Basic information

Full name
cathepsin F
Ensembl
ENSG00000174080.11
Summary
Cathepsins are papain family cysteine proteinases that represent a major component of the lysosomal proteolytic system. Cathepsins generally contain a signal sequence, followed by a propeptide and then a catalytically active mature region. The very long (251 amino acid residues) proregion of the cathepsin F precursor contains a C-terminal domain similar to the pro-segment of cathepsin L-like enzymes, a 50-residue flexible linker peptide, and an N-terminal domain predicted to adopt a cystatin-like fold. The cathepsin F proregion is unique within the papain family cysteine proteases in that it contains this additional N-terminal segment predicted to share structural similarities with cysteine protease inhibitors of the cystatin superfamily. This cystatin-like domain contains some of the elements known to be important for inhibitory activity. CTSF encodes a predicted protein of 484 amino acids which contains a 19 residue signal peptide. Cathepsin F contains five potential N-glycosylation sites, and it may be targeted to the endosomal/lysosomal compartment via the mannose 6-phosphate receptor pathway. The cathepsin F gene is ubiquitously expressed, and it maps to chromosome 11q13, close to the gene encoding cathepsin W. [provided by RefSeq, Jul 2008]
Annotation
Druggable target (Tier T4)

Protein product

  • ENST00000310325.10 Primary ENSP00000310832.5 (0 phosphosite)
Phosphosites on the primary protein product
Loading...

Tumor and normal comparison

Signed p-values
Data type Meta P BRCA CCRCC COAD GBM HNSCC LSCC LUAD OV PDAC UCEC

* P-values are from Wilcoxon rank sum test and can be clicked to show the box plots. Positive values mean higher abundance in tumor. BRCA and GBM do not have normal samples.

mRNA expression at gene level
Protein expression

* Mild outlier: filled circle; Extreme outlier: empty circle.

Phenotype and mutation association

Manhattan plot summarizing associations of phenotypes and mutations across all cohorts and omics data types

* Data points of significant associations above and below the dotted lines can be hovered to show the phenotype.

Associations of the protein abundance of CTSF with phenotypes and mutations

Signed p-values
Phenotype Meta P BRCA CCRCC COAD GBM HNSCC LSCC LUAD OV PDAC UCEC

* P-values could be from test for Spearman correlation, Wilcoxon rank sum test, Jonckheere-Terpstra trend test or Cox regression depending on the data type. P-values for individual cohorts can be clicked to show the data plots. The matrix icons in each row can be clicked to show a heatmap summary of associations across all cohorts and omics. The rows in the table can be expanded to show results from other omics.

Cis-association

Associations between omics data of CTSF

* The numbers are Spearman correlation coefficients and can be clicked to show the scatter plots. The color and size of the circles correlate with the coefficients.

Trans-association

Associations of the protein abundance of CTSF and the protein abundance of other genes

Signed p-values
Gene Meta P BRCA CCRCC COAD GBM HNSCC LSCC LUAD OV PDAC UCEC

* P-values are from test for Spearman correlation. P-values for individual cohorts can be clicked to show the data plots. The matrix icons in each row can be clicked to show a heatmap summary of associations across all cohorts and omics. The rows in the table can be expanded to show results from other omics.

Gene set enrichment analysis

Submit genes and the common logarithm of the p-values of their association with to WebGestalt.