Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative cysteine proteinase CG12163.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g4405 g4405.t14 TTS g4405.t14 2001061 2001061
chr_2 g4405 g4405.t14 isoform g4405.t14 2001294 2003272
chr_2 g4405 g4405.t14 exon g4405.t14.exon1 2001294 2001377
chr_2 g4405 g4405.t14 cds g4405.t14.CDS1 2001294 2001377
chr_2 g4405 g4405.t14 exon g4405.t14.exon2 2001442 2001689
chr_2 g4405 g4405.t14 cds g4405.t14.CDS2 2001442 2001689
chr_2 g4405 g4405.t14 exon g4405.t14.exon3 2001757 2002007
chr_2 g4405 g4405.t14 cds g4405.t14.CDS3 2001757 2002007
chr_2 g4405 g4405.t14 exon g4405.t14.exon4 2002061 2003089
chr_2 g4405 g4405.t14 cds g4405.t14.CDS4 2002061 2003046
chr_2 g4405 g4405.t14 exon g4405.t14.exon5 2003160 2003272
chr_2 g4405 g4405.t14 TSS g4405.t14 NA NA

Sequences

>g4405.t14 Gene=g4405 Length=1725
CGTCAAATCGGTGTTCCCGGTGGAATCAGTCCTGTAGAAAACTTTGAAGATGTAAAAATT
TATGTTCAAGAAGCTATTGATGAAATTAATGATAATGAAGATCCTGATTACATTTTGAAA
CATATCGTTGAAGCAACCCAACAAGTTGTTGCAGGCATGAGTTATAAAATTAAAGCAGTG
TTTTCCAGAGATGGAAGCGACATTGAATGTGATTTTGATGTATGGGAGCAAGCTTGGATT
AAAGATGGACGTAAAGTTTCAGTTTCTTGCAAAAATGATAAGAAATATAAGTTGACCCAA
TCACCATCTAATCAGCGTGTCAAACGTGATAACACGCTTGAAAGAGTTCTTGGTTTACCA
TCCAATACTGATGATCATGACGATTTGATAAAAATACTTTCTGAACATTTGAAGAGACTC
GATACTGGAAGTGATGCACAATTTGAATTGGTAAAACTTGAAAAGGTAACTCAACAAGTA
GTAGCTGGAATAAAATATAAAGCAACAGGTATTTTTAAAATTGGCAATGAAGAGAAAAAA
TGTGTTATCGATGTATGGCATCGCTCATGGATTAAGGGAGATGAAGGCACTCAATTAAGC
GCTGATTGTGATAAAGGTGCAACAACTTTCAAGACAAAATCTTCTAGAAAAAGGAGATCA
GTTCATCACCACACACACAATCGTCACAATAGACAATCAGTAAGCGATCATTTTGATGAC
CATCATCATCATACTGATAGACATCATCATCAATACTCAGCTACTGAAGAAATGAAAGAA
ATAAAATCTGAAATTTTATTTAACAATTTCATAACTAAATATAATCGTAAATATGCCAAT
GAACTTGAACATAAAATGAGAATGAGAATTTTCAAGAAGAATTTACATAAAATTGAAATG
TTGAATAAGCATGAACAAGGCACTGCAAAGTATGGAATTACAGAATTCGCTGATTTAACT
GAAAAGGAATACTTGCATAAAACTGGTTTGAGAGTGCGTGAAAGACATGAGAATGAATTA
GAAAATCCAATTGCACATATTCCAGAAGTTGAAGATTTACCAACCGAATTTGATTGGAGA
GATAAATCAGCAGTTACAAGTGTAAAAAATCAAGGAAATTGTGGATCATGCTGGAGTTTT
TCTGTTACAGGAAATATTGAAGGCTTACATGCTATTAAAACTGGAAAACTTGAAGCTTAT
TCTGAACAAGAACTTTTGGACTGTGATACAACTGATAATGCTTGCAATGGTGGTTATATG
GATGATGCTTTTAAAGCAATTGAAAAAATTGGTGGTCTAGAATTAGAAGATGAATATCCT
TATCAAGCAAGGAAACAAAAGAAATGCTTGTTTAATGCTACTATGAGTCATGTTAAAGTT
AAAGGTGTTGTAGATTTGCCTAAAGGTGATGAAATTGCAATGCAAAAGTTTTTAGTCTCA
ACTGGTCCGATTTCCATTGGCATAAATGCTAATGCTATGCAATTTTATCGTGGTGGTGTT
TCGCATCCATGGAAAGTTCTTTGCAGAAAATCTAATTTAGATCATGGTGTTTTGATTGTT
GGATATGGAATAAAAGAGTATCCCATGTTTAATAAAACTTTACCTTATTGGACTATTAAA
AATTCATGGGGTCCAAAATGGGGTGAACAAGGATATTATCGAGTTTATCGTGGAGATAAC
AGTTGTGGAGTTGCAGAAATGGCAAGCAGCGCAGTACTTGAATAA

>g4405.t14 Gene=g4405 Length=522
MSYKIKAVFSRDGSDIECDFDVWEQAWIKDGRKVSVSCKNDKKYKLTQSPSNQRVKRDNT
LERVLGLPSNTDDHDDLIKILSEHLKRLDTGSDAQFELVKLEKVTQQVVAGIKYKATGIF
KIGNEEKKCVIDVWHRSWIKGDEGTQLSADCDKGATTFKTKSSRKRRSVHHHTHNRHNRQ
SVSDHFDDHHHHTDRHHHQYSATEEMKEIKSEILFNNFITKYNRKYANELEHKMRMRIFK
KNLHKIEMLNKHEQGTAKYGITEFADLTEKEYLHKTGLRVRERHENELENPIAHIPEVED
LPTEFDWRDKSAVTSVKNQGNCGSCWSFSVTGNIEGLHAIKTGKLEAYSEQELLDCDTTD
NACNGGYMDDAFKAIEKIGGLELEDEYPYQARKQKKCLFNATMSHVKVKGVVDLPKGDEI
AMQKFLVSTGPISIGINANAMQFYRGGVSHPWKVLCRKSNLDHGVLIVGYGIKEYPMFNK
TLPYWTIKNSWGPKWGEQGYYRVYRGDNSCGVAEMASSAVLE

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g4405.t14 CDD cd02248 Peptidase_C1A 302 519 3.2955E-101
10 g4405.t14 Gene3D G3DSA:3.10.450.10 - 60 150 7.0E-11
11 g4405.t14 Gene3D G3DSA:3.90.70.10 Cysteine proteinases 167 521 2.9E-101
19 g4405.t14 MobiDBLite mobidb-lite consensus disorder prediction 157 203 -
20 g4405.t14 MobiDBLite mobidb-lite consensus disorder prediction 160 178 -
18 g4405.t14 MobiDBLite mobidb-lite consensus disorder prediction 179 203 -
3 g4405.t14 PANTHER PTHR13814:SF16 CYSTATIN 214 520 5.2E-99
4 g4405.t14 PANTHER PTHR13814 FETUIN 214 520 5.2E-99
7 g4405.t14 PRINTS PR00705 Papain cysteine protease (C1) family signature 319 334 4.1E-9
6 g4405.t14 PRINTS PR00705 Papain cysteine protease (C1) family signature 463 473 4.1E-9
5 g4405.t14 PRINTS PR00705 Papain cysteine protease (C1) family signature 484 490 4.1E-9
1 g4405.t14 Pfam PF08246 Cathepsin propeptide inhibitor domain (I29) 215 272 2.9E-12
2 g4405.t14 Pfam PF00112 Papain family cysteine protease 301 519 1.9E-71
15 g4405.t14 ProSitePatterns PS00139 Eukaryotic thiol (cysteine) proteases cysteine active site. 319 330 -
14 g4405.t14 ProSitePatterns PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. 461 471 -
13 g4405.t14 ProSitePatterns PS00640 Eukaryotic thiol (cysteine) proteases asparagine active site. 484 503 -
17 g4405.t14 SMART SM00848 Inhibitor_I29_2 215 272 3.0E-18
16 g4405.t14 SMART SM00645 pept_c1 301 520 2.0E-100
8 g4405.t14 SUPERFAMILY SSF54403 Cystatin/monellin 73 140 4.54E-8
9 g4405.t14 SUPERFAMILY SSF54001 Cysteine proteinases 208 519 2.67E-100

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0008234 cysteine-type peptidase activity MF
GO:0006508 proteolysis BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values