Gene loci information

Transcript annotation

  • This transcript has been annotated as Tight junction protein ZO-1.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2239 g2239.t1 isoform g2239.t1 16241573 16254975
chr_3 g2239 g2239.t1 exon g2239.t1.exon1 16241573 16242028
chr_3 g2239 g2239.t1 cds g2239.t1.CDS1 16241573 16242028
chr_3 g2239 g2239.t1 exon g2239.t1.exon2 16242291 16244277
chr_3 g2239 g2239.t1 cds g2239.t1.CDS2 16242291 16244277
chr_3 g2239 g2239.t1 exon g2239.t1.exon3 16244342 16244436
chr_3 g2239 g2239.t1 cds g2239.t1.CDS3 16244342 16244436
chr_3 g2239 g2239.t1 exon g2239.t1.exon4 16244507 16244581
chr_3 g2239 g2239.t1 cds g2239.t1.CDS4 16244507 16244581
chr_3 g2239 g2239.t1 exon g2239.t1.exon5 16254351 16254609
chr_3 g2239 g2239.t1 cds g2239.t1.CDS5 16254351 16254609
chr_3 g2239 g2239.t1 exon g2239.t1.exon6 16254707 16254975
chr_3 g2239 g2239.t1 cds g2239.t1.CDS6 16254707 16254975
chr_3 g2239 g2239.t1 TSS g2239.t1 NA NA
chr_3 g2239 g2239.t1 TTS g2239.t1 NA NA

Sequences

>g2239.t1 Gene=g2239 Length=3141
ATGATGGAAATAACAAATGAAATTGATGATAATGAAACTGAAAGTGATGTAATTGATGAT
GATGATGATTTGACTGAATCATCAGAACAAATTTTTCTACAAAATCTTTCAAAAATTATT
GCCGATAGCGAAAGAACGTCTGCCAATGAGAGTCTTTTATCTATAGCGGAAGTTAAATCA
ATTACAAATCAGCCACAAAATCAACAACCGCAATTTGGTTACTTTCTACCAATTTCTTAT
GCGCCACCAATACGACTACGTTCAAAAAGAGATACACAACATGAGTCTAGCCGTTCGCCC
ACACCTCTAACGCAACCAAATCACAATCACCATCAGCAGTATAGTAGCAACAACGTAAAT
AATCATATCAATGGCAGATCAAAACGATTCAGTTCTCCTGCGTATAGCGATTATGGATGT
GATAAAGTAAACAGTCGCAAATTAAGTGGAAAACATGAAAAACGAAGTATTCGTTCGATG
AATGAGGCTATTGAAGTTCTTGCTGATCAAGTTGAAGAAGAAAATTTGGGCGAGAGAACT
TCATGGGAATATCACACTGTGATTTTATTCAGAGTTGCTGGATATGGTTTTGGCATCGCT
GTGTCGGGTGGAAGAGATAATCCGCATTTTGCTAATGGAGATCCCTCGATTGCAGTTAGT
GATGTGCTTAAAGGTGGTCCAGCAGAGGGAAAACTTGAAACAAACGATCGAATAATAACA
GCAAATGGAATCTCATTGGAAAATGTTGAATATGCAACAGCAGTTCAAGTTTTAAGAGAT
AGTGGCAATACAGTGACACTTGTTGTTAAAAGACGTGCACCATTGCAATCAAGTGGAAAT
TATCAACAAGCTGGTATCAGTAATAGTGCAATTCCAACTCATCAGCATCAGCAAAGTCTC
AGTTCAATCGGATCTCAACAGCAAATTAAACTTGTCATCAATAAGAGCAGTAAGAAAGAA
GATTTTGGAATCGTTCTTGGCTGTAGATTATTCATCAAAGAGATCTCATCGAAAACACGT
GAGCAACTTGCACTTAATGGTTACTCACTGCAAGAAGGTGACATTGTTACTCGAGTTCAT
AATACAAATTGCAATGACATGATGAGCATAAAGGAAGCAAGAAAAATTATGGATAGTTGC
AAAGAACGATTAAATCTTGCTGTAATAAGAGATCCAAATGCAATAGTGCCACCACCTCAG
CCAAACACTTCAATTTACTCTCATCAACAGCAAATGTCAAATTGCTCGAATATCGAAGAT
GCTTTCAATTCATCAGCTTATTCAACACAGAATCTCTATGTGCAACCACCAACTAGACCA
TCATTAAGCACATTACTTGATGACAAGTGCAATCTTACACCACGAGGTAGAGCTCGAGGA
CCAATAACTGATATGTCACAGCTTTCACAACTTGATCGTCCATCATCACCACCTCATCAT
TCAAGAAGTCGTAGTGGAATTGAAATGATAGATGAACCGCCTCGACCGCCACCACCTCGT
GATGAATTTTATGGCACAAGACGAATGCAAGCAGAAACGACTGAACCAAGATATATTACA
TTCCAGAAAGAAGGTTCTGTTGGTATCAGACTAACAGGTGGCAATGAAGTTGGAATTTTT
GTGACTGCAGTTCAACAAAATAGTCCAGCATCAATGCAAGGACTTGTGCCTGGTGATAAA
TTGCTAAAAGTTAATGACATGGATATGAATGGAGTGACTAGAGAAGAAGCAGTTTTATTC
CTTCTTTCACTTCAAGACCGCATTGATTTAATTGTGCAATATTGCAAAGATGAATATGAA
AATGTCGTTCAAAATCAACGTGGTGATTCATTTCATATTAAGACACATTTTCATTATGAT
GCTCCAACCAAGAATGAACTTTCATTTAAATCAGGTGATGTGTTTAGAGTGATTGATACT
CTTCACAATGGAGTTGTCGGTTCATGGCAAGTTATGAGAATTGGAAGAGGACAACAAGAA
CTTCAGCGTGGAATCATTCCAAATAAAGCAAGAGCTGAAGAACTTGCAACAGCTCAGTTT
AATGCTACAAAGAAAGAAACAACAAATACAGAATCAAGAATGAATTTCTTTAGACGTAAA
AGAACAAATCATCGCAGATCGAAGTCTTTATCACGTGAAAATTGGGATGATGTTGTTTTT
GCTGATTCAGTCTCGAAATTCCCAGCTTATGAACGAGTTGTTTTAAGACACCCAGGATTC
ATTCGTCCAGTAGTTTTATTTGGACCAGTTGCAGATCTCGCTAGAGAAAGATTGATCAAA
GATCATCCAGAAAAGTTTACAGCACCATTACAAGATACTGATAAATCAAAATGTGGTATT
GTCAGATTGTCAAATATTCGTGACATTATGGATAGAGGAAAGCATGCATTACTTGATATC
ACACCAAATGCAGTTGATCGTCTGAATTATGCTCAATTTTATCCAATTGTTGTGTTTTTG
AAAGCCGATTCAAAGCATACAATTAAACAATTGCGACAAGGTATTCCAAAAACAGCACAC
AAAAGTTCAAAGAAACTTTTTGAGCAATGCCAGAAACTTGATCGAATGTGGTCACATGTG
TTTAGCACTACAATTAATTTAAATGATGCAGAATCGTGGTATCGAAAAACTAAAGAGACT
GTCGATAAGCAACAAGCTGGTGCTGTGTGGATGTCAGAGACAAAGGAATCAAGATTTACC
TCAGATATTTTTCTTCCTTATTTATCACCGCCATTATGTCCTTATGCTTGTTGTAATCCA
ACACGTCCGAGAGTTCATGTTGCTAGTTCATTAGCTTATTATCAAAGGCCTCGCTATTCT
ATGCAATTGCCTGCTCCTGTGAATTATCAGATTGTTCCGATTCGTCACTCAAATAGCTTT
TATCGAACGCCAACAAGAAGCTTACAAAGTTTAAATACTTTTGGCAATTTCCGTGCAATT
GATGATAATAATAATAGCTCGCGTACTGGAACTCTGAGAAATCACAATGAAGAGCGTAAA
TCGCCATATTATTATAATGAACTAACTCAAATGCATGCAACTAATCCGCAACAACAATTT
ATACCAATTACTAATCACATTAATGATGATAACTTTACAGATTTCATTAATACCATTCAT
CATTCTGAAACAATGTCATGA

>g2239.t1 Gene=g2239 Length=1046
MMEITNEIDDNETESDVIDDDDDLTESSEQIFLQNLSKIIADSERTSANESLLSIAEVKS
ITNQPQNQQPQFGYFLPISYAPPIRLRSKRDTQHESSRSPTPLTQPNHNHHQQYSSNNVN
NHINGRSKRFSSPAYSDYGCDKVNSRKLSGKHEKRSIRSMNEAIEVLADQVEEENLGERT
SWEYHTVILFRVAGYGFGIAVSGGRDNPHFANGDPSIAVSDVLKGGPAEGKLETNDRIIT
ANGISLENVEYATAVQVLRDSGNTVTLVVKRRAPLQSSGNYQQAGISNSAIPTHQHQQSL
SSIGSQQQIKLVINKSSKKEDFGIVLGCRLFIKEISSKTREQLALNGYSLQEGDIVTRVH
NTNCNDMMSIKEARKIMDSCKERLNLAVIRDPNAIVPPPQPNTSIYSHQQQMSNCSNIED
AFNSSAYSTQNLYVQPPTRPSLSTLLDDKCNLTPRGRARGPITDMSQLSQLDRPSSPPHH
SRSRSGIEMIDEPPRPPPPRDEFYGTRRMQAETTEPRYITFQKEGSVGIRLTGGNEVGIF
VTAVQQNSPASMQGLVPGDKLLKVNDMDMNGVTREEAVLFLLSLQDRIDLIVQYCKDEYE
NVVQNQRGDSFHIKTHFHYDAPTKNELSFKSGDVFRVIDTLHNGVVGSWQVMRIGRGQQE
LQRGIIPNKARAEELATAQFNATKKETTNTESRMNFFRRKRTNHRRSKSLSRENWDDVVF
ADSVSKFPAYERVVLRHPGFIRPVVLFGPVADLARERLIKDHPEKFTAPLQDTDKSKCGI
VRLSNIRDIMDRGKHALLDITPNAVDRLNYAQFYPIVVFLKADSKHTIKQLRQGIPKTAH
KSSKKLFEQCQKLDRMWSHVFSTTINLNDAESWYRKTKETVDKQQAGAVWMSETKESRFT
SDIFLPYLSPPLCPYACCNPTRPRVHVASSLAYYQRPRYSMQLPAPVNYQIVPIRHSNSF
YRTPTRSLQSLNTFGNFRAIDDNNNSSRTGTLRNHNEERKSPYYYNELTQMHATNPQQQF
IPITNHINDDNFTDFINTIHHSETMS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
18 g2239.t1 CDD cd00992 PDZ_signaling 185 270 1.85983E-19
17 g2239.t1 CDD cd00992 PDZ_signaling 516 593 1.35058E-17
19 g2239.t1 CDD cd11859 SH3_ZO 612 673 1.7456E-34
14 g2239.t1 Gene3D G3DSA:2.30.42.10 - 172 283 9.8E-28
15 g2239.t1 Gene3D G3DSA:2.30.42.10 - 291 395 2.1E-6
16 g2239.t1 Gene3D G3DSA:2.30.42.10 - 503 610 1.1E-24
13 g2239.t1 Gene3D G3DSA:2.30.30.40 SH3 Domains 611 893 1.9E-93
12 g2239.t1 Gene3D G3DSA:3.40.50.300 - 739 885 1.9E-93
28 g2239.t1 MobiDBLite mobidb-lite consensus disorder prediction 1 24 -
26 g2239.t1 MobiDBLite mobidb-lite consensus disorder prediction 88 109 -
25 g2239.t1 MobiDBLite mobidb-lite consensus disorder prediction 438 498 -
27 g2239.t1 MobiDBLite mobidb-lite consensus disorder prediction 463 482 -
29 g2239.t1 MobiDBLite mobidb-lite consensus disorder prediction 484 498 -
5 g2239.t1 PANTHER PTHR13865 TIGHT JUNCTION PROTEIN 89 970 0.0
6 g2239.t1 PANTHER PTHR13865:SF28 POLYCHAETOID, ISOFORM O 89 970 0.0
4 g2239.t1 Pfam PF00595 PDZ domain 187 269 7.8E-15
3 g2239.t1 Pfam PF00595 PDZ domain 523 593 3.3E-12
1 g2239.t1 Pfam PF07653 Variant SH3 domain 612 673 7.4E-10
2 g2239.t1 Pfam PF00625 Guanylate kinase 784 883 4.0E-12
34 g2239.t1 ProSiteProfiles PS50106 PDZ domain profile. 186 273 19.561
33 g2239.t1 ProSiteProfiles PS50106 PDZ domain profile. 310 392 9.994
32 g2239.t1 ProSiteProfiles PS50106 PDZ domain profile. 523 596 15.702
31 g2239.t1 ProSiteProfiles PS50002 Src homology 3 (SH3) domain profile. 608 676 12.504
30 g2239.t1 ProSiteProfiles PS50052 Guanylate kinase-like domain profile. 781 882 16.611
20 g2239.t1 SMART SM00228 pdz_new 195 273 2.0E-14
21 g2239.t1 SMART SM00228 pdz_new 320 392 1.1
22 g2239.t1 SMART SM00228 pdz_new 525 596 3.0E-17
23 g2239.t1 SMART SM00326 SH3_2 611 675 0.0046
24 g2239.t1 SMART SM00072 gk_7 704 885 3.1E-40
7 g2239.t1 SUPERFAMILY SSF50156 PDZ domain-like 180 272 1.26E-21
8 g2239.t1 SUPERFAMILY SSF50156 PDZ domain-like 296 398 1.86E-6
9 g2239.t1 SUPERFAMILY SSF50156 PDZ domain-like 511 610 3.06E-26
11 g2239.t1 SUPERFAMILY SSF50044 SH3-domain 611 676 4.33E-10
10 g2239.t1 SUPERFAMILY SSF52540 P-loop containing nucleoside triphosphate hydrolases 783 892 1.59E-15

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values