Gene loci information

Transcript annotation

  • This transcript has been annotated as N-acetylgalactosaminyltransferase 6.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g15214 g15214.t1 isoform g15214.t1 4500247 4502638
chr_4 g15214 g15214.t1 exon g15214.t1.exon1 4500247 4500517
chr_4 g15214 g15214.t1 cds g15214.t1.CDS1 4500247 4500517
chr_4 g15214 g15214.t1 exon g15214.t1.exon2 4500657 4501076
chr_4 g15214 g15214.t1 cds g15214.t1.CDS2 4500657 4501076
chr_4 g15214 g15214.t1 exon g15214.t1.exon3 4501129 4501262
chr_4 g15214 g15214.t1 cds g15214.t1.CDS3 4501129 4501262
chr_4 g15214 g15214.t1 exon g15214.t1.exon4 4501407 4501461
chr_4 g15214 g15214.t1 cds g15214.t1.CDS4 4501407 4501461
chr_4 g15214 g15214.t1 exon g15214.t1.exon5 4501523 4501657
chr_4 g15214 g15214.t1 cds g15214.t1.CDS5 4501523 4501657
chr_4 g15214 g15214.t1 exon g15214.t1.exon6 4501811 4502205
chr_4 g15214 g15214.t1 cds g15214.t1.CDS6 4501811 4502205
chr_4 g15214 g15214.t1 exon g15214.t1.exon7 4502267 4502638
chr_4 g15214 g15214.t1 cds g15214.t1.CDS7 4502267 4502638
chr_4 g15214 g15214.t1 TSS g15214.t1 NA NA
chr_4 g15214 g15214.t1 TTS g15214.t1 NA NA

Sequences

>g15214.t1 Gene=g15214 Length=1782
ATGTCAATTAGACGAATTGTAGTGAAAATTAATTTTTTAATTCTCAAACTTTTACGAACA
CGTGGCGGAATTTCAATTCTTCTTTTATGGCTTCTTCTTTCATTTTTATTAATTTTCTCA
ATTGTGCTGCATATCAATAATGATCCATTTAAAGTTGATAAACCATTTATTTTTATTGAA
CCAATTACGTCATATTATAGATTTCATCATAGTAATTTTAAAAGAGATTGGCATGATTAT
GAATTGATTAAAAGGGAAGCTTTAAGAACTGGTCCAGGTGAGCAAGGTCGTGGCGTATAC
ATCCCACTAGAAGAGCAAGAACTTGCTAGCAAAATCTACGTAGAAAATAAGCACAATGGA
CTTGCAAGTGACAAAATAGCTCGTGATCGATCGTTGCCAGACACTCGGCCATCTGCATGT
AGAAATAGAACATACCTCAATGAATTGCCATCTGTTTCTGTCATAATTCCATTTCATAAT
GAAGTTTTAAGTACACTGACAAGGACAATTCATAGTGTCTTTAATCGATCACCACCAGAA
TTATTGACGGAAGTTATTTTAGTGAATGATCATAGTGATAAAGAACATTGTTATGGTGAA
CTAGAAGAATATATTAAAGAATTTTTTGATCCATCTAAAATAAGATTATTAGTGATGGAT
CGAAGATATGGCTTAATGTGGGCAAGATTAGCTGGTGCACGAGCTGCAATTGGTGATGTC
TTAGTGTTTATGGATTGCCATACAGAAGCAAATGTTAATTGGTTACCACCTCTAATTGAA
CCAATTGCTTTGAATTATAGGACTTGTGTTTGTCCTTACATTGATGTCATTAGTGCACGT
GACTATTCATATATGGGAATAGGTCAAGGATCTCGAGGAGTGTTTAATTGGCAATTTTAT
TATCAATTTCTGCCTTTAAGACCAGGTGATCAAGATGATGAAACAGAACCTTTTCAATCA
CCAGTCATGATGGGATGTGTTTTTGCTATTTCTGCAAAATTCTTTTGGGAATTAGGTGGT
TACGATCCAGGTCTTTCAGTTTGGGGTGGTGAGCAATACGAATTAAGTTTTAAAACTTGG
CTATGTCATGGTCAACAACTTGATGCACCATGTTCTCGTGTTGGTCATCTTTATCGACCT
CGTCCATTTCTTGAAAATCTCAATGACACAAATTATTGCCATAGAAATTATAAAAGAGTT
GCTGAAGTTTGGATGGATGAATATGCACATCATGTTTATGATAAAGATCCAGAAGAATGG
TATCCACTTGAAATTGGTGATGTTTCATACATGAAAAGTATCAAGAAGAAATTAAATTGC
AAACCATTCAAATATTTCTTAGAAGTTGTCGCTCCTGATATGCTTGATAAATTTCCACCT
TTTGAGCCTGTTGTTTTTGCTTCTGGTGCTATTCAAAGTCTAGCATATCCAAAATATTGC
ATTGACACACTCGGATCATCAGAAGGTGAACCAATTGGTCTCTATTCTTGCAAATCTGAA
AATCTTACAGAATTTGAATTTCGTCAATATTTTATACTTAGACAACATCGTGATATTCTA
GTTGAAAATTCAAACAATGAATGTTTTGATGCCAATTATGAAAAAGTTTCAATTTTTCAT
TGTAAATTTACACAAGACAATCAATATTTTCGTTATGATGTTGATACACAGCAAATTATT
GTAGGACCAAAAAGAAAAAATAAATGTATGGATTTAAGTGAATCTAAAACAATAATTATT
GCTGCTTGTGATGCTGAAAAAATAACACAAAAATTCCATTAG

>g15214.t1 Gene=g15214 Length=593
MSIRRIVVKINFLILKLLRTRGGISILLLWLLLSFLLIFSIVLHINNDPFKVDKPFIFIE
PITSYYRFHHSNFKRDWHDYELIKREALRTGPGEQGRGVYIPLEEQELASKIYVENKHNG
LASDKIARDRSLPDTRPSACRNRTYLNELPSVSVIIPFHNEVLSTLTRTIHSVFNRSPPE
LLTEVILVNDHSDKEHCYGELEEYIKEFFDPSKIRLLVMDRRYGLMWARLAGARAAIGDV
LVFMDCHTEANVNWLPPLIEPIALNYRTCVCPYIDVISARDYSYMGIGQGSRGVFNWQFY
YQFLPLRPGDQDDETEPFQSPVMMGCVFAISAKFFWELGGYDPGLSVWGGEQYELSFKTW
LCHGQQLDAPCSRVGHLYRPRPFLENLNDTNYCHRNYKRVAEVWMDEYAHHVYDKDPEEW
YPLEIGDVSYMKSIKKKLNCKPFKYFLEVVAPDMLDKFPPFEPVVFASGAIQSLAYPKYC
IDTLGSSEGEPIGLYSCKSENLTEFEFRQYFILRQHRDILVENSNNECFDANYEKVSIFH
CKFTQDNQYFRYDVDTQQIIVGPKRKNKCMDLSESKTIIIAACDAEKITQKFH

Protein features from InterProScan

Transcript Database ID Name Start End E.value
13 g15214.t1 CDD cd02510 pp-GalNAc-T 153 451 2.99003E-149
12 g15214.t1 CDD cd00161 RICIN 469 592 7.17051E-16
8 g15214.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 72 461 7.8E-147
7 g15214.t1 Gene3D G3DSA:2.80.10.50 - 464 593 7.6E-20
3 g15214.t1 PANTHER PTHR11675:SF41 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 10 70 592 1.6E-149
4 g15214.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 70 592 1.6E-149
2 g15214.t1 Pfam PF00535 Glycosyl transferase family 2 153 338 5.3E-27
1 g15214.t1 Pfam PF00652 Ricin-type beta-trefoil lectin domain 468 592 3.3E-19
9 g15214.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 20 -
11 g15214.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 21 45 -
10 g15214.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 46 593 -
16 g15214.t1 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 467 593 16.767
15 g15214.t1 SMART SM00458 ricin_3 468 591 4.5E-12
6 g15214.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 129 453 3.66E-55
5 g15214.t1 SUPERFAMILY SSF50370 Ricin B-like lectins 460 592 6.49E-18
14 g15214.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 21 43 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values