Gene loci information

Transcript annotation

  • This transcript has been annotated as Transcription-associated protein 1.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g4787 g4787.t1 TSS g4787.t1 4964358 4964358
chr_2 g4787 g4787.t1 isoform g4787.t1 4964773 4976704
chr_2 g4787 g4787.t1 exon g4787.t1.exon1 4964773 4964841
chr_2 g4787 g4787.t1 cds g4787.t1.CDS1 4964773 4964841
chr_2 g4787 g4787.t1 exon g4787.t1.exon2 4964896 4965021
chr_2 g4787 g4787.t1 cds g4787.t1.CDS2 4964896 4965021
chr_2 g4787 g4787.t1 exon g4787.t1.exon3 4965142 4967747
chr_2 g4787 g4787.t1 cds g4787.t1.CDS3 4965142 4967747
chr_2 g4787 g4787.t1 exon g4787.t1.exon4 4967804 4967871
chr_2 g4787 g4787.t1 cds g4787.t1.CDS4 4967804 4967871
chr_2 g4787 g4787.t1 exon g4787.t1.exon5 4967930 4969252
chr_2 g4787 g4787.t1 cds g4787.t1.CDS5 4967930 4969252
chr_2 g4787 g4787.t1 exon g4787.t1.exon6 4969314 4975302
chr_2 g4787 g4787.t1 cds g4787.t1.CDS6 4969314 4975302
chr_2 g4787 g4787.t1 exon g4787.t1.exon7 4975361 4975562
chr_2 g4787 g4787.t1 cds g4787.t1.CDS7 4975361 4975562
chr_2 g4787 g4787.t1 exon g4787.t1.exon8 4975620 4976342
chr_2 g4787 g4787.t1 cds g4787.t1.CDS8 4975620 4976342
chr_2 g4787 g4787.t1 exon g4787.t1.exon9 4976402 4976704
chr_2 g4787 g4787.t1 cds g4787.t1.CDS9 4976402 4976704
chr_2 g4787 g4787.t1 TTS g4787.t1 NA NA

Sequences

>g4787.t1 Gene=g4787 Length=11409
ATGAAAACCTTCTTAAAAGTATTACAAGAAGGAGAATGCCATTTCTTTGCTGATTTCAAT
ATACAACAGATGAGGAAGCTGATTTTGGAAATGATTCATCGTTTGCCAGTCTCTGAAGTT
GTTCGTCCTTATGTTAAAAGCATATTAACACTTATGCTAAAGCTATTGAAAACTGACAAT
GAAGATAATGTTTTAATTCAACATTTTATCGCTTATGTACGCAATATTTATTCAGATCTG
CCAAAAAATCTTAACAAAATCTTTGAACCTCGCGATCAAATTAGAGTGAAAGATTTAAAA
GAGATAAATCTTGAACAACTGTTGGGTGAAATTTATTCAGTAACGACTATTTATGTTGAG
AAACAAGAAATTGAAGGGAAAGCTGCAAATTCATTCACTTTGCTTCCTCGTGGTGTAAAT
TCTCTTAAAGTTCTGCAAGAATTGCCAATAATTTTGGTTCTGATGTATCAAATTTACAAG
CAAAGTGTGCATCAAGAAATTGTAGAATTTATTCCATTAATTATGACAACAATTGCATTA
CAACCATCACCAGTACATCGTGCAATGCCAAATTTTAATCGAGAAATCTTTGTTGATTTC
ATGAGTGCTCAAATCAAAGCACTTTCATTTTTGGCTTATATCGTTCGAACATTTTTGGCG
CAATTTCAAGAAGTAATTAATGCTCATGCAACACAAATGGTCGATGGCATGATTTCACTT
CTTCGTTTATGTCCAATGGAAGTTGCAAGCACAAGAAAAGAATTACTTGTTGCTACTCGA
CATATTTTAGCAACTGATTTAAGGAACAAATTTGTCAGTTCTATAGAATTGTTATTTGAT
GAAGATTTACTTCTTGGTAGATGCTCATATAGCAATGAATCATTAAAACCACTTGCATAT
TCTACACTTGCTGACTTAATTCATCATGTTCGTCAACATCTTAAACTTGATATCCTTACT
AAAGCAGTATATCTCTTCTCAAAAAATGTTCATGATGAATCTCTTCCAACATCGATTCAA
ACAATGTCTTGCAAACTTCTTCTCAATTTGGTTGATTGCATTCGAGTACAACGAAATGAC
GATTCAACGCCATCACCAACACCAAGAGAACTTTTGATGACTATGCTCAAAGTTTTTACT
CTCAAATTTCAAACAATAGCAAAACTTCAATTGCCATTAATTATGCAGAAATGGAAGAAT
CTTAACACACAAGCACAAACGAGCACTGTTGATTTTAATAAAGAAATGATTGGAATTGAT
GTCTCACCAGAGAGTATAAATAAATTAACATCTATTGGTTTTCCGACAGTTAATACATTG
AATGTTTGTGAATATAAGAGTTTAGTAAAAACTTTAATATGTGGAGTAAAAACAATCACT
TTTGGAATCAATCTTACGGACAATACTCAACAAAAAATGCCCGTTACGTTTCAACCAGAA
GAACTTCTCGTATTTATTGATTTGTTCAAATGGGCATTGGAAGCGCTTGATATTTATATG
ATAAACATTCCTTCTCCAGGAACACCAGCAGGAGTGCAACAGAAAACAATCACACAATTA
CCAAGATCGAAAGATGAAAAAGAATTGCTCGAACATTTTAGTGGGCTTTTCCTTACAATG
TCACCACAAAATTTTCAAGAGATTTTCTCATCGACAATTGATTTTATGGTAGATCGTATT
TCTCAAAATGTTGCATTACAAGTTATCGCAAATTCTTTCCTTGCATCACCAACAACTTCA
CCATTATTCGCAACAGTTCTGATTGAATATTTGTTGGAAAGAATGGAAGAAATGGGCTCA
AATATTGAAAGATCGAATTTATATTTAAGACTTTTTAAACTCGTCTTTGGAAGTGTATCA
TTATTTGCAAATCAAAATGAACAAATGCTAAGGCCACATTTACATAATATCGTCAATCGA
TCAATGGATTTAGCAATGACAGCAAAAGAACCTTATAATTATTTCCTTTTACTAAGGGCA
TTATTCAGATCAATTGGAGGGGGAAGTCATGATAAGCTTTATAAAGAATTTCTTCCACTT
TTGCCGAATCTTCTTGAAGGACTCAATCGTCTACAGAGTGGCTATCATAAGCAACACATG
AAAGATTTATTTGTTGAATTGTGTTTGACTGTGCCAGTTCGATTATCTTCGCTTTTGCCC
TATCTTCCAATGTTAATGGATCCACTTGTCTCTGCTCTTAACGGCTCACCAACACTCGTA
AATCAAGGATTACGCACATTAGAACTATGTGTTGATAATCTTCAACCTGATTTTCTTTAT
GATCATATTCAACCTGTACGAGCAGAATTAATGCAAGCATTATGGAAAACTTTAAGGAAC
ACTGATACAGCTGCATTAGTTGCATTTAGAGTTCTTGGTAAATTTGGTGGTGGAAATAGA
AAAATGATGATTGAACCGCAACGATTAGAATATAATCAATGCAATAGTAAGGAATCTTCA
TCATCAGCAATTGTTGCATACTTTCAAGAGCAACGAAAACCAATTGACTTTCCTGTTGAT
AAAGTCATTGAAACTGCATTTAATGCTTTGAAAACATCATCAACTGATCCTTATTATTGG
GCACAAAGTTGGGAAATCATTCGTTGCTATTTGTCAGCTTCAATTTGCTTATCAGATGAG
AAACACATTTTGCAAAAACTTTTTACTCATCCAAGTTTCACTGAAGGACCAGTATTGAAT
ATTAACTTGACATCATCTCATTTTATTGAGAATCAAGCAAGAAAAACTCATCAAACAGCA
CTCACTGCTATGTTTGTCGCTGCTGCTACAAAAGAATTGAGACAATCTGTGCTACCAGTG
TTTGTCGATGTTGTAAGACATTATACGATGGTTGCTATTGCTCAACAAGCTGGGCCGTTT
CCTCTCAAACAAAATCAGCAATACAACAATATCGATCCGCTTGTTTTAATTGATGCTCTT
GCAACAATTATGGGACATGAAGAGAAAGAACTCTGCAAAGCAGCTAATTTTGCGATGATT
TTAATAATCAAAACAGCAACAACTATTATGGGTTCAAAAGAACGTGCATGTCGCCTTGCT
ATGATGCAGTATCTGGCTGAAAAAATGTCAAATTTATGCTACGAACGTCCTTGGTATGCA
AAACTTGGTGGTGTTATTGCATTAAAATTTTTGTGTGAACACATGTCAATGAGATGGCTC
TTTCAACATTTATACACATTCCTCAAAGCATATCTCTTTGTTATTATGGATTTAACTGGT
GAAGTTTCAAGTGGAGCGATTTCAATGGCTCGTGAGTATTTGGCCAATATGTTAAACATT
TGCATGGTTCCACTCACTGATAACAGTAATGAAGAGCTTGTTGCATTACAAAGGAAAGCA
ACGTTCGATGTAACACATGAACTTGTACGTAGAATTACTTCACCTCATCAGTTAGTAAGA
GAAACTGCAATGGATTCATTAAAACAAATTGCAACTTTACAAAAAATCACTGTTACTGAA
GTTATGATGCCACATAAAGAAGTTCTTGAAGATATTGTGCCACCAAAAAAGCATTTATTG
AAGCATCAACCGGCAGGTGCACAAATCGGTTTAATGGATGCATTCTCATTTACATCAATG
GAGCCAAGATTATTTACACTTGATCTCAATTTAGTAACACATAAATTCTTCTTTCATGAA
GTTATTACTCTTGCAGAAGTTGATGATGTATTTCTCAGTAAATTGGATTGCTTCAAAAAC
ATTAATAATTTGATCCCATTGAGAAAAAGTGCATTGAGAGCATTAGCAAGTTGTCATTAT
TTAGGACAAGAACGTAAAAATGTTAAAGAAAAAATTTTCACAATTTTATACAAAGCACTT
GAAAAGAGCTCAGAGTTACAAGAAACAGCATTTGAATGCATGCAAAAGTTCAATTTAGGA
AGTACAGCTGAGAAAGAATTTTTGCTACAGAATTTACGACCACTTTTATTGGCTCTTGGG
GATCATAGAAATTTGACAATAAACATCGTCAAAAGATTATCATATTTAACACAACTGTTT
CCTGCAATGTTCAATGAAAAATTATGCGATCAACTATTAGAAATAATCAAGAAAATGCTT
CAATCATCTGTTACTGCAAATAAAAATCAAAACTTTTTAAAAGTCTCTAAAACTGGAGAG
ACTGAAGTTAAAATTGCTACAATTATTGAAATATTCCACCAAATTCCTGCTGCATCAGCG
AAATTTGCTCCAAGTTTAATTTCATTGGTTCTTGCTGCTGAGAAGGAGATAATGATTGAG
CCATCGAGTCCATATCGTGCACCATTAATTAAATTTCTGAAACGTTATCCAGAAGACACA
GTTGAAATATTTTTGAATGAGATTAACATAAAAAATCCGCAATACAACAGATTCCTCATT
TATCTCTTAAAACACAAAGATGGTGCAGCATTCGTAAGCGTAATAGAAAAGAAGGAAGAC
CGATTAGTTGAGTTAATATTGAAAGAGAAGACTAATCAAACGTCAGTATTACCCGAATAT
ACCCCAGAGGATGAAAACGAAGCTCAACATCAAGCAATTTTGTTGATTTACACTCTCACA
GAAATCACAGAAAAATGGATTGATTCACAAGTAAATATAATCAATGGTTTGACACTCATT
TGGAGTCGTGAATTAGGCAGTTCACTTACAAGCGAAAATATTACGTGTGATTTGTGGCAT
TTAGTTGGAAAAATATTACTTCGATATTTTGAAGGAAATCCTAGCAATATCAAACTCTTA
TATGAGCTTCTTAAAGCATTTTGTGTACGTTTCATACCAGACTTTCAATTTCTACGTGAT
TTCATACAAAACACAGTTTGTCAAACTTACAGCGTTGAATGGAAACGAAATGCTTTCTTC
CTATTTGTGGAATTCAATAAAAACCCCATGATTTCAGTAGACTTGAAAGTAAAAATTGTC
ACGATGATTATTATTCCAAGTTTTGCTGTCAGCTTTGATAATGGTCATGGTGAAGAATTG
ATAGGAGCCCCAGCTGCTATAATTCAAAATCCACAACAACCGATTGCAACATTAAGCATG
ACAAGTCTTTCTTCTAGCACATCAGCCGATGCTGATGATGAAAACATTGTAAGCGTATTT
GTTAACAAAGTAATTGAACCAGAAAAATCAATCGACGATGATGATGATGCACTGAGAATA
GCTTTACTTCAATTTGCATGTCTTCTTGTTGAACGTTCAGCTGGCCATATTCATGATGGA
TCAGGGACAAGCACAAATAAAAAAGAAGGGAATAAATTGAGACGACTTATGACTTATGCA
TGGCCATGTTTATTAGCAAAGAATTATGTCGATCCATCAGCAAGATACCATGGACATCTT
TTACTTGCACATATTATCTCAAAATTGGCTATCAATAAAAGAATTGTGTTGCAAGTTTTT
CACTCTTTATTGAAAGCACATGCTGTTGATGCACGAAATTTGGTTAGACAAGCTCTTGAA
ATAATAACACCAGCACTTCCATTCAGAATGGATGATGGAAATCAAATGCTGACTCATTGG
ACAAGAAAAATAATTTTAGAAGAAGGACACTCGATGCAACAATTGAATCATATTTTGCAA
TTGATTGTAAGACACTATAAAGTTTATTATCCTGTTCGTCATCAACTAGTGCAGCAAATG
ATTTCTTCAATGAATAGGCTCGGATTCTCTCCAAATAGTTCAGTTGATTATAGAAAATTA
GCTGTTGAATTAGCTGAAGTTGTCATCAAATGGGAATTACAAAGAATTCGTGAAGAAACA
GATGGGCCATCTGATGAAGATCCTTTATTGACAATTACTGAGAAAACAACAGCAACAGGT
GCTGTCAAGCGATCTATTCAAGATGATGACGGAAGAAAAAAGCCGAGCGTAAGTGAACCT
GCAACTTCAACAACTTCAATAGTTCCACCACATCAACGAAATGAAGACTACTATAGACCA
ATTGATCGAGTTCATTGTGATGCCGTGCTTAATTTCTTATTTCGTTTATCATGTCAGTTA
GGTGATCTTCAACCTCAAATTCCTGGCATTCAATCACCAAGCGAAGGACTTTCAAGGCGT
TGTGTAATGTTAATCAAACGTGTAATGGATCAAAATGTTTGGCTATCTAATTGTGATTTG
AAATTGCAATGGCTTGATAAAATTTTCATCTCAATTTCAACTGAACCAACTCAAGCAAAC
ATTGGAAATATTTGTACAGCTCTTGAACTTCTGACATATTTATTAGGTTGTATGCGAAGA
GATCAGCTTTTAGCTATATTCAGACCACTTCAACGAGGATTATCAGTTTGTGTTACAAAT
CATAATGCAAAAATCATTAAAATGATGCATGCATTGCTCACTAAATTGATGTCAATTTTT
TCACCTGAACATAAACATAGCAGTGAAGATCATCTCGATCAACTTTATGCTACAATAAGT
AAGACAATTTTTGATGGGTTAACACAATTTGAAAAGAATCCGAAAGCAAATCCAAGTTCT
TTATTTGGTACACTGATGATACTCAAAGCAGCTTGTACAAATAATCAATCCTATATTGAT
CGTCTAATTGTTCCATTTATGCATGTACTTGATCGATTAAAGAAAGAACATATTGGACCA
ATAAATCAACAATCTCAACCAACTCAGCAGCAACAGCAAATACAAGGACAACAGCAGCTA
CAACAAGGGCAATCGAATTTAAGCAACTCTGAAATCACACTTGAACTCTTGATTTTGTCA
TTAGATTTAGTGAAGAATCGAGTGATTGTGATGGGTGTTGATTTGCGAAAAATGTTTATT
GGAACTATCCTTGTTGGTTTGATTGAAAAATCCACTGATCCAAAAGTTATTCGTGCCATC
ATAAAAATGATTGAAGAATGGATGAAAAACAAAAATACACCAGTCACTGTTTTGCAAGCA
CCAACACTTAGAGAAAAATCTATTTTACTCGTCAAATTGATGCATTATGTTGAAAAAAGA
TTTTCTGATGATGCGGAATTGAATGCACAATTTTTGGAGCTTGTTAATTTTATTTATCGG
GATGATCAATTAAAAGTCAGTGAATTGACATCAAAACTCGAACAAGCTTTCCTCGCTGGT
CTTCGATGTTCGCAGCCTCACATTCGAAGCAAATTCTTTGAAGTATTTGATGGTTCAATG
AGAAGAAAATTGCATGATCGATTACTTTATATTGTTTGTTCGCATTCATGGGACTCGATT
CAACATCATTATTGGATAAAACAATGCATTGAGCTTCTTTTGCTTACTACTAACACTACT
CATTCACAAATTAAGAATTCCAATGAAAACCATTTATTGCCAAGTATTACATCAGTAATT
AATCAAGCTGATTCAGAGGAAAAGAACAATTTTGTCATTTACACATCACTTCAACCAGAA
CAACATGAAATGAACTTGACTGAAGATAAAGAAGATGCAATTGATATGGAAATGAATGTT
GACAATAATATCATTAAGCGAGAAGAGAGTGAACGTCCAGTAGAAAATCGAATGGCTGTT
TTATGCAAACTTTTAAATCGAGAAAGTGAATTTTTAGATAATTCAAGAAAAATCAAAACT
GAACAATTTTTAATTGCAAGCTCTCAATTATGTCATATGGACACACAACTAGCAGAATCT
GTTTGGTTGAATGTTTTTCCTAGATTATGGTCAATATTGGATGAAGGTCAACAACAATCG
CTAGATCGAGAATTTGTAGCTTTTCTCTCATCTGGGACACATGTTATTCAAAAAGATTGT
AATCCAAGCGCAATAAATACTTTTGTCGAAGCACTCACTCAATGTCAACCACCAATTTAC
ATTCCTCCGAGTTTAATGACATATTTAGGAAAGTCACATAATTTATGGCACAGCATGACA
TTGCAACTCGAAAATATGGCATTAGAATGGCCAAATCGAAAAGATGCAATAAATGGTAAT
GAATATCGTAGTGAATATGATTTTGATGTCGAATTGCAACAAGAAATTGCTAGTTCAAAT
AAATCAGACACTGTTTTGGCAGTTTTAGAACCTCTATCACAAATGTATTCTGCTTTATAT
GAAGAAGATTTATGGGCGGGATTGTGGCAGAAACATGCAAAATATTCTGAAACCAATGCA
GCTATTTTTTACGAACAAATGGGTCATTTTGATGAAGCACAAAAAATGTATGAAACTGCC
ATGACTAAATTTAAACAAGATGCTCAAGCAGGAAATATTACAAGCGATATGAATAGTGAA
TTATTGCTCTGGGAAAATCATTGGATAAGGTGTACAAAAGAGTTGAACGAGTGGCAAATT
TTACTCGATTATGGTCAATCAAATAAGGATAAGCATGCATTTTTGATAATGGATAGTGCA
TGGAGAGTTCCCGATTGGAATTTAATGAAACAAGCTTTGCTTAAGGTTGAGCAGACAAGT
TCAAAACAGATGGGTCATAAAGTGAGTCTCTACAGAGGATACTTGGCCATTTTGAATCAA
GAAGATGCACAACATATTCCATCAGTTGAAAAATATGTTGAAATTTCATCAGCACTTTGT
ATGAGAGAATGGCGAAGATTACCGAAAGTAGTTTCTCACATTCATTTACCGATTCTTCAG
TCAGCACAGCAAATAATGGAATTACAAGAAGCAAGCCAAATTCATCAACAACCTTTGAAA
GTTCCTGATATTAAAGCAGTTGTAAAGACATGGAGAAATCGATTGCCAGTTATTTCGGAT
GATTTATCACATTGGAGTGATATATTTACATGGAGACAACATCATTATCAAATAATTACT
GATAGTATGGCAAATTGTGATTTAGGTCCGCATGCATCGGCACAAACAATTATTCAATTT
GGAAAAGTCGCAAGAAAACAAAATCTCACAACTGTATGTCATAATTTACTTTCAAGAATT
CACACAATTCCTACTGTGCCAGTCATCGATTGTTTCCAAAAAATTCGACAAGAAGTGAAG
TGTTTTATACAAATGGCAAATAATAATCAAGATCAAAGTTCATTGTCTGAAGCATTAGAA
GTGATTGAGACAACAAACATTAAATATTTCAAACCTGAACTTGTCGCCGAAATTTATGCA
TTGAAAGGAAATTTATTGAAACAAATGTCTAAATCAGATGAAGCAAATAAAGCATTTAGT
GCAGCTGTTCAACTATCAGATGTTTTACCATCAGCATGGGGAATGTGGGGTAATTATCTC
GAAGGAAATTTTGCTCGAGATCCAAAAAATGCTCAACTAGGTCTATCTGCTATGGTTTGT
TTCTTACACGCTTGTCGTCAACAAAATGAATCAAAGACGAGAAAATATTTTGCAAAAATA
ATTTGGCTTCTAACTTATGAAGAAAAACGTTCTGAAATGTTAGAACTTATTGATAAATAT
GGACAAGCGATTCCACCAATTCAATGGCTTCCATGGATACCTCAATTATTAAATTGCCTC
ATACAATACGAAGGTGATATCATTATGAATCTTTTGAGTAATATTGCAAGAAGTTTCCCA
CAAGCTGTTTACTTTCCAATTCGAACACTATATCTTATGCTAAAAATTGAACAACGTGAA
CGATACAAGAGTGTAGAACAAGCAATGGCCAAAAATCAACAGCAAGCTTCTGTTGATCAG
CAAACAAGTTCGCAAGACTCAACAGGTGGATCAAGTATTCCTTCTACAACATCAAATTTG
AGTTCTCAATCAATTCAAATCAAAGCAAGTCCACCAATGTATCGATGTTCAAAAATAATG
CATATGCAACGAGATATTCATCCTACAACTTTATCATCACTAGAAGGCATCGTCGATCAA
ATGGTATGGTTCCGAGAAAATTGGTATGAGGAAGTTCTTCGTCAATTAAAACAGGGATTA
GCTAAATGCTATGGAATTGCATTTGACAATAGAGGTGCTGTCAATGAGGCAAAAATAACT
CCACATACACTCAATTTTGTTCGCAAACTTGTTTCAACATTTGGAATCGGTATTGAAAAT
ATCTCAAATAATACAAACATTATGGCTGGTTCTTCAGCATCTGAATCATTGGCAAGAAGA
GCACAAGCAACTGTTCAAGATCCAGTTTTCCAAAAATTGAAAGGACAATTTACAAGTGAT
TTTGACTTTTCACAACCAGATGCAACTAAAATGCAAAATCTCATTACAAAATTAAAGATT
TGGATAAAAATTTTAGATGTTAAAACGAAGCAATTACCAAAATCATTCTTGATAGAAGAA
AAATGCAGATTTTTATCAAATTTCAATCAAAAAACAGCAGAGGTAGATTTGCCAGGAGAA
CTTTTACTTCCAAGACATTCACACTATCACGTTAAAATTGCCAAATTCATGCCTCGAGTT
GAAATTGTACAGAAGCATAACACATCAGCAAGACGACTTTTTATTCAAGGAACTAACGGA
AAGATTTATCCATATCTTGTGGTAAATGATTCGGGATTGGCTGATGCAAGACGAGAAGAA
AGAGTACTTCAACTACTTAGAATGCTTAATAGTTATTTGACAAAATACAAAGAAACTAGC
AAACGATTCCTTCATATTACAGTTCCACGAGTTGTTGCTGTTTCACCACAAATGCGATTA
GTGGAAGATGATCCATCAAGTATTTCATTATTAGATGTTCTCAAAACATATTGCTCAAGA
CTCAATATTGAATATGATTCACCAATAACGAGGTATTATGAGAGATTAGCTGAAATACAA
CAAAGAGGAGTACAAACAACGCATGCAACTTTGAGAGACATCTTTAAAGAGATTCAAATT
AATATGGTTCCAAAAACTGTACTCAAAGATTGGGCATCAAAGACATTCATGTCATCAACT
GATTACTGGACATTTAGAAAAATGTTTACGCTTCAATTATCACTTTGGTCAATTGTTGAA
TATGCTTTTCTCTTGACACGACTTAATCCTGATATGATGTACTTACATCAAGATTCAGGT
TTAATTAATGTCTCATATTTCAAATTTGATCTTGATGATATTAATGGAGAAATGAATCCG
AAAAGACCAGTGCCATTCCGTTTAACTCCAAATATTGCAGAATTTATCACAAATATGGGC
ATCTTAGGTCCGCTTTCAGCTTCAATAATGGCAACAGCTAGATGCTTTTTACAACCAAAT
TATCAGATTCCATCAATTCTCAAAACAATTTTATGTGATGAAATCATTACTATTCATCGT
AAGCACATCATTAATACAAAACCGATCGATCCAAATGAAGATTTATCGCAGGATAAAACA
TTTAGTGAAATCAATGTTGAGAATGTTATTCAAACGGTCAATAATGGAGTCAATCAAATA
ACTGAACGTCTCAATCATTTATCAAATTACGATTATAATGAAGGGAATAAAGTATCCCAA
CTCATTCAAATGGCTCGTAATCCTGATTCTTTGTGTCGTATGGATCCAATTTGGTTCCCT
TGGATGTAA

>g4787.t1 Gene=g4787 Length=3802
MKTFLKVLQEGECHFFADFNIQQMRKLILEMIHRLPVSEVVRPYVKSILTLMLKLLKTDN
EDNVLIQHFIAYVRNIYSDLPKNLNKIFEPRDQIRVKDLKEINLEQLLGEIYSVTTIYVE
KQEIEGKAANSFTLLPRGVNSLKVLQELPIILVLMYQIYKQSVHQEIVEFIPLIMTTIAL
QPSPVHRAMPNFNREIFVDFMSAQIKALSFLAYIVRTFLAQFQEVINAHATQMVDGMISL
LRLCPMEVASTRKELLVATRHILATDLRNKFVSSIELLFDEDLLLGRCSYSNESLKPLAY
STLADLIHHVRQHLKLDILTKAVYLFSKNVHDESLPTSIQTMSCKLLLNLVDCIRVQRND
DSTPSPTPRELLMTMLKVFTLKFQTIAKLQLPLIMQKWKNLNTQAQTSTVDFNKEMIGID
VSPESINKLTSIGFPTVNTLNVCEYKSLVKTLICGVKTITFGINLTDNTQQKMPVTFQPE
ELLVFIDLFKWALEALDIYMINIPSPGTPAGVQQKTITQLPRSKDEKELLEHFSGLFLTM
SPQNFQEIFSSTIDFMVDRISQNVALQVIANSFLASPTTSPLFATVLIEYLLERMEEMGS
NIERSNLYLRLFKLVFGSVSLFANQNEQMLRPHLHNIVNRSMDLAMTAKEPYNYFLLLRA
LFRSIGGGSHDKLYKEFLPLLPNLLEGLNRLQSGYHKQHMKDLFVELCLTVPVRLSSLLP
YLPMLMDPLVSALNGSPTLVNQGLRTLELCVDNLQPDFLYDHIQPVRAELMQALWKTLRN
TDTAALVAFRVLGKFGGGNRKMMIEPQRLEYNQCNSKESSSSAIVAYFQEQRKPIDFPVD
KVIETAFNALKTSSTDPYYWAQSWEIIRCYLSASICLSDEKHILQKLFTHPSFTEGPVLN
INLTSSHFIENQARKTHQTALTAMFVAAATKELRQSVLPVFVDVVRHYTMVAIAQQAGPF
PLKQNQQYNNIDPLVLIDALATIMGHEEKELCKAANFAMILIIKTATTIMGSKERACRLA
MMQYLAEKMSNLCYERPWYAKLGGVIALKFLCEHMSMRWLFQHLYTFLKAYLFVIMDLTG
EVSSGAISMAREYLANMLNICMVPLTDNSNEELVALQRKATFDVTHELVRRITSPHQLVR
ETAMDSLKQIATLQKITVTEVMMPHKEVLEDIVPPKKHLLKHQPAGAQIGLMDAFSFTSM
EPRLFTLDLNLVTHKFFFHEVITLAEVDDVFLSKLDCFKNINNLIPLRKSALRALASCHY
LGQERKNVKEKIFTILYKALEKSSELQETAFECMQKFNLGSTAEKEFLLQNLRPLLLALG
DHRNLTINIVKRLSYLTQLFPAMFNEKLCDQLLEIIKKMLQSSVTANKNQNFLKVSKTGE
TEVKIATIIEIFHQIPAASAKFAPSLISLVLAAEKEIMIEPSSPYRAPLIKFLKRYPEDT
VEIFLNEINIKNPQYNRFLIYLLKHKDGAAFVSVIEKKEDRLVELILKEKTNQTSVLPEY
TPEDENEAQHQAILLIYTLTEITEKWIDSQVNIINGLTLIWSRELGSSLTSENITCDLWH
LVGKILLRYFEGNPSNIKLLYELLKAFCVRFIPDFQFLRDFIQNTVCQTYSVEWKRNAFF
LFVEFNKNPMISVDLKVKIVTMIIIPSFAVSFDNGHGEELIGAPAAIIQNPQQPIATLSM
TSLSSSTSADADDENIVSVFVNKVIEPEKSIDDDDDALRIALLQFACLLVERSAGHIHDG
SGTSTNKKEGNKLRRLMTYAWPCLLAKNYVDPSARYHGHLLLAHIISKLAINKRIVLQVF
HSLLKAHAVDARNLVRQALEIITPALPFRMDDGNQMLTHWTRKIILEEGHSMQQLNHILQ
LIVRHYKVYYPVRHQLVQQMISSMNRLGFSPNSSVDYRKLAVELAEVVIKWELQRIREET
DGPSDEDPLLTITEKTTATGAVKRSIQDDDGRKKPSVSEPATSTTSIVPPHQRNEDYYRP
IDRVHCDAVLNFLFRLSCQLGDLQPQIPGIQSPSEGLSRRCVMLIKRVMDQNVWLSNCDL
KLQWLDKIFISISTEPTQANIGNICTALELLTYLLGCMRRDQLLAIFRPLQRGLSVCVTN
HNAKIIKMMHALLTKLMSIFSPEHKHSSEDHLDQLYATISKTIFDGLTQFEKNPKANPSS
LFGTLMILKAACTNNQSYIDRLIVPFMHVLDRLKKEHIGPINQQSQPTQQQQQIQGQQQL
QQGQSNLSNSEITLELLILSLDLVKNRVIVMGVDLRKMFIGTILVGLIEKSTDPKVIRAI
IKMIEEWMKNKNTPVTVLQAPTLREKSILLVKLMHYVEKRFSDDAELNAQFLELVNFIYR
DDQLKVSELTSKLEQAFLAGLRCSQPHIRSKFFEVFDGSMRRKLHDRLLYIVCSHSWDSI
QHHYWIKQCIELLLLTTNTTHSQIKNSNENHLLPSITSVINQADSEEKNNFVIYTSLQPE
QHEMNLTEDKEDAIDMEMNVDNNIIKREESERPVENRMAVLCKLLNRESEFLDNSRKIKT
EQFLIASSQLCHMDTQLAESVWLNVFPRLWSILDEGQQQSLDREFVAFLSSGTHVIQKDC
NPSAINTFVEALTQCQPPIYIPPSLMTYLGKSHNLWHSMTLQLENMALEWPNRKDAINGN
EYRSEYDFDVELQQEIASSNKSDTVLAVLEPLSQMYSALYEEDLWAGLWQKHAKYSETNA
AIFYEQMGHFDEAQKMYETAMTKFKQDAQAGNITSDMNSELLLWENHWIRCTKELNEWQI
LLDYGQSNKDKHAFLIMDSAWRVPDWNLMKQALLKVEQTSSKQMGHKVSLYRGYLAILNQ
EDAQHIPSVEKYVEISSALCMREWRRLPKVVSHIHLPILQSAQQIMELQEASQIHQQPLK
VPDIKAVVKTWRNRLPVISDDLSHWSDIFTWRQHHYQIITDSMANCDLGPHASAQTIIQF
GKVARKQNLTTVCHNLLSRIHTIPTVPVIDCFQKIRQEVKCFIQMANNNQDQSSLSEALE
VIETTNIKYFKPELVAEIYALKGNLLKQMSKSDEANKAFSAAVQLSDVLPSAWGMWGNYL
EGNFARDPKNAQLGLSAMVCFLHACRQQNESKTRKYFAKIIWLLTYEEKRSEMLELIDKY
GQAIPPIQWLPWIPQLLNCLIQYEGDIIMNLLSNIARSFPQAVYFPIRTLYLMLKIEQRE
RYKSVEQAMAKNQQQASVDQQTSSQDSTGGSSIPSTTSNLSSQSIQIKASPPMYRCSKIM
HMQRDIHPTTLSSLEGIVDQMVWFRENWYEEVLRQLKQGLAKCYGIAFDNRGAVNEAKIT
PHTLNFVRKLVSTFGIGIENISNNTNIMAGSSASESLARRAQATVQDPVFQKLKGQFTSD
FDFSQPDATKMQNLITKLKIWIKILDVKTKQLPKSFLIEEKCRFLSNFNQKTAEVDLPGE
LLLPRHSHYHVKIAKFMPRVEIVQKHNTSARRLFIQGTNGKIYPYLVVNDSGLADARREE
RVLQLLRMLNSYLTKYKETSKRFLHITVPRVVAVSPQMRLVEDDPSSISLLDVLKTYCSR
LNIEYDSPITRYYERLAEIQQRGVQTTHATLRDIFKEIQINMVPKTVLKDWASKTFMSST
DYWTFRKMFTLQLSLWSIVEYAFLLTRLNPDMMYLHQDSGLINVSYFKFDLDDINGEMNP
KRPVPFRLTPNIAEFITNMGILGPLSASIMATARCFLQPNYQIPSILKTILCDEIITIHR
KHIINTKPIDPNEDLSQDKTFSEINVENVIQTVNNGVNQITERLNHLSNYDYNEGNKVSQ
LIQMARNPDSLCRMDPIWFPWM

Protein features from InterProScan

Transcript Database ID Name Start End E.value
20 g4787.t1 CDD cd05163 PIKK_TRRAP 3433 3719 1.8984E-105
16 g4787.t1 MobiDBLite mobidb-lite consensus disorder prediction 1940 1972 -
14 g4787.t1 MobiDBLite mobidb-lite consensus disorder prediction 1957 1971 -
13 g4787.t1 MobiDBLite mobidb-lite consensus disorder prediction 2201 2222 -
15 g4787.t1 MobiDBLite mobidb-lite consensus disorder prediction 3193 3225 -
3 g4787.t1 PANTHER PTHR11139:SF1 TRANSFORMATION/TRANSCRIPTION DOMAIN-ASSOCIATED PROTEIN 38 444 0.0
5 g4787.t1 PANTHER PTHR11139 ATAXIA TELANGIECTASIA MUTATED ATM -RELATED 38 444 0.0
4 g4787.t1 PANTHER PTHR11139:SF1 TRANSFORMATION/TRANSCRIPTION DOMAIN-ASSOCIATED PROTEIN 395 3798 0.0
6 g4787.t1 PANTHER PTHR11139 ATAXIA TELANGIECTASIA MUTATED ATM -RELATED 395 3798 0.0
2 g4787.t1 Pfam PF02259 FAT domain 2775 3102 4.7E-59
1 g4787.t1 Pfam PF00454 Phosphatidylinositol 3- and 4-kinase 3475 3716 9.4E-18
18 g4787.t1 ProSiteProfiles PS51189 FAT domain profile. 2605 3173 25.533
17 g4787.t1 ProSiteProfiles PS50290 Phosphatidylinositol 3- and 4-kinases family profile. 3465 3676 12.858
19 g4787.t1 ProSiteProfiles PS51190 FATC domain profile. 3770 3802 10.554
12 g4787.t1 SMART SM00146 pi3k_hr1_6 3474 3767 2.3E-9
11 g4787.t1 SMART SM01343 FATC_2 3768 3802 3.4E-4
8 g4787.t1 SUPERFAMILY SSF48371 ARM repeat 26 783 8.64E-9
9 g4787.t1 SUPERFAMILY SSF48371 ARM repeat 527 1414 7.41E-17
7 g4787.t1 SUPERFAMILY SSF48452 TPR-like 2697 3061 8.06E-8
10 g4787.t1 SUPERFAMILY SSF56112 Protein kinase-like (PK-like) 3370 3764 1.22E-45

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0005515 protein binding MF
GO:0016573 histone acetylation BP
GO:0000123 histone acetyltransferase complex CC

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values