Gene loci information

Transcript annotation

  • This transcript has been annotated as Unconventional myosin-Ib.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g1950 g1950.t1 TTS g1950.t1 14022700 14022700
chr_3 g1950 g1950.t1 isoform g1950.t1 14023516 14030145
chr_3 g1950 g1950.t1 exon g1950.t1.exon1 14023516 14023642
chr_3 g1950 g1950.t1 cds g1950.t1.CDS1 14023516 14023642
chr_3 g1950 g1950.t1 exon g1950.t1.exon2 14023706 14023833
chr_3 g1950 g1950.t1 cds g1950.t1.CDS2 14023706 14023833
chr_3 g1950 g1950.t1 exon g1950.t1.exon3 14023927 14023983
chr_3 g1950 g1950.t1 cds g1950.t1.CDS3 14023927 14023983
chr_3 g1950 g1950.t1 exon g1950.t1.exon4 14024062 14024214
chr_3 g1950 g1950.t1 cds g1950.t1.CDS4 14024062 14024214
chr_3 g1950 g1950.t1 exon g1950.t1.exon5 14024270 14024405
chr_3 g1950 g1950.t1 cds g1950.t1.CDS5 14024270 14024405
chr_3 g1950 g1950.t1 exon g1950.t1.exon6 14024463 14024569
chr_3 g1950 g1950.t1 cds g1950.t1.CDS6 14024463 14024569
chr_3 g1950 g1950.t1 exon g1950.t1.exon7 14024647 14024871
chr_3 g1950 g1950.t1 cds g1950.t1.CDS7 14024647 14024871
chr_3 g1950 g1950.t1 exon g1950.t1.exon8 14024998 14025054
chr_3 g1950 g1950.t1 cds g1950.t1.CDS8 14024998 14025054
chr_3 g1950 g1950.t1 exon g1950.t1.exon9 14025148 14025517
chr_3 g1950 g1950.t1 cds g1950.t1.CDS9 14025148 14025517
chr_3 g1950 g1950.t1 exon g1950.t1.exon10 14025608 14025794
chr_3 g1950 g1950.t1 cds g1950.t1.CDS10 14025608 14025794
chr_3 g1950 g1950.t1 exon g1950.t1.exon11 14025861 14025997
chr_3 g1950 g1950.t1 cds g1950.t1.CDS11 14025861 14025997
chr_3 g1950 g1950.t1 exon g1950.t1.exon12 14026058 14026194
chr_3 g1950 g1950.t1 cds g1950.t1.CDS12 14026058 14026194
chr_3 g1950 g1950.t1 exon g1950.t1.exon13 14026264 14026500
chr_3 g1950 g1950.t1 cds g1950.t1.CDS13 14026264 14026500
chr_3 g1950 g1950.t1 exon g1950.t1.exon14 14026552 14026757
chr_3 g1950 g1950.t1 cds g1950.t1.CDS14 14026552 14026757
chr_3 g1950 g1950.t1 exon g1950.t1.exon15 14026814 14027056
chr_3 g1950 g1950.t1 cds g1950.t1.CDS15 14026814 14027056
chr_3 g1950 g1950.t1 exon g1950.t1.exon16 14027212 14027296
chr_3 g1950 g1950.t1 cds g1950.t1.CDS16 14027212 14027296
chr_3 g1950 g1950.t1 exon g1950.t1.exon17 14027523 14027536
chr_3 g1950 g1950.t1 cds g1950.t1.CDS17 14027523 14027536
chr_3 g1950 g1950.t1 exon g1950.t1.exon18 14027635 14027698
chr_3 g1950 g1950.t1 cds g1950.t1.CDS18 14027635 14027698
chr_3 g1950 g1950.t1 exon g1950.t1.exon19 14027807 14028872
chr_3 g1950 g1950.t1 cds g1950.t1.CDS19 14027807 14028872
chr_3 g1950 g1950.t1 exon g1950.t1.exon20 14029113 14029231
chr_3 g1950 g1950.t1 cds g1950.t1.CDS20 14029113 14029231
chr_3 g1950 g1950.t1 exon g1950.t1.exon21 14030041 14030145
chr_3 g1950 g1950.t1 cds g1950.t1.CDS21 14030041 14030145
chr_3 g1950 g1950.t1 TSS g1950.t1 14030432 14030432

Sequences

>g1950.t1 Gene=g1950 Length=3960
ATGGATCAAAATGTTGGCTGTTGGGATTCAGTTTTGCTTGAAAACGAGTCAGAAAATTGT
TTCATTTCAAATTTGCACCAGCGATATAAGCGAGATTTTATATATACTTTTTTAGGATCG
CATATAATCTTTTTAAATCCTTATTGTAAACCATCGACAATATTTTCTTCAGATTTAATT
AGTTCGTATGCGGAAAAAAGTTTGTTTCAATTACCGCCACACATATATTCGTTGACAAAT
AATGTTTATAAATCGTTACAAGATAATAATGAAGATCAATGTATAGTGATGCTTGGTGAA
TCTTCAAGTGGAAAAACTGAAAATGCCCGCATGGTCATAAGATTTCTATCAAAAATCTCG
GGAAGATTTATACCACTTCAACGTCAGAGAAGTTCAAACTCTATTGCTAGCTATAAGAGC
TCTCCTAAATCGACGTGTTCGACACCTAAGCATAAATCTCCAACATCAACGATTCAATCA
GTATTATCGCAAGAGAAAACGAGTTGTTTTAAAAGCGAAGGAAGTGGAAATGTTCCTGGT
GTCAAAAGTAAAAAACTTTCAAGAGTTGAATTTGATTTTTCTTATCAAAAGTGCAATGAT
ATAGATGCAAAGCATGATCTCATTAAGTATTGTCCAAAACACAATTGCTGTAACGTGTCA
TCTTCATCTTCAACCGCATCAAATCCAATTGATATTCCAATACGGCGTAAAAGCACTGGA
TATCAACTCCAACAACAACATCTACATCAATATCCAAATCTTCCAGAACTCCCAGGAATC
TCAAAAAGTTTTACCATTTATGAAACAATGAATCGTGTTCAAGTCAATAATAACAACAAA
AAGGTGCCAAATTGTCTTGATCATGTTCAACAGCAGCCGCAGCCAAGTGAATCATCGCGA
CAAACTCGATGTGAAAGCCTAGATCTCATTAAAATGAGTAATCCAAATTCCTTACAAAAA
TCAGATGATGCAATGGCAGCAACAAATTTCAGCAAATTATTTGATGAATGTATACAATTG
TCAAAAAATCAAACTAATAGCAGCTATAATAGAAGCGATGAAAAACGAAATCATTATACA
CAAATTCGTGATCTATCATACGATCGAAATAAAATTAATTTGGATAATTTTAAGAGTGCC
AAACGAAAAGTGCCAATTAAGAATTCTAGAAATGTTGAACTTAGCAATTTAGAAATTCAA
ACTATGAAAGAACGAATCGCACAAGCCGAAATATTTCTCGAAGCAATGGGTAATGCTTCA
ACTTCAAAGAATCGTGATTCAAGCCGATACGGAAAATATTTTGATCTAGAAATCGATTAT
CGTGGCGATCTAATCGGAGGTCATATAATGCATTTTTTATTGGAAAAGACTCGAGTTACG
AAGCAGTTGGAACGAGAAAGAAACTTTCATATATTTTATCAACTTTTGGCCGGAGCCGAT
ATACATTTTCTAAAATCCTTGAAACTACAAAGAAATATCAACAAGTATGATATACTCAAG
GACACAAGCTCAGATGAAGATGACAAATTTCAATTTGCCTTTACTAGAAAGAGCCTAGAC
ATTCTTGGCTTTACGACTGAGGAAACAACTTCAATCTTTAAAATCATTGCAGTGATTTTA
AAATTAGGCAATTTGAATTTTATACCAATTACTAACATCGATGGAACTGAAGGTTGCGAA
ATTTCAAATGATTATGAAATTCGTGATATATCACAACTGCTTGATATCGAAGAACAAATA
CTACTTAATTGTCTTACAAAATCAGGATCATCATGGATGCAATTAGAAAATGGGTCAGAA
CTTGATGCCATTAATGCAGCACTTATTAACAAAGCTTTATGTCGTACACTTTACGGTCGA
CTTTTTACATATGTGGTTAATCGAATTAATGAATCAATGAAGATCAAAAACTTAACAAAT
CGAGGTAGAAATTTGGGTATACTAGATTTTTTTGGTTTCGAATCGCTTGAAAAAAATTCA
TTCGAGGAATTTAACATAAATTATTGCAATGAACGTATCCATCAGAGCTATATTCAAATT
GTGTTAAAAAGTCAGCAAGATTTATATATTAAAGAGGGATTAGAATGGACCAAAATTGAT
TTCTATGATAATTTAGCAGTATGCGATATGATTGATAAGCTGCCACATGGAATATTTTTG
TTGATGGAGGAGCCAAAAGTCATAAACGATGAAATTTTATTACAAAGACTAGGACAGTGT
TGGTCAGGAAATGCTAGTTTTTCCACACAAGATCATATACCACCAAAGTGTTTTCAAATA
CGTCATTTTGCTGGAGCCTTAAATTACAGTATTGAGGGATTTGTAGAAAAAAATTCAGAT
AAAATCCCTAAACATCTAAGTTCTAGTTTATTTCAAAGCAAATTATCAATAGTACAAAAT
TTATTTCCTGAAGGAAATCCAAAACGAGCTTCAAAAAAACCAACAAATTCGAGTTCTATT
TTACGTTCATCACTGCAAAATTTATTATCTCAAATTGAACTGAGAAAATGCCATTACGTA
TTCTGCGTTAAATCGAATGATAAATGTATGCCAAAAGTATTTGAAGTACCAATTGTTCAA
CATCAAGTTCGATTTATGAGTCTTATGCCAGTTGTTGCTCTCTGGAGAAACGGATTCTAT
TTCAATTTTAGTCATTTGAAATTTTTGAGTCGCTATAAGATTTTAAGTCCATTCACATGG
CCTCATTTTCATTCAAGTATTATTGTAGAATCGATTGCTCAAATTATTCGAAGTGTACCT
CTTCCAGCTGCTGAATTTGCAATTGGACTTACCAAAGTTTTCATCAGAAGTCCTCGAACG
CTTTATGAGTTAAATGAATTTCGTAATCATCGATTGAATTCTTTAGCGACACTCATTCAA
AAAGCATTTCGCCGATATTCACAAAGAAAACTTTTTCTTAGAATGAAAAGAAGTCAAATT
ATAATATCAAGTGCATGGAGAACGTGGCGAGAATGTTGGGCAATTCCAGTATCGGAAAGA
AAACATTTATGGGGTTTATATAAAGTGGCTCGCGAAGAATATCGTTTCATAAAATACCGC
AAGCAAGTTGAGTGGGCTGTCAACACAATCCAACGAAATTACATTACATGGAAACGAAGA
CAATTTCTCATGACACTTCCGATGAGATTACACGCAAATAGTCTCAGTCCAATTTCGACA
GAATGGCCAACGGGTCCTAAGTTTCTTTCCGAGTGCTCGCAATTGTTAAAGATAATTTTT
CATAGATGGAGATGCTATAAATATCGTAAAATGTTTGACCAAACAGCTAGAAATCGCATG
CGAGAAAAAGTTACAGCAAGTATTCTATTCAAGGATCGGAAAGCGTCGTATGTTAAAAGT
GTGTCGCATCCGTTTCTTGGTGATTATGTTCGTTTACGACAAAATGTTCAATGGAAAAAG
ATTTGTGTCGAGAATAATGATCAATATGTAGTGTTTGCTGATATTATTAACAAAATTGCT
CGTTCTAGTGGAAAATATGTTCCGATTCTATTGGTATTATCAACCTCTTCAATGTTACTG
TTGGATCAGCGAACTCTTCAAATTAAATATCGTGTACCAGCTTCTGAAATTTATCGCATG
TCATTGAGTCCATATTTGGATGATATTGCTGTTTTTCATGTTAAAGCGGAAGATATATCA
TCAAATATTTCAACAACCTCAGACAATGGTGGTTGTTTATTTCAGTCTGAGCTTGGCAAA
AAGAAGGGAGATTTTGTGTTTCAAACGGGACATGTGATTGAAATTGTAACAAAAATGTTT
TTAGTAATACAAAATGCAACAAGTAAACCACCTGAGATTCAAATCAATCCTGAATTTGAA
GCAAATTTTGGCAATAATGTTGTAATAATGAGCTTTAAACAGCAAATGATGACAGATTTA
AATAATCAACAATTAACTCGTGTTTCACGAAAAGGAAATCGAATGGAAGTTATTGTCTAG

>g1950.t1 Gene=g1950 Length=1319
MDQNVGCWDSVLLENESENCFISNLHQRYKRDFIYTFLGSHIIFLNPYCKPSTIFSSDLI
SSYAEKSLFQLPPHIYSLTNNVYKSLQDNNEDQCIVMLGESSSGKTENARMVIRFLSKIS
GRFIPLQRQRSSNSIASYKSSPKSTCSTPKHKSPTSTIQSVLSQEKTSCFKSEGSGNVPG
VKSKKLSRVEFDFSYQKCNDIDAKHDLIKYCPKHNCCNVSSSSSTASNPIDIPIRRKSTG
YQLQQQHLHQYPNLPELPGISKSFTIYETMNRVQVNNNNKKVPNCLDHVQQQPQPSESSR
QTRCESLDLIKMSNPNSLQKSDDAMAATNFSKLFDECIQLSKNQTNSSYNRSDEKRNHYT
QIRDLSYDRNKINLDNFKSAKRKVPIKNSRNVELSNLEIQTMKERIAQAEIFLEAMGNAS
TSKNRDSSRYGKYFDLEIDYRGDLIGGHIMHFLLEKTRVTKQLERERNFHIFYQLLAGAD
IHFLKSLKLQRNINKYDILKDTSSDEDDKFQFAFTRKSLDILGFTTEETTSIFKIIAVIL
KLGNLNFIPITNIDGTEGCEISNDYEIRDISQLLDIEEQILLNCLTKSGSSWMQLENGSE
LDAINAALINKALCRTLYGRLFTYVVNRINESMKIKNLTNRGRNLGILDFFGFESLEKNS
FEEFNINYCNERIHQSYIQIVLKSQQDLYIKEGLEWTKIDFYDNLAVCDMIDKLPHGIFL
LMEEPKVINDEILLQRLGQCWSGNASFSTQDHIPPKCFQIRHFAGALNYSIEGFVEKNSD
KIPKHLSSSLFQSKLSIVQNLFPEGNPKRASKKPTNSSSILRSSLQNLLSQIELRKCHYV
FCVKSNDKCMPKVFEVPIVQHQVRFMSLMPVVALWRNGFYFNFSHLKFLSRYKILSPFTW
PHFHSSIIVESIAQIIRSVPLPAAEFAIGLTKVFIRSPRTLYELNEFRNHRLNSLATLIQ
KAFRRYSQRKLFLRMKRSQIIISSAWRTWRECWAIPVSERKHLWGLYKVAREEYRFIKYR
KQVEWAVNTIQRNYITWKRRQFLMTLPMRLHANSLSPISTEWPTGPKFLSECSQLLKIIF
HRWRCYKYRKMFDQTARNRMREKVTASILFKDRKASYVKSVSHPFLGDYVRLRQNVQWKK
ICVENNDQYVVFADIINKIARSSGKYVPILLVLSTSSMLLLDQRTLQIKYRVPASEIYRM
SLSPYLDDIAVFHVKAEDISSNISTTSDNGGCLFQSELGKKKGDFVFQTGHVIEIVTKMF
LVIQNATSKPPEIQINPEFEANFGNNVVIMSFKQQMMTDLNNQQLTRVSRKGNRMEVIV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
15 g1950.t1 Coils Coil Coil 1317 1319 -
13 g1950.t1 Gene3D G3DSA:3.40.850.10 Kinesin 2 142 1.9E-32
12 g1950.t1 Gene3D G3DSA:3.40.850.10 Kinesin 398 878 7.8E-129
11 g1950.t1 Gene3D G3DSA:1.10.10.820 - 467 520 7.8E-129
10 g1950.t1 Gene3D G3DSA:1.20.120.720 - 521 632 7.8E-129
14 g1950.t1 Gene3D G3DSA:1.20.58.530 - 656 835 7.8E-129
21 g1950.t1 MobiDBLite mobidb-lite consensus disorder prediction 134 158 -
4 g1950.t1 PANTHER PTHR13140:SF802 MYOSIN IB 7 1300 2.6E-279
5 g1950.t1 PANTHER PTHR13140 MYOSIN 7 1300 2.6E-279
6 g1950.t1 PRINTS PR00193 Myosin heavy chain signature 35 54 1.5E-13
7 g1950.t1 PRINTS PR00193 Myosin heavy chain signature 92 117 1.5E-13
8 g1950.t1 PRINTS PR00193 Myosin heavy chain signature 697 725 1.5E-13
1 g1950.t1 Pfam PF00063 Myosin head (motor domain) 9 122 3.8E-30
2 g1950.t1 Pfam PF00063 Myosin head (motor domain) 397 936 2.8E-130
3 g1950.t1 Pfam PF06017 Unconventional myosin tail, actin- and lipid-binding 1103 1295 7.2E-32
17 g1950.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 1 1162 -
18 g1950.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 1163 1181 -
16 g1950.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1182 1319 -
23 g1950.t1 ProSiteProfiles PS51456 Myosin motor domain profile. 5 949 141.841
24 g1950.t1 ProSiteProfiles PS50096 IQ motif profile. 952 979 6.705
22 g1950.t1 ProSiteProfiles PS51757 Class I myosin tail homology (TH1) domain profile. 1114 1319 18.278
20 g1950.t1 SMART SM00242 MYSc_2a 1 950 1.3E-151
19 g1950.t1 SMART SM00015 iq_5 951 973 0.0015
9 g1950.t1 SUPERFAMILY SSF52540 P-loop containing nucleoside triphosphate hydrolases 4 990 4.65E-189

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005524 ATP binding MF
GO:0005515 protein binding MF
GO:0016459 myosin complex CC
GO:0003774 cytoskeletal motor activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values