Gene loci information

Transcript annotation

  • This transcript has been annotated as F-box/WD repeat-containing protein 7.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g10044 g10044.t1 isoform g10044.t1 7017750 7021301
chr_1 g10044 g10044.t1 exon g10044.t1.exon1 7017750 7020160
chr_1 g10044 g10044.t1 cds g10044.t1.CDS1 7017750 7020160
chr_1 g10044 g10044.t1 exon g10044.t1.exon2 7020378 7020603
chr_1 g10044 g10044.t1 cds g10044.t1.CDS2 7020378 7020603
chr_1 g10044 g10044.t1 exon g10044.t1.exon3 7020669 7020879
chr_1 g10044 g10044.t1 cds g10044.t1.CDS3 7020669 7020879
chr_1 g10044 g10044.t1 exon g10044.t1.exon4 7020950 7021121
chr_1 g10044 g10044.t1 cds g10044.t1.CDS4 7020950 7021121
chr_1 g10044 g10044.t1 exon g10044.t1.exon5 7021190 7021301
chr_1 g10044 g10044.t1 cds g10044.t1.CDS5 7021190 7021301
chr_1 g10044 g10044.t1 TTS g10044.t1 7022219 7022219
chr_1 g10044 g10044.t1 TSS g10044.t1 NA NA

Sequences

>g10044.t1 Gene=g10044 Length=3132
ATGGCTGATGATGAATGTGCAATCAAAAATAAAATCACATCAGATGACACATTTTATCAT
CATCAAAGAAATATTGTAGTTGATGAGATTGTATATGATGATGAAAATACTGGAACAGAA
ATTACTATCACACAAGCTACTATCGTCGATAGTACTGGCTCAATTCAACTTGAGACACCA
ATGGAAACTAGTGTAGTGTCTCTGATAGCTGAAACTTCCAATAATATTGTTAAACCAAAA
CGCTTAAGTGATGAATTTTCATCATTGGATGAAGAAACTGACGACATTGAGGACGAAGAG
AAAATGAATGATGAGTCAACAAAGTGCGATGATGTTGCTTGTTGTAGCAAATCAAGTTCT
ATTATTGAACGTGAACAAGATGAAATGGATGAAAATTCACTTAATCCACCATTCTTAAGT
GATAAGGCCGATACATCGTCGTGCATAAGTTTGTGCTTCAGCAATAGTAGCAACAATATT
TTTGTGCAATCAATTGATGATGCAATTGAACGTGCGGATTTTGAGAATGAACAAAAGCTG
ACAGGTGGTATAATTATTCGAGCATCATCGCTGACAGGAGACGAGGATAGACTGAATTTT
GATTTGATTAATAACTCTAGAAGATCTGCAAACAATAATAATAGTAATAATAGTAGTAGG
AGTAGTAATAATAGTAATAGTGAAGATATTGGTACTAATGAAACATTGGTACAACAAAAT
CAAGATAATGACGATGAGGATGAGGATGAAGAAGAGGAAGACGAATTAGTAGATGATGAT
AATAATGAAAAAAATGAAACTTCAAACACTCATCGTAAAGAAATTACATCAACACCAACA
AGTAGCCGCTGTAATATAAATGATATGAAAAGCAAAAGTACAATTCGTTCGACATCAAGT
GCAAAGCAAAAATTATCTGATAAATATTTAAGTATAATAAAGAATGTCGCCAGTGCTTCA
TCAATGACTGATTACGATGATAATGGTGAATCTTCAAAGTCAATAAGCGTCAATAAAAGT
AGCATTAGTGGATCACAGCATTCCAGTCGTAATGTCAAAATCAGTCTATTTGACGATGAA
CCTTCGACATCATCTGGTAGTCGTAATTTGCATCATCGTCACAATCATCATTTACATGTA
AAGGACGAAGACGATCCAATTGATTGTCTGAAATTAAATGACGATTATATGGACAACAAT
ACAAACCATGATTTAAGTGACGAATTGGACCCAGAAGATACATGGGCTGATTGTGAAGAA
GGCTCCGACTGTGAAGAAATTTGCACCTGTCGAAATGACGATGAAGATGATGAGTATGGT
GTGTGCAGTAGTAGCAGTGAAGATGAACTTCCATCACGAGATGTGGATTTATCAAGTTAT
ACACAACTTGATCCAATTTCTGATGATATTCTTCAAGATGGTGGTTGCGATGGAACCCCA
AAGATTCAACGAAAACGCAAGCTCACTGAGCAATCAATGATTCATGTTACTGCCGAAAGT
CCAATTGCTCTTGGCGGCAATCGTAAGCGCCTTGCATTAGATTCAATTTCATCCCCGCAT
TCTAGCATCATTTCACCAATTGTTACAAGTCAAAATCTTGCCACACCTAAATCTTCAACA
AGTCATCTCAGTGATAAACGAACACCACGATTGATTCCAACAAAAGACAATCCACCACCT
GATTTAATGGATTGGTTGCTTACTTTCCAACGTTGGACAAATGCTGAACGTCTTGTCGCT
ATTGATAAATTGATTGAACAATGCGAACCAACTCAAGTTCGTTTTATGATGAAAGTCATC
GAACCGCAATTTCAACGTGATTTTATTTCTCTACTGCCTAAAGAACTCGCTCTTCATGTG
CTCTCATATCTAGAGCCTAAAGATTTGCTTCGTGCTGCACAAACATGCCGCAGTTGGCGT
TTTTTGGCTGATGATAATCTCTTATGGAAAGAAAAATGCAGACAATCTGGAATTATATCA
GATACATGTCCTGATAAGCCAAAACGTGGAAGAACAGGTAATATGCCCAAAATTTCATCA
CCTTGGAAGGCAGCATATATGCGACATCACACAATAGAAATGAATTGGCGTTCAAACCCA
ATTAGAACACCGAAAATTCTCAATGGTCATGATGATCATGTAATTACTTGCTTGCAATTT
AGTGGAAATCGTATTGTCTCAGGATCAGATGATAATACACTTAAAGTATGGTCAGCCGTG
ACTGGAAAGTGCCTTCGCACGTTAGTGGGTCACACAGGAGGTGTGTGGAGCTCACAAATG
TCAGGAAATCTTATAATAAGTGGTAGCACAGATCGTACGCTAAAAGTATGGGATGCTGAA
TCTGGAATTTGTAAACATACGCTTTATGGACACACTTCAACAGTTCGTTGCATGCATTTA
CATGGAAATAAAGTTGTTAGTGGTAGTCGAGACGCAACACTTCGCGTGTGGGATATTAAT
GATGGAACGTGTTTGCACATTCTTGTTGGACATTTAGCTGCGGTAAGGTGTGTGCAATAT
GATGGAAAATTAGTAGTATCAGGCGCCTATGACTATCAAGTCAAAGTATGGAATCCAGAA
AGACAAGAATGTATTCATACATTACAAGGTCATACTAATCGAGTGTATTCACTTCAGTTT
GATGGTGTCCATGTCGTATCAGGCTCACTTGATACATCGATTAGAGTTTGGGATGCAGAA
ACTGGCGCTCTGAAGCATACGCTAATGGGTCATCAATCACTTACATCGGGTATGGAGCTA
AAAAATAATATATTAGTTAGTGGAAATGCTGATTCAACTGTCAAAGTTTGGGATATCATA
ACAGGACAATGCTTAGCCACACTTGCTGGTCGCAATAAACATCATTCTGCTGTAACGTGT
CTTCAATTTAATAATCGCTTTGTCATTACAAGCTCAGATGATGGAACAGTTAAATTATGG
GATGTCAAAACTGGTGAATTTATTCGAAACCTTGTCGCTTTAGATAGTGGAGGTTCAGGG
GGAGTTGTCTGGCGCATACGAGCAAACGATACAAAATTAATATGTGCTGTCGGCTCACGC
AATGGAACAGAAGAAACTAAATTAATGGTCCTAGATTTTGACATCGAAGGAGCTTGTTCA
AAATGCTCATAG

>g10044.t1 Gene=g10044 Length=1043
MADDECAIKNKITSDDTFYHHQRNIVVDEIVYDDENTGTEITITQATIVDSTGSIQLETP
METSVVSLIAETSNNIVKPKRLSDEFSSLDEETDDIEDEEKMNDESTKCDDVACCSKSSS
IIEREQDEMDENSLNPPFLSDKADTSSCISLCFSNSSNNIFVQSIDDAIERADFENEQKL
TGGIIIRASSLTGDEDRLNFDLINNSRRSANNNNSNNSSRSSNNSNSEDIGTNETLVQQN
QDNDDEDEDEEEEDELVDDDNNEKNETSNTHRKEITSTPTSSRCNINDMKSKSTIRSTSS
AKQKLSDKYLSIIKNVASASSMTDYDDNGESSKSISVNKSSISGSQHSSRNVKISLFDDE
PSTSSGSRNLHHRHNHHLHVKDEDDPIDCLKLNDDYMDNNTNHDLSDELDPEDTWADCEE
GSDCEEICTCRNDDEDDEYGVCSSSSEDELPSRDVDLSSYTQLDPISDDILQDGGCDGTP
KIQRKRKLTEQSMIHVTAESPIALGGNRKRLALDSISSPHSSIISPIVTSQNLATPKSST
SHLSDKRTPRLIPTKDNPPPDLMDWLLTFQRWTNAERLVAIDKLIEQCEPTQVRFMMKVI
EPQFQRDFISLLPKELALHVLSYLEPKDLLRAAQTCRSWRFLADDNLLWKEKCRQSGIIS
DTCPDKPKRGRTGNMPKISSPWKAAYMRHHTIEMNWRSNPIRTPKILNGHDDHVITCLQF
SGNRIVSGSDDNTLKVWSAVTGKCLRTLVGHTGGVWSSQMSGNLIISGSTDRTLKVWDAE
SGICKHTLYGHTSTVRCMHLHGNKVVSGSRDATLRVWDINDGTCLHILVGHLAAVRCVQY
DGKLVVSGAYDYQVKVWNPERQECIHTLQGHTNRVYSLQFDGVHVVSGSLDTSIRVWDAE
TGALKHTLMGHQSLTSGMELKNNILVSGNADSTVKVWDIITGQCLATLAGRNKHHSAVTC
LQFNNRFVITSSDDGTVKLWDVKTGEFIRNLVALDSGGSGGVVWRIRANDTKLICAVGSR
NGTEETKLMVLDFDIEGACSKCS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
17 g10044.t1 CDD cd00200 WD40 705 981 9.63883E-83
16 g10044.t1 Gene3D G3DSA:1.20.1280.50 - 591 698 1.8E-30
15 g10044.t1 Gene3D G3DSA:2.130.10.10 - 700 1038 2.1E-121
34 g10044.t1 MobiDBLite mobidb-lite consensus disorder prediction 209 304 -
35 g10044.t1 MobiDBLite mobidb-lite consensus disorder prediction 209 239 -
32 g10044.t1 MobiDBLite mobidb-lite consensus disorder prediction 240 261 -
33 g10044.t1 MobiDBLite mobidb-lite consensus disorder prediction 272 304 -
8 g10044.t1 PANTHER PTHR19849 PHOSPHOLIPASE A-2-ACTIVATING PROTEIN 561 1035 3.6E-228
9 g10044.t1 PANTHER PTHR19849:SF1 F-BOX/WD REPEAT-CONTAINING PROTEIN 7 561 1035 3.6E-228
11 g10044.t1 PRINTS PR00320 G protein beta WD-40 repeat signature 805 819 1.2E-8
12 g10044.t1 PRINTS PR00320 G protein beta WD-40 repeat signature 925 939 1.2E-8
10 g10044.t1 PRINTS PR00320 G protein beta WD-40 repeat signature 968 982 1.2E-8
1 g10044.t1 Pfam PF12937 F-box-like 610 654 4.2E-14
4 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 705 738 5.5E-6
3 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 742 778 1.2E-5
6 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 784 818 6.4E-5
2 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 822 858 4.4E-6
5 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 863 898 1.3E-6
7 g10044.t1 Pfam PF00400 WD domain, G-beta repeat 942 981 1.9E-6
30 g10044.t1 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 765 779 -
29 g10044.t1 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 805 819 -
28 g10044.t1 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 885 899 -
31 g10044.t1 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 925 939 -
27 g10044.t1 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 968 982 -
44 g10044.t1 ProSiteProfiles PS50181 F-box domain profile. 606 652 13.975
36 g10044.t1 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 707 990 64.593
38 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 707 747 10.341
39 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 748 787 12.146
37 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 788 827 13.349
41 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 828 867 11.244
43 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 868 907 13.817
42 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 908 947 12.346
40 g10044.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 951 990 14.051
26 g10044.t1 SMART SM00256 fbox_2 612 652 1.6E-11
20 g10044.t1 SMART SM00320 WD40_4 700 738 1.5E-5
25 g10044.t1 SMART SM00320 WD40_4 741 778 1.3E-8
24 g10044.t1 SMART SM00320 WD40_4 781 818 1.4E-7
19 g10044.t1 SMART SM00320 WD40_4 821 858 3.6E-5
18 g10044.t1 SMART SM00320 WD40_4 861 898 2.4E-8
22 g10044.t1 SMART SM00320 WD40_4 901 938 0.029
21 g10044.t1 SMART SM00320 WD40_4 941 981 3.1E-7
23 g10044.t1 SMART SM00320 WD40_4 984 1032 330.0
14 g10044.t1 SUPERFAMILY SSF81383 F-box domain 592 694 4.32E-26
13 g10044.t1 SUPERFAMILY SSF50978 WD40 repeat-like 696 1023 1.74E-93

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values