Gene loci information

Transcript annotation

  • This transcript has been annotated as histone-arginine N-methyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g9573 g9573.t1 isoform g9573.t1 4099449 4105085
chr_1 g9573 g9573.t1 exon g9573.t1.exon1 4099449 4099593
chr_1 g9573 g9573.t1 cds g9573.t1.CDS1 4099449 4099593
chr_1 g9573 g9573.t1 exon g9573.t1.exon2 4099670 4099833
chr_1 g9573 g9573.t1 cds g9573.t1.CDS2 4099670 4099833
chr_1 g9573 g9573.t1 exon g9573.t1.exon3 4099896 4099969
chr_1 g9573 g9573.t1 cds g9573.t1.CDS3 4099896 4099969
chr_1 g9573 g9573.t1 exon g9573.t1.exon4 4100026 4100303
chr_1 g9573 g9573.t1 cds g9573.t1.CDS4 4100026 4100303
chr_1 g9573 g9573.t1 exon g9573.t1.exon5 4100367 4100540
chr_1 g9573 g9573.t1 cds g9573.t1.CDS5 4100367 4100540
chr_1 g9573 g9573.t1 exon g9573.t1.exon6 4100602 4101651
chr_1 g9573 g9573.t1 cds g9573.t1.CDS6 4100602 4101651
chr_1 g9573 g9573.t1 exon g9573.t1.exon7 4101931 4105085
chr_1 g9573 g9573.t1 cds g9573.t1.CDS7 4101931 4105085
chr_1 g9573 g9573.t1 TSS g9573.t1 4105289 4105289
chr_1 g9573 g9573.t1 TTS g9573.t1 NA NA

Sequences

>g9573.t1 Gene=g9573 Length=5040
ATGACTGATTTAATAAAAGGATTGTTATCGCAGATGAGCTCTAAATTTAATGAGCAAACA
AATCCAGAATTGGTAAAGAAATTGGATTTTCATGACGATAAGACATTGAAGTGGAAAACA
TTAATTAACAATACATATGCAAAAAAAGATCGCGAGAAACGAAATAAAAATAGAAATAAA
GATGGTGAAGATTCGACGGACGATAAAAGTCAACAGCAAGAACCATCAACATCTACTTCA
TTATTAAATAGAAGAGATACAATGTCCGCTGATAATTCAGATAGTGAAAGAAATAACAGT
GAAGAAGAAGAGAGCATGGATAATGCCGATGAATTAAAAGCTAGCGCTTCAATAAGTGAG
AAGTTTGGTAATAATGTTAAAGAGAAAAAACTCGTAATTGACGATGATTTACATACACTC
GAACAAAAAAAAAGTGCTGCTGCACAACAAGCAAGCAGCACACACACAACTATATCGTCT
ACATGCGACGACGACAACGACGATGAAAAAAATAAAATTATTTCAACTAATTTACCTCAA
TTGAATACAAACAGTCAAGCAATTGAAAGCAACACCAAAATAACACAGACCACTGTAACA
ACATCAACATTAACTGCTGATAAGAAGCATGATGATAATACGACTATAATGAACAGTAAA
AATACAGAGAATATAAATAGTATTGGTGTTGCTCTTAGTGACGATGATAAGAAAGCTGGC
TCCATAGCCAAAACATCAAAAATTAATTCAAATTCTTCTACAATTACTGTAATTTCTTCT
ACGTCCTTTTCTTCTTCAACAACATTAATTTCTGCAAATAATAATAATAGCAGTAATAAT
AAAGTTATTGATAAAACAATCGTTGCAAGTTCGTCACGAAATTTAAGAAGTAATAAAAAC
TCCTCAACTCAAAATTCGACAACACCAACACATAGGCAAATTTCGAATAGCAGTAGTATG
TCGGATGATAAAATTAAAAGTGATGAAATGGCAATCAAAAGTGATTTAGAAGAGAATGAT
GATTCTCAAAATAGTTCCATTGCTGTTCATCGTAAAGGACGTGGAAGACCAAGATTAGAT
GTTGCTAAGAATAATAATAATAACAATACTTCTACGGCAATTATCACGAAAAATTCAGTA
GATAGTGTTGTCGTTTGTGGTAGTGGTGATGGAAATAATGGAAATGAAAATGAATCGGAT
GTAAACCAAAAACGTGGTCGTGGTAGAATGCCGAAAACGCGATCGATAATTGAATCGTCG
TCATCATCTTCGGTAGTGAATGAAGAGAAAAAATTAAATGAGGCAGTAATGACAGAAGAG
ACGATTGAAGTGAAAAAAGAAGAAAGCACAGAAGAAACACCAAAACGTCGCGGAAGATCG
CGAAAATTAGTTGAAGAAACTATTTCTACTGATTCTTCAAATGTTGTCAAAGATGAAAAG
GATGCAGAATTGAGTCAAAGTGGTGATGAAAGTCTAAAATTGCAAAGAAAACGCGGTCGA
TTTACAAAAAAATTGAATGAAATTGAAGAAGATAATAAAACAGAATCAGCGACAAATAAA
GCAGCATCTATTACAAATACAAGTACAGCTGTTGTCGTAGAAAATTCAGACGATAGCAAA
CCGCGCCTGTTAATGACGATTCGCACTGATAAATCAGCTGTTCCTAAAATAATTACCACA
AATAATACACCATCACCACCCGCAGTTGAGAATGATGATCAGGAAAAAATAATTAAAAAA
CGAGGTCGACGACCTAAAGCGATTACTGCTGTAAAACAACAAACACAAATAAAAAAATCA
ACGAGATTGACGAAAGACGATGGTGATACGGGTGCTACTTCTTCCGAGTCACAACAACCC
AGTCCTAAAAAACTCGATCCAGTGTTATTAGCACCGCCTATTGGTAGAATGCGTTCACAA
CGTCGAATAAAACCAACTGCCAAAATTCTCGAAAATGAAGAACTACGTCAAGGTTTTGAA
GTCACAAATTGCAAACGATTGAGTCTAAGTAATGAAAATTTAATGGATTCATTTGGAATT
GATAGAAGTCCACCAACAGCAAGTAGAAGTGGAAATGATAGCAGTACTCAAAGATCACCA
AATGTCTCATCACCAGAGTCAAAAACTCAAGAAGCATCACACGATAACAGCGATACAATA
CTCAATAAAATCAATTTTACTAACAAGAAACCCTGCCGTGATCCTCAGGATTTTCTTAAT
GAAATTAAAAGTTTTAAGATAGGAACAAATAGATCTCCAGAAGATAACAAAAAATTAACA
AAAAGCCAACAAAGGCGATTATTAAAGCAAAAAGAAAAGCATTTAGGAATGCTTGGATTG
CGTCCTAAAAATGATTATACAAGCAGCTCTAATGAATCAGACACTGAAGAATTTAATCCT
AATCAAGCATCCACATCAAGACGTCCACCAATTACCTTGAGGTTGCGCAATCAACACAAG
TCTAATAATGATATGAAGCATCGTAATTTAAATCTTAGAAAACGTGAGAATGACCGAAAT
ATCGCACAATCTGAGAAGCGAATGAAAATGGCTGCTGCCAACACTACAGTGTCACATCAT
CATGAAAGAGAAATTGAATCAAGCGACGACGATTGTATTGTGATCGAGAACAATACAAAA
GTGACAGGCAAGGAAAATAATCATGTTAATCTTATTTGCTCGTGTCATCAAAAGACAAAG
TATTACATCAAGCAAGGTCAAAGTTTGTCATTATCAGTTAACGGGAAAATTTTCTGTTGT
GCAATTGATGAAATTGAAAAAAGAAAAATCGGATGTACTAATGAATTGACAGAATCATTA
ATTTGTCTCTATCGACCAAGTGTTAAAGTTAGCTATATGGTTCTTTGCGCAAGTCATAAG
AAACGACTCTTGTCGCATAATTGCTGCTGTGGTTGTGGATTATTTTGCACACAAGGAACA
TTTGTCATTTGCAATAATAAACATTTTTTCCATCGCAATTGTGCTACAAAATATATTTTA
AATACTCCTTATGAACCTGACAATCCTAATTACACGGGACCGACATTGCTTCTAAAGTGC
CCTCACTGCGGAATTGATGCACCTGATTTCGATTATCGCGTTACTATGCGTTGCGAAAAT
TTACCAGTTTTCGTACAACATCGCAGCAATGTTCCTAAACCAGCCAAAATGGGCAACATG
CTTCGTCAGTCACAACACAATCTTCAAGTGCAAATAAATAGCCTCACATTGAATATTGAA
AAGTTGATTCCTGAAAATGTCATGCAAATTTTAAAAACAGCTTATGAAAGACTCAAGTCA
CAACATGGATCAACTTCTGACTTGTCAAAATTATTTGCACCAAAAGATGTGTTCTATGCC
ATTTATCGTAAAGATGGTGATGAAAGAATGGCCGAAATTGTTGCTTCCGGTTTTAACTTG
ATGACGCCGCTTAAAGATTTTCATAATGGCACATGCTTGCATTTAATTTCCAATTTTGGA
TCTCTCACAATGGCTTATCTTATTCTAAGCCGAGCAAATTCTCATGATTTTGTAAACATG
ATGGATAAAGAAATGCGAACGCCAATCATGTACTCTGTTGCTGGTAAAAAACATGAGATT
TTAAAACTTTTATGTCAATGTGGTGCAGATGTCACAATCAAAGGTCCTGATGGCATGACT
GTTCTTCATTTAGCTGCAAAAAGTGGAAATCTTACAGCTACTCAAATTGTACTTGAAAAT
TATCGACAAATAGCTACAATTTCAAAACTTCACAAGTTCATCAACACAACAGATGATGGA
CATTGGACGCCACTTGTATGGGCGGCTGAAAATGGTCATGGAGAGATTGTCAATTATCTC
ATTAGTTTAGGTGCAGATCCAAATATTTGTGATTCTGAAAACAATACAGTACTTCATTGG
GCATCGCTAGCTGGAAAATTAGAATCTATTTATCCATTAATGAGCAATAGCGATTTGAAT
ATACAAAATATACATGGAGATACGCCACTGCATATCGCAACACGCCAATCAAAGCCAAGA
ATTTGCATGCTGCTAATGGCTCATGGTGCAAATTTAAATATTCGTAATCTTGCTAATGAA
ACACCGCTTGATGTTGCCGATGAAAAGGGTGAATGTGCAAAATTGTTGCGATTCAATATG
GACTTAAGAAGCATTGGAATTGGTGGATGGAAATGTGCCATTGGCGAAAAATTAATTTTA
TGCAATGATATTTCAAATGGCAGAGAAGTTTATCCAATTCAAATAATGAAAAATTTACAA
CACAGTGATGAAGTTATTTTGCCCGATTTTAAATACATCACAAAGAATATTCTTCTTCAG
AATTCAATTCAAATTGACCAAAGAATTTCTCAAATGAGAATTTGTTCCTGTTCAGACAAT
TGCATTTCAGAAAACTGCCAATGTGCACAAATATCATTACAAAATTGGTACAATATTGAT
GGACGTCTTATCTCGAATTTTAATTATGCCGATCCACCAATGCTTTTTGAATGCAACGAT
GTTTGTGGTTGTAATAAACTCTTATGTAAAAATCGCATCGTTCAGAATGGTATTAAATTT
CCACTTACCATTTTTGAATGTGATGATAAAATTAAAGGCTTTGGAGTCAAGTGCCTCACA
AGAATTCGTAAAGGATCATTTGTTGCACAATATTTAGGTGAAATTCTAACAGATCAAGAA
GCAGATCGCCGTACTGACGACAGTTATTTCTTTGATCTCGGTGCATCTGATCATTGCATC
GATGCAAATTTTTACGGAAATGTGAGCAGATTTTTCAATCATTCGTGTTCACCAAATGTC
GTTCCAGTTCGTGTGTATTATGAACATCAAGATCTACGTTTTCCAAAGATTGCGTTCTTT
GCATCAAAAGACATTGAAGCGGGTGAAGAAATTGTATTCGATTATGGCGAAAAATTCTGG
ATGATTAAATATAAATTCTTCAAATGCTTGTGCAAATCAGACAAGTGCCGTTACTCAGCT
GAAACAATCGATAAGACAGTTGCTGAATACAATCAGCGTCATGGAGCAATGAACAATTAA

>g9573.t1 Gene=g9573 Length=1679
MTDLIKGLLSQMSSKFNEQTNPELVKKLDFHDDKTLKWKTLINNTYAKKDREKRNKNRNK
DGEDSTDDKSQQQEPSTSTSLLNRRDTMSADNSDSERNNSEEEESMDNADELKASASISE
KFGNNVKEKKLVIDDDLHTLEQKKSAAAQQASSTHTTISSTCDDDNDDEKNKIISTNLPQ
LNTNSQAIESNTKITQTTVTTSTLTADKKHDDNTTIMNSKNTENINSIGVALSDDDKKAG
SIAKTSKINSNSSTITVISSTSFSSSTTLISANNNNSSNNKVIDKTIVASSSRNLRSNKN
SSTQNSTTPTHRQISNSSSMSDDKIKSDEMAIKSDLEENDDSQNSSIAVHRKGRGRPRLD
VAKNNNNNNTSTAIITKNSVDSVVVCGSGDGNNGNENESDVNQKRGRGRMPKTRSIIESS
SSSSVVNEEKKLNEAVMTEETIEVKKEESTEETPKRRGRSRKLVEETISTDSSNVVKDEK
DAELSQSGDESLKLQRKRGRFTKKLNEIEEDNKTESATNKAASITNTSTAVVVENSDDSK
PRLLMTIRTDKSAVPKIITTNNTPSPPAVENDDQEKIIKKRGRRPKAITAVKQQTQIKKS
TRLTKDDGDTGATSSESQQPSPKKLDPVLLAPPIGRMRSQRRIKPTAKILENEELRQGFE
VTNCKRLSLSNENLMDSFGIDRSPPTASRSGNDSSTQRSPNVSSPESKTQEASHDNSDTI
LNKINFTNKKPCRDPQDFLNEIKSFKIGTNRSPEDNKKLTKSQQRRLLKQKEKHLGMLGL
RPKNDYTSSSNESDTEEFNPNQASTSRRPPITLRLRNQHKSNNDMKHRNLNLRKRENDRN
IAQSEKRMKMAAANTTVSHHHEREIESSDDDCIVIENNTKVTGKENNHVNLICSCHQKTK
YYIKQGQSLSLSVNGKIFCCAIDEIEKRKIGCTNELTESLICLYRPSVKVSYMVLCASHK
KRLLSHNCCCGCGLFCTQGTFVICNNKHFFHRNCATKYILNTPYEPDNPNYTGPTLLLKC
PHCGIDAPDFDYRVTMRCENLPVFVQHRSNVPKPAKMGNMLRQSQHNLQVQINSLTLNIE
KLIPENVMQILKTAYERLKSQHGSTSDLSKLFAPKDVFYAIYRKDGDERMAEIVASGFNL
MTPLKDFHNGTCLHLISNFGSLTMAYLILSRANSHDFVNMMDKEMRTPIMYSVAGKKHEI
LKLLCQCGADVTIKGPDGMTVLHLAAKSGNLTATQIVLENYRQIATISKLHKFINTTDDG
HWTPLVWAAENGHGEIVNYLISLGADPNICDSENNTVLHWASLAGKLESIYPLMSNSDLN
IQNIHGDTPLHIATRQSKPRICMLLMAHGANLNIRNLANETPLDVADEKGECAKLLRFNM
DLRSIGIGGWKCAIGEKLILCNDISNGREVYPIQIMKNLQHSDEVILPDFKYITKNILLQ
NSIQIDQRISQMRICSCSDNCISENCQCAQISLQNWYNIDGRLISNFNYADPPMLFECND
VCGCNKLLCKNRIVQNGIKFPLTIFECDDKIKGFGVKCLTRIRKGSFVAQYLGEILTDQE
ADRRTDDSYFFDLGASDHCIDANFYGNVSRFFNHSCSPNVVPVRVYYEHQDLRFPKIAFF
ASKDIEAGEEIVFDYGEKFWMIKYKFFKCLCKSDKCRYSAETIDKTVAEYNQRHGAMNN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
15 g9573.t1 CDD cd10543 SET_EHMT 1428 1659 6.99052E-120
14 g9573.t1 Coils Coil Coil 92 112 -
12 g9573.t1 Gene3D G3DSA:1.25.40.20 - 1119 1243 4.0E-17
13 g9573.t1 Gene3D G3DSA:1.25.40.20 - 1244 1382 1.2E-34
11 g9573.t1 Gene3D G3DSA:2.170.270.10 SET domain 1392 1672 2.5E-89
33 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 48 111 -
41 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 48 70 -
43 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 71 88 -
31 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 143 166 -
37 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 143 160 -
29 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 289 375 -
44 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 289 321 -
34 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 322 359 -
38 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 361 375 -
40 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 388 497 -
35 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 414 429 -
30 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 432 497 -
32 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 562 643 -
42 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 607 622 -
45 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 676 719 -
28 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 744 812 -
36 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 748 779 -
39 g9573.t1 MobiDBLite mobidb-lite consensus disorder prediction 783 812 -
6 g9573.t1 PANTHER PTHR46307:SF3 G9A, ISOFORM B 897 1059 0.0
8 g9573.t1 PANTHER PTHR46307 G9A, ISOFORM B 897 1059 0.0
5 g9573.t1 PANTHER PTHR46307:SF3 G9A, ISOFORM B 1074 1660 0.0
7 g9573.t1 PANTHER PTHR46307 G9A, ISOFORM B 1074 1660 0.0
2 g9573.t1 Pfam PF12796 Ankyrin repeats (3 copies) 1217 1289 5.4E-11
3 g9573.t1 Pfam PF12796 Ankyrin repeats (3 copies) 1298 1377 1.2E-7
1 g9573.t1 Pfam PF05033 Pre-SET motif 1404 1512 9.9E-15
4 g9573.t1 Pfam PF00856 SET domain 1533 1636 4.2E-22
47 g9573.t1 ProSiteProfiles PS50297 Ankyrin repeat region circular profile. 1148 1378 40.505
49 g9573.t1 ProSiteProfiles PS50088 Ankyrin repeat profile. 1217 1249 9.003
50 g9573.t1 ProSiteProfiles PS50088 Ankyrin repeat profile. 1260 1292 11.22
51 g9573.t1 ProSiteProfiles PS50088 Ankyrin repeat profile. 1325 1357 12.796
46 g9573.t1 ProSiteProfiles PS50867 Pre-SET domain profile. 1453 1517 10.581
48 g9573.t1 ProSiteProfiles PS50280 SET domain profile. 1520 1636 17.961
19 g9573.t1 SMART SM00384 AT_hook_2 351 363 6.6
17 g9573.t1 SMART SM00384 AT_hook_2 404 416 15.0
16 g9573.t1 SMART SM00384 AT_hook_2 454 466 21.0
18 g9573.t1 SMART SM00384 AT_hook_2 578 590 31.0
23 g9573.t1 SMART SM00248 ANK_2a 1148 1177 1500.0
20 g9573.t1 SMART SM00248 ANK_2a 1184 1213 0.061
22 g9573.t1 SMART SM00248 ANK_2a 1217 1246 1.9
24 g9573.t1 SMART SM00248 ANK_2a 1260 1289 2.4E-5
25 g9573.t1 SMART SM00248 ANK_2a 1293 1321 140.0
21 g9573.t1 SMART SM00248 ANK_2a 1325 1354 0.0015
26 g9573.t1 SMART SM00468 preset_2 1401 1503 3.5E-19
27 g9573.t1 SMART SM00317 set_7 1520 1642 1.4E-37
9 g9573.t1 SUPERFAMILY SSF48403 Ankyrin repeat 1123 1370 8.19E-51
10 g9573.t1 SUPERFAMILY SSF82199 SET domain 1396 1651 2.35E-69

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0003677 DNA binding MF
GO:0005515 protein binding MF
GO:0002039 p53 binding MF
GO:0008270 zinc ion binding MF
GO:0034968 histone lysine methylation BP
GO:0018024 histone-lysine N-methyltransferase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values