LOCUS HSCKIIBE 5917 bp DNA PRI 10-SEP-1998 DEFINITION Human gene for casein kinase II subunit beta (EC 2.7.1.37). ACCESSION X57152 NID g29968 KEYWORDS casein kinase; cytoplasmic protein; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5917) AUTHORS Voss,H., Wirkner,U., Jakobi,R., Hewitt,N.A., Schwager,C., Zimmermann,J., Ansorge,W. and Pyerin,W. TITLE Structure of the gene encoding human casein kinase II subunit beta JOURNAL J. Biol. Chem. 266 (21), 13706-13711 (1991) MEDLINE 91310643 REFERENCE 2 (bases 1 to 5917) AUTHORS Pyerin,W. TITLE Human casein kinase II: structures, genes, expression and requirement in cell growth stimulation JOURNAL Adv. Enzyme Regul. 34, 225-246 (1994) MEDLINE 95028755 REFERENCE 3 (bases 1 to 5917) AUTHORS Pyerin,W. TITLE Direct Submission JOURNAL Submitted (09-JAN-1991) W. Pyerin, DEUTSCHES KREBSFORSCHUNGSZENTRUM, BIOCHEMICAL CELL PHYSIOLOGY, INST OF EXPERIMENTAL PATHOLOGY, GERMAN CANCER RESEARCH CENTER, IM NEUENHEIMER FELD 280, D-6900 HEIDELBERG F R G FEATURES Location/Qualifiers source 1..5917 /organism="Homo sapiens" /db_xref="taxon:9606" GC_signal 328..332 GC_signal 414..418 CAAT_signal 481..487 GC_signal 589..593 exon 683..1011 /note="casein kinase II subunit beta" /number=1 mRNA join(683..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; major start sites" /evidence=experimental GC_signal 698..702 mRNA join(715..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; major start site" /evidence=experimental CAAT_signal 759..765 mRNA join(795..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; minor start site" /evidence=experimental intron 1012..1622 /number=1 exon 1623..1705 /note="casein kinase II subunit beta" /number=2 CDS join(1634..1705,2672..2774,3344..3459,3906..3981, 4128..4317,4645..4735) /EC_number="2.7.1.37" /note="protein kinase" /codon_start=1 /product="casein kinase II subunit beta" /db_xref="PID:g29969" /db_xref="SWISS-PROT:P13862" /translation="MSSSEEVSWISWFCGLRGNEFFCEVDEDYIQDKFNLTGLNEQVP HYRQALDMILDLEPDEELEDNPNQSDLIEQAAEMLYGLIHARYILTNRGIAQMLEKYQ QGDFGYCPRVYCENQPMLPIGLSDIPGEAMVKLYCPKCMDVYTPKSSRHHHTDGAYFG TGFPHMLFMVHPEYRPKRPANQFVPRLYGFKIHPMAYQLQLQAASNFKSPVKTIR" intron 1706..2671 /number=2 repeat_region 1934..2212 /rpt_family="ALU" exon 2672..2774 /note="casein kinase II subunit beta" /number=3 intron 2775..3343 /number=3 exon 3344..3459 /note="casein kinase II subunit beta" /number=4 intron 3460..3905 /number=4 exon 3906..3981 /note="casein kinase II subunit beta" /number=5 intron 3982..4127 /number=5 exon 4128..4317 /note="casein kinase II subunit beta" /number=6 intron 4318..4644 /number=6 exon 4645..4875 /note="casein kinase II subunit beta" /number=7 BASE COUNT 1363 a 1452 c 1672 g 1430 t ORIGIN 1 gatctgtcgg ttggggtcct acttttacat aacgccccca caatgccctt cgccttcctc 61 aacgtggccc ccgctccaag cccattttct ggagccagga atccactctg tgggttagga 121 aaggccctca ggaggcggag ggaaacctgt ggaatgccga gaagccgtgt aatgaaataa 181 cggtcacggc ctggcccctc accattactc tgaccagggt tcgaaggtca cacttagagc 241 ctaaggggaa atggagaagt gcaaagggac gagcagaatg gctggcacca cctcaggtta 301 gcgcactggg acgttccagt tctcacaccg cccaccccac cccacccaag tcctacgcac 361 ggagccaagc cgcacctctc ccctcatgag gcaggagccc cggaggaaac agtacgcccg 421 tcaagggtct ctggcgggac tgattcgcac taggggccca acaggcaata aggacccagc 481 ggattggccg aggataggcc agtcccctgg gcagcagcgc cgcgccggga ctagagggga 541 acgtgaggag agctgcggaa agagatccag cctggctccc tcctttcccc gccctaagtc 601 agcctcttca cccagtgagc acaaaactgt attgcccaga ctcccgggcc ccgaacgcca 661 tacctggctt ccgcttccgg tggcttctcg ttgtgccccg cccgcaagcg ccctcctccg 721 ggccttcgtg acagccaggt cgtgcgcggg tcatcctggg attggtagtt cgctttctct 781 catttagcca gtttctttct ctaccgggga ctccgtgtcc cggcatccac cgcggcacct 841 gacccttggc gcttgcgtgt tgccctcttc cccaccctcc ctaatttcca ctccccccac 901 cccacttcgc ctgccgcggt cgggtccgcg gcctgcgctg tagcggtcgc cgccgttccc 961 tggaagtagc aacttcccta ccccacccca gtcctggtcc ccgtccagcc ggtgagtctg 1021 aagtcgtcgc tgctccgagt cccttgtcgc tgggagcggc acatggggtc tccggacttt 1081 gatgtggggc gggggaggaa gcgaccaggt ccggcacgaa ggagggagag gtggcctgag 1141 gagcggaggg gggatgtgtg gattccggtg aaagggacct gacaatcgcc cccaacccgt 1201 gagaaaagga ggagcccagt tcttgcttga gaatgataaa cttggaaacc cttgggaaag 1261 gcgtgggggt catgcagaga cttgtattgg tagggagcct gagtcgaggt ccctgccgga 1321 gttgacacag aggagagagg gccctggcct tcgggagctc cagggatgtg ggtcgggctg 1381 gtgggtcaaa gtatctgttg gcttctttca agtggtggga ccccaaagaa tgtttaactt 1441 caaagaaaag gggctgagat gtaaattaga ggagctggag aggagtgctt cagagtttgg 1501 gttgctttaa gaaagggtgg ttccgaattc tcccgtggtt ggagggccga atgtgggagg 1561 agggaggata ccagaggcag ggaaggagaa cttgagcttt actgacactg ttctttttct 1621 agctgacgtg aagatgagca gctcagagga ggtgtcctgg atttcctggt tctgtgggct 1681 ccgtggcaat gaattcttct gtgaagtgag ttctcttcaa cctccctact tgccagcttc 1741 acatatcttc ccaccagacg ttccttcaca tattccactt ctacactgtt ctcttacatg 1801 ctatttgaaa acttcctatc agcaaagagt cccccctata aaccccgacg aacctgtgct 1861 aaagtggcaa aactggggcc caagtcctga gtctgccacc gtccagcaat ataacgttgg 1921 gctagtcaat ttgtgtcttt ttcttttttt tgagactggg tctcactctg tcaccgaggc 1981 tggagggtag tggtgcgatc tcggcttact gccacctctg cctcccaggt tcaagcgatt 2041 ctcctgctcc agcctcccaa gtagctggga ttacaagtgc ctgccaccat gcctggctaa 2101 tttttgtatt tttagtagag acagggtttc actatgttgg caaggctggt ctcgaactcc 2161 agacctcagg gtgatctgcc tgcctcgggc ctcccaaagt gctgggatta caggcgtgag 2221 cattgcgccc ggcctgtatc ttttgttact aaagtggcac tgctagtact tgtctcaggt 2281 ggcctttagg aaaactgaaa tgctacacat tgaaatgttt tgttcagaaa ccatgctgtt 2341 cagcttccac cttccttagc cagctgagag gacaaaactg gttcctagag acgggataca 2401 ggagtggagt agggacaaag atcttggaaa agaatgtcta agaaaaagat tgctgtatct 2461 acttatcctt agaaaagaaa agccaaagct tttatgggag agagtgtagg tgaactaggg 2521 agagacacaa gtacttctgc tgagttggga gtgagaaaca agcacaacag atgcagttgt 2581 gttgatgata aggcatcact tagagcattt tgcccaggtc aaagatgagg attttgatat 2641 gggttccctc ttggcttcca tgtcctgaca ggtggatgaa gactacatcc aggacaaatt 2701 taatcttact ggactcaatg agcaggtccc tcactatcga caagctctag acatgatctt 2761 ggacctggag cctggtgagg caccctcagg gttgttttgt gtgtgtgcgt gcactatttt 2821 tctcttcaaa tctctattca cttgcctgaa ttttgccaaa tttcctttgg ttctctgatt 2881 tctttaaccc caaattcatg ctttattttg atcctccacc tgactcttgt ctagttttgt 2941 gacgtatatc acttgttctc atgttttcta aatccgcaat tcagacctat tccaaaatgc 3001 gtttcctcag ggtctggttt gttgtctgtt tctcctgctt tgcaccttcc agtctagagt 3061 ttcatcttct gcattgacat tgttgcagtt atgtattgag gagggagttg ggagggagag 3121 caaggagcag aggctgaaaa ggtgtgaggg gaaggcagag ctgtcttcgt ttgatgcaag 3181 ggtcagaagc ccaggtttct gggtcccatg cccagatgtt ggatggggta aggcccaaaa 3241 gtaggtgcta ggcaaactga atagcccgca gcccctggat atgggcaggg cacctaggaa 3301 agctgaaaaa caagtagttg catttggccg ggctgtggtt cagatgaaga actggaagac 3361 aaccccaacc agagtgacct gattgagcag gcagccgaga tgctttatgg attgatccac 3421 gcccgctaca tccttaccaa ccgtggcatc gcccagatgg tgaggcctct ctgctcctac 3481 ctgcctcctt ctgagcagta agagacacag gttcctgcag caagaagtca tgtttaagcc 3541 ctgtttaagg aagctagctg agaagagggg aagaacccca gaacttgggc ctgggaattg 3601 aattctgatt gggggtcatc ctgaagggat tgttttcagg gagggagaca gaccttgaat 3661 cagagagttg tgatagactg cctcttcctc aaggaacaaa caacaaatgg ctctgatggt 3721 ttgtagccct gccctaattt ggaagaaagg caacacagaa gtttgagagc ccatctagtc 3781 cagagaaggg ggcctctgga cagagttgga aggagtgccg acagagttgg tatgggttgg 3841 gctgcgaagg gagttgcctc ttctttacat ctacctgcca accccttcca ttgtattcac 3901 ctcagttgga aaagtaccag caaggagact ttggttactg tcctcgtgtg tactgtgaga 3961 accagccaat gcttcccatt ggtgagtgtt gaagaaggga aaggaaagca ccgtgtggca 4021 gtcttatggg aaggagttgg ggctcaacac attggagcct gagtcctgag gggaggttag 4081 gtaggaatag ggggatacct ggcctgctga gtctggctgt ctcccaggcc tttcagacat 4141 cccaggtgaa gccatggtga agctctactg ccccaagtgc atggatgtgt acacacccaa 4201 gtcatcaaga caccatcaca cggatggcgc ctacttcggc actggtttcc ctcacatgct 4261 cttcatggtg catcccgagt accggcccaa gagacctgcc aaccagtttg tgcccaggta 4321 gggagcaggg agagtcatta agggtcaaag gaaaggccca agatccccca gagaggggag 4381 gacagggcat ggccctttct tgaggtctgc ttctcccaga atcagggcat ctccctgctg 4441 agtgactgtg ggaaagttat ttgattatct gtgcttgagt taccttattg tagaatgttc 4501 ttgagctgag aagttgggaa ccacgaggct ttagctctga gcaggtccat agaggagctc 4561 aggtggggag gtgggaatgc aggtgactgg cagggcctgg atggggctca tgctgctgcc 4621 tctctgacct ctgccctggc ctaggctcta cggtttcaag atccatccga tggcctacca 4681 gctgcagctc caagccgcca gcaacttcaa gagcccagtc aagacgattc gctgattccc 4741 tcccccacct gtcctgcagt ctttgtcttt tcctttcttt tttgccaccc tttcaggaac 4801 cctgtatggt ttttagttta aattaaagga gtcgttatcg tggtgggaat atgaaataaa 4861 gtagaagaaa aggccatgag ctagtctgct ggtgcttgct gttggggaag ggaaggtgat 4921 ggtgtgttgg actccagggg ccctcatggc ccagcccacc ctccccagat tgaaaaccag 4981 gacagatttg tgctcagtgg attgggtggt gtttttagta tggagcagaa cagaattcct 5041 aggactgcgt gtgatgaaat gcaaggtcaa aaggaaaaga caaagcatat ttcaaagatg 5101 agaaatattt gtttggatat ctatgactgt ctgtttatac tgtaaggggc ttaatcagca 5161 gctccatctt ttagttttag ttctaaagga aaagtagcct aaagtcagta taactaaagg 5221 gtggaacgag gtgggacaag gtccggaatt gctgctcagt gatgtgtgtg tgcctgccgc 5281 tggtggagct gagactgctc actctcagaa ggatggggat gcttgatttc ctggccaggt 5341 tgtcccagca cagtggggat tggccctgtt gtatgacgaa gacagcacat ggtggcagag 5401 atagatacta acccatggac tttccaaggg agggaatagg tctttggagg gtatgcaaga 5461 caaaggtaga cactggataa agaacccggt agtgcccagg tattacccca tctgggccat 5521 tactcccaca ctcaggaacc agacgttgtg ggtgaggaca tgctgtccct cctgccaagt 5581 aataacttcc ttcccagcca ggatcctgcc ccaagtagga atatagctct gcatttacag 5641 cagctcctgc tcagaccttg tcaaaaccac cctgcagctt aggattaagg agcatggtca 5701 caggaaggtg gggtttcagg gcatcccctc aggaactgcc catctcccca gaattccaaa 5761 atgaaggtcc atatgcttgt aggtgtgctg gtcatggtgg gctcacagta ggaaagggta 5821 agtggggccc aggggcaggg agggaggaag gggtaactga gtccaggaag ggggtggagc 5881 gtggccatgg aaatcgggct ccacggccca gggatgg // LOCUS HUMSAACT 3778 bp DNA PRI 09-JAN-1995 DEFINITION Human skeletal alpha-actin gene, complete cds. ACCESSION M20543 NID g337745 KEYWORDS alpha-actin; alpha-skeletal actin. SOURCE Human skeletal fibroblast DNA, clone pHSA.400. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3778) AUTHORS Taylor,A., Erba,H.P., Muscat,G.E. and Kedes,L. TITLE Nucleotide sequence and expression of the human skeletal alpha-actin gene: evolution of functional regulatory domains JOURNAL Genomics 3 (4), 323-336 (1988) MEDLINE 89212595 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Taylor 07-SEPT-1988. FEATURES Location/Qualifiers source 1..3778 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p21-qter" prim_transcript 708..3542 /gene="ACTA1" /note="alpha-skeletal actin mRNA and introns; G00-120-535" intron 799..1667 /gene="ACTA1" /note="alpha-skeletal actin, intron A; G00-120-535" gene join(1680..1808,1915..2239,2367..2528,2608..2799, 2886..3067,3146..3289) /gene="ACTA1" exon <1680..1808 /gene="ACTA1" /note="alpha-skeletal actin precursor, (first expressed exon); G00-120-535" /number=2 sig_peptide 1680..1685 /gene="ACTA1" /note="alpha-skeletal actin signal peptide; G00-120-535" CDS join(1680..1808,1915..2239,2367..2528,2608..2799, 2886..3067,3146..3289) /gene="ACTA1" /note="alpha-skeletal actin precursor" /codon_start=1 /db_xref="GDB:G00-120-535" /db_xref="PID:g337746" /translation="MCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQG VMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEE HPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDG VTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKL CYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGI HETTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAP PERKYSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVHRKCF" mat_peptide 1686..1808 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" intron 1809..1914 /gene="ACTA1" /note="alpha-skeletal actin, intron B; G00-120-535" mat_peptide 1915..2239 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" exon 1915..2239 /gene="ACTA1" /note="G00-120-535" /number=3 intron 2240..2366 /gene="ACTA1" /note="alpha-skeletal actin, intron C; G00-120-535" mat_peptide 2367..2528 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" exon 2367..2528 /gene="ACTA1" /note="G00-120-535" /number=4 intron 2529..2607 /gene="ACTA1" /note="alpha-skeletal actin, intron D; G00-120-535" mat_peptide 2608..2799 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" exon 2608..2799 /gene="ACTA1" /note="G00-120-535" /number=5 intron 2800..2885 /gene="ACTA1" /note="alpha-skeletal actin, intron E; G00-120-535" mat_peptide 2886..3067 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" exon 2886..3067 /gene="ACTA1" /note="G00-120-535" /number=6 intron 3068..3145 /gene="ACTA1" /note="alpha-skeletal actin, intron F; G00-120-535" exon 3146..>3289 /gene="ACTA1" /note="alpha-skeletal actin precursor; G00-120-535" /number=7 mat_peptide 3146..3289 /gene="ACTA1" /note="alpha-skeletal actin; G00-120-535" BASE COUNT 733 a 1221 c 1131 g 693 t ORIGIN Chromosome 1p21-qter. 1 ctgcgccctc cggccgccgg tggccctctg tgcggtgggg gaaggggtcg acgtggctca 61 gctttttgga ttcagggagc tcgggggtgg gaagagagaa atggagttcc aggggcgtaa 121 aggagaggga gttcgccttc cttcccttcc tgagactcag gagtgactgc ttctccaatc 181 ctcccaagcc caccactcca cacgactccc tcttcccggt agtcgcaagt gggagtttgg 241 ggatctgagc aaagaacccg aagaggagtt gaaatattgg aagtcagcag tcaggcacct 301 tcccgagcgc ccagggcgct cagagtggac atggttgggg aggcctttgg gacaggtgcg 361 gttcccggag cgcaggcgca cacatgcacc caccggcgaa cgcggtgacc ctcgccccac 421 cccatcccct ccggcgggca actgggtcgg gtcaggaggg gcaaacccgc tagggagaca 481 ctccatatac ggcccggccc gcgttacctg ggaccgggcc aacccgctcc ttctttggtc 541 aacgcagggg acccgggcgg gggcccaggc cgcgaaccgg ccgagggagg gggctctagt 601 gcccaacacc caaatatggc tcgagaaggg cagcgacatt cctgcggggt ggcgcggagg 661 gaatcgcccg cgggctatat aaaacctgag cagagggaca agcggccacc gcagcggaca 721 gcgccaagtg aagcctcgct tcccctccgc ggcgaccagg gcccgagccg agagtagcag 781 ttgtagctac ccgcccaggt agggcaggag ttgggagggg acagggggac agggcactac 841 cgaggggaac ctgaaggact ccggggcaga acccagtcgg ttcacctggt cagccccagg 901 cctcgccctg agcgctgtgc ctcgtctccg gagccacacg cgctttaaaa aggaggcaag 961 acagtcagcc tctggaaatt agacttctcc aaatttttct ctagcccttt gggctccttt 1021 acctggcatg taggatgtgc ctagggagat aaacggtttt gctttagttg tcgccaaggc 1081 agttcccttc caaactagcg ctagagcgaa tgagcgagca gccaggacca ccattctggg 1141 tttccaacag gcgaaaaggc cctttctgag tttgaaatgt cacagggttc ctaacaggcc 1201 actcttccct ggatggggtg ccaacgcctt tcccatgggc atctccttcc accctcacgc 1261 tggcccagca agcaggcagt gctgaggcct tatctcccta ggtgacagat gtggtcaggg 1321 aggcgcagag aggatgggca ctagcgtcca gctcctggaa caggtgtcag gcagggaggg 1381 cagacaggtc ttgggaacat gttcccctgg ctatgtggac agaggacttc tcagtgggtc 1441 tcgcgaccct gtgccccttt tcctggttca gggcagcctt agccggggca aaggtcgaga 1501 agagaacccc tggtcgccgc cctggcagaa tttgagtggc tccggcagga gatgtcccta 1561 ggttcctggg gagggaggac gtcggggcca gccaggctta cccccccctg ccgctgagac 1621 ttctgcgctg atgcacgcgc ctcttcgcgg tctccctgtc cttgcagaaa ctagacacaa 1681 tgtgcgacga agacgagacc accgccctcg tgtgcgacaa tggctccggc ctggtgaaag 1741 ccggcttcgc cggggatgac gcccctaggg ccgtgttccc gtccatcgtg ggccgccccc 1801 gacaccaggt caggctgccc ctccgcagag ggagccggct cggggtcccc gcgtaagcca 1861 gcctggtgcc acccggagcg gcgttaacgg gtgcgtggtg tctcggctct gcagggcgtc 1921 atggtcggta tgggtcagaa agattcctac gtgggcgacg aggctcagag caagagaggt 1981 atcctgaccc tgaagtaccc tatcgagcac ggcatcatca ccaactggga tgacatggag 2041 aagatctggc accacacctt ctacaacgag cttcgcgtgg ctcccgagga gcaccccacc 2101 ctgctcaccg aggcccccct caatcccaag gccaaccgcg agaagatgac ccagatcatg 2161 tttgagacct tcaacgtgcc cgccatgtac gtggccatcc aggccgtgct gtccctctac 2221 gcctccggaa ggaccaccgg tgagtgcccg ctggccccca gtcccctcgt cccgcccccg 2281 cccccgcccc cgcccccggc cgctagcgct gagcgcctag cctcggcctc gcccccagcc 2341 actcactctc tcccgcgcgc gcacaggcat cgtgctggac tccggcgacg gcgtcaccca 2401 caacgtgccc atttatgagg gctacgcgct gccgcacgcc atcatgcgcc tggacctggc 2461 gggccgcgat ctcaccgact acctgatgaa gatcctcact gagcgtggct actccttcgt 2521 gaccacaggt gcgcggcgcc cctgcacccg ggcggagggc cgcggcggcc tgagtgaggg 2581 ctcctctcct gcttctgccc tccgcagctg agcgcgagat cgtgcgcgac atcaaggaga 2641 agctgtgcta cgtggccctg gacttcgaga acgagatggc gacggccgcc tcctcctcct 2701 ccctggaaaa gagctacgag ctgccagacg ggcaggtcat caccatcggc aacgagcgct 2761 tccgctgccc ggagacgctc ttccagccct ccttcatcgg tgagccccgc tcgccctcgc 2821 cccggccccc aggcccgcgc cccccggccc gagcttctgc tcacgctccc cgccgcggtc 2881 cccaggtatg gagtcggcgg gcattcacga gaccacctac aacagcatca tgaagtgtga 2941 catcgacatc aggaaggacc tgtatgccaa caacgtcatg tcggggggca ccacgatgta 3001 ccctgggatc gctgaccgca tgcagaaaga gatcaccgcg ctggcaccca gcaccatgaa 3061 gatcaaggtg ggtggtggcc tgcgcgggct gtcggcgggg tgggctccag ggtgaggtct 3121 ccccacctca cgcgctgtct tgcagatcat cgccccgccg gagcgcaaat actcggtgtg 3181 gatcggcggc tccatcctgg cctcgctgtc caccttccag cagatgtgga tcaccaagca 3241 ggagtacgac gaggccggcc cttccatcgt ccaccgcaaa tgcttctaga cacactccac 3301 ctccagcacg cgacttctca ggacgacgaa tcctctcaat gggggggcgg ctgagctcca 3361 gccaccccgc agtcactttc tttgtaacaa ctttccgttg ctgccatcgt aaactgacac 3421 agtgtttata acgtgtacat acattaactt attacctcat tttgttattt ttcgaaacaa 3481 agccctgtgg aagaaaatgg aaaacttgaa gaagcattaa agtcattctg ttaagctgcg 3541 taaagtggtc gtgtttattt gcttggggcg ggagtggagc aggaagaggg attcccatcc 3601 cccacatcct cttaagtcac ttttcacgat accccaaatg aatgggctcc ttggaagaca 3661 aaacttacat cttcccatgc tccctgccgg tttctgcagt ggatcagatc cattccagat 3721 cactggcagc tagtggtggc ctgacttgac ctctggggtg tggcgaggcg agctttct // LOCUS HSH4EHIS 859 bp DNA PRI 09-NOV-1992 DEFINITION H.sapiens H4/e gene for H4 histone. ACCESSION X60484 NID g32000 KEYWORDS H4/e gene; histone H4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 859) AUTHORS Doenecke,D. TITLE Direct Submission JOURNAL Submitted (08-JUL-1991) D. Doenecke, Georg-August Univer, Inst fuer Biochemie, Zentrum 3 des Fachbereichs Medizin Bioce, Humboldtallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 859) AUTHORS Doenecke,D. and Kardalinou,E. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..859 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Leucocyte" /clone="14.1" TATA_signal 247..252 gene 311..622 /gene="H4/e" CDS 311..622 /gene="H4/e" /codon_start=1 /product="H4 histone" /db_xref="PID:g32001" /db_xref="SWISS-PROT:P02304" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" /note="Histone mRNA" terminator 645..660 exon 311..622 BASE COUNT 185 a 236 c 227 g 211 t ORIGIN 1 cccggtcact tttttgtatt ccccacagta ttgatgtata tcttctgcgt tcaaaagcaa 61 ttttttaaag cctcataacg tggtaacaga atactttgca cattacaaaa ttcagaacac 121 ggaaacaaga agctcgcttt tttttccccc ctatttcggt ttggcccttt agatttcccc 181 tcccccaccg gggcgggact tcccgccgac ttctttcagg ttctcagttc ggtccgccaa 241 ctgtcgtata aaggcgctgc ctcaggccag aggcctcaca aagcgttggg tgagactcct 301 cttgctcgtc atgtctggcc gcggcaaagg cgggaagggt cttggcaaag gcggcgctaa 361 gcgccaccgt aaagtactgc gcgacaatat ccagggcatc accaagccgg ccatccggcg 421 ccttgctcgc cgcggcggcg tgaagcgcat ctccggcctc atctacgagg agactcgcgg 481 ggtgctgaag gtgttcctgg agaacgtgat ccgggacgcc gtgacctata cagagcacgc 541 caagcgcaag acggtcaccg ccatggatgt ggtctacgcg ctcaagcgcc agggccgcac 601 cctctacggt ttcggtggtt gagcgtcctt ttctaccaat aaaaggccct tttcagggcc 661 accctacttt ctcagctgaa gagtggtaac actgaggagt ggttttggta ggtacggaat 721 tttgcttggt tctgagtcag ttctgggggg aacagttttt tgaacacagc ggcacacgtg 781 tggccattca cccggggtca ctgtaggcag gactaattac gagatgtaat gtctaaactt 841 gctcaaaatt cgtaagctt // LOCUS HSU12202 4942 bp DNA PRI 13-SEP-1996 DEFINITION Human ribosomal protein S24 (rps24) gene, complete cds. ACCESSION U12202 NID g517220 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4942) AUTHORS Xu,W.B. and Roufa,D.J. TITLE The gene encoding human ribosomal protein S24 and tissue-specific expression of differentially spliced mRNAs JOURNAL Gene 169 (2), 257-262 (1996) MEDLINE 96194813 REFERENCE 2 (bases 1 to 4942) AUTHORS Roufa,D.J. TITLE Direct Submission JOURNAL Submitted (11-JUL-1994) Donald J. Roufa, Kansas State University, Division of Biology, Manhattan, KS 66506, USA FEATURES Location/Qualifiers source 1..4942 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pGS24-1" /sex="male" /cell_line="HT1080" /tissue_type="fibrosarcoma" /dev_stage="adult" exon 175..213 /gene="rps24" mRNA join(175..213,1657..1722,1816..2025,3406..3516,4544..4605) /gene="rps24" /note="alternatively spliced isoform" /product="ribosomal protein S24" gene 175..4614 /gene="rps24" 5'UTR 175..210 /gene="rps24" /evidence=experimental mRNA join(175..213,1657..1722,1816..2025,3406..3516, 4544..>4614) /gene="rps24" /note="alternatively spliced isoform" CDS join(211..213,1657..1722,1816..2025,3406..3516,3922..3924) /gene="rps24" /note="alternatively spliced isoform" /codon_start=1 /evidence=experimental /product="ribosomal protein S24" /db_xref="PID:g517221" /translation="MNDTVTIRTRKFMTNRLLQRKQMVIDVLHPGKATVPKTEIREKL AKMYKTTPDVIFVFGFRTHFGGGKTTGFGMIYDSLDYAKKNEPKHRLARHGLYEKKKT SRKQRKERKNRMKKVRGTAKANVGAGKK" intron 214..1656 /gene="rps24" exon 1657..1722 /gene="rps24" intron 1723..1815 /gene="rps24" exon 1816..2025 /gene="rps24" intron 2026..3405 /gene="rps24" exon 3406..3516 /gene="rps24" intron 3517..3921 /gene="rps24" exon 3922..3943 /gene="rps24" /note="alternative exon; residues 3925-3905 are not translated" intron 3944..4543 /gene="rps24" exon 4544..4605 /gene="rps24" /note="alternative exon; residues 4555-4605 are not translated" /evidence=experimental polyA_site 4605 /gene="rps24" BASE COUNT 1225 a 1129 c 1217 g 1371 t ORIGIN 1 gaattcctga tgcagtcgat cccgacttcg tcgtcagtca gtcgatcgaa tgcggtcgta 61 cgtcgatccg atcaaaaatg tagtttattg agatggtttc ccactcatct tgactcagag 121 tggcttttag tgctgtgcat tcctcctgaa ggaacatcct tctgtaagcc ttgcttttcc 181 tccttggctg tctgaagata gatcgccatc atggtgagtc tccctgggcc cgtgcagtta 241 tctgccgcgt atccgagcca tccgtggtcc ctgggtccca gtacttgagc tataggcacg 301 cgaagcccgg ttgctcttct ctggccgttt ccgtcagagg atggttgtcg aggggctcgg 361 ggctgttggc agggcgtccg gttggccggg ctggcagggc ctgcgcatgt agacccggac 421 acgctgcatt tcgggctcca gcgcctgggc agtgcaggag ctgttgcgct tgttgacttc 481 gtggagcacg gtggatgggg gtagggcatg ggcgggatag atggacactg ggaggcacat 541 ctcgtcttgc agttcctcat tgggctaggt aggcggcttg caggtgatgg cagtacaaga 601 ggtgaagaat tcggtgcggg cagcgctggg cgagttgagg caagtgaaac cattcccacg 661 tgtaagctag aaaacttgtc agctggatgg atcttcatgt ttcatagtgg ggaaacaggc 721 cctggaaacg tgatgttata aaaaggtagt cctgccctaa gtcatgcgtt attagctcct 781 ccagcatccg ttttaacttt accagcaatc ttttttttac tccttaagcc tttaaccggc 841 cgttactttc aagaattgag atccactgac gtggaattta atttggcgtg gttttccaag 901 taaatgagaa gcattggaag tattgctggc agtaagccca tttggccctc ttttgaaagt 961 gactgggtca ggccctaaac ttccgtttac aacggttcca gctacgttca ggcccatact 1021 tattttcccg acaaattccg gtcagtctgt actgctaacc cctcaggttc cacacgtcct 1081 tgggaaatct tctacaacgt tgcttccagt cgaagggttt ctctttggga aagcctctaa 1141 atgaggaacc tcttttgggg acccccaata atcgatctgg cctctgatga tctacgtgat 1201 ctgatctgct gcaaagactc gatccagctg tcgacgatta ctgctgacga tctgatcgat 1261 ccaagtctga tcgtcagtca acgacgtacg tacgatcgtg gttttcccag tctgggcagt 1321 cccaaaagtt tgcgacccgg ggtgcgatcg atcgatgtcg tgatctacgt accctttgat 1381 gaaagtctgg aacccctgac agtctgactg atacgtcgat acgtgcatgc ttcaagtcaa 1441 ggctctgaac gtcagcaaaa gctgggtctc agtctgcatg atcgaactga ctgctagctg 1501 actacgtacg tatgctcagg tgggtttatt tactttagtt cgcatcatga ctaatggcca 1561 tcaaattgga ctgcaacatt ctaggtcctt ggaaaatcac agcaaggcca agaaaagttg 1621 gagtagtttt attaaccaga gtatttatgt tttcagaacg acaccgtaac tatccgcact 1681 agaaagttca tgaccaaccg actacttcag aggaaacaaa tggtaaggaa gggcacatca 1741 atctttgctt aattgtcctt tactctaaag atctatttta tcatactgaa tgctaaactt 1801 gatatctcct tttaggtcat tgatgtcctt caccccggga aggcgacagt gcctaagaca 1861 gaaattcggg aaaaactagc caaaatgtac aagaccacac cggatgtcat ctttgtattt 1921 ggattcagaa ctcattttgg tggtggcaag acaactggct ttggcatgat ttatgattcc 1981 ctggattatg caaagaaaaa tgaacccaaa catagacttg caagagtagg tgtcttttca 2041 tttgttgatc agctcctgaa gacctatttt ttcaatagcg ttgtgttgtg agtgtggtaa 2101 aaagggcaag accaagcaat ctgggataca actctgaaag gattaagaga aaaagttatt 2161 tcataaaatg cacaggtgga gtacgggggt tcaaaattga agtctgacat ttgagctgag 2221 ttcagaaggg cagctgaagt agtgttcttg gagatgggct aggggtaagg attgttttcc 2281 agaggagaag taaggggatg tgtttggaaa ggtagcctgg caccaaattc acttagtttg 2341 gattaagtct gcatctgggc ttaaacccat aggtagtcag gaaccaggaa aagtttttga 2401 ttgtatactg agtgtcaaca ctgagagtga actacagcag tgatctgggc aagctcccca 2461 gcttacattc tgtaaacaag taagcatcat cacctcgaag agcttgtttt gtttgttcct 2521 ttaactcact ttatcctatc ctgttagata aaccttccca tacgatagct gtgagggaaa 2581 atcctggtga tgtcataggt agaggctgca cttttttttt ttagagagag agtctagctc 2641 tgtcgccagg ctggagtgca gtggtacaat ctcctgctaa tgctagctag ctagcccaat 2701 tggttcacga tcgttacggg gactgcagtc gggccctaaa attttttcct gactaaacgt 2761 cgtactactg atagtcgtac gtacttggcc catgcatgct agctgactgc atccgtacgt 2821 acgtacgtac gtagcatgac tgaggctgca ctatttttaa ctgacgttcg gtcatgttaa 2881 aggccttgct gaggtggtga cattgaagct gagatccttg taaagatttg gaagtagtct 2941 ttgtaaagat acagaaacag tcttgtaggc cccaggaaca gcaagtgcaa aggttgggca 3001 gagtggatcc gatgagtaga tccggaatct ccatgatgtt ggggttgtaa ttgcttgtct 3061 gtcataaggg gagtaatggc tgcactagcc attgacatga ctgacgtcag tcatgcatga 3121 aatagtgatc tgtgagctca agtctggaca agaacagcca gtctcccaaa ctccctgttc 3181 agagaatgtt gagtggacca tttgtttcct gtgaggctat tccataccta aaatcccttg 3241 atgatccttt ctgttccagc aaaaacatgc attaaatgta cttgaaaagt tttggtttag 3301 aaagctgtgt ttcttaagct cagactgggt cagtttccct accagagtgg tgggtaatga 3361 ttttaatgtt tacaagtcac ctggatgtac tcttttctca ttcagcatgg cctgtatgag 3421 aagaaaaaga cctcaagaaa gcaacgaaag gaacgcaaga acagaatgaa gaaagtcagg 3481 gggactgcaa aggccaatgt tggtgctggc aaaaaggtgc tcagtccagt cgtacgtcca 3541 atgttacgta aatgggcccc tggggcatgg ttttaaaact gcaatcccgt acgattggcc 3601 ctttgatata aaaaagctga tctgtttgct ccctgatacg ttgccagtct aaggtccacg 3661 atcattgata caatgccctt tgatcgatac gtaaagtccc gtacgtcgac tgatcagttt 3721 gacccgtagc tgatcataag tacgtacgta cggtacgatc aggactaaag catgcagtcc 3781 agacgtcatt gaaagtccct gggtcaagct ttgacgtacg atcaaagtcc ctgacttttg 3841 actaaagggt catcgatccc cggggtacgc gcagctgcgg cgactgactg attttgaccc 3901 atttactaac gatccagtca gtgagctgga tactggctca caggtcagtc gatcgatcga 3961 tcccattttc gtcgtccccg ttttgaaaac gtcgcgtcgt aaaggctttc cccgtgtttg 4021 cgtcccgacg tacgtagggg gccaagtcga tcggtaaaaa aacgtccgac gtacgatcgg 4081 atcgacggac tttgcagctg gactgcatcc gatcatgtcg ttacgtcgat cgaacgtttg 4141 ccccgttttg caaacattcg tttcacccgt cccgtcgttt tggggactcc caaaagtcta 4201 cacccgggca aagtcttcgt cgatcgatag tcgtaaattg ccgtcgatac gtttcgtggt 4261 cgactgacgt acgtacgttc gtcagctgac gtttcgatcg tacgcagtca gtcaaagtcg 4321 tcagactccc attttaaagt ctcccgtttt gcccgaaagg gtcgtttcgt cgagctgcta 4381 agtctcgtcc cgttaccccg tcccattttt gaaagttgtt tgtacgtcca aacgtccttc 4441 gtcgttcgta gtaagtcgta cgtcgatcgt acgtcgttca ggtaccctac gtacgtacca 4501 ttactacgta cgccagctac ccgatcaacg atcaactatc cagccgaagg agtaaaggtg 4561 ctgcaatgat gttagctgtg gccactgtgg atttttcgca agaacattaa taaactaaaa 4621 acttcatgtg ctgcagtacg acttgcagtc gatcgatcga acctgcactt gatttacggg 4681 tctacgtaaa cgtaaacgta cgtacccctt ttgcacccag cccgggacgt cgtctgatcg 4741 atctttcaaa gtcgataaac gttccgatat gatacgtacg atacgatatt cgatacgtac 4801 gttcccgttt acctaaacta aacgtacgga tcctttaccc tgctcccgta aagggctttt 4861 gcagctagct gactcccatt ttgcatcaaa gtccgtacgt acgtttacgt cgatcgactg 4921 actttacgtc gtaccagaat tc // LOCUS HUMHIS4 1098 bp DNA PRI 08-NOV-1994 DEFINITION Human histone H4 gene, complete cds, clone FO108. ACCESSION M16707 NID g184063 KEYWORDS histone H4. SOURCE Homo sapiens Foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1098) AUTHORS Pauli,U., Chrysogelos,S., Stein,G., Stein,J. and Nick,H. TITLE Protein-DNA interactions in vivo upstream of a cell cycle-regulated human H4 histone gene JOURNAL Science 236 (4806), 1308-1311 (1987) MEDLINE 87234336 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by U.Pauli, 14-AUG-1987. FEATURES Location/Qualifiers source 1..1098 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Foetus" /tissue_type="liver" /map="1q21" mRNA 585..974 /note="his4 mRNA" gene 613..924 /gene="H4F2" CDS 613..924 /gene="H4F2" /codon_start=1 /db_xref="GDB:G00-120-032" /product="histone H4" /db_xref="PID:g386773" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" exon 613..924 BASE COUNT 257 a 293 c 323 g 225 t ORIGIN EcoRI site; chromosome 1cen-q31. 1 aattctcctg tgtgagctaa aatacagtgg ctcggtccaa caaaacagag cctggagcca 61 ggaattatgg cgaacctgct ccctccgtcc tccttcggcg aagatccctg gcgcgcgtcc 121 ttgaggtcgc cttcggtgtt gacctcatcg tcggaacggc gcttcctgaa gctttatata 181 agcacggctc tgaatccgct cgtcggatta aatcctgcgc tggcgtcctg ccagtctctc 241 gctccatttg ctcttcctga ggctccctcc agagaccttt cccttagcct cagtgcgaat 301 gcttccgggc gtcctcagaa ccagagcaca gccaaagcca ctacagaatc cggaagcccg 361 gttgggatct gaattctccc ggggaccgtt gcgtaggcgt taaaaaaaaa aaagagtgag 421 agggacctga gcagagtgga ggaggaggga gaggaaaaca gaaaagaaat gacgaaatgt 481 cgagagggcg gggacaattg agaacgcttc ccgccggcgc gctttcggtt ttcaatctgg 541 tccgatactc ttgtatatca ggggaagacg gtgctcgcct tgacagaagc tgtctatcgg 601 gctccagcgg tcatgtccgg cagaggaaag ggcggaaaag gcttaggcaa agggggcgct 661 aagcgccacc gcaaggtctt gagagacaac attcagggca tcaccaagcc tgccattcgg 721 cgtctagctc ggcgtggcgg cgttaagcgg atctctggcc tcatttacga ggagacccgc 781 ggtgtgctga aagtgttctt ggagaatgtg attcgggacg cagtcaccta caccgagcac 841 gccaagcgca agaccgtcac agccatggat gtggtgtacg cgctcaagcg ccaggggaga 901 accctctacg gcttcggagg ctaggcgccg ctccagcttt gcacgtttcg atcccaaagg 961 ccctttttgg gccgaccact tgctcatcct gaggagttgg acacttgact gcgtaaagtg 1021 caacagtaac gatgttggaa ggtaactttg gcagtggggc gacaatcgga tctgaagtta 1081 acggaaagac ataaccgc // LOCUS HSHISH3 698 bp DNA PRI 12-SEP-1993 DEFINITION Human histone H3 gene. ACCESSION X00090 NID g32114 KEYWORDS histone; histone H3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 698) AUTHORS Zhong,R., Roeder,R.G. and Heintz,N. TITLE The primary structure and expression of four cloned human histone genes JOURNAL Nucleic Acids Res. 11 (21), 7409-7425 (1983) MEDLINE 84069776 FEATURES Location/Qualifiers source 1..698 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 93..97 /note="CAAT box" promoter 114..120 /note="TATA box" precursor_RNA 148..659 /note="put. primary transcript" CDS 186..596 /note="histone H3" /codon_start=1 /db_xref="PID:g32115" /db_xref="SWISS-PROT:P16106" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLV GLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" misc_signal 634..649 /note="dyad symmetry" exon 186..596 BASE COUNT 169 a 183 c 177 g 169 t ORIGIN 1 acggtaatga caggaatctc tcttaatctg caactaggca cagagatggg ccaatccaag 61 aagggcgcgg ggatttttga attttcttgg gtccaatagt tggtggtctg actctataaa 121 agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaagctt ttctgtggtt 181 ttgccatggc tcgtactaaa cagacagctc ggaaatccac cggcggtaaa gcgccacgca 241 agcagctggc taccaaggct gctcgcaaga gcgcgccggc taccggcggt gtgaaaaagc 301 ctcaccgtta ccgtccgggt actgtggctc tgcgtgagat ccgccgctac caaaagtcga 361 ccgagttgct gattcggaag ctgccgttcc agcgcctggt gcgagaaatc gcccaagact 421 tcaagaccga tcttcgcttc cagagctctg cggtaatggc gctgcaggag gcttgtgagg 481 cctacttggt agggctcttt gaggacacaa acctttgcgc catccatgct aagcgagtga 541 ctattatgcc caaagacatc cagctcgctc gccgcattcg cggagaaaga gcgtaaatgt 601 aaagttactt tttcatcagt cttaaaaccc aaaggctctt ttcagagcca cccacttatt 661 ccaacgaaag tagctgtgat aattttttgt tgtctcaa // LOCUS HSHSC70 5408 bp DNA PRI 09-MAY-1995 DEFINITION Human hsc70 gene for 71 kd heat shock cognate protein. ACCESSION Y00371 NID g32466 KEYWORDS heat shock cognate protein; hsc70 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5408) AUTHORS Dworniczak,B. and Mirault,M.E. TITLE Structure and expression of a human gene coding for a 71 kd heat shock 'cognate' protein JOURNAL Nucleic Acids Res. 15 (13), 5181-5197 (1987) MEDLINE 87259994 REFERENCE 2 (bases 1 to 5408) AUTHORS Rensing,S.A. and Maier,U.G. TITLE Phylogenetic analysis of the stress-70 protein family JOURNAL J. Mol. Evol. 39 (1), 80-86 (1994) MEDLINE 94343547 FEATURES Location/Qualifiers source 1..5408 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phage521" /clone_lib="human liver DNA in lambda L47.1" misc_feature 4..17 /note="heat shock consensus element" misc_feature 109..113 /note="regulatory sequence" misc_feature 178..191 /note="heat shock consensus element" TATA_signal 199..203 mRNA join(227..304,1035..1244,1567..1772,2097..2249,2337..2892, 3104..3306,3535..3733,3881..4113,4445..>4630) exon 227..304 /number=1 prim_transcript 227..>4630 intron 305..1034 /number=1 exon 1035..1244 /number=2 CDS join(1040..1244,1567..1772,2097..2249,2337..2892, 3104..3306,3535..3733,3881..4113,4445..4630) /codon_start=1 /product="71 Kd heat shock cognate protein" /db_xref="PID:g32467" /db_xref="SWISS-PROT:P11142" /translation="MSKGPAVGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAF TDTERLIGDAAKNQVAMNPTNTVFDAKRLIGRRFDDAVVQSDMKHWPFMVVNDAGRPK VQVEYKGETKSFYPEEVSSMVLTKMKEIAEAYLGKTVTNAVVTVPAYFNDSQRQATKD AGTIAGLNVLRIINEPTAAAIAYGLDKKVGAERNVLIFDLGGGTFDVSILTIEDGIFE VKSTAGDTHLGGEDFDNRMVNHFIAEFKRKHKKDISENKRAVRRLRTACERAKRTLSS STQASIEIDSLYEGIDFYTSITRARFEELNADLFRGTLDPVEKALRDAKLDKSQIHDI VLVGGSTRIPKIQKLLQDFFNGKELNKSINPDEAVAYGAAVQAAILSGDKSENVQDLL LLDVTPLSLGIETAGGVMTVLIKRNTTIPTKQTQTFTTYSDNQPGVLIQVYEGERAMT KDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKG RLSKEDIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDED KQKILDKCNEIINWLDKNQTAEKEEFEHQQKELEKVCNPIITKLYQSAGGMPGGMPGG FPGGGAPPSGGASSGPTIEEVD" intron 1245..1566 /number=2 exon 1567..1772 /number=3 intron 1773..2096 /number=3 exon 2097..2249 /number=4 intron 2250..2336 /number=4 exon 2337..2892 /number=5 intron 2893..3103 /number=5 exon 3104..3306 /number=6 intron 3307..3534 /number=6 exon 3535..3733 /number=7 intron 3734..3880 /number=7 exon 3881..4113 /number=8 intron 4114..4444 /number=8 exon 4445..>4630 /number=9 polyA_signal 4843..4848 BASE COUNT 1411 a 1079 c 1420 g 1498 t ORIGIN 1 gagcttgaaa gttccagaac gctgcggtga gtgcgttatc gtgaggcggc gcggtggggt 61 gggtgcggaa gggggcgagg cgaggagtgg agccgcgttg tgattgtgat tgggtcttgt 121 aagggcagcc ggactctatt ggccgggaac ctaatgcagg aagcaggcgg accccttctg 181 gaaggttcta agatagggta taagaggcag ggtggcgggc ggaaaccggt gctcagttga 241 actgcgctgc agctcttggt tttttgtggc ttccttcgtt attggagcca ggcctacacc 301 ccaggtaaaa cctctgctca agagttgggt tgtgggtctg ggagcgtgca gcctccacac 361 aggcctgttg ggcttgctga ggcttggggg ttctgagaat ctcgtcgagg cgagtgtgcg 421 gctccttcta ccggcttaaa gggcctcagt tttcggtggg atggcagcgg tatttggttg 481 cagccggcag acggaaatgt agggagtggg ccgcatggcc ccaggggagg ctgggagacg 541 cccggccgcg tggcggggga gggttgctgc atcggtttgc ctggcgcgcg gggaagtgga 601 gccagcgttt tctttcaccc agttccctgc ttagtccagt cccaccgtgg ttcttcagag 661 ctgttcttgg cgtgcttcca gtatgggggt acattccgga gtagttaaaa gcccgttgac 721 tcccgggggg cactggcacc tggcgaggga ggggaacaga cagtgctcag ttcggggtaa 781 gaccacgtgt tgagcaacgc cccacgccgt ctgggtcgat gggtccttca tctagggcgt 841 gctgtgctgc ggttggcacg gcaacctgga ctgcagcact agttctggac ctcgcgcgtg 901 cttagacagg aggtgatggg cactattacc tcttggcagt ggccatacgt ttttcctggt 961 taagtgttct gttaagggat gagggaaata ttttgattaa ttgaattttt aaaccagatt 1021 tttctttttt tcagcaacca tgtccaaggg acctgcagtt ggtattgatc ttggcaccac 1081 ctactcttgt gtgggtgttt tccagcacgg aaaagtcgag ataattgcca atgatcaggg 1141 aaaccgaacc actccaagct atgtcgcctt tacggacact gaacggttga tcggtgatgc 1201 cgcaaagaat caagttgcaa tgaaccccac caacacagtt tttggtgagt tcctaatttt 1261 aaatgacaga acaaatataa acagggctag gaagcacaaa agtttatgaa acgtgaggag 1321 ggaacttttt gattttagaa aaactgagct gagagacttg ttatcaagtc tgttataaaa 1381 caggttgtag aaacctttca ggctgaaatc tggataacgt aggaggttga agtttgaacc 1441 tttgctaggt atatggtagt tgaattcacc tacctatgaa ctgttaggta tttgagtaat 1501 catggacttg agttttatct gaagagctat gaaattgaaa gtgttttcat ttgacacctt 1561 ttacagatgc caaacgtctg attggacgca gatttgatga tgctgttgtc cagtctgata 1621 tgaaacattg gccctttatg gtggtgaatg atgctggcag gcccaaggtc caagtagaat 1681 acaagggaga gaccaaaagc ttctatccag aggaggtgtc ttctatggtt ctgacaaaga 1741 tgaaggaaat tgcagaagcc taccttggga aggtgaggtt ggtttttcag tatggggtgc 1801 attccggagt agttaaaagc ccgatgactc ccgggggcac tggcacctgg cgagggaggg 1861 gaacagatgg ggctcagctc agggttaaga ccacgtgccc aacagtgccc taggctctct 1921 aggtagatgg gtctgtcaac accagaaacc agtgaatctt gacaattaca cagtaattta 1981 cattttggtg gggggggtgc tccagctgtt gtttcaccag cattaatcca tttgctggag 2041 tttgcatata tgtaagtata atagttacca atctgtggtc ttttccttat tcctagactg 2101 ttaccaatgc tgtggtcaca gtgccagctt actttaatga ctctcagcgt caggctacca 2161 aagatgctgg aactattgct ggtctcaatg tacttagaat tattaatgag ccaactgctg 2221 ctgctattgc ttacggctta gacaaaaagg tatgtaccat ttgtgatgca agttcggatt 2281 attttaagat taatttgatc catcgtaaat ttaaatgaga ttgtttttaa cggcaggttg 2341 gagcagaaag aaacgtgctc atctttgacc tgggaggtgg cacttttgat gtgtcaatcc 2401 tcactattga ggatggaatc tttgaggtca agtctacagc tggagacacc cacttgggtg 2461 gagaagattt tgacaaccga atggtcaacc attttattgc tgagtttaag cgcaagcata 2521 agaaggacat cagtgagaac aagagagctg taagacgcct ccgtactgct tgtgaacgtg 2581 ctaagcgtac cctctcttcc agcacccagg ccagtattga gatcgattct ctctatgaag 2641 gaatcgactt ctatacctcc attacccgtg cccgatttga agaactgaat gctgacctgt 2701 tccgtggcac cctggaccca gtagagaaag cccttcgaga tgccaaacta gacaagtcac 2761 agattcatga tattgtcctg gttggtggtt ctactcgtat ccccaagatt cagaagcttc 2821 tccaagactt cttcaatgga aaagaactga ataagagcat caaccctgat gaagctgttg 2881 cttatggtgc aggtaacaat ggtatctcaa ttaaccctaa aggcaggcag gcccaaggtg 2941 actcgctgtg atgagtgatt gttaaacatt cgtagtttcc accaaaagct tggctaatga 3001 tggcaacacc ttccttggat gtctgagcga gtgatagtta aaacaggagc tatgtactgg 3061 gttttctttt aacttctttt aacgttaact ttttgtttgc tagctgtcca ggcagccatc 3121 ttgtctggag acaagtctga gaatgttcaa gatttgctgc tcttggatgt cactcctctt 3181 tcccttggta ttgaaactgc tggtggagtc atgactgtcc tcatcaagcg taataccacc 3241 attcctacca agcagacaca gaccttcact acctattctg acaaccagcc tggtgtgctt 3301 attcaggtat gtttctgtac ttctcttgtt tggcttactg ataacagata aagggaagtc 3361 ttgactgact cgctatgatg atggattcca aaaccattcg tagtttccac cagaaagtct 3421 tatgttggcc agttccttcc ttggatgttt gagcgaccat tcttccttag caggacccta 3481 gcactgtcac agacctggag tccattgtag taatttgttt tatttcctac caaggtttat 3541 gaaggcgagc gtgccatgac aaaggataac aacctgcttg gcaagtttga actcacaggc 3601 atacctcctg caccccgagg tgttcctcag attgaagtca cttttgacat tgatgccaat 3661 ggtatactca atgtctctgc tgtggacaag agtacgggaa aagagaacaa gattactatc 3721 actaatgaca agggtaagga ggcactgtca tctggtcttg acagggataa tggtatttca 3781 attgagttac tggtgaataa gggcgtctag ctaagagaaa ctagagttac acatacacag 3841 gtaatttaag gcttttactt agagttaatt tctttcctag gccgtttgag caaggaagac 3901 attgaacgta tggtccagga agctgagaag tacaaagctg aagatgagaa gcagagggac 3961 aaggtgtcat ccaagaattc acttgagtcc tatgccttca acatgaaagc aactgttgaa 4021 gatgagaaac ttcaaggcaa gattaacgat gaggacaaac agaagattct ggacaagtgt 4081 aatgaaatta tcaactggct tgataagaat caggtttgtg tttttttttt tttttttcct 4141 cccccacgca atggagggga aggggatggt aaaccaagct tgagctggat ttcagtgtag 4201 ggtcacaatg atgaatggtc caaaacattc gcggtttcca ccagaattca aggtgttggc 4261 aactaccttc cttggatgtc tgagtgaccc aagatgttaa ggaagaataa ggccctattt 4321 taatgttggt atgggccctc ttgtaagagt ttgctccaga cttttagtat cagattgcgt 4381 cagggagaaa gaagggttat taacattaaa agaacttgca gtaattcctt tttctcttcc 4441 tcagactgct gagaaggaag aatttgaaca tcaacagaaa gagctggaga aagtttgcaa 4501 ccccatcatc accaagctgt accagagtgc aggaggcatg ccaggaggaa tgcctggggg 4561 atttcctggt ggtggagctc ctccctctgg tggtgcttcc tcagggccca ccattgaaga 4621 ggttgattaa gccaaccaag tgtagatgta gcattgttcc acacatttaa aacatttgaa 4681 ggacctaaat tcgtagcaaa ttctgtggca gttttaaaaa gttaagctgc tatagtaagt 4741 tactgggcat tctcaatact tgaatatgga acatatgcac aggggaagga aataacattg 4801 cactttatac actgtattgt aagtggaaaa tgcaatgtct taaataaaac tatttaaaat 4861 tggcaccata caattgcttt gagtctttaa ataatctccc aggccagcgg tgggagaagt 4921 aggcttaggt gattatgtga ctcttacttt ctccttcctc ttaagcttga gttaacaagg 4981 gctgggtggc aagttgccct tcagagcatg tggatggtac attttggaat tcagagcttt 5041 gagaagggga gcataagaaa ttggatctgg atcaaactaa ccttagtcct taggctggag 5101 aggcagaagc tgacttaatg gtgttttcta aacttattct gtgtgtaagc ctgcctagga 5161 gcagaggctt tcctggaggg ttgtgctaga tgagtaagaa tttagataca gaatcaaata 5221 atgggcagtg aatattaagc tacatggcag aggtatctga atgtcaatcc cttatatgag 5281 ccactgccct gtgggcttcc atttcttctg agttaagatt attcagaagg tcggggattg 5341 gagctaagct gccacctggt taattaaggt cccaacagtg agttgtgata gcctagggga 5401 gcaggctg // LOCUS HUMNOCT 4878 bp DNA PRI 19-JAN-1996 DEFINITION Homo sapiens POU-domain transcription factor (N-Oct-3), complete cds. ACCESSION L37868 NID g972766 KEYWORDS N-Oct 3 protein; POU domain; POU domain transcription factor; POU3F2 gene; homeodomain protein; transcription factor. SOURCE Homo sapiens (clone: pBSKB2-10 4.8 kb SacI) (tissue library: Promega Cat No. C2091) liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Schreiber,E., Tobler,A., Malipiero,U., Schaffner,W. and Fontana,A. TITLE cDNA cloning of human N-Oct3, a nervous-system specific POU domain transcription factor binding to the octamer DNA motif JOURNAL Nucleic Acids Res. 21 (2), 253-258 (1993) MEDLINE 93181199 REFERENCE 2 (bases 1 to 4878) AUTHORS Angus,J., Thomson,F., Murphy,K., Baker,E., Sutherland,G.R., Parsons,P.G. and Sturm,R.A. TITLE The brn-2 gene regulates the melanocytic phenotype and tumorigenic potential of human melanoma cells JOURNAL Oncogene 11 (4), 691-700 (1995) MEDLINE 95380176 COMMENT Ref [1] reports bases 2359-4296. FEATURES Location/Qualifiers source 1..4878 /organism="Homo sapiens" /note="clone: lambda GEM-11" /db_xref="taxon:9606" /clone="pBSKB2-10 4.8 kb SacI" /tissue_type="liver" /tissue_lib="Promega Cat No. C2091" /map="6q16" gene 2511..3842 /gene="N-Oct-3" CDS 2511..3842 /gene="N-Oct-3" /note="bp 3312..3743 POU domain" /codon_start=1 /db_xref="GDB:G00-222-816" /product="POU-domain transcription factor" /db_xref="PID:g972767" /translation="MATAASNHYSLLTSSASIVHAEPPGGMQQGAGGYREAQSLVQGD YGALQSNGHPLSHAHQWITALSHGGGGGGGGGGGGGGGGGGGGGDGSPWSTSPLGQPD IKPSVVVQQGGRGDELHGPGALQQQHQQQQQQQQQQQQQQQQQQQQQRPPHLVHHAAN HHPGPGAWRTAAAAAHLPPSMGASNGGLLYSQPSFTVNGMLGAGGQPAGLHHHGLRDA HDEPHHADHHPHPHSHPHQQPPPPPPPQGPPGHPGAHHDPHSDEDTPTSDDLEQFAKQ FKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKPLLNKWLEE ADSSSGSPTSIDKIAAQGRKRKKRTSIEVSVKGALESHFLKCPKPSAQEITSLADSLQ LEKEVVRVWFCNRRQKEKRMTPPGGTLPGAEDVYGGSRDTPPHHGVQTPVQ" conflict 2587..2588 /gene="N-Oct-3" /citation=[1] /replace="cg" exon 2511..3842 BASE COUNT 1085 a 1389 c 1525 g 879 t ORIGIN 1 gagctccagg tccgcgaaaa gaaaggcacg acatgtcagg agaagatgcg gctggcgggc 61 tgtctgtgcg ctccccaagg cccagggaaa gtcacgcgcc tagcacaggg gacgcagggc 121 cgaagccggg agaaggcggt tctgcccgcc acccagcccc ttggctcgcg gttgcggagc 181 cgcgggcacc agtcgctgag tctgcgcgct gaggctcgga cttgttcatt ttttcctagc 241 tagacaaagt tatgggattc ctagcagccc ccaggcgcgc ggccagagtg gccccgcgtt 301 gcgctgggac ggagcgacac gctgtgtgtg tcctcgcctc gcctcgccgc gcgggtgcgc 361 gcgtggtggg ggtgcaggtc cgtcctgctg gcggcctgtg ggctcgcccc tcccctatcc 421 atttctctcc cccacacacg ctcataaaca caggacacgt ggaaattctc agccaaagcg 481 tcttgcccgg gcgggtgctc aacgtggtga tttataaaca gctcagccac tctccaaaga 541 gcatgactcc ttgttagtca tgatgaattt tctggcccgc tccagtctgc tgtctggatt 601 cagatgctca gaggccccaa aatgctccct ctgagcgttg atttggctgc cctctcaagc 661 ctacgaccgc cggcaaaacc agcgatagag cgaggggtga gaggtgggga cgcgatccag 721 gttccctggg gtctgtcatt cccttcccgg cagtaactgt tgattcgctt tacagcgccc 781 atgtacacac ccggccctgc gtggcggtca cgccgctcca gcctgggtcc gcgcccaagt 841 tctagctacc gggcttgcag gctcggaaac tcgcaggcgg cacctgctct gggcggctaa 901 agaactgatg gcgcgacgcg cccgcgctcc gtagagcggc gctgcgggtg catggtccca 961 tagtcttgct ctggagttca ctgctgaaag ctgagctggg agcttggcca gggagcgcca 1021 aagacataga ctaaccccac ccgctagcgc ccaccgaagt tactgattag gaaggttccc 1081 ctggcatcag cccatttgct tcacacactc ttcctccttc ccaaggggct aacggaaaag 1141 ttgcacctac cagcggtagg agcacgggga atcgcgcacg caggccctcc gcgcggcttc 1201 ccggagcggc ctttggcgac tgcgccgccc ctacaaggcc gcacacccct cgccagcact 1261 ccggcagcct cgcacaagac ggagcccacg caggcttcta gttttctaaa ccaaaaactg 1321 cacgcgccgg cgtgtgaatc tgacggagag acttttagag ttttgtttgg ttttcagttc 1381 agaccgctac gctgtcagag caagcaactc cgagccttga tgggaatgac ttacaagccg 1441 cggtgcgggc tgagcctgcc tctctcgggg ttctttcccc aagggagtgg gagacaattg 1501 ggcctggaac ctaaaaggaa gagagcctgt gctagccgcg gggtagaggg gaaggagaag 1561 agctttgggc cgaagtgtcc gcacaccctg gagcaagaga cacttgtgca ctgaaattct 1621 aaccgtggga tgcaaggaga gttgaacagg ggaacaggtc tggctcttgg ctctaggttt 1681 aagagataca cactgtgtgt gtgtccctag gtgaataaaa atccgactcc cgctggctcg 1741 gtgcgaccca gcttttccct taccccctcc gtttccccat cttaaagccc tattgggaaa 1801 cgaaagtgga caccactgcc tcccgccccc caccccaatc caagtcttcc aagtgtctgt 1861 ggccgataag agcaccggga ccgcccccct gccggtcttc tctgcctggg agacagatgg 1921 ggggcggggc ccatccgaga gagggcggag gaggccccgg gtgaggaaga agcggggggg 1981 tgagaactga gagaatctga atcgggaggc gaaggggacg gggaggaggg ctaggaggac 2041 tccgagcccg ggggaggggg agggagtagc tctgcgccaa tcagtgcgcc ggcctgggag 2101 gttgctagcg gtatccacgt aaatcaaagg gcgcggagcc aatgggaggg ggcggagggg 2161 gcggggccca ggcgcgtgcc gctgcgagcc ggcgctgcca agagagcggg agagagctgg 2221 agagagcagg gagagggggg agcgccgagc tagtcagaga gtgagcgaga gcgagaagga 2281 gggagaggag gagaaagaga gcgagggcgg gcgggaggcg gcggcggcgg cagcagcagc 2341 agtaatagca ggagcagcaa cagaaggcgt cggagcgggc gtcggagctg cccgctgtgg 2401 gagagagagg agacagaaag agcgagcgag gagagggagc ccgaggcgaa aaagtaactg 2461 tcaaatgcgc ggctccttta accggagcgc tcagtccggc tccgagagtc atggcgaccg 2521 cagcgtctaa ccactacagc ctgctcacct ccagcgcctc catcgtgcac gccgagccgc 2581 ccggcggcat gcagcagggc gcggggggct accgcgaagc gcagagcctg gtgcagggcg 2641 actacggcgc tctgcagagc aacggacacc cgctcagcca cgctcaccag tggatcaccg 2701 cgctgtccca cggcggcggc ggcgggggcg gtggcggcgg cggggggggc gggggcggcg 2761 gcgggggcgg cggcgacggc tccccgtggt ccaccagccc cctgggccag ccggacatca 2821 agccctcggt ggtggtgcag cagggcggcc gcggagacga gctgcacggg ccaggcgccc 2881 tgcagcagca gcatcagcag cagcaacagc aacagcagca gcaacagcag caacagcagc 2941 agcagcagca gcaacagcgg ccgccgcatc tggtgcacca cgccgctaac caccacccgg 3001 gacccggggc atggcggacg gcggcggctg cagcgcacct cccaccctcc atgggagcgt 3061 ccaacggcgg cttgctctac tcgcagccca gcttcacggt gaacggcatg ctgggcgccg 3121 gcgggcagcc ggccggtctg caccaccacg gcctgcggga cgcgcacgac gagccacacc 3181 atgccgacca ccacccgcac ccgcactcgc acccacacca gcagccgccg cccccgccgc 3241 ccccgcaggg tccgcctggc cacccaggcg cgcaccacga cccgcactcg gacgaggaca 3301 cgccgacctc ggacgacctg gagcagttcg ccaagcagtt caagcagcgg cggatcaaac 3361 tgggatttac ccaagcggac gtggggctgg ctctgggcac cctgtatggc aacgtgttct 3421 cgcagaccac catctgcagg tttgaggccc tgcagctgag cttcaagaac atgtgcaagc 3481 tgaagccttt gttgaacaag tggttggagg aggcggactc gtcctcgggc agccccacga 3541 gcatagacaa gatcgcagcg caagggcgca agcggaaaaa gcggacctcc atcgaggtga 3601 gcgtcaaggg ggctctggag agccatttcc tcaaatgccc caagccctcg gcccaggaga 3661 tcacctccct cgcggacagc ttacagctgg agaaggaggt ggtgagagtt tggttttgta 3721 acaggagaca gaaagagaaa aggatgaccc ctcccggagg gactctgccg ggcgccgagg 3781 atgtgtacgg ggggagtagg gacactccac cacaccacgg ggtgcagacg cccgtccagt 3841 gaactcgagc tgggggaggg gcagagcgcg gggctccccc tccccttcgg tccttggccc 3901 tttcccggcc ctcttgttcc ctctctaact tctgattgtt cttttatttt taattattat 3961 ttccccgtcc cttaaaaaga caaaaaaaat aaggcaaaag gaaagcaact aagacactgg 4021 actatccttt aaaggtagca ggtgtaatga tgtgttttga cctttgcagg cgagtaacca 4081 ggcaatggag tggagtgtct cctggagaga gtgaggagag tgtgtgatag ctagaaagag 4141 agagagacag agagatggca agcactgaga taaatacctg gcaaaactaa ataaattacc 4201 aaaaaggaaa aaaaatccac caaaccatga taaacacaaa atgcagcttc ctgatgctta 4261 gagttggcac atgctgctgt gtttatttat tgtggattcc catcaggaaa gaggaaaaaa 4321 tacacatgtt ctttcatata ggcaaaattt aaccacataa atttgcactg caagaaaatt 4381 gaagtttacg tgaacaaatt catgagcata ttttctcttt ctccccaccg ttaatttggg 4441 agttgccgtt ttgggggatt ttgttttgct ttgctttatt catcggagag agttgaagcc 4501 agctcttggc cactctccat ttctaatgtt cttgtgttgc cccttcttcg tactgtttgt 4561 gaactttggt taccttcaca ttccccttac gagggtgtaa catctatttg ttcctcttac 4621 caaagcaaaa ggattggctt catacaaaat agacaattct ctgatttcag gaaatgtgca 4681 tggtctaccc gctttatcga aggcaagaat ccggtttgga atataaaaat aagcattggt 4741 tgttcttacc agccacaaag taaacttcat tttcaggcag tgtttctggg ggaggttatg 4801 gagggaagaa aaaagaaaaa tcgatagtga gtgactgatt gcttcatttt atcaggcggg 4861 cccattgtga aagagctc // LOCUS HUMTROC 4567 bp DNA PRI 11-JAN-1991 DEFINITION Human slow twitch skeletal muscle/cardiac muscle troponin C gene, complete cds. ACCESSION M37984 NID g339945 KEYWORDS cardiac muscle troponin C; slow twitch skeletal muscle troponin C. SOURCE Human blood (buffy coat) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4567) AUTHORS Schreier,T., Kedes,L. and Gahlmann,R. TITLE Cloning, structural analysis, and expression of the human slow twitch skeletal muscle/cardiac troponin C gene JOURNAL J. Biol. Chem. 265, 21247-21253 (1990) MEDLINE 91065942 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by R.J.Gahlmann, 23-AUG-1990. University of Southern California School of Medicine HMR413 2011 Zonal Ave. Los Angeles, CA 90033. FEATURES Location/Qualifiers source 1..4567 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood (buffy coat)" exon 1448..1498 /gene="TnC" /note="putative" /number=1 mRNA join(1448..1498,2965..2995,3226..3372,3621..3735, 3952..4088,4173..4377) /partial /gene="TnC" /note="putative" /product="troponin C" gene join(1448..1498,2965..2995,3226..3372,3621..3735, 3952..4088,4173..4377) /gene="TnC" CDS join(1475..1498,2965..2995,3226..3372,3621..3735, 3952..4088,4173..4204) /gene="TnC" /note="putative" /codon_start=1 /product="slow twitch skeletal/cardiac muscle troponin C" /db_xref="PID:g339946" /translation="MDDIYKAAVEQLTEEQKNEFKAAFDIFVLGAEDGCISTKELGKV MRMLGQNPTPEELQEMIDEVDEDGSGTVDFDEFLVMMVRCMKDDSKGKSEEELSDLFR MFDKNADGYIDLDELKIMLQATGETITEDDIEELMKDGDKNNDGRIDYDEFLEFMKGV E" intron 1499..2964 /gene="TnC" /number=1 exon 2965..2995 /gene="TnC" /note="putative" /number=2 intron 2996..3225 /gene="TnC" exon 3226..3372 /gene="TnC" /note="putative" /number=3 intron 3373..3620 /gene="TnC" exon 3621..3735 /gene="TnC" /note="putative" /number=4 intron 3736..3951 /gene="TnC" exon 3952..4088 /gene="TnC" /note="putative" /number=5 intron 4089..4172 /gene="TnC" /number=5 exon 4173..4377 /gene="TnC" /note="putative" /number=6 BASE COUNT 896 a 1291 c 1424 g 956 t ORIGIN 1 cctcgcccgc cccgcgcgtg actgacaggg ccactcaggg cgcgcgtgcg aggtgctcgc 61 ttgggtaatc tacctgcgtg ggcccgccgg cggtaccctg cacagcctgc tagaaactga 121 gaccccgggt ggtgacagct ctggcatcgc ccctgggtcc tcgggaagag gggacagaag 181 gtcccgagtc tcccaggcca cacgaagcaa gtcactgctc ttcctggcct cagtttactc 241 ctcctgataa aggaggccat aatagtgcct cacctggctg ttggctcttt ctctttaggg 301 caaggcaggt tggaggggaa aataggacct gtgcttaccg ccggagcagg gcgagagtga 361 ttctgggcca gttctgaacc tctctgagat tcggagatct cttgtcagtg gggcttctgg 421 acaactgagt gggctgattg atgcgcggcc cagcacgcgg cccagtgctc gaggcaggga 481 gcgtgtttat caagagggat aaacttgata cgaactctgt acgaaggaag gtgtaggtgg 541 atggaggggt gtgtgctgcc actgagcaca agaacccacg gggtggcctg ccaaagttca 601 aaacgaggga gacaggttga tctggaccca ggaactacag tgctgaatcc taaaccgggg 661 aaagatgaga cctagaagag ggaggtggta acctaattgg agggtgagga gggaaagagc 721 ctgccacaga tggggcatct ataggggtgc tgttgataac agagcagctg acttaagccc 781 gaagtgggta cttctccctg ggcagatggg aggtctggga caggctcctc tggcagaagg 841 gctcctggcc accctgtcct aaggtgggtc agtcacttcc tccttcacca gttccacagc 901 atcttactat gagcttggca ttcgaggctt ctcttggcag ggccctgcac tcctagcctc 961 tccttgcaca ttgcaccccc attccagaga ggtttagtta aaggcggggg ttaccaagtc 1021 agtcagatct tgggcaagtc accactcctc cagagcctca gtttccttat ctggaaagtg 1081 gaggtcatgg caacccgcca acctggttgg atgggagcct gagctgttgt gttgcacctt 1141 gcctggggcc cacgactttg tagctcctgt cctgcactgg gcttatgttt tcattcattc 1201 cagaaacctt ttcagagagt ccctttgggg agtgtggggg acaggaggga aagaaacctg 1261 gtccttgtag ccgttcgtct gctccctgcc ctgggcagag gacatgggga ctcaggccag 1321 cctgagatca ctgggaccag aggaggggct ggaggatact acacgcaggg gtgggctggg 1381 ctgggctggg ctgggccagg aatgcagcgg ggcagggcta tttaagtcaa gggccggctg 1441 gcaaccccag caagctgtcc tgtgagccgc cagcatggat gacatctaca aggctgcggt 1501 gagggacagg gctgggtagg gctggggtgg gcaggcccac tgggggctca ctcagctgag 1561 agtgcggggt tagtagcccc agggaagtgg tggggaccaa ggagaaggcc tacgtgcctt 1621 caacccaggc cctcacaggg acagtgattc tggtgtttga ggatgcagaa gggggtaggg 1681 ggttccgggt ctgaagggtg gtggaggagg ttgcagcttt ctgatcgtgt ctcactctct 1741 gtttccaagt gtctgtggtc tgtggcactg tcgctcagcc acatgtctct gcatttgtct 1801 ctggacgttt ttgccttcct cttttcatct cttcctcctg agctgtctga gtccccatta 1861 ctgtctccct gtccccaacc cccactttct gcccctcaca ttctgcttct cacatgctca 1921 aaatctgcca cccactccag cccttggcgg gccgaagatg cttggagggt ggagggtgtg 1981 agaggagggg tctgtagagc ctgagtcctg ggctggagat ggggctttga agtttgaggc 2041 agggaagttc tggacatgag ggagaaccaa ggaagaagga acagagaact ggggccccag 2101 ctcccatcat gcctggcagg ctcagggctc agtggcttag ctaggggtga gagcgaggga 2161 atgagggctg gagagtggtc accccaagcc cctgcaacct cctgggtcac tgagggtctt 2221 cagatgctat tctatcctgg gtggtggtac ctccccaacc cagagcaagg acatcctggc 2281 atggccagct gtccccaggg gaacccctcc ctcagcctcc ctcactcctg ggcagggaag 2341 tgctatagcc agctctgggg gcacgcctgc ttatcctgtg ggagtccatg gagccggggt 2401 ggggacagcc ctccacccag tgcccataca aggcctggcg gagttgggga ctaattttgg 2461 cttctgaggc ggcactagca gccagggggc cagataacgc tgccccctgc atgccaaagt 2521 ccccagaaca atcaccaggt ttcactttgt tcctcgttaa aaatagccca gtggccaccc 2581 tggtcaggtt accgtgggtg gcttgcctgc ctccacactg gttttattat cccaacttag 2641 ggacagctgt ccttccggcc cacccagctt gagtttcatc aggggccgaa agggcattga 2701 gtggtcactg actattgtta ctgagggtca ccttggtcct gaagggggtg cccacctgtc 2761 accctggccc tgagcccagt cgcagtgagg ccagctgggt cacgtcaggg ctttgggggc 2821 agggagggag gactgagacc tccactctgt ggcctggaaa tagccagcct cctccagctc 2881 cagccttctc acctgtggaa tgggttggtt cctacgcagc agctatacct gagtctgaga 2941 ccttgagatt ccctttcctt ctaggtagag cagctgacag aagagcagaa aaatggtgag 3001 aatccctatc acacatgtgg gagaccagcg ggtccaggct ggcatgggga ccccttatca 3061 gaagaggacc ccaggccaga gaccagaggc ttggtccctc ttgctctgcc ctcagagagg 3121 tctccgaggg aggtgggcag gttggcaggt ggccccaggg ttctggccct ccgtggtcct 3181 ggctgctgag ccctgactac cgtgcccccc aacccctgaa cacagagttc aaggcagcct 3241 tcgacatctt cgtgctgggc gctgaggatg gctgcatcag caccaaggag ctgggcaagg 3301 tgatgaggat gctgggccag aaccccaccc ctgaggagct gcaggagatg atcgatgagg 3361 tggacgagga cggtgagccc ccctcctccc caggctccag aagaacccca gctggctggg 3421 ggctggaatg ctggctctgt ttagctggga gcaatttagc ctatccgagc cttggttgcc 3481 tcatctataa aatgggcata agggctacac aagcctggcg tttggtgtga ggatgcggtg 3541 agaacatggg ggttcgtgtc gaaggtgctg cctgcagtac ctaccctggc ctctgtaacg 3601 gccatgctgc ccacccccag gcagcggcac ggtggacttt gatgagttcc tggtcatgat 3661 ggttcggtgc atgaaggacg acagcaaagg gaaatctgag gaggagctgt ctgacctctt 3721 ccgcatgttt gacaagtgag cacgtgaccc ttgacctctg accctgaccc acactcaagc 3781 cgagctgtac aggagggcag tctcagattc caggcctagg gaccctgtgg cctctgcctg 3841 ataggggaga gggatgcccc atctcccagt gtccctgctc tgcctcctgg ggcatgggtg 3901 gggctgcctc atgccctccc cacagcccta ccctgagccc cctccccaca gaaatgctga 3961 tggctacatc gacctggatg agctgaagat aatgctgcag gctacaggcg agaccatcac 4021 ggaggacgac atcgaggagc tcatgaagga cggagacaag aacaacgacg gccgcatcga 4081 ctatgatggt aagcgggtgg gtgggctgat ctcctgcctc catgccctgc ccagccccta 4141 ccctcaaccc acacctgccc ctctttccac agagttcctg gagttcatga agggtgtgga 4201 gtagatgctg accttcaccc agagctgcct atgcccagcc tccaactcca gctgagtcct 4261 ggggttgggg agggggtcgg ggtcccagga cctgagcctg gccatgtcct caaccccaaa 4321 tcccccgact ccctccccag atctgtcctg ggggatgcaa ataaagcctg ctctcccaag 4381 gtctgctatc tggctctggt gtccctgggc cgtggactca tccccaggac ccactcttac 4441 ccaatggccg cttccttccc tgtcctaggc aggctggctg cagagcctgg cgcctgacca 4501 ccgctccaca ctgccttctg caggggggtg agatgagatc ggagactgcc gtgtggcctg 4561 ccctgct // LOCUS HSINT1G 4522 bp DNA PRI 03-JAN-1991 DEFINITION Human int-1 mammary oncogene. ACCESSION X03072 NID g33935 KEYWORDS int-1 oncogene; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4522) AUTHORS van Ooyen,A., Kwee,V. and Nusse,R. TITLE The nucleotide sequence of the human int-1 mammary oncogene; evolutionary conservation of coding and non-coding sequences JOURNAL EMBO J. 4 (11), 2905-2909 (1985) MEDLINE 86055728 COMMENT Data kindly reviewed (15-JUN-1986) by R. Nusse. FEATURES Location/Qualifiers source 1..4522 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 259..263 /note="pot. TATA-box" CDS join(465..568,1282..1535,2238..2503,2966..3454) /codon_start=1 /product="int-1 protein" /db_xref="PID:g33936" /db_xref="SWISS-PROT:P04628" /translation="MGLWALLPGWVSATLLLALAALPAALAANSSGRWWGIVNVASST NLLTDSKSLQLVLEPSLQLLSRKQRRLIRQNPGILHSVSGGLQSAVRECKWQFRNRRW NCPTAPGPHLFGKIVNRGCRETAFIFAITSAGVTHSVARSCSEGSIESCTCDYRRRGP GGPDWHWGGCSDNIDFGRLFGREFVDSGEKGRDLRFLMNLHNNEAGRTTVFSEMRQEC KCHGMSGSCTVRTCWMRLPTLRAVGDVLRDRFDGASRVLYGNRGSNRASRAELLRLEP EDPAHKPPSPHDLVYFEKSPNFCTYSGRLGTAGTAGRACNSSSPALDGCELLCCGRGH RTRTQRVTERCNCTFHWCCHVSCRNCTHTRVLHECL" intron 569..1281 /note="intron I" intron 1536..2237 /note="intron II" intron 2504..2965 /note="intron III" misc_feature 4410..4415 /note="pot. polyadenylation signal" BASE COUNT 805 a 1523 c 1320 g 874 t ORIGIN 1 cagctgagtg aggcgggcgc gcgtgggagg gtgtcccaag gggaggggtc cgcggccagt 61 gcaggcccgg aggcgggggc caccgggcag ggggcggggg tgagccccga cggccaaccc 121 gtcagctctc ggctcagacg ggcgggaacc acagccccgc tcgctgccca ttgtctgcgc 181 ccctaaccgg tgcgccctgg tgccacagtg cggcccggag gggcagcctc ctcccgtcac 241 ttcagccagc gccgcaacta taagaggcgg tgccgcccgc cgtggccgcc tcagcccacc 301 agccgggacc gcgagccatg ctgtccgccg cccgccccca gggttgttaa agccagactg 361 cgaactctcg ccactgccgc caccgccgcg tcccgtccca ccgtcgcggg caacaaccaa 421 agtcgccgca actgcagcac agagcgggca aagccaggca ggccatgggg ctctgggcgc 481 tgttgcctgg ctgggtttct gctacgctgc tgctggcgct ggccgctctg cccgcagccc 541 tggctgccaa cagcagtggc cgatggtggt aagtgagctg gtgcggggtc gccacttgtc 601 ccgcggcaca gagccagggg ccaaccctac ccagctccca cgctctggga tccgtctgcc 661 gacaggctcc ctccccgctc tgacttccct ccgcgacacc gaagggcgat ctggcatgaa 721 actgccccag actccagctc tgtacaagtg gggcgaatga tccgcccgcg gaggcctaag 781 ataccccagg cagggagccc actctcatct agcaccgccc ttcccctttg agcgccaact 841 ccagcctcac ggcggtggct caccacaggt ttccccacct cgggaagtga agggccagga 901 gttcgcctag aaaggagggg agaagagggt gggactccta agcatttcac gccttgggtg 961 ggcaagaact gcaggccatg attatctcgc tcaggctgac cggaagaggc tcggagatcc 1021 aaggtagaca ctcggtctcc gggtacctcc tctgtccagt ctccggacct agggctcagg 1081 cgagcagccc tgggactact gggcacacac aagtctggac gcccagttct ttcaaattag 1141 tgagcctggg agagcgggta ttattaatct cccgccattc tctccagcca cataccccca 1201 ggaagaggac cgggtggcac agtttttatg gttagggtgc ggatcccctt cctgagcctg 1261 agctatcata cgtcccacca ggggtattgt gaacgtagcc tcctccacga acctgcttac 1321 agactccaag agtctgcaac tggtactcga gcccagtctg cagctgttga gccgcaaaca 1381 gcggcgcctg atacgccaaa atccggggat cctgcacagc gtgagtgggg ggctgcagag 1441 tgccgtgcgc gagtgcaagt ggcagttccg gaatcgccgc tggaactgtc ccactgctcc 1501 agggccccac ctcttcggca agatcgtcaa ccgaggtggg tgcccaggaa ggcgacgctt 1561 ccgggagcag gggaaacgcg gggtcacccc cagggcatgg gcgggcgagt tcagagaagg 1621 tgtcccaggc gcctggaggg tcacacaatc aaccttgcca agtgcctcgt gcccagcgcc 1681 agctcggggc cagacttcta ccaggcgttt tccagccgtg caccctggaa acgaagctta 1741 acttttctga gctactgccc cagataaaga aagtttcggg tcgcggacgc cggctgaccg 1801 ccgctttccc ccagcctctc tcaaaagcgc ctgggaagct gctctctgca ggcgtgtgtc 1861 tggcctctcg cccagcaagg cttgcaccgc caaaatgggc cgaaagtttt gggctgcgaa 1921 gaagtcttgg ggatgtatgg ttcttccgct cccctctctt cggtttgtct ctctggggct 1981 gctccacttc cgctatcgag ccaaaatgcg ccctagaatc tcccagtaag gtgtgattac 2041 gcccgtggac gtggctgcgt gcccacgcac ctgctttctc tactagccct agagaccagc 2101 tttccagcac tgccggccct ggtcctcagg actcaaagtg cggagtcggg ggtgggattc 2161 cggtcccaag cccttcatga gggtgctggc cgcgccccgc gtaccccctc gctgatcccc 2221 gctcccttct cccacaggct gtcgagaaac ggcgtttatc ttcgctatca cctccgccgg 2281 ggtcacccat tcggtggcgc gctcctgctc agaaggttcc atcgaatcct gcacgtgtga 2341 ctaccggcgg cgcggccccg ggggccccga ctggcactgg gggggctgca gcgacaacat 2401 tgacttcggc cgcctcttcg gccgggagtt cgtggactcc ggggagaagg ggcgggacct 2461 gcgcttcctc atgaaccttc acaacaacga ggcaggccgt acggtgagct ttgagaggct 2521 ccgcacccta agcggagcgg caggggccaa cctcgggctg gggaagtgac ggtcggtgag 2581 ataaggcaag gggcaccagg agagggcgtc ctgggagagc cggaggcttg gaacgaagac 2641 ggagaataga ggagacagtg gctgagggca aaggtatgtc tggcccgcgg acaggtagaa 2701 gaggttgcaa atcaagcaca gtctcttcgc tgtacagatt cgaaaaataa gcctgagagg 2761 ccgagactga ctcgccgcgg cggagcaggg ttgggcaggg tttccaaatc tcagcggaac 2821 atttcgcgcc tcccttcccc tgggctcagc taggcctggg cctttgctga ggtccggccc 2881 ccgtggcgtc cgggagaggg cagtgtctgg gagggtgact ctggcccggt gccctgggac 2941 actctttctt cccctatccc cgcagaccgt attctccgag atgcgccagg agtgcaagtg 3001 ccacgggatg tccggctcat gcacggtgcg cacgtgctgg atgcggctgc ccacgctgcg 3061 cgccgtgggc gatgtgctgc gcgaccgctt cgacggcgcc tcgcgcgtcc tgtacggcaa 3121 ccgcggcagc aaccgcgctt cgcgagcgga gctgctgcgc ctggagccgg aagacccggc 3181 ccacaaaccg ccctcccccc acgacctcgt ctacttcgag aaatcgccca acttctgcac 3241 gtacagcgga cgcctgggca cagcaggcac ggcagggcgc gcctgtaaca gctcgtcgcc 3301 cgcgctggac ggctgcgagc tgctctgctg cggcaggggc caccgcacgc gcacgcagcg 3361 cgtcaccgag cgctgcaact gcaccttcca ctggtgctgc cacgtcagct gccgcaactg 3421 cacgcacacg cgcgtactgc acgagtgtct gtgaggcgct gcgcggactc gcccccagga 3481 acgctctcct cgagccctcc cccaaacaga ctcgctagca ctcaagaccc ggttattcgc 3541 ccacccgagt acctccagtc acactccccg cggttcatac gcatcccatc tctcccactt 3601 cctcctacct ggggactcct caaaccactt gcctggggcg gcatgaaccc tcttgccatc 3661 ctgatggacc tgccccggac ctaacctccc tccctctccg cgggagaccc cttgttgcac 3721 tgccccctgc ttggccagga ggtgagagaa ggatgggtcc cctccgccat ggggtcggct 3781 cctgatggtg tcattctgcc tgctccatcg cgccagcgac ctctctgcct ctcttcttcc 3841 cctttgtcct gcgttttctc cgggtcctcc taagtccctt cctattctcc tgccatgggt 3901 gcagaccctg aacccacacc tgggcatcag ggcctttctc ctccccacct gtagctgaag 3961 caggaggtta cagggcaaaa gggcagctgt gatgatgtgg gaatgaggtt gggggaacca 4021 gcagaaatgc ccccattctc ccagtctctg tcgtggagcc attgaacagc tgtgagccat 4081 gcctccctgg gccacctcct accccttcct gtcctgcctc ctcatcagtg tgtaaataat 4141 ttgcactgaa acgtggatac agagccacga gtttggatgt tgtaaataaa actatttatt 4201 gtgctgggtc ccagcctggt ttgcaaagac cacctccaac ccaacccaat ccctctccac 4261 tcttctctcc tttctccctg cagccttttc tggtccctct tctctcctca gtttctcaaa 4321 gatgcgtttg cctcctggaa tcagtatttc cttccactgt agctattagc ggctcctcgc 4381 ccccaccagt gtagcatctt cctctgcaga ataaaatctc tatttttatc gatgacttgg 4441 tggcttttcc ttgaatccag aacacaacct tgtttgtggt gtcccctatc ctcccctttt 4501 accactccca gcttggaagc tt // LOCUS HUMSRI1A 1634 bp DNA PRI 29-DEC-1994 DEFINITION Human somatostatin receptor isoform 1 gene, complete cds. ACCESSION M81829 NID g307433 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Yamada,Y., Post,S.R., Wang,K., Tager,H.S., Bell,G.I. and Seino,S. TITLE Cloning and functional characterization of a family of human and mouse somatostatin receptors expressed in brain, gastrointestinal tract, and kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 251-255 (1992) MEDLINE 92108031 COMMENT genomic sequence; gene lacks introns. FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" gene 100..1275 /gene="SSTR2" CDS 100..1275 /gene="SSTR2" /codon_start=1 /db_xref="GDB:G00-134-186" /product="somatostatin receptor isoform 1" /db_xref="PID:g307434" /translation="MFPNGTASSPSSSPSPSPGSCGEGGGSRGPGAGAADGMEEPGRN ASQNGTLSEGQGSAILISFIYSVVCLVGLCGNSMVIYVILRYAKMKTATNIYILNLAI ADELLMLSVPFLVTSTLLRHWPFGALLCRLVLSVDAVNMFTSIYCLTVLSVDRYVAVV HPIKAARYRRPTVAKVVNLGVWVLSLLVILPIVVFSRTAANSDGTVACNMLMPEPAQR WLVGFVLYTFLMGFLLPVGAICLCYVLIIAKMRMVALKAGWQQRKRSERKITLMVMMV VMVFVICWMPFYVVQLVNVFAEQDDATVSQLSVILGYANSCANPILYGFLSDNFKRSF QRILCLSWMDNAAEEPVDYYATALKSRAYSVEDFQPENLESGGVFRNGTCTSRITTL" BASE COUNT 283 a 513 c 495 g 343 t ORIGIN 1 ctgcaggcaa gcggtcgggt ggggagggag ggcgcaggcg gcgggtgcgc gaggagaaag 61 ccccagccct ggcagcccca ctggcccccc tcagctggga tgttccccaa tggcaccgcc 121 tcctctcctt cctcctctcc tagccccagc ccgggcagct gcggcgaagg cggcggcagc 181 aggggccccg gggccggcgc tgcggacggc atggaggagc cagggcgaaa tgcgtcccag 241 aacgggacct tgagcgaggg ccagggcagc gccatcctga tctctttcat ctactccgtg 301 gtgtgcctgg tggggctgtg tgggaactct atggtcatct acgtgatcct gcgctatgcc 361 aagatgaaga cggccaccaa catctacatc ctaaatctgg ccattgctga tgagctgctc 421 atgctcagcg tgcccttcct agtcacctcc acgttgttgc gccactggcc cttcggtgcg 481 ctgctctgcc gcctcgtgct cagcgtggac gcggtcaaca tgttcaccag catctactgt 541 ctgactgtgc tcagcgtgga ccgctacgtg gccgtggtgc atcccatcaa ggcggcccgc 601 taccgccggc ccaccgtggc caaggtagta aacctgggcg tgtgggtgct atcgctgctc 661 gtcatcctgc ccatcgtggt cttctctcgc accgcggcca acagcgacgg cacggtggct 721 tgcaacatgc tcatgccaga gcccgctcaa cgctggctgg tgggcttcgt gttgtacaca 781 tttctcatgg gcttcctgct gcccgtgggg gctatctgcc tgtgctacgt gctcatcatt 841 gctaagatgc gcatggtggc cctcaaggcc ggctggcagc agcgcaagcg ctcggagcgc 901 aagatcacct taatggtgat gatggtggtg atggtgtttg tcatctgctg gatgcctttc 961 tacgtggtgc agctggttaa cgtgtttgct gagcaggacg acgccacggt gagtcagctg 1021 tcggtcatcc tcggctatgc caacagctgc gccaacccca tcctctatgg ctttctctca 1081 gacaacttca agcgctcttt ccaacgcatc ctatgcctca gctggatgga caacgccgcg 1141 gaggagccgg ttgactatta cgccaccgcg ctcaagagcc gtgcctacag tgtggaagac 1201 ttccaacctg agaacctgga gtccggcggc gtcttccgta atggcacctg cacgtcccgg 1261 atcacgacgc tctgagcccg ggccacgcag gggctctgag cccgggccac gcaggggccc 1321 tgagccaaaa gagggggaga atgagaaggg aaggccgggt gcgaaaggga cggtatccag 1381 ggcgccaggg tgctgtcggg ataacgtggg gctaggacac tgacagcctt tgatggagga 1441 acccaagaaa ggcgcgcgac aatggtagaa gtgagagctt tgcttataaa ctgggaaggc 1501 tttcaggcta cctttttctg ggtctcccac tttctgttcc ttcctccact gcgcttgctc 1561 ctctgaccct ccttctattt tccccaccct gcaacttcta tcctttcttc cgcaccgtcc 1621 cgccagtgca gatc // LOCUS HSMIMAR 2100 bp DNA PRI 01-OCT-1996 DEFINITION H. sapiens M1 gene for muscarinic acetylcholine receptor. ACCESSION Y00508 M35128 NID g297405 KEYWORDS M1 gene; M1 muscarinic acetylcholine receptor; muscarinic acetylcholine receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2100) AUTHORS Allard,W.J., Sigal,I.S. and Dixon,R.A. TITLE Sequence of the gene encoding the human M1 muscarinic acetylcholine receptor JOURNAL Nucleic Acids Res. 15 (24), 10604 (1987) MEDLINE 88096607 FEATURES Location/Qualifiers source 1..2100 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="P-23" /clone_lib="human genomic DNA" gene 451..1833 /gene="M1" CDS 451..1833 /gene="M1" /codon_start=1 /product="muscarinic acetylcholine receptor" /db_xref="PID:g297406" /db_xref="SWISS-PROT:P11229" /translation="MNTSAPPAVSPNITVLAPGKGPWQVAFIGITTGLLSLATVTGNL LVLISFKVNTELKTVNNYFLLSLACADLIIGTFSMNLYTTYLLMGHWALGTLACDLWL ALDYVASNASVMNLLLISFDRYFSVTRPLSYRAKRTPRRAALMIGLAWLVSFVLWAPA ILFWQYLVGERTVLAGQCYIQFLSQPIITFGTAMAAFYLPVTVMCTLYWRIYRETENR ARELAALQGSETPGKGGGSSSSSERSQPGAEGSPETPPGRCCRCCRAPRLLQAYSWKE EEEEDEGSMESLTSSEGEEPGSEVVIKMPMVDPEAQAPTKQPPRSSPNTVKRPTKKGR DRAGKGQKPRGKEQLAKRKTFSLVKEKKAARTLSAILLAFILTWTPYNIMVLVSTFCK DCVPETLWELGYWLCYVNSTINPMCYALCNKAFRDTFRLLLLCRWDKRRWRKIPKRPG SVHRTPSRQC" BASE COUNT 458 a 663 c 571 g 408 t ORIGIN 1 agtatagctt ataagtggat gaatgcttga gaagttgcag attatacaaa gtagttccca 61 actcctgcaa cccagtatgt aagatagaat tgtagttaat ttcccagtaa gaaaatgagc 121 ctgagtctga aaggtaaaac tgaatgaagt attcaaaccc tggatcccaa agccactcca 181 cgctgctggc aaatccactt atggctggga aagtgccact gcataaatga ccatgagtgg 241 gcaccggtaa gggagggtga tgctatctgg tctgaagctc tggaagggca agaattacat 301 cccatgcatc ttccaataag gtctatcaga aatgtccagt ggcccaacca aagcccatgt 361 cctctctttt aggtgatgac tttcccctga ggaagccctg tagcgtgcct ggaggaaggg 421 gctctccaac cccagcccca cctagccacc atgaacactt cagccccacc tgctgtcagc 481 cccaacatca ccgtcctggc accaggaaag gggccctggc aagtggcctt cattgggatc 541 accacgggcc tcctgtcgct agccacagtg acaggcaacc tgctggtact catctccttc 601 aaggtcaaca cggagctcaa gacagtcaat aactacttcc tgctgagcct ggcctgtgct 661 gacctcatca tcggtacctt ctccatgaac ctctatacca cgtacctgct catgggccac 721 tgggctctgg gcacgctggc ttgtgacctc tggctggccc tggactatgt ggccagcaat 781 gcctccgtca tgaatctgct gctcatcagc tttgaccgct acttctccgt gactcggccc 841 ctgagctacc gtgccaagcg cacaccccgc cgggcagctc tgatgatcgg cctggcctgg 901 ctggtttcct ttgtgctctg ggccccagcc atcctcttct ggcagtacct ggtaggggag 961 cggacagtgc tagctgggca gtgctacatc cagttcctct cccagcccat catcaccttt 1021 ggcacagcca tggctgcctt ctacctccct gtcacagtca tgtgcacgct ctactggcgc 1081 atctaccggg agacagagaa ccgagcacgg gagctggcag cccttcaggg ctccgagacg 1141 ccaggcaaag ggggtggcag cagcagcagc tcagagaggt ctcagccagg ggctgagggc 1201 tcaccagaga ctcctccagg ccgctgctgc cgctgctgcc gggcccccag gctgctgcag 1261 gcctacagct ggaaggaaga agaggaagag gacgaaggct ccatggagtc cctcacatcc 1321 tcagagggag aggagcctgg ctccgaagtg gtgatcaaga tgccaatggt ggaccccgag 1381 gcacaggccc ccaccaagca gcccccacgg agctccccaa atacagtcaa gaggccgact 1441 aagaaagggc gtgatcgagc tggcaagggc cagaagcccc gtggaaagga gcagctggcc 1501 aagcggaaga ccttctcgct ggtcaaggag aagaaggcgg ctcggaccct gagtgccatc 1561 ctcctggcct tcatcctcac ctggacaccg tacaacatca tggtgctggt gtccacgttc 1621 tgcaaggact gtgttcccga gaccctgtgg gagctgggct actggctgtg ctacgtcaac 1681 agcaccatca accccatgtg ctacgcactc tgcaacaaag ccttccggga cacctttcgc 1741 ctgctgctgc tttgccgctg ggacaagaga cgctggcgca agatccccaa gcgccctggc 1801 tccgtgcacc gcactccctc ccgccaatgc tgatagtccc ctctcctgca tccctccacc 1861 ccagtccccg ggaaaaggcc ggtcggaaga gggcaggggc tgcatcctca gccccagggc 1921 cctgctcagg cctcacctgg cttcccagga ccctgggtca ccttcctggg cagcccagag 1981 agacctgcca actttccaga cttcgctatt cccaggcagg gagggaaacc cggggaactg 2041 gtttttctgt tccctgctgg gtgggaatgc gctcttcaca ggaagaaggc ccgggaggag // LOCUS HSFAU1 2016 bp DNA PRI 21-JUL-1993 DEFINITION H.sapiens fau 1 gene. ACCESSION X65921 S45242 NID g31304 KEYWORDS fau 1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2016) AUTHORS Kas,K. TITLE Direct Submission JOURNAL Submitted (29-APR-1992) K. Kas, University of Antwerp, Dept of Biochemistry T3.22, Universiteitsplein 1, 2610 Wilrijk, BELGIUM REFERENCE 2 (bases 1 to 2016) AUTHORS Kas,K., Michiels,L. and Merregaert,J. TITLE Genomic structure and expression of the human fau gene: encoding the ribosomal protein S30 fused to a ubiquitin-like protein JOURNAL Biochem. Biophys. Res. Commun. 187 (2), 927-933 (1992) MEDLINE 92412144 FEATURES Location/Qualifiers source 1..2016 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CML cosmid" /clone="15.1" mRNA join(408..504,774..856,951..1095,1557..1612,1787..>1912) /gene="fau 1" exon 408..504 /gene="fau 1" /number=1 gene 408..1912 /gene="fau 1" intron 505..773 /gene="fau 1" /number=1 exon 774..856 /gene="fau 1" /number=2 CDS join(782..856,951..1095,1557..1612,1787..1912) /gene="fau 1" /codon_start=1 /db_xref="PID:g31305" /db_xref="SWISS-PROT:P35544" /db_xref="SWISS-PROT:Q05472" /translation="MQLFVRAQELHTFEVTGQETVAQIKAHVASLEGIAPEDQVVLLA GAPLEDEATLGQCGVEALTTLEVAGRMLGGKVHGSLARAGKVRGQTPKVAKQEKKKKK TGRAKRRMQYNRRFVNVVPTFGKKKGPNANS" intron 857..950 /gene="fau 1" /number=2 exon 951..1095 /gene="fau 1" /number=3 intron 1096..1556 /gene="fau 1" /number=3 exon 1557..1612 /gene="fau 1" /number=4 intron 1613..1786 /gene="fau 1" /number=4 exon 1787..>1912 /gene="fau 1" /number=5 polyA_signal 1938..1943 BASE COUNT 421 a 562 c 538 g 495 t ORIGIN 1 ctaccatttt ccctctcgat tctatatgta cactcgggac aagttctcct gatcgaaaac 61 ggcaaaacta aggccccaag taggaatgcc ttagttttcg gggttaacaa tgattaacac 121 tgagcctcac acccacgcga tgccctcagc tcctcgctca gcgctctcac caacagccgt 181 agcccgcagc cccgctggac accggttctc catccccgca gcgtagcccg gaacatggta 241 gctgccatct ttacctgcta cgccagcctt ctgtgcgcgc aactgtctgg tcccgccccg 301 tcctgcgcga gctgctgccc aggcaggttc gccggtgcga gcgtaaaggg gcggagctag 361 gactgccttg ggcggtacaa atagcaggga accgcgcggt cgctcagcag tgacgtgaca 421 cgcagcccac ggtctgtact gacgcgccct cgcttcttcc tctttctcga ctccatcttc 481 gcggtagctg ggaccgccgt tcaggtaaga atggggcctt ggctggatcc gaagggcttg 541 tagcaggttg gctgcggggt cagaaggcgc ggggggaacc gaagaacggg gcctgctccg 601 tggccctgct ccagtcccta tccgaactcc ttgggaggca ctggccttcc gcacgtgagc 661 cgccgcgacc accatcccgt cgcgatcgtt tctggaccgc tttccactcc caaatctcct 721 ttatcccaga gcatttcttg gcttctctta caagccgtct tttctttact cagtcgccaa 781 tatgcagctc tttgtccgcg cccaggagct acacaccttc gaggtgaccg gccaggaaac 841 ggtcgcccag atcaaggtaa ggctgcttgg tgcgccctgg gttccatttt cttgtgctct 901 tcactctcgc ggcccgaggg aacgcttacg agccttatct ttccctgtag gctcatgtag 961 cctcactgga gggcattgcc ccggaagatc aagtcgtgct cctggcaggc gcgcccctgg 1021 aggatgaggc cactctgggc cagtgcgggg tggaggccct gactaccctg gaagtagcag 1081 gccgcatgct tggaggtgag tgagagagga atgttctttg aagtaccggt aagcgtctag 1141 tgagtgtggg gtgcatagtc ctgacagctg agtgtcacac ctatggtaat agagtacttc 1201 tcactgtctt cagttcagag tgattcttcc tgtttacatc cctcatgttg aacacagacg 1261 tccatgggag actgagccag agtgtagttg tatttcagtc acatcacgag atcctagtct 1321 ggttatcagc ttccacacta aaaattaggt cagaccaggc cccaaagtgc tctataaatt 1381 agaagctgga agatcctgaa atgaaactta agatttcaag gtcaaatatc tgcaactttg 1441 ttctcattac ctattgggcg cagcttctct ttaaaggctt gaattgagaa aagaggggtt 1501 ctgctgggtg gcaccttctt gctcttacct gctggtgcct tcctttccca ctacaggtaa 1561 agtccatggt tccctggccc gtgctggaaa agtgagaggt cagactccta aggtgagtga 1621 gagtattagt ggtcatggtg ttaggacttt ttttcctttc acagctaaac caagtccctg 1681 ggctcttact cggtttgcct tctccctccc tggagatgag cctgagggaa gggatgctag 1741 gtgtggaaga caggaaccag ggcctgatta accttccctt ctccaggtgg ccaaacagga 1801 gaagaagaag aagaagacag gtcgggctaa gcggcggatg cagtacaacc ggcgctttgt 1861 caacgttgtg cccacctttg gcaagaagaa gggccccaat gccaactctt aagtcttttg 1921 taattctggc tttctctaat aaaaaagcca cttagttcag tcatcgcatt gtttcatctt 1981 tacttgcaag gcctcaggga gaggtgtgct tctcgg // LOCUS HUMCRYABA 4206 bp DNA PRI 01-NOV-1994 DEFINITION Human alpha-B-crystallin gene, 5' end. ACCESSION M28638 NID g181075 KEYWORDS alpha-crystallin; crystallin. SOURCE Human DNA, clones 730 and cp8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4206) AUTHORS Dubin,R.A., Ally,A.H., Chung,S. and Piatigorsky,J. TITLE Human alpha B-crystallin gene and preferential promoter function in lens JOURNAL Genomics 7 (4), 594-601 (1990) MEDLINE 90353958 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Dubin, 04-OCT-1989. FEATURES Location/Qualifiers source 1..4206 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q22.3-q23.1" TATA_signal 924..930 prim_transcript 949..>4040 /note="A-B2-cry mRNA and introns" exon <999..1199 /gene="CRYA2" /note="alpha-B2-crystallin; G00-119-805" /number=1 gene 999..1199 /gene="CRYA2" CDS join(999..1199,2274..2396,3754..3957) /note="alpha-B2-crystallin" /codon_start=1 /db_xref="PID:g181076" /translation="MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSL SPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHG KHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIP ITREEKPAVTAAPKK" intron 1200..2273 /note="A-B2-cry, intron A" exon 2274..2396 /number=2 intron 2397..3753 /note="A-B2-cry, intron B" exon 3754..>3957 /note="alpha-B2-crystallin" /number=3 polyA_signal 4075..4080 BASE COUNT 1028 a 1041 c 985 g 1152 t ORIGIN 1 gtcgacacca cccaaaatag tgccgagcct cttggggggg gaggggctgg gagtgggggc 61 cctgagtgag agcaacgagg gtgtgaccag cgccgcccgg acccctagtc ccctcccccg 121 cacactcttc agctgtcgca gggggcctga gaggacagct gagggtcctg gctgggaacg 181 agctggggag ggggagctgg tggtgcctgg ggcatgaaga ggcctcgctg agaccctcac 241 aaacggtttg cacgtttcca cacctcattt tctcctcttc ggtggcaggc actgtgcacc 301 caattcctaa agcactcctg gatttaatgt tctgagagcc acatagaacg aaagatgcaa 361 gaaatctgtt tgctcttttt tcagggggtg gggtctttct gcccagatgt gggatcctct 421 cctaaaccca ggtcaaccca gggcacgagg cagatggctg gtgctgacat gttgaccatc 481 actgctctct tccaaggact cacaaagagt taatgtccct ggggctcagc ctaggaagat 541 tccagtccct gcccaggccc aagatagttg ctggcctgat tcccctggca ttcaggactg 601 gaaaggagga ggaggggcac actacgccgg ctcccatcct ccccccaccc cgcgtgcctg 661 cttgggattc ctgactctgt accagcttca gagaacaggg gtgggggtgg gtgccattgg 721 gtgtggacag aaagctagtg aaacaagacc atgacaagtc actggccggc tcagacgtgt 781 ttgtgtctct cttttcttag ctcagtgagt actgggtatg tgtcacattg ccaaatcccg 841 gatcacaagt ctccatgaac tgctggtgag ctaggataat aaaacccctg acatcaccat 901 tccagaagct tcacaagact gcatatataa ggggctggct gtagctgcag ctgaaggagc 961 tgaccagcca gctgacccct cacactcacc tagccaccat ggacatcgcc atccaccacc 1021 cctggatccg ccgccccttc tttcctttcc actcccccag ccgcctcttt gaccagttct 1081 tcggagagca cctgttggag tctgatcttt tcccgacgtc tacttccctg agtcccttct 1141 accttcggcc accctccttc ctgcgggcac ccagctggtt tgacactgga ctctcagagg 1201 tgagtctccc cacagctagg acgggagagt ccttactgga acctcctgga aacttctcca 1261 tccattttcc tttcctaccc tgcctaaacc attttaggca catgtgtgtc caaatgtgaa 1321 gaaaaatgag gaggttgcta gtgccttcct cccccatcac ctgtttctat ttgatagtcc 1381 tctgtatccc atttattaca ttttttcatg cactgtcaag tttatcctcc gtcccctaac 1441 ttctctacag gatacccctt tctggtttgg ttcatgacaa tctgcaggga aagagctgcc 1501 ttcaaactcc tttgcttatc tcttccaaca ccttggactc ttgaccgatt ttaccatctc 1561 aggtttcaga gccaggagag agccctgcct catcctgagc tgttcatccc catgggtatt 1621 ttctgccttt ctattccctc ttctatgatt ttctgggttt ctcagggcta cgacagggcg 1681 ctggcctggg tccaatcaag ccctacgagg aaacaatata gggacgccca tttgtcctaa 1741 gagggtggaa gaacagggtg aacaaataag gttgacagag ctgtcacaga taacactctg 1801 gtttaaaaat attcaagtgt gagtaaacag gagctgagtg ggcaagggct ttggaaggac 1861 aagcaggacc agcagaacat tccagattgg gtgggtggaa aactggcaaa gagacctgag 1921 ccagaagaag aggcctttgt ctcacagaca aaccacaaag ccaggcattg gagtcagaga 1981 ggcagcagat gccaggcttg cacccatcct tgcgactggt cccctgggtg atctgtcttc 2041 ttctctgtcc ctgtaaataa agtttgggtc tgatcaccat gagccttagg tatcactgtg 2101 gtggctccct gaagcagaca gctatgttta tttaaaaagg agatttttta agcagagaag 2161 agaaggatga attacccgga cagaaagcag ctctgcagaa taagacagca cctgtgtaat 2221 cagtattttt gccctctttc tcccatccca ttcccttacc ttgctatttc tagatgcgcc 2281 tggagaagga caggttctct gtcaacctgg atgtgaagca cttctcccca gaggaactca 2341 aagttaaggt gttgggagat gtgattgagg tgcatggaaa acatgaagag cgccaggtat 2401 gtagcttgtt tttttgtttt ctgctcattc attcagtgat actgtaatag tccaggtagt 2461 gctatcagct ttggaggctg gctacattcc agtcccaagc cataacagtc gggatcaggg 2521 gttacaaatc aatgtctaga agactaagtt aggatagaca tattgctgtt gttactatta 2581 tggccagaga tgtggccttt gatttgatcg ccttagatgg gatgatggga tgctgatgcc 2641 ccatttaagc cagtggttct gaatctgggc cacattagaa tcaccagggg aactttcaaa 2701 aacctaatgc tcgggcatcc tccagaccaa ttagcatatg tgctgccgaa gcgagcacta 2761 ctccagacca attaaatcag catttttaag ggtgggaccc aggcatcagc aatttttaag 2821 gtaattctaa tctacagtca aggttgagaa ccactgatta ggtatagggc tgtcagacac 2881 ctagttgctt tgcataatta cattaactac aggtacccta aaagcacttg agttgtgact 2941 tctcttttag ctgtgcaaga atccgtgtct cttctttagc ccatcttaat gctgaactac 3001 ttggtttgtc taaatttcag agctgtgctc agtctttaat cccctacagc ccatgtggta 3061 atcagttaac gagagcctgt ttggctacat gcttgagagt cagcaggcat acgggttaag 3121 gtcatctact ctttggggga gttctgacaa atggaacagc ttgttatgac tttataagag 3181 ggctttaaaa ttgcttctca ccatttaacg atagctcaga acctgtgcgt caaccagtac 3241 agtttgtcct cagtaatgtc ctcaggctgt ttcaattttg cttatatgat ttaggtttgg 3301 gtcatagtct ccttggatgg agtcattttt tttttttttt aatttcagca gcagtcctat 3361 tgttctggaa ccttctggga cattcctgaa gagtcaggac aatttcaggg cttcctcagg 3421 gactcagatt ctaaatgaga ttccaaattc tgtaggccca gccaacattg atctaaacct 3481 ttgggaaata cccctaaaca tatctatgcc tcagggtttg aaaaacaatg aagtgttgga 3541 ctgtttcaga cttctcagat tctcactggt aggagtgact acctaggcaa tttcatctta 3601 gctgcaaccc tgaaacgaag ctctatttat ttttcctatg ttgtcatggc atttggtctc 3661 acctaagggg aaatcaggat gcctgagttc tgggcaggtg ataatagttc ctgttcttat 3721 ctctctgcct ctttcctcat tcttttgggt taggatgaac atggtttcat ctccagggag 3781 ttccacagga aataccggat cccagctgat gtagaccctc tcaccattac ttcatccctg 3841 tcatctgatg gggtcctcac tgtgaatgga ccaaggaaac aggtctctgg ccctgagcgc 3901 accattccca tcacccgtga agagaagcct gctgtcaccg cagcccccaa gaaatagatg 3961 ccctttcttg aattgcattt tttaaaacaa gaaagtttcc ccaccagtga atgaaagtct 4021 tgtgactagt gctgaagctt attaatgcta agggcaggcc caaattatca agctaataaa 4081 atatcattca gcaacagata actgtcttgt gtttgaatat tccacacact tttaaataaa 4141 tatacagata ccacagatct atttatgatt gcattatgat ttagagggct ccaaggattt 4201 tagagt // LOCUS HSENO3 7194 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens ENO3 gene for muscle specific enolase. ACCESSION X56832 NID g31166 KEYWORDS beta-enolase; ENO3 gene; enolase; glycolytic enzyme; isoform; muscle specific enolase; muscle specific protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7194) AUTHORS Feo,S. TITLE Direct Submission JOURNAL Submitted (21-NOV-1990) S. Feo, ISTITUTO DI BIOLOGIA DELLO SVILUPPO, DEL CONSIGLIO NAZIONALE DELLE RICERCHE, VIA ARCHIRAFI 20, 90123 PALERMO, ITALY REFERENCE 2 (bases 1 to 7194) AUTHORS Giallongo,A., Venturella,S., Oliva,D., Barbieri,G., Rubino,P. and Feo,S. TITLE Structural features of the human gene for muscle-specific enolase. Differential splicing in the 5'-untranslated sequence generates two forms of mRNA JOURNAL Eur. J. Biochem. 214 (2), 367-374 (1993) MEDLINE 93292497 REMARK Erratum:[Eur J Biochem 1993 Dec 15;218(3):1095] COMMENT Related sequences: X16504 and X51957. FEATURES Location/Qualifiers source 1..7194 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /haplotype="2N" /dev_stage="adult" /tissue_type="haematopoietic" /cell_type="T-cell" /cell_line="Jurkat-T" /clone_lib="partial Sau3A genomic library in EMBL-3" /clone="lambda B.21" /chromosome="17" /map="p11" TATA_signal 837..842 mRNA join(868..973,1577..1663,2540..2635,2796..2854,3016..3085, 3455..3588,4820..5042,5153..5350,5688..5889,6318..6426, 6576..6634,6723..6872) /gene="ENO3" gene 868..7194 /gene="ENO3" prim_transcript 868..>7194 /gene="ENO3" exon 868..973 /gene="ENO3" /number=1 intron 974..1576 /gene="ENO3" /number=1 exon 1577..1663 /gene="ENO3" /number=2 CDS join(1579..1663,2540..2635,2796..2854,3016..3085, 3455..3588,4820..5042,5153..5350,5688..5889,6318..6426, 6576..6634,6723..6792) /gene="ENO3" /EC_number="4.2.1.11" /codon_start=1 /product="muscle specific enolase" /db_xref="PID:g31167" /db_xref="SWISS-PROT:P13929" /translation="MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIY EALELRDGDKGRYLGKGVLKAVENINNTLGPALLQKKLSVVDQEKVDKFMIELDGTEN KSKFGANAILGVSLAVCKAGAAEKGVPLYRHIADLAGNPDLILPVPAFNVINGGSHAG NKLAMQEFMILPVGASSFKEAMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNIL ENNEALELLKTAIQAAGYPDKVVIGMDVAASEFYRNGKYDLDFKSPDDPARHITGEKL GELYKSFIKNYPVVSIEDPFDQDDWATWTSFLSGVNIQIVGDDLTVTNPKRIAQAVEK KACNCLLLKVNQIGSVTESIQACKLAQSNGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLMRIEEALGDKAIFAGRKFRNPKAK" intron 1664..2539 /gene="ENO3" /number=2 repeat_region complement(2096..2368) /note="ALU repeat element" exon 2540..2635 /gene="ENO3" /number=3 intron 2636..2795 /gene="ENO3" /number=3 exon 2796..2854 /gene="ENO3" /number=4 intron 2855..3015 /gene="ENO3" /number=4 exon 3016..3085 /gene="ENO3" /number=5 intron 3086..3454 /gene="ENO3" /number=5 exon 3455..3588 /gene="ENO3" /number=6 intron 3589..4819 /gene="ENO3" /number=6 repeat_region 4379..4662 /note="ALU repeat element" exon 4820..5042 /gene="ENO3" /number=7 intron 5043..5152 /gene="ENO3" /number=7 exon 5153..5350 /gene="ENO3" /number=8 intron 5351..5687 /gene="ENO3" /number=8 exon 5688..5889 /gene="ENO3" /number=9 intron 5890..6317 /gene="ENO3" /number=9 exon 6318..6426 /gene="ENO3" /number=10 intron 6427..6575 /gene="ENO3" /number=10 exon 6576..6634 /gene="ENO3" /number=11 intron 6635..6722 /gene="ENO3" /number=11 exon 6723..6872 /gene="ENO3" /number=12 polyA_signal 6853..6858 /gene="ENO3" polyA_site 6872 /gene="ENO3" BASE COUNT 1553 a 1992 c 1971 g 1678 t ORIGIN 1 agatctctac cgagggcaga gacctacctc cccgcagtgc tacaagtggg gcgccggaag 61 agccccaggc gtgcagaagc tcacaaaagg ccacccgtcc tcggtccatt catttttgtt 121 cactgttgat tcagccccat tcattgatgg gctggggccg tgcgctgagc gcccacagtc 181 gatggggaaa ggggctctga ccgacagtcc ccacgccggg cgacaagtgc tgtcccagcg 241 ttatcagtcg ggcgccttgc cagccgaaag ggcctgtcta aattcgtttc ctgtccccta 301 actcatcccg gcgctggctg gcctggagag ggtaggatgg ggcggcgccg agaatggccg 361 ttatgaggac cctaagaggt gagaccctct cgccttctgg ggtggggggg tcccgtcctt 421 tcccccactg aggacagagg cccgcccagc gatctgagca tgtgtggacg tcaatcttgc 481 agcccctctt ccaggccccc tccccagcct tgcagggctc aggttacccc tggcctttcc 541 taaaggtcac tcattcctct tgacgtttgc aaaaggggaa tgtaatcctg gggtgggggg 601 agacccctca tctgtagccc ctcccttgct cctcccaaag ggtggaatta gaacagggac 661 tgttattggg agacagaaag tgggggatag tagttgacct ttggtaaggg ggcaggtgcc 721 cagggccaga ggcttctgct tcaggctgta gtgggcactt ggctgccagc ccagtgtgaa 781 ggggggagga tggagagaaa gagaggcggg gctggctggg gaccgagtgg ctcagggata 841 aatgcgcagc ctgagagggg gtgagctgac actgtcccag ctgccaccta gactcggagc 901 tccatccaaa cctccagcga agacatccca ggtcgggtga atcttccagc cctgggggtg 961 gaggtagtaa agggtgagca tggtattggc ttggaggaag tgggggacat ttctgctttt 1021 tttcctcctg ggactggaga tgcttgaaaa agctggggga aggggcggct ggagcaagca 1081 gatgggacaa actctgggaa caccgaagga tctagggaaa ggaggctgtg aggagggcag 1141 cagggatgga tagaaaaggg cagctagagc tggaacctga tagggaattg ggggcccaag 1201 gagatttcgg agcaggaaaa tgagaaccag aaaggatttg aaggccacca gccatggaga 1261 acagactgct tgaccagagg ggtggaagga gaaggcctaa gtggaggctt gggggaggtg 1321 ggggcttggt gagcggtggc atcccaggag ctatagataa gaggcccctg gattcttagg 1381 atgggagggt ggaataagag ctgttctgag tgggggaggg ggctgcgcct gcctctttgg 1441 tctgtgacct ttttgtaggg tatttttagc tccagcacct gccttcttgg agtggggaag 1501 aatcttaaag ggcaagggat ttctggttcc ttaagagatc aactgtctac actcactcac 1561 acctcctgtc ctgcagccat ggccatgcag aaaatctttg cccgggaaat cttggactcc 1621 aggggcaacc ccacggtgga ggtggacctg cacacggcca agggtaacac aaggcccatt 1681 ggataggctc gctccgaaga ccccaaccct ttggcctttg cccccagttc tgtgccatat 1741 cctctccttt ctctcgggtt ccctttccca gacttcttcc caagccccct tcttccaacg 1801 tggaaccaga gctggaagct aggagaagca ggagtctctc tctgcactgc ctcctgtccc 1861 tgagctcaga gaggacacct cagccctttg agaggtagag agactccctg tgaggtcctt 1921 ttttttgttt tgttttgttt tgagatggag tctcgctctg tggcccgggc tggagtgcag 1981 tggtgcaatc tcagctcact gcaacctctg cctcccgggt ttaagtgatt cttgtgcctc 2041 agcctcccaa gtagctggga ttacaggcct gtgccatcac gcccagctaa tttttttttt 2101 ttttttgaga tggagtttca ctctgtcgcc caggctggag tgcagtggcg cgatctaggc 2161 tcactgcaag ctccgcctcc cgggttcacg ccattctcct gcctcagcct cccgagtagc 2221 tgggactaca gctaattttt tgtattttta gtagagatgg ggtttcacgg tgtcagccag 2281 gattgtctcg atctcctgat ctcgtgatcc gcccgccttg gcctcccaaa gtgctgggat 2341 tacaggcgtg agccaccgcg cccggcctca gctaattttt gtatttttag cagagacggt 2401 ttcgccatgt tagccaggct ggtcttgaac tcctgatctc aagtgatcca cctgcctagg 2461 cctctcaaag tgctgggatt acaggcatgg gccaccgcgc ccagccatcc ctgtgatctt 2521 ccaattcctc ctgtcccagg ccgattccga gcagctgtgc ccagtggggc ttccacgggt 2581 atctatgagg ctctggaact aagagacgga gacaaaggcc gctacctggg gaaaggtgag 2641 gagacaccag cgcagaagga gcctgtgtgg gcggctttag gacatgggtg cacaatgggt 2701 agaggactgg aacccccaag gctcttgagg agctggggtc cacagaaagg ggcccagttg 2761 attgagctcc aaaactcatc ctctggcctg tctaggagtc ctgaaggctg tggagaacat 2821 caacaatact ctgggccctg ctctgctgca aaaggcaagt ggggaagccc gctcgctgca 2881 gcctcctccc catgcccctg ctccctcagc ccagacaggc ctctcccgaa acattttccc 2941 ttatccttcc cctgcatgtg ccctgacttc tgagaaatct gacctctgct ctcccttctc 3001 aaactcacct tccagaaact aagcgttgtg gatcaagaaa aagttgacaa atttatgatt 3061 gagctagatg ggaccgagaa taagtgtgag tgaagggcta gcggtgggga agggatgagg 3121 tgtgggagag atggcgagag gccatggggt gaggcctgat gggttatttc tgggtccccc 3181 attttgggtc acaccgcagc tggatggatt tgtgttcatt ccacaggcat ttcctatgcc 3241 tgccttctgc caggcctggg ccgggctgtg gacacagaca tgcagcggac acattccccc 3301 gggtgggggt tcccagggct actcaggctg ttggggagag atctgttaac caattcttca 3361 tgctcagtgg tttggagctc tggcgtcttc ctggagtagc caccccaaac cctgcagaag 3421 ctctcatcct ttcttcccgc ttgcctcctt ccagccaagt ttggggccaa tgccatcctg 3481 ggcgtgtcct tggccgtgtg taaggcggga gcagctgaga agggggtccc cctgtaccgc 3541 cacatcgcag atctcgctgg gaaccctgac ctcatactcc cagtgccagt gagtgcagct 3601 acccgccctt cccagatctc gcctggacag agccaacccc gcccagccag gttggccccc 3661 ctggaaaatc cacctttcag accagctcct aagaagcagt ttcctgaaat tgaatctctc 3721 ctgctgtggt cccacccctc cctctcttcc acaatccaaa ccttccctga aggctctatg 3781 tgcttccttc cctgccatat aacccagttc cttcccattc caatcctagg ctcgataact 3841 ccagcctcgt tccacacccc ccaccaccct acacccaaat catattgaga gtttctttga 3901 ggcaaagatc ttggcatttc tttgtatctt cttgagtcct tcttaaacca gtgccttctt 3961 gatgaatgga aactctttga atttaatctc aataataatg ttaccatttc ttgttactta 4021 ctaggaggat tgcataagag catggtctct agagatagat tgctgggttt gaatcctggc 4081 tctgcctgtt attagttgta tgattttagg caagttcttt aactgctctg tctcagtttc 4141 cccatctgta aaatgggggg taataacgat accaatgaat agaggtgttt tgagaatgaa 4201 atgggctaat acatgtaaag tacttagtgc gtggcatgta gtaagtgcta tataaaagtg 4261 ttagctatta tatactaggc acatgctaag aactttacat tcaatttctc taattttcag 4321 caaaagatcc tacagggaag taggtgctat ccccatttta cagaaatgag gaaagagagg 4381 ctgggcacag tcgtcacgcc tgtaatccca gcacttcggg aggccgaggc gggcagatca 4441 caaggtcagg agttcgagac cagcctgacc aacatggtga aaccccgtct ctactaaaaa 4501 tacaaaaatt agccaggcat ggtggcaccc acctatagtc ccagctacta gggaggctaa 4561 ggcaggagaa tcacttgaac ccgggaggcg gaggttgcgc cattgcactc cagcctggtg 4621 acaaagtgag actccgtctc aaaaaaaaaa aaaaaaaaag aaagaaatga ggaaagaggc 4681 tcttgagcaa aactactcat cctagaccac tgagctagta agtagggaag ccaggtttcc 4741 accccaacac cccccgcccc tgtcccttct tgagctctca tgccccggcc caggtccaga 4801 caccctctcc ccatctcagg ccttcaatgt gatcaacggg ggctcccatg ctggaaacaa 4861 gctggccatg caggagttca tgattctgcc tgtgggagcc agctccttca aggaagccat 4921 gcgcattggc gccgaggtct accaccacct caagggggtc atcaaggcca agtatgggaa 4981 ggatgccacc aatgtgggtg atgaaggtgg cttcgcaccc aacatcctgg agaacaatga 5041 gggtcagtgc tgagcaccct ggggggcaga ccccctggat ctccacatgg gcaggggagg 5101 ctgcagacaa ggggacactg gagtctcagg tcctttcttg gtcctccccc agccctggag 5161 ctgctgaaga cggccatcca ggcggctggt tacccagaca aggtggtgat cggcatggat 5221 gtggcagcat ctgagttcta tcgcaatggg aagtacgatc ttgacttcaa gtcgcctgat 5281 gatcccgcac ggcacatcac tggggagaag ctcggagagc tgtataagag ctttatcaag 5341 aactatcctg gtgaggcgtt cgggtgtccc agtgttcctg cccgaatccc gtgcagctgc 5401 ctaatatact gatttcagtg acctgctttg ccatcgactt ggatccttcc aattcttagc 5461 cccattaaaa tccccattta agctcttctg ccctgtcacc cctccatgag gctccttctg 5521 acctctagcc gtgtctctgc cctgtctctg ccttgtctct gccctgtctc tgctctgtct 5581 ctgccctgtc tctgccctgt ctctgccctg tctctgctct gtctctgccc tgtctctgcc 5641 ctgtctctgc tctgtctctg ccctgtctct gctccaaacc ccaccagtgg tctccatcga 5701 agaccccttt gaccaggatg actgggccac ttggacctcc ttcctctcgg gggtgaacat 5761 ccagattgtg ggggatgact tgacagtcac caaccccaag aggattgccc aggccgttga 5821 gaagaaggcc tgcaactgtc tgctgctgaa ggtcaaccag atcggctcgg tgaccgaatc 5881 gatccaggcg tgagtgcctc ctgaccctga ggctcaccat agcctgcctc tgccccagct 5941 ctgcccactc cagctacagt cttacccacc aactccaagc ttacctttcc cctggcacct 6001 gacttaccca caccttgatc tcaagacttt gtttgactaa tccttggcct gactccaaga 6061 gctttgcccc tgtggctctg tcatgacccc ccacccccag cccacaccat tcctctctca 6121 gattccgctg ttctcagcag catctcaaga gcccgaaatc aagataagtc ttttcctctc 6181 ctacttccca aagaacttag tcactgccct ccctgcagag tgcttgctac ccaaaacaga 6241 agggagaggc ccacccaacc cctgctttcc cactgaggag gttctagaag gccacatcaa 6301 atgtcctctc cactcaggtg caaactggct cagtctaatg gctggggggt gatggtgagc 6361 caccgctctg gggagactga ggacacattc attgctgacc ttgtggtggg gctctgcaca 6421 ggacaggtac ttgtagcttc tctctactga gtgtctcacc aagttttctt ggggtccctg 6481 gcctcctgcc tttgaggtta atgctccctt ggggccaggt ccaacccctc ctttccagcc 6541 tcacctaacc ctccaaattc ttcttccctc atcagatcaa gactggcgcc ccctgccgct 6601 cggagcgtct ggccaaatac aaccaactca tgaggtacag cgggaacagt gggcctgggc 6661 attggggtgc tggaggctgt taggttggaa gttcagcagc cctaaccttg cctgcattct 6721 aggatcgagg aggctcttgg ggacaaggca atctttgctg gacgcaagtt ccgtaacccg 6781 aaggccaagt gagaagctgg aggctccagg actccactgg acagacccag gtcttccaga 6841 cctgcttcct gaaataaaca ctggtgccaa ccaagacagc tgtgtgcttc tttgtgggag 6901 ctaggggatg ctggtttctg ggctgagctg gggaaaacag gaggtggcga gatgggggcg 6961 gggaggggag gcaaatagat atgtaacctg ctaggacaaa gaggaggtga gggactcaca 7021 gaggaagctg ggtggttcta gaacaacaga ggtgttcaca tttccaaagg aggtagggga 7081 agcagatgct atatggtaaa atgaagctgc agagttgcca atggagccgg gcatggtggc 7141 gctcgccttg tagtcccaga taccttggga ggctgagttt gagaggatcg cttt // LOCUS HUMPPIB 1083 bp DNA PRI 08-JAN-1995 DEFINITION Homo sapiens 21 kDa protein gene, complete cds, clone D4S234. ACCESSION M98529 NID g190260 KEYWORDS 21 kDa protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1083) AUTHORS Carlock,L., Vo,T., Wisniewski,D. and Lorincz,M. TITLE The Identification of a neuron-specific gene that maps adjacent to the Huntington's Disease marker D4S10 that shows homology to protein phosphatase inhibitors JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1083 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KO2" /map="4p16.3" gene 223..928 /gene="21 kDa protein" CDS 223..780 /gene="21 kDa protein" /note="putative" /codon_start=1 /function="unknown" /product="21 kDa protein" /db_xref="PID:g190261" /translation="MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKV VVKTKTEYEPDRKKGKARPPQIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVV YKVYKYDRACPDGFVLKNTQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQSITRS VSPWMSVLSEEKLSEQETEAAEKSA" polyA_signal 923..928 /gene="21 kDa protein" BASE COUNT 258 a 312 c 277 g 236 t ORIGIN 1 ataatctaga aatacacagc accacccgac ccccgcatcg ggccgtgacc accgcgtccc 61 cacgagccct ccccgagacg aagcggggcc ggggagctcg cggacgccgg gacgccggtg 121 ggtgtgggtg cccacttccc ccccgccccg ccccgggtct tgcttgtggt gactcccccc 181 ggccctcccg ccgcaggctg cagcctcgga gctcccggaa cgatggtgaa gttggggaac 241 aatttcgcag agaagggcac caagcagccg ctgctggagg atggcttcga caccattccc 301 ctgatgacgc ccctcgatgt caatcagctg cagttcccgc ccccggataa ggtggtcgtg 361 aaaactaaga ccgagtatga acctgaccgc aagaaaggga aagcacgtcc tccccaaatt 421 gctgagttca ccgtcagcat cacggagggt gtcaccgaga ggtttaaggt ctccgtgttg 481 gtcctcttcg ccctggcctt cctcacctgc gtcgtcttcc tggttgtcta caaggtgtac 541 aagtatgacc gcgcctgccc cgatgggttc gtcctcaaga acacccagtg catcccagaa 601 ggcttggaga gctactacgc ggagcaagac tccagtgccc gggagaaatt ttacacagtc 661 ataaaccact acaacctggc caagcagagc atcacgcgct ccgtatcgcc ctggatgtca 721 gttctgtcag aagagaagct gtccgagcag gagactgaag cggctgagaa gtcagcttag 781 cgggatgggc aagttcctta caatgtgtca cttgcaaata acaaagggac tttgagggac 841 atttcattaa atataattac tgatacttta gaggttactc atttacggtg caattgcttc 901 tgtttgctaa tgctgctttg caaataaaac ttgctgccga ccacccacgg gcataaaatc 961 aagtgcattt cagcattgcc taaagagctc tgacaccact tttcatgtta agatcttcat 1021 ttagctcctt tactgggatt tattggatgc tgtaaaaaaa taaatttaca ctggatatgc 1081 gaa // LOCUS HUMKCHN 2397 bp DNA PRI 03-DEC-1997 DEFINITION Homo sapiens voltage-gated potassium channel (HGK5) gene, complete cds. ACCESSION M38217 X57342 NID g186670 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2397) AUTHORS Cai,Y.C., Osborne,P.B., North,R.A., Dooley,D.C. and Douglass,J. TITLE Characterization and functional expression of genomic DNA encoding the human lymphocyte type n potassium channel JOURNAL DNA Cell Biol. 11 (2), 163-172 (1992) MEDLINE 92189730 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by Y.-C.Cai, 30-AUG-1990. Oregon Health Sciences University Vollum Institute Mail Code: L474 3181 S.W. Sam Jackson Park Road Portland, OR 97201-3098 USA. FEATURES Location/Qualifiers source 1..2397 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-HGK5" /tissue_type="liver" /tissue_lib="lambda-Fix" gene 19..1590 /gene="HGK5" CDS 19..1590 /gene="HGK5" /codon_start=1 /product="voltage-gated potassium channel" /db_xref="PID:g186671" /translation="MTVVPGDHLLEPEVADGGGAPPQGGCGGGGCDRYEPLPPSLPAA GEQDCCGERVVINISGLRFETQLKTLCQFPETLLGDPKRRMRYFDPLRNEYFFDRNRP SFDAILYYYQSGGRIRRPVNVPIDIFSEEIRFYQLGEEAMEKFREDEGFLREEERPLP RRDFQRQVWLLFEYPESSGPARGIAIVSVLVILISIVIFCLETLPEFRDEKDYPASTS QDSFEAAGNSTSGSRAGASSFSDPFFVVETLCIIWFSFELLVRFFACPSKATFSRNIM NLIDIVAIIPYFITLGTELAERQGNGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQI LGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEADDPTSGFSSIPDAFWWAVVTMTT VGYGDMHPVTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQSQYMHV GSCQHLSSSAEELRKARSNSTLSKSEYMVIEEGGMNHSAFPQTPFKTGNSTATCTTNN NPNSCVNIKKIFTDV" BASE COUNT 532 a 675 c 603 g 587 t ORIGIN 1 cgcgagctgc cgcccgacat gaccgtggtg cccggggacc acctgctgga gccggaggtg 61 gccgatggtg gaggggcccc gcctcaaggc ggctgtggcg gcggcggctg cgaccgctac 121 gagccgctgc cgccctcact gccggccgcg ggcgagcagg actgctgcgg ggagcgcgtg 181 gtcatcaaca tctccgggct gcgcttcgag acgcagctga agaccctttg ccagttcccc 241 gagacgctgc tgggcgaccc caagcggcgc atgaggtact tcgacccgct ccgcaacgag 301 tacttcttcg accgcaaccg gcccagcttc gacgccatcc tctactacta tcagtccggg 361 ggccgcatcc gccggccggt caacgtgccc atcgacattt tctccgagga gatccgcttc 421 taccagctgg gcgaggaggc catggagaag ttccgcgagg acgagggctt cctgcgggag 481 gaggagcggc ccttgccccg ccgcgacttc cagcgccagg tgtggctgct cttcgagtac 541 cccgagagct ccgggccggc ccggggcatc gccatcgtgt ccgtgctggt catcctcatc 601 tccattgtca tcttctgcct ggagacgctg ccggagttcc gcgacgagaa ggactacccc 661 gcctcgacgt cgcaggactc attcgaagca gccggcaaca gcacgtcggg gtcccgcgca 721 ggagcctcca gcttctccga tcccttcttc gtggtggaga cgctgtgcat catctggttc 781 tccttcgaac tgctggtgcg gttcttcgct tgtcctagca aagccacctt ctcgcgaaac 841 atcatgaacc tgatcgacat tgtggccatc attccttatt ttatcactct gggtaccgag 901 ctggccgaac gacagggcaa tggacagcag gccatgtctc tggccatcct gagggtcatc 961 cgcctggtaa gggtcttccg catcttcaag ctgtcgcgcc actccaaggg gctgcagatc 1021 ctcgggcaaa cgctgaaggc gtccatgcgg gagctgggat tgctcatctt cttcctcttt 1081 attggggtca tccttttctc cagcgcggtc tactttgccg aggcagacga ccccacttca 1141 ggtttcagca gcatcccgga tgccttctgg tgggcagtgg taaccatgac aacagtgggt 1201 tacggcgata tgcacccagt gaccataggg ggcaagattg tgggatctct ctgtgccatc 1261 gccggtgtct tgaccatcgc attgccagtt cccgtgattg tttccaactt caattacttc 1321 taccaccggg agacagaagg ggaagagcaa tcccagtaca tgcacgtggg aagttgccag 1381 cacctctcct cttcagccga ggagctccga aaagcaagga gtaactcgac tctgagtaag 1441 tcggagtata tggtgatcga agaggggggt atgaaccata gcgctttccc ccagacccct 1501 ttcaaaacgg gcaattccac tgccacctgc accacgaaca ataatcccaa ctcttgtgtc 1561 aacatcaaaa agatattcac cgatgtttaa tatgtgatac aagtgacatg ctgtgctcag 1621 tattgtgtgg aacgtgcccc cttggtctgc ctatgccctt gttttataca tttccagacc 1681 attcatcaag gaaaggacct gaagaagtgg aaagcacact tcattctccc tctccctgct 1741 gcttcatact gaaacaggtg cctgttttgc aagtgggctg cattctctca gctctccttt 1801 tccctcttac cctctctctc tgaacattgt aaacaacaga cttacgttaa acttcatttc 1861 tagtacacgc cctatttaaa aaagagcagt acatcctggg aggaaatgaa actaaagaac 1921 agttagagta actgtttaac ctcagaattt taaaggcagt tgtttctttc ctaagcacat 1981 caattcgtag taaatgatgc ttcggtttga tggacctttc aacgttattt attgaatatg 2041 tatttcggtt gcctaccctg tagatatgtg gatgaagagt ctaactagaa taatgacttg 2101 taaacccacc atgagttatt tggtttttga cttaaattcc tatttgaatc ccctttcccg 2161 gaattttaag tgtctctaca actttgaata aagggaaatg cccaagatgt cctgatctga 2221 ctaattagtt taattctttc gggcttgcta agcatttcta aagcattaga ctaacagatt 2281 cctgtgaagt tcagagcata tgtcccagcc ccaacaacta tcaaagtcta gaaacagatg 2341 ttttcagtgt tgctgagaga aacaaaaaat ttcctaatgc atctgagaga taagctt // LOCUS HUMPCNA 6340 bp DNA PRI 07-JAN-1995 DEFINITION Human proliferating cell nuclear antigen (PCNA) gene, complete cds. ACCESSION J04718 NID g189681 KEYWORDS proliferating cell nuclear antigen. SOURCE Human leukocyte DNA, (library of C.Croce), and cDNA to mRNA, clone EMBL3-S2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6340) AUTHORS Travali,S., Ku,D.H., Rizzo,M.G., Ottavio,L., Baserga,R. and Calabretta,B. TITLE Structure of the human gene for the proliferating cell nuclear antigen JOURNAL J. Biol. Chem. 264 (13), 7466-7472 (1989) MEDLINE 89214190 COMMENT Draft entry and printed copy of sequence [1] kindly provided by D.-H.Ku, 15-MAR-1989. FEATURES Location/Qualifiers source 1..6340 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20pter-p12" misc_signal 748..753 /note="GC box" misc_signal 1090..1095 /note="GC box" misc_signal 1110..1115 /note="GC box" CAAT_signal 1124..1129 misc_signal 1145..1150 /note="GC box" prim_transcript 1268..6191 /note="PCNA mRNA and introns" exon <1427..1647 /gene="PCNA" /note="proliferating cell nuclear antigen (PCNA)" /number=1 gene join(1427..1647,2356..2453,2550..2617,3556..3750, 5636..5759,5846..5924) /gene="PCNA" CDS join(1427..1647,2356..2453,2550..2617,3556..3750, 5636..5759,5846..5925) /partial /gene="PCNA" /note="proliferating cell nuclear antigen (PCNA)" /codon_start=1 /db_xref="GDB:G00-120-261" /db_xref="PID:g387005" /translation="MFEARLVQGSILKKVLEALKDLINEACWDISSSGVNLQSMDSSH VSLVQLTLRSEGFDTYRCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALV FEAPNQEKVSDYEMKLMDLDVEQLGIPEQEYSCVVKMPSGEFARICRDLSHIGDAVVI SCAKDGVKFSASGELGNGNIKLSQTSNVDKEEEAVTIEMNEPVQLTFALRYLNFFTKA TPLSSTVTLSMSADVPLVVEYKIADMGHLKYYLAPKIEDEEGS" intron 1648..2355 /note="PCNA intron A" exon 2356..2453 /gene="PCNA" /note="proliferating cell nuclear antigen" intron 2454..2549 /note="PCNA intron B" exon 2550..2617 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=3 intron 2618..3555 /note="PCNA intron C" exon 3556..3750 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=4 intron 3751..5635 /note="PCNA intron D" exon 5636..5759 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=5 intron 5760..5845 /note="PCNA intron E" exon 5846..>5925 /note="proliferating cell nuclear antigen" /number=6 BASE COUNT 1756 a 1280 c 1457 g 1847 t ORIGIN 1 bp upstream of EcoRI site; chromosome 20pter-20q13. 1 gaattctgct gaccaaggta ttaaaagtaa ctaaagagaa gtggtgtgaa gaaagcaaga 61 gagaaacaac aaatcctgtc catcctgtaa caattgaaaa tttctggctg ggcgtggtgg 121 ctcaggcctg taatcccagc actttgagag gccgaggcag gtggatcacc tgaggtcagg 181 tgttcaagac cagcctggcc aacatggtga aaccccgtct ctactaaaaa aaaataataa 241 taataataca aaaattagcc gggtgtggtg gtaggcacct gtaatcccag atactcggga 301 ggctgaggca ggagactcac ttgaacctgg gaggcggagg ttgcaatgag ctgagatcgc 361 gcgactgtac tccagcctgg atgacagagc aggactccat ctcaaaaagg aaggcgggga 421 aaaggggaaa tattaaatgt gtacgctctt tgactcagct gtattacttc aaggagttga 481 tatcaccaaa attgcctaag tgctcaaagg tgtttgtagt taaacaacag gagattgata 541 aattatgtta tatacatgtg atgctatgtt ttaaagaggt actgatatga taaaaagatg 601 tacgtggcat aaaattaaat gtacttatta agtacttttc caagtgttta cggaatgagt 661 gcatttttga aaaaaaaaaa gtgtattcga acttttaaaa aagctttaaa agctttatac 721 aataacgatt gagtgattat aagagctggc gggggaatgt taagaggatg atagggagct 781 aagtttaaca gaacaattca cctctttatc ttgtgacacc tacgagcgca tcaattctgt 841 aattgaaaaa taaagtgcat atttgcagca gctgtactct cttcaggctg caaggaggct 901 tttcctcccg gtaggcttga tttgcatttc actttcactt tcgtggctgg aaactttcta 961 cccacgtagt gaggctagag gagccaccta aagctggggc ttgacgaagc cgggaccggg 1021 acccgatctc cacatatgcc cggacttctt ctgcggccgg gttcaggagt caaagaggcg 1081 gggagacctg cgcgacgctg ccccgccctg cgcccgcttc ctccaatgta tgctctaggg 1141 ggcgggcctc gcggggagca tggacacgat tggccctaaa gtcttccccg caaggccgtg 1201 ggctggacag cgtggtgacg tcgcaacgcg gcgcagggtg agagcgcgcg cttgcggacg 1261 cggcggcatt aaacggttgc aggcgtagag agtggtcgtt gtctttctag gtctcagccg 1321 gtcgtcgcga cgttcgcccg ctcgctctga ggctcctgaa gccgaaacta gctagacttt 1381 cctccttccc gcctgcctgt agcggcgttg ttgccactcc gccaccatgt tcgaggcgcg 1441 cctggtccag ggctccatcc tcaagaaggt gttggaggca ctcaaggacc tcatcaacga 1501 ggcctgctgg gatattagct ccagcggtgt aaacctgcag agcatggact cgtcccacgt 1561 ctctttggtg cagctcaccc tgcggtctga gggcttcgac acctaccgct gcgaccgcaa 1621 cctggccatg ggcgtgaacc tcaccaggtg agcctcgcgc cccgggaagc cgccccggcc 1681 cgcctgcacc tccggctgtg gcgagcgctt cgagcctagc cctcattggc tggcgtgggc 1741 atccagagct tctcattggc ctgcacgcag tggtggggcc caagctgaga tgagcggtta 1801 cggaaaagcc cgcgctggct gctgcgcgaa cctgcttttt cgcgccaaag tcacaaagcg 1861 ggtggtggcg ggaaaatcaa gggtttttcc gcagtgccag gaacactgtt ccagggactc 1921 tttgctcact aaacctgttg gccttgaatg gacgctttag ctgtggcttt cttgtttctg 1981 agacggtctc ggtctcggtg tgttgcccgg gctggtctcc aacttctggg ctcaagcgat 2041 cctcccggct cagtcgcgtc gactttaaat gctttataat gcccttgcga gaaatgtggc 2101 agcctgtcat cctacttagt ggtaggagat tgtttctatc cagaagggac actgctggtg 2161 gtattttagt ataaatactg ccagatgcgt ccaaaacgtc tgcattaata atggcatcct 2221 ccagcagtcc gtttaccctc caccagttct gagacggcct gacgggtgag agtggtaacc 2281 ccttctaacc gcgttcgaaa tacagccctt cagcagacgg cgttgatttt aaagcatgtg 2341 tctcctgtct tctagtatgt ccaaaatact aaaatgcgcc ggcaatgaag atatcattac 2401 actaagggcc gaagataacg cggatacctt ggcgctagta tttgaagcac caagtaagtc 2461 gtaccttttt accgagtcac gaagctacag gaaaatcaaa actctgtgtg agtagaaact 2521 caaaagctat ctgcgtttct tttggtaaga ccaggagaaa gtttcagact atgaaatgaa 2581 gttgatggat ttagatgttg aacaacttgg aattccagtg agtatcagtt tctcattgta 2641 gagagtgctg tacacaggca cgatagttat gtcatagaat gtttgtttat ttttacagac 2701 agggtcttgg ctctgttgcc caggctggag tgcagtagtg ccatatagct ctctctaacc 2761 tgggattcct gggctcaagc agtcctcttg ccttagtctc ctaagtggct aggaaggact 2821 acgggcctgt cccaccacac ctggctaatt tttttcattt ttgtgtgtgg gacgtggggg 2881 cagtctagcc aggctggctg gaactcctgg cctcaagtga tcctcctccg tcaagatatg 2941 ttaatataat ttaaagccta cttcataaca acttttctag aaatatatct actggtgcat 3001 gtttcaaaga gatgatttta gtatttggat agttgttcac cacaagtcta ataatctcca 3061 caggttaaat ttattgttta tgccagttgt ctatttgcat taacttccat gaactcttta 3121 aattgttctc tagaatgctt gctttttatt aatgaggttt taaagctagc ttgagagaaa 3181 tttatccagg ttaggttata aacaccaaag gagagaagaa atgtttgaat gttgaaaatg 3241 cctaatatat tctcttgctt tcttttagaa agtgattagg cctgcttgcg ccatcatgat 3301 ttctgtgcca tactctaatg ttctcttact ttatccctgg aggatgagga ggaggaggct 3361 cttgttccct ggatggtgca tttaatagcc atttattttt ttgagtggag tttgttaaga 3421 aattacgcaa gtcatatttt aaagtaatca gaaaatatga ttctgagttg tttaggtgtt 3481 gccttttaag aaagtgaggg tgccaaatca ttaaatttct aacaattaac ttttggaaaa 3541 ttttgttctt aataggaaca ggagtacagc tgtgtagtaa agatgccttc tggtgaattt 3601 gcacgtatat gccgagatct cagccatatt ggagatgctg ttgtaatttc ctgtgcaaaa 3661 gacggagtga aattttctgc aagtggagaa cttggaaatg gaaacattaa attgtcacag 3721 acaagtaatg tcgataaaga ggaggaagct gtaagtagtt tttaagtaaa aagaaaatag 3781 tttgaagaga attataatac tgcttattag gttaattgct aaaattaaaa gtagacagaa 3841 ttggatccca agtaatttct gaaaattgag atactgttga aatctgtgaa tgatttataa 3901 gtgtcatcca atttagaatt atatttgcaa gaagggaata caaattcagc acgtgtacat 3961 accacagcaa cagtggttta tggatcaagt ccacaccggc tcttaagggt aggattggga 4021 agttaggcgt ataacttagc ttctggagat acttactctc ttaccaaata attgagcata 4081 ggacagcagc tcaatagaag gatatgttag gagtaaaagt ctacctgttt ggagcactta 4141 tgtaatccta atatagctta catgttgtgg gtccaattgg tagcccattt taaaggtgga 4201 gaagcaggct gagcaacctt aagtgacaat ttagccaaag tcacaggctg taggaatcaa 4261 aggttaaaca ggaaggagac tctcactaag gctagaaagc agactccatg caactttgag 4321 agtacctaga gagaccctta tttaaccaaa atagaaagaa catagcaaaa ccccatctct 4381 cataaaaata taaaaattga ccgggtgatg agtggcgaca cacctgtaaa cccgactact 4441 ggaatgcatg agatgggaga atgacttgaa ccgaggaggc ggaggttgca gtgagcccag 4501 atcatgccac tcccctccag cctgggtgac agagcaagat tccatcttaa acaaacaaaa 4561 aaaactcgct aacctgggca taaattaaaa ctttgtaaat caaggacaaa ggtcctaaac 4621 ctcataactt gcattaggat taaatacggt agcattaaag agcttagcat atctgtgtgt 4681 ggcatattat aagcttacaa taaatactat atattgctct cttgtccctt gaatgggtag 4741 tcaacattta gtttaaataa aggtaaaatt cagttgaaag gttttttttt aaattaataa 4801 agtctaggag ctgattcttt atctgtttcc tgaatcacat ttccactcct gccaacctcg 4861 tttttttctt ttgctgtttt tctttgtttt tgagacaggg cttgctctgt gccacccagg 4921 ctggagtgcg gtggtgcagt cggttcacta cagcctcaaa ctccagggct taagtgatcc 4981 tcctgcctca gtttcccaag agccgggaca caggtgtgtg ccaacacact agcctggttt 5041 ccctaatttc attttcccct tgaccattac aactatttgt tgaagaaatt agatcattta 5101 ttagtttcag agtttggatt ttacctgatt gcattcctgt gtatctaata acctctacct 5161 gtgtgtccta cagactggta gctatagcct ggagccttga tatcagggtg ttttgtttcg 5221 ggggtgagag agcaagaata tggtggtggt gtgtgtgcct ctagtaggag gcacagggtg 5281 tctggatgtg tttgcaatgt tagcagctat aagtcattgt ctagatccat taagtcatta 5341 attagagttt gcagagctga aattaatacg ttttatcact tattggctgc ttattagaaa 5401 acttccataa gaaaagcttc ccattatata atttggttat ctaaattata gctataccaa 5461 aagacaaagg ctagataatc gagtcttttt gcatttatgt atcagtcttc aaaattttca 5521 tagcgtccct ccaaagtgac caatacaagt gtttgtgggt ttttataaat atataatgag 5581 ctaatagatt gcaactttct tgatgttttt caatgatgaa tcttttgttt tgtaggttac 5641 catagagatg aatgaaccag ttcaactaac ttttgcactg aggtacctga acttctttac 5701 aaaagccact ccactctctt caacggtgac actcagtatg tctgcagatg taccccttgg 5761 taagataata aatttgaacc ttgttttgat ggtagtcata tgtgatacat actcctcagt 5821 aattaaccat cttcctgtct ttcagttgta gagtataaaa ttgcggatat gggacactta 5881 aaatactact tggctcccaa gatcgaggat gaagaaggat cttaggcatt cttaaaattc 5941 aagaaaataa aactaagctc tttgagaact gcttctaaga tgccagcata tactgaagtc 6001 ttttctgtca ccaaatttgt acctctaagt acatatgtag atattgtttt ctgtaaataa 6061 cctatttttt ttctctattc tctccaattt gtttaaagaa taaagtccaa agtctgatct 6121 ggtctagtta acctagaagt atttttgtct cttagaaata cttgtgattt ttataataca 6181 aaagggtctt gactctaaat gcagttttaa gaattgtttt tgaatttaaa taaagttact 6241 tgaatttcaa agatcacagg gcagtgtctt catttgacca ggactgttga aagtatccta 6301 ctgaattccc agctacagtc accctttgtt caaactgttc // LOCUS HSU73304 5665 bp DNA PRI 05-NOV-1996 DEFINITION Human CB1 cannabinoid receptor (CNR1) gene, complete cds. ACCESSION U73304 NID g1657840 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5665) AUTHORS Hoehe,M.R., Caenazzo,L., Martinez,M.M., Hsieh,W.T., Modi,W.S., Gershon,E.S. and Bonner,T.I. TITLE Genetic and physical mapping of the human cannabinoid receptor gene to chromosome 6q14-q15 JOURNAL New Biol. 3 (9), 880-885 (1991) MEDLINE 92031291 REFERENCE 2 (bases 1 to 5665) AUTHORS Bonner,T.I. TITLE The coding exon of the human CB1 cannabinoid receptor JOURNAL Unpublished REFERENCE 3 (bases 1 to 5665) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (03-OCT-1996) Lab of Cell Biology, NIMH, Bldg. 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..5665 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q14-15" intron <1..58 /note="5 exons located 19-23 kb upstream" mRNA <59..5530 /gene="CNR1" gene 59..5530 /gene="CNR1" CDS 122..1540 /gene="CNR1" /note="G protein-coupled receptor" /codon_start=1 /product="CB1 cannabinoid receptor" /db_xref="PID:g1657841" /translation="MKSILDGLADTTFRTITTDLLYVGSNDIQYEDIKGDMASKLGYF PQKFPLTSFRGSPFQEKMTAGDNPQLVPADQVNITEFYNKSLSSFKENEENIQCGENF MDIECFMVLNPSQQLAIAVLSLTLGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLA VADLLGSVIFVYSFIDFHVFHRKDSRNVFLFKLGGVTASFTASVGSLFLTAIDRYISI HRPLAYKRIVTRPKAVVAFCLMWTIAIVIAVLPLLGWNCEKLQSVCSDIFPHIDETYL MFWIGVTSVLLLFIVYAYMYILWKAHSHAVRMIQRGTQKSIIIHTSEDGKVQVTRPDQ ARMDIRLAKTLVLILVVLIICWGPLLAIMVYDVFGKMNKLIKTVFAFCSMLCLLNSTV NPIIYALRSKDLRHAFRSMFPSCEGTAQPLDNSMGDSDCLHKHANNAASVHRAAESCI KSTVKIAKVTMSVSTDTSAEAL" polyA_signal 5499..5504 /gene="CNR1" polyA_site 5530 /gene="CNR1" /note="location established by comparison with ESTs with GenBank Accession Numbers R20626, R42346, H06205, H10202" BASE COUNT 1603 a 1185 c 1118 g 1759 t ORIGIN 1 tttgttttta ttcttcctgt ttctcaccat tcggcttatt tgttttccct cctcttagga 61 ttgccccctg tgggtcactt tctcagtcat tttgagctca gcctaatcaa agactgaggt 121 tatgaagtcg atcctagatg gccttgcaga taccaccttc cgcaccatca ccactgacct 181 cctgtacgtg ggctcaaatg acattcagta cgaagacatc aaaggtgaca tggcatccaa 241 attagggtac ttcccacaga aattcccttt aacttccttt aggggaagtc ccttccaaga 301 gaagatgact gcgggagaca acccccagct agtcccagca gaccaggtga acattacaga 361 attttacaac aagtctctct cgtccttcaa ggagaatgag gagaacatcc agtgtgggga 421 gaacttcatg gacatagagt gtttcatggt cctgaacccc agccagcagc tggccattgc 481 agtcctgtcc ctcacgctgg gcaccttcac ggtcctggag aacctcctgg tgctgtgcgt 541 catcctccac tcccgcagcc tccgctgcag gccttcctac cacttcatcg gcagcctggc 601 ggtggcagac ctcctgggga gtgtcatttt tgtctacagc ttcattgact tccacgtgtt 661 ccaccgcaaa gatagccgca acgtgtttct gttcaaactg ggtggggtca cggcctcctt 721 cactgcctcc gtgggcagcc tgttcctcac agccatcgac aggtacatat ccattcacag 781 gcccctggcc tataagagga ttgtcaccag gcccaaggcc gtggtagcgt tttgcctgat 841 gtggaccata gccattgtga tcgccgtgct gcctctcctg ggctggaact gcgagaaact 901 gcaatctgtt tgctcagaca ttttcccaca cattgatgaa acctacctga tgttctggat 961 cggggtcacc agcgtactgc ttctgttcat cgtgtatgcg tacatgtata ttctctggaa 1021 ggctcacagc cacgccgtcc gcatgattca gcgtggcacc cagaagagca tcatcatcca 1081 cacgtctgag gatgggaagg tacaggtgac ccggccagac caagcccgca tggacattag 1141 gttagccaag accctggtcc tgatcctggt ggtgttgatc atctgctggg gccctctgct 1201 tgcaatcatg gtgtatgatg tctttgggaa gatgaacaag ctcattaaga cggtgtttgc 1261 attctgcagt atgctctgcc tgctgaactc caccgtgaac cccatcatct atgctctgag 1321 gagtaaggac ctgcgacacg ctttccggag catgtttccc tcttgtgaag gcactgcgca 1381 gcctctggat aacagcatgg gggactcgga ctgcctgcac aaacacgcaa acaatgcagc 1441 cagtgttcac agggccgcag aaagctgcat caagagcacg gtcaagattg ccaaggtaac 1501 catgtctgtg tccacagaca cgtctgccga ggctctgtga gcctgatgcc tccctggcag 1561 cacaggaaaa gaattttttt ttttaagctc aaaatctaga agagtctatt gtctccttgg 1621 ttatattttt ttaactttac catgctcaat gaaaaggtga ttgtcaccat gatcacttat 1681 cagtttgcta atgtttccat agtttaggta ctcaaactcc attctccagg ggtttacagt 1741 gaagaaagcc tgttgtttaa gtgactgaac gatccttcaa agtctcaatg aaataggagg 1801 gaaacctttg gctacacaat tggaagtcta agaacccatg gaaaaatgcc atcaaatgaa 1861 taatgccttt gtaaccacaa ctttcactat aatgtgaaat gtaactgtcc gtagtatcag 1921 agatgtccat ttttacaagt tatagtacta gagatatttt gtaaaatgta ttatgtcctg 1981 tgagatgtgt atcagtgttt atgtgctatt aatatttgtt tagttcagca aaactgaaag 2041 gtagactttt atgagaacaa tggacaagca gtggatacgt gtcaatgtgt gcactttttt 2101 tctatattat tgcccatgat ataactttag aaataaacct taatatttct tcaaatatct 2161 ctatttaatt ttgacactga aataaccgta aaggtttatt tttctgttac ctcaacaaga 2221 agaatttgaa gacttcaaaa tattgagcag aattcattca tacttaaaaa tttattagcc 2281 ctgcattttc ataggaagac acattatctt ctggactata gctgttctaa tggattataa 2341 tcagaatgga agagagaaag catattgact ttttttgagc gacatctctg actttcttta 2401 gtctttagct attactggat ctcttaagac agcatgtgtt aatcttaatg tatatcgtta 2461 tcactgtgca gttgctgttt acttgaatag tattgtgttc ctatattcca ggtttaagta 2521 gatttcatgc ctgggtggcc aaacaacagt cttcattttt tttaattgaa aagaagtagt 2581 gtctggatca gtaaaattat actgtgtgtg agtgtgaata taaatgtgtg tatgtgtgtt 2641 tctgtccgta actgttacag taatgtcata aagtgagaaa actgtgacca agtataaact 2701 tttaccactt gctgcactct tgcacatgga ttcagtttct aaaattgagt tcttcctgta 2761 atcttgttga taaaaatact gactccaacc attcaaaaat ttcaccccat ccctccttaa 2821 gagattggat caagtattac taaattgacc tttaggtatt acacaagacc agtgcttagc 2881 aaaaaataat gacaggcatc caaggaaggg atgtatttgt agtgttattg ccaggaaagg 2941 agagtacttt ggtttctgag caccgaatat tgagcaatat gtcagtcact aaaaggaaga 3001 cagttctaca gaaaaacaaa tggtaacatt tttcaatagc gtgtgtagat agtatgcact 3061 atatacatca cgttaaagta ggactatcac acccagccca tgtggctaaa aaagctgaat 3121 cagacagtgg atgagacaca caacggcagt gaagaaccga tacacttggc attgacgtct 3181 agctatgctg tatctgtgct ttgcccacat gcccttggtg acagctgagc acccagctct 3241 gtcttggtag gtttgggcta aggaacaaat ctctcctttg ctcgtggtta gcaagataca 3301 ctcaagcatg aagataaaca cagctgcttt cttcttacac cccggtctca tgctccttaa 3361 tggcgccatg ggtgcttgtt gggccttttt ccagtaagga atgatattgc tgaagaatct 3421 acttaaccct gacaaatttt aattataatc tcttcttata cagataaaac atgactccta 3481 caaggcccca aggtttacat agtctgaagt gaagtacaga gctggcatct atctggtgat 3541 ttctagctct cgagataccc aagcagcctg atggggcagt tccccttctt acggttcacg 3601 ctctaaggca ggatgtggct tatgagatac tttgcattgt ctgtctgcac accttgaatc 3661 tgcctgctgg ctcccttact ttacctctct gtcatgtgca gatgaaggct cagggtgcta 3721 gaggattagt aagatctctt tctaaagaca ggagagatta tttacaagaa gaactcacca 3781 gggtttagtt tgcatttaag aattgccagt cttttgtcct gcatcatctt gaacattaat 3841 ccacatgttt cagagctcac caggcagtac caatgctctt ttcacagcta tgaagagcta 3901 gagaaattct tgttatggta gaaaaatttc acggttcatt tttgaaactg catttgtgcg 3961 tatgcagtgt agattttata gtgtgttgtg ctttcaagat ctaaatcata tataataaat 4021 taagggacaa tggggctgac agcactaaac ttggtgctta ttgatattct aagaaatatc 4081 tgtgaaatat catcacgtat gttatacaac cttcatttaa aaaggtttaa aactagttag 4141 attcactttg acacttttca tatcatttct taacccaagt gacgaaaaca ttgtccccaa 4201 tgaatatact cattagaatt accatttgtt aatatcactc attaattaac cccataatta 4261 gatccattaa tttaaatgat ttaaatttaa gtaagtttta taaggtctga catcagaggt 4321 atcttacttt cctctgagga tgatgtactt gccctgacca tgcattttac catcacacat 4381 gttcagaaag ggccaaattc ccaacctgct catttttttt tttatcagag tcatgatgaa 4441 tcagtcctag aatgtttcat ttgcacaagt agggctgcct ccaagaggaa cctctgattt 4501 attttgtatg aaatatatgt gaaaggatat gaatctgaga gatgctgtag acatctgtcc 4561 tacacttgag atgatttcca agcctctctg gcactttgag ttaagtctat ctggtattaa 4621 atgccaagga ccttttgctg cctaaatcca ctctgcagga aataggccca accaccagat 4681 gagaattagg ccctggatga gtagcgctat agttactgtc ctgttgatta atttctgcca 4741 tttcatgtcc ataaaagaga ccacccatat catgcacaca attagatttc tcacactcta 4801 actgtatatt tgtatgatat tttaaaatct cctaaatgct gggcaatggc tattaacaat 4861 taattgtctt gcactggcct tctgatgaaa tgttaacaat gcctattgta atatagaaaa 4921 aaacattcta tctactgatt tgggctgaat gtatgtaaat aggtttctaa aaagtcagat 4981 gtttgagcag tggcctacaa atcagtaatt ttcgggtggg agagtttctt tacattgccg 5041 tggcatctta aaagctatct tcatgtaaat tgactgtact aggcctactg gggatcagag 5101 ttcccaagaa aggaaacctt ttcttgtatc tggattcaaa tttatttcca atgtttcaag 5161 cgggaaacat gactctttat tgtctgtaaa tctaacatta ttacttttcc tcttagaaga 5221 atattgtatt gttagatgtt tgttgagctg gtaacatcgt tgcaaccact gcaatatctt 5281 cgttagtaat ctgtataata ctttgtatac aagtactggt aagattgtta ttaaatgtag 5341 cttcagtcat taaattacta tagcaaagta gtacttcttc tgtaatattt acaatgtatt 5401 aagcccacag tatattttat ttcaatgtaa ttaaactgtt aacttattca aagagaaaac 5461 atctcatcat gtctattgtc caaagttacc tggaatcaaa taaaaattct agattaccat 5521 gaagaacata aaatgccttt gaactctgcc ttatttcaca gtctgatggc aaaatactaa 5581 ggatttaatt tctaaaagat tgctgaacta atttattcct caaaaagcac taatgactac 5641 ttgaaaagtg gggacatatt ggatt // LOCUS AF007876 7894 bp DNA PRI 07-APR-1998 DEFINITION Homo sapiens Na,K-ATPase beta 2 subunit gene, complete cds. ACCESSION AF007876 NID g3025476 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7894) AUTHORS Avila,J., Alvarez de la Rosa,D., Gonzalez-Martinez,L.M., Lecuona,E. and Martin-Vasallo,P. TITLE Structure and expression of the human Na,K-ATPase beta 2-subunit gene JOURNAL Gene 208 (2), 221-227 (1998) MEDLINE 98201617 REFERENCE 2 (bases 1 to 7894) AUTHORS Avila,J., Alvarez de la Rosa,D., Gonzalez-Martinez,L.M., Lecuona,E. and Martin-Vasallo,P. TITLE Direct Submission JOURNAL Submitted (11-JUN-1997) Bioquemica y Biologia Molecular, Universidad de La Laguna, c/ Astrofesico F. Sinchez s/n, La Laguna, S/C de Tenerife 38206, Spain FEATURES Location/Qualifiers source 1..7894 /organism="Homo sapiens" /db_xref="taxon:9606" protein_bind 646..652 /bound_moiety="Sp1" TATA_signal 672..677 exon 766..1406 /number=1 mRNA join(766..1406,3437..3565,3903..4007,4114..4319, 4691..4747,5589..5687,5793..7811) /product="Na,K-ATPase beta 2 subunit" CDS join(1295..1406,3437..3565,3903..4007,4114..4319, 4691..4747,5589..5687,5793..5957) /codon_start=1 /product="Na,K-ATPase beta 2 subunit" /db_xref="PID:g3025477" /translation="MVIQKEKKSCGQVVEEWKEFVWNPRTHQFMGRTGTSWAFILLFY LVFYGFLTAMFTLTMWVMLQTVSDHTPKYQDRLATPGLMIRPKTENLDVIVNVSDTES WDQHVQKLNKFLEPYNDSIQAQKNDVCRPGRYYEQPDNGVLNYPKRACQFNRTQLGNC SGIGDSTHYGYSTGQPCVFIKMNRVINFYAGANQSMNVTCAGKRDEDAENLGNFVMFP ANGNIDLMYFPYYGKKFHVNYTQPLVAVKFLNVTPNVEVNVECRINAANIATDDERDK FAGRVAFKLRINKT" intron 1407..3436 /number=1 exon 3437..3565 /number=2 intron 3566..3902 /number=2 exon 3903..4007 /number=3 intron 4008..4113 /number=3 exon 4114..4319 /number=4 intron 4320..4690 /number=4 exon 4691..4747 /number=5 intron 4748..5588 /number=5 exon 5589..5687 /number=6 intron 5688..5792 /number=6 exon 5793..7811 /number=7 repeat_region complement(6726..7039) /rpt_family="Alu" polyA_signal 7783..7789 BASE COUNT 1564 a 2353 c 1975 g 2002 t ORIGIN 1 tcccgaccct agagtcccgt cacgacccct gacccttaca ccacaactct cccgaagtcc 61 ctctgcacta cccttctacc ccttcggaga catccacctg tctcggtcca ccacacctgt 121 ccccgacact ctaacctctt cctcctcaga cccctgacca tcgccgaggt tgtccccttg 181 caggagaccc ccacgggaaa taggtagaga cgccccggac aggatcgaag ggaggaccga 241 ggccgctcag ggccgaccag agaagcgaga gaaagaacta gagatcgatc tctagctctt 301 tattcctctg acattctgcc ccatctgctc ccggactctt ccgccgcatc tgtatcttag 361 tctctttcta actccctgcc tggggccccc acctttgagc atatatcggc ttctctctct 421 gccgttgtgt ttctctgggg ttatctttcc ctctactccg cccggacacg ccctctatct 481 cttcctccca ccctgtccca gtctagtacc tggggtgggg gtaatggaga gacagctagg 541 tcctaagaga agtcaggggg gctgggccaa cctctccata tttacatatg tatgagggtc 601 gcctgggcca gtggcgagga ggcggacgtt ctgggggtgg gaagggggcg ggcaccccca 661 gagccgcaga gtataaagac cgcgctcggc gaccgcgggc cccgactgct gaggagcgga 721 cgctccgcct ggggggcccc ccatccctgg ctgtccccca gctgcgcgtc cccgccccac 781 ccccgcggct gagccaccac cggtgcagtg gtctccgctt ggcggagcga gccttgagct 841 tcgttccaca gcttctttgc atcttggatt tcggggcggc cccctccccc acctctctct 901 gcctttttgt accccgcttt ttttctgcgt tctgctcggt ttttgtagcc gtctgttttt 961 gcaccccatt tcgttttgtt tctagacggt ttggcggggg gtgaagctgc attcataccc 1021 cttcctcttg ttattctccc ctgctctgac agcacccctt ttcatcgcag ttggggggcc 1081 taggatcggt gcatcttccg ccgcgctgcc agcaccccgc agcgcgtggt cgtgcacccc 1141 ggaatctgca gcagctgcat atctgagggg ggtctccttt gcccgcgccg ccttcgctcc 1201 ccgtgctttt gggtgtgtgg agggcttcag tcgcggcgcc cccgcttctc cgcaaccccc 1261 cgccccgcgc ccggactcgc cccgcgccac caagatggtc atccagaaag agaagaagag 1321 ctgcgggcag gtggttgagg agtggaagga gttcgtgtgg aacccgagga cgcaccagtt 1381 tatgggccgc accgggacca gctggggtac gcagggccgg cacgcaaggg gcgggggaaa 1441 gccgcggggc gacgcctcgg gggcgcaggg tcccgccgac gcgccccagc tcccctcccg 1501 ggtcccggcg tccagctccc tgccgggctc tgggctggga gggggccgaa tcgccagtct 1561 aactccccgg ctggccgtgc ggaggcggag aaagtaggtc acagccgcct tccgcccccc 1621 gcggagcccc ctcgggcggc gcgggtcgcc agctccgcct gcgtgccgcg ccgcgcgctc 1681 acactcccct ctcggggcct gtcgctccac acgggcgtcc cccacctcca aagagcgccc 1741 cttccctccc tccggctctc actagctccg cagccccgtc tatttttagc tcgtgcccac 1801 cccctggacc ctgggaacgt tcatgagggg gcgggtcttg gggggtgtgt taggggggtt 1861 cttcacggcg gaagttgtct gtatcccacc gcctggcctt gggagccttc tgggactgct 1921 ttgtgggttg gggggctgct gatagtatga gttttaccga ggctgcaggt tttagcctcc 1981 catgtcggtg acggagggag gagtggtcgc tgtggtgatt tgtgtgcatc agccagccag 2041 gtgtctgtga cagtcggatg acttggaagc ctccccaggc tgaccatggc aggactcagg 2101 gagctgtagt ggtcgggggt tgggggggtg gagggggtct ggtgaccggc acaggtgcag 2161 gtgaggggtg gaattcactt tacattttcc ccatagaaac aaagttataa atagtgactg 2221 catactgcac ctaaaatgcc acgtatctac aacgaaatta ttacaatgta tttataatat 2281 attatttatt ctaaacatgt atgcatttga aattatccaa tgaaatctaa cctattgatg 2341 tacctatttc tacattaata cattatatca tatattaata tgctaataat tatatcacat 2401 taatatatta ataatatgtt aattaaccaa acgaaaagta tatactgatc acaacacttg 2461 aattcttcca agagggatgg tcaagctggg acgttgagac acaggggaca gaggacactg 2521 tgtgacacga tttacaatct ttccacactg ggcaccgtcc ccatcagtcc acccattcgg 2581 ggcctacacg aagtgggtcc catgcaatcc attccctcag ggaactcaaa ctccagcccc 2641 tgggatgaga agaatccagc aatgcttggg agagccagag gacttcatgg aagaagtgtc 2701 ctctgagatg gaaggattgg gagtccaggg tggtgggaac agccggccct tgggtcctta 2761 cttcaggcgg gggagccatg gagagatccc accaagggaa ggctgtggag attctgcctt 2821 tcctccctgc ctctgcccag ggtgctgggt gtgaactgag ggtggggtga ctgttgaagg 2881 ttctaacaag ccgtctctga gagatttgta gctaggctag tgttaggtct ttcatttcag 2941 gaactgtgtt caaagtttgg cttctgaagg gcaccaggag agagatgttg ctattcaaat 3001 ctgagggtcc agtctctgcg gggtggtatg agggtttgct tgtgaatggt ggccagtacc 3061 cgctttaaaa ggcaccatgc tagcacagct ttaagcatga gtacgaatgc agaggtaaca 3121 gatgtgtgcc ttgtcaggac tatgcatggt tgagaagttg gaaatgtaat tggaggcaaa 3181 ataacagacc tccacaaggt cgggcttcac tgtgccctag gaccaggagg gggctgggag 3241 tcatggctag aagccagaca caactgcctg tttccagttt gtctcatttt gcctccagag 3301 gaaggctcta agacatccct gtggctctgt gatcagtccc agtgcagaac ttcagagtgg 3361 gtagaggggt gtgtggggat agttgaggtt atggtgggaa ccttgggccc tgctgaccct 3421 gtttcctcct ccctagcctt tatcctcctc ttctacctcg ttttttatgg gttcctcacc 3481 gccatgttca ccctcaccat gtgggtgatg ctgcagactg tctccgacca tacccccaag 3541 taccaggacc gactggccac accgggtgag tgtggaggct ccccctgcca gctactctaa 3601 ctgctcttgt gcccccaaac ctccagaagg aactcatagt tccttccagg agtttgattt 3661 tgatgaccca atccccacgt gcttggaagt tcttgaaatc tgtccacctt cccatttact 3721 gcagttggga gctgtgtgat ttgggcatgt ggcagatagc cacaggagat caccctccca 3781 tgaagacgat ctcagatatt caccgctgtc cccactctgg tgttccctgt gtccattttc 3841 ttccctctgt gtgcagtccc tcatcttata gatcccccaa cttctgcctt tgttggctgt 3901 aggcttgatg attcgcccca agactgagaa ccttgatgtc attgtcaatg tcagtgacac 3961 tgaaagctgg gaccagcatg ttcagaagct caacaagttc ttggagcgtg agtgtgggcc 4021 tggttatgtg tcagttcaag acttcgggca ggggactggg gaccttggaa gtggaacatc 4081 tggcccctga gtctctccct cccacctctt tagcttacaa cgactctatc caagcccaaa 4141 agaatgatgt ctgccgccct gggcgctatt acgaacagcc agataatgga gtcctcaact 4201 accccaaacg tgcctgccaa ttcaaccgga cccagctggg caactgctcc ggcattgggg 4261 actccaccca ctatggttac agcactgggc agccctgtgt cttcatcaag atgaaccggg 4321 tatctatgac cttggtcccc agggtgaatg gaggaaggat ctggggacac cacctgcaga 4381 caattgcatc ctttcactgg ggctaatggg catgagaaag acttggatgt ttgtgtagct 4441 gagagaaaaa gagaggtggg actcttggag gctaagctct gcagttagaa ggctggatgc 4501 cttttgtttt ttttttagac agggtcttgc tatgttgccc aggctggcct tgaactcctg 4561 gaattaagct attcctcctg cctcaacctc cctagtagtt gactacagtc cccggcttag 4621 cttgggtctg gatgcccatc ttcgacaact tcttcctctg actctcttca ccttccaccc 4681 tcaactccag gtcatcaact tctatgcagg agcaaaccag agcatgaatg ttacctgtgc 4741 tgggaaggtg agttcgttgg gccttgtctg cctgctcacc tgagtggctc acctgagtgt 4801 ccttcatggt ttctgtgtaa ttcacctgtc tctccctatc ttctttgctc ctagaggccc 4861 catcaccata gaaacaaggg ggtaagagtg ggcttttggg ctccactgta gcttgaactc 4921 cgagggcccc gcactcctct tgcttctctc tgggatgcag aggcctgctc tcctaggggc 4981 cagacacacg ccctcctcca ccaacgccct ggcctctggc ttctctccct aagcttccac 5041 cttctccttc attcccgagt tgtccgtatc gttcgctctc cctcccatat ggcccccacc 5101 cgtcagttcc ttctaggtgt ctggtagctg ctgatctgtt agcaccatct gccaccagtt 5161 cgttgtcctt gggaccctgt tccccgcttc ccccccggtc tgtcctttct agaaactggc 5221 tgctccctcc acatcccctt ccttgcttcc tattcaaccc ttaatcatgt atctcttctt 5281 tcttggctct gctccagaaa ctgattcctg aggatggtgt aagaacttgg ggtaggagtg 5341 gagtaggagg ctttcgtgcc attagcagcc ttcaggagtt cctagaatga tgacagggac 5401 agaccatccc ctcatctaca cacgcacatg cagacacaca cacacagact cacagctcca 5461 gggaggtaga cttgagacag gaataattgg ggcgaaggaa gaagaaaaga acaaatggaa 5521 gtctggtgag ctcctgggtg cctgccatcc ctaactggct aaccccctat cttcctgcac 5581 ccccacagcg agatgaagat gctgagaatc tcggcaactt cgtcatgttc cccgccaacg 5641 gcaacatcga cctcatgtac ttcccctact atggcaaaaa gttccacgta agtcccaggg 5701 gaggcccagg ctggatggcg ggtgcgggtg gtgagctagg gaaggaggcc gcgttgcccc 5761 aggcctagac cctgcactgc tctccggccc aggtgaacta cacacagccc ctggtggctg 5821 tgaagttcct gaatgtgacc cccaacgtgg aggtgaatgt agaatgtcgc atcaacgccg 5881 ccaacatcgc cacagacgat gagcgagaca agttcgccgg ccgcgtggcc ttcaaactcc 5941 gcatcaacaa aacctgaggc cccttcctcc caccccatct ctctcctgtg gatgctcctg 6001 gaatgtccct gaccctgcct gatccctccc tcacccaccc caaaggtatt tttgataaca 6061 gagctatgac ttgtctgagc ctcacatcct tttccttgac ttctcaaccc agcctgaagt 6121 ccattgcggt tccgtcactc gcctttccca ccaacttctc ccaacctcag atcagtcaga 6181 cagggagctg ggctaagatg gccacagagg agttaggagc ctttctagtt ctggtttagc 6241 tgtgagagct atccactctc ctgcctgcat atcccctgag agttatagga agtgcccact 6301 gacccaccca cccacctaca ccccccgcca cacacacaca caaacgtgca cacgcgtctc 6361 atttgacccc tttgcttcca gagatgaatg tggcactccc tccttccatt cctaagctct 6421 agccaccgtc ccttgatctc tcatactttc tccctgtcta cacagtcgcc atcttggtga 6481 ctttgaattt atctggctcc tgggaggtct tctcctcctc tccatcccta ttccctcctc 6541 tgaaagcacc ccttgtaatt gaggacaagg tggttctgtg gccttttccc tctttgctgg 6601 cacgttctgc ttctcaccct ctggtgactc tgtgagctgg gaaatgaggg actggaagtg 6661 aggcctgtgt tgacccttcc tgaaaatcct ctagcagccc ccgacttcag cagtttcttt 6721 ctttgttttt ttgagatgga gtttcgctct tgttgcccag gctggagtgc aatggtgcaa 6781 tctcagctca ctgcaacttc cgcatcccag gttcaagcga ttctcccgcc tcaggttccc 6841 gagtagctgg gactacaggc atgtgccacc atgcccggct taatttcttt cttttttttt 6901 tttttttttg cattttttag tagagatggg ggtttctcct tgttggtcag gctggtctcg 6961 aactcccgac ctcaggtgat ccacctgcct cggcctccca aagtgttggg attacaggcg 7021 tgagccaccg cgcccggcct tcagtttctt cctaggccgt tctgtcaccc aaatagctgc 7081 tacccagagg ggcggggttg acctaggctg aatatccact ttgtttttat ggatggctcc 7141 cttcccccat tcgccttccc agaatatcct tcaagttcca cttcccaggg agctctgggg 7201 gaggggcggc cattctggct ccgtccccag tggccacctt ggaaacatcg gctggctttg 7261 ggactattcc acctccttcc cctgagccca gatctgcccc caccatcctt tctctggctt 7321 cttttagcaa gttatcaact aatcactaac tccttccttt tcctctgcat gccagcctga 7381 aaattccaaa tctagcctct gaatgtcttg gctccatctc ttcagacccc tttgccttta 7441 aaaaaaaaac aaaaacaaaa acaaaaaaac ccataatgcc catagaatgt caaatgaggg 7501 gcctcctgcc tcctgctctg aatattctgt agctgtagag gcattttaac cctttgtcct 7561 ccagcatccc ttcacgtcct catcctctct aacctccttt ttcttttttt aatgctgcag 7621 cctccacact ccacccacag gtgaccctta cctttttctc tagctggatc tgtgtttctt 7681 cccttcgggc ccccatgttt tcctgcaccc gccctaccat ggtctctctc tgcagttatt 7741 taatgcctgt gtcagatcta ctgtaaaaag aggattaagt aaaataaaat gagagcaatt 7801 atatatataa atatatatca tacacagagc cctgtgtgtg ggttgttcct ttctgagcag 7861 ttggaagagc caggaaggag tggagggtgg atct // LOCUS HUMNT3A 1029 bp DNA PRI 07-MAR-1995 DEFINITION Human neurotrophin-3 gene, complete cds, from 1.8 kb HindIII fragment. ACCESSION M61180 NID g189302 KEYWORDS neurotrophic factor; neurotrophin-3. SOURCE Human DNA, clone phi-hN(G1). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1029) AUTHORS Maisonpierre,P.C., Le Beau,M.M., Espinosa,R. III., Ip,N.Y., Belluscio,L., de la Monte,S.M., Squinto,S., Furth,M.E. and Yancopoulos,G.D. TITLE Human and rat brain-derived neurotrophic factor and neurotrophin-3: gene structures, distributions, and chromosomal localizations JOURNAL Genomics 10 (3), 558-568 (1991) MEDLINE 91365361 FEATURES Location/Qualifiers source 1..1029 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phi-hN3(G1)" /map="12p13" gene 116..889 /gene="NTF3" CDS 116..889 /gene="NTF3" /codon_start=1 /db_xref="GDB:G00-125-917" /product="neurotrophin-3" /db_xref="PID:g189303" /translation="MSILFYVIFLAYLRGIQGNNMDQRSLPEDSLNSLIIKLIQADIL KNKLSKQMVDVKENYQSTLPKAEAPREPERGGPAKSAFQPVIAMDTELLRQQRRYNSP RVLLSDSTPLEPPPLYLMEDYVGSPVVANRTSRRKRYAEHKSHRGEYSVCDSESLWVT DKSSAIDIRGHQVTVLGEIKTGNSPVKQYFYETRCKEARPVKNGCRGIDDKHWNSQCK TSQTYVRALTSENNKLVGWRWIRIDTSCVCALSRKIGRT" mat_peptide 530..886 /gene="NTF3" /note="G00-125-917" /product="brain-derived neurotrophic factor" BASE COUNT 289 a 248 c 251 g 241 t ORIGIN Chromosome 12p13. 1 cctcacaggg ctactcagcc tcaggtagct ggtgccagaa taacacagac tcagctgcca 61 gagcctgctc ttaacacctg tgtttccttt tcagatctta caggtgaaca aggtgatgtc 121 catcttgttt tatgtgatat ttctcgctta tctccgtggc atccaaggta acaacatgga 181 tcaaaggagt ttgccagaag actcgctcaa ttccctcatt attaagctga tccaggcaga 241 tattttgaaa aacaagctct ccaagcagat ggtggacgtt aaggaaaatt accagagcac 301 cctgcccaaa gctgaggctc cccgagagcc ggagcgggga gggcccgcca agtcagcatt 361 ccagccagtg attgcaatgg acaccgaact gctgcgacaa cagagacgct acaactcacc 421 gcgggtcctg ctgagcgaca gcaccccctt ggagcccccg cccttgtatc tcatggagga 481 ttacgtgggc agccccgtgg tggcgaacag aacatcacgg cggaaacggt acgcggagca 541 taagagtcac cgaggggagt actcggtatg tgacagtgag agtctgtggg tgaccgacaa 601 gtcatcggcc atcgacattc ggggacacca ggtcacggtg ctgggggaga tcaaaacggg 661 caactctccc gtcaaacaat atttttatga aacgcgatgt aaggaagcca ggccggtcaa 721 aaacggttgc aggggtattg atgataaaca ctggaactct cagtgcaaaa catcccaaac 781 ctacgtccga gcactgactt cagagaacaa taaactcgtg ggctggcggt ggatacggat 841 agacacgtcc tgtgtgtgtg ccttgtcgag aaaaatcgga agaacatgaa ttggcatctc 901 tccccatata taaattatta ctttaaatta tatgatatgc atgtagcata taaatgttta 961 tattgttttt atatattata agttgacctt tatttattaa acttcagcaa ccctacagta 1021 tataagctt // LOCUS HSCKBG 4200 bp DNA PRI 24-APR-1993 DEFINITION Human gene for creatine kinase B (EC 2.7.3.2). ACCESSION X15334 NID g29962 KEYWORDS creatine kinase; creatine kinase B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4200) AUTHORS Mariman,E.C.M. TITLE Direct Submission JOURNAL Submitted (23-MAY-1989) Mariman E.C.M., University of Nijmegen, Dept of Human Genetics, Geert Grooteplein Z18, 6525 GA Nijmegen, Netherlands REFERENCE 2 (bases 1 to 4200) AUTHORS Mariman,E.C., Schepens,J.T. and Wieringa,B. TITLE Complete nucleotide sequence of the human creatine kinase B gene JOURNAL Nucleic Acids Res. 17 (15), 6385 (1989) MEDLINE 89366665 REFERENCE 3 (bases 1 to 4200) AUTHORS Mariman,E. and Wieringa,B. TITLE Expression of the gene encoding human brain creatine kinase depends on sequences immediately following the transcription start point JOURNAL Gene 102 (2), 205-212 (1991) MEDLINE 91340154 COMMENT The sequence overlaps with that reported by Mariman et. al. in Genomics 1:126-137(1987)J03036 and by Daouk et. al. in J. Biol. Chem. 263:2442-2446(1988)J03531. Data kindly reviewed (13-NOV-1989) by Mariman E.C.M. FEATURES Location/Qualifiers source 1..4200 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CK3-1" /tissue_type="brain" /clone_lib="EMBL3" /map="chromosome 14q32.3" CAAT_signal 724..728 TATA_signal 742..749 CAAT_signal 754..758 exon 808..875 /number=1 mRNA join(808..875,1136..1340,1464..1618,1691..1823,2208..2379, 3050..3173,3331..3520,3600..4075) intron 876..1135 /number=1 exon 1136..1340 /number=2 CDS join(1148..1340,1464..1618,1691..1823,2208..2379, 3050..3173,3331..3520,3600..3778) /EC_number="2.7.3.2" /codon_start=1 /product="creatine kinase B" /db_xref="PID:g29963" /db_xref="SWISS-PROT:P12277" /translation="MPFSNSHNALKLRFPAEDEFPDLSAHNNHMAKVLTPELYAELRA KSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVAGDEESYEVFKDLFDPIIEDRHGGYKP SDEHKTDLNPDNLQGGDDLDPNYVLSSRVRTGRSIRGFCLPPHCSRGERRAIEKLAVE ALSSLDGDLAGRYYALKSMTEAEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIW HNDNKTFLVWVNEEDHLRVISMQKGGNMKEVFTRFCTGLTQIETLFKSKDYEFMWNPH LGYILTCPSNLGTGLRAGVHIKLPNLGKHEKFSEVLKRLRLQKRGTGGVDTAAVGGVF DVSNADRLGFSEVELVQMVVDGVKLLIEMEQRLEQGQAIDDLMPAQK" intron 1341..1463 /number=2 exon 1464..1618 /number=3 intron 1619..1690 /number=3 exon 1691..1823 /number=4 intron 1824..2207 /number=4 exon 2208..2379 /number=5 intron 2380..3049 /number=5 exon 3050..3173 /number=6 intron 3174..3330 /number=6 exon 3331..3520 /number=7 intron 3521..3599 /number=7 exon 3600..4075 /number=8 polyA_signal 3952..3957 BASE COUNT 586 a 1443 c 1477 g 694 t ORIGIN 1 gatcagtttt tttttttaat cgcacttatg cttattgttt attagcgttt cctcccatct 61 ttgcctgaag tctccgggga ctgcctttgg gggtcgggta aacttgtccc ctgcgaagag 121 ggcccagggt tggggtctgg aaactccgag gctgcacttg ccagcggcct cttaaggcca 181 cagcgtcccc gtggtttctg gctcgcagcc ccccgagacc caggacttgt ccaaggtcag 241 ggcaccgcgg gtgcccccgg gctgggccgc agcagactgc gcttcccgcg cgccttcgct 301 ttgcaccagg atcgcccagg aaatgcctgc gggcaccttg aggaaggtcg gcggctccgg 361 gccagctcgc actggccggg gtggggcggg ggccgtacct gctgcggaag ccccgaaagc 421 tttcgcccgg cccctcgccg ccgccgcggg ggctggctgg actaggcggg caggctcgag 481 gatgcggatg aacccaagcg tcctcgagtg cccggaggct ctccgcctca gtttcccgcc 541 cagaggcaag ggcgtgcgag gggatccaga tatccaagga cctgaggttt cggcctcgag 601 gtcttgggcg ggggactggg caggctgcgc ggggtcccag cgaggggaca gctcgggtgg 661 gcggccaggg tgttgggggc tgcgggcggc ggacaaagcg gcggcaccac cccgcggcgc 721 gggccaatgg aatgaatggg ctataaatag ccgccaatgg gcggcccgcg ttgtgcccct 781 taagagccgc gggagcgcgg agcggccgct gttcgcctgc gtcgctccgg gagctgccga 841 cggacggagc gcccccgccc ccgcccggcc gcccggtgag tgggcccggg ggccgggggc 901 gtccgcgccc gggctagggg cgctgcgagc aaagggggcg cgtcgcctgg agcgcgcgcc 961 ggaccggccg ggggtccccg gcgatgatgg cgctccccgc gcgcgctgcg gaccccgctg 1021 accttggccg cgtcccgggg ggcgccgggg ggcccggcgg cgggggcctg agtggtacgc 1081 gggagcccgg gaaccccggc gtgccggtcc cctctgaccc cgcgtctccc cgcagcccgc 1141 cgccgccatg cccttctcca acagccacaa cgcactgaag ctgcgcttcc cggccgagga 1201 cgagttcccc gacctgagcg cccacaacaa ccacatggcc aaggtgctga cccccgagct 1261 gtacgcggag ctgcgcgcca agagcacgcc gagcggcttc acgctggacg acgtcatcca 1321 gacaggcgtg gacaacccgg gtacgcgacc cctcggggcc ggggtcccgg ccccccctcc 1381 ccccgcgcag ccgcagggtc ctcagcagcg cgctcgggcc cggcagtgac gtcactgtcc 1441 ccgtcccgcg ccccctcccc caggccaccc gtacatcatg accgtgggct gcgtggcggg 1501 cgacgaggag tcctacgaag tgttcaagga tctcttcgac cccatcatcg aggaccggca 1561 cggcggctac aagcccagcg atgagcacaa gaccgacctc aaccccgaca acctgcaggt 1621 gcggggctgc gggcgggccg ggcgggcggg gccggggtct tcgggcgctc actcccgtct 1681 cgcctcccag ggcggcgacg acctggaccc caactacgtg ctgagctcgc gggtgcgcac 1741 gggccgcagc atccgtggct tctgcctccc cccgcactgc agccgcgggg agcgccgagc 1801 catcgagaag ctcgcggtgg aaggtagggg ccgggcgggc cgaggggcgg cggcggccgc 1861 gtccccctcc cggcgcggtc cccgcccgct tttgtttacg tcgcccggga gcggcagccg 1921 ccgtcgcgct cttatctgcg cgcgcccggg ttcagtttcc cggacccacc gagggacgga 1981 ggcccagccc ccgcgcccac agcggcctgg ggcccaggga gggcgggtcc tggcgcgggg 2041 tcaccgcctg ggaccgtcgc ccgggccgtg aggactggac gcccgcagat ccgggcgggt 2101 ggggccctct gacgtccccc gaggtggggc acgggggcgg gcgggtccgc gctgcgggct 2161 ggaggggcgg gcgcgggagc ccagcgtcct gagcgcaccc ctcgcagccc tgtccagcct 2221 ggacggcgac ctggcgggcc gatactacgc gctcaagagc atgacggagg cggagcagca 2281 gcagctcatc gacgaccact tcctcttcga caagcccgtg tcgcccctgc tgctggcctc 2341 gggcatggcc cgcgactggc ccgacgcccg cggtatctgg tgcgtgtccc tctgcgccct 2401 ctcgcggcgt cctccctccc cgctacctcc gctttccctc tcgcccccct cgcgggggtg 2461 gggcccctcg cggcgaggag gaggaggagg aggagggagg ggccggccgc gctccgggtc 2521 tgggttccgt gccgcgcctc ctcctgcgcc ggtgaccttg gccgagcagg tgcgttaagg 2581 gactgggccc cggcccgtgg gggctcagga ctcagcaaca cctccccacc ccgagacgtg 2641 aggtgggggc ggggctctct ggcgcctctc cccgacggcc ctgggagctg gagctctttg 2701 ttttcttttc tcactcctcc gccgctggga ttctaccagg ggctggtgac gccaaagctt 2761 ctccaggggc agggctccta cccccactgt ggggggcggg tcgggctgtc ctggcggtcc 2821 ctggccccgc cccacctcgg gccacagcgc atgatggcag ctggggttct cctgctgtga 2881 ggcgtcccgg ttcccccgcc cgccccgtgt tggcgggtgg agtcttggca gcagcctcca 2941 ctcctgggca tggcagggag cagcacctca gggacttggg aagttccttt ggtctggggg 3001 cggcctgggg cttttttctg ggtatgccct gagaccagcc ctcccgcagg cacaatgaca 3061 ataagacctt cctggtgtgg gtcaacgagg aggaccacct gcgggtcatc tccatgcaga 3121 aggggggcaa catgaaggag gtgttcaccc gcttctgcac cggcctcacc caggtgccag 3181 ggacggggca ggcccagacc ccagggcccc agcagggatg tgggtgcccc agcatcagtc 3241 cccccggggg atttccggca ctggggagtc tcagggcctg taggggtttc aggcaggcct 3301 tctccctcat accctcttct ccgtctgcag attgaaactc tcttcaagtc taaggactat 3361 gagttcatgt ggaaccctca cctgggctac atcctcacct gcccatccaa cctgggcacc 3421 gggctgcggg caggtgtgca tatcaagctg cccaacctgg gcaagcatga gaagttctcg 3481 gaggtgctta agcggctgcg acttcagaag cgaggcacag gtgagcaggg caggtgctgc 3541 ggcttcccgt ggcctttggg cagccctgtt tcctccgccc tgacttgctg tctccccagg 3601 cggtgtggac acggctgcgg tgggcggggt cttcgacgtc tccaacgctg accgcctggg 3661 cttctcagag gtggagctgg tgcagatggt ggtggacgga gtgaagctgc tcatcgagat 3721 ggaacagcgg ctggagcagg gccaggccat cgacgacctc atgcctgccc agaaatgaag 3781 cccggcccac acccgacacc agccctgctg cttcctaact tattgcctgg gcagtgccca 3841 ccatgcaccc ctgatgttcg ccgtctggcg agcccttagc cttgctgtag agacttccgt 3901 cacccttggt agagtttatt tttttgatgg ctaagatact gctgatgctg aaataaacta 3961 gggttttggc ctgcctgcgt ctgagtggtg cctctccttt cccagggggg agggggaagg 4021 gcagcagcca ggccccagga gtcttgagtc ctgggcctgc tgtgggcctc gccttctgtg 4081 agatgggaca agagccagga ggtggccact ctgttctgcc tgccctacct agtccatggg 4141 ccccttccct cgtgtctatc gggctgtgca ggcaggaaca tgggagagag cgagggagga // LOCUS HUMACHRM4 2595 bp DNA PRI 30-OCT-1994 DEFINITION Human m4 muscarinic acetylcholine receptor gene. ACCESSION M16405 NID g177991 KEYWORDS acetylcholine receptor; muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2595) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes [published erratum appears in Science 1987 Sep 25;237(4822):237] JOURNAL Science 237 (4814), 527-532 (1987) MEDLINE 87263421 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES Location/Qualifiers source 1..2595 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p12-p11.2" intron <1..771 /note="ACHR-m4 intron A" prim_transcript <1..2595 /note="ACHR-m4 pre-mRNA" gene 801..2237 /gene="CHRM4" CDS 801..2237 /gene="CHRM4" /note="muscarinic acetylcholine receptor m4" /codon_start=1 /db_xref="GDB:G00-125-216" /db_xref="PID:g177992" /translation="MANFTPVNGSSGNQSVRLVTSSSHNRYETVEMVFIATVTGSLSL VTVVGNILVMLSIKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTVYIIKGYWPLGA VVCDLWLALDYVVSNASVMNLLIISFDRYFCVTKPLTYPARRTTKMAGLMIAAAWVLS FVLWAPAILFWQFVVGKRTVPDNHCFIQFLSNPAVTFGTAIAAFYLPVVIMTVLYIHI SLASRSRVHKHRPEGPKEKKAKTLAFLKSPLMKQSVKKPRPGGRPGGLRNGKLEEAPP PALPPPPRPVADKDTSNESSSGSATQNTKERPATELSTTEATTPAMPAPPLQPRALNP ASRWSKIQIVTKQTGNECVTAIEIVPATPAGMRPAANVARKFASIARNQVRKKRQMAA RERKVTRTIFAILLAFILTWTPYNVMVLVNTFCQSCIPDTVWSIGYWLCYVNSTINPA CYALCNATFKKTFRHLLLCQYRNIGTAR" BASE COUNT 528 a 839 c 674 g 552 t 2 others ORIGIN 1 bp upstream of XbaI site. 1 tctagaccac cagcctggac aacataccaa gaccctgtct ctacaaataa atagataaat 61 aaatagacac tttttttaag tgtcaaaagt gcttggcact tagtagacca tcagtgttag 121 gtgctcatac ataccccgat tattgccttg tcccagtgtc ttgtacaggg gttggagagn 181 aggtgttaag aaatgaccga atgggtaaat ggatgaacag aacacctccc tccagagccc 241 acatgctcgt gggcctctgg gaccactctc ctcctcctct tgcttccctg agctccccca 301 gcatggcctc tgtccaggcc ttgcgctgcc tccaggcctt tgctgtggct actgcccctg 361 gagcgccatn tccacagctc ctcctgtggc tggctcctca tcacccagat gacctggtgg 421 gtgaggccac ctagcaagga gtcatgcctg tcctgccttc tgactcactc tctcatcacc 481 ctgccttttt tttcttttgt ggctcacgtg tttgcatgtc tccccccatg aggcaggggg 541 ccatgtgtgt cttattcact tctgtagcca cagcaccctg agcaatgctt gccacatagt 601 aggtgctcaa ttaatgttga atgaatgggc aaaatgcggg atggcgggac agagttctct 661 caaggcattc tgccagagaa tgtccctctg tcaccttgaa tccagtgtac ctccagatga 721 ctcccccatt ccctcctgta gttcatgctt ttctctcccc ttcctcccca gacacggcct 781 acccacccct ggcaaccaac atggccaact tcacacctgt caatggcagc tcgggcaatc 841 agtccgtgcg cctggtcacg tcatcatccc acaatcgcta tgagacggtg gaaatggtct 901 tcattgccac agtgacaggc tccctgagcc tggtgactgt cgtgggcaac atcctggtga 961 tgctgtccat caaggtcaac aggcagctgc agacagtcaa caactacttc ctcttcagcc 1021 tggcgtgtgc tgatctcatc ataggcgcct tctccatgaa cctctacacc gtgtacatca 1081 tcaagggcta ctggcccctg ggcgccgtgg tctgcgacct gtggctggcc ctggactacg 1141 tggtgagcaa cgcctccgtc atgaaccttc tcatcatcag ctttgaccgc tacttctgcg 1201 tcaccaagcc tctcacctac cctgcccggc gcaccaccaa gatggcaggc ctcatgattg 1261 ctgctgcctg ggtactgtcc ttcgtgctct gggcgcctgc catcttgttc tggcagtttg 1321 tggtgggtaa gcggacggtg cccgacaacc actgcttcat ccagttcctg tccaacccag 1381 cagtgacctt tggcacagcc attgctgcct tctacctgcc tgtggtcatc atgacggtgc 1441 tgtacatcca catctccctg gccagtcgca gccgagtcca caagcaccgg cccgagggcc 1501 cgaaggagaa gaaagccaag acgctggcct tcctcaagag cccactaatg aagcagagcg 1561 tcaagaagcc ccgcccggga ggccgcccgg gaggactgcg caatggcaag ctggaggagg 1621 cccccccgcc agcgctgcca ccgccaccgc gccccgtggc tgataaggac acttccaatg 1681 agtccagctc aggcagtgcc acccagaaca ccaaggaacg cccagccaca gagctgtcca 1741 ccacagaggc caccactccc gccatgcccg cccctcccct gcagccgcgg gccctcaacc 1801 cagcctccag atggtccaag atccagattg tgacgaagca gacaggcaat gagtgtgtga 1861 cagccattga gattgtgcct gccacgccgg ctggcatgcg ccctgcggcc aacgtggccc 1921 gcaagttcgc cagcatcgct cgcaaccagg tgcgcaagaa gcggcagatg gcggcccggg 1981 agcgcaaagt gacacgaacg atctttgcca ttctgctagc cttcatcctc acctggacgc 2041 cctacaacgt catggtcctg gtgaacacct tctgccagag ctgcatccct gacacggtgt 2101 ggtccattgg ctactggctc tgctacgtca acagcaccat caaccctgcc tgctatgctc 2161 tgtgcaacgc cacctttaaa aagaccttcc ggcacctgct gctgtgccag tatcggaaca 2221 tcggcactgc caggtaggca ggcaggagtg ccctaggagg tgcggtgtgc gtgcgtgtgc 2281 tgggggacca cacggctcac ttgctgtggg gaagagtgca ggcaccattc tgcgttcacg 2341 tttgctgagg aggaagttca gaagaggctc tgtggctgca ttcagagacc agatctctgc 2401 tcacccgtga ggaggctcac cccagggagt gtctgaactg gggctgcctg gcccacctct 2461 gtggccctgc ttcagcgagc tgcggggcac tggcctgggt gggcacctgc ccactgtgac 2521 caaccatcag cagtgctgga agaatggaga tctggatggg ggccgaagcc cagggccccc 2581 tcaggaagaa caaag // LOCUS HUMMHHSP2 2876 bp DNA PRI 07-MAR-1995 DEFINITION Human MHC class III HSP70-2 gene (HLA), complete cds. ACCESSION M59830 M34269 NID g188489 KEYWORDS class III gene; complement system protein; heat shock-induced protein; major histocompatibility complex. SOURCE Human DNA, clone I81. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2876) AUTHORS Milner,C.M. and Campbell,R.D. TITLE Structure and expression of the three MHC-linked HSP70 genes JOURNAL Immunogenetics 32 (4), 242-251 (1990) MEDLINE 91055806 FEATURES Location/Qualifiers source 1..2876 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I81" /haplotype="HLA:A2,B7,C2C,Bfs,C4A3,C4BQ0,DR2" misc_feature 67..92 /gene="HSP70-2" gene 67..2411 /gene="HSP70-2" misc_feature 168..175 /gene="HSP70-2" CDS 486..2411 /gene="HSP70-2" /codon_start=1 /product="heat shock-induced protein" /db_xref="PID:g188490" /translation="MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAF TDTERLIGDAAKNQVALNPQNTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPK VQVSYKGETKAFYPEEISSMVLTKMKEIAEAYLGYPVTNAVITVPAYFNDSQRQATKD AGVIAGLNVLRIINEPTAAAIAYGLDRTGKGERNVLIFDLGGGTFDVSILTIDDGIFE VKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQNKRAVRRLRTACERAKRTLSS STQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKALRDAKLDKAQIHDL VLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDKSENVQDLL LLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERAMT KDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKG RLSKEEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEAD KKKVLDKCQEVISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGA QGPKGGSGSGPTIEEVD" BASE COUNT 640 a 798 c 879 g 559 t ORIGIN Chromosome 6p21.3. 1 tgccatgaga ccaacaccct tcccaccacc actccccctt ctctcagggc ccctgtcccc 61 tccagtgaat cccagaagac tctggagagt tctgagcaga gggcggcacc ctgccctctg 121 attggtccaa ggaaggctgg ggggcaggac gggaggcgaa acccctggaa tattcccgac 181 ctggcagcct catcgagctt ggtgattggc tcagaagggg aaaggcgggt ctccacgacg 241 acttataaaa gccgaggggc gcgcggtccg gaaaacggcc agcctgagga gctgctgcga 301 gggtccgctt cgtctttcga gagtgactcc cgcggtccca aggctttcca gagcgaacct 361 gtgcggctgc aggcaccggc gtgttgagtt tccggcgttc cgaaggactg agctcttgtc 421 gcggatcccg tccgccgttt ccagccccca gtctcagagc ggagcccaca gagcagggca 481 ccggcatggc caaagccgcg gcgatcggca tcgacctggg caccacctac tcctgcgtgg 541 gggtgttcca acacggcaag gtggagatca tcgccaacga ccagggcaac cgcaccaccc 601 ccagctacgt ggccttcacg gacaccgagc ggctcatcgg ggatgcggcc aagaaccagg 661 tggcgctgaa cccgcagaac accgtgtttg acgcgaagcg cctgatcggc cgcaagttcg 721 gcgacccggt ggtgcagtcg gacatgaagc actggccttt ccaggtgatc aacgacggag 781 acaagcccaa ggtgcaggtg agctacaagg gggagaccaa ggcattctac cccgaggaga 841 tctcgtccat ggtgctgacc aagatgaagg agatcgccga ggcgtacctg ggctacccgg 901 tgaccaacgc ggtgatcacc gtgccggcct acttcaacga ctcgcagcgc caggccacca 961 aggatgcggg tgtgatcgcg gggctcaacg tgctgcggat catcaacgag cccacggccg 1021 ccgccatcgc ctacggcctg gacagaacgg gcaaggggga gcgcaacgtc ctgatctttg 1081 acctgggcgg gggcaccttc gacgtgtcca tcctgacgat cgacgacggc atcttcgagg 1141 tgaaggccac ggccggggac acccacctgg gtggggagga ctttgacaac aggctggtga 1201 accacttcgt ggaggagttc aagagaaaac acaagaagga catcagccag aacaagcgag 1261 ccgtgaggcg gctgcgcacc gcctgcgaga gggccaagag gaccctgtcg tccagcaccc 1321 aggccagcct ggagatcgac tccctgtttg agggcatcga cttctacacg tccatcacca 1381 gggcgaggtt cgaggagctg tgctccgacc tgttccgaag caccctggag cccgtggaga 1441 aggctctgcg cgacgccaag ctggacaagg cccagattca cgacctggtc ctggtcgggg 1501 gctccacccg catccccaag gtgcagaagc tgctgcaaga cttcttcaac gggcgcgacc 1561 tgaacaagag catcaacccc gacgaggctg tggcctacgg ggcggcggtg caggcggcca 1621 tcctgatggg ggacaagtcc gagaacgtgc aggacctgct gctgctggac gtggctcccc 1681 tgtcgctggg gctggagacg gccggaggcg tgatgactgc cctgatcaag cgcaactcca 1741 ccatccccac caagcagacg cagatcttca ccacctactc cgacaaccaa cccggggtgc 1801 tgatccaggt gtacgagggc gagagggcca tgacgaaaga caacaatctg ttggggcgct 1861 tcgagctgag cggcatccct ccggccccca ggggcgtgcc ccagatcgag gtgaccttcg 1921 acatcgatgc caacggcatc ctgaacgtca cggccacgga caagagcacc ggcaaggcca 1981 acaagatcac catcaccaac gacaagggcc gcctgagcaa ggaggagatc gagcgcatgg 2041 tgcaggaggc ggagaagtac aaagcggagg acgaggtgca gcgcgagagg gtgtcagcca 2101 agaacgccct ggagtcctac gccttcaaca tgaagagcgc cgtggaggat gaggggctca 2161 agggcaagat cagcgaggcc gacaagaaga aggttctgga caagtgtcaa gaggtcatct 2221 cgtggctgga cgccaacacc ttggccgaga aggacgagtt tgagcacaag aggaaggagc 2281 tggagcaggt gtgtaacccc atcatcagcg gactgtacca gggtgccggt ggtcccgggc 2341 ctggcggctt cggggctcag ggtcccaagg gagggtctgg gtcaggccct accattgagg 2401 aggtggatta ggggcctttg ttctttagta tgtttgtctt tgaggtggac tgttgggact 2461 caaggacttt gctgctgttt tcctatgtca tttctgcttc agctctttgc tgcttcactt 2521 ctttgtaaag ttgtaacctg atggtaatta gctggcttca ttatttttgt agtacaaccg 2581 atatgttcat tagaattctt tgcatttaat gttgatactg taagggtgtt tcgttccctt 2641 taaatgaatc aacactgcca ccttctgtac gagtttgttt gttttttttt tttttttttt 2701 tttttgcttg gcgaaaacac tacaaaggct gggaatgtat gtttttataa tttgtttatt 2761 taaatatgaa aaataaaatg ttaaactttt tcttgtctgt taatatgtga agataatgga 2821 tatttgcgga gggatagtgt ctgaatacca tctatcttta tagtctgaaa agaaca // LOCUS HUMAPEXN 3730 bp DNA PRI 18-JAN-1995 DEFINITION Human APX gene encoding APEX nuclease, complete cds. ACCESSION D13370 NID g219473 KEYWORDS 3'-5' exonuclease; AP endonuclease; APEX nuclease; DNA 3' repair diesterase; DNA 3'-phosphatase; DNA repair enzyme. SOURCE Homo sapiens (library: human leukocyte genomic) blood leukocyte DNA, clone lambda 35. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Seki,S., Hatsushika,M., Watanabe,S., Akiyama,K., Nagao,K. and Tsutsui,K. TITLE cDNA cloning, sequencing, expression and possible domain structure of human APEX nuclease homologous to Escherichia coli exonuclease III JOURNAL Biochim. Biophys. Acta 1131 (3), 287-299 (1992) MEDLINE 92329542 REFERENCE 2 (bases 1 to 3730) AUTHORS Akiyama,K., Seki,S., Oshida,T. and Yoshida,M.C. TITLE Structure, promoter analysis and chromosomal assignment of the human APEX gene JOURNAL Biochim. Biophys. Acta 1219 (1), 15-25 (1994) MEDLINE 94368844 REFERENCE 3 (bases 1 to 3730) AUTHORS Seki,S. TITLE Direct Submission JOURNAL Submitted (06-OCT-1992) to the DDBJ/EMBL/GenBank databases. Shuji Seki, Okayama Univ. Medical School, Inst. of Cell. and Mol. Biol., Department of Molecular Biology; 2-5-1, Shikata-cho, Okayama, Okayama 700, Japan (Tel:0862-23-7151, Fax:0862-22-2846) COMMENT Submitted (06-Oct-1992) to DDBJ by: Shuji Seki Department of Molecular Biology Institute of Cellular and Molecular Biology Okayama University Medical School 2-5-1 Shikata-cho Okayama 700 Japan Phone: 086-223-7151 Fax: 086-222-2846. FEATURES Location/Qualifiers source 1..3730 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /clone_lib="human leukocyte genomic library" misc_feature 226..1645 /note="CpG island" exon 892..1155 /number=1 prim_transcript 892..3527 /note="APX mRNA and introns" intron 1156..1338 /number=1 exon 1339..1464 /number=2 gene 1407..3269 /gene="APX" CDS join(1407..1464,1675..1862,2429..2621,2752..3269) /gene="APX" /codon_start=1 /product="APEX nuclease" /db_xref="PID:d1003137" /db_xref="PID:g219474" /translation="MPKRGKKGAVAEDGDELRTEPEAKKSKTAAKKNDKEAAGEGPAL YEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSEN KLPAELQELPGLSHQYWSAPSDKEGYSGVGLLSRQCPLKVSYGIGEEEHDQEGRVIVA EFDSFVLVTAYVPNAGRGLVRLEYRQRWDEAFRKFLKGLASRKPLVLCGDLNVAHEEI DLRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTYMMNARSK NVGWRLDYFLLSHSLLPALCDSKIRSKALGSDHCPITLYLAL" intron 1465..1674 /gene="APX" /number=2 exon 1675..1862 /gene="APX" /number=3 misc_difference 1769 /gene="APX" /note="replace(1769,'c')" /citation=[1] intron 1863..2428 /gene="APX" /number=3 exon 2429..2621 /gene="APX" /number=4 intron 2622..2751 /gene="APX" /number=4 exon 2752..3527 /number=5 misc_difference 2756 /gene="APX" /note="replace(2756,'t')" /citation=[1] BASE COUNT 920 a 920 c 886 g 1004 t ORIGIN 1 aaacagaata tttcgagtcc atctctatta tactgtttat catgctgtag tataaacatc 61 tatttatatg tctgactcca cagcagtata acatctattt ctgaatactt agcacataaa 121 atgcgctcag taaaaggtaa atgaactcat cagatccacg tccaagtgcc tgtaactctt 181 ctagtcatca gccttgtgac gttaataatg tcacataatt tcaatgagac tcggatcgaa 241 agctgcacct cactagtatt taggaagatg gaaggctgat tctgtgcggg gaagtgcgcg 301 gaaaactctt aagtccccta ctcaccaacc tgtgccagga ggcgtgacgt aagtccgccg 361 cgggttcgcc agcaccttgc catcccgcac cacgcccacg ccaatcttat tggcgctgcc 421 ttcaaaaccc agcaccgccg gcatggcgga ggctgggaga aaacgccgac aggactcctg 481 gcaatgtcag gagctgtgga ggtcctcact agtccgcgct gggccgcagc tttccggagc 541 gcagaggaag ctggccagcc tgcagatagc actgggaaag acaccgcgga actccagcga 601 gcggagacac gccaaggccc ctccagggac ctgtcttcct aactgccagg gacgccgagc 661 caactctgtg ccttacattc gtatccgttt tcctatctct ttcccgtggt ccagcccagc 721 cttctccact gtttttttcc ctcttgcaca gagttagaat cttaagtcag tgtcacacaa 781 tgtgctgtgc atctggcaca acgataaaca gcccgaggga gggttgggga cctaagtgtc 841 ctagagaatt agaggaggga ggcgaggcta agcgtctccg tcacgtggtg tcagacagac 901 caatcacgcg cattcttcgg ccacgacaag cgcgcctctg atcacgtgac caggtccgct 961 acccacgtgg gggctcagcg tgcacccttc tttgtgctcg ggttaggagg agctaggctg 1021 ccatcgggcc ggtgcagata cggggttgct cttttgctca taagaggggc ttcgctggca 1081 gtctgaacgg caagcttgag tcaggaccct taattaagat cctcaattgg ctggagggca 1141 gatctcgcga gtagggtaca aggcactatg aaatgatcta gtttcgtggg tgaggggctg 1201 aagggcctat gatgcacgga ggcggggaaa ggatttagag ataacgtggt ttgaaaggcg 1261 ggacctggtg cggggacgct cttgggagga gtcttctccc cagccttagc tggtttcatg 1321 atttctttgc gtctgtaggc aacgcggtaa aaatattgct tcggtgggtg acgcggtaca 1381 gctgcccaag ggcgttcgta acgggaatgc cgaagcgtgg gaaaaaggga gcggtggcgg 1441 aagacgggga tgagctcagg acaggtaagg gaatgaaatc agcccttctt cctagaagct 1501 gcggcggggg tgtttgtcat tcccttgatg tacggtaagt acgggccgac tcatttttgc 1561 aggggtttgt gaagaagtcg caggaaccgt aggctttcgt tgggtctata gttaacgccg 1621 gatcgcagtt ggaaaccacc agctttttgt cagtatatat tactcatttt atagagccag 1681 aggccaagaa gagtaagacg gccgcaaaga aaaatgacaa agaggcagca ggagagggcc 1741 cagccctgta tgaggacccc ccagatcaga aaacctcacc cagtggcaaa cctgccacac 1801 tcaagatctg ctcttggaat gtggatgggc ttcgagcctg gattaagaag aaaggattag 1861 atgtgagtgg aatttgaggg aaagagacat tttttagtat tgaatggtct tagggtttag 1921 tcaccccttt tctccgttta gccttcaggc tgttttattt ttctcctgcc cgtagttttc 1981 tgtggggctt ccccagtctt gccagttgta tttcctaaat gtctgttcct tcacttccat 2041 tgccattttc ttttttagtg ttctctcctc ttcccagaat gttgcaaaaa cctcttcact 2101 atacttcctc cattttatct tcctgcattg cattccatat gaagcatgtc ctccattcca 2161 ttaaccatag cttaaaaatc ttagcttgct atccactgcc tatagaaaaa acacatctcc 2221 ttggcatagc atgtaagact ttcttacctc tctatatttg ttttcattta tctagcttag 2281 aattgtttga atattgtgct gcttgactcg aactccttag gccaagagac tgtttaaccc 2341 gtgcgtatct atgacttagc atatagatta ttcaataaat gttctgctga attgataata 2401 cgttttccac ctttcttttc acttacagtg ggtaaaggaa gaagccccag atatactgtg 2461 ccttcaagag accaaatgtt cagagaacaa actaccagct gaacttcagg agctgcctgg 2521 actctctcat caatactggt cagctccttc ggacaaggaa gggtacagtg gcgtgggcct 2581 gctttcccgc cagtgcccac tcaaagtttc ttacggcata ggtgagaccc tattgatgcc 2641 taatgcctga actcttcaaa accaattgct aattctctat ctctgcccca cctcttgatt 2701 gctttccctt ttcttatagt tttttatgct aattctgttt catttctata ggcgaggagg 2761 agcatgatca ggaaggccgg gtgattgtgg ctgaatttga ctcgtttgtg ctggtaacag 2821 catatgtacc taatgcaggc cgaggtctgg tacgactgga gtaccggcag cgctgggatg 2881 aagcctttcg caagttcctg aagggcctgg cttcccgaaa gccccttgtg ctgtgtggag 2941 acctcaatgt ggcacatgaa gaaattgacc ttcgcaaccc caaggggaac aaaaagaatg 3001 ctggcttcac gccacaagag cgccaaggct tcggggaatt actgcaggct gtgccactgg 3061 ctgacagctt taggcacctc taccccaaca caccctatgc ctacaccttt tggacttata 3121 tgatgaatgc tcgatccaag aatgttggtt ggcgccttga ttactttttg ttgtcccact 3181 ctctgttacc tgcattgtgt gacagcaaga tccgttccaa ggccctcggc agtgatcact 3241 gtcctatcac cctataccta gcactgtgac accaccccta aatcactttg agcctgggaa 3301 ataagccccc tcaactacca ttccttcttt aaacactctt cagagaaatc tgcattctat 3361 ttctcatgta taaaactagg aatcctccaa ccaggctcct gtgatagagt tcttttaagc 3421 ccaagatttt ttatttgagg gttttttgtt ttttaaaaaa aaattgaaca aagactacta 3481 atgactttgt ttgaattatc cacatgaaaa taaagagcca tagtttcagc cttgctgtct 3541 tcgtgtctta ccccttcgtg gggctacaca ttctcttcct catattttca tgcacacaag 3601 ttaacaagtg aaaagcgtag agtcatgacc ttatttattt acaagcacag gataagtccc 3661 taacctcccc caaagactga gcaaccctac ccagcccagt taaatactgc aactgggggg 3721 gtaaaaaagg // LOCUS HUMGAD45A 5378 bp DNA PRI 25-JAN-1994 DEFINITION Human gadd45 gene, complete cds. ACCESSION L24498 NID g403127 KEYWORDS . SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5378) AUTHORS Hollander,M.C., Alamo,I., Jackman,J., Wang,M.G., McBride,O.W. and Fornace,A.J. TITLE Analysis of the mammalian gadd45 gene and its response to DNA damage JOURNAL J. Biol. Chem. 268 (32), 24385-24393 (1993) MEDLINE 94043278 FEATURES Location/Qualifiers source 1..5378 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI38" /cell_type="fibroblast" /tissue_type="lung" /tissue_lib="Stratagene cat 944201" /map="1p31.1-31.2" exon 225..2595 misc_signal 2158..2165 /note="octamer transcription binding site" misc_signal 2185..2192 /note="octamer transcription binding site" 5'UTR 2257..2551 CDS join(2552..2595,3082..3183,3407..3644,4718..4831) /codon_start=1 /db_xref="PID:g403128" /translation="MTLEEFSAGEQKTERMDKVGDALEEVLSKALSQRTITVGVYEAA KLLNVDPDNVVLCLLAADEDDDRDVALQIHFTLIQAFCCENDINILRVSNPGRLAELL LLETDAGPAASEGAEQPPDLHCVLVTNPHSSQWKDPALSQLICFCRESRYMDQWVPVI NLPER" intron 2596..3081 exon 3082..3183 intron 3184..3406 exon 3407..3644 intron 3645..4717 misc_signal 3832..3851 /note="p53 binding site" misc_signal 3881..3887 /note="AP1 binding site" exon 4718..>5378 3'UTR 4829..>5378 BASE COUNT 1381 a 1247 c 1375 g 1375 t ORIGIN 1 gtcgacttgg gtggggcact ttaggactgt ggttcatttg aattggtgta aacaatacac 61 cggttctact gtcctacagc ctccattcag atgactgaag tcatgggact ttcagcatag 121 ctagctgatg acagtgcata ctattttgtc ccaaaatcca gttcaagcat ggacatacca 181 ataagagcct aagctcttta aaggcaaagg accaggaatt gtacagttct tggtatagaa 241 gaagacaggc aaaagtgttt ttgaactaac gttaaatgtg caatatgtta gaattcatgc 301 aatgcacagg actgcaggat tctgatatct tatttaactc tcaaattcta ttcaactcaa 361 taaaccttga ctgtgcttct actaaatgca ggtattgtac taggagctga ggacaccaaa 421 ctgatgaagt ccttgctgtc aagaaactca catgattccc taattctttg tcagcttgct 481 gtgatcacat tttcttccca agaacctcta agaaatgcct agtggataga accttggagt 541 tccacggaac atattaacaa tcgccaaatg atgactcagg ctagattgtg taattcaggt 601 tttgtctgca aaactgaaaa tgcttcggta acctacctaa atttcaatgt tgaggaattc 661 tttaagaaag acatcaaatg ttaagattta aggcatagat atgagataca tagtcatgct 721 taggtgaatt atgcactgac catgaccatt tctttactca aatgttgtcc atggctgaca 781 acacagtgaa aaaatgagtg caaaatgaca actcaaataa atgaaccaga aaacctatca 841 cttttctttt ccaccaaatt aagatcaaga gagctggaga atattttgtc tagagtgata 901 aaaacataag ggtgcaaaac ttccaggtac ctttgcagaa attacttctg tgacctttgg 961 ctgtacagca accttaataa tgcaagcact gttttgaatg caagcatgtg ggagccattt 1021 tcaccacttt tgatgacttc agtaggttta agaaatgttt ttgcttttat tgcataaacc 1081 ataaaacaaa ggaagggact tttgaactac tcagtgagag tctatatatt aaagtttgtt 1141 tttcaaaaat gtgtaactac catttgcagt tttaaaggtc tgctttccac ctacaagttg 1201 ccattatctc aaaggtgaaa ttttagcata tgactaaaaa cttcctatag ttacagcttc 1261 atgattcagc atctaacatc aataattcac agtgagatca taggaggctc tctgtggaag 1321 gtaacgacat acatacgtta ggaaaggaag cttagggcat atcgagagca ttttgaattt 1381 agacttgtgg gctgtgtggg tgtcagatgg ttgtctctca gctggtgggc gtccagaagg 1441 atccttgttt gggcaaggct ctttgagaaa ggagaatctg ggttgccagg gattcccaca 1501 tgtggtcacc agctccccac gcagaccagc tcacgatttc ccagttacac cgggcaggtg 1561 ggaaaccgtt ctgctttctg tggaaaagat tctaacttgg ttccctgcca tccctgaata 1621 caaacgggtt ggtttttctt ttttcagctt ccaacccttg cagctttcca aaaataaatc 1681 aaaccagcca tcagggcacc gaaataatac tactgctaat aagcagcttc gcctagactt 1741 agataaacaa cacttctgag gtaaactttg ccccggaggt ctggagacac ttttttaatg 1801 taacctgctt actaataatt actagacttc agtgcattaa ccctggaaat agattttaat 1861 agccacccct taaaacaaaa gacatgaaaa gataataaga aaaaagtgcc gcaactatta 1921 tagaaaaaca cttggcagcc tgcttcagcc caagctgagg ccacctctag cctctgctaa 1981 agccccccac tcccaatggt ccccgccaac cggataagag tgcgcgcggg acccgccttc 2041 ccctctcggc accgcccccg cccccgcccc ctcggctcgc ctcccgcgtg gctcctccct 2101 tttccgctcc tctcaacctg actccaggag ctggggtcaa attgctggag caggctgatt 2161 tgcatagccc aatggccaag ctgcatgcaa atgaggcgga aggtggttgg ctgagggttg 2221 gcaggataac cccggagagc ggggcccttt gtcctccagt ggctggtagg cagtggctgg 2281 gaggcagcgg cccaattagt gtcgtgcggc ccgtggcgag gcgaggtccg gggagcgagc 2341 gagcaagcaa ggcgggaggg gtggccggag ctgcggcggc tggcacagga ggaggagccc 2401 gggcgggcga ggggcggccg gagagcgcca gggcctgagc tgccggagcg gcgcctgtga 2461 gtgagtgcag aaagcaggcg cccgcgcgct agccgtggca ggagcagccc gcacgccgcg 2521 ctctctccct gggcgacctg cagtttgcaa tatgactttg gaggaattct cggctggaga 2581 gcagaagacc gaaaggtgag tcggcctgcg gactcttccg gcccgaactt ctcttaccta 2641 ccccgcgctc cccggtgcag ccgggctgtg gaaggcttgc aggggaggaa gctaaaaagt 2701 ttgcacaggg caactcccgc ccttgctccc tcgggactct ccgtggagct cccacggact 2761 gaaagagcgt gccccccaac ccgaacgagc cccgccgggg cctttgcaaa gggcagcagt 2821 ggccgtcgct gcccgtgcgg ctcccgtggc tggcagcctg tggcaggggc actctcggga 2881 cttctcacgg gacgcccggt ccttgggcgt gcaggggtca tggggggtga cggggccgcg 2941 ggagcgccgg gttttcgtag agcccaggtg cgcggtggtg cttgcattcg agagggaggg 3001 gcgtggtacc ggacgagggg ggcggcgatg gccccgaggg caccggggct gacgggaccc 3061 ctcgcccttg cccgcgtgta ggatggataa ggtgggggat gccctggagg aagtgctcag 3121 caaagccctg agtcagcgca cgatcactgt cggggtgtac gaagcggcca agctgctcaa 3181 cgtgtaagtg gggcccttgc gcgtccccca tggcacccct tcccgcccca gcccgggagg 3241 tcgccttggc tgggcgcccc tcgcccggcc gcgccacttc ctgtcgcttt tctgcctgtc 3301 tcggaaggga gggggcgagc gggccgggcg gcgaccccca gggacccggg cagtggttga 3361 gggcgcccgc gcttctgcgc tcactggccc cgcccgctgc ccccagcgac cccgataacg 3421 tggtgttgtg cctgctggcg gcggacgagg acgacgacag agatgtggct ctgcagatcc 3481 acttcaccct gatccaggcg ttttgctgcg agaacgacat caacatcctg cgcgtcagca 3541 acccgggccg gctggcggag ctcctgctct tggagaccga cgctggcccc gcggcgagcg 3601 agggcgccga gcagcccccg gacctgcact gcgtgctggt gacggtaagg gactggggga 3661 ctgcagcctg cagggtagag ccccggaagg acgggagtca gggctgggtt gcctgattgt 3721 ggatctgtgg taggtggggg tcaggagggt ggctgccttt gtccgactag agtgtggctg 3781 gactttcagc cgagatgtgc tagtttcatc accaggattt tctgtggtac agaacatgtc 3841 taagcatgct ggggactgcc agcagcggaa gagatccctg tgagtcagca gtcagcccag 3901 ctactcccta cctacatctg cactgcctcc cgtgactaat tcctttagca gggcagatta 3961 gataaagcca aatgaattcc tggctcaccc ctcattaagg agtcagcttc attctctgcc 4021 agtcagagct aaaaatagaa attgtgtagg agacaaacct tgttaattcc ctagaaatac 4081 attaagagga tagagtggaa ttttttttct ctgcaatctt gcattttttt aatggctctt 4141 tttttttttc ctgataaaaa cctttggtag gtagggaagt tatgttttca ggggtaaatg 4201 tgctactttt gtcttctaaa ttttgctctt ttttgactgg tctagtcaag tgacagcccg 4261 attattttgc tactccttaa aagtactatt ctgtctcttg gagtatggtt gatggcaatt 4321 ccagttaact gctgtgcagc tctcatctca ttgtgcacac agcatggaaa tctttctcaa 4381 aactgtttca ctcaggtcag ggtaacaagt ttggtagagc aaaccggtga atgatactct 4441 catgcaaaac tgaacagata tgcaaacata tgtatgtggt tcagcttggg ttgcatgggt 4501 tcagactttg caatgtgtag tttaataggt aattaccctt aacgcttttg cagggaaccc 4561 aactaccttg aagaaacttt aatttttttg tgcttctaat ttgtctccat gtcacatagc 4621 caaaatatag aatgttcaag tgttttctcc tcaaaagtat aattactaga atatactggt 4681 ttttaaaata agtttatttt tataaatttg tttccagaat ccacattcat ctcaatggaa 4741 ggatcctgcc ttaagtcaac ttatttgttt ttgccgggaa agtcgctaca tggatcaatg 4801 ggttccagtg attaatctcc ctgaacggtg atggcatctg aatgaaaata actgaaccaa 4861 attgcactga agtttttgaa atacctttgt agttactcaa gcagttactc cctacactga 4921 tgcaaggatt acagaaactg atgccaaggg gctgagtgag ttcaactaca tgttctgggg 4981 gcccggagat agatgacttt gcagatggaa agaggtgaaa atgaagaagg aagctgtgtt 5041 gaaacagaaa aataagtcaa aaggaacaaa aattacaaag aaccatgcag gaaggaaaac 5101 tatgtattaa tttagaatgg ttgagttaca ttaaaataaa ccaaatatgt taaagtttaa 5161 gtgtgcagcc atagtttggg tatttttggt ttatatgccc tcaagtaaaa gaaaagccga 5221 aagggttaat catatttgaa aaccatattt tattgtattt tgatgagata ttaaattctc 5281 aaagttttat tataaattct actaagttat tttatgacat gaaaagttat ttatgctata 5341 aattttttga aacacaatac ctacaataaa ctggtatg // LOCUS HUMMHHSPHO 3330 bp DNA PRI 07-MAR-1995 DEFINITION Human MHC class III HSP70-HOM gene (HLA), complete cds. ACCESSION M59829 M34268 NID g188491 KEYWORDS class III gene; complement system protein; heat shock-induced protein; major histocompatibility complex. SOURCE Human DNA, clone H92. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3330) AUTHORS Milner,C.M. and Campbell,R.D. TITLE Structure and expression of the three MHC-linked HSP70 genes JOURNAL Immunogenetics 32 (4), 242-251 (1990) MEDLINE 91055806 FEATURES Location/Qualifiers source 1..3330 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H92" /haplotype="HLA:A2,B7,C2C,Bfs,C4A3,C4BQ0,DR2" gene 960..2885 /gene="HSP70-HOM" CDS 960..2885 /gene="HSP70-HOM" /codon_start=1 /product="heat shock-induced protein" /db_xref="PID:g188492" /translation="MATAKGIAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYV AFTDTERLIGDAAKNQVAMNPQNTVFDAKRLIGRKFNDPVVQADMKLWPFQVINEGGK PKVLVSYKGENKAFYPEEISSMVLTKLKETAEAFLGHPVTNAVITVPAYFNDSQRQAT KDAGVIAGLNVLRIINEPTAAAIAYGLDKGGQGERHVLIFDLGGGTFDVSILTIDDGI FEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQNKRAVRRLRTACERAKRTL SSSTQANLEIDSLYEGIDFYTSITRARFEELCADLFRGTLEPVEKALRDAKMDKAKIH DIVLVGGSTRIPKVQRLLQDYFNGRDLNKSINPDEAVAYGAAVQAAILMGDKSEKVQD LLLLDVAPLSLGLETVGGVMTALIKRNSTIPPKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFDLTGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKVNKITITND KGRLSKEEIERMVLDAEKYKAEDEVQREKIAAKNALESYAFNMKSVVSDEGLKGKISE SDKNKILDKCNELLSWLEVNQLAEKDEFDHKRKELEQMCNPIITKLYQGGCTGPACGT GYVPGRPATGPTIEEVD" BASE COUNT 951 a 738 c 867 g 774 t ORIGIN Chromosome 6p21.3. 1 ggatcctatg agcctgggag gtcaggactg cagtgagcca tgattacacc actgcagtgc 61 agcctgcgtg acaaaacgag accctgtctc taaaaaatga gaaaaaaaaa tggttgttac 121 caggcgataa agggagggga aaacgggagt tacttaatga gtatacagtt tcagttttgc 181 gagatgaaca gaattctgga aattggttga acaccgctgt gattgaactc actaccaaac 241 tctacactta aaaatggtta agatggtaca atttgtatgt attttaccac aataaaaaat 301 aaaaaaaagg ctgggcgaga tgttcactcc tgtaatccca gtacttgggg aggctggggc 361 tgaaggatcg tttgagccct gaaggagttt gagaccagcc tgagcaacat aaggagaccc 421 catctgtaca caaaattaaa acattagcca ggcagagagc tggtcacggt ggctcacgta 481 tgtaatccca gcactttggg aggccgaggc gggcgggcgg atcacctgag gtcaggagtt 541 tgagaccagc ctggccaaca tagtgaaacc gtgaaacccc atctctacta aaaatacaaa 601 aattagctgg gcgtggtggt gccctcataa tcccagccac tcgggaggct gagacaggag 661 aatcgcttga actcaggagg tggaggttgc agtgagccta gatcacacca ctgcagtcca 721 aagcaagact ccgtctcaaa aaaaaaaaaa attagcccgg ctgttgtctc cagttattct 781 ggaggctaag gcaggaagat tgctggagcc taggagatca aagctgcagt gagctatgac 841 tgcgcctctg cactccaacc tgggtgacag aggaagaccc tgtctcaaaa aaataaataa 901 cattgaaaag gaactctccc aaaagtatct tattctttct ccataggcct cagagaacca 961 tggctactgc caagggaatc gccataggaa tcgacctggg caccacctac tcctgtgtgg 1021 gggtgttcca gcacggcaag gtggagatca tcgccaacga ccagggcaac cgcaccaccc 1081 ccagctacgt ggccttcaca gacaccgagc ggctcattgg ggatgcggcc aagaaccagg 1141 tagcaatgaa tccccagaac actgtttttg atgctaaacg tctgatcggc aggaaattta 1201 atgatcctgt tgtacaagca gatatgaaac tttggccttt tcaagtgatt aatgaaggag 1261 gcaagcccaa agtccttgtg tcctacaaag gggagaataa agctttctac cctgaggaaa 1321 tctcttcgat ggtattgact aagttgaagg agactgctga ggcctttttg ggccaccctg 1381 tcaccaatgc agtgattacc gtgccagcct atttcaatga ctctcaacgt caggctacta 1441 aggatgcagg tgtgattgct ggacttaatg tgctaagaat catcaatgag cccacggctg 1501 ctgccattgc ctatggttta gataaaggag gtcaaggaga acgacatgtc ctgatttttg 1561 atctgggtgg aggcacattt gatgtgtcaa ttctgaccat agatgatggg atttttgagg 1621 taaaggccac tgctggggac actcacctgg gtggggagga ctttgacaac aggcttgtga 1681 gccacttcgt ggaggagttc aagaggaaac acaaaaagga catcagccag aacaagcgag 1741 ccgtgaggcg gctgcgcacc gcctgcgaga gggccaagag gaccctgtcg tccagcaccc 1801 aggccaacct agaaattgat tcactttatg aaggcattga cttctataca tccatcacca 1861 gagctcgatt tgaagagttg tgtgcagacc tgtttagggg taccctggag cctgtagaaa 1921 aagcgcttcg ggatgccaag atggataagg ctaaaatcca tgacattgtt ttagtagggg 1981 gctccacccg catccccaag gtgcagcggc tgcttcagga ctacttcaat ggacgtgatc 2041 tcaacaagag catcaaccct gatgaggccg tagcatatgg ggctgcggta caagcagcca 2101 tcctgatggg ggacaagtct gagaaggtac aggacctgct gctgctggac gtggctcccc 2161 tgtccctggg tctggagacg gttgggggcg tgatgactgc cctgataaag cgcaactcca 2221 ccatcccacc caagcagaca cagattttca ccacctactc tgacaaccaa cccggggtgc 2281 tgatccaggt gtatgagggc gagagggcca tgacaaagga caacaacctg ctggggcggt 2341 ttgatctgac tggaatccct ccagcaccca ggggagttcc tcagatcgag gtgacgtttg 2401 acattgatgc caatggtatt ctcaatgtca cagccacgga caagagcacc ggcaaggtga 2461 acaagatcac catcaccaat gacaagggcc gcctgagcaa ggaggagatt gagcggatgg 2521 ttctggatgc tgagaaatat aaagctgaag atgaggtcca gagggagaaa attgctgcaa 2581 agaatgcctt agaatcctat gcttttaaca tgaagagtgt tgtgagtgat gaaggtttga 2641 agggcaagat tagtgagtct gataaaaata aaatattgga taaatgcaac gagctccttt 2701 cgtggctgga ggtcaatcaa ctggcagaga aagatgagtt tgatcataag agaaaggaat 2761 tggagcagat gtgtaaccct atcatcacaa aactctacca aggaggatgc actgggcctg 2821 cctgcggaac agggtatgtg cctggaaggc ctgccacagg ccccacaatt gaagaagtag 2881 attaattctt tttagaactg aagcatccta ggatgcctct acatgtattt cattcccctc 2941 atgttgaaac atcattatta ttcttgacca gacctgaatc taagttacca tcccttggaa 3001 attctggaga aggagtctca tgcaccacct atcacactcc ctcacatcct gtttctgact 3061 ttggaatgga ctcaggaaaa ctaggcccct ctttaaccgt gtgatgtatt tgaatgtctg 3121 ttatttccag ccaccctaac attcttcttc ctgtgtggat gcttatttgt caatcagtaa 3181 atttgttcgt aaagaaaatt acttctggta tttaggctgt gaatgtacct tgaaggggag 3241 agttcatgga gagagcatgt gttctctgat tgtgaggtca ctgtgaatga ttaaattggt 3301 aagggtaaag tatttgaatt ttcatgaact // LOCUS HSHOX51 6305 bp DNA PRI 25-JUN-1997 DEFINITION Human HOX 5.1 gene for HOX 5.1 protein. ACCESSION X17360 NID g32394 KEYWORDS DNA-binding protein; homeobox; Hox 5.1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6305) AUTHORS Cianetti,L. TITLE Direct Submission JOURNAL Submitted (20-DEC-1989) Cianetti L., Dept. of Hematology-Oncology, Istituto Superiore Di Sanita', Viale Regina Elena, 299, 00161 Rome, Italy REMARK (revised by [3]) REFERENCE 2 (bases 1 to 6305) AUTHORS Cianetti,L., Di Cristofaro,A., Zappavigna,V., Bottero,L., Boccoli,G., Testa,U., Russo,G., Boncinelli,E. and Peschle,C. TITLE Molecular mechanisms underlying the expression of the human HOX-5.1 gene JOURNAL Nucleic Acids Res. 18 (15), 4361-4368 (1990) MEDLINE 90356367 REMARK (revised by [3]) REFERENCE 3 (bases 1 to 6305) AUTHORS Cianetti,L. TITLE Direct Submission JOURNAL Submitted (14-SEP-1990) to the EMBL/GenBank/DDBJ databases COMMENT See also (HSHOM4) for HHo.c13 clone. FEATURES Location/Qualifiers source 1..6305 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda 13G." /dev_stage="adult" /chromosome="2" /map="q31-37" TATA_signal 347..350 exon 376..2051 /number=1 mRNA join(376..2051,2593..6053) prim_transcript 376..6053 CDS join(1619..2051,2593..2927) /codon_start=1 /product="hox 5.1 protein" /db_xref="PID:g296652" /db_xref="SWISS-PROT:P09016" /translation="MVMSSYMVNSKYVDPKFPPCEEYLQGGYLGEQGADYYGGGAQGA DFQPPGLYPRPDFGEQPFGGSGPGPGSALPARGHGQEPGGPGGHYAAPGEPCPAPPAP PPAPLPGARAYSQSDPKQPPPGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTA YTRQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNT KGRSSSSSSSSSCSSSVAPSQHLQPMAKDHHTDLTTL" intron 2052..2592 /number=1 exon 2593..6053 /number=2 misc_feature 2619..2801 /note="homeobox" polyA_signal 3192..3197 polyA_signal 5971..5976 polyA_signal 6039..6044 BASE COUNT 1587 a 1433 c 1686 g 1599 t ORIGIN 1 ggatcctggt gggggagggt ggttaataaa gccgccatcc ttgggatgga ttatttttct 61 ttctttcttt ctttttttct ttcttaagaa gaatattctg gttgttcgcc tgcttggtaa 121 ccctgaccct ggcagaagaa tgagggaact cattgcttca aattgtcgcc aagcccatta 181 ggctacctga actgtctcag aaagtgcggg tggctgcgtc gaacggtggt ggctcagagg 241 aagagattgg ggccggcagc gacctaggta cctcactctg ggtgggaccc agaggttgta 301 acgttgtcta tatataccct gtagaaccga atttgtgtgg tatccgtata gtcacagatt 361 cgattctagg ggaatatatg gtcgatgcaa aaacttcacg tttcttcgga atagccagag 421 accaaagtgc gacatggaga ctagaagcag ccggcgctgg tcagccgcct cgttctgttt 481 tattaccttg gactccagga ggatcagctg cgcctggtga catagagcag cttttcctct 541 ccagaagctc ctcaccttta aacagagtat cctctgggtg ctgaaaagaa agaaagacag 601 aaagagagaa agagagagag agagagaaag agagaatgca agcctaattg gttgcatgga 661 tgcagggcca aagggctagg ttttggggta ctagggagtg aggtacaagg ccagcttgcc 721 cagtcccagc tctgccctcc aggaacatga ggtgcaaagg tacccaaatg ggggcttgct 781 tgtatttggg gcctgtggga agaaagcaag cttcaaagaa gcccagtggg gagctctagg 841 gtgcattttg acaaggtgga ggtgcccttg ccaccatccc agcccacccc cagctacatg 901 ggcaagggca gcgagggccc cctgctattt tggcagggcc cagctttggc tgggaacccc 961 cgggcctggg cactggtaga aagcatggcg gttactcatt gcctaatttg attcaagctg 1021 gccagattct ggtaactttt gggtgaccct gatgaagaca aagccaggac ggcggccttt 1081 gtatggcaga tccctgctcc cgccggctgc aggcagggcg ggcaggcagg aaccctcctc 1141 gcctggggca ctctgcccaa ctcagaggcg agttcaccca cccacctttc attgctctgt 1201 accccaatag gaggattcat tctcccttga gctgtgccta cttggtgtcg gggggcgggg 1261 gttgcattca gctgggggtg agtggaaggg ccacggaagg ttggcaaaat cagtggcaga 1321 caaaagctgg gattacctga ggggaatggg gtgctgggga ctggaactac attaatatct 1381 ggcaggggct ctcaaatgtg ccatagcaag ctacttgatt acacgtatgt tatttagtta 1441 aatttgtgaa aattatgaga tgctcaccaa cccggtgata aacttgctcc ctcgccattg 1501 gctggcctgg tcacatggct gcccaacttt attcagttga cagcaagtag gagggcccta 1561 tggaaggaga aaaaaagaca acacgagaaa aattagtatt ttctaccttc tgaaattaat 1621 ggtcatgagt tcgtatatgg tgaactccaa gtatgtggac cccaagttcc ctccgtgcga 1681 ggagtatttg cagggcggct acctaggcga gcagggcgcc gactactacg gcggcggcgc 1741 gcagggcgca gacttccagc ccccggggct ctacccacgg cccgacttcg gtgagcagcc 1801 tttcggaggc agcggccccg ggcctggctc ggcgctgcct gcgcggggtc acggacaaga 1861 gccaggcggc cccggcggtc actacgccgc tccaggagag ccttgcccag ctcccccggc 1921 gcctccgccg gcgcccctgc ctggcgcccg ggcctacagt cagtccgacc ccaagcagcc 1981 gccccccggg acggcactca agcagccggc cgtggtctac ccctggatga agaaggtgca 2041 cgtgaattcg ggtaaggcta gggtccagta acctttctgt ccacatccca gcccgttagc 2101 ctgggtcctc tggaaggggg tgcgagtagg tgggggcgtg tggagcttcc atgggcgccg 2161 caattactct ccccataaat ttttatagct gagggagcag gtcaggacca tgtggctggc 2221 tgctcggctg tgggcgcaaa agggggtggg gatggggggg tgggggagga ctccattttc 2281 agagcagggg gaaggctgtg gaggagcggg ggatttccaa aatgcttgag ggttccggac 2341 ctggtggtgg gcccagaaga aggagcacat ttggggatcc cgcaagcctg gggtatgtgg 2401 gtgtgtttga ggaggtgggt gggagtgagc gtgtgcgccg gggagagggc gggagggagg 2461 aagcaagcga gcttgggagc gcgcggggag ggccgcgggc ctcggggcgc gccaggaagt 2521 gagcggcgga ggcgaggggc ctaactagtg gccgggcgct gacctgcctg tcctgtctgt 2581 tttgtctcgc agtgaacccc aactacaccg gtggggaacc caagcggtcc cgaacggcct 2641 acacccggca gcaagtccta gaactggaaa aagaatttca ttttaacagg tatctgacaa 2701 ggcgccgtcg gattgaaatc gctcacaccc tgtgtctgtc ggagcgccag atcaagatct 2761 ggttccagaa ccggaggatg aagtggaaaa aagatcataa gctgcccaac actaaaggca 2821 ggtcatcgtc ctcatcttcc tcctcatctt gctcctcctc agtcgccccc agccagcatt 2881 tacagccgat ggccaaagac caccacacgg acctgacgac cttatagaag tggggaccct 2941 gggcccatct ctccctgcgc accaggctga gccgaagctg cggggcaggc cgggcctgct 3001 gtcacctcgc tgggctctaa ggtactgtgg ggtggacctg ggacaagcag gccgccctcg 3061 gactaggtta gcatcctgcc cgagggcagc cccctcccta gagcgggatg gggatgggag 3121 ggggggcggg attctctctc taagtatatt atatggcagg agctactgag aacataaaat 3181 cttggcgagt cattaaactt atgaaaatca ccgctcttgg attttgaatt tgcaaatgaa 3241 ggttggatgc tttatcccac tgtgaatttg gacattctcc cccactccac ccctccaggg 3301 tgctttgtgg cttaataatg tgggggagtt gaggcagaag gttggccacc cctgtgctag 3361 gtgctttcag tggaagccag agagctgggt caggatttct ggactttctg ggttgtctat 3421 ggaatttcat gtgattaaaa aatatatatt ttgctcccag tggccccacc tccaaagaaa 3481 tgggtctaag aaggaagtaa aaatgggtta ttttatgttt agatatttgc ttaaatattt 3541 atttgttggg aaatgtggta cagaaataac tgacaccttc atgccaaaaa tcttaaaaag 3601 gtgaaagggg ctgaacttca gggagcagaa tcagagatat gtgcacttac ttctgaactc 3661 cacccctccc cactctctgg aaatgtatat aggggggcct taacccttcc agaaggaagc 3721 aaaggattca ctcaaagttg cattcttgaa aatatatttc cacatgtgtt tttttcagca 3781 ctgtgcttac aaccagttct gggtgattaa aggaaaggga aaaaaaccaa caaatggtcc 3841 aacattttcc ttctggggaa agaaaacaaa acctctatgc actgggtcat tagataatga 3901 ctgaattttc tgttccaact ggattccaaa tgccctaaat accctcatat agcagtgttt 3961 tacaggaatt agtgtatggc ccgtgtaggg gaggggctgt gcagtgggga gaaagtggga 4021 aggtgaggaa ctcttgcttt aagaaggaaa aaaaaaaaac cctaattgaa tctagaagtc 4081 cacaaaagtt agccttagag tttttttccc cctgaagttt taattttttt aaaaaccaaa 4141 tctaaggaag ttttcctcag ctcattaatt agaagcagaa tttgtaaaag tataaaagtt 4201 ttcaagcact cgtctttgcc ttgagaatag tggtttttta aagaatcact ctcaacaggg 4261 gagatgtcct ctagtcgttt ttcttctgcc tctcctggga agggttcaaa gttcattttt 4321 ctaaaatgct gaccctcaag cataaggagg aagaagtcaa agttaatggc cagagttcat 4381 atactcagat gaaaccagtc ttcccaaggc ctcaggctcc aaaaaaggtt gtagctatca 4441 aaaagtgacc aaagtgggaa agggagaaag gataagctta aaatttaatt ttaagatcca 4501 gaaggggggt atttttttca gtacttcaaa aacactttag aaggtttctg ttgtaattta 4561 aaaaatatat ttaagtggga gggaaaaaag agtttctctg taggcttgtt ctttggctgt 4621 gtctcctgag agctgagggc aggtattcac tgcagtccct aggctgaaat tccgcttctc 4681 tgaagtgtct tccaagcctt ggtcttttgt attagaccct ggggactgct ctttgtttct 4741 ccttggggtg agcctggctc tcagacttgc acatggcaat acttgaatgt caccacgtcg 4801 ggatattaaa gatggatatt cgtgcattat tcacatcatt gtttctatga caaaaagcac 4861 agagttcata catagtcaag acgtcttttt ctgacgccct cacgttgaga agctgaaaag 4921 gtattttacc gaagttcggg taaattacag aatcaggttc atccagagga caaattttct 4981 atttgattag ctgtatttca gccgggagga ctgacctcta aacccctaac cttttggact 5041 ctaactaccc ttctcttctt ttttcctctc taacatggag agcagtcttt ggatgtacca 5101 tttgaaagga gccgctatcc ttaggcaagt tggaaagttg ctaagctgct ttcctaaaac 5161 ccaaatctgt ctatacattg aaccttctct ttgagaaggg gaaaaaggta tatattttca 5221 caacatccaa ttacatatat ataatagaga tttgttgtat agattttccc ccacctcaga 5281 agttcaggtt actcaccccc agtttcatac caaatgccac acaggcttaa ctgactgcat 5341 ccctgcccca gaggaaagcc aagaaacatg ttttcatgag gaaaacccaa gctccttctc 5401 aaacatagcc ccactacttt ggaaagtaac ttaatcagag aaacaacttc tttgtttata 5461 agtctcagct ctccttctca gcttggaggg attcttttga aatgttaatg gagcctggat 5521 ggcccagagt gcagccccca accctgaggt cccagtcgga ccccagcatc catttgggcc 5581 cacaggagtg ggccagggaa ggggtagggc cccgtaacca cttagggcag ggaaggaaat 5641 gggtttccat ctgagaacgt gctttggaga aagctaggtg tggaaaagct ccaatgccca 5701 tttgctatta tttgtttcca gtttgttcct ttaaatatga gccagaagtg tttgtgttgg 5761 tgttttaaaa acaaaaacaa aaaccgtgtt ggggtcctga ctgggggagg gggagagtga 5821 agtgtttgct gaggacattg ctcctctgac tcccatctca ctttgtccat cgcagccttt 5881 tgttgggaga tgacactgtc agtcagccca tgatgtctgt tcacacgaga tgctttttta 5941 atagaattga ccaatgtttt gctgccactg attaaagtat tatttatact aattgttgct 6001 tgtagttttg atgtaattca ttgatctata tttaaaataa taaaaggtgt agcaaaatct 6061 ccctcctgtt tggtgcctta acagaagcat tcatcctttg ttaagtcttc taaaagctaa 6121 catttaacat aaacaagtta ttattttctg caataaatta ggcaccattt tttggggggt 6181 gcctaaagtg tgaaggttaa accatgtaag gcttagcaat tctattatta ccacctcctt 6241 aatgtacaca cactcccagt tggccaccat attttgtgag cattgggaag cctggggttg 6301 aattc // LOCUS HUMHISAC 1978 bp DNA PRI 07-MAR-1995 DEFINITION Human histone H1 (H1F4) gene, complete cds. ACCESSION M60748 NID g184073 KEYWORDS histone H1. SOURCE Human blood DNA, clone C3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1978) AUTHORS Albig,W., Kardalinou,E., Drabent,B., Zimmer,A. and Doenecke,D. TITLE Isolation and characterization of two human H1 histone genes within clusters of core histone genes JOURNAL Genomics 10 (4), 940-948 (1991) MEDLINE 92009931 FEATURES Location/Qualifiers source 1..1978 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C3" /tissue_type="blood" /map="12q11-q21" gene 730..1389 /gene="H1F4" CDS 730..1389 /gene="H1F4" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-030" /product="histone H1" /db_xref="PID:g184074" /translation="MSETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELI TKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGAS GSFKLNKKAASGEAKPKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKK PAAAAGAKKAKSPKKAKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKK K" BASE COUNT 532 a 494 c 544 g 408 t ORIGIN 1 aagggaaaga attatccaag aattgtttaa aaactcagat gtagcggaca gatgtaaaac 61 catggctgta tagattgatg tcccaggggt ccaaaactta atctcaaatg ggcaataatt 121 tgtttggcat taaactaaac cagtttgatg aactcaaatg ccctcggctc aataggcagg 181 actctccgag gagcctgtgt tacttccctc acttaagtgc agatttgtaa taaaaatctt 241 aatgccagtg gcatgctttt tggatatata agaagctaac cacttggagt atcatatttg 301 agaggtcaga aaagtccaca gttaaagatc ggtttataat ttacgaagaa atagaaagtt 361 ttgtttcctc ctgagttgaa atttgccaag cacggaggaa atattgcaag tttttggcac 421 aaggctttct gcttcccctt ataatttgag atctgcgtga agcctgaggg ttcggggatc 481 attatctgag aaaaaccggg cagttcggtg tagacaattt ttatattttt ggcttttttt 541 gaggtgtaac aaacacaact cgggatccga gaggacactc tgcggctgcc agcgaggcgg 601 gctggacagc gcaccaatca cggcgcacgt ccgccctata taaacgggcg ggcgcagcgc 661 cgcggctcga gtcccggcca gtgcctctgc ttccggctcg aattgctctc gctcacgctt 721 gccttcaaca tgtccgagac tgcgcctgcc gcgcccgctg ctccggcccc tgccgagaag 781 actcccgtga agaagaaggc ccgcaagtct gcaggtgcgg ccaagcgcaa agcgtctggg 841 cccccggtgt ccgagctcat tactaaagct gttgccgcct ccaaggagcg cagcggcgta 901 tctttggccg ctctcaagaa agcgctggca gccgctggct atgacgtgga gaaaaacaac 961 agccgcatca agctgggtct caagagcctg gtgagcaagg gcaccctggt gcagaccaag 1021 ggcaccggcg cgtcgggttc cttcaaactc aacaagaagg cggcctctgg ggaagccaag 1081 cctaaggcta aaaaggcagg cgcggccaag gccaagaagc cagcaggagc ggcgaagaag 1141 cccaagaagg cgacgggggc ggccaccccc aagaagagcg ccaagaagac cccaaagaag 1201 gcgaagaagc cggctgcagc tgctggagcc aaaaaagcga aaagcccgaa aaaggcgaaa 1261 gcagccaagc caaaaaaggc gcccaagagc ccagcgaagg ccaaagcagt taaacccaag 1321 gcggctaaac caaagaccgc caagcccaag gcagccaagc caaagaaggc ggcagccaag 1381 aaaaagtaga aagttccttt ggccaactgc ttagaagccc aacacaaccc aaaggctctt 1441 ttcagagcca cccaccgctc tcagtaaaag agctgttgca ctattagggg gcgtggctcg 1501 ggaaaacgct gctaagcagg ggcgggtctc ccgggaacaa agtcggggag aggagtggga 1561 ttttgtgtgt ctccggagct atttttgact atggcgtcgc gtcgcccaag ccggagtgca 1621 gtggcgtcat ctcgattttg cgttctcgag tgtcggagtt gaacccattt gggcctccct 1681 tgtgctttgc cttttagcag gccctggctc cagatagcat gggaaaaaaa atgttgggat 1741 tttccccggg tttctaagct gggtttttcc gagttccaaa cacggcacag tgtatcagtt 1801 tctgtgctgg ttacaagcct actggttatc cctatcgagt atggcaggca gtgagggact 1861 tcagaggagt acgtcttagg acaagtggca tagtactgac attatttccg aagggctaca 1921 tttcaagtgc ttggggagac tactgccaca taactgaaat tagaaaccaa cactgcag // LOCUS HUMSPERSYN 7623 bp DNA PRI 13-JAN-1995 DEFINITION Human spermidine synthase gene, complete cds. ACCESSION M64231 NID g338393 KEYWORDS spermidine synthase. SOURCE Homo sapiens blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Myohanen,S., Kauppinen,L., Wahlfors,J., Alhonen,L. and Janne,J. TITLE Human spermidine synthase gene: structure and chromosomal localization JOURNAL DNA Cell Biol. 10 (6), 467-474 (1991) MEDLINE 91299162 FEATURES Location/Qualifiers source 1..7623 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Sultan 20D" /cell_type="IgG myeloma" /tissue_type="blood" /map="1p36-p22" CAAT_signal 1212..1216 /gene="SRM" /note="box 1; G00-127-983" CAAT_signal 1242..1246 /gene="SRM" /note="box 2; G00-127-983" exon 1315..1563 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=1 /product="spermidine synthase" mRNA join(1315..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..7132) /gene="SRM" /note="G00-127-983" /product="spermidine synthase" gene join(1315..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..7132) /gene="SRM" CDS join(1397..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..6474) /gene="SRM" /EC_number="2.5.1.16" /codon_start=1 /db_xref="GDB:G00-127-983" /product="spermidine synthase" /db_xref="PID:g338394" /translation="MEPGPDGPAASGPAAIREGWFRETCSLWPGQALSLQVEQLLHHR RSRYQDILVFRSKTYGNVLVLDGVIQCTERDEFSYQEMIANLPLCSHPNPRKVLIIGG GDGGVLREVVKHPSVESVVQCEIDEDVIQVSKKFLPGMAIGYSSSKLTLHVGDGFEFM KQNQDAFDVIITDSSDPMGPAESLFKESYYQLMKTALKEDGVLCCQGECQWLHLDLIK EMRQFCQSLFPVVAYAYCTIPTYPSGQIGFMLCSKNPSTNFQEPVQPLTQQQVAQMQL KYYNSDVHRAAFVLPEFARKALNDVS" exon 1995..2115 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=2 /product="spermidine synthase" exon 2448..2540 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=3 /product="spermidine synthase" exon 4584..4737 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=4 /product="spermidine synthase" exon 5247..5330 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=5 /product="spermidine synthase" exon 5415..5560 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=6 /product="spermidine synthase" exon 6255..6377 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=7 /product="spermidine synthase" exon 6454..7132 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=8 /product="spermidine synthase" polyA_signal 7126..7132 /gene="SRM" /note="putative" BASE COUNT 1609 a 2287 c 2259 g 1468 t ORIGIN 1 ctgcaggcgc gcactaccat gcccagctaa tttttgtatt tttagtacag acggggtttc 61 accatgttgg ccaggatcgt cttgatctct tgacctcgtg atccgcccgc ctccgcctcc 121 gcctcccaca gtgctgggat taccggggtg aaccatcacg cccaggcacc tagcaattct 181 ttagcggtct tggtttacct cccctttaga aggagcttaa aagcaagcaa ggcacattgt 241 tgcctaggct tgaggcttgc tctcacccat aaacaacgct gttactctgc tctggggacg 301 acacaggaaa cgttccccac ctccaggtgg aggctgcaaa acgtgtcaaa accatccctg 361 acataatgtc aagagtagct tatactagtt tcatcttcct tccttggcat tcgaactgcg 421 tgtggaacaa ttagcattta atatttggta attagcatgc ttaatgtgat tctgagaagt 481 tctttgacat tctcataaaa acagcacatt cccacccacc cttcaaagag caagacccag 541 tttgtcaaga aaaattgcgt gccagtcttt tctggtgctg aatatgtatg ttctgggcct 601 cttcctggac actgctggta aatttagaaa ctcgtttaga aaagcacttc tctcgtattc 661 aacagcctat aggctcatgg cgcagaatct aagggaaaat ggctaaatcc agcttgttaa 721 ttcgcgggct gtgatgattc tttccaagta aataaaaacc ctcggttcgc cccgacgagc 781 cacataatct gttcaaatcc aacaaggaac cagattttgg acgcaaagaa ggatacgttc 841 tactcgcccc gtgcaacaac gtaaaccact gtagccgccc gctcccgtgt ctccagccca 901 aaggctgact ctccagtccg cacgtcgcag cgctcttgcc ctccacacca agcccgagtc 961 ccgcagcccc tcgaggccct cggtgcctcc caaccccgag aaggaagcgg gggccggtgg 1021 tgcaccgccc cggctgcttg gggcggagga aggacccgga ccccttccgc cggcccagcc 1081 cgccccggaa cccgacccgg ccgcccggcc ccggccgggc cccacgtggc ccctggagcg 1141 ggccgcacta ccctgctgcc gccgacggac ggcgcgccac agccactctg cgccgctctg 1201 cgagccggtg gccaatgagc gccaggcgag gccgctttgc cattggcgag cgcgggctcc 1261 gcccccgccg gcaggccccg ccccgcgccc gggttaggtt gcggcgcggg cggcgggcgg 1321 agctggtccc gttgtgctgc ggcgccgcgc ggcctgcagt cccgggcccg cgccccgcgc 1381 cgcccgcccg cccgccatgg agcccggccc cgacggcccc gccgcctccg gccccgccgc 1441 catccgcgag ggctggttcc gcgagacctg cagcctgtgg cccggccagg ccctgtcgct 1501 gcaggtggag cagctgctcc accaccggcg ctcgcgctac caggacatcc tcgtcttccg 1561 caggtaccgc cgctgcccgc aggcgcctgc cccctaggct cagcccgggc cgcctgctgc 1621 ccgcctcacg ggcctctcca cgccgggacc caagcgggct ggacctcgtc ctgccctggc 1681 cccctcgcca cccctcacac cgcctccctg ggctggggct gggactgcgg gctggcctct 1741 tgggtggggg agtcggagtc tgcgccccgc tccacgtgtc agccctcagg acacgtcaga 1801 gcccgaggag accccgggtc ccaccccggc ctccacccgg cggcccgcct gccgttcctc 1861 gccacgtgtc accatcgctc ctcatccctg ggacccctag gcgggatggg gagaccctcc 1921 tcacccaggg cggcttgggg tacgttttcc ccaccccaga gaacccaggt ccccgactgt 1981 cactccgccc gcagtaagac ctatggcaac gtgctggtgt tggacggtgt catccagtgc 2041 acggagagag acgagttctc ctaccaggag atgatcgcca acctgcctct ctgcagccac 2101 cccaacccgc gaaaggtacc ccagtgtccc ctggaacagt gccggacgag gggcggcccc 2161 aggtgtgctc cgggctcttc ccagatgctg cctgcatggt tgtcagagaa agtgctagca 2221 aggccagggg cgtcccgcgg aggggtgggg gccgacactg acgcggcctc ggaatcctag 2281 ggcagccctg gaaggaactt ccaggaaagg ggacaccggc acgaaagcgt ttccgagggt 2341 agaaaaagat gaggcccgtg ggtccgaggg gtcagggggt ctgcttcagg ggcctggggg 2401 ctcccagtcc tgccagggcc cctgccttga ctgccccctc ctcccaggtg ctgatcatcg 2461 ggggcggaga tggaggtgtc ctgcgggagg tggtgaagca cccctccgtg gagtccgtgg 2521 tccagtgtga gatcgacgag gtgagtgccg gcgtagagcc aggtttgagt cctggttctc 2581 ccagcggcca gctgtgccct gaaatggctg cacacccccg agcaaggcag gtagggcctg 2641 tttctccatc tggaaaacac ctggtcgggg agggttcagt aggaaaacca gatggcagag 2701 ggcctggcag gtggtgaggg cacctgcgtg gcgagctctt actaaaactg agctgatttt 2761 tttttttttt tttttttgag acagagtttc gctcttgttg cccaggctgg agtgtgatgg 2821 tgcgatctcg gctcactgca acctccacct cctgggctca agtagttctt ctgcctcagc 2881 ctccggagta gctgggatta cagacatgcg ctaccatgcc cggctaattt tgtattttta 2941 gtagagacag ggtttctcca tgttggtcag gctggtctcg gacctggcga ccacaggtga 3001 tccgccagcc tcgtcctccc aaagtactgg gattacaggc gtgagccacc acgcccagcc 3061 gactaagctg atttttaatc tgagccccag gcagggcccc aagacagctc aactatttgt 3121 acgttacccc ttacactcag tagctgctca ctaaaatcat gctacgtgcc aggtgttgcc 3181 cgggtatggg gacagtggta gacgacagat cagtccctgc cctctaggag ctgatgtcgt 3241 agttaaagga gacatcagat ggccagacgt ggtggctcac acctgtaatc ccagcacttt 3301 gggacgccaa ggcgggcaga tcacctgagg tcaggagttc aagagcagcc tggccaacct 3361 ggtgaaaccc catttctact aaaaatacaa aaattagccg ggcatggtag tgcatgtctg 3421 taaacccagc tactagggag gctgaggtgg aagaattact tgagcccggg aggcggaagt 3481 tgcaatgaac cgagatctcg ccactgcact ccagcctggg tgacagagga agaatctgtc 3541 tcaaaaaaaa caaacaacaa aaatagagac atcaaaggat ggtctgatga aggcaagaca 3601 ggggctgggg gacaggagaa ggcagggttc ctgtgaatgc atggggggtg gtcagggcag 3661 gcctccagga ggtggcgttt gagctgagac ctcagtgaaa agcaggtggc cgtgtgcagg 3721 gagggggagg ttctcctggc cagaggttgg aattgcatcc ttctaaaata ggaaacaggc 3781 caagcgctgg tggctcacac ctgtaatctc agcactctgg gaggctgagg cgggcagatc 3841 acaaggtctc tactaaaaat acaaaaaaaa aaaaaaatgg cccagcttgg tggcgtgtgc 3901 ctgtaatccc aactactcgg gaggctgagg caggagggtg cagtgagctg agatcgtgcc 3961 actgcactcc agcctgggca gcagagcaag actgtctcga aaaataaata aataaaatag 4021 gaagcgacaa gaaagccact cagatggggc gatttggtct ggaaggaggg ggtagggatg 4081 ggagagagga gcccaagccg ccggcaggag ccagctctca aggagaaatg gaggtaccag 4141 agttctcccc gctttacaca ttataaactg aggttcccaa aaggggccag gtgtggtggc 4201 tcacagctgt aatcctagca cttttggagg ccgaggtggg aggatcgagg agttcgaggc 4261 aacataggga gactctatct ctaccaaaaa tttaaaaagt agccaagtat gattgaacac 4321 acttgtccca gctactcagg aggctgatgg gggaggatca cctgagcccc ggaggccgag 4381 ggtgtagtga gccatgatcg atgccactgc actccagcct gggccacaaa gtgagaccct 4441 gtctcaaaaa aaaataataa aaaaaaggga aggggttggc caaggcggct tgcctgtgag 4501 gcacttggag agtcccacgt ggctgtgctg gctccaggtc ccccagcccc ttggcccaga 4561 ctggtccctc ccatcccctc caggatgtca tccaagtctc caagaagttc ctgccaggca 4621 tggccattgg ctactctagc tcgaagctga ccctacatgt gggtgacggt tttgagttca 4681 tgaaacagaa tcaggatgcc ttcgacgtga tcatcactga ctcctcagac cccatgggta 4741 agcagtggat gggccccagg gttttctggc agctgcaggt ctggaggtca gcctccccca 4801 ggccttcaga gtaaaggata gagcggcctc ccaccccccg aactagagct gtacttttcc 4861 cttctcattt gttacctgcc ctctgaaaca tggctcagga cagtaggcag gagccaggcg 4921 actgcccaga ttcacaagct ggtgaccaag gagagtgggg atctggcatt gggacactga 4981 ggaccctgtg tcctcttcag cctcccctct gctctgaagt ggtcagcact ggagtggggg 5041 caggttctag tcttgaacga aggcctaggt tagaggttcc tctgctgtgg tgccaatgag 5101 actcccccaa gaatgggatt caggtgtgga tcccccacag acctgggttc agatcctggc 5161 tctggccacc tggtagctgt gtgggtgaca gtggccctgg aggtcacagc cagactgttc 5221 agtgtgtctc cctctgtctt ccccaggccc cgccgaaagt ctcttcaagg agtcctatta 5281 ccagctcatg aagacagccc tcaaggaaga tggtgtcctc tgctgccagg gtgagccaca 5341 ggcctggagc actggggcgg ggcggggtgg ggcagggcag gccctgccgg atgctgatgc 5401 ttaggggccc ccaggcgagt gccagtggct gcacctggac ctcatcaagg agatgcggca 5461 gttctgccag tccctgttcc ccgtggtggc ctatgcctac tgcaccatcc ccacctaccc 5521 cagcggccag atcggcttca tgctgtgcag caagaacccg gtgagatggg ggtgtctggg 5581 ggtgggggtt ggggggaagg tgggcataaa tagagatccc tgcccctgcc gggcgcggtg 5641 gctcacacct gtaacccagc actttgggag gctgaggcgg gcagatcaca aggtcaggag 5701 atcgagacca tcttggctaa cacggtgaaa ccccgtctct actaaaaata caaaaaaatt 5761 agccaggcat ggcagcgcgc gcctgtagtc ccagctgctg gggaggctga ggcaggagaa 5821 tggcgtgaac ccgggaggcg gagcttgcag tgagccgaga ttgcgccact acattccagc 5881 ctgggtaaca gaggaagatt ccgactcaaa aaaaaaaaaa aaaccctccc caggccaggt 5941 gcggtgtctc atgcctgtaa ttccagcatt ttgggagacc aaggtgggcg gatcacttga 6001 ggtcaggagt acaagaccag cctgaccaac atggagaaac cccatcacta ctcaaaatac 6061 aaaaaaaaaa aaaattagcc gggcgtggtg gcgcgtacct gaggctgagg caggagaatc 6121 acttgaacct gggagacaga ggttgcagtg agctgagatg acgccactgc actccagcgt 6181 ggcaacagtg agactccgtc tcaaaaaaaa aaaaaaaagt gccccccctg atgtgcccct 6241 ggcccggtcc ccagagcacg aacttccagg agccggtgca gccgctgaca cagcagcagg 6301 tggcgcagat gcagctgaag tactacaact ccgacgtgca ccgcgccgcc tttgtgctgc 6361 ccgagtttgc ccgcaaggtg ggtggcctgc ggggctgggt ggtgggaccc agggacccag 6421 agcgccctcc tgactggcct catgtccctc caggcactga atgatgtgag ctgagcccag 6481 gcgccaccac tgatgccacc caggacctcg gaccttggag cctgcggggt gcctcggccc 6541 ctccagcccc gggccggacc tcctgctggc tctcgcccac caaccaagtg ttacaagccc 6601 cagaatgctg cccggcctgc cctgctgggc ggactgtctg tgtgtctgtc tctctggcgt 6661 tccacctcca agcctatacc agctgtgtac agcgccatct ctctgccttc tgttgcccct 6721 cactcaccaa acacgtgtat ttatagcaaa gattggagtc ctgtgtctcc tgaccttggc 6781 tgggcccagg cagggccaca ttcaccattg ggtgcctctg gggtgagggt ctgcagaggc 6841 cttgctggct gacccccaag tgtctgctgc agggctgagg ctgcaggcgg gccatcgtgg 6901 atagcctggg gcacagaggg tcaccgcagt cgtcacgtgg gacccagagc tgtcctggga 6961 agctgactta gctgtccttt taccaagccc ttcacaaggc cactggtgac agccccccag 7021 ggcagtgggg tgggtgagat cagggtgggg ctgcccggga gcattctcag aaaaattggg 7081 gacactcaca ggtgtaagtc aggtcccatc caggtactcc agggcaaata caggaagggg 7141 tggcggggct ggttaccttc ggccttttta agcacatcag gagcttaaca ctggcccagt 7201 gactgtgccc tgactccacc cggcattcag acttgggttc aaattcccac catgccccgc 7261 cccctatgtg gacaaattga gaaagcaagt gtgggcaccc caccagggac tgcgaggacc 7321 agggctgtcc cctctccaag gtgctgaact cccgccttcc aggacccaac ggtggtggga 7381 ggacaggaaa ggaaccctct ttgcatgggc ctgagttgcc aacccctttc cccaccctgg 7441 gcaggggctg ggctagcgga cgcatcaggg agggaggccc cactcccagc cgaggcagcc 7501 accttggagc cctaactcac ccgggtatgt tttctgggac accagtgtaa gggggattca 7561 gtttcgccat caactctggc ttcaggccag tcatagccct ccagtctcca cctgcccccc 7621 act // LOCUS HSHIS10G 2530 bp DNA PRI 12-SEP-1993 DEFINITION Human gene for histone H1(0). ACCESSION X03473 NID g32106 KEYWORDS histone; histone H1(0). SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1810) AUTHORS Doenecke,D. and Tonjes,R. TITLE Differential distribution of lysine and arginine residues in the closely related histones H1 and H5. Analysis of a human H1 gene JOURNAL J. Mol. Biol. 187 (3), 461-464 (1986) MEDLINE 86200226 REFERENCE 2 (bases 1811 to 2530) AUTHORS Toenjes,R. TITLE Direct Submission JOURNAL Submitted (14-OCT-1986) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (14-OCT-1986) by Toenjes R. FEATURES Location/Qualifiers source 1..2530 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 371..955 /note="histone H1(0) (aa 1-194)" /codon_start=1 /db_xref="PID:g32107" /db_xref="SWISS-PROT:P07305" /translation="MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAG SSRQSIQKYIKSHYKVGENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPK KSVAFKKTKKEIKKVATPKKASKPKKAASKAPTKKPKATPVKKAKKKLAATPKKAKKP KTVKAKPVKASKPKKAKPVKPKAKSSAKRAGKKK" BASE COUNT 607 a 626 c 728 g 569 t ORIGIN 1 cggcggccct gtcctcaccg cggtccgccc gccgccgcta aatacccgga tgcgccgccc 61 aagcgccgga cgcggagctg ggaaaaggga ggcagaggag gcggaggcag aggcagaggc 121 agagcccggt gccgagacca agcgacagac cggcggggct gggcctcgca aagccggctc 181 ggcgagctct cccgacaccc gagccgggga ggaaaagcag cgactcctcg ctcgcatccc 241 cgggagccgc actccagact ggcccggtag tcaggggctc aggagcagat cccgaggcag 301 gctttgctca gcctccgacg agggctggcc cttggaaggc gccttcaaca gccggaccag 361 acaggccacc atgaccgaga attccacgtc cgcccctgcg gccaagccca agcgggccaa 421 ggcctccaag aagtccacag accaccccaa gtattcagac atgatcgtgg ctgccatcca 481 ggccgagaag aaccgcgctg gctcctcgcg ccagtccatt cagaagtata tcaagagcca 541 ctacaaggtg ggtgagaacg ctgactcgca gatcaagttg tccatcaagc gcctggtcac 601 caccggtgtc ctcaagcaga ccaaaggggt gggggcctcg gggtccttcc ggctagccaa 661 gagcgacgaa cccaagaagt cagtggcctt caagaagacc aagaaggaaa tcaagaaggt 721 agccacgcca aagaaggcat ccaagcccaa gaaggctgcc tccaaagccc caaccaagaa 781 acccaaagcc accccggtca agaaggccaa gaagaagctg gctgccacgc ccaagaaagc 841 caaaaaaccc aagactgtca aagccaagcc ggtcaaggca tccaagccca aaaaggccaa 901 accagtgaaa cccaaagcaa agtccagtgc caagagggcc ggcaagaaga agtgacaatg 961 aagtcttttc ttgcggacac tccctcctgt ctcctatttt ctgtaaataa ttttctcctt 1021 ttttctctct tgatgctcac caccaccttt tgcccccttc tgttctgact ttataagaga 1081 caggatttgg attcttcaga aattacagaa taattcattt ttccttaacc agttgtgcaa 1141 ggacagcaac aaccaatcta atgatgagaa tgtacttata ttttgttttg ctattaacct 1201 acttacgggg ttagggactt gcgggggggg cttgtgtgtt ttgttggctt gtttgccatg 1261 aaggtagatg tgggtgggga gaagacacaa ggcagtttgt tctggctaga tgagagggaa 1321 cccaggaatt gtgaggttag caggaatatc tttagggtga gtgagttttc cttgagttgg 1381 gcacccgttg tgagagtttc agaacctttg gccagcagga gagaggtggt agggagcagc 1441 cagccggcaa aggaaggagg tggaaaaaaa ccgccaccgg gctgacttcc acctcccagt 1501 ggtgagcagt gggggcccaa acccagtttc cttctcattt ttgttagttt gccctttcgg 1561 cctccctatt ttcttaggga aggggagtgg ggtccaagtg acagctggat gggagaagcc 1621 atagtttctc ccagtcagct agggtgtagc cattggggga tctttgtggc ttcagcaaat 1681 tctcttgtta aaccggagtg aaaacttcag gggaagggtg gggagtcagc caagtgcctc 1741 agtgtgccct gttgaaactt aggtttttcc acgcaatcga tggattgtgt cctaggaaga 1801 cttttctttt cttttctttt cctctggatt tttgttcctc ctgtacaaga ggtgtctttg 1861 cttggtttgg tggggctgcg gccacttaaa acctcccgat ctctttttga gtcctttatt 1921 ataagtagtt gtagctgcgg gagggggagg gggagtgggc gggcagtgga tagtaagact 1981 tactgcagtc gatttgggat ttgctaagta gttttacaga gctagatctg tgtgcatgtg 2041 tgtgtttgtg tatatataca tatctagggc tagtacttag tttcacaccc gggagctggg 2101 agaaaaaacc tgtacagttg tctttctctt atttttaata aaatagaaaa atcgcgcact 2161 tgcgcgtccc ccccccccca cccccttttt taaacaagtg ttacttgtgc cgggaaaatt 2221 ttgctgtctt tgtaatttta aaactttaaa ataaattgga aaagggagaa actgagcggt 2281 gtatttttcc tcactttgaa gactggagaa tgaatgcgga gcggttaggc gggcaggcag 2341 tggggacatc tggggcgttt gcacctgaga atggcggggg agggttcagg agctggacag 2401 aagccgagaa tccttagtcc cggaggctgc ggaacgtgcc ttagccggct tctaggcttc 2461 tgccttccgg ggtccagagg gtaccccatt tccggccgag aggtgagtac gtgggaatgc 2521 ggtggggagt // LOCUS HSCFOS 3565 bp DNA PRI 21-NOV-1994 DEFINITION Human cellular oncogene c-fos (complete sequence). ACCESSION V01512 NID g29903 KEYWORDS oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3565) AUTHORS van Straaten,F., Muller,R., Curran,T., Van Beveren,C. and Verma,I.M. TITLE Complete nucleotide sequence of a human c-onc gene: deduced amino acid sequence of the human c-fos protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (11), 3183-3187 (1983) MEDLINE 83221560 COMMENT Data kindly reviewed (10-OCT-1983) by F. van Straaten. FEATURES Location/Qualifiers source 1..3565 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(132..429,1183..1434,1866..1973,2088..3239) /gene="c-fos" precursor_RNA 132..>3515 /gene="c-fos" /note="possible transcript" precursor_RNA 132..>3259 /gene="c-fos" /note="possible transcript" mRNA join(132..429,1183..1434,1866..1973,2088..3515) /gene="c-fos" gene 132..3515 /gene="c-fos" exon 136..429 /gene="c-fos" /note="(alternate start site)" /number=1 precursor_RNA 136..>3259 /gene="c-fos" /note="possible transcript" mRNA join(136..429,1183..1434,1866..1973,2088..3239) /gene="c-fos" precursor_RNA 136..>3515 /gene="c-fos" /note="possible transcript" mRNA join(136..429,1183..1434,1866..1973,2088..3515) /gene="c-fos" CDS join(289..429,1183..1434,1866..1973,2088..2729) /gene="c-fos" /codon_start=1 /db_xref="PID:g29904" /db_xref="SWISS-PROT:P01100" /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPV NAQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPS AGAYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRE LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSV ASLDLTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPA SSRPSGSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSC TAYTSSFVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL" intron 430..1182 /gene="c-fos" /note="intron I" exon 1183..1434 /gene="c-fos" /number=2 intron 1435..1865 /gene="c-fos" /note="intron II" exon 1866..1973 /gene="c-fos" /number=3 intron 1974..2087 /gene="c-fos" /note="intron III" exon 2088..3239 /gene="c-fos" /note="(alternate stop site)" /number=4 exon 2088..3515 /gene="c-fos" /note="(alternate stop site)" /number=4 BASE COUNT 780 a 954 c 978 g 853 t ORIGIN 1 gcagccgggc ggccgcagaa gcgcccaggc ccgcgcgcca cccctctggc gccaccgtgg 61 ttgagcccgt gacgtttaca ctcattcata aaacgcttgt tataaaagca gtggctgcgg 121 cgcctcgtac tccaaccgca tctgcagcga gcaactgaga agccaagact gagccggcgg 181 ccgcggcgca gcgaacgagc agtgaccgtg ctcctaccca gctctgcttc acagcgccca 241 cctgtctccg cccctcggcc cctcgcccgg ctttgcctaa ccgccacgat gatgttctcg 301 ggcttcaacg cagactacga ggcgtcatcc tcccgctgca gcagcgcgtc cccggccggg 361 gatagcctct cttactacca ctcacccgca gactccttct ccagcatggg ctcgcctgtc 421 aacgcgcagg taaggctggc ttcccgtcgc cgcggggccg ggggcttggg gtcgcggagg 481 aggagacacc gggcgggacg ctccagtaga tgagtagggg gctcccttgt gcctggaggg 541 aggctgccgt ggccggagcg gtgccggctc gggggctcgg gacttgctct gagcgcacgc 601 acgcttgcca tagtaagaat tggttccccc ttcgggaggc aggttcgttc tgagcaacct 661 ctggtctgca ctccaggacg gatctctgac attagctgga gcagacgtgt cccaagcaca 721 aactcgctaa ctagagcctg gcttcttcgg ggaggtggca gaaagcggca atcccccctc 781 ccccggcagc ctggagcacg gaggagggat gagggaggag ggtgcagcgg gcgggtgtgt 841 aaggcagttt cattgataaa aagcgagttc attctggaga ctccggagcg gcgcctgcgt 901 cagcgcagac gtcagggata tttataacaa accccctttc aagcaagtga tgctgaaggg 961 ataacgggaa cgcagcggca ggatggaaga gacaggcact gcgctgcgga atgcctggga 1021 ggaaaagggg gagacctttc atccaggatg agggacattt aagatgaaat gtccgtggca 1081 ggatcgtttc tcttcactgc tgcatgcggc actgggaact cgccccacct gtgtccggaa 1141 cctgctcgct cacgtcggct ttccccttct gttttgttct aggacttctg cacggacctg 1201 gccgtctcca gtgccaactt cattcccacg gtcactgcca tctcgaccag tccggacctg 1261 cagtggctgg tgcagcccgc cctcgtctcc tctgtggccc catcgcagac cagagcccct 1321 caccctttcg gagtccccgc cccctccgct ggggcttact ccagggctgg cgttgtgaag 1381 accatgacag gaggccgagc gcagagcatt ggcaggaggg gcaaggtgga acaggtgagg 1441 aactctagcg tactcttcct gggaatgtgg gggctgggtg ggaagcagcc ccggagatgc 1501 aggagcccag tacagaggat gaagccactg atggggctgg ctgcacatcc gtaactggga 1561 gccctggctc caagcccatt ccatcccaac tcagactctg agtctcaccc taagaagtac 1621 tctcatagtt tcttccctaa gtttcttacc gcatgctttc agactgggct cttctttgtt 1681 ctcttgctga ggatcttatt ttaaatgcaa gtcacaccta ttctgcaact gcaggtcaga 1741 aatggtttca cagtggggtg ccaggaagca gggaagctgc aggagccagt tctactgggg 1801 tgggtgaatg gaggtgatgg cagacacttt tactgaatgt cggtcttttt ttgtgattat 1861 tctagttatc tccagaagaa gaagagaaaa ggagaatccg aagggaaagg aataagatgg 1921 ctgcagccaa atgccgcaac cggaggaggg agctgactga tacactccaa gcggtaggta 1981 ctctgtgggt tgctcctttt taaaacttaa gggaaagttg gagattgagc ataagggccc 2041 ttgagtaaga ctgtgtctta tgctttcctt tatccctctg tatacaggag acagaccaac 2101 tagaagatga gaagtctgct ttgcagaccg agattgccaa cctgctgaag gagaaggaaa 2161 aactagagtt catcctggca gctcaccgac ctgcctgcaa gatccctgat gacctgggct 2221 tcccagaaga gatgtctgtg gcttcccttg atctgactgg gggcctgcca gaggttgcca 2281 ccccggagtc tgaggaggcc ttcaccctgc ctctcctcaa tgaccctgag cccaagccct 2341 cagtggaacc tgtcaagagc atcagcagca tggagctgaa gaccgagccc tttgatgact 2401 tcctgttccc agcatcatcc aggcccagtg gctctgagac agcccgctcc gtgccagaca 2461 tggacctatc tgggtccttc tatgcagcag actgggagcc tctgcacagt ggctccctgg 2521 ggatggggcc catggccaca gagctggagc ccctgtgcac tccggtggtc acctgtactc 2581 ccagctgcac tgcttacacg tcttccttcg tcttcaccta ccccgaggct gactccttcc 2641 ccagctgtgc agctgcccac cgcaagggca gcagcagcaa tgagccttcc tctgactcgc 2701 tcagctcacc cacgctgctg gccctgtgag ggggcaggga aggggaggca gccggcaccc 2761 acaagtgcca ctgcccgagc tggtgcatta cagagaggag aaacacatct tccctagagg 2821 gttcctgtag acctagggag gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc 2881 aaggacttga aagcatccat gtgtggactc aagtccttac ctcttccgga gatgtagcaa 2941 aacgcatgga gtgtgtattg ttcccagtga cacttcagag agctggtagt tagtagcatg 3001 ttgagccagg cctgggtctg tgtctctttt ctctttctcc ttagtcttct catagcatta 3061 actaatctat tgggttcatt attggaatta acctggtgct ggatattttc aaattgtatc 3121 tagtgcagct gattttaaca ataactactg tgttcctggc aatagtgtgt tctgattaga 3181 aatgaccaat attatactaa gaaaagatac gactttattt tctggtagat agaaataaat 3241 agctatatcc atgtactgta gtttttcttc aacatcaatg ttcattgtaa tgttactgat 3301 catgcattgt tgaggtggtc tgaatgttct gacattaaca gttttccatg aaaacgtttt 3361 attgtgtttt taatttattt attaagatgg attctcagat atttatattt ttattttatt 3421 tttttctacc ttgaggtctt ttgacatgtg gaaagtgaat ttgaatgaaa aatttaagca 3481 ttgtttgctt attgttccaa gacattgtca ataaaagcat ttaagttgaa tgcgaccaac 3541 cttgtgctct tttcattctg gaagt // LOCUS HUMHGCR 2635 bp DNA PRI 17-SEP-1992 DEFINITION Human gene for serotonin 1B receptor, complete cds. ACCESSION D10995 NID g219678 KEYWORDS human 5HT1B-type receptor; serotonergic receptor; serotonin 1B receptor. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2635) AUTHORS Mochizuki,D., Yuyama,Y., Tsujita,R., Komaki,H. and Sagai,H. TITLE Cloning and Expression of the human 5-HT1B-type receptor gene JOURNAL Unpublished (1992) REFERENCE 2 (bases 1 to 2635) AUTHORS Mochizuki,D. TITLE Direct Submission JOURNAL Submitted (21-APR-1992) to the DDBJ/EMBL/GenBank databases. Daisuke Mochizuki, Asahi Chemical Ind. Co.,Ltd., Institute for Life Science Research; 632-1 Mifuku, Ohito-cho, Tagata-gun, Shizuoka 410-23, Japan (Tel:0558-76-7079, Fax:0558-76-5755) COMMENT Submitted (21-Apr-1992) to DDBJ by: Daisuke Mochizuki Asahi Chemical Ind. Co.,Ltd. Institute for Life Science Research 632-1 Mifuku, Ohito-cho Tagata-gun Shizuoka 410-23 Japan Phone: 0558-76-7079 Fax: 0558-76-5755. FEATURES Location/Qualifiers source 1..2635 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 82..1254 /gene="HGCR1" CDS 82..1254 /gene="HGCR1" /codon_start=1 /evidence=experimental /product="serotonin 1B receptor" /db_xref="PID:d1002238" /db_xref="PID:g219679" /translation="MEEPGAQCAPPPPAGSETWVPQANLSSAPSQNCSAKDYIYQDSI SLPWKVLLVMLLALITLATTLSNAFVIATVYRTRKLHTPANYLIASLAVTDLLVSILV MPISTMYTVTGRWTLGQVVCDFWLSSDITCCTASILHLCVIALDRYWAITDAVEYSAK RTPKRAAVMIALVWVFSISISLPPFFWRQAKAEEEVSECVVNTDHILYTVYSTVGAFY FPTLLLIALYGRIYVEARSRILKQTPNRTGKRLTRAQLITDSPGSTSSVTSINSRVPD VPSESGSPVYVNQVKVRVSDALLEKKKLMAARERKATKTLGIILGAFIVCWLPFFIIS LVMPICKDACWFHLAIFDFFTWLGYLNSLINPIIYTMSNEDFKQAFHKLIRFKCTS" BASE COUNT 633 a 674 c 612 g 716 t ORIGIN 1 cgatcgccac ggtccttccg ccctctcctt cgtccgctcc atgcccaaga gctgcgctcc 61 ggagctgggg cgaggagagc catggaggaa ccgggtgctc agtgcgctcc accgccgccc 121 gcgggctccg agacctgggt tcctcaagcc aacttatcct ctgctccctc ccaaaactgc 181 agcgccaagg actacattta ccaggactcc atctccctac cctggaaagt actgctggtt 241 atgctattgg cgctcatcac cttggccacc acgctctcca atgcctttgt gattgccaca 301 gtgtaccgga cccggaaact gcacaccccg gctaactacc tgatcgcctc tctggcggtc 361 accgacctgc ttgtgtccat cctggtgatg cccatcagca ccatgtacac tgtcaccggc 421 cgctggacac tgggccaggt ggtctgtgac ttctggctgt cgtcggacat cacttgttgc 481 actgcctcca tcctgcacct ctgtgtcatc gccctggacc gctactgggc catcacggac 541 gccgtggagt actcagctaa aaggactccc aagagggcgg cggtcatgat cgcgctggtg 601 tgggtcttct ccatctctat ctcgctgccg cccttcttct ggcgtcaggc taaggccgaa 661 gaggaggtgt cggaatgcgt ggtgaacacc gaccacatcc tctacacggt ctactccacg 721 gtgggtgctt tctacttccc caccctgctc ctcatcgccc tctatggccg catctacgta 781 gaagcccgct cccggatttt gaaacagacg cccaacagga ccggcaagcg cttgacccga 841 gcccagctga taaccgactc ccccgggtcc acgtcctcgg tcacctctat taactcgcgg 901 gttcccgacg tgcccagcga atccggatct cctgtgtatg tgaaccaagt caaagtgcga 961 gtctccgacg ccctgctgga aaagaagaaa ctcatggccg ctagggagcg caaagccacc 1021 aagaccctag ggatcatttt gggagccttt attgtgtgtt ggctaccctt cttcatcatc 1081 tccctagtga tgcctatctg caaagatgcc tgctggttcc acctagccat ctttgacttc 1141 ttcacatggc tgggctatct caactccctc atcaacccca taatctatac catgtccaat 1201 gaggacttta aacaagcatt ccataaactg atacgtttta agtgcacaag ttgacttgcc 1261 atttgcagtg gggtcgccta agcgaccttt ggggaccaag ttgtgtctgg ttccacaggt 1321 aggtcgaatc ttctttcgcg gtttctgggt cccagcgagg ctctctctcc tgggcaaggg 1381 caatggatcc tgagaagcca gaatagtcct gagagagagc tctgaaagga gaagtgttga 1441 aactaaatgt agagcttccc tgcccaggag gaggctcact tcctcccctc aagccccggg 1501 ctcagcactg accctgcggt agccaatccc aaagggggtt gcaactttta aaaattgata 1561 atggaaggga atccctgccc tgctttggta tcgtggataa tgcccactag aagcagtgta 1621 cttgtaattg ttgtctgaag cctgtctgag acagatctac atacagcctg gcagtacttg 1681 aactagacgc ttaatgccct gtgtttttgg ggagaacttt gtgttacagc ttaatttaag 1741 aacagttact ttggcatcat tcagtcttca ctttttgtct atttaaactt ggttggagaa 1801 acttgtggat ttggtgcttc aaaccctatg tgtggcttgg atggcgcaga gaaaccttga 1861 agagttaaca gcaaaattct gatgctgaga tctctatttt tattatactt gaaactatat 1921 gggggtgggt gggtgggaat gggagatgag gagtgttaaa ctgagaatca acacctatga 1981 ttgtttgttt tctgcagatt tacaattttg taattcctgt ttagcgattg tcaagccaca 2041 actctaacaa acaaaccatt atgtgtgcta gtgccaaagt ctgcagactg ctttattttt 2101 tctcttaatt tcatgtacct gtcactttac acatttaaat ccccataaat gaagggtatg 2161 atgggtgact cagcccacac tgctgctata tttcttacta atgcaattgg taaaaccgat 2221 tagtattgga aatatactgt ttcttaacaa gaaaagtgtc tttatttctt atccaattta 2281 gtgagatgtg aaggagactg atgacatggg gatagttctt acacaattga ggaatggggt 2341 gggggcaata ggaggatgta tattttgact tgtaaaaaaa tcttaaagtg catgaaactt 2401 ttatctgata gtcatttgca ctctccttcc catctgtgat tccttgtgtg ctaacatata 2461 aagaaaccaa gagaactatc ttccttctcc agaaacctta aaaatacagt taagggccct 2521 aaaaacgata ttgaaaagaa aataaacttg tttctttttt gttgttgttg ttattgaagt 2581 ttgggcagga gaaaagattg ctagaaaatg acatataaga actttagaaa agctt // LOCUS HUMUDPCNA 4705 bp DNA PRI 19-SEP-1995 DEFINITION Human alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase (MGAT) gene, complete cds. ACCESSION M61829 NID g340075 KEYWORDS alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltrae. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4705) AUTHORS Hull,E., Sarkar,M., Spruijt,M.P., Hoppener,J.W., Dunn,R. and Schachter,H. TITLE Organization and localization to chromosome 5 of the human UDP-N-acetylglucosamine:alpha-3-D-mannoside beta-1,2-N-acetylglucosaminyltransferase I gene JOURNAL Biochem. Biophys. Res. Commun. 176 (2), 608-615 (1991) MEDLINE 91222222 COMMENT From EMBL entry HSUDPCNA; dated 23-JUL-1991. FEATURES Location/Qualifiers source 1..4705 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5" mRNA 2056..4592 /partial /gene="MGAT" /note="G00-128-225" /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" exon 2056..4592 /gene="MGAT" /EC_number="2.4.1.101" /note="G00-128-225" /number=2 /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" gene 2056..4592 /gene="MGAT" CDS 2182..3519 /gene="MGAT" /EC_number="2.4.1.101" /codon_start=1 /db_xref="GDB:G00-128-225" /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" /db_xref="PID:g340076" /translation="MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALD GDPASLTREVIRLAQDAEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPA PAVIPILVIACDRSTVRRCLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVT HIRQPDLSSIAVPPDHRKFQGYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDF FEYFRATYPLLKADPSLWCVSAWNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELW AELEPKWPKAFWDDWMRRPEQRQGRACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKL NQQFVHFTQLDLSYLQREAYDRDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRD SFKAFAKALGVMDDLKSGVPRAGYRGIVTFQFRGRRVHLAPPPTWEGYDPSWN" BASE COUNT 958 a 1217 c 1234 g 1296 t ORIGIN 1 tgatttctgt aatatagatt agtttctatt ttcggcagtt ttatgtcaat ggaatcacag 61 tgtatgtgct cttttttgtt tttcccaaaa agacccatat tgtgtaatta tttggagatt 121 gtgttgtgta tgtcagtggt ttattcagtt ttattgctga atattattcc attctatgag 181 taaaccacaa tttgtttatc tgtatttccc tgttgatggg cctttgggtt atttccagtt 241 ttgagtgatt atgaataaag ttaccatcaa cactcctgtg taggtttttg tagggacatg 301 gctgaaagta gaaagaaatg gggcatgcag gagagactgg gcttggttat aacagcctgt 361 tatccctggg gaactagctc actcaccaga ggacgttagg aaggatctgc ccctgccccc 421 cttgtgatcc aaacacctac cactagaccc cacctccggc agcaccacca cacaggggat 481 caaatcccaa catgtgtttc agagaacaaa ccatattcaa gctatagcat agctacgtaa 541 ttattataac actattaata agaccatctt tgccttggga attttaagta gcacctttat 601 tacatattag atttttttgt gtaatgagtc tgtttctgga gcttaaattc tatttcactg 661 gatggtctgt ttatttatgt accagtacca cgcagttgta attatggagg ttttgtagta 721 tgccatatat atatatatat aataaaaaaa aatttttttg agacagagtc tcactctctt 781 gcccaggctg gagtgcaatg gcacaatcca atcacagctc actacagcct cagcctccca 841 gactcaggtg atcccatctc agcctcctga gtagctggga ccacaggtgc gtgccaccat 901 gcccagcttt tttttttttt ttttttaatt gttacttgta gagacagggt ctccgtgtgc 961 tgcccaggct attcttgaac tcctgggctc aagtgatcct cctgtattgg cctcccaaag 1021 tgctgggatt gcaggtagga gctactgcat ctggccagta tgcagtattt tcatatctgg 1081 tagggctgag tccctcctca cagcttattt ttattgtttt cctagttttt catgtgaact 1141 ttactgtcag ttcgtctagc tccataaaaa ggtattttta ttgggatttt tgatgatgct 1201 gaaatgtcct gaccaagaac aagggatatt cagatgttct tttgtgtctt tcaggaatga 1261 tagttttgtt cacataagtt ttgaatgttt aagtttattt aagtttattt ctaaatattt 1321 tctcatttct ctggcttttg taagtagggt tttctcatcc atgttttctt ctcatgagtt 1381 atttgtggat atgaaggcta tccattagta tatgttgatt tttatattac acttccttgc 1441 tcagttcatt attgattctt tttgagtttt ccaggcatat tctcacaagt aaagataata 1501 gaaatagttt gcttcctttc cacttctgct ttgaattttt ttttcttggt tcatttgcat 1561 tggctgcttc ctccagcaaa atgttaaata accctggaga tgatgggcaa cttcgttttg 1621 ctcctgacat tcgtggggtg cctctggtgc ttccctgttg gtaaggggtt aactgtagcc 1681 ctgaggtggg acatttgatt ttaaaaatca gtcatcttgg ggcgcttagg ttagaggaat 1741 ggtaggcaga tgctgtcact ccttgcccct cccctcctcc ttcccacctg gaggggaaat 1801 gaaatctgac aggtagaaag aggggagttg gggttctttt tctctctccc tccaccagca 1861 tcactctctg cctctccctc aaaaatacgt tcctgggtca ggatatatgt tgactcccta 1921 gagagctctg gagtcaacct cctggccttc ctccaccctc actcttggcc ttttcctgcc 1981 cccatttcct ctacctgtgg ggcatggagc cacgagcctt tgtgtgacgg tttgctttct 2041 ctctcctgtc tttaggtgca tggctgcctc ctaatcccat agtccagagg aggcatccct 2101 aggactgcgg gcaagggagc cgggcaagcc cagggcagcc ttgaaccgtc ccctggcctg 2161 ccctccccgg tgggggccag gatgctgaag aagcagtctg cagggcttgt gctgtggggc 2221 gctatcctct ttgtggcctg gaatgccctg ctgctcctct tcttctggac gcgcccagca 2281 cctggcaggc caccctcagt cagcgctctc gatggcgacc ccgccagcct cacccgggaa 2341 gtgattcgcc tggcccaaga cgccgaggtg gagctggagc ggcagcgtgg gctgctgcag 2401 cagatcgggg atgccctgtc gagccagcgg gggagggtgc ccaccgcggc ccctcccgcc 2461 cagccgcgtg tgcctgtgac ccccgcgccg gcggtgattc ccatcctggt catcgcctgt 2521 gaccgcagca ctgttcggcg ctgcctggac aagctgctgc attatcggcc ctcggctgag 2581 ctcttcccca tcatcgttag ccaggactgc gggcacgagg agacggccca ggccatcgcc 2641 tcctacggca gcgcggtcac gcacatccgg cagcccgacc tgagcagcat tgcggtgccg 2701 ccggaccacc gcaagttcca gggctactac aagatcgcgc gccactaccg ctgggcgctg 2761 ggccaggtct tccggcagtt tcgcttcccc gcggccgtgg tggtggagga tgacctggag 2821 gtggccccgg acttcttcga gtactttcgg gccacctatc cgctgctgaa ggccgacccc 2881 tccctgtggt gcgtctcggc ctggaatgac aacggcaagg agcagatggt ggacgccagc 2941 aggcctgagc tgctctaccg caccgacttt ttccctggcc tgggctggct gctgttggcc 3001 gagctctggg ctgagctgga gcccaagtgg ccaaaggcct tctgggacga ctggatgcgg 3061 cggccggagc agcggcaggg gcgggcctgc atacgccctg agatctcaag aacgatgacc 3121 tttggccgca agggtgtgag ccacgggcag ttctttgacc agcacctcaa gtttatcaag 3181 ctgaaccagc agtttgtgca cttcacccag ctggacctgt cttacctgca gcgggaggcc 3241 tatgaccgag atttcctcgc ccgcgtctac ggtgctcccc agctgcaggt ggagaaagtg 3301 aggaccaatg accggaagga gctgggggag gtgcgggtgc agtatacggg cagggacagc 3361 ttcaaggctt tcgccaaggc tctgggtgtc atggatgacc ttaagtcggg ggttccgaga 3421 gctggctacc ggggtattgt caccttccag ttccggggcc gccgtgtcca cctggcgccc 3481 ccaccgacgt gggagggcta tgatcctagc tggaattagc acctgcctgt ccttcctggg 3541 cccctccttg ccacatcatg agctgaggtg ggaccacagt ccccaggctg catcggcctg 3601 cctgtgtttc cctcttaggt gcatttatct ttttgatttt tccgagtggc atttaagtgc 3661 acaaatgata acaagaggat tattctcccg ttctcaaggg agtcagatca ggggaactat 3721 tctagggtat gttgcggggt attaagcagg aaaccactgt gtggtggggg gcactgggct 3781 tgttggggcc agaaatgtcc acgtcctgag ctttctcctg gagcatgtgc agagagtttg 3841 gcaacgttcg ctctcttgac cagacccctt ctccctgacc tggctcttcc agccagggca 3901 cgagccctcc ttctatacct gctccccttc ccccagtggg gactgagtta tgggagaagg 3961 ggacatattt gtggccaaaa tgatactaac caaaggggct tccttgtcag ggcctggtgg 4021 agttggtggg tcatcggggc tcactgcctc ctgcccttct ctcctgtctg acccccactt 4081 agcccttctc tccttgcagc ctagcagttt atagttctga gatggaaagt tgaagggggc 4141 aagcaagacc tctcctcagc ccatgcccag ctgtcaggag agaggtgcag ggaggaaggc 4201 cttgtgctgg gacaacctct ctcttgcctt acctcagaga gggactatgc cctgacccct 4261 cctttctgaa aatcagtgcc ctccctgttg ctctaggagg ctcctgctgg cttggtagaa 4321 gacagaattc gatctgcctg tccctttttc ccctggggtt tgacacacag gctcctctca 4381 gcatgaggtg gagcagtgac caggtggagc agtgaccagg acgcctctgg cccagtgctg 4441 cccagcctcc ccgcccgctc ccaggcgccc catgtcctca caggccagga cgccatggca 4501 ggatggagag gacttggtgg atttttgttt cttgcctgac ctcagtttca tgaaagaaag 4561 tggaagctac agaattattt tctaaaataa aggctgaatt gtctgaaaaa tatttatgtg 4621 tgtgtgtcct ggaaaagaag gtggcaggca gggaaagaaa ggaaaaggga gaataaagag 4681 ttaagaagag gtctagacgg gtggg // LOCUS HSODCG 9043 bp DNA PRI 24-APR-1993 DEFINITION Human gene for ornithine decarboxylase ODC (EC 4.1.1.17). ACCESSION X16277 NID g35137 KEYWORDS Alu repetitive sequence; ornithine decarboxylase; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9043) AUTHORS Steeg van,H. TITLE Direct Submission JOURNAL Submitted (21-AUG-1989) Steeg van H., Institute of Public Health and Environmental Protection, P O Box 1, 3720 BA Bulthoven, Netherland REFERENCE 2 (bases 1 to 9043) AUTHORS van Steeg,H., van Oostrom,C.T., Martens,J.W., van Kreyl,C., Schepens,J. and Wieringa,B. TITLE Nucleotide sequence of the human ornithine decarboxylase gene JOURNAL Nucleic Acids Res. 17 (21), 8855-8856 (1989) MEDLINE 90067851 FEATURES Location/Qualifiers source 1..9043 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="CML cells." TATA_signal 763..767 mRNA join(795..1001,3858..3967,4073..4191,4475..4648, 4855..5027,5286..5420,5551..5632,5809..5892,6948..7110, 7193..7305,7399..7613,8254..8740) prim_transcript 795..8740 exon 795..1001 /number=1 intron 1002..3857 /number=1 misc_feature 2682..2980 /note="Alu repeat I" misc_feature 3211..3477 /note="Alu repeat II" exon 3858..3967 /number=2 intron 3968..4072 /number=2 exon 4073..4191 /number=3 CDS join(4090..4191,4475..4648,4855..5027,5286..5420, 5551..5632,5809..5892,6948..7110,7193..7305,7399..7613, 8254..8398) /EC_number="4.1.1.17" /codon_start=1 /product="ornithine decarboxylase (ODC)" /db_xref="PID:g296667" /db_xref="SWISS-PROT:P11926" /translation="MNNFGNEEFDCHFLDEGFTAKDILDQKINEVSSSDDKDAFYVAD LGDILKKHLRWLKALPRVTPFYAVKCNDSKAIVKTLAATGTGFDCASKTEIQLVQSLG VPPERIIYANPCKQVSQIKYAANNGVQMMTFDSEVELMKVARAHPKAKLVLRIATDDS KAVCRLSVKFGATLRTSRLLLERAKELNIDVVGVSFHVGSGCTDPETFVQAISDARCV FDMGAEVGFSMYLLDIGGGFPGSEDVKLKFEEITGVINPALDKYFPSDSGVRIIAEPG RYYVASAFTLAVNIIAKKIVLKEQTGSDDEDESSEQTFMYYVNDGVYGSFNCILYDHA HVKPLLQKRPKPDEKYYSSSIWGPTCDGLDRIVERCDLPEMHVGDWMLFENMGAYTVA AASTFNGFQRPTIYYVMSGPAWQLMQQFQNPDFPPEVEEQDASTLPVSCAWESGMKRH RAACASASINV" intron 4192..4474 /number=3 exon 4475..4648 /number=4 intron 4649..4854 /number=4 exon 4855..5027 /number=5 intron 5028..5285 /number=5 exon 5286..5420 /number=6 intron 5421..5550 /number=6 exon 5551..5632 /number=7 intron 5633..5808 /number=7 exon 5809..5892 /number=8 intron 5893..6947 /number=8 misc_feature 6123..6410 /note="Alu repeat III" exon 6948..7110 /number=9 intron 7111..7192 /number=9 exon 7193..7305 /number=10 intron 7306..7398 /number=10 exon 7399..7613 /number=11 intron 7614..8253 /number=11 exon 8254..8740 /number=12 BASE COUNT 2280 a 1970 c 2372 g 2421 t ORIGIN 1 ggatccgggt cccctcacgc tcctggctga gtccctggct tcacagggga aactacctcc 61 gcaggccagg acccatctag ttacaggata cctcgatgtt acaaagacga ggcttccagc 121 gcgggggcgt ggaggcggct gccagccctg cccgcagcgt gctggcgacc cccgggacgc 181 cccttccctc ccgcgcctct gctccctagc tggtgggagc agagcgcacc gggatcactt 241 ccaggtccct tgcaccggag gaatgggcgg cagcagggtc cggagtcggc ccggcggggc 301 ccacgtggcc agcacatcgg tcctccgctc gcgatttccc ttttccgctc tcgggcacga 361 ggtactgaac gccaggtgga agcacagctg tgcagctaca ggctctgccg ttcagctgcc 421 gcgggccggg gccggggcct gcggcgtcgt gcgcgtgcgc ggaccagttc caggcgggcg 481 agaccgccgc agggcggggc ggggcgaggc ggccgcaggg cggggagggc ggggagaggc 541 ggccgcaggg cggggagggc ggggcgcgaa gccgggggcg ggggccacgc gtggggcagg 601 cggtgctcgg ctcggctgac gtcggcccgc cggcgcccca ccagctccgc gcgggcccgg 661 gttggccacc gccgggcccc cgcccctccc ccggccgtgt cccggccgga accgatcgtg 721 gctggtttga gctggtgcgt ctccatggcg acccgccggt gctataagta gggagcggcg 781 tgccgtgggg ctttgtcagt ccctcctgta gccgccgccg ccgccgcccg ccgcccctct 841 gccagcagct ccggcgccac ctcgggccgg cgtctccggc gggcgggagc caggcgctga 901 cgggcgcggc gggggcggcc gagcgctcct gcggctgcga ctcaggctcc ggcgtctgcg 961 cttccccatg gggctggcct gcggcgcctg ggcgctctga ggtgagggac tccccggccg 1021 cggaggaagg gagggagcga gggcgggagc cggggcgggc tgcgggcccc gggccccggg 1081 cacgtgtgcg gcgcgcctcg ccggcctgca gagacacgtg gtcgccgagc gggccacgac 1141 cttgaggcgc cgcttcctcc cggcccgggg ttctcccgcg gctggataag ggtgatccgg 1201 gcgcctcgtt ctgcccccgt cttcacagct cggggctgga ggggcctagg ggagacccac 1261 ccggagaccc tgcggccccg cgccggcctc tttcccaacc cttcggcggc cgcgcgctgg 1321 ccggggagcc gttggggagg ccctggcggc cgcgcagcag gtgcaggggc gcagagcctg 1381 ggctcgcctt ggtacagacg agcggccccg gccttggcgc cttcagtttc cttccagttt 1441 ttattttcgc tgtgtctaca gagcagatga caccaatttg gaaacccgcg agagtgggta 1501 gagctaagat agtcttgctg tagtagctgt gatattagat gctcggccat gacttagagg 1561 tgtttattta aggactgtga atgactcggt gatttcggaa aagcttggct tagatgaacg 1621 gacatacaca ggggagacag ccctaaggtt tgcagaaaag gctgattgtg ctgtttgcga 1681 agtcgaaata attggtgaaa gtgtagaagg cagaacctct caggaatgtc tggggaggac 1741 aaagaatgtg ttggctgact ttgtttaaac ataaaattgg gcagacttta attgatttgt 1801 gaaatttttt tcaaagtttg tttgaattag cccctatctc ttctaacatt atcctcttgt 1861 gctaattgat tgaccatttt aaataactta gctgttacag aaagaccgaa aggtgttctt 1921 cagtaaaata tattcaagta agttacttaa gtaacgcctt aaaagataca gaaaagcaaa 1981 aaagtattgg cgtattaaaa agaaatcaaa actttccaag tttaggcctg aacattgcct 2041 taaaaatatt taataaggcc tcaaatgacc cagtccgaga ctgcatgagc ctatttatta 2101 ttaaattgta aatattcttc atataaacaa aaatatataa ccatgtctgt aacaaaaatg 2161 gttttgctag cgttgttact ctcttccctt ctccgagggg tgatttaggc aacttcggag 2221 gttgacaatg ccaagcagtc acaatagata gagctttaaa gcaaattcta tgcatgggtt 2281 tggatttatg acaggcccgt caccctgggc ctgtcatagt accccatgcc agagcaaact 2341 gtgtccccga accattgcct ggcctctgtg cccgtaggct gctggcactg aagtgggttg 2401 cacagtggaa aagaagaaag ctctacctgg cagaaatttt taaaggttaa aataaataat 2461 tttaagaaag ctggttcaca aggtgccaca tttgatgaaa gcaaaataca gtggctttta 2521 ttgttactag agtgatgttc ttgcttgttt ttcttttttg gtgaagttag ccccaaatta 2581 ttctcatagc taagcaaata cgagagtgac tgtaaggaca gttggcattc ccggaattgc 2641 taaacttggt aggcaacgct ggtttaagaa tactgagttc tagccgggcg tggtggctca 2701 cgcctgtaat cccaacactt tgggaggctg aggcaggcgg atcacctgag gtcgggagtt 2761 ggagaccagc ctgactaaca tggagaaacg ccatctccac taaaaatata aaattagcca 2821 ggccccgggt gtggtggcac atgccggtaa tcccagctac tcgggagact gaggcaggag 2881 aatcgcttga acccaggagg cggaggttga ggtgagccga gatcatgcca ttgcactcca 2941 gcctgggcaa caagagtaaa actctgtctc aaaaaaaaaa aaaaaaaaat actgaattct 3001 gatcaggtaa cagcaactgt aatacaatgt gataagttga cttgaagatt acagttttta 3061 agaagtatat acccagctaa tacatgaaaa ttaactcgta aaatctcaaa tgctccagac 3121 atttccatga tgcctgttgg tcagtaaaaa tcattctaag acttagtgga agtaggaaat 3181 gtttgtatgg ctgtgtataa aggctataat gtaatcccag cactttggaa gaccgaggcg 3241 ggtggatcac ctggggtcag gagtttgaga cccacctgga caacgtggtg aaatcctgtc 3301 tctactaaaa acacaaaaat tagccgggca tggtggcagg cgcctgtaat cccagctgct 3361 ggggaggctg aggcaggaga atcgcttgaa cccgggaggc agaggttgca gtgagccaag 3421 attgcaccgc tgcactccag cctgggtgac agcgtgagac tctgtctcaa aaaaaataaa 3481 aaagtctata atgctatttt aagtttctaa ggaactgaaa ctgctctgaa ataaatcaga 3541 ccattataag acttttttcc atatcagtga gctaagtgca gataagcttc tgaaacttgc 3601 atgctagatt tttttggtac aaatatttga aatgcttagt gtgctgcctt ggaaaaacct 3661 ggtatttttt gttgtgtcct tatactgcca aggtttatgg aatcatgtac cttatgccta 3721 gtaataatta ggatgaccag gccagtgagt ggttcatatc cggggcatga ttagctctgc 3781 gtgtgctcag ccagtgcccc atcttcaact cgatgtgttc ctaaggtaga cagcaaattc 3841 cctattttat ttctcagatt gtcactgctg ttccaagggc acacgcagag ggatttggaa 3901 ttcctggaga gttgcctttg tgagaagctg gaaatatttc tttcaattcc atctcttagt 3961 tttccatgta agtattcagt ttacatttat gttgcaggtt aatcttaaga attgtattgc 4021 taaggcttct aagtgaattt ctccactcta tttgcatttt gttgcatttc agaggaacat 4081 caagaaatca tgaacaactt tggtaatgaa gagtttgact gccacttcct cgatgaaggt 4141 tttactgcca aggacattct ggaccagaaa attaatgaag tttcttcttc tgtaagtata 4201 tgaggcccat gctggcagtg cagctgagag tgccaggcaa gtggaaaact ttggcaaggt 4261 ctaaggaaga gcaatgaggc ttacatgtct tgttatggaa tgtagaaatt aattcactgg 4321 tggtaaatta atagtgataa tggtgatact catatcagtg gctagactca aaagagcagg 4381 attcattgtg actgatggga atgaaggtcg ctggctattg gtgtggtgtg tggtgaggct 4441 gctagtgagt cacctgtgac cactcttgtt tcaggatgat aaggatgcct tctatgtggc 4501 agacctggga gacattctaa agaaacatct gaggtggtta aaagctctcc ctcgtgtcac 4561 ccccttttat gcagtcaaat gtaatgatag caaagccatc gtgaagaccc ttgctgctac 4621 cgggacagga tttgactgtg ctagcaaggt aagcgatagc agcaggcctc aaaagcgttg 4681 tataaaatgg gcctggtatt ccccacgagg cagatacaag ttgtgttttt tgggcaataa 4741 atgctcacta aaggcaaatg gggcgggggg gtacatgaca acttcccatg cttttctgtt 4801 tattccacgt gttaagccac atatggatag catgacacca ctcttctttt tcagactgaa 4861 atacagttgg tgcagagtct gggggtgcct ccagagagga ttatctatgc aaatccttgt 4921 aaacaagtat ctcaaattaa gtatgctgct aataatggag tccagatgat gacttttgat 4981 agtgaagttg agttgatgaa agttgccaga gcacatccca aagcaaagtg agttattccc 5041 ccatctgagg gcaagatcgg gagcataaga tatgtggatt cttatcaaac aaacttaaat 5101 ttctgattat tatatttcta tactttagta gaaagtagtt gaaaccccca ttgagtcatg 5161 aagcctggga ctcaaactac agaatatatc agcgacagta tttagaacag gattgttttt 5221 attttaattg tggctataag tgaacatcta tcatgagaca tttgctgcac tttccttgct 5281 tgtaggttgg ttttgcggat tgccactgat gattccaaag cagtctgtcg tctcagtgtg 5341 aaattcggtg ccacgctcag aaccagcagg ctccttttgg aacgggcgaa agagctaaat 5401 atcgatgttg ttggtgtcag gtgagatttt ggtgggatag ctagaggtca agacattgaa 5461 cagtttgagt tttacaggct ttctcctagt gtttgctatt attttaagaa atactaagac 5521 acagtgtctc gtctctttat tttaccccag cttccatgta ggaagcggct gtaccgatcc 5581 tgagaccttc gtgcaggcaa tctctgatgc ccgctgtgtt tttgacatgg gggtgagtat 5641 acgtgaccct gttagggaag ggcgggacac aactgacaat aactagtctt aattctagag 5701 ttaacttttt atggcagttg gttctgtatt acatgggttt cagcctatct gctgcataca 5761 tttttgttat tagctgtgga tctggctgac ttattttctt gattctaggc tgaggttggt 5821 ttcagcatgt atctgcttga tattggcggt ggctttcctg gatctgagga tgtgaaactt 5881 aaatttgaag aggtaattta gaacaaaact gtaatactca gtagccgttc taataaattc 5941 ctttttggaa tatttcaaaa tttaagtgtc ttaactaata ccacaatggg ctgaagtgtc 6001 ttggtgtgat attttgagtg atttctttgt gctgtctgac attacacttg ataccatttg 6061 gttttctaaa gtgtgaatca gctttcccag aagtcttgga taattggtta cattggaaat 6121 catggctcac acctgtaatc cagcacttgg ggaggccaag gtggtaggat cacttgagcc 6181 caggagtttg agaccagcct gggcaacaca gtgagacccc atctctacaa aaaaaatttt 6241 aaaattagcc tggtgtggtg gcgggcacct gtaatcccag ctacttggaa ggctgaggtg 6301 ggaggatcac ttgagcccag gaggttgagg ctgcagtgag ccatgatcat gccactgcac 6361 tcagcctggg ctacagagtg agaccctgtc tcaaaaaaaa aaaagaaaaa gcatgttgct 6421 gtgggcttcc tagagaatat gctgactgta gcacatcatc accccaaatg tgctttgcta 6481 gacctatgct tcctctcctt aaaatacttg aaatgtttag tcacttagga agttaagcca 6541 ttatattggt gcttgaattt ataaaataca tccacatggt ttgttaaaat catgacgtag 6601 gcagaatagg atttttatcc tgttggcatg tatttgttaa aatgttttga catcttgatg 6661 ccttcctagg tagtagttag ttgcgtactg ttctttgata aaaatcatac ccataacatc 6721 ctaaaggaga tagggtgcct ggaggggaat gaaaacgagc cacctgggat atgtagcctg 6781 gttttcaggg agatgttgat gtttttttgc ttttgttact ttaatgataa acctgtctgt 6841 tgatgcctgg tctcatgatg tcatgtcaca aggccctgtg atgttactcc cccatgtgaa 6901 tttcccacaa tgaaggctgc tctttctttt ctgtttcact ctcttagatc accggcgtaa 6961 tcaacccagc gttggacaaa tactttccgt cagactctgg agtgagaatc atagctgagc 7021 ccggcagata ctatgttgca tcagctttca cgcttgcagt taatatcatt gccaagaaaa 7081 ttgtattaaa ggaacagacg ggctctgatg gtatgtataa aggacgaatc acttcatgta 7141 taactgaaag ctgatgcaaa aagtcattaa gattgttgat ctgcctttct agacgaagat 7201 gagtcgagtg agcagacctt tatgtattat gtgaatgatg gcgtctatgg atcatttaat 7261 tgcatactct atgaccacgc acatgtaaag ccccttctgc aaaaggtaat ttctgagcat 7321 actgtataaa acaattaaga ggactggtca caacacgtgt aattaagtag tacttcctct 7381 ctccgtctct ttatatagag acctaaacca gatgagaagt attattcatc cagcatatgg 7441 ggaccaacat gtgatggcct cgatcggatt gttgagcgct gtgacctgcc tgaaatgcat 7501 gtgggtgatt ggatgctctt tgaaaacatg ggcgcttaca ctgttgctgc tgcctctacg 7561 ttcaatggct tccagaggcc gacgatctac tatgtgatgt cagggcctgc gtggtaagta 7621 agccatgcat gttgatggtg ctgccaagaa taggcacctt cttggatgtg tgcttcttgt 7681 ctagacgaat aagaaattgt cttgcctaag attaaatata tatggatatt tttcctaaga 7741 aaagttttag aaaagactga tgagtgtatt tctatgtaat tggaatatat ttaagttcat 7801 gccatgtgtc ttgtggtttc cttattacca aaacggtgac tgaagaaacg cttgctttag 7861 aaatacattg aattggccag gtgtgctggc tcacacctga aatcacaaca cattgggagg 7921 ccaaggcaga aggatcactt gagcccagga gttcgagcct gggcaacata gtgagaccct 7981 gtctctacaa aaaattaaaa aattagttgg ccatggtagt gggcgcctgt agtcccagct 8041 gcttggctaa ggtgagaggt ttgcttgagc ctgggaggtt gaggctgcgg tgagctatga 8101 tagcaccatt gtattccagc ctgagtaaca gagaaagacc ctgtctcaga aaaaaaaaaa 8161 atacattgaa ttgtttcctg atgggaagta aatactctca tgcccagtta ggagtgagtc 8221 agggttttta atatgccact ttttctttct caggcaactc atgcagcaat tccagaaccc 8281 cgacttccca cccgaagtag aggaacagga tgccagcacc ctgcctgtgt cttgtgcctg 8341 ggagagtggg atgaaacgcc acagagcagc ctgtgcttcg gctagtatta atgtgtagat 8401 agcactctgg tagctgttaa ctgcaagttt agcttgaatt aagggatttg gggggaccat 8461 gtaacttaat tactgctagt tttgaaatgt ctttgtaaga gtagggtcgc catgatgcag 8521 ccatatggaa gactaggata tgggtcacac ttatctgtgt tcctatggaa actatttgaa 8581 tatttgtttt atatggattt ttattcactc ttcagacacg ctactcaaga gtgcccctca 8641 gctgctgaac aagcatttgt agcttgtaca atggcagaat gggccaaaag cttagtgttg 8701 tgacctgttt ttaaaataaa gtatcttgaa ataattaggc attgggacgt ttttatggtg 8761 tgttcattcc agacagttca cgaatcccgt atagctcgct ctgattctca gagaacaatg 8821 agtgggtcca cccacacaca ggtaggagga caggtgagac ggaagcccca tcctcccatg 8881 tggacggtgc acatctgctc agcccacccc acatgtccag agttggctgc aaactccttg 8941 tccagagcct ctggtggtgg gacctactta agtctgacgg acctgtcctg tccaggccag 9001 tgcccaggga aggtgtggga ggccctttga gcctggcctg cag // LOCUS HUMGALTB 4286 bp DNA PRI 14-AUG-1995 DEFINITION Homo sapiens galactose-1-phosphate uridyl transferase (GALT) gene, complete cds. ACCESSION M96264 NID g945216 KEYWORDS galactose-1-phosphate uridyl transferase. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4286) AUTHORS Leslie,N.D., Immerman,E.B., Flach,J.E., Florez,M., Fridovich-Keil,J.L. and Elsas,L.J. TITLE The human galactose-1-phosphate uridyltransferase gene JOURNAL Genomics 14 (2), 474-480 (1992) MEDLINE 93052353 COMMENT On Aug 17, 1995 this sequence version replaced gi:306758. FEATURES Location/Qualifiers source 1..4286 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="9p13" exon 298..406 /gene="GALT" /note="G00-119-971" /number=1 gene 298..4209 /gene="GALT" mRNA join(298..406,715..884,1117..1192,1288..1336,1462..1591, 1745..1801,1964..2086,2392..2524,2628..2711,3039..3193, 4004..4209) /gene="GALT" /note="G00-119-971" CDS join(325..406,715..884,1117..1192,1288..1336,1462..1591, 1745..1801,1964..2086,2392..2524,2628..2711,3039..3193, 4004..4084) /gene="GALT" /EC_number="2.7.7.12" /codon_start=1 /db_xref="GDB:G00-119-971" /product="galactose-1-phosphate uridyl transferase" /db_xref="PID:g306759" /translation="MSRSGTDPQQRQQASEADAAAATFRANDHQHIRYNPLQDEWVLV SAHRMKRPWQGQVEPQLLKTVPRHDPLNPLCPGAIRANGEVNPQYDSTFLFDNDFPAL QPDAPSPGPSDHPLFQAKSARGVCKVMCFHPWSDVTLPLMSVPEIRAVVDAWASVTEE LGAQYPWVQIFENKGAMMGCSNPHPHCQVWASSFLPDIAQREERSQQAYKSQHGEPLL MEYSRQELLRKERLVLTSEHWLVLVPFWATWPYQTLLLPRRHVRRLPELTPAERDDLA SIMKKLLTKYDNLFETSFPYSMGWHGAPTGSEAGANWNHWQLHAHYYPPLLRSATVRK FMVGYEMLAQAQRDLTPEQAAERLRALPEVHYHLGQKDRETATIA" misc_feature 390..406 /gene="GALT" /label=primer intron 407..714 /gene="GALT" /note="G00-119-971" /number=1 exon 715..884 /gene="GALT" /note="G00-119-971" /number=2 intron 885..1116 /gene="GALT" /note="G00-119-971" /number=2 exon 1117..1192 /gene="GALT" /note="G00-119-971" /number=3 intron 1193..1287 /gene="GALT" /note="G00-119-971" /number=3 exon 1288..1336 /gene="GALT" /note="G00-119-971" /number=4 intron 1337..1461 /gene="GALT" /note="G00-119-971" /number=4 exon 1462..1591 /gene="GALT" /note="G00-119-971" /number=5 intron 1592..1744 /gene="GALT" /note="G00-119-971" /number=5 exon 1745..1801 /gene="GALT" /note="G00-119-971" /number=6 intron 1802..1963 /gene="GALT" /note="G00-119-971" /number=6 exon 1964..2086 /gene="GALT" /note="G00-119-971" /number=7 intron 2087..2391 /gene="GALT" /note="G00-119-971" /number=7 misc_feature complement(2097..2114) /gene="GALT" /label=primer misc_feature 2342..2357 /gene="GALT" /label=primer exon 2392..2524 /gene="GALT" /note="G00-119-971" /number=8 intron 2525..2627 /gene="GALT" /note="G00-119-971" /number=8 exon 2628..2711 /gene="GALT" /note="G00-119-971" /number=9 intron 2712..3038 /gene="GALT" /note="G00-119-971" /number=9 exon 3039..3193 /gene="GALT" /note="G00-119-971" /number=10 intron 3194..4003 /gene="GALT" /note="G00-119-971" /number=10 misc_feature complement(3247..3266) /gene="GALT" /label=primer exon 4004..4209 /gene="GALT" /note="G00-119-971" /number=11 BASE COUNT 948 a 1168 c 1179 g 991 t ORIGIN 1 gaattccgga tcaaatgaat gattgcagca agcaagtcct gtaggcatcc tggagcccaa 61 ggattctgca gtaggcagct ttcacagagg ttcttccagt gtagtggctc tagctctggg 121 tgaagtagga tcatcaatgt cggcccccag ggttcacagc tgttctgagc cccgccccct 181 ggtggcagcc gacgggagtc agtcagtcac gtgctggcgg ctggccaatc atcgggggcg 241 gcgcggggag gggtggtgtg gacggagaaa gtgaaaggtg aggcacggcc ctgcagattt 301 tccagcggat cccccggtgg cctcatgtcg cgcagtggaa ccgatcctca gcaacgccag 361 caggcgtcag aggcggacgc cgcagcagca accttccggg caaacggtaa ctgcaccgcg 421 gcagggactc gctggggcgc ggagccgagc cctccccttc cttaggaagc tttcgtcccc 481 ctccgaaggt tggaacgctc atcccgagcc agaccgacaa ggcgtacagt ctgcaggcct 541 ctacgagcag caggccaatt ggcgctggga aagtccaatc ctgggcctct agctcctgag 601 cgggacaggg ccgagagggc gctcccgagc ttgggcctgc tggtgggtga gacccaggag 661 agagggagct agagggggga gctctgagga ctgatcttga ctgtctgccc ccagaccatc 721 agcatatccg ctacaacccg ctgcaggatg agtgggtgct ggtgtcagct caccgcatga 781 agcggccctg gcagggtcaa gtggagcccc agcttctgaa gacagtgccc cgccatgacc 841 ctctcaaccc tctgtgtcct ggggccatcc gagccaacgg agaggtaagc ctgtagagcc 901 ctgcatctgc aggctgggcc acggggagta gttccctctt agaactgtcc tccacccaca 961 ggatagtgaa cctccttctg ggtcatatcc caccaagctt tttggtcccc tagggtgggc 1021 cttccctact cccttgtagc ctgtccagtc tttgaagccc accaggtaac tggtggtatg 1081 gggcagtgag tgcttctagc ctatccttgt cggtaggtga atccccagta cgatagcacc 1141 ttcctgtttg acaacgactt cccagctctg cagcctgatg cccccagtcc aggtaacctg 1201 gctccaactg ctgctgggga ggagggtggc tagacctctt gagggacttc tgctgcagag 1261 atgctgagtg atactccttt acctcaggac ccagtgatca tccccttttc caagcaaagt 1321 ctgctcgagg agtctggtaa ctatggattt cccctcttac aactttcaaa ccagagttgg 1381 agactcagca ttggggttcg ccctgcccgt agcacagcca agccctacct ctcggttatc 1441 ttttctcccg tcaccaccca gtaaggtcat gtgcttccac ccctggtcgg atgtaacgct 1501 gccactcatg tcggtccctg agatccgggc tgttgttgat gcatgggcct cagtcacaga 1561 ggagctgggt gcccagtacc cttgggtgca ggtttgtgag gtcgcccctt cccctggatg 1621 ggcagggagg gggtgatgaa gctttggttc tggggagtaa catttctgtt tccacagggt 1681 gtggtcagga gggagttgac ttggtgtctt ttggctaaca gagctccgta tccctatctg 1741 atagatcttt gaaaacaaag gtgccatgat gggctgttct aacccccacc cccactgcca 1801 ggtaagggtg tcaggggctc cagtgggttt cttggctgag tctgagccag cactgtggac 1861 atgggaacag gattaatgga tgggacagag gaaatatgcc aatgatgtgg aggcttggag 1921 gtaaaggacc tgcctgttct tctctgcttt tgccccttga caggtatggg ccagcagttt 1981 cctgccagat attgcccagc gtgaggagcg atctcagcag gcctataaga gtcagcatgg 2041 agagcccctg ctaatggagt acagccgcca ggagctactc aggaaggtgg gagagagcca 2101 agccctgtgt ccccaaggag tccctaactt tcttatccca tgagagaggt gtgtaaagga 2161 gaaagctaga ggtgaactag tagagagaga cttgctagga ggccttagca ataatccagt 2221 aatctaaagg aaagatgatg gtgacttaga ctcgggtggt tagtggtaga ggtggtgaga 2281 agacatcaga tcctgggcac attcttttct tctgcttccc ttgcctattt gctgaccaca 2341 ctccggctcc tatgtcacct tgatgacttc ctatccattc tgtcttccta ggaacgtctg 2401 gtcctaacca gtgagcactg gttagtactg gtccccttct gggcaacatg gccctaccag 2461 acactgctgc tgccccgtcg gcatgtgcgg cggctacctg agctgacccc tgctgagcgt 2521 gatggtcagt ctcccaagta ggatcctggg gctaggcact ggatggaggt tgctcccagt 2581 agggtcagca tctggacccc aggctgagag tcaggctctg attccagatc tagcctccat 2641 catgaagaag ctcttgacca agtatgacaa cctctttgag acgtcctttc cctactccat 2701 gggctggcat ggtgaggctt ttcaagtacc tatatttagc cccaacacca tttctgggct 2761 cctgggctca gcctagtgaa ctgcaacctc aaaggagcaa gccttgaaac agttgctggg 2821 ggaagtggcc agagtagaga tgctgggact gagggtggag cagcaaactt ggtgaaacta 2881 catctccaat gtgctttcta atctcctgcc agctcttctc aagcagggga tcctgggaga 2941 tgtagttttc agatacctgg ttgggtttgg gagtaggtgc taacctggat aactgtaaaa 3001 gggctctctc tccccactgt ctctcttctt tctgtcaggg gctcccacag gatcagaggc 3061 tggggccaac tggaaccatt ggcagctgca cgctcattac taccctccgc tcctgcgctc 3121 tgccactgtc cggaaattca tggttggcta cgaaatgctt gctcaggctc agagggacct 3181 cacccctgag caggtcagga ctcagaacag tctggcgtct ccagactctc acatgcagta 3241 tgtgcaggca cctgatactt ctgttgccct tgtgctccag tcattgcaca aggcagaaac 3301 agctctggca ggaagggact gccaaagtta ggagccctag ggcctggaag gagagtatgg 3361 tcctcagatc ccccttctct cctgcttcct ccagggaacc caacagtcat gaccctgata 3421 gtttcccata acaacctggg cattccttgg gactcaggag ctgctaaact ctttcatccc 3481 ctggtggctt cagcagtcct tatcaccagc ctcacaatcc cacaggccca cccccagtgg 3541 gcctgtggca ttcatatttc atattcatat ttcaaaccac aatatccagc aaaatgtctc 3601 ctgagcaccc agaactccat accatcggcc gggtgtggtg gctcatgcct taatcccagc 3661 actttgggag gtcaagatgg gaggattgct tgagcccaga agttcgagac tagcctggga 3721 aacataggaa gccctcgtct ctacaaaaaa aatttaaaaa gttagccagg tatggtggca 3781 tatacgatgc tttgtggtcc cagatacttg ggaggctgag ataggatcac ttgggcccag 3841 gagtttgagg ctgcagtgag ccatcatcat ggcatcattg cattccagcc tgggcaacag 3901 agcaagacct cgtctcaaaa aaaaaaaaaa aaatgaagtc catgccacca ttcttggcag 3961 cccagccctt atcctcctta attgctccct gtcccttttc caggctgcag agagactaag 4021 ggcacttcct gaggttcatt accacctggg gcagaaggac agggagacag caaccatcgc 4081 ctgaccacgc cgaccacagg gccttgaatc cttttttgtt ttcaacagtc ttgctgaatt 4141 aagcagaaag ggccttgaat cctggcctgg aatttgggca gatatagcat taataaaact 4201 gtgcatctca aacttttatc acatactcta atatcagagg agtgtgaacc ttcagagatc 4261 tagggttaaa agctaaaggc atagct // LOCUS HSU29185 35522 bp DNA PRI 19-FEB-1998 DEFINITION Homo sapiens prion protein (PrP) gene, complete cds. ACCESSION U29185 NID g2865216 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 35522) AUTHORS Lee,I.Y., Westaway,D., Smit,A.F., Wang,K., Cooper,C., Yao,H., Prusiner,S.B. and Hood,L. TITLE Structure and Organization of Chromosomal Regions Carrying the Mammalian Prion Gene from Three Species JOURNAL Unpublished REFERENCE 2 (bases 1 to 35522) AUTHORS Lee,I.Y., Westaway,D., Prusiner,S.B. and Hood,L. TITLE Direct Submission JOURNAL Submitted (31-DEC-1995) Molecular Biotechnology, University of Washington, Seattle, Washington 98195, USA COMMENT Interspersed Repeats were identified with RepeatMasker (available from http://ftp.genome.washington.edu/RM/RepeatMasker.html) Simple sequence repeats were identified with sputnik (available from http://serac.mbt.washington.edu/ chrisa/software/sputnik.html). FEATURES Location/Qualifiers source 1..35522 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela cell line S3" /clone="pGPrP1" /chromosome="20" /map="20pter-12" repeat_region complement(4..268) /rpt_family="MER74B" repeat_region 284..402 /rpt_family="MER3" repeat_region 1619..1685 /rpt_family="L1MA9" repeat_region 2106..2526 /rpt_family="MER65A" repeat_region complement(2812..3124) /rpt_family="MLT1A1" repeat_region 3131..3424 /rpt_family="AluSx" repeat_region complement(3425..3469) /rpt_family="MSTC" repeat_region complement(5410..5640) /rpt_family="LTR16C" repeat_region complement(5410..5640) /rpt_family="MSTC" repeat_region 5969..6265 /rpt_family="AluSx" repeat_region complement(6973..7266) /rpt_family="AluSx" repeat_region 7960..8253 /rpt_family="LINE2" repeat_region 8258..8440 /rpt_family="MER5A" repeat_region 8485..8891 /rpt_family="LINE2" repeat_region 8951..9253 /rpt_family="AluJb" repeat_region 9254..9349 /rpt_family="LINE2" repeat_region 9885..9982 /rpt_family="MLT1G" repeat_region complement(9983..10281) /rpt_family="AluSx" repeat_region 10286..10479 /rpt_family="MLT1G" repeat_region 10552..10759 /rpt_family="MLT1F" repeat_region complement(11478..11800) /rpt_family="AluJo" mRNA join(12634..12767,15390..15488,25464..27817) /gene="PrP" /product="prion protein" gene 12634..27817 /gene="PrP" exon 12634..12767 /gene="PrP" /number=1 repeat_region 14413..14498 /rpt_family="L1MC1" repeat_region 14414..14499 /rpt_family="L1MC1" repeat_region 14583..14653 /rpt_family="MIR" repeat_region 14584..14654 /rpt_family="MIR" repeat_region 14752..14947 /rpt_family="L1ME3" repeat_region 14753..14948 /rpt_family="L1ME3" exon 15390..15488 /gene="PrP" /number=2 repeat_region complement(16201..16334) /rpt_family="MER5B" repeat_region complement(16202..16335) /rpt_family="MER5B" repeat_region 16594..16695 /rpt_family="MIR" repeat_region 16595..16696 /rpt_family="MIR" repeat_region 16945..17564 /rpt_family="L1M2_orf2" repeat_region 16946..17565 /rpt_family="L1M2_orf2" repeat_region complement(17566..18025) /rpt_family="L1M2_orf2" repeat_region complement(17567..18026) /rpt_family="L1M2_orf2" repeat_region complement(18020..18289) /rpt_family="L1M2_orf2" repeat_region complement(18021..18290) /rpt_family="L1M2_orf2" repeat_region complement(18325..18663) /rpt_family="L1M2_orf2" repeat_region complement(18326..18664) /rpt_family="L1M2_orf2" repeat_region 19154..19584 /rpt_family="LINE2" repeat_region 19155..19585 /rpt_family="LINE2" repeat_region complement(19597..19708) /rpt_family="AluJb" repeat_region complement(19598..19709) /rpt_family="AluJb" repeat_region 19709..19925 /rpt_family="LINE2" repeat_region 19710..19926 /rpt_family="LINE2" repeat_region 20075..20349 /rpt_family="AluJo" repeat_region 20076..20350 /rpt_family="AluJo" repeat_region 20472..20604 /rpt_family="FLAM_C" repeat_region 20473..20605 /rpt_family="FLAM_C" repeat_region 20618..20663 /rpt_family="LINE2" repeat_region 20619..20664 /rpt_family="LINE2" repeat_region complement(21394..21928) /rpt_family="L1PA9" repeat_region complement(21395..21929) /rpt_family="L1PA9" repeat_region complement(21916..22898) /rpt_family="L1M2_orf2" repeat_region complement(21917..22899) /rpt_family="L1M2_orf2" repeat_region complement(23507..24188) /rpt_family="LINE2" repeat_region complement(23508..24189) /rpt_family="LINE2" repeat_region 24671..24821 /rpt_family="L1MB8" repeat_region 24672..24822 /rpt_family="L1MB8" exon 25464..27817 /gene="PrP" /number=3 CDS 25474..26211 /gene="PrP" /codon_start=1 /product="prion protein" /db_xref="PID:g2865217" /translation="MANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYPGQGSP GGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQWNKPSKPKTNMK HMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPNQVYYRPMDE YSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYERESQAYY QRGSSMVLFSSPPVILLISFLIFLIVG" repeat_region complement(28160..28231) /rpt_family="TIGGER2" repeat_region complement(28162..28233) /rpt_family="TIGGER2" repeat_region complement(28237..28534) /rpt_family="AluSx" repeat_region complement(28239..28536) /rpt_family="AluSx" repeat_region complement(28537..28907) /rpt_family="MER28" repeat_region complement(28539..28909) /rpt_family="MER28" repeat_region 29185..29635 /rpt_family="MER88" repeat_region 29187..29637 /rpt_family="MER88" repeat_region 29743..30047 /rpt_family="AluSq" repeat_region 29745..30049 /rpt_family="AluSq" repeat_region complement(30816..30998) /rpt_family="MER5A" repeat_region complement(30818..31000) /rpt_family="MER5A" repeat_region complement(31331..31431) /rpt_family="MER5A" repeat_region complement(31333..31433) /rpt_family="MER5A" repeat_region complement(31818..31949) /rpt_family="FLAM_A" repeat_region complement(31820..31951) /rpt_family="FLAM_A" repeat_region 33143..33315 /rpt_family="AluJo" repeat_region 33145..33317 /rpt_family="AluJo" repeat_region 33322..33619 /rpt_family="AluSg" repeat_region 33324..33621 /rpt_family="AluSg" repeat_region 33599..33631 /rpt_unit=caaaa repeat_region 33847..34134 /rpt_family="AluY" repeat_region 33849..34136 /rpt_family="AluY" repeat_region 34288..34748 /rpt_family="MLT2CB" repeat_region 34290..34750 /rpt_family="MLT2CB" repeat_region complement(34883..34958) /rpt_family="MIR" repeat_region complement(34885..34960) /rpt_family="MIR" BASE COUNT 9811 a 7951 c 7700 g 10060 t ORIGIN 1 gatctttggc ctatagtagt gatctctagt tgcctagtac ctggccctgg gatgattaag 61 gcagaggaat attgcttcct gagctgcatg ggccagatag ggaaggtctg gtttaggact 121 ggtttgtttt catatcaaag gcacgttcct ggctgggtcc ttcgttatct ctaagaattg 181 gctggcccag ggaggaacag tctctctcca gctaaaaagg tttttcagga tcaaagtata 241 cagacaactt aaaaaaaaac aagagtaaaa gttatttgac caacatcagc catgtgtggc 301 taatgaacac ttaaaatgtg acaggtgtga ttgaggaatt aaattttagt tttatttaaa 361 tgttttaatg taaatttaaa gagcacatgt ggcttatagc tatggaacat gcagctgtga 421 acactatagt taaaactgac attttgatct gaggattcct tttatttaat cagttctgaa 481 actatagttt tctacattta tatatttgtt ctattgtgaa ttttaggatt tattttgatg 541 atgtactttt tctttgtgaa gtttagaaat gaccatgggg gtcatgtggg ccaaattaac 601 tcaattcctg gacaaaaact atttccccga ccaccccaag cttcttgcca atgtgcgccc 661 ttgggacaag aaaggctagg agaagctgtg gacaaattaa gataaacctg gcaatatcat 721 tagacaaata acccaaagtg aatccctctt ctgatggtca acatcagatg gtcacacaga 781 aatgttattt tcctatgagt aattttgcat tattgttaca tatgatttcc acttacaaat 841 gccacaaata ttatgcaagt acctctctac tgggaaggaa tataacaatt ctaacaaata 901 aacataaaag cagtgtgtac ataagaatgc ctctatacaa ttttttgagc atcaaaatct 961 tagaaagttg aaaaatacca atgaaatata gcattcttat tctttctcat ttcttttatc 1021 tgttctttag agaaaaaaag atggagcaga aagcaagaaa attgtccaac agaattattg 1081 caactgactt agtttcttat aaacaaagaa tgcattgaaa gagaaaacct tgcgtcagca 1141 aagtatatta cacaataaga aagaagaatt tcaaagataa tagagataat ttctcacaga 1201 tgtctgacca gaaagaaatt ctcaactttc taaacttttc tatcataatt tcacatattt 1261 ctctggttca tttataaagg gaaaataact tgagaatatt ccaagatgaa tacatcagca 1321 ctcacatagg taaccagaaa catggggtgt taaatcaatt acaggagtag ctgggttcct 1381 tatccagata agacagcgaa aaaagaccaa gaagccagta gccggtgact ttaaaaattg 1441 ttcgaatatg cttcctcctc actttgtctc ttgaggttct ttgtaaaagc cagggtttca 1501 caattaactt tacagcaatc aaaaagcttt aatcagtcac taaaacaaat gctattttca 1561 cttatcaaat tggtaataat taggaaaaaa gtgataatat catgctggga tattcagagc 1621 acgcttttgt gggaatctaa attggtgcaa ccactttggc tgcatgcctc aaaagcttaa 1681 aaatatgccc atcttttctg caccttctac tcctgacaac ttatttaact acagggatgt 1741 tcttcccagc tatgtgatat acagccatta aatggctgaa tactatgcag ccattaaaat 1801 attgttgcag aatatattgt gatatggggt ggaggtggag cctagctcaa aaattatgat 1861 tgtatttggg taaggaaaaa ttatatgtat acataaaagt aatatgaagg ggatagactt 1921 atgaatattt tataaatatt ttagatattt tcttcatttt cttttattct tttctttagt 1981 cttatctata ttttctaata tgtctatttg atatgtattg cttttacaag aaaaaataac 2041 ttcaatgatc ctataaaagg aggagagagt ggggagcaag tgttttaaat actgcgacat 2101 gaggctgtga aagctctcag aatcaaaatg aagtcaccta tgttaaaaat cccaacaaac 2161 agagccagac gagaccttga agagagggtt gtcatgtaca aaaactattg caagactgca 2221 aaaatcacaa ccttgcacaa agtccatctc aaccttacgc aaaaaaatac ctctacaagg 2281 acatctgccc agcaactgcc tgtccaacct cagactggtg ctaccctcgt tattgaacct 2341 tgtagtcaag gataattaac ccaaaacagt tatgtaaccc tccttatttt tcctctaaaa 2401 acccttgtct tcctttacct ccctgaatag gtacatagtg tattatagta catgcatttc 2461 catcgcaatg cctattccca aataaatatc attttctttt caaaatgttt gttatttagg 2521 ttgacaaggc ctggctgccc actccgtggt ggttgccatc tgttgcttta ttataacaaa 2581 cttcatttaa ctcaagccag gctgagtggc tttctttatc ttgctatcga tgctccctaa 2641 tactctaacc ccaactttgt atcatctgga atccggagat attttttagc caaacaagaa 2701 acccacatgt tgatttgtaa catttcagca gcaccacctg catcaccaaa gactttgaga 2761 cttatggaaa taaattcaaa tcctcaacat ctggaatggt tgagacacat gtataaagaa 2821 cagaaattta tttctcacag tcctggaggc tgggaagtcc aaaatcaaga ctctggcatg 2881 tttggtgtct ggtgagggct gctctttgct tccaagatgg caccttgatg ctgcatcgtc 2941 tagcggggag gaacagtgtg tcctcacatg gcacgaggca gaagggcaag agggccagac 3001 gctgctgtgt gaagcctctt ttaaaagggc ctgaatccca ttcacaagga aagaatcctt 3061 gtgactgaat cacctcttaa agcctccact tcctaatact gtcccatggg ccattaagtt 3121 tcaatgcctg ggccgggtgt ggtggctcac acctgtaatc ccaacacttt gggaggctga 3181 ggcgggcaga tcacctgagg tcaggagttt gaaaccagcc tggccaacat ggcaaaaaca 3241 ctgtctctac taaaaaatac aaaaattagc cgggtgtgtt ggcacgtgcc tataatccca 3301 gctacttggg gggctgaggc aggagagtca cttgaacccg ggaggcagag gttgcagtga 3361 gacaagatca tgccactgca ctccagcctg gggaacagag cgaaactccg tctaaaaaaa 3421 aaaagtttca atgcctggat cttggagggg acacattcaa accgtagcac cattttatat 3481 tatttgtgga ttagggttaa tactttgtca tctcagacaa accatttcta agaaggtaga 3541 ggtttgcgtg tgaggcttga gccacaggta tgacactcca ttgtcatgca caaattgtta 3601 cttccttaag atatgatcca acagacctgg agacctgaat gcatttaaca caaatgaaag 3661 tttctgcctc ctccctcagc ccatttgagc ttccatttcc ctctgaccca gtgcgcattc 3721 taccctttgc agtctcaaga tgcttctccc tggcagaaaa atcaaaagca aaatatgaac 3781 aagtataagc cagaattttc agagtaactt tccagattgc ttcaaatggc agcatttcca 3841 atcctggttt gtctcatttg ctttattcaa taaaagaaca agaacaagaa caagaaaaaa 3901 agaacactac tgtgtatgag ctttgacatc acttctgacc atttttactt tcatgttgtt 3961 aattttgcca tttgattttg tttccattac atgctattaa cttctgttca gatgtcactc 4021 atttaaaaat gtgggggtat ttcccattgt tgtcccagtc aggcaattat aacttaacct 4081 aaagacagcc ctgctctcag gtcagccttg ccataaagtc cgattcaaag aaataacagt 4141 ccatcagaaa aatgaaaacc attaaaatct ttgtgtgtct ctcaagaaaa taccacagga 4201 tgagtaaaca aagctgaaaa caagatagag aatagagtct ctcaagaaaa taccacagga 4261 tgagtaaaca aagctgaaaa caagatagag gatagaagtt gatagagagt tagaatttta 4321 acatgtgaca cgccattagt aagtgttttg aacaccaagg gtttggaggg aaagaactgt 4381 gtggtcagga cagcaagcct ggaagctgac cctggctggc aacactcaaa gtaacaagac 4441 aaagatctag ctgagggagg aggctcagag ggcctcagga tgctgccatc ctaatgaaga 4501 acaatcacac ggagggtgaa ggcagtgttc accctctgtg tgaacacatc ctcctaagag 4561 gatgcactgc cctcctccta gaacactgtg tgagggatcc cagctaccca aggtccagga 4621 gccacatgaa agctgcttga agatgtccct tgcaaaccta gcatctgaga atatggaccc 4681 acaaaataca ttcattaagg agaggcccat tttatataat gtacatttac aaagttaata 4741 cacttttgac atattttcac acttttattt caaaaacatc ttgaattacc tgcaaacttt 4801 caggaactca tcaccaacaa ggtccattca tgatgtagtg ggaaataacc actgtgtaat 4861 gatagcataa gcaaattcag ttttaatgag aggaaatagg ggaagaaagt tgaagataaa 4921 acatgcttcc tttataaatc cttctttatt ttattgtccc cactcccaac caacccccaa 4981 ggccaagtaa agagctggga gtaaaaagtt agggcactga agatagattc gggagacgtc 5041 tgggttcctc acctgctaat gcaggaccca gtttcctaca aaacagcgtc tgcctgtcca 5101 aggattgtgg aaatacaggt gtatgttcct gcatgtgtat ccaggaagaa caagtccctt 5161 tctcaggctc gagcccctgg agccaggccc tcttcatagg ccaccccaca gtcaccaccg 5221 tggaggttct ggaagcctca ggatgcccca gggacttccc acccccaagc caggtagtgc 5281 cccaagcata agtgaaaaca cccttttcac tcctttgttt ccctagaaat ccagcagctt 5341 ctaaggttca tgtgctacaa ctctttccat ttttaatatc cacacatgtg caggtgtggt 5401 aatggatcaa ctactgtgag caactggggc ttcacccccc acagaacatt ctacgagata 5461 gagggtggta gcacacttca ggtcatccag ctgcaagagt gaggagttgg agcatttatc 5521 cacagactcc catctcccac tggggagaac tgccctggga cccctgacac cacacacttt 5581 gggggtgctt ctagggggag tagggcaagt tcagatgatg ccagaaaaag tcctcaggca 5641 gagaagagag aggcaggcat cagggtggga aggcatcagc attctggaag ccttagttac 5701 aggtgaactc agatgggcca aggggacaca ggacaaggta tcgacagcat ctgccacaac 5761 tctcttggga ggtacacaaa accaactgag taaacatttg tattatttcc tggatatagg 5821 agcaatacat tttacataaa gatgatgatt cttcacaaat ttattcttaa gtgcaatgcc 5881 atttcaataa aaaaacccaa ccataggtat catggaattt gacaagatta caaaattaat 5941 ctggaaagag aaataagcaa gaatatcggg ctgggcacag tggctcatgc ctgtaatccc 6001 agcactttgg gaggctgagg cgggcagatc acttgaggtc aggagttcga gaccagcctg 6061 ggcaatacgg tgaaacccag tctctactaa aaacacaaaa attagctagg catagtggtg 6121 catgactgta atcccagcta gttaggaggc tgaggcagga gaatcgcttg aacccaggag 6181 gtggaggttg cagtgagatt gtgccactgc actccagcct gggtgacaga gcgagactcc 6241 atctaagaaa aaaaaaatca gaaaatattc tgaaacaaag agaaagtaat gggtgatatg 6301 agtcctacca ggcattgaag tgcattataa agcctttacc tacctcatgg atctaataaa 6361 gcaagaaagg aaaagcctct cctcacgtgg catcccctcc cttggttgtg tcaatttgaa 6421 tcacactgat atgccttatg cggaatctac ataaaaagaa accttttagg cacacaccga 6481 ggaaggggct gaatgagggt agctgcggat cattcactgg tatcagggac aagaaaatta 6541 ccatggcctc aggaactgcc agtaaagaca caggaaggtt ctggttccca gaaaaataaa 6601 gtcagaagcc aagcccaggg tgggaacaaa gaagatcaga aaaaatcaga ccatttcata 6661 aaagctgtga tgcagaagaa gtccagtgga ggagggccca agtgccacac aggttccagg 6721 tgatgctgaa attgtctcgg gtaactgaga tacctcagac ccgcctagac agttagaaat 6781 ggctagaaag agaaaaggaa atttttttct agacaccaag gagaaaatag gaagaaaagc 6841 aatatgagtg gatgggagag gttgccccat attcaggata ccatttaact gattttcccg 6901 ggcaaagcac taggagtcat ctcccaagga atttatagtc cagttaaaga ccaaaagggc 6961 tttaaataat aatttttttt ttttgagatg gagtcttgct ctgtcaccca ggctggagtg 7021 cagtggcaca atcttggctc actgcaacct ccgcctcctg ggttcaagcg attctcctgc 7081 ctcagcctcc ttagtagctg ggactacagg catgcaccac cacgcccagc taatttttgt 7141 atttttagta gagacagggt ttcaccatgt tggccaggct gttcttgaac tcttgacctt 7201 aggtgatcca cccgcctcgg cctcccaaag tgctgggact ataggcgtga gccacttcac 7261 caggccaata ataaattttt aaataataaa gcagtttaaa atgggactgg atgatacaag 7321 ccagttgctg ctggagctag aaggaaggag gctgacttct gatgggaaga tatggaagga 7381 tttcagagaa aaggatgcat cagagccaga actccacaga agagaagcag ggtcaagaaa 7441 ttcagggctg agagaactac tcaagcagac cttcagcaag gcaattaaag gaggacaagt 7501 ggtcctgcag ggcttcgccc agggaaatgc aaagagaaag agagagagag agaaagaggc 7561 atcactggaa atgtttcaca gataccttga ctgtaaacat taactgcagg cactgaaatc 7621 tagccattaa ggaagcattg aatgctttta aagggtgggg agacaggaac cattctgcag 7681 caactgtgca atagtttctc cttaaattca tgattcacat ctttagagaa aatgttccta 7741 acgcctaaga gaagaccata caaacagatt agctgttttc ttcctctgtc tgaagtgcta 7801 tgccatgtga tgcctcaaat tgctacagct gtcaacacca gcttgggtag ggcagagtgg 7861 aaacagggaa gaaacgaggt ctgtgaagat ataaatatac cattgaatgt accctgaccc 7921 gaagaggatc ctccagtatt caaaacaccc tttccagcgc ccaccctctc ctccagctcc 7981 tgcccactcc tctgctcccc gcagggtaaa actcagaaat gttgtcgcta ctccctcacc 8041 ttccattccc tcttcaattt gcttctgccc ccattaatcc attgaaactg ctctcctcaa 8101 gtccccaaaa acctccacgt caccaaatcc agtgagcact atcctgtctt cacatgactt 8161 ggcctctcag tagctggctc tgccttcttg aaacaccctc ctctcagctt ccagaaccct 8221 cttatttcac tggctgtccc ttttcagtct cttcctctgg ttcaaagtgt ggtctctggc 8281 ccagcagcat cagcatcacc taggcacttg ttagaaatgc aggttcctgg accccatccc 8341 agacctgctg aatcaaatct ctggtccggg aggggcgagg cagcaatctg tgttttagcc 8401 aagtcttcca ggtgatgcac actagagact gagaaggact aatccgaaac tgtgggcatt 8461 cctcattacg aaatcctgat tcctttctct tccctggcta cattttctcc catgatgatg 8521 tcatctaagc ccgtgactgg aaatcatcta tactctgaca gcttctaaat gtgtatcttc 8581 agcacccatc tggggattac acagccatct caaattcaac atgtccaaat ggaaggaagg 8641 aactccagat gttccctcta aaatctggtc gccatccatt tttttccatt taagaaaata 8701 gctccaccat tcccccagct gtttattaaa aaaacttaag tcatccacga gtcctccttt 8761 tccccaactc cttaccctat tagcaatagt atatctagag ggtctatctc taaattgtca 8821 tgaatctgtc ctctattacc actataggcc aagccacaat tttctctccc cagcccactg 8881 caaaagcctc cagccaggta ctgctcatgc ccacaaaaca accagagtgg acttttaaga 8941 aggtaaatca ggctaagcat ggtggctcac acctgtgatc ccagcaattt gagaggccga 9001 gataagcaga ttgcttgagc tcaagagttc aaaaccaacc tggacaacat agtgagaccc 9061 ccgtctctaa aacaaataca aaaatcagcc aggcatggtg gctcacgcct gtggtcccag 9121 ctactcagaa gactgaggta agaggatagc ttaagcccag gaggcagagg ttgtagtgag 9181 ccgagatcac cccacggcac tccagcctgg gcaacagagt gagaccctat ctaaaaaaaa 9241 agaagaaggt aaatcatgtc gcatcattcc ccttcttaaa tcttcaaggg cttcctatca 9301 tatttaatta ttaaaaccca aacatctgtc atgggctgtc aggccctatg gtcactgtct 9361 gtatggcttc ctgtatcctc tttgaggtct gaagtcagag actgtaacta tttaattcac 9421 actaagtgct caataaagtg gaggaaataa ttctcctttt gtgagaagat gaatttagag 9481 tctaacttct ttagtttctc cttacttgct gaggaggatc tatatgatgt ggtcaaatca 9541 ctttcagcca ggacacagga gatgggaagg gagacttcca aagaatgact tgatgatgct 9601 gtgaacctca gcacaatagg gctctgtggg caaacacagt agggtttgac ctccaaggac 9661 aaaccctaca ccccactccc cacccagtcc ggaaaacgac tttcactgtt tatagtgaga 9721 tatgggtggt tgtggtcatg ggagaggcaa tgggagtgag tttggaatga ggaattatgc 9781 agggccattt gaatagccac ttcctgtggg ttgcccaccc ttggagtggt tcataaggtc 9841 aataaaaacc tctaagaatc cataggccag accatcaacc ccattgtggc agcttttaaa 9901 actggcaaca aggtctttga tactcttccc attgaggagt gggctccatg tcctctgtcc 9961 ttgaatctgg gggctatatg actttttttt tttcttttct gagacagagt ctcgctctgt 10021 cgcccaggct gaagtgcaat ggtgccatct tggctcactg caacctctgc ctcccgggtt 10081 caagcgattc tcctgcctca ccctcctgag tagctgagat tacaggtgcg catcaccatg 10141 cctggctaat ttttgtattt ttaatggaga cggggtttca ccatgttgcc caggctggtc 10201 ttgaactcct gacctcaagt aatccatggg cctcggcctc ccaaagtgct gggattacag 10261 gcatgagcca ccgcgcccgg caatggggct atataactgt tacgtcaaca gcatacagtg 10321 gaagtgatgc tacctgacta ctgagcccag gcaggaagtg gcattgcagc ttcttgtttg 10381 ctgaaactct cacacttgga gtcctgagcc aacatgtaag aagttcatcc aagccgagac 10441 caccagacca tgaggaaacc aggccacaca gagaagccgc agctaggcac cgcaatcagc 10501 acatctgatc ttcaagtcct cctagcccag gtgccactca ggcaagtgca taggcttcag 10561 atgactgcaa cccccggcta tcaaggtacc cctagactta gagtcttctc aactgacacc 10621 ccaaacataa tggggcagaa tcaagtcccc gctgtgcctg tccagctccc tggcttaaac 10681 aatccattgg cataataaaa tggtagttgt tttaaaccac ctaagttgtg gggtatttgc 10741 agcacatcag taataaccag aataggtcat aagtcaatcc cctttggttc cctagttctt 10801 tccctccaag aaatcccagg ccatttgagc cctttcctga tgggaggaaa gcagtcgacc 10861 agaaaatacg attcttttga ggaaaagatg ggttttagcg tcttggaacc aagcttcaca 10921 gagcgccttc tgccagtggg cagccagccc cttgcctagg ctgatctggt cacctacttc 10981 cagccccacc ccagctctgt cctcccaccc agctctgtcc tccctgcagg ccctgagcct 11041 ctaggagact caaagagaca ctgtgctgcc cgctgaattc atctcccagt gagccatctc 11101 tgaggaaagg aggtaaggct ctgaatgcga tgtcaaccta agcattacat aacactgata 11161 ggttgtaaaa tggcctcgct gcctagtgct cattctaact ggccacaatc cagtcctccc 11221 tgttgcactg agtcagctcc ctccaaaggc ggcgctttcc aggcctcctg gttgcccttt 11281 cccagcagac ctttcaagtg ctcacccaca ctcacataaa catggcccag gcactgttta 11341 cagcagctct cctctgtgca ttctcctggc ttctcctgtc tctcgccatt tgtacccctc 11401 ccctgcaact tgagtgacag tgattggtcc tgagatgcaa atatttgatg cacatatgtt 11461 tacaatgcag cctcagcttt tttttttttt ttttgtaaag agacaaagag tgagacaggg 11521 tcttggccta tagccctgtg gcccaggctg gagtgcagtg gcacaattaa agctcactgc 11581 agcctctacc tcctgggctc aagcaatcct cccatctcag cctccccagt agctgggact 11641 acaggactgt gccaccgttc ccagcaattt tttttatttt ttgtagaaat ggggtctcac 11701 tatgttgctc actctggtct caaactcctg agctcaagca atcttcctgc cttggcctct 11761 gaaagtgctg ggattacagg cctgagccac tgcacctggc tgtcaacatt cttaaatctc 11821 tttccttaca tgcttcctaa acctctcacc caaaactagg agactagatg tcctattttc 11881 cccagggcat gcctggttta cgcccatttc actttaaaag tgcccaattt gggtaataat 11941 ttataagatc cccctccctc taaatcctgt ccttctatca cttcatcctt cgctctcctt 12001 taaaatgaga cagttgtcag caggaatcct gcgcaagaac acaccaccct gtttcataga 12061 agatatctca ggtaatgtgc aaacacgggt ttttaaacgg agcgcatttt tctcatttgt 12121 taatatcacc acctaaatca tctcttgcct aaaacaagga gtagaaagtg aatgaaggaa 12181 ggaacaggtg atggtcagtg tcctttctac gcctcaaaat ttaagagttt atgtgaaaat 12241 tcataaatat taatctcaat ccaggttaag caaaattttt tgctctcctc tttagaaatt 12301 tctggttgcc aaagttccag aaattgcttc ctcattcctg agcctttcat tttctcgatt 12361 tctccattat gtaacgggga gctggagctt tgggccgaat ttccaattaa agatgatttt 12421 tacagtcaat gagccacgtc agggagcgat ggcacccgca ggcggtatca actgatgcaa 12481 gtgttcaagc gaatctcaac tcgttttttc cggtgactca ttcccggccc tgcttggcag 12541 cgctgcaccc tttaacttaa acctcggccg gccgcccgcc gggggcacag agtgtgcgcc 12601 gggccgcgcg gcaattggtc cccgcgccga cctccgcccg cgagcgccgc cgcttccctt 12661 ccccgccccg cgtccctccc cctcggcccc gcgcgtcgcc tgtcctccga gccagtcgct 12721 gacagccgcg gcgccgcgag cttctcctct cctcacgacc gaggcaggta aacgcccggg 12781 gtgggaggaa cgcgggcggg ggcaggggag ccgcgggggc cgagtgagga ccccgggcct 12841 cgggtcccag gcgcaagggt gcccggccgg gcggggtcgg gaccccagtg aggaggggcc 12901 gggggctgcc ccgcgggcgc gtgacggtct cgggcctgcc cggctgcgct ggtctccgct 12961 cgggtgaggc ggcttggctt cgcttttcag gttaggaaag ctccctttac tgcgcgttgg 13021 ggggctgggg gagctggcgg agccacgtta gggaggtcgg tggcgccggg gtgtctcagc 13081 gccccctgca ccccgcgcgg gtccggccca gcgggcgatc gctggcgccc agggaactcc 13141 gggagggccg ccagcgggct ccgcaggcgc ggggcgggga ggggcgcctg ggggccgcgg 13201 ggctcgcgct ccccgcccgt tggccgcccc tcggaggccg agatcggggc ccagaacgcc 13261 ccttggcaaa gcctggcgct tccgcgatgc ccagagggtg cttgggggga tggagagagg 13321 ggcgcccgcc ggggtagttc cgggagcctc ggtgcctccc gccgcagctg cagcgttcct 13381 cccgggaggc ggcccagccc ttcatcctcg ccgcctgagc ttctccgagg ggggctgcag 13441 ccttgcggcc gttgccaccg cctggagaag cggcccacgc ggactgacgg gcgggggcgg 13501 ggcctcgggc ctcggcgggg gcggggtccg gggaggcccc accctctgtt ctccaggggc 13561 ggggagagag gagctgcagg tctgcggcct ggccccaggt gcgatggcgg accccagctt 13621 ggccagtcac attcctccca gtccccctgg agggagaacg ctggccatgg ggggctccaa 13681 ggaacaacca gcctcggatg acgacccttg ggtcaccggt ctccccacct gtgcggcagg 13741 cgccttcacg tttcattatt aaacaatggg gagaaatcca tgtttactgt cctttttagg 13801 aattttttgc tcttctcttt gaggtggctg taggaaatag attttttttt taacctcgca 13861 attccaccac ggtcacatcc atcctcgcca tcgcagagcc acagctctcc gtttttgttt 13921 cctagcctcc agattctcac acaacacagt gcagtttcac tgctgtaatg atgaggatct 13981 tcatggccgc gttattttct tgttctgaga gcatcacggt ttaattagca gttccccata 14041 tgatttgaag tgtttcccgt ttccttaggg aaaactcctg gtagaatagg attaaggatt 14101 tttacaaata taattatcaa aaacatagga acagggaatt ggataaatat gttaaacttc 14161 tggaaaaatc aacaacgctc ttagatttgt agaagaaagg aaaaaatcac cagtggaaag 14221 gagcaatttt acttacacaa acacagagaa ggtcttacag tgaaaaaaag ctaaccagta 14281 aggggaaaag caggcagagg ggtaggatgt gatttgtatg ttatttatat ctaacacaag 14341 tcttccacac cgaaaggaaa atattaagat tataatagat aaatggcaaa atgatgagtc 14401 atttacacaa taaaatgcaa attagagcat gtttgggtta tcattttaca tctattaaaa 14461 taaccaaaat aattaatagt aacagcaacc cttgctggaa ggttgcccaa aacttggcat 14521 tttcaagtgt ctggggaggt ggcagggctt tggggtcaca aagatggttc tgcagtcaat 14581 tttgtgacct tggacaggct acctaatttc ctgatcctcc ttttgtccat tcatagaatg 14641 gaggaaatga tagctacttt ctgcgtctgt atgtatgagt tattgggggc atttcgaacc 14701 agtgacaaac attttgttaa gcaatctggt gatgcattaa gaagctggaa gctgtgaccc 14761 agaaacccca ctcctgagaa cttacctgca atggaagaaa caaacaaaca aaaacaggca 14821 tgtattccta gcagaatgat ctaaaattag aacacctgga aaagagccta aatgtataac 14881 accagggcag tagctaagaa aattatgaca cattaactga aatgaacatt atgtaaccac 14941 taaaaatcat gattttggag cctgtgatat gtggggaaaa actgacaagt aaaaaagtgg 15001 gttattaact gcacctgctt actctaacgt gaacgcatat gtgaaaaatc tgaaaggaaa 15061 agcacagaaa atggacgttt tcattgaaat tgtcggtgat cttaattttc ctttggtgaa 15121 tatattgctc tcactaaggg cgtttaaaaa atagttcaca ggttttaatt ttttagatga 15181 aatggaccca cagttttctg taagagaaag gagagattgt tatatttgct acttagaata 15241 aaagattttt agccaacgtt gtttcctttt tcaaatattt ttccattttt ttagttgatt 15301 aatgatttag taatttgtgt attgggtttt tttaagaatc agttcttaga ttcatttatc 15361 aattctagtt ttttgttgtt gtttttaagg actcctgaat atttttcaaa actgaacaat 15421 ttcagccatg tctgagcttt ccgtcttcct ggaggcacaa atctagttta gctgaaccac 15481 aacagattgt acatatcctg cagaacctct gtggtcttag gaaggttgaa agtcaccaaa 15541 tgtcacagaa aatgagggtc aggaaaggct gtgatcacaa gctcttgatc cagaggccac 15601 tcgggtgcat ctgtgtgact gacacctcat tgcatcatca tttcttgtaa aatcagctta 15661 attttgcaga gaaatggcca cagattttta gaattatgct gaattatgta agggagcagc 15721 cattttaaaa ataatgtaat caaataacaa taataacata acagaagttt ctgcaccaca 15781 ctgacactga acatggctgt caaccaacta gccaaaccaa ggtgaaagat tctctgaact 15841 tcagttttaa cttacttcgt ccattctccc ttttcagatc ttatttcttc ttccaaggca 15901 actatatcca ctgtgaactg ctgcatatta aacattcaaa acatgtctgt atctaaccca 15961 gtgatggcta cagcattata aaataactcc taagtgaata gtgtgtgcag caattctcac 16021 agctaggaga attttttttg tcgatgtgtt aaccattcct ttttctccaa ctgcattcta 16081 cttaaactgt tacagtattg tgtgatcact ttaagggagg taatccttca gtgaaaagca 16141 ttataaatgt tgaagattag acccctgagg acctagcact gaagcctgat cacgggtgcc 16201 aattccctga gaagctttta tagattccca aatcccatcg ctaggcactc tctgactggc 16261 ttgtgatggg ttccagcaat gttgtttttt aaaagctcct caggtaattc tgatcatgga 16321 ccaggttggg gaccctttgc tctgttcagt ttgtggtttc taggtgccag tgagaaaccc 16381 ctacccccaa tttttgtgca aaattttttt cagcttacat caagttcttt atgaacctca 16441 gaaaattaaa atataacaag ggaatttact tagctcagaa gagacaaggg acattatggg 16501 gtgaaggtcg ggggttgttt gacatttgcc ccttcccgat gtaactaatt aagcttcaag 16561 ttgtaatgtg ttttatgggc tagaaggctt tagtgcctat atttggatct cagctccaac 16621 accttggact cttacaacct tgggataagt tcttactagc tctgactctc actccccatc 16681 tataaatggg aataagatga acggggtaat ttatagattt tttagtaccc tctctctagt 16741 tccgctgttt tgccattctg ctcatgcacc aatgccatgt gatcttattg tagctttgca 16801 ctctgacacc tggcaggtca agtgcaccct tgattatctt ctcattaaaa attttcttca 16861 gtcttttaag ttttgagtat aatttcataa tgtcttttaa aatctctttt gaattctttt 16921 tcaaatttat ttattttttc tttgacagat aagattatat gtattgtgta caaaattaga 16981 aagagatctt ttcctctaag atctgaaaca agacaagaat gctcacacgc actagttcta 17041 ttcaacatat ggtactgcaa gtcctagcca ggacagttag acaagagaaa gaaataaaaa 17101 gcatccaaat ttgaaaggaa aaagttaaat tgttcctatt tggagacgac atgattttac 17161 acttataaaa ccctaaagac tccaccagaa aaccattaga actaataaat tcagtaaagt 17221 tgcaggatac aaaattaaca tacaaaaatc agtagtgttt ctatatgcca acagtgaact 17281 atcaggaaag aaataaaaag aaagtgccat ttacaatagc tacaaaataa atatttcaga 17341 ataaatttag ccaagaaggt gaaagatctc tacactgaaa actataaaac actgatgaaa 17401 gaaattcaag gagacacaaa tagaaagata gcctgtgctc atggggggat aattaatgtt 17461 gttaaaatgt ccatactacc caaagtgacc tacagattcc atagaatccc tatcaaaata 17521 ccaatgacat tcttcacaga aatagaaaaa acaatcctaa aattgggtga aggtggcatc 17581 tttgtctcgt ttcagatcta agaggaaaag ctctcaattt ttcaccatta agtatgatgt 17641 tagctgtgac cttgtcatat gtggccttta ttgtgttgag gtacaatgct tctacaccta 17701 attcattgag agtttttatc atgaaagaat gttgaatttt gtcaaacgct ttttctgcat 17761 ctattgaaat gatcgtatgg ttattgtctc actctgttac tgtgatgtat cacacttatt 17821 gatttgtgta tgttgaacca tccttgcagc cctgggatga atcccactgt atcatagtaa 17881 atgatctttt taatgagctg ttgaatttgc ttcgctagaa ttttgttaag aatttttgca 17941 cctatgttca tcagggatat tggcctatac tgccattttc ttgtgtgttt ttgtgggact 18001 ttgacatcag gataatgctg gtttctcctg ttattaattt ctagttttat accatcatgc 18061 ttagaaaaaa aacttgatac gatttcagtc taattaagtt tgttaagact tgttttatgg 18121 cctaacatat gatctgtctt ggagcatgct ccatgtgctg ttgagaagaa tgtgtattct 18181 gcagctgttg gatggaatgt cctgtatttg tctgtttgga tccatttggt ctaagtgaag 18241 tgtaagtctg atgcctcctt atttattttc tttctaaatc tcccgtccaa aaatctaaga 18301 gatccaaagc aaacaggggg ttataaaatt acctttgaat tctgactgaa attgtgttta 18361 ctttgtagat tatttgctgg tgacttggta tctttcaata ttggtttccc aaacaaaaaa 18421 cattgtatgt ctttctcttt ttccagtctt cttttatgtc tcagtaaaat tttgtagctt 18481 gcctactact tattttcagg ttaaatccta gatattttca attcttactg ccattatgaa 18541 tgaatgtttt tccccatgaa atgtttaact acagctgatt tagggactca ctgagttttg 18601 atttgttgat ttgaaaaact accaccctgt tgagctcatg tatttattct aagacatttt 18661 ttgagatttt ccaggtaaaa aaattatacc tagatgaatt tatagaacaa cctaatagca 18721 taaaagcaat agattattgt tcaccacatt aatgggaatg cctttcctct taaaattatg 18781 atatcttggg ctttgtgttt ctgacagcat ttctttatga tggaaagaga ctatccttcc 18841 agttctatag aaacgtttta aaaatagctt acatttttaa aaatttagga atgatattga 18901 ttttcattga aaacttttca acatctgaag aaaatctttt cttcctaacc tattaataga 18961 gctcctgatt ttcaaccatt cttgtattcc taaaataatc catattggtc ataatgtatt 19021 cttcttttac taaactgtta aaataagttg ggtgctgttt ttttagattt tggcacctgt 19081 gtgagaggag ggtgccatgg cagggtggct aagtgtgccc acctgtgtac tagcattgaa 19141 ttttgaccct ttatcccatt ggctctaaac actacatctc aggtgaagac gcccaaatgt 19201 ttatctctac tgagttacag attcctataa caattgctta aattgatact cccacttgaa 19261 tgtctcatag gtgtggcaaa gtttacatgt ctatactgaa actcttggtt ttctctccca 19321 ggcatgaccc tccaccagtt tgtcctattt cagtagatgg catcaccgcc atctccctgg 19381 ttatggaagt cccagacccg ggcaactctc actcctccat ccactcaacc agcaggtctt 19441 attggctctt tctgtaaaat atgtctcatt tattcacagc ctgctagctg catggactcc 19501 atcccaggcc acgccaccat cttcctgcag cagtttctta actggcctct ccattccctt 19561 tcttgtccca ctaaattttc ttctatttaa gaaaaaatgt tttgtagaca tggggtttca 19621 ccatgttgcc caggctggtc tcaaactcct gggctcaagc aatcctcctg cctgagcttc 19681 ccaaagtgct gggattatag gcttgagctg agcttttgaa aaaacattct atgatgttac 19741 tgtcctacta agacccttta atggctcccc actgctttca gaggaaaata gaaatttctc 19801 tctgtggacc atgaggccct catgatctga ccctcaccac ctccctagtc ttagcagtcc 19861 acctgccact gtctcatggg gctccacaga tttgatctcc tttttggtcc ttaagcccac 19921 caagctaagt atcttattca aaactgtgtc ctcagcccct agcacaattc ttgaccctta 19981 gaattctgac acttgaaatg tgtgttaagc taatcaataa attattgtgc caggcttggt 20041 ggcacaataa tgtgttgtgg gtcacctata tatagctgtg gctcacacct ataatcccag 20101 cactttggga ggctgaggcc tgcagattgg ttgagcccag gaatttgaga ccagcctggg 20161 caacatggca agaccccatc actattaaaa atacaaaaaa aaaaaaaaac caggcgtggt 20221 gttgcatacc tgtggtccca gctactcagg aggatcactt gagcccactg ggtggaggtt 20281 gcagagagct gagatcatgc cactacactt tggcctgggt aagaccctgt ctcaagaaaa 20341 aaaaaaaaag gttatggtac tgtaaatatt atattacagt gtataatttt tgacagcttt 20401 ttttctccat ttcttctatg gttattggtc agttctagcc ttctattttt ttttttaagc 20461 tgacacatac tggctgggca tggtggctca cacctgtaat ctcagcactt tggggggcca 20521 agacaggaag atctcttgag cccagaagtt tgaggcgagc ctgggcaaca tagcaagacc 20581 ccatctctac cagaaaagaa acaacaacaa aaagaaactg gcacatacta gttattccat 20641 aaatatttct taaatgaatt aattctaaag ttgattttta taatttacat cttaatagaa 20701 agacaatcca tctcagagat atttttattc attaaggaaa gttgcataaa gtaatttatt 20761 aaccttttaa agcattctct aagcctattt ccccttatag agatatgttt tctagtgatg 20821 tgtgtatgtt acattttcaa ataactctta attttattta gaaatttcac atttttgtgg 20881 gttttattta ttaaattcga ttttcatctc tattattaat ttatgacagt ttattttgaa 20941 gttttctgtt ttcttccttt tgagttgaat aatcatttac tttattttac ttcttctatt 21001 cagttgtatt atccatgttc ctctgatagc catgcaggtt ccaaatcctc acatttttca 21061 acttctttaa aagatttttt ttgtgctatc ctgataagcg ttaagtgtta tatcattttt 21121 tttcattttt attatattct taatgatttt gattctcttt ttctatcctt ttggttttct 21181 ttcttaggta aatgatcagt tatatctttt ggccattttt ttctgttgga attactgtct 21241 cttactgctc tgtaagaatt ccttgcaagt tctagaaatg aatcctctgt tgattttaag 21301 tactaaagtc ccctttacca ttgtgtactc tttctattaa ctttgtccag agtcagcttt 21361 atcgtgctga acaagaatct ttacatttta attttttttt ttaattttta agttcaaggg 21421 tacaagtgca ggtttgttcc ataggtaaac tcgtgtcatg ggggtttgtc atacagatta 21481 tttcatcact caggtactaa gcctagtacc cattagttat ttttcctgat cctctccctc 21541 ccttagccct ccgataggcc ctagtatgtg ttgttcctct gcatgcgtcc atgtgttctt 21601 atcatttagg tcccacttat aaatgagaac atgtagaatt gggttttctg ttcctgtgtc 21661 agtttgcata tccctgcaaa gaacatgatc tcgttctttt ctggggctgc atagtattcg 21721 atagcatata tgtaccacac tttcattatc ccagctatca ctgatgggca tttaggttga 21781 ttccatgtct ttgctattgt ggatagtttt gcaatgaaca tacacgtgca tgtgtctttg 21841 taataggatg atttatattc ctttgggtat atacctagta atgggattgc tgagtcgaat 21901 gatatttccg tctttaggtt tttgaggatg taatcaaatt catcaattat ttgccttatg 21961 gtttgtgctt tttgtttaaa aagttcttcc cccattcctc ctccaggtca caaagacatt 22021 tcccaatatt ttcctcttct agcgccaaac ttgtaccttt cacatgtaag tcttggatcc 22081 atctagagtt cacttttgta tgtagttagg gatctagttt tctttttctt gtttcccaga 22141 agcatgaaac cttcctcctt tattggttcg tggtgcccat gtcagcacac atcaagtccc 22201 tgttaaagtg tatctacgtg ggtttctctc tcaacttgct tttctgtttc ctttggtctc 22261 tccgtctatt cctgagcttt accgtccagt cttcgtcact gtggttttgt agtgatttca 22321 tgtctgcttt ggcagattcc ctcctcttta cacttttttt ttaaggttga ccttattatt 22381 catggacatt ttcttcccca tagaagttta ctgaacttct caaaaaattg agattacctg 22441 gaattttaac tggcatttta ttgcaactac cagttcattt aggataatgg acatctttat 22501 aatactgagt tatcgccatc aagaccatgg aatgcctctc catttttttc agaccatctt 22561 ctgtgttctc tgtataatgt gtcacccatc agaagtcttt tcttagttaa gtcaattcct 22621 agacgcttta cagtttggtt gttactttga atatctgtat tttcttctgg tcagttattg 22681 ctactgtaga ggaatgccgt tggttttcgt aggctgacct tcagtcccac aaccttgcac 22741 tctagcgttg gttctcatgt tatccctgtt attctgttgg attttctatg tgcatgatca 22801 cactaattgc aaatagtgtc agctagagca ctttccttgc cattcttaca cctgttactt 22861 ctttgcatat ggccttggac agcaccttga gtcccatgac aatggatgtc cctgttgtat 22921 ttctgagtgt attcttagca ttcttcttag taataaacag gtattgacca ttaccatata 22981 cttaatagga tatcttgaaa tatgccagag gactaggcca ctacttgact ggtgacagag 23041 agtatcttag atgtataggc ccccagacat gccagtgtcc tctctcacca cccatctcca 23101 ctccaaagtt gtctggcctc acccaccttc tgtgcctggt tcttctcctc tgtgcttctc 23161 tcactatgcc atgtcctcgc ctgttcaggt gtctctaaga gctcatagat ccttaactct 23221 tacagcttta aatgaggcac attcaaagcc cgactcagtg tcattccctg cagtctcctc 23281 ttcctcccat cttacccatt tcaggtaata tctccagcca gccagagtct actgcagtag 23341 ctggcacatg aaataaatgt ttatttaact gaattgaaaa tgttagctgg ccattgtcaa 23401 aaatactatt ttgtaaattt ataaatatgg ttctctttgt atatccatta taaatattta 23461 gtatttagtt tgattattcc ttatgcttta taacatgtgt tttcccattc attcatttat 23521 ccagcattta ttttgcatgt ctatgggtca agcgctctgt ctaggaacac tgctgtgaac 23581 aaaaccaaac tcccgcccta aaggagcctg cactcccgtg gagaacatga ataataagca 23641 cagaggaaat aacataatat ctcaagtagc tgtaactgct ccagagaata atgaagccag 23701 gaaagggggt gggctagggg gtgctgtttt aggtagagtg atgggaacag ccccactgag 23761 caaactttag ccacatgagt agctggaaga aaagccttct aggaccaggg aacagcaagt 23821 gcaacagccc tgagacagga tgggcttgtc agtttgagga gcagtgggag gcctgaacca 23881 ggttacatgg ggcccagcca gtatggccac gactttgtgt tttatccaga gtacaaagga 23941 gcctcactga gggacaaggg aagtggcatg atgtgacccg catattaaga ggagagcgct 24001 caatggcagc caggggagga gcagggaggc tggttgggag gctgttgaag aaatcaggtg 24061 agaagtgatg gaagcaccga ataagatggt catgttggaa aaattgagaa gctgaggtgc 24121 ttagcattga ttttcaaggt agagctactg agatttgctg atagatccaa tgtatgctgg 24181 gagagaaaat tcagtcactc tagagcattg gctggatttg tcacccattg cagcgaatgg 24241 agaaggtgct gacataaaag ccctttagac tgaaagctac tgactgaggg gatggtgccc 24301 tagtttgatt tcctggggtg tatgagtagc agggaggcca agagctgggc ctcacgagat 24361 tgtggggact acatcaggga agcaggcctg caaagaacct gcaccacaga cccccaccta 24421 aaagagcctc caaaaaccct aactggcatg acctgagtac tttaactctg cttgtaactt 24481 tcaaatatat tttgatttgc ttatgaccac caaggcaaaa cttcccattt caataatgtt 24541 agtagaaaca taaacaggat gttaatttat ttatttggta attcttttta tgaatattat 24601 ccagtttaat cattagctct gaaggagatg aaaaataatt ttctaatttt tagaaaaatt 24661 tgcagctaat tgggtgataa aggtaagggg tttctgagtt cacaaaaatg ttctaaaatt 24721 ggcaacagtt tcggttgcac gtttcattaa atgtactaaa aaccattgaa ctgtacatgg 24781 tatatggtga attatatggt atgtgaatta tatctcaaca agctgggtat tgttttttta 24841 aaaaataaaa ataaaaaagg agaaagagag agagaaaaac aattgcagat catcccagca 24901 ctgaggacaa gactaacttc agtgttccag tatatgccat tataggtttt acggcacaca 24961 gtgatgattt ggagcctatg atttgaccta gggtacagca ggtactgttt agcaatcatt 25021 ttactattgt cataggtctc tgctcttgga gctaagtgcc cagggtaaat gagatcttta 25081 atttgaaaag agatttttga tttgatgaat gtacattctc caaagggtca taaattgtca 25141 ttctggatgt ttgatctgtt tgttgttttg gtacaaaaat tagaagaaaa taattcactc 25201 ataaaatgtt aaataatgaa ctaaaagtca ttcatcaagt ccataactta gggtcacatt 25261 tgtccttgga gcaggagaaa gagttgtgtt cacccttttc ttacttttgc ttttgtccta 25321 agtgcttcag agaagtacag ggtggcaaca gtgtttctac tgagcagctg ataccattgc 25381 tatgcactca ttcattatgc aggaaacatt tagtaatttc aacataaata tgggactctg 25441 acgttctcct cttcattttg cagagcagtc attatggcga accttggctg ctggatgctg 25501 gttctctttg tggccacatg gagtgacctg ggcctctgca agaagcgccc gaagcctgga 25561 ggatggaaca ctgggggcag ccgatacccg gggcagggca gccctggagg caaccgctac 25621 ccacctcagg gcggtggtgg ctgggggcag cctcatggtg gtggctgggg gcagcctcat 25681 ggtggtggct gggggcagcc ccatggtggt ggctggggtc aaggaggtgg cacccacagt 25741 cagtggaaca agccgagtaa gccaaaaacc aacatgaagc acatggctgg tgctgcagca 25801 gctggggcag tggtgggggg ccttggcggc tacatgctgg gaagtgccat gagcaggccc 25861 atcatacatt tcggcagtga ctatgaggac cgttactatc gtgaaaacat gcaccgttac 25921 cccaaccaag tgtactacag gcccatggat gagtacagca accagaacaa ctttgtgcac 25981 gactgcgtca atatcacaat caagcagcac acggtcacca caaccaccaa gggggagaac 26041 ttcaccgaga ccgacgttaa gatgatggag cgcgtggttg agcagatgtg tatcacccag 26101 tacgagaggg aatctcaggc ctattaccag agaggatcga gcatggtcct cttctcctct 26161 ccacctgtga tcctcctgat ctctttcctc atcttcctga tagtgggatg aggaaggtct 26221 tcctgttttc accatctttc taatcttttt ccagcttgag ggaggcggta tccacctgca 26281 gcccttttag tggtggtgtc tcactctttc ttctctcttt gtcccggata ggctaatcaa 26341 tacccttggc actgatgggc actggaaaac atagagtaga cctgagatgc tggtcaagcc 26401 ccctttgatt gagttcatca tgagccgttg ctaatgccag gccagtaaaa gtataacagc 26461 aaataaccat tggttaatct ggacttattt ttggacttag tgcaacaggt tgaggctaaa 26521 acaaatctca gaacagtctg aaataccttt gcctggatac ctctggctcc ttcagcagct 26581 agagctcagt atactaatgc cctatcttag tagagatttc atagctattt agagatattt 26641 tccattttaa gaaaacccga caacatttct gccaggtttg ttaggaggcc acatgatact 26701 tattcaaaaa aatcctagag attcttagct cttgggatgc aggctcagcc cgctggagca 26761 tgagctctgt gtgtaccgag aactggggtg atgttttact tttcacagta tgggctacac 26821 agcagctgtt caacaagagt aaatattgtc acaacactga acctctggct agaggacata 26881 ttcacagtga acataactgt aacatatatg aaaggcttct gggacttgaa atcaaatgtt 26941 tgggaatggt gcccttggag gcaacctccc attttagatg tttaaaggac cctatatgtg 27001 gcattccttt ctttaaacta taggtaatta aggcagctga aaagtaaatt gccttctaga 27061 cactgaaggc aaatctcctt tgtccattta cctggaaacc agaatgattt tgacatacag 27121 gagagctgca gttgtgaaag caccatcatc atagaggatg atgtaattaa aaaatggtca 27181 gtgtgcaaag aaaagaactg cttgcatttc tttatttctg tctcataatt gtcaaaaacc 27241 agaattaggt caagttcata gtttctgtaa ttggcttttg aatcaaagaa tagggagaca 27301 atctaaaaaa tatcttaggt tggagatgac agaaatatga ttgatttgaa gtggaaaaag 27361 aaattctgtt aatgttaatt aaagtaaaat tattccctga attgtttgat attgtcacct 27421 agcagatatg tattactttt ctgcaatgtt attattggct tgcactttgt gagtattcta 27481 tgtaaaaata tatatgtata taaaatatat attgcatagg acagacttag gagttttgtt 27541 tagagcagtt aacatctgaa gtgtctaatg cattaacttt tgtaaggtac tgaatactta 27601 atatgtggga aacccttttg cgtggtcctt aggcttacaa tgtgcactga atcgtttcat 27661 gtaagaatcc aaagtggaca ccattaacag gtctttgaaa tatgcatgta ctttatattt 27721 tctatatttg taactttgca tgttcttgtt ttgttatata aaaaaattgt aaatgtttaa 27781 tatctgactg aaattaaacg agcgaagatg agcaccacct cccgtgtctg cagttgtatt 27841 tcctggtgct tgccctgtgt tggggactgt tttgggggtt aatctgagcc aagtggcgct 27901 ttctgtcctc ccttctcaag tgatggccga tggttcacgc acttccccct gttcctgccc 27961 ttgtcctcac ttcccagtca cccactagtt catctctgcg gcttttgcat tttctccaca 28021 agcatctaag tgggcttagc actggtaaac tgcaaaggca ctattgcagc aggaggaaca 28081 gtctgggagc ttttttcagt cctggattta gaaatagatt ttcttgatta aaatgaaaat 28141 taacaagctc taaagaactg ttgacccttg aactacacag ggattagagg cactgacctg 28201 ccgcacagtc gaaaatctgc agagaagttt tttttgtttt gttttgtttt ttttgagacg 28261 gagtctcgct ctgtcgccca ggctggagtg cagtggcggg atctcggctc actgcaacct 28321 ccgcctcccg ggttcaggcg attctcctgc ctcagcctcc tgagtagctg ggactacagg 28381 catatgccac catgcccggc taatttttgt atttttagta gagatggagt ttcaccatat 28441 tggccaggct gttctcaaac tcggcctcaa gtgatctgct cgcctcagcc acccaaagtg 28501 ctaggattac aagcatgagc caccgcgccc ggcctgcata gaacttttaa ctcccccaaa 28561 acttaattgc taatagatta attgcctgct gttggctgga agccttacca ataacgtaaa 28621 cagttggtta gcacatattt cacatgtcat atatactata tactgtattc ttaccataaa 28681 gtaagttaga gaaaatgttg ttaaaacaat gaggaaggaa aagtatattt actattcact 28741 gactggaagt ggatcatcat aaagatcttc atctcatctt cctcacattg aggaaggctg 28801 aggaggaagg ggaggggttg gtcttgctgt ctcctgggtg gcagaggcag aagaaaatct 28861 gtgtctcgtg gactcagttc caacccgtgg tgttcaaggg tcacctgtgc ctgctttatg 28921 caccccaaat caggctctaa aatagtctca ggtcctgtga gccttagtga cacactcttc 28981 tcagattcaa tacctttgat ctgaaaggtg actttgattt gtcataatct tccaactcat 29041 tctctacatg tttcgacgca ccagcactgt gatgttcttg gttatccttg gcagaagttt 29101 tctcctgcaa ctaatttgct aaaggcaggg ggtaatcaga agtgacagtg gttgaaatca 29161 caggccaata tcttagcaaa cgtctgtgtt gtaaatgggg aagtgcctct ggtggggtct 29221 tcaaggatca ctccagcctg gctagggaaa ctctagggga gaggcacttg aaattattgt 29281 actcataggt cctagagagg gaggcatgtc atgccaagca gggggctgga ttggaggcgt 29341 gcccagggat tgggtgcaac tcagcgtgta tgagacagaa agagagagaa ctcctgggca 29401 agtgccttta ctgggagcca ggatggagga cacaagcaca aggtgtaagg ggatctcact 29461 ggtgtgtttg aatgtcacta agtcacagtc aggggaagac aagaagtgga acttgtggca 29521 gggaccagcc ttattacact ggcacctggt tacctggaca gggtgcccac tgcctatttg 29581 taggatgtca aggtatcagg aaaatatgaa gtttttaaaa atttacaata taatgccaac 29641 ctcctacggt gatcagagga tcagctttgt attaaagatg gaaagatacc aatcatcacc 29701 cacaaaacta ctaaaacaga aagaatgaca aatacaattc tcggccaggc acagttgctc 29761 acgcctgtaa tccctgcact ttgggaggcc gaggcaggca gatcacttga ggtcaggagt 29821 tcaagaccag cctggccaac atggtgaaac ccccattccc tactaaaatt acaaaaaatt 29881 agctgggtgt ggtcgcacgt gcctgtaatc ccagctactc aggaggctga ggcaggagaa 29941 tcgcttataa ccgggaggca gaggctgcag tgagccgaga tcccgccatt gcactccagc 30001 ctggcggtaa gagcgaaact ccgtctcaaa aaaataaaat aaaataaact tttcattacc 30061 tgcctataat aatttctcaa aaagagtcaa ctgaggagac ttaaagtgtt cgaagcatag 30121 ctacttgtag gaccagagtg aataatatcc acctgtatct ttcccaggcc tcatgtttcc 30181 tttgttattt caaacgtgtg agtcatgcaa aaggaatttt tactaaaaag gaaaagaaaa 30241 aaatttaaaa tgaccagcat aaacataatt gagttcaatc ccccagaaag ggttattcgg 30301 ggaacagcca ggcgctgccc ttgtttttcc ccctccccat ctggggctaa gaccaaaagg 30361 ggtctgtctt tggagggaaa tctgggtctg tacagcttcc cactctaagg ggagccttaa 30421 tcccagccgg aacctgagct tctccaggca aggcctgacc ttgccgactg atctaagcat 30481 tctgcgcttc attttcaccc cagaaaggtg acttcctcct ggctgggggg ctcctcccct 30541 aggactcagc tcatcctgaa ctaataaccg cgttatgtcc atcctttgtt ccgttctttg 30601 agtcccaccc atgttcttca ggaaagctgc agctggtcca cgtgccttgc ctgtgcttct 30661 gtagaactac ccctcccttc tctgcctgca gcagggctcc tggaacttga atgtgcactt 30721 gaatctgcag gtcctgactg taggtctgtg tttctaacat gctgctgggt gacgctgtgc 30781 ccgtcctgtg gtcggcattg taacaaggct ccgcgggttc ccacgccgga gtctgcgtca 30841 gaaacctctg gagggcttgt tatacacaga tggctgaacc ccatccccag agtttctgat 30901 tccctagtac cagggtgggg tctgagaatt tgcatttcta acaaattccc aggtgatact 30961 gatgctgctg gtgcggagag gtcacatgga aaccactgct cggcagatgc aaaagaagca 31021 gaagccgccc caaccctgaa atactcacaa agctgctgtg gagaccagac acctgccggg 31081 ggaatgagag cctggggatc ctgcctggga ggaagagaga gctgggagct ctatgggtga 31141 atgccaatgg aggtgcaagg aggtgactaa ctcctgccgt ctgaaaatag tgaagcacag 31201 ccgaggcctg aagcccaccc ttagtgagag atatagttaa acttggaaat ggtgcccagt 31261 cacttgaaaa cagcacttct cagattcaga tgtacacatg aaccacctga ggatcttatt 31321 aaaatgtggg ttctgaggct gtagctctga ggtgagctca gagattctgc agttctagaa 31381 gcttccaggt gatgcaatgc tgctggtgca aggaccgctc tctgagaaac aaggacctag 31441 aggtccaggt catcttggag agctgttcta accctggcca agagcttcgg gaggcactgg 31501 ggctcaggca cccggcaggg gagtcttcac tttccttgtc tgaactctaa gtgctgcctc 31561 tcacacacaa ttcccttgtc tggacccaga caacagaatc agtgcagcgg cttatctcag 31621 aactaaagac agttgactgc ccctggggaa gggagggctc tctctgagga agtctcaaac 31681 accctgctgc aaaattttca ccctcaggat ccccaatttc cctgaacctt aacagacttg 31741 cttgttctgt agaaattgcc tgtaatttcc aagataaaag gactagaaag gtggcctttt 31801 cacctatgtg gcttctcttt tttttttttc ttttgagata ggatcttgct atattgccca 31861 ggctggtctc aaattcctgg gctcaagcaa tcctcccatc tcagcctccc aagtggctga 31921 gatgacaggc acgtgccact atgtctggcc ctatgtggct tctccaaggt gaatatgggc 31981 cttgtggggc tccaggtggc acatgtaagg tgtgtatagt agttttgtgt tcagacttaa 32041 atgagactca aaattatact ggggaattct caggggcaga cccctccaaa atcaaagacc 32101 tgtggggcaa agccccagca tcctgcccca atgagctggg gctgactccg acagctgctg 32161 cttctgggtc tatcatctcc tgatgctttg acaggccaaa gcattagcct caggttagcc 32221 tctgaggcct gtcctgtcct atgcagtccc aggcagtact gctccctgtg cctgagacac 32281 tggcttctcc acatagacct cagacacaga actggggcca ctgccctcac tgtgcacttt 32341 aaagggtcct cctccaggcc tccactcaga gcctactgtc ccgctcttcc taaaccacat 32401 tccaggtgcc cagggtgttg gaattatctg cctcagggcc cctgttgtcc tcctatctgt 32461 cctcctacga gtggaccaca ccaccagtat acacagctca atgtccaaga ggcagccacg 32521 acatcactgt ttgggtctgg tggctaagaa gaaacttgga catgtaggct gggagtccac 32581 atccatgaac tcaaggcccc caggaggcac acgatgaatt cgagtgggaa gagaaggggg 32641 taggccacgg gacagaggcc tccctctgga gtcttcctct aaccgctgca aatgttagag 32701 gcagttctac ccccaaagct tctaggggaa aggagtgaag gatttggcaa taaagtcctt 32761 agtttttgca aatgtctgca agtcataaaa gtgcttttaa gttctccaca ggtcaggact 32821 ggtcatgtta ggttcttaca cctctcccat agtccctcca cccctagaca ttgggagact 32881 cccagtcaat tcaccaccca ggatggctgc ccctcacctt cttcccaggg ccccacatcc 32941 aattaatcca aatacaccca tgtattcaag cacctttctg atgaacctct agaaagtagc 33001 cagtgctgca gaaatttctt gtagcatgtg ctgaaatttc taggatccta ccagtcaaag 33061 aacaacagtc tccttgcttc caccagggaa tttgctaaca cttgcttcta agatctgacc 33121 tctcagagcc actgtggtcc atcaatcaat tcaaaaatta tagtggcctg gggttgtgtg 33181 ggtgaggtgg ctgcttggga ggctgaggtg ggaggatggc ttaagcccag tagtgcaagg 33241 ctgttgcaag ctgtgaccgc accactgcac tcctgcctgt gcaacagagc aagacaccat 33301 ctttataaaa acaaaccagc agccagacgc ggtggctgac acctgtaatc ccaacacttt 33361 gggaggccga ggtgggcgga tcacgaggtg aggagtttga gaccagcctg gccaatatgg 33421 tgaaaccctg tctctcccaa aaacacaaaa actatctggg tgtggtggtg tgtgcctgta 33481 gtcccagcca ctcgggaggc tgaggcagaa gaatcgcttg aacccaggag gcggaggctg 33541 cagtgagccg agatcgcgcc actgcactcc agcctgggtg acagagtgag actctgtctc 33601 aaaacaaaac aaaacaaaac aaaacaaaac accagcaaac aaacattgtg aactacaatt 33661 aaatttattt aatcacagta ctaaatgcta cacagcaact gtgtagaata ctgcatagca 33721 atgaaaagcc atgctttaaa agagttgtgc cttacatgag gaagtgtgca taatataatt 33781 ttaattgaaa atcagattgg caggataatc attttttata aaacaacaac aacaaaaaaa 33841 caatgtccgg gcgcggtggc tcacacctgt aatcccagca ctttgggagg ccgaggcggg 33901 cagatcagga ggtcaggagc tcgagaccat cctggctaac acggtgaaac cccgtctcta 33961 ctaaaaatac aaaaaattag ccgggcgtgg tggcgggcgc ctgtagtccc agctactcgg 34021 gaggctgagg caggagaatg gcgtgaaccc gggaggcgga gcttgcagtg agccgagatt 34081 gtgcctctgc actccaacct gggtgacaga gcgagactcc gtctcaaaaa taaagtatat 34141 atatatatac tgtacaaata cacaggaaaa aaaataacag aaatcttcag gccttactgt 34201 taacagtggt tattgctggg tggtggataa caggtgattt ttttatttcc ttctgtgttg 34261 tttttatatt ttccaaaaaa tttacagtgt atcaaattga ctggatcaag gggtatgtgt 34321 ttgcgtgtga gtgtgtctat tctgagtgtg tctatgaggg tgtttctgga taaggttagc 34381 atttgaatca gtaacctgag tgatgaaaat tgttctcacc catgtgggtg ggcaccaccc 34441 gatcttctga ggtctgaata gaacaaaaag gcagaggaag gaggagttca ccccttgttg 34501 cctcctgcct gcctgcttca gctgatgcat ccgtctcctc atgcccttgg actgggattt 34561 acactgccac taccctggtt ctcaggcctt tggacttagg atggaaacac aaccccagct 34621 ttcctgagtc tccagcttga agacagcagc aggtcatggt acttcagagc ctccataatt 34681 gcatgagtga gttcttcctg tatacatgtc cttctggttc tgtttctccg agaggtctgg 34741 ctgatacata cctgaccaaa tattgctttt acaatgagaa aaagattggt aactattatt 34801 tttaatcagt aaatggtatg caccaaacct ggccggtctg ccttcagggg ggcctctctg 34861 tagcattttc tcagacataa gtacaacttt tgagagtgta acatgccagg ctctgtgccc 34921 aacatttggc acatatagcc ttacctgatc ttcacaacgc tgtccatttt aggaagctca 34981 cattcagaaa cgtttattct gtcacttacc cagggtacac agtatgctcc ccaaaagtat 35041 attatggaaa cacatggcca ttgatttgtg atgatattta agagtttggt atagaggtcc 35101 tgagcccgga gtgctatggg cacaactcat gacctatgtg aatggtgcct cctggacttg 35161 tgcagggcac cagcttcaag ggactctgct gactcccagc agtacctggg ggaattgtca 35221 gcaatcctaa aagcaagctc agcctgcaag gctgcctacc ccggacagat gatgccccct 35281 ccaattgagc catgatccca tagagccgta gaaaatgggt cacctcctgc aggaagccac 35341 cagggtgctt gcatggctgg gaactctttt ttctctaact gtcccagacc atctgctgct 35401 ttcctccctg cttccaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaagtca ttctctaaat 35461 ttttatggtg ggcaaagtta tcagttttca gtaagcccat tccttagctc ccctatttga 35521 tc // LOCUS HSU01212 3718 bp DNA PRI 03-AUG-1994 DEFINITION Human olfactory marker protein (OMP) gene, complete cds. ACCESSION U01212 NID g520739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3718) AUTHORS Buiakova,O.I., Rama Krishna,N.S., Getchell,T.V. and Margolis,F.L. TITLE Human and rodent OMP genes: Conservation of structural and regulatory motifs and cellular localization JOURNAL Genomics 20, 452-462 (1994) MEDLINE 94307732 REFERENCE 2 (bases 1 to 3718) AUTHORS Margolis,F.L. TITLE Direct Submission JOURNAL Submitted (02-SEP-1993) Frank L. Margolis, Roche Institute of Molecular Biology, 340 Kingsland Street, Nutley, NJ 07110-1199, USA COMMENT On Aug 3, 1994 this sequence version replaced gi:457938. FEATURES Location/Qualifiers source 1..3718 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HOMP2, HOMP3, HOMP5" /clone_lib="genomic library in EMBL3 from one caucasian female, Clontech Labs." /chromosome="11" /map="11q13.5" enhancer 449..459 /note="distal Olf-1 binding site" enhancer 510..534 /note="UBE binding site, putative NF-1 element" enhancer 976..986 /note="proximal Olf-1 binding site" CDS 1245..1736 /note="intronless open reading frame" /codon_start=1 /product="olfactory marker protein" /db_xref="PID:g520740" /translation="MAEDRPQQPQLDMPLVLDQGLTRQMRLRVESLKQRGEKRQDGEK LLQPAESVYRLNFTQQQRLQFERWNVVLDKPGKVTITGTSQNWTPDLTNLMTRQLLDP TAIFWRKEDSDAIDWNEADALEFGERLSDLAKIRKVMYFLVTFGEGVEPANLKASVVF NQL" polyA_signal 3593..3598 BASE COUNT 716 a 1063 c 1117 g 822 t ORIGIN 1 ggatcccact gattgattag ccaccatctc acaattgacc gtgcttgtgg tccactagac 61 ccttcagatt gttttccgtt gtgatacctt ggccttgact ctgtcctctt ttctgtgtgt 121 ggcgtgttgt ggcgaggggc gctccccaac tcccatcccc actctctccc caactccggc 181 tccactcaca ctccagttct ttcatttccc cagtataaag gctgagcttc tggttccgcc 241 ccgggccctg gggatataaa catttgccag attcttcctc ggcccctggg ggaaactgag 301 gattaattca ggtggagtaa gtggtgggat ttgggtagaa gtgaagcctt gtcctgttgt 361 ggccatggtg cagggctgcg gcacagccag ccatcagtgt catccgggtc agtaatgctc 421 aaggcacagt ccctggccca gcagcatgtc acctgggagg tggttaggaa tgcagattct 481 caggcccaca gagccctgat aaaccaggag ttctgggagg gggtccagca atctgtgtgt 541 taagtcctga gagtgagtct tgatgctcac tcaagtcttg agaaccacgg gtctgggtga 601 gagatacggt agctgggctg agatcctgtc aatgggactg gaggggaagg gtcccggggt 661 gtttgggaag cagaatcgac aggctttggt gattgggtgt ggaggagtga gagggaggcg 721 ggcgtcaggg gtagctccaa ggtttaactt aggtgacttc agatctccaa tcaccaagcc 781 ctctctggtc ctgccttctc cacctgctcc tgcgggtctt gcatcttctc ctgtgtacct 841 ccagtgagga gtggtcccca ccaccctccc catcagtgca cttacgaagt gctctcatct 901 tcacaaacaa gccagcaccc agcccagccc tggtagtcag ggcggttgcc acagcaattg 961 acatcagcga cctggtcccc aaggaacctg ccaccttccg cctgcctgca gggcctgcat 1021 tatcgcttct gcggggactg gagtggaggc agatggggac tcccacccct gacacacacc 1081 ccattttgag aactgagtgg ggctgggaag agccagtgcc aaagggaggg gaagagggaa 1141 gggcagaaag taggtggggc ccccctttgg tggcctcttc tctccacggc cccaggctcc 1201 agcccacttg ggtccttggc gttggtggca gcagcacttg ggccatggcg gaggacaggc 1261 cgcagcagcc gcagctggac atgccgctgg tcctggacca gggcctgacc aggcagatgc 1321 ggctacgcgt ggagagcctg aagcagcgcg gggagaagcg ccaggatggg gagaagctgc 1381 tgcagccagc ggagtctgtg taccgcctca acttcaccca gcagcagcgg ctacagttcg 1441 agcgctggaa tgtcgtgctg gacaagccgg gcaaggtcac catcacaggc acctcgcaga 1501 actggacgcc tgacctcacc aacctcatga cacgccagct gctggacccc actgccatct 1561 tctggcgcaa ggaggactcg gatgccatag attggaatga ggccgacgcc ctggagtttg 1621 gggagcgcct gtcggacctg gccaagatcc gcaaggtcat gtacttcctc gtcacctttg 1681 gcgagggtgt ggagcccgcc aacctcaagg cctccgtggt ttttaaccag ctctgacagc 1741 agctgccagc tgctgctctc ctctagccca cctgtgctct cccctgcccc tgccactttc 1801 ccccctgtat tttgggggcc attattctcg ctgctcagcc tgtcctctgc ttgcccagag 1861 gccccctgag tcccacacct ttcctcctct gcttctccct ggggccagca ctccagctca 1921 caggaagaag attctgaggc tccatagcct agaagctgga ctggctgctg cattgctata 1981 gacgatagag gcctactagg ggccagtgtg catggacagt gaggccaggg ccatctgcct 2041 tctctctgct tcattgtggg agagagagac tgagaaagac caagagagac acagagacag 2101 agattgaaaa acccagcatc cacttcctcc agagtcaggg agacagagat gatggggcgt 2161 ctccacgggg agtccagcaa gccggcattc actgctccct ggccttggtg ccctttgccg 2221 gagcctgtgt ctgggctgct ggtcccataa cacgtcgaca accctcagga tatggggcag 2281 ggttgctgca ggggtggatt tgggcagtgg agagtggctg gcaccctgga ggctgtgtag 2341 gcccagctgt ggctcttctg ggcctgactt cagggtggag aagtgaaggg ggaggttaca 2401 cagagatctg tctctacgca cacatatcca tgagacagag tgtgctgtat tcatatggat 2461 gtattctaga ggtctattcc taccctagga acaagtgcag ttttagatta tctgttcatc 2521 attgctgctg gttcaaggat ggctcttaac aggggcctgg tccggatgac cttggcctgg 2581 gggcttgctg agctaggaga ctgcagttca gatagtgaaa cagggagtgg attagtaaag 2641 ggggttccct ttgccttgag ggaagttgga gctggagaga gtggattctc cagggcctca 2701 ggtatcccct gctggggagt caggctcttt agagcttgca ggtcagggaa ggcaagtgct 2761 tcgtcctgac atagcatctg ttggcatttc ttgggcttct tcaatgcagc tgaggggggc 2821 agggcgaagg cgtggtgggc agttacgacg gctgatagtc ccaagtgggc tgcaggcggc 2881 agtggtgtga cggcagaatg gtaacctctg gggtcattgg atgcaactca ctcaccaaac 2941 agatggggaa actgaggcac aattttcatc agattcagtt ctgactctta gcctcattcc 3001 ccttcgcatt gcgcagtccc agagagcccc cccttttggg ggagtgcctg acctgcacct 3061 aacatcagcc aagtacagct aagccactgt ccccagcacc ctgacttaag gccagccctg 3121 tgttttgtcc tcagccagtc agggatgtgt ccaagacatt tcccctcatg aagcaaagct 3181 gtcaaggaac ttgccggctc tggaacagat gcactgaggg ccagagggtc agggccatcc 3241 cctgtggctg gggctgccgg gagggtgagc cccacctcgg aggtgtgcag gctggagcag 3301 catgctggag ctgagattct gtgggtgaga gagtgggaga gtgtctgtgg gctgagcact 3361 ggtcctttct gactcacagc tctggggccc attccgggac aggcttgaag aagtctcggc 3421 cattgcctgc cctgctgagc acgaggggag gccagaaccg tgtgcagtgg ccctgccctt 3481 ctgcttgagc tcttcctgca gctctgggga ccctcttagt cccgactgcc tgtctcccca 3541 gcctgtctgt cccggggcct gagtccctct gctgtgcccg ctgcaggtcc ccaataaagc 3601 ctgtgccctg gcctcggtgg tgtgcagtgt ctcgccatca gcccccatcc ctttcacaat 3661 ccctcacggc cccgagcact tgctccctgg ccacttccca cactccccca gcccttgc // LOCUS HUMMIF 2167 bp DNA PRI 29-SEP-1994 DEFINITION Homo sapiens macrophage migration inhibitory factor (MIF) gene, complete cds. ACCESSION L19686 NID g307284 KEYWORDS macrophage migration inhibitory factor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2167) AUTHORS Paralkar,V. and Wistow,G. TITLE Cloning the human gene for macrophage migration inhibitory factor (MIF) JOURNAL Genomics 19 (1), 48-51 (1994) MEDLINE 94245178 REFERENCE 2 (bases 1 to 2167) AUTHORS Wistow,G.J. TITLE Direct Submission JOURNAL Submitted (19-JUN-1993) Wistow G.J., Molecular Sturcture and Function, NEI, NIH, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2167 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(1076..1280,1470..1642,1738..1920) /gene="MIF" gene join(1076..1280,1470..1642,1738..1920) /gene="MIF" exon 1076..1280 /gene="MIF" /number=1 CDS join(1173..1280,1470..1642,1738..1804) /gene="MIF" /codon_start=1 /product="macrophage migration inhibitory factor" /db_xref="PID:g307285" /translation="MPMFIVNTNVPRASVPDGFLSELTQQLAQATGKPPQYIAVHVVP DQLMAFGGSSEPCALCSLHSIGKIGGAQNRSYSKLLCGLLAERLRISPDRVYINYYDM NAANVGWNNSTFA" intron 1281..1469 /gene="MIF" /number=1 exon 1470..1642 /gene="MIF" /number=2 intron 1643..1737 /gene="MIF" /number=2 exon 1738..1920 /gene="MIF" /number=3 BASE COUNT 392 a 657 c 717 g 401 t ORIGIN 1 ctgcaggaac caatacccat aggctatttg tataaatggg ccatggggcc tcccagctgg 61 aggctggctg gtgccacgag ggtcccacag gcatgggtgt ccttcctata tcacatggcc 121 ttcactgaga ctggtatatg gattgcacct atcagagacc aaggacagga cctccctgga 181 aatctctgag gacctggcct gtgatccagt tgctgccttg tcctcttcct gctatgtcat 241 ggcttatctt ctttcaccca ttcattcatt cattcattca ttcagcagta ttagtcaatg 301 tctcttgata tgcctggcac ctgctagatg gtccccgagt ttaccattag tggaaaagac 361 atttaagaaa ttcaccaagg gctctatgag aggccataca cggtggacct gactagggtg 421 tggcttccct gaggagctga agttgcccag aggcccagag aaggggagct gagcacgttt 481 gaaccactga acctgctctg gacctcgcct ccttccttcg gtgcctccca gcatcctatc 541 ctctttaaag agcaggggtt cagggaagtt ccctggatgg tgattcgcag gggcagctcc 601 cctctcacct gccgcatgac taccccgccc catctcaaac acacaagctc acgcatgcgg 661 gactggagcc cttgaggaca tgtggcccaa agacaggagg tacaggggct cagtgcgtgc 721 agtggaatga actgggcttc atctctggaa gggtaagggg ccatcttccg ggttcaccgc 781 cgcatcccca cccccggcac agcgcctcct ggcgactaac atcggtgact tagtgaaagg 841 actaagaaag acccgaggcg aggccggaac aggccgattt ctagccgcca agtggagaac 901 aggttggagc ggtgcgccgg gcttagcggc ggttgctgga ggaacgggcg gagtcgccca 961 gggtcctgcc ctgcgggggt cgagccgagg caggcggtga cttccccact cggggcggag 1021 ccgcagcctc gcgggggcgg ggcctggcgc cggcggtggc gtcacaaaag gcgggaccac 1081 agtggtgtcc gagaagtcag gcacgtagct cagcggcggc cgcggcgcgt gcgtctgtgc 1141 ctctgcgcgg gtctcctggt ccttctgcca tcatgccgat gttcatcgta aacaccaacg 1201 tgccccgcgc ctccgtgccg gacgggttcc tctccgagct cacccagcag ctggcgcagg 1261 ccaccggcaa gcccccccag gtttgccggg aggggacagg aagagggggg tgcccaccgg 1321 acgaggggtt ccgcgctggg agctggggag gcgactcctg aacggagctg gggggcgggg 1381 cggggggagg acggtggctc gggcccgaag tggacgttcg gggcccgacg aggtcgctgg 1441 ggcgggctga ccgcgccctt tcctcgcagt acatcgcggt gcacgtggtc ccggaccagc 1501 tcatggcctt cggcggctcc agcgagccgt gcgcgctctg cagcctgcac agcatcggca 1561 agatcggcgg cgcgcagaac cgctcctaca gcaagctgct gtgcggcctg ctggccgagc 1621 gcctgcgcat cagcccggac aggtacgcgg agtcgcggag gggcggggga ggggcggcgg 1681 cgcgcggcca ggcccgggac tgagccaccc gctgagtccg gcctcctccc cccgcagggt 1741 ctacatcaac tattacgaca tgaacgcggc caatgtgggc tggaacaact ccaccttcgc 1801 ctaagagccg cagggaccca cgctgtctgc gctggctcca cccgggaacc cgccgcacgc 1861 tgtgttctag gcccgcccac cccaaccttc tggtggggag aaataaacgg tttagagact 1921 aggagtgcct cggggttcct tggcttgcgg gaggaattgg tgcagagccg ggacattggg 1981 gagcgaggtc gggaaacggt gttgggggcg ggggtcaggg ccgggttgct ctcctcgaac 2041 ctgctgttcg ggagcccttt tgtccagcct gtccctccta cgctcctaac agaggagccc 2101 cagtgtcttt ccattctatg gcgtacgaag ggatgaggag aagttggcac tctgccctgg 2161 gctgcag // LOCUS AF049259 5698 bp DNA PRI 16-SEP-1998 DEFINITION Homo sapiens keratin 13 gene, complete cds. ACCESSION AF049259 NID g3603252 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5698) AUTHORS Waseem,A., Alam,Y., Dogan,B., White,K.N., Leigh,I.M. and Waseem,N.H. TITLE Isolation, sequence and expression of the gene encoding human keratin 13 JOURNAL Gene 215 (2), 269-279 (1998) MEDLINE 98382520 REFERENCE 2 (bases 1 to 5698) AUTHORS Waseem,A., Alam,Y. and Waseem,N.H. TITLE Direct Submission JOURNAL Submitted (19-FEB-1998) Craniofacial Development, UMDS, Guy's Dental School, Floor 28, Guy's Tower, London Bridge, London, England SE1 9RT, United Kingdom FEATURES Location/Qualifiers source 1..5698 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q12-q21.2" /tissue_type="suprabasal layers of mucosal epithelia" mRNA join(465..1006,2331..2413,2613..2769,2960..3121, 3246..3371,3464..3684,4694..5075) /product="keratin 13" CDS join(512..1006,2331..2413,2613..2769,2960..3121, 3246..3371,3464..3684,4694..4712) /note="ectopically induced by retinoids" /codon_start=1 /product="keratin 13" /db_xref="PID:g3603253" /translation="MSLRLQSSSASYGGGFGGGSCQLGGGRGVSTCSTRFVSGGSAGG YGGGVSCGFGGGAGSGFGGGYGGGLGGGYGGGLGGGFGGGFAGGFVDFGACDGGLLTG NEKITMQNLNDRLASYLEKVRALEEANADLEVKIRDWHLKQSPASPERDYSPYYKTIE ELRDKILTATIENNRVILEIDNARLAVDDFRLKYENELALRQSVEADINGLRRVLDEL TLSKTDLEMQIESLNEELAYMKKNHEEEMKEFSNQVVGQVNVEMDATPGIDLTRVLAE MREQYEAMAERNRRDAEEWFHAKSAELNKEVSTNTAMIQTSKTEITELRRTLQGLEIE LQSQLSMKAGLENTVAETECRYALQLQQIQGLISSIEAQLSELRSEMECQNQEYKMLL DIKTRLEQEIATYRSLLEGQDAKKRQPP" BASE COUNT 1316 a 1503 c 1575 g 1304 t ORIGIN 1 ggatccagga catcccagtc agaagtttta ggtatagaaa aaaggaaggt cgagggctac 61 ggtgaccttg caaagcacag agccactctg cacccaccac tttcctcgca gaagctcctg 121 gcaggcctcc gcatctcagg tcccgttcta atactgggtg gggctcagag cctgcacagt 181 gaactcctta agtggaggtg aaacagaatt cacttgtcaa aagggcagtc tcaggcagaa 241 tgctgtcccc tctgaatcat tctttttgtt aaggattaaa aaaaaagcca cccctaaaag 301 gcacaaccca ccctggagag atcagagata actaagcttg tgggaaacag aagtgtagtt 361 ggcaccaagt cagaggttgg ggaagggagg agagaagata accagcccct atggaggtgt 421 ataaaaggtg tccactctgg ggaagagcca cagtcctcgg cccaggccaa gcaagcttct 481 atctgcacct gctctcaatc ctgctctcac catgagcctc cgcctgcaga gctcctctgc 541 cagctatgga ggtggtttcg ggggtggctc ttgccagctg ggaggaggcc gtggtgtctc 601 tacctgttca actcggtttg tgtctggggg atcagctggg ggctatggag gcggcgtgag 661 ctgtggtttt ggtggagggg ctggtagtgg ctttggaggt ggctatggag gtggccttgg 721 aggtggctat ggaggtggcc ttggaggtgg ctttggtggg ggttttgctg gtggctttgt 781 tgactttggt gcttgtgatg gcggcctcct cactggcaat gagaagatca ccatgcagaa 841 cctcaacgac cgcctggctt cctacctgga gaaggtgcgc gccctggagg aggccaacgc 901 tgacctggag gtgaagatcc gtgactggca cctgaagcag agcccagcta gccctgagcg 961 ggactacagc ccctactaca agaccattga agagctccgg gacaaggtga gcccttggaa 1021 gctgaggagg ggtcctccat gaagggcaga gcctccaagg ccaagaaaca cagtgttgtg 1081 ggggactagg tgtggacgga attcacaaac ttcttgcagg ggaaaagcct actgtttgac 1141 taccaagagt ctctacctct gatgggcttt ctcatgccac aactccccaa atgcattttc 1201 tctcattatt catttttatg attgttatta ctcaggtcag tttttttact gctttccaga 1261 cagttatgtt attcatgact cgatctctat aatcctaacc tactgagtta tctggaaaca 1321 tgacaggttt aaaagccatt aagaatgcat gaaatcaaca gagaaagaga gacgtgttca 1381 ttcattgccc tgggaataat tgtttacttt gtaggggacc cattcaaact ggtattctca 1441 actggcgatg gcaagttggg ggagagagag gcagaggctt aggcagtgcc gagcctgggt 1501 gccctttgtt gagccctgtc tgttgtgctg tgataccagc ctgtggagca aaatctacca 1561 gcctggctga gtctcagcca tgtggagcag gctttcactg ctttggaata taaacgttta 1621 gacaagaaat tcaaaggatg ggaatatttg ttggatttcc cctccactca gctgggaagt 1681 cagctctagc aaagtgattg tgagatatgt ggcaccaagc cgatccctcc tgactttcag 1741 gggatggatc tcaagacagt ggccatggga gggtgctagt ctgaggctgc caagacatga 1801 cgaaccaaag atgttctcag caggtcgttc ctcagaaatg cactccctgc atctccccca 1861 gggtgctcag acatggaata ttaaagtgtt agaaacgatt ggaggggatg cagccccact 1921 cctcacttta cagaagagca atggaggccc agagggaaat gccttgtccc agctcacagg 1981 acagtttaga ggtagagccc catgggaacc aggtcctgcc catgttttcc caggccacct 2041 cctcaagcac gtggaccccc cagagcaagc ttgtctaacc tgtggccggt gggctgcatg 2101 tggcccagga cagctttgat tgaggcctaa cacaaatttg taagctttct taaaacattg 2161 agtttttttt tattttttat ttttgccaat ttctttttag cccatcagct atcgttagtg 2221 ttagcgtatt tgatgtgttg cccaagacaa ttcttcttcc aatgtggccc agggaagcca 2281 acagactgga cacccatgcc ctagaggtta ccttcttctg cattttgtag atcctgaccg 2341 ccaccattga aaacaaccgg gtcatcctgg agattgacaa tgccaggctg gctgtggacg 2401 acttcaggct caagtgagtt ttttttttcc ccctccctct cctttcccag gagtctctta 2461 gtttagacaa aagctcatct gaccaatgac actggagcat cccaggagtg aaggtctgga 2521 agtactggag tctcgtgcag gggtagggag aatgtctcat atcatttatg tcttgctttt 2581 tcaaaaaaca tttcattttt ttactatttt aggtatgaga atgagctggc cctgcgccag 2641 agcgtggagg ccgacatcaa cggcctgcgc cgggtgctgg atgagctcac tctgtctaag 2701 actgacctgg agatgcagat cgagagcctg aatgaagagc tagcctacat gaagaagaac 2761 catgaagagg tgagcgggga ttgcaggctg ccctgggctc aacaacctcc agtgagaagg 2821 aggagccctc ccctgcaact ggacacagtc ctccactgtg ggaccttgga aacactctcc 2881 agaacactct tgccagcact cagagccaag gagagggagg atgagtagcc tcacacatct 2941 tcctttccca tccccacagg agatgaagga atttagcaac caggtggtcg gccaggtcaa 3001 cgtggagatg gatgccaccc caggcattga cctgacccgc gtgctggcag agatgaggga 3061 gcagtacgag gccatggcag agaggaaccg ccgggatgct gaggaatggt tccacgccaa 3121 ggtacctggc cctcccaccc cacatagccc atcccatata gccagggggg ccgggcggac 3181 acagcctctt ggtcaggctg ctcgagctga gcttccccac caccttcttc ctgcttgcat 3241 ttcagagtgc agagctgaac aaggaggtgt ctaccaacac tgccatgatt cagaccagca 3301 agacagagat cacggagctc aggcgcacgc tccaaggcct ggagattgag ctgcagtccc 3361 agctgagcat ggtatgcctg cagcctcctg cccggtggcg accacctcta ggtttccctc 3421 tggcctggct ccctctgtaa cttcttgtct gtcgctccac cagaaagcgg ggctggagaa 3481 cacggtggca gagacggagt gccgctatgc cctgcagctg cagcagatcc agggactcat 3541 cagcagcatc gaggcccagc tgagcgagct ccgcagtgag atggagtgcc agaaccaaga 3601 gtacaagatg ctgctggaca tcaagacacg tctggagcag gagatcgcca cctaccgcag 3661 cctgctcgag ggccaggacg ccaagtaggt cctctccagc ctctttctta ggatagtgac 3721 tgcaggaccc gtgggtgtca gagcaggcag ggccttagag atccaccccc tcatgtcaga 3781 gaggagaccc tcactgtctg actttacggg gacatttcct tggtaggctg caggttaaca 3841 cctttaaact ttctaatttg ggcctccaag ttagactgga gggaacagag accatggagg 3901 gaacatctga tattggaaaa acatgagctt ggatacagct cagctgaaca aagctattac 3961 tatggtgctt taactgcctt tagacccttt gcagaaatca agccacccct gtgggctcag 4021 ggctcaggtg gagggttgcg ggtttgggcg atccagaggg aagacaaaca ccctgaatct 4081 tgtggctaat aatatctccc cttgagacct ctcagacaga cagacaggaa agacagacag 4141 tcccagtggg cacctgatgc tgctgtccct tgagctgggg gcacagctag gaggcactag 4201 agtccagccc aggggcttga aatgataagc cgaggcactt ccccagagga acaggtaagc 4261 acctgccctc agttaccctc tgctcatggc cacctctgtc ctctctccca ggatgattgg 4321 tttcccttcc tcagcaggta agggaacagc cccgtggtgt agggggaggc agagttagcc 4381 tcccattcac tgccccaatg ggggtgggag ctggcgggga gggagggagc cagtgctggg 4441 ctctccagtg tagggagggg gagagctggg gtggaagtgg tcattcgctc actgattttc 4501 cttttgaaac tgatggctct gccccacaga gctatttggc tgtcttggga agagccatct 4561 ttggggacaa taaagagtct ccatccacgc accacctgac taatgaggag tttgtgagcc 4621 ctccaggggg gccctgccct gtcctgaggt gggtcttctc cagcctgagc atgtcttgtc 4681 tcttgtttcc caggaagcgt cagcccccgt agcacctctg ttaccacgac ttctagtgcc 4741 tctgttacca ccacctctaa tgcctctggt cgccgcactt ctgatgtccg taggccttaa 4801 atctgcctgg cgtcccctcc ctctgtcttc agcacccaga ggaggagaga gccggcagtt 4861 ccctgcagga gagaggaggg gctgctggac ccaaggctca gtccctctgc tctcaggacc 4921 ccctgtcctg actctctcct gatggtgggc cctctgtgct cttctcttcc tgtcggatct 4981 ctctcctctc tgacctggat acgctttggt ttctcaactt ctctacccca aagaaaagat 5041 tattcaataa agtttcctgc ctttctgcaa acatattttt gattctggcc tggtttctta 5101 ggtggcccct gctaggccct ggtttagcgg gtgaggggga cacagctggg cggagtagct 5161 aatccagaca acgacaagcc tggggagctt cctatctagg aaggacagac ataggcccat 5221 gggccatctg agcacacaga cacaacaacg gcaaacacag gggacacagg cagctcacag 5281 acgatggtgt ccagcagaca tggacagacg ggacacacca gagacagcgc aggggcacgt 5341 gagccagttc aacagccttg cagattagga tgttcagtga gaatggcagc tgatagtaga 5401 gtcaagggag cgtgaaacat gggcaaaaag tacagatgcc tctggggcag gctgcaaggg 5461 gccagtgaag gggtgatctg gggagaagga ggccagcccc cacctgccta cctccctgct 5521 tcctgcaaag agtttattct caagcaaatg gctccttgtt ggcctgaggc atccactcct 5581 aatgtcctcc ccagcccaac cctagtctgc cactccacac tctggacagc tcctgctggc 5641 cgctgcacta acctggcctt ctctctgtgc tccccaatct ctccctgccg ctttggct // LOCUS HSH12 1391 bp DNA PRI 09-NOV-1992 DEFINITION H.sapiens H1.2 gene for histone H1. ACCESSION X57129 NID g31967 KEYWORDS H1.2 gene; histone H1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1391) AUTHORS Kardalinou,E. TITLE Direct Submission JOURNAL Submitted (19-DEC-1990) Kardalinou E., Zentrum Biochemie, Abteilung Molecularbiologie, Humboldallee 23, D-3400 Goettinger, FRG REFERENCE 2 (bases 1 to 1391) AUTHORS Eick,S., Nicolai,M., Mumberg,D. and Doenecke,D. TITLE Human H1 histones: conserved and varied sequence elements in two H1 subtype genes JOURNAL Eur. J. Cell Biol. 49 (1), 110-115 (1989) MEDLINE 89338424 FEATURES Location/Qualifiers source 1..1391 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="genomic DNA library in EMBL3" misc_signal 365..371 CAAT_signal 427..431 TATA_signal 459..464 gene 526..1167 /gene="H1.2" CDS 526..1167 /gene="H1.2" /codon_start=1 /product="histone H1" /db_xref="PID:g31968" /db_xref="SWISS-PROT:P16403" /translation="MSETAPAAPAAAPPAEKAPVKKKAAKKAGGTPRKASGPPVSELI TKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGAS GSFKLNKKAASGEAKPKVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKK PAAATVTKKVAKSPKKAKVAKPKKAAKSAAKAVKPKAAKPKVVKPKKAAPKKK" terminator 1195..1210 /note="histone mRNA terminator" BASE COUNT 375 a 343 c 385 g 288 t ORIGIN 1 ggacctgtgt tacttccctt gtgaagaaac agaattatca tgaaaattta ggtggaaacc 61 atttcgcttt tttcttcaaa aataagggaa gcatgtgccc aaccacccct gggaaaaaga 121 accttcaggg gcaaaggagc gaacaggtaa tttataagaa aaacagaaag tggtcttctt 181 gactgcccca gacttccttc ggagttgggg gaattgggga cgcctggacg cgttgttttt 241 gtgtttgtgg aaaaaataaa tgaaggagca tgaagcccga ggcttctgag atcctttcct 301 gaccaaaccc aagtgatttg gtgtcgggga attttaatat ttttcccctt ttgtgaggtg 361 gaacaaacac aacttgggag cagcgcagcg gctcagagcc tgccagccag gcgggcgacc 421 agagcaccaa tcagagcgcg cctgcgctct atatatacag cggccctgcc caggcgctgc 481 ttcatcggcg ctttgccact tgtacccgag tttttgattc tcaacatgtc cgagactgct 541 cctgccgctc ccgctgccgc gcctcctgcg gagaaggccc ctgtaaagaa gaaggcggcc 601 aaaaaggctg ggggtacgcc tcgtaaggcg tccggtcccc cggtgtcaga gctcatcacc 661 aaggctgtgg ccgcctctaa agagcgtagc ggagtttctc tggctgctct gaaaaaagcg 721 ttggctgccg ccggctatga tgtggagaaa aacaacagcc gtatcaaact tggtctcaag 781 agcctggtga gcaagggcac tctggtgcaa acgaaaggca ccggtgcttc tggctccttt 841 aaactcaaca agaaggcagc ctccggggaa gccaagccca aggttaaaaa ggcgggcgga 901 accaaaccta agaagccagt tggggcagcc aagaagccca agaaggcggc tggcggcgca 961 actccgaaga agagcgctaa gaaaacaccg aagaaagcga agaagccggc cgcggccact 1021 gtaaccaaga aagtggctaa gagcccaaag aaggccaagg ttgcgaagcc caagaaagct 1081 gccaaaagtg ctgctaaggc tgtgaagccc aaggccgcta agcccaaggt tgtcaagcct 1141 aagaaggcgg cgcccaagaa gaaataggcg aacgcctact tctaaaaccc aaaaggctct 1201 tttcagagcc accactgatc tcaataaaag agctggataa tttctttact atctgccttt 1261 tcttgttctg ccctgttact taaggttagt cgtatgggag ttactgaggt atcagacgaa 1321 ttgggtgacg gggttggaga gtggccgtgg tgaggttaca gcatttaaac ctttattgcg 1381 gcttctaggt c // LOCUS HSACTHR 1850 bp DNA PRI 18-SEP-1992 DEFINITION H.sapiens ACTH-R gene for adrenocorticotropic hormone receptor. ACCESSION X65633 NID g28343 KEYWORDS hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1850) AUTHORS Cone,R.D. TITLE Direct Submission JOURNAL Submitted (22-APR-1992) R.D. Cone, Vollum Inst. for Advanced Biomedical Res, Oregon Health Sciences University, 3181 S.W.Sam Jackson Park Road, Portland OR 97201, USA REFERENCE 2 (bases 1 to 1850) AUTHORS Mountjoy,K.G., Robbins,L.S., Mortrud,M.T. and Cone,R.D. TITLE The cloning of a family of genes that encode the melanocortin receptors JOURNAL Science 257 (5074), 1248-1251 (1992) MEDLINE 92390715 FEATURES Location/Qualifiers source 1..1850 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="keratinocyte" /cell_line="primary human keratinocytes" /clone_lib="HK EMBL3" gene 696..1589 /gene="ACTH-R" CDS 696..1589 /gene="ACTH-R" /codon_start=1 /evidence=experimental /product="candidate adrenocorticotropic hormone receptor" /db_xref="PID:g28344" /db_xref="SWISS-PROT:Q01718" /translation="MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVLENLI VLLAVFKNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFETTA DDIIDSLFVLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTG TGITMVIFSHHVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLPRANMKGA ITLTILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDPFIY AFRSPELRDAFKKMIFCSRYW" BASE COUNT 471 a 457 c 372 g 546 t 4 others ORIGIN 1 acaacacttt atatatattt ttataaatgt aaggggtaca aargtgccat tttgttacat 61 ggatataccg tgtagtggtg aagcctgggc ttttagtgta tctgtcatca gaataacata 121 cgtgttaccc ataggaattt ctcatcaccc gccccctcca cccttcgagt ctccaatgtc 181 cattccacac tctatatcca cgtgtatgca tatagctcca catataagtg agaacatgta 241 gtatttgact tcctctttct gagttatttc actttgataa tggcctccac ttccatccat 301 gttgctgcaa aagacatgac cttattcttt ttgatagctg gggagtactc cattgtgtat 361 atgtaccaca tttnctttat ccattcaccc attgangaac acttagttga ttccatatct 421 ttgctattgt cactagtgct gcaataaaca tacatgtgca ggctccttct aatatactga 481 tttatatttt atggagagag atagagttct tagcgagtgt gctgtttatt tctagtgtac 541 ttgcaactaa tattctgtat actcccttta ggtgattgga gatttaactt agatctccag 601 caagtgctac aagaagaaaa gatcctgaag aatcaatcaa gtttccgtga agtcaagtcc 661 aagtaacatc cccgccttaa ccacaagcag gagaaatgaa gcacattatc aactcgtatg 721 aaaacatcaa caacacagca agaaataatt ccgactgtcc tcgtgtggtt ttgccggagg 781 agatattttt cacaatttcc attgttggag ttttggagaa tctgatcgtc ctgctggctg 841 tgttcaagaa taagaatctc caggcaccca tgtacttttt catctgtagc ttggccatat 901 ctgatatgct gggcagccta tataagatct tggaaaatat cctgatcata ttgagaaaca 961 tgggctatct caagccacgt ggcagttttg aaaccacagc cgatgacatc atcgactccc 1021 tgtttgtcct ctccctgctt ggctccatct tcagcctgtc tgtgattgct gcggaccgct 1081 acatcaccat cttccacgca ctgcggtacc acagcatcgt gaccatgcgc cgcactgtgg 1141 tggtgcttac ggtcatctgg acgttctgca cggggactgg catcaccatg gtgatcttct 1201 cccatcatgt gcccacagtg atcaccttca cgtcgctgtt cccgctgatg ctggtcttca 1261 tcctgtgcct ctatgtgcac atgttcctgc tggctcgatc ccacaccagg aagatctcca 1321 ccctccccag agccaacatg aaaggggcca tcacactgac catcctgctc ggggtcttca 1381 tcttctgctg ggcccccttt gtgcttcatg tcctcttgat gacattctgc ccaagtaacc 1441 cctactgcgc ctgctacatg tctctcttcc aggtgaacgg catgttgatc atgtgcaatg 1501 ccgtcattga ccccttcata tatgccttcc ggagcccaga gctcagggac gcattcaaaa 1561 agatgatctt ctgcagcagg tactggtaga atggctgatc cctggtttta gaatccatgg 1621 gaataacgtt gccaagtgcc agaatagtgt aacattccaa caaatgccag tgctcctcac 1681 tggccttcct tccctaatgg atgcaaggat gacccaccag ctagtgtttc tgaatactat 1741 ggccaggaac agtctattgt aggggcaact ctatttgtga ctggacagat aaaacgtgta 1801 gtaaaagaag gatagaatac aaagtattag gtacaaaagt aattanggtt // LOCUS AF027148 12825 bp DNA PRI 08-AUG-1998 DEFINITION Homo sapiens myogenic determining factor 3 (MYOD1) gene, complete cds. ACCESSION AF027148 NID g3403164 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12825) AUTHORS Chen,B., Dias,P., Jenkins,J.J. III, Savell,V.H. and Parham,D.M. TITLE Methylation alterations of the MyoD1 upstream region are predictive of subclassification of human rhabdomyosarcomas JOURNAL Am. J. Pathol. 152 (4), 1071-1079 (1998) MEDLINE 98206444 REFERENCE 2 (bases 1 to 12825) AUTHORS Chen,B. TITLE Direct Submission JOURNAL Submitted (26-SEP-1997) Pathology, University of Arkansas for Medical Sciences, 4301 West Markham St., Little Rock, AR 72205, USA COMMENT Methylation alterations in the 5' region are found in embryonal rhabdomyosarcoma and alveolar rhabdomyosarcoma. Dynamic methylation changes in this region are found in normal myogenesis. FEATURES Location/Qualifiers source 1..12825 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.4" repeat_region 1866..2513 /note="VNTR; polymorphic in the general population" /rpt_type=tandem gene 8721..12783 /note="expressed in early myogenesis and in rhabdomyosarcomas" /gene="MYOD1" mRNA join(10264..11065,11555..11633,11908..12783) /gene="MYOD1" /product="myogenic determining factor 3" CDS join(10436..11065,11555..11633,11908..12161) /gene="MYOD1" /note="Myf3; MyoD1" /codon_start=1 /product="myogenic determining factor 3" /db_xref="PID:g3403165" /translation="MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFE DLDPRLMHVGALLKPEEHSHFPAAVHPAPGAREDEHVRAPSGHHQAGRCLLWACKACK RKTTNADRRKAATMRERRRLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGL QALLRDQDAAPPGAAAAFYAPGPLPPGRGGEHYSGDSDASSPRSNCSDGMMDYSGPPS GARRRNCYEGAYYNEAPSEPRPGKSAAVSSLDCLSSIVERISTESPAAPALLLADVPS ESPPRRQEAAAPSEGESSGDPTQSPDAAPQCPAGANPNPIYQVL" BASE COUNT 2979 a 3315 c 3511 g 3019 t 1 others ORIGIN 1 ggatccagcg tcagccacct gaggaacatc ccaacacatg tcatgcctcc gggccttcac 61 ccatgctgtt ctgtaagccc ggaaagcctt tctctacagc agccccccga cccactttgg 121 gcactcctat tcatccttca gtaaccaaga agaaggatag caatatttcc tccttgggga 181 actcttcttg ggtgtccaca agagagttca ttgttccctc ttggtgctcc cccagggtgt 241 tttgattatt catttatgga ttgctctttc cccaacagac agagctcttt gaggctgggg 301 atgctgtctg aatcatccct agatcttact gcacctaacc tggggcctgg aaacagggtg 361 ggtgtctggg gaatgcttga ggggttgggg gaggttaggc ctgttgaggc gcatgggcca 421 taaatcacct tcccaagcca gggggaaagc agcaatccag gagagcttca cggaggtggc 481 aggacgggat gtaggacgga ggcaaaccat agactcgcag gcggtgaggg catcattcat 541 gagacctctg ccctccgttc ttctgccagg aaacccctgt cctgggtgct attggccagg 601 gataagcaga ttttggaggg gggaatcagg cttcttcaag gcgttaggtc tccactcaga 661 ggtatgtggc tggggcagct gctgggggct gcagctggtg tctgtcccag agcccaacgg 721 ctgtgtgtgc cttaatccca gtccttggtg ccccccaggc tggcaggtgg actgatgagg 781 cagaaaggag gcaacaggag aggggtggag agccgagccc cctctccagg tccccacagc 841 cgccctctgg atcttgctac atgtgccacc ccatcaccag gtcctcgtca ccagctatcc 901 cctcagccct gagctgctcc tccctcagcc cttagaggga ggtctgacct ccttgaggta 961 ggaagattca gcttaaaaag ttcaaactgg aaaacacaaa gaagagatgg atgtcacggc 1021 atctgtgccc cttagcgtct tctttctgaa tggggctggg gatgtttagg cacataggtg 1081 gagacaagca ttcacactgg gactgaggtg tggacgtacc taacttagaa tatgagcagg 1141 tgggacgcat ctgtggtgtg cacatggcat gtgaagtgtg tgtgttctgt aggtgagggg 1201 tgcacagcat gggaggtata cgcatgtgtg aggtgtgcat ggactatgaa gcatgcacag 1261 tatatgaggt gtaatatgtg tgatatgcac atgatgtgag agaggaaata tcccatatgt 1321 gggacatact gctgttggag gtttacatgg catgtgagtt tacatagagt gtcaagtgta 1381 tccggagtgt taggtgtaca tggcatgtaa tgtgtaaaca atgtgagact tgtacatagt 1441 atgtgaggta acatggagtg tgaggtgtgt gtgtgtgtga ggtgtacatg gagtgggata 1501 tgtacacaat ataagagatg tacccagtgt gtaacatatt catggagtgt gaggtagaca 1561 tggcacaaga gagatacaca cagtgcttga gatatatgtg gtatgtgagg tgtacagaga 1621 gggtgaactg tacattatgt gagaggtgtg cagtatgtga gaggtgtaca gtatgtgaaa 1681 ggtgtgcagt atgtgaaaga tgcacattat gagagaggta gatggagcat aaggtgtaca 1741 ttatgtgaca ggcatagagg gaacgtgagc tgtacattct gtgagaggca cataggaatt 1801 gtgaggtgta cactatgtga gaggtataca ttatgtgaaa gatgtacatt atgagagaag 1861 ctgcaggcag catgtggtgt acatcctgtg agaggcgtgc agggagtgtg aggcgtgcgt 1921 tctgtgagag gcatgcaggg agtgtgaggt gtgtattctg tgagaggcgt gcagggggtg 1981 tgcggcgtgc attctgtgag aggcatgcag ggagtgtgca gcatgcattc tgtgagaagc 2041 gtgcagggag tgtgaggcgt gcattctgtg agaggcgtgc agggaatgtg aggcgtgcat 2101 tctgtgagag gcgtgcaggg agtgtgaggc gtgtattctg tgagaggctt gcagggaatg 2161 tgaggcgtgt attctgtgag aggcgtgcag ggagtgtgag gtgtgcattc tgtgagaggc 2221 gtgcagggaa tgtgaggcgt gtattctgtg agaggtgtgc agggagtgtg aggcgtgcat 2281 tctgtgagag gcgtgcaggg aatgtgaggc gtgtattctg tgagaggtgt gcagggagtg 2341 tgaggcatgc attctgtgag aggcatgcag ggagtatgtg gcatgcattc tgtgagaggc 2401 gtgcaggggg tgtgtggtgt gcattctgtg agaggtgtgc agggagtgtg aggtgtacat 2461 tatcagagag gtgtacaggg agcgtgcact gtatattgtg tgagaggtgt gcacagtgta 2521 aggtgagcac tgtatgtgac ttgtctgcag tttatcaggt gcacatagct atgacaccac 2581 aaaggcatat ggaagaatgc aggtgaggac aaactgtcct tccaagaaaa tgtgcttgtc 2641 agctctgtcc tgggctcaca ctctggacgc atggcacaac ctcctgagca gtgtcacagg 2701 cagaggagac agagggacgt cctctggctt ttcaggagtc cttttataca taaaggagaa 2761 ggcagctttg agaggccttc tcctccatcc ttctgcctgc atccatacca cgttagctca 2821 agggcaatgt gctctttgag gaataccata cgttggtaat attattttta catctgactt 2881 aaatccctca tgctgcaagt tgaattcgca tttttttggt gctttcaaat gtgaaggtag 2941 tgaactctcc tttcatgtag acctcctcac cccgggagcc taatgtcttt aggctagaag 3001 aataattagc accttcatga gctctgcctg cttgaacact tctagtcatt aagttattgt 3061 caccttattt gtgttgttta gtaaacagcg catgtgtatt tcaaacccca ggaatctagt 3121 cccacaaaga ccccagggct cacctctgag gcagagaaag tctttaatta cataaagaat 3181 aagcttgaag aagagaggct ggatgaatga atgaacgcct caggtctttc cagaagccag 3241 gcatcagtgg gttcttaagc ctggcctcag cttcccctac ctggtgggca gtctcagctc 3301 ctgctatttt cactcatctg gcttcctagt gtctgggaaa gaacccggag tgagaggcat 3361 tgtgagggct gcatggaatg tgaggtgcac atggcatgtg gggcctcgtg ggggaaagag 3421 ctgggcgggc tgcagggagg gtccccccat aacaccaagc tcacatgggt cacatcccat 3481 gccctttagc ctcttcccct gggtagcagc ccaccctcat ctcacatcct gaattgggct 3541 aaagcttagt tctagatgaa gttataatta aatttaaaat cacatctaga gaattccctc 3601 ttaaatttaa gtactatgtc agttttggag ggagttttta taaacctatt gaagagggga 3661 ataaacagct ctagcctatt atgcctagtt cttcaagaat ctcttggttt gtaagatatt 3721 ctttcattag aagtaacaaa ttggccaggt gcgggtggct catgcctgta atcccagcac 3781 tttgggangc tgaggcaggt agatcacctg aggtcaggaa ttcgagaaca gcctggccaa 3841 catggtgaaa ccccatctct actaaaaaca cacaaaaaat agccagacat ggtggctcgc 3901 gcctgttgtc ccagctactc aggaggctga ggcacgagaa tcgcttgaac cccggcagca 3961 aaggttgcag tgagctgaga tcgcgccact gcactccagc ctgggcgaca gagcgagatt 4021 ctgtttcaat tttttaaaaa agaagaagaa gtaacagatt gcccatcaaa ccccatgccc 4081 aaactctatc tcgtgccacc cccacacccc ccagccccta cagctccagg agccatgcct 4141 gtctgtaccc acagaaatcc tctgctccta gcagaagcga ttgtcccata gaaggtgctc 4201 aataattatt ttgttctctc tgccgggagc attcttcccc agatgccaca tggctaactc 4261 ccaagcctct tttaactctt tgctcaaatg ttaccttgcc aacctgacca ctctgtttaa 4321 cactacagcc gaccccaacc tggcacctca atcctgctca tcttgttctg cttgtaatat 4381 ttttctgcgt gtatcgcctt ctaacatcta tataatttat ttattattat gtgtattgct 4441 tattgtctgc cctgccctgc ccgcatgtca gttccacagg aacaaggaac atggtctgtt 4501 tcattctcca tcacattccc agcacctgaa taaatgtttg ttgtataagt gaatgaatca 4561 tagactggac aactgagagt gggaatatcc ccaccacccc aatagtgccc tgcccccacc 4621 ccctgcacac tgggtgggaa gggcacatgc tgttggttgt cttcctgttc cctccctcca 4681 cagtgtggac tcctgttctt caactcccag ctcagctgcc agactcctaa gccctgcttg 4741 tctgtgggag gctggagagt acccccaaag ggggaaatgt ggccttctgt gaggaatctc 4801 tgggaccctg tccctaatct gggaccatgt ctatatcctg gcaatatcac agtccctcct 4861 gaccaaccca gactgggccc agagaaggat ctatacccat gtggtgggtg gattttggct 4921 ttcccaggga gcaagtttgt caggggacag agggaggcac tcaggttgga cccaggaaca 4981 ggaagggaaa ggctggggac agagagggga cctggagctg gccctgcccc accaggccca 5041 ctcatgcttt taccttctgg ccctttggcg ccccccactt cccggccaga tacgcagcct 5101 gtgtcagccc cagtgcagag ccacaggccc agcttgggca ggggcagggt gcgtgaagac 5161 tggggcaggt gcaggctgga ttgggtttcc agaggctata tatataaagg ctgccgggag 5221 ccccagggcc gctccctgag ggcacaacac tgtgggggcc cagccaggcc cgcattcctt 5281 tccagaggcc agctttccat ttatagcccc tgggcagagc agccaaggga gctgagaggg 5341 gaggactgga aagggcagag ggagaagggg cagcccaggc agcactccct ccccactccc 5401 caccaaatga gcccctcatc atgaagacag cagaagccag gcccagggcg aggtgtgcac 5461 atgcccccaa gcacagagcc taccattctg gtcagacctg cgttgagggg tgagggggct 5521 gccagggatc cctcaaagtc ctcagcccat tgctagtggc ccctcacaga acaagtccag 5581 cacctgtgga caaagggcac ccttgactag actctgcagt ataagagttt gaatgttttc 5641 agcttccaaa cttggtatcc tttttccctc cgcccccaac ccagcactgg gactaaaagg 5701 acaacatgtc ccaggttgga catacttctc cctgctctgt gggcagcagg gaagagatga 5761 tggtgttgac aaacctctct ccaaagagga gacgcaacca gaagggtgat tccaggcagg 5821 tgtggatgcc aggcatggag aggtctgaaa tggtcaccga gttcagtgag ttccaatctt 5881 tttttgagca acggaagcct ggtagcaaac aaaaatccca cttgaaagcc taatataaaa 5941 atggcatttt acccctagaa tgtctgtgtg ctttaaaaca gcgcttccta attatgggaa 6001 gaagatgtag ctgcaaatca agcttaaaac tgtcaaagca gtttagattt ataagccata 6061 agtgataaaa tattaaatgt gtttggtaag ttcaaacata taacatttac ttatttattg 6121 taaaggcaac ttgatgacag ccctgaggaa gtttttagaa actcaaagca caaaaagcaa 6181 agttgtattc acttgtctca gcatccaatt tattttgtag tttcttgctt attcagattt 6241 ggggaaaatc tagatttgca tagataagtg gtttgaatag ctcacctggg aatctcagag 6301 tactctttaa ttaagtagac tcattcattc atttgcccaa gaaatattta ttgagtgcct 6361 actatgcacc tggctttctg ctgagcctga gcaagtagga agagtatttg tttctaaatc 6421 atcaatacaa caattactta cattcctcat tgtgagtata gtgaaaaaac aataagaaga 6481 cacatcgaga tgccaaatct ctctatagta taccactaca atattgcagc atgccatgta 6541 taatgcccag ttcaagagaa catgccccag gcagatgggg tctgagcctt tcctgggaag 6601 agcaagtgta atagaacatg gatgtctaat catgtatgtg acttcccagt tttgaagaaa 6661 ttcaaccatc ttcattaatg tcacagttca agacgttcaa atatatgcat agagatgcaa 6721 agggcgtgag gccactcatt cattcactgg acatttatca ggtgcctact gtgtgactgc 6781 cactgtgatg gtttttggaa atgtcattgt gaacaaaaca tctttgcctt catggggcta 6841 agtcagtcat gaacaaacta atgaagaaga ttataaagag tggcagtgcc tggaaggaaa 6901 ttaacagggt ggtgtgaacc agagtatggg gggcagttat cttagaggga atggtcagtg 6961 atgtcccaag gaggcaagaa gatctattat ataaaatgca gtcaatcttg ggaaaagccc 7021 aggggataat gctatgagtt tagagcagca aggacaaggc tccaaggcag gatcctgttc 7081 agagtgttag aagacaagaa aggaggtcag gctaactgga gtagagcaat caaaacggat 7141 ggtgcctcca ggtgtagtaa gagacacagg ctgaggctag ctcatgcagg gcctgatagg 7201 ccaggggatg gagatgagat tttactctaa atgcaagaag ctgctgggga gtttgaagca 7261 ggaggatggc ataatcccat ttacattgta aaggtctact caggctgcta tgtaacgaat 7321 ggattatact gaggctacag aggaagtgag gagggcagcc aggttattgt ggtcatccag 7381 agcagcagtg tgacagctta gtccaggatg gtgacactag ggatggactg acttgagaca 7441 ggtcttagag gtagaatgga cagctcttcc tcatactcag acctctcctt gggaactcca 7501 gccttaaaaa tccccctgcc taatccacac ttccatctgg atatccgata tgcatttcaa 7561 atttttgccc aaacctggac acttaatcta ccacccacaa acctagtctt ctcaactcca 7621 ttcttgcagg agtcatcctt agctccttgc ttatctcact tcccacatgc aatcgtgagt 7681 aaatactgct agctccatct tgaaaacata tatctagaat tcaaccacca cttttcacca 7741 gctccactca ctggggatta ttgcaatagc ctcctaattg gtctccctcc ttctgccctt 7801 tccctccagt aaccagcatg ctcctgttaa aatgtaaatc agatagtgat attcttctct 7861 tccgctcaag ccctccagtg agatggctca agccatctca gaatataatc ctgtttctct 7921 ctctctcacc ccatttccca ctatctcttt tactcaccct gctgcagaca tatgagcctc 7981 cttgccatac tttgaatctg ccaaacatgg tcctgcctca ggacttttgc acatgcttat 8041 ccctctatct ggccctacat atctccaggt atctgcttcc aattctcttc attcaggtct 8101 ctgcttaaat gtagcagaga ggccttctct aatcactcca taatcttcat cactgtgtct 8161 ccctgccttg ctttttcttc ataaggcttc taacctcctg gcatcacata cttccttgct 8221 tatttgttca ttgtctatca ccctgcttac attgacaatg taagtgccat gtaagcagag 8281 tcttggtttt gttcattgct ctagccccac acttggactg gtacctgcca cataatggat 8341 acctaacaat atgtggattt ccgggtgacc cctgcagctt gggtggggtg aggcagaact 8401 tgctggctct gccaggattt agagttactg tcactgctgc tccatgatgt agactttact 8461 gaatgaacaa atacaggtgg gccctatgga gtaaagcgag gtgagtactt catgaaggga 8521 gaccttcagc accactacca gcagcagaga agtgaagaag ttaggacccc aacagagccc 8581 tctgagtttt gtgggaaggg aggacttctt agggcccaga acggccagct agaatgcctt 8641 ccagaagtta gtgggaaagg cacaaagatc actcctgctt aaatttctgg tttccaggag 8701 gggagatcca ggcagggatg tatggccaac ggagattcct agccagagtg ctgagaggac 8761 tgtgtgaact gcagattcag gaagaggctg agagaccccc atgggggtgg ccggtatgct 8821 gaggcttgta tgggagccag atatcccaca tcccatgggg tggttgcctc ctcctgtttc 8881 cagcctttcc agtgaggctg caggaaagag acacagctaa ggcctggaga ctcgtggcac 8941 tccgtcaggg catggtacca cagatgagtt gtaagcctgc gggacacagc atccaactct 9001 gaaagcccct tgctcgaata accctacatc accgcctgag ggcttccata tccttggtct 9061 cttcagactg tcatccccac cacaattact ccaagaaatt actgtcatcc ccaaatctat 9121 aactggaaac tgaggctcag gaaggagaca tgacttccac aaaatcacac agttgggaaa 9181 ctctggagtc tgcactcaac tggtctgcaa accgactctc ggagacttca ggtgagatga 9241 ggtcaggttc tcaggccagg tcctgaagtt tgacaccttg gcgaaatgca ctttccttga 9301 ctcagcaccg cagtgacggc ggaacgaagc cccgagcaga agggcttttc ttcccagctg 9361 aagaggcagc tcagcctaga ccccaggcat ggcactggac acccctgctg tggaaacgtg 9421 cagatttaga tggaggggat tcctaacctg ggcaggatcc gagtttggag agattggcgc 9481 gaagtttagc agcaatctcc gattcctgta caaccatagc tgggtttcta agcgtctagg 9541 gaagaaggac tgggcccacg acctgctgag caactcccag gtcggggact ggcggaatat 9601 cagagcctct acgacccgtt tgtctcgggc tcgcccactt caactctcgg ggtctctccg 9661 cctgttgttg cactcgtgcg ttctctgccc ctgacgctct aagctttctg ctttctgcgt 9721 gtctctcagc ctctttcggt ccctctttca cggtctcact cctcagctct gtgcccccaa 9781 tgccttgcct ctctccaaat ctctcacgac ctgatttcta cagccgctct acccatgggt 9841 cccccacaaa tcaggggtac agaggagtat tgaaagtcag ctcagaggtg agcgcgcgca 9901 gccagcgttt cccgcggata cagcagtcgg gtgttggaga ggtttggaaa gggcgtgccg 9961 gagagccaag tgtcagccgc ctagggcttg ccggtcgctc cctccctccc tgcccggtag 10021 gggacctagc gcgcacgcca gtgtggaggg gcgggctggc tggccagtct cgggcccctc 10081 ggccaccccg gggacccccc ccaagccccg cccccgagtg ttcctattgg cctcggactc 10141 cccctccccc agctgcccgc ctgggctccg gggcgtttag gctactacgg ataaatagcc 10201 cagggcgcct ggccgagaag ctaggggtga ggaagccctg gggcgctgcc gccgctttcc 10261 ttaaccacaa atcaggccgg acaggagagg gaggggtggg ggacagtggg tggggattca 10321 gactgccagc actttgctat ctacagccgg ggctcccgag cggcagaaag ttccggccac 10381 tctctgccgc ttgggttggg cgaaagccag gaccgtgccg cgccaccgcc aggatatgga 10441 gctactgtcg ccaccgctcc gcgacgtaga cctgacggcc cccgacggct ctctctgctc 10501 ctttgccaca acggacgact tctatgacga cccgtgtttc gactccccgg acctgcgctt 10561 cttcgaagac ctggacccgc gcctgatgca cgtgggcgcg ctcctgaaac ccgaagagca 10621 ctcgcacttc cccgcggcgg tgcacccggc cccgggcgca cgtgaggacg agcatgtgcg 10681 cgcgcccagc gggcaccacc aggcgggccg ctgcctactg tgggcctgca aggcgtgcaa 10741 gcgcaagacc accaacgccg accgccgcaa ggccgccacc atgcgcgagc ggcgccgcct 10801 gagcaaagta aatgaggcct ttgagacact caagcgctgc acgtcgagca atccaaacca 10861 gcggttgccc aaggtggaga tcctgcgcaa cgccatccgc tatatcgagg gcctgcaggc 10921 tctgctgcgc gaccaggacg ccgcgccccc tggcgccgca gccgccttct atgcgccggg 10981 cccgctgccc ccgggccgcg gcggcgagca ctacagcggc gactccgacg cgtccagccc 11041 gcgctccaac tgctccgacg gcatggtaag gccgggaccc caggaagtga ggaagttagg 11101 gcggcgctcg ggatatcagg gacgcgtttc cgagggcggg gagctggcct tgcgggaggt 11161 ttgggccagg atccttcccg agagagagga cccccttgtc ctgggcagct gtcactgggg 11221 tagcctgttt tggaagtgtg cgggcaagcg ttcgagctgc cccattgggg gcgctattag 11281 aacactgcag cgcgaacgtg aagatctttt tctctactta tccctacttc caaaatgtaa 11341 atttgcgccc cttggtgact gtccgccctt ggtttggccc tgcatgttgc agacctcatc 11401 tcctacccac ccgtaattac ccccccaacc aggacaggtc tgggcccgga actagagcct 11461 taggctagag ttagggaggg ggcggctaca ggaattggtg ttcgggcctc gagccgtccc 11521 gcgggcctga ctcagtcgcc cttgctgttt gcagatggac tacagcggcc ccccgagcgg 11581 cgcccggcgg cggaactgct acgaaggcgc ctactacaac gaggcgccca gcggtgggta 11641 ttccgggcct ctccctgctc gctcctcctc cttcatggag ctgtcctggc ctctatctag 11701 gacgctccca cccccactca cacacgccta tgtcctggga agtggtgcag gagatgaaat 11761 actaagcaag tagctccctg tcttttcgat tgtcccggac tctaactaaa gtcctcagtt 11821 tccaatctgt ctcaaagtac tgggcccggg ggtgggaggc ttgtcgcggc cccacccctg 11881 cttactaacc gagccctccc cgcgcagaac ccaggcccgg gaagagtgcg gcggtgtcga 11941 gcctagactg cctgtccagc atcgtggagc gcatctccac cgagagccct gcggcgcccg 12001 ccctcctgct ggcggacgtg ccttctgagt cgcctccgcg caggcaagag gctgccgccc 12061 ccagcgaggg agagagcagc ggcgacccca cccagtcacc ggacgccgcc ccgcagtgcc 12121 ctgcgggtgc gaaccccaac ccgatatacc aggtgctctg aggggatggt ggccgcccac 12181 ccgcccgagg gatggtgccc ctagggtccc tcgcgcccaa aagattgaac ttaaatgccc 12241 ccctcccaac agcgctttaa aagcgacttc tcttgaggta ggagaggcgg gagaactgaa 12301 gtttccgccc ccgccccaca gggcaaggac acagcgcggt tttttccacg cagcaccctt 12361 ctcggagacc cattgcgatg gccgctccgt gttcctcggt gggccagagc tgaaccttga 12421 ggggctaggt tcagctttct cgcgccctcc cccatggggg tgagaccctc gcagacctaa 12481 ccctgccccg ggatgcaccg gttatttggg ggggcgtgag acccagtgca ctccggtccc 12541 aaatgtagca ggtgtaaccg taacccaccc ccaacccgtt tcccggttca ggaccacttt 12601 ttgtaatact tttgtaatct attcctgtaa ataagagttg ctttgccaga gcaggagccc 12661 ctggggctgt atttatctct gaggcatggt gtgtggtgct acagggaatt tgtacgttta 12721 taccgcaggc gggcgagccg cgggcgctcg ctcaggtgat caaaataaag gcgctaattt 12781 ataccgccgt ggctccggct ttccctggac atgggtgtgg gatcc // LOCUS HUMKER18 6520 bp DNA PRI 09-JAN-1995 DEFINITION Human keratin 18 (K18) gene, complete cds. ACCESSION M24842 M19353 X12799 NID g186686 KEYWORDS Alu repeat; keratin; keratin 18; type I intermediate filament. SOURCE Human HeLa cell line DNA, clone 18.23. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 2281 to 3394; 3402 to 3920) AUTHORS Kulesh,D.A. and Oshima,R.G. TITLE Cloning of the human keratin 18 gene and its expression in nonepithelial mouse cells JOURNAL Mol. Cell. Biol. 8 (4), 1540-1550 (1988) MEDLINE 88246424 REFERENCE 2 (bases 1 to 6520) AUTHORS Kulesh,D.A. and Oshima,R.G. TITLE Complete structure of the gene for human keratin 18 JOURNAL Genomics 4 (3), 339-347 (1989) MEDLINE 89233125 COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by D.A.Kulesh, 24-MAY-1988 and 16-MAY-1989. The K18 gene contains an unusual ag/gc donor splice site of intron 3 instead of the consensus ag/gt sequence. FEATURES Location/Qualifiers source 1..6520 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12 or 17p12-p11" misc_binding 913..919 /bound_moiety="AP1 protein" LTR 913..1253 /note="Alu repetitive sequence" misc_binding 1319..1324 /bound_moiety="SP1 protein" LTR 1930..2180 /note="Alu repetitive sequence" misc_binding 2137..2142 /bound_moiety="SP1 protein" misc_binding 2394..2399 /bound_moiety="SP1 protein" misc_binding 2409..2414 /bound_moiety="SP1 protein" misc_binding 2469..2474 /bound_moiety="SP1 protein" misc_binding 2475..2480 /bound_moiety="SP1 protein" prim_transcript 2533..6316 /note="keratin 18 mRNA and introns" CDS join(2580..2996,3738..3820,4158..4314,4887..5051, 5137..5262,5525..5748,6128..6248) /partial /note="keratin 18" /codon_start=1 /db_xref="PID:g386844" /translation="MSFTTRSTFSTNYRSLGSVQAPSYGARPVSSAASVYAGAGGSGS RISVSRSTSFRGGMGSGGLATGIAGGLAGMGGIQNEKETMQSLNDRLASYLDRVRSLE TENRRLESKIREHLEKKGPQVRDWSHYFKIIEDLRAQIFANTVDNARIVLQIDNARLA ADDFRVKYETELAMRQSVENDIHGLRKVIDDTNITRLQLETEIEALKEELLFMKKNHE EEVKGLQAQIASSGLTVEVDAPKSQDLAKIMADIRAQYDELARKNREELDKYWSQQIE ESTTVVTTQSAEVGAAETTLTELRRTVQSLEIDLDSMRNLKASLENSLREVEARYALQ MEQLNGILLHLESELAQTRAEGQRQAQEYEALLNIKVKLEAEIATYRRLLEDGEDFNL GDALDSSNSMQTIQKTTTRRIVDGKVVSETNDTKVLRH" exon <2580..2996 /gene="KRT18" /note="keratin 18; G00-120-127" /number=1 gene 2580..2996 /gene="KRT18" intron 2997..3737 /note="keratin 18 intron A" exon 3738..3820 /number=2 intron 3821..4157 /note="keratin 18 intron B" exon 4158..4314 /number=3 intron 4315..4886 /note="keratin 18 intron C (no splice consensus); putative; does not fit consensus" exon 4887..5051 /number=4 intron 5052..5136 /note="keratin 18 intron D" exon 5137..5262 /number=5 intron 5263..5524 /note="keratin 18 intron E" exon 5525..5748 /number=6 intron 5749..6127 /note="keratin 18 intron F" exon 6128..>6248 /note="keratin 18" /number=7 BASE COUNT 1490 a 1699 c 1870 g 1461 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttagca ataacagtaa aaggcagtac atagcttgtt gactccacat actttattat 61 aaaatactgc ccaacttgac agttctggaa tccagtgggg gaatataaag gtgaaagcag 121 gagagacccc tctgactgga acctcttacc tcccagaagc cttgtatgca aaaccagtgg 181 gcattcattt gtatgttatt ttgcatcccg tttgcctccc agccttcagc aggccccgac 241 cctcccctgg ccagcttcca ccctgactgc cccctggctg gctcccattg agcactgtgg 301 gctctcccca ccattaggtg acagatcagg aacaatccag gctcaggctc tttatctgtg 361 ctctgcctcc cacctggcag gtccactggc caggcttttc cagggtccct tctctcccag 421 gtctgcccta ctatttgtcc tccccttccc cctcagctgg tagctcgata agaatcaata 481 ggtccactcc agagcaaaga acacagccaa atgtgtcata ccaggccctg ccagaaaaac 541 gagctgctgg agctgacaaa cttgaaggcc aaacacctaa ggttcccccc aacacttcat 601 tcagcaggga tggtcattca gcttcagggg gcaggcagca tgaaagcctc cctacctcca 661 tccttctcac acagaggctg gggagagcat cttggaggat gcagtcccct ggggccaggc 721 ttctaatcca gacagccctt acaagggggg acaggggaag gactggcttg gagaaaagtc 781 ctagaaaaga ggggaggggc actggccacc agggctgggt cgctgctatg atggtcctag 841 gagtgcctgc ctgtcctctc aggccccatg cgatgtagga cacattactt ttatttattt 901 atttatttat tttgagtcag agtttcgctc tggttgccca ggctggagcg cgacggcacg 961 atcttggctc actgcaacct ctgcctcctg ggttcaagcg attctcctgc ctcagcctcc 1021 tgagtagctg ggattacagg cacacactgt gctggttaat ttttgtattt ttagtagaga 1081 aggggtgtca ccatgttggt caggctggtc tcaaattttt tttttttttt tttttttttt 1141 ttgagacaga gtcttgctct gttgtctagg ctggagtgca gtggcatcga actcttgacc 1201 tcaagtgatc cacccgcctc ggcctcccaa agtgcttgga ttacaggcat gagccactgt 1261 gcccggcgat gtgggacaca ttatcatctc tgtgagagat ttttggtctc ttttgtcacc 1321 gcccttctct cccagctcct agaactgggc ctggctcaca gtaggtgctg aatgcatact 1381 ggttgaattg taaatgctca ggatttgttt aattaaggat gcaggaaagg tgatataccg 1441 gtgtgcagaa gtcaggatgc attccctgtc caaatcacag tgttccactg aggcaaggcc 1501 cttgggagtg aggtcgggag aggggagggt ggtggagggg gctcagagac tgggtttgtt 1561 ttggggagtc tgcacctatt tgctgagtga atgtatgtgt gtgtgcattt gagagcacac 1621 ctctgtatga ttcgggtgtg agtgtgtgtg aggaaacgtg ggcaggcgag gagtgtttgg 1681 gagccaggtg cagctggggt gtgagtgtgt aagcaagcag ctatgaggct gggcattgct 1741 tctcctcctc ttctccagct cccagccttt cttccccggg actcctgggg ctccaggatg 1801 cccccaagat cccctccaca agtggataat ttgggctgca ggttaaggac agctagaggg 1861 actcacaggc cattccaccc gcacaccacc agacccccaa atttcttttt tctttttttt 1921 ttgagacaga gtctcactct gtcgccaggc tgcagtggcg cgatctcggc tcactgcaac 1981 ctccgcctcc caggttcaag cgattcccct tcctcagcct cccaagtagc tgagactaca 2041 ggcgtgcacc atcacgtccg gctaattttt tgtattttag tagagagggg tttcaccatg 2101 ttggctagga tggtctcgat ctcctgacct cgtgatccgc ccacctaggc ctcccaaagt 2161 gctgagatta caggcgtgag ccactgcgcc cggtcaagac tcccaaattt caaactcgcc 2221 agcacctcct ccacctgggg gagaagagca taataacgtc atttcctgcc ctgaaagcag 2281 cctcgagggc caacaacacc tgctgtccgt gtccatgccc ggttggccac cccgtttctg 2341 gggggtgagc ggggcttggc agggctgcgc ggagggcgcg ggggtggggc ccggggcgga 2401 gcggcccggg gcggagggcg cgggctccga gccgtccacc tgtggctccg gcttccgaag 2461 cggctccggg gcgggggcgg ggcctcactc tgcgatataa ctcgggtcgc gcggctcgcg 2521 caggccgcca ccgtcgtccg caaagcctga gtcctgtcct ttctctctcc ccggacagca 2581 tgagcttcac cactcgctcc accttctcca ccaactaccg gtccctgggc tctgtccagg 2641 cgcccagcta cggcgcccgg ccggtcagca gcgcggccag cgtctatgca ggcgctgggg 2701 gctctggttc ccggatctcc gtgtcccgct ccaccagctt caggggcggc atggggtccg 2761 ggggcctggc caccgggata gccgggggtc tggcaggaat gggaggcatc cagaacgaga 2821 aggagaccat gcaaagcctg aacgaccgcc tggcctctta cctggacaga gtgaggagcc 2881 tggagaccga gaaccggagg ctggagagca aaatccggga gcacttggag aagaagggac 2941 cccaggtcag agactggagc cattacttca agatcatcga ggacctgagg gctcaggtaa 3001 ggggtaggag ggacctcaac tcccagcctt gtctgaccct ccaattatac actcctttgc 3061 ctctttccgt cattccataa ccaccccaac ccctactcca ccgggagggg gttgggcata 3121 cctggatttc catccgcgca cctagccaca gggtccctaa gagcagcagc agctaggcat 3181 gggagggctc tttcccagga gagaggggga aggggacagg gttgagagct ttacagagga 3241 agtggacagc atggagggag gtaaggaaag gcctgtaaag aggaggagac actggctctg 3301 gcggaatggg gactattgga gggttaagcg gatgtggcta aggctgagtc atctaggagt 3361 aaacaagagg ccttcctttg ggaggagcca atccagggtg tagggggccc agagtgacca 3421 ggtgcactag ggaaaaaatg ccaggagagg gccaggaaga ggacttgtta gtagcgactc 3481 acttctgggc aggcaggcca gccagctagc cagcctgctg aggcttccca agaggggcag 3541 agtgctggga tctgggaatc caggaaagga gggaatgggg tggggctaga tgaaaaggga 3601 taggtgtcca gggagagcct ctggctattc ctgggaccag gaagttttca ctaggataca 3661 taacactttt tacacactca ccccacccat ccctggcttt ctattcatgg aacaacctct 3721 ctctacaatc cctccagatc ttcgcaaata ctgtggacaa tgcccgcatc gttctgcaga 3781 ttgacaatgc ccgtcttgct gctgatgact ttagagtcaa gtaagtttgg gggctagaga 3841 gctgggggtc caggggtgga gctaagaagg actgctcccc aggctgggta gttaggggca 3901 cacagtggga tcctgttagg tgtgggtgga tgagagtcag ggtccatcag tgtattcatt 3961 taactgttca tttgtataac cccgtttaag aatactgtcc tccaagtgcc aagaatggtg 4021 ctcaggggat taccacctaa ttgctgactc aagttgctgg tttgcaatgg gcacagaact 4081 tctcttagta ggtggcatga gttgagaagg ttctggatca gagatagggg cccctctgat 4141 cacctccact cctataggta tgagacagag ctggccatgc gccagtctgt ggagaacgac 4201 atccatgggc tccgcaaggt cattgatgac accaatatca cacgactgca gctggagaca 4261 gagatcgagg ctctcaagga ggagctgctc ttcatgaaga agaaccacga agaggcaagc 4321 aggggccact ggccaggcca gggattgagg ggccaagaga agtctgggtc ggagaataga 4381 caagacaaac caactgcaag tagccttgct aagacgttta gaaatagcag cctgggctct 4441 tcttaaataa gaccgttctg atgaagagca ttctcagggg gtcgagtaca ccctggctca 4501 cctgaattac aggtcaaaat gatatgggtt agaaaatgag atgagaacag aagtagaagc 4561 agctaacact aggagctggg ggtgataaag gaatgacagc agatggagtt ggcagcttcc 4621 taaaagatgg tagaaggagc aggtttgtga aggggagggt ggataaagga acagggtgaa 4681 gttacagaga aaccatcagt gagtgggtgg catttctacc cactggagta gaaaggccag 4741 aactggcatt gccctgagtg caagccaagg ggttcctcct gtctcttctc caactgtagg 4801 cctcctagaa gaggcaatca cagaagaaag gccttgttgg agctctgacc ctgaaccctc 4861 ctcacttttg cccctgtcac ctttaggaag taaaaggcct acaagcccag attgccagct 4921 ctgggttgac cgtggaggta gatgccccca aatctcagga cctcgccaag atcatggcag 4981 acatccgggc ccaatatgac gagctggctc ggaagaaccg agaggagcta gacaagtact 5041 ggtctcagca ggtgcgtgag gggaggggat ggctgccaag gtgtgggagg gaggcagacg 5101 gaatgagggg cctgatggac tgtccccatc ctgcagattg aggagagcac cacagtggtc 5161 accacacagt ctgctgaggt tggagctgct gagacgacgc tcacagagct gagacgtaca 5221 gtccagtcct tggagatcga cctggactcc atgagaaatc tggtgagtgc cttcacatca 5281 cctgcccagc tcctccttca cttggcctca gacccaaccc tgtctcaacc caaatcctat 5341 ccctcatatc atgagttcct ttagctcaga aagagtcagt ttcctctttg catttccctc 5401 cactcctatc ccttatccca gtacttggca catagcaggt gcccaaaaaa gtttccaaaa 5461 gtgaagggat gagcagtcct gggactctgg gctcaccctg cccctcctct ctgtgcccct 5521 gcagaaggcc agcttggaga acagcctgag ggaggtggag gcccgctacg ccctacagat 5581 ggagcagctc aacgggatcc tgctgcacct tgagtcagag ctggcacaga cccgggcaga 5641 gggacagcgc caggcccagg agtatgaggc cctgctgaac atcaaggtca agctggaggc 5701 tgagatcgcc acctaccgcc gcctgctgga agatggcgag gactttaagt gagtggggct 5761 ctcctaccca cacgtgctgg gatcaggaga tcacttctcc ccaaagtctg agcttttgga 5821 agcaccccat gtgtctgttc actggtatcc actgagcact gggccgttgc tccgtgggtg 5881 ctcctgtgtc ttcaagggag taacagttac agaggtctcc cccttgaaga aagcaaacta 5941 agtattgtcc ctagctgtac ttagtatgca aatgaagttt ggccttgagt ttcccttttc 6001 tggaggaaga ggctgagggt gatttggaga taaaggtaga ggtcaggagg ctttttccct 6061 ctacctttct tgtctccctt ctactccacg gggctgttta taacttgggc ttggtcttct 6121 gttacagtct tggtgatgcc ttggacagca gcaactccat gcaaaccatc caaaagacca 6181 ccacccgccg gatagtggat ggcaaagtgg tgtctgagac caatgacacc aaagttctga 6241 ggcattaagc cagcagaagc agggtaccct ttgggagcag gaggccaata aaaagttcag 6301 agttcattgg atgtcacttt gtcttctttt ggctgttttc attgtgcaca aatgccctaa 6361 cccaacagtc ccatccctga tccagcagaa accacctctg acccctgagg tttcatatag 6421 attggggtgt agaaggaaga gggatctgta ttcttggaaa cacttctgag agacagagga 6481 gggagcagta gatgtgatgg gtcacaggct gtggggatcc // LOCUS HUMADRA 1521 bp DNA PRI 30-OCT-1994 DEFINITION Human platelet alpha-2-adrenergic receptor gene, complete cds. ACCESSION M18415 NID g178191 KEYWORDS alpha-2-adrenergic receptor; alpha-adrenergic receptor. SOURCE Human (lambda-EMBL 3 library) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Kobilka,B.K., Matsui,H., Kobilka,T.S., Yang-Feng,T.L., Francke,U., Caron,M.G., Lefkowitz,R.J. and Regan,J.W. TITLE Cloning, sequencing, and expression of the gene coding for the human platelet alpha 2-adrenergic receptor JOURNAL Science 238 (4827), 650-656 (1987) MEDLINE 88042789 FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q23-q25" gene 59..1411 /gene="ZNF32" CDS 59..1411 /gene="ZNF32" /note="alpha-2-adrenergic receptor old gene name 'ADRA2R'" /codon_start=1 /db_xref="GDB:G00-125-339" /db_xref="PID:g178192" /translation="MGSLQPDAGNASWNGTEAPGGGARATPYSLQVTLTLVCLAGLLM LLTVFGNVLVIIAVFTSRALKAPQNLFLVSLASADILVATLVIPFSLANEVMGYWYFG KTWCEIYLALDVLFCTSSIVHLCAISLDRYWSITQAIEYNLKRTPRRIKAIIITCWVI SAVISFPPLISIEKKGGGGGPQPAEPRCEINDQKWYVISSCIGSFFAPCLIMILVYVR IYQIAKRRTRVPPSRRGPDAVAAPPGGTERRPNGLGPERSAGPGGAEAEPLPTQLNGA PGEPAPAGPRDTDALDLEESSSSDHAERPPGPRRPERGPRGKGKARASQVKPGDSLRG AGRGRRGSGRRLQGRGRSASGLPRRRAGAGGQNLEKRFTFVLAVVIGVFVVCWFPFFF TYTLTAVGCSVPRTLFKFFFWFGYCNSSLNPVIYTIFNHDFRRAFKKILCRGDRKRIV " BASE COUNT 223 a 546 c 499 g 253 t ORIGIN Chromosome 10q23-q25. 1 cccgccttca tcttccgcca ggaggccaag gccgttggcc gagggcagct ttgcgcccat 61 gggctccctg cagccggacg cgggcaacgc gagctggaac gggaccgagg cgccgggggg 121 cggcgcccgg gccacccctt actccctgca ggtgacgctg acgctggtgt gcctggccgg 181 cctgctcatg ctgctcaccg tgttcggcaa cgtgctcgtc atcatcgccg tgttcacgag 241 ccgcgcgctc aaggcgcccc aaaacctctt cctggtgtct ctggcctcgg ccgacatcct 301 ggtggccacg ctcgtcatcc ctttctcgct ggccaacgag gtcatgggct actggtactt 361 cggcaagact tggtgcgaga tctacctggc gctcgacgtg ctcttctgca cgtcgtccat 421 cgtgcacctg tgcgccatca gcctggaccg ctactggtcc atcacacagg ccatcgagta 481 caacctgaag cgcacgccgc gccgcatcaa ggccatcatc atcacctgtt gggtcatctc 541 ggccgtcatc tccttcccgc cgctcatctc catcgagaag aagggcggcg gcggcggccc 601 gcagccggcc gagccgcgct gcgagatcaa cgaccagaag tggtacgtca tctcgtcgtg 661 catcggctcc ttcttcgctc cctgcctcat catgatcctg gtctacgtgc gcatctacca 721 gatcgccaag cgtcgcaccc gcgtgccacc cagccgccgg ggtccggacg ccgtcgccgc 781 gccgccgggg ggcaccgagc gcaggcccaa cggtctgggc cccgagcgca gcgcgggccc 841 ggggggcgca gaggccgaac cgctgcccac ccagctcaac ggcgcccctg gcgagcccgc 901 gccggccggg ccgcgcgaca ccgacgcgct ggacctggag gagagctcgt cttccgacca 961 cgccgagcgg cctccagggc cccgcagacc cgagcgcggt ccccggggca aaggcaaggc 1021 ccgagcgagc caggtgaagc cgggcgacag cctgcgcggc gcgggccggg ggcgacgggg 1081 atcgggacgc cggctgcagg gccgggggag gagcgcgtcg gggctgccaa ggcgtcgcgc 1141 tggcgcgggc gggcagaacc tcgagaagcg cttcacgttc gtgctggccg tggtcatcgg 1201 agtgttcgtg gtgtgctggt tccccttctt cttcacctac acgctcacgg ccgtcgggtg 1261 ctccgtgcca cgcacgctct tcaaattctt cttctggttc ggctactgca acagctcgtt 1321 gaacccggtc atctacacca tcttcaacca cgatttccgc cgcgccttca agaagatcct 1381 ctgtcggggg gacaggaagc ggatcgtgtg aggtttccgc tggcgcccgc gtagactcac 1441 gctgactgca ggcagcgggg ggcatcgagg ggtgcttagc ccgagggcac tcagaaaccc 1501 gggcgctgct gctctgcgtt t // LOCUS HSMHCPU15 5833 bp DNA PRI 29-JUL-1993 DEFINITION H.sapiens gene for major histocompatibility complex encoded proteasome subunit LMP2. ACCESSION Z14977 S47822 S47823 S47869 S47870 S47871 NID g34655 KEYWORDS major histocompatibility complex; proteasome subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5833) AUTHORS Fruh,K., Yang,Y., Arnold,D., Chambers,J., Wu,L., Waters,J.B., Spies,T. and Peterson,P.A. TITLE Alternative exon usage and processing of the major histocompatibility complex-encoded proteasome subunits JOURNAL J. Biol. Chem. 267 (31), 22131-22140 (1992) MEDLINE 93054490 REFERENCE 2 (bases 1 to 5833) AUTHORS Fruh,K. TITLE Direct Submission JOURNAL Submitted (04-AUG-1992) Fruh K., Scripps Research Institute, Immunology, 10666 North Torrey Pines Road, La Jolla, California, USA, 92037 FEATURES Location/Qualifiers source 1..5833 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="HLA DR 7" /cell_line="MANN" /clone="U15" /chromosome="6" prim_transcript 465..5833 /gene="MHC-encoded proteasome subunit gene" exon 465..570 /gene="MHC-encoded proteasome subunit gene" /number=1 mRNA join(465..570,2424..2491,3547..3678,4289..4418,4648..4789, 5688..5833) /gene="MHC-encoded proteasome subunit gene" gene 465..5833 /gene="MHC-encoded proteasome subunit gene" CDS join(511..570,2424..2491,3547..3678,4289..4418,4648..4789, 5688..5815) /gene="MHC-encoded proteasome subunit gene" /codon_start=1 /db_xref="PID:g34656" /db_xref="SWISS-PROT:P28065" /translation="MLRAGAPTGDLPRAGEVHTGTTIMAVEFDGGVVMGSDSRVSAGE AVVNRVFDKLSPLHERIYCALSGSAADAQAVADMAAYQLELHGIELEEPPLVLAAANV VRNISYKYREDLSAHLMVAGWDQREGGQVYGTLGGMLTRQPFAIGGSGSTFIYGYVDA AYKPGMSPEECRRFTTDAIALAMSRDGSSGGVIYLVTITAAGVDHRVILGNELPKFYD E" intron 571..2423 /gene="MHC-encoded proteasome subunit gene" /number=1 exon 2424..2491 /gene="MHC-encoded proteasome subunit gene" /number=2 intron 2492..3546 /gene="MHC-encoded proteasome subunit gene" /number=2 repeat_unit 3208..3505 /gene="MHC-encoded proteasome subunit gene" /rpt_family="ALU" exon 3547..3678 /gene="MHC-encoded proteasome subunit gene" /number=3 intron 3679..4288 /gene="MHC-encoded proteasome subunit gene" /number=3 exon 4289..4418 /gene="MHC-encoded proteasome subunit gene" /number=4 intron 4419..4647 /gene="MHC-encoded proteasome subunit gene" /number=4 exon 4648..4789 /gene="MHC-encoded proteasome subunit gene" /number=5 intron 4790..5687 /gene="MHC-encoded proteasome subunit gene" /number=5 repeat_unit 5076..5354 /gene="MHC-encoded proteasome subunit gene" /rpt_family="ALU" exon 5688..5833 /gene="MHC-encoded proteasome subunit gene" /number=6 BASE COUNT 1490 a 1311 c 1545 g 1487 t ORIGIN 1 gtcctcccct actggcggct gggggaggga acgagggcgg ggctctcgga aagtcccagg 61 aacaggctga tcctgcgctg gcgagaagct cagccattta ggggaaagcg aaatcgaaag 121 cggccgctac tcactagata acgcctactt ccaaaagtgg cctgcccaga ctattttggt 181 agcaagcgtg gaaatcagat ctgagaatct cgggagcagc cctggtgccc aattttctcc 241 atcacgcaca cccttctcgc ctctccctgc ctcctgcctt tccacttgca ccagttttcc 301 caccccagcc tcagggcggg gctgcctcgt cacttgtctc ggggcagatc tgccctacac 361 acgttagcgc cgcgcgcaaa gcagccccgc agcacccagg cgcctcctgg cggcgccgcg 421 aaggggcggg gctgtcggct gcgcgttgtg cgctgtccca ggttggaaac cagtgcccca 481 ggcggcgaag gagagcggtg ccttgcaggg atgctgcggg cgggagcacc aaccggggac 541 ttaccccggg cgggagaagt ccacaccggg gttaatgggt ctgggcttga gggttggcag 601 aggggtggag gagatgcagc ggccagggga ccctggaagc gcgcgcggag aagtgaatgc 661 agagaccaac gggagcgcag ggaggtcgcc tgtagcagcc agcgcttgca acccgcaatg 721 agcatagagt atttcttttc tgaggggggt cgtctagagt gtccgtgaag ggaacaggca 781 cgcgaggctg gtggaaaaag cgggtgcttt gactcttagc tggaagcgtc aacgggaagc 841 tactctaaag cgctttcgct ttcactctgg tcccggacag tgggggctgg ttaaatcaag 901 aaagggggtt ggggatggtg caaagagatg aggaaatggt gccctgggtg aagtagaaca 961 gcacttggga gaaggaaata taggcactta ttgagaagga ccaactcatc acacagactt 1021 ttgataaact tgccactggg caactcttag cccaagcact gataatgggc gttctgtgtt 1081 aactagtgat gcccttccct agcttgaccc aggaaggtct ctccttggcc cagatgctgc 1141 cttactccct tccctgtgtc ttccctgccc actcccatgt gcccactggg ggggactttg 1201 cttaggatgg gcgcctggtg cagatggcag ccccaagact ggctggctgg cttctgctct 1261 ggactactgc caccactcgt ggcttggggg cggctttgtt agagaggaat agcctctaac 1321 ttgaagttaa ccctgttctt tgaccctcta ttcatgataa gtcggtccgt cggaaagcat 1381 actcagagga gcgtcctttg gggccagagt aacttacggc ctggtaagaa agacacagtg 1441 aaaccactta taatttggga aatctcccct cactgccaaa tgagcagtgg caagtaggaa 1501 gtagaagtgg aaacaaggga taagagttag acctgaattt tagtcccagg tctactatta 1561 actctgtgtg actttgcata agtcgtttgc attttctgtg acttggtttc ctcatttgaa 1621 ccgaggatct ttaaggctcc ttccaactca atagtagaat aaatgtagct ttatcttccc 1681 tcacttcttc ttgattcttt tcttgacctg gaaaagtcag cttaaacttc tcagtcaaat 1741 tatctcttgg tacaaattta cctccctggg tgctgagata tgtatttacc tcttgatcgg 1801 aaattccata actgaaactt ttattttcac ccatctgtat gtgttccttg ctgcttctct 1861 cctgccttgc ccctggacat gctaaccact gccctcctcg attttttcca atgtacagta 1921 aattggaaga gctcacttct gatgaaatgg ggggtgagag tggaggattg tgggaccaaa 1981 aaaaaaaaaa tagactgacc ttgtttccca agatcatagt caattactct gtgttgggtc 2041 tacaccacat ctgcacatac tatgagccct tccgttggag ataattttca cttgcggagc 2101 tgcttcactt ctacctgtag gagcctcatc tccacctctc tacagtggag aggattccac 2161 taggcaagtt ggaacttagg gacacagttc tttctgtgtt gtatcacagc tgggctgtgg 2221 cattcccctg cagccggatg aagcaataga gaaagtggaa agatgaaggg aaaaaaagcc 2281 tgtactgaca gtcagctctg gcctgttact gtgtaatctt tgagccagtc acttcgcctc 2341 tctgggaatg tttcttcttc tctaacatga gggcatcaag gctgttcttg ccctgacatt 2401 ccatattgct gtgtgctctg cagaccacca tcatggcagt ggagtttgac gggggcgttg 2461 tgatgggttc tgattcccga gtgtctgcag ggtgagtaaa agtgaagatg tatgcatttg 2521 gaaagaagct aatggcctca aatacacact ttccttaccc attcatgaaa agactggcaa 2581 actggagcct tggaggaatg gagttgacct tccccaaaag ccactatgat aagctatttg 2641 gtgggtgctt gggtctctga atttgtggag gaggatctgg ggtctgaatg tgtatgtgac 2701 ctgtcccagt agtgtacagg gatgagtaaa ggaatagggt ctgagagggg gacaggagat 2761 agatttttga gggtcttctt tccatctgtg cttagggatc aaaaagatga ttctgtcaag 2821 cagatacctg gtttctcatt taccatatat tgaactattt tggctcttct cccactccta 2881 accaatttcc tcacatgcaa aatgagtata tggggttagg tcaatattac tgacattatg 2941 ttccatagaa cataactctc tcaagattgt taatagcaaa gaaaattgat gaggcatatt 3001 tttcttacct tagcattttt tgctttgtta taaaatctaa gcctgaaaaa taagcctaat 3061 tttgattaac atctgcagtg attaataata tctgagatga ttatttgcct cctgctttaa 3121 tccaagcatt aaacttcatg ctattctctt gtcaaagaaa tttgagagac attgaatgat 3181 caccctcaaa aattcctgag ttctggttgg gtgcagtggc tcacatctat aatctcagca 3241 ctttgggatg ccgaggtggg cagatattta aggtcaggag tttgagacca gcctggccaa 3301 catgttggga ccttgtctct actgaaaata caaacattag ctgggcttgg tggtgggtgc 3361 ctgtaatccc agctattcgg gaggctgagg caggagaatc acttgaacca gggaggcgaa 3421 gtttgcagtg agcccaagat tgatccactg cactccagcc tgggtgacag agtgagactg 3481 tctcaaaaaa aaaaaaaaaa aaaaacctga gttttaactt ggtgactgtt gactccctcc 3541 tgacagcgag gcggtggtga accgagtgtt tgacaagctg tccccgctgc acgagcgcat 3601 ctactgtgca ctctctggtt cagctgctga tgcccaagcc gtggccgaca tggccgccta 3661 ccagctggag ctccatgggt atgaagctct ggagttctga ctccccaccc actagagctc 3721 ccccaacctg catgaatccc tgtacagtgt gctgttccag gagctggaca ctgggaaatg 3781 gaaaagtctt gtttcggctc ttgctggcac ttgaatctgt cagtttctgc atctgtaaag 3841 tggagataat atagtacctc atgagacggt tattttgaga accacattct atatgtgaac 3901 acagtttaaa agctgtaaat cactatcctg atataaataa tcaggaagaa ggtgatattg 3961 tgacccacca taatatcagg cagttaccat acgagaaatc aaggtcgttg ggacggaagt 4021 aaccttatct gcttttcccc ataagagcag ggtccttgca gccacaagaa agttatgtgg 4081 gtggggctga ccaaaagagt gagcaattga aagcttctta ccagttggtg gtgtgggact 4141 ctggttcccc tgtacatgtg ggagggaggc tgcagtttga gctattgcag ttacagtttt 4201 caggggtcgt ttagcaggga tgatggtaac agtataggag aatgagactt aaaattctat 4261 caacctttat tcctaatatt tccctcagga tagaactgga ggaacctcca cttgttttgg 4321 ctgctgcaaa tgtggtgaga aatatcagct ataaatatcg agaggacttg tctgcacatc 4381 tcatggtagc tggctgggac caacgtgaag gaggtcaggt gagtttctcc caaagcactc 4441 tctcctctgg gcttccccac tctcctgcag aggaagatgg aagtcctatg tcattctagc 4501 aatgagttcc aaggacacta cctctgaaag catagtactt tggggatatg agataccagg 4561 gcttcattgc agggtgcaga gaccacttaa tgtctcagtg ggaaggaagg gcttgatgat 4621 tctttaacct gaggatccct ttcccaggta tatggaaccc tgggaggaat gctgactcga 4681 cagccttttg ccattggtgg ctccggcagc acctttatct atggttatgt ggatgcagca 4741 tataagccag gcatgtctcc cgaggagtgc aggcgcttca ccacagacgg taaccagcca 4801 agtggaaggg tacctgggga gggctttgaa acatgggaag gaagtagatt atgaggaaca 4861 ggaagagaaa tacaggggtg gccatttaag ttaatgccgg gcctggtaca cttttaagag 4921 tgaaaagggg caggacaaat gcaaagctca atggggttct tgggcaatac ggataaccca 4981 gggctgttct gagtaaatca aatgaggata cacagtcact gtgagaccca gtggtgtgct 5041 aagcacagtg gctcacacct gtaatgccaa caatttggga ggctgaggca ggaggattac 5101 ttgaccccag gagtttgagg ccagcctagg caagatggtg aaaccctgtc tccacaaaaa 5161 aacaataaaa aaaagtaaaa aaaaaattga cctgggcatg gtggtgaaca cctgtggtcc 5221 cagttactca ggaggttgag gtggaaagat catctgagcc ggggagatca aggttgtagt 5281 gagcggtgat tgaaccactg cgctgcagcc taggtgacag agagagaccc tgtctggaga 5341 aaaaaaaaaa aaaagaacca gtggtgtgct gaggtgtact gaggctggct tgggaccact 5401 catgagagcg gactgttaaa tagtccagga tttgtgaact gcttagctat ttgtaacttg 5461 caattcatca tagcgggagc atttacacca cggacatcag cagatgccac atatggaagc 5521 ctttttgtaa aaaaactgat ttaccagcac accactaaat atgccttcct ggaagatgag 5581 ttttgaggtg aaagtggtag taggcatatg gatggagggg gagtaaaaag atttttgaag 5641 ctaagccatc ctctctctcc ctctctccaa cttgaaaccc tctgcagcta ttgctctggc 5701 catgagccgg gatggctcaa gcgggggtgt catctacctg gtcactatta cagctgccgg 5761 tgtggaccat cgagtcatct tgggcaatga actgccaaaa ttctatgatg agtgaacctt 5821 ccccagactt ctc // LOCUS HSU72648 4850 bp DNA PRI 19-DEC-1996 DEFINITION Human alpha2-C4-adrenergic receptor gene, complete cds. ACCESSION U72648 NID g1737180 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4850) AUTHORS Schaak,S., Devedjian,J.C., Cayla,C., Sender,Y. and Paris,H. TITLE Direct Submission JOURNAL Submitted (26-SEP-1996) Unite 317, INSERM, CHU Rangueil, Toulouse 31054, France COMMENT On Dec 19, 1996 this sequence version replaced gi:1628637. FEATURES Location/Qualifiers source 1..4850 /organism="Homo sapiens" /note="in superCos 1 vector" /db_xref="taxon:9606" /chromosome="4" /dev_stage="fetus" /tissue_type="brain" /clone_lib="Stratagene # 961200" TATA_signal 1853..1858 CDS 2807..4192 /codon_start=1 /product="alpha2-C4-adrenergic receptor" /db_xref="PID:g1628638" /translation="MASPALAAALAVAAAAGPNASGAGERGSGGVANASGASWGPPRG QYSAGAVAGLAAVVGFLIVFTVVGNVLVVIAVLTSRALRAPQNLFLVSLASADILVAT LVMPFSLANELMAYWYFGQVWCGVYLALDVLFCTSSIVHLCAISLDRYWSVTQAVEYN LKRTPRRVKATIVAVWLISAVISFPPLVSLYRQPDGAAYPQCGLNDETWYILSSCIGS FFAPCLIMGLVYARIYRVAKRRTRTLSEKRAPVGPDGASPTTENGLGAAAGEARTGTA RPRPPTWSRTRAAQRPRGGAPGPLRRGGRRRAGAEGGAGGADGQGAGPGAAQSGALTA SRSPGPGGRLSRASSRSVEFFLSRRRRARSSVCRRKVAQAREKRFTFVLAVVMGVFVL CWFPFFFIYSLYGICREACQVPGPLFKFFFWIGYCNSSLNPVIYTVFNQDFRPSFKHI LFRRRRRGFRQ" polyA_signal 4727..4732 polyA_site 4746 BASE COUNT 769 a 1686 c 1636 g 759 t ORIGIN 1 ggatcctaga tagatcccca tgcccctgcg catcccccaa agcacttctc aaaaagagag 61 cccttcagga tgggggtggg aggcacgctg atgccgaggg agagagacct tctccaaagc 121 ctgcatctgg cccctgccac tgctctttcc tagtcctcct gtctccccac gcccccagta 181 caaagcccac tcccactgga ggaaggacag cctacagagg cctccccctt cctcataaca 241 cctcctcatt ctctaagcac agtttggtca cctcctccag gaaggcttct ttgacaaccc 301 cgctcccagg ctgggggagg agcccagggc actgatggtt gtagcgcatt cacctcagtg 361 gtggccccat ccacccacag atggtgagca cctcaactga ggccaggccc ccctcactct 421 ctgccatcgc agagggctca atagagacaa gggctccagc gggccgacca tgaaggaagc 481 aggtgtgtgt gggcttccct gaaaaatcga cctccaggtg ccctggacag gtggcacctt 541 ttaggcaacc agaaccagga gggcctttag taaggatggt gaccctccac catccccagg 601 tcacacacct cccacagaca gccatggcct gttgcctccc gaaggcatat gtgccctaga 661 ctagcacagg actggcacag aaggctttgg ctccaacgtg cgccctgctt ggcaggccca 721 tcatctgaaa tatttgcttt ctccggggcc cggctcggcc tccacacccc caatgcaggc 781 cctgctggtg acaaacatgg tgaagaacga agccgtagct cctggcagtg tctctgcgga 841 gacagacatg ctccccagta gggctgagca gcagagaaca cgacaggctc tgaggctaaa 901 gctggcgccc ttcaggagct cagagagaag ctgccccagg ccacgggcat ggataggcat 961 cccgggggta gggcagtcac tccagccgct cttcatgatg acctcccggg cactcagaag 1021 cgccccttcc ccgccggttc tagccacagg atggacagac cttccctccc tctctcctgc 1081 gggcccatgc agggagaggc cagccagctg gggttcgttt ctggacgttt gggagaaggg 1141 tcgtcccttc ggtcctagca gagaaacgtc aggcctgaga actccagctc tcctcctgga 1201 tcacctcgtc cctggaagtc cctctcccac agaccccagg ctgcctggag gcagaagcca 1261 gcacccaaca tgggccacag agtggcagtg cacgtggcag tgcacatctg cagaatggag 1321 gtgcgcggac gggggcgagg gtgcacgggg gctacctcaa cggccgctct cccctgagcc 1381 cacccttgca aaaagctcca tcatccaggc agctccgttc tgggtgtttg tcagaggtgg 1441 tgactttgaa cacctctcca tcctttaaaa gcagggcctg agagcctaag gattaaactt 1501 caatcactgc taacatgggt gcagagcttg ggagcttgcc cagggcttcc acagacaccg 1561 ccgttagtta tgattcatgc tccagcagcg ctgggaagtg ggccgaggga actgaggccg 1621 gggtgggtgc ggaggtggct ctgacgggga gcgccgatcc aggatctggt ccggaccctg 1681 agctcggcag gacctgtgcg cacgcaggta gcgtcgcgta ctcccccagg ctgcgagagg 1741 ttgctctgcc atcaggccat ggaccccgag ccgcccgtgc tgctcgcccg cgtgtgcgcg 1801 cccccgcctg cgcccatgcc tgcgccgggg gaggcgaagg aggctccagt cttagaaaga 1861 gcagcttctg gaactcaccg cccaggccgg ccgccgctcg gccccgtccc gcaggctgca 1921 ggcggccctg gagggggcgc cctcgccgag cgcgcgcccc gcgccgccgc cccggactcc 1981 tccccggcgc cgcgcgggca ggttcgacca ggcggccgcg ggctccggtt cccggccagc 2041 tccccagggc ccgcggcgcc ccgccccgcg cgcccgcccc gctgcgctaa ctcgacccaa 2101 gttggaagcc gatcgcaggc ggccgcactc gcgcccagcg agggcggcgg cggcggcggc 2161 ggcgcagctc cggcgagcga ggcggcggcc gcacggcaag cgtggaccgc ggggggcgcc 2221 cgcgccggga gcagccggag gactcgcggc ggcgccggcg ccccgcccgg gaaagtaaag 2281 ttggagacgg agggagcgcg cggggcgggc ccggaggagc ggcggccggc gccccggcgc 2341 gcgcagccct agccgccgga tgggaggcgg acggcccggg ccgccgccgc cttgtcgcct 2401 gcgccccggc tgggctccgg gaccgcgggg ccgctacggc accgccgctc ggcccgcgtc 2461 gcgtgggctc gccgccgggg cgctcccgtg agccgggccg aggcggggcg cgcgaggacc 2521 ccgggacctg cccccctccc cccgcagccg cgtcgccgct cgctccgggc gcctcctgct 2581 ctgcacttac acgctcggca gctgcgggga gcccggcagc cacgctctcc ggcgcgccgc 2641 ccgcggagcc accacggccg agggccggct gctgggcgcc gcggtccccg gcgggcgcgc 2701 ccgagcagca ggcggcgatg cgggcgccga ccccggctgg ggggcgcccg agctgccgcg 2761 gctgcgcccc ggctccagga gggacggcgt agctcgcggg aggaccatgg cgtccccggc 2821 gctggcggcg gcgctggcgg tggcggcagc ggcgggcccc aatgcgagcg gcgcgggcga 2881 gaggggcagc ggcggggttg ccaatgcctc gggggcttcc tgggggccgc cgcgcggcca 2941 gtactcggcg ggcgcggtgg cagggctggc tgccgtggtg ggcttcctca tcgtcttcac 3001 cgtggtgggc aacgtgctgg tggtgatcgc cgtgctgacc agccgggcgc tgcgcgcgcc 3061 acagaacctc ttcctggtgt cgctggcctc ggccgacatc ctggtggcca cgctggtcat 3121 gcccttctcg ttggccaacg agctcatggc ctactggtac ttcgggcagg tgtggtgcgg 3181 cgtgtacctg gcgctcgatg tgctgttttg cacctcgtcg atcgtgcatc tgtgtgccat 3241 cagcctggac cgctactggt cggtgacgca ggccgtcgag tacaacctga agcgcacacc 3301 acgccgcgtc aaggccacca tcgtcgccgt gtggctcatc tcggccgtca tctccttccc 3361 gccgctggtc tcgctctacc gccagcccga cggcgccgcc tacccgcagt gcggcctcaa 3421 cgacgagacc tggtacatcc tgtcctcctg catcggctcc ttcttcgcgc cctgcctcat 3481 catgggcctg gtctacgcgc gcatctaccg agtggccaag cgtcgcacgc gcacgctcag 3541 cgagaagcgc gcccccgtgg gccccgacgg tgcgtccccg actaccgaaa acgggctggg 3601 cgcggcggca ggcgaggcga gaacgggcac tgcgcgcccc cgcccgccga cgtggagccg 3661 gacgagagca gcgcagcggc cgagaggcgg cgcgccgggg ccgttgcggc ggggcgggcg 3721 gcggcgagcg ggcgcggagg ggggcgcggg cggtgcggac gggcaggggg cggggccggg 3781 ggcggctcag tcgggggcgc tgaccgcctc caggtccccg gggcccggtg gccgcctctc 3841 gcgcgccagc tcgcgctccg tcgagttctt cctgtcgcgc cggcgccggg cgcgcagcag 3901 cgtgtgccgc cgcaaggtgg cccaggcgcg cgagaagcgc ttcacctttg tgctggctgt 3961 ggtcatgggc gtgttcgtgc tctgctggtt ccccttcttc ttcatctaca gcctgtacgg 4021 catctgccgc gaggcctgcc aggtgcccgg cccgctcttc aagttcttct tctggatcgg 4081 ctactgcaac agctcgctca acccggtcat ctacacggtc ttcaaccagg atttccggcc 4141 atccttcaag cacatcctct tccgacggag gagaaggggc ttcaggcagt gactcgcacc 4201 cgtctgggaa tcctggacag ctccgcgctc ggggctgggc agaaggggcg gcccggacgc 4261 gggggagctt tcccagagac ccggggagct ttcccagaga cccggggatg gattggcctc 4321 cagggcgcag gggagggtgc ggcagggcag gagcttggca gagagatagc cgggctccag 4381 ggagtgggga ggagagaggg ggagacccct ttgccttccc ccctcagcaa ggggctgctt 4441 ctggggctcc ctgcctggat ccagctctgg gagccctgcc gaggtgtggc tgtgaggtca 4501 gggttttaga gagcagtggc agaggtagcc ccctaaatgg gcaagcaagg agccccccaa 4561 agacactacc actccccatc cccgtctgac caagggctga cttctccagg acctagtcgg 4621 ggggtggctg ccagggggca aggagaaagc accgacaatc tttgattact gaaagtattt 4681 aaatgtttgc caaaaacaac agccaaaaca accaaactat tttctaaata aacctttgta 4741 atctaatgtt gggtgccgca gtcagtcctt gagcttgggg gctgggggta tcttccagac 4801 cctcaccctg cctgacaccc tcccccatct cccccacccc acaatgctgg // LOCUS HUMMK 4638 bp DNA PRI 12-SEP-1992 DEFINITION Human midkine gene, complete cds. ACCESSION D10604 D90540 NID g219928 KEYWORDS midkine. SOURCE Homo sapiens placenta DNA, clone_lib:genomic library. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4638) AUTHORS Uehara,K., Matsubara,S., Kadomatsu,K., Tsutsui,J. and Muramatsu,T. TITLE Genomic structure of human midkine (MK), a retinoic acid-responsive growth/differentiation factor JOURNAL J. Biochem. 111 (5), 563-567 (1992) MEDLINE 92348340 REFERENCE 2 (bases 1 to 4638) AUTHORS Uehara,K. TITLE Direct Submission JOURNAL Submitted (21-OCT-1991) to the DDBJ/EMBL/GenBank databases. Kazuyoshi Uehara, Faculty of Medicine, Kagoshima University, Department of Biochemistry; 8-35-1 Sakuragaoka, Kagoshima, Kagoshima 890, Japan (Tel:0992-64-2211(ex.2079), Fax:0992-64-5618) COMMENT Submitted (21-Oct-1991) to DDBJ by: Kazuyoshi Uehara Department of Biochemistry Faculty of Medicine, Kagoshima University 8-35-1 Sakuragaoka Kagoshima 890 Japan Phone: 0992-64-2211 x2079 Fax: 0992-64-5618. FEATURES Location/Qualifiers source 1..4638 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="genomic library" /tissue_type="placenta" protein_bind 708..712 /bound_moiety="retinoic acid receptor" enhancer 1094..1100 /note="IgG enhancer element" protein_bind 1104..1110 /bound_moiety="AP-1" protein_bind 1452..1461 /bound_moiety="NF-kB" protein_bind 1582..1586 /bound_moiety="retinoic acid receptor" GC_signal 2117..2122 GC_signal 2133..2138 GC_signal 2154..2159 GC_signal 2178..2188 GC_signal 2216..2221 protein_bind 2223..2235 /bound_moiety="steroid/thyroid hormon receptor" exon 2286..2330 /number=1 exon 2627..2703 /number=2 gene 2628..4058 /gene="MK" CDS join(2628..2703,2864..3031,3154..3315,4033..4058) /gene="MK" /codon_start=1 /product="midkine" /db_xref="PID:d1001932" /db_xref="PID:g219929" /translation="MQHRGFLLLTLLALLALTSAVAKKKDKVKKGGPGSECAEWAWGP CTPSSKDCGVGFREGTCGAQTQRIRCRVPCNWKKEFGADCKYKFENWGACDGGTGTKV RQGTLKKARYNAQCQETIRVTKPCTPKTKAKAKAKKGKGKD" exon 2864..3031 /gene="MK" /number=3 exon 3154..3315 /gene="MK" /number=4 exon 4033..4188 /number=5 polyA_signal 4342..4347 polyA_signal 4364..4369 BASE COUNT 814 a 1430 c 1637 g 757 t ORIGIN SauSAI site. 1 gatcagggga cgggatgggg tacacagcca gcccctgctc ccccagcggg gagacctgtt 61 tgcaccaagc agcggccctg ggccagcgca ccatctgcca ctacatcgtg gaggccgggg 121 cctcgctcat gaagacagac cagcaggtga gcagacggca ggcagggagc ccacgagggc 181 accaaccaaa cctttcccaa ggtcctaggc gggagctggg gctgggggct gtccctggga 241 agacacagtc cagaccctgg gaaacctgag ccagcagggg aggagctggt gggcagagag 301 gcctccctcc ctgaccaggc cacagggagg tagagcccct gcctctcagc ctgctagggg 361 ttaggcctgc ctctggcccc tgctgatcgc agctccgccc tcctccaggg cgacactccc 421 cggcagcggg ctgagaaggc tcaggacacc gagctggccg cctacctgga gaaccggcag 481 cactaccaga tgatccagcg ggaggaccag gagacggctg tgtagcgggc cgcccacggg 541 cagcaggagg gacaatgcgg ccaggggacg agcgccttcc ttgcccacct cactgccaca 601 ttccagtggg acggccacgg ggggacctag gccccaggga aagagcccca tgccgccccc 661 taaggagccg cccagaccta gggctggact caggagctgg gggggcctca cctgttcccc 721 tgaggacccc gccggacccg gaggctcaca gggaacaaga cacggctggg ttggatatgc 781 ctttgccggg gttctggggc agggcgctcc ctggccgcag cagatgccct cccaggagtg 841 ggaggggctg gagaggggga ggccttcggg aagaggcttc ctgggccccc tggtcttcgg 901 ccgggtcccc agcccccgct cctgccccac cccacctcct ccgggcttcc tcccggaaac 961 tcagcgcctg ctgcacttgc ctgccctgcc ttgcttggca cccgctccgg cgaccctccc 1021 cgctcccctg tcatttcatc gcggactgtg cggcctgggg gtggggggcg ggactctcac 1081 ggtgacatgt ttacagctgg gtgtgactca gtaaagtgga tttttttttc ttttctgctt 1141 ttcttctttt gcgggggagg tctaacaacc agcgggggct gcggggttgt cctcggggtg 1201 ggggactgga cgctgtcgac agcaccttcc tggggccccg gctcccgttt ggtggttggt 1261 cccagggcct gcccggttcc tgacctctgc ccgcggccgc gctcgtcggg gccgggcggg 1321 ggccgatccc tccggcttcc cttcccgcgg agaacaacaa tgaaagtgaa agaggggtgg 1381 ggcgggggcg agcccgggtt ctgtggccca tttgccctgt ggccttgagc aagcccctcc 1441 cccaggcctc gggggctctc ccggtttggg ggaaccgggc gaggcaatgc cacaggccca 1501 gggttagagg gggtgggcac ttgcagctgc cgatgtggct ggatctggaa cttctcagac 1561 ggctcctgtc agcgccaagt ttcaccaaat ccaggcctgc gggctcctcc cccaggaccc 1621 ccactcgcag tccctcaagc ctgtgctccc ggaaaggcac tgggcgaccg cacccgtggc 1681 tttctctggg cgaccgggtc ccagactccc cccagcacag cagagcgctt ccctgcccac 1741 ccgcggaaac cgccccaggt ggccgcgccc cctccccagc agccagcagg gcgccagggc 1801 tgagccggcc gtggagggga gcgggtcccg ggggttatac aggcgccggg cgtccgcggc 1861 aggcaagaga agctgaggcc tgagaacggc ccgggccttg gcgtacggca ggggacgacc 1921 tgggatgggg gcagcgggcg gcggcgcagg gagtgggccg gggccggtgt gcgcgggcgg 1981 gacggggccg gggtcgggag accaccgctc ggaagatggg gccgggagag gccgccgtcg 2041 cagcgcagag ggcaccggcg gggagacgcg aggacgcggg gccgggaaca cggacgccgg 2101 agtagaagcg cggggggggc gggctggagc gggggcgggg acgccggggt cgggggcggt 2161 gcgggtttga ggggaggggg cggggcgggt ccttccctgg gggggtgggg agagggggcg 2221 ggggcccatg tgaccggctc agaccggttc tggagacaaa aggggccgcg gcggccggag 2281 cgggacgggc ccggcgcggg agggagcgaa gcagcgcggg cagcgagcga gtgagcgcgc 2341 ggcggcccct ggtccgcccg ccgcggccga tctaggggct gggggctgga ggcgggggtg 2401 ggggtctgag ctgcgtcctg ggctcgaggc gtcccccggg gagtcgcctc ttagcggtgc 2461 gtccgggcta gcggcgaggg gccgccccaa gtcttcccac cgccgccacc ttagcagccc 2521 gacttggggc ctggaaagtg gagcactctg aggtgggagg gccctgcacg cggccccggt 2581 ggggaagggg acgggccagg gattcagact cgggctctcc cctcaggatg cagcaccgag 2641 gcttcctcct cctcaccctc ctcgccctgc tggcgctcac ctccgcggtc gccaaaaaga 2701 aaggttatgg gggatgatcg aaggagggct ggggacgggc aggcgaggcc cctccacttc 2761 tgggctgggc cgcctgggtt cctagcctgg aaccccagga aggcgggtcc cgagggagtc 2821 tccccgtgcc ccagtcctga actctgttcc tcgcgcgttg tagataaggt gaagaagggc 2881 ggcccgggga gcgagtgcgc tgagtgggcc tgggggccct gcacccccag cagcaaggat 2941 tgcggcgtgg gtttccgcga gggcacctgc ggggcccaga cccagcgcat ccggtgcagg 3001 gtgccctgca actggaagaa ggagtttgga ggtgaggcgg ggggcagtca gagggcagga 3061 gacgggggca cagcctcgcc gaagcctggg gggacccttg gcggagggcg gggccgcggg 3121 cgcgcagcct gacctgggcc gctctctcgc cagccgactg caagtacaag tttgagaact 3181 ggggtgcgtg tgatgggggc acaggcacca aagtccgcca aggcaccctg aagaaggcgc 3241 gctacaatgc tcagtgccag gagaccatcc gcgtcaccaa gccctgcacc cccaagacca 3301 aagcaaaggc caaaggtcag cgaaaggaga agggggtggg gctgtcgcgg ggggctgccc 3361 cccccccccc cgcctgtgag gggacaattc caagttaaac cttaagtttt gagtcctggc 3421 cagtggcttc ctgacatcgc ctcacttggc ttccctgcct ggaaaagtct gaagatgggc 3481 actacaagag aggccgcagg tgatgctggg gacataaatc ctccctggcc caaataggga 3541 ccaactcaaa ctactccatt ggagcatctg gcttaggacc cagggagagt gtcctggaac 3601 ggcttgcctt tgtcagctct ccagccacgg gcagcatttg gtcagctctg cctttctagt 3661 gttgggagga ggaggtcaag ccaccctggg cctctcagct cactcgtgac tcagcccagc 3721 gaggccagca gggcaggggt gaatctgccc gcttctcagg tgaggaggct gaggatgccc 3781 agggctgctg tgaccaggac taggactgga aacttgaagg ttttctgatc ccaagtggaa 3841 ataggaagct ggggatgtcc catgtccaca tcacaatggc tgccccatcc cctgcttccg 3901 agtcagctga ttggaaacac atagggggca gaatcttctc cttccctgat gcccgggtgt 3961 ttgtggagcc ggcggtctgc aatgggtcag cctaactgct gatatggtat taatattctt 4021 cttgttttac agccaagaaa gggaagggaa aggactagac gccaagcctg gatgccaagg 4081 agcccctggt gtcacatggg gcctggccca cgccctccct ctcccaggcc cgagatgtga 4141 cccaccagtg ccttctgtct gctcgttagc tttaatcaat catgcccctg ccttgtccct 4201 ctcactcccc agccccaccc ctaagtgccc aaagtgggga gggacaaggg attctgggaa 4261 gcttgagcct cccccaaagc aatgtgagtc ccagagcccg ctttgttctt ccccacaatt 4321 ccattactaa gaaacacatc aaataaactg actttttccc cccaataaaa gctcttcttt 4381 tttaatatat aaaagcccct tcccaaggag ttttgctgtg gaaatgtgtt tgggagtggg 4441 aaggtgggga gaaagaccag gctgtaggga ctggtgggtt tcagggggct tggtggtggg 4501 tgctctccag agctcatgga aaaagcagaa caattacaaa catttcttcc agggcccctg 4561 aaagggtgct ccccatcaag tcacctaagc ctttcggtcc tcatctccct cagggaccag 4621 ggctggggaa gggaacgt // LOCUS HSMYF4G 2804 bp DNA PRI 27-SEP-1996 DEFINITION Human myf4 gene for skeletal muscle-specific transcription factor. ACCESSION X62155 NID g34833 KEYWORDS muscle specific protein; Myf-4 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2804) AUTHORS Arnold,H.H. TITLE Direct Submission JOURNAL Submitted (16-SEP-1991) H.H. Arnold, Dep of Toxicology, Medical School, University of Hamburg, Griudelallee 117, D-2000 Hamburg 13, FRG REFERENCE 2 (bases 1 to 2804) AUTHORS Salminen,A., Braun,T., Buchberger,A., Jurs,S., Winter,B. and Arnold,H.H. TITLE Transcription of the muscle regulatory gene Myf4 is regulated by serum components, peptide growth factors and signaling pathways involving G proteins JOURNAL J. Cell Biol. 115 (4), 905-917 (1991) MEDLINE 92064650 REFERENCE 3 (bases 1 to 2804) AUTHORS Braun,T., Bober,E., Buschhausen-Denker,G., Kohtz,S., Grzeschik,K.H. and Arnold,H.H. TITLE Differential expression of myogenic determination genes in muscle cells: possible autoactivation by the Myf gene products JOURNAL EMBO J. 8 (12), 3617-3625 (1989) MEDLINE 90059960 REMARK Erratum:[EMBO J 1989 Dec;8(13):4358] REFERENCE 4 (bases 1 to 2804) AUTHORS Arnold,H.H. TITLE Corrigendum for EMBO J. 8, 3617-3625 (1989) JOURNAL EMBO J. 8, 4358-4358 (1989) FEATURES Location/Qualifiers source 1..2804 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda EMBL3" exon 1179..1649 /gene="myf4" /number=1 CDS join(1179..1649,1782..1862,1989..2111) /gene="myf4" /function="skeletal muscle-specific transcription factor" /codon_start=1 /product="Myf4 protein" /db_xref="PID:g34834" /db_xref="SWISS-PROT:P15173" /translation="MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSP EAPGPLEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFE ALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPS ECSSHSASCSPEWGSALEFSANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFP DETMP" gene 1179..2111 /gene="myf4" mRNA join(1179..1649,1782..1862,1989..2107) /gene="myf4" intron 1650..1781 /gene="myf4" /number=1 exon 1782..1862 /gene="myf4" /number=2 intron 1863..1988 /gene="myf4" /number=2 exon 1989..2111 /gene="myf4" /number=3 BASE COUNT 629 a 809 c 779 g 586 t 1 others ORIGIN 1 ccatggcctg agaggcctga agattaaata gtccctgctc tacctgacat ttgagacctg 61 catgatacgt gtaaaattga tggctcagac agtaagaggc agaacaacca agccccggtt 121 ggccaaggag agaggagatt taaactgagt ttgaagggct ctgttttttc tactttttca 181 cccccgtcca cattaatgag ccaggaaaat tctggtgttg gaaaaagagt tgaggaggta 241 acatagaagg acagatcaat ggtccggagt gagacaggga ggagacaggg agcttaggac 301 atcttgacct tcccgctctg gggaaggtgg caacgtgtag gctgtaaatt ctaatctctg 361 ctctgaccac ggctgctgta ttaaagaggg aaatttctct ttacccctgc caagaaaacc 421 aaaacccaaa taattttctc ttgcctcaat ttaccccccc agatgagacc taagagaaca 481 tacagctcaa agctgcttgg gaaccccaaa agtggaatct gttagctgct ctgagtcttg 541 atccttctta tccctggata ctgagtgccc atgaatgccc agaatctgaa gcagtcccat 601 tctttcccag gaagtcttta aaagagtctc atcgactgat gtagtgtggt taaggtgctg 661 tcaggaagca aggagatgga taagttggct cttaaggccc cttccagcct aagcctaccc 721 ttccttgttc cctcctcccc ctatacccac cagcttcttc gggactcgaa ggagacagca 781 ggcaggccgc ccagctagga gtaattgaaa ggagcagatg agaggggaat gtgtcctccc 841 ccaacgcccc tgccccacag gggctgctga gaaatgaaaa ctaatcaaat tacacccgac 901 ggcctcccga cccgtgcaca ggagcccgcc tgggccaggg gcaggctggc agggtggggt 961 gggggccatg cgggagaaag aaggggaatc acatctaatc cactgtaaac gccttgatgt 1021 gcagcaacag cttagagggg gctcaggttt ctgtggcgtt ggctatattt atctctggtt 1081 ccatgccagc ggggagggtt taaatggcac ccagcagttg gcgtgagggg ctgctggagc 1141 ttgggggctg gtggcaggaa caagcccttt ccgaccccat ggagctgtat gagacatccc 1201 cctacttcta ccaggaaccc cgcttctatg atggggaaaa ctacctgcct gtccacctcc 1261 agggcttcga accaccaggc tacgagcgga cggagctcac cctgagcccc gaggccccag 1321 ggccccttga ggacaagggg ctggggaccc ccgagcactg tccaggccag tgcctgccgt 1381 gggcgtgtaa ggtgtgtaag aggaagtcgg tgtccgtgga ccggcggcgg gcggccacac 1441 tgagggagaa gcgcaggctc aagaaggtga atgaggcctt cgaggccctg aagagaagca 1501 ccctgctcaa ccccaaccag cggctgccca aggtggagat cctgcgcagt gccatccagt 1561 acatcgagcg cctccaggcc ctgctcagct ccctcaacca ggaggagcgt gacctccgct 1621 accggggcgg gggcgggccc cagccagggg taagtggcca tcccatcccc ctgccccaag 1681 gggtaagtgg ccatcccatc cccctgcccc aagggggact aggcagagag agacatcaaa 1741 gaatngctgt gccccagcct tgccttcgcc ctgtcttgca ggtgcccagc gaatgcagct 1801 ctcacagcgc ctcctgcagt ccagagtggg gcagtgcact ggagttcagc gccaacccag 1861 gggtaagtga ggcctgactg gtaaccttgc gtccaatccc tcagtccctc cctgggccag 1921 gttctcccct cttgcctcaa gaacccccac tgctcaccca ggtaccacct tcccacctcc 1981 ccttacagga tcatctgctc acggctgacc ctacagatgc ccacaacctg cactccctca 2041 cctccatcgt ggacagcatc acagtggaag atgtgtctgt ggccttccca gatgaaacca 2101 tgcccaactg agattgtctt ccaagccggg catccttgcg agccccccaa gctggccaca 2161 gatgccacta cttctgtagc aggggcctcc taagccaggc tgccctgatg ctaggaagcc 2221 agctctgggg tgccataggc cagactatcc ccttcctcat ccatgtaagg ttaacccacc 2281 ccccagcaag ggactggacg ccctcattca gctgcctcct tagaggagag ggcatccctt 2341 tccagggagg taaagcaggg gaccagagcg ccccctcgtg tatgccccag ctcagggggc 2401 aaactcagga gcttcctttt tatcataacg cggcctctaa ttccaccccc caagtgaaac 2461 ggtttgagag acgccgtgcc ctgacctgga caagctgtgc acgtctcctg ttctggtctc 2521 ttcccgatgc agtggctggc tggcctgccc tgaattgaga gagaagaagg gggagaggaa 2581 cagccctctg ttcccaagtc ctggggggcc aaacttttgc agtgaatatt gggaaccttc 2641 cagtggtttt atgttttgtt ttgtttcgtg tgttgtttgt aaagctgcca tccgaccaag 2701 gtctcctgtg ctgaagttgc cggggacagg cagggaaaag gggttggggc ctcttggggg 2761 tgatttcttt tgttaacaaa gcatcgtgtg gttttgccgg aatt // LOCUS HUMHISAB 1314 bp DNA PRI 07-MAR-1995 DEFINITION Human histone H1 (H1F3) gene, complete cds. ACCESSION M60747 NID g184071 KEYWORDS histone H1. SOURCE Human blood DNA, clone C5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1314) AUTHORS Albig,W., Kardalinou,E., Drabent,B., Zimmer,A. and Doenecke,D. TITLE Isolation and characterization of two human H1 histone genes within clusters of core histone genes JOURNAL Genomics 10 (4), 940-948 (1991) MEDLINE 92009931 FEATURES Location/Qualifiers source 1..1314 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C5" /tissue_type="blood" /map="6p21" gene 445..1110 /gene="H1F3" CDS 445..1110 /gene="H1F3" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-029" /product="histone H1" /db_xref="PID:g184072" /translation="MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSEL ITKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGA SGSFKLNKKAASGEGKPKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVK KPATAAGTKKVAKSAKKVKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAP KKK" BASE COUNT 395 a 312 c 351 g 256 t ORIGIN 1 gaattcacag aacaatgccc tggaagagag atttgcaaat gaaagaggga gaaatagtat 61 ttttaaggaa aacatgaaag ttgtcttcga tttgaccgaa gtttgaaaga gagttggggg 121 aataagaaaa gtttcaagat gccggtttta aaactatgca tagaaacaaa ggcaggaatg 181 aagcccgaac tctctcggat cagtttcccc caagtctcta attatttcat gcccagattt 241 cttatatttt tactcttttt taaggggcaa caaacacagc agcgcggtag atacgaggag 301 tccttttcca gcagcgccgc gcatggagca aggaaccaat catcactcag cgtctctcta 361 tataaaccct cagcctccct cgtagaggac atgctgttct gacagtttga gattacttat 421 tgtcttttct gggaagacaa aaacatgtcg gagactgctc cacttgctcc taccattcct 481 gcacccgcag aaaaaacacc tgtgaagaaa aaggcgaaga aggcaggcgc aactgctggg 541 aaacgcaaag catccggacc cccagtatct gagcttatca ccaaggcagt ggcagcttct 601 aaggagcgca gcggcgtttc tctggccgcg cttaagaaag cgcttgcggc tgctggctac 661 gatgtagaaa aaaacaacag ccgtatcaag cttggcctca agagcttggt gagcaaaggt 721 actctggtgc agaccaaagg taccggtgct tctggctcct tcaaactcaa caagaaagcg 781 gcttccgggg aaggcaaacc caaggccaaa aaggctggcg cagccaagcc taggaagcct 841 gctggggcag ccaagaagcc caagaaggtg gctggcgccg ctaccccgaa gaaaagcatc 901 aaaaagactc ctaagaaggt aaagaagcca gcaaccgctg ctgggaccaa gaaagtggcc 961 aagagtgcga aaaaggtgaa aacacctcag ccaaaaaaag ctgccaagag tccagctaag 1021 gccaaagccc ctaagcccaa ggcggccaag cctaagtcgg ggaagccgaa ggttacaaag 1081 gcaaagaagg cagctccgaa gaaaaagtga aactggcggg acgttcccct ttgaaaattt 1141 taaacggctc ttttcagagc cacccacagg tctcagtcaa aagagctgaa gctttttgga 1201 ggggggagtg gggtggagag gggtgctgcg gtgttgtgcg gccacggtct atcctagtcg 1261 tgctggttgg gggtcagtta ttaacagctc ccagcagcct gggcgcaagg atcc // LOCUS HUMOTNPI 1338 bp DNA PRI 03-MAY-1996 DEFINITION Human prepro-oxytocin-neurophysin I (OXT) gene, complete cds. ACCESSION M11186 NID g189414 KEYWORDS neurophysin; oxytocin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1338) AUTHORS Sausville,E., Carney,D. and Battey,J. TITLE The human vasopressin gene is linked to the oxytocin gene and is selectively expressed in a cultured lung cancer cell line JOURNAL J. Biol. Chem. 260 (18), 10236-10241 (1985) MEDLINE 85261445 COMMENT A draft entry and printed copy of the sequence in [1] were kindly provided by J.Battey 03-FEB-1986. FEATURES Location/Qualifiers source 1..1338 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hAVP4." /tissue_type="placenta" /map="20pter-p12.2" gene 382..1278 /gene="OXT" exon 382..537 /gene="OXT" /number=1 CDS join(418..537,838..1036,1121..1176) /gene="OXT" /note="precursor" /codon_start=1 /product="oxytocin-neurophysin I" /db_xref="PID:g386991" /translation="MAGPSLACCLLGLLALTSACYIQNCPLGGKRAAPDLDVRKCLPC GPGGKGRCFGPNICCAEELGCFVGTAEALRCQEENYLPSPCQSGQKACGSGGRCALGL CCSPDGCHADPACDAEATFSQR" sig_peptide 418..474 /gene="OXT" mat_peptide 475..501 /gene="OXT" /product="oxytocin" mat_peptide join(511..537,838..1036,1121..1170) /gene="OXT" /product="neurophysin I" intron 538..837 /gene="OXT" /number=1 exon 838..1036 /gene="OXT" /number=2 intron 1037..1120 /gene="OXT" /number=2 exon 1121..1278 /gene="OXT" /number=3 BASE COUNT 223 a 531 c 391 g 193 t ORIGIN 1 bp upstream of Sau3A site. 1 ggatcctgcc agagcctcct cccacctgga ggggtcccag cgtccacctt ccctgcccca 61 gcccccctcc tcgaggtact gggaggctgg ataaagtctt cggctgggcc acaccccacc 121 ccaaattctc cctgtcccac cctagtgccc aggccacccc ggcctgctcc cttccgcaag 181 gcacctcacc ttctgtgccc agaccattag ccaacgcggt gaccttgacc ccggcccagg 241 ccctgctaat gaagaggaaa gcccgtacgc actcggcctg acccacggcg accctctgtg 301 accaatcata ctaccaacct cttaaacaga gctccaccga cgcaatgccc aggcataaaa 361 aggccaggcc gagagaccgc caccagtcac ggaccctgga cccagcgcac ccgcaccatg 421 gccggcccca gcctcgcttg ctgtctgctc ggcctcctgg cgctgacctc cgcctgctac 481 atccagaact gccccctggg aggcaagagg gccgcgccgg acctcgacgt gcgcaaggtg 541 agtccccagc cctggtcccg cggcgctccg gggagggagg gacccgcagc cacaggggcg 601 cgccccgctc cggcctcgcc tgagaactcc aggagctgag cggattttga cgccccgccc 661 ttgaccgcgg tcgaggcccc cacggcgccc cagcgtctca gccccgctgt ccccgcccga 721 actccgaacc ccggacccca gcatccttgc ccggcgcacc ccggccggcc tcgcagggtc 781 ctccgagcga gtccccagcg ccgccccgcg tcccgctcac cccgcccgtc ccccgagtgc 841 ctcccctgcg gccccggggg caaaggccgc tgcttcgggc ccaatatctg ctgcgcggaa 901 gagctgggct gcttcgtggg caccgccgaa gcgctgcgct gccaggagga gaactacctg 961 ccgtcgccct gccagtccgg ccagaaggcg tgcgggagcg ggggccgctg cgccttgggc 1021 ctctgctgca gcccgggtga gcggggcaag gcgctccggg gccaggggga ggcgggcggg 1081 ggtgcggccg ggattcccct gactccacct cttcctccag acggctgcca cgccgaccct 1141 gcctgcgacg cggaagccac cttctcccag cgctgaaact tgatggctcc gaacaccctc 1201 gaagcgcgcc actcgcttcc cccatagcca ccccagaaat ggtgaaaata aaataaagca 1261 ggtttttctc ctctaccttg actcgtgtct aagtgccaga aatgggacgg ggagggggca 1321 ttgtgggact ggaagatc // LOCUS HUMTKRA 13500 bp DNA PRI 14-JAN-1995 DEFINITION Human thymidine kinase gene, complete cds, with clustered Alu repeats in the introns. ACCESSION M15205 M15206 NID g339718 KEYWORDS Alu repeat; repeat region; thymidine kinase. SOURCE Human DNA (library of Y.-F.Lau), clone lambda-tk46 [1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13500) AUTHORS Flemington,E., Bradshaw,H.D. Jr., Traina-Dorge,V., Slagel,V. and Deininger,P.L. TITLE Sequence, structure and promoter characterization of the human thymidine kinase gene JOURNAL Gene 52 (2-3), 267-277 (1987) MEDLINE 87277399 REFERENCE 2 (sites) AUTHORS Slagel,V., Flemington,E., Traina-Dorge,V., Bradshaw,H. and Deininger,P. TITLE Clustering and subfamily relationships of the Alu family in the human genome JOURNAL Mol. Biol. Evol. 4 (1), 19-29 (1987) MEDLINE 88188974 REFERENCE 3 (sites) AUTHORS Barik,S. and Banerjee,A.K. TITLE Cloning and expression of the vesicular stomatitis virus phosphoprotein gene in Escherichia coli: analysis of phosphorylation status versus transcriptional activity JOURNAL J. Virol. 65 (4), 1719-1726 (1991) MEDLINE 91162717 REFERENCE 4 (sites) AUTHORS Kim,Y.K. and Lee,A.S. TITLE Identification of a protein-binding site in the promoter of the human thymidine kinase gene required for the G1-S-regulated transcription JOURNAL J. Biol. Chem. 267 (4), 2723-2727 (1992) MEDLINE 92129365 COMMENT [2] sites; Alu repeats only. [1] exons, intron/exon boundaries, 5', 3' flanks. Draft entry and computer-readable sequence for [2],[1] kindly provided by P.L.Deininger, 07-APR-1987. FEATURES Location/Qualifiers source 1..13500 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q23.2-q25.3" prim_transcript 451..13417 /note="TK mRNA (alt.)" prim_transcript 458..13417 /note="TK mRNA (alt.)" exon <520..585 /gene="TK1" /note="thymidine kinase; G00-120-439" /number=1 gene 520..585 /gene="TK1" CDS join(520..585,696..727,2360..2470,4845..4938,11901..11990, 12348..12467,12567..12758) /note="thymidine kinase" /codon_start=1 /db_xref="PID:g339719" /translation="MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQ IAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFF PDIMEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREA AYTKRLGTEKEVEVIGGADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAA RKLFAPQQILQCSPAN" intron 586..695 /note="TK cds intron A" exon 696..727 /number=2 intron 728..2359 /note="TK cds intron B" repeat_region complement(1268..1546) /note="Alu-tkA repeat" repeat_region 1697..1994 /note="Alu-tkB repeat" exon 2360..2470 /number=3 intron 2471..4844 /note="TK cds intron C" repeat_region complement(3813..4109) /note="Alu-tkC repeat" repeat_region 4393..4661 /note="Alu-tkD repeat" exon 4845..4938 /number=4 intron 4939..11900 /note="TK cds intron D" repeat_region complement(5141..5434) /note="Alu-tkE repeat" repeat_region 5471..5765 /note="Alu-tkF repeat" repeat_region complement(5962..6259) /note="Alu-tkG repeat" repeat_region complement(6263..6552) /note="Alu-tkH repeat" repeat_region complement(7434..7731) /note="Alu-tkI repeat" repeat_region 8658..8879 /note="Alu-tkJ repeat" repeat_region 9350..9646 /note="Alu-tkK repeat" repeat_region 9646..9939 /note="Alu-tkL repeat" repeat_region 10327..10397 /note="Alu-tkM repeat" exon 11901..11990 /number=5 intron 11991..12347 /note="TK cds intron E" exon 12348..12467 /number=6 intron 12468..12566 /note="TK cds intron F" exon 12567..>12758 /note="thymidine kinase" /number=7 BASE COUNT 2842 a 3614 c 3578 g 3463 t 3 others ORIGIN Chromosome 17q23.2-q25.3; 1 bp upstream of HindIII site. 1 aagcttcctt cttggaattc caaactaata aatgagctaa ctccgcccca gccccttagt 61 ccctccctgc aatccaccta cctctgcaga catcttcttc caaggaacct tgcttgggaa 121 acccacacca gacacatcca tcatggcgtc tacagccgca tgggcgtgcg tccctctgtt 181 tatatggcca gagccccgcc tcgctccgcc cctttaaact tggtgggcgg accgaggcgg 241 ggctcagacc aggccccacc ccgatcagcc acgtccatcg ccctgatttc caggccctcc 301 cagtccctgg gcgcacgtcc cggattcctc ccacgagggg gcgggctgcg gccaaatctc 361 ccgccaggtc agcggccggg cgctgattgg ccccatggcg gcggggccgg ctcgtgattg 421 gccagcacgc cgtggtttaa agcggtcggc gcgggaccag gggcttactg cgggacggcc 481 ttggagagta ctcgggttcg tgaacttccc ggaggcgcaa tgagctgcat taacctgccc 541 actgtgctgc ccggctcccc cagcaagacc cgggggcaga tccaggtgcg ggggccagcc 601 ctgcgcgtgg ctggggatga ggtggtcgtg gtgatagcct gtgtccaggc atccgcgcag 661 ggcgggccct caaatgacct caccttctct cctaggtgat tctcgggccg atgttctcag 721 gaaaaaggta atggcttcgc ggggctgggg tggagctcct tcctcttctc cggggacccc 781 ttgtccctcc cctcccctcc cctcccctcc cctcccctcc cctccccttc cctccccttc 841 ccttccctcc ccttcccttc ccctagaagg accagcacag cctcctacag ctcccgcccg 901 gggtgctcct cccttgaatt cagtccagga ggaagtctct gccctcttct gcccaggcca 961 agcccctcgt cctgtgtgga cgccactccc tcctggagct ggtgacagct gcttacagct 1021 tagctgtctt ccccaccaag tcctctgaga aggtggcaac cagttgtgtc ccctgtaggc 1081 caggcctttt tgtacacccc tattcaatgt ggctgtttcc ttctaaggcc aaggaaacgt 1141 agtcgctttc taaaccaagg agtctgaagc cgtggagcct ctgctctcct gaggtgatag 1201 aaccattccc tgacccgggt ggggctagtg agtttcttga gtaaactacc cacgcaccat 1261 tctttttgtt ttgtttttgt tcttctagag gtaggatctt gctatgttgc ccaggctggt 1321 ctcaaactcc tgggctcaag caattctctc acctcagcct cccaagtagc tgggactaca 1381 ggcgtgcacc ccccccgcct ccacccagct aattttattt tatttttata gagctggggt 1441 cttgctatgt tgcccaagct ggtcttgaac tcctggtctc aagcaatcct cctacttcag 1501 catcccaaag tgctgggatt acagatgtta gccaccatgc cctgccccaa cattctttta 1561 tggccctggg gatcacttca gctcaaaccc cttgctcagg aagatgtggc tcagagttgg 1621 acttcttgga cccagaagca agtgcttttg acgctgcaca caaagacttt ctgaaattaa 1681 tttagaaaag ctgtatgcca ggtgtggtgg cccacgcctt taatcccagc gctttggaag 1741 gctgaggtgc gttgatcact tgaggttagg agtttgagac caccctggtc aacgtggtga 1801 aaccccatct ctactgaaaa aaaaaaccaa aaattatctg ggcatggtgg cagcctcctg 1861 taatcccagc tactcgggag gttgaggcag gagaatctct tgaacccgga aggcaggggt 1921 tgcagtgagc tgagatcgct ccactgcact ctaacctagg caacagagcg agactccacc 1981 ccaaaaagaa agaaagaaaa actctgaact ctgggaacaa ctctgggatg aggttacttt 2041 ggaatgcagt cgcaggttcc ctctacatgt agcctttgct tctgccttcc ccactacatc 2101 ttggagaagg ttactcctcc cacacttcct gggaccacct gagtaccatt cctggacctc 2161 ttccccatag agaattctga cttccaaccc tctttgtagg gatattatac cctgcctgct 2221 ctgccctgct cttttctggc tgtggtgggc tcagtctgca taccactagg gacaatgagg 2281 agccaggctt gttggggagg ggtctccttc tcccactcct cccgccgtgg acctcacctg 2341 accctctctc ctcttgcagc acagagttga tgagacgcgt ccgtcgcttc cagattgctc 2401 agtacaagtg cctggtgatc aagtatgcca aagacactcg ctacagcagc agcttctgca 2461 cacatgaccg gtcagtccct gccccctgca gtcctgtcca gtggaaaatc acaaggcaca 2521 ggacacactg ttaggactct ctttaatggg gatggttaat catttgaaca ttgaatgatt 2581 caaatcagca cactttccaa ggtgcttggc aaggtagcgc acactctcca ctccctgggc 2641 tggagccagt ggttctccac tgagggtgat tttgccgcca gggtccattt gacaatgttt 2701 gaagacattt ctagttgttg caactggagg ggggagggga tgcttttggg ctttaatgtg 2761 tagaaatcag ggacactgct gctaagggtc ctatggtgca gaggacggcc cccatgcaag 2821 aacgagctgg ccccaaatgt caggagcctg ccagtgttca gaaactctgc cgtagggttt 2881 cagcttcaca caggctgcag actggtttgg tttggcctgc acgttgattt ttgtttaatt 2941 ttttagttgt ccgttgttgg ctggctcccc cgtcacctgg cagccttcac gcttccctgt 3001 tttatgtgta gctgtttgag ctcgctggac atttccgcct gcaacctcag tttgggagtt 3061 aaattcactt ccttggcagc agatgtgggc ccgatgtttc tgagcctgag acgctttgct 3121 tggtcctctg gacttgtcca cctgggcacc cagtggcaaa gccatgctgt gccacacatt 3181 atagggcttc agcctcagag ccctggctgg gagctgtatc cgagagttgc tatggctgtg 3241 cagagaacag atccacccgg cgtgtggcct tcggtgggag ctgaggggct cctgaagcca 3301 gatgctggtg gagtggaggg tgcttggggc ttggagttgc atgtgggaat ttaaccgcac 3361 cttcgtgacc atgctgtctg atgtaggtca tttacttttc caaatttgct tcctcattcc 3421 taagatgcga tgtccacggc acagggtggt gttacacctg gtggggacag ggaaagcaga 3481 ggaggtcact tcgttccagc tgttggaagt acaacttctg gagtcagtca gatccgggat 3541 taaatatgag ttctgcccgt gtgtcacaag tcatctctaa cacgggccac agaggccaag 3601 gctgggccag cagcattgat ggctcgagag gctgcccttg caggggccac agctggcctc 3661 ccacctgccc tcactttgtc tttctctgtt tagggaggga agagggaatt taaaatgccc 3721 aaaatactgt ttcacacatt ctttccagaa ctcgaagtag gattatagca aggtaataac 3781 gaaacaatag ttgtaaagta tgtttttttg tttgtttgtt gtttgttttt gggacagggt 3841 ctctctctgt cacccaggct ggagtgcagt ggctcaatca tagcttactg ttacgtgacc 3901 ccaaaccctt gggctcaagt gatcgtccca cctcagcccc ctgagcaggt gggactacag 3961 gcgcacacca ccacacccag ttaattttta catttttttc acacagtgtc tcgctgtgtt 4021 acccaggctg gtctcgaact cctgagttca agtgatcctc ccgtcttggc ctccccaaag 4081 attacgggca tgagctgctg tgtctggcca gaatacagga ttttaaaaat ttatgttttg 4141 caacataatt aatataaaga caaatataac ccaggcccag ttctagttat tcattcttct 4201 gaattttaaa aggaaacatt tggctggccc ctaatggtat catgggccct ggtacctgat 4261 gaagttggcc tagtctgccc ccagctcctg aacagtggaa gagtttttag tctcattgag 4321 ctttgtactg gacattacta atttctaatc caaagcatca agtgaagtgg cttgtataaa 4381 taactggttt tcctctggga ggctaaggcg ggtggatcac ttaaaagtta ggagtctgag 4441 accagcctgg ccaacatggt gaaaccccat gtctgctaaa aatacaaaaa ttagctgggt 4501 gtgatggtgt gtggccagta gtcccagcta ctcttgtggc tgaggtggga gaatcgcttg 4561 agacccttga gaattgggag gtagagattg cagggagccg agatggcgcc actgcactcc 4621 agcctgggtg acagagcaag actctgtttc ataaaaaata aataaataac tggttttctg 4681 gacgagggcc tttcccatag gtgctaactt ctcaaagccc ggctgggtga acactgagcc 4741 tgctttgcag gtagcaggtg gtcacgacag tgccattccc tggcccctgc attgtggctt 4801 ctggcctccc tggccctgct cacgctctgg ctttctcttc ccaggaacac catggaggcg 4861 ctgcccgcct gcctgctccg agacgtggcc caggaggccc tgggcgtggc tgtcataggc 4921 atcgacgagg ggcagtttgt aagttggctt gtcttggcat cactcttcct gccttccgct 4981 gtgtcctccc gttttccctc gctgacttgg aagttatctg anncttttag taaaataaca 5041 aggttaaata gctacaacta gtgttggaat accctctgaa ggcccctttc tagtttccct 5101 gtcatagtgt catagtcttg taggattcgt tttacttttt tttttttttt ttttgagacg 5161 gagttttgct cttgttgccc aggccggagt acgatggcac aatctcaccg caaactttgc 5221 ttcctgggtt caagcaattc tctcctgtct cagcctcccg agtagctggg attacaggca 5281 tgcgccacca cgcccagcta attttatatt tttagtagag atggggtttc tccatgttgg 5341 tcaagctggt ctcaaactcc caacctcagg tgatccgccc cgccttgaac tcccaaagcg 5401 ctgggattac aggcatgagc taccacacct ggccattgta cctttttaaa aatacatata 5461 tctatttact ggcaagatgc agtgactcac acctgtaatc tcagcctgtg ggaggccaag 5521 gtggacagat cacttgagcc caggagttgg agactcacct gggcaacata gtaaaacccc 5581 atctctacca aaaaaaaaaa gaaattagcc agtcatagca gcgcacacct gtggtccctg 5641 ctactcagga ggctgaggca gaaggatgga gcctgggagg tcgaggctgc agtgagtggt 5701 gatagcacca ctgcactcca gcccgggcga caaggccaga ccctgtctca aaaaaaaaag 5761 ggggaggtgg ggagtaatgt ttggtttgcc tcatggttcc ttttgcttgt ttcttatacg 5821 tttattttct tgttgttgaa gtaccttttt tagtagtttt tgcagccagg aggtatagat 5881 gggaagctgc cagtctttgt atggaaatct ttcttttgtc atctagttta agctgggcag 5941 caagaggtag gttgatcttg tgtgggtttg ggtttttttt tttttttgag acggagtctt 6001 actctgtcgc ccaggctgga gtgcaatggt gtgatctcgg ctcactgcaa cctctgccac 6061 ccggattcaa gcgattttcc cacctcgcct cccaagtagg tgggattaca ggcacccacc 6121 atcatgcctg gctaattttt gtagagacaa gggttcacca tgttggctag gctggtcttg 6181 aactcctgac ctcaggtgat ccacccgcct tggcttccca aagtgttgga attacaggca 6241 tgagccgccg tgcccggcct tttttatttt tatttttttt gagatggagt cttgctctgt 6301 tgccctggct ggagtggagt gacgtgatct tagctcacag caacctccgc cttttgggtt 6361 caagcagttc tgcctcatcc ttccgggtag ctgggatcac aggtgcgtgc cacatgcgta 6421 mtcatttatg tatttttaat agagatgggg tttcaccatg ttggccagct ggtctggaac 6481 tcctgacctc aggtgatccg catgcctcag ctcccaaagt gctgggatta caggcgtgaa 6541 ccacgcctgg tcttgatctt gttgctttga aaagtagcag cgctggtcat tgtgtttttg 6601 ctcagaggaa ggccgccatc tctctaatgt tacctctggt caggtattct atctgttctc 6661 tctcagcaca atgtgtgtag gggaagcttt gtttcattta tcctgcttta tagctggtgt 6721 gccttttcat ttctggggaa ggaatgaagc cattatcact tcaggtattt ctctcctcat 6781 ccatctctga ggtgttctgg gttccatctt ccagagtgtg ttttgtttca gtgactattt 6841 ttacatctgc tgctctaatt catcatgctc cgttttgttt gacaagttac tgttgggtta 6901 tttttaaatt tatgctgttc cttccattat gttcctgaaa atcttttctt agacttttcc 6961 agatttttct atttcctcag gaacatattc tgtggttgag tttctgggtt attttctgtt 7021 atcttagttt tctttcctct gctttggaga ttttattttt gttagtttat cacaaagaat 7081 gaaactgaaa ctctctccaa ggggtttagc agacttgacc tcttaggtac ttttagggtt 7141 gcctcgaagt acacaatgtg gtggtttgat ataaacataa caggaattta tttctcgctc 7201 acagaccccc tacgtggttc caggccggtt gatggggagg ccgcccacga ggcggcttag 7261 gtcgccctgg ctggctgtat acagacacgg aggggaagag acgtggcgga gcccctgggt 7321 gtgaggtttt catgggcctg accagaagct gcaaacgtca cttctgctga tctttcaaag 7381 actagaacct gggcacaggg ccacctatac gtttagtata cttagtccag ttcgtttttt 7441 gtttgttttt aaaaacagtc ttgctctgtg gcccaggctg gagtgcagtg gcgcagtctc 7501 ggctcactat aacctccatg tcccaggttc aagtgattct cccgcctcag cctcctgagt 7561 agctgggatt acaggcttct gccaccatgc ccagctaacc ttttgtattt ttagtagaga 7621 cggggtttca tcatgttgac cgggctggtc tggaactcct aacctcaggt gatctgcctg 7681 cctcagcctc ccaaagtgct gggattacag cgtgagccac cacgcctggc cacacttagt 7741 ctagttctat accctggagg aagaataaat gagtttgttt ggtgagtgct tcaaggtctc 7801 tacccgccct gcctcccagc acagagccag gccgctctgg cctgaatacc ctgcccggac 7861 gtcacagggc ctgtcccctc aaaaggccag tcctgccttc ctggttctgt tcttgcccaa 7921 cattctgtat gagtcacagc tgcaaattcc attcccgtgg ggaggctgac gggtcccttc 7981 ccctgtgcgg ggcatctgcc ctgtggagtt gaggctgcca gtgtccgctc tgggttcccg 8041 accacccggc agctggcatc tcctccccgc ttgggtatgg ccattccgtt tctgaccttc 8101 agaggtgcgc ccctgagcac ccccatgcct ctgcgtacgt ggagacgtcg ttgttgctgc 8161 cccgtgcttg agggactcct ggcgagaaag tgagcccagg ctgggaatag ggctgcagct 8221 gttctctttt gctcccaaac tgtggcctca gaatgcatcc agggattttg catcagcttt 8281 ggggacatgg ccctctcaga acaaggaagc ttcagctttg gcaaggctct ccctccttca 8341 gacctgccgc tgtgagttgt tcaatagctc tgttctcctg gctctgcgta aaccttgttg 8401 acagaggctg acccagaccc ccgaggcaga aacctttccc ttctccttcc tcgacatcca 8461 aatgccctga gtcaggagcc agcgtatgaa gtcctgtccc ctgttcagcc tgtaggaggg 8521 atttctcggt ctacttcctc cctggccagc aagtaaaact tgagttcatt cagtgagtat 8581 ttattacacc ctacccagac atcagcattc tgccctggcc tctgtgtgcc cttgttctct 8641 tcaagaagtt ccgggtcacc agcctgacca acatggagaa actccgtctc tactaaaaat 8701 acaaaaatta gccgggcgtg gtggcgcact gcctgtaatc ccagctactt gggaggctga 8761 ggcaggagaa tcgcttgaac ccggtaggcg aaggttgcag tgagccaaga tcgccccatt 8821 gcactccaag cctgggcaac aacaagagca aaactcagtc tcaaaacaaa acaaaacaaa 8881 agaagttcag ggtcttccca ttgcaagcag ttctagatcg aggagagggg ttcctagcat 8941 gggacccagc agaaggactg tccttcgctc cttcattgtc tacgtggaca gtggatgaag 9001 ctcagccgaa cctgccttgt tcccgttttc tgggtcagca gggaaagcct ttcacagagt 9061 agccaccgtg ccatcctgag gaaggccctg ggtcagaagc ttctgtgctt ctttgtaccc 9121 cgggcaagac acacaggtgc tcacactgct ctgtagaaac tgttggcatc caagagagac 9181 tcacctggaa atctctggaa aacctgaagc tcctagctgg gggtgctgtg cttcagatgc 9241 tggtggtggg tgggcaccct tgcatcaaca gctgcacagt gtgtggtggg cttgcagggt 9301 cgcttggcaa tagtaggagc tctgatttat ttttttaaac tttttttctg gctgggcagg 9361 tggctcacac ctgtaatccc agcactttgg aaggcctagg cgggcggatc acttgaggtc 9421 aggagtttga gaccagccag gccaacatgg tgaaacccca tctctactaa aaatacaaaa 9481 attagccaag cgtggtggca cacacctgta attccagcta cttgggaggc agaggcacaa 9541 gaattgcttg aacctgggag gcagaggttg cagtgagcca agattatgcc actgcactcc 9601 agcctggatg acagagcgag actctgtctc aaaaaaaata gacaaagcca ggcgcagtgg 9661 ctcatgcctg taatcccaac actttgggag gccgaggtgg gtgaatcacg aggtcaggag 9721 atcgagacca tcctggctaa cacggtgaaa ccccgtctct actgaaaata caaaaaaatt 9781 agccaggcgt ggtggtgggc acctgtagtc tcagctactc gggaggctga ggcaggagag 9841 tggcgtgaac ccaggaggcg gagcttgcag tgagctgaga tcacgccact gcactccagc 9901 ctgggcgaca gagcgagact ccgtctcaaa aaaaaaaaaa aaatagacct ttttgtgttt 9961 tctgttctac tacacaagta atacaggttg agtattcctt aacctaaatg cctgggacca 10021 gaagtgtttc ggatttcagg ttttcgaata tttgcatgtt cataatataa tgagaccttg 10081 ggaatgagcc ccaagtgtaa acacaaaatc catttatgtt ttatagacat cttaggcaca 10141 tagcctgaga gtaattttat gtatttagta atttgggcgt gagccacagt ttttgactgt 10201 gacctgtccc atgaggtcag gtgtggaatt ttccacttgt ggtgggcgct caaaaagttt 10261 cagattttgg agcctttcag gttagagaca tgcaatctat aataagttta atctaggaaa 10321 agttagggtc tggcacagag gctcacgtct gtgatcccag cactttggga ggctgaggca 10381 ggcagatcac tggaagtgct ggacgggtgg ggaagtgccg ggtgcaagaa ccaagctctt 10441 tgactatgga cctcagcctg aggttggtca agaggtggag tgagtggggg ctgaggacct 10501 tcatcctgaa accctgatgc aggagagtct ggggtctgcc ttctaccctc atgtggcggg 10561 tgaaggagca aggttctcaa ctcaggaggg ttcttcccct ctccattccc acccagggga 10621 catctcacaa caactagaaa caattttgtc gcagctgggg ggtgggaggt gtgttcctgg 10681 catctatcta atgggtgggg gcgagggacg cagcccaaca ccctacagtg cacaggacac 10741 agcgagatcc ggcctcaaac tggcagccat ggcagcgtca gccctccagg gggcgcgccc 10801 tggcgcaggt ggtgtgccgg cccacagctc cttgcaggct gggagctgca ttttcgtgac 10861 atgtcatgag tcctcagaga aaaagaggga acgagtgcat ggtggggagg ggccctggcg 10921 tgctggagtc tctgggtttc cttctccaga gacccctgca gtcagctgag cgcaatcagt 10981 cacgttgggc tttgcttgga tctcactgga atttttcgag ccacccctta gtcctcacct 11041 tgctaagccc tcacgtctca ataacctcaa acctcagtac ctgggctgag aaagcctgag 11101 tggccctggg agagagaccc tgcacccaag gacaaggaca tccctgcttc acccaaccca 11161 aaggccagtc tggacatatg aactcaacca gctaagagtg atatgattga ttgatgagaa 11221 tcaccagagc acttgccaga gtttcagctt ctccctgggc caaagtgaag tttgctttac 11281 acagtaaatg tgctctgtgc aggtcctgaa tttagaaggc tgtgctgtgt catcctgctc 11341 tgtaaatggc cagtaggacc cccgcccctt ctcaaggcac attacccgtt taaaacgggg 11401 gaggcaagag cacaaagcgc ccacctattc accgaagagc atgtatataa cttagggcct 11461 tccatcctta aacaacagga ccttccttgc tcttacggaa aaggaaacag gttcagagac 11521 gttaattcat tgccaaggtc acacagataa tgggtccagc gaagagtggt gtccgagccc 11581 aaggcagcag gcctttggcc actgcagtgt taaacagcac agctggtgtg gaagtccggt 11641 gctgagtcct gggtacctgg actcggaggg aagctggctg cagggggaag gggctgcgca 11701 gttgtggatg tacctgtcgt ctgctggggg gcgtgcgggt ggacacagtc ccccggcctg 11761 gggagcctcg tgggagaatt aagagttact ccgggccaaa tggccggagt tgtcagatct 11821 ggcagcgtct tcgctggggc tccagggagc tgctgctggg gtggaagctc tcacactctt 11881 tctccacgtg ccctttccag ttccctgaca tcatggagtt ctgcgaggcc atggccaacg 11941 ccgggaagac cgtaattgtg gctgcactgg atgggacctt ccagaggaag gtaaggcgtc 12001 tgatccaggt ctggagctgg gattgaggag ggcaagaggc ttctggatgg gcacagagac 12061 accagctctg ggtgaccagg gctcagccac cacagggtta cggccgagct gctcaggctt 12121 ggctgagcca agggactcca tggtctgtgc agactgcgtg ccatctgttg tggcaggtgc 12181 tttgaattgg caaagggaca gagccgggca tggtgctctg ggggttgggg gaaggactaa 12241 ggtcagagca aactctcctg gcttcagtac ttgtgaatca gagggtttaa aagaaaaacc 12301 cacctggtaa ggtgctgagc gccctctgtc tttccatggg agcacagcca tttggggcca 12361 tcctgaacct ggtgccgctg gccgagagcg tggtgaagct gacggcggtg tgcatggagt 12421 gcttccggga agccgcctat accaagaggc tcggcacaga gaaggaggta gctccacctg 12481 ccttccctgc aggccggcgg ggtgggggta tggctctgcc tccttcctgt cctggccctt 12541 cacccatccc ctgtccctgc ggccaggtcg aggtgattgg gggagcagac aagtaccact 12601 ccgtgtgtcg gctctgctac ttcaagaagg cctcaggcca gcctgccggg ccggacaaca 12661 aagagaactg cccagtgcca ggaaagccag gggaagccgt ggctgccagg aagctctttg 12721 ccccacagca gattctgcaa tgcagccctg ccaactgagg gacctgcaag ggccgcccgc 12781 tcccttcctg ccactgccgc ctactggacg ctgccctgca tgctgcccag ccactccagg 12841 aggaagtcgg gaggcgtgga gggtgaccac accttggcct tctgggaact ctcctttgtg 12901 tggctgcccc acctgccgca tgctccctcc tctcctaccc actggtctgc ttaaagcttc 12961 cctctcagct gctgggacga tcgcccaggc tggagctggc cccgcttggt ggcctgggat 13021 ctggcacact ccctctcctt ggggtgaggg acagagcccc acgctgttga catcagcctg 13081 cttcttcccc tctgcggctt tcactgctga gtttctgttc tccctgggaa gcctgtgcca 13141 gcacctttga gccttggccc acactgaggc ttaggcctct ctgcctggga tgggctccca 13201 ccctcccctg aggatggcct ggattcacgc cctcttgttt ccttttgggc tcaaagccct 13261 tcctacctct ggtgatggtt tccacaggaa caacagcatc tttcaccaag atgggtggca 13321 ccaaccttgc tgggacttgg atcccagggg cttatctctt caagtgtgga gagggcaggg 13381 tccacgcctc tgctgtagct tatgaaatta actaattgaa aattcactgg ttggtggacg 13441 cacatttctc tttcacctgg gtttccctgg gtctcatgga cagctccaac ttgatttggg // LOCUS HUMADRBRA 3458 bp DNA PRI 13-FEB-1996 DEFINITION Human beta-2-adrenergic receptor gene, complete cds. ACCESSION J02960 NID g178203 KEYWORDS adrenergic receptor; beta-2 adrenergic receptor. SOURCE Homo sapiens (clone: H-beta-R-[9,10,11].) epidermis DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3458) AUTHORS Emorine,L.J., Marullo,S., Delavier-Klutchko,C., Kaveri,S.V., Durieu-Trautmann,O. and Strosberg,A.D. TITLE Structure of the gene for human beta 2-adrenergic receptor: expression and promoter characterization JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (20), 6995-6999 (1987) MEDLINE 88041037 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by L.J.Emorine, 25-AUG-1987. FEATURES Location/Qualifiers source 1..3458 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H-beta-R-[9,10,11]." /cell_line="A431" /tissue_type="epidermis" /map="5q31-q32" mRNA 1045..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" mRNA 1055..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" mRNA 1064..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" gene 1264..2505 /gene="ADRB2" CDS 1264..2505 /gene="ADRB2" /codon_start=1 /db_xref="GDB:G00-120-541" /product="beta-2 adrenergic receptor" /db_xref="PID:g178204" /translation="MGQPGNGSAFLLAPNGSHAPDHDVTQQRDEVWVVGMGIVMSLIV LAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFG NFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIV SGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASSIVSFYVPLVIMVFV YSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLG IIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRI AFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGTEDFVGHQG TVPSDNIDSQGRNCSTNDSLL" BASE COUNT 777 a 890 c 886 g 905 t ORIGIN 1 bp upstream of EcoRI site; chromosome 5q31-q32. 1 gaattctcat tgcatctcca gttcaacaga taatgagtga gtgatgccac actctcaaga 61 gttaaaaaca aaacaacaaa aaaattaaaa caaaagcaca caactttctc tctctgtccc 121 aaaatacata cttgcatacc cccgctccag ataaaatcca aagggtaaaa ctgtcttcat 181 gcctgcaaat tcctaaggag ggcacctaaa gtacttgaca gcgagtgtgc tgaggaaatc 241 ggcagctgtt gaagtcacct cctgtgctct tgccaaatgt ttgaaaggga atacactggg 301 ttaccgggtg tatgttggga ggggagcatt atcagtgctc gggtgaggca agttcggagt 361 acccagatgg agacatccgt gtctgtgtcg ctctggatgc ctccaagcca gcgtgtgttt 421 actttctgtg tgtgtcacca tgtctttgtg cttctgggtg cttctgtgtt tgtttctggc 481 cgcgtttctg tgttggacag gggtgacttt gtgccggatg gcttctgtgt gagagcgcgc 541 gcgagtgtgc atgtcggtga gctgggaggg tgtgtctcag tgtctatggc tgtggttcgg 601 tataagtctg agcatgtctg ccagggtgta tttgtgcctg tatgtgcgtg cctcggtggg 661 cactctcgtt tccttccgaa tgtggggcag tgccggtgtg ctgccctctg ccttgagacc 721 tcaagccgcg caggcgccca gggcaggcag gtagcggcca cagaagagcc aaaagctccc 781 gggttggctg gtaagcacac cacctccagc tttagccctc tggggccagc cagggtagcc 841 gggaagcagt ggtggcccgc cctccaggga gcagttgggc cccgcccggg ccagcctcag 901 gagaaggagg gcgaggggag gggagggaaa ggggaggagt gcctcgcccc ttcgcggctg 961 ccggcgtgcc attggccgaa agttcccgta cgtcacggcg agggcagttc ccctaaagtc 1021 ctgtgcacat aacgggcaga acgcactgcg aagcggcttc ttcagagcac gggctggaac 1081 tggcaggcac cgcgagcccc tagcacccga caagctgagt gtgcaggacg agtccccacc 1141 acacccacac cacagccgct gaatgaggct tccaggcgtc cgctcgcggc ccgcagagcc 1201 ccgccgtggg tccgcctgct gaggcgcccc cagccagtgc gcttacctgc cagactgcgc 1261 gccatggggc aacccgggaa cggcagcgcc ttcttgctgg cacccaatgg aagccatgcg 1321 ccggaccacg acgtcacgca gcaaagggac gaggtgtggg tggtgggcat gggcatcgtc 1381 atgtctctca tcgtcctggc catcgtgttt ggcaatgtgc tggtcatcac agccattgcc 1441 aagttcgagc gtctgcagac ggtcaccaac tacttcatca cttcactggc ctgtgctgat 1501 ctggtcatgg gcctagcagt ggtgcccttt ggggccgccc atattcttat gaaaatgtgg 1561 acttttggca acttctggtg cgagttttgg acttccattg atgtgctgtg cgtcacggcc 1621 agcattgaga ccctgtgcgt gatcgcagtg gatcgctact ttgccattac ttcacctttc 1681 aagtaccaga gcctgctgac caagaataag gcccgggtga tcattctgat ggtgtggatt 1741 gtgtcaggcc ttacctcctt cttgcccatt cagatgcact ggtacagggc cacccaccag 1801 gaagccatca actgctatgc caatgagacc tgctgtgact tcttcacgaa ccaagcctat 1861 gccattgcct cttccatcgt gtccttctac gttcccctgg tgatcatggt cttcgtctac 1921 tccagggtct ttcaggaggc caaaaggcag ctccagaaga ttgacaaatc tgagggccgc 1981 ttccatgtcc agaaccttag ccaggtggag caggatgggc ggacggggca tggactccgc 2041 agatcttcca agttctgctt gaaggagcac aaagccctca agacgttagg catcatcatg 2101 ggcactttca ccctctgctg gctgcccttc ttcatcgtta acattgtgca tgtgatccag 2161 gataacctca tccgtaagga agtttacatc ctcctaaatt ggataggcta tgtcaattct 2221 ggtttcaatc cccttatcta ctgccggagc ccagatttca ggattgcctt ccaggagctt 2281 ctgtgcctgc gcaggtcttc tttgaaggcc tatggcaatg gctactccag caacggcaac 2341 acaggggagc agagtggata tcacgtggaa caggagaaag aaaataaact gctgtgtgaa 2401 gacctcccag gcacggaaga ctttgtgggc catcaaggta ctgtgcctag cgataacatt 2461 gattcacaag ggaggaattg tagtacaaat gactcactgc tataaagcag tttttctact 2521 tttaaagacc cccccccgcc caacagaaca ctaaacagac tatttaactt gagggtaata 2581 aacttagaat aaaattgtaa aattgtatag agatatgcag aaggaagggc atccttctgc 2641 cttttttatt tttttaagct gtaaaaagag agaaaactta tttgagtgat tatttgttat 2701 ttgtacagtt cagttcctct ttgcatggaa tttgtaagtt tatgtctaaa gagctttagt 2761 cctagaggac ctgagtctgc tatattttca tgacttttcc atgtatctac ctcactattc 2821 aagtattagg ggtaatatat tgctgctggt aatttgtatc tgaaggagat tttccttcct 2881 acacccttgg acttgaggat tttgagtatc tcggaccttt cagctgtgaa catggactct 2941 tcccccactc ctcttatttg ctcacacggg gtattttagg cagggatttg aggagcagct 3001 tcagttgttt tcccgagcaa agtctaaagt ttacagtaaa taaattgttt gaccatgcct 3061 tcattgcacc tgtttctcca aaaccccttg actggagtgc tgttgcctcc cccactggaa 3121 accgcaggta actacttgta attactgccc atgacttaat gtagaatgat acaagaatga 3181 catgcacaga ttgcttaacc ctttcatttg cctttgagtc tgctgctgca aagctgcatc 3241 tctcctgaca cttgtgcccc aaatcagttc tgcctgctct tagtatagct caactctccc 3301 tatggttatt gttctgtgtt gttacctcag aaacactgac tcacagaagc ggagttaagg 3361 ggatatgttt ttttctctcc acgtgcaccc accacccacc ttccagttct acttgtttca 3421 aaactgttta tatttctgtc ttggccatgt gtttacag // LOCUS HUMMETIII 2167 bp DNA PRI 01-JUL-1992 DEFINITION Human metallothionein-III gene, complete cds. ACCESSION M93311 NID g187546 KEYWORDS metallothionein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2167) AUTHORS Palmiter,R.D., Findley,S.D., Whitmore,T.E. and Durnam,D.M. TITLE MT-III, a brain-specific member of the metallothionein gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 6333-6337 (1992) MEDLINE 92335292 FEATURES Location/Qualifiers source 1..2167 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 387..392 CDS join(482..512,761..826,1731..1840) /codon_start=1 /product="metallothionein-III" /db_xref="PID:g187547" /translation="MDPETCPCPSGGSCTCADSCKCEGCKCTSCKKSCCSCCPAECEK CAKDCVCKGGEAAEAEAEKCSCCQ" polyA_signal 1957..1962 BASE COUNT 424 a 663 c 666 g 414 t ORIGIN 1 gaattctaga atgaagggga agagaggcag ggaagagctg ggaaatacgc aaagcgcctt 61 tttctccact ttcggagatg gtacgtgcgc gcttccacgc agtggcggct gctgcggcga 121 gcacgtcccc tgcgggaccc acgcggggag tgggctggca gtgcgcgcat agcggcggcg 181 agtgggtcgt gcacgcggat gcggggtggg agtgggggcg cacgcgcggg cgtgggcgag 241 cgggccccgg cagtgcacac acacggcagg ggcgggcgac agatgcagtg cgtgcgccgg 301 agcccaagcg cacaaacgga aagagcgggc gcggtgcgca ggggcgggcg cccagcgggc 361 ttggcatgcg cgcccccgcc cgaggctata aaagcatcgc cacctgctgc cactagccaa 421 gccgcgcgtc cagttgcttg gagaagcccg ttcaccgcct ccagctgctg ctctcctcga 481 catggaccct gagacctgcc cctgcccttc tggtgagccc ccgcccccgc tcgcatcctg 541 cgcactgcgc gcccttgtac ctgcaaagaa acccacgccc tgcgccttcg ctcaaggaca 601 cttgggggaa gggcccctga ttccctattc ttcacctcgt gaagggcggg catgcctgtg 661 tcgcggagaa cagggagact tggcacccca tctcctcgtg acaggcgtgg ggacccgagt 721 tcgtccacat taacccttcc tgtggcgtcg ccctctctag gtggctcctg cacctgcgcg 781 gactcctgca agtgcgaggg atgcaaatgc acctcctgca agaagagtga gtgcggggac 841 ccttcccctc tgccgccgcc ccctctgctc tgcggagtcg gtgtctcacc acgcaggatg 901 tggagagaca gccaggcccc gatcccgtgg tttctgactc ttgctggaat agaaccaccc 961 gggcagacat taaaatacag atgcccctgc cccacaccca ggaattgatg gcctaggcgt 1021 gggacgagag gtttttgtaa atccccagaa gactccagtg gcagctggga ttgaggactg 1081 ccacactggt ccaacctttc ccgacccatc cctcaaaaac tccccaaaac ctggggaagc 1141 cattagtgag gctgcggctc agctctggag ttccggtccc ttggcctctc tccggcctta 1201 cggatcctct cagtttgatc tcaaaatctc cccagctcac ggcagtgtaa tatgcatagg 1261 gagtaagagg tgggaggaag gctcccttct tccctaggta tgaaaccagg cacaggtccc 1321 caccgcctcc ccacggcttt cctgggcccc ttacatctgc accatatctg cccctcccaa 1381 gtttatcctt tgaggcccct ttcaaggtcg tttaccctgt gatgctttct ccttaaggcc 1441 tagtacccac ctaagctggg cttaagccca ccagccccca ggacttcctg gcatccaccc 1501 aaggggctct ggtcatttcc tgggtacctc actcctgtgg aggtgcagga tgccactgcc 1561 gcgacataga tgctgagtca aagcaggtgt gagactaaga agggggctgc gactagccct 1621 ggctgaatga accaggatga ctccccaccc agcacccttc cctccccttt gatggggacg 1681 aattggggga gtgtgcatca gagagtggtc atcttccatt ttatctgcag gctgctgctc 1741 ctgctgccct gcggagtgtg agaagtgtgc caaggactgt gtgtgcaaag gcggagaggc 1801 agctgaggca gaagcagaga agtgcagctg ctgccagtga gaaggcaccc ctccgtgtgg 1861 agcacgtgga gatagtgcca ggtggctcag tgccacctat gcctgtggtg aagtgtggct 1921 ggtgtcccct tcccctgctg accttggagg aatgacaata aatcccatga acagcatgag 1981 ccaaggactg gtctcttctt aaagggggga aggatgtgga gcagtggggg agcctattcc 2041 aagggagcca cacagttaag agtgaaaccc tggctgggtg cagtggctca cgcctgtagt 2101 cccagcactt tgggaggccg aggcaggtgg atcacctgag gtcaggagtt cgagaccagc 2161 ctggcca // LOCUS HUMXRCC1G 37785 bp DNA PRI 30-JAN-1995 DEFINITION Human XRCC1 DNA repair gene, genomic. ACCESSION L34079 NID g642116 KEYWORDS Alu repeat; DNA repair protein; tandem satellite array. SOURCE Homo sapiens (tissue library: LL19NC02-F2) adult blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 37785) AUTHORS Lamerdin,J.E., Carrano,A.V., Thompson,L.H., Montgomery,M.A., Stilwagen,S.A., Scheidecker,L. and Tebbs,R.S. TITLE Genomic sequence comparison of the human and mouse XRCC1 DNA repair gene regions JOURNAL Genomics (1995) In press FEATURES Location/Qualifiers source 1..37785 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="UV5HL9-5B" /cell_type="lymphocyte" /dev_stage="adult" /tissue_type="blood" /tissue_lib="LL19NC02-F2" /map="19q13.2" gene 4268..36195 /gene="XRCC1" exon 4268..4318 /partial /gene="XRCC1" /note="exon 1; G00-120-737" exon 4724..4816 /gene="XRCC1" /note="exon 2; G00-120-737" exon 18706..18816 /partial /gene="XRCC1" /note="exon 3; G00-120-737" exon 24922..25080 /partial /gene="XRCC1" /note="exon 4; G00-120-737" exon 26043..26117 /partial /gene="XRCC1" /note="exon 5; G00-120-737" exon 26214..26325 /partial /gene="XRCC1" /note="exon 6; G00-120-737" satellite complement(26342..26484) /partial /gene="XRCC1" /note="human chromosome 19-specific tandem repeat, pE670" exon 26635..26744 /partial /gene="XRCC1" /note="exon 7; G00-120-737" exon 26818..26929 /partial /gene="XRCC1" /note="exon 8; G00-120-737" satellite complement(27008..27410) /partial /gene="XRCC1" /note="human chromosome 19-specific tandem repeat, pE670" exon 27451..27709 /partial /gene="XRCC1" /note="exon 9; G00-120-737" exon 28039..28155 /partial /gene="XRCC1" /note="exon 10; G00-120-737" exon 32609..32702 /partial /gene="XRCC1" /note="exon 11; G00-120-737" exon 32859..32991 /partial /gene="XRCC1" /note="exon 12; G00-120-737" satellite complement(33001..33360) /partial /gene="XRCC1" /note="human chromosome 19-specific tandem repeat, pE670" exon 33480..33534 /partial /gene="XRCC1" /note="exon 13; G00-120-737" exon 33629..33768 /partial /gene="XRCC1" /note="exon 14; G00-120-737" exon 35353..35443 /partial /gene="XRCC1" /note="exon 15; G00-120-737" exon 35900..35975 /partial /gene="XRCC1" /note="exon 16; G00-120-737" exon 36082..36195 /partial /gene="XRCC1" /note="exon 17; G00-120-737" BASE COUNT 8717 a 9329 c 9806 g 9929 t 4 others ORIGIN 1 gatctctttt tgttgtccag gctggtctca aactcctggg ctcaaacgat cctcccacct 61 caacctccca aagtgctggg attacaggcg tgaaccaccg tgcccggcct ccctttcatt 121 ttactcatcc acatctcccc ctccctgtct ctgcttctcc tctttacgat gatttgctca 181 gccaggcacg gtagctcatg cctgtaatcc cagcactttg ggaggtcgag gcgggagaat 241 cacttgaggt caggagttag agaccagcct ggccaacatg gcgaaacccc gtctctacta 301 aaaaatacaa aaattagcca ggtgtggtgg cacgcacctg taattccagc tactcgggag 361 gctgaggcag gagaatcgct tgaacctggg aagtggaggt tgcagtgagc agagatcgcg 421 ccattgcact ccatcctggg tgacagagcg agactctgtc tcaaaaaaga aaaaaaatga 481 tttgctccat acttcaaggt tatcatccat ctcctgtttg gggtgatccc ttttctgtct 541 cttccaccac tccaacactc caccatcatc tctcctccct cggtcaggaa tgacctcttc 601 cctgattaca cttacacttt cccaccccag cccagcctct tccttctcta tccactcccc 661 accatttcca cctcctgcca gttagattct tcccctctca ctatttctct gccccccaca 721 gtgtctttcc ttacccctct tctgcctaag aaagtctatt aactctcacc cctcccacat 781 tttccttctc tttaggacag tgtcattttc agaattctgt gacaataatg atttctatca 841 caccacttat ttctgttccc tccatctcat gatggtcttt ccctctttcc tgctctccac 901 tgatttcttc ccacgctccc gtcttgagaa ccatgatgat cccccctcag catggtagtc 961 tcttccaccc tgaaggtaca ccccacctcc ccaccaggtc ccaggccaca tcctcttact 1021 tgaagtagcc ttcccgacca ggagcacaca tgtgtccttg tcactgctgc aggtcttcat 1081 ttggccatgg cacctgctcc ccgccgccgt acatatttcg cagtgtagtg ggcaccctgc 1141 agtgtagaca gagggcgagg ctgtagtcat gggtggtgtg gtcttcggac cacgactttg 1201 acttaggact tttgtgtcct gagagaggag ggaccagcag ctccctggtt tttcatgggc 1261 gggatgttga ccactgccca aggaactgtc cagagttgga cgaagaagca gtggtgatgc 1321 cagcaaagct gcttccaggg gcgggaagcg actccaccat gaagcctgtt tgcctaatta 1381 catggctggg gattttggat gccagggtcc gctgagaaag ctcaggggca gggaggggtc 1441 gtgatggagg tttccatgga cttttgtgaa acttcaggtc cagctggttc ttgaatgtgg 1501 gattctcctc cttgaaggtg ccatgtaggt cttgtagaca gcaggagctc aggcaggggg 1561 ccctgaggct agggtctgca tcccaggtca aggagggaga gtgaggacca ggagggcaga 1621 ggacttgaac ttcttggttc tgagagaggg gacagctagg ggcccagaaa cctgggtcct 1681 ggggaaagtg aggcttactt ggactcctgt gtgcagggga ggagggggct ggggcctgaa 1741 cttcttagtc taaggagaaa ggaaatctct ctacctcttt taaaaaaata tatcatatga 1801 tattaatgta ttttttttga gatggagttt tgctctgtca cccaggctgg ggagtgcagt 1861 ggcacgatct tggctcactg caacctttgc ctcccagttt caagcaatcc tctcacctca 1921 gcctcccaag tagctaggat tacaggcgcc caccaccaag ctcagctaac tttttttgta 1981 tttttagtaa agacggggtt tcaccatgtt ggccaggctg gtcttgaact cctgacctca 2041 agtgatccac ctgcctcggc ctcttaaagt gctggggtta caggcatgag ccaccgcacc 2101 tggccttctc cctctttttt tttttttaat tatattttta aagatttccc tttccagcca 2161 tgctctcatc tgatttccta gaattgagat gtgtttggga tccttcctgg aggatggagg 2221 attcagagcc caccaagccc ctctctcagg gatccaatca caggagcttt tcttctctgg 2281 gacccccagg ctttctcttc ctttaccctt ggcctcttcc atgaatctga atttccagaa 2341 tgcctccccg ttcaccttga ggacttgctg aaggctcttt ctctctctat atctctctct 2401 ctctcaatct gggacctctc tcttcctgag gtcccagaga cagttcaggt cccttgtcac 2461 ttaccaagac ccaggagggt gcagagcaac acaaaggcca gcagaaaggt ctctggtctc 2521 ctggagagcc tcatggtgtg tgtatcagca gtacctggtc cccagctttt ataggaagct 2581 gaggaggggt gggtctggac ccacattcac cctacgggga gaaatagctg aggtcagaac 2641 ccagggcagt cctctgtggc ctccgggttg ggaaggcaga accagaatcc aaaagtgacc 2701 tgggcatctt gtgcccagcc ccccacctcc atggagaccc caggagtctg ggcttccatc 2761 cccctcttct attagaaccc agaaagtcac caccctgttt tctcaccttc ccacggccca 2821 ttgttgtttg tttgtttgct tgaagaccaa gacggagttg ggcctcttga ttcccagtgg 2881 ctgcaagaac tgggattccc tctccttctc tctcttcccc tctcctccca aggaagcacc 2941 cagtgtgggt agggccccac cttggagcac ccagcatgtc caggctggtc ctgcgggtgc 3001 ggggtgcctt gtctcccttc ccctctcttc atgtgctcgg ctcacactcc ttgtgtgttt 3061 gggctgtctc cagctgcgac cccacccagg gctgcaccct tgggtctcca aggcaccgaa 3121 tgaacagagg gatgtctgtc ggcagtcata ggcgggcgta gtaaaagaca gatgcccaca 3181 gtccacatat tgggaaactg aggcttttgt ggaagaactc caggataggg gtgtgtgtgt 3241 gtgtgtgtgg tgggggttgg gcacatgctt tgtgttatat ttaggacgca gaacccttct 3301 cttttggcct caggcataag gctgaaagag atctgctaat ttttttcgcg cgtgcgcgcg 3361 cgcgtatgta tgtatatgtg cgtgtgtgtg tgtgtggcaa ggggacagag agaagagtgc 3421 agccgcctga gccatttgaa gagatcctgt tgcgtagaat ccaggttccc tacgaaacta 3481 cgaaaattca ctcagatttg ctgtcctagc ccagcgcagt cgctcacgcc tgtagttcca 3541 gcactttgag aggccgaggc aggtggatcg cttgagctga ggagttcgag accagtctgg 3601 gcaacatggc aagacccgcc gccctctcac cccatgtctc cacaaaaaat acaaaaatta 3661 gccgaatgtg gtggcgcgtg cgtgtaatcc cagctactta ggaggctgaa gtgggaggat 3721 cccttggccc caggagacag gggttgcaga aagccgagat cgtgccactg cactccatcc 3781 tgggtgagag agcaagaccc tgtctcaaca aaaaattttt aaaaaataaa ataaataata 3841 atacagcaaa aagatttgct ttctcggctt cagtgtgggc ggtaactcca tcgtgcaatg 3901 agaaaggcga atttcttcca gacaccaatc ccggaggtcg cttctgttgc taggctccca 3961 gaaagcaggg ttcggacgtc attgggaggc gaggctagag cggggttgtg tgtggcggag 4021 ggaggcgggg ctggaggaaa cgctcgttgc taaggaacgc agcgctcttc ccgctctgga 4081 gaggcgcgac tgggcttgcg cagtgtcgac gccggcgccg gcgcgccggg gtttgaaagg 4141 cccgagcctc gcgcgcttgc gcactttagc cagcgcaggg cgcaccccgc cccctcccac 4201 tctccctgcc cctcggaccc catactctac ctcatccttc tggccaggcg aagcccacga 4261 cgttgacatg ccggagatcc gcctccgcca tgtcgtgtcc tgcagcagcc aggactcggt 4321 gagggacctg catgggggag gttggaagac gtgagggaat taatgaggat agctcttaaa 4381 agagcttttg gggttctctt ggaagtctaa ggagagtcat ggggggattt ccaggaggct 4441 aaaagtgaat tgtggagaga tgttgaatta gggacgttcc ggcgtctagg gaacttccgg 4501 agagaaaagg aggaataggg aagctttctt ggagagccca taggtgaact tccacgggaa 4561 agtgttgtag gggaaactcc aggcagagtg tcgggggatt aacgggagag cttaaatctg 4621 ggggtcgtag aggccttggg aggactcaaa ggaatcatct gagagctgga ggcctagctg 4681 tcatctctgt gccctaattc tcctctccca ctgttgtttc cagactcact gtgcagaaaa 4741 tcttctcaag gcagacactt accgaaaatg gcgggcagcc aaggcaggcg agaagaccat 4801 ctctgtggtc ctacaggtga tcctgccacc ccacctcctc ctcactagga gtctggagtc 4861 cggggttcca gattctgctc ctgagccaac tggcagtggg tttagtcact ttctttgctg 4921 ggcctcagct tccttactcg gaaaatgggc agcataatcc ctgtgcccct cagcagagag 4981 gaaaggagac ccaaggatta agtccaggat gaccacagga atcagtgtgt ggtcctgggc 5041 aggtggcatc tactttctgg gttccagttt cctcagctgg aacaaaatga atcattgcca 5101 cgtactctgc ctcctgcagc cagtgggctc agggctggaa ggtgtccagg aaggtcgtct 5161 aagccaatca actccacttt aaagaagggg aaactggccg gatgccgtgg ctcatgcctg 5221 taaacccagc actttcagag cctgagatgg gaggattgct tgagctcagg agttccagac 5281 cagcctgggc aacatatcaa gaccctgtct ctatttttat taaaataaat aatttttttt 5341 aaagaataat agccaggtat ggtggttcac acctgtaatc ccatcacttt gggaggccga 5401 agctggagta ttgcatgagc ccaggagttt gagaccagcc tgggcaatac agtaagatgt 5461 catctctacc aaaattttat agattagctg agtatggtgg tgtgcacctg caatcccagc 5521 tactggcgag tctgaggtga gagaattgct tgagcccagg agatggaggc tgcagtgagc 5581 tgagattgtg ccactgcact ccagcctggg caccagagtg agacccccaa aatataaaaa 5641 ataaaataat aataatgaga gctggcatta ttcaacgtta cctgtcagtc gctgctctac 5701 acacctcact cacatggatt agctcattga gtcctcataa aaaaccccaa gaggcaggct 5761 ctgttatccc cacttttaca gatgaggaaa ctgaggccca aagaagttat acaatttgcc 5821 tgacagtcaa tcttttattt atctgttgag acagagtctc actctgttcc caggctgtag 5881 tgcagtggca tgatcttggc ttggctcact gcagcctcta cctcctgggc tcaagcgatc 5941 ctcccacctc agcctcctga gtagctggga ctgcaggtgt gcaccaccat gcctggctag 6001 ttttttttgt atttttttag agatggggtt tccaccatgt tgccaggctg gtctcaaact 6061 cctgggctca agcgttcctc ctgcttcagc ctcccaaagt gctcggatga caggcatgaa 6121 ccattgttcc tggcctgggg gctgctttat gactataagg agcagaacac tctgtggata 6181 gtgctagggt gacggaaaaa ttaacctttt gtgaatgggc agtcattcag caaatatcca 6241 ctcctctccc tcccactgga gggggagtat atttccctgc tccattgatg agtttgacca 6301 tatgatttac ttaatggaat ttcagcacat ctaacttcag caagggcctt aaatgcactt 6361 ggcttttgtg ctcccataat atgccatgag aacatttgcc aggttactgc taatcccaga 6421 aaactgtgga atctcatgga agagacctga atcctcccaa actgcagcca aagcagccat 6481 cccacaaatt cttataagaa tgcttgttgg tataagtctc agttttcatg ttgggaggtc 6541 gaggtgggtg aatcgcttga acccaggagt ttgagaccag cctgggtaac atggggaaac 6601 cctatctcta cgaaaaatta taaaaaatag ctgggcttgg tggcgtatgc ctgtagtccc 6661 agctaccagg gaggctgagc ggggaggatt gattgagccc ataaggttga agctgcagtg 6721 aactgtgatg gtttcactgc actccggcct gtgagacaga gtgagactgt atctcaaaaa 6781 aaaaaaaaaa aaaaaaaaga aaagatctca gttttaggat gatttgttac acagcattat 6841 tacagtgtaa gttaatacaa cttggttgtt ttaaaccact gagattaggg actcctttgt 6901 tactgtaaca tcatctggtc caccccaatt gatgtcagtg tttagtgcct gtcaactaat 6961 ggcctccact tcccaggttc aaatgattct catgcctcag ccttctgagt agttggaatt 7021 tacatgcctg gctaattttt gtatttataa tagagaccga gttttgccat gttggctagg 7081 ctggtcttga actcctgacc tcaggagttc aaggcaaatg agcaccaaat gctaatttaa 7141 gtgatccacc caccttggcc tcccaaaatg ttgggattac aggcatgagc cactgcacct 7201 gacctcagtc ttttaaactg tgatagatac agattgctga cagttttagt cttgagatct 7261 agattcaagc ccccagctct tcctcttaat cggtgacctc aagaaaattg tttcatttct 7321 ctaacctgtt ttctcatgtg tccattggga ataatgacat ccaagcccgg agttgtgaaa 7381 attcggtgac ataatgtctg cacaagtgct ctgtcaacat gtaggtgttt gtgtagaggg 7441 agggatttgt gtttttatta cttagtgggt gatgattcca ctgggctctc ttatgatcaa 7501 gtgactccag ggaaatcaca tttcattctg ttaccccagt tgctgtggaa acagagtgac 7561 tggaggcaga agagacagaa cctgtctctg gcctgggaga gactgaggga gtattgcagc 7621 ggtcaccaac atggtaaacc ccagagggag cagcttgggg aggtgatgtg tttcagttgg 7681 gatgtccagg tttgatgtga ctttgggaca ttctaggggt ggcatccagg aggcagttgg 7741 ggatcacatg tggagctgaa gacagaatgt ggggaagttg tcagtgcaga gaaagaacct 7801 atagactcgg gagtagatga tggatttatt tgttcattca ctcatttatt tagtcaagta 7861 tttattaagt gccttctgta ggccaagcac tgttttaata cagcagtaca cagcagaaga 7921 caaaaaatat tcactctcag tggggtgcag taagctcacg cctataatcc cagcactttg 7981 ggaggccaag gcctggcagt tcgagaccag cttgggcaac acagggaaac ttcatctcta 8041 ctaaaaatac aaaaatctgc caagcgtggt ggcctatgcc tgtggtccca gctactccag 8101 aggctgaggc gggaggatca cttgagcctg gaagatcaag gctgtagtga gctatgatcg 8161 caccattgcg ctccagcctg ggcaacagaa tgagatgctg tctcttaaaa acaaaagaaa 8221 ttcagtcttg tggagctttc acgttagtag ggaaagactg acaaatgagt cagtgaaaca 8281 tatgttggat gatgagtgct atggaggaac ttaatgaagg ggaatgagag tgccccagct 8341 ggagctggag cagtggcaag gatttttatt taaatagtag ggtccgggaa ggcctccctg 8401 aaaaggggac atttgagcaa catgagctat gtatgtatct agggaaaaga ctctgaagct 8461 ggcttgtgac agcaaggagg ccagtatggc tggggtggag tcagccaggg ggagagaggt 8521 aggagatgag gtaacagggt gggactggga acccctaaaa aaccacttaa ggcctggaaa 8581 gccattgaaa ggacttgagc ttttactcca ggtgtgagat aggagccacc ggagggctcc 8641 gagcagagga gggacaggat ctgccttagg ccttcacagg atcctgctgg ctccatgttg 8701 acaatcaact gcaggggcca cacacaggag tgggagatca gcattgcagc agttgttttg 8761 ttaacttact ccacgtcaga gaccctttgg aatgactgat tgcatctggc aatgactaat 8821 ttgtggggtt actctggatt catcaaatgc taattttgat tcttttgacg taactgataa 8881 aggcagacat ttctggccac caaggcagta ggcttatctc tgggttgtta tgttatgttt 8941 tgagacaggg tctcactctc ttgcccaggt tggagtgcag ttgcaaagtc atggctcact 9001 gtagcctcaa cctcctgggc tcaggcaatc ctcccacctc agcctcccag atagctggga 9061 ctacaggcgt gtgccgccag gcccagctaa ttttaaaaat ttttcgtaga gatggggtct 9121 cactatgttg cccagcctgg tctcggactc ctgagctcaa gcagtcctcc tgtcttggcc 9181 tcccaaagtg ctggcattac aggcatgagc cactgcacct ggccacatct gtgggttctt 9241 ctactaccct gcattttctc ctctaattat tgttatgcaa aatctcgatc ttaccacatg 9301 gggcttagcc atctgcttat gacagcccca cacacctggt tgtttgcagt agccacagca 9361 ggctctgttg tctgttcatc acctgctccc agagggaaat gccaacctct ttgctaacta 9421 ctcctgggct ggaggggtgg cttggacaag acccagatat aactctggtg tgtcactgga 9481 ggtgtgtcat ttttagaaga aatgttcatc agcaataggg cctatggtca tttttgattg 9541 ttattggttt tctaatcact cagttcgcca ctgtggtgtc ttaaatttgt ttttactgtt 9601 ttttgatgtg taacatacat agagtgaagt gaaggtgtaa gtaagagtac agctcagtga 9661 ctctttgcaa actgcagccc tgtgtactcc acacccagat gcagcacaga acatgcctag 9721 cccccagcag ttccctcagg ttctttctgt cagtacccct gcaagggtaa ccactgtcct 9781 gattgttaac agcatagatt cattttgcct gattttgaat tgcatataaa atggaatcgt 9841 gtggtataaa cttttcttat ctggcttccc ttgttatgtt tgttcatttc atctatgtcg 9901 ttcagtgtag ctgtagatga tttattgtga tttttttttt ttctgagacg gagtttcgtc 9961 ttgttgttct ggctggagtg caatggcaca atctcagctc actgcaacct ccgcctccca 10021 ggttcaagtg gttctcttgc ctcagcctcc caagtagctg ggattacagg catgcaccac 10081 cacgcctggc taattttttg tatttagtag agacaggttt tcaccatgtt ggtcaggctg 10141 gtctcgaact cctgacctct agtgatccac ctgccttggc ctcccaaagt gctgggatta 10201 caggcatgag ccactgtgct cgactcctat tgtctttttt tttttttttt gagacagggt 10261 ctcactatgt tgcccaggct ggagtgcagt ggcaccatct cagctcacta caggctcgac 10321 ctcccgggct caagcgatcc tcccacctca gcctcctcag tagttgggac tacaggcatg 10381 tgccagttca ccagtctaat ttttgtattt tttttgcaga gatgggggtc tcactgtgtt 10441 gtctaggctg gtcttgaact cctggactca agcattcctc ctgcctcagc ttcttaaatc 10501 cttgggatta cagacatgag ccactgcacc tggcctattg tcatttctgt atagtattct 10561 gttgtattga gataactatt atggttttat ttatttatat ttgattttct tttttaaacg 10621 gagtctcact atgttgccaa ggctggtctt gaactgttga tctcaggtga tcccccaact 10681 tggcctcctg agtagctagg actataggtg cacaccacca cacccaacta attttttttt 10741 ttttttttaa gctttttgga ggcagcttcc tgagtagctg ggattacagg gattacagac 10801 agagccacaa agacctggcc attggggttt tattatattt gctttctgaa tgggtattat 10861 atgcatatag tgcaaaattc aaaaggtgca aaagggtata cactaaaaag aaaatctccc 10921 ctcacccttt ccatccctcc ctgctccctc cccaactcct gctccctccc caactcctgc 10981 tccctcattc ttctctcctg atgcaaccac cattagcagt ttttttgggt ggtcttcctg 11041 ggtcattctg tacacataca agcatatata cctaccgcct ccatcccttt ttaaaaaatt 11101 catagtatat acactgttcc cttgtgtgtt ttttttcaag taacagtgta ttttggagat 11161 gcacactgca tgaagttcta ccttatttat ttatttattt atttacttac ttacttactt 11221 acttactttg agacagaatc tcgctctgtc actcaggctg gagtgtagta gtgcgatcgt 11281 ggctcactgc aacctccccc tcccagcttc aagtgattct cctgcctcaa tctccctagt 11341 agctgggatt acgggcacat gccaccaggc ttggctaatt tttgtatttt tggtagagat 11401 ggggtttccc catgttagcc aggctggtct tgaattcctg acctcaagtg atcctcctgc 11461 cttggcctcc caaagtgttg ggattacagg cgtcagccac gccgtgcgca gcctacctta 11521 ttcttttaaa tgatgcatac aattgcattt catggccggg cgcggtggct cacgcctgta 11581 atcccagcac tttgggaggc tgaggcaggt ggatcacgag gtcaggtgtt caagaccagc 11641 ctggccaaga tggtggaacc ctgtctctac taaaaatata aaaattagcc aggtgcagtg 11701 gtgggcacct gtaatcccag ctacctggga ggctgaggca ggagaattgt ttgaacccgg 11761 gaggcggagg ttgcagagag ccgagatcgt gccactgtac tctagcctgg gtgacagagc 11821 aagactctgt ctcaaaaaaa aaaaaaaaag aaaagattgc atttcatgga tgtgaaacag 11881 ttttttagca atatatctca ggaactttgc cttcctgtgc catgtttgtt tcaggaacag 11941 ttcatatcag aactgtgtct ccctgagaaa gctgcatgca cacgaaccca ggccatcggt 12001 gatagtcatc ttgtgctctt atcctcacag ctgatacata ctgagcactc atctcttatt 12061 ttgagaaaaa ttcacataaa atttactgtt ttaaccattt aaagtgcaca tttcagccag 12121 atgcggtagc tcacacctgt aatcccagca ctttgggagg ccaaggcggg tggattgctt 12181 gaggtcaggg attcgagact aggctggcca acatggtgaa accccatctc tactgaaaat 12241 acaaaaatta gccgggggtg gtggtgcgtg cctgtagtcc cagctattcg ggaggctgag 12301 gcaagagaat cacttgaact ggaaggcaga gtttgcagtg agctgagatt gcaccactgc 12361 actccagcct gggcaacaga gtgagactcc gtctcaacaa aaaacataaa taaataaata 12421 aataaagtgc acatttcagg gttttttttt tttttttagc atattgacaa tgttcttcaa 12481 ccatctccat tctctaattc agaatatttt catcactcca aaaagaaacc acataaccat 12541 taaacagtca cttcttattc tctccccaat gttgcccttg gcaaccactg atctactttc 12601 tgcttctctg gatttgcgta ttctggactt ataaatgtat acgtggtctt ttacacttaa 12661 ccttgttaca ctttgcataa tgtttctgag gttcatccat gttgtagcat atatcacaac 12721 ttcagttttt taaataactg gatgatattc cataaatata ctgtacaatg tttatctatt 12781 catcaactga tggacattta ttgggttgtt accatgtata tataaacctt tttttttttt 12841 tttttttgag attgagtctt gctctgttgc ccaggctgga atgcagtggt gcaatctcgg 12901 ctcactgcaa cctctgcttc ccaggttcaa gtgattctcg tgcctcagtc tcctgagtag 12961 ctgggactac aggcgcatgc caccatgccc aactaatttt tctgttttta gtagagacgg 13021 ggtttcgccg tgttgcccag gttggtcttg aactcctggg ctcaagaaat ccacctgcct 13081 tggcctccca aagtgttggg attacaggcg tgagccacca ccccagcctc catgtataaa 13141 catttttatc agattcttaa aaagtattta tttacttttt tttttttttt ttttgagaca 13201 gagtcttgct ctgccaccca ggctggaatg cagtggcatg atcttggctc actgcaacct 13261 ccgcctcctg ggttcaagca attctctgcc tcagcctccc gagtagctgg gattacaggc 13321 gcctgccacc atgcctggtt aatttttgta tttttactag agacggggtt tcaccctctt 13381 ggccagcctg gtcttgaact ctcctgacct tgtgatctgc caacctctgc ctcccgaagt 13441 gctaggatta caggcctgag ccaccgtgcc tggccacttt tttttttttt tagatggagt 13501 ctcactctgt cacccaggct ggagtgcagt ggcatgattt cggttcgctg caacttctgc 13561 ctcccaggtt caagtgattc tctggcctca gcctcctgtg tagctgggat tacaggtgca 13621 cgccaccaag tctggctaat ttttatattt ttagtagaga cagggtttca ccatattggt 13681 catgctggtc ttgaactcct ggcctcaagt gatctgcctg ccttggcctc ccaaagtgct 13741 gagattacag gtgtgagcca ctgtgcccag cctgttgttt tctctcttga gacagagtct 13801 cactttatca cccaggctgg ggtgtagtgg ctcgtctcaa ctcactgcaa cctccacctc 13861 ccaggttcaa gtaattcttg tgccttagcc tcctgagtag ctgggattat tgaggtgcgt 13921 gtcaccacac ccagctaact ttttttattt ttggtagaga cagggttttg ctatgttggc 13981 caggctggtc tcgaactcct ggcctcaagt gattctcccg cctccacctc ccacggtgct 14041 gggattacag gcgtgagcca gtgctcctag ccaggacttt ttttatatac aaagaagaaa 14101 tgtcagggag tgcctgttta aagtagaaat agttatatat gcagtaggaa ttgttcattt 14161 aaaatacaag catctccatc ctaccacatt taggagctat gctgaatatt gtgggagagg 14221 taaagatgtt gagtttgact cagggatgga aagaaaggag ttgagaacag tggtggtatt 14281 ggacagtgct ttgcagtcag gcagttgcgt ccgaagtact ctgctagtta ttagcttgtg 14341 tgccatgtaa gctattgatt aaacctgtct gagcctcagt ttccccatct gtgaaactgg 14401 agtcgtaaca gacccttcta tttaaagtgg ttttgaggct tcagtgatgc ccggcccatg 14461 cttagccaag tgtggtggac agaacagtgg tgctcaaaag atgttcatgt cctcatcccc 14521 agaacctgga aatattttaa tggccaaagg agctctgtgg gtttgattaa attaaggatc 14581 ttgagatggg gagagtgccc tggattatct ggtgggcctc acgtaacccc cagaggcctt 14641 ataagggaaa gagggaggca agagagctgg aaccagagct gtgaagatgc aagcagagcc 14701 tggagtgatg ccagcgcttt cagcagcttt gaagacgagg aagggactgt gagccaagga 14761 aggcaggtgg cctcgagaag ccgaagatgc aaggagacac gttctcctct agtgctcccg 14821 gaaggaagga atgcagccct gccgcatcta gattttagcc ctgtaaggcc gagtttcaga 14881 cttttgacct ccagatctgt aagatattaa atttgtgtgt tttttttttg tttttttgag 14941 acggagtttc gctcttcttg cccaggctgg agtgcagtgg tgtgatctcg gctcactgca 15001 acctccgccc ccccggttta agcaattctc ctgcctcagc ctcctgagta gctgggatta 15061 caggcatgcg ccaccacgcc tggctaattt tgtattttta gtagagacgg agtttctgca 15121 ttttggtcag gctggtctcg aactccctac ctcaggtgat ccgcctgcct cagtctccca 15181 aagttctggg attacaggcg tgagccactg cgcctggcca aatttgtgtt tttagccaca 15241 aagtttgtgc taatttgtac aacagtaaca ggaaactcat acaccaagca cctggcacat 15301 ggtgagaatc cgaaaatgtt agccatgatg atcctcttgt gacagatggg acacccaggc 15361 aagcagcaga gaaggggctc aacagggcca cttggtaagg gaggacagag ccagagttac 15421 tgcttaataa ctcttgagta gtaacttctg caaacggttg ctgggcgcct gttctgcgtc 15481 tgtctgtaca tccattggga atgcagtaga ggcctgctgt gttaatgata ttcgcagctg 15541 acatttatgt agtgtttgct cggtgccagg cagtattcca ggcacttggt atgttatttc 15601 tcattcttgg tatggatttg aatgttctca acaactttat gagatagacg ttgttattgt 15661 ccccatttca taggttcgaa aactgaggca caaggggtga aatgaaacca tttgctccag 15721 gatcacacag caggaaatca gcagagctgg gaaggaccag acactgttct tggccttggg 15781 gacacagtag ggaacaactc agacgatact gtccttgctc ttgtggagct taaaagtctg 15841 gcgggagaga cagatgctag acaaatacaa acaaaaaaga tgtaactgtg taacaccagg 15901 cagtaatcat gaagaaaagt aaagtcgggt agggagatag agagtggcac caggtagggg 15961 gcgaggtagg ttacttatgg tctgaagtga catctgagtg ggcacctgaa gcacagaagg 16021 caaactttta cccagctatg gatatgtttg agggaagagt ggtctaggca gggggattgg 16081 caggtgcaga ggccctgagg tgggaaggag ccagactgtt cttggcacag aagggaggcc 16141 gttgtggctg gtggaaagtc accaagcctg gcctggggag cagtcagact gggacttctt 16201 gttattgttg tttttgttgt tgttgttgtt tttgagacag agtctcactc catcgcccaa 16261 gctggagtgc agtggcacaa tctcggctca ctgcaacctc tgcctcccgg gttcaagtga 16321 ttctcctgcc tcagcctccc cagtatctgg gattacaggc gctcaccacc atgcccggct 16381 aattttaatt ttcatatttt tagtagagac agggtttcac catgttggcc aggctagtct 16441 caaactcctg acctcaagcg attcacctgt ctcagcctcc caaagtgctg gtattacggg 16501 tgtgagccac cacacctggc ccagactggg acatctcatg agcccaggca gcactgtaca 16561 ctgcccaggg cccagggctg ggaatggcgg tggaggcagg ggagtcagca ggaggtctgc 16621 cggcctcagc cttaggcagg cccttcctcc agcaccatct tcctcctcta gcacgtcagt 16681 ggtgtgatgg aggagcacgc gtggactgtt gccagcctgc ctgggttccg gtccggcttt 16741 gccacccacc agctgtgaag ttacataacc acttcctgcc tccgtttgct catgtgtaag 16801 atggagataa ttatagcaac tgcctggcag agccattgca ggagagtaga tacatgtatt 16861 attagcagga gtctggcaca ccaggagcgc tcagtgcact cggatgttgg ccgcagttgt 16921 ggttatagtt gttaattctg tgtcatctca acttccttca gagggagggt aagttttttt 16981 ttgaagacaa ggtctcatca ctctgtcgcc cagagtacag tgatgcgaac atgactcact 17041 gcagccttga cctccccagg ctcacgcgat cctccttcct cagcctcctg agtagctggg 17101 accacaggta tgtgccacca tgccttgcta atttttgtaa tttttgtaga gacaaggtct 17161 ggctatgttg cccaggctgg tcttgaactc ccagcctcaa gtgatcctcc caccccagcc 17221 tcccaaagtg ctggttatag agacgtgagc cactgcgcct ggccaaggag ggagggttgt 17281 taatgaccat cttccctttt tgcatgtgag gaaactgcct cacagagacg ttgaatacgt 17341 cacccaaggt tgtgcaggtt ggaaatggca ttgctgtatt tgagcccagg cctacctcac 17401 cccaagtcca ggctccctcc ttccatacag tggtgttcct gttgatagca gtgtgcaggt 17461 gatgtctaga tgatctgaca gggaggcaac agagcagatt ggagcatctg tcatcaccag 17521 gggtgggaga gggcagtgta gggtggacta ggtaccttgg tttttttttt tttttttttt 17581 gtgacagagt ctcgctctgt cacccaggct ggagtgcagt ggcatgatct ctgcttactg 17641 aaacctctgc ctcctaggtt caagagattc ttgtgcctca gcctcttgaa gagctgggac 17701 tacaggcatg tgtcaccatg cccggctata ttttatattt ttagtagaga aggggttttg 17761 ccatgttggc caggctggcc tcaagtggtc cactgccttg gcctcgcaga gtgctggatt 17821 tacaggcttg agccactgca cctggcccag gtacctttga gagttgtaat aatgcgtcat 17881 tctaggatcc taataaaaga gccccaaact ggaaggcagg acacttgcct caacctaact 17941 gcctcccgtg cccgcatggc ccgcacggct ttctgcatct tcatctgtca gacaggaatg 18001 gcagatcctt tgagatgagc cctcctatag gtagctcttt tcttctttca acaaatagtt 18061 gtctagcact gactaataat acctcctgac tgacagcaga taattgccta ctcttcgcca 18121 tgccctgtgg ccattattta tgtgcatgat cctagatatg gtggtcattc ccattttaca 18181 gatgagaaaa ctgaggctcc aaaaggttaa ctggtatggt catacagcta gtgtgcagca 18241 cagctgggat tcaaacccag gaaccagcgg tgagcaagat cggacaaagc ccctgccctc 18301 agggaggttt cctggagtcc tctgcgattg tgggtgaggg gtgacatgtg ctgtttgtct 18361 gcatgatgat ttgaataaag ttataactct ggcaagtgct tatcaaggag agctctatgg 18421 cagtgtgtga taagctgacc tggtggagtc tggagaggtc agggaagcag gtatgtgtga 18481 ggtgggcgcg tatgcattca ttcagtgact attttgtgag tttctgctcc atgccatacc 18541 ctgtgctgca tggaggggac acatggcacc attcaaccct ggtgtgcccc ctcacagaat 18601 tgacattaca gggacatggt gatccaaagg agattcatcc tgaggtccca cagtgagccc 18661 agaggcagag caggcactga cagtggcctc tctctctgcc cacagttgga gaaggaggag 18721 cagatacaca gtgtggacat tgggaatgat ggctcagctt tcgtggaggt gctggtgggc 18781 agttcagctg gaggcgctgg ggagcaagac tatgaggtaa gcagggctat gagatggatc 18841 ccatactgac ctctgcccct catgctcgca tcaccgtctc ccccaggggc tggggctggg 18901 aagacatggg gcagatgtga gtcacacctg attgctgccc tctggaggat cacccagtgt 18961 acggaggaga cagaaggcag acacagggcg gtaaactgca catcagggaa gagagcaagt 19021 ttggatcatg gcttaaaatg acacaaatgt attattcgac tgttctggag gtcagaagtc 19081 tgaaatcact ctcactgggc taaaatctag gtgtaggcag agccctgttc cttctggagg 19141 ctctagggga gaatccattt ccttgcctat ttcagcttct agagtccatc tgccctcctc 19201 agcttgtgac ccttccttca tcaaagccag cactcctgtc actctaacgc ctgcttccac 19261 ttccattgtt acttctccaa ctctgactct cttgccttct ttttttcttt ttcttttttt 19321 ttttttgaga cggagtctcg ctctgtcgcc caggctggag tgagtgcact ggcgcgatct 19381 cagctccttg caacctctgc ctcccgggtt caagctattc ttctgcctca gcctcccgag 19441 ttgctgggac tacaggtgcc caccaccacg cccggctaac tttttttttt tgtattttta 19501 gtagagacgg ggtttcacca tattggccag gctggtctcg aactcctgac ctcgtcatct 19561 gcccgcctca gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cctggccttc 19621 tttttctttt gttcattttt ttttgagaca gggtttcatt tccgttgccc aggctggagt 19681 acagtggcgt gttctcagct cactgcaact tctgcctctg aggttcaagt gattctccca 19741 cctcagcctc ccaagtagcc agggctacag gtgcacacca ccactcctgg ctagcttttg 19801 ttatttttta tagagagagg gtttcactgt gttgcccagg ctggtctcaa actcctgggc 19861 tcaagtgatc cacccacccc agcctcccaa agtgctagga ttacaggcat gagccaccac 19921 acccagccct cctttcttat aaagaccctt atgatgacac ttggcctacc cggataatcc 19981 aggggcatct cccccatttc aacatcttca agttcatcac atctgcaaag tgccttctgc 20041 cacatttcca ggttcaggga ttaggatgta aacatctttg aggagctgtt attctgcctg 20101 tctcctatgt tttcttctag aagctttcta tttttagatt ttacatttag atctctgttt 20161 catctcaagt gaatttttgt gtattgtgtg aagcagggga taaggtttat tttttctgta 20221 tctttttttt tttttttttt ttttttgaga tggagtctca ctctgttgcc caggctggag 20281 tgcagtggcg cgatctcggc tcactgcaag ctccacctcc cgggttcacg ccattctgcc 20341 tcagcctccg gagtagctgg gactacaggc acgtgccacc atgcccggct aattttttgt 20401 atttttagta gagacagggt ttcactgtgt tagccaggat ggtctcgatc tcctgacctt 20461 gtgatctgcc caccttagcc tcccaaagtg ctgggattac aggtgtgagc caccgcaccc 20521 ggcccttttc tgtatctttt ttatatgttt gttgacaatg cttatttatt tattttattt 20581 attttaagac agagtctcac tctgttggcc aggctggagt gcagtggcat gatcttggct 20641 cactgcaacc tccgcttccc agactcaagg gattctccag cctcagcctc ctgagtagct 20701 gggactatag gtgtgagccg ccatacccgg ctaatttttt aaaaaatttt ttgtagaaat 20761 ggggtttcac tacattgcct ggctggtctt gaactcctga gcttaaagtg atcctcccac 20821 cttggcctcc caaagtattg ggattacaga cctgagccac tgcacctggc tgaaaatact 20881 ttgaaaaaaa aaaaaagact tgccttttct cattgaatca ttttgactcc ttgttgagaa 20941 ttaattaacc atatgtatgg gtttatttct gtactctctt ctctgttcca ttgatatgta 21001 tagttttctt tatgccaata ccaaactgtc tcttttttta accatattat ttatttatgt 21061 atatatgtat ttatttattt atttatttat ttatttgaga tgggggtctc actctgttgc 21121 tcaggttgga gtacagtggc atgatctcag ctcactgcaa cctctgcctc ctgggctcaa 21181 gtgatcctcc cgcctcagcc ccccaagtag ctagtgggag ccatcatgcc cggctaattt 21241 ttaaatattt tttgtagaaa tagggtttca ctgtgttgcc caggctggtc ttgaactcct 21301 gaggtcaggt gatccacctg ctgcggcctc ccaaagtgct aggattacac atgtgagctg 21361 ctgtgtgtac ctggcctact cttttttttt tgaaacgagg tctcactttg tcacccaggc 21421 tggagtgcag tggcacgatc tcagctcact gcagccttgt cctcccgggt tcaagtgatc 21481 ctccttccac agcccctcaa gtagctggga ctacaggctt gcaccaccat gcccaacttt 21541 gcactagtca gtgtgacaag gtgagaaaaa gaaaaaagaa aactatcttt tttcacagat 21601 ggcatgattg tctatgcaga aaatcccaag agatctagac aaactacaag gtttagtaag 21661 tgaaggtcac aagatgagaa gtcaatatac aaaataaact gtatattaca catcacttct 21721 gtggtattct ggccaaagtg tgcaacctaa attaatcaca aggaaacatc agatactttg 21781 tacaataagt gatatctgac aaaaatcact gacctctact cttcaaaaac atcaaaggca 21841 caaacccaaa gtcacaggag ccttccttcc ttccattctc cctcccctcc ttccctccct 21901 ccttcctttt tttttttttt tttttttttt tttgagatgg aattttgctc ctgttgccca 21961 ggctggagtg cagtggtgca atctcggttc actgcaacct ccgcctccca ggtacaaacg 22021 attctccatc ttagcctccc aagtagctcg gattacaggc ctgtgccacc atgcatggct 22081 attttttttg tatttagtag gacagggttt tcccatgtta gtcaccagat gataaatgca 22141 gcttccctgt cagctccaaa tcaggccttt tataaacaag tatagtcaag gtcaggaaga 22201 aaacttccct tctgtataat tgtctttctt gtttcctttg cataatcagc tgtgcacaaa 22261 gagagctata gttatgttct gtttttctca gcaaaaggct ttattgtcca attttataat 22321 ttcacagcat gtcagttaga aaagagacta agtcagtaac aaaatatacc caaccaaaag 22381 tggctgcatt catctttttt tttttttttt ttttttttga gacggagtct cgctctattg 22441 ccaggctgga gtgcagtggc gcaatcttgg ctcactgcaa cctccgcctc ctgggttcaa 22501 gtgattctcc tgcctcagcc tcccgagtag ctgggactac aggcacgcgc caccatgccc 22561 agctaatttt tttgtatttt tagtagagat ggggtttcac tatgttggcc aggatggtct 22621 cgatctcctg acctcgtgat ttgcctgctt tggcctccca aagtgctggg attacaggtg 22681 tgagccactg tgcccagtct ttttgtttcg ttttttgttt tttggcagag tcttgctccg 22741 tcacccaggc tggaatgcag tggcacaatc tcggctcacc atcacctcca actcctgggt 22801 tcaagcgatt cttgtgcctc agcctctcga gtagctggga ttataggtgc ctaccaccat 22861 gtccagctaa tttttgtatt tttagtaaag acggtgtttc accatgttgg ctaggctggt 22921 cttgaactcc tggcctcttg tccaccaggc ttaagtgcaa tggtgcgatc tcagctcact 22981 gcaacctccg cctcctgggt tcaagcaatt ctcgtgcctc agcctcccga gtagctggga 23041 caacaggcgt ctgccaccat gcctggctta aattttgtat ttttagtaga gatggggttt 23101 taccatgttg gccaggctgg tattgaactc ctaacctcaa atgatccgcc tgccttggcc 23161 tcccaaagtg ctgggattac aggcgtgagc caccgcgcct ggcctcagct tgggtttttc 23221 aatgcagatt gctaacatga attgctactc tcaattcctt gagagaacag acttgtgagg 23281 agttctcccg agaagaattt tcacctcttc tttggtgtcc ccccaagcta aggccaagac 23341 agacaggcct cccagggact ctttccacag gcttatttcc ccttcatgta actagtgtgg 23401 gcgttgcctt tggggagtcc tcagttctat gcagtgtggt gaggatctct gagatgccac 23461 tctcccttgc ttggagccta ggttttctct cttgttctct ttcaccacca cgtgcataac 23521 ccttaaagat caagttctag ctgggtgtgg tggcttacac ttgtacttct atagctttgg 23581 gaggctgagg tgagaggatc atttgaggcc aggagttcag gaccagcctg ggcaacatag 23641 caagaccatg cctctacaaa aaataaaaaa ttagccaggt gtggtgtcgt gcatctctag 23701 acccagctac tcagaggcca aggtgggaga atcacttgag acccagaggt cgaggctgca 23761 gtgagctatg atcgcaccac tgtgctccag cctgggtgac agagaaagac cctgtctctt 23821 aaaaaaataa taataataaa taaaatcaag ttatttgcta tctgggatca gcaccatgac 23881 aaactaccaa catgagtgct gccgggtcag tcagggtcct tgcccggaga ttaaaaccac 23941 accagttatt ttaacagata atttgataga aagaaccagt aactaggtgt tactagaaga 24001 ctaagatatg agggagaagc aaccgcaggg agtagctacc actcccaggg ttgggggaac 24061 aaaagaaagg agcaggaatt attgaaacgt agatgtttgg aaactgggag gtggggctcc 24121 aggagctgaa attcagatcg tttgaggagg gcttcctccc ccaccggctg ctgagagggg 24181 ctgaatgagg ctgtttctgg aggtattggg acactgcaaa ctggattcat ctgctgctgc 24241 aggcaggaac cactgcagca gcagaagctt aactgggggt gatgctcaca ggatctgcaa 24301 tgcacgagaa gctggtcctt cctccctcct gcagccttgc aaactctctc tcttgccccc 24361 cctattggca gagcctggtc acgatcagct agggaagcag aaaggtggtt tgccaagtgc 24421 gggcccagat cacagagatg agttgtagag ggtaggtttg agctgagaaa caatcactta 24481 atacttagtg cagctcgcct gactctctgg attctgtttt ctcatttcca gcctctagta 24541 attccccttg cctttccgct gactcacctg tgcatttcca aaaggtgttt gaatattgtg 24601 ccagtttact gaatattctc tgccgggaaa gttccttagg acatgtagcc cgctgtattg 24661 ccagatatgg aagactccat gggagaattt agataagggt cgctcacatc tggaggataa 24721 agagatccct ctggggcctg tgtcagggca ggactgactc aaggcctaga aacaaagggt 24781 ctggagacat ctaggcccaa gctacattct agtgactccc agtcccctgt gtgggccctg 24841 tgccctcttc cccagcccct agaagctctc atctaagcca acctgccctg gttctcatgc 24901 actgtggctt ctcccaccca ggtccttctg gtcacctcat ctttcatgtc cccttccgag 24961 agccgcagtg gctcaaaccc caaccgcgtt cgcatgtttg ggcctgacaa gctggtccgg 25021 gcagccgccg agaagcgctg ggaccgggtc aaaattgttt gcagccagcc ctacagcaag 25081 gtacctaggg tgggagccag ggtgcagagt ccttggtcct gggcaaggag ggaaataagg 25141 gccaatattg agtcccatag ggctgggagg agtagacagc tggtgtccca tgggattatg 25201 gggaggctgg gtccctgcgt tccctgaaat gggtttggga tctttcctgt aatgagaagt 25261 ttcatctccc ggaggaatct aggaagggtg aactgcagca tctcagaggg gaatcggagg 25321 aggggaactg acttgttaga gaggtggcct agtagagtgg gcctagtagt taacatcagg 25381 agatctggcg tctgattact gagttcacgt cctggctata ccacttttta gctgtgtgat 25441 cttgggcaag tgacttacgc tctctgggcc tccattgcct gatccttaca gggttgctgt 25501 aggttaaggg atgctaaaag ctgtggtaac agtgagactg gagtatgtga tgatgccaca 25561 cactaaaggt gtgcttcttg cccaggtaac tgtcctgggg agtgtttagc ctgtcagctg 25621 gcttctcctt atgaggtcgt gcagggaccc agacttctcc tattttctgg ctctgcatcc 25681 agctggcaga ggagagagcc tggaacactt ctgtgttctg gaagtggcac gcagcactcc 25741 ctcccattcc acaggcagga gtttgtgcat agggcagtga attcgctggg acacctagta 25801 gactctgcca attgcccaaa gagctaatgc gtgaaaaggg cttggagcca gactcagagt 25861 tagggctcag tcataacagc ttaatcctat aattgtcatc accatcgcca gggcccctcc 25921 ttcaacatgg gtgttcatga gagggaagga gccaggaaga ggtttccctg agtgaaaagg 25981 gtcttgggcc ctggcctctg tccttgggcc agactccctg actcccaccc ctcctttccc 26041 aggactcccc ctttggcttg agttttgtac ggtttcatag ccccccagac aaagatgagg 26101 cagaggcccc gtcccaggta agctgtacct gtcactcccc atggccttct ccctgcctct 26161 ccacccccac ctgccagcag cccacctata atactgacct tgcgggacct tagaaggtga 26221 cagtgaccaa gcttggccag ttccgtgtga aggaggagga tgagagcgcc aactctctga 26281 ggccgggggc tctcttcttc agccggatca acaagacatc cccaggtgag ctcggacaac 26341 gtgggtcctg agtgagtagg gttgagacct agactcgtgg gtctgagggt agagggggct 26401 ggggctcaga ttcctgggtc tgagggagga gggtagtgga gtcctggact cctgggtctc 26461 agggtggagg aaggtgggcc tggcctcttg tgtcctgagg gttgagaggt ctggaggctg 26521 ggactcctgg atcactggtg ggttttggca acaatattct gtgtcccata gataggagtg 26581 aaagggtctt ggggctcagc cctctcatcc tggatccact ttctcccttc atagtcacag 26641 ccagcgaccc ggcaggacct agctatgcag ctgctaccct ccaggcttct agtgctgcct 26701 cctcagcctc tccagtctcc agggccatag gcagcacctc caaggtgaaa tcatcagact 26761 ttggtggggt ggaggaggag agaagctgga ggcctcaatc catccccatc ccctcagccc 26821 caggagtctc ccaaagggaa gaggaagttg gatttgaacc aagaagaaaa gaagaccccc 26881 agcaaaccac cagcccagct gtcgccatct gttcccaaga gacctaaatg tgagctaact 26941 agatcccttg tttccacagg ggcctgtgct cctgtgtcct aggggaagcg gggctggagc 27001 ctgctgtcct gaatctgagg gaggaagggc tgggggcctg gacccctggg tctgagggag 27061 gaggggctgg gggcctggat tcctgaatcc gagggaggag gggctgaggg cctaaacccc 27121 tgggtctgag ggaggagggg ctgggagcct ggacccctgg gtctgagaga ggaggggttg 27181 ggggcctgga tccctgggtc tgagggagga ggggctgggg gcctggactc ctgggcctga 27241 gggaggaggg gctgggggcc tggactcctg ggtctgaggg aggaggggct gggggcctgg 27301 acccctgggt ctgagggggg aggggctggg gcctggattg ctgggtctga gggaggaggg 27361 tctggggcct ggactgctgg gtctgaggga ggaggggctg ggggttgacc cccagtggtg 27421 ctaacctaat ctactctttg tcttctccag tgccagctcc aactcgtacc ccagccacag 27481 ccccagtccc tgcccgagca cagggggcag tgacaggcaa accccgagga gaaggcaccg 27541 agcccagacg accccgagct ggcccagagg agctggggaa gatccttcag ggtgtggtag 27601 tggtgctgag tggcttccag aaccccttcc gctccgagct gcgagataag gccctagagc 27661 ttggggccaa gtatcggcca gactggaccc gggacagcac gcacctcatg taggcttgcg 27721 cccccctccc tgcgccgctg cagtttctcc cccagctccc tgtgtctcct ccaccttgtg 27781 ctttctctgt gtccactatg ctgcatgctt tctctctctc tcactcgctt tctttctcac 27841 tgcattctgt agcctttgtc ttctctctga tttttgcatc tctcccttgg tctccaacct 27901 ctttttgttt ctcccacctc aatctcatga tctgtctgtc tgtctgtctc tctctctctc 27961 tgtctgtctc ccctgtctca ttcccctttg cccctcagat cacacctaac tggcatcttc 28021 acttctgccc cccaccagct gtgcctttgc caacaccccc aagtacagcc aggtcctagg 28081 cctgggaggc cgcatcgtgc gtaaggagtg ggtgctggac tgtcaccgca tgcgtcggcg 28141 gctgccctcc cggaggtaag gcctcacacg ccaaccctgc tccttatcct gtgctgggca 28201 atgccaggaa tctggagggg agtcagactg gggcctgcca gcggagaaca caggtggtcc 28261 cagcccagag ccagcagact actgagagga gcgggacagg ggctggggta ctccagatga 28321 gggaaggagg acctgcctgg gaggtcacag agggctctgt gaagcctgct tatcagaaaa 28381 ggctggagga gcagtttgtg caaaaactca ggggtgggag aacaagtgtt agggtccata 28441 ttcattcatt caaagtgagt cccggtgcag tggctcacgc ctgtaatccc agcactctgg 28501 gaggctgagg caggtggatc acttgaggtc agaagttcga gaccagcctg gccaacatgg 28561 tgaaacccca tctctactaa aaacacaaaa attagctggg cttggtggcg cacgcctgta 28621 atcccagtta ctcggaaggc tgaggcagga gaatcgcttg aacctggaag ccaaaatcgt 28681 gccactgcac tctggcctgg gtgacagagc gagactccca tctcaacaag aaaaaaaaaa 28741 cccaaaaacc aaaatgagtc ccagaagccc ctgcttggtg ccaggctgga atgtgcaagg 28801 ctgaggatcc agtcattgag ccctgccctt gaaggcttct cactttgatg agggatcaga 28861 attgttgtaa cagtgcagtg gaagccccac agggtggccc gcagagagcg aggggtcttc 28921 aaggtccaag agaggttttt caggctgaac agaggcagag aacatgctgg gcagaggaga 28981 caaggtcagg tcagcctgtt gtctgcctgg agttgtaggt gaggctgagc ttacacaggg 29041 gcaagaagtg gcaggaaaaa tgggactggg gcattaggaa ccacagaagg ctttggactg 29101 agcaaaggct agctcatatc tgaggtccat gaattggagg ggagactgga ttgtactcca 29161 tggcctgagt gtgttcagaa gttagttgta cctggtgtat ttgagaaatt atttgccttg 29221 ggggatagcg aggctgtgag gtgggtgggc tcaggtcaga aggggcttcc ggagcccagg 29281 ctacacactg ggctccatgc agcttcacag aaccctcatc tgttattcag gaatccaaac 29341 atctctagaa gctgaaagtt cttttctagg ttcagcacaa actcatttcg ctgcagtcat 29401 cctgaactaa tgtgaggcta tttgcggact tttttttttt tatcccctta agtgtgtcat 29461 ggtcttaact gcagaaatat tagtgggttt gatcatgggg tgctccccca ggccctcctg 29521 ggagtttatg taacaatctg tatgccagcc atttgcctct ctaaaacctg aaagattcaa 29581 aattctgaag cccatccagc tccagaggcc tcctcttgtg agcagagggg agtcacagaa 29641 ggtttgatgg gggcaggggc aggctctgtt gggagggatg aagtgtcaag gggaggctgc 29701 aggtggggac tcctgagtag gctggggcct caggaacaga agaggggatg ggctggtgct 29761 atatctgggg cagaaggaac cagatttggg ttattatctg cctgggagag cttaaggagg 29821 gcatgaaaga aagggaaaaa gacaaacagt gttggatttg caactggtct taacaaaaag 29881 aattcacaaa gcattttata tttttaaaaa ttctggcttg tactcattta taaactaaaa 29941 aaaaataact atatgatgca ggcaacgtgg tgagacccca tccctacaaa acatgaaaaa 30001 aattatccag gtgtggtggc acatacctgt agtcgtagat acttgggagg ctgaggtggt 30061 aggatcactt gagcccagga ggttcaggct gcaatgagcc atgattgcac cgctgcactc 30121 cagtctaggt ggcaaatcaa gaccttatct caaaaataaa aatttggctg agtgcagtgg 30181 ctcacgcctg taatcccagc actttgggag gccaaggtgg gcagatcacg tgacatcagg 30241 agtttgagac cagcctggcc aacatggtaa aaccccatct gtaccaaaaa atacaaaaat 30301 tagctgggca tggtggcgca cacctgtagc cccagctact ctagaggctg aggcaggaga 30361 atcccttgaa cctgggagga aggggttgaa gtgagtcgag atcatgccac tgcactccag 30421 cctggggaca gagcgagact ctgtctcaaa aaataaaaat aaaataaaaa ttcaactata 30481 gaaggaagac cccaagagga tgactctcca tacctaataa gcacaaattt agnttttttt 30541 ttaatctttg aatagagcac tttgaagaaa aagttaagta aacctgctat tgactgttaa 30601 tgaactacta cattttaaat cctctgaatc tcctgtaggc accgttttat gagtgtgact 30661 tacctaaaca gcatttcacc agcttgaaaa caaccaaatt aaatgcagaa gggtcttaac 30721 atgtatgtct cagaaaaatg gagctaatac attcatttgt gtctgcatac cttccctgtg 30781 gcttagggca gattaaataa tctctctgtg ccttagtctc ctcacctgta aaatggggcc 30841 aaaaatagga cctaccccag gattcagtga ctcagtattt gtcacgtgct acctggcatg 30901 taggaagtcc catgtaagtg tcaacctgct actgttgtgt tttttttttt tttttttttt 30961 tttttttnnn tgatcattct tgggtgtttc tcacagaggg ggatttggca gggtcatagg 31021 acaatagtga agggaaggtc agcagataaa caagtgaaca aaggtctctg gctttcctag 31081 gcagaggacc ctgcggcctt ccgcagtgtt tgtgtccctg ggtacttgag attagggagt 31141 ggtgatgact cttaatgagc atgctgcctt caagcatctg tttaacaaag cacatcttgc 31201 accgccctta atccatttaa ccctgagtgg acacaacaca tgtttcagag agcacagggt 31261 tgagggtaag gtcgtagatt aacagcatcc caaggcagaa gagtttttct tagtacagaa 31321 caaaatgaag tctcccatgt ctacttcttt ctacacagac acagcaacaa tctgatttct 31381 ctatcttttc cccacctttc ccccttttct attccacaaa accgccatcg tcatcatggc 31441 ccgttctcaa tgagctgttg ggtacacctc ccagacgggg tggtggccag gcagaggggc 31501 tcctcacttc ccagaagggg cggccgggca gaggcgcccc ccacctccct cccggacggg 31561 aacctgctgc tgttgttaga gctaggaaaa tccttttcct ggaatgttag tggcttaagg 31621 aaccatgatt atagtgttgt agtctttaaa atgtgtttgc tttgagattt tccccttaaa 31681 tggtgggtat tttcctaatg agaaactgcc tttgattctt aaaggagaaa gttctgattt 31741 ttcattcctt tctgaattga gacatttctt ccccaagagg gaggggaatt ctgggtactg 31801 tggttcatgc ctgtaatccc agcactttgg gaggctgagg tgggtggatc actcgaggtc 31861 aggagttcga gaccatcctg gccaacgtga tgaaacctcg tatgtactta aaaaacaaaa 31921 caacacaaaa aacacaaaat tagccaggcg tggtggcaca tgcctgtaat cgcagttact 31981 ggggaggctg aggcaggaga ctcgctcgaa ccctgggagg tggaggttgc agtgagccaa 32041 gatcacgcca ctgcactcta tcctgggtga cagagtgaga tcttgtctaa aaaaaaaaaa 32101 agcggccagg agtggtggct aacgcctgta attccagcac tttcagaggc tgacgggggc 32161 agagcatgag gtcaggagat cgagaccatc ctggccaaca tggtgaaacc gtgtctctac 32221 taaaatacaa aaagttggcg gggcatggtg gtgtgcacct gtaatcccag ctactaggga 32281 ggctgaggca ggggagtcgc tcgaacctgg aggcagagat tgcagtgagc cgagatgagg 32341 ccactgcact ccagcctggc gatggaatgg actctgtctc aaaaaaaaaa gagggagggg 32401 aagaaagggg gtacccaggt ctgcagggca ggtggccaga ctgactgggt tcactggggc 32461 cttcctgagc tgggagagaa tgggctggga acaagaccat gaggcccact agagtgagta 32521 ttttagtaaa tgccaatggc tgttgcctcc tgaacgccct ccccacttcc cttgggcctg 32581 tttgtctgag gcccgatttg tcccctaggt acctcatggc agggccaggt tccagcagtg 32641 aggaggatga ggcctctcac agcggtggca gcggagatga agcccccaag cttcctcaga 32701 aggtctgatg ccccctgtcc agtgggggag ggtgtagtgg gaggatgaca tgagtccgct 32761 gtggcacccc aaacttccca ctcttgcctc ttgcaggtac ccccccaaca cacccacact 32821 gttcctgccc tcctcaccca cccactctcc ctccttagca accccagacc aaaaccaagc 32881 ccactcaggc agctggaccc agctcacccc agaagccccc aacccctgaa gagaccaaag 32941 cagcctcacc agtgctccag gaagatatag acattgaggg ggtacagtca ggtcagactc 33001 tgagggagga ggggctgggg cctgggctcc tgagtctgag ggaggagggc ctgcggcctg 33061 ggctcctgag tctgagggag gaggggctgg ggcctggact cctgagtctg agggaagaga 33121 gcctggggac ctggactcct gggtctgagg gatgagggcc tgggggcctg gactcctggg 33181 tctgagggag gaggggctgg gggcctggcc ttctgggtct gagggaggag gggctggggg 33241 ctggactcct gtgtctgagg gaggaagggc tgagtcctgg actcctgggt ctgagggagg 33301 aggggctggg gcctggactc ctggatcaga gggaggaggg gctgggggcc cagacgactg 33361 gatctggagg gcagttgagg ctgggggcat gagatgcctg ggtctgtgcc atactgggag 33421 agctgccagt cttcccagga gacaaggcag ccttgcattg actgagcttc tacttgcaga 33481 aggacaggac aatggggcgg aagattctgg ggacacagag gatgagctga ggaggtaggg 33541 ctagggctgg gggttggggt gtccctggcc ttgggtgtag gggaggctgc tctgcatgct 33601 cactctctac tcctcttggc acccccaggg tggcagagca gaaggaacac agactgcccc 33661 ctggccagga ggagaatggg gaagacccgt atgcaggctc cacggatgag aacacggaca 33721 gtgaggaaca ccaggagcct cctgatctgc cagtccctga gctcccaggt gggaaatccc 33781 cccatccctc ctgaagtggc attttctctt ctcagttctc agctgggact tgccctggcc 33841 agccctgggg tcactgctgc ccccgtgtct tgcctcccct ctccttcctc aacccactcc 33901 ctgcctcagc accttcaggt tctgccaggc ttgttccttt cctttcagca tcctctctgt 33961 gacttgcctt tttctcccct ccagatttct tgttctgacc actcccctcg cagccatgtg 34021 ccaggtgtga cctccttttt ttcacgtcag gggtgagggc cccaccctca tgctagtgct 34081 tcacttgctc aggtgccgtt ctgttggtct ggggtgaagc cgggcacacc aggtgacagg 34141 ggcactgctc tctgtcaggc cctgggtggg tgcacaccag tgtctgcttt aacatcagcc 34201 atcactctgg acgtggatgc tgtctgcaca cattgtgtgt ttgcatttcc agtgtaaaag 34261 aataagaaca aaaggaaatg gggaaactct gcagggagtt gtgggatgag gggcagggag 34321 gcagagcttc ttcgagaatg aagggagcag aggaaagaca gtcacgggct gtgcagaggt 34381 gtggttctgg ggcttccaga agagccgttt ccacccagtt cctcgtgggt gatgggcagg 34441 aagatgcttt gattctccca agcccctaaa gtggcatcag caaattgatg gcctagagag 34501 tcaccttaaa atggaaagaa ggagccaccc caaaggtcag aacaagaaag gcactggttg 34561 gaaaggagag caaaggtgaa gaagtggcag atactgacgt tttcagcctg gactaagctc 34621 attaagtggc tccaggcagg ccaggcgtgg tggctcaccc tgtactttga gaggccgagg 34681 tgggaggact gcttgagtcc aagagttcaa gaccagcctg ggcaacatag tgagaccccc 34741 ccccaacccc cgcctttaca aaacatttta aaaattagcc gagcgtggtg gtgtgctact 34801 cagtagtcct ggctactcag gaggctgagg tgggaggatt gcttgagccc aagagtttga 34861 gaccagcctg agcaacatgt caataccccg tctctctaca aaaagtttaa aaattagtct 34921 ggcttggtgg catgcacctg tagtcccagc tactggggag gctgagatgg gagaatcact 34981 tgagcccagg aggcagaggt tgcagtgagg caagatcacg ccactgcact ccagcctggg 35041 cgacagagtg agtccttgtc ttaaaaaaaa aaaagagaga gagagaggta agtggctcca 35101 gcctgtcggt cagaaacaaa tcacaataat aggcattgta tcttctacca ttaaagcaaa 35161 ccaaccaacc aaccaaccta cctccccagg aggctctgat gggccgctgg gcagggtcca 35221 ctgcctgcca tcaatccctg cccacatgtg gacagccctc tcctcagctg ctgggtcagc 35281 ctcgtcactc ctgtttggtg cctgggaact cctggggcct catgcttacc ttagccccct 35341 tctctcctgc agatttcttc cagggcaagc acttctttct ttacggggag ttccctgggg 35401 acgagcggcg gaaactcatc cgatacgtca cagccttcaa tgggtgagtc tccagggact 35461 ggggtgggga aggggtgatg gatccaaggg aacgctcagg gacctgctgg gaggggtggg 35521 ggtgtgggaa agaagaccat caggaggatg gagaagggcc atgcccagca ggtgggcaca 35581 ggtcatggaa tccgaatgct agcttgccca ccaccggcag cccagggtgc attgcttgcc 35641 tcagtttcct cttctgtatg gcgaggatcc ctacccctac ccctgcttct gaaagggaca 35701 gcctgaggat tagccactgt gggagctctg gtcattgtgg ctgtggcaag atggatacat 35761 gaaagaatgg agcctagagg caacctagag gcgcataggt ggggacagtg tggggacagc 35821 cattgagagt ggctggggag taggacgtca gtgctgattc cctgatgtgc cttctacctc 35881 ttttcttctc tcccgccagg gagctcgagg actatatgag tgaccgggtt cagtttgtga 35941 tcacagcaca ggaatgggat cccagctttg aggaggtgag taccaaagag gcagagaatg 36001 ggagacccgg gcacaccttt gcatctcctc cgtcctcccc tcgatgacac attcccgtat 36061 ggactccacc ccaccctgca ggccctgatg gacaacccct ccctggcatt cgttcgtccc 36121 cgatggatct acagttgcaa tgagaagcag aagttacttc ctcaccagct ctatggggtg 36181 gtgccgcaag cctgaagtat gtgctataca cacacacaca cacacacaca cacacacaca 36241 cgatgcattt aataaagatg agttggttct catccaagag tctcccaaaa ctctaagagg 36301 ctccctggga cctggggaag aatgctgggc acctccgtca gagatctggt agagaaggaa 36361 ctctttgtct cttctgcttg gccccttatc cctgtgttgg caagaggcag ggaactggga 36421 atctgaccct cagcactgcc cctcaacttt ttctggccct ctgagccaca cctgtatctt 36481 ggctgtccct ttgtggctgg aggcctgggt acccatgagg cttgtctctc tcctgaagcc 36541 tcagcgtccc caggccaggg atagtgcctt tcctccaggg ttttcagaag tagctgtata 36601 atgaatgtca ccactgtggt tttacatcac ctaatttagg aggtagaatg ggaggaaggg 36661 acattttaag caacctaccc aaaccctgca cccccttcaa ggctccacca catttaagtg 36721 ggaattgagg tgattgtgag ggaatcaggg cagaggatgg gatggtgctc taggcagagg 36781 ccatagcatg ggccaggagg tgggaagcag cctgaggcag gcagaagagc attgctgcct 36841 ttggcaaaga ggctttgggg gcagatttag aaagccaggc acagtccttg cagctctgct 36901 gtccagtgtg atcaccactg ggtacaggtg gctatttaaa ttaattaaaa tgaaatgaaa 36961 tgaaatgtaa acttcacttc catagtcaca ctggccatgc atttcaattg ctcagtggct 37021 attgtattag gcagcacaga gaacatttgc atcagtctag aaaggtcttt tggacagggc 37081 tggcctgaag gtctgcgaac tccaggctga gaagcctgga ctttaatctg agggtgggaa 37141 ggttgggagg gggtttgagc aggacagagg ccagggtcag aggtaggttt gagatctgac 37201 ctttgggtaa ccttttattg gacattccaa gccctttccc aacttttgcc tcatcaacca 37261 agggctacca gtctcactcc tgtctctttc tggacctttg tttttgcatc tttaagatga 37321 gggtggttta ttattattat tattattatt attattatta ttagagacag ggtcttgctc 37381 tgttgcccag gctgaagtgc agtggtgcaa tcattgttca ctccagcctc gaactcctgg 37441 gcccaagcga tcctcctgct tcagcctcct gagcagctgg gactataggc atgtaccacc 37501 acacctgata aattttctat tttttgtaga gatgggctct cactatgttg cccaggctgg 37561 cctcgaactt ctggcctcaa gcagtcttcc cgccttggcc tcctaaagtg ctgggattac 37621 aggtgttgag ccacactgcc cagctaaggg tggctttaga cttcatgttc actcattcgt 37681 ttgctcattc acactcccac cgataactca ccatgttagt tttgtactac tgctataaca 37741 aattatcaca aactcagtgg cttaaaataa tagaaattta tgatc // LOCUS HUMG0S24B 3889 bp DNA PRI 09-MAY-1997 DEFINITION Homo sapiens zinc finger transcriptional regulator (GOS24) gene, complete cds. ACCESSION M92844 NID g2072389 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3889) AUTHORS Blum,S., Forsdyke,R.E. and Forsdyke,D.R. TITLE Three human homologs of a murine gene encoding an inhibitor of stem cell proliferation JOURNAL DNA Cell Biol. 9 (8), 589-602 (1990) MEDLINE 91103879 REFERENCE 2 (sites) AUTHORS Taylor,G.A., Lai,W.S., Oakey,R.J., Seldin,M.F., Shows,T.B., Eddy,R.L. Jr. and Blackshear,P.J. TITLE The human TTP protein: sequence, alignment with related proteins, and chromosomal localization of the mouse and human genes JOURNAL Nucleic Acids Res. 19 (12), 3454 (1991) MEDLINE 91288233 REFERENCE 3 (bases 1 to 3889) AUTHORS Heximer,S.P. and Forsdyke,D.R. TITLE A human putative lymphocyte G0/G1 switch gene homologous to a rodent gene encoding a zinc-binding potential transcription factor JOURNAL DNA Cell Biol. 12 (1), 73-88 (1993) MEDLINE 93135830 REFERENCE 4 (bases 1 to 3889) AUTHORS Heximer,S.P., Cristillo,A.D., Russel,L. and Forsdyke,D.R. TITLE RT-PCR analysis of RNA of the CCCH zinc finger protein-encoding gene G0S24 (TIS11/TTP/NUP475) in cultured human blood mononuclear cells JOURNAL Unpublished COMMENT On May 6, 1997 this sequence version replaced gi:183444. FEATURES Location/Qualifiers source 1..3889 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="19q13.1" exon 549..630 /gene="GOS24" /number=1 /evidence=experimental gene join(607..630,1441..2397) /gene="GOS24" CDS join(607..630,1441..2397) /gene="GOS24" /codon_start=1 /product="zinc finger transcriptional regulator" /db_xref="PID:g183445" /translation="MDLTAIYESLLSLSPDVPVPSDHGGTESSPGWGSSGPWSLSPSD SSPSGVTSRLPGRSTSLVEGRSCGWVPPPPGFAPLAPRLGPELSPSPTSPTATSTTPS RYKTELCRTFSESGRCRYGAKCQFAHGLGELRQANRHPKYKTELCHKFYLQGRCPYGS RCHFIHNPSEDLAAPGHPPVLRQSISFSGLPSGRRTSPPPPGLAGPSLSSSSFSPSSS PPPPGDLPLSPSAFSAAPGTPLARRDPTPVCCPSCRRATPISVWGPLGGLVRTPSVQS LGSDPDEYASSGSSLGGSDSPVFEAGVFAPPQPVAAPRRLPIFNRISVSE" exon 1441..3103 /gene="GOS24" /number=2 /evidence=experimental misc_feature 2978..2986 /note="TA-rich conserved element (TARCE)" misc_feature 3163..3168 /note="U-rich RNA polymerase II termination element" misc_feature 3198..3889 /note="CpG island" BASE COUNT 648 a 1304 c 1078 g 850 t 9 others ORIGIN chromosome 19q13.1. 1 tcccaaccct cttctccctc tgaatctgtc tctgggactg tctctgtctc cccgtcttcc 61 ctcccttcct caccctgtct atctctctct gtatgtctct tggtgtgtgt gtctctctcg 121 atgtttttct ctctgcctgt ctgcctgtct gtacccctct gcgtctctcc ccgcccccat 181 ccgtctgtgt cgcacgcgca cccccatcgg gcttctgctc ttgtcaattg cccctggggc 241 cctgccccca cctccgcccc agtttccttc tacaagcctc agtctccagc tttgaaaact 301 gggcaggcgt ccccccatcc gcacccccac cccttcccca cgcattcccc gctcggtcac 361 ggctgtccac cggccaagct caggcgcgtc ctcccagggc cgggcggaag ggaaccagtc 421 cagggccagc caggctgccg ggggcgcgcg tccgggaagc gcccctcctg ccccgccccc 481 ggccccggcc ccggccccgc cccgtgcttg cagtttccta taagtagccg gctctcggtg 541 ccagcctgag cctgacttca gcgctcccac tctcggccga cacccctcat ggccaaccgt 601 tacaccatgg atctgactgc catctacgag gtgagtcccc gccgcacggc atccccggta 661 cctgcatgcc tgagtccgag tccccacctc tctagcgccg caaactccag cccgggacgc 721 ttgcctccct tctccaactg gggctcccta gcgccgcgcc ctccagcctg gggcccctgc 781 ctcccgctca gaccagcttg gtgatttgga ggtgaaaatg gaaccgcgac acccggctct 841 tcgctcaaac atgggtgggg cggcccatgc aagtggaaag tcggagaact tttctcagac 901 cgaggctgcc tggaggcgga agtggccccc atacctggct cacccctagt cgttgctgag 961 ggcgtggttt tgcgcggagg cgtctctggg gctgaagtct cagggtgggg ggatccgact 1021 tctgtctctc cagtccctga ccgtagagac agagaaccct aaaaccgaag caatccggac 1081 ttccaggtca actttgcccg gtttctccag ttgtgaaact gaatcccgac gcgtgggtca 1141 tatccgggga ggacaagaga acccaaaatt gggaaacagt ggtgcgccct gacttcgggg 1201 tccccctctt ggtccagccg gggaagccgg gattcctggg tccctcggga taaggcctcg 1261 gtggtgggta aactcagaac ctccaactct gggttcctgg catccggaac ccaggggttt 1321 ctgcgggcgg gtggggctca ggcggggagc ccacaaaccg gcctggcaag ctctagttcc 1381 ctgcagctgg ggtggggcgt gcctgcattt tcaggtgcct taaccgaccc atttccgcag 1441 agcctcctgt cgctgagccc tgacgtgccc gtgccatccg accatggagg gactgagtcc 1501 agcccaggct ggggctcctc gggaccctgg agcctgagcc cctccgactc cagcccgtct 1561 ggggtcacct cccgcctgcc tggccgctcc accagcctag tggagggccg cagctgtggc 1621 tgggtgcccc caccccctgg cttcgcaccg ctggctcccc gcctgggccc tgagctgtca 1681 ccctcaccca cttcgcccac tgcaacctcc accaccccct cgcgctacaa gactgagcta 1741 tgtcggacct tctcagagag tgggcgctgc cgctacgggg ccaagtgcca gtttgcccat 1801 ggcctgggcg agctgcgcca ggccaatcgc caccccaaat acaagacgga actctgtcac 1861 aagttctacc tccagggccg ctgcccctac ggctctcgct gccacttcat ccacaaccct 1921 agcgaagacc tggcggcccc gggccaccct cctgtgcttc gccagagcat cagcttctcc 1981 ggcctgccct ctggccgccg gacctcacca ccaccaccag gcctggccgg cccttccctg 2041 tcctccagct ccttctcgcc ctccagctcc ccaccaccac ctggggacct tccactgtca 2101 ccctctgcct tctctgctgc ccctggcacc cccctggctc gaagagaccc caccccagtc 2161 tgttgcccct cctgccgaag ggccactcct atcagcgtct gggggccctt gggtggcctg 2221 gttcggaccc cctctgtaca gtccctggga tccgaccctg atgaatatgc cagcagcggc 2281 agcagcctgg ggggctctga ctctcccgtc ttcgaggcgg gagtttttgc accaccccag 2341 cccgtggcag ccccccggcg actccccatc ttcaatcgca tctctgtttc tgagtgacaa 2401 agtgactgcc cggtcagatc agctggatct cagcggggag ccacgtctct tgcactgtgg 2461 tctctgcatg gaccccaggg ctgtggggac ttgggggaca gtaatcaagt aatccccttt 2521 tccagaatgc attaacccac tcccctgacc tcacgctggg gcaggtcccc aagtgtgcaa 2581 gctcagtatt catgatggtg ggggatggag tgtcttccga ggttcttggg ggaaaaaaaa 2641 ttgtagcata tttaagggag gcaatgaacc ctctccccca cctcttccct gcccaaatct 2701 gtctcctaga atcttatgtg ctgtgaataa taggccttca ctgcccctcc agtttttata 2761 gacctgaggt tccagtgtct cctggtaact ggaacctctc ctgaggggga atcctggtgc 2821 tcaaattacc ctccaaaagc aagtagccaa agccgttgcc aaaccccacc cataaatcaa 2881 tgggcccttt atttatgacg actttattta ttctaatatg attttatagt atttatatat 2941 attgggtcgt ctgcttccct tgtatttttc ttcctttttt tgtaatattg aaaacgacga 3001 tataattatt ataagtagac tataatatat ttagtaatat atattattac cttaaaagtc 3061 tatttttgtg ttttgggcat ttttaaataa acaatctgag tgtaagctgg gatcctggct 3121 tcttcgcggt ctagagacag gaatggagag ggagggggtg actttttgga agctgggtgc 3181 agttaactct tcctctccga gccccgcggc gcttacctgg caggccgtga cgtcaccggg 3241 cctggccgtt cacctgaaag ggtgggacca ggtgaggtca ccagatggga accggggagc 3301 cacttcccgg gcgtcggggg cgccgcgctc ccgtcctgct gggctccttg acccagcttc 3361 gggcgggtgc ggtcggggac gggatgtttc cgtccccacg gggccgcggg aggcgggagg 3421 ggccggctgg tgaggacgga atgtgctggt gcgcgcgccc agagcgaacg gtggcggtcg 3481 ctgtgcggtg cagccttggg tagcggaacc cctttcggga ctagaggttc ccggggggct 3541 tcgaaccttc tggatgttgg ggaagcgggt ttagggtcta agaggctagg atctcaacat 3601 ttgggagtca caggttgcat cccctagcgc tttagactct agacccctgg tggctcggag 3661 ttgcagattt ctggccaccc gggaccctgg agcccgggaa ttaccgggtc ttggnatttc 3721 cgaaccttgg aagtccgagg cttcgtacac cgaccagggt cgtcggnccc gcgaggccag 3781 ggcgtgtggg taggggncgc gcgtctagga ggggcccggn ggagncgcgt cttcagacca 3841 tncaaggtca gagtcgtcgg gaacaacccc ggngncccgc ntcccggaa // LOCUS HSAGL1 1138 bp DNA PRI 24-APR-1993 DEFINITION Human alpha-globin germ line gene. ACCESSION V00488 NID g28546 KEYWORDS alpha-globin; germ line; globin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1138) AUTHORS Liebhaber,S.A., Goossens,M.J. and Kan,Y.W. TITLE Cloning and complete nucleotide sequence of human 5'-alpha-globin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (12), 7054-7058 (1980) MEDLINE 81175088 COMMENT KST HSA.ALPGLOBIN.GL [1138]. FEATURES Location/Qualifiers source 1..1138 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 98..929 exon 98..229 /number=1 CDS join(135..229,347..551,692..820) /codon_start=1 /product="alpha globin" /db_xref="PID:g28547" /db_xref="SWISS-PROT:P01922" /translation="MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYF PHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLL SHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR" exon 347..551 /number=2 exon 692..929 /number=3 BASE COUNT 183 a 412 c 350 g 193 t ORIGIN 1 aggccgcgcc ccgggctccg cgccagccaa tgagcgccgc ccggccgggc gtgcccccgc 61 gccccaagca taaaccctgg cgcgctcgcg gcccggcact cttctggtcc ccacagactc 121 agagagaacc caccatggtg ctgtctcctg ccgacaagac caacgtcaag gccgcctggg 181 gtaaggtcgg cgcgcacgct ggcgagtatg gtgcggaggc cctggagagg tgaggctccc 241 tcccctgctc cgacccgggc tcctcgcccg cccggaccca caggccaccc tcaaccgtcc 301 tggccccgga cccaaacccc acccctcact ctgcttctcc ccgcaggatg ttcctgtcct 361 tccccaccac caagacctac ttcccgcact tcgacctgag ccacggctct gcccaagtta 421 agggccacgg caagaaggtg gccgacgcgc tgaccaacgc cgtggcgcac gtggacgaca 481 tgcccaacgc gctgtccgcc ctgagcgacc tgcacgcgca caagcttcgg gtggacccgg 541 tcaacttcaa ggtgagcggc gggccgggag cgatctgggt cgaggggcga gatggcgcct 601 tcctctcagg gcagaggatc acgcgggttg cgggaggtgt agcgcaggcg gcggcgcggc 661 ttgggccgca ctgaccctct tctctgcaca gctcctaagc cactgcctgc tggtgaccct 721 ggccgcccac ctccccgccg agttcacccc tgcggtgcac gcttccctgg acaagttcct 781 ggcttctgtg agcaccgtgc tgacctccaa ataccgttaa gctggagcct cggtagccgt 841 tcctcctgcc cgctgggcct cccaacgggc cctcctcccc tccttgcacc ggcccttcct 901 ggtctttgaa taaagtctga gtgggcggca gcctgtgtgt gcctgggttc tctctgtccc 961 ggaatgtgcc aacaatggag gtgtttacct gtctcagacc aaggacctct ctgcagctgc 1021 atggggctgg ggagggagaa ctgcagggag tatgggaggg gaagctgagg tgggcctgct 1081 caagagaagg tgctgaacca tcccctgtcc tgagaggtgc cagcctgcag gcagtggc // LOCUS HSPGK2G 1911 bp DNA PRI 12-SEP-1993 DEFINITION Human testis-specific PGK-2 gene for phosphoglycerate kinase (ATP:3-phospho-D-glycerate 1-phosphotransferase, EC 2.7.2.3). ACCESSION X05246 Y00261 NID g35437 KEYWORDS phosphoglycerate kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1911) AUTHORS McCarrey,J.R. and Thomas,K. TITLE Human testis-specific PGK gene lacks introns and possesses characteristics of a processed gene JOURNAL Nature 326 (6112), 501-505 (1987) MEDLINE 87173013 FEATURES Location/Qualifiers source 1..1911 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 71..1324 /note="PGK (AA 1-417)" /codon_start=1 /db_xref="PID:g35438" /db_xref="SWISS-PROT:P07205" /translation="MSLSKKLTLDKLDVRGKRVIMRVDFNVPMKKNQITNNQRIKASI PSIKYCLDNGAKAVVLMSHLGRPDGVPMPDKYSLAPVAVELKSLLGKDVLFLKDCVGA EVEKACANPAPGSVILLENLRFHVEEEGKGQDPSGKKIKAEPDKIEAFRASLSKLGDV YVNDAFGTAHRAHSSMVGVNLPHKASGFLMKKELDYFAKALENPVRPFLAILGGAKVA DKIQLIKNMLDKVNEMIIGGGMAYTFLKVLNNMEIGASLFDEEGAKIVKDIMAKAQKN GVRITFPVDFVTGDKFDENAQVGKATVASGISPGWMGLDCGPESNKNHAQVVAQARLI VWNGPLGVFEWDAFAKGTKALMDEIVKATSKGCITVIGGGDTATCCAKWNTEDKVSHV STGRGASLELLEGKILPGVEALSNM" misc_feature 135..136 /note="position of intron I in PGK-1" misc_feature 186..187 /note="position of intron II in PGK-1" misc_feature 342..343 /note="position of intron III in PGK-1" misc_feature 487..488 /note="position of intron IV in PGK-1" misc_feature 591..592 /note="position of intron V in PGK-1" misc_feature 711..712 /note="position of intrion VI in PGK-1" misc_feature 826..827 /note="position of intron VII in PGK-1" misc_feature 1006..1007 /note="position of intron VIII in PGK-1" misc_feature 1184..1185 /note="position of intron IX in PGK-1" misc_feature 1224..1888 /note="polyA rich sequence" misc_feature 1283..1284 /note="position of intron X in PGK-1" misc_feature 1711..1716 /note="polyA signal" BASE COUNT 583 a 367 c 442 g 519 t ORIGIN 1 gcccctcaac agcaagttgg ttcttcagca ttaagatcca ggtgtcagcc tatgtcttta 61 tattgtcaag atgtctcttt ctaagaagtt gactttagac aaactggatg ttagagggaa 121 gcgagtcatc atgagagtag acttcaatgt tcccatgaag aagaaccaga ttacaaacaa 181 ccagaggatc aaggcttcca tcccaagcat caagtactgc ctggacaatg gagccaaggc 241 agtagttctt atgagtcatc taggtcggcc tgatggtgtt cccatgcctg acaaatattc 301 cttagcacct gttgctgttg agctcaaatc cttgctgggc aaggatgttc tgttcctgaa 361 ggactgtgta ggcgcagaag tggagaaagc ctgtgccaac ccagctcctg gttcagtcat 421 cctgctggag aacctgcgct ttcatgtgga ggaagaaggg aagggccaag atccctctgg 481 aaagaagatt aaagctgagc cagataaaat agaagccttc cgagcatcac tttccaagct 541 aggggacgtc tatgtcaatg atgcttttgg cactgcacac cgcgctcata gttccatggt 601 gggagtgaat ctgccccata aagcatccgg attcttgatg aagaaggaac tagattactt 661 tgctaaagcc ttggaaaacc cagtgagacc ctttctggct atacttggtg gagccaaagt 721 ggcagacaag atccaactta tcaaaaatat gctggacaaa gtcaatgaga tgattattgg 781 tggtggaatg gcttatacct tccttaaggt actcaacaac atggagattg gtgcttccct 841 gtttgatgaa gagggagcca agatcgttaa agatatcatg gccaaagcac aaaagaatgg 901 tgtaaggatt acttttcctg ttgattttgt tactggggac aagtttgacg agaacgctca 961 ggttggaaaa gccactgtag catctggcat atctcctggc tggatgggtt tggactgtgg 1021 tcctgagagc aacaagaatc atgctcaagt tgtggctcaa gcaaggctaa ttgtttggaa 1081 tgggccgtta ggagtatttg aatgggatgc ctttgctaag ggaaccaaag ccctcatgga 1141 tgaaattgtg aaagccactt ccaagggctg catcactgtt atagggggtg gagacactgc 1201 tacttgctgt gccaaatgga acactgaaga taaagtcagc catgtcagca ctggacgcgg 1261 tgccagtcta gagcttctgg aaggtaaaat ccttcctgga gtagaggccc tcagcaacat 1321 gtagttaata tagtgttact tccttctgtt ttctgtccat ggcccttaag tcagcttaat 1381 gcttttacat ctcgatgtga cttttgttaa aatctactcc tagatcaaga cctatgtaat 1441 ggacaagcag caggccatca ggaactctta atatcagcac agcaattcat tttagtttgg 1501 tcacgcattt gcctgttcaa gttctcattt gaacttcacc attgtgctat ctagggagga 1561 catattctta agttgcctat taaagaaagt gagctgaaga aactgaatct ttttatttta 1621 gtccaacttt gctattgttt cataatttga aacccaaaag ataaaactta atttgttggg 1681 aaagggtgga atgaaagttg acaaacaaac aataaatatg cccaaataaa ctgagaaaaa 1741 taattacata taaagaactc atgggtacca ttaaagttct gctaagggga aatgttatac 1801 taaataggac caaaaaaaag agaaatgaaa gacacttatt aaaccaagga atttttaaaa 1861 agaaaagaaa ttaaactcaa ggtaagaatt atgtttgtgt gcattatcaa t // LOCUS HSU57623 9170 bp DNA PRI 16-JUN-1996 DEFINITION Human fatty acid binding protein FABP gene, complete cds. ACCESSION U57623 NID g1377853 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9170) AUTHORS Peeters,R.A., Veerkamp,J.H., Geurts van Kessel,A., Kanda,T. and Ono,T. TITLE Cloning of the cDNA encoding human skeletal-muscle fatty-acid-binding protein, its peptide sequence and chromosomal localization JOURNAL Biochem. J. 276 (Pt 1), 203-207 (1991) MEDLINE 91248148 REFERENCE 2 (bases 1 to 9170) AUTHORS Phelan,C., Morgan,K., Baird,S., Korneluk,R., Narod,S. and Pollak,M. TITLE MDG1 paper JOURNAL Genomics (1996) In press REFERENCE 3 (bases 1 to 9170) AUTHORS Baird,S. TITLE Direct Submission JOURNAL Submitted (03-MAY-1996) Stephen D. Baird, C.H.E.O, Genetics, 401 Smyth Rd., Ottawa, Ont., Canada, K1H 8L1 FEATURES Location/Qualifiers source 1..9170 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1248..1351 /number=1 CDS join(1279..1351,4738..4910,6800..6901,8339..8392) /codon_start=1 /product="fatty acid binding protein FABP" /db_xref="PID:g1377854" /translation="MVDAFLGTWKLVDSKNFDDYMKSLGVGFATRQVASMTKPTTIIE KNGDILTLKTHSTFKNTEISFKLGVEFDETTADDRKVKSIVTLDGGKLVHLQKWDGQE TTLVRELIDGKLILTLTHGTAVCTRTYEKEA" intron 1352..4737 /number=1 exon 4738..4910 /number=2 intron 4911..6799 /number=2 exon 6800..6901 /number=3 intron 6902..8338 /number=3 exon 8339..8622 /number=4 BASE COUNT 2313 a 2265 c 2351 g 2241 t ORIGIN 1 atcccctgag ccccggagtt tgaggctgca gtgagctatg atggcgtcac agtactccag 61 cctgggagac acagcgagag actttgtctc taaaaaataa taataaaata aaaagttcaa 121 tgaaacaata cacccaaagc cctcagcatg caataaatag caagacaagg caggtcttat 181 ttttactgaa agtgcttagt aaactataca gtgacaaacc accgcacaac aggctctcga 241 aaggaggcag caaattaccc aaaagtgcag gcggcttgct agtgtgcaca ggccaaagaa 301 agggcggcag gtggggaagg cagccatggg ccttgaagag ctgaccgaat tggcagaatt 361 tctgcaggag gggagctggg aacgacctga gctaaagctc ggagctgtgc gaagaaaccg 421 gaaaagccca gagcacttgc aggggcgggt ggggagctag atggtggggt ggggtgggga 481 cggaggaggg ccagcaggag acattccgca gggaggggca agcacgtgtg aggcgggcgg 541 ggcgcgaagg gtcaggcttt tgctcaaaac aggcagagga caaggtcagc tcagccgcag 601 accgagccgc tggtgactgt ctccgccacc aggcagtgag agtgaaggga gagcgcgacc 661 tctgaagccc gctagactaa gcttgcaatc tgagctccat tcaccccctc ctatttcttg 721 agaccttgtc agttcccctg tgagcctcgg actcacttgt aaaacgagga cagatgcccg 781 tgccagaagt caaccagagc tttccccggc gtgggcacca gcccaagggc gttttgtttt 841 ctagtctcat ctctgctctg acgctaagct caaagaggga ctgggggacg ggaagatatc 901 caccatggat gcgccctaga tctcgggctg gtgtcggctg ttccttctca gattccagag 961 tgtctagagg ccaggaaagg gagaaggtcc taccagcctg gggtagggac tcgggggcca 1021 ggcactggcg ctgacgcagg ctagcagggc gccactggct ggtccccacc cacctcggtg 1081 ggttggggga tgggcgcacc agcccctcct gggtgagccc tagcctgggg cttcctattt 1141 cgggagccgg gggcgtgggc cacgtctcct catgtgatgc gagggctatt taaagcggca 1201 gcccgggcag ggagccgccg tcggagccct tgcacgcctg ctctcttgta gcttctctca 1261 gcctagccca gcatcactat ggtggacgct ttcctgggca cctggaagct agtggacagc 1321 aagaatttcg atgactacat gaagtcactc ggtgagcaag ccgcggggct caggatgttg 1381 gcttggggac tggctggtgg cgtgcctagc cccacgcagc actcctgccg catccctcct 1441 ggttaagact ggggaatagg ggagcgcgga gatggcagcc tggcctagag caggtggggc 1501 ctgttcagag ggggctttgg tggtccaaat ctggttagag accacggtag ggaggtggtg 1561 gaaggaggca gctgtgtggg aggctctttc caggaagagg gatatgtgat ttggaggtag 1621 gaggagggtt tggataaaga acactgatca caggaaaggg agtgtagcca ggggagaaaa 1681 agaacagggg catgggtagt ttagaaattg gaggagactg aacccagaaa gggaatgggg 1741 cagccaggga gtgtacaatg atgtaaacaa gtaggaaata cctaggagga aaaagattag 1801 tggggaaaaa actgtggatc agtgaatcag atatgagaag gacgtaagac aggaacctgc 1861 agtaagcagc aatccccatc tctgcttggt tagggaagag aattcttgct ggagaatgcc 1921 ctttctcacc agccagtctg accttgtcct gcagtctatg tatccaggcc ttcatcactg 1981 tctgtgagcc tcgtggtagg gtggggcaag aggcccatga tcagctgggc ctttcctgca 2041 acccaaggct cacctatctg tgcgaggggt aggcagagaa agccattgga cttctgatgt 2101 gcagtagagg gtcccaaggc aaggtcaaga cctgggaggg aggatcactg gtttaggagg 2161 atgtggagaa ctcctgtggt gttgggatgg agaagaatca ggattcaaag aatctcacag 2221 gtgaggaact tggagattcc cataccatct agttcaacag ggaaactgaa accaggagag 2281 tagaaatgta ttataacaat tccacagcag agccaatatg aaaatctaag gtttctagat 2341 ctgtaaccca gagctcttcc cactacccta caggccctgc gagtgggaag aaaagtagaa 2401 actgcttagc taatgattga cctcagccct tcttctactg ctttgggctt agatggagag 2461 gtcaaagctc tcaacggcct ctaccctatc ttgggcgcta tgcccagtaa ttctaggcag 2521 gcagtcattc ttagaggagc agcccccagc ccccacgaac acagcccagc agctattggg 2581 aagttggaat gcccagattt agttcctcct tccaaagctg ggccagagct gagtcttgaa 2641 ttgagctgca acaactttac cattcttgtt cccttattct gccccgagtt gggtcagcgg 2701 gctggtctcc ctgaagtcct gttatctttc agcagcttat gttaaggcag ccagcattct 2761 catcgtagga atggaaagcc tgggaaaata ccctcctcag ctctcagtaa gtagtgctgg 2821 cttcatttct aagtagaacc cagatctccc tgagtctcct aaattctgtc agctcaatat 2881 tcttagtttc tcttggttca gaccctcact catcccgcag tggtttcctt ttcaaacact 2941 ccatacctct gggtagatcc taagtgaaca gagctcccag tgccgtgaca aggtcctgct 3001 ctgtgcaagg gagtgtgatt ggcctgactc atcctgatac caaggggcaa tgccaagttc 3061 ctcactggcc aagcaagggt gggctgacag cataacagca gaggcagccc ctgcccctcc 3121 tgctgtagac ctagggctct caaggggcaa agaggtcccg tctagtacca gtgaccacag 3181 gcacaactgc tggcctggat tgagtatgtg ctggacagaa tcgcccagtg aaaatagtca 3241 acagttttgg agccgaggtt caaatctatg tcagtagttt attctctttg aatttcgaca 3301 agacacttcg cactcttcat tgtaaactgg ggataatcta cgcttcgagg ctgttacaag 3361 cattaagtaa aacaacccat gtagggcatg tgcagagtac ctagcttcca gcaagcacta 3421 tgtagccagg tacatttgga gactttacac acaccacctc actacactgg gctgcctcct 3481 gcctcacctt tgccttggaa gacagttcaa tgttaagctg ctggggggag agggggcagt 3541 catgattagt tctttgttct ttacttggtt gcaggacact taggactttg cccagtaccc 3601 aaggaagcca tgcttgggtc aggaagagag tctctgtaaa gccttagact gggagtcagg 3661 agacgggttt gagtctaact cattgttgct accgctttag gctcctcctg aatctgcaca 3721 ataggacaaa tacttccttt gtacctaact cctagatcat agataacagg ctttgaaaat 3781 gatgggttgc catgtataag ggacaagagc actaacactt cttagtttca gggtaaaaac 3841 ttccaaagtt ggaaaactcc tatgcctaag gctttggaag ggaaagtcta tgtttctctt 3901 ctttcctcag ccttattcct aaggctttga gagcttttca ggtgccctgg aaggcagcct 3961 tatgctccag ccttgggagg tagtatagct gagcacttaa gcaagctctg gactcagaca 4021 attctgggct tcaatctcag atttgtgacc ctgggcttta cctctgtttt tgtatctgta 4081 acgtggaaac agtcttcaga agaacaggaa gaactaaatg agataacatg tacagttctt 4141 actacacaaa aagctcatag tacttaatag tagctttttt tttttttttt tgagatggaa 4201 tctcactctg ttggctaggc tggagtgcag tggcacaatc tcgactcact gcaacctcca 4261 cctcctaggt tcaagcaatt ctcagcctca gcctcctgag tagctgggat tacaggcaca 4321 taccaccaca tctggctaat tttttgtatt tttagtagag acgggtttca ccatattggc 4381 caggctggtc ttaaactcct ggcctcatgt gatccgcctg ccttggcctc ccaaagtgtg 4441 attacaggcg tgagccacca cacttggccc aatagtagct tattctaatc ccagctctgc 4501 cactgacttg ctatggcact gctgttcctt aagtatctct catctaatgg gatcagttat 4561 ctgtgttcac caaacagaac taagcgcaag actgaatttt aaaattccca tgcaaaggct 4621 ttgaaagata cagtcctcca cttccccata cccaggcctg agagttattc attgagtttc 4681 ttgtacactg cttctctacc ccagctcata tactcataac cttcccccta ccctcaggtg 4741 tgggttttgc taccaggcag gtggccagca tgaccaagcc taccacaatc atcgaaaaga 4801 atggggacat tctcacccta aaaacacaca gcaccttcaa gaacacagag atcagcttta 4861 agttgggggt ggagttcgat gagacaacag cagatgacag gaaggtcaag gtaagtcagg 4921 gaaacagggg tggggaatgg agagtgctga gactctaaaa gagaataggc tggtagtctt 4981 ggctccctgg tattgcaccc tgaggggcag actatcatgg ggaatttaca tgaaacaaga 5041 ttcataaagc ctgtgtagtg ctggaatgcc actgatgcta aatacatgtc agttctgtcc 5101 tcttgttttc ttccctccct tcttgggatt catctattgt ctgcctcgga atgggcagca 5161 cagagccagg atgttcttct gacctcagta tctactccag ctccagctgg gtgaccctgt 5221 gcaaggtatg cagtagctct aggtttcttt ccccttccat agatggagag ttatgtggcc 5281 atggctgtga cctgaagtgc tttaggaatg atgcccagaa gtcagggccc tccactgagt 5341 gaggtcattg tgacctccag cagcaaaaaa ggcagccagg aactagaagc acctactcag 5401 atgccgcttc aacttctaac tcccagacat ggccaatgac cctgacaaac tatttccagt 5461 gttgccagct gacaggcagg aaagagctat gttccgtgat agggcattca ccttgtcatg 5521 aatgtgtttg cagtgtctcc caccaagcct tagcccctcc tcccagggtt ctatcaccct 5581 gcagtggctg tcttggcagc ttgcctcagc cttccaggcc aggcatggga gcgagagaac 5641 ttaagggctt tgacctctat agggtgtccc tatagcagtg ttctatcatg acactatcat 5701 tcagccccat cagctgtttc ctcttcctca tagctgtccc cagaaagaac aggatcacac 5761 aggtggctgg cagcagagct ggggatggtg cccaaagatg gcagtctacc ttggataaag 5821 gtggctgccc caccacctgc tcatacctcc ttggacttgc ctactttctc aaggggcaag 5881 aaccccaatt aaacacaata gccctgtgga atgcctaggg caaaaatatc tactctgagt 5941 aggcaaaaaa aactagggga atgagaacaa ggagtaaggt aaggataaaa aagagcacac 6001 taagagacag gcctcatacc ccttatcacc taaacaatac acagaacctt ctcagattct 6061 cctactgaac caccttgctc atcaggatcc cttagcctgg ccttgtggcc cccaaactcc 6121 taggaaagag agctggaaga gctgccaaat gagaaccagc tgatgtatgt atgctggcag 6181 cacccagagc tgaggaacca cttcaagggc atccagtcac aggactttgt ggttgctgcc 6241 ctcttgttgg ctaaagaggt cacatgatgt ggaccaagaa aaggtgtagg aatacagggc 6301 aggaagtcta attatccaat acttcctatc actaagggtc ttttagacat tatgtggact 6361 aaccacaagg ctggataaag attctcagga ctactcctcc tcctcagtca gtctttccca 6421 gggatagact agtaaatccc acctgtatct gaggggacca ggctacggga atcacctaga 6481 gtacagataa gtgtctgtct tgaaggcttg tggtacttct cagagccagg ctctctggct 6541 ccaccatact gcctgcctct ccctccttgc ctaatatctg aaggcctctt ccccagaaag 6601 gcagtagtgg agcagaggct ggaggtgaac tagatgtctt gcagggatag ctgggaggcg 6661 gattgcctga gctcttgtcc tcacaccatc actagtttgg gtcaaaggct gtgtcctctg 6721 tggcccagtg tccagacccc accctgcccc tcaattcctg actaagatca cagctcaggc 6781 ctctaccctc tttccacagt ccattgtgac actggatgga gggaaacttg ttcacctgca 6841 gaaatgggac gggcaagaga ccacacttgt gcgggagcta attgatggaa aactcatcct 6901 ggtaagatgg gcaactttgg agctatatct gattggttat tactactgct ctttcagcca 6961 agcctgttct aaaaagccaa gtcctcccct gagagctgta gaagctggga caagagagtg 7021 gttgtgggtc agggtggtat caggtgggaa tttttctgtg tagtggcttt ggactcacac 7081 aggccggaac tcaaatctta ccttataggc tacatgactg tgggcaaatc accttttcca 7141 agtgcaactg taaaacgggt attaataata ccaaccttgt agggctgctg ggaagcctgt 7201 aagagacagt gtatgcacag cacaaagcat cactgattga ggaacacagc aggtgctcca 7261 tgtcctttgt ttgctcttcc tgtgtttcta ccttgcctca cctcaggaag aagtagaaaa 7321 cagggccaaa tctgatccca ggccctctag gaggggctcc cattgcctat ctcagcattc 7381 cctttcctct cctccctagg actgcattgt cacttgcagg gacaggctcg tgactggtgg 7441 ggacactgaa tgacagtaca gtcctttctt ccccattcta gtcctacccc attttcatgc 7501 tttctatgtc tggcctactg aaactacttg actactgctt gggtaggaag taccacagcc 7561 aggctggcag atctgttcaa gcttggggac ttcacttgga gaatctagcc ttgactgaat 7621 tccccccaga cccagggaga gcagccaact gtggattctg cctaaccaca gggcctcagg 7681 ttttcaccta ggcatcttca ctgcacacct tcttgggtca gcataacctg ttaactgcat 7741 tcttgtactc atgtgggaca ggggtcccct tgaagtttgg aatgaggtgc ctagctttgg 7801 tggggatgtg atatgcagga ccaaattctc agtggcagct gaactatggt gaggccatgg 7861 gtctggctct atgatgccag accggatagt gggaggtaca gggctctggc cctggcacta 7921 ctctaagtta gggaaggatt ggagttagta cccaaacaca gtcctttcct gagtctctgg 7981 atatttttcc tatttgtcaa ctatatgcca ggcaccatct tagacactaa ggatgaagaa 8041 gccaaatggt ataagggaag gaaaaacact caggtcttga ccaaattact tcctctctaa 8101 aggctcgttt tttccaaatc tctaaaataa gaattacaat gcctgtctta aggatttgct 8161 gtgcatatca gaaaaaaaaa attatgtatg tatacacaca cacacacaca cacacacaca 8221 tacatacttg ccggcactgg taggtctcag tgacaattat caggaggaag ggagggtaga 8281 atgctcgcaa tggtgttcct ggctcccacc ccccatctca ctctgtcttt ccttccagac 8341 actcacccac ggcactgcag tttgcactcg cacttacgag aaagaggcat gacctgactg 8401 cactgttgct gactactact ctgccaatcg gctacccctc gactcagcac cacattgcct 8461 catttcttcc tctgcatttt gtacaaatcc acgaattctt ctggggtcag gtgccactga 8521 ccgggatcca gttccagttc ccatggtgta tgtggttttt tttttttttt tttaactgca 8581 ctcatagggt gctctgaggt caataaagca gagccaaggc cacccagttg ccttttggcc 8641 tttggtaaca taactctggg agtcttggtt tatcctgtgt gtcagagagt gggcagaaat 8701 aacggcctga aggttactga ggaagaagca ctggatggga gactgaaatg gacagtctcg 8761 gagcctgtta atcagctgat caccttacac atttaataat aaaagagctg tacctacacg 8821 ttgcctttac actgcccccc ctccatggtc aaatgaccta gttcagtcag tgatggggct 8881 tccccaggtt tggctattga actgtcactt caggcccatc ctacactgaa agctcttggg 8941 tctggctgtt ctctgtgaaa tgctgtagtc tctccctttc cagaattcag gttcagggca 9001 cagaacccag gcttgtacca tggtggtggg agaaaatgac cactggccaa gaggactgct 9061 gacctgtgca ccaggctagt acttatgact acaaattctt actgcttctc taatcaactc 9121 tgagggaaga gggcatctga tcattacaaa agggagggct tataagtgat // LOCUS HSARYLA 3637 bp DNA PRI 24-APR-1993 DEFINITION Human DNA for arylsulphatase A (EC 3.1.6.1). ACCESSION X52150 NID g28859 KEYWORDS arylsulphatase; lysosomal enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3637) AUTHORS Kreysing,J. TITLE Direct Submission JOURNAL Submitted (26-MAR-1990) Kreysing J., Georg-August University Biochemie II, Goalerstr 12D, 3400 Goettingen, F R G REFERENCE 2 (bases 1 to 650) AUTHORS Kreysing,J., von Figura,K. and Gieselmann,V. TITLE Structure of the arylsulfatase A gene JOURNAL Eur. J. Biochem. 191 (3), 627-631 (1990) MEDLINE 90361046 COMMENT See for mRNA sequence of arylsulphatase A. Data kindly reviewed (02-NOV-1990) by Hall L. FEATURES Location/Qualifiers source 1..3637 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="G1/1" /cell_type="leukocytes" /clone_lib="EMBL-3" /chromosome="22" misc_feature 191..200 /note="GC-box 1" misc_feature 201..210 /note="GC-box 2" misc_feature 213..222 /note="GC-box 3" misc_feature 240..249 /note="GC-box 4" prim_transcript 256..3356 mRNA join(256..847,997..1237,1352..1570,1645..1814,2127..2251, 2342..2469,2720..2822,2938..3356) exon 256..847 /number=1 CDS join(630..847,997..1237,1352..1570,1645..1814,2127..2251, 2342..2469,2720..2822,2938..3257) /EC_number="3.1.6.1" /codon_start=1 /product="arylsulphatase a" /db_xref="PID:g28860" /db_xref="SWISS-PROT:P15289" /translation="MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSS TTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPL EEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLT CFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRP FFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLV IFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL DLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTG KYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATP EVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA" intron 848..996 /number=1 exon 997..1237 /number=2 intron 1238..1351 /number=2 exon 1352..1570 /number=3 intron 1571..1644 /number=3 exon 1645..1814 /number=4 intron 1815..2126 /number=4 exon 2127..2251 /number=5 intron 2252..2341 /number=5 exon 2342..2469 /number=6 intron 2470..2719 /number=6 exon 2720..2822 /number=7 intron 2823..2937 /number=7 exon 2938..3356 /number=8 polyA_signal 3351..3356 BASE COUNT 566 a 1290 c 1107 g 674 t ORIGIN 1 agccgctcct cctctgagaa gctccggacc cgagaggaca ccgacactgc gcagcgccga 61 gcccgcgcgc agcccggacg cctcagccag ggccgaccgc gcagaggaag ctcccagagc 121 ccgtttcaag accgcagcca acagcctcag gcgcacacgg cggcctcgga gcgagcacgc 181 gcagcaacgc ccctcgcccc ggcccgcccc cggccccgcc ccgcaagggt cacaggtcac 241 ggggcggggc cgaggcggaa gcgcccgcag cccggtaccg gctcctcctg ggctccctct 301 agcgccttcc ccccggcccg actgcctggt cagcgccaag tgacttacgc ccccgaccct 361 gagcccggac cgctaggcga ggaggatcag atctccgctc gagaatctga aggtgccctg 421 gtcctggagg agttccgtcc cagccctgcg gtctcccggt actgctcgcc ccggccctct 481 ggagcttcag gaggcggccg tcagggtcgg ggagtatttg ggtccggggt ctcagggaag 541 ggcggcgcct gggtctgcgg tatcggaaag agcctgctgg agccaagtag ccctccctct 601 cttgggacag acccctcggt cccatgtcca tgggggcacc gcggtccctc ctcctggccc 661 tggctgctgg cctggccgtt gcccgtccgc ccaacatcgt gctgatcttt gccgacgacc 721 tcggctatgg ggacctgggc tgctatgggc accccagctc taccactccc aacctggacc 781 agctggcggc gggagggctg cggttcacag acttctacgt gcctgtgtct ctgtgcacac 841 cctctaggta aagagggggc cgcgcctctt ccccgccccg accctccatc cctttcctcc 901 caatggattg caggggggcg ggaaaaacgt ctgtctctct ctctagggaa ggccacattt 961 ctgtctgtct cagggactct gtgacttgtc ccgcagggcc gccctcctga ccggccggct 1021 cccggttcgg atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc 1081 cctggaggag gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc 1141 cggcaagtgg caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt 1201 ccatcgattt ctaggcatcc cgtactccca cgaccaggta ggaaccaccc gggccctcag 1261 ccaccctccc acctcccaaa gtcccccagc cccttgactg tcccgcagcc ccacctgcca 1321 gcccagccct cacggcagct gcccgcctca gggcccctgc cagaacctga cctgcttccc 1381 gccggccact ccttgcgacg gtggctgtga ccagggcctg gtccccatcc cactgttggc 1441 caacctgtcc gtggaggcgc agcccccctg gctgcccgga ctagaggccc gctacatggc 1501 tttcgcccat gacctcatgg ccgacgccca gcgccaggat cgccccttct tcctgtacta 1561 tgcctctcac gtaagtgatc ttggcccaac cccctggctg cccgttgacc cctacccagt 1621 gctaactcca gtctttgccc ccagcacacc cactaccctc agttcagtgg gcagagcttt 1681 gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg 1741 gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc 1801 actgcagaca atgggtatgc cagcagggca gctgggtgct ccggccctgt cacgggccag 1861 ggcctggagg ccttgcagtt cagctgcttg ccaagaacat agtgggtgag ggggtgccag 1921 gagatgctgg ccacgttgca ggggcccaag gtgtagtcag gagacacagg tgcacagaga 1981 gctggtcttg gtaggcctgg gaggtgccgg gctcatgctg ggcacctccg ggcaagcttt 2041 gtgacttaga ggtgtggggc cactggtcac cctcggtggc tcagaggctg tggctccatg 2101 gctcatgagc gcctcctgtg tcccagacct gagaccatgc gtatgtcccg aggcggctgc 2161 tccggtctct tgcggtgtgg aaagggaacg acctacgagg gcggtgtccg agagcctgcc 2221 ttggccttct ggccaggtca tatcgctccc ggtcagtccg caggccctct ccttggaacc 2281 ctggccccac caccccaacc ttgatggcga actgagtgac tgaccagcct cctgccccca 2341 ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct 2401 ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc 2461 acaggcaagg tagggccggt gacccctgat cccagatcct tggcccctgt cctggccttc 2521 ccctggggtg agtgtggcag tgctgagagt ctgtgcctca gtgcctcctg cactgagtgg 2581 catccaagtg gcgccacctc tcaggttcct gggtgggcaa gaagcggtgc acgtccaggg 2641 cctcccacca gggctggcag cccaggtatg tgcagtgctt gggcctgccc cgccccgtga 2701 cccctgactc tgcccccaga gccctcggca gtctctcttc ttctacccgt cctacccaga 2761 cgaggtccgt ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca 2821 gggtaacccc tccccgtgga tccctccccc cgaacctgct gacccctccc cggagcccta 2881 gatccctggc ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct 2941 ctgcccacag tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc 3001 atgagccccc gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg 3061 ggggtgtggc cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca 3121 aggcccagtt agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc 3181 ccgccctgca gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc 3241 cagatcccca tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg 3301 agttgtgggg gaggctcagg tgtctggagg gggtttgtgc ctgataacgt aataacacca 3361 gtggagactt gcagctgtga caattcgacc aatcctgggg taatgctgtg tgctggtgcc 3421 ggtcccctgt ggtacgaatg aggaaactga ggtgcagaga ggttcaggac ttgtacaaga 3481 tcacccagcc agaaagaggt tgggctggga tttgaaccct ggtgtcgtgg ctctggaagc 3541 tgccctggcg ctccttggtg atctgcgtgg gtctgtgcac acaggcacac gtcagccaca 3601 aggcacatgg acgagcgcac gtgcttgagt gcaggac // LOCUS S63168 1594 bp DNA PRI 23-AUG-1993 DEFINITION CCAAT/enhancer-binding protein delta=transcription factor CRP3 homolog [human, prostate carcinoma cell line LNCaP, Genomic, 1594 nt]. ACCESSION S63168 NID g386449 KEYWORDS . SOURCE human prostate carcinoma cell line LNCaP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1594) AUTHORS Cleutjens,C.B., van Eekelen,C.C., van Dekken,H., Smit,E.M., Hagemeijer,A., Wagner,M.J., Wells,D.E. and Trapman,J. TITLE The human C/EBP delta (CRP3/CELF) gene: structure and chromosomal localization JOURNAL Genomics 16 (2), 520-523 (1993) MEDLINE 93300531 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 134356] from the original journal article. This sequence comes from Fig. 1B. Map location: 8q11. FEATURES Location/Qualifiers source 1..1594 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 90..1337 gene 130..939 /gene="CCAAT/enhancer-binding protein delta, C/EBP delta" CDS 130..939 /gene="CCAAT/enhancer-binding protein delta, C/EBP delta" /note="transcription factor CRP3 homolog; mouse CRP3/rat CELF homolog. This sequence comes from Fig. 1B; C/EBP delta" /codon_start=1 /product="CCAAT/enhancer-binding protein delta" /db_xref="PID:g386450" /translation="MSAALFSLDGPARGAPWPAEPAPFYEPGRAGKPGRGAEPGALGE PGAAAPAMYDDESAIDFSAYIDSMAAVPTLELCHDELFADLFNSNHKAGGAGPLELLP GGPARPLGPGPAAPRLLKREPDWGDGDAPGSLLPAQVGPCAQTVVSLAAAGQPTPPTS PEPPRSSPRQTPAPGPAREKSAGKRGPDRGSPEYRQRRERNNIAVRKSRDKAKRRNQE MQQKLVELSAENEKLHQRVEQLTRDLAGLRQFFKQLPSPPFLPAAGTADCR" BASE COUNT 293 a 540 c 525 g 236 t ORIGIN 1 cccggggcgc ccccgcggtg ccggagtcgg ggcggggcgt gcacgtcagc cggggctaga 61 aaaggcggcg gggctgggcc cagcgaggtg acagcctcgc ttggacgcag agcccggccc 121 gacgccgcca tgagcgccgc gctcttcagc ctggacggcc cggcgcgcgg cgcgccctgg 181 cctgcggagc ctgcgccctt ctacgaaccg ggccgggcgg gcaagccggg ccgcggggcc 241 gagccagggg ccctaggcga gccaggcgcc gccgcccccg ccatgtacga cgacgagagc 301 gccatcgact tcagcgccta catcgactcc atggccgccg tgcccaccct ggagctgtgc 361 cacgacgagc tcttcgccga cctcttcaac agcaatcaca aggcgggcgg cgcggggccc 421 ctggagcttc ttcccggcgg ccccgcgcgc cccttgggcc cgggccctgc cgctccccgc 481 ctgctcaagc gcgagcccga ctggggcgac ggcgacgcgc ccggctcgct gttgcccgcg 541 caggtgggcc cgtgcgcaca gaccgtggtg agcttggcgg ccgcagggca gcccaccccg 601 cccacgtcgc cggagccgcc gcgcagcagc cccaggcaga cccccgcgcc cggccccgcc 661 cgggagaaga gcgccggcaa gaggggcccg gaccgcggca gccccgagta ccggcagcgg 721 cgcgagcgca acaacatcgc cgtgcgcaag agccgcgaca aggccaagcg gcgcaaccag 781 gagatgcagc agaagttggt ggagctgtcg gctgagaacg agaagctgca ccagcgcgtg 841 gagcagctca cgcgggacct ggccggcctc cggcagttct tcaagcagct gcccagcccg 901 cccttcctgc cggccgccgg gacagcagac tgccggtaac gcgcggccgg ggcgggagag 961 actcagcaac gacccatacc tcagacccga cggcccggag cggagcgcgc cctgccctgg 1021 cgcagccaga gccgccgggt gcccgctgca gtttcttggg acataggagc gcaaagaagc 1081 tacagcctgg acttaccacc actaaactgc gagagaagct aaacgtgttt attttccctt 1141 aaattatttt tgtaatggta gctttttcta catcttactc ctgttgatgc agctaaggta 1201 catttgtaaa aagaaaaaaa accagacttt tcagacaaac cctttgtatt gtagataaga 1261 ggaaaagact gagcatgctc acttttttat attaattttt acagtatttg taagaataaa 1321 gcagcatttg aaatcgcccc tgcttcctat attcgcagtg actcccgccc gcccgccgcc 1381 gccggtcgga ggacccggct cggaagggcg ttccggaccg cagccagcca gcacctaggg 1441 agcccgggcg ccaggtgtgt gtgtgggggg ggcgggggga tgggcgcagc ggcgagctac 1501 tcaggagaga gggtctgtcg cttttaaaac gcattaaagg ctctctcctg gccttattta 1561 acttgcctaa gctaggtgga gcacggctga gctc // LOCUS HSHSP27 2496 bp DNA PRI 28-MAR-1995 DEFINITION Human gene for 27kDa heat shock protein (hsp 27). ACCESSION X03900 NID g32475 KEYWORDS heat shock protein 27. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2496) AUTHORS Hickey,E., Brandon,S.E., Potter,R., Stein,G., Stein,J. and Weber,L.A. TITLE Sequence and organization of genes encoding the human 27 kDa heat shock protein JOURNAL Nucleic Acids Res. 14 (10), 4127-4145 (1986) MEDLINE 86232547 REMARK Erratum:[Nucleic Acids Res 1986 Oct 24;14(20):8230]] COMMENT Data kindly reviewed (26-AUG-1986) by Hickey E. FEATURES Location/Qualifiers source 1..2496 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 18..38 /note="Pelham sequence pot. heat control element" misc_feature 46..59 /note="inverted complement of Pelham sequence" misc_feature 46..58 /note="abbreviated repeat of Pelham sequence" promoter 130..137 /note="pot. CAAT-box (major transcript)" precursor_RNA 159..1829 /note="pot. primary transcript (major transcript)" mRNA 159..612 /note="exon I (major transcript)" misc_RNA 159 /note="pot. CAP site (major transcript)" promoter 181..187 /note="pot. CAAT-box (minor transcript)" misc_RNA 209 /note="pot. CAP site (minor transcript)" mRNA 209..612 /note="exon I (minor transcript)" precursor_RNA 209..1829 /note="pot. primary transcript (minor transcript)" CDS join(250..613,1337..1400,1519..1690) /codon_start=1 /product="hsp 27" /db_xref="PID:g32476" /db_xref="SWISS-PROT:P04792" /translation="MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQ WLGGSSWPGYVRPLPPAAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVN HFAPDELTVKTKDGVVEITGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPE GTLTVEAPMPKLATQSNEITIPVTFESRAQLGGRSCKIR" intron 614..1336 /note="intron I" misc_feature 780..952 /note="region of homology to human Alu family" mRNA 1336..1401 /note="exon II" intron 1401..1518 /note="intron II" mRNA 1520..1829 /note="exon III" misc_feature 1803..1809 /note="pot. polyadenylation signal" polyA_site 1829 /note="pot. polyadenylation site" misc_feature 1922..2280 /note="region of homology to human Alu family" BASE COUNT 546 a 757 c 734 g 459 t ORIGIN 1 gaattcattt gcttttcctt aacgagagaa ggttccagat gagggctgaa ccctcttcgc 61 cccgcccacg gcccctgaac gctgggggag gagtgcatgg ggaggggcgg ccctcaaacg 121 ggtcattgcc attaatagag acctcaaaca ccgcctgcta aaaatacccg actggaggag 181 cataaaagcg cagccgagcc cagcgccccg cacttttctg agcagacgtc cagagcagag 241 tcagccagca tgaccgagcg ccgcgtcccc ttctcgctcc tgcggggccc cagctgggac 301 cccttccgcg actggtaccc gcatagccgc ctcttcgacc aggccttcgg gctgccccgg 361 ctgccggagg agtggtcgca gtggttaggc ggcagcagct ggccaggcta cgtgcgcccc 421 ctgccccccg ccgccatcga gagccccgca gtggccgcgc ccgcctacag ccgcgcgctc 481 agccggcaac tcagcagcgg ggtctcggag atccggcaca ctgcggaccg ctggcgcgtg 541 tccctggatg tcaaccactt cgccccggac gagctgacgg tcaagaccaa ggatggcgtg 601 gtggagatca ccggtgagcc cccctgctcc tgcaggggag aggaggaggc tagcagggcg 661 ggcagggccg ggggcgtgcg gttgaaacgg gggtcccggg ggcctgggga gttaaacgtt 721 ggcccagcac cgggaaaaac aggactcctg attcccttgc tcaggaattg ggagtgcggg 781 tcgcttctaa gggcgctttc tgctctgtaa tcccagcgct ttgggaggcc gagacgggag 841 gatcgcttga ggccaggagt tcaagactag cctgggcaac atagcgagac gcgccccccc 901 gccccgaccc cgcgccatta caaaaaaaaa gcaaacaaaa atttttttaa agatcatcga 961 tgaagagaga aaatgcgctt ttctacagag tccccttccc acccacagcc ccatccccag 1021 ataagcgggg agttccctgg cgcggtgcca gtttctagcc gctgagtggg cgtgtgcgcg 1081 gctccaagtg cgcctgcgta ctgctcactc cccagctccg cgccctgctc cgttcctccc 1141 aaaactctga atcgaagaac tttccggaag tttctgagag cccagaccgg cgggcacgcc 1201 cccatcccca accccctctg ttaatcccta ccagcctgca gtcctggctg cttccaagca 1261 ggaggtgggg cctctggcta gcggggccga aaaagtcccc tcccccgcat gtctgatttc 1321 cctcttcccc ccaaaggcaa gcacgaggag cggcaggacg agcatggcta catctcccgg 1381 tgcttcacgc ggaaatacac gtgagtcctg gcgccaggtc ggggtgggtg ggtggcgtgg 1441 gggtggggtc agggaagagg gcacagggac ccacccggtg tgtaatgtaa cgcttgcctt 1501 tcctctctgc acgtccaggc tgccccccgg tgtggacccc acccaagttt cctcctccct 1561 gtcccctgag ggcacactga ccgtggaggc ccccatgccc aagctagcca cgcagtccaa 1621 cgagatcacc atcccagtca ccttcgagtc gcgggcccag cttgggggca gaagctgcaa 1681 aatccgatga gactgccgcc aagtaaagcc ttagcccgga tgcccacccc tgctgccgcc 1741 actggctgtg cctcccccgc cacctgtgtg ttcttttgat acatttatct tctgtttttc 1801 tcaaataaag ttcaaagcaa ccacctgtca ctggcccagg ccctggtgtt tgtggaagga 1861 agcctcaggc acctgccatt tgctggcttt caggagtcat ctttgctcag gcccgtgctg 1921 ggccatgtgg gtacactggt gtaggttgct ggacacaggc tgactcacat ccataaagac 1981 agaggtctta gggccgggcg cagtggctca tacctacaat cccagcactt tggggggttg 2041 aagcaggagg agtgcttgaa gccaagagtt ctagaccagc ctggacaaca tagtaagact 2101 gtctctaaaa aataaaaatt aggcagggtg gtactgcacg cctgtagtcc cagctactca 2161 ggaggctgag gcaggaggat cgcttgagcc cagagttgtg aaggtacagt gagctaacat 2221 cgtgccattg cactccagcc tgggcaacag aacaagatcc tgtctcaaaa caaccaaaag 2281 cccagagaga aagagtgaga ccccatcttt aaaagaaaaa aaaaaaaggt catgattgca 2341 aggtcacgat tgcaattaaa actgtaaggt ggggaaggag gaggaaataa gagaagcacc 2401 tgaggcttga gttctcagga gcacctaggt tgggtcccag gtgaaggggc acagaggtaa 2461 ttgcacctca gagctgatgg gaggattact atgtca // LOCUS HUMROD1X 2841 bp DNA PRI 09-JAN-1995 DEFINITION Human rod outer segment membrane protein 1 (ROM1) gene exons 1-3, complete cds. ACCESSION M96759 NID g292430 KEYWORDS disk morphogenesis; disk rim; peripherin-related protein; rod outer segment membrane protein 1; rod photoreceptor; transmembrane protein. SOURCE Homo sapiens (tissue library: lambda DASH) adult DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2841) AUTHORS Bascom,R.A., Schappert,K. and McInnes,R.R. TITLE Cloning of the human and murine ROM1 genes: genomic organization and sequence conservation JOURNAL Hum. Mol. Genet. 2 (4), 385-391 (1993) MEDLINE 93278386 FEATURES Location/Qualifiers source 1..2841 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_lib="lambda DASH" /map="Unassigned" mRNA join(637..1241,1629..1875,1992..2487) /gene="ROM1" /note="G00-120-350" exon 637..1241 /gene="ROM1" /note="G00-120-350" /number=1 gene join(637..1241,1629..1875,1992..2487) /gene="ROM1" CDS join(652..1241,1629..1875,1992..2210) /gene="ROM1" /codon_start=1 /db_xref="GDB:G00-120-350" /product="rod outer segment membrane protein 1" /db_xref="PID:g292431" /translation="MAPVLPLVLPLQPRIRLAQGLWLLSWLLALAGGVILLCSGHLLV QLRHLGTFLAPSCQFPVLPQAALAAGAVALGTGLVGVGASRASLNAALYPPWRGVLGP LLVAGTAGGGGLLVVALGLALALPGSLDEALEEGLVTALAHYKDTEVPGHCQAKRLVD ELQLRYHCCGRHGYKDWFGVQWVSSRYLDPGDRDVADRIQSNVEGLYLTDGVPFSCCN PHSPRPCLQNRLSDSYAHPLFDPRQPNQNLWAQGCHEVLLEHLQDLAGTLGSMLAVTF LLQALVLLGLRYLQTALEGLGGVIDAGGETQGYLFPSGLKDMLKTAWLQGGVACRPAP EEAPPGEAPPKEDLSEA" intron 1242..1628 /gene="ROM1" /note="G00-120-350" /number=1 exon 1629..1875 /gene="ROM1" /note="G00-120-350" /number=2 intron 1876..1991 /gene="ROM1" /note="G00-120-350" /number=2 exon 1992..2487 /gene="ROM1" /note="G00-120-350" /number=3 BASE COUNT 538 a 871 c 867 g 565 t ORIGIN 1 gcccggggcc gcagtctcca gacccccccg ggccctcgga ctctcccggg gccgctctcg 61 gctcccgggg gtggggtggc agggccgtcc ggtgccacag cgccgcagca caaacaggcg 121 ccggacgcgg agccgccagg aagcgcggga gggggggcgg gcccgagggg gggccgggcc 181 gcttggtaac ccctccctgt ccgggcctcg ccgctcagta cgggggcggg gctagccggc 241 tgaccccctg gcctactccc gccgtccggc tccaggccct tcccggatcc ccgcccccgg 301 attcccaggg gacggggaag gtagcgcccg ccccgatatc tccgcccccc agccccctaa 361 cccctcaggg ttgagcggac caaccccacc acttccgcga ggggcagggg cggggtcaca 421 aaacgggccc tcggcctagg ggcggagttt ctcgtaaggg gcaaggccaa ggcatcttgt 481 attggggctg acaggggggc gggttattag ggctgaggat gggaggtagc tcagggtatt 541 ggggtcaggg tggcattagc ccagctcaag ccgggccggg ctgactcagc atcctgcccc 601 agccagcttc catccctgac acctctgcac tcccttgggc agagatggga gatggcgccg 661 gtgttgcccc tggtgctgcc cctgcagccc cgcatccgcc tggcacaagg gctctggctc 721 ctctcctggc tgctggcgct ggctggtggc gtcatcctcc tctgtagtgg gcacctcctg 781 gtccagctaa ggcaccttgg caccttcctg gctccctcct gtcagttccc tgtcctgccc 841 caggctgccc tggcagcggg cgcggtggct ctgggcacag gactagtggg tgtaggagcc 901 agccgggcaa gtctgaatgc agctctatac cctccctggc gaggggtcct gggcccgctg 961 ctggtggctg gcacggctgg tggggggggg ctcctggtcg tcgccctcgg gctagccctg 1021 gctttgcctg ggagtctgga tgaggcgctg gaggagggcc tggtgactgc cttggctcac 1081 tacaaggaca cagaggtgcc tgggcactgt caggccaaaa ggctggtgga tgagctgcaa 1141 ctgaggtacc actgctgcgg gcgccacggg tacaaggatt ggtttggggt ccagtgggtc 1201 agcagccgtt acctggatcc cggtgaccgg gatgtggctg agtgagtgat ttgcgtctcc 1261 cttcctcctc ctcctcctcc ctggacaggc tccctcctgc tgccttgaat ccccacctcg 1321 ctcagagggg caataagtag aacatagtgg ctgagagact ggttacagct ctgccattta 1381 ctagctgtga aacccagggc atgttaccaa accaccctgg gcccattcct tcacctgtaa 1441 aatggaaata atagtactta tctgatagag ttgttgtgaa gatgtgaatt atgcttggct 1501 ggcacatagt acagcagtca gtaaatgttt tactattctt tttcccttct gaacacctgt 1561 gcccttcagt ccctccccca ggcctctatc tccagacatc cttaacccct ctgtccctcc 1621 ctttgcagcc ggatccagag caatgtagaa ggcctatacc tgactgatgg ggtccctttc 1681 tcctgttgca acccccactc accccggcct tgcctgcaaa accgtctttc agactcctac 1741 gcccaccccc tgttcgatcc ccgacaaccc aaccaaaacc tctgggccca agggtgccat 1801 gaggtgctgc tggagcactt gcaggacttg gcaggcacac tgggtagcat gctggctgtc 1861 accttcctac tgcaggtgag tcagcaaagc atctgacacc tcctcccacc cgggactcct 1921 ccctgcctcc aaccctgggc ctcttggaac cgctgactct ccctgactct ttccccttgc 1981 ttcccccaca ggctctggtg ctccttggcc tgcggtacct gcaaacagca ctggaggggc 2041 ttggaggggt cattgatgcg ggaggagaga cccagggcta tctctttccc agtgggctga 2101 aagatatgct gaaaacagca tggctacagg gaggggttgc ctgcaggcca gcacctgagg 2161 aggccccacc aggagaagca cctcccaagg aggatctatc tgaggcctag aggcctggag 2221 cttggggtga ggaagaggga gggatggaca agtctgaaaa cctcacaact ccttaccaag 2281 gctccaggtt ggggggatcg taggattaga ggggctaagg atagtcagcg agctggactg 2341 gggtaagaaa gaaaaccaga tgtcctaggg cctagccctt gtagtcagaa ccaccaggga 2401 acagcaaaga acagagtgat gggaaagtga catgagaagg cctggaggct gattctgata 2461 tagactcaat aaagtttttg gatggaagca attgcttttt cttgtcaagg ggatgggggc 2521 ctgggagaac tgatttctgt ctgatggagc agctaggact ccaaagtttg gaccctggct 2581 cgacctgtgc agcaacagga gcccacatct gtaaggatca gaaagcaaga acccaatgta 2641 agaagcaaag ggaaaacaag aggcccctcc aggttgagat tctttattct ggaggtagga 2701 agggggtcag catgctcagg tgggaagggt ccagcccagc tcctccagcc cccagtgcat 2761 gcccagcccc aataagttac ccagttactc agctgccctc cctcctgggt ccatctgtcc 2821 ttctgttcca ccctagacag g // LOCUS HUMSSTR3X 1413 bp DNA PRI 13-JAN-1995 DEFINITION Human somatostatin receptor subtype 3 (SSTR3) gene, complete cds. ACCESSION M96738 NID g338498 KEYWORDS somatostatin receptor. SOURCE Homo sapiens (tissue library: Stratagene #946203) male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1413) AUTHORS Yamada,Y., Reisine,T., Law,S.F., Ihara,Y., Kubota,A., Kagimoto,S., Seino,M., Seino,Y., Bell,G.I. and Seino,S. TITLE Somatostatin receptors, an expanding gene family: cloning and functional characterization of human SSTR3, a protein coupled to adenylyl cyclase JOURNAL Mol. Endocrinol. 6 (12), 2136-2142 (1992) MEDLINE 93149123 FEATURES Location/Qualifiers source 1..1413 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" /tissue_lib="Stratagene #946203" gene 98..1354 /gene="SSTR3" CDS 98..1354 /gene="SSTR3" /codon_start=1 /product="somatostatin receptor subtype 3" /db_xref="PID:g338499" /translation="MDMLHPSSVSTTSEPENASSAWPPDATLGNVSAGPSPAGLAVSG VLIPLVYLVVCVVGLLGNSLVIYVVLRHTASPSVTNVYILNLALADELFMLGLPFLAA QNALSYWPFGSLMCRLVMAVDGINQFTSIFCLTVMSVDRYLAVVHPTRSARWRTAPVA RTVSAAVWVASAVVVLPVVVFSGVPRGMSTCHMQWPEPAAAWRAGFIIYTAALGFFGP LLVICLCYLLIVVKVRSAGRRVWAPSCQRRRRSERRVTRMVVAVVALFVLCWMPFYVL NIVNVVCPLPEEPAFFGLYFLVVALPYANSCANPILYGFLSYRFKQGFRRVLLRPSRR VRSQEPTVGPPEKTEEEDEEEEDGEESREGGKGKEMNGRVSQITQPGTSGQERPPSRV ASKEQQLLPQEASTGEKSSTMRISYL" BASE COUNT 218 a 464 c 467 g 264 t ORIGIN 1 atgggagggg gcagcacaga gaaagccatt ctctgctgtg accgagctgt ttttccttcc 61 cccaggcaaa tgactgctga ccaccctccc ctcagccatg gacatgcttc atccatcatc 121 ggtgtccacg acctcagaac ctgagaatgc ctcctcggcc tggcccccag atgccaccct 181 gggcaacgtg tcggcgggcc caagcccggc agggctggcc gtcagtggcg ttctgatccc 241 cctggtctac ctggtggtgt gcgtggtggg cctgctgggt aactcgctgg tcatctatgt 301 ggtcctgcgg cacacggcca gcccttcagt caccaacgtc tacatcctca acctggcgct 361 ggccgacgag ctcttcatgc tggggctgcc cttcctggcc gcccagaacg ccctgtccta 421 ctggcccttc ggctccctca tgtgccgcct ggtcatggcg gtggatggca tcaaccagtt 481 caccagcata ttctgcctga ctgtcatgag cgtggaccgc tacctggccg tggtacatcc 541 cacccgctcg gcccgctggc gcacagctcc ggtggcccgc acggtcagcg cggctgtgtg 601 ggtggcctca gccgtggtgg tgctgcccgt ggtggtcttc tcgggagtgc cccgcggcat 661 gagcacctgc cacatgcagt ggcccgagcc ggcggcggcc tggcgagccg gcttcatcat 721 ctacacggcc gcactgggct tcttcgggcc gctgctggtc atctgcctct gctacctgct 781 catcgtggtg aaggtgcgct cagctgggcg ccgggtgtgg gcaccctcgt gccagcggcg 841 ccggcgctcc gaacgcaggg tcacgcgcat ggtggtggcc gtggtggcgc tcttcgtgct 901 ctgctggatg cccttctacg tgctcaacat cgtcaacgtg gtgtgcccac tgcccgagga 961 gcctgccttc tttgggctct acttcctggt ggtggcgctg ccctatgcca acagctgtgc 1021 caaccccatc ctttatggct tcctctccta ccgcttcaag cagggcttcc gcagggtcct 1081 gctgcggccc tcccgccgtg tgcgcagcca ggagcccact gtggggcccc cggagaagac 1141 tgaggaggag gatgaggagg aggaggatgg ggaggagagc agggaggggg gcaaggggaa 1201 ggagatgaac ggccgggtca gccagatcac gcagcctggc accagcgggc aggagcggcc 1261 gcccagcaga gtggccagca aggagcagca gctcctaccc caagaggctt ccactgggga 1321 gaagtccagc acgatgcgca tcagctacct gtaggggcct ggggaaagcc aggatggccc 1381 gaggaagagg cagaagccgt gggtgtgcct agg // LOCUS HSPNMTB 3799 bp DNA PRI 24-APR-1993 DEFINITION Human gene for phenylethanolamine N-methylase (PNMT) (EC 2.1.1.28). ACCESSION X52730 NID g35560 KEYWORDS methylase; phenylethanolamide N-methylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3799) AUTHORS Nagatsu,T. TITLE Direct Submission JOURNAL Submitted (19-APR-1990) Nagatsu T., Department of Biochemistry, Nagoya University School of Medicine, Nagoya 466, Japan REFERENCE 2 (bases 1 to 3799) AUTHORS Sasaoka,T., Kaneda,N., Kurosawa,Y., Fujita,K. and Nagatsu,T. TITLE Human phenylethanolamine n-methyltranserase gene: existence of two types of mRNA with different transcription initiation sites JOURNAL Neurochem. Int. 15, 555-565 (1989) COMMENT See also for mRNA sequence and for overlapping genomic sequence. Data kindly reviewed (20-AUG-1990) by Nagatsu T. FEATURES Location/Qualifiers source 1..3799 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="ghPNMT1101" /chromosome="17" misc_feature 814..829 /note="pot. glucocorticoid response element" misc_feature 998..1012 /note="pot. glucocorticoid response element" protein_bind 1027..1037 /note="pot. Sp1 binding site" /bound_moiety="Sp1" protein_bind 1395..1404 /note="pot. Sp1 binding site" /bound_moiety="Sp1" protein_bind 1407..1416 /note="pot. Sp1 binding site" /bound_moiety="Sp1" TATA_signal 1640..1648 exon 1670..1894 /number=1 mRNA join(1670..1894,2846..3053,3168..3688) CDS join(1693..1894,2846..3053,3168..3606) /EC_number="2.1.1.28" /codon_start=1 /product="phenylethanolamine n-methyltransferase" /db_xref="PID:g296668" /db_xref="SWISS-PROT:P11086" /translation="MSGADRSPNAGAAPDSAPGQAAVASAYQRFEPRAYLRNNYAPPR GDLCNPNGVGPWKLRCLAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDF LEVNRQELGRWLQEEPGAFNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVH QPQPLGAGSPAPLPADALVSAFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALE ESWYLAGEARLTVVPVSEEEVREALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAW AQKVGL" intron 1895..2845 /number=1 misc_feature 2243..2597 /note="Alu sequence" misc_feature 2392..2406 /note="pot. glucocorticoid response element" exon 2846..3053 /number=2 intron 3054..3167 /number=2 mRNA 3168..3688 /note="exon 3" polyA_signal 3671..3676 BASE COUNT 733 a 1024 c 1363 g 679 t ORIGIN 1 ctggcactgg gtggtaacca gcaagccagc tggcatccgc atccagggtt tgtttcaatg 61 atgtctcgtg gagaatatgg aggggctggt gccaggactg tccttggctt tgcctcgggg 121 tgtgaacggg gtcagtgacc tctaaaacta acctgcctct cagttctgaa tccagacaga 181 atcaatcctc agctgtgtct cgctccacac cccctgccct ggaagccagg gaaggttgga 241 ggtgctaggg ggtcaggctc ccctctgtga cccctgcagc tgttgtggtg actcatgtcc 301 caacctagct gcctctccca aggagacttt cccctgggac aagggggagg gaatggcatg 361 gaggaggccc acatcaagcg gggccaggaa cccacggtgg caggagctgg gctggtgacc 421 tacccagggc agaagggccc gggactcatc cagaggggaa ggaaggggtc ttcaggaaga 481 ccacggagat gccacaggca gaattggctt cccatctggg agataggtgg ggagaccctg 541 gcattttgac agccagaacc tggggtgctg agcagaatct tcatgcctgg cctggccgcc 601 ttcggaggga agctggaggg ttgggtgcga gaggagtggg gtcagagccc ctacatccgc 661 aggaccccaa atcggctggg ccccaaggcc cggactgcgc tccccggtgg ccccggcggc 721 cctccgcgaa tgcgtcctgc ccctcccctg cccaagccct ctgccctcac ccgggtccgg 781 cgccgccccc gaagtggcgg gaacaacccg aacccgaacc ttctgtcctc gggagccccc 841 agataagcgg ctgggaaccc gcggggcccg caggggaggc ccggctgttc cgcccgctaa 901 gtgcattagc acagctcacc tcccctatcg cgcctgccat cggacgggca gtgccgcgcc 961 ctgctctggg gcccccggag cgaccacagc ggaggccgga acggactgtc ctttctgggg 1021 cggggtgggg agggggtgtc gctggagggc ccggtggcat agcaacggac gagagaggcc 1081 tggaggaggg gcggggaggg ggagttgtgt ggcagttcta agggaagggt gggtgctggg 1141 acgggtgtcc gggagggagg ggagcctggc ggggtctggg gcctcgtcgc ggagggcgct 1201 gcgaggggga aactggggaa agggcctaat tccccagtct ccacctcgaa tcaggaaaga 1261 gaaggggcgg gctgctgggc aaaagaggtg aatggctgcg gggggctgga gaagagagat 1321 gggaggggcc ggccggcggg ggtgaggggg tctaaagatt gtgggggtga ggaactgagg 1381 gtggggggcg cccagaggcg ggactcgggg cggggcaggc gaggcggagg gcgagggctg 1441 cgggagcaag tacggagccg ggggtgtggg ggacgattgc cgctgcagcc gccgccccac 1501 tcacctccgg tgtgtctgca gcccggacac taagggagat ggatgaatgg gtggggagga 1561 tgcggcgcac atggccccgg gcggctcggc ggtcagctgc cgcccccaca gcggaccggt 1621 cggggcgggg gtcgggcggt agaaaaaagg gccgcgaggc gagcggggca ctgggcggac 1681 cgcggcggca gcatgagcgg cgcagaccgt agccccaatg cgggcgcagc ccctgactcg 1741 gccccgggcc aggcggcggt ggcttcggcc taccagcgct tcgagccgcg cgcctacctc 1801 cgcaacaact acgcgccccc tcgcggggac ctgtgcaacc cgaacggcgt cgggccgtgg 1861 aagctgcgct gcttggcgca gaccttcgcc accggtgagc gggggaaact gaggcacgag 1921 ggacaagagg tcgtcgggga gtgaaagcag gcgcagggaa ataaaaagaa ggaaagggag 1981 acagaccagg cgcctaacag atggggacca agaaacaaga gatagctgag aggtgcaaac 2041 agaagagaaa aaggagcaac atcccttagg agaggggcag aggagagaga ggtggagaga 2101 gggggcggag agtgctcaga attgagagct aaggtggggg atgcaggaca gactgaggtg 2161 gagatgcata ggaggaaatg gaggcagatg tgggacaggg gtgagaaact ccaggatttc 2221 ctcgctgagc ctggctggta ggtatagttg ttttctttct ttttctttat tttattttca 2281 tttatttact tatttttatt ttttatttgt tttgagacgg agtttcgctc ttgttgccca 2341 ggctggagta caatggcgcc atctcggctc actgcaacct ccgcctcccc gggttcaagc 2401 gattctcttg cctcagcttc cctagtagct gggattacag gcatgcgccc ccatgcctgg 2461 ctaatttatt tgtattttta gtagagacgg gacttctcca tgttggtcag gctggtctcg 2521 aactcccaac cttaggatcc acccaccccg gcctcccaaa gtgctgggat tacaggtgtg 2581 agccactgcg cccggccagt aggtatagtc ttctagatgt gaaacctgag tctcagagcg 2641 gtgaagttcc cttccgaagg gcagcccatg ttggagctgg gttcagtcta actctggggc 2701 caatgctttt tccagatgga gacacatttg cagaggagaa ggaagaacta gagagaggca 2761 gggagatgca ggggagggaa gggtaaggag gcaggggctg cctgggctgg ctggcaccag 2821 gaccctcttc ctctgccctg cccaggtgaa gtgtccggac gcaccctcat cgacattggt 2881 tcaggcccca ccgtgtacca gctgctcagt gcctgcagcc actttgagga catcaccatg 2941 acagatttcc tggaggtcaa ccgccaggag ctggggcgct ggctgcagga ggagccgggg 3001 gccttcaact ggagcatgta cagccaacat gcctgcctca ttgagggcaa ggggtaagga 3061 ctggggggtg agggttgggg aggaggcttc ccatagagtg gctggttggg gcaacagagg 3121 cctgagcgta gaacagcctt gagccctgcc ttgtgcctcc tgcacaggga atgctggcag 3181 gataaggagc gccagctgcg agccagggtg aaacgggtcc tgcccatcga cgtgcaccag 3241 ccccagcccc tgggtgctgg gagcccagct cccctgcctg ctgacgccct ggtctctgcc 3301 ttctgcttgg aggctgtgag cccagatctt gccagctttc agcgggccct ggaccacatc 3361 accacgctgc tgaggcctgg ggggcacctc ctcctcatcg gggccctgga ggagtcgtgg 3421 tacctggctg gggaggccag gctgacggtg gtgccagtgt ctgaggagga ggtgagggag 3481 gccctggtgc gtagtggcta caaggtccgg gacctccgca cctatatcat gcctgcccac 3541 cttcagacag gcgtagatga tgtcaagggc gtcttcttcg cctgggctca gaaggttggg 3601 ctgtgagggc tgtacctggt gccctgtggc ccccacccac ctggattccc tgttctttga 3661 agtggcacct aataaagaaa taataccctg ccgctgcggt cagtgctgtg tgtggctctc 3721 ctgggaagca gcaagggccc agagatctga gtgtccgggt aggggagaca ttcaccctag 3781 gctttttttc cagaagctt // LOCUS HSIGF2G 8837 bp DNA PRI 27-JAN-1993 DEFINITION Human gene for insulin-like growth factor II. ACCESSION X03562 M13970 M14116 M14117 M14118 NID g33003 KEYWORDS growth factor; hormone; insulin super family; insulin-like growth factor II; signal peptide; somatomedin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8837) AUTHORS Dull,T.J., Gray,A., Hayflick,J.S. and Ullrich,A. TITLE Insulin-like growth factor II precursor gene organization in relation to insulin gene family JOURNAL Nature 310 (5980), 777-781 (1984) MEDLINE 84295593 REFERENCE 2 (bases 1 to 8837) AUTHORS Tadokoro,K., Fujii,H., Inoue,T. and Yamada,M. TITLE Polymerase chain reaction (PCR) for detection of ApaI polymorphism at the insulin like growth factor II gene (IGF2) JOURNAL Nucleic Acids Res. 19 (24), 6967 (1991) MEDLINE 92107706 REFERENCE 3 (bases 1 to 8837) AUTHORS Raizis,A.M., Eccles,M.R. and Reeve,A.E. TITLE Structural analysis of the human insulin-like growth factor-II P3 promoter JOURNAL Biochem. J. 289 (Pt 1), 133-139 (1993) MEDLINE 93143658 COMMENT Data kindly reviewed (24-OCT-1986) by A. Ullrich. FEATURES Location/Qualifiers source 1..8837 /organism="Homo sapiens" /db_xref="taxon:9606" exon <2979..2979 /gene="IGF-II" /number=1 exon <2979..2979 /gene="IGF-II" /number=1 gene 2979..8183 /gene="IGF-II" intron 2980..5690 /gene="IGF-II" /number=1 intron 2980..5640 /gene="IGF-II" /number=1 sig_peptide 5647..5718 /gene="IGF-II" CDS join(5647..5803,7505..7653,7947..8183) /gene="IGF-II" /codon_start=1 /product="put. IGF-II" /db_xref="PID:g33004" /translation="MGIPMGKSMLVLLTFLAFASCCIAAYRPSETLCGGELVDTLQFV CGDRGFYFSRPASRVSRRSRGIVEECCFRSCDLALLETYCATPAKSERDVSTPPTVLP DNFPEIPLGKFFQYDTWKQSTQRLRRGLPALLRARRGHVLAKELEAFREAKRHRPLIA LPTQDPAHGGAPPEMASNRK" exon 5641..5804 /gene="IGF-II" /number=2 exon 5691..5803 /gene="IGF-II" /number=2 mat_peptide join(5719..5804,7506..7653,7947..8180) /gene="IGF-II" /note="(aa 1-156)" /product="put. IGF-II" mat_peptide join(5719..5804,7506..7565) /gene="IGF-II" /note="B-chain, (aa 1-32)" intron 5804..7504 /gene="IGF-II" /number=2 exon 7505..7653 /gene="IGF-II" /number=3 mat_peptide complement(7539..7566) /gene="IGF-II" /note="C-peptide (aa 33-40)" mat_peptide 7540..7605 /gene="IGF-II" /note="A-chain (aa 41-62)" mat_peptide 7606..7620 /gene="IGF-II" /note="D-peptide (aa 63-67)" mat_peptide join(7621..7653,7947..8180) /gene="IGF-II" /note="E-peptide (aa 78-156)" intron 7654..7946 /gene="IGF-II" /number=3 BASE COUNT 1388 a 3037 c 2697 g 1685 t 30 others ORIGIN 1 cccaaccccg cgcacagcgg gcactggttt cgggcctctc tgtctcctac gaagtccgta 61 gagcaactcg gatttgggaa atttctctct agcgttgccc aaacacactt gggtcggccg 121 cgcgccctca ggacgtggac agggagggct tccccgtgtc caggaaagcg accgggcatt 181 gcccccagtc tcccccaaat ttgggcattg tccccgggtc ttccaacgga ctgggcgnng 241 ctcccggaca ctgaggactg gccccggggt ctcgctcacc ttcagcagcg tccaccgcct 301 gccacagagc gttcgatcgc tcgctgcctg agctcctggt gcgcccgcgg acgcagcctc 361 cagcttcgcg gtgagctccc cgccgcgccg atcccctccg cctctgcgcc cctgaccggc 421 tctcggcccg catctgctgc tgtcccgccg gtgctggcgc tcgtccgctg cgccggggag 481 gccggcgtgg ggcgcgggac acggctgcgg acttgcggct gcgctgcgct cgctcctgct 541 gggcgccccg aaatccgcgc cactttcgtt tgctcattgc aaagatctca tttgtgggga 601 aagcggctgg agggtcccaa agtggggcgg gcagggggct ggggcgaggg acgcggagga 661 gaggcgctcc cgccgggcgg taaagtgcct ctagcccgcg ggcctaggac tccgccggga 721 gggcgcgcgg agngcgaagt gattgatggc ggaagcgggg gggcaagggg ggcagggggg 781 cgcgggattc cgccggcgac cccttcccct tggctaggct taggcggcgg ggggctggcg 841 gggtgcggga ttttgtgcgt ggtttttgac ttggtaaaaa tcacagtgct ttcttacatc 901 gttcaaactc tccaggagat ggtttcccca gacccccaaa ttatcgtggt ggcccccgag 961 accgaactcg cgtctatgca agtccaacgc actgaggacg gggtaaccat tatccagata 1021 ttttgggtgg gccgcaaagg cgagctactt agacgcaccc cggtgagctc ggccatgcag 1081 gtaggatttg agctgtgttt cccgccctga tcctctctcc tctggcggcc ggagcctccg 1141 taggctccaa gcctggccca gattcggcgg cgcagccggc cttccgcgcg tccgcaccta 1201 gcgggggctc cggggctccg gcgcggcacc ggggggcgct cgggatctgg ctgaggctcc 1261 aaggcccgcg tggccggctc ctcctgctgg ggcaggtggc ggctgcgcgc cccgcccgag 1321 cccaggggcc ccctcagccg caacaaccag caaggacccc ccgactcagc cccaagccac 1381 ctgcatctgc actcagacgg ggcgcacccg cagtgcagcc tcctggtggg gcgctgggag 1441 cccgcctgcc cctgcctgcc cggagacccc agctcacgag cacaggccgc ccgggcaccc 1501 cagaaacccg ggatggggcc cctgaattct ctaggacggg cattcagcat ggccttggcg 1561 ctctgcggct ccctgccccc cacccagcct cgcccccgcg caccccccag cccctgcgac 1621 cgccgccccc ccccccgggg ccccagggcc ccagcccgca ccccccgccc cgctcttggc 1681 tcgggttgcg ggggcgggcc gggggcgggg cgagggctcc gcgggcgccc attggcgcgg 1741 gcgcgaggcc agcggccccg cgcggccctg ggccgcggct ggcgcgacta taagagccgg 1801 gcgtgggcgc ccgcagttcg cctgctctcc ggcggagctg cgtgaggccc ggccggcccc 1861 ggcccccccc ttccggccgc ccccgcctcc tggcccacgc ctgcccgcgc tctgcccacc 1921 agcgcctcca tcgggcaagg cggccccgcg tcgacgccgc ccgctgcctc gctgctgact 1981 cccgtcccgg gcgccgtccg cggggtcgcg ctccgccggg cctgcggatt ccccgccgcc 2041 tcctcttcat ctacctcaac tccccccatc cccgcttcgc ccgaggaggc ggttcccccc 2101 gcaggcagtc cggctcgcag gccgccggcg ttgtcacccc ccccgcgctc cccctccagc 2161 cctccccccg gcgcgcagcc tcgggccgct cccctttccg cgctgcgtcc cggagcggcc 2221 ccggtgccgc caccgcctgt ccccctcccg aggcccgggc tcgcgacggc agagggctcc 2281 gtcggcccaa accgagctgg gcgcccgcgg tccgggtgca gcctccactc cgccccccag 2341 tcaccgcctc ccccggcccc tcgacgtggc gcccttccct ccgcttctct gtgctccccg 2401 cgcccctctt ggcgtctggc cccggccccc gctctttctc ccgcaacctt cccttcgctc 2461 cctcccgtcc cccccagctc ctagcctccg actccctccc cccctcacgc ccgccctctc 2521 gccttcgccg aaccaaagtg gattaattac acgctttctg tttctctccg tgctgttctc 2581 tcccgctgtg cgcctgcccg cctctcgctg tcctctctcc ccctcgccct ctcttcggcc 2641 cccccctttc acgttcactc tgtctctccc actatctctg cccccctcta tccttgatac 2701 aacagctgac ctcatttccc gatacctttt cccccccgaa aagtacaaca tctggcccgc 2761 cccagcccga agacagcccg tcctccctgg acaatcagac gaattctccc ccccccccca 2821 aaaaaaagcc atccccccgc tctgccccgt cgcacattcg gcccccgcga ctcggccaga 2881 gcggcgctgg cagaggagtg tccggcagga gggccaacgc ccgctgttcg gtttgcgaca 2941 cgcagcaggg aggtgggcgg cagcgtcgcc ggcttccagg taagcggcgt gtgcgggccg 3001 ggccggggcc ggggctgggg cggcgcgggc ttgcggcgac gcccggccct tcctccgccc 3061 gctcccggcc cggggcctgc ggggctcggc ggggcggctg agccgggggg gaggaggagg 3121 aggaggagga ggacggacgg ctgcgggtcc cgttccctgc gcggagcccg cgctaccnnn 3181 nnnnnnnnnn nnnnnnnnnn nnngacgtcc ccgctgaagg gggtcggtct gtgggtgcag 3241 ggggtgccgc ctcacatgtg tgattcgtgc cttgcgggcc ctggcctccg gggtgctggg 3301 taacgaggag gggcgcggag ccgcagaagc ccaccctggt gtcgttgacg ccggtgccag 3361 cgagaccgcg agaggaagac gggggcgggc ggggccagga tggagagggg ccgagttggc 3421 aggagtcatg gcagacgcca cactcgcgac catctccccc acacccctct ggcctctgtc 3481 cgcaacattt ccaaacagga gtcccgggag agggggagag gggctgctgg tctgaggcta 3541 agaagggcag agccttcgac ccggagagag gccgcggccg cctgccccag tggcaacgtt 3601 gaagttttcc atacaacgga ggtcgggaag gagacccccc ccccccttca ctgccctgtg 3661 aagagatgag ccgggggtgc aggatgggag cccatggcac ttcgctacgg gatgtccagg 3721 gctccggttg ggggtgcagg agagaagaga ctggctggga ggagggagag ggcgggagca 3781 aaggcgcggg ggtgtggtca gagggagagg ggtgggggtt aggtggagcc cgggctggga 3841 ggagtcggct cacacataaa actgaggcac tgaccagcct gcaaactgga tattagcttc 3901 tcctgtgaaa gagacttcca gcttcctcct cctcctcttc ctcctcctcc tcctgcccca 3961 gcgagccttc tgctgagctg taggtaacca gggctgtgga gtgaaggacc cccgctgcca 4021 tcccactcca gcctgaggca gggcagcagg gggcacggcc cacgcctggg cctcgggccc 4081 tgcagccgcc agcccgctgc ctctcggaca gcacccccct cccctctttt cctctgcccc 4141 tgcccccacc tggcgtctct gctccctcac ctgctccttc cctttctgtt ccttcccttc 4201 ggccccctcc ttgcccagct caggactttt cctgggccct cacctgctcc gcaccgctgc 4261 atgcttcctg tcctgctttc tgccggtccc ctgacccgga cctccaagcg cagagtggtg 4321 gggcttgttg cggaagcgcg gcgagggcta gagtggccag ctggcggagt gtgctcttag 4381 aatttggaag ggggtggcag agggggcggt gagaggactg gccagggtcc gccatgtcaa 4441 ggagatgacc aaggaggctt tcagatcctc ggcgcagtcg cccactagtc tttagagagg 4501 gcatgcaaag ttgtgcttct gtcccactgc ctgctcagtc gctcacataa tttattgcat 4561 caaaaactcc cctgggtctg cggagcaagg ctggggctgc ccgcctggag ggtaccacct 4621 tctgcaggag cagggccaac ttgctgtggt ggctcccggc ctcccacccc cgagtgggta 4681 acccggccct gtgacctgca gcctgtggag ggggtgtgcc taagactggc ctccccttcc 4741 agattgtagt ctggggaacc tggtgtcgga cttcccaggt ggcctgagct ggtctcttca 4801 gctccacggg gagagtttgg tagcgcaaat agggagatgt tctgggcccc tggccttact 4861 ggttcgattt gaggcctgga aaggaggctc tgggcgtgtg tgtgtgtgtt tgggggtacc 4921 caaggcagac tggagttgga gaactgggtg actgggaaaa caaggtttct agagcatggg 4981 tggcgtggtt gtgttaacca ttggagtcgc ttgacccagg cctggctcag ctgcagactg 5041 gaaaggtgga aaagccaggg ggaggggcgg ggctggccca gcaggactgg cctgctgctt 5101 tgagggcgat ggtcctcctg gaccccccct gctcagctgg gggttgtggg gaggaagggg 5161 ctggtcctcc ttggagcaca tgctctgtag gggtggggct gtctgccatc ttggcggcgc 5221 tggaggcctg agaagtggcg atgtaacgct gggctggccc tgcccccatg gtgtcatagg 5281 acggaggcag gtcgggtgtc cagcctgggc ccctgcagct gtggatgccg ctgagctcct 5341 gcaataatga ccgtgcagat ggtcacccct cgtgtaaaat tactagtgct tcttgcaaat 5401 ggaaggaact gggccttttc tgtgtgcttc tggacgcttc attctgcaca tggccctgcg 5461 ccctcacctc ggcattatga cctgtgtgtt acttttgtaa taaaaataat gtttatagga 5521 aagccgtgct ttcaattttc aactgaattt gtaggttggc aaatttggtt tgggaggggc 5581 acctctggcc tggggcttgg cctggctgcc ccgctcacgc cacttctctc ccgcccccag 5641 acaccaatgg gaatcccaat ggggaagtcg atgctggtgc ttctcacctt cttggccttc 5701 gcctcgtgct gcattgctgc ttaccgcccc agtgagaccc tgtgcggcgg ggagctggtg 5761 gacaccctcc agttcgtctg tggggaccgc ggcttctact tcagtaagta gcagggaggg 5821 gcttcctcag acctggtcag gcccctagag tgaccggtga ggatctccca tcctcaagcc 5881 aggggagcac actcctaggt cagcagccca gccgcttgct ctgagacttt gaccttcccg 5941 ccgcgtttct gagcacgtgc ggtgtcccag ggcatccaca ccagctgcct ttcccatcac 6001 acgcctcctt cgaagggtgg gccagaggtg ccccctagac gtcaggggca tctacagggg 6061 tctccctggg catcagaatt tctgttgggg gccgtgaggc tcctgctcct gaggcaccgc 6121 acgcctagtg cagggcttca ggctctggag gaagagcctg cctttcttcc tgcacctttt 6181 ggacattttg acaagggacg tgcgttcggt gaatgatcag aattaaaatc aataaagtga 6241 tttatataat taaaatcaat aagacaagtg cagttggtgg gtggcagggg tgagcggtgc 6301 atgcgcctcc ttgggcccca aggctgccgt ggggggtgcc cacctgctga cctcaaggac 6361 gcttcagcct ttcctcatgt ttctctcttg gttctccagc ctgggggctg gcaggtgggt 6421 gcatggccca ttgtccttga gaccccaccc ccagataggg gggctgggtg gatgcagagg 6481 caggcatggt gcctgggcat gcctgatggg gcaggggagg ggccgctcct tactggcaga 6541 ggccgcaact tattccacct gacactcacc acgtgacatc tttaccacca ctgcttactc 6601 acgctgtgaa atgggctcac aggatgcaaa tgcacttcaa agcttctctc tgaaaagttc 6661 ctgctgcttg actctggaag cccctgcccg ccctggcctc tcctgtgccc tctctcttgc 6721 ctgccccatt tgggggtagg aagtggcact gcagggcctg gtgccagcca gtccttgccc 6781 agggagaagc ttccctgcac caggctttcc tgagaggagg ggagggccaa gcccccactt 6841 gggggccccc gtgacggggc ctcctgctcc ctcctccggc tgatggcacc tgccctttgg 6901 caccccaagg tggagccccc agcgaccttc cccttccagc tgagcattgc tgtgggggag 6961 agggggaaga cgggaggaaa gaagggagtg gttccatcac gcctcctcag cctcctctcc 7021 tcccgtcttc tcctctcctg cccttgtctc cctgtctcag cagctccagg ggtggtgtgg 7081 gcccctccag cctcccaggt ggtgccaggc cagagtccaa gctcacggac agcagtcctc 7141 ctgtgggggc cctgaactgg gctcacatcc cacacatttt ccaaaccact cccattgtga 7201 gcctttggtc ctggtggtgt ccctctggtt gtgggaccaa gagcttgtgc ccatttttca 7261 tctgaggaag gaggcagcag aagtcacggg ctggtctggg ccccactcac ctcccctctc 7321 acctctcttc ttcctgggac gcctctgcct gccggctctc acttccctcc cctgacccgc 7381 agggtggctg cgnccttcca gggcctggcc tgagggcagg ggtggtttgc tgggggttcg 7441 gcctccgggg gctgggggtc ggtgcggtgc taacacggct ctctctgtgc tgtgggactt 7501 ccaggcaggc ccgcaagccg tgtgagccgt cgcagccgtg gcatcgttga ggagtgctgt 7561 ttccgcagct gtgacctggc cctcctggag acgtactgtg ctacccccgc caagtccgag 7621 agggacgtgt cgacccctcc gaccgtgctt ccggtgaggg tcctgggccc ctttcccact 7681 ctctagagac agagaaatag ggcttcgggc gcccagcgtt tcctgtggcc tctgggacct 7741 cttggccagg gacaaggacc cgtgacttcc ttgcttgctg tgtggcccgg gagcagctca 7801 gacgctggct ccttctgtcc ctctgcccgt ggacattagc tcaagtcact gatcagtcac 7861 aggggtggcc tgtcaggtca ggcgggcggc tcaggcggaa gagcgtggag agcaggcacc 7921 tgctgaccag ccccttcccc tcccaggaca acttccccga gatacccctg ggcaagttct 7981 tccaatatga cacctggaag cagtccaccc agcgcctgcg caggggcctg cctgccctcc 8041 tgcgtgcccg ccggggtcac gtgctcgcca aggagctcga ggcgttcagg gaggccaaac 8101 gtcaccgtcc cctgattgct ctacccaccc aagaccccgc ccacgggggc gcccccccag 8161 agatggccag caatcggaag tgagcaaaac tgccgcaagt ctgcagcccg gcgccaccat 8221 cctgcagcct cctcctgacc acggacgttt ccatcaggtt ccatcccgaa aatctctcgg 8281 ttccacgtcc cctggggctt ctcctgaccc agtccccgtg ccccgcctcc ccgaaacagg 8341 ctactctcct cggccccctc catcgggctg aggaagcaca gcagcatctt caaacatgta 8401 caaaatcgat tggctttaaa cacccttcac ataccctccc cccaaattat ccccaattat 8461 ccccacacat aaaaaatcaa aacattaaac taaccccctt cccccccccc cacaacaacc 8521 ctcttaaaac taattggctt tttagaaaca ccccacaaaa gctcagaaat tggctttaaa 8581 aaaaacaacc accaaaaaaa atcaattggc taaaaaaaaa aagtattaaa aacgaattgg 8641 ctgagaaaca attggcaaaa taaaggaatt tggcactccc cacccccctc tttctcttct 8701 cccttggact ttgagtcaaa ttggcctgga cttgagtccc tgaaccagca aagagaaaag 8761 aagggcccca gaaatcacag gtgggcacgt cgcgtctacc gccatctccc ttctcacggg 8821 aattttcagg gtaaact // LOCUS HUMADAG 36741 bp DNA PRI 04-OCT-1995 DEFINITION Human adenosine deaminase (ADA) gene, complete cds. ACCESSION M13792 NID g178076 KEYWORDS Alu repeat; adenosine deaminase; long terminal repeat (LTR); repetitive sequence. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36741) AUTHORS Wiginton,D.A., Kaplan,D.J., States,J.C., Akeson,A.L., Perme,C.M., Bilyk,I.J., Vaughn,A.J., Lattier,D.L. and Hutton,J.J. TITLE Complete sequence and structure of the gene for human adenosine deaminase JOURNAL Biochemistry 25 (25), 8234-8244 (1986) MEDLINE 87128922 REFERENCE 2 (sites) AUTHORS Berkvens,T.M., van Ormondt,H., Gerritsen,E.J., Khan,P.M. and van der Eb,A.J. TITLE Identical 3250-bp deletion between two AluI repeats in the ADA genes of unrelated ADA-SCID patients JOURNAL Genomics 7 (4), 486-490 (1990) MEDLINE 90353944 REFERENCE 3 (sites) AUTHORS Gossage,D.L., Norby-Slycord,C.J., Hershfield,M.S. and Markert,M.L. TITLE A homozygous 5 base-pair deletion in exon 10 of the adenosine deaminase (ADA) gene in a child with severe combined immunodeficiency and very low levels of ADA mRNA and protein JOURNAL Hum. Mol. Genet. 2 (9), 1493-1494 (1993) MEDLINE 94061056 COMMENT [2] sites; 3250 bp deletion. [2] describes a patient with severe combined immune deficiency caused by a 3250 base pair deletion in the ADA gene. FEATURES Location/Qualifiers source 1..36741 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20q13.2-qter; 475 bp upstream of HindIII site" /chromosome="20" LTR 1025..1357 /note="THE O family LTR" repeat_region 1362..1672 /note="Alu repeat" LTR 1680..1717 /note="THE O family LTR" repeat_region 2357..2903 /note="Alu repeat" mutation 2455..5706 /note="g-3250 bp-c in [1]; gc in [2] -> immune deficiency dysfunction" prim_transcript 3936..35975 /note="ADA mRNA" gene 4031..35664 /gene="ADA" CDS join(4031..4063,19230..19291,26344..26466,28908..29051, 29823..29938,31176..31303,32425..32496,32573..32674, 32851..32915,34354..34483,35100..35202,35651..35664) /gene="ADA" /codon_start=1 /product="adenosine deaminase" /db_xref="PID:g178077" /translation="MAQTPAFDKPKVELHVHLDGSIKPETILYYGRRRGIALPANTAE GLLNVIGMDKPLTLPDFLAKFDYYMPAIAGCREAIKRIAYEFVEMKAKEGVVYVEVRY SPHLLANSKVEPIPWNQAEGDLTPDEVVALVGQGLQEGERDFGVKARSILCCMRHQPN WSPKVVELCKKYQQQTVVAIDLAGDETIPGSSLLPGHVQAYQEAVKSGIHRTVHAGEV GSAEVVKEAVDILKTERLGHGYHTLEDQALYNRLRQENMHFEICPWSSYLTGAWKPDT EHAVIRLKNDQANYSLNTDDPLIFKSTLDTDYQMTKRDMGFTEEEFKRLNINAAKSSF LPEDEKRELLDLLYKAYGMPPSASAGQNL" exon <4031..4063 /gene="ADA" /note="adenosine deaminase" /number=1 mutation 4052 /gene="ADA" /note="The functional significance of this substitution at cDNA position 117 (AA 8 Asp->Asn) is unknown as it occurs in the same allele as a 5 bp deletion in exon 10 which is associated with very low ADA mRN" /citation=[3] intron 4064..19229 /gene="ADA" /note="ADA intron 1" repeat_region 4907..5227 /note="Alu repeat" repeat_region 5606..5908 /note="Alu repeat" repeat_region 7582..8001 /note="Alu repeat" repeat_region 8179..8484 /note="Alu repeat" repeat_region 10005..10204 /note="Alu repeat" repeat_region 10257..10534 /note="Alu repeat" repeat_region 13452..13777 /note="Alu repeat" repeat_region 14837..15386 /note="Alu repeat" repeat_region 15806..16106 /note="Alu repeat" repeat_region 16913..17224 /note="Alu repeat" repeat_region 18414..18717 /note="Alu repeat" exon 19230..19291 /gene="ADA" /number=2 intron 19292..26343 /gene="ADA" /note="ADA intron 2" repeat_region 19605..19902 /note="Alu repeat" repeat_region 22523..22829 /note="Alu repeat" repeat_region 24481..24773 /note="Alu repeat" repeat_region 25143..25453 /note="Alu repeat" exon 26344..26466 /gene="ADA" /number=3 intron 26467..28907 /gene="ADA" /note="ADA intron 3" repeat_region 26949..27269 /note="Alu repeat" repeat_region 28032..28333 /note="Alu repeat" exon 28908..29051 /gene="ADA" /number=4 intron 29052..29822 /gene="ADA" /note="ADA intron 4" exon 29823..29938 /gene="ADA" /number=5 intron 29939..31175 /gene="ADA" /note="ADA intron 5" exon 31176..31303 /gene="ADA" /number=6 intron 31304..32424 /gene="ADA" /note="ADA intron 6" repeat_region 31460..31867 /note="Alu repeat" exon 32425..32496 /gene="ADA" /number=7 intron 32497..32572 /gene="ADA" /note="ADA intron 7" exon 32573..32674 /gene="ADA" /number=8 intron 32675..32850 /gene="ADA" /note="ADA intron 8" exon 32851..32915 /gene="ADA" /number=9 intron 32916..34353 /gene="ADA" /note="ADA intron 9" exon 34354..34483 /gene="ADA" /number=10 mutation 34462..34468 /gene="ADA" /note="This mutation is found in both alleles of a patient with severe combined immunodeficiency secondary to ADA deficiency. The deletion creates a premature stop codon resulting in very low levels of ADA m" /citation=[3] intron 34484..35099 /gene="ADA" /note="ADA intron 10" exon 35100..35202 /gene="ADA" /number=11 intron 35203..35650 /gene="ADA" /note="ADA intron 11" exon 35651..>35664 /gene="ADA" /note="adenosine deaminase" /number=12 BASE COUNT 8165 a 9716 c 9721 g 9139 t ORIGIN 1 gatctgggta aagggttttc caggtgtcag gatggaagtg actaaggtgc agaggctgga 61 gggctggggc aggtagaagc aagcattcct gttacctact gctgtgtgac aatctccccc 121 taaaacacaa tggcttaaaa taacatccat ttcattacat atctcaatac tataggtcag 181 gaatttgggc tgggcttact tgggtaattc ttctgtccca catggcattg accaaagcct 241 ggttttcagt gggcagctgg gctggatggc ccaacacagc ttcgctaaca tgattgctgt 301 cttcgtaggg atggtggaag cctgggctca gtgggactgt caactggaat ggccatatgt 361 ggactctctt agcatgatgg tctcttctag aagcttgggt tcccagagag aatgttcaag 421 aggccccaaa ggacaccaca aagcttcttt atgaccaagg ctcggaaatc caggaagctt 481 gctcccatca cgctctatta ctccaacaag tcactcaggc cagcccaggt ccaagaggag 541 gaaacctaga ctccatcttg caatgtgaag aattgcaaat aatttgtgtc acccttaagc 601 aaccagcaac tcatctaggt tgattggcat ttcagcaatg tggtgggaag tggtgggact 661 gatgttgaag agggacttga atgtcatgag aggctgggga ggcaataagg tggggagtga 721 agtttctcga gtcagattca aatttaaacc ccagttttgc cacttacaac ccatgagcca 781 agcaggctgt ctctctatct gaacctcagt gtcctcatct gtaaaatgag gagaacacct 841 cctacatctg aggatgactg taaagatgaa atgggatggg tgcttataaa gtgcttccca 901 gtgtacctgg ctccaaacct gtctcagtaa atggcagccc ctattattga acccgagtaa 961 cacagagagc caagaaagga tcttacaaaa aactcccctg gctttgacaa tgtatgagac 1021 ccactgatag ggtttggctt tgtgtcctca cccaaatctc atctagtagc tcccataatt 1081 cctacatgtt gtgggagaga ctcggcggga gataattgaa tcatggggga tggtctttcc 1141 catgctgttc ttgtgatagt aaataagtct cacaagatct gatggtttta aaaatgggag 1201 tttccctgca ggcgctctct ctttgtctac tgccatccat gtaagacgtg acttgctcct 1261 cctttgcctt ctgccatgat tgcaaggcct ccccaccatt gtggaactgt aagtctatta 1321 aagcctcttt cttttgtaaa ttacccagtc tcaggtatgt cttttttttt tttttcatga 1381 gatggagttt cgctcttgtt gcccaggctg gaatgcaatg gtgtaatctt ggctcaccac 1441 aacctccacc tcccaggttc aagcgattct cctgcctcag cctcccgagt agctgggatt 1501 acagtcatac accaccacgc ctggctaatt ttgtattttt tttttttttt ttagtagaga 1561 cggggtttca ccatgttggt caggctggtc tcaaactccc gacctcaggt gatcctcctg 1621 ccttggcctc ccaaagtcct gggattacag gcatgaacca ctgcgcccag gctcgggtat 1681 gtcttcatca gtagcatgaa aataatggac taatacagcc accctctccc tcactcccac 1741 atacaaccaa accccaaatc cagctgattt tacaccctaa atgcagcttg aatatgagtt 1801 tctccacttc ccccactgac atcactatgc cctacccaga ccatggcagt tgcctccttc 1861 ctggtatcct gtcctccctc acccccgctg gccccctgta atgccctccc ctcacagcag 1921 ggagcccagg cttctcaaag tgccctgtgg gtgcgaacca cctgggggtc ctgtttgtat 1981 aaaatacaga ttctacttca gtaggtctgg gatggggtct gaaagtctgc atttgtagtc 2041 agctcccagg tgatgtgggt gctgatgatc cctggatcac actttcagta gctggagaat 2101 attttttcca aataaaaggg tgattttgtc tcgcctccac ttaaaacact ccactgactt 2161 cctaggaatc ccacaccatc gctgggtccc acatccctgg caggattcag ctcccatcag 2221 accttctagc cccttgctct ccactctccc actctctctt tcccccttgt ttatgggttt 2281 gttaatttat ttatgatgaa atgaaatgaa gctaccatcc accccagtac tggaacatta 2341 tcaataacct gtgtgtggcc aggcgtggtg gctcatgcct gtaatcacgc cttgggaagc 2401 cgaggtgggt ggatcatgtg aggtcaggtg ttcgagacca gcctggccaa catggtgaaa 2461 ccccgtctct actacaaatc caaaacttag cagggcacgg tgccacgcgc ctgtaatccc 2521 agctactcgg gacgctgagg ccgagaactg cttaaaatcc aggaggtgga ggttgcagtg 2581 agccgagatt tcgccactgc actccagcct gggcgacaga gcaagagtcc atctcaaaaa 2641 aacaaaaaca aaaacaaaaa aacaaaaaac aaaaattagc caggcgtggt tgtgggcgcc 2701 tataatccca gctactcggg aggctgagac aggaaaatcg cttgaaacgc tgggggtgcg 2761 ggggggcggt ggggaggagg cgggccagag gggcagaggt tgcagtgagc ccagatcgcg 2821 ccacttcact gcagcctccg cgaaagagcg aaactccgtc tcagtaaata aataaataaa 2881 taaataaata aataaataaa taacctgtac ccgcgtgtta tttccctccg tccttacctc 2941 ctcccggctc cttccctttc acctgagata accactcttc tcgtatctat gctcatcttt 3001 cccttgcttt acattttttc caccgatgca tgtgtctaaa catacatact tttggttttg 3061 cttttacaca ttctaaaagt tgcaccattg tatgcagttt tccgcaactt agtttttttc 3121 actcaacatt gtttctgaga cattgtttct gttgttgtct ggctgaagtt cattccgttt 3181 cactgctgtc taacgtttca tggtgtgaat attccggttt atttgcccac tcgcccgtgg 3241 aggggcattt gagggtgttt ccaatgttcc tgttattcgg aatagcgctg gtgtgaacat 3301 tctgcacagg tctctggctg cgcctgggcg ggtttcttaa aggtgaatgc ccaggagggg 3361 actgtctgtg ttctccctcc ctccgagctc cagccttcct cgcctccttt cactcccagc 3421 tccctggagt ctctcacgta gaatgtcctc tccaccccca cccacccctg atgaactcct 3481 gcaggttctg caggccacgg ctggcccccc tcgaaagttc cttaactata caattatggt 3541 gtgtgtttct gcgacgagcg tccgtctatc cggtggaagg cacgccgctc gaggcttgcg 3601 atgctcccgg ggtccccgct tctagcttgg gcctggcgca cagcagcgcc cagactgcag 3661 ggggacgctt gaaagttgct ggaggagccg gggggaaggc agcgcccagc gaggcggctg 3721 gagcgcgcgc ccacaggtgg gtccggtcgg gcgccgcggg gccgtagttt tcgggtcggc 3781 gggcgaggac gccgggtcca gaattccagg aaatgcgcga tccaggccgg cgggcggggc 3841 gggggctccg gcgagagggc gggccccggg aacggcggcg ggcggggcgg gaggcggggc 3901 ccggcccgtt aagaagagcg tggccggccg cggccaccgc tggccccagg gaaagccgag 3961 cggccaccga gccggcagag acccaccgag cggcggcgga gggagcagcg ccggggcgca 4021 cgagggcacc atggcccaga cgcccgcctt cgacaagccc aaagtgagcg cgcgcggggg 4081 ctccggggac gggggtccgg cgcctgggcg gcccgagggg cttagcgggg cccagcccgg 4141 ggcgtccaaa ccctgggaac gaacgggggc tcctgcaggc gagttcttcc ttcggcttag 4201 gccgtggctt gcttgcgggc taatcaggga caatggggca gagaaggtcc agaacccgga 4261 ggcctccaga gtctgcttct gcccctgact tgacccctct gggtctcagt ttcgctgtct 4321 gtcaagtggg catcctagca ccgctgagcg ctgtgtgggc ctgggcaggg acttgaggtc 4381 tctgaagctc agctgtatga tcaggcccga tgtctacgcc ggatagcgac ctagtgctgt 4441 gccccgcgcc tactgagtgc tcagtgaatg gaagcagctt tgtacgccag cgttatggtg 4501 gtgagcgcca aggagctcag gtttgtggat gcgccccggg gaagaaccgt gagccctgcc 4561 agaaagggga gggaggggag cagagcaccc cccttccccc gcgcgggaag aacaggagct 4621 aggtaggccc tgggtttggg gccctagcag ggttcactcg aggccaagcc atggcccact 4681 ggccccaggg gagaatcccc ttgtttctcc gcccaccagc tgtggcgtct tgggactgtt 4741 ggggtcaggg agggtctgga cccccttggc ctgtctcaga gtccgagagg aggggcccag 4801 gagtctgcca agcagggtga gtcagccagt agggtgtgag agtggttggg gaaggagtca 4861 gctgcagtca gcctcaactt acccttctaa gaaataggtg tgagtggccc aggaggttgg 4921 ctcacgcctg taatcccagc actttgtgag gctgaggcgg gaggatcatt tgagtccagg 4981 agtttgagac tagcctggac aacaaaacta gaccccgtct ctccaaaaaa taaaaaaagt 5041 taggggaagt gtgtgtggtg gtgcactccc gtagtcccag ctactcagga ggctgaggcg 5101 ggaggatcgc ttgagcccag gaggttgagg ctgcagtgag gtgtgatggt gccactgacc 5161 ttcagcctgg gagacagagc gagaccctgt ctcaaaaaaa aagagaagaa aaagaaaaga 5221 aaagaaatag gtgtgaatga tgatgacagc tatcacaaaa gtgccggtga gaatccagtg 5281 agtgtgcatg tgtcagtgag ggagacaggc tgtggagagc ccacctacct tctgaggagg 5341 gtgaggcctg gcccccacta ctgatgcccc cagcccaggg aaaatgctca gctactcccc 5401 gtcagaagct ggaacgactg aggtgctgta caagccctcc tacccccacc cctgcctcct 5461 tcacgtctta ctggagctgg ggcccatgat tggcgcctcc cctttgcagt ctttttatta 5521 aatgctctgg gctccctctg cccttgggct ggggacccac tgtaccctga tgtgaatcct 5581 atggcagtag caaagctctt tgattggcgg ggtgcagtgg ctcacgcctg taatcccagc 5641 actttgggag gcaaaggtgg gtggatcatg aggccaggag ttcgagacca gcctggccaa 5701 catggcaaaa ccccatttct actaaaaata caaaaaatta gctgggcatg gtgcgggcgc 5761 ctgtagtccc acgtacgcag aaggctgagg caggagaatg gcataaaccc gggaggtgga 5821 gcttgcagtg agccgagatc tcgccattgc actccagcct gggtgacaga gtgagactct 5881 gtctcaaaaa aaaaaaaaaa aaaaaaaagg ctccttgatt gcgaacatgt tgggagttat 5941 ggagagaaca gcagggccca cttctagagc acttgttgca gacacccatt ggatccttgc 6001 agttcttctg taacagccca tcaagggagg ggctcatatt attatcccca ttttttggcc 6061 ttgctcagtc ctcccatctg attcaagctg gcagatcatt ttccctattg ggacctcagt 6121 gtccacacct ggaggatgga acatcagctg cttatgtggg tgtcccgtgt cctgagtccc 6181 aaggccacaa ggtgatgctt gagagtgaag gtagaatgtt acctgccatg tgtttgaggc 6241 gtgacaaatc ttgtatgatt gtgaggagga acttgtgtga gctggcagga gaagtgggaa 6301 ggagtgtgaa tctcagagcc actgtgacca gagccagctc cctgccctct tgtgggaggg 6361 acagatgaca gttataatta ttagcattac tagctgcagc taatggagtg ttgatgtttc 6421 tgccaggcac cgttctaaac acattatctg cattttttat ttaatccagg cacagagagg 6481 ttaactaggc ccaagatcac acagctagga aatgtccaac tctggggttt gagtccaagg 6541 gaggctggct tcgaaatccc atgcctctaa ccatctttcc taaactacct ctgcagaagc 6601 ctttggggat agaggtgcca gtgccccagg tgcaaacctc ctgagacagg agcctttgct 6661 gtgtccttca gcttctcata cctgccacca gctgaggcct gggacctggt cagctagaag 6721 aaagcagagc agggcagcgc ttttcaaact gcactcaagt ggcctgactt ttaatgttca 6781 cactgtgatt ctgtgtgggt cgggttgggg cctgcgatgc tgcactgctg accagctccc 6841 aggaaatgct aatgtcaacg atccaggaac acactttgct tagcaaggcc ctaggcagct 6901 gccttctgtt gtgcgggacc cctattgact ccaatggata tagcaccagg ttcaagaggc 6961 taccttcttt ggaagaggta gcaaacaaga tacggggttt tactgggggc ttagacacag 7021 ggaagagagt ccagtggcgg cagactgagc agaagaaccg caaccacttg caaatcatgc 7081 agtttatgta gcattttcat ttaacacctt ctcccaacca tctccaccta gtaaccttca 7141 tttaacccaa aacaaagggc ctcggtccct atacccctgt atggtcagtg tcccgtggga 7201 atggggtggg gctcagatgt tcctcataga taacgactgg atctccaggt tggccactct 7261 tggattcctt cgctcagaac tctgaacacc cattcaagtg tgcctgccat gcagggtcat 7321 cgtcagggga tgcccaagtc aagtttgcct gtcgggtgtg cctcccatac ccccacctgg 7381 tttgacttag cacctgctgg gcactggaag aagtgcaaag gggggttgca ggggtggccc 7441 ttatcagcct atgttcacag gtggcaccag gcactcaggc attctgcatc ctggaggcca 7501 gtgctgatca catgcctgtt acaataatca taacaatagc tgtccttgaa gtagtcctgg 7561 gtaccaggtg ccttcagtga ctttttcttc tttgccagaa tctcactctg tcgcccaagc 7621 tggagtgcag tggcaagatt ttgggtccct gcaacctctg cctcctgggt tcatgcgatc 7681 ctcctgcctc agcctcccaa gtagctggga ctacaggcgt gtgccgcagt ctcactctgt 7741 tgcccaggct ggagtgcagt ggtgtgatcc tggctcacta caacctccac ctcccgagtt 7801 caagccattc ttctgcctca gcctccggag tagctgggat tacaggcgtc caccaccacg 7861 cccggctaat ttttgtattt ttagtagaga cagggtttca ccacgttagc cagctggtct 7921 cgaactcctg atctcaggtg atcctcccac cttggcttcc caaagcgctg ggattacagg 7981 tgtgagccac tgtgcccggc tagtaacttt tatctcacgg aatcctctgg acgacttgac 8041 aaggcatggg tcttcatccc catttacaga tgaagaaact gaagcttagg gagtggaggg 8101 acttgccagg gctacacaaa atctgagagc cttgaagctg tagactggca agtgaacagg 8161 tacaggctgg gacagcagtt tctttctttt tttctttttt tagacagagt ttcgctcttg 8221 ttgcccaggc tggagtgcaa tggcacgacc tcggctcact gcaaccttcg cctcccaggt 8281 tcaagtgatt cttctgcctc agcctcccaa gtagctggaa ttacaggcat gcaccaccat 8341 gcccggctaa ttttttgtat ttttagtaga gacggggttt ctccttgttg gccaggctgg 8401 tctcgaactc ccgacttcag gtgatccgcc cacctcagcc tcccaaagtg ccgggattac 8461 aggcatgagc caccgcaccc ggccaaggga cagcagtttc taaactgtcc ctctctgatg 8521 cagaggggaa ttggggctaa atcagcaatg tgccttttct gtctcatatt tgaatgtcta 8581 ctctgcacga ggcgctgtcc tgctttgcat acagtgactc atttaatgtt tatgtcagcc 8641 ctctgaggaa ggtcctgtcc tattattaac ttcacttatt atgaggaaac tgagactcag 8701 agaggggagg gaacttgcca aagtcacaca gctggcaagc agcagagcta gacttgaacc 8761 cagatctgcc tgcactcaag tagaagctgt tcattgcttt gctcatttgc caattccact 8821 ttatgcaaaa aagagggggc agtgtggggg gaagagttag aatcagggtg gcagggtggg 8881 ccagtgcatt agccctgggc ttcagatgta ctggggttga attcctgcct gccgcttagc 8941 agctagggta cctcaggtag acaactcctg aaactcagct tccccctctg taaaatgggg 9001 tgacaaaacc aagatcttgg ggttcttggg gaaactgaca tgctgattgg tttttgtaca 9061 gtgcctggct ggtaacagca ggccctcagg ggtgcgtttc cttcctgggg actggagtgg 9121 gggttgcagt agactctggg aggcctctcc agctgcagaa tctccctcct ccctcctcct 9181 ttttgtcttc ctgacacaaa acccaccagc tgcacttctt tgggcttgca gtggctttca 9241 gttaccagag ccacctgtta aaacaaaaat gtgcctagga agagcctgcc ttacccattt 9301 tgactcacat ggcagttggt ggtggagggg aacaaaggag actgagtttc atcgaagcct 9361 tttgcttcgg aggaggaagg gaggatcaga gagaggaagt ggtctgtgtt cacacaggga 9421 ggcaggggag gccaggcagc ttcccaatcc tgcattcaac ctcagggtgg gcttgacctg 9481 ggtggctggg ggccctgtga tccaggagag acttgtccac ctgctcaggt gtcttgaagg 9541 ggtccctgtg gtaccccctg ggcggggcaa ggtagtagga ccatggtctg gctggggagg 9601 tggagaggag caggctgtgg gcgcagagtg aggttggaat ctgtatttac ccaaggtgtt 9661 gggggtaggc ttgccctcag cccttaatgt tctcaggccc ctgagcagtt gtgggggata 9721 acctctgcac tcctagtgac cagggagcta gaacagcaag gaatttgaac ttggacacca 9781 gctggggtca ggctctctgg gtctgagtcc tgatttccca ctttccagct agaggagctt 9841 gaatgagtca tttaacttca cggtgcctca gtttcccctc tctaaaatga gaattatacc 9901 catacccacc tctcaaacac caagtgcagg cctggctcag agcaggtgct gcagcaatag 9961 ctgccattgg tcagcatcat catcatggtt ggtaatggtc ctactttgac ttttgagaca 10021 gagtctcact ctgtcgccca ggctggagtg cagtggtgca atctcggctc actacaacct 10081 ctgctcccgg gttcaagtga ttcttctgcc tcagtctccc aagtagttgg gattacaggt 10141 gtgcgccacc atgcctggct aatttttgtg tttttagtag agacagggtt tcaccatgtt 10201 ggccataaca atggctgtcc ttgaagtagt cctgggtacc aggtgccttc agtgactttt 10261 tttttttttt tttttttgag ctggagtctt cctctgtcac ccaagctgga gtgcagtggc 10321 acgattttgg ctcactgcaa cctctgcctc ctgggttcat gcgatcctcc tgcctcagcc 10381 tcccaagtag ctgggacttg ggatacactt gcccccgctg gtcctccctt ccacctctgt 10441 gaagaggagg tctcaaactc ctggcctcaa gtgatccacc cacctcagcc tcccaaagtg 10501 ctgggatttc aagagtgagc caccgcacct ggcccctgtt tagatgttag catcagtgac 10561 ccagcacctt gctatgtggc atgcagggag cgtgctgcta gacctccggg tttagagtca 10621 aatagcttcc tggctgtggt gtgcattaga ctttctaact caaggtcctc ccactctctg 10681 agcctcagtc ttgttgcctt taaaacgagt ttaagtgtgc tgagtcccta tgctgtggct 10741 ccacaggaat ttccccaggt ggaagacaca tcttgccttc tgtgaaacct ctcagcagca 10801 gagctgtcag gccccgtcag caggagacac tgtggggact gctcagtccc ttccactgtg 10861 tacctcggag ctggcggagc ctagatgagg ctgagcatag agggcttcct ggaggaagtg 10921 gagctgaaac agtttctcag cccagggctg ctctgtctcc tggcctcaca ctaaaagtca 10981 gttgagaggc catagtggca taagtcactg accctggcac tgcccagctc atcaccaaaa 11041 gcagggctag ggagggaggg gacattcgat tggcagtggg cacctgtggc tcatctgggt 11101 tctggccacg gtgctcaggt tctgtgagct gaccaggcag ccctggctcc tctgcccccg 11161 tgtgggttct gccaggtccc atggggcagg tcagcccctt ccttgttgca gggagagcac 11221 ccagcattgc tgacatggga cagggaaacg aggaaataac ggtgtggtca ttgaacacag 11281 agagcactag gtgctgtgcg aggtgctgag gacacgacat gatgacacag acaaggtccc 11341 ccctctcagc aaacggctca tgagggagac agacatgtta catacatgaa cccaaaaagt 11401 cagacgaaaa caaaacagag cgatgtgttt gggaggcaaa cccaactgcc ggagggcgag 11461 cagttgggaa cgtggaaaca tgagtcagat ctgggagtat ctgtcccagg agtccaagac 11521 ctgggtcctc atggtagctc tgccaccgac acactgagtg accttgggta agtgaaccca 11581 ccgccctgga cctctctggc acgcatctct tgagagcagg gacttagtgc atttcccgag 11641 ggcctccacg gtgcctggca catagtgggg cttagtaaat atttgttggt aactgaggat 11701 gcttcctgtt cacatcagcg ctgggaggat ttcctgctgt tcagacaaat gctgggctgg 11761 ctgtgagtca gccttgcaga gagcaaaggc agtgggaagg ggcgtgagat tcccctctgg 11821 agaggtcagg aggccaggca ctgtctcgac atgagtgcca gggagggggt gtggcctgtg 11881 ggcagggctt gggctgaggc agagggactt gagttccacc ctagctctac caccatcaat 11941 tttgtgtaac tctggacagg ccactgaact tctccgggct tagcctggca agtccatttc 12001 cccatctgta acatgggccg atatgtacat tgcctaggga ttaaatgaga taaagggtct 12061 gaaaacagta ggtagctgct ttatcattat tattatttct gtattattga tgtctgaggc 12121 taggcccaca gaggcagtac agtagagtgg ttaggagctc aagaatcaga ctagggttca 12181 aattctgact ccatcactga ctgttttggg gtacttcttt gaacctcagt ttcttcatca 12241 gtaaaatggg agtgaagtct ctaccttgct ggttgtaagg atgaaataag ataatgcata 12301 tagatggtct agcacatagt agatactcaa aagtttgagg ccactgctga cccttttccc 12361 tgaaaggaga caggagagcg gggtcgccac cccattgtca ttgtcatctg gaataggctg 12421 acagacttcc catggtgtgt tgcagttttc tagaaaattc agtaggaggc ctgcctgagc 12481 ttgagccacc tgtggaggtg cttcctgcct ctgctccaca cctgaaacgc gtctgggcct 12541 cttctcaggc agccgtgaga agggatgagt gctactggtc atggtgggca gctggctctg 12601 ctttccccct tcccagaggc gctcctgcct cctgcccagc tccctgaacc cctagcttct 12661 gcaccccggc actgtctggc ttctgccccg ctgagcaccc actgtctctg acgctgcctt 12721 gagtacttcc cgcatgttat tcaaatccca atcagatctt ccctccccca gtagctggtc 12781 ttctgttctg gcttcctgcc atcctgtcct ccacacagca gccgggaaag gtttttttaa 12841 aggggactct ccgatttaac acacttgggt ggaaaaccct ttgcttcggc ctctgcaatc 12901 tccctgcccc ctctccactt tgccctggcc tcatttctca ccactaacct cactctgcac 12961 tctggccaac tccccgcctg cttcctgatt cagacactaa gcacacgcag ctcccctgcc 13021 tggagccatt ctccctctcc ttctttcttc tccctggaga actccccctt taagtgatct 13081 tttcccaaca cactttctaa attgccccca ccccagtgtg atttttcttt atctcatagc 13141 acttggtctg cttcttatca cagtttgcaa ggctgagttc agaaaggtgt gtttgctcat 13201 tctgaggcag gagaggctac cttgtgctgc tgtggtaaca aacagccccc aggtctgagg 13261 ggtctgcaga gacccaggtt gacctcatac tgcttgtccc tccagggcct ccagtgaggt 13321 ttcggctcct tggatcactc agggccccag gcagatggga agattccact ctgaacattg 13381 ccaattgttg tgccagagta aagcagagct gggaggtggg ctcttgaatt ggcatttaaa 13441 tacttttgcc aggcagggta aggcagctca cgcctgtaat cataacactt tgggaggcct 13501 aggtgggtgg atcacctgag gtcaggagtt caaaaccagc ctggccaaca tggtgaaacc 13561 ctgtctctac taaaagtaca aaaattagcc gggcatggtg gtgggcgcct gtaatcccag 13621 ctacttggga ggctgaggca cgagaatccc ttgaacctgg gaggcagagg ctgcaatgag 13681 ctgagatctt gccactgcac tccagcctgg gcaacagagc cagactccat ctcaaaaaaa 13741 aaaaaacaac aacaacaaat aaataaatga ataaatactt tagccagaag tagccatgca 13801 gacctccccc caccagtccc acccacaagc ggacgtgact accgccccca ttcactgcct 13861 gatcctcctg ttctcagggg ctccaaggcc aggcctggtt tgaccttctg actttctgac 13921 ttcctcctac cttcccagta acctcatgca actcctttca ctcagcctca atcatcccca 13981 tgggtgttta aacttgccca agacatgccc ctttgaaaaa gcctgccatt ctcttgaccc 14041 acatgcacgt cctgccccct ccaaggctgc tagttccttt aggggcaaaa ttgtgaaaga 14101 gtagtctaaa ccttcttcct cttcttacct ccacttcttt cttaccttat tcccatgtgg 14161 attctaccct cactcaggcc tctagaacgg ttcctctacg gcagtggttc ccaatcttga 14221 ctacgtgttt ttttaaaaaa agtcctccac ctgggcctgc caccaaggat ttttctttaa 14281 ttgacctcag atggggttga ggccttggga actggccaga acttcccgtg ctcctaactt 14341 gcagccgggg ttaagaacta ctcctctgaa gcccccagtg cctgcgcttt tagcccgacg 14401 gacaagtttc tgcccttcca tcctgtgacc tccagcaggg cctgaccatg tgagttttct 14461 gtggctgccg tgacaagttg ccacaccctg catggcttca accaacagaa acgtgtgccc 14521 tggcagttct gggggccaga agtccaacat caagatatca tcagagccac atgcccactg 14581 aaggctctcg ggggaatcca ttccttgcct cttctggttg ctggtggctc taggcattcc 14641 ttggcttgtg gctgcatcat tccagtctct gcctctgagg tcacgttgct gcttcctctt 14701 gtgtgtgttt ctcttaaaac tctctgcttc tgtcttataa ggatacatgt gattgcatct 14761 agggcccaac cagataatcc aggataaact cttcctgtca agacatttaa taatcacact 14821 ttgccatata aggtaatttt tttttttttt tgaggtggag ttttgcactt tcacccaggc 14881 tggagtaaag tgatttaatc tcggctcact ggaatctctg cccccaggtt caagcaattc 14941 tcctgcctca gcctcctgag tagctgggat tataggtacc tgccaccatg cccagctaac 15001 ttttgtattt ttagtagaca tggggtttca ccatgttggc caggctggtc tcgaactcct 15061 gacctcaggt gatccacccg ccataagtta atattttttt tttgagaggg agtattgctc 15121 tgttgcccag gctggagtgc tagtggctca atctcggctc actgcaacct ccgcctccca 15181 ggttcaaatg attctcctac ctcagtctcc tgagtagctg ggactacaga tgcatgccac 15241 catgcctggc tgatttttgt atttttaata gagaggggat ttcaccatgt tggccaggct 15301 ggtgttgaac tcctaacctc aagtgatcca cccacctcag cctcccaaag tgttgggatt 15361 acaggcatga accaccacgc ccgacccata taaggtaata tttacaggtt ctggggatta 15421 ggattagcat gtagacagct ttgtgggggc caccattcag cccactatgc taaccctgtg 15481 aaccgttgct cgcttctcct tgacatctga cggcctggcc ttctgcatac cacacaccct 15541 cccacctctc tggccacagt tctgtaggct cagcctcctc cgtaaggcca ttaagtgctt 15601 gtgctggtca aagtttcatc ctaggccttt tccttacctc ccttgatatt ttctccctag 15661 gtgagctcct tcaagcccac agcttctgtg cttacccaca ctcctaccta cattcccagc 15721 ttgggcttct caggccagct ctagactctt gtatcccact gggttcttcc acttaccttt 15781 ggatatctca aaggcatctc cagttggctg ggcacgatgg ttcacacctg taaccccagc 15841 actttgggag gccgaggtgg gcagatcact tgaggtcagg agttcaagac cagcctggcc 15901 aatatggtga aaccccatct ctactaaaaa tacaaaaatt agctgggcat ggtggtgggt 15961 gcctgtagtc ccaactactc gggaggctga ggcaggagaa tcgcttgaac ccgggaggtg 16021 gaggtttccg tgagctgagc tggagccact gcactccagc ctgggcaaca gagtgaaact 16081 ccgtcttaaa aaaacaaaaa acaaaaggtg tctctagtgt aacataacta aaaccaaacc 16141 aatcatgcct ccctcccccg catcctccct cctggaggga gctccaggac ttggtcttct 16201 cttccagagt tctctgtctc aaactgcggg aattgctccc cacccaggcc taacctgaag 16261 tgtgagcctt ggcatctctt tctatccacc tgtttttcct ctatgcacct cacaaccctg 16321 gtccaagcca ccgtcatctt tcaaatggct gcagtagcct ctaactggcc ttggaggagc 16381 catcctcttt ctctaaccag ctgccaaccc tgcaatggcc tctgtgtgct ttccagataa 16441 agcctgactc ctcgtggccc gcacagccct gcctgggtgg tcctatcctg cagcctctcc 16501 agtaccatga accctccctt ctctgaacct ctatttaatc catttcatat accccgtttt 16561 ctcctgccat agggccttgc acatgctgtt ccttctgcct ggaattttct tcctgcctcc 16621 ctccgcaccc ctgccttgtg ttgtgggttc ctcgctatcc tctagctttt cgctcaggct 16681 cattgttggc cctctagatg tattcacttc tcttgtttgt taccctctgt cataggactg 16741 tgttcgtact tcccaaggag tcgtcttggt ttgtgactgt acattttccc atgtgacatt 16801 tgcttaatgc ctctcccact ctggggcctg tacaagcccc aggaacagga cttggaccct 16861 cctgtttaac tctacaatct agcatccagc aggcgcgcag gccttcgttg acttttattt 16921 tattcttatt ttttattttt gagatgcagt ttcgctcttg tcgcccaggc tggagtgcag 16981 tggcgtaatc tcggctcact gcagcctctg cctcccaggt tcaggtgatt ctcctgtctc 17041 agcctcccaa gtagctggga ttacaggtgt gcgccaccac gcctggctaa ttttttgcat 17101 ttttagtaga gatggggttt caccatgttg gccaggctgg tctcaaactc ctggcctcag 17161 gtgatccacc cacctcggcc tcccaaagtg gctggattac aggggtgagc ccccatgccc 17221 agccttcatt gacttttagt tgacaactat ttagcatttg ctatgtgcca agaactccct 17281 gcctactaat gcagttaacc ctcatgaagc ctagaaggaa ggactgccat tctccccact 17341 taacagatga ggatgccgag gcacaggaag tgaagtgact ttctcagggt caagcaggga 17401 gtgagtggag gagccgagat tccagctcta accgcatgat gctctataca gtgtgactcc 17461 ggctctctgg ctgggccctc tccatagccc tgtgagggtt aaggatagaa aacagaggct 17521 cagagagttg aggtcccttg cctgaggtca cacagctggt tggccgttcc ctgggctata 17581 agcttcagta ttcccaatgc tgagcatatt ttgagaaccc gagaaacaga cgtttggctg 17641 ggtgggaact gaactcattt tgtcagggaa ttcaacaact aagttggccc tgagactggg 17701 tgtgaagacc gctctgtccc ctgccagctg gatgacctca ggagagatct gatgactctg 17761 aggtcctgct gataggacct ctggtgtctc tgttccctgc tggcctcccc tgggcctggg 17821 ttgggtttcc tctgcaggag gcagctcatg tatgtgctcc tagacgccct tgggccagca 17881 gctccttggc tgttcctccc tgagccaggg cagccaactt tcttatccag ctctccatgc 17941 tccccacccc agcatgagat gtcagctgag agttttctgg atctccccta gctaggggga 18001 aagcttccat catttggaac aggaacagca ggaacagcaa agtccctttc cccaccatct 18061 cccactgcct gctgtgcttc tcctaacagc tcatggtaaa caccctgact gagcggcagg 18121 ggctgtttcc tttgggctat ccatgtccac ctacactgcc ctttttaatc cttacaattt 18181 ttcttggaca cgggggcata atattccatt gtttttcagt tgaggaaact gaggctcaga 18241 gaggtcaagt gtcttgtctg aggtcacaca gcagaactgg gagtcaagcc agatgggctg 18301 cctccaagga tcctactctt aaactctaga gtactagaaa gatcttccgt tgcctaatat 18361 tgattcctga taggctatgc ttgagtagca tctgcttttg aaaatggagc ctgggtcggt 18421 tgcggtggca catacctgta atcccagcac tttgggaggc tgaggtgggt ggacacctga 18481 ggtcaggagt tcgagactag cctgagcaac atggtgaaac cctgtctcta ctaaaaatac 18541 aaaaattaac tgggtgtggt ggcacctgcc tatagtccca gctactccgg aggctgaggc 18601 acaagaattg cttgaaccca ggaggtggag gttgcagtga gaggagatca cgtcactgca 18661 ctccagcctg ggagacagag cgagactcca tccgtctcaa aaaaaagaaa acgaaaatgg 18721 atcctgaatt ttgaaatatg ctgtgactct tccctagttt gggacatctg ggtcaatccc 18781 ttttgttaaa gtagtttatt tagttggctg agagcgggag ctgcctacgt gacctggagc 18841 acaagctttg gaattgggct tgggttagaa ttccgcctct gccactcacc agctgcgatt 18901 aagaacaaag atactgggtt gggctcctgc ctctattact tgcaatctgt gtggccttgg 18961 atgagatatt taacacctcc gaacctcagt gtcctcaatt gtgaaagaga tcgagataac 19021 agctgaaccc acatcccagg agcggattaa atgagatagt gcagtacaga gtttaccgaa 19081 gtatatgggg tcagcagcca gccagtaaaa tggtggctaa tggttatcat gattaatgtt 19141 aacattaagc tctgaaaggt ccttcgtgaa ctcataggta tttgttctct ctctcccttt 19201 ctctctctct tccccctgcc cccttgcagg tagaactgca tgtccaccta gacggatcca 19261 tcaagcctga aaccatctta tactatggca ggtaagtcca tacagaagag ccctctctcc 19321 ctgggatttg agtggggtcc ccagctccac ccagaggccc ctggggaatt ccagggtcac 19381 tgttccttcc tgtctccctg tgggaatcaa gccagctcca ggccagaagt gggactgtga 19441 ggacatggag gcctcggcac tgagctgcag acccgcagac caactcctga gctttctggg 19501 cctctgagtc ttgtcctcct ggtgtcaggt gagccaggcc tgagcctgct ctccccaccc 19561 acccacatac gtgcatgaag gtagttccca gggctgaatc cgtctttttt tttttctttt 19621 gagatagagt cttgctctgt cgcccaggct ggagtgcagt ggcatgatct cggctcactg 19681 caacctccac ctcctgggtt caagtgattc tcctgcctca gcctcctgag tagctgggat 19741 tacaagcaca tgccaccaca tccagctaat ttttgtattt ttagcggaga tggggtttca 19801 catgttggcc aggctggtct cgaactcctg acctcaagtg atccacccag cttggcctcc 19861 cacagtgctg ggattacagg catgagccac tgtgcctggc tcctgtcttt tgacttaact 19921 gagagcctat atatagcagg tgatgtgctc acatgagatg ccagtacaat ttcttgagca 19981 tctcctagag ctgggctggg ctttatcagc tcattgaatt cctccacgct tggaagagga 20041 ggatacgctc tctgcatttt actgaggagg gaatgggctc agccaagaca gttgtccacg 20101 gtcacacaaa ttaatagcag atcaagagtt gaacccaagg ctgtctgacc cctaaggctt 20161 tactacatca tcagggtcat aacctgctag gagtcacgga aaagtggctc cccaactctg 20221 ggcctaaatc tctgcatctt ccaagtgaga acacacttcc tgcctcagct ctcagagatg 20281 ctagggggcc agagggtccc cctgttcccc agcgaggaag gttcttccct tcctacccag 20341 acctcaaggg ctcacagcag ctcctctctt aggaccagct tttaagggca gggactttaa 20401 aggccagtgg atctggattc aaatttggac atattatctc ctgtctgcga acttggtctc 20461 tatcaactga ggctaagaac aggccctccc tagagagatg acctaggagc taggggctcc 20521 ttgtccaccc agccctgccc ccgcagacct gtgttcctcg gatgtttgca caacactcat 20581 tttgtttgga gctgaaagaa ctcagcctct ctgtcacagt cttgaaattc agctcgggac 20641 ccaaatttga acatttctgc tccataagcc agaatcctgt tattcagagg cctgccctca 20701 tggagagaat gagggatccc gggggttgcc cccaactctc gggagcatct ccaccaactc 20761 cctgagagat ttctggtaag tccactattc tccatctttt cacacttcca gggaccttct 20821 tctgccccag gaagctgcca ttgatttaat tcctatttaa ctgcaaggca taagcacagt 20881 agcacctcct gtgtgccaaa cactccttta agtgcgttac ccgggttaag ttattgaagc 20941 ctcacaacaa tttgtaagat aggaactcta ttgccgtcat ttacagatga ggagactgag 21001 ccgtggtagg tggagtaagg tgcccagtaa gcacagggcg gaggtttgaa cccagatagt 21061 ctgcccccga gtccatggcc ctggccatta ccccctgtca gttagaggtt ttggtaagtg 21121 atgcccgtaa aatgcttagt tcagggccta gcacacatta atgtgctcca taaatgtcac 21181 ttaatgataa tattcttatt aattggagct tatatctcta agtggggtga aacctcttgg 21241 cttatctctg cctggccttt gcccatgtca agccgccaac ttgccacaag gcccctaatg 21301 aggtcgttca gtggggcacc aagatgagat cgaacccagg cactcattaa ggggtcacgg 21361 agggctcatc agctgcagcc aggggctggg agcgccgggt ggggctaaga gaaaggggaa 21421 aggagccgcc gggaggggca ctggtctgat cgtccattcc tcacaccacc tctgggcctt 21481 ggagatggcg tgcggcaggt gccagctgga gcttggcctg aagtcagcag gcaggggact 21541 ggggagtttg tcacactcag atatgggtgt ctgtaaatgc acacaaatat gggctaagaa 21601 tggaaggagg aggggagccc ctggcctgag ccctgctagg cccaattcag tggccctttt 21661 tccagctctg ggactcaggc ctgcctcatt aactgtcctc acccatttct ccttcctcca 21721 gttcccagga ttctggcctt ttcaggggcc tctccaacct ctttctcagt cttgtttata 21781 accctgtcaa ctatttctac agagattctg aaactggctg ctctttcctc cgatcactgc 21841 cctggtctgg gccaccactg cccctccctg gtgctgtggc ctcctgattg gtctcagcca 21901 tctactctgg ccttcctctc tacgggccct gcagtgctgt agttggagca agagccttaa 21961 cccatggtct tcccagctca ttccccagct tccccatctc actcagagtc aaagccaaag 22021 tccacacatg ggccttaaag ttctgcaaag cctgcattgc ctctctgacc tctctaaggc 22081 tccttgctta gtccacactg gatgtttttc aaacatgcca gacctaggaa acagagagtc 22141 tgggttactt gcccaaggtc acacagcctt taagtcacag agctgggatt caaacccaga 22201 ccactgggct tcagagtctg ctctttctca tgacacacaa agtttcattt cttcctctgt 22261 gcacccctac atggaaaata ttatgtttta ctgacaaggg caccaagggc cttagagggg 22321 agcgctcctg cctgggatga tgtggtaaat aggggtggga gatggacttg acctgcaacc 22381 cctgcgctca tcctccctcc ctccctgggc tcctgatggt gggcttcttg tgactgtgtt 22441 gcccaccaag gccggaagag gaccagacag tgccccagca cagcagctgt ggctgaccag 22501 ggagtaggga tcatctaaga acagagcgtg catggtgctc acgcctgtaa tcccagcact 22561 ttgggaggcc aaggcgggtg gatcacctga ggtcaggagt tcaagaccag cgtggccaac 22621 atgggaaacc ccgtgtctac taaacataca aaaaattagc caggcatggt ggtgggcatc 22681 tataatccca gctacttgag aggctgaggc aggagaatca cttgaaccag ggaggtgaag 22741 gttgcagtga gtcgaggtcg tgccattgca ctccagcctg ggcaacaaga gcaagactcc 22801 gtctcaaaaa aacaaaacaa aagaaaaaac agagggtggc cctatgagga gccttcgctt 22861 gtgtgggtgg ccagggacag caagaggtgc cagggcccta ggaacagctc tttcctgctt 22921 caactttggg ctccagatgg gcgctttcca gctcagtctg agcagcttcg ggaagctgtg 22981 tcccatggga gacactggga gtcccctgtg ctctttgtct cctgtcgggc ccccacatta 23041 gctctctggc ctcagctctg gcttccctcc aatttgtttc ccacgcagca gccagaggag 23101 ctttcaaaaa ggtaaattat ttcatgctag tcccctgctt gaaatcctac agtgccttcc 23161 cagtgctttc agccaaagcc ccagtccctt cctaagccca gcctggccct gcctccctgg 23221 tgcatcatct gcacaaatgc ctgctctctg acctccagcc accctgcact tccaatgccc 23281 gcggcttcct gcctgcagct ttagtacaga cccctcccct gcccagaact gcccccaccc 23341 caaggcttct gctgaaatgt cacctcctca gagaggcctt ccctggctgc tctgtctaaa 23401 ctctgtgttg agaagttcct tcttgatggt tgttgaggag ggaggctgga gaagaagaat 23461 caaagaggag aaatagaaag caaaataatt tgttcttggg gacgggctgg tgctgggcac 23521 ggggaggcgc ccgtctctgg tgtgggcagc tgggtagatg gaggagccgt atttggaaat 23581 gtggaaccca ggaagggagt gatctagagg gaggggaaag gtggcgcgag atgcctgcct 23641 ctcaacaggt agccagacac atgggtctgt cttggtcact gctatctgcc cagtgcccag 23701 cacatcacag gccctcagtg gtggtgtgtg ggcatagaga attagaagct gtggacctct 23761 ggatccggag ctgaaaacca ccaaaggaga tgagttggcc tggccaggtg tgtaaaaggc 23821 agagtctgag agagaacgac cagagggcag agccccgcag gtggagtcct gggggctgga 23881 gggagaccat taggagaatc gcacatggct ggcgcagcag gtcccaggca aatgtggcca 23941 ctgggtttgg caatatggga gccagagccc tagtgtcatc tccctgcctt ctacccagca 24001 gttcccagag tgatatcccc aacagtgttt gacaactggt acaggctctt cagcggccac 24061 agttactggg caaggccttg tgagggtgac tttggggcag ctggccagca gtgggagggg 24121 aagcagtctc aggggtacct gaggcactga gctccgacct ccaggtgcca atgccgcacc 24181 agggcaccgt tcccctgcag gctcttacag ggattagggg ctggtaagga gcagtgatta 24241 ggggctgact agcaggctgg tgggcaccag catgacccct tggtggtacc ctctgggcac 24301 tcatggggac ttgggctaac agatggggaa gggagcacat tcagggggct taggaaacat 24361 atttatgtag ggaagcattt taatatttta gtaacagaag ctattaaagg acttacaaac 24421 ttacttacat acactaaaac actatttggt caaacttctg tttctttggc actttcctcc 24481 tttattcttt tttatttttt tgagacaggg tcttgctctg tcacccaagc tggagtgcag 24541 tggtgcaatc ttggcccgca gtagccttga cttccaggct caggtggtcc tcccacctta 24601 gcctcccaag tagctgggac tacaggtgca cgccaccacg cctggtgaat ttttgttttg 24661 aaggggtttc actgtgttgc ccaggctggt ttcaaactcc tgggcttaag tgatccgcca 24721 gccttggctt cccaaagtac tgtgattaca ggtatgagcc actgcacccg gcctcctatt 24781 tttctgcttc tgctttgtgg ataattggat gcttggacct cctgatttaa tcttctaatt 24841 tccttaactg tttactccta tttttcatca tcttgtcttt ttgttctact ttgtggagga 24901 tttcttcact tttagcttcc agttcttttc ttacatcgtg acagttgctg ccgcattctc 24961 ttgtaaattt ccgagggctc gttcttgggt tctgaatgtt ccctcctttc aaggatcttc 25021 tcatctcttt gaggatattc atgtcttttt tgttttggtt cttaggtttt catctgttct 25081 ctgtgctgtt tcctcggagt gcttttgtct attctgttgt tttgtccctc atgttagaag 25141 catttctttt ttttttcttt ttttttttgt gatacagagt cttgctctgt caccaggctg 25201 gagtgcagta gcatgatctc ggctcaccac agcctctgac tccctggttc aagtgattct 25261 cctgcctcag cctcctgagt agctgggatt acaggcacac accaccacac ccaactaatt 25321 tttgtatttt tggtagagac ggggtttcac catgttggcc aggatagtct caatctcctg 25381 acctcatgat cctccgacct tgcctgggag gccaaagtgc tgggattaca ggcgtgagcc 25441 accatgccca gcctagaagc atttcttaat gtctggtgtt ctctggctgt tgtatcttaa 25501 aaaaaaaagg ggggggaaac tgaggctcga ggtgaccttg tgagctggag cagagccggg 25561 atgggatgag gaggcaggag cgtgtgcaga agagagggag cccccctgag ctcgcaccct 25621 gcttcccgtg gctgggaggg gaggccgaga tgcttgggga gaaatggagg ctccaagcca 25681 gaggggctgt ttccagcacg ctcttactga gcgctgctgt agtccagctt ggtgtggcgg 25741 ctgtgggcag ggaggggaga gaggtctgag ctggctggcg gcccactggg cccctcccct 25801 gagcctccac cggccctctc ccagtgcgct gggctgggca agcctctgat gtgccagcca 25861 gatggagggt gaagtcctga tgcctgcccc taccctggga attgtgatgc tgcagttact 25921 gcccctgata acccctgact gggcatagga ccagctggct gagccagctc ctggggctga 25981 ggaggaagcc atgaacttga cctggcactt tccttgtctc caagcatcag tcaaccaagg 26041 atatggaggg ggtgtgtgca tgtgtgcaca catacacaca cacacacaca cacacttcaa 26101 cctgtttatc ccccttgaga tttgctgact tgtgcattgg gggtagaagg tgctggaaaa 26161 attccggtcc tggttctcag tttccccatc tgtccagtgg gagcagctgg actgagagac 26221 gcccatgtct cctgctgtgg tcctgcaagg aggctggcgc tcctgagtct gctccatcct 26281 ggcctgtcag gcctgcctgg atcctgcccc gggttggtcc accactcact gttttgtttc 26341 caggaggaga gggatcgccc tcccagctaa cacagcagag gggctgctga acgtcattgg 26401 catggacaag ccgctcaccc ttccagactt cctggccaag tttgactact acatgcctgc 26461 tatcgcgtga gttgccccca acccacaggt cctagggcag cattgatccc tatgactagg 26521 accaggcctg tccctcagcc tgtgggggcc agagaagttg ctctgaaacc acagctgtct 26581 ttctcaccat tgtgtacact tagtgagtct ctccagtgcc tttaggcctc agttttccct 26641 tctgagatgt gggtgtgatg gactgaaatt gcttcaagtt ctacagagaa atggcagaat 26701 atgggagcta agaacacagg gtcagaggca gtgcagggct tgaacccggg ccatctatct 26761 cctagttcag ggcttcgtgt tgtgagggga ggagaggcct gaatataggg tgggggcggg 26821 gagatgtggg gaagattctc caaaaggctt tttctttttc ttgtcttgag tcgccaggga 26881 acagcactag gtaccgaaaa ggccagaagg ggtatgggcg agtactagag agaaatttcc 26941 atgactgctt tatttattta tttatttatt tatttattta tttattgaga cagagtctca 27001 ctctgttgcc caggctgaag tgcagtggtg cgatctcagc tcactgcaac ctccacctcc 27061 cagtttaagg gattctcctg ctttagcctc ccaagtagct gggatcacag gcacccacca 27121 tcacacccaa ctaatggttt tgtattttta gtagagatgg ggttttacta tgtttgccag 27181 gctggtctcg aattcctgac ctcaggtgat ctgcccgcct cggcctccca aaatgctggg 27241 attacaggcg tgagccactg cgcctggcct ccatcctcat cctgaagatg caagaacttc 27301 tggtgacccc ttctcctgag agtggcctga tctcccctgg gcagggcact ttcttcccac 27361 gctgggctct cccacgactt gtgtgccttc cctcacacat tctagtaacc acttcatttt 27421 cactcttcat ggtgggaact tccagctaag cacagtccac cgttacgtga tcaacacagt 27481 ggccctggca ggccaatttg tgccttgctt ctggaacaaa catgcagtaa taacaacgaa 27541 aatgttttga gcatttgtcc gctctgctcc aagcactgac ccgggtgggg tttatgaagt 27601 ttgactcatt tgtccccgca ataactcctt gacctaggtg tcagagggtg actaaccagg 27661 ggtcacacag cagataagtg tgggcacaag gatccaagtc catgactgta tcccacgtgt 27721 ctcccacatc caggcatccc tctggacttg tccagctgtg tccttttctc tcatttctct 27781 tccctgccag ccttaactcc atcaccaaca aatattgggc tactctgtcc taggcatggt 27841 cctcagctga gaggtcgcag ccatcccaag acagaggggt ccttgccaca tggagactgc 27901 attctagtag ggaatacagc aaactggctg ataagccata tgacacacaa tgttgagtag 27961 tgataaggac ctgggagaaa aagaaagccc aggagaatgg tggaggggcc gttttaagat 28021 aaggcggtct gggccaggta cagtggctca cgcctgtatc cccagcactt tgggaggctg 28081 aggtgggcgg atcatgaggt caggagatcg agaccatcct ggctaacaca gcgaaacgct 28141 gtctctacta aaaatacaaa aaattagccg ggcgtggtgg catgcgcctg taatcccagc 28201 tacttgggag gctgaggcag acgaatcact tgaacccagg aggcagaggc tgcagtgagc 28261 tgagatggcg ccactgcact ccagcctggg cgacagagca agattctgtc tcaaaaaaaa 28321 aaaaaaaaga taaggtggtc agggaaggcc tctctgagga ggtgaagctt cagctggctc 28381 taaaccaggg gagcgggaga gacgcagtgt aggacagtat cggggaagag caggcctgtg 28441 tcttctccgg tggcctcagg gaatgaggga gaaggaaggt gctggggagg ctggcaaggc 28501 tggaggatgc aggcttgtgg gcaggacctg ggagttgcga tgtcactctc cgtggcagga 28561 agctactggg gcttcgaggg gagaagtgat atgctttgat ttaccttctt aaaagattgc 28621 cccaactgct gggtggagaa caggatgaca ggggcaagca tggagacagg gaggccagtt 28681 agagatggcg tgattcaggc caggatgagg ggtgagaact ggtatgcagt tccaaagtag 28741 agctgatagg acttgcccag tgtctggatc ttatccagtg gatgcccaga gcttgggtct 28801 ggggatgaag tgggtttaat ctgccaaggg ttggggatgt catttgctcc tggagctccc 28861 aagggacttg gggaaggttg ttcccaaccc ctttcttccc ttcccagggg ctgccgggag 28921 gctatcaaaa ggatcgccta tgagtttgta gagatgaagg ccaaagaggg cgtggtgtat 28981 gtggaggtgc ggtacagtcc gcacctgctg gccaactcca aagtggagcc aatcccctgg 29041 aaccaggctg agtgagtgat gggcctggaa ggggccatgc tgagggtgtg gctgggaggc 29101 tcagctctga gactggaagg gcgaactgct gggaatccct gacccaagca agaccttgtt 29161 cttgccccca gtctggtcca tggcctcaga aagatgggtt taactctgtc acaagagacg 29221 tggttcccat cctccctttg ccgttatgtt cttaccttgg gcacaagtgt ttggctgtgt 29281 cttgctctgg ccacaggcct gctgtccagg aatgttaacc tgcttagcca cccaggattt 29341 ctgaggggtc tcccttgtca ctgatgctga tcagatctct aaaggcccta aaggtcctgc 29401 tctaacttca taactgaagt gagtctggcc catttctagc cccctgcctg ggcccccatg 29461 gatctctaag tggtatcaca aaaccaccct gccccatttt ctgagccatg attctgatac 29521 atatagaatg tgaacatcat ggcaggccca agcttagcaa tgctgtccat ctgggggtgg 29581 ggagggccat gttgacaccc cacacctccc actaagatct aggagcaccc agctgcttta 29641 agagctagag ggacatgcta gggcctgggg gcatctctgc cagtctttcc tctgaggcag 29701 tgggtcagtg ggggaggagg gtcctcccca aagcctcctc ttcctcctct gtcccagtcc 29761 cagagctgcc ctttaggcct tccttttgcc tcaggcccat ccctactcct ctcctcacac 29821 agaggggacc tcaccccaga cgaggtggtg gccctagtgg gccagggcct gcaggagggg 29881 gagcgagact tcggggtcaa ggcccggtcc atcctgtgct gcatgcgcca ccagcccagt 29941 gagtaggatc accgccctgc ccagggccgc ccgtctcacc ctggccctga cctcctggcc 30001 tagcagtggg gctgtacctg atctcccctg tgccccacag ccccatggtg tccccttgag 30061 cccactggca tgaacttggg gcttcatgaa acaactggag acctcctagg caggctcaga 30121 acttctggag atgttctccc cagggacacc atgcctttat agccaccctg caggaagctc 30181 aacaccaaat aggaacgtaa ctattgaaaa aaaaatctag gctagattct gatcagccca 30241 tagtcctccc tcgagaccca gtggaccagg ccccatcctg tctgggcctg aataggtctg 30301 atttccaaga tttctgaggg gtctcccttg tcactgacgc agatcagatc tctagagttt 30361 gtgcctcatg gtgcacagcc tcactgtgtg atattgggca ggtcacactg ctgctctggt 30421 tatgcaccaa gacacctcag ttgtgcactg tcacaaggag atgatcacac ttacttcatt 30481 cctctaccct caggattagt aagaaccaaa gagctacctg cacgcatttc ctctaatcct 30541 cgcagcagcc tgcaaagcag aactaccatt gcttagtccc atttgacaga tgaggaaact 30601 gaggtggagt gaggtgcagc ctcttgcaag gcacaaaccc tggatttgta tccggggaca 30661 tctagttcca aagcctgtgt tcattcattc tttcttaaac acttcagaat aactttattg 30721 gttaagagta cctaatacat tagcgagata cttcccaata ctagtgtgag ttctatttta 30781 gatgacgtgt taaacggtcc tccgtttcct catctgcgca tgggaataag cctaccatga 30841 gtgttgttgg aaacaccagg tgagagaagg gtccgtgtca tttactgagc tcaggccccg 30901 tccttggtgc tttacacaca tggcctcggc aaagcctggc cgtgaccctg tgcaatagct 30961 ggcagggttc tttctgaaaa gggcggaaac tgaggccata agcagagcag ttttccgcag 31021 ccatgtggtt aggacatagc agttaggatt tgaagacact gagccctgtt ttgtgctggc 31081 ctcccatggg gggtttgggt gggacagcag gcaggtaggc tgggaggtct ctccatggtg 31141 ctggtgacag agcctgggtg ggcatctgcc cacagactgg tcccccaagg tggtggagct 31201 gtgtaagaag taccagcagc agaccgtggt agccattgac ctggctggag atgagaccat 31261 cccaggaagc agcctcttgc ctggacatgt ccaggcctac caggtgggtc ctgtgagaag 31321 gaatggagag gctggccctg ggtgagcttg tctcccaccc atagttggga gaaatcacaa 31381 gaaccaggga ccatggtgtc tcctgagttc tgaagtgtgt ctttgttggg tcttaaggct 31441 tggaactgga atccccctgg gccaggcgtg gtggttcatg cctgtgatcc cagcactttg 31501 ggaggcgagg caggaggatt gcttgagcct aggagtttga gaccagccag ggcaacatag 31561 tgagatccat ctctgcaaat acaaaaaaaa gtagtcaggc atggtggtgc atgcctgtag 31621 tcccagctac ttgggaggct gaggtgggag aattgcttga gtccaggaag tcaaagctgc 31681 agtgagctgt gataatgcga ctgcactcca gcctgggtga cagagggaga ccctgtctca 31741 aaaaaaaaaa aaaggaagaa agaagaaaga gaaaagaaag agaaagaaag agaggaagga 31801 aggaaaaaga ggaagggagg gagggaggaa ggaaggaaag aaggaaggaa gggagagaga 31861 aagaaaagcc tccacttggt gttgggagtc ctgtgctgag cctgcttctg gctgtgattt 31921 gctgtgtgaa cctgggcaac actgtgtctt ctctgggcct ctgtttcttc tattgggatg 31981 actgagttgg agccgacatc tcaaaagtcg cttccagcgt gatgatgaat gggcctcctg 32041 tggagggtgc agcatggtgg agaagtcagg gctctggagt cccactgccc gggctcagag 32101 cttggttcca cacttcctgt ctgaccttgg tcacattact tgaatctcct gagcttcagt 32161 ccttcatcat aaaatgggtg ggataatagt tgtgaatatt agataatgta tacaagtcac 32221 ttcatatact acctgacaca tggtaactgg ctaatgagtg acagctacca cttagataag 32281 gacttggagg gtaaaagacc aggtttcccc atgctgttga agcaggcagc atgactagga 32341 tggttcaatc tccacagcat ggtcaaggca ggctgccggg gccctcccgc tagggcaccc 32401 atgacctggc tctccccctt ccaggaggct gtgaagagcg gcattcaccg tactgtccac 32461 gccggggagg tgggctcggc cgaagtagta aaagaggtga gggcctgggc tggccatggg 32521 gtccctcctc actgcctcct cccatacttg gctctattct gcttctctac aggctgtgga 32581 catactcaag acagagcggc tgggacacgg ctaccacacc ctggaagacc aggcccttta 32641 taacaggctg cggcaggaaa acatgcactt cgaggtaagc gggccaggga gtggggagga 32701 accatccccg gctgtcccaa cttcctgtat agagaggcag aaagcagggc gggtcccagg 32761 aactcgaggg gtggccccag gcccagacat ggggggagga atcagcatgg cctggggcca 32821 tccctgccag ccacacacct gctcttccag atctgcccct ggtccagcta cctcactggt 32881 gcctggaagc cggacacgga gcatgcagtc attcggtgag ctctgttccc ctgggcctgt 32941 tcaattttgt tccaggaagg ccaaagaggg aagaaacttt agggattggg catcagccca 33001 tgccgcgtct tttagatatg aaatctcttc gacaccctgg gaagcaggca ttgccgtcct 33061 catcttacaa atgaggaatc cgaggcccag atgtgctgtg gcttgactgg gattacccag 33121 ctgctaacca gcagagctgg ggccctacag ctcatcagct ggagcagaac gctccattac 33181 tctgagggaa gcttccacac ttccaattct cccaactctg ccccctgggc atcgcatagg 33241 aagcaggagt ccctctggcc agcatgttct ctcttcctga cacctggccc ttgggacccc 33301 tgggcattcc cctgagcgcc atcttgaagc tttccaccgg aggtctgttc caccctgcct 33361 ggctcccatc ctggagtcta accagggtca aggccctcct tccgtcctgt cgccaagcca 33421 caggagcagt atcaggcctt aggaaaaagc cgccttcccc aagacaagga cagcaagaac 33481 tcagggtgac catggtcagg ccagcactta tccatctgcc aggcatatga gaaggggagg 33541 ggcttcggct ctgatgttct gatgacaagg gggtcttggg gcttgcttag ggacacgtgg 33601 cacctgtgga ggttcttgga ggcatgtggg tataccatgg gctggaaaaa gatccaggag 33661 tcatctgcac agatatggtg gctgaaggag aagcagtggc cccaggaggt ggtggagcaa 33721 gaagggccta ggatagaacc cagaaggaca atggtattta agggaccagc aaaagagaca 33781 agtaggagga aagtcaaaag tgtggtgtca cagaaatcca gggaaaaggt ttcaagaaac 33841 agtcaacagt gtgaaattct gctatgcaag tcgattatgg tcagagctag gaaagatcca 33901 ttagatacaa caagatggtg gtcagggatc gtgccaagaa cagcttccat ggtatgttgg 33961 agtagccagc tcccagtggg actgaggaac aagcagggta gggtgcagag gggaaggctg 34021 gagagggtgg cagccggagg gggatgttgc tttcttggct cccaccccca cgcccccacc 34081 ggctgccatt ctgcctggtt cccatgtctg gcccctctgc tgcctttgcc cagctctggt 34141 cttcaggatg ggctggattc tggactttct ggttacatag acttgaacaa gtcacctaag 34201 ttctgaattt atttccccct ctgcacaagg atcagatctt tcagatctgt ttgaggctgc 34261 tgtgaggatc aaaggcgggt gaacgtcaat gtgttctgac tatttatgta agagtaaaag 34321 gaggctgatt ctctcctcct ccctcttctg caggctcaaa aatgaccagg ctaactactc 34381 gctcaacaca gatgacccgc tcatcttcaa gtccaccctg gacactgatt accagatgac 34441 caaacgggac atgggcttta ctgaagagga gtttaaaagg ctggtgagtg ggtgtgagcc 34501 atactggcct tgactcgggt ttgggagtat ggtatctaca ggtccagtcc ggggcctgga 34561 atctttggag agagggagtg agtctgcctc aacagtccaa gacaagccca acctagacac 34621 tttccacaga gaagacatct ttgtgttgac gtcctgacct aggaccaggt ttttgatcct 34681 ttgcttgggt tgagtgcctt taaagaatcc agtgaaagct gtcaaccctc tccccagaaa 34741 ggtgtgtgca gcagctatga agtcttgcac actctcttca ggttgttctt aaatcccagg 34801 ctgaataagt ccattcctgc acgtgtctgc gaggtgtctc tggcccccta catgccaccc 34861 tgtctctcaa aggtttctcc aacttccttc tcacagccct ttttcatgta atgacaaatt 34921 aagaacacga cctcatggtc tctactctgg cacttgctgc cgtgtgacag tggacaaatc 34981 cttccccctc taagcgtatc tgcccatgtt gagtgaagag gatggactat cactacattg 35041 ctaagagctg ccttctttgt tctctggttc catgttgtct gccattctgg cctttccaga 35101 acatcaatgc ggccaaatct agtttcctcc cagaagatga aaagagggag cttctcgacc 35161 tgctctataa agcctatggg atgccacctt cagcctctgc aggtaggttc ctgtctgggc 35221 ttctgggcag ttgcctgtcc tggccccagt gtggctttct gtgggacttc tagcaagatg 35281 cccttccatt cttgggcagc gcatgaatgt gtgatgactc cctggtttct gggccctggc 35341 tgggagcagc gtctcattag atcggtttgt tttctataaa agttcttgag aggctgttct 35401 aaggggagac tttctgaagc ccagtcccaa aggtctgggc agttggggac acctccatgg 35461 ctgcccaaag ccaagggcag ggagaggggc ccaggctgtt ctgctccttt cttcctatgt 35521 ggtcttggca aggcatcttc ttgccatcat aggaaggagt tcctttctgg ttctggtgtt 35581 ctatgatttt tacaacatcc tgggtactac aagttgcctg atctttttgc ttctctgaac 35641 caacgagcag ggcagaacct ctgaagacgc cactcctcca agccttcacc ctgtggagtc 35701 accccaactc tgtggggctg agcaacattt ttacatttat tccttccaag aagaccatga 35761 tctcaatagt cagttactga tgctcctgaa ccctatgtgt ccatttctgc acacacgtat 35821 acctcggcat ggccgcgtca cttctctgat tatgtgccct ggccagggac cagcgccctt 35881 gcacatgggc atggttgaat ctgaaaccct ccttctgtgg caacttgtac tgaaaatctg 35941 gtgctcaata aagaagccca tggctggtgg catgcagcag gtggcatgta atttggtggt 36001 cttgggcggg ccgatgtggg caggatgagc atggagggag ctgggtcagc ctgctcagca 36061 gcagggcctg agcctaaggg tggctgtgaa tgccaggcca gagatcccaa tgctgtgggc 36121 caagaggggt ccagaggctg tcctccttcc agaagaaata aggcttctct ggttgttgct 36181 caaacattcc ctgaactctc agcccctcct aactctaggt tttaaggagt aaagcttcct 36241 tttgggttcc tgaagctggc agttggggtg agagcagatg agatggaaga gggctcatca 36301 gacactggcc ttggagggtg ctggcctctg cagaacgcca gcatcttctc agaatcgtat 36361 gttctagaag cctgggcgaa gtccggctaa ttgtggactt ggggaaaata aggcccaacc 36421 cctgtttttg caaggttaag gagaaataat cttaaaccag tcacacaaat catcggcatt 36481 tatttcctgg gtcctaggtg tcacttatcc tggtggacag ggcagaggtg gtcagatcgt 36541 tttgagccaa aatcccttcc ctaaaaatgg atctgtggag ctccatgagg gaacctcaga 36601 gatgcacaat gacagtttag ctaaaatggc ttaaaaaatg tgaattgatt gtcagctctc 36661 tccatatctg ctgaaaaaag gtttaaaatt tttaaaaagt ttaaaagtgt tttctaaaaa 36721 agggacaagc aggtctggac c // LOCUS HUMSMPD1G 5588 bp DNA PRI 31-AUG-1995 DEFINITION Homo sapiens acid sphingomyelinase (SMPD1) gene, complete cds, ORF's 1-3, complete cds's. ACCESSION M81780 M81781 NID g972768 KEYWORDS SMPD1 gene; acid sphingomyelinase; sphingomyelinase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Schuchman,E.H., Levran,O., Pereira,L.V. and Desnick,R.J. TITLE Structural organization and complete nucleotide sequence of the gene encoding human acid sphingomyelinase (SMPD1) JOURNAL Genomics 12 (2), 197-205 (1992) MEDLINE 92155708 FEATURES Location/Qualifiers source 1..5588 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p15.4-p15.1" TATA_signal 219..223 /gene="SMPD1" /note="G00-128-144" gene 219..5488 /gene="SMPD1" promoter 245..254 /gene="SMPD1" /note="CAAT/TATA; G00-128-144" protein_bind 397..402 /gene="SMPD1" /bound_moiety="Sp1" CAAT_signal 408..413 /gene="SMPD1" /note="G00-128-144" protein_bind 508..513 /gene="SMPD1" /bound_moiety="AP1" protein_bind complement(529..534) /gene="SMPD1" /bound_moiety="NF-1" protein_bind complement(651..656) /gene="SMPD1" /bound_moiety="NF-1" protein_bind complement(827..832) /gene="SMPD1" /bound_moiety="Sp1" protein_bind complement(845..850) /gene="SMPD1" /bound_moiety="Sp1" protein_bind complement(856..861) /gene="SMPD1" /bound_moiety="Sp1" protein_bind 962..967 /gene="SMPD1" /bound_moiety="Sp1" exon 1031..1428 /gene="SMPD1" /note="G00-128-144" /number=1 CDS join(1117..1428,1894..2666,3728..3899,4129..4205, 4408..4553,4710..5119) /gene="SMPD1" /EC_number="3.1.4.12" /codon_start=1 /db_xref="GDB:G00-128-144" /product="acid sphingomyelinase" /db_xref="PID:g972769" /translation="MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLALALALALA LALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPICKGLFTAINLG LKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFEDDMVEVWRRSVLSPSEACGLL LGSTCGHWDIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTD PDCADPLCCRRGSGLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWT GDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHESTPVNSFPPPFIEG NHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPYPGLRLISLNMNFCSRENFWL LINSTDPAGQLQWLVGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTL AAQFFGHTHVDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGSS HVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAWHNLVYRMRGDMQL FQTFWFLYHKGHPPSEPCGTPCRLATLCAQLSARADSPALCRHLMPDGSLPEAQSLWP RPLFC" intron 1429..1893 /gene="SMPD1" /note="G00-128-144" /number=1 exon 1894..2666 /gene="SMPD1" /note="G00-128-144" /number=2 intron 2667..3727 /gene="SMPD1" /note="G00-128-144" /number=2 repeat_region complement(3010..3302) /rpt_family="Alu" exon 3728..3899 /gene="SMPD1" /note="G00-128-144" /number=3 intron 3900..4128 /gene="SMPD1" /note="G00-128-144" /number=3 exon 4129..4205 /gene="SMPD1" /note="G00-128-144" /number=4 intron 4206..4407 /gene="SMPD1" /note="G00-128-144" /number=4 exon 4408..4553 /gene="SMPD1" /note="G00-128-144" /number=5 intron 4554..4709 /gene="SMPD1" /note="G00-128-144" /number=5 exon 4710..5488 /gene="SMPD1" /note="G00-128-144" /number=6 polyA_signal 5483..5488 /gene="SMPD1" /note="G00-128-144" BASE COUNT 1135 a 1656 c 1495 g 1302 t ORIGIN 1 ccctcttcct tacctagtcc cttgtctaat tgatcgtgta atagtacact tccccagaca 61 agtccacact aatataagaa ccaaaacatc cctccccttc tttttttggc aacaaggaaa 121 actacccaga cctgcgatcc attgccgaaa ccctctctcc agagccctca tccttccggt 181 ctgtgtggaa ttccgaattg aatcattcag tttggtgcta taaaacccat acctaaagac 241 tcgcggttat attatccatg aagaatccca acaacactct taacttctaa taattaatat 301 ttcacggacc caaccacgaa ccggtagtat ttgttgaggt atttacgagc gaaaatgaca 361 gcacctatgt gcctccaccc tctgggttaa cccaagggcg ggtacaataa cccggggtgc 421 cgatatacca gaaatgccga aggatcagag aagtggtaga gattccaaac aaaggagtag 481 acttagtgtc cactcactaa gagtcccact gagttctccc gaccacgtca ccgtctcgta 541 cccttagctc ccttccgcgg aaggacagtc tccgttgtgt gggtcaccct cctaggacgg 601 gcgagacgag agaggatgga ggggaagggc ccgatttgcg accccagcca gaccgtcgat 661 aaggccctta gactcgcgcc taagactgtt tcctctgcag aaggtggctg gtgtagtgta 721 cctcgaggtt tcgtgctcgt gccggagggc cccgacactg gagttccgcc tcggggacca 781 ctggagtccc tctcaggggt gggggcgtcg ggcacggggc cccgtcccgc ccccgtccct 841 ctcccccgcc ttagccccgc cagggccctc gcggggcggg ggaggcggag gcgtcgcaac 901 tggacgtcga cagccgcccg ccaccgagag atcagctgtc agagatcaga ggaagaggaa 961 ggggcggagc tgctttgcgg ccggccggag cagtcagccg actacagaga agggtaatcg 1021 ggtgtccccg gcgccgcccg gggccctgag ggctggctag ggtccaggcc gggggggacg 1081 ggacagacga accagccccg tgtaggaagc gcgacaatgc cccgctacgg agcgtcactc 1141 cgccagagct gccccaggtc cggccgggag cagggacaag acgggaccgc cggagccccc 1201 ggactccttt ggatgggcct ggcgctggcg ctggcgctgg cgctggcgct ggctctgtct 1261 gactctcggg ttctctgggc tccggcagag gctcaccctc tttctcccca aggccatcct 1321 gccaggttac atcgcatagt gccccggctc cgagatgtct ttgggtgggg gaacctcacc 1381 tgcccaatct gcaaaggtct attcaccgcc atcaacctcg ggctgaaggt gagcactgaa 1441 ggggctgcag tggaggaggc cgaaaggagt gctggggctg ggggctgggg ctgatgctgg 1501 tgcgctgggc tcagaatgca tccctgatgg agagggtggc atctacaatc catcactgag 1561 tttgctcccc tttggggaca cccatggcta catgccacca tcaccccatt gtgacctttg 1621 tgaagtaaga aataatgcag acagtgcctg aggaagtcag cttgccaagc aaaggcctca 1681 tgccacaggc cgctgagcta aagaagaagc gatggcctgg tgctgcctga gttacagggc 1741 aatatctgga aggcaaaggt gtgcactgag cttggtgcac tgagtcctgc ccagccccag 1801 tttggaaatg gaggccaagg ggtggtggcc aggggttggc ctggttcctc tgctctgcct 1861 ctgatttctc accatgcgct cctcccactg cagaaggaac ccaatgtggc tcgcgtgggc 1921 tccgtggcca tcaagctgtg caatctgctg aagatagcac cacctgccgt gtgccaatcc 1981 attgtccacc tctttgagga tgacatggtg gaggtgtgga gacgctcagt gctgagccca 2041 tctgaggcct gtggcctgct cctgggctcc acctgtgggc actgggacat tttctcatct 2101 tggaacatct ctttgcctac tgtgccgaag ccgcccccca aaccccctag ccccccagcc 2161 ccaggtgccc ctgtcagccg catcctcttc ctcactgacc tgcactggga tcatgactac 2221 ctggagggca cggaccctga ctgtgcagac ccactgtgct gccgccgggg ttctggcctg 2281 ccgcccgcat cccggccagg tgccggatac tggggcgaat acagcaagtg tgacctgccc 2341 ctgaggaccc tggagagcct gttgagtggg ctgggcccag ccggcccttt tgatatggtg 2401 tactggacag gagacatccc cgcacatgat gtctggcacc agactcgtca ggaccaactg 2461 cgggccctga ccaccgtcac agcacttgtg aggaagttcc tggggccagt gccagtgtac 2521 cctgctgtgg gtaaccatga aagcacacct gtcaatagct tccctccccc cttcattgag 2581 ggcaaccact cctcccgctg gctctatgaa gcgatggcca aggcttggga gccctggctg 2641 cctgccgaag ccctgcgcac cctcaggtac ttatcgtccg tggaaaccca ggaagggaaa 2701 agaaaggtga atgaaagtga agggagaagg gaacctgggg cattgtctct gattgctcta 2761 gcatgagtcc ttagtgctct tcatttggct cccctaatct gactcctcct tccctttcta 2821 ctgttttgcc gcaccaggct tttttttttt tttttttttt agtttagttt ttgtagagac 2881 aagatcttgc tatgttgccc aggctggtct caaacaccta acctcaagca atcctcccgc 2941 ctcggcctcc caaaatgctg ggaccacagg catcagctac tgctcctggc cctccctttt 3001 tttttttttt tttttttttt ttttttgaga tggaatcctg ctctgttgcc caggctggag 3061 tgcagtggca ccatctcagc tcactacagc ctccacctcc tgggttcaag caattctgcc 3121 tcagcctccc aagtacctgg gactacaggt gcacgccacc acacccagct aatttttgta 3181 tttttagtag agatggggtt tcaccatgtt ggccaagatg gtcttgatct cctgacctca 3241 tgatctgccc acctcggcct cccaaagtgc tgggattaca ggcatgaacc actgcaccca 3301 gctttccagc cctccctttc tactcttatc tccagccacc ctccttcaaa ggtctggcag 3361 cataacctct ctatgcccca gctgtgtctt tgctcatgtt ggccctctgg aaatgatttc 3421 cccctttttt ttaagtgctc cagttttttc ccaccttatc catcccatgt catcttccct 3481 ctgtgtggtc cttgcttccc attctagcta actcttatcc ctcccccata ctcctggagc 3541 cctctgccct cagagtcttt tgtgtcacac agacccaata attagaactg tttggtctct 3601 ggctagactg tgagctcctt gcaggtgggg aagatgtcat gtatgctttt accctccacc 3661 caaatgccca gcacaggagg accaggattg gaacaagtgt tgacctctca tgtttacttt 3721 gtttcagaat tggggggttc tatgctcttt ccccataccc cggtctccgc ctcatctctc 3781 tcaatatgaa tttttgttcc cgtgagaact tctggctctt gatcaactcc acggatcccg 3841 caggacagct ccagtggctg gtgggggagc ttcaggctgc tgaggatcga ggagacaaag 3901 tgagggccag tagtgggaac acggtggtgc tgggggacaa gcaggctcct gttgagctgg 3961 agcacctctg ggcacagaag ttttattttc ctggcattcc caacaagtgt tccctgggga 4021 ttcagctcat ggtcactgtt gaaagccttc attcagtccc cctttctcta gccagggctg 4081 cctggacccc tggatgccct gattaccatc cttaattctc cctactaggt gcatataatt 4141 ggccacattc ccccagggca ctgtctgaag agctggagct ggaattatta ccgaattgta 4201 gccaggtagg acggagatga gggtgggaat agggacaggg tgagtgtctg aaggctgaaa 4261 attcccttga gcatctcacc atccctgttg tcccatggag tggggaggct cctcactaga 4321 acaggttgga gaaagagggc atcctatctc cccagatgtc ttcctacccc tccctagaat 4381 cttctgaatg tagtaccttc tggccaggta tgagaacacc ctggctgctc agttctttgg 4441 ccacactcat gtggatgaat ttgaggtctt ctatgatgaa gagactctga gccggccgct 4501 ggctgtagcc ttcctggcac ccagtgcaac tacctacatc ggccttaatc ctggtgagtg 4561 aggcagaagg gagcctccct tatcctggag ttggtgggat aggggaagga ggttggagcc 4621 agagcctgca aagcatgggc aggatgtgtg gcccctccct ggagttaccc ttgctccttg 4681 cccctccagt cagccccaca tccttgcagg ttaccgtgtg taccaaatag atggaaacta 4741 ctccgggagc tctcacgtgg tcctggacca tgagacctac atcctgaatc tgacccaggc 4801 aaacataccg ggagccatac cgcactggca gcttctctac agggctcgag aaacctatgg 4861 gctgcccaac acactgccta ccgcctggca caacctggta tatcgcatgc ggggcgacat 4921 gcaacttttc cagaccttct ggtttctcta ccataagggc cacccaccct cggagccctg 4981 tggcacgccc tgccgtctgg ctactctttg tgcccagctc tctgcccgtg ctgacagccc 5041 tgctctgtgc cgccacctga tgccagatgg gagcctccca gaggcccaga gcctgtggcc 5101 aaggccactg ttttgctagg gccccagggc ccacatttgg gaaagttctt gatgtaggaa 5161 agggtgaaaa agcccaaatg ctgctgtggt tcaaccaggc aagatcatcc ggtgaaagaa 5221 ccagtccctg ggccccaagg atgccgggga aacaggacct tctcctttcc tggagctggt 5281 ttagctggat atgggagggg gtttggctgc ctgtgcccag gagctagact gccttgaggc 5341 tgctgtcctt tcacagccat ggagtagagg cctaagttga cactgccctg ggcagacaag 5401 acaggagctg tcgccccagg cctgtgctgc ccagccagga accctgtact gctgctgcga 5461 cctgatgctg ccagtctgtt aaaataaaga taagagactt ggactccaga cccctgtgtg 5521 actgtcccaa tttcttcttt ccaggcaagc agggcaagga gatctttgga gcaagatcat 5581 aactgagg // LOCUS HUMCOX5B 2593 bp DNA PRI 01-NOV-1994 DEFINITION Homo sapiens cytochrome c oxidase subunit Vb (COX5B) gene, complete cds. ACCESSION M59250 NID g180936 KEYWORDS cytochrome c oxidase subunit Vb. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2593) AUTHORS Lomax,M.I., Hsieh,C.L., Darras,B.T. and Francke,U. TITLE Structure of the human cytochrome c oxidase subunit Vb gene and chromosomal mapping of the coding gene and of seven pseudogenes JOURNAL Genomics 10 (1), 1-9 (1991) MEDLINE 91257815 FEATURES Location/Qualifiers source 1..2593 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2cen-q13" mRNA join(44..146,1023..1096,1297..1396,1949..2144) /gene="COX5B" /note="G00-127-530" gene join(44..146,1023..1096,1297..1396,1949..2061) /gene="COX5B" CDS join(44..146,1023..1096,1297..1396,1949..2061) /partial /gene="COX5B" /codon_start=1 /db_xref="GDB:G00-127-530" /db_xref="PID:g180937" /translation="MASRLLRGAGTLAAQALRARGPSGAAAMRSMASGGGVPTDEEQA TGLEREIMLAAKKGLDPYNVLAPKGASGTREDPNLVPSISNKRIVGCICEEDNTSVVW FWLHKGQAQRCPRCGAHYKLVPQQLAH" exon 44..146 /partial /gene="COX5B" /note="G00-127-530" /number=2 exon 1023..1096 /gene="COX5B" /note="G00-127-530" /number=3 exon 1297..1396 /gene="COX5B" /note="G00-127-530" /number=4 exon 1949..2144 /gene="COX5B" /note="G00-127-530" /number=5 BASE COUNT 615 a 607 c 685 g 686 t ORIGIN Chromosome 2. 1 ctgcagcttg ttcccggaag ttttgctgct agtcgcggac gcaatggctt caaggttact 61 tcgcggagct ggaacgctgg ccgcgcaggc cctgagggct cgcggcccca gtggcgcggc 121 cgcgatgcgc tccatggcat ctggaggtac tcgggtctcc gggcgtgcca gggaccagag 181 tgttgccctc ccagggtggt cccagggcgg caaagcggcg cggctcgtgc agcttctcga 241 ggtcccagtg gccgctttac ggtccccagt gcctcaggct ctgcaggcat ctccctgtaa 301 ttctggaccg ctgctcctgc cgctccccga actcactccg ctgcgaaagt atcctaaacg 361 gaggtgccgg gtgaccttgg gagggaccgg ggctgccacc gggatgggga ggggtccggc 421 ctcccttcaa acctgcgccc acctcaagca gagtgggttc tacatgcttt tagacaaatg 481 tcgacaaatt tgcctcggtg gttggagaaa gaaaagctca taggccgggc gcggtggctc 541 acaactgtaa tcccagcact ttgggaggcc gaggcggaca gatccctgag gtcaggagct 601 caagaccagc ctggccaaca tggtaaaacc ccgtttctac taaaaataca aaaattagcc 661 gggcgtggtg gcgcgcgcct gtagtcccag ctactcggga ggctgaggca ggagaatcgc 721 ttaaacccgg gaggcggagg ttcccgtgag ccaacatcgc gccattgcac tccagcctgg 781 gcaacaagag caaaactccg gctcaaaaaa gaaaaaaaaa gctcccccga gtgctgccgc 841 ttgtgtggat gggtacttgg tggttcttag gggaccatgg atatgagtag cctttaggag 901 cttgtgagcc cgctaaaact tatacagaag tttcggggca ccattttcct tgatcatttc 961 tgtttgtagt ttttctatca gtcatttcag tcagcgtcat aattcacgtt atcttccttt 1021 aggtggtgtt cccactgatg aagagcaggc gactgggttg gagagggaga tcatgctggc 1081 tgcaaagaag ggactggtaa gagaaactcc cttctgtctt ctgtgtaact tatggccttg 1141 gatgtgttca tagtggtctc ctctctggga gtatttgata caggaaactt ggcttgtagg 1201 ttcagtcccc tgagcttctg gaaggtaggg cttatttggc ctaatggttc aattcttgtt 1261 ttttttgttt tgttttgttt tttgtttttt cttcaggacc catacaatgt actggcccca 1321 aagggagctt caggcaccag ggaagaccct aatttagtcc cctccatctc caacaagaga 1381 atagtaggct gcatctgtaa gtacctcacc tctatttttt atccacttgc ttaatatatc 1441 ctacaatagt gtgtaagctg cctcaaatct tcagtgtgtg agtgcatgtt ggtaagtttg 1501 tctaagggtc ttgacactct caagcctcat tatgcctgat agttcatcct tactggaaag 1561 aagcgcagca cagcggtaag actggctcac tgggagtgcg gcatgaagga gtacccaccc 1621 agcaaatatt tattgttata ctgcttctat gccaggcatc attttagaca ctagggatac 1681 ataccagaac ggaccctgct ttgcgttcta cataggcaag ggaattgtta gaatttacag 1741 tgaccttgat acaaggtcag tttactcata ggtgaactca gagcctcaat ttcttcatct 1801 gaagaatggg aggagggctt gaactgatct cttaagatat caaccatagt cttacttgtg 1861 tatcagagat gtctgaggaa aagaaattca tgttgaaagt ctccctttct aggttggatg 1921 accataaacc accttttttc ccttttaggt gaagaggaca ataccagcgt cgtctggttt 1981 tggctgcaca aagggcaggc ccagcgatgc ccccgctgtg gagcccatta caagctggtg 2041 ccccagcagc tggcacactg agcacctgca ctaaattact caaaatgtgc tgtaaagttt 2101 cttctttcca gtaaagacta gccattgcat tggctccttc tcccatagat ggctggtctt 2161 atttcttacc cgtattcttt ggtaggcatg gaatatgctt attttgggaa aagctgtctg 2221 ttaatgctag cttgccatcc acttactgaa agtgtataac cagtgtatag tgcttagatt 2281 aataataaga atagatcgac aacccgtaat gcaatgaatg ggaccacctg gtatgagaga 2341 aaggggtggg ctgaggagtc aggctgacag gacttaaaat attggctcca tcatttggct 2401 ctatctctgt ggacatgttc tctgggggtc aagtacaact acaaaaaggt aagtacctta 2461 caaggtgctt aatatgtaaa gagctgagtg gatagtaggt cctcacagta tcctcataga 2521 gacagggtgg ccttgtgaag agagttttgg ggattttgaa gtcttgaggg acccaggtgc 2581 aaagtaagaa ttc // LOCUS HUMNUCLEO 10942 bp DNA PRI 07-JAN-1995 DEFINITION Human nucleolin gene, complete cds. ACCESSION M60858 J05584 NID g189305 KEYWORDS nucleolin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10942) AUTHORS Srivastava,M., McBride,O.W., Fleming,P.J., Pollard,H.B. and Burns,A.L. TITLE Genomic organization and chromosomal localization of the human nucleolin gene JOURNAL J. Biol. Chem. 265 (25), 14922-14931 (1990) MEDLINE 90368666 FEATURES Location/Qualifiers source 1..10942 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q12-qter" exon 1070..1198 /gene="NCL" /note="G00-125-908" /number=1 /product="nucleolin" gene join(1070..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10499) /gene="NCL" mRNA join(1070..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10499) /gene="NCL" /note="G00-125-908" /product="nucleolin" CDS join(1181..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10216) /gene="NCL" /codon_start=1 /db_xref="GDB:G00-125-908" /product="nucleolin" /db_xref="PID:g189306" /translation="MVKLAKAGKNQGDPKKMAPPPKEVEEDSEDEEMSEDEEDDSSGE EVVIPQKKGKKAAATSAKKVVVSPTKKVAVATPAKKAAVTPGKKAAATPAKKTVTPAK AVTTPGKKGATPGKALVATPGKKGAAIPAKGAKNGKNAKKEDSDEEEDDDSEEDEEDD EDEDEDEDEIEPAAMKAAAAAPASEDEDDEDDEDDEDDDDDEEDDSEEEAMETTPAKG KKAAKVVPVKAKNVAEDEDEEEDDEDEDDDDDEDDEDDDDEDDEEEEEEEEEEPVKEA PGKRKKEMAKQKAAPEAKKQKVEGTEPTTAFNLFVGNLNFNKSAPELKTGISDVFAKN DLAVVDVRIGMTRKFGYVDFESAEDLEKALELTGLKVFGNEIKLEKPKGKDSKKERDA RTLLAKNLPYKVTQDELKEVFEDAAEIRLVSKDGKSKGIAYIEFKTEADAEKTFEEKQ GTEIDGRSISLYYTGEKGQNQDYRGGKNSTWSGESKTLVLSNLSYSATEETLQEVFEK ATFIKVPQNQNGKSKGYAFIEFASFEDAKEALNSCNKREIEGRAIRLELQGPRGSPNA RSQPSKTLFVKGLSEDTTEETLKESFDGSVRARIVTDRETGSSKGFGFVDFNSEEDAK EAMEDGEIDGNKVTLDWAKPKGEGGFGGRGGGRGGFGGRGGGRGGRGGFGGRGRGGFG GRGGFRGGRGGGGDHKPQGKKTKFE" exon 2159..2275 /gene="NCL" /note="G00-125-908" /number=2 /product="nucleolin" exon 3439..3916 /gene="NCL" /note="G00-125-908" /number=3 /product="nucleolin" exon 4587..4784 /gene="NCL" /note="G00-125-908" /number=4 /product="nucleolin" exon 4889..4975 /gene="NCL" /note="G00-125-908" /number=5 /product="nucleolin" exon 5160..5301 /gene="NCL" /note="G00-125-908" /number=6 /product="nucleolin" exon 6307..6431 /gene="NCL" /note="G00-125-908" /number=7 /product="nucleolin" exon 7037..7160 /gene="NCL" /note="G00-125-908" /number=8 /product="nucleolin" exon 7620..7777 /gene="NCL" /note="G00-125-908" /number=9 /product="nucleolin" exon 8292..8415 /gene="NCL" /note="G00-125-908" /number=10 /product="nucleolin" exon 8652..8785 /gene="NCL" /note="G00-125-908" /number=11 /product="nucleolin" exon 9279..9405 /gene="NCL" /note="G00-125-908" /number=12 /product="nucleolin" exon 9792..10006 /gene="NCL" /note="G00-125-908" /number=13 /product="nucleolin" exon 10140..10499 /gene="NCL" /note="G00-125-908" /number=14 /product="nucleolin" BASE COUNT 2873 a 2245 c 2723 g 3101 t ORIGIN 1 attctgctgt agacatagag atgatgatca tagctgacta tgatgatgat cccccgcgag 61 cctgaaagag gaaatgctct ggtttgctaa gcccgcgaat cgagtgagac ccacccacaa 121 agctaaccgt ggaagtcact ggcggcctcc ttcgccctgc cagccgggga acccatccgg 181 tggctctcga cctgctcccg ggccatctgg tgacactgac ttcgcagcca ccaccttaat 241 tggcgcattc gacccaaata ataacctggg aacctgtggg cggtctaagg cccggctctg 301 cggtcgccct cccaggcccc tctccctggc cctgtgaggc cagaaagtta cttctccgag 361 gccagttccc catgtctgag aaatatctcc caacttgagg ttctgtgggg taggggaggg 421 ttcgtgactt tctcacagaa aacctcgtac agaccccgcc actgccttta ttaacagctc 481 tcaggagact gcctgcagga ggggggtcgc tccggcccca tgctcgcggg caagcaggga 541 taagctgtgc ctccaaaagg gccaacggga actccgcggt ccctgaactt ccggtgctgg 601 aggactcctc gctccagggc caccaggagc cgcggcgtga gtgcgtgccg gaaccgaggg 661 cggggtctct gaggaactcc aaggctgccc aagcctacgg acccagccac attggcgaac 721 cggagaccgc ccgattccac cacccccgcg ctcccctcac agccggcgcc aaaaacgcca 781 gtcccacgac gcaggccggg acccgcgcgc ccacggccca atcagcgcga ccttgcacaa 841 agcgagcccc gcccccacgg cgccgttgcc agcccctccc cctcccgtgc cgcctcggcc 901 cgcctactcc ccgccccgcg ccgttcacgg ttagaggctc gcgattggct catggggacg 961 gccgcgagct ttggttggtc ggcgcggagt cacgaggcgc cgtcgtcgcc tttccacagg 1021 cgttactggg caggctcagt ctttcgcctc agtctcgagc tctcgctggc ttcgggtgta 1081 cgtgctccgg gatcttcagc acccgcggcc gccatcgccg tcgcttggct tcttctggac 1141 tcatctgcgc cacttgtccg cttcacactc cgccgccatc atggtgaagc tcgcgaaggt 1201 aaacggcctt gagcgcgacg cagacgtgta ggcctgcttc cgaggggcga gcgcggcgcc 1261 gcggggagga gggcctgcgc gcagtcccgg gcgcgttcta gggcgccatg ctgcgggaag 1321 tctcgcgcga ttagtgggga ggtctcgcgc ttctggctac ttggtggcga ggtgaagagc 1381 ttctgcaggt gctgggggag ggggcgctgg gcctcggggt ggagagatga gaccaaactt 1441 ttgcgacgcg tacgagctgg gactgactct gacgcacgtg cccgggagcg tgcctgccac 1501 gtgggccggc gtaggtctgg aatctccaga gggaccgggt gccttgggcc gggaaatggc 1561 ggtatcggcc ctagtcggag tcccggctgc gctcggatgt ctccgccccg gcctggcaag 1621 ccgatacgtg gtgggccccg gaaggtggct ctgccgcgtg ccttttgcgc tgtgtttcgg 1681 gcaagaggtg gtcctgccag gtacccccac gtggccgcac ccgcctcttt aaggggcggg 1741 gtagtgctgg ggaaaggcat aagcttcatg agaaaataag gtagtatttt taagtgcctt 1801 aatgatcttc accgttaatt tgattcaaat aagggtggta gataaagtac cgggatttgt 1861 agtataaaaa cacggttgtg cttaactaag gtaacgggag gagaaatcat ttcctcaggt 1921 tgacttttta ccttagggca ggttttctgt tggtaaagcc tgggaggaaa aatgtgggcg 1981 gttgagaagt agtccctctt gcattgccat caggagtagt ttctatgtta gttgtggtgt 2041 ttggcactat gagaaatgat ctgagacgga gatgatggcg tatgaacact aatggcaaaa 2101 tatgaatggc ctgaaatgtc gaggtggagg tgtaatgatc tatttgtgtc cattttaggc 2161 aggtaaaaat caaggtgacc ccaagaaaat ggctcctcct ccaaaggagg tagaagaaga 2221 tagtgaagat gaggaaatgt cagaagatga agaagatgat agcagtggag aagaggtaat 2281 tttatccaac ttaatgcaga attatgttaa aactacaaaa tggagagtta agacatgaaa 2341 ttggatatct gtggcaaaaa taagatttta tcaggtatgt cttattgtag tggttgagtg 2401 tttcacaagc tcttcattga catgtcaaga tgtcatttgg ctagtatttg aatgtgagtg 2461 ctaagacgag actgggaatt tcttttacat gttcctctgc agggcttgga gtgtgatttg 2521 ttgtgttaaa tcattacatt tttccagttt caacatgtta gctcaccccc acatgtagag 2581 ctgggcattg tattcagagc tgagaataac cttaccagat tcctttccta tcctccgaat 2641 taaaattaat tggtctccat tccatatata tataactgta tcactactgg ttaagtactc 2701 gggtgtagac tgagggctgc cacctctctt tggtaccatt gaccctcttt agccacctcc 2761 tggcctttta tttgcctcca ctataaagac agctgagcac tgaattgtgc tcaggttttc 2821 gttgagaacc tgaatgaaag ttttactctc cacacattgc cttgataaaa ctacgggatt 2881 ttaatgtagc taaatgatga cttttatcaa actaccatgc acactctttg atgtgtgata 2941 gttttgtaag gaatatttat atttagccta ttcatttttt gtctcaggtc ctaagaattg 3001 agcttcactg ggcttggtgg accgcaacca cgagggcccc aatgatttaa taagttaatg 3061 cttggagcct cctatgtgta acgttctgaa taatttacac atagcaattc atgaccttaa 3121 acatgtaagg atgatactat taccattttc agatgagaaa gttggggctt gggaaagtat 3181 gaggtgtaag aattcagagg gtctggttca gaggtatttt cagtgttcaa aagagttcct 3241 tatgtctggg tattcacctt attatagggg ctctgactta agacaacata acagaagcct 3301 ggagttttaa catgtcatat gtgtcatgcg tatgtcttga accagaggca ttgccagagt 3361 ctaacaactc attgggacca tggttatctt tttgggtgtg gggctggact tactggtttg 3421 gttttcattt atctcaaggt cgtcatacct cagaagaaag gcaagaaggc tgctgcaacc 3481 tcagcaaaga aggtggtcgt ttccccaaca aaaaaggttg cagttgccac accagccaag 3541 aaagcagctg tcactccagg caaaaaggca gcagcaacac ctgccaagaa gacagttaca 3601 ccagccaaag cagttaccac acctggcaag aagggagcca caccaggcaa agcattggta 3661 gcaactcctg gtaagaaggg tgctgccatc ccagccaagg gggcaaagaa tggcaagaat 3721 gccaagaagg aagacagtga tgaagaggag gatgatgaca gtgaggagga tgaggaggat 3781 gacgaggacg aggatgagga tgaagatgaa attgaaccag cagcgatgaa agcagcagct 3841 gctgcccctg cctcagagga tgaggacgat gaggatgacg aagatgatga ggatgacgat 3901 gacgatgagg aagatggtaa ggagttgtct tggtagttac tgggcttctg attacaaggt 3961 atcttgagat tctgggatca catattcctt catcgtacaa cctggagatg agattagaat 4021 cttgtgggaa ttctcttggg ttgttgtggt gtgctagact taattaccca tgaatgattt 4081 tgtcctcttg agaaaatttc aatagcacat ctattagtgt tttttataat gtaggatttt 4141 cgtttctaag tgattttttt ttttttttaa atttttttga gatggagctt ttgctgtttc 4201 ccaggcggga gtgcaatggc gcgctatctc ggcgcactgc agcctccatc tcctgggttc 4261 aagcagttct gcctcagcct cccgagtagc gggattacag gtgcccacca ccacacccta 4321 ctaattttgt attttagtag agacgacatt tcaccatgtt ggccaggctg gctctgaact 4381 ttgacctcag gtgatccacc caccttaggc tctcccaaag tgctaggatt acaggtgaga 4441 tatgctgcgc ccggccccaa gtgatctatt cttgccatga ctgttaacta aacatggtga 4501 caggattcga ttttctttac attagatttg aaaaccgatg ttggttttgg gagattgctg 4561 caatttttag gtgacttctc tttcagactc tgaagaagaa gctatggaga ctacaccagc 4621 caaaggaaag aaagctgcaa aagttgttcc tgtgaaagcc aagaacgtgg ctgaggatga 4681 agatgaagaa gaggatgatg aggacgagga tgacgacgac gacgaagatg atgaagatga 4741 tgatgatgaa gatgatgagg aggaggaaga agaggaggag gaaggtactt aaattagatt 4801 ctgacatacg acatgagtta tgtttaaagg aggcacttaa gtgtttgtgg ctactgatgt 4861 gtgatacatt gtttgacatc ttgtccagag cctgtcaaag aagcacctgg aaaacgaaag 4921 aaggaaatgg ccaaacagaa agcagctcct gaagccaaga aacagaaagt ggaaggtaac 4981 ttgcagaatt aggggatatg ggggagataa acagcacaaa tgatgaataa caaagggact 5041 taatactgaa accagatgtt acattgtagt gtgctgatgt gctgtgtata gaaattttgc 5101 tttggaaact aactttttac cacactacaa gtagactgag ttgagctttt tttgtgcagg 5161 cacagaaccg actacggctt tcaatctctt tgttggaaac ctaaacttta acaaatctgc 5221 tcctgaatta aaaactggta tcagcgatgt ttttgctaaa aatgatcttg ctgttgtgga 5281 tgtcagaatt ggtatgacta ggtagctgct tcactgcacg ttacataccg tgggtctgtt 5341 aatttttcct tcccctgtta gcacagttac tttagcctgc cactgttaaa catgaatact 5401 gtaaacactt caaggttagc attagtgaac taagttagaa ttaaactgta gatcccctaa 5461 gttgcaattt ccataatcag tcgtaacttg gtatagcaca gaataatttt tagtaatttt 5521 tttgttgttt ttgttatgta ttgagacgga cgctggcttt tgttcaggct ggagtacagt 5581 ggcgcaatct tggctcactg caacctctgc ctcccgggtt caagcgattc tcctgcctaa 5641 cctcccaagt gactgggata cgggtgccac tcaccatgca tggctaattt ttgttttgta 5701 tttagtatcg atttcaccat gttggtcggc tggttttgaa ctcctgacct caagtgatcc 5761 acccacctcg gcctctcgaa gtgctggtac agcgtcacca ccctgccagt aagttttaat 5821 aatttggtgt taggtgggag aatgcttgaa cctgggaggc agaggttgca gtgagccaag 5881 ttcgcgccac tgtactccag cctgggcaac agattgagac accgtctcaa tttaaaataa 5941 tgtttatttt cttggaagta ccttgaaact attagacctg tctagtcatc atagtgaata 6001 cttttatcca gacaggattc tcctgtatta gtgcttatag gtgttctttt gtcagctgct 6061 actgtgaatt cttataagca atttagctcc atgatgaaga cctcaaacgt gaatgtgcat 6121 gtcatatctt catgctgagc cgtgttctgt agctgcagtt tgcagagcct tgactttgtt 6181 ttgctatact aggggtgctt tttaaaatgt gatctttgtt tgcaccatca catttgtcta 6241 gatacagatt gtgattttga tttgtgtttt cacctgttgt aattttgccc tcctctccac 6301 ctgaaggaaa tttggttatg tggattttga atctgctgaa gacctggaga aagcgttgga 6361 actcactggt ttgaaagtct ttggcaatga aattaaacta gagaaaccaa aaggaaaaga 6421 cagtaagaaa ggtatgtaag gctttatgag ttatgcaatg aactcaggag ctagactgct 6481 agggaaaatg ctttgtaacc catttccctt tggtttcctc ttattttttt taaatcattt 6541 ttttcctttg gtttcctctt aatgtgggaa ttaaatgagc tacagtgttt acaaggtact 6601 tggcactgct tgtcagtgta taggtaaatt cctgagttag gcaagcaaga gcactcttat 6661 acagaacaag aaccattaca tgcacctaaa ttaagctaag gatctttctt cactgaaact 6721 agttaggtcc ctaattactc cctatataca gtgtaatgtt ttgaattggt acattcactt 6781 tttttgttat gcgcgtctac tctaggttga actccagtgt acctaacaga gagtttgaca 6841 tcaaggctgt gacaacatgg agggaccact tgtgtgttga cactgctata tctccatatt 6901 tagcaccgag ccttgtacat ataggatctc aaattatttg ttgatagagc tatgtgtgtt 6961 tttcccctct ttttgttgtt gccccccacc tttggttttt caggccacag agctcatttt 7021 tgttttttta atctagagcg agatgcgaga acacttttgg ctaaaaatct cccttacaaa 7081 gtcactcagg atgaattgaa agaagtgttt gaagatgctg cggagatcag attagtcagc 7141 aaggatggga aaagtaaagg gtatgttctt ctattgaaat gtaagggttt tattaacatt 7201 aatgcacttc ctgctttata aaagaaatat tggtttgatt tccttaggcg tgtaacttgg 7261 acagtttaac ctgtaagttt gtgcctcagt aacccatctg taccatgggg ataatgtact 7321 catagggtga ttttaaaaga caaagctaat acttacaaag aagcaagttt aatgcctatc 7381 ttacataaat actttgtaag tagtagcagt tctttcagtg aggtgaggtt acatgaaaaa 7441 attccaagta tttgtaaaac tagtgggaag taagagggaa gctcgagttt tgattgaaaa 7501 gtggactaaa caagggcatt ttatgtactc agatctgaag caagttctgt gttgctgagg 7561 taaaagcatt tgtgttaata tggttttaaa aaccatgagt tcttctccct ccattgcagg 7621 attgcttata ttgaatttaa gacagaagct gatgcagaga aaacctttga agaaaagcag 7681 ggaacagaga tcgatgggcg atctatttcc ctgtactata ctggagagaa aggtcaaaat 7741 caagactata gaggtggaaa gaatagcact tggagtggta agaaattagg cttgttccaa 7801 ggttttcaga attggttgag ggaactcttc tagtctttgt atttcataag tttataaata 7861 ctttttaatc aaagttactc aaatgtaggt gaagatcaag gacatgatac cccaagtcat 7921 actcttattt ggaatagtaa tttccaatct tgaaatgaga gctctaaatc attttgcatt 7981 ggaatacagt aggcaaatca agcttccttt gtaggcatgt tttatacttt aaatgacttg 8041 accatgtgcg ttttgaactc agatgattct aggaaaacag accagtcatc agcctatgta 8101 agaacaacca gcaggacatt gcaacacgta ctaggtactt aatatgttga gtaacagaaa 8161 tggatttagc ttacgtcatg agtatttgta tataactcaa gcactgaaat tcttagggaa 8221 tagatattac tgttgtgacc gaagctggga cactgtttca gagtcttagg aatgtggctc 8281 tctatttcga ggtgaatcaa aaactctggt tttaagcaac ctctcctaca gtgcaacaga 8341 agaaactctt caggaagtat ttgagaaagc aacttttatc aaagtacccc agaaccaaaa 8401 tggcaaatct aaagggtaag ataatacctt tgtatcatca gttataggcc tatatatgtc 8461 ttagaggtct aaggacgtaa ggtcatgtgt cctgtagaaa aaagctaaat aattttagcc 8521 tagtaaatga gtgtaaaata agtatattta ggtccaacct tgagagaagg gccttggcca 8581 gatcatgtga ccagtggtat agagagcatg tgcctggtaa attactctaa gcattaactg 8641 ttcatcctca ggtatgcatt tatagagttt gcttcattcg aagacgctaa agaagcttta 8701 aattcctgta ataaaaggga aattgagggc agagcaatca ggctggagtt gcaaggaccc 8761 aggggatcac ctaatgccag aagccgtaag ttcacctggt tagggtgctg tggttggggg 8821 tagcactctc ggtgctttgt ttatttttgc acaaattctg tgtttcctgt tcgctactga 8881 gtgaacaata actggatatc gatgactgat tacctgagaa ataattgatg aaatctcaag 8941 aaaattcctc tagatagtca agttctgatc cagctgtcgt caactcagag tagcaagttt 9001 gcccatgatt tcctgcccca tccactgggc cccacctgct tgggttgctt tcccactttc 9061 catagaagac tggggcagga tatcaactat gcaatggcaa ttaaaaaatg taaacccaga 9121 atagccttta ctttaattaa ggactagttg gcttagttgc ttttaactgc tttttcacta 9181 taacaagtat cttggctagt agtcatacta ggcattgtgc aaattcagtg tacgaactgt 9241 gaattcacat aaatcgcaaa tttttttttc cttcccagag ccatccaaaa ctctgtttgt 9301 caaaggcctg tctgaggata ccactgaaga gacattaaag gagtcatttg acggctccgt 9361 tcgggcaagg atagttactg accgggaaac tgggtcctcc aaagggtaag ggaaggaagc 9421 gtgagtgctg cttccacttg aaggggtttt tgttctgtgc agaccttgag tctaatgtgt 9481 cttctcattg agctccttct gtctatcagt ggcagtttat ggattcgcac gagaagaaga 9541 gagaattcac agaactagca ttattttacc ttctgtcttt acagaggtat atttagctgt 9601 attgtgagac attctggggt tcaagctgtc acaccagtta gttttccata gagagctact 9661 ctgctgcact ggtatctttt tcccaaataa acaaggctac ttctgtggga tggctcccca 9721 gcatgtacag ttaacttggg acatgtgtag taggtgcttt ttataatggg caatttcatt 9781 tggtgttcta ggtttggttt tgtagacttc aacagtgagg aggatgccaa ggaggccatg 9841 gaagacggtg aaattgatgg aaataaagtt accttggact gggccaaacc taagggtgaa 9901 ggtggcttcg ggggtcgtgg tggaggcaga ggcggctttg gaggacgagg tggtggtaga 9961 ggaggccgag gaggatttgg tggcagaggc cggggaggct ttggaggtaa ggcacgcaga 10021 gataatgaca ccacatagca tgtgctcttc agaccctgtg ccctgtcacg gttcctaatc 10081 actggggagg aggagctttg tacccattct tttaacagtg tcttgccttc ctcctgtagg 10141 gcgaggaggc ttccgaggag gcagaggagg aggaggtgac cacaagccac aaggaaagaa 10201 gacgaagttt gaatagcttc tgtccctctg ctttcccttt tccatttgaa agaaaggact 10261 ctggggtttt tactgttacc tgatcaatga cagagccttc tgaggacatt ccaagacagt 10321 atacagtcct gtggtctcct tggaaatccg tctagttaac atttcaaggg caataccgtg 10381 ttggttttga ctggatattc atataaactt tttaaagagt tgagtgatag agctaaccct 10441 tatctgtaag ttttgaattt atattgtttc atcccatgta caaaaccatt ttttcctaca 10501 aatagtttgg gttttgttgt tgttactttt ttttttgttt ttgttttttt tttttttgcg 10561 ttcgtggggt tgtaaaagaa aagaaagcag aatgttttat catggttttt gcttcaccgc 10621 tttaggacaa attaaaagtc aactctggtg ccagacgtgt tacttcctaa agagtgtttc 10681 ccctggaatc tcactggaga gcatggcaaa gccagctctg ccacttgctt cacccatccc 10741 aatggaaatg gcttagtgcg tgtttccagt atcccagccc taactaactt ggttgaaatg 10801 ctggtgaggg gacctgctcc tgcagccctg gtgctgactt gaaggctgct gcagcttctc 10861 ctacttttag caggtctcga ggattatgtc tgaagaccac tctggaaaga ggtcgaggaa 10921 cagattagtc aggtttccta gg // LOCUS S45332 8647 bp DNA PRI 23-DEC-1992 DEFINITION erythropoietin receptor [human, placental, Genomic, 8647 nt]. ACCESSION S45332 NID g255496 KEYWORDS . SOURCE human placental. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8647) AUTHORS Noguchi,C.T., Bae,K.S., Chin,K., Wada,Y., Schechter,A.N. and Hankins,W.D. TITLE Cloning of the human erythropoietin receptor gene JOURNAL Blood 78 (10), 2548-2556 (1991) MEDLINE 92399733 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 113293] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..8647 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1916..8128 /gene="erythropoietin receptor, Epo receptor" CDS join(1916..2030,2887..3022,4010..4185,4266..4423, 4907..5060,5145..5232,7334..7421,7517..8128) /gene="erythropoietin receptor, Epo receptor" /note="This sequence comes from Fig. 2; Epo receptor" /codon_start=1 /product="erythropoietin receptor" /db_xref="PID:g255497" /translation="MDHLGASLWPQVGSLCLLLAGAAWAPPPNLPDPKFESKAALLAA RGPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGA VRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVARLADESG HVVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTF AVRARMAEPSFGGFWSAWSEPVSLLTPSDLDPLILTLSLILVVILVLLTVLALLSHRR ALKQKIWPGIPSPESEFEGLFTTHKGNFQLWLYQNDGCLWWSPCTPFTEDPPASLEVL SERCWGTMQAVEPGTDDEGPLLEPVGSEHAQDTYLVLDKWLLPRNPPSEDLPGPGGSV DIVAMDEGSEASSCSSALASKPSPEGASAASFEYTILDPSSQLLRPWTLCPELPPTPP HLKYLYLVVSDSGISTDYSSGDSQGAQGGLSDGPYSNPYENSLIPAAEPLPPSYVACS " BASE COUNT 1855 a 2411 c 2432 g 1949 t ORIGIN 1 ggatccaccc acctcggcct cccaaagtgc tgggattaca ggcatgagca ctgtgcatgg 61 actatttatt tatttttttg aaacagagtt tcaatcttgt tgcacagcct ggagtgcaat 121 ggtgtgatct cagctcactg caacctctgc cttctggttt caagcaattc tcctgcctca 181 gcctcctgag tagctgggat tacaggcacc caccaccacg ctcgaatata tatatatatt 241 ttttgagacg gagtccgctc tgtcaccagg ctggagtgca gtggccaaat atcggctcac 301 tgaaacctcc ggctcctggg ttcaagcgat tctcctgcag cctcccaagt agctgggatt 361 acaggcatgc agcaccacgc ccatctaatt tttgtatttt tggtagagat ggggttttac 421 catgttggcc aggatggtct tgatctcttg acctcgtgat ctgcccacct cggcctccca 481 aagtgctggg attacaggcg tgacgaccgc gcccggccta cgcctggcta atttttgtat 541 ttttagtaga gacgtggttt cgccatgttg cccaggctgg tctcgaactc ctgacctcat 601 gatccgcctg tctcggcctc ccaaagtgtt gggattacaa gtatgagcca ccgcgccact 661 agccaatttt ttttattttt tgagatgcag tctcactctg ttgcccaggc tggagttgca 721 gtggcatgat cttggctcac tgcaatcttc atctcccaga ctgaagcagt tctcatgcct 781 cagcctcctg agtagctggg attacagcac acgccaccac acctggctaa tttttgtatt 841 tttagtagag atgggatttc accatgttgg ccaggctggt ctcaaactcc tgacctcaag 901 tgatttgccc acgtcggcct cccaaagtgc tgggattata ggcgtgagcc accgcccagc 961 ccaagagaat aaaaatgtgg gtggtaaaaa tttttttccc aaaaattcgt aaatgaaaat 1021 ctcacatatt atgcatactg cccaggagca tggcctagca ctgtgcaaac actcaactgc 1081 tggtcgttgc aaggattatt attggccggc ttcagtggct tgctggtatt cccagcacat 1141 tgggagatgg aggctggagg attgcttaag tccgggattt caagaccagc ctggacaaca 1201 tagtgggatc ccatctctac aaagaatttt aaaaattagc caggtgcagt gggaagattg 1261 cttcagtcca gaggctgcag tgagctatga ttgtgccact gcactccagc ctgggtgaca 1321 gagcaacacc ctgagacaga gagagagagg gggaaggagg gaaggaggga aggaaggaag 1381 gaaggaagga aggaaggaag gaaggaagga aggaaggaaa ggagagagag agagagagag 1441 agagagagag agagagaaaa taatttttat ttatttccag gctgggaaga gatgctgatt 1501 tctgcgataa aatcagtagg tacatttttt ggaatgttcg ctatgtgcca ggctagattt 1561 tacagatgag aagtctgaag ctcaggtaag gtaagtcacc tgtccagggc cacaaagaaa 1621 aaaaaaacgt gtgtctgaag ccagaacggg agctgttgcg cccaactccc tcccctgccc 1681 ccaagcggcc tctgggctcg ggaagggccc ctgcctcctc ccgccaggca cttatctcta 1741 cccaggctga gtgctggccc cgcccctcgg ggatctgcca cttagaggcg cctggtcggg 1801 aagggcctgg tcagctgcgt ccggcggagg cagctgctga cccagctgtg gactgtgccg 1861 ggggtggggg acggaggggc aggagccctg ggctccccgt ggcgggggct gtatcatgga 1921 ccacctcggg gcgtccctct ggccccaggt cggctccctt tgtctcctgc tcgctggggc 1981 cgcctgggcg cccccgccta acctcccgga ccccaagttc gagagcaaag gtaaggatga 2041 gctgcgtgtg gacccctacg ctggagcctg caggaccatg ctggggcctg aactcccagc 2101 ctaggtcctg ggggccatgc tgtttctgga cttcctgacc gggtcctggg ggccaagctg 2161 gcatctgaac ccttagactg ggtcctggat gggtgggggg cggggtgggg tatgttagga 2221 tccaagactc ctgatcgcgt cccgggcaag agctagagtg ggcttaacat tcccgtttta 2281 ccttttcagg gagtctggga catgctaaat cctaaggggg ctgacttggt gctaaggtcc 2341 ctggggggtg gggaccaagc cgatccctag gggagggagg gtaaagcccg ggtccgagtt 2401 agagggccaa gccacaggct actgtaaaca cggtttgtgt gagggcgcca gatcacttgc 2461 ccggcccggt ggagggaggg aggcgggggg cacggttggc gctatcggtt ggcggggagc 2521 ctgccggggc cgataggggg cccgcctctc cgcacacacc cccagccgcg cgcgtgtcct 2581 aggctggggc ggggctggca gtcccgagct cgaggtcttg aacgccgcgc ccagctcagc 2641 tggccgctgg gtgggcaggt gtgcgccagt ggtgcacggc gggggacagt aaggcgagaa 2701 acttgcccct gggaattagg ggggcaccac ctctgcggac ccctccaagg gacccgcttg 2761 ggaagatggc agggcggggc ttttttctta tcgggtccgc ccaggctgcg ggagggaaga 2821 ggagggggct gtctcccgag gatagagctc agacccccat gcccttcctt tgtcgcccct 2881 ccccagcggc cttgctggcg gcccgggggc ccgaagagct tctgtgcttc accgagcggt 2941 tggaggactt ggtgtgtttc tgggaggaag cggcgagcgc tggggtgggc ccgggcaact 3001 acagcttctc ctaccagctc gagtgagtcc gatccggcgg gtgcctccaa gggcggaggg 3061 agggggtggg gcagagctcc ctggaggtcg tagcctcgta tgtcccctgc tgtttgaggc 3121 ccgacggcgc ctccagtcgt ggtcactgga gggaaacctg cgggtccagg gctggcacgc 3181 ctctatgggc cggggcgcga acactcccgc gatcaccgct ggaacgcgac cccaaacatc 3241 aggctgggat aacaacgcct ccaaatcgag ggtaaggcgt tactacgtcg gggctgggac 3301 gccttctcga ggtagtatcc aaaaggaggc cagcagtgct catgcctgta atcccaactc 3361 tttggaaggt cgagcggaag aaccgcttga gcccaggtgt tcaagaccag cctgggcaac 3421 acagcgagat ccccgtctct taaaaaaaaa ttagactggg cgcggctgca cgcctgtaat 3481 cccagcactt tgggaggctg aggcgggcgg atcacctgag gtcgggagtt tgagagccag 3541 cctggccaac atggagaaac tctatctcta ctaaaaatac aaaattagcc gggcgtggtg 3601 gcgcatgcct gtgatcccag ctactcggga ggctgaggca ggagaatcgc ttgaacccgg 3661 gaggcggagg ttgcggtgag ccgaggtagc gccattgcac tccagcctgg gcaacaagag 3721 cgaaactccg tctcaaaaaa aaaaaaaata aaagccaggc gtggcgcgtg cctgtggtct 3781 caactacttg ggaagctgag gtgggaggat cccttaagcc ccagaatttg aggctgcagt 3841 gagccatgat cgcgccactg cactccagcc tgggcgacga aggaacacct tgtcacacac 3901 acacacaagg ctagaccttg tgtcacacat acacactgcc ccccacaggc cgggcaatgc 3961 caactccccg gtcccccctc ccaacctgct cccttccctg ggcgcatagg gatgagccat 4021 ggaagctgtg tcgcctgcac caggctccca cggctcgtgg tgcggtgcgc ttctggtgtt 4081 cgctgcctac agccgacacg tcgagcttcg tgcccctaga gttgcgcgtc acagcagcct 4141 ccggcgctcc gcgatatcac cgtgtcatcc acatcaatga agtaggtaag tgctctggga 4201 atggaggagt ggtcggagga gagggtctca gtcctcgccc acctgaccaa cccccatgcc 4261 tgcagtgctc ctagacgccc ccgtggggct ggtggcgcgg ttggctgacg agagcggcca 4321 cgtagtgttg cgctggctcc cgccgcctga gacacccatg acgtctcaca tccgctacga 4381 ggtggacgtc tcggccggca acggcgcagg gagcgtacag agggtgaggc cagcccctac 4441 ggcccagccc ccaaagctcc actgactacg gcccagccac gcctctcgag gtcgcgcccg 4501 gtgccgcttt cagggccggt ccgtaacatc ccacatccca ttaccctggt gctgaagacc 4561 gttccacgcc cacagacaca gccccctttc ctaatgtcct cgcaagcctg ttgaacccca 4621 acttcttctc cctccggccc gtaaccctag acccctttag cgcccgggtc cctctacgag 4681 tgctagccca gatattaaat tgcccgggtc ccgccctttc gtaccagaga ctctctctct 4741 gattggccct gagctttctt gggctcctcc ccctactctt attggtccca ttgcaattct 4801 agggcaccgt tttcctttcc cctgattggc tcagttccac cagggcccgc ccccacgtca 4861 tctatttttg tctgctacgc gtccctcgcc ctgattccgc ccccaggtgg agatcctgga 4921 gggccgcacc gagtgtgtgc tgagcaacct gcggggccgg acgcgctaca ccttcgccgt 4981 ccgcgcgcgt atggctgagc cgagcttcgg cggcttctgg agcgcctggt cggagcctgt 5041 gtcgctgctg acgcctagcg gtgaggcccc aggcgggggt gtaggaggag ccagggcgaa 5101 tcacggggca agcccaccgc cctgacctcc tccccgcctc ttagacctgg accccctcat 5161 cctgacgctc tccctcatcc tcgtggtcat cctggtgctg ctgaccgtgc tcgcgctgct 5221 ctcccaccgc cggtgagctc cccatttggg cgctgggccc agactcctcc ccgccaacgg 5281 tcctctttca ctatggaaac ctaggctcag agagagacac gcacttgccc aaggtcacgc 5341 agtaaggatt cacatcagtg gcagggctgg gatgcatgcc agactagacc cagactcttc 5401 gttaacattt tctgctcttg gggactttca cctgattttc cttctacatc aggggctgcc 5461 atttcttggg tccctttgtt agttcctttc cccagtgtca tcacctttgt aaaatcaact 5521 agatggattt agtgaaagaa tttaagaccc tgaatgcctc cgcacccctg cggtcaagct 5581 tctcagacac tatgatcaga ctagccgttc tgaggtattt gtaattccaa gcacacacta 5641 ggtggtttca cacccccaag cttttgccca tgctgttccc tctgcctgga atgcccttcc 5701 tgccttgtct gctaagcaat cttctagtcg tctttcatgg ccctgttcat ttacttggtt 5761 ggaaaataca aacagagtgc caaacatgtg ccaggcactg gagagagaat ggagaacaag 5821 ctagaccctg accacaagtc cctgaccttg tggatctcaa gtcaacaaac aagggaccca 5881 agaaatattt gatgacaaat tgtaatgagt gatatcacag aaacaaacag aatgtggtga 5941 catgacagga tggtcaggga aggctccagg aggaggtgac atcagagtgg aaacctgaag 6001 attggaagga agcagccgct tgaaaagtgg ggagaagaaa cagcaagtgc aaaggccctg 6061 aggtgggaat gagattggaa cgttcagcca gcttcaagaa ttgccacatg catggcctgg 6121 catggtggct cacgcctgta atcccagcac tttgggatgc cgaggcaggc agatcacctg 6181 aggttgggag ttcgcgacca gcctgaccaa catggagaaa ccccacctct actaaaaata 6241 caaaactagc caagcgtggt ggcacatgcc tgtaatcccc gctactcggg aggctgaggc 6301 aggagaatca cttgaacctg ggaggtggag gttgcgggtg agccgagatc gtgccatcgc 6361 attccagcct gggcaataag agtgaaactc cgtctcaaaa aaaaaaaaaa aattgccaca 6421 tggctagagt ggtatgtaag ggggtgtggc agatattgag atgagggagg tgacaggggt 6481 catataacgc agggccttct gcagggtggt ggggaggagt ttggaatttt tttttttttt 6541 gagacagagt cactcttgtc gcccaagctg tagtgcagtg cagcagtctt ggctcactgc 6601 aactctgcct cccaggttca agtgattctc ctgcctcaac cgcctgagta gctgagatta 6661 caggcgtgca tgcccggcta attttgtagt tttagtagag acggggttcc accatgttgg 6721 ccaggctggt ctcaaactcc tgacctcagg tgatctgctc acatcagcct ctcaaagtgc 6781 tgggattata ggcatgagcc accgtgcctg gcttggattt tatcctaaat gcctctctca 6841 ttaccccaga aggtaacata atatttatct atgaagtgac atcatggacc tcctggaaaa 6901 atctgggcca gggttttggg ttttttaatt tattttattt tatttttttt agagatgggg 6961 gtctcactat gtttcctagg ctggtcttga actcctgggt tcaaatgatc ctcccacctc 7021 agcctcccaa agtactggga ttatagtgct ggtgtaaacc actgcacctg gccatggcca 7081 ggattaaagg gagaatgacc aaggtatatt gaactcctat gcacccttca ataccctgtt 7141 ccatttaccc ttttgtaggg ccttgctgat gcttcagcca aaacccctgt cccctggccc 7201 tgatgtactc ctctgcctcc attgtgatca cagggaccaa gtgtatctgt gcctctatga 7261 ctgggagtgg agggggaatt ggtgagtatt caatgagtca tatctatgta actatttata 7321 ttggcttcaa cagggctctg aagcagaaga tctggcctgg catcccgagc ccagagagcg 7381 agtttgaagg cctcttcacc acccacaagg gtaacttcca ggtaggtggc ctggttgtcc 7441 cctcagtgcc tgggcttccc tgcttcttgc agccaaactg caggcctctc tgagcaggtt 7501 ggtgctattt cttcagctgt ggctgtacca gaatgatggc tgcctgtggt ggagcccctg 7561 cacccccttc acggaggacc cacctgcttc cctggaagtc ctctcagagc gctgctgggg 7621 gacgatgcag gcagtggagc cggggacaga tgatgagggc cccctgctgg agccagtggg 7681 cagtgagcat gcccaggata cctatctggt gctggacaaa tggttgctgc cccggaaccc 7741 gcccagtgag gacctcccag ggcctggtgg cagtgtggac atagtggcca tggatgaagg 7801 ctcagaagca tcctcctgct catctgcttt ggcctcgaag cccagcccag agggagcctc 7861 tgctgccagc tttgagtaca ctatcctgga ccccagctcc cagctcttgc gtccatggac 7921 actgtgccct gagctgcccc ctaccccacc ccacctaaag tacctgtacc ttgtggtatc 7981 tgactctggc atctcaactg actacagctc aggggactcc cagggagccc aagggggctt 8041 atccgatggc ccctactcca acccttatga gaacagcctt atcccagccg ctgagcctct 8101 gccccccagc tatgtggctt gctcttagga caccaggctg cagatgatca gggatccaat 8161 atgactcaga gaaccagtgc agactcaaga cttatggaac agggatggcg aggcctctct 8221 caggagcagg ggcattgctg attttgtctg cccaatccat cctgctcagg aaaccacaac 8281 cttgcagtat ttttaaatat gtatagtttt tttttgtatc tatatatata tatacacata 8341 tgtatgtaag tttttctacc atgatttcta caaacaccct ttaagtccca tcttcccctg 8401 ggcataggcc atagggatag aagttaaagt tcttgagctt attcagaagc tggatctgca 8461 atctgaatgc tactcataac ataacaaaat agtatgttaa acagctctta aatcttactg 8521 gcttaccaca ttaaatgatt tctctctcct aactcagctc aaatgggcag ccatccatgg 8581 atgagtcaga ggttcagact cttccagtct gtagctctac cttctcttag ggtacttaga 8641 tggatcc // LOCUS HUMINSPR 3943 bp DNA PRI 06-JAN-1995 DEFINITION Human alpha-type insulin gene and 5' flanking polymorphic region. ACCESSION M10039 NID g186437 KEYWORDS insulin. SOURCE Human (30 year old Caucasian male) lymphocyte DNA, clone lambda-HI-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 134 to 2096) AUTHORS Owerbach,D. and Aagaard,L. TITLE Analysis of a 1963-bp polymorphic region flanking the human insulin gene JOURNAL Gene 32 (3), 475-479 (1984) MEDLINE 85155512 REFERENCE 2 (bases 1 to 3943) AUTHORS Owerbach,D. JOURNAL Unpublished (1985) REFERENCE 3 (sites) AUTHORS Cao,G.-J., Jiang,P., Feng,X.-L., Gu,X.-R. and Machatt,M.A. TITLE The mouse Col2a-1 gene is highly conserved and is linked to Int-1 o n Chromosome 15 JOURNAL Nature 1, 23-36 (1991) COMMENT Draft entry and sequence in computer-readable form for [1],[2] kindly provided by D.Owerbach, 15-OCT-1985. The nucleotide sequence of a long polymorphic region (positions 134-2096) located 365 bp upstream of the human insulin gene is composed of 139 repeating sequences whose consensus structure is related to 'acaggggtgtgggg'. Expansion in the number of repeating sequences appears to have taken place through duplication and triplication of blocks of 8-10 repeats. However, ancestral polymorphic regions containing additions or deletions of 50 bp or more were not detected in two previous generations. The region 168-258 bp upstream from the transcription start site, containing essential control elements for efficient cell-specific expression, are the same. Thus linkage-disequilibrium between sequences in this control region and specific polymorphic regions is probably not the explanation for the disease association between the long polymorphic regions and atherosclerosis. FEATURES Location/Qualifiers source 1..3943 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p15.5" source 1..1013 /organism="Mus musculus" /germline gene 1..23 /gene="S" exon 1..23 /partial /gene="S" /note="f = pseudouridine" /citation=[3] /evidence=experimental /product="transfer RNA-Lys ligase" prim_transcript 2461..3891 /note="ins mRNA" intron 2503..2681 /note="ins mRNA intron A" exon <2699..2885 /gene="INS" /note="insulin, (first expressed exon); G00-119-349" /number=2 gene 2699..2885 /gene="INS" CDS join(2699..2885,3673..3818) /partial /note="insulin" /codon_start=1 /db_xref="PID:g386829" /translation="MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCG ERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSL YQLENYCN" intron 2886..3672 /note="ins cds intron B" exon 3673..>3818 /note="insulin" /number=3 BASE COUNT 628 a 868 c 1825 g 622 t ORIGIN Chromosome 11p15.5; PvuII site. 1 ctggggctgc tgtcctaagg cagggtggga actaggcagc cagcagggag gggacccctc 61 cctcactccc actctcccac ccccaccacc ttggcccatc catggcggca tcttgggcca 121 tccgggactg gggacagggg tcctggggac aggggtccgg ggacagggtc ctggggacag 181 gggtgtgagg acaggggtcc tggggacagg ggtgtgggga caggggtgtg aggacagggg 241 tcccggggac aggggtgtgg ggacaggggt gtggggatag gggtgtgggg acaggggtgt 301 ggggacaggg gtgtggggac aggggtctgg ggacaggggt gtggggatag gggtgtgggg 361 acaggggtgt ggggacaggg gtgtggggac aggggtctgg ggacaggggt gtggggacag 421 gggtccgggg acaggggtgt ggggacaggg gtgtggggac aggggtgtgg ggacaggggt 481 cccggggaca ggggtgtggg gacaggggtc tggggacagg ggtgtgggga taggggtgtg 541 gggacagggg tgtggggaca ggggtgtggg gacaggggtc tggggacagg ggtgtgggga 601 caggggtctg gggacagggg tgtggggaca ggggtcccgg ggacaggggt gtggggacag 661 gggtctgggg acaggggtgt ggggataggg gtgtggggac aggggtgtgg ggacaggggt 721 gtggggacag gggtctgggg acaggggtgt ggggacaggg gtgtggggac aggggtgtgg 781 ggacaggggt ccggggacag gggtgtgggg acaggggtct ggggacaggg gtgtggggac 841 aggggtgtgg ggacaggggt gtggggacag gggtctgggg acaggggtgt ggggacaggg 901 gtctggggac aggggtgtgg ggacaggggt gtggggacag gggtgtgggg acaggggtgt 961 ggggacaggg gtccggggac aggggtctgg ggacaggggt gtggggacag gggtgtgggg 1021 acaggggtgt ggggacaggg gtcccgggga caggggtgtg gggacagggg tctggggaca 1081 ggggtgtggg gataggggtg tgtggacagg ggtgtgggga taggggtgtg gggacagggg 1141 tcccggggac aggggtgtgg ggacaggggt gtggggatag gggtgtgggg acaggggtcc 1201 cggggacagg ggtgtgggga caggggtctg gggacagggg tgtggggaca ggggtgtggg 1261 gacaggggtc ccggggacag gggtgtgggg acaggggtct ggggacaggg gtgtggggat 1321 aggggtgtgg ggacaggggt gtggggatag gggtgtgggg acaggggtgt ggggacaggg 1381 gtcctgggga caggggtgtg gggacagggg tgtggggaca ggggtgtggg gacaggggtg 1441 tggggacagg ggtcccgggg acaggggtgt ggggacaggg gtgtggggac aggggtgtgg 1501 ggacaggggt ccggggacag gggtgtgggg acaggggtgt ggggacaggg ctgtggggac 1561 aggggtgtgg ggacaggggt cctggggaca ggggtctggg gacaggggtg tggggacagg 1621 ggtgtgggga caggggtccg gggacagggg tgtggggaca ggggtccggg gacaggggtg 1681 tggggacagg ggtgtgggga caggggtgtg gggacagggg tgtggggaca ggggtcctgg 1741 ggacaggggt ctggggacag gggtgtgggg acaggggtgt ggggacaggg gtcccgggga 1801 caggggtgtg gggacagggg tgtggggaca ggggtgtggg gacaggggtg tggggacagg 1861 ggtgtgggga caggggtccc ggggacaggg gtgtggggac aggggtgtgg ggacaggggt 1921 cctggggaca ggggtctggg gataggggtg tggggacagg ggtctgggga caggggtgtg 1981 gggacagggg tctggggata ggggtgtggg gacaggggtg tggggacagg ggtgtgggga 2041 caggggtgtg gggacagggg tgtggggaca ggggtcctgg ggacaggggt ctggggacag 2101 cagcgcaaag agccccgccc tgcagcctcc agctctcctg gtctaatgtg gaaagtggcc 2161 caggtgaggg ctttgctctc ctggagacat ttgcccccag ctgtgagcag ggacaggtct 2221 ggccaccggg cccctggtta agactctaat gacccgctgg tcctgaggaa gaggtgctga 2281 cgaccaagga gatcttccca cagacccagc accagggaaa tggtccggaa attgcagcct 2341 cagcccccag ccatctgccg acccccccac cccaggccct aatgggccag gcggcagggg 2401 ttgacaggta ggggagatgg gctctgagac tataaagcca gcgggggccc agcagccctc 2461 agccctccag gacaggctgc atcagaagag gccatcaagc aggtctgttc caagggcctt 2521 tgcgtcaggt gggctcaggg ttccagggtg gctggacccc aggccccagc tctgcagcag 2581 ggaggacgtg gctgggctcg tgaagcatgt gggggtgagc ccaggggccc caaggcaggg 2641 cacctggcct tcagcctgcc tcagccctgc ctgtctccca gatcactgtc cttctgccat 2701 ggccctgtgg atgcgcctcc tgcccctgct ggcgctgctg gccctctggg gacctgaccc 2761 agccgcagcc tttgtgaacc aacacctgtg cggctcacac ctggtggaag ctctctacct 2821 agtgtgcggg gaacgaggct tcttctacac acccaagacc cgccgggagg cagaggacct 2881 gcagggtgag ccaaccgccc attgctgccc ctggccgccc ccagccaccc cctgctcctg 2941 gcgctcccac ccagcatggg cagaaggggg caggaggctg ccacccagca gggggtcagg 3001 tgcacttttt taaaaagaag ttctcttggt cacgtcctaa aagtgaccag ctccctgtgg 3061 cccagtcaga atctcagcct gaggacggtg ttggcttcgg cagccccgag atacatcaga 3121 gggtgggcac gctcctccct ccactcgccc ctcaaacaaa tgccccgcag cccatttctc 3181 caccctcatt tgatgaccgc agattcaagt gttttgttaa gtaaagtcct gggtgacctg 3241 gggtcacagg gtgccccacg ctgcctgcct ctgggcgaac accccatcac gcccggagga 3301 gggcgtggct gcctgcctga gtgggccaga cccctgtcgc caggcctcac ggcagctcca 3361 tagtcaggag atggggaaga tgctggggac aggccctggg gagaagtact gggatcacct 3421 gttcaggctc ccactgtgac gctgccccgg ggcgggggaa ggaggtggga catgtgggcg 3481 ttggggcctg taggtccaca cccactgtgg gtgaccctcc ctctaacctg ggtccagccc 3541 ggctggagat gggtgggagt gtgacctagg gctggcgggc aggcgggcac tgtgtctccc 3601 tgactgtgtc ctcctgtgtc cctctgcctc gccgctgttc cggaacctgc tctgcgcggc 3661 acgtcctggc agtggggcag gtggagctgg gcgggggccc tggtgcaggc agcctgcagc 3721 ccttggccct ggaggggtcc ctgcagaagc gtggcattgt ggaacaatgc tgtaccagca 3781 tctgctccct ctaccagctg gagaactact gcaactagac gcagcccgca ggcagcccca 3841 cacccgccgc ctcctgcacc gagagagatg gaataaagcc cttgaaccag ccctgctgtg 3901 ccgtctgtgt gtcttggggg ccctgggcca agccccactt ccc // LOCUS HUMHST 6616 bp DNA PRI 22-AUG-1995 DEFINITION Human transforming protein (hst) gene, complete cds. ACCESSION J02986 M16338 NID g184430 KEYWORDS transforming protein. SOURCE Homo sapiens (clone: pLBS6.2) DNA; and Homo sapiens (clone: lambda-CT361-b3) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 6182 to 6616; 2313 to 2890; 3508 to 3611) AUTHORS Taira,M., Yoshida,T., Miyagawa,K., Sakamoto,H., Terada,M. and Sugimura,T. TITLE cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (9), 2980-2984 (1987) MEDLINE 87204251 REFERENCE 2 (bases 1 to 6181) AUTHORS Yoshida,T., Miyagawa,K., Odagiri,H., Sakamoto,H., Little,P.F., Terada,M. and Sugimura,T. TITLE Genomic sequence of hst, a transforming gene encoding a protein homologous to fibroblast growth factors and the int-2-encoded protein [published erratum appears in Proc Natl Acad Sci U S A 1988 Mar;85(6):1967] JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (20), 7305-7309 (1987) MEDLINE 88041096 COMMENT Draft entry and printed copy of sequence for [1],[2] kindly provided by H.Sakamoto, 08/06/87. No polyadenylation site was found. FEATURES Location/Qualifiers source 1..6616 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pLBS6.2" /clone="lambda-CT361-b3" /cell_line="T361-2nd-1 stomach cancer" /map="11q13.3" exon 2313..2890 /note="transforming protein" /number=1 variation 2444 /note="c in DNA, t in cDNA" /replace="t" variation 2474 /note="c in DNA, t in cDNA" /replace="t" gene 2551..4326 /gene="FGF4" CDS join(2551..2890,3508..3611,4150..4326) /gene="FGF4" /codon_start=1 /db_xref="GDB:G00-120-066" /product="transforming protein" /db_xref="PID:g386788" /translation="MSGPGTAAVALLPAVLLALLAPWAGRGGAAAPTAPNGTLEAELE RRWESLVALSLARLPVAAQPKEAAVQSGAGDYLLGIKRLRRLYCNVGIGFHLQALPDG RIGGAHADTRDSLLELSPVERGVVSIFGVASRFFVAMSSKGKLYGSPFFTDECTFKEI LLPNNYNAYESYKYPGMFIALSKNGKTKKGNRVSPTMKVTHFLPRL" intron 2891..3507 /gene="FGF4" /note="hst intron A" exon 3508..3611 /gene="FGF4" /number=2 intron 3612..4149 /gene="FGF4" /note="hst intron B" exon 4150..>6181 /note="transforming protein" /number=3 variation 4383 /note="g in DNA, a in cDNA" /replace="a" CDS 4685..5149 /note="ORF; putative" /codon_start=1 /db_xref="PID:g567023" /translation="MAWSISVWKGPPEAWAASSGSSQAWLSASILRGPLPLCPVAINR DISVYFGYRKCGVEVLAATLFLDLPRLTFELSCSQSSSIYQMGETLGQLYKLLFAFFG SATAPIAVTIGEKTKLFHKFHGEESISHWKARNGQDSVFAITNKTLVMKNNL" BASE COUNT 1404 a 1833 c 1984 g 1395 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccagaa ggcatccccg agtggctact ccaatggagt ggcttctcca ttcaggcaaa 61 cctgaatggg ataagtcatt ggcaggaaga tctggggccg ggggtcatcc agtgggaagg 121 ggagagatga cgcggtcagc atggcgggaa cacaggagca gaaaggaagc aggtgggaag 181 ccaggtcaag ggccaggggc acggaaaggg gtcagatgca gataagtgag tgcttcctgg 241 tgcatccttc atccgcaatt catccttacc tgtgcttttg ttgcctccat tgcacagctg 301 aggaggccag ggcctgcgga ggttgagagt gtgctcaggg agcccccgga gcaaagtgga 361 agccagattc cagatcagtt ctgctgggaa ttcccagctc ccaaaagccc tgctggctgt 421 cagtccccag tcaccacaag cacctatcct gtgtgggtgg gcctgcagtt ctgggagata 481 tatcagctgc ctgcagcgtc ctttgctgaa ctcacagcaa ataggagaga cagggagggg 541 tccttgggaa gccctaaatt gagcttgctg tgggagtcct gggaagaaag gagcctcatc 601 ctatcaaaag ccggggggaa gacatcagag tccctctgct caggtcagct ggcacaggtg 661 ggtctccagg cctgggtctc acttccccag agggtgtgtt cgggtggccc caggctgagg 721 gaggaaagcc cacctcccat gtcattttgc aaatggggag tcagggacct agagatggaa 781 agacaacaca gcaagtgagg gatgggttct aggtcccctg caccctgcac cctgcaccct 841 ggccaacgat gtctatttgg caccagatct gcaggctcat ctgggggacc ccaggaccca 901 gaggcagccg ggttgcatct cgaagctgtg agctgcagcc caggaaggtc caggtctggg 961 tggcgctgcc caagcaggct gcaggcccaa ggaggaacaa agatcctctc aaggggtgcg 1021 gagctgaggt tccggtcctg ccaaagccac ttgatgaccc ccaagtgccc ccctttctgc 1081 acctcagaga agagccctca agcctcccag gtcccctcca ggggcacgaa taagccccag 1141 cagggttctg aaggggtccc aggaatctcc ctgtggggat gcggtggagg tggaggaggc 1201 tgcggtggcc tggggacatc tctggtcaca ggtgctggtg gtatgagaga tggggtaggc 1261 accaagcccc ctgcagctgt ggctaggcgg gcctgcagga agggccaggc aggctcctca 1321 gggaccacaa agaacagggg ttttcacacc taggtgggcc tgcatctagc taggccagtc 1381 cccatcaggc cataatgggc acagtgggag gtagaaccat gagtgagaga ggggaggctt 1441 ccagaggcct ggcctgggtc cctgctagat tgagggctct ggctatggta catggatatt 1501 tctgctgtgg aatcaaagga gcaggggatg ctgaatatcc cctctggccc tatgccctgc 1561 tacctgtcct ttcacggaag ggtgtgtgtg tagggggtgc aggaccaggc ctccctgggt 1621 gcatctctgc caccttgccc tttggctcag gtggacctcc accaggtatt cagaactcca 1681 gcccagaaac gcgccaagcc tgtggggcca agacctaggg ggtgggggtg gcctccctcc 1741 cgcctgtagc caaagggtcc tcccttgccc agccaggccc cggtgtcgct tactgctctt 1801 atccacccct ccttcccagg ccggtcctca aggccccagc aaaggaacca agttcccgtg 1861 agcctccgaa aggcgaaggg caggcagcag ccgctggctt ctgcgcccac taggagcttc 1921 ggatgcccga gttagggctg cgccaaggcg gccggagcag agagggagac ggggacgggg 1981 acaggcaggg acaaagtgca agaggcaaaa ctggctgaaa agcagaagtg taggagccgc 2041 caaggggcgg gacgaacagg tccgtgggcc gggcggagcc aagggtgggg gccggggtcc 2101 ctccaggtgg cactcgcggc gctagtcccc agcctcctcc cttcccccgg ccctgattgg 2161 caggcggcct gcgaccagcc gcgaacgcca cagcgccccg ggcgcccagg agaacgcgaa 2221 cggccccccg cgggagcggg cgagtaggag ggggcgccgg gctatatata tagcggctcg 2281 gcctcgggcg ggcctggcgc tcagggaggc gcgcactgct cctcagagtc ccagctccag 2341 ccgcgcgctt tccgcccggc tcgccgctcc atgcagccgg ggtagagccc ggcgcccggg 2401 ggccccgtcg cttgcctccc gcacctcctc ggttgcgcac tcccgcccga ggtcggccgt 2461 gcgctcccgc gggccgccac aggcgcagct ctgcccccca gcttcccggg cgcactgacc 2521 gcctgaccga cgcacggccc tcgggccggg atgtcggggc ccgggacggc cgcggtagcg 2581 ctgctcccgg cggtcctgct ggccttgctg gcgccctggg cgggccgagg gggcgccgcc 2641 gcacccactg cacccaacgg cacgctggag gccgagctgg agcgccgctg ggagagcctg 2701 gtggcgctct cgttggcgcg cctgccggtg gcagcgcagc ccaaggaggc ggccgtccag 2761 agcggcgccg gcgactacct gctgggcatc aagcggctgc ggcggctcta ctgcaacgtg 2821 ggcatcggct tccacctcca ggcgctcccc gacggccgca tcggcggcgc gcacgcggac 2881 acccgcgaca gtgagtggcg cggccaggcg cgaaggggcg ggggcggggg gcaacggccg 2941 ccgggccaac ccgctcagtc acactctgag accctcggcg ggcacctgct cgggggcccc 3001 gggaaccggg gcggactcgg gctccggtcc cttctgacgc ggggctgggg acgcagacac 3061 tcttggctcc ggcagcccag cgcaacccct gaggtcgggc gccgcctccc gccttcagaa 3121 actcgggctc cgagcgccga attccagcgc cttcgcccgt gggcacaggg cgcgcggtgc 3181 agccacaggg ggcccgagac acgcgccccg gcctggccca ggctggggaa ccgctggggt 3241 cgggctcgcg tctgaaggtc cgggactggg tgcggccgcc gggggtcccc tacacaggca 3301 agctaatctg agctagcgca ggcttgggct ccggaggccc tagagggcag cttgggctct 3361 ggaggccctt gggggcggct gcgccgggaa ccctggccct ttatccccaa ccccacccca 3421 gaaatagggt ccccggaggc gaacaagccg aggggcggag tgggccaggg atcacctgcc 3481 ccgcaatgac ctgcgccccg cccccaggcc tgctggagct ctcgcccgtg gagcggggcg 3541 tggtgagcat cttcggcgtg gccagccggt tcttcgtggc catgagcagc aagggcaagc 3601 tctatggctc ggtgagtacc gcaggggtct ggctaggcac ctagttggga acagcggaca 3661 tggctagcag gctcgtggct tctccagccc cacctgtgcc tgggtcttgg aggggtggca 3721 gggtcaccag gtcacgggac cggcaggcct ccccagacaa aggaagcagc cccaaggcag 3781 gaacaatgag gttcctgcca tccctgagtg ggcccctccc agaccgagga aagggcgcta 3841 ttgagagccc ttcccttctc tagtccagag gggtaggtct cagtgttgga actgcgggct 3901 tgaggctgga cacgcaggga atgaattctc tggctgctag gtgcagggca ggtggtgaga 3961 gcaccagctg ttgtgggctg gccatgtccc cttctcaccc tgtgtgggtc ttgacacctt 4021 aactgctcag cagagacatc tcagcccagg gtggggggtg ggacagaagg gggttctgac 4081 ccctggcttc aggctgggta ccttgcccaa gaggtgcccc agccctgaca ctgccctgct 4141 ttgctgcagc ccttcttcac cgatgagtgc acgttcaagg agattctcct tcccaacaac 4201 tacaacgcct acgagtccta caagtacccc ggcatgttca tcgccctgag caagaatggg 4261 aagaccaaga aggggaaccg agtgtcgccc accatgaagg tcacccactt cctccccagg 4321 ctgtgaccct ccagaggacc cttgcctcag cctcgggaag cccctgggag ggcagtgccg 4381 agggtcacct tggtgcactt tcttcggatg aagagtttaa tgcaagagta ggtgtaagat 4441 atttaaatta attatttaaa tgtgtatata ttgccaccaa attatttata gttctgcggg 4501 tgtgtttttt aattttctgg ggggaaaaaa agacaaaaca aaaaaccaac tctgactttt 4561 ctggtgcaac agtggagaat cttaccattg gatttcttta acttgtcaaa agttgtcacg 4621 agtgtgctgc tattctgtgt tttaaaaaaa ggtgacattg gattccgatg tcatcccctg 4681 tagtatggcg tggagcatct ctgtctggaa aggcccgcct gaggcttggg cagccagttc 4741 agggagctcc caggcttggc tctcggctag catcctcaga ggcccactcc ctttgtgccc 4801 tgttgctatt aatcgggaca tatcggttta cttcgggtac agaaagtgcg gtgttgaagt 4861 cctcgctgcc actctgtttt tagatctgcc aagactgacc tttgaacttt cctgtagtca 4921 atcttcctcg atctaccaga tgggagagac ccttggacaa ctttataaac tcctgtttgc 4981 cttttttgga tcagcgacag cccccatcgc tgtgactatt ggggaaaaga cgaagctctt 5041 tcataaattc catggagagg aatcaatatc ccactggaag gctagaaatg gacaagatag 5101 tgtatttgca atcacaaaca aaaccctagt gatgaaaaat aatttgtgat ggcagatgct 5161 tctgatggtg tgatagaata tgtttttgaa aacaaaccat cgaacccccc gccccacccc 5221 caaaacgggc ttccctgtgt ttagggagct ttgggctaga actagctacg atttttaggt 5281 gaaatgtcct tgtaattgta caaagcactt ggtgcagtgt ttgcgtggag cagcctgctg 5341 ctttctgatg cattccctgt ttaagtgcgt ttaacatcta cctcacaagc cctgaaaccc 5401 caggcaaaac ccacagaaag ctcatacccg gtgcaggagt ttgccatccc aagtggcttt 5461 ttttccatat gtagccaaaa aggattgcag atagcgtcgg tgcgtcccat tcgaaccttg 5521 tcacgtttga gctatcttta ccctgtgatt tacttttagt aagggtgatc atggtgaaaa 5581 tatttgcaga cagctgttac agtacactat atggtcacca agtaacctta tatttttctt 5641 tatatatttt acaaatgtaa cccctgtcat tgaagcaacc gtggaagagg cagggtcggt 5701 gatgtttaaa aaaagttccg aggtgatggc aaacatttaa ttttaatgaa tgacttttta 5761 gagtttatac aaaatgacct tagcttgcta ccagaaatgc tccgaatgtt tcgtcaagac 5821 tttaatactc tcctaggatg tttctgaact gtctcccgaa ttaactttat gggagtctac 5881 agacagcaag actggaaaat ctgattggag tttttgtctt tcacattcct tttgaaaact 5941 ctttgttcga atgcaaatca tcgacttaaa atactattct taaccaaggc ctggaagaaa 6001 gaagacactt gcaaagccgc taagacagga ccacacatct taaactgctg ttcctaccat 6061 gcactaaact gtttttaagt tttaaaccac accctaggct ccaggagtgt tcaggaaaga 6121 tggtgtttgt aggtctccat gctgtttggc gttggggggt gtggagggat catccgtcga 6181 ctttctgaat tttaatgtat tcacttagta acaaaccatg attgtcttaa atgccttaaa 6241 ttattatgag atttcttgtc tcagagccca atcagattgt caggaattaa catgtgttag 6301 gtttgatcac ccttgaccac ttcttataga tatttcttca acaaatcatg tgtgatgcct 6361 gtaggaacac aactgtacct ttaaaatatt gttttcatat tgctgtgatg gggattcgag 6421 gttcctgtat gtgccactgt tttcagaatc tgtagtttta tacaggtgcc gaccctcgtt 6481 gtgatgtatg tgctgtgcac attgacatgc tgaccgacaa tgataagcgt ttatcgtgta 6541 taaaaagaca ccactggact ggatgtacac aactgggaaa ggaattaaaa gctattaaaa 6601 ttgtgccttg aaatgc // LOCUS HUMPLPSPC 3409 bp DNA PRI 17-AUG-1998 DEFINITION Human pulmonary surfactant protein C (SP-C) and pulmonary surfactant protein C1 (SP-C1) genes, complete cds. ACCESSION J03890 NID g190089 KEYWORDS alternative splicing; pulmonary surfactant-associated protein SP-C. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3409) AUTHORS Glasser,S.W., Korfhagen,T.R., Perme,C.M., Pilot-Matias,T.J., Kister,S.E. and Whitsett,J.A. TITLE Two SP-C genes encoding human pulmonary surfactant proteolipid JOURNAL J. Biol. Chem. 263 (21), 10326-10331 (1988) MEDLINE 88273133 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.W.Glasser, 18-MAY-1988 There are also insertions in the clone lambda-VG524, though what these are is not indicated. They occur at positions 519, 870, 987, 1076, 1146, 2021 (2), 2054, 2058, 2383, 2875, 2915, and 3306 (2). FEATURES Location/Qualifiers source 1..3409 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="VG519" /chromosome="8" variation 524..526 /note="gtc in lambda-VG519; gc in lambda-VG524" /replace="gc" variation 541..543 /note="ctc in lambda-VG519; cc in lambda-VG524" /replace="cc" mRNA join(591..657,1356..1514,1860..1982,2206..2316,2651..3237) /gene="SP-C1" /note="G00-120-373" mRNA join(591..657,1356..1514,1860..1982,2206..2316,2669..3237) /gene="SP-C1" exon 591..657 /gene="SP-C1" /note="G00-120-373" /number=1 gene 591..3237 /note="SFTP2" /gene="SP-C1" CDS join(616..657,1356..1514,1860..1982,2206..2316,2651..2809) /gene="SP-C1" /codon_start=1 /db_xref="GDB:G00-120-373" /product="pulmonary surfactant protein SP-C" /db_xref="PID:g387029" /translation="MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVV LIVVVIVGALLMGLHMSQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLV VYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVPTSK LGQAEGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI" intron 658..1355 /gene="SP-C1" /note="G00-120-373" /number=1 variation 658..660 /gene="SP-C1" /note="gtg in lambda-VG519; gg in lambda-VG524" /replace="gg" variation 683..685 /gene="SP-C1" /note="tat in lambda-VG519; tt in lambda-VG524" /replace="tt" variation 873 /gene="SP-C1" /note="g in lambda-VG519; a in lambda-VG524" /replace="a" exon 1356..1514 /gene="SP-C1" /note="G00-120-373" /number=2 intron 1515..1859 /gene="SP-C1" /note="G00-120-373" /number=2 exon 1860..1982 /gene="SP-C1" /note="G00-120-373" /number=3 intron 1983..2205 /gene="SP-C1" /note="G00-120-373" /number=3 variation 1992..1994 /gene="SP-C1" /note="cca in lambda-VG519; ca in lambda-VG524" /replace="ca" variation 2073..2075 /gene="SP-C1" /note="cct in lambda-VG519; ct in lambda-VG524" /replace="ct" variation 2083..2085 /gene="SP-C1" /note="act in lambda-VG519; at in lambda-VG524" /replace="at" variation 2088..2090 /gene="SP-C1" /note="ttg in lambda-VG519; tg in lambda-VG524" /replace="tg" variation 2097..2099 /gene="SP-C1" /note="cca in lambda-VG519; ca in lambda-VG524" /replace="ca" variation 2109..2112 /gene="SP-C1" /note="agtc in lambda-VG519; aac in lambda-VG524" /replace="aac" variation 2172..2174 /gene="SP-C1" /note="gga in lambda-VG519; ga in lambda-VG524" /replace="ga" variation 2177..2179 /gene="SP-C1" /note="ccg in lambda-VG519; cg in lambda-VG524" /replace="cg" exon 2206..2316 /gene="SP-C1" /note="G00-120-373" /number=4 intron 2317..2668 /gene="SP-C1" /note="G00-120-373" /number=4 intron 2317..2650 /gene="SP-C1" /number=4 variation 2370 /gene="SP-C1" /note="g in lambda-VG519; a in lambda-VG524" /replace="a" variation 2544 /gene="SP-C1" /note="c in lambda-VG519; t in lambda-VG524" /replace="t" variation 2644 /gene="SP-C1" /note="a in lambda-VG519; g in lambda-VG524" /replace="g" exon 2651..3237 /gene="SP-C1" /note="G00-120-373" /number=5 variation 2810 /gene="SP-C1" /note="g in lambda-VG519; t in lambda-VG524" /replace="t" variation 2813 /gene="SP-C1" /note="c in lambda-VG519; g in lambda-VG524" /replace="g" variation 2868 /gene="SP-C1" /note="g in lambda-VG519; c in lambda-VG524" /replace="c" variation 2875 /gene="SP-C1" /note="t in lambda-VG519; a in lambda-VG524" /replace="a" variation 2879 /gene="SP-C1" /note="c in lambda-VG519; t in lambda-VG524" /replace="t" variation 2881..2882 /gene="SP-C1" /note="gc in lambda-VG519; cg in lambda-VG524" /replace="cg" variation 2892..2894 /gene="SP-C1" /note="ggc in lambda-VG519; gc in lambda-VG524" /replace="gc" variation 2908 /gene="SP-C1" /note="t in lambda-VG519; c in lambda-VG524" /replace="c" variation 3024 /gene="SP-C1" /note="t in lambda-VG519; c in lambda-VG524" /replace="c" variation 3130 /gene="SP-C1" /note="a in lambda-VG519; g in lambda-VG524" /replace="g" variation 3181 /gene="SP-C1" /note="a in lambda-VG519; g in lambda-VG524" /replace="g" variation 3292 /note="c in lambda-VG519; g in lambda-VG524" /replace="g" variation 3316..3317 /note="gc in lambda-VG519; cg in lambda-VG524" /replace="cg" variation 3326 /note="g in lambda-VG519; t in lambda-VG524" /replace="t" BASE COUNT 758 a 938 c 1042 g 671 t ORIGIN 1 ggtaccagat atgtgggagg aggcaaggta agggaaagag tacttgaagt tggaactggt 61 ccttgcaggg aaatgcacat ttatgaaacc ccgaaaactg atgtcaaagc acctcctgcc 121 ttgggcagtc ctctcagagt ctacaggtgc tgcctccaga accctcttcc tggagcgcat 181 ccctatgtat ctagaaattc tgctgggaaa tatgatggtc agacccttgg ccacctgaaa 241 gttcagggtg gtagaagaaa aaggaaagcc acagggcagc aggggcaggt gcagcaagga 301 aggcaggcac gccaggaaga cacccatggg tagaagtgca gatggcccga gggcacagtt 361 tgctcaactc acccaggttt gctcttgctg gggccaagag gactcatgtg ccagggccaa 421 gggctctggg ggctctcaca gggggcttat ctgggcttcg gttctggagg gccaggaaca 481 aacaggcttc aaagcaaggg cttggctggc acacaggggc ttggtccttc acctctgtcc 541 ctctcctacg gacacatata agaccctggt cacacctggg agaggaggag aggagagcat 601 agcacctgca gcaagatgga tgtgggcagc aaagaggtcc tgatggagag cccgccggtg 661 agtgtggttg cgtgtgtgta tgtatgtgcg cgcgcacatg tgtgtgatgg ccctgcctcc 721 tctatcctcc ctggcctgtt tccttatcca gatccattca ctcaactaac ctaggactgt 781 gataagtcag gatggggaca ccaagaccac taagccaggg acccttgggg agctgtttgt 841 ggccaagagc cactataggg gtccgtagaa ctggagtgcg cgtagacagc cctgagtcag 901 aagccatgag aaacttcaga agtcagggga cacttctcag agaaaaacca catacgagct 961 ggagccagaa taaggaggag ctcgcccggt ggagaaggag gaaggcattc caggaaggag 1021 ggagactctg tatcaccgca tggaggtgat cacttgggga gagagagggg ctgaccatgg 1081 ctgggggaag cagcagggag agacaggtga agcaggctct cttgggtccc tcaaaactag 1141 accctgcttc taagcttcta tgtatctatg ggtttgttag aatccaggcc acctcctcca 1201 agaagccttc tctgatctcc tcagcccttc cctgtccatc catcgcatcg gctgtccagc 1261 ctaggagccg tgggagggtg ttcagcttgt atagggagaa gaggggacag cctcatgacc 1321 tcatgcctgt ctccttgcct gccccaccgt gtcaggacta ctccgcagct ccccggggcc 1381 gatttggcat tccctgctgc ccagtgcacc tgaaacgcct tcttatcgtg gtggtggtgg 1441 tggtcctcat cgtcgtggtg attgtgggag ccctgctcat gggtctccac atgagccaga 1501 aacacacgga gatggtgaga ggtgtgggat gcacagcagt gggcacagga catgccagac 1561 agaggggcta ggtgggatgg gcgataggaa actgtccaag gggagtggag gggaggaggc 1621 aaggggcaca gctagaagga aagaggcacg aaccaggcag caacccagct caggcttttc 1681 cacaaggccc ctgcccgcga caggacagcc agctccctcc agcacctggt tccactcagc 1741 ctccctgaac tcttgggaaa gagggaagcg catttgagta cagaggcctg agtatgggga 1801 tgggtaccac tggctgagta ggaaagggga agaccaggtg gctccatgcc tttccccagg 1861 ttctggagat gagcattggg gcgccggaag cccagcaacg cctggccctg agtgagcacc 1921 tggttaccac tgccaccttc tccatcggct ccactggcct cgtggtgtat gactaccagc 1981 aggtgggtat gccagacctc ctgacctgga ccaatgacaa ctgggctctg ctagagcgcc 2041 cagctggcca ctttcattcc acatccatct ctcctctctc agactttttg ctgagcccag 2101 attctagtag tctcccgtgc ccaacctaga gggaggtggc taaggacctg ggtcagggag 2161 agagcagggc aggaccccga atgatctcca gcattctgtg cctagctgct gatcgcctac 2221 aagccagccc ctggcacctg ctgctacatc atgaagatag ctccagagag catccccagt 2281 cttgaggctc tcactagaaa agtccacaac ttccaggtgt gtgtgtgtgg gtgaaaagag 2341 tgggctgtct ccctcccagg ctgctggagg agtgtccgaa tggtggctat ttgtcacctg 2401 taaagcactg ttcctcattg gctgccagct gactgcccct ctcctattcc cctgcacgac 2461 tcctttcctt cccaccccac tgccaagctg ctgggctcag ctgagtccac tcactacctg 2521 gtggcttctg actctagcac agcccctctt tactgatgag aaaactgagg ctcagagaga 2581 ttgcctgata tacctgaagt cccacaataa gggctgcaca tgggatagaa actcacttcc 2641 tacattccag atggaatgct ctctgcaggc caagcccgca gtgcctacgt ctaagctggg 2701 ccaggcagag gggcgagatg caggctcagc accctccgga ggggacccgg ccttcctggg 2761 catggccgtg aacaccctgt gtggcgaggt gccgctctac tacatctagg acgcctccgg 2821 tgagcaggtg tgatcccagg gcccctgatc agcagcggag gagcgctggc cacctgcccg 2881 gctgtggagg aggctcgctg accaggctgg ggcgtccact gaagcggggt catccaggca 2941 actcggggga ggggaagctc acagaccggt acttcccact cccctgaatt ctctctgtcc 3001 atcctcaaca ttcctttgct tcatagggtc agtggaagcc ccaacggaaa ggaaacgccc 3061 cgggcaaagg gtcttttgca gcttttgcag acgggcaaga agctgcttct gcccacaccg 3121 cagggacaaa ccctggagaa atgggagctt ggggagagga tgggagtggg cagaggtggc 3181 acccaggggc ccgggaactc ctgccacaac agaataaagc agcctgattt gaaaagcaaa 3241 gggtctgctt ctgtcttcct gcagggcgca gtcctcgctg gcggggccgg ccaagaaggg 3301 aagggccttg ggagagcaaa gtggggtttc cattcgccct ctgtcccagg gcgctggcac 3361 tgtccacctc ggcggggaga ggggctcgca gggagcatcc acgggcttt // LOCUS HSCOSEG 3521 bp DNA PRI 15-JAN-1992 DEFINITION H.sapiens coseg gene for vasopressin-neurophysin precursor. ACCESSION X62890 NID g30137 KEYWORDS neurophysin; vasopressin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3521) AUTHORS Schmale,H. TITLE Direct Submission JOURNAL Submitted (28-OCT-1991) H. Schmale, Universitaet Hamburg, Universitaetskrankenhaus Eppendorf, Inst. fuer Zellbiol. u. klin. Neurobiol., Martinistr. 52, 2000 Hamburg 20, FRG REFERENCE 2 (bases 1 to 3521) AUTHORS Bahnsen,U., Oosting,P., Swaab,D.F., Nahke,P., Richter,D. and Schmale,H. TITLE A missense mutation in the vasopressin-neurophysin precursor gene cosegregates with human autosomal dominant neurohypophyseal diabetes insipidus JOURNAL EMBO J. 11 (1), 19-23 (1992) MEDLINE 92155158 COMMENT See also x62891. FEATURES Location/Qualifiers source 1..3521 /organism="Homo sapiens" /isolate="Diabetes insipidus patient IV-3" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="leukocyte" /clone_lib="subgenomic plasmid library" /clone="IV-3-2" /chromosome="20" TATA_signal 1061..1066 gene 1139..3243 /gene="coseg" CDS join(1139..1258,2635..2836,3011..3183) /gene="coseg" /codon_start=1 /product="vasopressin-neurophysin precursor" /db_xref="PID:g30138" /db_xref="SWISS-PROT:P01185" /translation="MPDTMLPACFLGLLAFSSACYFQNCPRGGKRAMSDLELRQCLPC GPGGKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKACGSGGRCAAFG VCCNDESCVTEPECREGFHRRARASDRSNATQLDGPAGALLLRLVQLAGAPEPFEPAQ PDAY" exon 1139..1258 /gene="coseg" /number=1 mRNA join(1139..1258,2635..2836,3011..3243) /gene="coseg" intron 1259..2634 /gene="coseg" /number=1 exon 2635..2836 /gene="coseg" /number=2 allele 2657 /gene="coseg" /note="nucleotide different from mutant allele in ADNDI patient" intron 2837..3010 /gene="coseg" /number=2 exon 3011..3243 /gene="coseg" /number=3 3'UTR 3184..3243 /gene="coseg" BASE COUNT 592 a 1158 c 1167 g 604 t ORIGIN 1 ggatccgggc ctgaacccca gaccagcggg cattctggtg gcccagggag agaggccatg 61 tcctctttag acctgccacc ttgggattag ggaccaaaag tggctttctc aggctgggtc 121 catttaggta tggctcctcc tgcaccttcc cccaggccca gacacccccc acccccggct 181 ctcccacccc aggcctgaga tgtacctgcc ctggttgcca catactaagg tcctggtggg 241 ggtaggaatt gggacaaact gttgtcaggt ttcctggggc tccccccgcc tttgaacctt 301 acttggcatg gagttctttc tccccatagt agctctccac actcatgttt ctacctggga 361 gggggtgacg ggtcatgtgc caatgtgtcc ccaaggcccc cagtttcagg gcctgagtcc 421 ccatgccctg ctctatgggg gtgcgggggt gtcatgtctt gcactttccc cagcaacctg 481 ccacccccct accctgtgag aacaagcccc tctcattctg tgtcccttga ctccacgacg 541 gcggctgcct tgtccacgag gcagggattc tcatctctgg tgcagcccct gtcctcacca 601 ctctgggctt gtgctgttct tggtccctta tcttgttgtc acagggctaa ccctccccac 661 acacctggct gtcccctgcc ccacagctcc taggccaggg cctgtccctg cccctaacac 721 gcagcccctc ggttcctctt accctcttct tggaccctac gccatttctg ggtaatcaga 781 aagctcagag atggtctcca ggtgaccccc cagtctgtcc ctctccctga atccagagcg 841 ctgcagtcac agtagaggca attgctgtca ttccgcggcc catgacagcc tggcggcccg 901 tacccctccc ccatgatccc ctgcacagac aggcccacgt gtgtccccag atgcctgaat 961 cactgctgac ggctggggac ctggcggccg tgggctcctg gggagccact ggggaggggg 1021 tggcggccgc gtctcgcctc cacgggaaca cctgcggaca taaataggca gccagcagag 1081 gcagcagcac agagccacca agcagtgctg catacggggt ccacctgtgt gcaccaggat 1141 gcctgacacc atgctgcccg cctgcttcct cggcctactg gccttctcct ccgcgtgcta 1201 cttccagaac tgcccgaggg gcggcaagag ggccatgtcc gacctggagc tgagacaggt 1261 acttcccact gtgggccatc tcagggctgc catagcgggc agtgctgaca ccctgggtca 1321 ggggctagga aagagggaag tcatgggtgg tggtagcctt taggggaagt tcgggggagg 1381 aagagggagg catggcatgg ctgggcagag gagccaatgg ggtgggccag aggggaccag 1441 gctttggagg aggctgggag aggctgaagg cgctcctggt cactgtcgcc atccagacag 1501 ggatgcagga aaatgaggga tgcttccccg gtgactgggc ttggggctgg atagggagaa 1561 cggggcatca tggcctcccc tgtgcccatg gcgttcttgc atctggactg gctggggcag 1621 cagaggctcc atcctaccta gcattggagg ctttcctcat ccagccccag cctcccagcc 1681 acaggcgccc aggcccccac acagaagatg gccactggtc tgagcgcgct tgagtggggc 1741 atcctgtggg gaagttctgc tgggaacctg gcctaattct atagtgctgg acgtttcctc 1801 catttccagc agagctgaag gaaatccaat cacgatgtgc atgcaattct gtccaggctc 1861 aatgatgagc ccttgagcaa attagaccac accaggctca gctaaaagtc taatgcgcta 1921 tccattgcgc cagagaaccg gctgttgagc agatgagagt ggccgctcgg caacccccgc 1981 agcctctctt cctcctgcta ggctccttta gggtcctgag gcacctgggt gtccgtgctc 2041 gcctctaggt ctcaggcccc tgccacccac ctgataggtc ataggtggct gagcaggggt 2101 cagggctcca gctgaggccg acaagcttgg cgggggccag gggcaaggca agagaggaga 2161 caggaaatgg gaagggccgg ggttctggat gggtagggcc tctccgcatg gtgtagtggg 2221 gaagggggtg ggcccgggct caagccgcag cagggcgagg aggaaggagg aagggtctgg 2281 agtggtggag ggtggggcag ctgcaacagt ggcgcccacc agcgatgacc ccgaggctcg 2341 aggaagggct ccccacgctg tagtccacgg gagacccgtc cctagctgag ggtgaggacg 2401 ctgagggctg tcaccgagag gtcatccaag aaaccaaggt gccgagcaga tctggacgcc 2461 ccgcccgtga ccgcggtcga ggcccagtgg cgcccgagcg tgcctgcagc cgcagccccg 2521 gtgtcccgcc cgcactccga gccctggacc ccagcatccc cgcctcgctg cgttcccctc 2581 caacccctcg actcccggct cccctcctcc cgctcacccc gcccgtcccc gcagtgcctc 2641 ccctgcggcc ccgggggcaa aggccgctgc ttcgggccca gcatctgctg cgcggacgag 2701 ctgggctgct tcgtgggcac ggctgaggcg ctgcgctgcc aggaggagaa ctacctgccg 2761 tcgccctgcc agtccggcca gaaggcgtgc gggagcgggg gccgctgcgc cgccttcggc 2821 gtttgctgca acgacggtgc gcggcggggg cgggcctggg gctggggggg gcgcagaccg 2881 cttgggtggg ggggacgcgg gcctgcggcg gggtgggggc tgcgtcgggc ccggcaggga 2941 gggtgtgggc cccccgcacc ccgagctgcg cccgccccag ggcgcccgtg ctcacacgtc 3001 ctcccggcag agagctgcgt gaccgagccc gagtgccgcg agggctttca ccgccgcgcc 3061 cgcgccagcg accggagcaa cgccacgcag ctggacgggc cggccggggc cttgctgctg 3121 cggctggtgc agctggccgg ggcgcccgag cccttcgagc ccgcccagcc cgacgcctac 3181 tgagccccgc gctcgcccca ccggcgcgct cttcgcgccc gcccctgcag cacggacaat 3241 aaacctccgc caatgcacgg cctcgcgtct gtctcagtct ctggcgggaa gagggaaggg 3301 gagagaggtg ggagcgcgga cccccgccac cacgcccacc ggccagtccc cggacctgag 3361 gtcgtgggca gatccacccc agagaagcaa caggtcccgt agaggaagcg atctgggacc 3421 cgcagaggtg tcgctagacc gagggacagg gcgaattggg aggcagggga gggggagacc 3481 agaggccgag agtggccttg gagggggtgg gttgaggatc c // LOCUS HSB3A 3683 bp DNA PRI 18-MAY-1993 DEFINITION H.sapiens gene for beta-3-adrenergic receptor. ACCESSION X72861 NID g298094 KEYWORDS beta-3-adrenergic-receptor; promoter; transmembrane receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3683) AUTHORS Emorine,L.J. TITLE Direct Submission JOURNAL Submitted (05-MAR-1993) L.J. Emorine, Inst. Cochin de Genetique Moleculaire, CNRS UPR 0415, 22 rue Mechain, 75014, Paris, FRANCE REFERENCE 2 (bases 1 to 661; 1817 to 3683) AUTHORS van Spronsen,A., Nahmias,C., Krief,S., Briend-Sutren,M.M., Strosberg,A.D. and Emorine,L.J. TITLE The promoter and intron/exon structure of the human and mouse beta 3-adrenergic-receptor genes JOURNAL Eur. J. Biochem. 213 (3), 1117-1124 (1993) MEDLINE 93279311 REFERENCE 3 (bases 1 to 3683) AUTHORS Emorine,L.J., Marullo,S., Briend-Sutren,M.M., Patey,G., Tate,K., Delavier-Klutchko,C. and Strosberg,A.D. TITLE Molecular characterization of the human beta 3-adrenergic receptor JOURNAL Science 245 (4922), 1118-1121 (1989) MEDLINE 89368947 COMMENT Related sequences: M29932 & M62473. FEATURES Location/Qualifiers source 1..3683 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="8p11.1-8p12" /chromosome="8" protein_bind 15..22 /note="CRE element" /bound_moiety="receptor" protein_bind 119..126 /note="CRE element" /bound_moiety="receptor" protein_bind 254..259 /note="GRE element" /bound_moiety="glucocorticoid receptor" protein_bind 264..270 /note="AP-1 site" /bound_moiety="AP-1" CAAT_signal complement(365..369) CAAT_signal 386..390 mRNA join(441..1842,2868..>3525) exon 441..1842 /number=1 misc_feature 441..492 /note="CAP site" CDS join(638..1842,2868..2889) /codon_start=1 /product="beta-3-adrenergic receptor" /db_xref="PID:g298095" /db_xref="SWISS-PROT:P13945" /translation="MAPWPHENSSLAPWPDLPTLAPNTANTSGLPGVPWEAALAGALL ALAVLATVGGNLLVIVAIAWTPRLQTMTNVFVTSLAAADLVMGLLVVPPAATLALTGH WPLGATGCELWTSVDVLCVTASIETLCALAVDRYLAVTNPLRYGALVTKRCARTAVVL VWVVSAAVSFAPIMSQWWRVGADAEAQRCHSNPRCCAFASNMPYVLLSSSVSFYLPLL VMLFVYARVFVVATRQLRLLRGELGRFPPEESPPAPSRSLAPAPVGTCAPPEGVPACG RRPARLLPLREHRALCTLGLIMGTFTLCWLPFFLANVLRALGGPSLVPGPAFLALNWL GYANSAFNPLIYCRSPDFRSAFRRLLCRCGRRLPPEPCAAARPALFPSGVPAARSSPA QPRLCQRLDGASWGVS" intron 1843..2867 /number=1 exon 2868..>3525 /number=2 polyA_signal 3520..3525 terminator 3567..3576 BASE COUNT 604 a 1147 c 1060 g 872 t ORIGIN 1 agatctcacc aagctgaggt cttgggagag gagatactgg ctgagcccta ttacttaatt 61 taaaatacct taggggaggc cacccaagtg gatgcggggc tcctgtgaat cctttgcttg 121 actccagcgg gttacctttg cctctgatac ataaagggtg gggatgggag cgctctcctc 181 tctccttccc ctgccttgct gtgggaactt ctgggaaagg aggtgcaggg ctccaggaag 241 ccagtgccca gggagtgcta tgctgagtcc aggagcctgg ccacggcagg ggtggacaga 301 tggtggcaga ggaaccacgg tgtcccttcc tccagattta gctaaaggaa acgtggagca 361 tcccattggc catcctcccc actctccaat tcggctccag aggcccctcc agactatagg 421 cagctgcccc tttaagcgtc gctactcctc ccccaagagc ggtggcaccg agggagttgg 481 ggtgggggga ggctgagcgc tctggctggg acagctagag aagatggccc aggctgggga 541 agtcgctctc atgccttgct gtcccctccc ctgagccagg tgatttggga gaccccctcc 601 ttccttcttt ccctaccgcc ccacgcgcga cccggggatg gctccgtggc ctcacgagaa 661 cagctctctt gccccatggc cggacctccc caccctggcg cccaataccg ccaacaccag 721 tgggctgcca ggggttccgt gggaggcggc cctagccggg gccctgctgg cgctggcggt 781 gctggccacc gtgggaggca acctgctggt catcgtggcc atcgcctgga ctccgagact 841 ccagaccatg accaacgtgt tcgtgacttc gctggccgca gccgacctgg tgatgggact 901 cctggtggtg ccgccggcgg ccaccttggc gctgactggc cactggccgt tgggcgccac 961 tggctgcgag ctgtggacct cggtggacgt gctgtgtgtg accgccagca tcgaaaccct 1021 gtgcgccctg gccgtggacc gctacctggc tgtgaccaac ccgctgcgtt acggcgcact 1081 ggtcaccaag cgctgcgccc ggacagctgt ggtcctggtg tgggtcgtgt cggccgcggt 1141 gtcgtttgcg cccatcatga gccagtggtg gcgcgtaggg gccgacgccg aggcgcagcg 1201 ctgccactcc aacccgcgct gctgtgcctt cgcctccaac atgccctacg tgctgctgtc 1261 ctcctccgtc tccttctacc ttcctcttct cgtgatgctc ttcgtctacg cgcgggtttt 1321 cgtggtggct acgcgccagc tgcgcttgct gcgcggggag ctgggccgct ttccgcccga 1381 ggagtctccg ccggcgccgt cgcgctctct ggccccggcc ccggtgggga cgtgcgctcc 1441 gcccgaaggg gtgcccgcct gcggccggcg gcccgcgcgc ctcctgcctc tccgggaaca 1501 ccgggccctg tgcaccttgg gtctcatcat gggcaccttc actctctgct ggttgccctt 1561 ctttctggcc aacgtgctgc gcgccctggg gggcccctct ctagtcccgg gcccggcttt 1621 ccttgccctg aactggctag gttatgccaa ttctgccttc aacccgctca tctactgccg 1681 cagcccggac tttcgcagcg ccttccgccg tcttctgtgc cgctgcggcc gtcgcctgcc 1741 tccggagccc tgcgccgccg cccgcccggc cctcttcccc tcgggcgttc ctgcggcccg 1801 gagcagccca gcgcagccca ggctttgcca acggctcgac gggtaggtaa ccggggcaga 1861 gggaccggcg gctcagggtc gggaagcatg cgatgtgtcc gtgggtcaac tttttgagtg 1921 tggagtttat taagagaagg tgggatggct ttgcttggag agaaaaggga acgaggagta 1981 gcgaaccaaa atgggaccca gggtcctttt ctttccggat ccagtcacta gggtagaagc 2041 aaaggagggc gagcgggccg tcgttcctca cccaaggacc caaggtgcgc caccggaaag 2101 cgctgcggtg tcccgaggac tctcgcctcg cctggtcggc tttagggatt tttttttttt 2161 ttaaatagag acagggtttc gtctctgtcg cccacgcggg aatgcagtgg tgcgatctca 2221 gctcactgca gtcttgaact cctggctcct gggctcaagc gatcctccca cctcagcctc 2281 ctgagtatct gggactacag gcgagcccca ccaatcccag ctatttttaa aatttcttgt 2341 agagatgggg tcttgctatg ttgcccaggc ttgtcttgaa cttctggcct caagtgatcc 2401 ttctgcctca gccttccaaa gcattaggat tacaggccgg agccagggcg ccgggtcggc 2461 tctagttttg gttttccagc tcagttcttt gcccccctcc cccgatttct tgccatcact 2521 agacctggct cggacttgaa ggcagggcta gtgccccccc acccgccccc caagccctcg 2581 gcctcagttc tgggttttct caaaggtttg acagctgtgg aggtgagaat ccacttccgg 2641 tatgaagtac agttgtgagt gaggagcctg tgagtgcaga tgtgtgccct cccgctccct 2701 gggctgggtt ggagtaggga tggggtgggg cgtgtgtggc tgggtggtgc cctggcgttt 2761 ttgtgtaact aaatatgcgt tccagggtct ctgatctctg tcattcccct cagtgcacct 2821 gttgctcctt tcaccccagg gtctattatc tccacttttt ttcccagggc ttcttgggga 2881 gtttcttagg cctgaaggac aagaagcaac aactctgttg atcagaacct gtggaaaacc 2941 tctggcctct gttcagaatg agtcccatgg gattccccgg ctgtgacact ctaccctcca 3001 gaacctgacg actgggccat gtgacccaag gagggatcct taccaagtgg gttttcacca 3061 tcctcttgct ctctgtctga gagatgtttt ctaaacccca gccttgaact tcactcctcc 3121 ctcagtggta gtgtccaggt gccgtggagc agcaggctgg ctttggtagg ggcacccatc 3181 acccggcttg cctgtgcagt cagtgagtgc ttagggcaaa gagagctccc ctggttccat 3241 tccttctgcc acccaaaccc tgatgagacc ttagtgttct ccaggctctg tggcccaggc 3301 tgagagcagc agggtagaaa agaccaagat ttggggtttt atctctggtt cccttattac 3361 tgctctcaag cagtggcctc tctcacttta gccatggaat ggctccgatc tacctcacag 3421 cagtgtcaga aggacttcgc cagggttttg ggagctccag ggttcataag aaggtgaacc 3481 attagaacag atcccttctt ttccttttgc aatcagataa ataaatatca ctgaatgcag 3541 ttcatcctcg gccccctttc cctccgtttg ttttcttttc ataatccact tactcccttc 3601 ccttctactc tgctggcttt tgacagaggc gtaaattagg cctaatcctc actcttttct 3661 tcctaatgtt catcaaagaa aaa // LOCUS HUMLORI 3321 bp DNA PRI 06-OCT-1992 DEFINITION Human loricrin gene exons 1 and 2, complete cds. ACCESSION M94077 NID g187186 KEYWORDS loricrin. SOURCE Homo sapiens (library: EMBL3 from Dr. Gonzales, NCI, NIH) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3321) AUTHORS Yoneda,K., Hohl,D., McBride,O.W., Wang,M.G., Cehrs,K.U, Idler,W.W. and Steinert,P.M. TITLE The human loricrin gene JOURNAL J. Biol. Chem. 267, 18060-18066 (1992) MEDLINE 92388173 FEATURES Location/Qualifiers source 1..3321 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="EMBL3 from Dr. Gonzales, NCI, NIH" CAAT_signal 235..237 /note="first putative CAAT site" CAAT_signal 248..250 /note="second putative CAAT site" TATA_signal 339..346 exon 378..416 /number=1 intron 417..1604 exon 1605..2814 /number=2 CDS 1628..2578 /codon_start=1 /product="loricrin" /db_xref="PID:g187187" /translation="MSYQKKQPTPQPPVDCVKTSGGGGGGGGTGGGGCGFFGGGGSGG GSSGSGCGYSGGGGYSGGGCGGGSSGGGGGGGIGGCGGGSGGSVKYSGGGGSSGGGSG CFSSGGGGSGCFSSGGGGSSGGGSGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQS YGGVSSGGSSGGGSGCFSSGGGGGSVCGYSGGGSGGGSGCGGGSSGGSGSGYVSSQQV TQTSCAPQPSYGGGSSGGGGSGGSGCFSSGGGGGSSGCGGGSSGIGSGCIISGGGSVC GGGSSGGGGGGSSVGGSGSGKGVPICHQTQQKQAPTWPSK" polyA_signal 2791..2796 BASE COUNT 687 a 895 c 1008 g 731 t ORIGIN 1 agatcttcag tttgactctc ttagggcacc tcaagactct gggacctatt cctcaagcac 61 agccctgtgg tcacctggta tgcgtcctgt gccagaggtt ttggaaacaa tgtctgccat 121 ccactctgac tgggtgaccc cactgatgag cctgccacac tgttgcatca gagaaggggc 181 cagtcacaca ccaggctgcc catctcaaga atgccaaaac cttcatgaat gggccatcct 241 gtgcctgcat cacagggagg tggggccgac agccacgggt cacgtaactg aggccaaaca 301 caagaagctg gcctggatca atgagtcagg gagagctcta tatataacct caggagatca 361 gtcgtcctca cattgccagc atcttctctc ctcactcacc cttcctggtg ctttgggtaa 421 gtgtggttct actgactctc tcattttccc agctggtctt gcccaggcct gactagatta 481 gatggaccag ggcctctttc ccctttggga gctataggac ctctgccttc ccaaaagcac 541 tcacatttag agggcggtca ggaaaggagc aggggatgag ctgctgccat gcagatggtg 601 tttctaggtc ttctggccag aatgtaaact ccacaaagac aagactatct cctgcctctc 661 tggcacccgc atagggcagg catggtgccg ggcacagaag gactctgcag aggctgtcca 721 aggcagcctg tgcacaggct gagcagacct tgtgaacctg tcaggaggag aggctgagcc 781 actctcaaga gagtagggag aatgatccaa aaaagttgca gatgggaggg atttcaggtg 841 ccacagaaat ccaattgctg ttttacagag ttagaagttc tgagagggaa atacaacttc 901 ctactagtca aagtcccttc taccaaactt ggacttgaga aaatcatgga agagatggct 961 aatagcttct ggtcggggat gttggtccaa aggaccacca gggtccttcc cctgtttcca 1021 ggcagggcca cagcagagcc tgtcttttct agtgactagc ccttggttca tggcctagct 1081 cagtgggaca gaccaagagt ttgaactaag agcttctgca gaggatggag tgcaacagcc 1141 ctcaatagaa tgaagtccga caaacctctc tctgttgtgc tgggaatggg taaaatctcc 1201 tatcctaggt agaggtttgt ggtagttttc tatttgcact cagagaatca gtgttgagat 1261 tggaagaggc ctgaaagatg aagtgttcaa accctttcat ttaacaggaa atgagaaaga 1321 ggcgggcgtg tgggtgatct gccctcaaat cacacaacag tggtcagagg cagagctggg 1381 agaaagaatg ggcactgcta actgggctgt ttaaagcata atgaaggctt tctgcagggg 1441 aatgaggaac tcaacttttc caaaagcagc aatcataaga aacccgctga ggctctggca 1501 cctgaaggag ccctgggagg atggcgatgt tgcctatgga tgcagctgcc tccgtaagta 1561 tcctcttgca gctggtccag cgatgctctc ttctcccctt ccaggctctc cttccttctc 1621 agacaagatg tcttatcaga aaaagcagcc cacccctcag cccccagtgg actgcgtgaa 1681 gacctctggc ggcggtggcg gtggcggcgg cacgggcggt ggtggctgcg gcttcttcgg 1741 cggcggcggc tcagggggcg gtagcagcgg ttctggctgc ggctactccg gcggcggtgg 1801 ctactctggc ggcggctgcg gcgggggctc ctccggcggc gggggcgggg gcggcattgg 1861 aggctgcgga gggggctccg gtgggagcgt caagtactcc ggaggcggcg gctcctccgg 1921 cgggggctct ggctgtttct ccagcggtgg gggcggctcc ggctgcttct cctccggtgg 1981 cggcggctcc tccgggggcg gctccggctg cttctccagc ggtgggggcg gctcctccgg 2041 gggcggctcc ggctgcttct cctccggcgg cggcggcttc tcgggccagg cggtccagtg 2101 ccagagctac ggaggcgtct ctagcggcgg ctcctccggg ggcggctccg gctgcttctc 2161 cagcggcggg ggcggcggct ctgtctgcgg ctactctggc ggcggctctg gcggcggctc 2221 tggctgcggc ggaggctcct ctggcggcag cggctccggc tacgtctcct cgcagcaggt 2281 cactcagacc tcgtgcgcgc cccagccgag ttacggaggg gggtcgtccg gcggcggcgg 2341 cagcggcgga agcggctgct tctccagcgg cgggggcggc gggagctccg gctgcggcgg 2401 cggctcctcc gggattggca gcggctgcat catcagtggc gggggctccg tctgcggagg 2461 tggttcctct ggaggcggcg gcggcggctc ctccgtgggt ggctccggga gtggcaaggg 2521 cgtcccgatc tgccaccaga cccagcagaa gcaggcgcct acctggccgt ccaaatagat 2581 cccccagggt accacggagg cgaaggagtt ggaggtgttt tccaggggca ccgatgggct 2641 tagagctctc atgatgctac ccgaggtttg caaatccttc atgtcttaac ctacctggaa 2701 gaagccattg agctctccgg ctgcatctag ttctgctgtt tagcctcttt ggtttctgta 2761 caactacctc ccaaccccag tgcctcagtc aataaatttg caaattcatg agaatcttta 2821 ggtctccaag agtatttgta gtgttcaagt tcatccattc cattccattc ttctcctttc 2881 ctgcattcgc tcattcacac ttacattcat tttacaaatg cacgcactct attggcggaa 2941 aggtgagtgg atagatggga gtcaccagga gagggaggaa tggctggcaa acgacggtca 3001 aacatatgaa cattctagtg tacggcttaa aatcatgacg tgatcccaaa acacaaagat 3061 cctgaggaga gacttcagta aacacctccc tgtggattga gtagctactg tgggtctggt 3121 tactgaactc tgttttttaa agacaggctg ccctggaaag acgctgtctc tcacagtcca 3181 aagccaattc tgcaaggtct gaaagctggg tggctgcaca ttaggagcac cagctcatgc 3241 tactcagact cacggagaaa taaaaagcat atcagatgtt tacgggcccc tagggagagc 3301 aagaccactt tagtaggtac c // LOCUS HSU37055 9980 bp DNA PRI 16-MAY-1996 DEFINITION Human hepatocyte growth factor-like protein gene, complete cds. ACCESSION U37055 M74179 NID g1311660 KEYWORDS hepatocyte growth factor-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9980) AUTHORS Waltz,S.E., Gould,F.K., Air,E.L., McDowell,S.A. and Degen,S.J. TITLE Hepatocyte nuclear factor-4 is responsible for the liver-specific expression of the gene coding for hepatocyte growth factor-like protein JOURNAL J. Biol. Chem. 271 (15), 9024-9032 (1996) MEDLINE 96224125 REFERENCE 2 (bases 3882 to 9980) AUTHORS Han,S., Stuart,L.A. and Degen,S.J. TITLE Characterization of the DNF15S2 locus on human chromosome 3: identification of a gene coding for four kringle domains with homology to hepatocyte growth factor JOURNAL Biochemistry 30 (40), 9768-9780 (1991) MEDLINE 92002016 REFERENCE 3 (bases 1 to 9980) AUTHORS Degen,S.J.F. TITLE Direct Submission JOURNAL Submitted (26-SEP-1995) Sandra J. F. Degen, Division of Developmental Biology, Children's Hospital Research Foundation, 3333 Burnet Avenue, Cincinnati, OH 45229-3039, USA COMMENT On May 10, 1996 this sequence version replaced gi:183978. FEATURES Location/Qualifiers source 1..9980 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p21" /clone_lib="EMBL-3 SP6/T7 from Clontech" /chromosome="3" /tissue_type="Placenta" promoter 1..3881 /note="5' flanking sequence" repeat_region 8..133 /note="repetitive sequence" repeat_region 241..495 /note="repetitive sequence" repeat_region 542..843 /rpt_family="Alu" source 3882..9980 /organism="Homo sapiens" /map="3p21" /chromosome="3" mRNA join(4078..4205,4903..5050,5131..5243,5321..5435, 5513..5649,5729..5849,5994..6112,6315..6483,6604..6734, 6832..6934,7062..7198,7288..7323,7412..7532,7708..7785, 7913..8059,8141..8247,8343..8482,8602..8843) /product="hepatocyte growth factor-like protein" exon 4078..4205 /number=1 CDS join(4154..4205,4903..5050,5131..5243,5321..5435, 5513..5649,5729..5849,5994..6112,6315..6483,6604..6734, 6832..6934,7062..7198,7288..7323,7412..7532,7708..7785, 7913..8059,8141..8247,8343..8482,8602..8763) /codon_start=1 /product="hepatocyte growth factor-like protein" /db_xref="PID:g1311661" /translation="MGWLPLLLLLTQCLGVPGQRSPLNDFQVLRGTELQHLLHAVVPG PWQEDVADAEECAGRCGPLMDCRAFHYNVSSHGCQLLPWTQHSPHTRLRRSGRCDLFQ KKDYVRTCIMNNGVGYRGTMATTVGGLPCQAWSHKFPNDHKYTPTLRNGLEENFCRNP DGDPGGPWCYTTDPAVRFQSCGIKSCREAACVWCNGEEYRGAVDRTESGRECQRWDLQ HPHQHPFEPGKFLDQGLDDNYCRNPDGSERPWCYTTDPQIEREFCDLPRCGSEAQPRQ EATTVSCFRGKGEGYRGTANTTTAGVPCQRWDAQIPHQHRFTPEKYACKDLRENFCRN PDGSEAPWCFTLRPGMRAAFCYQIRRCTDDVRPQDCYHGAGEQYRGTVSKTRKGVQCQ RWSAETPHKPQFTFTSEPHAQLEENFCRNPDGDSHGPWCYTMDPRTPFDYCALRRCAD DQPPSILDPPDQVQFEKCGKRVDRLDQRRSKLRVVGGHPGNSPWTVSLRNRQGQHFCG GSLVKEQWILTARQCFSSCHMPLTGYEVWLGTLFQNPQHGEPSLQRVPVAKMVCGPSG SQLVLLKLERSVTLNQRVALICLPPEWYVVPPGTKCEIAGWGETKGTGNDTVLNVALL NVISNQECNIKHRGRVRESEMCTEGLLAPVGACEGDYGGPLACFTHNCWVLEGIIIPN RVCARSRWPAVFTRVSVFVDWIHKVMRLG" exon 4903..5050 /number=2 exon 5131..5243 /number=3 exon 5321..5435 /number=4 exon 5513..5649 /number=5 exon 5729..5849 /number=6 exon 5994..6112 /number=7 exon 6315..6483 /number=8 exon 6604..6734 /number=9 exon 6832..6934 /number=10 exon 7062..7198 /number=11 exon 7288..7323 /number=12 exon 7412..7532 /number=13 exon 7708..7785 /number=14 exon 7913..8059 /number=15 exon 8141..8247 /number=16 exon 8343..8482 /number=17 exon 8602..8843 /number=18 polyA_signal 8814..8819 BASE COUNT 2162 a 2924 c 2808 g 2086 t ORIGIN 1 ggatcctgta atatgcaaca acatggatgg aactggaggt cattatgtta aatgaaataa 61 gccaggcaga aaaagacaaa cattgcatgt tatcacttat ttgtgacagc taaaaattaa 121 aacaattgaa ctcatagagg caaagtactg gatggttacc agaggcttgg aagggtagta 181 cagggcttgt ggggagatgg ggatggctaa tgcgtacaaa aaaattgaat taatgagacc 241 tatctggtag cagaacagag tgactatagt aaataataat tgtacattta aaaataacta 301 aaagagtata actagattgt ttataagaca aaagataaat gcttgagggg atggataccc 361 catttaccat gatgtgatta ttacgcattg catgcctgta tcaaagtatc tcatgtaccc 421 tataaatata tatacctact acatacccac aaaaattaaa aattaaaaat gtaacacaaa 481 gaaagaaaca gaaaagaatc acatttgaat agtcatgtta aaataacaag atagcaggtg 541 cttttccttc tttttttaga gacagggtct tgctctgtca cccaggctac agtgcagtgg 601 catgcagagg ccaactcact gcagcctcaa cctcctaggc tcaagtgatc ctcccacatc 661 agcctcccta acagctggga ctacaggtgc acaccaccat gcccaggtaa tttctgtatt 721 ttattgtaga gatggggtct tgccatgttg ctcaggctgg tcttgaactc ctggactcaa 781 gcaatctacc caccttggcc tcccaaagtg ctgggattac aggtgtgaac cactgtgcct 841 ggccctctaa gtttctgcta aaatgtactt gtaactttta tactgactac acttaaaagt 901 attaaacttg taaagaaaag gccctggagg agtttgttct ttacaaggtc agtcagaagc 961 tacaaggggc caggctgggg ccaaggtgag ggcaggtaca tgcaggggca gatgctgggt 1021 atggagtagg tgccaggtgc tagctgggca tgaatgtgca cccgctccct gacctctgct 1081 ccctgactct cagaaggaac tagcactgcc tggagacagg aagaagcaga gtacggaagg 1141 cagactctct ggccagagat ctcgcagcca ggagcacagt gagaacagag cctaagccta 1201 tctgtccaga gcccaggctc cctttcccta caactgcagc ctccttattc tggtctggcc 1261 cagcagagtg aggaggggcc acccacccag ccatacctgc tggtggctgc tcggggtgca 1321 tgttcagaag aggaaaagat gcggttcagg tagtcattca gcagcttctc ctgcacaatg 1381 cctgagccac aagtcagaag ggagaaaagg gccacatgat cagtgcagct cctggcccct 1441 gctggcccca gagacttctg gtcctaggcc agccacaggt gcagcaaggc ctgaggcctt 1501 tgtcccttcc ttctcctccc aacaagagga gcctccttcc ttccctgcca gctcttacct 1561 gtgaccctgg atttctcagc atctgaggtc agcctatagc tcttgcggga gaaagacatg 1621 ccggccccct tggatgccat ccttcatctc tcatgggcag ccagtggtcc tgggggctga 1681 gccagacaca gagcctgtct actcagggcc agccaggggc ccccacacct gacctcagtc 1741 ctgccatggc aggaaggctt gggagcaggt gaggccaggg gcatgcacac taacacatag 1801 ggcaggggaa tgcaggggac tcccagcttc ctacacaggg agggtagcca gggtctaggc 1861 ccatgtaaca gggaccactt cttgttctcc agtgtccctg tgacccagca ctagggactg 1921 ggcagagagc aggtactaga tatatttttg ccgaaccata acaaaatgtc tcagctctgt 1981 ggaaatcaga ccaaaatgtc caaagtgagg tgtccttggg tagagcccct gtatccatct 2041 cagttgtaag ccatactgag tccagctcac aaccagaagc gacaatccta cacattcccc 2101 caagtgcaga ccaccactgg tctggtcatc tctagccata agtcccactt actccccaca 2161 aggctggagc ctctttcctc agccctgggt tcctcccttc ttttatccat ggtcctgata 2221 gtttcaccat tcccactcac acatgtggcc tcccacctca tccaagcacc acccttgccc 2281 agccctcatg gcaaactcca ttggtcctcc tagtgcatct atccctcaca tcatgttgct 2341 gctgcagctg ctccagaacc tgcactgagt ccctggtgct ggcctcactg atgtgtaggc 2401 gtgctgtcca acacagcagc cgtcatccac atgaggttac ttaaatttaa agttttaaaa 2461 attaacaatt cgatttctca gttgccttag ccacatttca agagttcaat gaccacatgt 2521 ggctaatggt tactctgttg gacagcacaa aggtagaacg ttcccaccat catataaagt 2581 tctattagac agcgcaggtc tagagtttcg ctctgcagca ctgccttagc cagcactccc 2641 tactctgcag ttcagtctct tatctcaccc caaacagacc ctctactccc gccacaaacc 2701 cttgtgggtc tgcaccacgc ctgctcctcg ggaaggtgat tttcccgacc tcgggccctc 2761 ggtgggcgtc catgatcggt gccccttcac taccagagta gttgaagatc tgattaaact 2821 cagtcattca tttgatgagc agaccttcca gacacccgga gtgggctccg ggagtcagtt 2881 gctgaggacc caggtgagag gcggaccttg tcccgcccgg gactccctgt cacggctatc 2941 ccgtccaacc ccgccggggt atccggactc agaactgaac ctactccgtc tgggagccca 3001 aggacggcga gggcgggcat ggctccccca agtcgccccc gcctggcccc aagctccaca 3061 ggacctgcca gaccagcttt ccgctcggag tttcgcaccc agatccctgc gctgcaggtg 3121 ccctcgcact ggactagccc ggaccgcgaa gggaatagcc ggaacccgcc agctcagagc 3181 cctgccgccc ctcactcacc cggagcctcg gccgcagcga cccggctcac aacatccgcc 3241 caacctcttg gctacggtgt ccgttcaggg ccaaatcacg cgcacgcttg cgcgctgagc 3301 ctccagccgc gcacgcgcac tggcccgcgc ccagcctccg ctaggggacc ccctccatgg 3361 cttcccaccg gttgttccag gcctcagctt cgccgaaagc ctcaccacct ccgacctccg 3421 cctgccctgg ggatgctccc agccctcgct gcggcagaac gcgacatgct aaccggaatc 3481 cctaggccgc ctgtctccta cccatactta gaggccccgc tcagacggtc cttaaaacgt 3541 ctgaaaggcc gttcctgcca gagtccctgc tacctgttac ctccacccct atttagtcct 3601 agtggacagc ctcgctcacc ttccctggga tgacacttct ggcggctgag atgagcgagc 3661 ctctctggct ctgccgccgg gtgtgggctg acctgcctac agctggggcc tgataaggca 3721 gcagcaaaag ggtggagggg aggcagtgtt gaagctgggg caagtaattt tccccaattt 3781 acagggaaaa accgaaattc agaaaagttt aatgtcaccc aggggctgga gcccagacct 3841 ctggcagctc tcactttcac aatgcccttg ggctgactag gctgcagagg ggtttcaccc 3901 caaccccagg gcacctcaag tgtccccacc aaaccttcct aacacctgtc cactaagctg 3961 tactaggccc ttgcaactga cctatgggac cctgaggcct ggcccctcat ggctcctgtc 4021 accaggtctc aggtcagggt ccagcaggcc ctgagctgac gtgtggagcc agagccaccc 4081 aatcccgtag acaggtttca caacttcccg gatggggctg tggtgggtca cagtgcagcc 4141 tccagccaga aggatggggt ggctcccact cctgctgctt ctgactcaat gcttaggggt 4201 ccctggtaag tgcccccaac cctgatcccc atctgccttc aggagggggt tggccccatt 4261 ctcctattct aggatgagaa aaaagtcggg agcagaggct cagtgggcat ggggcagtga 4321 ccttgccctc ttgagcacag ctgggaagcc ctaggaacac atagacattg cccacttagg 4381 cctctattag cacgtctgct ctagcactga agcagtgtca ggaccacaca gatgcacgca 4441 cacagcaggc agtgacccct cctgagcctg atctacccct ctaacctagc atatgccttt 4501 gtgcaggtga gagcccagat ttggagtctg aatgcctagc cagggccctt ggctgggtaa 4561 tgtgatggct ctgagcctta gcattctcat ttgagagatg aggtggggca agcttcatca 4621 cccactgctc tcacagagcg tatgtgttag atctgagccc ggtgcctggg ccactaaaca 4681 gaggcaccgg tgataactac caagtctggg cctgcttccc aggggaaatt tttttcacaa 4741 gtatctgtgc agggggctag actggccctt gaaagtgcat acagggtcca tcccagaagc 4801 ttgtagcttt gatcccctga atgaacaaag tgtggacatg ccaatacaca ttactgacat 4861 gtatgcccac ctgacctgca cccactcatg cctactctgc agggcagcgc tcgccattga 4921 atgacttcca agtgctccgg ggcacagagc tacagcacct gctacatgcg gtggtgcccg 4981 ggccttggca ggaggatgtg gcagatgctg aagagtgtgc tggtcgctgt gggcccttaa 5041 tggactgccg gtgagtggcc actgggctag ataagactgg gggcagggaa gcctgggctg 5101 tggcgttacc ctgtgccttc ttctctccag ggccttccac tacaacgtga gcagccatgg 5161 ttgccaactg ctgccatgga ctcaacactc gccccacacg aggctgcggc gttctgggcg 5221 ctgtgacctc ttccagaaga aaggcaagtg ggggtggaga ggggcagggt gggagacagg 5281 ggacctcagc ccaagttgat cttctgtctc ttgctcccag actacgtacg gacctgcatc 5341 atgaacaatg gggttgggta ccggggcacc atggccacga ccgtgggtgg cctgccctgc 5401 caggcttgga gccacaagtt cccgaatgat cacaagtgag acaaacacct tccctccgtc 5461 ccggcctggg gcttccccca gcacacacta tagtgatgct ctgggccctc aggtacacgc 5521 ccactctccg gaatggcctg gaagagaact tctgccgtaa ccctgatggc gaccccggag 5581 gtccttggtg ctacacaaca gaccctgctg tgcgcttcca gagctgcggc atcaaatcct 5641 gccgggaggg taagcggcgc cgggtcaagc tgggagagtg gagacaagcc cacgtccatc 5701 cacgaaccca ctggctcttt gtctccagcc gcgtgtgtct ggtgcaatgg cgaggaatac 5761 cgcggcgcgg tagaccgcac ggagtcaggg cgcgagtgcc agcgctggga tcttcagcac 5821 ccgcaccagc accccttcga gccgggcaag tacgcgtagg cggtatcggc gtcctggggg 5881 ccgggctagg gaaggtccag gactccaggg gcagggctcc gtgtagggca attgggcggg 5941 gccagataag ccagagtccc agggtcttgt tcacgcccca ttaccgcccc caggttcctc 6001 gaccaaggtc tggacgacaa ctattgccgg aatcctgacg gctccgagcg gccatggtgc 6061 tacactacgg atccgcagat cgagcgagag ttctgtgacc tcccccgctg cggtaggcgg 6121 cggggaccag gcctgggagg gtacctggga accttgggga ggggcgtggc ttggccgggg 6181 aggtaagagg ggctgggcgt gacctgagag cataccccgt ggagtaccgt acacctggga 6241 aaggcgggtt tggtcccagc cccagaggga tctcagctct cgctcggggc ccgacctatc 6301 tcggtccatc taagggtccg aggcacagcc ccgccaagag gccacaactg tcagctgctt 6361 ccgcgggaag ggtgagggct accggggcac agccaatacc accactgcgg gcgtaccttg 6421 ccagcgttgg gacgcgcaaa tccctcatca gcaccgattt acgccagaaa aatacgcgtg 6481 caagtgaggt gggggggggg ggcgggcgtt gggacgtgct gctgcgggtg agacgggagg 6541 aaggtagtca cgggctcaag gctggaggct ggcgggctag ggctgagtgg agcgcctgct 6601 tagagacctt cgggagaact tctgccggaa ccccgacggc tcagaggcgc cctggtgctt 6661 cacactgcgg cccggcatgc gcgcggcctt ttgctaccag atccggcgtt gtacagacga 6721 cgtgcggccc cagggtgagg cccaagcttg ggggctacag agccgggctg gaagctggaa 6781 ccggaggccg gggcgaggtc tcggcctgat ggctgcccgc acccgccaca gactgctacc 6841 acggcgcagg ggagcagtac cgcggcacgg tcagcaagac ccgcaagggt gtccagtgcc 6901 agcgctggtc cgctgagacg ccgcacaagc cgcagtgagt ccctggtgct cccggccccg 6961 ccagggccct aaccctgggg cggcatgctt tggtgtctgg gaccagagcc tggaaatggt 7021 tgagactacc ctgccacgat tttgctcccg cttccgccta ggttcacgtt tacctccgaa 7081 ccgcatgcac aactggagga gaacttctgc cggaacccag atggggatag ccatgggccc 7141 tggtgctaca cgatggaccc aaggacccca ttcgactact gtgccctgcg acgctgcggt 7201 gagcactagt gacgcttccc ccatgaccct gcctcagccc ccacccaaag gctggctccc 7261 ttaaccccag tgaactttgt ctttcagctg atgaccagcc gccatcaatc ctggaccccc 7321 caggttagga gttgggccag ttatgggtca ggccctttag cccacgacat ccacacagtc 7381 tgggtttcat ccagcccacc ccatcctaca gaccaggtgc agtttgagaa gtgtggcaag 7441 agggtggatc ggctggatca gcggcgttcc aagctgcgcg tggttggggg ccatccgggc 7501 aactcaccct ggacagtcag cttgcggaat cggtgaggca caactgcctg tctcccacag 7561 agaggagctg aggttgtgtc ctctgtggtt atccactggg gctgggaatc tatccgtgcc 7621 ccttgagagg tcctagccaa gaagatggca ggtcttacga atctgtccca ggagtctgtt 7681 acctgtccta attccccact cctctaggca gggccagcat ttctgcgggg ggtctctagt 7741 gaaggagcag tggatactga ctgcccggca gtgcttctcc tcctggtgag cctcccttgt 7801 gtttggggac ccagtctcat cccaccttcc cccttcccca ggcaagctaa caagtgagcc 7861 ttggggcaat ggactgagag tcacaaatga cctagcagag cttctctccc agccatatgc 7921 ctctcacggg ctatgaggta tggttgggca ccctgttcca gaacccacag catggagagc 7981 caagcctaca gcgggtccca gtagccaaga tggtgtgtgg gccctcaggc tcccagcttg 8041 tcctgctcaa gctggagagg tatgtggaca acctgggagg gtgtgaggtg gggctgggcc 8101 ttgtggcctc agaccctgag tgcccccatt cttgctaaag atctgtgacc ctgaaccagc 8161 gtgtggccct gatctgcctg ccccctgaat ggtatgtggt gcctccaggg accaagtgtg 8221 agattgcagg ctggggtgag accaaaggta agagcacagt gcacaggact gctggtggcc 8281 aggaggccag ccctggatct tcctgcagga ccctctccct ctccccattc ccctcactgc 8341 aggtacgggt aatgacacag tcctaaatgt ggccttgctg aatgtcatct ccaaccagga 8401 gtgtaacatc aagcaccgag gacgtgtgcg ggagagtgag atgtgcactg agggactgtt 8461 ggcccctgtg ggggcctgtg aggttggtgg cagggcctgg gcagccctgg aagtatgggg 8521 ggctagaaat gaactatttt atcatgaagc aggctagtca ttgctgtggc ccggggcctc 8581 atcagttctc ctacctgcca gggtgactac gggggcccac ttgcctgctt tacccacaac 8641 tgctgggtcc tggaaggaat tataatcccc aaccgagtat gcgcaaggtc ccgctggcca 8701 gctgtcttca cgcgtgtctc tgtgtttgtg gactggattc acaaggtcat gagactgggt 8761 taggcccagc cttgatgcca tatgccttgg ggaggacaaa acttcttgtc agacataaag 8821 ccatgtttcc tctttatgcc tgtacagatg cttcttagcc tttgcttcca ggaaatgtgt 8881 cagtgactcc ttgctagggc tcgggtggct tgagcccagc acaccctggg ctaggtgatc 8941 tgtccagcct aggggcttcc ccaaccaagg caatgtccct gggactactt ttgcccatgg 9001 gtgccgtgga aagacagggc ctcacactag tcctccagac atactcttgg gaagggtggt 9061 acagagtagt tgctaatgga aggggctgca gcagggaagc taggctggta cagagtcctg 9121 gttgccagga caggcagagg ctaagcctct cactgttccc tcccttctca cactggaggc 9181 agatgaagcc cttgtggctg ccacacccag aacctagggt ctctgcaccc cagagtggga 9241 ggtggggttg gggatggttt ggtacaaagt accagcagga accaggctct gtgtcctaat 9301 ttattatgac tacatagccc acattcctct gcccatgcat ccgtggagtc cagagcccag 9361 aaagcctcct gctgccctgc cagaccgttg agctcctcaa gaggaagtgt ggcacaggct 9421 gatcagctca tgcagaatgg cagggcttca gctgcccaag tgtgtgcgta gccagagcac 9481 agcattcatg aagctgtctg actccacctc cacctctgat aatgcgtggg tgcttttggg 9541 atagagcagg agcctgtagg gattagtcag caacatttaa ggttggaggg tcctcctgtg 9601 ctcacctgcc caccagctgc cagggccttc atgctgcact caccgaacag gcacattccg 9661 ggtcttgagg gcacggtaat actccatgcc ctgcttgaag ggcacacgcc ggtcctcctg 9721 gcccaacatc agtaacagtg gtgtcttcac ctgggtgttt ggggaagagt ggggagctgt 9781 gttgagctgg gccctggatt ctggatggat gggcagcaca cagggcaagc agggggctgc 9841 atacctgagg gatgtatctg atgggcgatt tgtccagcat ctcagcccac acgctgaggt 9901 ctggcaggca gtcactgctg aaaggaaagc cagcctccac cacgcacctg caagacaccg 9961 agctgttgca gccccaggaa // LOCUS HSH11 1034 bp DNA PRI 19-APR-1993 DEFINITION H.sapiens H1.1 gene for histone H1. ACCESSION X57130 NID g31966 KEYWORDS H1.1 gene; histone H1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1034) AUTHORS Kardalinou,E. TITLE Direct Submission JOURNAL Submitted (19-DEC-1990) Kardalinou E., Zentrum Biochemie, Abteilung Molecularbiologie, Humboldallee 23, D-3400 Goettinger, FRG REFERENCE 2 (bases 1 to 1034) AUTHORS Eick,S., Nicolai,M., Mumberg,D. and Doenecke,D. TITLE Human H1 histones: conserved and varied sequence elements in two H1 subtype genes JOURNAL Eur. J. Cell Biol. 49 (1), 110-115 (1989) MEDLINE 89338424 FEATURES Location/Qualifiers source 1..1034 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="genomic DNA in EMBL3" CAAT_signal 59..63 TATA_signal 82..86 gene 168..815 /gene="H1.1" CDS 168..815 /gene="H1.1" /codon_start=1 /product="histone H1" /db_xref="PID:g296288" /translation="MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVS ELIVQAASSSKERGGVSLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGT GASGSFKLNKKASSVETKPGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPA ATRKSSKNPKKPKTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK" terminator 849..864 /note="histone mRNA terminator" BASE COUNT 317 a 268 c 256 g 193 t ORIGIN 1 cagctgcgcg aaacacatcg gcagcgcgca gggcgcggag gggcgggact gacgggcacc 61 aatcacggcg cagtcccacc ctataaatag gctgcgttgg ggcctttttt tcgcatcctg 121 cttcgtcagg tttataccac tttatttggt gtgctgtgtt agtcaccatg tctgaaacag 181 tgcctcccgc ccccgccgct tctgctgctc ctgagaaacc tttagctggc aagaaggcaa 241 agaaacctgc taaggctgca gcagcctcca agaaaaaacc cgctggccct tccgtgtcag 301 agctgatcgt gcaggctgct tcctcctcta aggagcgtgg tggtgtgtcg ttggcagctc 361 ttaaaaaggc gctggcggcc gcaggctacg acgtggagaa gaacaacagc cgcattaagc 421 tgggcattaa gagcctagta agcaagggaa cgttggtgca gacaaagggt accggagcct 481 cgggttcctt caagctcaac aagaaggcgt cctccgtgga aaccaagccc ggcgcctcaa 541 aggtggctac aaaaactaag gcaacgggtg catctaaaaa gctcaaaaag gccacggggg 601 ctagcaaaaa gagcgtcaag actccgaaaa aggctaaaaa gcctgcggca acaaggaaat 661 cctccaagaa tccaaaaaaa cccaaaactg taaagcccaa gaaagtagct aaaagccctg 721 ctaaagctaa ggctgtaaaa cccaaggcgg ccaaggctag ggtgacgaag ccaaagactg 781 ccaaacccaa gaaagcggca cccaagaaaa agtaaattca gttagaagtt tcttctagta 841 acccaaacgg ctcttttaag agccacctac gcatttcagg aaaagagctg tagtacacag 901 atgaaatccc ccaagcaaat gcaacacgcc ctcaattata ttagaatcac ttggagagtc 961 gatagaactt taacatagcc tcatctagta agaatttact actcaatcta tcaaagatag 1021 caaggattga attc // LOCUS HSU66875 1569 bp DNA PRI 31-MAY-1997 DEFINITION Homo sapiens cytochrome oxidase subunit VIa heart isoform precursor (COX6AH) gene, complete cds. ACCESSION U66875 NID g2138177 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1569) AUTHORS Bachman,N.J., Riggs,P.K., Siddiqui,N., Makris,G.J., Womack,J.E. and Lomax,M.I. TITLE Structure of the human gene (COX6A2) for the heart/muscle isoform of cytochrome c oxidase subunit VIa and its chromosomal location in humans, mice, and cattle JOURNAL Genomics 42 (1), 146-151 (1997) MEDLINE 97321054 REFERENCE 2 (bases 1 to 1569) AUTHORS Lomax,M.I. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) Lomax M.I., The University of Michigan, Anatomy and Cell Biology, 1335 East Catherine, Ann Arbor, MI 48109-0616, USA FEATURES Location/Qualifiers source 1..1569 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p" repeat_unit 296..414 /rpt_type=dispersed /rpt_family="Alu" 5'UTR 727..801 /gene="COX6AH" mRNA join(727..874,976..1112,1281..1392) /gene="COX6AH" exon 727..874 /gene="COX6AH" /number=1 gene 727..1392 /gene="COX6AH" transit_peptide 802..837 /gene="COX6AH" CDS join(802..874,976..1112,1281..1364) /gene="COX6AH" /EC_number="1.9.3.1" /codon_start=1 /product="cytochrome oxidase subunit VIa heart isoform precursor" /db_xref="PID:g2138178" /translation="MALPLRPLTRGLASAAKGGHGGAGARTWRLLTFVLALPSVALCT FNSYLHSGHRPRPEFRPYQHLRIRTKPYPWGDGNHTLFHNSHVNPLPTGYEHP" mat_peptide join(838..874,976..1112,1281..1361) /gene="COX6AH" /EC_number="1.9.3.1" /product="cytochrome oxidase subunit VIa heart isoform" intron 875..975 /gene="COX6AH" /number=1 exon 976..1112 /gene="COX6AH" /number=2 intron 1113..1280 /gene="COX6AH" /number=2 exon 1281..1392 /gene="COX6AH" /number=3 3'UTR 1365..1392 /gene="COX6AH" BASE COUNT 345 a 472 c 450 g 302 t ORIGIN 1 aagcttttgg catccagggt ccagcagtgt gagctgtggg ggtgtcactg cagctggcca 61 acggaatgac ttgtttatga ctgtgcagac tgtaggcatc tctggctata tcagacatag 121 taagctacct aagattctca attacttatc ttctaaaact agtaagagtg agttctgttc 181 tttgcaaaga actgtgacta atacatatac tgaactactg ttacctgcca aaatccaggt 241 tgacacattc ctaccttaca taaacttcta taaaaatgat gtaagggccg tgtagtggct 301 catgcctgta atcccagcac tgtgggaggc tgaggcggga ggatcacgtg agctcaagag 361 ttccacacca gcctggacaa tatagtgaga tccccgtctc cagtatttaa aaaatctttt 421 aaggtacgaa ttattataca ttttacagct gaaaacactg agactcaaaa agacacttaa 481 ttcattctac aaatatttat taaagcccaa ctgtgtgcca gccaccaggc atagaatagg 541 gaacaaaata gtttccctgc aactatataa aaagaatggc atattgtact taaatgcctc 601 cttgccaaaa taagaggcac aggacaacag ctgtctggag atgacctcaa gccagtcagg 661 ccccatttta aatatagaaa cccctaagaa tagccgccag tgctccagac tcaacaggtg 721 attggcccag agaggggagg tgaccccagg ccccaggaaa gggagcgagg acagcgctgg 781 ttcccggctc cccgcaccat catggctttg cctctgaggc ccctgacccg gggcttggcc 841 agcgctgcca aaggaggcca cggaggagca ggaggtgagt ggggaacggg cggatccggg 901 ggctccccta ccctgcccac ctgttcacag gcccgccgcc ccaggaccgc cgcgctcacc 961 ccgctccgtc cgcagctcgt acctggcgtc tgctgacctt cgtgctggcg ctgcccagcg 1021 tggccctctg caccttcaac tcctatctcc actcgggcca ccgcccgcgc cccgagttcc 1081 gtccctacca acacctccgc atccgcacca aggtacgcgg gacgggcgcg cgggcggcac 1141 gggggtgctg cgggggcggg ggggggtgct gcgggggcgg gggggggtgc tgcggggggg 1201 gggtgcgcgg ggcgcgcggg gcggggcgcg gactccggac tcacggctca cgcgagctcc 1261 ctcccccgct ctctccacag ccctacccct ggggggacgg caaccacact ctgttccaca 1321 atagccacgt gaaccctctg cccacgggct acgaacaccc ctgaggcccc ggacgccccc 1381 ggacacaata aaggtgtgaa gcttcgagtc tgcggctctg tggggagggc ggggccacgg 1441 ggagcgcgcc cagaggcgcc cgctccgcgc atgcgcccag cttggtaagc gctcctgctc 1501 cgcgcatgcg cctagcgcgg tgggcgctcc acgtgctttt cccacgccgc tgatacggag 1561 gtgctcgag // LOCUS HSINT2 11608 bp DNA PRI 25-JUN-1997 DEFINITION Human int-2 proto-oncogene. ACCESSION X14445 NID g33937 KEYWORDS growth factor; int-2 gene; proto-oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11608) AUTHORS Dickson,C. TITLE Direct Submission JOURNAL Submitted (27-FEB-1989) Dickson C., Imperial Cancer Research Fund, P O Box 123, Lincolns Inn Fields, London, WC2A 3PX, ENGLAND REFERENCE 2 (bases 1 to 11608) AUTHORS Brookes,S., Smith,R., Casey,G., Dickson,C. and Peters,G. TITLE Sequence organization of the human int-2 gene and its expression in teratocarcinoma cells JOURNAL Oncogene 4 (4), 429-436 (1989) MEDLINE 89239468 COMMENT tissue=placenta; library=cosmid; clone=C1; Data kindly reviewed (04-Jul-1989) by Smith R. FEATURES Location/Qualifiers source 1..11608 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="q13" misc_feature 445 /note="pot. transcription start site" misc_feature 479 /note="pot. alt. transcription start site" misc_feature 507 /note="pot. alt. transcription start site" misc_feature 527 /note="pot. alt. transcription start site" CDS join(936..1155,3445..3548,9197..9592) /codon_start=1 /product="int-2 preprotein" /db_xref="PID:g312409" /db_xref="SWISS-PROT:P11487" /translation="MGLIWLLLLSLLEPGWPAAGPGARLRRDAGGRGGVYEHLGGAPR RRKLYCATKYHLQLHPSGRVNGSLENSAYSILEITAVEVGIVAIRGLFSGRYLAMNKR GRLYASEHYSAECEFVERIHELGYNTYASRLYRTVSSTPGARRQPSAERLWYVSVNGK GRPRRGFKTRRTQKSSLFLPRVLDHRDHEMVRQLQSGLPRPPGKGVQPRRRRQKQSPD NLEPSHVQASRLGSQLEASAH" sig_peptide 936..986 exon <936..1155 /number=1 mat_peptide join(987..1155,3445..3548,9197..9589) /product="int-2 protein" intron 1152..3444 /number=1 exon 3445..3548 /number=2 intron 3549..9196 /number=2 exon 9197..9929 /number=3 polyA_signal 9911..9916 /note="pot. polyA signal" polyA_site 9929 /note="polyA site" BASE COUNT 2166 a 3822 c 3181 g 2439 t ORIGIN 1 ggatcctccg aggcctgcca gacgagagct aaccccacca ctccgggtgc ccagttggtg 61 tgcgcaaaag gagatagcat ctccagaccc ggcctgcccc gcgccttgca aggctgagga 121 cagcgccact gctcctgcag gaagcgccgg gcgcagacac aaacccggag ctccccacgc 181 gtgcccgcgc cccggagcct ccccgccgcc gccctccgcg gtccccttca ttatgcgccg 241 cctttaatgg gcgtttgtca gctcgacttc cccgcaagtt gttttcacgg acatcagtca 301 tcggcggcgg ccccattgtg cagggggatt gatggggagc gggagggggt gacagggcct 361 ggggcggcct cctcaaggcc tcggtctata attttcggag gcataattgg tctgggggag 421 ggggcggggg aggggcgggt aggggacctt tcagagccag gagggctttc gggggcgtgg 481 ggcgcgctgc ggagcggagc cgcggctcga cggcggtgcg ctggcggcga gtgtatgcag 541 acggcgcccg gcccgaaccc cgagccccgc ggggctcccc acccgccggc ctcccgcccc 601 tcccgcgcct ccgcctgggg accacgtcgg ccttttgttg gcgaaccgtc ctttctttca 661 gcgctttgcg cagcaacgga aatttcattg ctcctgggtg gaaattaaag ggactcgcgt 721 tccctctctc cctctccctc tcccactctc cctctctttc tctctctcgc ccacccttcc 781 cccttcttcc cccacctttc ccgcgaagcc ggagtcagca tctccaggcg cgggatcccg 841 ctccgagcac ctcgcagctg tccggctgcc gccccttcca tgggcgccgc gctcgcctgc 901 agccgccgcc gccgcggggc gggcgcgatg ccacgatggg cctaatctgg ctgctactgc 961 tcagcctgct ggagcccggc tggcccgcag cgggccctgg ggcgcggttg cggcgcgatg 1021 cgggcggccg tggcggcgtc tacgagcacc ttggcggggc gccccggcgc cgcaagctct 1081 actgcgccac gaagtaccac ctccagctgc acccgagcgg ccgcgtcaac ggcagcctgg 1141 agaacagcgc ctacagtgag tgccggacgc tgcggggccc cgggggaagc ggcgccggga 1201 ggggtcgggc ccgggagaag gcgcgctgcg gggccccgcg ggggaggcgg cgccggggag 1261 gggtctggcc cgggagaagg catggtgcgc ccggggtgtc cgggaaaaga ccgtctgcct 1321 cccgctgcca gaaggggaat gccagggtgc cctcctcaac ctacacgtcc gggagaagag 1381 cgcgcgctgg ggttcgagca gaactcgggg gcatgggcgt tcacgtccga agagggcgcc 1441 ggcggctgtc agagtccgtc cccggtccgg cccgctggtc tgaggggacg cgagctaggc 1501 gaccccgggg ctccaggctc tgctgctttg ggcagctctg acaagtttaa gccctttgaa 1561 tgtgtcggag aaaaggagcc ggacaaggca tttatcatct tctttattat tttgacgact 1621 tctcctcctc ctgccgaccc ctggacgccc cgccgtcccc tccgccctcc ccgctcgcgg 1681 ctgcccaggt ggccgccccc gctgctgcct ctgcgcggag gactcttgcc ctgcggagct 1741 cggttccctg gccgcggccg ccaccgacag ttttccccgc gctggaatct gcacctcccc 1801 cgcctccgcc ctggccgcac agcgacagga gggtggaggc cctcgccgtc gggacgtgcg 1861 gatcacgagc gcggaacggt gtccgcccgg gcctgggggt gcagacacac acactccggc 1921 ccccgcacgc aggacccgcg gcccgggctg cgctcaccgc gggagtctgc cggactacac 1981 ggttgggctc ctctggtcac agggacctga ggcccgcgcc gagccctttg aggagcagga 2041 tagcggagct cagggcccgg gaggaccgcg tggggagcgt gggagggcag tcaagaccac 2101 agctgtcctc gggtgctccg cgcggcgctc ggctccggcc ccgggcgcag gaagcggttc 2161 cgccgcttga aggtggcggc ggcggcctca gcaaaccgcg gcttcctcca ggaaagccgc 2221 agccctgaga gggcgtcctg gggacatgcg cctccggagc cgcacggtgg gcaccagctg 2281 tcaccagggg gtccgagtgc gcggaattcg tctcactaag acactccggt tctctccaaa 2341 gccaggctcc ccctcggagt ctcacagcat ccaaacttct tggtgttggc tgctcacggg 2401 gaggggaggg cgcgcgcccg cagccgcccc tgtcctgcgt cgagactcgt gcttcgctgg 2461 tccccggtca ggcaccgccg atgccgccca gcctgcgcac tgggaaggcg ggaggctcgc 2521 agcctgcacc acagcacccc tgggctggag caaaagcccg gtggtgaccg cgtctgtgct 2581 cggaccgcgt gccaggaggg cgctcctgag aggcatgcct tggagagggt gcaggcagtc 2641 gccccaatcc cctcgcagcc tctattggga gacaatgacc ccaacccgtc ctttgatgta 2701 gtcccccgcc cccagccccc accacagcac tcgtgtatcc agaaggaaag gcgggagggg 2761 agatttaaac tttcttatcc ctggggagtg ggtgagaccc gtccagcttc cctctgcggg 2821 ctccagggtc taaactggct tcctgccctt ccaggtcccc cggcgagtac aatgatctcc 2881 cgccaggtct attgccagca tgatttgggt ggcaggcacc ctggctgtgt tattgccagt 2941 ggtttatatt atcagcctgg ctcatcaggg cccagctttc ccgctggcag gcacaggcat 3001 tggggaccgt gcagtcccag agctcagact cccatggctg ccgccagggc ttgatttccc 3061 gctcaagcac atgtgcaggg gcctgacagt cacccagcca tagtggtccc cacttgcttg 3121 tctgtcggat gatgatggca gctggcaagg tgttgtgagg ctcgaatgaa agccagggtg 3181 caggcagcgt cagcaggtgg gaatgttaat ttcattgtta ccctcccaag tttgaatgaa 3241 gagaagcctt ccctgatgtt cccttctcta cctcctgtcc ggtgacttct cccttcctga 3301 acttccactc actccccgac ctgtggggtg tgtgctaccc tccccagccc tgagcccggg 3361 gtgtggcttt gggcagggcc agctgggcct cagccgcccc ctccgtgggg gcggcgggag 3421 tgaggcacct ctcatttctt ctaggtattt tggagataac ggcagtggag gtgggcattg 3481 tggccatcag gggtctcttc tccgggcggt acctggccat gaacaagagg ggacgactct 3541 atgcttcggt gagtccaggc tgtcacgtgg gtgggcgctg acggagtagc ggtctggcct 3601 gcacatcaag ccaggggacg ggggatgtgg gcagtagaat gctttgccaa gggtacatgg 3661 gatccaagct aggaccagac cctggcccag ggctccacgc ccagtgatct ttggctgtgt 3721 ctgttagagg ctgctccgcg agtgacctgg agcctggccc tgggggacct gggatctgga 3781 gaatgccagg ttagcaaagg tttcccccat ccttgtcatc actaccaccc tcccttgaga 3841 gtagctgaga gcagggaagt gattgttctc caaatttcca ccaaaataaa tggacccaag 3901 acaacctggg agtgttcctt ggggcagcca gagctctcct gggagcatgg agaggagggc 3961 gctggcccag ctccaccggc tctctccaag ggtggaaagg cgagagctga agcctaggag 4021 gggaaggggg atggcgccca tgtcccgggc acacaggagc cttggagccc tggcttggga 4081 cccagtctat ctactgcctg ggcctccgtg taaccaaaaa gggtctcctt attctgtggg 4141 tatggcagcg cccctcttca taaggggtag gggtggggaa tacacagagt aaaatacatg 4201 tcacagggaa atttgctacc tccaactagt cattacatgc aatttggctg atacttcctt 4261 gggcaatgag aggttttcca tccatgagta gcctggctga cgcggcccaa ggacaatctc 4321 cctgcagtga gctctctgct cagtcctgct cacagaggac acatcccgca gctccctctc 4381 gcagaagctg atgatttcat cacagatttt tagccgtttt gctaaaggaa ggtccagaaa 4441 gccgggatgc gccccttcat tttctctggt ccagaggcta ctccctcctt cctcccatcc 4501 actcacccat ccacccatcc acccattcac ccatccactc atccacccat ccacccatcc 4561 atccatccac tcatccaccc atccacccat ccatccatcc acctatccat ccatccatcc 4621 acccatccac ccatccttcc atttacccat ccaccatcca cctatccatc catccatcca 4681 cccatccacc catccaccca tttacccatc cacccatcca cccatccacc tatccatcca 4741 tccatccatc cacctatcca cccatccact catccactca cccatccacc tatccaccca 4801 cccactcacc catccatcca cccacccact cacccatcca tccatacacc tatccatctg 4861 acattcagtc cattcatcag tcagtgtcct ggaagctgtg tctgggagca ggggccctca 4921 tggtgccagc ttctgccata ggggagccag gatttggaga gacagaataa agcatgacat 4981 gagggggtcc cagtggttca gagcccacct ggggcttccg tgcctgagaa ctcacagcct 5041 ggctttgaga agggtggaca gaggctccta gccccaccag ggatgcttag ccaagcagtc 5101 tggctggagg gtgtggaact ttccagagtg cccagtggag aagttcccat tgccctgcag 5161 acaacctggg aggcatagta gcctggatgc caactccgtc acctgccacc tgccacctga 5221 taccacagtc ctgcgtgtgg ccagcttgcc taggagtgct gtcccaccac ggggctccct 5281 ggactttgcc ggcagcatgc ttgccctcag gccagctcat ctcactgtgg gccagtgagc 5341 actgcaccct gacactcctg gtctggccct gacctccctt ctgaggtata gaccaccctc 5401 ccccaacatc ctgctgactc agacactcca accgaactca gcagctcccc gactccccat 5461 cttaatcatt gccccaccag ccactcactc tcacccagag tggactggga gtcatggtag 5521 acccctcctg ctattcagtc catccctagt cctcagacca cgcccctgca tctcttcagc 5581 ccacacacct cacctccctc tgcctcatcc ccgagagcac gccttaaaag gcaggctcat 5641 cacttcccac ctggatcact gctccagcct ctcctacctc cagtccatct gtttgtcaag 5701 ccatccatcc atcttccttc cttctttccc cttcctttat tctcttcacc catctgtcca 5761 tacacctctc cttccttcct tctttccttt ctctcatcct tctttccatc catccttcca 5821 accgttcatc tgtccatctg cccattcaca tgtccatccg tccatgctcc ctccttcctt 5881 ccatcctttc acccatccac ccatccaccc atccacccat ccaccaatcc atccagcaat 5941 ccacccatcc atccagccat ccacccatcc atccgtccac ctatccatcc atccacccat 6001 ccatccaacc agccatccac ccatccatcc atccatccat ccatccatcc atccacccac 6061 ccacccatcc acccatccat ccatccagcc aaccagccat ccacccatcc acccatccac 6121 ccatccatcc atccatccat ccatccatcc atccatccat ccacccatcc acccatccat 6181 ccatccagcc atccacccat ccatccatcc acctatccat ccatccacct atccatccat 6241 ccacccatcc atccatccag ccatccaccc atccatccat ccacccatcc acccatccat 6301 ccatccatcc atcaatccat ccatccatcc acccatccat ccatccatcc acccacccat 6361 ctgacattca gtccttcgtt caccagtcag tgtcctgggc acttaccatg tgttgcactc 6421 tgctagtgca tgttaaaggg gcttcctaaa aaagcaaatc agcccagttc atggcttctg 6481 aggactccca tgatgcgcag gcatccccca ggcaccttag ctccacccac agagcagcag 6541 cccaggtgcc atctgctccc aggggggccg tcatccagag gttgagcctc ctccagcttc 6601 cggaagcccc ctcaactctc caagccatct gtactgctca ggccacacct tcactgcctg 6661 gaacgccctc ctccagaaaa tgcccatgag aaagtgcccg ccccgccctt acagctccct 6721 ttagaagtca cctcctcaac aaagcagctt cggagtcggc ttcctcccca cccttcatgg 6781 ggtggaagcg gccctggggg aggggcctgt ggaaccaggt ctggtggggc ggcagtgtca 6841 ggacacacat tggaaagtgt tgtcacagcg agtcccaact gcagtagctc tggagtctat 6901 ggccccggcc cctgagccaa caccttctgc cctggtcatc ttggccaccc caggtgccct 6961 accacgtgtc agcaattgat ctacactgcc cccaatctcc cacctcggga gagcctgagc 7021 cccctgccac ctgaggctca caggcacctg tccttcagtg acccacccct caagtggccc 7081 cccaaagaga aagcctattt cctggcctct ctggccctga gaggactcac ctgctgcagg 7141 atcgagggac ctgggagaag cttgggtcct gccctgagct tctcacagtc tgccatggga 7201 gccagacact aacatgttgt acccagtgtg tggggtgggg aagtcggccc tgagcccacc 7261 cgctcattcc cagatagtcg ctgagcccca ggctatgtgg gggtcaggca cagatcagac 7321 actgttctat cccctgtaat gtagggcctg gcaccacccc tgggcctcag tttccccatc 7381 tataaacatg gagaccggtc ttggatagta acatccagac tgtggcccag ggggcactgg 7441 gcctccaaga gggctggaga gcaggggggc ttggggttat ggggagaggc agcccctacc 7501 accaggtcaa gccttccacc cttccctgct ctgggctttc atgtcgctga aaaacaggat 7561 ggggtgcttg aagcatcttg ttccctgagg tcctgcgagg tcagatgctg ctcggtccag 7621 tctgggagcc tctggaggcc atttccactc tccctctctc ccacgggaca ccacacccca 7681 gatggggaca gaccatgagg gcctccgact cctcctgcgg ccagtcccca ggaggaaaga 7741 gagatcctac tgtctgcctt gcacctgctg cttccacctc ccaccaccct ttctggtttg 7801 gagggagcag ctcctgtcag tcatcccctg agggaggtgg ccctaggcat taccacttgc 7861 ctgtagctgg gactgaagcc tagtgggtct gcataaggca tacccacctc ttccctacgc 7921 cacagactaa gaagacccca tgaggcagcc cttgagggaa actcccacgg ccaggccttg 7981 gggatgttgg tgtaagctcc ttcagttcag agcttcactg tgcctttgag agggcagggt 8041 cccttctgcc cttcccccac tgcaccctgg gcctgacaag ggtcccatta atctctgatg 8101 acaaagaggt gtcctctctg tcctgcttgt ggtgggacac cccagcttct gctctcatcc 8161 taagaacagc agtatctgtg gcactgattg atcacgtgcc tcatgccagg ccccagcccc 8221 tgctctgtgt tgtaacctct tcaacctgca aggcgagatc ctccagctta agagagtgtt 8281 gctcaggggt caatgacccc agctggggcc cctgagcgca gcgtttaatg gagagtctgg 8341 gctcatcaaa cctggctgtg tccccttccc ttcctgcttt ttgtgttctt tcttcttttt 8401 acttttctgc aatttctttt tttttctttt cttttttttt ttttttgagt cagagttttg 8461 ctcttgtcgc ccagactgga gtgcagtggc acaatctctg ctcactgcaa cctctgcctc 8521 ccgggttcaa gtgattctcc tgcctaagcc tcccaagtag ctgggattgt gggcgcacgc 8581 caccacaccc ggctaatttt tttgtatttt tagtagagac ggggttttgc catgttgccc 8641 aggctggtct cgaactcctg acctcaggtg attcacctgc ctcggcctcc caaactgctg 8701 ggattacagg catgagccac cgtgcccagc ctctgcaatt tctttaaaag agatctgggt 8761 gttgtcatct tcctttgtcc aaatgcccag tccttgctga cctccacact gccagcagac 8821 tgccagggca cctgggttgg ccggcctggt ctctgctcca cagacaaccc tacacatccc 8881 tttgctgtgt cagcgccctc agatggggga cagaggctgg tggaacccag agagtaggag 8941 ctgggagttg tctggcacag ctttgagtga aggtgacctt taggcagaag gccaagttca 9001 ccaggcacac aggggaggaa ggacatgctg ggcagagggc agggcctagg caaaggtgtg 9061 actggctgag agtgcgctgg ggtttggcac tggaccgaac agcctcacag gaggggaggg 9121 aggcatcagg caaccctggg ccctgacgct gccgcagtct ccccggggca ctgaccatga 9181 tatctcatcc ccgcaggagc actacagcgc cgagtgcgag tttgtggagc ggatccacga 9241 gctgggctat aatacgtatg cctcccggct gtaccggacg gtgtctagta cgcctggggc 9301 ccgccggcag cccagcgccg agagactgtg gtacgtgtct gtgaacggca agggccggcc 9361 ccgcaggggc ttcaagaccc gccgcacaca gaagtcctcc ctgttcctgc cccgcgtgct 9421 ggaccacagg gaccacgaga tggtgcggca gctacagagt gggctgccca gaccccctgg 9481 taagggggtc cagccccgac ggcggcggca gaagcagagc ccggataacc tggagccctc 9541 tcacgttcag gcttcgagac tgggctccca gctggaggcc agtgcgcact agctgggcct 9601 ggtggccacc gccagagctc ctggcgacat cttggcgtgg cagcctcttg actctgactc 9661 tcctccttga gcccttgccc ctgcgtcccg cgtctgggtt ctcagctatt tccagagcca 9721 gctcaaatca gggtccagtg ggaactgaag agggcccaag tcggagctcg gagggggctg 9781 cctgcaatgc agggcatttg tgggtctgtg tggcaggaag ccggcaggga agggcctgag 9841 tgccagccct ggcagactga ggagcctccc aggagcagcg gggcagtgtg gggctttgtg 9901 tcatcacaac attaaagtat tttattctac tctgtcgttt ggtagaccgt gatgcaggct 9961 gaggagcgct tgccgccttt actggaacgt gcttgcttcc agcacagcag aatccgcgct 10021 ggcatcagcc tgcgtcagct gctgctttaa gaggaagacg gcattcccag aaatcgggct 10081 aaaggtgcat ttcagttccc tggttttaga aagttacgtt tttttggatg gttggaaaca 10141 agcaaaggca tgtttgtgca tgtgtgtgca tcgtgtgtgt gtgcgtggag agaaggatcg 10201 tgtatttctg aaagcgtgag tgtgcatgtg ggtatgtgtg atcttgtgtt agtggtacct 10261 gtgtgaggac atgtatgtgt gtgtgtgttt ctgggtgtgt ctgaatgtgt gatgtgtgtg 10321 tgtctgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtggaga gagagagaga gagtttactt 10381 tctttgaaaa ctctaaaaag cctctccctc tggaagctgt gtgcttctcc agggaccctt 10441 tagagcaact gtgtcaggtc aggcagcaca gaaacttcct ttatccttac aacctgctct 10501 tggggcccgt gcaccctgtc tttacctaga aggtgaggct cagagaagca attactcagt 10561 ggccggcccc tgccttggac taggtgcctc ctcacacctg ttccccaaca atggcatggg 10621 tggaatcacc tgggccggcc caggtgagag ccagcatggg cagtgtacta acctctcctg 10681 gcacttggca ggatgggcag ggtccaggtg aggaggctct cctgagcctg ggactgtgag 10741 gaccatcgct ctctgttccc atgccctccc aggggtcaga gagcccagac tcagagagcc 10801 cagggtcaga gaacttaggg tcagagaggc cagagtcaga gagctgagac tcagagagcc 10861 cagggtcaga gtgtccaggg ccagagagct tgtggtcaca gagcccagac tcagagagcc 10921 cagactcaga gccacctgat tggttagtgc agactcgcca aacccacagg gaggctgggc 10981 tcctccctgg cacgtgtgca acacaagtga aaatctcggt gcctccttca gcccccagcg 11041 catgtcagat ttcccggaat ggctcccctg cagctgcgaa cattcctggc agtcaacagg 11101 agcagcacgc agctgagctc tgctgtgggt tttgttgttt ctctagagtg agatggggca 11161 ggggctgcca tcactccctc cttgcagatg atgaccctga gtcctggcaa ggggaacttg 11221 cccggggctg tgtcaacaca ggggaagcag cagtactcag tgctgcagga tcaacagatg 11281 gtccctgatg aaggcgtagg agacactggg ggctcttgtt taacatgtaa aacagctttg 11341 acaagagaat gtggattttt cgcagctgat ggctgtgcca tggtcacctt cttccccaca 11401 ccagagtcca agggacttca ttttgtgtgt gtgtttgggg ggtcatgggc tgaattatgt 11461 ctcctcccca gagttcatct gttgaagtcc taacccctag taactcagca tgtgacctta 11521 tttggaatag ggtcattaca gatgcaactg gtgaagatga ggtaacatag gagtagaatg 11581 acccctgaat ccattgtgac caggatcc // LOCUS HSBGL3 2052 bp DNA PRI 07-OCT-1996 DEFINITION Human germ line gene for beta-globin. ACCESSION V00499 NID g29440 KEYWORDS beta-globin; germ line; globin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2052) AUTHORS Lawn,R.M., Efstratiadis,A., O'Connell,C. and Maniatis,T. TITLE The nucleotide sequence of the human beta-globin gene JOURNAL Cell 21 (3), 647-651 (1980) MEDLINE 81064667 COMMENT KST HSA.BETGLOBIN.1.GL. FEATURES Location/Qualifiers source 1..2052 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 104..1709 exon 104..245 /number=1 CDS join(154..245,376..598,1449..1577) /codon_start=1 /product="beta globin" /db_xref="PID:g29441" /db_xref="SWISS-PROT:P02023" /translation="MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFE SFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPE NFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH" intron 246..375 /number=1 exon 376..598 /number=2 intron 599..1448 /number=2 exon 1449..1709 /number=3 BASE COUNT 533 a 414 c 421 g 684 t ORIGIN 1 ccctgtggag ccacacccta gggttggcca atctactccc aggagcaggg agggcaggag 61 ccagggctgg gcataaaagt cagggcagag ccatctattg cttacatttg cttctgacac 121 aactgtgttc actagcaacc tcaaacagac accatggtgc acctgactcc tgaggagaag 181 tctgccgtta ctgccctgtg gggcaaggtg aacgtggatg aagttggtgg tgaggccctg 241 ggcaggttgg tatcaaggtt acaagacagg tttaaggaga ccaatagaaa ctgggcatgt 301 ggagacagag aagactcttg ggtttctgat aggcactgac tctctctgcc tattggtcta 361 ttttcccacc cttaggctgc tggtggtcta cccttggacc cagaggttct ttgagtcctt 421 tggggatctg tccactcctg atgctgttat gggcaaccct aaggtgaagg ctcatggcaa 481 gaaagtgctc ggtgccttta gtgatggcct ggctcacctg gacaacctca agggcacctt 541 tgccacactg agtgagctgc actgtgacaa gctgcacgtg gatcctgaga acttcagggt 601 gagtctatgg gacccttgat gttttctttc cccttctttt ctatggttaa gttcatgtca 661 taggaagggg agaagtaaca gggtacagtt tagaatggga aacagacgaa tgattgcatc 721 agtgtggaag tctcaggatc gttttagttt cttttatttg ctgttcataa caattgtttt 781 cttttgttta attcttgctt tctttttttt tcttctccgc aatttttact attatactta 841 atgccttaac attgtgtata acaaaaggaa atatctctga gatacattaa gtaacttaaa 901 aaaaaacttt acacagtctg cctagtacat tactatttgg aatatatgtg tgcttatttg 961 catattcata atctccctac tttattttct tttattttta attgatacat aatcattata 1021 catatttatg ggttaaagtg taatgtttta atatgtgtac acatattgac caaatcaggg 1081 taattttgca tttgtaattt taaaaaatgc tttcttcttt taatatactt ttttgtttat 1141 cttatttcta atactttccc taatctcttt ctttcagggc aataatgata caatgtatca 1201 tgcctctttg caccattcta aagaataaca gtgataattt ctgggttaag gcaatagcaa 1261 tatttctgca tataaatatt tctgcatata aattgtaact gatgtaagag gtttcatatt 1321 gctaatagca gctacaatcc agctaccatt ctgcttttat tttatggttg ggataaggct 1381 ggattattct gagtccaagc taggcccttt tgctaatcat gttcatacct cttatcttcc 1441 tcccacagct cctgggcaac gtgctggtct gtgtgctggc ccatcacttt ggcaaagaat 1501 tcaccccacc agtgcaggct gcctatcaga aagtggtggc tggtgtggct aatgccctgg 1561 cccacaagta tcactaagct cgctttcttg ctgtccaatt tctattaaag gttcctttgt 1621 tccctaagtc caactactaa actgggggat attatgaagg gccttgagca tctggattct 1681 gcctaataaa aaacatttat tttcattgca atgatgtatt taaattattt ctgaatattt 1741 tactaaaaag ggaatgtggg aggtcagtgc atttaaaaca taaagaaatg atgagctgtt 1801 caaaccttgg gaaaatacac tatatcttaa actccatgaa agaaggtgag gctgcaacca 1861 gctaatgcac attggcaaca gcccctgatg cctatgcctt attcatccct cagaaaagga 1921 ttcttgtaga ggcttgattt gcaggttaaa gttttgctat gctgtatttt acattactta 1981 ttgttttagc tgtcctcatg aatgtctttt cactacccat ttgcttatcc tgcatctctc 2041 tcagccttga ct // LOCUS HUMALIFA 7614 bp DNA PRI 31-OCT-1994 DEFINITION Human leukemia inhibitory factor (LIF) gene, complete cds. ACCESSION M63420 J05436 NID g178414 KEYWORDS glycoprotein; leukemia inhibitory factor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7614) AUTHORS Stahl,J., Gearing,D.P., Willson,T.A., Brown,M.A., King,J.A. and Gough,N.M. TITLE Structural organization of the genes for murine and human leukemia inhibitory factor. Evolutionary conservation of coding and non-coding regions JOURNAL J. Biol. Chem. 265 (15), 8833-8841 (1990) MEDLINE 90256813 FEATURES Location/Qualifiers source 1..7614 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11.2-q13.1" mRNA join(657..739,2471..2649,3343..6826) /gene="LIF" /note="G00-120-152" exon 657..739 /gene="LIF" /note="G00-120-152" /number=1 gene join(657..739,2471..2649,3343..6826) /gene="LIF" CDS join(721..739,2471..2649,3343..3753) /gene="LIF" /codon_start=1 /db_xref="GDB:G00-120-152" /product="leukemia inhibitory factor" /db_xref="PID:g178415" /translation="MKVLAAGVVPLLLVLHWKHGAGSPLPITPVNATCAIRHPCHNNL MNQIRSQLAQLNGSANALFILYYTAQGEPFPNNLDKLCGPNVTDFPPFHANGTEKAKL VELYRIVVYLGTSLGNITRDQKILNPSALSLHSKLNATADILRGLLSNVLCRLCSKYH VGHVDVTYGPDTSGKDVFQKKKLGCQLLGKYKQIIAVLAQAF" sig_peptide join(721..739,2471..2490) /gene="LIF" /note="G00-120-152" exon 2471..2649 /gene="LIF" /note="G00-120-152" /number=2 mat_peptide join(2491..2649,3343..3750) /gene="LIF" /note="G00-120-152" /product="leukemia inhibitory factor" exon 3343..6826 /gene="LIF" /note="G00-120-152" /number=3 BASE COUNT 1512 a 2159 c 2297 g 1646 t ORIGIN 1 aggcctgacc ctctggggcc ctgggcaacg tgttcccctc ggagcccctt ggggcccagt 61 ggcaatgtgc agattggagg aggctacctc tggggtggct tccagcgccc atgttcagtt 121 ccactttatg accgtctaaa gtccacgccg gccccggccc ctcctcctgg agcctccttc 181 tcagccaagc cctagccctc ctcctcccac tgccccccta aagcccccca accagggggc 241 acaggggcca gtctccatct gctagtccag actcgctacc tccccaaacc caggtgagtc 301 agggccgtct ggggcgccca ctgctgggac ccctgctgac tcggccaggg gccccctcct 361 ggcgatgcca tcttcagaca actcccggga caagccaggc aggaaaacca cagggcgttt 421 tgtcgaaggc ttcattataa ttttatcaat caaattctta gaagagggaa aaagtctgtt 481 ctccccaccc tcccccctca ctcgtccccc cccttcactc tcactttctt ccattcataa 541 tttcctatga tgcacctcaa acaacttcct ggactgggga tcccggctaa atatagctgt 601 ttctgtctta caacacaggc tccagtatat aaatcaggca aattccccat ttgagcatga 661 acctctgaaa actgccggca tctgaggttt cctccaaggc cctctgaagt gcagcccata 721 atgaaggtct tggcggcagg taaatacacc cgccccgcgc cggcttcgcg tccccgctgc 781 ggggcgcggc ggcaacttgg ggcgcttggc agcgcgagcc ggacgcccac ccgccgcaga 841 cacacgaaca cttggggcgc ccgcgcagcc accggaggcg ctgggtggcg gcccggagcg 901 agcgcgggca catggtccca agcaccgcac gccccgcggc aggttggcgg cgagggaggg 961 ggcgccgagc cttcctcctc ctcctcttct tccccctctc cttctcctcc tcctcctcct 1021 cgctccggac actcgctggc tgctcttccc tctgcgcttt ctcccagatt cactctccct 1081 ctctcttttt cttttctttc tttctttccg ctttctcctt tccaaccgcg gtcccgggct 1141 gctcccaggg aggggcgcgg gcggcgagca gcttgcaaac tccggcctgg gacgggagca 1201 ggtgccgcct ccatctgctg gtgtctggaa gcgtgtggtc tgcgctaggt gagatatagg 1261 ggtgtggccc ccctcccgca gccaccccgg ggcctcagca ctgccttggg atgctgggac 1321 gaacagggga cccaggagaa gtgaacttga ggaggcccct gtccccgctg ttccagcatg 1381 ggggatccgg gggaggtctc cagcctcact cgcccgctga ccccggcccc caccatcttc 1441 aggtgccctt ctgcagagcc ctggaaaagc gtctaagggg gcctgggggg agtcgggaga 1501 acaggccccc ctgggggagg ggcaaaccag gacatgtcgg gacagctccc agctctcctg 1561 tcacctttca ctttccttcc tccccgccca cccacctgcc tatgaccttt tgccttttct 1621 ctctccattt cctctccctc cctgagccgg tgtgtgtgtg ttaggaggga ggggaacccc 1681 tgggacaagg gacagcggat gagtcgggag aaggcatgga gtcaggggct gtccggagct 1741 gggggaacag agagttttga atgatgattt ggggatggag agtggggaca gccaggcaga 1801 aatggggtga gcttgagtga gataggggac actagggaag gaagagatag aggatgatgg 1861 gggcgggggc agaattgggg gcaattggcc cagggagcca gaatcaagtg gcaggtttgg 1921 gagggagatg agggtcaacc aggactcctc cccactcccc catgccccgt ggcccccatg 1981 ggtgcctggc ttcggagatt tgggctgcaa tgggccagtg agtgggaagc gctgctgagg 2041 aacctgggcc accacgggag gtgggaagag aggggtctcc tttctcctgg tgcctgctgg 2101 cctggggcct ggtgccttcc agccagaggg ccagggggcc ttaggacctt tgccttcctg 2161 agggaaaggg tgggatggct gggagtctct cctggaccct gcaccctttg ggtggaaatg 2221 gcttgtgtct tgccctatct ttacgtcatc ccagaagaga gcagggagaa ctaaggtaga 2281 gaaagggaga cagagaaaca cacgcagtga cagagtgaag actaggcccc aggaagacag 2341 ctgcaggtgg tgagggggaa ccaggagtcc tctcccatgc ggccattgtt ggacccccat 2401 ccggtgtgcc atgaccccag gccacccttt cctgcctttc tactcatggc ttcttcctga 2461 ctgtccccag gagttgtgcc cctgctgttg gttctgcact ggaaacatgg ggcggggagc 2521 cccctcccca tcacccctgt caacgccacc tgtgccatac gccacccatg tcacaacaac 2581 ctcatgaacc agatcaggag ccaactggca cagctcaatg gcagtgccaa tgccctcttt 2641 attctctatg taagttaccc ctgggatact gacaggagat ggcagggagg gggcttgtaa 2701 atatcattag gggctgtcct gatctgggtt gaggggacct tttggggctg gaaggagaga 2761 atggggagag ggcttgatta aaccaccccc agactcctgc cacttcctgc ccaagcttcc 2821 ccagggaagc ttccccaggg tgcccagtta gcaaggggag aactgagtgc aaaggtgggg 2881 acctggcact tcttatcttg tgattgtcct gctgcaggga gcgagggatg gaggggaaat 2941 gggcgtgagg caccagggag atgcggttga gaggcagtgg gctgtgggtg ctgggcatgg 3001 aggggcgtcc cggaacattg tgagtgcagg gatggaagta cttgtgtgtg gtgccccagc 3061 tagggctaga caccgagttt tcccttctgt ccccttaggg tggtgatgat gatgatgatg 3121 ataatgatga ctgcgtgcat ggctcagtct ttgatcttta gcaagggcac tcacattaca 3181 attagttttg gctctcatga caattccaga tgcttacagg gcaaggagtt gggtcctcat 3241 gcgctagatg gggaaacaga cgcaagagct tgcccaaagg gttggcggca gggctgggac 3301 actgacccct gactcccacg tcacctccct tctgcccctc agtacacagc ccagggggag 3361 ccgttcccca acaacctgga caagctatgt ggccccaacg tgacggactt cccgcccttc 3421 cacgccaacg gcacggagaa ggccaagctg gtggagctgt accgcatagt cgtgtacctt 3481 ggcacctccc tgggcaacat cacccgggac cagaagatcc tcaaccccag tgccctcagc 3541 ctccacagca agctcaacgc caccgccgac atcctgcgag gcctccttag caacgtgctg 3601 tgccgcctgt gcagcaagta ccacgtgggc catgtggacg tgacctacgg ccctgacacc 3661 tcgggtaagg atgtcttcca gaagaagaag ctgggctgtc aactcctggg gaagtataag 3721 cagatcatcg ccgtgttggc ccaggccttc tagcaggagg tcttgaagtg tgctgtgaac 3781 cgagggatct caggagttgg gtccagatgt gggggcctgt ccaagggtgg ctgggcccag 3841 ggcatcgcta aacccaaatg ggggctgctg gctgaccccg agggtgcctg gccagtccac 3901 tccactctgg gctgggctgt gatgaagctg agcagagtgg aaacttccat agggagggag 3961 ctagaagaag gtgccccttc ctctgggaga ttgtggactg gggagcgtgg gctggacttc 4021 tgcctctact tgtccctttg gccccttgct cactttgtgc agtgaacaaa ctacacaagt 4081 catctacaag agccctgacc acagggtgag acagcagggc ccaggggagt ggaccagccc 4141 ccagcaaatt atcaccatct gtgcctttgc tgccccttag gttgggactt aggtgggcca 4201 gaggggctag gatcccaaag gactccttgt cccctagaag tttgatgagt ggaagataga 4261 gaggggcctc tgggatggaa ggctgtcttc ttttgaggat gatcagagaa cttgggcata 4321 ggaacaatct ggcagaagtt tccagaagga ggtcacttgg cattcaggct cttggggagg 4381 cagagaagcc accttcaggc ctgggaagga agacactggg aggaggagag gcctggaaag 4441 ctttggtagg ttcttcgttc tcttccccgt gatcttccct gcagcctggg atggccaggg 4501 tctgatggct ggacctgcag caggggtttg tggaggtggg tagggcaggg gcaggttgct 4561 aagtcaggtg cagaggttct gagggaccca ggctcttcct ctgggtaaag gtctgtaaga 4621 aggggctggg gtagctcaga gtagcagctc acatctgagg ccctgggagg tcttgtgagg 4681 tcacacagag gtacttgagg gggactggag gccgtctctg gtccccaggg caagggaaca 4741 gcagaactta gggtcagggt ctcagggaac cctgagctcc aagcgtgctg tgcgtctgac 4801 ctggcatgat ttctatttat tatgatatcc tatttatatt aacttattgg tgctttcagt 4861 ggccaagtta attccccttt ccctggtccc tactcaacaa aatatgatga tggctcccga 4921 cacaagcgcc agggccaggg cttagcaggg cctggtctgg aagtcgacaa tgttacaagt 4981 ggaataagcc ttacgggtga agctcagaga agggtcggat ctgagagaat ggggaggcct 5041 gagtgggagt ggggggcctt gctccacccc catcccctac tgtgacttgc tttagcgtgt 5101 cagggtccag gctgcagggg ctgggccaat ttgtggagag gccgggtgcc tttctgtctt 5161 gcttccaggg ggctggttca cactgttctt gggcgcccca gcattgtgtt gtgaggcgca 5221 ctgttcctgg cagatattgt gccccctgga gcagtgggca agacagtcct tgtggcccac 5281 cctgtccttg tttctgtgtc cccatgctgc ctctgaaata gcgccctgga acaaccctgc 5341 ccctgcaccc agcatgctcc gacacagcag ggaagctcct cctgtggccc ggacacccat 5401 agacggtgcg gggggcctgg ctgggccaga ccccaggaag gtggggtaga ctggggggat 5461 cagctgccca ttgctcccaa gaggaggaga gggaggctgc agacgcctgg gactcagacc 5521 aggaagctgt gggccctcct gctccacccc catcccactc ccacccatgt ctgggctccc 5581 aggcagggaa cccgatctct tcctttgtgc tggggccagg cgagtggaga aacgccctcc 5641 agtctgagag caggggaggg aaggaggcag cagagttggg gcagctgctc agagcagtgt 5701 tctggcttct tctcaaaccc tgagcgggct gccggcctcc aagttcctcc gacaagatga 5761 tggtactaat tatggtactt ttcactcact ttgcaccttt ccctgtcgct ctctaagcac 5821 tttacctgga tggcgcgtgg gcagtgtgca ggcaggtcct gaggcctggg gttggggtgg 5881 agggtgcggc ccggagttgt ccatctgtcc atcccaacag caagacgagg atgtggctgt 5941 tgagatgtgg gccacactca cccttgtcca ggatgcaggg actgccttct ccttcctgct 6001 tcatccggct tagcttgggg ctggctgcat tcccccagga tggcttcgag aaagacaaac 6061 ttgtctggaa accagagttg ctgattccac ccggggggcc cggctgactc gcccatcacc 6121 tcatctccct gtggacttgg gagctctgtg ccaggcccac cttgcggccc tggctctgag 6181 tcgctctccc acccagcctg gacttggccc catgggaccc atcctcagtg ctccctccag 6241 atcccgtccg gcagcttggc gtccaccctg cacagcatca ctgaatcaca gagcctttgc 6301 gtgaaacagc tctgccaggc cgggagctgg gtttctcttc cctttttatc tgctggtgtg 6361 gaccacacct gggcctggcc ggaggaagag agagtttacc aagagagatg tctccgggcc 6421 cttatttatt atttaaacat ttttttaaaa agcactgcta gtttacttgt ctctcctccc 6481 catcgtcccc atcgtcctcc ttgtccctga cttggggcac ttccaccctg acccagccag 6541 tccagctctg ccttgccggc tctccagagt agacatagtg tgtggggttg gagctctggc 6601 acccggggag gtagcatttc cctgcagatg gtacagatgt tcctgcctta gagtcatctc 6661 tagttcccca cctcaatccc ggcatccagc cttcagtccc gcccacgtgc tagctccgtg 6721 ggcccaccgt gcggccttag aggtttccct ccttcctttc cactgaaaag cacatggcct 6781 tgggtgacaa attcctcttt gatgaatgta ccctgtgggg atgtttcata ctgacagatt 6841 atttttattt attcaatgtc atatttaaaa tatttatttt ttataccaaa tgaatacttt 6901 tttttttaag aaaaaaaaga gaaatgaata aagaatctac tcttggctgg ctctccggag 6961 tgtactgatg tggggagatg ggctggaagg gctgggactg tccctgtcct gggcaccagc 7021 caagtgggac tcagcgaagg gtggaggagg gtgggggagg ggcacctggc ataggtgggg 7081 gcagttaggt ggtattttgg ccaaggcaga acaaggtggg tggtgtctag atcatggggt 7141 gcccccaagg agagagatgg attgctcaga ggtaaagggg gtgctgggca cggtgggtca 7201 ctcctgtaat cctagcactt tgggaggctg aggcaggtgg atcatttgag cccaggaatt 7261 cgagaccagc ctggccaaca tggtgaaacc ctgtctacaa aatatacaaa cagccagatg 7321 ctgtggcgtc cgcctgtggt cccagctact cggggtgctg aggtgggagg atcccttgat 7381 cccaggaggt ggaggctgcg gtgagccatg attgcgccac tgcactgcag cctgggtgac 7441 agaggaagac cctgtctcaa aaaaaaaaaa aaaaaaaaaa aagaagtaaa cggggtgccg 7501 tgtgatatct cagctcattc cctcccaact cctcacttaa cctcatggga tctccaggag 7561 ttgccatccc cacataccag aggaagaaat cgaggctcag agccatgaaa ccac // LOCUS HUMFABP 5204 bp DNA PRI 08-NOV-1994 DEFINITION Human, intestinal fatty acid binding protein gene, complete cds, and an Alu repetitive element. ACCESSION M18079 J03465 NID g182351 KEYWORDS Alu repeat; fatty acid binding protein. SOURCE Human DNA (library of T.Maniatis), clone lambda-HIFABP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5204) AUTHORS Sweetser,D.A., Birkenmeier,E.H., Klisak,I.J., Zollman,S., Sparkes,R.S., Mohandas,T., Lusis,A.J. and Gordon,J.I. TITLE The human and rodent intestinal fatty acid binding protein genes. A comparative analysis of their structure, expression, and linkage relationships JOURNAL J. Biol. Chem. 262 (33), 16060-16071 (1987) MEDLINE 88058967 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Sweetser, 19-JAN-1988. FEATURES Location/Qualifiers source 1..5204 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q28-q31" prim_transcript 1028..>4393 /note="FABPI mRNA (alt.) and introns" prim_transcript 1053..>4393 /note="FABPI mRNA (alt.) and introns" exon <1089..1155 /gene="FABP2" /note="fatty acid binding protein; G00-119-127" /number=1 gene 1089..1155 /gene="FABP2" CDS join(1089..1155,2350..2522,3546..3653,4098..4148) /note="fatty acid binding protein" /codon_start=1 /db_xref="PID:g182352" /translation="MAFDSTWKVDRSENYDKFMEKMGVNIVKRKLAAHDNLKLTITQE GNKFTVKESSAFRNIEVVFELGVTFNYNLADGTELRGTWSLEGNKLIGKFKRTDNGNE LNTVREIIGDELVQTYVYEGVEAKRIFKKD" intron 1156..2349 /note="FABPI intron A" exon 2350..2522 /number=2 intron 2523..3545 /note="FABPI intron B" exon 3546..3653 /number=3 intron 3654..4097 /note="FABPI intron C" exon 4098..>4148 /note="fatty acid binding protein" /number=4 repeat_region 4466..4742 /note="Alu repeat" repeat_region 4466..4472 /note="5' direct repeat" repeat_region 4736..4742 /note="3' direct repeat" BASE COUNT 1770 a 867 c 836 g 1731 t ORIGIN 156 bp upstream of HindIII site; chromosome 4q28-q31. 1 gtaatatctt gggcaagccc tagagcttct ttcctgaccc ttagttaata agatgttatc 61 tggtcacatt cagtcacaat aatagactca ttttagtaat aaacatctta agactagtaa 121 ttaaaactct ttacttcaca ccaagtttcc tccccaagct tggcctgttc ctggctggca 181 gcctgaagta gggaaaggag agatatggtg accttttctt tgtacctttc tagctaccct 241 ctataccctg accccacata cataattgag ctgtggcttc tgactctact gggtttgggg 301 atgagaggca gtgagagtaa aatgaaggag tggttttaat taatggcaca gctaaaactg 361 gattttgttc tctctgcaca tggcagatgt ttaaagctca ttctttcttt tatgcaagtt 421 tttacaccat ccagcctcat ttgtacctct tgaatttttg ctcagtggcc tatcaccatt 481 caggatcaag acaaaaatca atgagcactt attgtgtgtc atgcacccta caaagtgcca 541 ggatatttat ccaaactcct ggcaatgcta aacacaatgc aaaaagacat attagaaaac 601 gaatcttatt aactttagct tttcaactgt atttcatcat aaagtcttac tttacaagat 661 aattgctgtt gtgaaaaagg gaaaggtcat ggtctcattt cccagatgtt atttgatata 721 tgctataaat tatattacct ccaacatagt ctgcactttg aacttagaaa aacaatcttc 781 agacggcatg cattctaatt cttgaaataa gtatgcccac aaactgtagt ttaagacaga 841 ataggtatgc ttctcatgtt ttaattcagt tgaatttcag aagatctcag gaatgtacag 901 aacgagaatt aagaattaat aagaataaga attaattaat tgcttgacat agagtagtta 961 ggtgatttcc tgaactttaa gcttccacat cacagtatga agttggttca agataagaaa 1021 tataataaat tctcgcccaa ggacagacct gaatctctag ctgcctagag gctgactcaa 1081 ctgaaatcat ggcgtttgac agcacttgga aggtagaccg gagtgaaaac tatgacaagt 1141 tcatggaaaa aatgggtaaa gactttattt ctttgtggct cattctttgc tttcttacaa 1201 acatttttct ttctaactcc taaatctcta ggagattaca gatagcttac agatagctcc 1261 tgatgtggta gagagggatc cagaagatgt tcagaggagg gaaaccatat tttcccttct 1321 tacattagga agaatccact atctcactaa tggaagaaaa gattctttga gtgctgttct 1381 ctgaaacaca ccaaaaagat ccagaaatgt ttccttcact ctttaactga aaaatgactt 1441 tttttgttgt ttacagtaag aaaatggcag cgtgtaatga taacttccag atctgaaaat 1501 gttaaattct aggagatgga aaaacaaaga ccatataaga aagtaatgga aaaagttctc 1561 ttaaaattta tagctctgaa taagttagat ttaattctga tttcttctaa cttaaaaaag 1621 ttttggaata atcttgagaa gctgtgtagt tttctccagg gcgtttaatt taactgattt 1681 ataatttgat accaatactc tggcagccca tatactatac aagataggca aacaaatttg 1741 tgtcattccc ctaaaagaaa aatctgcatc aattatagct tacagtttag gaactctaag 1801 tttaaattta taaaagttgt agattcttat agtgattttg gcttaatatt tgctaatttt 1861 ctcatttttg tgtcagaaag aaatgccaca agaagcaaat agaactataa agttcaaaat 1921 gttaaagcca ctaagaaaaa caaaggggca tttaagaaaa aagaatactg tatatgtgga 1981 attaaagatg tgcttcctta taaatatatg aatatacatt ttaatccttc atttaatatt 2041 tctagaattt gatttactta acactgaaat gaacagtttg ttaatcttat taaggttgct 2101 cagctctaag attctataat tctgtactct acttaatttt tctcaagtta tggaaaaaca 2161 actttaatca gttctcttga tcggattgaa cctgaacttc tatagaagca atctgaatgt 2221 tcttgtgcaa aggcaatgct accgagtttt cttcccaccc tcaaaataaa caaacaaaac 2281 ataacttgga aaaataaaca cttcctatgg gatttgactt tattttctcc attgtcttac 2341 cttttacagg tgttaatata gtgaaaagga agcttgcagc tcatgacaat ttgaagctga 2401 caattacaca agaaggaaat aaattcacag tcaaagaatc aagcgctttt cgaaacattg 2461 aagttgtttt tgaacttggt gtcaccttta attacaacct agcagacgga actgaactca 2521 gggtaagaat tttttttttt atgagcaatg cattcttgat ttttctaccc aatattaaaa 2581 tgatttctgc tctatttcat tggatggttt aattaatgca ggtctccttc actaactgaa 2641 gaagccaatg aagtttgtct acattatata ttacacaaat tggcagggta tttaaatatg 2701 cttttatttt tatacgcatc tgtgaagaat ctgaattgaa cagtaagaat tagaaaacta 2761 tcttttgaat gactgaatat agacctattc ataaagaaat ttaaaactgt gtttttaaac 2821 agtacagcaa aagaagcctt tagagttaat atgtaactta actgtaacat gttgaaataa 2881 taaaagaaat gaatagatga acaaatgagt gagttaccaa atggaaagat ttgatgtatt 2941 gtaggtcatt gggagtgtac cttttcatgt ttaagataac acattttagg aagtcatcat 3001 tttcaacaaa ttttttaaaa acttttttta gcctcaacat ttttctattt aaattacatg 3061 tttgtaatga caatttaact actgaatgtt ttatcgtaag ttatgtcttt ccttaattag 3121 taccacaatc acacaaatta aaacaagcac aggttattaa catctccgtg aaactaattt 3181 taaccatgac tatatttctg gacacgtaac atgaaagatt cagaaagaag tgctgctcat 3241 ctgccttaaa attcagcgta tggaaattat tgaagagaac aagcataatg gttatcaaca 3301 catactctgt agcccaatgg cctaggttca atcctcactc tgtgacttta ggtgaatcac 3361 tgtgccattt tacagtctcc tcttctgcaa agtagagata gtagtatcag tttcataggg 3421 tcaccatgaa gattaaatga aaaagtgtgt ctacagaact cagaacagtg cctgacatgt 3481 gtaagaccct aataaatgcc attattatta ttattattat tattattatt attattatta 3541 tgtaggggac ctggagcctt gagggaaata aacttattgg aaaattcaaa cggacagaca 3601 atggaaacga actgaatact gtccgagaaa ttataggtga tgaactagtc caggtgagtt 3661 gtcaaattta tagctatttt caaaaggcaa aaattactac aaaacaataa tttttgtcac 3721 tgctgagcca gatcttcagt aaactgacta cttcttttct cataaatctt actgatttta 3781 aaaatattgt atagctattt tctgatgcct atttactaaa gacaacttat atatgtcaaa 3841 taatcaatgc ctattttaac tgaaaatata aatgactaca aaccaacatg tgttttaaaa 3901 tggctgtatc ccatatctgt ataaatcttg ctatcaagta caagaaaaaa ttgtataaac 3961 tcatactcat ataatatata tgaatatata atataaaaat agtataaact catatagtat 4021 aaaactataa tactactttt tcttaactta gatgtaaacc ttaaagataa attcttctgt 4081 ttgttaacac ctttcagact tatgtgtatg aaggagtaga agccaaaagg atctttaaaa 4141 aggattgagc attattcttg gcgcacagtc caaaatacaa attggacaga agatctatat 4201 tgtaccagaa ctgtttattt caccccatca agtataaggt tactgattga ttggtccttt 4261 tataaacatt ggtatatttc cattcatgcc aaagcaaaag aagtaaaagc taattaggat 4321 ttaatttgtt ttatattctc taagatatat atttactaaa agaatttgtg acattttaaa 4381 aaacaaaaat aaatattgca tccatgttgc tttatatgta gccttgcctt ttaaaagaaa 4441 aagtatgtga atatgaattg acagattgtt ttcgtagaga gagggtctta ctctttcact 4501 caggctggaa tgcagtggag agatcatagc tcactgtaac ctcaaactcc tggactcatg 4561 caatcttcct gcctcaggct tctgagtagc taggactatg ggtacattcc acagtgccca 4621 gctaattttt gttttgtttt ctttttattt tttttagaga tggggtcttg ctatattgcc 4681 caggctggtc ttgaacccct ggcctcaagc aatcctcctg cctcagcctc tcaagttgtt 4741 tttttcttta catttgataa actaaaagca taggctgcat atgagtcttt aacatcttga 4801 actggttgtg aataattttc tggcactggt tgtaagtaat atctattatt ataaaaataa 4861 tatatgctca accagaaaac ttagaaataa gaaacacaaa tgtaaaataa gtatttccat 4921 aactcataat ccagagataa ttgccattct gattttgata gatatcctct cagctctctt 4981 ccctgggggc agatatttcc caatacatac cactttgaat aggatgatag gaaataaatg 5041 atgtactaca ttaaattaaa ttattgtatt acatttttgt acacatcagt cattcccagg 5101 cttggctgaa aatcaggatc atctgagaaa cttaaacaat ttctgcattc ttaatctcca 5161 ctgttattct attatatcag aatcgctaat agaaccaaga attc // LOCUS HUMLYTOXBB 6305 bp DNA PRI 14-MAY-1996 DEFINITION Homo sapiens lymphotoxin-beta gene, complete cds. ACCESSION L11016 NID g292278 KEYWORDS lymphotoxin-beta. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6305) AUTHORS Browning,J.L., Ngam-ek,A., Lawton,P., DeMarinis,J., Tizard,R., Chow,E.P., Hession,C., O'Brine-Greco,B., Foley,S.F. and Ware,C.F. TITLE Lymphotoxin beta, a novel member of the TNF family that forms a heteromeric complex with lymphotoxin on the cell surface JOURNAL Cell 72 (6), 847-856 (1993) MEDLINE 93208881 FEATURES Location/Qualifiers source 1..6305 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MANN DR7 homozygous cell typing line" mRNA join(2764..2933,3330..3375,3559..3630,4026..4631) /product="lymphotoxin-beta" exon 2764..2933 /number=1 /product="lymphotoxin-beta" CDS join(2772..2933,3330..3375,3559..3630,4026..4480) /note="The 3' end of this gene lies 2.23 kb from the 3' end of the tumor necrosis factor gene" /codon_start=1 /product="lymphotoxin-beta" /db_xref="PID:g292279" /translation="MGALGLEGRGGRLQGRGSLLLAVAGATSLVTLLLAVPITVLAVL ALVPQDQGGLVTETADPGAQAQQGLGFQKLPEEEPETDLSPGLPAAHLIGAPLKGQGL GWETTKEQAFLTSGTQFSDAEGLALPQDGLYYLYCLVGYRGRAPPGGGDPQGRSVTLR SSLYRAGGAYGPGTPELLLEGAETVTPVLDPARRQGYGPLWYTSVGFGGLVQLRRGER VYVNISHPDMVDFARGKTFFGAVMVG" intron 2934..3329 /number=1 exon 3330..3375 /number=2 /product="lymphotoxin-beta" intron 3376..3558 /number=2 exon 3559..3630 /number=3 /product="lymphotoxin-beta" intron 3631..4025 /number=3 exon 4026..4631 /number=4 /product="lymphotoxin-beta" BASE COUNT 1302 a 1730 c 1618 g 1655 t ORIGIN 1 gaattcctgg gctcagaggt cctcccacct tagccttctg agtagctagg actacagaca 61 ccagctacca catgaggctt tgtagaaatg gggtcttact atgttgccca ggctgatttt 121 gaactcctgg tctcaagcaa tctttccacc ttagccttcc aaagtgctgg aattacagga 181 gtgggccact gcacctggct ctattaacat tttttatttg ctttatcaca tatttatcaa 241 tccatctcac ttttaaatat cttttaaaat tacaaatatc agtacatttt acatctaaac 301 ccttcagaag cttaacattg actggagttc agtatttatt tccccatttc ttttctggcc 361 tgaggaaggc aaattttaca tacaaatctc aagtcagtac tctttttttt ttttgagacg 421 gagtcttgct ctgttgccca ggctggagtc cagtggtgtg atcttggctc actgcaacct 481 ctgccttctg ggtacaagcg attctcctgt ctcagcctcc caagtagctg ggactacagg 541 tttgtgccac catatccagc taatttttgt atttttaatg gagaaggggt ttcaccatgt 601 tggccaggct ggtctcaaac tcttgacctc aagtgatcca cctgccttgg tctcctaaag 661 tgctgggatt ataggtgtga gccatctcgc ctggcctaat actgttttgt ttgtttgttt 721 ttgtttttaa gacagagtct tgttcttgtc acccaggctg gagtgcaatg gcatgatttc 781 ggctcactgc aacttccgcc tcctgggttc aagtgattct cctgcctcag cctcccaagt 841 agctggaatt aaaggtgcct accaccacgc cccgctaatt tttatatttt tagtagagat 901 ggggtttcac catgttgatc aggctgctct cgagctcctt acctcagatg atccaccttc 961 cttggcctcc caaagtgctg gtattatagg caagagccac tgcgcccagc cccagtattc 1021 agtttttaaa ctgtcttgtt atcaaggctc tggagccaga tgcctgggtt caaattctgg 1081 ttctgccact gactctgtga gctccataag tttcttaacc tctctgtacc tcagtttcct 1141 cttagggttt ttgtcaggat tataattatt ggctgggcat gatggctcat gcttgtaatc 1201 ccagcacttt aggaggccaa cacgggcaga tcacgtgagt ccaggagttt gagcccagcc 1261 tgggcaatgt ggcaaaaatc catctctaca aaaaatgcaa aaattagctg ggcatggtgg 1321 catgtgccta tagtcccagc tattcaggag gctgaggtag gtgaatccat agatcctggg 1381 aggtcaaggc tgcagtgagc catgatcctg ccattgcatt ccagtctggg tgacatagcg 1441 agaccctgtc tcaaaaaaaa aaattattaa agtgtgtaaa tcagtggcat aaacatgtta 1501 agtgcatttt gtgggtcagc tatattatta ttagtattac ggaaacacat agagatgtta 1561 ccaagaaggg gagatgattg gagccacttc cagcttcctt ggacctggtc tttcttccct 1621 tgactctttt tttttttttt tttttttttt tgagagagag agtctcagcc tgttgcccag 1681 gctggagtgc aatggtgcaa tcttggctca ctgcaacctc tgcctcccag gtttaagtga 1741 ttctcctgcc tcagcctcct gtgtagctgg aattacaggc gcgtgccacc acgcccggct 1801 aactttttgt atctttagta gagacagggt ttcaccatgt tggccaggct ggtctcgaac 1861 tcctgacctc aagtgatcca cctgcctcag cctcccaaag tgttgggatt acaggtgtaa 1921 gccactaccc cggctactcc cttgactctt aaccactcat gctgcctaca tctaccattc 1981 atgtggtcct tgctgctttg ttttggttat tcctgcattt atttgtcctt ttattcattt 2041 atgtataaac atttagtaag cacctactaa tggatagggc tcattgtaga cttggaagct 2101 ctctgagggt gggagtatgc ctcgtccatc tgtctttact ttttgtagca agggaggtaa 2161 agctccattt ccatccctcc ttagtgagtc agtagtcagt ggtgaggcta aggcttacct 2221 ctccctttct cactcagcac agggggctgg agatgagcaa gggaacggga ggaggtcagc 2281 ccagtatggg aatcagttct tctcagggaa cccagacatc catccctcaa gattccagtc 2341 cttgtcctag tccggccctt gacctcagag acgggatcag ctcttcctcc agcacctacc 2401 ttgagggtat agaagaatgc aaaccacatt ggaaacctgg agatctgtgt tctcatttca 2461 gctctgctga ctggcttcct gcaagctacc ttccctccct gggcctcagt ttctctctct 2521 gctgagccag aagatgtcta aagacccctt tggttccacc ctgagagcct gtctccctaa 2581 cctcaacttc ttccccagtt cagagaaccc aggcatccag ctgccccacc ccagctctgg 2641 gtaaacagga agctgggtga ggggagcagg ggtgtgcgga aagtcccagc caggtgtgca 2701 ggtctacagg gagggggtgg gcccgtccct gaggtatgaa agccccctgc tctggctctg 2761 gttcagtctc aatgggggca ctggggctgg agggcagggg tgggaggctc caggggaggg 2821 gttccctcct gctagctgtg gcaggagcca cttctctggt gaccttgttg ctggcggtgc 2881 ctatcactgt cctggctgtg ctggccttag tgccccagga tcagggagga ctggtgagtg 2941 gctgcaacag gccctggtgg agagttgtat cttgcggatg cttggctccc tctggttgtg 3001 cctgtggtct tttgccccct ctggctcagc tggctcggct gtccctggtg gggatgtctt 3061 gtctctttgc tgactctctt tccatgttcc tgtgatgttg tgcttgtgtc ccgacataag 3121 ccccttgtgt ctcctctcct cttcccgagg tacatctgtt tctccgccca agtacctatg 3181 ccttgcttgt tctcccttct aaggaggtgt gtgttgggga tggtgctggt aggagaaacc 3241 ccaggcctgc agcttgggtc cactttcaga ggggtagggg tgacatgagc tgaatctgaa 3301 ctctgggcac tgtgacccca cccaaccagg taacggagac ggccgacccc ggggcacagg 3361 cccagcaagg actgggtaag agcagactgt ctctccttcc ccgcttcaga ccctcagggg 3421 ctcccagctc cctgctgcgt ccccagatac ctcttcctct aggaatccag gctccccatc 3481 cctgcgccct gttctctcaa gggtagcctg catgggtggc tgccctgccc ccaatcgtgg 3541 actctttgcc ccttccaggg tttcagaagc tgccagagga ggagccagaa acagatctca 3601 gccccgggct cccagctgcc cacctcatag gtaaggacct ccaagacctg aataagagtg 3661 taaataatcc gaaggttcca gttctgctcg cccagagtcc ttcggctcca tgattccagt 3721 gctcggtttc ccacccgctt cacgaccttt tgtcgctcgt gcccactctt acgctcgtcc 3781 ccgcagtgta gtttcttctt ccctccggtg caagcaaaag ccggcctgga ggtccccact 3841 acagcgttct gcaccccaca tccgtgttcc ctcggccccc aactcgcact catcccagaa 3901 acagcaccat ccctcctccc ccggcccggc tcggctcccg caggggctaa aagccgccac 3961 ttccccagaa gtcccaagcc tttaggatcg cattcccaag agcgcgtcgg cccgtgtctc 4021 cgcaggcgct ccgctgaagg ggcaggggct aggctgggag acgacgaagg aacaggcgtt 4081 tctgacgagc gggacgcagt tctcggacgc cgaggggctg gcgctcccgc aggacggcct 4141 ctattacctc tactgtctcg tcggctaccg gggccgggcg ccccctggcg gcggggaccc 4201 ccagggccgc tcggtcacgc tgcgcagctc tctgtaccgg gcggggggcg cctacgggcc 4261 gggcactccc gagctgctgc tcgagggcgc cgagacggtg actccagtgc tggacccggc 4321 caggagacaa gggtacgggc ctctctggta cacgagcgtg gggttcggcg gcctggtgca 4381 gctccggagg ggcgagaggg tgtacgtcaa catcagtcac cccgatatgg tggacttcgc 4441 gagagggaag accttctttg gggccgtgat ggtggggtga gggaatatga gtgcgtggtg 4501 cgagtgcgtg aatattgggg gcccggacgc ccaggacccc atggcagtgg gaaaaatgta 4561 ggagactgtt tggaaattga ttttgaacct gatgaaaata aagaatggaa agcttcagtg 4621 ctgccgataa agatgctgag ttgcgacaca cgtcttaatt cagggtgggt gcacgggtgc 4681 gggttaaata ttctcagtac tcttctggtt gcttgaaaca attcatcaca acacagtgta 4741 tggcctttgc tcctagggat gatggtctgc ctgtcccacc ccctccctgc ctctgaatgg 4801 ccaggcccca ccattagccc agttggaggg tgggaggaag ggggacttct caaactccga 4861 agcttctcta ggcatcctga ttttcagggc cacatggtcc caaccagact ctgcaccata 4921 ctcttttctc ttgggtaccc cccaacagtg agaggggtca ttacagagcc cagcaagcac 4981 cactcagaaa ggcccagcag cagagtaagc ccctatcatg acagaggaat gaagcctgga 5041 ggggccccgc acttctcccc ctagagctgc ctgaaggcct ctctgtctcc tacccgacag 5101 tcaactcttc tcctccaagg agcttaattc aaggctcatg gggtctgaag ggaggaggct 5161 gaaggagaaa gaaggggaga atattagaga gagatgggga tggcaggaag gagcctgtgg 5221 tgcctgaaaa caccaggaag ttctggggag gaggaaaaac cgatgcccca cttagggtgt 5281 cccatttagg gtgagacgga aaatcctcac ctttttttca cactttaggt cccccttccc 5341 aaaagtgagt aagtgtgggt gcttctggga tgagtaacag tgtcccccat tacttcatgg 5401 ctgactttca gccacaggct ggaggaggca gagggtgacc caaggcccta tctaggtcac 5461 cccaatgggt caccctaccc cctcagccta ccacatggtt ttctcctgcc tggcacccca 5521 gggctggagg taaagcctaa tttccgaact cagtgggggc tcccagtcta ggggggctca 5581 atttccgtct ccatatttgt ttttggaatt attatttttt tgagacaggg tctcgttctg 5641 tcacccagac gggggtacag tggcatgatc atagcttact gtaacctcaa actcctgggc 5701 ttgagtgatc ctcctgcctc agcctcctga ggagctagga ttacaggcat gcaccactac 5761 acctgactaa tctttaattt tttttctaga aacaaggtct tgctatgttg cacaggctgg 5821 tcttgaacta gtgggctcaa gtggtcctcc cacctcagcc tcccaaagtg ttgggataac 5881 aggcatgagc cactgcgccc cacccttatt tgtctttgac tctctccaga agagccttca 5941 tccagggagg gggtgctttt ctctttccgg attacccacc tctcacctct cccctccttc 6001 accacaaaga ccagtgggac caagccggca tgtgagtcct tcacccacat cttattccta 6061 tgtttcattc ttttttaaaa aatagagaca ggatctcact atgttgccca ggttgctctg 6121 gaactcctgg gttcaagcga tcctctcacc ttggccttgc aaagtggtag gattacaggt 6181 gcatgccacc acgtccggca gttcggttcc ttgttcttta ttgtcctcag tctcttcgat 6241 ttcacccact gagagaatgg aaggggatag aacagctgga aactggttga aggaagccag 6301 aattc // LOCUS HUMLYL1B 4569 bp DNA PRI 18-MAR-1996 DEFINITION Human LYL-1 protein gene, complete cds. ACCESSION M22638 NID g187266 KEYWORDS LYL-1. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 2213 to 2572; 2660 to 2751; 3555 to 4256) AUTHORS Mellentin,J.D., Smith,S.D. and Cleary,M.L. TITLE lyl-1, a novel gene altered by chromosomal translocation in T cell leukemia, codes for a protein with a helix-loop-helix DNA binding motif JOURNAL Cell 58 (1), 77-83 (1989) MEDLINE 89324062 REFERENCE 2 (bases 1 to 4569) AUTHORS Mellentin,J.D. TITLE Direct Submission JOURNAL Submitted (28-SEP-1989) Stanford Univ.Med.Cntr, Dept.of Pathology, Palo Alto, CA 94305, USA COMMENT . FEATURES Location/Qualifiers source 1..4569 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="19p13.2" exon 567..911 /gene="LYL1" /note="G00-120-158" /number=1 gene 567..4261 /gene="LYL1" intron 912..2218 /gene="LYL1" /note="G00-120-158" /number=1 exon 2219..2572 /gene="LYL1" /note="first expressed exon; G00-120-158" /number=2 CDS join(2277..2572,2660..2751,3555..3970) /gene="LYL1" /codon_start=1 /db_xref="PID:g386861" /db_xref="GDB:G00-120-158" /translation="MTEKAEMVCAPSPAPAPPPKPASPGPPQVEEVGHRGGSSPPRLP PGVPVISLGHSRPPGVAMPTTELGTLRPPLLQLSTLGTAPPTLALHYHPHPFLNSVYI GPAGPFSIFPSSRLKRRPSHCELDLAEGHQPQKVARRVFTNSRERWRQQNVNGAFAEL RKLLPTHPPDRKLSKNEVLRLAMKYIGFLVRLLRDQAAALAAGPTPPGPRKRPVHRVP DDGPRRGSGRRAEAAARSQPAPPADPDGSPGGAARPIKMEQTALSPEVR" intron 2573..2659 /gene="LYL1" /note="G00-120-158" /number=2 exon 2660..2751 /gene="LYL1" /note="G00-120-158" /number=3 intron 2752..3554 /gene="LYL1" /note="G00-120-158" /number=3 exon 3555..4261 /gene="LYL1" /note="G00-120-158" /number=4 BASE COUNT 936 a 1408 c 1430 g 795 t ORIGIN 234 bp upsteam of ApaI site; chromosome 19p13.1-13.2. 1 aaatccaact tcccacccaa ccctacagcc atggctcttc ctctcacctc ctgggcttgg 61 aaatgtgtgg atactgtgct gagggcttgg gacacctgcc caggctggac acccaaggcc 121 agactgttag aactgttccc tttgggtgcc ccactcccgg gggggtggtg gagaggaggg 181 aagggcctct gtaaaaagag gaaccaggct gggtgaggag aggaggtggg ggcccagggt 241 cacggcccca acgggccgga agggctctcc ctcaccccca tcagcacaga ttcgggcagg 301 gagtgtgcaa agccaaaggg cagatcggac ataagaggag cagcttccct gcctcagttt 361 accccctgcg tgcatccagg cacctccctc cccggctgcc ctggggcccg ggtccccaga 421 ggcccggggt ccctacagtg gctgggggag cggaagcgaa gccctgaggg gtggcggacc 481 gagtcctgcc agcgccggct gggggcggcc gggaggccgc gctgctaggt ccccgcccgc 541 tggtttcctc cggggtcagc caggctcctt atcaggcgcg ccaggcagcc tggcccttat 601 ctgcactggg ccagcatcct ccgcccgtgc gccgccaggg gtgagaggga ggaaaccggg 661 gccgggggcg gggagaaggc gggccggccc gggagccgct cactttccct gggggggacc 721 tacgcggaga cctcggctat cctggccttc cgaggcccac gaggaggcgc ggcccaacgc 781 cggggcctgg agcattgagg ccggaccctc gcgagacagc agagcctggc ctgacgctgg 841 aaaccacacc ctggcccaga ctgccagccc tgacgggaca gagccagggc actcaccagg 901 ctgcaagaac agtgctgggg taagagggga gcgggggatc ccgggcctgg gacccagcct 961 gcattccttt gttcattcct tcattcattc attcaccagc agggacccac tagtgagggg 1021 ccaggcctgc ttccccaggg cctagctgag gaagacaggg cagaggggcc aacagtctca 1081 caccttgctg ggacatcctg gactctggaa ccaagagcaa acagggatgt caaaacagta 1141 tgcaaaactg tggatgatcg cggggcattg tggtgcatgc ctgtaatccc agcactttgg 1201 gaggctgagg caggaggatc acttgagccc aggagttcca gaccaccctg gacaacagag 1261 tgagaacctg cctctacaaa aaaaattttt ttttttaaat tagccgggca tggtggcaca 1321 tgcctctact ctcagctact caggaggctg agacaggagg atggctcgag cccaggaatt 1381 tgaggctgca gtgcactatg attgcaccac tgcactccag cctggacaac agagggagac 1441 cctgtctcta aaaaaataaa caaattttaa aaagcgtgga tgctgatgag gatgggggct 1501 tccaagccag agggagggtt ggggcatgac tggactggac tgggcagtgg ggtcagtgtt 1561 tgggggtcta gggtcagcat ttgaggtcat agtgtcagta gtggggtcac agggttgcta 1621 tgggggatgt ttgcagtggg agtgggggtc actcaagcaa ttatggagca ctttcaggga 1681 cagcaatggg aagaaccgca gggtcattct tgaggtgcag agggtcagcc tgggtggggg 1741 ctgggggcct tccgtgaagg gtcagagtcc atcttggagg gattccaagc tcacatggca 1801 tcaatgagtt aatgaggagt ctctgggttg gatctttaga tctgaggata acagggttaa 1861 tcttggggtt tctggggatc ccaggatcag ttaggggctc cctgtcttgt tccctctcca 1921 ggctgcagaa ttcctcagag gctgggcaac agcgcccctc ctgggtcaca aagagctcag 1981 ggacagtgcg ctgcctagca cctggggggc gcctcctact ctaacgtagg acccccctcc 2041 cgtgtcctgg aagtttctgg gcctccccgc catctgcctt tgcctactga aacttctccc 2101 ctcctccttt ccccctcccc ctccctcctt cccatttgag gggtttcttg agctaagcag 2161 gtgggagcgg ggcacctagt cctcctcccc actggctgcc ttctttccca caggtgagta 2221 cccccacgtc ggggtccatg tgcccgcctc aggcacaggc agaggtgggc cccaccatga 2281 ctgagaaggc agagatggtg tgtgccccca gcccagcgcc tgccccaccc cctaagcctg 2341 cctcgcctgg gcccccgcag gtggaggagg tgggccaccg aggaggctcc tcgcccccca 2401 ggctgccacc tggtgtacca gtgatcagcc tgggccacag caggccccca ggggtagcca 2461 tgcccaccac agagctgggc actctgcggc ccccgctgct gcaactctcc accctgggaa 2521 ctgccccgcc cactttggcc ctgcactacc accctcaccc cttcctcaac aggtagtggg 2581 gatctggggt ggggggcagt ggggattggg ggccagggtc cttgcccaca aggacttagt 2641 gacccacgac cccttacagt gtctacattg ggccagcagg accttttagc atcttcccta 2701 gcagccggtt gaagcggaga ccaagccact gtgagctgga cctggctgag ggtgagtgtg 2761 ggtttgtgtg ccttgtgggt ttgtatgcct gtatgtgcac ttgtgggtgc acagaaggcc 2821 tggccgtctc tgtgtggggc tcatgtctgt cctctactcc acctcgggga gacgtgcttc 2881 cagccaacac agagaacagc ccacataggt tctacccatg tgcaaaatcg cagtcccata 2941 gccagcccca cacaaccacc gccacccaac aggccaagca tgtagccgca gtcacggcaa 3001 aacagagcat gccacaccag ggcatgggag ccacaaaatg acagctccac tggaatgtgc 3061 agccacacaa ccacagccac tcgtgaaaca gccacatggc aacatcacac ataaccacag 3121 ccatgagaca taacaagagt ctcacacagt catacaagac acaggacaca gacagtcata 3181 atgagaggac ctctcagacc catgagtaca cccagccagc cgtacttggg cacaatcaga 3241 atgagggccc cacggacagc ttcccagacc aaacaaacac aaggaaatct ttctttaggg 3301 aatctcagtc attgacataa aggtgcccat agtcacagat acagcaggcc cttgtccgta 3361 ggccgggcct gtgatgattc ttgtctggat ggctcttggg aggggtgggg agagctgccc 3421 gtggacccct tggtggagaa agccccagcc catggcctgg gtttaagttc ccacgagggt 3481 gctgcgtgtc tagcgggagg gcaggagggg ccgccctttg cgcatggcac caacccggcc 3541 ctccttgtcc acagggcacc agccccagaa ggtggcccgg cgcgtgttca ccaacagccg 3601 ggagcgctgg cggcagcaga acgttaacgg cgccttcgcc gagctgagga agctgctgcc 3661 gacgcacccg cccgaccgga agctgagcaa gaacgaggtg ctccgcctag ccatgaagta 3721 catcggcttc ctggtgcggc tgctgcgcga ccaagccgca gctctggccg caggccccac 3781 ccctcccggg cctcgcaaac ggccggtgca ccgggtccca gacgacggcc cccgccgggg 3841 atccggacgc agggccgagg cggcagcgcg ctcgcagccc gcgcccccgg ccgaccccga 3901 cggcagcccc ggtggagcgg cccggcccat caagatggag caaaccgctt tgagcccaga 3961 ggtgcggtga ccgcacgcgg cagcacctct gagccggagg gcaccaggga ctcggcccag 4021 ggccgtcaag gaaagggcag tggacgtgct gcgcatgttc gggagcgaac tcccccgaag 4081 aaggaccagt gaagacgtca ggggcaaggt ctcgggggtc cggaagggtg atcatcgacc 4141 cccaagggac ccgcagaccc ttaaaaaaat cacccacaac cctctggaag tggccttgcc 4201 cggtcccctt cccaggggcg aggtcggcaa agcaacatgg cagagcagtc ataggaccca 4261 agtggtgcct cattttttcc cgggctgggg tcgcgggggg aggccaggag gggctgggga 4321 ggcttcgctt ttctcaccgc ccctccggag acccaggggc agacgctcgt cgacggctct 4381 gctgccctcc gggccttgga cagaggccca gaatccaagt cggggccgca ccctaccgac 4441 cccgacccag tcccgcacgg tgcgttaaag ggtcaggcgc tctcgctttg tttttcttta 4501 tttctttatt tcacatacac attagccttc aatggagaag ccgagagagt caggcaaaga 4561 tggataaca // LOCUS HUMANFA 2710 bp DNA PRI 01-NOV-1994 DEFINITION Human atrial natriuretic factor (PND) gene, complete cds. ACCESSION K02043 NID g178629 KEYWORDS atrial natriuretic factor; hormone; natriuretic factor; preprocardiodilatin; pronatriodilatin. SOURCE Human DNA (genomic library of Lawn et al.), clones pHGRB1 and lambda-hPND13; and cDNA to atrial mRNA, clones phANP1 and phANP82 (see comment). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 480 to 692; 815 to 1141; 2235 to 2537) AUTHORS Oikawa,S., Imai,M., Ueno,A., Tanaka,S., Noguchi,T., Nakazato,H., Kangawa,K., Fukuda,A. and Matsuo,H. TITLE Cloning and sequence analysis of cDNA encoding a precursor for human atrial natriuretic polypeptide JOURNAL Nature 309 (5970), 724-726 (1984) MEDLINE 84219799 REFERENCE 2 (bases 1 to 1839; 2124 to 2548) AUTHORS Nemer,M., Chamberland,M., Sirois,D., Argentin,S., Drouin,J., Dixon,R.A., Zivin,R.A. and Condra,J.H. TITLE Gene structure of human cardiac hormone precursor, pronatriodilatin JOURNAL Nature 312 (5995), 654-656 (1984) MEDLINE 85061626 REFERENCE 3 (bases 1 to 2583) AUTHORS Greenberg,B.D., Bencen,G.H., Seilhamer,J.J., Lewicki,J.A. and Fiddes,J.C. TITLE Nucleotide sequence of the gene encoding human atrial natriuretic factor precursor JOURNAL Nature 312 (5995), 656-658 (1984) MEDLINE 85061627 REFERENCE 4 (bases 2105 to 2710; 7 to 1792) AUTHORS Seidman,C.E., Bloch,K.D., Klein,K.A., Smith,J.A. and Seidman,J.G. TITLE Nucleotide sequences of the human and mouse atrial natriuretic factor genes JOURNAL Science 226 (4679), 1206-1209 (1984) MEDLINE 85065766 REFERENCE 5 (bases 1 to 1839; 2124 to 2548) AUTHORS Drouin,J. JOURNAL Unpublished (1985) COMMENT [2] revised by [5]. [5] revises [2]. A potential enhancer sequence is located at positions 203-213. A TATA box is present at positions 442-447. A potential polyadenylation signal is present at positions 2509-2514. A potential glucocorticoid receptor binding site is present at positions 1283-1297. Two Alu repeats are present within intron B [4]. Another human PND allele has been sequenced (see separate entry). The sequence in [4] is somewhat mislabelled in figure 2: within intron B some of the human sequence is labelled as mouse. A revision of the sequence in [2] was kindly sent by J. Drouin. The individual revisions are not annotated in the FEATURES table. Complete source information: Human DNA [4] (genomic library of Lawn et al. [2]), clones pHGRB1 [3] and lambda-hPND13 [2],[5]; and cDNA to atrial mRNA, clones phANP1 and phANP82 [1]. FEATURES Location/Qualifiers source 1..2710 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p36" prim_transcript 473..2537 /note="PND mRNA [1],[2]" gene join(570..692,815..1141,2235..2240) /gene="PND" CDS join(570..692,815..1141,2235..2240) /gene="PND" /codon_start=1 /db_xref="GDB:G00-118-727" /product="natriodilatin" /db_xref="PID:g178630" /translation="MSSFSTTTVSFLLLLAFQLLGQTRANPMYNAVSNADLMDFKNLL DHLEEKMPLEDEVVPPQVLSEPNEEAGAALSPLPEVPPWTGEVSPAQRDGGALGRGPW DSSDRSALLKSKLRALLTAPRSLRRSSCFGGRMDRIGAQSGLGCNSFRY" exon <570..692 /gene="PND" /note="prepronatriodilatin; G00-118-727" /number=1 sig_peptide 570..644 /gene="PND" /note="prepronatriodilatin signal peptide" intron 693..814 /gene="PND" /note="G00-118-727" /number=1 exon 815..1141 /gene="PND" /note="G00-118-727" /number=2 conflict 886 /gene="PND" /citation=[4] /replace="" mat_peptide join(1061..1141,2235..2237) /gene="PND" /note="G00-118-727" /product="atrial natriuretic peptide" intron 1142..2234 /gene="PND" /note="G00-118-727" /number=2 conflict 1331..1334 /citation=[4] /replace="" conflict 1349..1356 /citation=[4] /replace="" conflict 2106 /citation=[4] /replace="" conflict 2111 /citation=[4] /replace="" conflict 2119 /citation=[4] /replace="" conflict 2131 /citation=[4] /replace="" BASE COUNT 667 a 665 c 768 g 610 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccattt gtctcgggct gctggctgcc tgccatttcc tcctctccac ccttatttgg 61 aggccctgac agctgagcca caaacaaacc aggggagctg ggcaccagca agcgtcaccc 121 tctgtttccc cgcacggtac cagcgtcgag gagaaagaat cctgaggcac ggcggtgaga 181 taaccaagga ctctttttta ctcttctcac acctttgaag tgggagcctc ttgagtcaaa 241 tcagtaagaa tgcggctctt gcagctgagg gtctgggggg ctgttggggc tgcccaaggc 301 agagaggggc tgtgacaagc cctgcggatg ataactttaa aagggcatct cctgctggct 361 tctcacttgg cagctttatc actgcaagtg acagaatggg gagggttctg tctctcctgc 421 gtgcttggag agctgggggg ctataaaaag aggcggcact gggcagctgg gagacaggga 481 cagacgtagg ccaagagagg ggaaccagag aggaaccaga ggggagagac agagcagcaa 541 gcagtggatt gctccttgac gacgccagca tgagctcctt ctccaccacc accgtgagct 601 tcctcctttt actggcattc cagctcctag gtcagaccag agctaatccc atgtacaatg 661 ccgtgtccaa cgcagacctg atggatttca aggtagggcc aggaaagcgg gtgcagtctg 721 gggccagggg gctttctgat gctgtgctca ctcctcttga tttcctccaa gtcagtgagg 781 tttatccctt tccctgtatt ttccttttct aaagaatttg ctggaccatt tggaagaaaa 841 gatgccttta gaagatgagg tcgtgccccc acaagtgctc agtgagccga atgaagaagc 901 gggggctgct ctcagccccc tccctgaggt gcctccctgg accggggaag tcagcccagc 961 ccagagagat ggaggtgccc tcgggcgggg cccctgggac tcctctgatc gatctgccct 1021 cctaaaaagc aagctgaggg cgctgctcac tgcccctcgg agcctgcgga gatccagctg 1081 cttcgggggc aggatggaca ggattggagc ccagagcgga ctgggctgta acagcttccg 1141 ggtaagagga actggggatg gaaatgggat gggatggaca ctactgggag acaccttcag 1201 caggaaaggg accaatgcag aagctcattc cctctcaagt ttctgcccca acacccagag 1261 tgccccatgg gtgtcaggac atgccatcta ttgtccttag ctagtctgct gagaaaatgc 1321 ttaaaaaaaa aagggggggg gctgggcacg gtcgtcacgc ctgtaatccc agcactttgg 1381 gaggccaggc agcggatcat gaggtcaaga gatcaagact atcctggcca acatggtgaa 1441 accccagctc tactaaaaat acaaaaatta gctgggtgtg tggcgggcac ctgtactctc 1501 agctacttgg gaggctgagg caggagaatc acttgaaccc aggaggcaga ggttgcagtg 1561 agcagagatc acgccactgc agtccagcct aggtgataga gcgagactgt ctcaaaaaaa 1621 aaaaaaaaag gccaggcgcg gtggctcacg cctgtaatcc cagcgctttg ggaggccaag 1681 gcgggtggat cacgaggtca ggagatggag accatcctgg ctaacacggt gaaaccccgt 1741 ctctactaaa aatacaaaaa attagccagg cgtggtggca ggcgcctgta agtcctagct 1801 actccggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt gcagtgagca 1861 gagatggcac cactgcactc cagcctgggc gacagagcaa gactccgtct caaaaaaaaa 1921 aaaaaaaaaa gcaactgcca ctagcactgg gaaattaaaa tattcataga gccaagttat 1981 ctttgcatgg ctgattagca gttcatattc ctccccagaa ttgcaagatc ctgaagggct 2041 taagtgaaat ttactctgat gagtaacttg cttatcaatt catgaagctc agagggtcat 2101 caggctgggg tgggggccgg tgggaagcag gtggtcagta atcaagttca gaggatgggc 2161 acactcatac atgaagctga cttttccagg acagccaggt caccaagcca gatatgtctg 2221 tgttctcttt gcagtactga agataacagc cagggaggac aagcagggct gggcctaggg 2281 acagactgca agaggctcct gtcccctggg gtctctgctg catttgtgtc atcttgttgc 2341 catggagttg tgatcatccc atctaagctg cagcttcctg tcaacacttc tcacatctta 2401 tgctaactgt agataaagtg gtttgatggt gacttcctcg cctctcccac cccatgcatt 2461 aaattttaag gtagaacctc acctgttact gaaagtggtt tgaaagtgaa taaacttcag 2521 caccatggac agaagacaaa tgcctgcgtt ggtgtgcttt ctttcttctt gggaagagaa 2581 ttcaggccga tattccttgt cgttttactc tttgtcagag gaaagaatgc tgagtttttc 2641 ttcttccttt catttcaccc tccttttttg gtaggtggtt gggaggccta attcatctag 2701 tgggtttttt // LOCUS HUMG0S19A 4102 bp DNA PRI 07-JAN-1991 DEFINITION Human homologue-1 of gene encoding alpha subunit of murine cytokine (MIP1/SCI), complete cds. ACCESSION M23178 M32337 NID g182846 KEYWORDS cytokine; macrophage inflammatory protein. SOURCE Human lymphocyte DNA, clone LG0S1907. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4102) AUTHORS Blum,S., Forsdyke,R.E. and Forsdyke,D.R. TITLE Three human homologs of a murine gene encoding an inhibitor of stem cell proliferation JOURNAL DNA Cell Biol. 9, 589-602 (1990) MEDLINE 91103879 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Forsdyke, 30-JUN-1989 and 23-FEB-1990. The G0S19 genes are members of the 'small inducible' family of genes. The G0S19-1 product is homologous to the alpha subunit of the murine cytokine MIP1. FEATURES Location/Qualifiers source 1..4102 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1298..1304 /note="CK-2 element" misc_feature 1326..1335 /note="serum-response element (put.); putative" misc_feature 1481..1488 /note="AP1-binding element (put.); putative" misc_feature 1710..1717 /note="AP1-binding element (put.); putative" TATA_signal 1967..1972 prim_transcript 1998..3880 /note="G0S19-1 mRNA and introns" CDS join(2081..2153,2842..2956,3377..3467) /note="G0S19-1 peptide precursor" /codon_start=1 /db_xref="PID:g182847" /translation="MQVSTAALAVLLCTMALCNQFSASLAADTPTACCFSYTSRQIPQ NFIADYFETSSQCSKPGVIFLTKRSRQVCADPSEEWVQKYVSDLELSA" sig_peptide 2081..2140 /note="G0S19-1 peptide signal peptide (put.); putative" exon <2081..2153 /note="G0S19-1 peptide precursor" /number=1 mat_peptide 2141..2153 /note="G0S19-1 peptide" intron 2154..2841 /note="G0S19-1, intron A" mat_peptide 2842..2956 /note="G0S19-1 peptide" exon 2842..2956 /number=2 intron 2957..3376 /note="G0S19-1, intron B" exon 3377..>3467 /note="G0S19-1 peptide precursor" /number=3 mat_peptide 3377..3464 /note="G0S19-1 peptide" BASE COUNT 1143 a 945 c 903 g 1111 t ORIGIN EcoRI site. 1 gaattccaaa ggcatggtcg cacttggctt ctgtcctctg ttattctcca gcatcaaatg 61 tatcaactct aacccctttg gggggaatac aaggcctgtc ctggtttggt cccaatttag 121 ctttatcatc catattcacc cccactgctc tgcagctcca ctgaagcacc ccctctttcc 181 tctgaaccca caatgtcaca ctcaggactc tgcctcagct gggcactcat ctatagatgc 241 ctaaatcccg ggcagttatc cagacacaac taaagttcca tcccttccat gaagccttcc 301 ccaaccctct ggtggaaggt cacttcttcc cctcgtggga ttctgagctt tcatttcttt 361 ttctactagg agtcctagca ctttcggcta aatgctacaa ttacctgttc atacactcta 421 cctgccccca cgagatcagg ggcatctcag aaacaaagat cattaaaacc aactaaatct 481 atttctcatt ataaaatgag gtatgctgat tgattgtgaa agaataaaat aacaaagtat 541 ggaaaagaaa aaaaagcata taatctggct gagaaggtag agacccttcc acaccactga 601 aattatgtat tgaaaagaat aagtaaaaaa ctgcttcaat ttggcatgat ttatgtaagt 661 atagtatagg atccttaaaa tggttcaaag aaatgggaaa tcaagacttc attttggcca 721 aaaccattga acagaaactt cagcatattt atcaataatt tctttcagat taaacaactg 781 acaacaacct atttttcaac cagtgatgtt ggaaatgttt ttttaaaaat tagtttataa 841 atttgtgggc tgaccaagaa ggtaataaag tctaactaag taaaatgaga aaaattcaga 901 aaaagaaaaa aataagaaaa taaatcaccc agggacctat cacacaaata taagaactat 961 tcattcttta aggcatgtat ttccaagcct ttgtattttt ttccatgctt agggttggca 1021 aggaatatat atatatttgt acaaatatat atgtgtatat gtacaaatac atgtatatat 1081 agtacaaata tatatatata tttgtacaat tcttcagact ttgtagaatt tgtataatgt 1141 cgtatcttgc tttttttaac cactgatgtt ataagcatat ttatgccact tcattcattt 1201 tagagactta ataataaatg atctagtgga taatttatca ttccctgatg gagaaaaatt 1261 tagctttgtt tattttagag ttataaacga tgctgggtca ggtatcttta tgtttgaaga 1321 tggctccata tttgggttgt ttccacagaa ctctttccta gaaatgcttt ttctaggtta 1381 atggctacag atatttctag gcacctgaca tattgacacc cacctctaaa gtatttttat 1441 gatccacaac tagcgtttaa cacagcgccc tagtcactac atgactaata aatagacaaa 1501 tgactgaaac atgacctcat gctttctatt cctccagctt tcattcagtt ctttgcctct 1561 gggaggagga agggttgtgc agccctccac agcatcagcc catcaaccct atccctgtgg 1621 ttatagcagc tgaggaagca gaattgcagc tctgtgggaa ggaatggggc tggagagttc 1681 atgcacagac cagttcttat gagaagggac tgactaagaa tagccttggg ttgacatata 1741 cccctcttca cactcacagg agaaaccatt tccctatgaa actataacaa gtcatgagtt 1801 gagagctgag agttagagaa tagctcaaag atgctattct tggatatcct gagcccctgt 1861 ggtcaccagg gaccctgagt tgtgcaactt agcatgacag catcactacg cttaaaaatt 1921 tccctcctca cccccagatt ccatttcccc atccgccagg gctgcctata aagaggagag 1981 ctggtttcag acttcagaag gacacgggca gcagacagtg gtcagtcctt tcttggctct 2041 gctgacactc gagcccacat tccgtcacct gctcagaatc atgcaggtct ccactgctgc 2101 ccttgctgtc ctcctctgca ccatggctct ctgcaaccag ttctctgcat cacgtgagtc 2161 tgagtttcgt tgtgggtatc accactctct ggccatggtt agaccacatc aatcttttct 2221 tgtggcctaa aagcccccaa gagaaaagag aacttcttaa agggctgcca aacatcttgg 2281 tctttctctt taagactttt atttttatct ctagaagggg tcttagcccc ctagtctcca 2341 ggtatgagaa tctaggcagg ggcaggggag ttacagtccc ttttacagat agaaaaacag 2401 ggttcgaaac gaatcagtta gcaagaggca gaatccaggg ctgcttactt cccagtgggg 2461 tatgttgttc actctccagc tcactctagg tctcccagga gctctgtccc ttggatgtct 2521 tatgagagat gtccaaggct tctcttgggt tggggtatga cttcttgaac cagacaaaat 2581 tccctgaaga gaactgagat aagagaacag tccgttcagg tatctggatc acacagagaa 2641 acagagaacc cactatgaag agtcaaggag aaagaaggat acagacagaa acaaagagac 2701 atttctcagc aaaaatgccc aaatgccttc cagtcacttg gtctgagcaa gcctgccttc 2761 ctcaactgct cggggatcag aagctgcctg gccttttctt ctgagctgtg actcgggctc 2821 attctcttcc tttctccaca gttgctgctg acacgccgac cgcctgctgc ttcagctaca 2881 cctcccggca gattccacag aatttcatag ctgactactt tgagacgagc agccagtgct 2941 ccaagcccgg tgtcatgtaa gtgccagtct tcctgctcac ctctatggag gtagggaggg 3001 tcagggttgg ggcagagaca ggccagaagg ctatcctgga aaggcccagc cttcaggagc 3061 ctatcgggga tacaggacgc agggctccga ggtgtgacct gacttggagc tggagtgagg 3121 catgtgttac agagtcagga agggctgccc cagcccagag gaaagggaca ggaagaagga 3181 ggcagcggga cactctgagg gccaccccta ctgagtcact gagagaagct ctctagacag 3241 agataggcag ggggcccctg aaagaggagc aagccctgag ctgcccagga cagagagcag 3301 aatggtgggg ccatggtggg cccaggattc ccctgctgga ttccccagtg cttaactctt 3361 cctcccttct ccacagcttc ctaaccaagc gaagccggca ggtctgtgct gaccccagtg 3421 aggagtgggt ccagaaatat gtcagcgacc tggagctgag tgcctgaggg gtccagaagc 3481 ttcgaggccc agcgacctcg gtgggcccag tggggaggag caggagcctg agccttggga 3541 acatgcgtgt gacctccaca gctacctctt ctatggactg gttgttgcca aacagccaca 3601 ctgtgggact cttcttaact taaattttaa tttatttata ctatttagtt tttgtaattt 3661 attttcgatt tcacagtgtg tttgtgattg tttgctctga gagttcccct gtcccctccc 3721 ccttccctca caccgcgtct ggtgacaacc gagtggctgt catcagcctg tgtaggcagt 3781 catggcacca aagccaccag actgacaaat gtgtatcgga tgcttttgtt cagggctgtg 3841 atcggcctgg ggaaataata aagatgctct tttaaaaggt aaaccagtat tgagtttggt 3901 tttgtttttc tggcaaatca aaatcactgg ttaagaggaa tcataggcaa agattaggaa 3961 gaggtgaaat ggagggaaat tgggagagat ggggagggct accacagagt tatccacttt 4021 acaacggaga cacagttctg gaacattgaa actacgaata tgttataact caaatcataa 4081 catgcatgct ctaggagaat tc // LOCUS HUMALPI 5291 bp DNA PRI 29-APR-1996 DEFINITION Human intestinal alkaline phosphatase (ALPI) gene, complete cds. ACCESSION J03930 NID g178441 KEYWORDS alkaline phosphatase. SOURCE Homo sapiens (clone: Ch40[Bam5,Bg5].) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5291) AUTHORS Henthorn,P.S., Raducha,M., Kadesch,T., Weiss,M.J. and Harris,H. TITLE Sequence and characterization of the human intestinal alkaline phosphatase gene JOURNAL J. Biol. Chem. 263 (24), 12011-12019 (1988) MEDLINE 88298885 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P.S.Henthorn, 20-JUN-1988. FEATURES Location/Qualifiers source 1..5291 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Ch40[Bam5,Bg5]." /cell_line="563" /cell_type="fibroblast" /map="2q34-q37.1" exon 246..362 /gene="ALPI" /note="G00-119-671" /number=1 gene 246..4128 /gene="ALPI" CDS join(296..362,445..561,676..791,995..1169,1246..1418, 1661..1795,1891..1963,2094..2228,2313..2504,2728..2844, 2956..3242) /gene="ALPI" /EC_number="3.1.3.1" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-671" /product="alkaline phosphatase" /db_xref="PID:g178442" /translation="MQGPWVLLLLGLRLQLSLGVIPAEEENPAFWNRQAAEALDAAKK LQPIQKVAKNLILFLGDGLGVPTVTATRILKGQKNGKLGPETPLAMDRFPYLALSKTY NVDRQVPDSAATATAYLCGVKANFQTIGLSAAARFNQCNTTRGNEVISVMNRAKQAGK SVGVVTTTRVQHASPAGTYAHTVNRNWYSDADMPASARQEGCQDIATQLISNMDIDVI LGGGRKYMFPMGTPDPEYPADASQNGIRLDGKNLVQEWLAKHQGAWYVWNRTELMQAS LDQSVTHLMGLFEPGDTKYEIHRDPTLDPSLMEMTEAALRLLSRNPRGFYLFVEGGRI DHGHHEGVAYQALTEAVMFDDAIERAGQLTSEEDTLTLVTADHSHVFSFGGYTLRGSS IFGLAPSKAQDSKAYTSILYGNGPGYVFNSGVRPDVNESESGSPDYQQQAAVPLSSET HGGEDVAVFARGPQAHLVHGVQEQSFVAHVMAFAACLEPYTACDLAPPACTTDAAHPV AASLPLLAGTLLLLGASAAP" sig_peptide 296..352 /gene="ALPI" /note="G00-119-671" mat_peptide join(353..362,445..561,676..791,995..1169,1246..1418, 1661..1795,1891..1963,2094..2228,2313..2504,2728..2844, 2956..3239) /gene="ALPI" /EC_number="3.1.3.1" /note="G00-119-671" /product="alkaline phosphatase" intron 363..444 /gene="ALPI" /note="G00-119-671" /number=1 exon 445..561 /gene="ALPI" /note="G00-119-671" /number=2 intron 562..675 /gene="ALPI" /note="G00-119-671" /number=2 exon 676..791 /gene="ALPI" /note="G00-119-671" /number=3 intron 792..994 /gene="ALPI" /note="G00-119-671" /number=2 exon 995..1169 /gene="ALPI" /note="G00-119-671" /number=4 intron 1170..1245 /gene="ALPI" /note="G00-119-671" /number=4 exon 1246..1418 /gene="ALPI" /note="G00-119-671" /number=5 intron 1419..1660 /gene="ALPI" /note="G00-119-671" /number=5 exon 1661..1795 /gene="ALPI" /note="G00-119-671" /number=6 intron 1796..1890 /gene="ALPI" /note="G00-119-671" /number=6 exon 1891..1963 /gene="ALPI" /note="G00-119-671" /number=7 intron 1964..2093 /gene="ALPI" /note="G00-119-671" /number=7 exon 2094..2228 /gene="ALPI" /note="G00-119-671" /number=8 intron 2229..2312 /gene="ALPI" /note="G00-119-671" /number=8 exon 2313..2504 /gene="ALPI" /note="G00-119-671" /number=9 intron 2505..2727 /gene="ALPI" /note="G00-119-671" /number=9 exon 2728..2844 /gene="ALPI" /note="G00-119-671" /number=10 intron 2845..2955 /gene="ALPI" /note="G00-119-671" /number=10 exon 2956..4128 /gene="ALPI" /note="G00-119-671" /number=11 polyA_signal 4105..4110 /gene="ALPI" /note="G00-119-671" BASE COUNT 1177 a 1520 c 1552 g 1042 t ORIGIN 118 bp upstream of HindIII site; chromosome 2q34-q37. 1 cctaggctgt gtgtttccag tctcacctct cttcacacct tgaatgaggt gaatgaagga 61 gtggcaacgc gtctcccaca agacactgtg agccacaccc agtcccttcc cttcagcaag 121 cttggcttca ggtcacagga ctgggcgggg tcaagatgga caccaggggt gtggggaggg 181 acgtggagca tttacagcca ggggcaaagt cctcccctga tttaaaccca ggcagcctgc 241 gctgcagccg gttcctggtg tccccacttc gcctccctcc tgctgccccc aagacatgca 301 ggggccctgg gtgctgctgc tgctgggcct gaggctacag ctctccctgg gcgtcatccc 361 aggtaatgag gctccccaag ctgttccaca cacagggcac cccctcagcc aggctgacct 421 gatctctact ctccccctgg ccagctgagg aggagaaccc ggccttctgg aaccgccagg 481 cagctgaggc cctggatgct gccaagaagc tgcagcccat ccagaaggtc gccaagaacc 541 tcatcctctt cctgggcgat ggtgagtgag caaggcctgt ccagccccgt agtcctcaca 601 gccccggcac ccgggacctt cagtggttcc aggacaaccc tggggcccag gactcacaca 661 tttctgctcc ttcagggttg ggggtgccca cggtgacagc caccaggatc ctaaaggggc 721 agaagaatgg caaactgggg cctgagacgc ccctggccat ggaccgcttc ccatacctgg 781 ctctgtccaa ggtaagggct gggccacctc agagtcctcc aagcagagga gagggatcaa 841 ggatatggag tgtggcagga gggagggagc caggacagct ggggcctaag ttaggagctg 901 ggagcagtta ggatcccaga ggaccagaac caggtccttg gttggggtct gggtgtccgc 961 cccgaagtag agctcagggt gtctccgttc gcagacatac aatgtggaca gacaggtgcc 1021 agacagcgca gccacagcca cggcctacct gtgcggggtc aaggccaact tccagaccat 1081 cggcttgagt gcagccgccc gctttaacca gtgcaacacg acacgcggca atgaggtcat 1141 ctccgtgatg aaccgggcca agcaagcagg tgagctgggg cccgctgtgg ggtcaggacc 1201 aggcccaaga tctcggtcac cgatcctgac ctctgtcacc ctcaggaaag tcagtaggag 1261 tggtgaccac cacacgggtg cagcacgcct cgccagccgg cacctacgca cacacagtga 1321 accgcaactg gtactcagat gctgacatgc ctgcctcagc ccgccaggag gggtgccagg 1381 acatcgccac tcagctcatc tccaacatgg acattgacgt gcgacccccg ggccaagggc 1441 tggggctggg cagaggggaa ggtggcacag gctcagatcc aggcaaccaa aagcctgatc 1501 tgggtcagca ggttctggag gtggagttgg ggatgtagaa tgtgcaatac aggctgggcc 1561 attcccacag ccctggggag gggagccagg ggctatgcat gaggaggggg cacggggcca 1621 gccaggcccc caaaccacct gccccatcca ttgtcctcag gtgatccttg gcggaggccg 1681 caagtacatg tttcccatgg ggaccccaga ccctgagtac ccagctgatg ccagccagaa 1741 tggaatcagg ctggacggga agaacctggt gcaggaatgg ctggcaaagc accaggtgat 1801 gggggctggt gggtgtggga ggcacggcag ggggaggcca agtgtgtggg tctcagggct 1861 gtgggctgaa gcctggctct gtccctgcag ggtgcctggt atgtgtggaa ccgcactgag 1921 ctcatgcagg cgtccctgga ccagtctgtg acccatctca tgggtaatga cccccttcct 1981 gccctggcat tcctcagaca acctcagagg gtgccatccg agcctgtgtg cccatttgcc 2041 agcaccctcc cgctcacagc ctgccaatca ccaccaagct ccttgtccca caggcctctt 2101 tgagcccgga gacacgaaat atgagatcca ccgagacccc acactggacc cctccctgat 2161 ggagatgaca gaggctgccc tgcgcctgct gagcaggaac ccccgcggct tctacctctt 2221 tgtggagggt gcgtggtggc ccctggggag tggaggaagg cggggcgcgg cagggcaggt 2281 tcaagcatca cccccctctg gccttcctgc aggcggccgc atcgaccatg gtcatcatga 2341 gggtgtggct taccaggcac tcactgaggc ggtcatgttc gacgacgcca ttgagagggc 2401 gggccagctc accagcgagg aggacacgct gaccctcgtc accgctgacc actcccatgt 2461 cttctccttt ggtggctaca ccttgcgagg gagctccatc ttcggtaggc ctggggagag 2521 tggcaggtgc tgctgcatca attatgaggg tgaagtttga gcctcagttt cctcctctgt 2581 caaaagtgtg caatgctggc accagcccta tagggatctt gtgaggaccg agcccccgaa 2641 caggcaaaaa gtggcggtgc ctggcacata ggaggcactc ccacagctgt ggtcagctca 2701 actacaggga cccgcatctc cctacagggt tggcccccag caaggctcag gacagcaaag 2761 cctacacgtc catcctgtac ggcaatggcc cgggctacgt gttcaactca ggcgtgcgac 2821 cagacgtgaa tgagagcgag agcggtgagt gaggctgaat ggcccgtgca gggggaccag 2881 ggtgccaggg atgggggcat tcgcgggagg gggacgccgc ctgcctgccc tgaagtgcac 2941 tcaccctcct accagggagc cccgattacc agcagcaggc ggcggtgccc ctgtcgtccg 3001 agacccacgg aggcgaagac gtggcggtgt ttgcgcgcgg cccgcaggcg cacctggtgc 3061 atggtgtgca ggagcagagc ttcgtagcgc atgtcatggc cttcgctgcc tgtctggagc 3121 cctacacggc ctgcgacctg gcgcctcccg cctgcaccac cgacgccgcg cacccagttg 3181 ccgcgtcgct gccactgctg gccgggaccc tgctgctgct gggggcgtcc gctgctccct 3241 gagtgcccca ctccggagtt atcctgctcc ccacctccgg gcgtcctgcc ctgttccccg 3301 tcctgagccg ccacttccag cgaacacaca caggtgtcct gccgttggac cttcacctcc 3361 tagagataaa ccagcctcag ctggcgcagc ggggcccttc ttccctccgc atccccttca 3421 gggagcagga gcccagggcg ccctgggagc tgagcctggg acttccagga cctcccctca 3481 ggttgttctc tgattcttcc tcccaacccc agagactgca gatttgtgcc atgcggctgc 3541 ctgcacccca gacaataaag ggaccaaaac cacccaaccc ccaccctgcc tctatcctaa 3601 ggaagaccaa gcaggcctgg acccagagac gtcccccatc gtgggacacg acacacccag 3661 accgcgtgcc ccaccgtctt agcttcaatc ctggcagcac ctggtagacc caaggacttg 3721 ggtggatcag gacacctgaa gaagagaagc ttccggcaac cctgcaaccc acccaaggag 3781 gctactggat cggggattcc caggggggct ttgacacagt cctctgctgt ctccccacta 3841 ggatcattcc acacccctgc acctgaccaa gggaccaatg aggcagaggc ttgccccaag 3901 tcacagccac tcagatgctt cctgcccccc agtgcccatt ccaggtcacc agatccaagg 3961 agcgcttgag gagctctggg tacagggcag caacccagag cccatgggcc ctcccgggac 4021 atctggatgc tgggcataga tttctcaaca aggaagactc ccctgcctcc tcaaggtctc 4081 cattctccta ggagacaaag caataataaa aggtgttaga caatgtaatg ccagtactac 4141 ttcctaggag aaaaatcatg agtgagtgtg ggcacagtat ctggagaggt ggataacgca 4201 ggccaggagg tactgctgag gggcagatga ttgagcaaga gacttgaaca gagtgggggc 4261 ttgagcaagg cagcacagca gtgcaaacgc cctggggcag tgtcagcagg tgctctggga 4321 ggccaagggc tggatcagag gggtgggggt gggtgggcag agtggggaaa gcctgagggg 4381 tcaggagagt ggggtgtgca tgggggactg tgaagtctgg ttagaggggt gtggttggag 4441 gtctttgagg agggctgtga cctgccctgg ttgggaaata agcactctgg ctgctgccag 4501 gagaagggtc tggtcttttg ggcagagggt gggggtggtg gcaggctcag gtgaaagctg 4561 gggaaggagc tgactccagg tgtttctgac ctccctctga aagtattctg gagcgcccat 4621 cccaatacag ccatacttag tgagtacaca cctgctccaa gagaacattg aaaagaataa 4681 aggtgaaatc aaccacattt tccagcaaat tttgcagtat tacaaattta tttgtacatt 4741 tacaaaggtg caaaaaagca tcttgctttt gcaagaaata gtaacatcat tcaatatgct 4801 ttcttattta ctaaaacctt gaaataaaat tgtaaaacat cagtttgaag gcctgactct 4861 cagggtagtt cttttttaat tctgggtttt agtagctgtc acaaaaatat tggaggacca 4921 tgatcccact tgtgaatagc cataggactc cagcctggga agcatagcga aaatctgtgt 4981 ctaaaaaatg aaataaaagg atgaatttta tggtatgtaa attatatcta aattttaaaa 5041 aacagattcg aatatataat ctgctttcaa gtttttttaa atgtgtaggg atcagggttt 5101 tatcagtcaa atacattttt taccacaaaa ttcacatgtc aatgaaaaca ttctcaaact 5161 ttggttctaa aaaatgtttt ctttggcatg agttttcatt ccaagatgat tactttctca 5221 ttttttcatt gaaaggacat ctttaccttg aaggagcaga tgcaagaaaa gtacaattat 5281 ttttcaagct t // LOCUS HUMFPR1A 6931 bp DNA PRI 18-MAR-1994 DEFINITION Human N-formyl peptide receptor (FPR1) gene, complete cds and Alu repeats. ACCESSION L10820 NID g182739 KEYWORDS G protein; G protein coupled receptor; N-formyl peptide; N-formyl peptide receptor; formyl peptide receptor; peptide receptor; pertussis toxin; phagocytosis; plasma membrane; transmembrane domain; transmembrane receptor. SOURCE Homo sapiens (library: Lambda FIX) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Boulay,F., Tardif,M., Brouchon,L. and Vignais,P. TITLE The human N-formylpeptide receptor. Characterization of two cDNA isolates and evidence for a new subfamily of G-Protein-Coupled receptors JOURNAL Biochemistry 29, 11123-11133 (1990) MEDLINE 91105045 REFERENCE 2 (sites) AUTHORS Perez,H.D. TITLE Cloning of the gene coding for a human receptor for formylpeptides JOURNAL Unpublished (1992) REFERENCE 3 (bases 1 to 6931) AUTHORS Murphy,P.M., Tiffany,H.L., McDermott,D. and Ahuja,S.K. TITLE Sequence and organization of the human N-formyl peptide receptor-encoding gene JOURNAL Gene 133 (2), 285-290 (1993) MEDLINE 94040825 FEATURES Location/Qualifiers source 1..6931 /organism="Homo sapiens" /db_xref="taxon:9606" gene 456..6599 /gene="FPR1" TATA_signal 456..461 /gene="FPR1" /note="putative" exon 479..552 /gene="FPR1" /number=1 /evidence=experimental intron 553..2111 /gene="FPR1" /number=1 /evidence=experimental repeat_unit 683..966 /gene="FPR1" /rpt_family="Alu" exon 2112..2162 /gene="FPR1" /number=2 /evidence=experimental intron 2163..5369 /gene="FPR1" /number=2 /evidence=experimental repeat_unit 3603..3894 /gene="FPR1" /rpt_family="Alu" exon 5370..6599 /gene="FPR1" /number=3 /evidence=experimental CDS 5381..6433 /gene="FPR1" /note="putative" /codon_start=1 /product="N-formyl peptide receptor" /db_xref="PID:g182740" /translation="METNSSLPTNISGGTPAVSAGYLFLDIITYLVFAVTFVLGVLGN GLVIWVAGFRMTHTVTTISYLNLAVADFCFTSTLPFFMVRKAMGGHWPFGWFLCKFVF TIVDINLFGSVFLIALIALDRCVCVLHPVWTQNHRTVSLAKKVIIGPWVMALLLTLPV IIRVTTVPGKTGTVACTFNFSPWTNDPKERIKVAVAMLTVRGIIRFIIGFSAPMSIVA VSYGLIATKIHKQGLIKSSRPLRVLSFVAAAFFLCWSPYQVVALIATVRIRELLQGMY KEIGIAVDVTSALAFFNSCLNPMLYVFMGQDFRERLIHALPASLERALTEDSTQTSDT ATNSTLPSAEVALQAK" variation 5681..5683 /gene="FPR1" variation 5954..5956 /gene="FPR1" polyA_signal 6581..6586 /gene="FPR1" /note="putative" repeat_unit 6917..>6931 /rpt_family="Alu" BASE COUNT 1830 a 1542 c 1680 g 1879 t ORIGIN 1 ttatggggtt aatcttggtg gtgtgcatgg gtgtggacgc gctgtcctgc caactgtctc 61 aacttccccc actcccttac ctctctctgt gtttctggtc tccatccctc atgacttctt 121 ctcttccttt cattgcctcc ctctgattct tctcaccaca gtgcttgctg ctttctttac 181 cttgaccctt ggagggagca ggggcccgga cacttggatt tcttggccct tgttgttgag 241 agcactgaac ctctgcatcc acagagactg aggctgagaa atacagtcag gtacatgagt 301 ttctaaacag gcccagccac tgtcctaatg ccattaaagc agacagtata ttggtgtatt 361 cttggggcca tcaaaaatca gaagaagctc agacttccta tttcctgcta cccagctggt 421 ttcagttcct ttacccctcc tcctgttcct tggtgtatgt tttgctgcaa tcattagagc 481 ctgagtcact ctccccagga gacccagacc tagaactacc cagagcaaga ccacagctgg 541 tgaacagtcc aggtaagaaa catattctgg gggactgggt agcttgacat agacacactt 601 aatacactgg tgaacacaat agactctttg tttgaagaag gatgtggagt tgaagaataa 661 tttttttaaa gaggttctaa acaggccggg cacgtgactc atgcttgtaa tcccagcact 721 ttgggaggcc aaggcaggtg gataacttga ggtcaggagt tcgagaccag cctggccaac 781 atgttgaaac ctcacttcta ttaaaaatac aaaaattagc caggcgtggt ggtgggcgtc 841 tgtaattcaa gctacttggg aggctgaggc aggagaattg cttgaatctg ggaggtggag 901 gttgcagtga gctcagattg caccactgta ctccagccta ggtgacagag tgagactcca 961 tctcaaataa ataagtaaat aaataaataa acaaataggc tctagacttt attggggtcc 1021 tgatgggaat gatggaatat agagacacta tgaggctgca ggagagagaa gggataagtg 1081 aaggagggaa gttcttgagt aaatggggga atgagtttta tgggaccagt agagggtaat 1141 tcttggatgg aaacacttgt tttctggtta cagagatgag ggcagagaac agtgatgcag 1201 gagccagcag gaggagaatc tggtggagca agaggcaagg gcatcagctt gagggcttga 1261 gtcggggtgg gaaaggatgt ttggtttgaa gctttcagga gaaagcaaaa gagccctaac 1321 gactttatga tgggtcatgg ggaaatgagt gtaatacaga agcagtcacc tttcggatca 1381 gtgatgagtc tccacgtgaa gacactggtt ctgcatggtg cataggaccc agtagcggaa 1441 tacagccctg ccctctcatg aatctgcaga cattggctgt catttctcat caagctccta 1501 gtttcctctt ccgtcacaca gggatgatga tgtataatca atgctgagtt tgttgagatc 1561 agaggcaaga aagttgagga tgtggtacaa tgattatcaa gcataaagtc cttcatcgat 1621 ggcagccacc aaggcctgtg ttcattgcaa atcacaggat ttttcccttg aaccataatt 1681 ggacaaaatt gcacaggttt tcctctctgg tatgttccag ctcattattg cagaatcctc 1741 atcattcatt gcaacaatct tctgaactgg ttgccttaga gattcttgag gacaaagagt 1801 cctcagggta ttttggtggt tggatgggtg gtaggggagg gagacatctc tgagttgttt 1861 ctgagattgt agagtatcaa agttcattag aaatgtgaga cacatatgta gtttcaaagt 1921 gtcaagtagg cactttaaaa aggtgaagag aaacatgcga acttaatttt aataaaatat 1981 ctaacctaaa cttaaatagc caaattatta tcattttaac atgtagtcaa tataaaaact 2041 attaatgaga tggtcacaat atttttttca tactaagtct ttgaaatctg atgggtattt 2101 tactcctaca gcctgtctcc agttggacta gccacaattc aagtgcttga aaaccacatg 2161 tggtgagtga ctagtgcatt agacagctct tctgtgccag tagcgtggca gggcagaggc 2221 agaaagagac ttttgctgga attcgacagc ctgggctgtt gggtgacttt aaggaattaa 2281 ctgaaatgct ctaagtcgca ctttcctcgt ctgcaaaatg ggtaccataa agacagtgac 2341 cctcatagta acaattttga ggaaggctgc tgcaagcatt agaagagaag accaggccgg 2401 gcgcagtggc tcacacctgt aatcctagca ctttgggagg ctgaggcagg tggactgcct 2461 gagctcagga gtttgagacc agcctgggca acatggtgaa accccatgtc tactaaaata 2521 caattaaaaa gaaaaattag ccaggcacgg cggcgtgcac ctgaaatccc agctactcag 2581 gaggctgaga caggagaatc gcttgaacct gggaggcgga ggttgcagtg agctcagatc 2641 acgccactgc actccagcct gggtgataga gagagactcc atctcaaaat aaataaataa 2701 ataaaataaa ataaattaaa aaaaagagag aaggccagct ctcagcaaga acctgacatg 2761 cactttgcat acctgatcta atgaaatgtc atatccaata catatttgga ctattatttc 2821 tatcacggtc ttctacatcc ccaagagtgc tgagcccatc caatggatgg agaagcacct 2881 cccatccatc tggccaaggt aaccgcagag acgagagtgg aaacagagtc tgttgttctg 2941 acttcttcta gcacagaatc cattggatat aatacttgat tatacgtcgt ttagtatcac 3001 tcacggatcc atttgaccta tgttggaaaa ctaaccaagt actatgcctt gcttcccacg 3061 atgagagagg aaagggagat ttcatgacct taaagagaga gagcgagcag aatttggagt 3121 ggagaagttg catatctttt tttttttttg aaacggagtc tcgctctgtc gcccaggctg 3181 gagtgcaatg gcgcagtctc tgctcactgc aacctccgcc tcccacgttc aagtgattct 3241 cctgcctcag cctctggagt agctgggact aaaagcaccc accgccatgc ctggctaatt 3301 tttgtatttt tagtagagat ggggcttctc catgttggcc aggctggtct cgaactgcta 3361 acctcatgat ccacccgcct cggcctccca aggtactggg attacaggcg tgagccactg 3421 cgcctggctg agaagttgca taccttatag ttcttatgca gtgttatggg tgagagaaag 3481 tcttagtcca cgtttgtgtg acactaggtc actgaacgtg tcgcccactg attttctgat 3541 atcttatgct cttatttttc ctttattaaa ttgatggtaa aatacaaaaa gttaaccatt 3601 ttaggctggg cacagtggct cacgcctata atctcagcac tttgggacgc cgaggcgggc 3661 agatcacctg aggtcaagga gtttgagacc aggctggcca acatggtgaa accccatctc 3721 tacaaaaata taaaaattag ctgggcatgg tggcaagtgc ctgtaattcc agctactcag 3781 gaggctgagg caggagaatc acttaaacgc gggaggcgga ggttgcagtg agccaagatc 3841 gcaccactgc actccagcct gggtgacaga acgagactgc gtctcaaaca acaataaaaa 3901 agttacccat ttaaatgtac aattcaatgg cattaagtac atctacagtg tgttgcaacc 3961 ataaccatta tctagttcca gaacttttta atcacaccaa gcaaaaactg catattcatt 4021 aagcaacaat tccccacttc ctagtttccc cagtccctgg caatcactaa tctgcattcg 4081 gtctctgtga gtttggctac tctggatatt tcatacaagt ggaatcatac aaaatgtgac 4141 ttgtggtgtc ttccttcctt ctctccttcc ttcctccttc tttccttcct ttcttttttc 4201 tttcatttta gagacagggt cttgctttgt tgcccaggct ggagtgcagt ggctcaatca 4261 tagtacacta ttaaccttga attcctggct caagtgagcc tcccatctca cccttctgag 4321 tagctaggac tacaggtgtg tgccaccaca tccagctaat ttaatttaat tttgttgttg 4381 ttgttgttag agatgaggat cttgctatgt ggctcaggct gatcatgaac tcctgagctc 4441 aagcaaccct cctacctcag cttcccaaag ctctgagatg acaggcgtga accaccacat 4501 ccagccctct tttgtttaga gtaatgtttt caaggttcag ccacgttgta gcatatatca 4561 gtgcttcatt gttttttatg gcagaataat attccattgg gtggataaac cacattttgt 4621 ttatccattc atctgttgat ggacatctgg gttgtttcta ccttttggct atagggagga 4681 gtgtctctat gaacattttt gtttgaatac atgttttctc tattcatgta ttcatgaatc 4741 tttggcccca tattctcctc ttggaggtca aggaccctat cttccttagc catttaatct 4801 ccctggtagg ggcttgcatg ttagcaatta tgaggatgtg tcataagatg caatcttggc 4861 cacacgcggt agctcacgtc tgtaatccca gcactttggg aggctgaggt gggtggatca 4921 cctggggtca gcagttcaag accagcctgg ccaacatggc aaaaccccac ctctactaaa 4981 tatacgaaaa ttagccagga gtggggtggt gcatacctgt agtctcagct actcaggatg 5041 ctgaggcagg ggaattgctt gaacccggga gtggaggttg cagtgagaga tcacgccact 5101 gcactccagc ctgggtgaca gagcaatatt ccatctcaaa aaaaaaaaaa aagatgtaat 5161 cttgcccaac aggtacaata aaagattgcc aatgtataga tgtgggcatg tgtaagggac 5221 agtggtgagg atgttccggc tgttgtggga gctctagggg aaggtgagaa agctaggagt 5281 gggaaaactg gcaggggcag aaatgagggg gtggggaaat ggtacggagg ttcataactg 5341 aggaaatgac cacgactgca ctatttcagg agcagacaag atggagacaa attcctctct 5401 ccccacgaac atctctggag ggacacctgc tgtatctgct ggctatctct tcctggatat 5461 catcacttat ctggtatttg cagtcacctt tgtcctcggg gtcctgggca acgggcttgt 5521 gatctgggtg gctggattcc ggatgacaca cacagtcacc accatcagtt acctgaacct 5581 ggccgtggct gacttctgtt tcacctccac tttgccattc ttcatggtca ggaaggccat 5641 gggaggacat tggcctttcg gctggttcct gtgcaaattc gtctttacca tagtggacat 5701 caacttgttc ggaagtgtct tcctgatcgc cctcattgct ctggaccgct gtgtttgcgt 5761 cctgcatcca gtctggaccc agaaccaccg caccgtgagc ctggccaaga aggtgatcat 5821 tgggccctgg gtgatggctc tgctcctcac attgccagtt atcattcgtg tgactacagt 5881 acctggtaaa acggggacag tagcctgcac ttttaacttt tcgccctgga ccaacgaccc 5941 taaagagagg ataaaggtgg ccgttgccat gttgacggtg agaggcatca tccggttcat 6001 cattggcttc agcgcaccca tgtccatcgt tgctgtcagt tatgggctta ttgccaccaa 6061 gatccacaag caaggcttga ttaagtccag tcgtccctta cgggtcctct cctttgtcgc 6121 agcagccttt tttctctgct ggtccccata tcaggtggtg gcccttatag ccacagtcag 6181 aatccgtgag ttattgcaag gcatgtacaa agaaattggt attgcagtgg atgtgacaag 6241 tgccctggcc ttcttcaaca gctgcctcaa ccccatgctc tatgtcttca tgggccagga 6301 cttccgggag aggctgatcc acgcccttcc cgccagtctg gagagggccc tgaccgagga 6361 ctcaacccaa accagtgaca cagctaccaa ttctacttta ccttctgcag aggtggcgtt 6421 acaggcaaag tgaggaggga gctgggggac actttcgagc tcccagctcc agcttcgtct 6481 caccttgagt taggctgagc acaggcattt cctgcttatt ttaggattac ccactcatca 6541 gaaaaaaaaa aaaagccttt gtgtcccctg atttggggag aataaacaga tatgagttta 6601 ttattgactt cttttttgat tttggacctc agcctcgggt ggtcagggtg ggaaatgata 6661 ggaagaagct gtcatctgca tcctagtttg cctgaaatga acccaaataa tacccattat 6721 tattagtcct gaattatgag tagtgaatga tacccatcat tctggcatca tgatgagtag 6781 tgtccacttc cattctgaaa agtgccctgc tgtgaaaaat aaattatata gtcatcctag 6841 gtaaatgaag gaggagggag aagtgtgaaa gagtatggct taaatcagac aagatataca 6901 agaagatact ttatataggg caggagcggt g // LOCUS HSSPRO 5296 bp DNA PRI 20-MAY-1992 DEFINITION Human S-protein gene, complete cds. ACCESSION X05006 NID g36572 KEYWORDS Alu repetitive sequence; S-protein; vitronectin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5296) AUTHORS Jenne,D. and Stanley,K.K. TITLE Nucleotide sequence and organization of the human S-protein gene: repeating peptide motifs in the 'pexin' family and a model for their evolution JOURNAL Biochemistry 26 (21), 6735-6742 (1987) MEDLINE 88107592 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by K.Stanley, 14-JUL-1987. No pre-print was sent. There is a polyadenylation signal at positions 4591 to 4596. FEATURES Location/Qualifiers source 1..5296 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region 220..376 /note="78 bp repeat" prim_transcript 1636..>4591 /note="S-protein mRNA and introns" CDS join(1701..1764,1857..1976,2052..2396,2475..2614, 2871..3027,3225..3377,3837..4181,4415..4527) /note="S-protein" /codon_start=1 /db_xref="PID:g36573" /db_xref="SWISS-PROT:P04004" /translation="MAPLRPLLILALLAWVALADQESCKGRCTEGFNVDKKCQCDELC SYYQSCCTDYTAECKPQVTRGDVFTMPEDEYTVYDDGEEKNNATVHEQVGGPSLTSDL QAQSKGNPEQTPVLKPEEEAPAPEVGASKPEGIDSRPETLHPGRPQPPAEEELCSGKP FDAFTDLKNGSLFAFRGQYCYELDEKAVRPGYPKLIRDVWGIEGPIDAAFTRINCQGK TYLFKGXQYWRFEDGVLDPDYPRNISDGFDGIPDNVDAALALPAHSYSGRERVYFFKG KXYWEYQFQHQPSQEECEGSSLSAVFEHFAMMQRDSWEDIFELLFWGRTSAGTRQPQF ISRDWHGVPGQVDAAMAGRIYISGMAPRPSLAKKQRFRHRNRKGYRSQRGHSRGRNQN SRRPSRAMWLSLFSSEESNLGANNYDDYRMDWLVPATCEPIQSVFFFSGDKYYRVNLR TRRVDTVDPPYPRSIAQYWLGCPAPGHL" intron 1765..1856 /note="S-protein intron A" intron 1977..2051 /note="S-protein intron B" intron 2397..2474 /note="S-protein intron C" intron 2615..2870 /note="S-protein intron D" intron 3028..3224 /note="S-protein intron E" intron 3378..3836 /note="S-protein intron F" intron 4182..4414 /note="S-protein intron G" repeat_region 5166..5296 /note="partial Alu repeat" BASE COUNT 1143 a 1519 c 1532 g 1100 t 2 others ORIGIN 1 ggtaccaggc aagggtgccg gtcagaactt tgggggagga gtgtcggcag gatcggggga 61 gcggttggca aaggtgatgc aggcccagat gcagggagtg ggcgaaagga gagagcgagc 121 aggcgaggga tggctccgga cgccgtaggg tagaggaggc ggcggctgga acaaagcgtg 181 gcctgggcac gggctcggct gagctgggag atgctcggca gcagccacgc cccggaggag 241 cagcggtccc cgggccgacg ccccgcctct gggtcccgcg gggctcctcc agcctcctgc 301 agccacgccc cggaggagca gcggtccccg ggccgacgcc ccgcctctgg gtcccgcggg 361 gctcctccag cctcctgcgc agccccaccc ccaagccgtc cccagccacg cccacgcccc 421 tggccagcct tccctagccc ctcagtcagt cgcagccacc tccccctgcg ctggacttgg 481 aggaaggggg tggagaatga gaggacgctc tccgttctcc gccccgcctc cccgccccac 541 gtgacatcgg tacgtgggag caacaccgct ggatgccccc tccctactaa atggttaaaa 601 caatcatcat aactcttaag gcccttttat ggctgtttct tagcgaaacc tcacaacatg 661 cctatggggt gggaattgat gttccattcc acagctgact ccactgaagc ccatgaggaa 721 gaaattggcc caaatcacag tcccctctga ttacagaaca gaaaaagtca gctaaaatct 781 caggatcact tttttcaggt gatgggagga tttcgaagtt ctgtggacac ctgaaattgg 841 gcacaaatca ggtgttcatg ccagtgggca ttttccaggt agagggatca tatttctctc 901 gagagtctaa aagtgtgttg aacaagccca ctctttacag atggggagac tgagcctggg 961 gacagggagt ggcctgctca gaaaagactc agaaattaaa tccagtccag tgggttgata 1021 tttacccaaa tttccagcct ggggagattg atgcacccaa gagaagaacc cagaaatgaa 1081 actttgttct tttatgctaa aaaataaaat tccccagagt gcttacaatc tctcctccca 1141 ctcccttttt cctgccctaa ataaataatg gcgaatgagc acccagccag ggatgtgtct 1201 gatcaaacaa tcatggatca atagctatgt ttggagaagg aatttgtggc tgctccagct 1261 actgggcatt ttgtctggtc cagttcatgt aatctcccaa caccccatga agcaaggctt 1321 tgttaatcct attttattga aaatgaacta agactcagag agataaagct gttgcccaat 1381 gagccttctt tctgccctcc agatccacgg tgctaattcc ccttccgatg acctaatgat 1441 tctgagcttg gcaaaggtct tatctcccag ctcgcccagg cccagtgttc caggaatgtg 1501 acctttgctg cagcagccgc tggagggggc agaggggatg ggctggaggt tgagcaaaca 1561 gagcagcaga aaaggcagtt cctcttctcc agtgccctcc ttccctgtct ctgcctctcc 1621 ctcccttcct caggcatcag agcggagact tcagggagac cagagcccag cttgccaggc 1681 actgagctag aagccctgcc atggcacccc tgagacccct tctcatactg gccctgctgg 1741 catgggttgc tctggctgac caaggtacag gggatgttgg tggccatctg ggtcaatgta 1801 gggagggcga gggtggtctg ggcttggtgg caccgactga cactccttcc tcatagagtc 1861 atgcaagggc cgctgcactg agggcttcaa cgtggacaag aagtgccagt gtgacgagct 1921 ctgctcttac taccagagct gctgcacaga ctatacggct gagtgcaagc cccaaggtgt 1981 gttcagagcc caggtgggtg ggctggggtg ccccctgctg ctggagactc actaccatcc 2041 actctctgca gtgactcgcg gggatgtgtt cactatgccg gaggatgagt acacggtcta 2101 tgacgatggc gaggagaaaa acaatgccac tgtccatgaa caggtggggg gcccctccct 2161 gacctctgac ctccaggccc agtccaaagg gaatcctgag cagacacctg ttctgaaacc 2221 tgaggaagag gcccctgcgc ctgaggtggg cgcctctaag cctgagggga tagactcaag 2281 gcctgagacc cttcatccag ggagacctca gcccccagca gaggaggagc tgtgcagtgg 2341 gaagcccttc gacgccttca ccgacctcaa gaacggttcc ctctttgcct tccgaggtga 2401 atccagggca ggtactgggg atgcgggtct gccccaggag cgtccctgct ctcacaccat 2461 ctcctccact ctagggcagt actgctatga actggacgaa aaggcagtga ggcctgggta 2521 ccccaagctc atccgagatg tctggggcat cgagggcccc atcgatgccg ccttcacccg 2581 catcaactgt caggggaaga cctacctctt caaggtgcca ggggctgtgg gccagggtag 2641 aaagcatcta gggagggttt gagagctatt gctcccaggg acagggtgga cagggaagct 2701 ggacccaggg ccctgcagga cctggtggga gctctgtgag cacagggcag ccccaacagt 2761 ccaggtcctg ggcagtgaac ctggacctgg aacggctgct agggcaaggg actctgcctc 2821 tgtgcccagc cagcggctcc ataccccttt tcactttccc cacctcttag ggtartcagt 2881 actggcgctt tgaggatggt gtcctggacc ctgattaccc ccgaaatatc tctgacggct 2941 tcgatggcat cccggacaac gtggatgcag ccttggccct ccctgcccat agctacagtg 3001 gccgggagcg ggtctacttc ttcaagggta ctcagggggt ggtgggagac tgagcaggca 3061 gtggagcagt cttggattcc tttcacactt cactggggac aggcctcagc atgtgcccac 3121 ccctgacccc cacctcatgc tgggagatcc taacttcaac agcctctggg atctccagtc 3181 ttgccctggc ccagccctcc taatgcccac catcccgtcc tcagggaaas agtactggga 3241 gtaccagttc cagcaccagc ccagtcagga ggagtgtgaa ggcagctccc tgtcggctgt 3301 gtttgaacac tttgccatga tgcagcggga cagctgggag gacatcttcg agcttctctt 3361 ctggggcaga acctctggta tggagagagg gcaagtcttg cttctccctc aaaagggctg 3421 aaaccccttg gtattggtag agccaggccg gctggagggg ctgtggttgt ggagctatcg 3481 atcaaagtct gtttgctcag gccagacttt gcttctgttg accttttggg gaaagctcag 3541 ctctacctgg accccacacc ttggactttg cctagcacag ctgagagcac agccagcaga 3601 gggaggggct gtggctgagg agtttagggg gcctgggggg gtcgggtcga gacaccatca 3661 tatggtggag ggaaagcaca gggggaaggg aattggactg agagtcaaag gcctggctct 3721 gccattcgct gctgtgtgtc tttgggcaag ctgcagcaga tgaactctaa tggccccgct 3781 ggaaggggca agattcggac ccccaagacc tctcattcac cccttccctg ccagagctgg 3841 taccagacag ccccagttca ttagccggga ctggcacggt gtgccagggc aagtggacgc 3901 agccatggct ggccgcatct acatctcagg catggcaccc cgcccctcct tggccaagaa 3961 acaaaggttt aggcatcgca accgcaaagg ctaccgttca caacgaggcc acagccgtgg 4021 ccgcaaccag aactcccgcc ggccatcccg cgccatgtgg ctgtccttgt tctccagtga 4081 ggagagcaac ttgggagcca acaactatga tgactacagg atggactggc ttgtgcctgc 4141 cacctgtgaa cccatccaga gtgtcttctt cttctctgga ggtaggagcc gctgccaccc 4201 ctgaagctgg tctagcttgg gttttccttg ctgtccctgg tgcacaaggg ctgaacgcag 4261 cctggaagta gtgaccacag aaagccaggc cagaagtcct tagctgcatc atggatgttc 4321 actttccctt ctgggagaga ccctagactt ctcaagggaa gagtgggcag ggccaggctg 4381 ggcctcacgc atcctctgct ttcctctctt ccagacaagt actaccgagt caatcttcgc 4441 acacggcgag tggacactgt ggaccctccc tacccacgct ccatcgctca gtactggctg 4501 ggctgcccag ctcctggcca tctgtaggag tcagagccca catggccggg ccctctgtag 4561 ctccctcctc ccatctcctt cccccagccc aataaaggtc ccttagcccc gagtttaaaa 4621 ttattgttct gactggggta gacacatccc tgtctgggta aaggagagga agctggactg 4681 tcagaatggg ttggtgaggg ggacagagaa gggacagaga agggcctcgc cgtgtccaac 4741 ccatgttggg ctcaggacct ctctgtgctc aggacctctc tgtgaaccgg tttggggctg 4801 gggaaggcct cctgagagct acccttcctc ttgagctaag cagcccccag caacccacca 4861 gcagctaagc agacacttga actcctctga tgatgacaaa gttaccaggc cagcctccgt 4921 gggtgaggct tgctttgccc cttggagttg actagggtct tggaggagag aggggcagag 4981 ctggtgggtc tgctagcttt tgtccatcct gatctgaaat cctagtagct ctgactcctg 5041 agcacttggg gaaccagtca ctgtgcttcc ccaaacccac ccctagggtg agcctttctt 5101 gaggtctctg tctagggtct aggatgggga gcagcaagac caggagagat agtgaagggc 5161 agaaactgaa aggaaattca aaagcacaca cttcttgtgg aaaagtggcc caggagccca 5221 ggagtgtgaa gctgcagtga gctattattg taccactgca ctccaacctg gggaatagag 5281 tgagactcta tctcta // LOCUS HSGCSFG 2960 bp DNA PRI 24-APR-1993 DEFINITION Human gene for granulocyte colony-stimulating factor (G-CSF). ACCESSION X03656 NID g31687 KEYWORDS colony stimulating factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2960) AUTHORS Nagata,S., Tsuchiya,M., Asano,S., Yamamoto,O., Hirata,Y., Kubota,N., Oheda,M., Nomura,H. and Yamazaki,T. TITLE The chromosomal gene structure and two mRNAs for human granulocyte colony-stimulating factor JOURNAL EMBO J. 5 (3), 575-581 (1986) MEDLINE 86220137 COMMENT Data kindly reviewed (19-JUN-1986) by S. Nagata. FEATURES Location/Qualifiers source 1..2960 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 296..301 exon 329..403 /number=1 mRNA join(329..403,580..743,1122..1229,1374..1520,1685..2702) prim_transcript 329..2702 CDS join(364..403,580..743,1122..1229,1374..1520,1685..1849) /codon_start=1 /product="G-CSF protein" /db_xref="PID:g296647" /db_xref="SWISS-PROT:P09919" /translation="MAGPATQSPMKLMALQLLLWHSALWTVQEATPLGPASSLPQSFL LKCLEQVRKIQGDGAALQEKLVSECATYKLCHPEELVLLGHSLGIPWAPLSSCPSQAL QLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIWQQMEELGMA PALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLAQP" sig_peptide join(364..403,580..629) intron 404..579 /number=1 exon 580..743 /number=2 mat_peptide join(630..743,1122..1229,1374..1520,1685..1846) /product="mature G-CSF protein" intron 744..1121 /number=2 exon 1122..1229 /number=3 intron 1230..1373 /number=3 exon 1374..1520 /number=4 intron 1521..1684 /number=4 exon 1685..2702 /number=5 polyA_signal 2685..2690 polyA_site 2702 BASE COUNT 599 a 839 c 917 g 605 t ORIGIN 1 ctgccgcttc caggcgtcta tcagcggctc agcctttgtt cagctgttct gttcaaacac 61 tctggggcca ttcaggcctg ggtggggcag cgggaggaag ggagtttgag gggggcaagg 121 cgacgtcaaa ggaggatcag agattccaca atttcacaaa actttcgcaa acagcttttt 181 gttccaaccc ccctgcattg tcttggacac caaatttgca taaatcctgg gaagttatta 241 ctaagcctta gtcgtggccc caggtaattt cctcccaggc ctccatgggg ttatgtataa 301 agggccccct agagctgggc cccaaaacag cccggagcct gcagcccagc cccacccaga 361 cccatggctg gacctgccac ccagagcccc atgaagctga tgggtgagtg tcttggccca 421 ggatgggaga gccgcctgcc ctggcatggg agggaggctg gtgtgacaga ggggctgggg 481 atccccgttc tgggaatggg gattaaaggc acccagtgtc cccgagaggg cctcaggtgg 541 tagggaacag catgtctcct gagcccgctc tgtccccagc cctgcagctg ctgctgtggc 601 acagtgcact ctggacagtg caggaagcca cccccctggg ccctgccagc tccctgcccc 661 agagcttcct gctcaagtgc ttagagcaag tgaggaagat ccagggcgat ggcgcagcgc 721 tccaggagaa gctggtgagt gaggtgggtg agagggctgt ggagggaagc ccggtgggga 781 gagctaaggg ggatggaact gcagggccaa catcctctgg aagggacatg ggagaatatt 841 aggagcagtg gagctgggga aggctgggaa gggacttggg gaggaggacc ttggtgggga 901 cagtgctcgg gagggctggc tgggatggga gtggaggcat cacattcagg agaaagggca 961 agggcccctg tgagatcaga gagtgggggt gcagggcaga gaggaactga acagcctggc 1021 aggacatgga gggaggggaa agaccagaga gtcggggagg acccgggaag gagcggcgac 1081 ccggccacgg cgagtctcac tcagcatcct tccatcccca gtgtgccacc tacaagctgt 1141 gccaccccga ggagctggtg ctgctcggac actctctggg catcccctgg gctcccctga 1201 gcagctgccc cagccaggcc ctgcagctgg tgagtgtcag gaaaggataa ggctaatgag 1261 gagggggaag gagaggagga acacccatgg gctcccccat gtctccaggt tccaagctgg 1321 gggcctgacg tatctcaggc agcaccccct aactcttccg ctctgtctca caggcaggct 1381 gcttgagcca actccatagc ggccttttcc tctaccaggg gctcctgcag gccctggaag 1441 ggatctcccc cgagttgggt cccaccttgg acacactgca gctggacgtc gccgactttg 1501 ccaccaccat ctggcagcag gtgagccttg ttgggcaggg tggccaaggt cgtgctggca 1561 ttctgggcac cacagccggg cctgtgtatg ggccctgtcc atgctgtcag cccccagcat 1621 ttcctcattt gtaataacgc ccactcagaa gggcccaacc actgatcaca gctttccccc 1681 acagatggaa gaactgggaa tggcccctgc cctgcagccc acccagggtg ccatgccggc 1741 cttcgcctct gctttccagc gccgggcagg aggggtcctg gttgcctccc atctgcagag 1801 cttcctggag gtgtcgtacc gcgttctacg ccaccttgcc cagccctgag ccaagccctc 1861 cccatcccat gtatttatct ctatttaata tttatgtcta tttaagcctc atatttaaag 1921 acagggaaga gcagaacgga gccccaggcc tctgtgtcct tccctgcatt tctgagtttc 1981 attctcctgc ctgtagcagt gagaaaaagc tcctgtcctc ccatcccctg gactgggagg 2041 tagataggta aataccaagt atttattact atgactgctc cccagccctg gctctgcaat 2101 gggcactggg atgagccgct gtgagcccct ggtcctgagg gtccccacct gggacccttg 2161 agagtatcag gtctcccacg tgggagacaa gaaatccctg tttaatattt aaacagcagt 2221 gttccccatc tgggtccttg cacccctcac tctggcctca gccgactgca cagcggcccc 2281 tgcatcccct tggctgtgag gcccctggac aagcagaggt ggccagagct gggaggcatg 2341 gccctggggt cccacgaatt tgctggggaa tctcgttttt cttcttaaga cttttgggac 2401 atggtttgac tcccgaacat caccgacgtg tctcctgttt ttctgggtgg cctcgggaca 2461 cctgccctgc ccccacgagg gtcaggactg tgactctttt tagggccagg caggtgcctg 2521 gacatttgcc ttgctggatg gggactgggg atgtgggagg gagcagacag gaggaatcat 2581 gtcaggcctg tgtgtgaaag gaagctccac tgtcaccctc cacctcttca ccccccactc 2641 accagtgtcc cctccactgt cacattgtaa ctgaacttca ggataataaa gtgtttgcct 2701 ccagtcacgt ccttcctcct tcttgagtcc agctggtgcc tggccagggg ctggggaggt 2761 ggctgaaggg tgggagaggc cagagggagg tcggggagga ggtctgggga ggaggtccag 2821 ggaggaggag gaaagttctc aagttcgtct gacattcatt ccgttagcac atatttatct 2881 gagcacctac tctgtgcaga cgctgggcta agtgctgggg acacagcagg gaacaaggca 2941 gacatggaat ctgcactcga // LOCUS HUMTNFBA 2140 bp DNA PRI 14-JAN-1995 DEFINITION Human tumor necrosis factor-beta (TNFB) gene, complete cds. ACCESSION M55913 NID g339742 KEYWORDS tumor necrosis factor-beta. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2140) AUTHORS Abraham,L.J., Du,D.C., Zahedi,K., Dawkins,R.L. and Whitehead,A.S. TITLE Haplotypic polymorphisms of the TNFB gene JOURNAL Immunogenetics 33 (1), 50-53 (1991) MEDLINE 91139175 FEATURES Location/Qualifiers source 1..2140 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="R/612337" /cell_type="B-lymphocyte" /germline /map="6p21.3" exon 78..239 /gene="TNFB" /note="G00-120-442" /number=1 allele 87 /standard_name="8.1 haplotypic polymorphism" /note="8.1 ancestral haplotype is A1, B8, Bfs, C4AQ0, C4B1, DR3. Polymorphism is within the untranslated exon 1." /label=TNF8-1-1 /replace="a" allele 329 /gene="TNFB8.1" /standard_name="8.1 haplotypic polymorphism" /note="8.1 ancestral haplotype is A1, B8, Bfs, C4AQ0, C4B1, DR3. Polymorphism is within intron 1." /label=TNF8-1-2 /replace="g" exon 528..634 /gene="TNFB" /note="G00-120-442" /number=2 gene join(536..634,721..826,1074..1486) /gene="TNFB" CDS join(536..634,721..826,1074..1486) /gene="TNFB" /codon_start=1 /db_xref="GDB:G00-120-442" /product="tumor necrosis factor-beta" /db_xref="PID:g339743" /translation="MTPPERLFLPRVCGTTLHLLLLGLLLVLLPGAQGLPGVGLTPSA AQTARQHPKMHLAHSTLKPAAHLIGDPSKQNSLLWRANTDRAFLQDGFSLSNNSLLVP TSGIYFVYSQVVFSGKAYSPKAPSSPLYLAHEVQLFSSQYPFHVPLLSSQKMVYPGLQ EPWLHSMYHGAAFQLTQGDQLSTHTDGIPHLVLSPSTVFFGAFAL" exon 721..826 /gene="TNFB" /note="G00-120-442" /number=3 allele 800 /gene="TNFB" /standard_name="8.1 ancestral haplotypic polymorphism" /note="8.1 ancestral haplotype is A1, B8, C4AQ0, C4B1, DR3. Polymorphism is within exon 3." /label=TNF8-1-3 /replace="a" exon 1074..1486 /gene="TNFB" /note="G00-120-442" /number=4 BASE COUNT 414 a 735 c 495 g 496 t ORIGIN chromosome 6. 1 ccgacctaga acccgcccgc tgcctgccac gctgccactg ccgcttcctc tataaaggga 61 cctgagcgtc cgggcccagg ggctccgcac agcaggtgag gctctcctgc cccatctcct 121 tgggctgccc gtgcttcgtg ctttggacta ccgccccgca gtgtcctgcc ctctgcctgg 181 gcctcggtcc ctcctgcacc tgctgcctgg atccccggcc tgcctgggcc tgggccttgg 241 tgggtttggt tttggtttcc ttctctgtct ctgactctcc atctgtcagt ctcattgtct 301 ctgtcacaca ttctctgttt ctgccatgat tcctctctgt tcccttcctg tctctctctg 361 tctccctctg ctcaccttgg ggtttctctg actgcatctt gtccccttct ctgtcgatct 421 ctctctcggg ggtcgggggg tgctgtctcc cagggcggga ggtctgtctt ccgccgcgtg 481 ccccgccccg ctcactgtct ctctctctct ctctctttct ctgcaggttc tccccatgac 541 accacctgaa cgtctcttcc tcccaagggt gtgtggcacc accctacacc tcctccttct 601 ggggctgctg ctggttctgc tgcctggggc ccaggtgagg cagcaggaga atgggggctg 661 ctggggtggc tcagccaaac cttgagccct agagcccccc tcaactctgt tctcccctag 721 gggctccctg gtgttggcct cacaccttca gctgcccaga ctgcccgtca gcaccccaag 781 atgcatcttg cccacagcac cctcaaacct gctgctcacc tcattggtaa acatccacct 841 gacctcccag acatgtcccc accagctctc ctcctacccc tgcctcagga acccaagcat 901 ccacccctct cccccaactt cccccacgct aaaaaaaaca gagggagccc actcctatgc 961 ctccccctgc catcccccag gaactcagtt gttcagtgcc cacttcctca gggattgaga 1021 cctctgatcc agacccctga tctcccaccc ccatccccta tggctcttcc taggagaccc 1081 cagcaagcag aactcactgc tctggagagc aaacacggac cgtgccttcc tccaggatgg 1141 tttctccttg agcaacaatt ctctcctggt ccccaccagt ggcatctact tcgtctactc 1201 ccaggtggtc ttctctggga aagcctactc tcccaaggcc ccctcctccc cactctacct 1261 ggcccatgag gtccagctct tctcctccca gtaccccttc catgtgcctc tcctcagctc 1321 ccagaagatg gtgtatccag ggctgcagga accctggctg cactcgatgt accacggggc 1381 tgcgttccag ctcacccagg gagaccagct atccacccac acagatggca tcccccacct 1441 agtcctcagc cctagtactg tcttctttgg agccttcgct ctgtagaact tggaaaaatc 1501 cagaaagaaa aaataattga tttcaagacc ttctccccat tctgcctcca ttctgaccat 1561 ttcaggggtc gtcaccacct ctcctttggc cattccaaca gctcaagtct tccctgatca 1621 agtcaccgga gctttcaaag aaggaattct aggcatccca ggggaccaca cctccctgaa 1681 ccatccctga tgtctgtctg gctgaggatt tcaagcctgc ctaggaattc ccagcccaaa 1741 gctgttggtc ttgtccacca gctaggtggg gcctagatcc acacacagag gaagagcagg 1801 cacatggagg agcttggggg atgactagag gcagggaggg gactatttat gaaggcaaaa 1861 aaattaaatt atttatttat ggaggatgga gagaggggaa taatagaaga acatccaagg 1921 agaaacagag acaggcccaa gagatgaaga gtgagagggc atgcgcacaa ggctgaccaa 1981 gagagaaaga agtaggcatg agggatcaca gggccccaga aggcagggaa aggctctgaa 2041 agccagctgc cgaccagagc cccacacgga ggcatctgca ccctcgatga agcccaataa 2101 acctcttttc tctgaaatgc tgtctgcttg tgtgtgtgtg // LOCUS HSU16720 8868 bp DNA PRI 28-OCT-1995 DEFINITION Human interleukin 10 (IL10) gene, complete cds. ACCESSION U16720 NID g1041812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8868) AUTHORS Sanjanwala,B. and de Waal-Malefyt,R. TITLE The Structure of the Human IL-10 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 8868) AUTHORS Sanjanwala,B. TITLE Direct Submission JOURNAL Submitted (28-OCT-1994) Bharati Sanjanwala, Human Immunology, DNAX Institution, 901 California Ave., Palo Alto, CA 94035, USA FEATURES Location/Qualifiers source 1..8868 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" repeat_region 1144..1447 /rpt_type=dispersed /rpt_family="Alu" mRNA join(<4057..4221,5088..5147,5438..5590,6601..6666, 7742..8868) gene join(4057..4221,5088..5147,5438..5590,6601..6666, 7742..7834) /gene="IL10" CDS join(4057..4221,5088..5147,5438..5590,6601..6666, 7742..7834) /gene="IL10" /codon_start=1 /product="interleukin 10" /db_xref="PID:g1041813" /translation="MHSSALLCCLVLLTGVRASPGQGTQSENSCTHFPGNLPNMLRDL RDAFSRVKTFFQMKDQLDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQAENQD PDIKAHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQVKNAFNKLQEKGIYKAMSE FDIFINYIEAYMTMKIRN" repeat_unit 8427..8440 repeat_region 8441..8741 /rpt_type=dispersed /rpt_family="Alu" repeat_unit 8742..8755 BASE COUNT 2425 a 2137 c 2075 g 2231 t ORIGIN 1 ccctccaaaa tctatttgca taagcacaca cacacacaca cacacacaca ccccagcagt 61 tcttgcctgc ccagattcct ctgcagctaa agtgatgaaa cttactgggc ggagcttcct 121 aaaaagatta ttagggtctc ctgggttggt gtgcctttaa acctttggac tttaccacct 181 cctatctctc ctatctcctt gcaacaaagg ttaggagaac aagaatgcac aaaaaacggg 241 tcctggatga catctgagtg cctgctttgg gcttcttgat gagtgagaca gaaaataaaa 301 tacaaccccc tcttttaaaa gccatgctta ctcaggtttt ccttcatttg cagctaaata 361 cagaaatgag agaatatttt ggagcaggga tggaagaaga gaggtattcc ccttcccaca 421 accttctgat ttcccactac atcccccact ggaaaaattc atttaaaatc agtataataa 481 gcatttgatt agatgcctac tatgcatctg ggcttgaggg caaactggac tcagcctttt 541 ggcctcaaga agctcacagt gtgagagtgg catttgtgtc ctcttaaatt cacaggacta 601 aattgtccca ggctacattc tatccatcca taggtgcctg ccttctcact tccctctctt 661 catggctctt gccttgtagg aaaatccaaa cccaaatgtg gtgacatgtg agtgttggca 721 ttcatgtctc agacatgacc tatgggcttg ggacttttcc ccgtgtaccc cacgtgactt 781 ttcacgatga acaggtatct ccaaaaactt cgagaaatag gagtcctgtt tgtgtgttct 841 tgttgctttg tcaatatata gagagcacag ggtcatctta taattctaaa aatgttcatt 901 atctatctct tcgacagaaa tactatgaga catacttgat taggagaagc cgttatctcc 961 atatgctaaa tgaggacttg caccagggaa cttgcccatg gttctctcca accacttaaa 1021 ttctgaaatt ttgaaatgag agtggacagt aatttcaaat caatggggaa agaatcaaat 1081 cttcagcaaa tggcttgaga taattagcta cacatttcag aacaaataaa gaagtcagat 1141 ccgggccggg cacagtggct catgctgtaa tctcagcact ctgggaggcc aaggcgggcg 1201 gatcataagg tcaggagatc gagaccatcc tggttaacac agtgaaaccc cgtctctaat 1261 aaaaatacaa aagaaaataa aaaaacttag ccgggcgtgg tgccagcgcc tgtagtccca 1321 gctactcggg agcgtgaggc aggagaatgg cttgaactcg ggaggcagag cttgcagtga 1381 gctgagatca tgccactgca ctccagcctg ggcaacagag cgagactctg tctcaaaaaa 1441 aaaaaaagct agtcagatcc taacctcaac cctatttaac agattataga tgaagaaggt 1501 acaaatggct tttacatacc tcccttctcc ctgacatttt gtatgtgtgt gtgtgtgtat 1561 ttacacacac atctcatata aggaaattga agggaggctg cctgcatccc tgagtcactc 1621 tccctctcct tctgaatgct tacctgtgcc cagaccacct ccttagcctc gcaccctcca 1681 ggcttacagg gcactcttct atgcccatcc caagtatagc tgataccttc caagggccag 1741 acttggtgct aagtaccaag tacgcaaaga ttaataaaac aatgtcctgt ttcagggagc 1801 tcaaagctga ttcggcaggg catggtgtgt acatgaatga taaccacgta gggttgcagg 1861 tttcctagtg aggtaagcac aaggcaagat gggaaacaaa ggaaggaggg gttcacagcc 1921 tcacccagag tccagaaccc ctggcctgcc tggtgcccat gctgagtcca cttctggaac 1981 acccagctca gagagggggt tagacctgca ggctaacaca gacacagccc agaaaaccca 2041 ggagccgagg gggaaggaga aaggtgcaag aaggggaaac ccaggtcctg gtccccttct 2101 ctctgcttcc tggcagcaga actcagacag aacccttaag ccagtctaag tctggcagga 2161 ccagtaagtt ctgagttagc tccatactag tttctagcag gctctttctc acttcctgat 2221 tcttaggttt ctacattgac actccctgaa gagttgggaa gagacaccac agtcccctga 2281 ccctgatcca taggtcacac agcagggaca tccacagggt gacgtgggcc ctctcatccc 2341 tccctcccac tcacttcacg ctggctgggc cccaaggtgt ttgcacccct tgcagtgagt 2401 gaccttctct agtgcagcaa gctcagaacc tgctgccact ggagttgtcc cattgctgat 2461 gcagaaaggt gaagaactag cagaacactg gaaatgccct ccatctgggt ccatggctac 2521 ttaagctcaa tgctccctgg caggcaggag gacaggtgct attgccctgt tgggacagat 2581 gaaaaacaga cacagggagg atgagtgatt tgccctgact atagagtggc agggccaagc 2641 agagcccagg cctcctgcac ctaggtcaat gttcctccca gttacagtct aaactggaat 2701 gcaggcaaag cccctgtgga aggggaaggt gaaggctcaa tcaaaggatc cccagagact 2761 ttccagatat ctgaagaagt cctgatgtca ctgccccggt ccttccccag gtagagcaac 2821 actcctcgct gcaacccaac tggctcccct taccttctac acacacacac acacacacac 2881 acacacacac acacacacac acacaaatcc aagacaacac tactaaggct tctttgggag 2941 ggggaagtag ggataggtaa gaggaaagta agggacctcc tatccagcct ccatggaatc 3001 ctgacttctt ttccttgtta tttcaacttc ttccacccca tcttttaaac tttagactcc 3061 agccacagaa gcttacaact aaaagaaact ctaaggccaa tttaatccaa ggtttcattc 3121 tatgtgctgg agatggtgta cagtagggtg aggaaaccaa attctcagtt agcactggtg 3181 tacccttgta caggtgatgt aacatctctg tgcctcagtt tgctcactat aaaatagaga 3241 cggtaggggt catggtgagc actacctgac tagcatataa gaagctttca gcaagtgcag 3301 actactctta cccacttccc ccaagcacag ttggggtggg ggacagctga agaggtggaa 3361 acatgtgcct gagaatccta atgaaatcgg ggtaaaggag cctggaacac atcctgtgac 3421 cccgcctgtc ctgtaggaag ccagtctctg gaaagtaaaa tggaagggct gcttgggaac 3481 tttgaggata tttagcccac cccctcattt ttacttgggg aaactaaggc ccagagacct 3541 aaggtgactg cctaagttag caaggagaag tcttgggtat tcatcccagg ttggggggac 3601 ccaattattt ctcaatccca ttgtattctg gaatgggcaa tttgtccacg tcactgtgac 3661 ctaggaacac gcgaatgaga acccacagct gagggcctct gcggacagaa cagctgttct 3721 ccccaggaaa tcaacttttt ttaattgaga agctaaaaaa ttattctaag agaggtagcc 3781 catcctaaaa atagctgtaa tgcagaagtt catgttcaac caatcatttt tgcttacgat 3841 gcaaaaattg aaaactaagt ttattagaga ggttagagaa ggaggagctc taagcagaaa 3901 aaatcctgtg ccgggaaacc ttgattgtgg ctttttaatg aatgaagagg cctccctgag 3961 cttacaatat aaaaggggga cagagaggtg aaggtctaca catcaggggg ttgctcttgc 4021 aaaaccaaac cacaagacag acttgcaaaa gaaggcatgc acagctcagc actgctctgt 4081 tgcctggtcc tcctgactgg ggtgagggcc agcccaggcc agggcaccca gtctgagaac 4141 agctgcaccc acttcccagg caacctgcct aacatgcttc gagatctccg agatgccttc 4201 agcagagtga agactttctt tgtgagtatg attccttcct gtcctttctc tcttcctggg 4261 actgcctgaa ctagacattc tcctggaact ataagaaccc tcctcctgcg cctccacctc 4321 catccccaac acctattccc ccaaacttaa attcttaaga agaaatccta gatcaagcca 4381 tgggttggtc agttaagcta agccagatag atacagtaaa tgtcaggaca cacctgcctt 4441 ataaagtaaa tgcgttcttt ctcgtgctga gaaacttata acgcactcct gctgcgcgcc 4501 tatatcattt attggctagg agaagtaaag aaaggtctga tgtcgaggtg aagatgctcc 4561 ccagtccttg cagcaaggga aatttaaatt gcctctgctt agagcgtttc cagcctgaaa 4621 gaccagtggt ttagggaagc actctaccat gagggaaacc tgcattagaa ggagcttctt 4681 aaatccctgg gatctttcca agctaaactg agtgtctaca gtggggagaa agaaaagcag 4741 agaacaggac atgaggggct caaggccccg aagggttgac ataggtgtcc cttaaagcct 4801 aatgtacgtc cgcagaaaga agaccaggac tgagtcaagc ttctgctttc ccttgaaaat 4861 caggccagat ttttaaaata acttgactct agaggaggag gactgattta agtgatcgtg 4921 tcccatactg ttgaatcctc tgtttttaaa ctcccctttt gtattatatt tggccagagc 4981 caatttgtat taaaaaaaaa aaaatctcta aatgaaaggg catcaaaaat accgcatttc 5041 agttatttcc ccaaacctaa agttcattct cctttttctt cctgcagcaa atgaaggatc 5101 agctggacaa cttgttgtta aaggagtcct tgctggagga ctttaaggtg agagcagggg 5161 cgggggtgct gggggagtgt gcagcatgat taagggaagg gaggctctgc ttcctgattg 5221 tgcagggaat tgggtttgtt tccttggctt gaaaggagaa gtgggaagat gttaactcag 5281 cacatcagca gcagagggtt tacaaagggc tcagtcttcg ggggaggctt ctggtaagga 5341 ggatcgcatg aacaagctgt cctcttaagc tagttgcagc agccctcctc ccagccacct 5401 ccgccaatct ctcactcacc ttcggctcct gccccagggt tacctgggtt gccaagcctt 5461 gtctgagatg atccagtttt acctggagga ggtgatgccc caagctgaga accaagaccc 5521 agacatcaag gcgcatgtga actccctggg ggagaacctg aagaccctca ggctgaggct 5581 acggcgctgt gtaagtagca gatcagttct ttcccttgca gctgccccca aaataccatc 5641 tcctacagac cagcagggac actcacatcc acagacacag caaagagaca cagctgcaag 5701 cgatcgtgta aatgaggaaa gactcctgag tcatagtctc ttctcatttc tctttgagca 5761 ggcgttgggg gtggctgcta ggcatttaca tgtgaaattt gcaaacagct tcctgttatt 5821 tgtgagtcat ttgtgggtta ttaactactc ccctctctct tcataaaagg agcccagagc 5881 ttcagtcagg cctccactgc ctctttgtac tagacctggg cggggagcta aggttcccaa 5941 agcagaggga aacatcattc acctctttta atctcaatgt ttgaaagcaa agctctaaga 6001 agggcccaat tgactgacag gatttcccct ggcattttag aagggacaag ggggctattc 6061 atccccaggc tagtgtctat gagtaattcc tccaggaatt tatttctcca actgaaatga 6121 tgccgtcact actaatggtt tcccctgttc tgtcaccaat attggaaaat cagttggtgt 6181 ctatttgtag gacaaggcta tgtgaagggt ttggtcccag tagcttccct cctcagatgc 6241 ttagttagtg ttcctccggt ggctgtgact gacggggggg agaacaggag agagaggcag 6301 aaaaggacag gctgaagaat gcctcgctca gcactgcagg agatactgta gagttctggg 6361 ggaggaagga atcccaagac cctgggttgt catccaagcc ttgcaaacat cttggagtga 6421 gtcctggaga aatacattta actcccaggg ccatggaagc agggctcagt tctctctccc 6481 agctgtgagg cgaggatttg gataaatctg gcctcctcat gatgcaccag cttgtcccta 6541 agcgtgatgg acatggagct ggaagccagg atcaccaaca ctttctcttt tcttccacag 6601 catcgatttc ttccctgtga aaacaagagc aaggccgtgg agcaggtgaa gaatgccttt 6661 aataaggtag agagggtctc agagcacaac ccatgcccac tccccaaccc caaagcatgg 6721 aaggtggtgg gactcaatag gccccattct tcattgagag agtgtgggaa cctacaatgg 6781 tatgacctct cagccattag gagctgctgc cttgattgta tttgttttct gttaagttgt 6841 ctttgggggt tctaaatgac tgctcgcttg cctttgcagg cttgcgggtc agggctggcc 6901 gcccaggtga acacagatga gctgcatgct ggggagagtg acaaaggaaa cagaaagtac 6961 agaaagtagc ttgttgggaa tctagtctga acccacacgt gcaggaagct ggcacattaa 7021 atgtgcacat tacaaataca cctgggggtg cagcccagat ctcccctagg acctcagaat 7081 gagcaggaag ctggattgct cacttaacct ggagttggtt caagcccgct ttccatctgc 7141 ccttcgcacc tgcggaggtg cctgagaatg tcagtttccc aaacgaaatg gggtttcaca 7201 cttccaactg tgcgtgaact ttttcagtct gatttcccag aaaccgtgcg gcctatgtcc 7261 tcctcgtggg ctggggacag acactgcaca gagtgccaac atcagggggt gtgaatttct 7321 catagtaggt cagggcggca gggagggcct gctcagtgtg ttggtgggag aacacagaca 7381 tttaaaaggc tccctcctct cctctcaccg tcttgctttc gaagcgcttc ctctaatgtc 7441 ttttcatcaa actctgcata atcatcatgt gaatacgtga cctttaaaat tgttgaaaag 7501 gcatcatttt gaagacagtg ctttgcaaaa tgaatgctac cccaattgct agggggaggc 7561 ctggaggaga tgaaaggtca atgcacagcc tttcccaagg cagctaggcc tatcctctgg 7621 tttacttccc agcgtgaggg agaacaagca acctctgcac tcaaggtcat gcccatccat 7681 gagcatgagg gaggggagcc tatttagtcc ccagaaagga ttttaactgt atgtttctta 7741 gctccaagag aaaggcatct acaaagccat gagtgagttt gacatcttca tcaactacat 7801 agaagcctac atgacaatga agatacgaaa ctgagacatc agggtggcga ctctatagac 7861 tctaggacat aaattagagg tctccaaaat cggatctggg gctctgggat agctgaccca 7921 gccccttgag aaaccttatt gtacctctct tatagaatat ttattacctc tgatacctca 7981 acccccattt ctatttattt actgagcttc tctgtgaacg atttagaaag aagcccaata 8041 ttataatttt tttcaatatt tattattttc acctgttttt aagctgtttc catagggtga 8101 cacactatgg tatttgagtg ttttaagata aattataagt tacataaggg aggaaaaaaa 8161 atgttctttg gggagccaac agaagcttcc attccaagcc tgaccacgct ttctagctgt 8221 tgagctgttt tccctgacct ccctctaatt tatcttgtct ctgggcttgg ggcttcctaa 8281 ctgctacaaa tactcttagg aagagaaacc agggagcccc tttgatgatt aattcacctt 8341 ccagtgtctc ggagggattc ccctaacctc attccccaac cacttcattc ttgaaagctg 8401 tggccagctt gttatttata acaacctaaa tttggttcta ggccgggcgc ggtggctcac 8461 gcctgtaatc ccagcacttt gggaggctga ggcgggtgga tcacttgagg tcaggagttc 8521 ctaaccagcc tggtcaacat ggtgaaaccc cgtctctact aaaaatacaa aaattagccg 8581 ggcatggtgg cgcgcacctg taatcccagc tacttgggag gctgaggcaa gagaattgct 8641 tgaacccagg agatggaagt tgcagtgagc tgatatcatg cccctgtact ccagcctggg 8701 tgacagagca agactctgtc tcaaaaaaat aaaaataaaa ataaatttgg ttctaataga 8761 actcagtttt aactagaatt tattcaattc ctctgggaat gttacattgt ttgtctgtct 8821 tcatagcaga ttttaatttt gaataaataa atgtatctta ttcacatc // LOCUS HUMCP21OH 4042 bp DNA PRI 01-NOV-1994 DEFINITION Human 21-hydroxylase B gene, complete cds. ACCESSION M26856 X05448 NID g180963 KEYWORDS 21-hydroxylase. SOURCE Human whole blood, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4042) AUTHORS Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C., Porter,R.R. and Campbell,R.D. TITLE Molecular characterization of the HLA-linked steroid 21-hydroxylase B gene from an individual with congenital adrenal hyperplasia JOURNAL EMBO J. 6 (6), 1653-1661 (1987) MEDLINE 87275858 FEATURES Location/Qualifiers source 1..4042 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p21.3" gene 745..946 /gene="CYP21" CDS join(745..946,1044..1133,1416..1570,1678..1779,1868..1969, 2071..2157,2327..2527,2728..2906,2990..3093,3191..3456) /note="21-hydroxylase B" /codon_start=1 /db_xref="PID:g180964" /translation="MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPD LPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVS RNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVA IEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRF FPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQ LLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSR VPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGA HLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAF TLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ" exon <745..946 /gene="CYP21" /note="21-hydroxylase B; G00-120-605" /number=1 intron 947..1043 /note="CYP21P intron A" exon 1044..1133 /number=2 intron 1134..1415 /note="CYP21P intron B" exon 1416..1570 /number=3 intron 1571..1677 /note="CYP21P intron C" exon 1678..1779 /number=4 intron 1780..1867 /note="CYP21P intron D" exon 1868..1969 /number=5 intron 1970..2070 /note="CYP21P intron E" exon 2071..2157 /number=6 intron 2158..2326 /note="CYP21P intron F" exon 2327..2527 /number=7 intron 2528..2727 /note="CYP21P intron G" exon 2728..2906 /number=8 intron 2907..2989 /note="CYP21P intron H" exon 2990..3093 /number=9 intron 3094..3190 /note="CYP21P intron K" exon 3191..>3456 /note="21-hydroxylase B" /number=10 polyA_signal 3942..3947 BASE COUNT 779 a 1261 c 1154 g 848 t ORIGIN Chromosome 6p21.3. 1 tcgacagcta gatttccagg ctggaatcct gccctccaca acatgcgaac aatacccgtg 61 ttgcatatag agcatggctg tgaagagttg agtgagtgcc cacaaagcac ttagagcagt 121 gtctggtaca tgctattact ccgcagcggg aaaccacttc ctcctttgtc ttctgggcac 181 ttttgtgagt gaaaggaggc actaataaca atcacactgg gatacctgta tatactggaa 241 tgccccaggc aaaccaggct taaactgtat tactctatct gtagcttaaa ctaacaaaca 301 acccacacaa atcacatttt gttcttcagg cgattcagga aggcctatta ggcagggact 361 gccattttct ctctgagaca aacatcatgc cagtaaactg gcccacggtg gggtggcaga 421 gggagagggc ccaggtgggg gcggacacta ttgcctgcac aggtgatgtg gaaccagaaa 481 gctgactctg gatgcaggaa aaaggtcagg gttgcatttc ccttccttgc ttcttgatgg 541 gtgatcaatt tttttgaaat acggacgtcc caaggccaat gagactggtg tcattccaga 601 aaagggccac tctgtgggcg ggtcggtggg agggtacctg aaggtggggt caagggaggc 661 cccaaaacag tctacacagc aggagggatg gctggggctc ttgagctata agtggcacct 721 cagggccctg acgggcgtct cgccatgctg ctcctgggcc tgctgctgct gctgcccctg 781 ctggctggcg cccgcctgct gtggaactgg tggaagctcc ggagcctcca cctcccgcct 841 cttgccccgg gcttcttgca cttgctgcag cccgacctcc caatctatct gcttggcctg 901 actcagaaat tcgggcccat ctacaggctc caccttgggc tgcaaggtga gaggctgatc 961 tcgctctggc cctcaccata ggagggggcg gaggtgacgg agagggtcct ctctccgctg 1021 acgctgcttt ggctgtctcc cagatgtggt ggtgctgaac tccaagagga ccattgagga 1081 agccatggtc aaaaagtggg cagactttgc tggcagacct gagccactta cctgtaaggg 1141 ctgggggcat tttttctttc ttaaaaaaat ttttttttaa gagatgggtt cttgctatgc 1201 tgcccaggct ggtcttaaat tcctagtctc aaatgatcct cccacctcag cctcaagtgt 1261 gagccacctt tggggcatcc ccaatccagg tccctggaag ctcttggggg ggcatatctg 1321 gtggggagaa agcaggggtt ggggaggccg aagaaggtca ggccctcagc tgccttcatc 1381 agttcccacc ctccagcccc cacctcctcc tgcagacaag ctggtgtcta ggaactaccc 1441 ggacctgtcc ttgggagact actccctgct ctggaaagcc cacaagaagc tcacccgctc 1501 agccctgctg ctgggcatcc gtgactccat ggagccagtg gtggagcagc tgacccagga 1561 gttctgtgag gtaaggctgg gctcctgagg ccacctcggg tcagccttgc ctctcacagt 1621 agcccccgcc ctgcccgctg cacagcggcc tgctgaactc acactgtttc tccacagcgc 1681 atgagagccc agcccggcac ccctgtggcc attgaggagg aattctctct cctcacctgc 1741 agcatcatct gttacctcac cttcggagac aagatcaagg tgcctcacag cccctcaggc 1801 ccacccccag cccctccctg agcctctcct tgtcctgaac tgaaagtact ccctcctttt 1861 ctggcaggac gacaacttaa tgcctgccta ttacaaatgt atccaggagg tgttaaaaac 1921 ctggagccac tggtccatcc aaattgtgga cgtgattccc tttctcaggg tgaggacctg 1981 gagcctagac acccctgggt tgtaggggag aggctggggt ggagggagag gctccttccc 2041 acagctgcat tctcatgctt cctgccgcag ttcttcccca atccaggtct ccggaggctg 2101 aagcaggcca tagagaagag ggatcacatc gtggagatgc agctgaggca gcacaaggtg 2161 gggactgtac gtggacggcc tcccctcggc ccacagccag tgatgctacc ggcctcagca 2221 ttgctatgag gcgggttctt ttgcataccc cagttatggg cctgttgcca ctctgtactc 2281 ctctccccag gccagccgct cagcccgctc ctttcaccct ctgcaggaga gcctcgtggc 2341 aggccagtgg agggacatga tggactacat gctccaaggg gtggcgcagc cgagcatgga 2401 agagggctct ggacagctcc tggaagggca cgtgcacatg gctgcagtgg acctcctgat 2461 cggtggcact gagaccacag caaacaccct ctcctgggcc gtggtttttt tgcttcacca 2521 ccctgaggtg cgtcctgggg acaagcaaaa ggctccttcc cagcaacctg gccagggcgg 2581 tgggcaccct cactcagctc tgagcactgt gcggctgggg ctgtgcttgc ctcaccggca 2641 ctcaggctca ctgggttgct gagggagcgg ctggaggctg ggcagctgtg ggctgctggg 2701 gcaggactcc acccgatcat tccccagatt cagcagcgac tgcaggagga gctagaccac 2761 gaactgggcc ctggtgcctc cagctcccgg gtcccctaca aggaccgtgc acggctgccc 2821 ttgctcaatg ccaccatcgc cgaggtgctg cgcctgcggc ccgttgtgcc cttagccttg 2881 ccccaccgca ccacacggcc cagcaggtga ctcccgaggg ttggggatga gtgaggaaag 2941 cccgagccca gggaggtcct ggccagcctc taactccagc ccccttcagc atctccggct 3001 acgacatccc tgagggcaca gtcatcattc cgaacctcca aggcgcccac ctggatgaga 3061 cggtctggga gaggccacat gagttctggc ctggtatgtg gggggccggg ggcctgccgt 3121 gaaaatgtgg tggaggctgg tccccgctgc cgctgaacgc ctccccaccc acctgtccac 3181 ccgcccgcag atcgcttcct ggagccaggc aagaactcca gagctctggc cttcggctgc 3241 ggtgcccgcg tgtgcctggg cgagccgctg gcgcgcctgg agctcttcgt ggtgctgacc 3301 cgactgctgc aggccttcac gctgctgccc tccggggacg ccctgccctc cctgcagccc 3361 ctgccccact gcagtgtcat cctcaagatg cagcctttcc aagtgcggct gcagccccgg 3421 gggatggggg cccacagccc gggccagaac cagtgatggg gcaggaccga tgccagccgg 3481 gtacctcagt ttctccttta ttgctcctgt acgaacccct cccctccccc ctgtaaacac 3541 agtgctgcga gatcgctggc agagaaggct tcctccagcg gctgggtggt gaaggaccct 3601 ggctcttctc tcggggcgac ccctcagtgc tcggcagtca tactggggtg cgagagaggt 3661 gggcagcagc tcagcctccc cccgctgggg agcgaaagtt tcttggtctc agcttcattt 3721 ccgtgaaggg caccgagaac tcgaagccct tccagtggta ccagctcact ccctgggaaa 3781 ggggttgtca agagagagtc aaagccggat gtcccatctg ctcttcccgt tccccttaag 3841 gaggtagctc ccagcactca accaacctcc ccgcagagct cccttcctga ccctccgctg 3901 cagaggattg aggcttaatt ctgagctggc cctttccagc caataaatca actccagctc 3961 cctctgcgag gctggcatga ttgttccatt tcacccagcc actcagtccc ttgcctgtta 4021 cactgtgggg ctgaaaccta gg // LOCUS HUMMIS 3100 bp DNA PRI 03-MAY-1996 DEFINITION Human Mullerian inhibiting substance gene, complete cds. ACCESSION K03474 NID g188560 KEYWORDS Mullerian inhibiting substance; antigrowth protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3100) AUTHORS Cate,R.L., Mattaliano,R.J., Hession,C., Tizard,R., Farber,N.M., Cheung,A., Ninfa,E.G., Frey,A.Z., Gash,D.J., Chow,E.P., Fisher,R.A., Bertonis,J.M., Torres,G., Wallner,B.P., Ramachandran,K.L., Ragin,R.C., Manganaro,T.F., MacLaughlin,D.T. and Donahoe,P.K. TITLE Isolation of the bovine and human genes for Mullerian inhibiting substance and expression of the human gene in animal cells JOURNAL Cell 45 (5), 685-698 (1986) MEDLINE 86218082 REFERENCE 2 (bases 1 to 3100) AUTHORS Cate,R.L. TITLE Direct Submission JOURNAL Submitted (16-FEB-1987) Richard L. Cate, Molecular Genetics, Biogen, Inc., 14 Cambridge Center, Cambridge, MA 02142, USA COMMENT The precise 3' boundary of the signal peptide has not been established and could be at position 265. FEATURES Location/Qualifiers source 1..3100 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="chmis33" /map="174 bp upstream of AflII site" prim_transcript 201..2944 /note="MIS mRNA" CDS join(211..622,1215..1357,1530..1638,1727..1886,1977..2835) /codon_start=1 /product="prepro-Mullerian inhibiting substance" /db_xref="PID:g386953" /translation="MRDLPLTSLALVLSALGALLGTEALRAEEPAVGTSGLIFREDLD WPPGIPQEPLCLVALGGDSNGSSSPLRVVGALSAYEQAFLGAVQRARWGPRDLATFGV CNTGDRQAALPSLRRLGAWLRDPGGQRLVVLHLEEVTWEPTPSLRFQEPPPGGAGPPE LALLVLYPGPGPEVTVTRAGLPGAQSLCPSRDTRYLVLAVDRPAGAWRGSGLALTLQP RGEDSRLSTARLQALLFGDDHRCFTRMTPALLLLPRSEPAPLPAHGQLDTVPFPPPRP SAELEESPPSADPFLETLTRLVRALRVPPARASAPRLALDPDALAGFPQGLVNLSDPA ALERLLDGEEPLLLLLRPTAATTGDPAPLHDPTSAPWATALARRVAAELQAAAAELRS LPGLPPATAPLLARLLALCPGGPGGLGDPLRALLLLKALQGLRVEWRGRDPRGPGRAQ RSAGATAADGPCALRELSVDLRAERSVLIPETYQANNCQGVCGWPQSDRNPRYGNHVV LLLKMQARGAALARPPCCVPTAYAGKLLISLSEERISAHHVPNMVATECGCR" exon <211..622 /note="prepro-Mullerian inhibiting substance" /number=1 sig_peptide 211..261 /note="Mullerian inhibiting substance signal peptide (see comment)" mat_peptide join(286..622,1215..1357,1530..1638,1727..1886,1977..2832) /note="Mullerian inhibiting substance" intron 623..1214 /note="MIS cds intron A" exon 1215..1357 /number=2 intron 1358..1529 /note="MIS cds intron B" exon 1530..1638 /number=3 intron 1639..1726 /note="MIS cds intron C" exon 1727..1886 /number=4 intron 1887..1976 /note="MIS cds intron D" exon 1977..>2835 /note="prepro-Mullerian inhibiting substance" /number=5 BASE COUNT 433 a 1140 c 1063 g 464 t ORIGIN 1 cacatcaggc ccagctctat cactggggag ggagataggc tgccagggac agaaagggct 61 ctttgagaag gccactctgc ctggagtggg ggcgccgggc actgtccccc aaggtcgcgg 121 cagaggagat aggggtctgt cctgcacaaa caccccacct tccactcggc tcacttaagg 181 caggcagccc agcccctggc agcacccacg atgcgggacc tgcctctcac cagcctggcc 241 ctagtgctgt ctgccctggg ggctctgctg gggactgagg ccctcagagc agaggagcca 301 gctgtgggca ccagtggcct catcttccga gaagacttgg actggcctcc aggcatccca 361 caagagcctc tgtgcctggt ggcactgggc ggggacagca atggcagcag ctcccccctg 421 cgggtggtgg gggctctaag cgcctatgag caggccttcc tgggggccgt gcagagggcc 481 cgctggggcc cccgagacct ggccaccttc ggggtctgca acaccggtga caggcaggct 541 gccttgccct ctctacggcg gctgggggcc tggctgcggg accctggggg gcagcgcctg 601 gtggtcctac acctggagga aggtatgtgg ggcccagccc caagcttggc accgccgtct 661 tccttcaggt gggccgggtc ctcctaggga agatcagggg ctggcagagc ccccaccctg 721 ggcagggagg ctgtggtctt gttcctagga ctgggttgcg ggtccgtggc ctggaaggtg 781 ggcaccacac tctgtcctgt ccccgaagcc cagctcttag acttgcccct gcctcggtgc 841 cagggagaga gctgctgcct tctccccacc cctgaagacg acgcagggct cggggccagt 901 ggaacccttc ttcccacagc cccagcctgt tctcagggcc gctggcctaa gatactccct 961 gcggggaagg ggcttcatcg ggcaccccaa cccagagacc ccagggcggc agccccaccc 1021 acagcctcag acgcagcccc tgcctgcccc tgccgtcacc gctccctggc tgcaggaagg 1081 cagctaagag gggcaccctt gtcccccgct tgaggtcccc tgcacagtgg ccagagcggc 1141 agggacagat cccaaagatt cccggggggt gtggccttca atggctcagg cgtcccctgc 1201 tgtcccggct gcagtgacct gggagccaac accctcgctg aggttccagg agcccccgcc 1261 tggaggagct ggccccccag agctggcgct gctggtgctg taccctgggc ctggccctga 1321 ggtcactgtg acgagggctg ggctgccggg tgcccaggta ccagggagtt gcatggggca 1381 gtgcccgggc cgtggcgggg ggcatgaatt tgttgcaggg tctgcagtac tgagaacagc 1441 gtagaaccag tggcgatggg aggaagggga ccggtagagc ggggctgggt aagcctccat 1501 ccagccgggc tgagccctgg tctccgcaga gcctctgccc ctcccgagac acccgctacc 1561 tggtgttagc ggtggaccgc cctgcggggg cctggcgcgg ctccgggctg gccttgaccc 1621 tgcagccccg cggagagggt aggtccgcgt ggagagggac ggggagccgg gtcgactgcc 1681 cccgggcccc cagcccctga gccagccgcg tgcccaccca ccgcagactc ccggctgagt 1741 accgcccggc tgcaggcact gctgttcggc gacgaccacc gctgcttcac acggatgacc 1801 ccggccctgc tcctgctgcc gcggtccgag cccgcgccgc tgcctgcgca cggccagctg 1861 gacaccgtgc ccttcccgcc gcccaggtgc gcgcaggcac cgggacacgg ggcaggagcg 1921 ggcgggggcg gcgtggcctc gtggccgctc tcaactcctc caattgcggg ttccaggcca 1981 tccgcggaac tcgaggagtc gccacccagc gcagacccct tcctggagac gctcacgcgc 2041 ctggtgcggg cgctgcgggt ccccccggcc cgggcctccg cgccgcgcct ggccctggat 2101 ccggacgcgc tggccggctt cccgcagggc ctagtcaacc tgtcggaccc cgcggcgctg 2161 gagcgcctac tcgacggcga ggagccgctg ctgctgctgc tgaggcccac tgcggccacc 2221 accggggatc ctgcgcccct gcacgacccc acgtcggcgc cgtgggccac ggccctggcg 2281 cgccgcgtgg ctgctgaact gcaagcggcg gctgccgagc tgcgaagcct cccgggtctg 2341 cctccggcca cagccccgct gctggcgcgc ctgctcgcgc tctgcccagg aggccccggc 2401 ggcctcggcg atcccctgcg agcgctgctg ctcctgaagg cgctgcaggg cctgcgcgtg 2461 gagtggcgcg ggcgggatcc gcgcgggccg ggtcgggcac agcgcagcgc gggggccacc 2521 gccgccgacg ggccgtgcgc gctgcgcgag ctcagcgtag acctccgcgc cgagcgctcc 2581 gtactcatcc ccgagaccta ccaggccaac aattgccagg gcgtgtgcgg ctggcctcag 2641 tccgaccgca acccgcgcta cggcaaccac gtggtgctgc tgctgaagat gcaggcccgt 2701 ggggccgccc tggcgcgccc accctgctgc gtgcccaccg cctacgcggg caagctgctc 2761 atcagcctgt cggaggaacg catcagcgcg caccacgtgc ccaacatggt ggccaccgag 2821 tgtggctgcc ggtgacccct gcgccgcgcg gactcctgcc ccgagggtcc ggacgcgccc 2881 cagctcgcgc cccttcccat atttattcgg accccaagca tcgccccaat aaagaccagc 2941 aagcaaccgg ctggggtgtc cgtgcgtgtt agggggcccg tgggacctcc cttgccgtct 3001 ctcctcgcgc acggcccggg tccgccctgt agcgctcgct gtctctcccc tgcctgaagc 3061 gccccaccac cgtctttcag gccccggact tggtgccggg // LOCUS HUMAPOE4 5515 bp DNA PRI 09-NOV-1994 DEFINITION Human apolipoprotein E (epsilon-4 allele) gene, complete cds. ACCESSION M10065 J03053 J03054 NID g178852 KEYWORDS Alu repeat; allelic variation; apolipoprotein; apolipoprotein E; lipoprotein; repeat region; very low density lipoprotein. SOURCE Human DNA [2], [1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5515) AUTHORS Das,H.K., McPherson,J., Bruns,G.A., Karathanasis,S.K. and Breslow,J.L. TITLE Isolation, characterization, and mapping to chromosome 19 of the human apolipoprotein E gene JOURNAL J. Biol. Chem. 260 (10), 6240-6247 (1985) MEDLINE 85207610 REFERENCE 2 (bases 196 to 5269) AUTHORS Paik,Y.K., Chang,D.J., Reardon,C.A., Davies,G.E., Mahley,R.W. and Taylor,J.M. TITLE Nucleotide sequence and structure of the human apolipoprotein E gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (10), 3445-3449 (1985) MEDLINE 85216517 REFERENCE 3 (bases 1 to 5515) AUTHORS Emi,M., Wu,L.L., Robertson,M.A., Myers,R.L., Hegele,R.A., Williams,R.R., White,R. and Lalouel,J.M. TITLE Genotyping and sequence analysis of apolipoprotein E isoforms JOURNAL Genomics 3 (4), 373-379 (1988) MEDLINE 89212602 COMMENT [3] two allelic variations. Draft entry and computer-readable sequence for [3] kindly provided by M.Emi, 19-AUG-1988. Apolipoprotein E is a constituent of the human very low density lipoprotein in the plasma. There are at least six distinct phenotypes derived from the single E gene on chromosome 19; next to the epsilon-3 allele (see separate entry), the epsilon-4 allele, represented by the sequence below, is most common, the product difference being arginine in place of cysteine at residue 112 [2]. The gene structure of apo E is similar to that of other apo genes: presence of the 66-bp repeats in the fourth exon (starting at base 3782 below) makes the E gene highly similar to the A-I gene (see separate entry) as argued by [1]. A potential TATA box is found at positions 1014-1018, and a potential polyadenylation signal at 4616-4621. [2] and [1] had slight differences in the boundary positions for the Alu repeats and their flanks; the boundary positions indicated in [1] have been used in the FEATURES table below. Draft entries and clean copies were kindly supplied by J.M. Taylor, Gladstone Laboratories, San Francisco, and by J.P. Levine, Rockefeller University, New York. FEATURES Location/Qualifiers source 1..5515 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.2" conflict 202..203 /citation=[2] /replace="" repeat_region complement(300..345) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(346..635) /note="Alu repeat [1]" repeat_region complement(636..680) /note="direct repeat flanking Alu repeat 5' copy [1]" conflict 963..967 /citation=[2] /replace="" exon 1047..1090 /note="apo E mRNA [2],[1]" /number=1 intron 1091..1847 /note="apo E mRNA intron 1 [2],[1]" conflict 1362..1363 /citation=[2] /replace="" conflict 1789..1793 /citation=[2] /replace="" sig_peptide join(1871..1913,3007..3017) exon 1871..1913 /partial /gene="APOE" /note="preapolipoprotein E; G00-119-691" /number=2 gene 1871..1913 /gene="APOE" CDS join(1871..1913,3007..3199,3781..4498) /note="precursor" /codon_start=1 /product="apolipoprotein E" /db_xref="PID:g178853" /translation="MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRW ELALGRFWDYLRWVQTLSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTP VAEETRARLSKELQAAQARLGADMEDVRGRLVQYRGEVQAMLGQSTEELRVRLASHLR KLRKRLLRDADDLQKRLAVYQAGAREGAERGLSAIRERLGPLVEQGRVRAATVGSLAG QPLQERAQAWGERLRARMEEMGSRTRDRLDEVKEQVAEVRAKLEEQAQQIRLQAEAFQ ARLKSWFEPLVEDMQRQWAGLVEKVQAAVGTSAAPVPSDNH" intron 1914..3006 /note="apo E cds intron 2 [2],[1]" repeat_region complement(2092..2104) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(2105..2429) /note="Alu repeat [1]" repeat_region complement(2430..2442) /note="direct repeat flanking Alu repeat 5' copy [1]" repeat_region 2520..2526 /note="direct repeat flanking Alu repeat 5' copy [1]" repeat_region 2527..2886 /note="Alu repeat [1]" conflict 2568..2569 /citation=[2] /replace="" repeat_region 2887..2893 /note="direct repeat flanking Alu repeat 3' copy [1]" conflict 2947..2948 /citation=[2] /replace="" conflict 2983..2984 /citation=[2] /replace="" exon 3007..3199 /number=2 mat_peptide join(3018..3199,3781..4495) /product="apolipoprotein E" allele 3182 /note="g in epsilon-4; a in epsilon-3 [2]" allele 3191 /note="g in epsilon-4; a in epsilon-3 [2]" intron 3200..3780 /note="apo E cds intron 3 [2],[1]" conflict 3568..3569 /citation=[2] /replace="" exon 3781..4640 /partial /note="preapolipoprotein E" /number=3 allele 3817..3819 /note="cta in [2] and 1 allele [3]; agc in other allele [3]" allele 3844..3846 /note="gac in [2] and 1 allele [3]; tgt in other allele [3]" allele 3932 /note="c in epsilon-4; t in epsilon-3 [2] (arg; cys)" allele 4342 /note="g in epsilon-4; a in epsilon-3 [2]" conflict 4708..4709 /citation=[2] /replace="" repeat_region complement(4761..4768) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(4769..5048) /note="Alu repeat [1]" repeat_region complement(5049..5056) /note="direct repeat flanking Alu repeat 5' copy [1]" BASE COUNT 1042 a 1667 c 1600 g 1206 t ORIGIN 201 bp upstream of BanII site on chromosome 19q12-q13.2. 1 ggaacttgat gctcagagag gacaagtcat ttgcccaagg tcacacagct ggcaactggc 61 agacgagatt cacgccctgg caatttgact ccagaatcct aaccttaacc cagaagcacg 121 gcttcaagcc ctggaaacca caatacctgt ggcagccagg gggaggtgct ggaatctcat 181 ttcacatgtg gggagggggc tcctgtgctc aaggtcacaa ccaaagagga agctgtgatt 241 aaaacccagg tcccatttgc aaagcctcga cttttagcag gtgcatcata ctgttcccac 301 ccctcccatc ccacttctgt ccagccgcct agccccactt tctttttttt ctttttttga 361 gacagtctcc ctcttgctga ggctggagtg cagtggcgag atctcggctc actgtaacct 421 ccgcctcccg ggttcaagcg attctcctgc ctcagcctcc caagtagcta ggattacagg 481 cgcccgccac cacgcctggc taacttttgt atttttagta gagatggggt ttcaccatgt 541 tggccaggct ggtctcaaac tcctgacctt aagtgattcg cccactgtgg cctcccaaag 601 tgctgggatt acaggcgtga gctaccgccc ccagcccctc ccatcccact tctgtccagc 661 cccctagccc tactttcttt ctgggatcca ggagtccaga tccccagccc cctctccaga 721 ttacattcat ccaggcacag gaaaggacag ggtcaggaaa ggaggactct gggcggcagc 781 ctccacattc cccttccacg cttggccccc agaatggagg agggtgtctg tattactggg 841 cgaggtgtcc tcccttcctg gggactgtgg ggggtggtca aaagacctct atgccccacc 901 tccttcctcc ctctgccctg ctgtgcctgg ggcaggggga gaacagccca cctcgtgact 961 gggctgccca gcccgcccta tccctggggg agggggcggg acagggggag ccctataatt 1021 ggacaagtct gggatccttg agtcctactc agccccagcg gaggtgaagg acgtccttcc 1081 ccaggagccg gtgagaagcg cagtcggggg cacggggatg agctcagggg cctctagaaa 1141 gagctgggac cctgggaagc cctggcctcc aggtagtctc aggagagcta ctcggggtcg 1201 ggcttgggga gaggaggagc gggggtgagg caagcagcag gggactggac ctgggaaggg 1261 ctgggcagca gagacgaccc gacccgctag aaggtggggt ggggagagca gctggactgg 1321 gatgtaagcc atagcaggac tccacgagtt gtcactatca ttatcgagca cctactgggt 1381 gtccccagtg tcctcagatc tccataactg gggagccagg ggcagcgaca cggtagctag 1441 ccgtcgattg gagaacttta aaatgaggac tgaattagct cataaatgga acacggcgct 1501 taactgtgag gttggagctt agaatgtgaa gggagaatga ggaatgcgag actgggactg 1561 agatggaacc ggcggtgggg agggggtggg gggatggaat ttgaaccccg ggagaggaag 1621 atggaatttt ctatggaggc cgacctgggg atggggagat aagagaagac caggagggag 1681 ttaaataggg aatgggttgg gggcggcttg gtaaatgtgc tgggattagg ctgttgcaga 1741 taatgcaaca aggcttggaa ggctaacctg gggtgaggcc gggttggggg cgctgggggt 1801 gggaggagtc ctcactggcg gttgattgac agtttctcct tccccagact ggccaatcac 1861 aggcaggaag atgaaggttc tgtgggctgc gttgctggtc acattcctgg caggtatggg 1921 ggcggggctt gctcggttcc ccccgctcct ccccctctca tcctcacctc aacctcctgg 1981 ccccattcag acagaccctg ggccccctct tctgaggctt ctgtgctgct tcctggctct 2041 gaacagcgat ttgacgctct ctgggcctcg gtttccccca tccttgagat aggagttaga 2101 agttgttttg ttgttgttgt ttgttgttgt tgttttgttt ttttgagatg aagtctcgct 2161 ctgtcgccca ggctggagtg cagtggcggg atctcggctc actgcaagct ccgcctccca 2221 ggtccacgcc attctcctgc ctcagcctcc caagtagctg ggactacagg cacatgccac 2281 cacacccgac taactttttt gtattttcag tagagacggg gtttcaccat gttggccagg 2341 ctggtctgga actcctgacc tcaggtgatc tgcccgtttc gatctcccaa agtgctggga 2401 ttacaggcgt gagccaccgc acctggctgg gagttagagg tttctaatgc attgcaggca 2461 gatagtgaat accagacacg gggcagctgt gatctttatt ctccatcacc cccacacagc 2521 cctgcctggg gcacacaagg acactcaata catgcttttc cgctgggccg gtggctcacc 2581 cctgtaatcc cagcactttg ggaggccaag gtgggaggat cacttgagcc caggagttca 2641 acaccagcct gggcaacata gtgagaccct gtctctacta aaaatacaaa aattagccag 2701 gcatggtgcc acacacctgt gctctcagct actcaggagg ctgaggcagg aggatcgctt 2761 gagcccagaa ggtcaaggtt gcagtgaacc atgttcaggc cgctgcactc cagcctgggt 2821 gacagagcaa gaccctgttt ataaatacat aatgctttcc aagtgattaa accgactccc 2881 ccctcaccct gcccaccatg gctccaaaga agcatttgtg gagcaccttc tgtgtgcccc 2941 taggtagcta gatgcctgga cggggtcaga aggaccctga cccgaccttg aacttgttcc 3001 acacaggatg ccaggccaag gtggagcaag cggtggagac agagccggag cccgagctgc 3061 gccagcagac cgagtggcag agcggccagc gctgggaact ggcactgggt cgcttttggg 3121 attacctgcg ctgggtgcag acactgtctg agcaggtgca ggaggagctg ctcagctccc 3181 aggtcaccca ggaactgagg tgagtgtccc catcctggcc cttgaccctc ctggtgggcg 3241 gctatacctc cccaggtcca ggtttcattc tgcccctgtc gctaagtctt ggggggcctg 3301 ggtctctgct ggttctagct tcctcttccc atttctgact cctggcttta gctctctgga 3361 attctctctc tcagctttgt ctctctctct tcccttctga ctcagtctct cacactcgtc 3421 ctggctctgt ctctgtcctt ccctagctct tttatataga gacagagaga tggggtctca 3481 ctgtgttgcc caggctggtc ttgaacttct gggctcaagc gatcctcccg cctcggcctc 3541 ccaaagtgct gggattagag gcatgagcac cttgcccggc ctcctagctc cttcttcgtc 3601 tctgcctctg ccctctgcat ctgctctctg catctgtctc tgtctccttc tctcggcctc 3661 tgccccgttc cttctctccc tcttgggtct ctctggctca tccccatctc gcccgcccca 3721 tcccagccct tctcccccgc ctccccactg tgcgacaccc tcccgccctc tcggccgcag 3781 ggcgctgatg gacgagacca tgaaggagtt gaaggcctac aaatcggaac tggaggaaca 3841 actgaccccg gtggcggagg agacgcgggc acggctgtcc aaggagctgc aggcggcgca 3901 ggcccggctg ggcgcggaca tggaggacgt gcgcggccgc ctggtgcagt accgcggcga 3961 ggtgcaggcc atgctcggcc agagcaccga ggagctgcgg gtgcgcctcg cctcccacct 4021 gcgcaagctg cgtaagcggc tcctccgcga tgccgatgac ctgcagaagc gcctggcagt 4081 gtaccaggcc ggggcccgcg agggcgccga gcgcggcctc agcgccatcc gcgagcgcct 4141 ggggcccctg gtggaacagg gccgcgtgcg ggccgccact gtgggctccc tggccggcca 4201 gccgctacag gagcgggccc aggcctgggg cgagcggctg cgcgcgcgga tggaggagat 4261 gggcagccgg acccgcgacc gcctggacga ggtgaaggag caggtggcgg aggtgcgcgc 4321 caagctggag gagcaggccc agcagatacg cctgcaggcc gaggccttcc aggcccgcct 4381 caagagctgg ttcgagcccc tggtggaaga catgcagcgc cagtgggccg ggctggtgga 4441 gaaggtgcag gctgccgtgg gcaccagcgc cgcccctgtg cccagcgaca atcactgaac 4501 gccgaagcct gcagccatgc gaccccacgc caccccgtgc ctcctgcctc cgcgcagcct 4561 gcagcgggag accctgtccc cgccccagcc gtcctcctgg ggtggaccct agtttaataa 4621 agattcacca agtttcacgc atctgctggc ctccccctgt gatttcctct aagccccagc 4681 ctcagtttct ctttctgccc acatactgcc acacaattct cagccccctc ctctccatct 4741 gtgtctgtgt gtatctttct ctctgccctt tttttttttt tagacggagt ctggctctgt 4801 cacccaggct agagtgcagt ggcacgatct tggctcactg caacctctgc ctcttgggtt 4861 caagcgattc tgctgcctca gtagctggga ttacaggctc acaccaccac acccggctaa 4921 tttttgtatt tttagtagag acgagctttc accatgttgg ccaggcaggt ctcaaactcc 4981 tgaccaagtg atccacccgc cggcctccca aagtgctgag attacaggcc tgagccacca 5041 tgcccggcct ctgcccctct ttctttttta gggggcaggg aaaggtctca ccctgtcacc 5101 cgccatcaca gctcactgca gcctccacct cctggactca agtgataagt gatcctcccg 5161 cctcagcctt tccagtagct gagactacag gcgcatacca ctaggattaa tttggggggg 5221 ggtggtgtgt gtggagatgg ggtctggctt tgttggccag gctgatgtgg aattcctggg 5281 ctcaagcgat actcccacct tggcctcctg agtagctgag actactggct agcaccacca 5341 cacccagctt tttattatta tttgtagaga caaggtctca atatgttgcc caggctagtc 5401 tcaaacccct ggctcaagag atcctccgcc atcggcctcc caaagtgctg ggattccagg 5461 catgggctcc gagcggcctg cccaacttaa taatattgtt cctagagttg cactc // LOCUS HUMREGB 4251 bp DNA PRI 15-SEP-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 NID g190980 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) MEDLINE 90237042 COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES Location/Qualifiers source 1..4251 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 1169..1174 prim_transcript 1196..4116 /note="reg mRNA and introns" intron 1224..1524 /note="reg intron A" CDS join(1571..1634,2270..2388,2696..2833,3549..3660, 3856..3923) /note="regenerating protein (reg)" /codon_start=1 /db_xref="PID:g190981" /translation="MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYR SYCYYFNEDRETWVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIAL HDPKKNRRWHWSSGSLVSYKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSF VCKFKN" exon <1571..1634 /note="regenerating protein (reg), (first expressed exon)" /number=2 intron 1635..2269 /note="reg intron B" exon 2270..2388 /note="regenerating protein" /number=3 intron 2389..2695 /note="reg intron C" exon 2696..2833 /note="regenerating protein" /number=4 intron 2834..3548 /note="reg intron D" exon 3549..3660 /note="regenerating protein" /number=5 intron 3661..3855 /note="reg intron E" exon 3856..>3923 /note="regenerating protein" BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMPROLA 1404 bp DNA PRI 19-MAY-1995 DEFINITION Human cathepsin L gene, complete cds. ACCESSION M20496 NID g809235 KEYWORDS cathepsin L; collagenolytic lysosomal enzyme; elastinolytic lysosomal enzyme. SOURCE Human kidney, cDNA to mRNA, clone SL12.1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Joseph,L.J., Chang,L.C., Stamenkovich,D. and Sukhatme,V.P. TITLE Complete nucleotide and deduced amino acid sequences of human and murine preprocathepsin L. An abundant transcript induced by transformation of fibroblasts JOURNAL J. Clin. Invest. 81 (5), 1621-1629 (1988) MEDLINE 88213715 COMMENT On May 19, 1995 this sequence version replaced gi:190417. FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q21-q22" sig_peptide 134..184 /gene="CTSL" /note="cathepsin L signal peptide; G00-119-824" CDS 134..1135 /gene="CTSL" /note="preprocathepsin L precursor" /codon_start=1 /db_xref="GDB:G00-119-824" /db_xref="PID:g190418" /translation="MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNE EGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRK GKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDH GVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV" gene 134..1135 /gene="CTSL" mat_peptide 473..1132 /gene="CTSL" /note="cathepsin L; G00-119-824" BASE COUNT 384 a 266 c 374 g 380 t ORIGIN 421 bp upstream of EcoRI site. 1 acctccacgt gccctgtttt tctggaggca catccttggc ctcttccaca gtccttgggt 61 aaatgcttgg gagaataatt taaatatttt tattctacca tggtggccct aatttttcag 121 ggggcagtaa gatatgaatc ctacactcat ccttgctgcc ttttgcctgg gaattgcctc 181 agctactcta acatttgatc acagtttaga ggcacagtgg accaagtgga aggcgatgca 241 caacagatta tacggcatga atgaagaagg atggaggaga gcagtgtggg agaagaacat 301 gaagatgatt gaactgcaca atcaggaata cagggaaggg aaacacagct tcacaatggc 361 catgaacgcc tttggagaca tgaccagtga agaattcagg caggtgatga atggctttca 421 aaaccgtaag cccaggaagg ggaaagtgtt ccaggaacct ctgttttatg aggcccccag 481 atctgtggat tggagagaga aaggctacgt gactcctgtg aagaatcagg gtcagtgtgg 541 ttcttgttgg gcttttagtg ctactggtgc tcttgaagga cagatgttcc ggaaaactgg 601 gaggcttatc tcactgagtg agcagaatct ggtagactgc tctgggcctc aaggcaatga 661 aggctgcaat ggtggcctaa tggattatgc tttccagtat gttcaggata atggaggcct 721 ggactctgag gaatcctatc catatgaggc aacagaagaa tcctgtaagt acaatcccaa 781 gtattctgtt gctaatgaca ccggctttgt ggacatccct aagcaggaga aggccctgat 841 gaaggcagtt gcaactgtgg ggcccatttc tgttgctatt gatgcaggtc atgagtcctt 901 cctgttctat aaagaaggca tttattttga gccagactgt agcagtgaag acatggatca 961 tggtgtgctg gtggttggct acggatttga aagcacagaa tcagataaca ataaatattg 1021 gctggtgaag aacagctggg gtgaagaatg gggcatgggt ggctacgtaa agatggccaa 1081 agaccggaga aaccattgtg gaattgcctc agcagccagc taccccactg tgtgagctgt 1141 ggacggtgat gaggaaggac ttgactgggg atggcgcatg catgggagga attcttcagt 1201 ctaccagccc ccgctgtgtc ggatacacac tcgaatcatt gaagatccga gtgtgatttg 1261 aattctgtga tattttcaca ctggtaaatg ttacctctat tttaattact gctataaata 1321 ggtttatatt attgattcac ttactgactt tgcattttcg tttttaaaag gatgtataaa 1381 tttttacctg tttaaataaa atcg // LOCUS HSU29874 6155 bp DNA PRI 29-FEB-1996 DEFINITION Human Flt3 ligand gene and Flt3 ligand alternatively spliced isoform gene, complete cds. ACCESSION U29874 NID g1072036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6155) AUTHORS Lyman,S.D., Stocking,K., Davison,B., Fletcher,F., Johnson,L. and Escobar,S. TITLE Structural analysis of human and murine flt3 ligand genomic loci JOURNAL Oncogene 11 (6), 1165-1172 (1995) MEDLINE 96032581 REFERENCE 2 (bases 1 to 6155) AUTHORS Lyman,S.D. TITLE Direct Submission JOURNAL Submitted (21-JUN-1995) Stewart D. Lyman, Molecular Biology, Immunex Corporation, 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..6155 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.3" /chromosome="19" mRNA join(<1..112,1131..1241,1586..1639,1864..2007,4305..4443, 5085..5180,5693..5871,5947..6015) /product="Flt3 ligand" mRNA join(<1..112,1131..1241,1586..1639,1864..2007,4305..4443, 5693..5871,5947..6015) /product="Flt3 ligand" CDS join(80..112,1131..1241,1586..1639,1864..2007,4305..4443, 5693..5871,5947..5994) /codon_start=1 /product="Flt3 ligand" /db_xref="PID:g1072037" /translation="MTVLAPAWSPTTYLLLLLLLSSGLSGTQDCSFQHSPISSDFAVK IRELSDYLLQDYPVTVASNLQDEELCGGLWRLVLAQRWMERLKTVAGSKMQGLLERVN TEIHFVTKCAFQPPPSCLRFVQTNISRLLQETSEQLVALKPWITRQNFSRCLELQCQP DSSTLPPPWSPRPLEATAPTAPQPPLLLLLLLPVGLLLLAAAWCLHWQRTRRRTPRPG EQVPPVPSPQDLLLVEH" intron 6016..>6155 BASE COUNT 1447 a 1600 c 1907 g 1201 t ORIGIN 1 ggggcatgag ggtccgagac ttgttcttct gtcccttcca agacccggcg acaggaggca 61 tgaggggccc ccggccgaaa tgacagtgct ggcgccagcc tggagcccaa cagtgcgtaa 121 accccaggga caagatcagg ggagagggga ggcacaatgt caggatgggg cagagatgag 181 gggagatgga cgggagaaca gatggacaga tgacgaggaa ataggagggg agatggacag 241 atgtgagggg agatggatag gagaggagac ggacagagga gggggagatg gacagaggat 301 ggggagatgg acagaggagg gggagataga ggagagggag attgagagga gtgggagatg 361 gacaggaggg gggagattga cagaggaggg ggagatggac agaggagagg gcagatgaac 421 agaggagagg gagataaaga ggagggaggt ggacaggagg agggagatgg acagagaagg 481 gggagatgga cagaggtggg ggagatggac aggagggaag atggagagga gggggtatgg 541 acagaggaga ggggagatgg actggagggg gagatggtca gaggagaggg agatgaagag 601 gagggggagg tggacaggag gagggagacg gacagaggag aggaagagag acagaggtgg 661 ggggagatgg acagaggagg gggagatgga cagaaaaggg gggagatgga cagaggagag 721 ggaaggtgga cagagccaag aacaaatgaa gaggacgtgg accaagatca agagagaagc 781 aggcaacagt ggtgtagaag gcagagggag ggacagagct ggaggaaccc gggcgaggaa 841 accagacgtg aagatgaggt ggttggagag agaccagcag agtgggggag atggggagag 901 agaggggtgg ggcagagggg gatgcaaact ggacagcatt ggaccagagg cagagagaaa 961 ccgggaaaga caggcagaga tgggcccatg tctccagaaa gtgtggaaga ggcagaagga 1021 cacccaggga aggaggagcg gtgaagacag aacagtacag gtgggaagcc cggaggaggg 1081 ggctgtgtgt ggaacagcag agggctcccc cagcacccgc tcccctgcag acctatctcc 1141 tcctgctgct gctgctgagc tcgggactca gtgggaccca ggactgctcc ttccaacaca 1201 gccccatctc ctccgacttc gctgtcaaaa tccgtgagct ggtgagcggc gctgccccgg 1261 accccctcat gtgatccccc ttccccccac tttttttttt taagtagaga tggggtctct 1321 ctccctgtgt ttcccagggt gggcttgaac tcctgggatc aagagatcca cccaccttgg 1381 tctcccaaag tgctggaatt acaggagtga gacactgtgc ttaggggtgt catccctttt 1441 taagggcaag gttctgtggc ttcttctggg ctccccctct cttggtcttg tccctctctc 1501 tctggatctc tgctgccacc tctgggtccc cacagttctg tttctcgctg ttttcagcca 1561 ggcctgatcc tgttttctcc cgcagtctga ctacctgctt caagattacc cagtcaccgt 1621 ggcctccaac ctgcaggacg taagtcatgt tgggagggac ctgggatgga ggtggggacc 1681 acagactcaa gatgctccac cgaggcgagt ggataaccag gccctcccct ccccaaaccc 1741 aggaatcaga gtcctcagcc cctcctccct cagacccagg agccccggcc cagcccctcc 1801 tccctcagac ccagcagccc cgtgcccagc tcctccctca gacccgtggg ttctcccctc 1861 taggaggagc tctgcggggg cctctggcgg ctggtcctgg cacagcgctg gatggagcgg 1921 ctcaagactg tcgctgggtc caagatgcaa ggcttgctgg agcgcgtgaa cacggagata 1981 cactttgtca ccaaatgtgc ctttcaggtc agccctcaac ttaggggaca agtgagggga 2041 gggagatgtc ttcctacgaa ttagaagtaa agctccacta ggccttattg gcgatttgga 2101 ccatagccac ccaacgaagg tagagcgaga agtgccaccc tggagagccc tgttcctaca 2161 gaacaacacg tccccaggca ccggtgatgg ggagcagtct ggtcccattc tggggccccg 2221 gtttcctagg ccatgatgaa gggtgccact gaggggttct tcccccaaaa aaaaacaggg 2281 cagagaaggg gtctctaaac tgaggaggcc gggcgtggtg gctcactcct gtaatcctag 2341 cactttggga ggctgaggtg ggcggatcac ttgaggtcag gagtttgaga ccagcctggc 2401 caacatgatg aaatcccggc tgtactaaaa atacaaaaat tagccgggca tggtggctca 2461 ggaggctgag gcacaggaat cgcttgaacc cgggagccag aggctgcagt gagccgagat 2521 catgccactg cactccagcc tgggagacag agtgaaactg tctcaaaaac aaaacaaaca 2581 aacaaaaacc tctctctgag ggctgggtgc agtggctcac acctgtaatc ccagcacttt 2641 cggaggccaa ggtgggagga ttatttgagc cctggtgttc aagatcagtc cgggtatcac 2701 agtgaaacct catctctgaa caaaaataaa aataaattag ccaggatggt ggtgcccacc 2761 tgtggtccca gactactcag aaggctgagg tgggagggat cacttgagac ctggaggtcg 2821 acgctgctat gagttctgat tgggacactg cactccagct tgagcgacag agcaagacct 2881 gtctcaaaac atagaatagg ccgtgtatgg tggctcacga ctgtaatccc agcactttgg 2941 gaggctgagg cgggtggatt gcctgagctc aggagttcga gaccagcctc ggcaacgtga 3001 tgaaaaccat ctctactaaa atacaaaaac aaaattagcc aggcatggtg gtgggcacct 3061 gtagtcccag ctacttggga gactgaggca ggagaattgc ttgaacccag gaggcagagg 3121 ttgcagtgag ccgagatcac accactgccc tccagcctgg gcgacagagc aagactccat 3181 ctccaaaaaa ataaaataaa ataaaaggct gggtacagtg gctcacgcct gtaatcccag 3241 cactttggga gcccgaggcg ggcagatcac gaggtcagga gtttgagacc accctggcca 3301 atgtggtgaa accccgtctc tactaaaaat acaaaaatta gctgggcatg gtggcgcgcg 3361 cctgtagtcc cagctactca ggaggctgag gcagaattcc ttgaacccgg gaggtagaag 3421 ttgcagtaag ccgagatcgt gccactgtac tccagccagg gtgacagagc aagactctgt 3481 ccccaaaaaa taaataaata ataaagtaac tttgggaggc cgaggcgggc gactcacctg 3541 aggtcagagt ccgagaccag cctggacaac atagagaaac cccgtttcta ctaaaaatat 3601 aaataaataa ataaataagt aagtaggctg gggacggagg ttcacgcctg ttttcccaca 3661 ctttgggggg ccgaggcggg cggatcacaa ggtcaggaga tcaagaccat cctgggtaac 3721 acagtgaaac cccatctcta ctaaaaatac aaacaattag ccgggggtgg tggcaggggc 3781 ctgtagtccc aactacctgg gaggttgagg caggtggttg gggcaaccca ggaggtggag 3841 cttgcagaga gccgcgattg cgccactgtc ctccagcctg ggcaacagag cgacacggtg 3901 gctcatgcct acaatcccag cactttggga ggccaaggtg ggcagatcac ctgaggtcag 3961 gagttcgaga ccagcctggc caacatggag aaaccccgtc tctactacaa atacaaaaaa 4021 tagctgggca tggtggtggg cacctatagt cccaaccact gaggaggctg aggcaggaga 4081 atcacttgaa cctgggaggc agaggttgca tttcaccact ccagcctggg caacagagtg 4141 agactctatc tcaaaaaaaa attaattaat taaataaata aactctgaga gccagagctc 4201 actgggccct gttgctgggc atcgccagca gggaagggcc tttggcctgc ggaggggcgg 4261 tggggggatg acgtggtggt gacgtctccc tcccctgctc ccagcccccc cccagctgtc 4321 ttcgcttcgt ccagaccaac atctcccgcc tcctgcagga gacctccgag cagctggtgg 4381 cgctgaagcc ctggatcact cgccagaact tctcccggtg cctggagctg cagtgtcagc 4441 ccggtaaagg ttccaggcac ccccactcct tcccctcctg tcctcacggc cgctcctcct 4501 ctctgcacag tgcatcccag accccatctt tctcatattg gttgtgacaa gggcaagctt 4561 attcctcttt ctggagctca gtttaccaat tttttttttt tttttttgag acggagtctc 4621 gcactgtcgc ccaggctgga gtgcagtggc gtgatcttgg ctcactgcaa gctccgcctc 4681 tcgggttcat gccattctcc tgcctcagcc tcccaagtag ctgggaccac aggcgcccgc 4741 caccacaccc ggctaaattt ttgtattttt agtagagacg gggttttacc gagttaaacc 4801 aggatggtct cgatctcctg accttgtgat ccacctgcct tggcctccca aagtgctggg 4861 attacaggcg tgagccacag tgcccagcct accttttttt tttttttttt tttgagatgg 4921 agtcttgctc tgacccccag gctggagtgc aggggtgcta tctcggctca ctgcaagctc 4981 tgcctcctgg gttcacgcca ttctcttgcc tcagcctccc cagcagctgg gactacaggc 5041 gcctgccacc tcaacgcggc taattttttt ttttgtattt ttagtagaga cggtgtttca 5101 ccgtgtcagc caggatggtc tcgatctcct gacctcgtga tctgcccgcc tcggcctccc 5161 aaagtgctag gattacagat gtgagccacc gcgcccagcc tattgttttt tttttctaag 5221 tcggagtctt gctctgtcgc ccaggctgga gtgcagtggc gcaatcttgg ctcgctgcaa 5281 cctccatctc cagggttcaa gcgaatcttc tgcctcagcc ttccgagtac ctgggattac 5341 agatgcgcat accatgcctg gctagttttt gtgtttttag tagagatggg ttttcaccat 5401 gttggccagg ctggtcttga actcctgacc tcaagtgatc cacccacctt ggcctcccaa 5461 agtgctggga ttataggcgt gagccaccgc gctcagccta ttcactcatt taatttgtga 5521 cagtctgatg aggtaggtac aatgattatc ctagttttac agatgaccaa actgaggcac 5581 agagaggcca agcagcccat ccaaggtcac acagccagtg gcagccaggc ctctctttcc 5641 ttccttaccc cagcccttct ccttggtcac ccagcctcct ctttctcccc agactcctca 5701 accctgccac ccccatggag tccccggccc ctggaggcca cagccccgac agccccgcag 5761 ccccctctgc tcctcctact gctgctgccc gtgggcctcc tgctgctggc cgctgcctgg 5821 tgcctgcact ggcagaggac gcggcggagg acaccccgcc ctggggagca ggtgagcagg 5881 ctgggaagag ggggtgaggg ggccgagagg gtggcccact tgtggctgac actttggggt 5941 ccacaggtgc cccccgtccc cagtccccag gacctgctgc ttgtggagca ctgacctggc 6001 caaggcctca tcctggtgac tccttcctgg gctatggggc ctggactttg tgtctgcagg 6061 ttgggagggt cactaggagg ccatggaagg gtggtgagca gtggaggggc aggggcagcc 6121 caggtgcaga aagacccctc tggggccagg cgcgg // LOCUS HSA6693 3448 bp DNA PRI 13-JUN-1998 DEFINITION Homo sapiens UHS KerA gene. ACCESSION AJ006693 NID g3228238 KEYWORDS UHS Ker A gene; ultra high sulfer keratin. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3448) AUTHORS Perez,C. and Egly,J.M. TITLE Genomic organization and promoter characterization of two human UHS keratin genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 3448) AUTHORS Perez,C. TITLE Direct Submission JOURNAL Submitted (10-JUN-1998) Perez C., Jean-Marc EGLY, IGBMC, 1 Rue Laurent Fries, F-67404 Illkirch Cedex, FRANCE COMMENT Related sequences: X55293, X55294. FEATURES Location/Qualifiers source 1..3448 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="placenta" TATA_signal 770..775 /gene="UHS KerA" gene 770..1367 /gene="UHS KerA" CDS 858..1367 /gene="UHS KerA" /codon_start=1 /product="ultra high sulfer keratin" /db_xref="PID:e1299394" /db_xref="PID:g3228239" /translation="MGCCGCSGGCGSSCGGCDSSCGSCGSGCRGCGPSCCAPVYCCKP VCCCVPACSCSSCGKRGCGSCGGSKGGCGSCGCSQCSCCKPCCCSSGCGSSCCQCSCC KPYCSQCSCCKPCCSSSGRGSSCCQSSCCKPCCSSSGCGSSCCQSSCCKPCCSQSRCC VPVCYQCKI" BASE COUNT 961 a 866 c 827 g 794 t ORIGIN 1 aagctttgtt ttcctccaag agaagaacct gtccagacaa tagtttcaaa gcagcgggaa 61 gctctgctca cgtgtcccca aggaccatgc tgtgtgaaat tcccttctgt aatcacagag 121 cccattgccc tggcctattc ctggtccagg agtagggagg aggtacacag aggatgcctc 181 cttccccact gctgagaacc ctgccatcct cagccacagt tgccacagag aagataccac 241 atccctgggg gaatcagcag gaatcaggta gagagtggca ctgctctggg gagggagggc 301 gtctcacagc atcaaacgtc aaaaacccac aacattgacc cagtcctgcc aagacggaac 361 cctgcatgag catgggggat ggggagttgg ggtgttgcaa aagacgcaat acatgaatga 421 tctcaggtaa ttctcaggca acccccggag gctggtgttg ctagcacccc tctgcaggag 481 aagaagctgg ggctcgggag ctgactggat ctgctcaaag gcccaggaag aataagagtt 541 aggaactggg acagaccttg aggaagctgc acttcctcct gaggtgagcc agcgttggag 601 ctgtttttcc tttcagtatg aattccacaa ggaaatcatc tcaggaggaa gggcttatac 661 ttggatccag aaaatatcaa catagccaaa gaaaaacaat caagacatac ctccaggagc 721 tgtgtaacag caaccggaaa gagaaacaat ggtgtgttcc tatgtgggat ataaagagcc 781 ggggctcagg gggctccaca cctgcacctc cttctcacct gctcctctac ctgctccacc 841 ctcaatccac cagaaccatg ggctgctgtg gctgctccgg aggctgtggc tccagctgtg 901 gaggctgtga ctccagctgt gggagctgtg gctctggctg caggggctgt ggccccagct 961 gctgtgcacc cgtctactgc tgcaagcccg tgtgctgctg tgttccagcc tgttcctgct 1021 ctagctgtgg caagcggggc tgtggctcct gtgggggctc caagggaggc tgtggttctt 1081 gtggctgctc ccagtgcagt tgctgcaagc cctgctgttg ctcttcaggc tgtgggtcat 1141 cctgctgcca gtgcagctgc tgcaagccct actgctccca gtgcagctgc tgtaagccct 1201 gttgctcctc ctcgggtcgt gggtcatcct gctgccaatc cagctgctgc aagccctgct 1261 gctcatcctc aggctgtggg tcatcctgct gccagtccag ctgctgcaag ccctgctgct 1321 cccagtccag atgctgtgtc cctgtgtgct accagtgcaa gatctgaggc tctagtggga 1381 aacctcaggt agctcccgaa gatctgtgct ttccaacaag tgactaccct tgaagcacat 1441 ccccttctgg atctgaaaag agcccttggc tcagggcgtc tttttccagc ccctgaggaa 1501 aaggaatgaa ccactccctg cccattccct ataagaatat cccaagaccc aggcaatttt 1561 gcccctcttt cccacatgcc cccatatgtc tgagccaaac tgcactgggg gctgccctca 1621 tgccaagcaa gagcctggaa ttccccttct tgataattcc atgggagaca gcaaaccctt 1681 ctttcctttg cctgccagga gcttcacgac atttgcagat ggatgtcctg caacccaaat 1741 gatcacatgt atctatggaa atccaaaatg catctgggtg cagcactaaa taaattctcc 1801 atccctcagc cttggtctca ctgactcttt tcttccagcc tctgtctcat tggcaaactg 1861 gccacatgtc cttcccctcc tccctgatag cttattgctc ccactgctgt aacagtgtgc 1921 caggcccagc tcccattcca gaaacagttg ttgaactgat taatgaagga accactagca 1981 gtatgtatga atgaatgaag atgaagaatg aatgagtgaa tgagtgagta tctctcatta 2041 gtacacaggg agtcccagct gtatctcagt gggattcagt ctgtcttggt tggaatttgg 2101 actcctactt cttccccatg ggaggatatg tttgggagag aaggagaact ttgccctcag 2161 tgcctaggaa gggatgtaat gggtgctctc tgggtccagc cagtccccag tttgtgggtc 2221 aagccaggag agggggaagc gagactagaa tgagctgtgt ccctgagatg ctctgtagga 2281 caacactgga aactgtgctg cttcaaggat ccaagacggt gtggctgaac acaggctgaa 2341 gtgcaccctc catctctggg ctcagagtga ggaggaatcc aagtgtccac aggcttccca 2401 gctttggttt ggcacaggga ggaacagaag ggacttttct cagcctgata aaggccatct 2461 acccacagtg aacacggtgc ttaactatga aaacctggat gttttcccct aggatcagga 2521 acaggaaaag aatgtccaca cccaccactt ctatgcaaca tttcactgaa gctatagcca 2581 gaataacata acaggaaaag aaaataaagc catccaggta agaaaaaagg aagcaaaact 2641 atttattcac agaaaacctc atcttgtata cagaagatgt gagggacaca cacacacaca 2701 cacacacaca actattagag ccaataaccg agctcagcag agtgcagaac acaaggacta 2761 tacacaatag tcagctgtgt ttctttacaa tagcaaaaag caagcaaaaa ggttgtaagg 2821 aaacaataaa attcctggga acaaatgcaa ccaaaaaagt acaagacttg gacacttaac 2881 actacaaaac accattggaa gaagttaagg aggaccaaag taatggggga aaaaatccta 2941 tgttcatgta ttggaagacc taatattgtg aagattgaaa cactccaaaa aaaaaattgt 3001 tctaccaatt taatgcaatt ccgatcaaca tctcagattt cttttttcaa aaattgacaa 3061 gttgatccta aaatttatgt ggacattcaa gggaccctaa atagctaaaa caatcttgaa 3121 aaagaaaagc aaagtttgag gactcacgtt ttccaatttc taaatgggct gcaaagctac 3181 agtcatcaag gcagcatggc tcttgcataa ggatagaaag atggatcaat ggggtaggtt 3241 tgagactctt gaaataaagc ctcacagttg tggtcaatgg attttcacag gtactataca 3301 attccatggg aaaagaggaa tgttttcaac aaacaatgct gctaagacaa ctggatgtcc 3361 acatgcaaaa gagtgaattt gaatcttgac ctcataccat ataccaaaca aacaaaaaaa 3421 aaatagaaaa aatagatccg cggccgcc // LOCUS HUMIL2RGA 4038 bp DNA PRI 18-OCT-1993 DEFINITION Human (IL2RG) gene, complete cds with repeats. ACCESSION L19546 NID g349631 KEYWORDS Alu repeat; Alu-like repeat; interleukin 2 receptor gamma chain. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4038) AUTHORS Puck,J.M., Deschenes,S.M., Porter,J.C., Dutra,A.S., Brown,C.J., Willard,H.F. and Henthorn,P.S. TITLE The interleukin-2 receptor gamma chain maps to Xq13.1 and is mutated in X-linked severe combined immunodeficiency, SCIDX1 JOURNAL Hum. Mol. Genet. 2 (8), 1099-1104 (1993) MEDLINE 94004847 FEATURES Location/Qualifiers source 1..4038 /organism="Homo sapiens" /db_xref="taxon:9606" exon 74..202 /number=1 CDS join(88..202,581..734,943..1127,1336..1475,2239..2401, 2934..3030,3283..3352,3708..3893) /codon_start=1 /product="interleukin-2 receptor gamma subunit" /db_xref="PID:g349632" /translation="MLKPSLPFTSLLFLQLPLLGVGLNTTILTPNGNEDTTADFFLTT MPTDSLSVSTLPLPEVQCFVFNVEYMNCTWNSSSEPQPTNLTLHYWYKNSDNDKVQKC SHYLFSEEITSGCQLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENL TLHKLSESQLELNWNNRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKR YTFRVRSRFNPLCGSAQHWSEWSHPIHWGSNTSKENPFLFALEAVVISVGSMGLIISL LCVYFWLERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSE IPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET" intron 203..580 /number=1 mutation 366..369 /note="deletion" repeat_region 394..511 /rpt_family="Alu-like" exon 581..734 /number=2 intron 735..942 /number=2 exon 943..1127 /number=3 intron 1128..1335 /number=3 exon 1336..1475 /number=4 intron 1476..2238 /number=4 repeat_unit 1643..1655 /note="flanking" repeat_region 1656..1976 /rpt_family="Alu" repeat_region 1977..1989 /note="flanking" exon 2239..2401 /number=5 intron 2402..2933 /number=5 exon 2934..3030 /number=6 intron 3031..3282 /number=6 exon 3283..3352 /number=7 intron 3353..3707 /number=7 exon 3708..4038 /number=8 BASE COUNT 982 a 995 c 1030 g 1031 t ORIGIN 1 aaacgtgtgg gtggggaggg gtagtgggtg agggacccag gttcctgaca cagacagact 61 acacccaggg aatgaagagc aagcgccatg ttgaagccat cattaccatt cacatccctc 121 ttattcctgc agctgcccct gctgggagtg gggctgaaca cgacaattct gacgcccaat 181 gggaatgaag acaccacagc tggtgggaaa tctgggactg gagggggctg gtgagaaggg 241 tggctgtggg aaggggccgt acagagatct ggtgcctgcc actggccatt acaatcatgt 301 gggcagaatt gaaaagtgga gtgggaaggg caagggggag ggttccctgc ctcacgctac 361 ttcttctttc tttctttctt gtttgtttgt ttctttcttt cttttgaggc agggtctcac 421 tatgttgcct aggctggtct caaactcctg gcctctagtg atcctcctgc ctcagccttt 481 caaagcacca ggattacaga catgagccac cgtgcttggc ctcctccttc tgaccatcat 541 ttctctttcc ctccctgcct tcattttctc cccaatctag atttcttcct gaccactatg 601 cccactgact ccctcagtgt ttccactctg cccctcccag aggttcagtg ttttgtgttc 661 aatgtcgagt acatgaattg cacttggaac agcagctctg agccccagcc taccaacctc 721 actctgcatt attggtatga gaagggacga gggggagggg atgaagaaga ggtgggttgg 781 atcagagacc aagagagagg gtagcaagtc tcccaggtac cccactgttt tctcctgggg 841 taagtcataa gtcggttgag gggagatgag gctaggctct ggatatctgc agtacccaga 901 ttggccccac tgttcctctt ccttccaacc tttctcctct aggtacaaga actcggataa 961 tgataaagtc cagaagtgca gccactatct attctctgaa gaaatcactt ctggctgtca 1021 gttgcaaaaa aaggagatcc acctctacca aacatttgtt gttcagctcc aggacccacg 1081 ggaacccagg agacaggcca cacagatgct aaaactgcag aatctgggta atttggaaag 1141 aaagggtcaa gagaccaggg atactgtggg acattggagt ctacagagta gtgttctttt 1201 atcataaggg tacatgggca gaaaagagga ggtaggggat catgatggga agggaggagg 1261 tattaggggc actaccttca ggatcctgac ttgtctaggc caggggaatg accacatatg 1321 cacacatatc tccagtgatc ccctgggctc cagagaacct aacacttcac aaactgagtg 1381 aatcccagct agaactgaac tggaacaaca gattcttgaa ccactgtttg gagcacttgg 1441 tgcagtaccg gactgactgg gaccacagct ggactgtgag tgactaggga cgtgaatgta 1501 gcagctaagg ccaagaaagt agggctaaag gattcaacca gacagataga aggacctaat 1561 atcaagctcc tgttctctgc ctcccagctt ctctgctcac cccctaccct ccctcctcca 1621 actcctttcc cccctatttt ctccagtgag ttttcttttt ttcttttctt ttctttcttt 1681 ctttcttttt tttttttttg agacagagcc tcactctgtt gcccaggctt gagtgcagtg 1741 gggcgatctt gggctcactg cgacctctgt ctccctggtt caagtgattc tcctgcttca 1801 gcctcccaag tagctgggag catgcaccaa ccatgcctgg ctaatttttg tatttttagt 1861 aaagacaggg ttttgccatg ttggtcaggc tggtcttgaa ctcctgacct caggtgatct 1921 gcccacctcg gcctcccaaa gtgctggatt acaggcgtga gccaccattc ctgacaccag 1981 tgagttttca ttagggattc cctacccata ctcttcctga taccagatag acaagtaaac 2041 aaaaggaagc cattaagggg atccagaggg gaggcattag attcaagtca gtgaagggag 2101 cagtgtggct tgagtagtca agagatgaga gagaaactgg gcagtagcac agatgacact 2161 ggtgggtgtt caggagtatg ttttaattct cccttctctc atagacaccc actttccctc 2221 atcctctttc tcctcaagga acaatcagtg gattatagac ataagttctc cttgcctagt 2281 gtggatgggc agaaacgcta cacgtttcgt gttcggagcc gctttaaccc actctgtgga 2341 agtgctcagc attggagtga atggagccac ccaatccact gggggagcaa tacttcaaaa 2401 ggtaaaatgg gcccacatga cccaatccat gagcccaaca ccccagcctt tctaacacca 2461 ctgtcttttg ctccacttcc ctgtcactaa agcccctaaa cttggtgccc catctctcca 2521 cactgtctaa ccccaacctc tagaaatcaa ggtttttctg tgtagggttg ggttagcgtg 2581 ttgttagagt aggggagtgg attgagaagg aggctgaggg gtactcaagg gggctataga 2641 atgtatagga tttccctgaa gcattcctag agagcctgca aggtgaagat ggctttggaa 2701 ccagctggat ctaggctgtg ccacatacta cctctttggc cttggccaca tccctaaact 2761 cttggattct gtttcctaag atgtaagatg gaggtaattg ttcctgcctc acaggagctg 2821 ttgtgaggat taaacagaga gtatgtcttt agcgcggtgc ctggcaacag tgcctggcat 2881 gtagtagggg cacaacaaat ataaggtcca ctttgctttt cttttttcta tagagaatcc 2941 tttcctgttt gcattggaag ccgtggttat ctctgttggc tccatgggat tgattatcag 3001 ccttctctgt gtgtatttct ggctggaacg gtgagatttg gagaagccca gaaaaatgag 3061 gggaacggta gctgacaata gcagaggagg gttttgcagg gtctttagga gtaaaggatg 3121 agacagtaag taatgagaga ttacccaaga gggtttggtg atggaaggaa gccacaggca 3181 cagagaacac agaatcactt tatttcatat gggacaactg ggagaagggt gataaaaaag 3241 ctttaaccta tgtgctcctg ctccctcttt ctcccctgtc aggacgatgc cccgaattcc 3301 caccctgaag aacctagagg atcttgttac tgaataccac gggaactttt cggtgagaac 3361 gctgtcataa gcatgctgca gtctatcaac tgccaactgc ctgccagcaa gacagacaga 3421 gtgtgggggt gggggcagag aggagaggga aggaggccct gcactaactg tcaggatgtg 3481 gccgaccaaa tggggcatgg actatacaga gagagacaca cacagaagtg cagattatag 3541 attgaatgag gcagatggca actggtattg ggggcccagg agcctgtgta tcccttctgt 3601 aatcaattac agtggttgca gacatcatga gtactccttt ggcacagagc tcggtctttt 3661 acttcctgcc cctaattgac ccctgacctg gacatatctg tctttaggcc tggagtggtg 3721 tgtctaaggg actggctgag agtctgcagc cagactacag tgaacgactc tgcctcgtca 3781 gtgagattcc cccaaaagga ggggcccttg gggaggggcc tggggcctcc ccatgcaacc 3841 agcatagccc ctactgggcc cccccatgtt acaccctaaa gcctgaaacc tgaaccccaa 3901 tcctctgaca gaagaacccc agggtcctgt agccctaagt ggtactaact ttccttcatt 3961 caacccacct gcgtctcata ctcacctcac cccactgtgg ctgatttgga attttgtgcc 4021 cccatgtaag cacccctt // LOCUS HUMCRPGA 2480 bp DNA PRI 01-NOV-1994 DEFINITION Human C-reactive protein gene, complete cds. ACCESSION M11725 NID g181067 KEYWORDS C-reactive protein. SOURCE Human fetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2480) AUTHORS Lei,K.J., Liu,T., Zon,G., Soravia,E., Liu,T.Y. and Goldman,N.D. TITLE Genomic DNA sequence for human C-reactive protein JOURNAL J. Biol. Chem. 260 (24), 13377-13383 (1985) MEDLINE 86033784 FEATURES Location/Qualifiers source 1..2480 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q23" gene 230..290 /gene="CRP" CDS join(230..290,569..1182) /note="C-reactive protein" /codon_start=1 /db_xref="PID:g181068" /translation="MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPL TKPLKAFTVCLHFYTELSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILF EVPEVTVAPVHICTSWESASGIVEFWVDGKPRVRKSLKKGYTVGAEASIILGQEQDSF GGNFEGSQSLVGDIGNVNMWDFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFT KPQLWP" exon <230..290 /gene="CRP" /note="C-reactive protein; G00-119-071" /number=1 intron 291..568 /note="intron A" exon 569..>1182 /note="C-reactive protein" /number=2 BASE COUNT 621 a 574 c 620 g 665 t ORIGIN Chromosome 1q12-q23. 1 tttgcttccc ctcttcccga agctctgaca cctgccccaa caagcaatgt tggaaaatta 61 tttacatagt ggcgcaaact cccttactgc tttggatata aatccaggca ggaggaggta 121 gctctaaggc aagagatctg ggacttctag cccctgaact ttcagccgaa tacatctttt 181 ccaaaggagt gaattcaggc ccttgtatca ctggcagcag gacgtgacca tggagaagct 241 gttgtgtttc ttggtcttga ccagcctctc tcatgctttt ggccagacag gtaagggcca 301 ccccaggcta tgggagagtt ttgatctgag gtatgggggt ggggtctaag actgcatgaa 361 cagtctcaaa aaaaaaaaaa aaagactgta tgaacagaac agtggagcat ccttcatggt 421 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgg tgtgtaactg gagaaggggt cagtctgttt 481 ctcaatctta aattctatac gtaagtgagg ggatagatct gtgtgatctg agaaacctct 541 cacatttgct tgtttttctg gctcacagac atgtcgagga aggcttttgt gtttcccaaa 601 gagtcggata cttcctatgt atccctcaaa gcaccgttaa cgaagcctct caaagccttc 661 actgtgtgcc tccacttcta cacggaactg tcctcgaccc gtgggtacag tattttctcg 721 tatgccacca agagacaaga caatgagatt ctcatatttt ggtctaagga tataggatac 781 agttttacag tgggtgggtc tgaaatatta ttcgaggttc ctgaagtcac agtagctcca 841 gtacacattt gtacaagctg ggagtccgcc tcagggatcg tggagttctg ggtagatggg 901 aagcccaggg tgaggaagag tctgaagaag ggatacactg tgggggcaga agcaagcatc 961 atcttggggc aggagcagga ttccttcggt gggaactttg aaggaagcca gtccctggtg 1021 ggagacattg gaaatgtgaa catgtgggac tttgtgctgt caccagatga gattaacacc 1081 atctatcttg gcgggccctt cagtcctaat gtcctgaact ggcgggcact gaagtatgaa 1141 gtgcaaggcg aagtgttcac caaaccccag ctgtggccct gaggccagct gtgggtcctg 1201 aaggtacctc ccggtttttt acaccgcatg ggccccacgt ctctgtctct ggtacctccc 1261 gcttttttac actgcatggt tcccacgtct ctgtctctgg gcctttgttc ccctatatgc 1321 attgaggcct gctccaccct cctcagcgcc tgagaatgga ggtaaagtgt ctggtctggg 1381 agctcgttaa ctatgctggg aaatggtcca aaagaatcag aatttgaggt gttttgtttt 1441 catttttatt tcaagttgga cagatcttgg agataatttc ttacctcaca tagatgagaa 1501 aactaacacc cagaaaggag aaatgatgtt ataaaaaact cataaggcaa gagctgagaa 1561 ggaagcgctg atcttctatt taattcccca cccatgaccc ccagaaagca ggagcattgc 1621 ccacattcac agggctcttc agtctcagaa tcaggacact ggccaggtgt ctggtttggg 1681 tccagagtgc tcatcatcat gtcatagaac tgctgggccc aggtctcctg aaatgggaag 1741 cccagcaata ccacgcagtc cctccacttt ctcaaagcac actggaaagg ccattagaat 1801 tgccccagca gagcagatct gctttttttc cagagcaaaa tgaagcacta ggtataaata 1861 tgttgttact gccaagaact taaatgactg gtttttgttt gcttgcagtg ctttcttaat 1921 tttatggctc ttctgggaaa ctcctcccct tttccacacg aaccttgtgg ggctgtgaat 1981 tctttcttca tccccgcatt cccaatatac ccaggccaca agagtggacg tgaaccacag 2041 ggtgtcctgt cagaggagcc catctcccat ctccccagct ccctatctgg aggatagttg 2101 gatagttacg tgttcctagc aggaccaact acagtcttcc caaggattga gttatggact 2161 ttgggagtga gacatcttct tgctgctgga tttccaagct gagaggacgt gaacctggga 2221 ccaccagtag ccatcttgtt tgccacatgg agagagactg tgaggacaga agccaaactg 2281 gaagtggagg agccaaggga ttgacaaaca acagagcctt gaccacgtgg agtctctgaa 2341 tcagccttgt ctggaaccag atctacacct ggactgccca ggtctataag ccaataaagc 2401 ccctgtttac ttgagtgagt ccaagctgtt ttctgatagt tgctttagaa gttgtgacta 2461 acttctctat gacctttgaa // LOCUS HSBCDIFFI 3230 bp DNA PRI 30-MAR-1992 DEFINITION H.sapiens gene for B cell differentiation factor I. ACCESSION X12706 NID g29392 KEYWORDS B-cell differentiation factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3230) AUTHORS Honjo,T., Takatu,K. and Severinson,E. JOURNAL Unpublished COMMENT see X12705 for ph.IL-5-30 cDNA sequence; extent of mRNA is given according ph.IL-5-30 cDNA; Data kindly supplied by Derwent Biotechnology Abstracts: Patent (EP_0_261_625, 20.09.86_JP_223284/86), T. Honjo. FEATURES Location/Qualifiers source 1..3230 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Charon 4A" /clone="lambda 12, lambda 22, lambda 38" sig_peptide 553..609 /note="B cell differentiation factor I" CDS join(553..696,905..937,1883..2011,2118..2216) /codon_start=1 /product="B cell differentiation factor I" /db_xref="PID:g29393" /db_xref="SWISS-PROT:P05113" /translation="MRMLLHLSLLALGAAYVYAIPTEIPTSALVKETLALLSTHRTLL IANETLRIPVPVHKNHQLCTEEIFQGIGTLESQTVQGGTVERLFKNLSLIKKYIDGQK KKCGEERRRVNQFLDYLQEFLGVMNTEWIIES" mRNA join(553..696,905..937,1883..2011,2118..2216) /note="B cell differentiation factor I" exon 553..696 /number=1 mat_peptide join(610..696,905..937,1883..2011,2118..2213) /product="B cell differentiation factor I" intron 697..904 /number=1 exon 905..937 /number=2 intron 938..1882 /number=2 exon 1883..2011 /number=3 intron 2012..2117 /number=3 exon 2118..2216 /number=4 polyA_site 2583 BASE COUNT 1027 a 545 c 622 g 1036 t ORIGIN 1 atcctaatca agaccccagt gaacagaact cgaccctgcc aaggcttggc atttccattt 61 caatcactgt cttcccacca gtattttcaa tttcttttaa gacagattaa tctagccaca 121 gtcatagtag aacatagccg atcttgaaaa aaaacattcc caatatttat gtattttagc 181 ataaaattct gtttagtggt ctaccttata ctttgttttg cacacatctt ttaagaggaa 241 gttaattttc tgattttaag aaatgcaaat gtggggcaat gatgtattaa cccaaagatt 301 ccttccgtaa tagaaaatgt ttttaaaggg gggaaacagg gatttttatt attaaaagat 361 aaaagtaaat ttatttttta agatataagg cattggaaac atttagtttc acgatatgcc 421 attattaggc attctctatc tgattgttag aaattattca tttcctcaaa gacagacaat 481 aaattgactg gggacgcagt cttgtactat gcactttctt tgccaaaggc aaacgcagaa 541 cgtttcagag ccatgaggat gcttctgcat ttgagtttgc tagctcttgg agctgcctac 601 gtgtatgcca tccccacaga aattcccaca agtgcattgg tgaaagagac cttggcactg 661 ctttctactc atcgaactct gctgatagcc aatgaggtaa ttttctttat gattcctaca 721 gtctgtaaag tgcataggta atcatttgtg atggttcctt tactatatat agagatctgt 781 tataaataat aagattctga gcacattagt acatgggtga taactacatc accagcaaac 841 attctgttaa aagttatgaa tgctggtgtg ctgtaaaaat gattgtattt cctttcctct 901 ccagactctg aggattcctg ttcctgtaca taaaaatgta agttaaatta tgattcagta 961 aaatgatggc atgaataagt aaatttcctg ttttaagctg taaatcatta gttatcattg 1021 gaactattta attttctata ttttgttttc atatgggtgg ctgtgaatgt ctgtacttat 1081 aaatatgagg aatgactttt tatcaagtag aatcctttaa acaagtggat taggctcttt 1141 ggtgatgttg ttagtttgcc ttcccaaaga gcatcgtgtc aggattcttt ccagaaggat 1201 tccacactga gtgagaggtg cgtgctagtc tccgtgcagt tctgactctt tctcactcta 1261 acgtgtttct gaaagtatta gcaactcaga attatatttt tagaaccatg atcagtagac 1321 attaaaatat ataacaaatg ccctatatta ataattctgc atacttaaat aattatgact 1381 atatgatggt gtgtatgcat tgaatatgcc tggtcatatt aaaatgtaaa atatatagtt 1441 tattagtcta aatagaataa aactaccagc tagaactgta gaaacacatt gatatgagtt 1501 taatgtataa tgcattacac ttccaaaaca tttttttcca gttacataat taagttatat 1561 cctttataaa actcctcagt aatcatataa gcttcatcta ctttttgaaa attttatctt 1621 aatatgtggt ggtttgttgc ctagaaaaca aacaaaaaac tctttggaga agggaactca 1681 tgtaaatacc acaaaacaaa gcctaacttt gtggaccaaa attgttttaa taattatttt 1741 ttaattgatg aattaaaaag tatatatatt tattgtgtac aatatgatgt tttgaagtat 1801 gtatacattg cagaatggac aatggaccaa atttttatac cttgtcttga ttatttgcat 1861 tttaaaaatt ttcctcattt agcaccaact gtgcactgaa gaaatctttc agggaatagg 1921 cacactggag agtcaaactg tgcaaggggg tactgtggaa agactattca aaaacttgtc 1981 cttaataaag aaatacattg acggccaaaa agtaagttac acacattcaa tggaagctat 2041 atttgtcctg gctgtgccta tttctatgga attgacagtt tcctgtaata cctattgtca 2101 tttttctttt ttcacagaaa aagtgtggag aagaaagacg gagagtaaac caattcctag 2161 actacctgca agagtttctt ggtgtaatga acaccgagtg gataatagaa agttgagact 2221 aaactggttt gttgcagcca aagattttgg aggagaagga cattttactg cagtgagaat 2281 gagggccaag aaagagtcag gccttaattt tcaatataat ttaacttcag agggaaagta 2341 aatatttcag gcatactgac actttgccag aaagcataaa attcttaaaa tatatttcag 2401 atatcagaat cattgaagta ttttcctcca ggcaaaattg atatactttt ttcttattta 2461 acttaacatt ctgtaaaatg tctgttaact taatagtatt tatgaaatgg ttaagaattt 2521 ggtaaattag tatttattta atgttatgtt gtgttctaat aaaacaaaaa tagacaactg 2581 ttcaatttgc tgctggcctc tgtccttagc aatttgaagt tagcacagtc cattgagtac 2641 atgcccagtt tggaggaagg gtctgagcac atgtggctga gcatccccat ttctctggag 2701 aagtctcaag gttgcaaggc acaccagagg tggaagtgat ctagcaggac ttagtgggga 2761 tgtggggagc agggacacag gcaggaggtg aacctggttt tctctctaca gtatatccag 2821 aacctgggat ggtcgaaggg taaatggtag ggaataaatg aatgaatgtc gtttccaaga 2881 tgattgtaga actaaaatga gttgtaagct cccctggaag aagggatgtg gaacctgtaa 2941 ctaggttcct gcccagcctg tgagaagaat ttggcagatc atctcattgc cagtatagag 3001 aggaagccag aaaccctctc tgccaaggcc tgcaggggtt cttaccacct gaccctgcac 3061 cataacaaaa ggacagagag acatggtagg gcagtcccat tagaaagact gagttccgta 3121 ttcccggggc agggcagcac caggccgcac aacatccatt ctgcctgctt atggctatca 3181 gtagcatcac tagagattct tctgtttgag aaaacttctc tcaaggatcc // LOCUS HUMTHY1A 2806 bp DNA PRI 14-JAN-1995 DEFINITION Human Thy-1 glycoprotein gene, complete cds. ACCESSION M11749 NID g339682 KEYWORDS Thy-1 glycoprotein. SOURCE Human B-lymphoblastoid cell line LG2 DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2806) AUTHORS Seki,T., Spurr,N., Obata,F., Goyert,S., Goodfellow,P. and Silver,J. TITLE The human Thy-1 gene: structure and chromosomal location JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (19), 6657-6661 (1985) MEDLINE 86016759 FEATURES Location/Qualifiers source 1..2806 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q22.3-q23" gene 27..63 /gene="THY1" exon <27..63 /gene="THY1" /note="Thy-1; G00-119-614" CDS join(27..63,547..882,1410..1522) /note="Thy-1" /codon_start=1 /db_xref="PID:g339683" /translation="MNLAISIALLLTVLQVSRGQKVTSLTACLVDQSLRLDCRHENTS SSPIQYEFSLTRETKKHVLFGTVGVPEHTYRSRTNFTSKYHMKVLYLSAFTSKDEGTY TCALHHSGHSPPISSQNVTVLRDKLVKCEGISLLAQNTSWLLLLLLSLSLLQATDFMS L" intron 64..546 /note="intron A" exon 547..882 /number=2 intron 883..1409 /note="intron B" exon 1410..>1522 /note="Thy-1" /number=3 BASE COUNT 612 a 851 c 754 g 589 t ORIGIN Chromosome 11q22.3. 1 ggatccagga ctgagatccc agaaccatga acctggccat cagcatcgct ctcctgctaa 61 caggtacccg gcatggggca ggactggggc tccaggcgcc ctggcttcct tccctccaga 121 gaagcagctt ctccctcaca gtctcagaaa agcgcaggtg acaaagagag ggctcttttt 181 catcctgaag tcagccgatc caccgcgctg atattctgac ggcctgaggt ggtttttgga 241 aacacagttt gctgagccct ccttcacact attgaactag aatccccaac tgagaaccca 301 ggaaccagca tcaactccct aagatctcct gtccttgaaa cacattgata ggatccaagg 361 ctcaagcaga gtggggaggg aggctggggt ctgcaaagga gaagtgggat ccctggggtg 421 gggaaaggca ctcagagagc agaccccggt cccctcccta gccaggccca tctctccact 481 tcaggtgggt gggaggcccc tgtgccgcag gcccctccag tttgaaggag gcactgctgg 541 tgccagtctt gcaggtctcc cgagggcaga aggtgaccag cctaacggcc tgcctagtgg 601 accagagcct tcgtctggac tgccgccatg agaataccag cagttcaccc atccagtacg 661 agttcagcct gacccgtgag acaaagaagc acgtgctctt tggcactgtg ggggtgcctg 721 agcacacata ccgctcccga accaacttca ccagcaaata ccacatgaag gtcctctact 781 tatccgcctt cactagcaag gacgagggca cctacacgtg tgcactccac cactctggcc 841 attccccacc catctcctcc cagaacgtca cagtgctcag aggtgagaca agcccctaac 901 aaggtcaagt gagctgggag agccaggctc ggggacagca ggcagttccc ttggctggac 961 tagagaggag aatagcccca taacgctctc accctctccc aactgctgcc tggtcaactg 1021 gggaaccatt gccttcggtg tgaatggggt gaagagctca gggccagaca ggcagagcag 1081 tgtggttcca ccagaactgt gggcaaggcc tttggcccct aatcttcctt ctcccagcgg 1141 gaaacaggga tgacaccacc tccctcagcc agttttcttg tcatgatgtt tagtaaggtt 1201 ttcataagat gatatgtgtg caagagatca gtaatctgca aatgggaaag atggctggtt 1261 ctgtgagacc aggctgttcc tggtcccagc taagacattg cagtacccac ctcccaaagg 1321 gagtacaccc ttgctttggg cctgtgcctg cctgagtcct gatccgtctt ccttcctacc 1381 ctgcccccgg cccccttctc tttctgcaga caaactggtc aagtgtgagg gcatcagcct 1441 gctggctcag aacacctcgt ggctgctgct gctcctgctg tccctctccc tcctccaggc 1501 cacggatttc atgtccctgt gactggtggg gcccatggag gagacaggaa gcctcaagtt 1561 ccagtgcaga gatcctactt ctctgagtca gctgaccccc tccccccaat ccctcaaacc 1621 ttgaggagaa gtggggaccc cacccctcat caggagttcc agtgctgcat gcgattatct 1681 acccacgtcc acgcggccac ctcaccctct ccgcacacct ctggctgtct ttttgtactt 1741 tttgttccag agctgcttct gtctggttta tttaggtttt atccttcctt ttctttgaga 1801 gttcgtgaag agggaagcca ggattgggga cctgatggag agtgagagca tgtgaggggt 1861 agtgggatgg tggggtacca gccactggag gggtcatcct tgcccatcgg gaccagaaac 1921 ctgggagaga cttggatgag gagtggttgg gctgtgctgg gcctagcacg gacatggtct 1981 gtcctgacag cactcctcgg caggcatggc tggtgcctga agaccccaga tgtgagggca 2041 ccaccaagaa tttgtggcct accttgtgag ggagagaact gaggatctcc agcattctca 2101 gccacaacca aaaaaaaata aaaagggcag ccctccttac cactgtggaa gtccctcaga 2161 ggccttgggg catgacccag tgaagatgca ggtttgacca ggaaagcagc gctagtggag 2221 ggttggagaa ggaggtaaag gatgagggtt catcatccct ccctgcctaa ggaagctaaa 2281 agcatggccc tgctgcccct ccctgcctcc acccacagtg gagagggcta caaaggagga 2341 caagaccctc tcaggctgtc ccaagctccc aagagcttcc agagctctga cccacagcct 2401 ccaagtcagg tggggtggag tcccagagct gcacagggtt tggcccaagt ttctaaggga 2461 ggcacttcct cccctcgccc atcagtgcca gcccctgctg gctggtgcct gagcccctca 2521 gacagccccc tgccccgcag gcctgccttc tcagggactt ctgcggggcc tgaggcaagc 2581 catggagtga gacccaggag ccggacactt ctcaggaaat ggcttttccc aacccccagc 2641 ccccacccgg tggttcttcc tgttctgtga ctgtgtatag tgccaccaca gcttatggca 2701 tctcattgag gacaaagaaa actgcacaat aaaaccaagc ctctggaatc tgtcctcgtg 2761 tccacctggc cttcgctcct ccagcagtgc ctgcctgccc ccgctt // LOCUS HSUPA 7258 bp DNA PRI 07-FEB-1997 DEFINITION H.sapiens uPA gene. ACCESSION X02419 NID g37601 KEYWORDS plasminogen activator; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7258) AUTHORS Riccio,A., Grimaldi,G., Verde,P., Sebastio,G., Boast,S. and Blasi,F. TITLE The human urokinase-plasminogen activator gene and its promoter JOURNAL Nucleic Acids Res. 13 (8), 2759-2771 (1985) MEDLINE 85215647 COMMENT Direct repeat 1 is the hexanucleotide sequence GGCGG, previously found at similar regions in several viral and eukaryotic promoters and known to be essential for promoter activity. (McKnight et al (1984) Cell, 37, 253-262). FEATURES Location/Qualifiers source 1..7258 /organism="Homo sapiens" /db_xref="taxon:9606" gene 720..7188 /gene="uPA" CAAT_signal 720..722 /gene="uPA" repeat_region 739..744 /note="repeat 1" /rpt_type=DIRECT repeat_region 754..759 /note="repeat 1" /rpt_type=DIRECT repeat_region 765..770 /note="repeat a" /rpt_type=DIRECT TATA_signal 775..781 /gene="uPA" mRNA join(802..889,1196..1283,1701..1728,1875..1982,2586..2760, 2954..3045,3203..3422,3644..3792,4458..4598,4945..5093, 6083..7188) /gene="uPA" exon 802..889 /gene="uPA" /number=1 intron 890..1195 /gene="uPA" /number=1 exon 1196..1283 /gene="uPA" /number=2 CDS join(1227..1283,1701..1728,1875..1982,2586..2760, 2954..3045,3203..3422,3644..3792,4458..4598,4945..5093, 6083..6259) /gene="uPA" /EC_number="3.4.21.73" /codon_start=1 /product="urokinase-plasminogen activator" /db_xref="PID:e300604" /db_xref="PID:g1834524" /translation="MRALLARLLLCVLVVSDSKGSNELHQVPSNCDCLNGGTCVSNKY FSNIHWCNCPKKFGGQHCEIDKSKTCYEGNGHFYRGKASTDTMGRPCLPWNSATVLQQ TYHAHRSDALQLGLGKHNYCRNPDNRRRPWCYVQVGLKPLVQECMVHDCADGKKPSSP PEELKFQCGQKTLRPRFKIIGGEFTTIENQPWFAAIYRRHRGGSVTYVCGGSLMSPCW VISATHCFIDYPKKEDYIVYLGRSRLNSNTQGEMKFEVENLILHKDYSADTLAHHNDI ALLKIRSKEGRCAQPSRTIQTICLPSMYNDPQFGTSCEITGFGKENSTDYLYPEQLKM TVVKLISHRECQQPHYYGSEVTTKMLCAADPQWKTDSCQGDSGGPLVCSLQGRMTLTG IVSWGRGCALKDKPGVYTRVSHFLPWIRSHTKEENGLAL" intron 1284..1700 /gene="uPA" /number=2 exon 1701..1728 /gene="uPA" /number=3 intron 1729..1874 /gene="uPA" /number=3 exon 1875..1982 /gene="uPA" /number=4 intron 1983..2585 /gene="uPA" /number=4 exon 2586..2760 /gene="uPA" /number=5 intron 2761..2953 /gene="uPA" /number=5 exon 2954..3045 /gene="uPA" /number=6 intron 3046..3202 /gene="uPA" /number=6 exon 3203..3422 /gene="uPA" /number=7 intron 3423..3643 /gene="uPA" /number=7 exon 3644..3792 /gene="uPA" /number=8 intron 3793..4457 /gene="uPA" /number=8 exon 4458..4598 /gene="uPA" /number=9 intron 4599..4944 /gene="uPA" /number=9 exon 4945..5093 /gene="uPA" /number=10 intron 5094..6082 /gene="uPA" /number=10 exon 6083..7188 /gene="uPA" /number=11 polyA_signal 7155..7160 /gene="uPA" polyA_signal 7168..7173 /gene="uPA" polyA_site 7188 /gene="uPA" BASE COUNT 1680 a 1858 c 2089 g 1631 t ORIGIN 1 ttcaatagga agcaccaaca gtttatgccc taggactttg ttcccacaat cctgtaacat 61 catatcacga cacctaaccc aatccttatc aagccctgtc aaaaacggac tttaaaccaa 121 gctgcaaatt ttcagtaatc tggccttgcc tttccccctc tgatagcacc atcaaacaaa 181 cccccttact gccgaaagca ataagcccgg ctttgttcca tccactggtt gtgttggtga 241 tatctgggga ctgccactga acagacgcac agagggagcc cctacaggca ggggtttttc 301 tgtctgtgct tcttgggaga gtatgtctcg tacatttgtc gcgtgatgaa gacttcacag 361 ctccatccag cgaccagact cacagctcca tccagctgcg gcaagggggt ctgaggcagt 421 cttaggcaag ttggggccca gcgggagaag ttgcagaaga actgattaga ggacccagga 481 ggcttcagag ctgggctgag gtagagagtc tcctgtgcgc cttctctcct ctctgcaatt 541 cggggactcc ttgcactggg gcaggccccg gcaggtgcat gggaggaagc acggagaatt 601 tacaagcctc tcgattcctc agtccagacg ctgttgggtc ccctccgctg gagatcgcgc 661 ttcccccaaa tctttgtgag cgttgcggaa gcacgcgggg tccgggtcgc tgagcgctgc 721 aagacagggg agggagccgg gcgggagagg gaggggcggc gccggggcgg gccctgatat 781 agagcaggcg ccgcgggtcg cagcacagtc ggagaccgca gcccggagcc cgggccaggg 841 tccacctgtc cccgcagcgc cggctcgcgc cctcctgccg cagccaccgg tgagtgccgc 901 ggtcctgaga tccccgggcc ggatgcgcgg cggccccagc tcccgagcgt ctgcctgccc 961 cgccctgggc tgcccgggct ccctgggctc cccggcggct gcacggagtc aaggcgcccc 1021 gtcccgggcg tcccccgcgg gtgccgatcc aggctgcccg gagtccggag cccatagagg 1081 agagagacag ctggggagcc tggtcaccgc gggcatctcc cctgcgctgc agtcgcccgc 1141 ctggcctgcc ttcccgttcc tccgcctctt gccctgactt ctccttcctt tgcagagccg 1201 ccgtctagcg ccccgacctc gccaccatga gagccctgct ggcgcgcctg cttctctgcg 1261 tcctggtcgt gagcgactcc aaagtgagtg cgctcttgct ttgactgatg ctgcccaagg 1321 acctctgatc agcaccaggg gagaggaggg gctgctcagg gagctggggt ctccggattc 1381 catccacagc agggccagac tctccccagg aaatgggaca gggtggcagc ggaggcttga 1441 gaaccacggg ggttggcact ggctggcaag ggaggaagag ggccaccggg actgccccag 1501 cctgcgggca tctggtagat gaagcttaat ccatttctcc tggctggaaa ccatggtctt 1561 ccatttgaga actagatacg aacagggtga ggcgagaggg agagggaaga gtgggttttg 1621 ggattggggc cagtttaccc tcaccctgga tccctggagc atgggacctt tgatgaagcc 1681 tcctcccgaa tctcttccag ggcagcaatg aacttcatca agttccatgt gagtatccac 1741 ccctacaaca gttggctgca cagacaagtt gggaaggctt caggggacac tcccctccct 1801 gccctctgct gcagcgtgcg ccacccctta ccacttccac tccccctcgc ttaccccacc 1861 tttgttctct ccagcgaact gtgactgtct aaatggagga acatgtgtgt ccaacaagta 1921 cttctccaac attcactggt gcaactgccc aaagaaattc ggagggcagc actgtgaaat 1981 aggtatgggg atctccactg caactgggag agaaatttgg ggacagggag ggatgggtgg 2041 gaggcaagag caggcaggag ttaggagctg gaggtagggt gggtgacatc ttcatcccta 2101 tgtgacaagc ataaacacac acacacgctc acgaaacagt ggccacacaa atgtgaggtg 2161 gggttggaag gagaccctgt ccagtcttct ggcaggtctg aaacgacatc tttaaaatgt 2221 ccgttggcag ccgggcatgg tggctcacgc ttgtaatccc agcattttga gaggtcaagt 2281 ttgagtggat catttaggtc aggagttcaa gaccagcctg gacaacatgg tgtaaccctg 2341 cctctactaa aaatgcaaaa atcagcctgg catggtggtg gatgcctgta gtcccagcta 2401 cttgggaggc tgaggcagga gaattgcttg aacatgggag gccagatctc agtgagctga 2461 gatcacacca ctgcactcca actgggcgac agagcaagac tccatctcaa aaaaaaaaaa 2521 aaataaaagt tagttggaat gttcttctct ttctcatatt ctctcatcct cctgtcccct 2581 tgtagataag tcaaaaacct gctatgaggg gaatggtcac ttttaccgag gaaaggccag 2641 cactgacacc atgggccggc cctgcctgcc ctggaactct gccactgtcc ttcagcaaac 2701 gtaccatgcc cacagatctg atgctcttca gctgggcctg gggaaacata attactgcag 2761 gtgaggtggg ggcaacaagg accaaaagcc ctccctacag cttcccagaa accttgttac 2821 catccccttc tcccagaggg ctggccatag cacaagagaa gtgcggcctc tggttgagtc 2881 ttccctgagg ggaggaggca gggaaggccc tctgggttgg aatgacatcc cctatctttc 2941 tgtgttgtgc caggaaccca gacaaccgga ggcgaccctg gtgctatgtg caggtgggcc 3001 taaagccgct tgtccaagag tgcatggtgc atgactgcgc agatggtgag catcactgac 3061 ctgctgatga caggtgggtg gaaggggaca aacttacatg tccccttatt ccatcacagg 3121 aggactgagg aggtgggggg tgcccgagag ggatgctttc tcctacctgc ctccctaaga 3181 catccctctg tttgtcctcc aggaaaaaag ccctcctctc ctccagaaga attaaaattt 3241 cagtgtggcc aaaagactct gaggccccgc tttaagatta ttgggggaga attcaccacc 3301 atcgagaacc agccctggtt tgcggccatc tacaggaggc accggggggg ctctgtcacc 3361 tacgtgtgtg gaggcagcct catgagccct tgctgggtga tcagcgccac acactgcttc 3421 atgtacggcc ctgggtttct cctcttcgac tcttctgccc caccccaagc acatcccttt 3481 ctccttccca gcaaagtgtt ccgcctcatt tctccctcat ctgcccctgt ccatgcgccc 3541 atggccttgg ggacaagtcg tgctttgagg cctctaggga gggaaggaag aagtggcatg 3601 atttcatggg actaagctgt ttgatgggta tcttcttcca cagtgattac ccaaagaagg 3661 aggactacat cgtctacctg ggtcgctcaa ggcttaactc caacacgcaa ggggagatga 3721 agtttgaggt ggaaaacctc atcctacaca aggactacag cgctgacacg cttgctcacc 3781 acaacgacat tggtgagggg gaacgcccgc gactactgtg gccataatgg cttggggaga 3841 gtgggaccca gggagagact ggagctgagt tgaagctgcc ggtggggcag gggtggggcg 3901 agggaccttg aagcctcgat atacatgaca aaggatggca gggaagagtt ccatgaagtc 3961 tgaggggcct ggtgctcctc tggagagacc ctgaatttcc ccaacaagta gccctcttgc 4021 gagtggaaac agccctgtgg gtatatggct tgggctggga aggccctgtt tatatgaatt 4081 agaaaaagac acaccttcct ttgtgggatg cagcctctgt ctgtgctagg atatagaact 4141 tggagaatgg agccttggga tggattccag cctaactacc tcagggggat cctctagagt 4201 gcagctggga gtttttgcag aaacgacctg tacagctgta tgcagtggct ctggccatcc 4261 aagccttttt caacacctgg aacaaagccc ttggggcatg gggcagggga ggtttccagg 4321 tgataagcga ccagcagacc tccctggatg actgacctag ggataggcat agctacttcc 4381 tcggcacttg gaggggacag atggggaccg cctaaccagt agtgatcttt ctcctctgac 4441 cctctgtcct cccccagcct tgctgaagat ccgttccaag gagggcaggt gtgcgcagcc 4501 atcccggact atacagacca tctgcctgcc ctcgatgtat aacgatcccc agtttggcac 4561 aagctgtgag atcactggct ttggaaaaga gaattctagt aagtgacaat tgcgactgac 4621 ttagaaggtc ctgaggagtg ttttgacctg aaaatgagcc cagtgtgatc aagggaagac 4681 tgcagagtta gaggtgggag cactgaggcg gtggcagatg ggtccaggga tggatgaaga 4741 gtgttgttta gggagcgatg ggctgcaaag gtaaatagat ggtaggggct ataggtggag 4801 gtaaatggct cagatttgca tggagagaga ataatgggcc tctccctggg tgatgatact 4861 ttatggtgtc ccctctctgg cgagacgtcc cacgtggagg cagataaatc ttgatgcaaa 4921 cgcctccctg ttttctccac ctagccgact atctctatcc ggagcagctg aaaatgactg 4981 ttgtgaagct gatttcccac cgggagtgtc agcagcccca ctactacggc tctgaagtca 5041 ccaccaaaat gctgtgtgct gctgacccac agtggaaaac agattcctgc caggtgagtg 5101 ttccaagcat ctctctccac ctcttccata tctccccaga gctcctgggc ttgttccagc 5161 cagcttaagg gtgtctctct ctagccaaag ccctaagtag ccagaatcag gagctcaggt 5221 ctttgagggt ttaaaccagt ccttatgtgt ttgccagaca ttaccaaaaa aatcccagct 5281 ctgcgctagt cacttcagac tgggggcacg agatcctaga aagaggaaac agtaaaagac 5341 aatgtaactc agtgcccagg gtgtgttgtg aactataaat gatcaggtgt tcaggagagg 5401 gaggtgagtg ccaacctgag ggtcagggag gggaggcttt aaaggaaatg tgacttgata 5461 ggcatttgaa gaggcagagg gaagaaagga aggtgtttca gttgaaagat acaaaactga 5521 gaaggaggct ggcatattcc gggtggggag gagaactagg gtctgggagt gtggatggaa 5581 tagtggcaga tgacagggct tttaaagcca agcaggggat tttccaactt cgatgtggta 5641 gaaatggggc tgcgtcaggc acagtggctc atgcctgtaa tcccagcatt gggctaggcc 5701 gtagtcgatg gatcattgag gccagagttg agaccggcct ggaccaacat ggtgaaaccc 5761 tgtgtctact aaaaaatgca aaaaaaaaaa ttagccaggt gtggtggtgc ctgcctgtaa 5821 tcccagctaa tcaggaggct gagacatgga atcgcttgag cacaggaggc aagtttgacg 5881 tgagctgaga tcacgtcatt gcacgccagc ctgggcgaca gagcgagatt ctgtcctccc 5941 gccgaaaaaa gaaagaaaat gggaagtcgc taaggacttt gactgggaaa ctcttccctc 6001 tctctggtat ggttgggtga tgggatcaga aatcccctcc tcacttctct agggctcatc 6061 ttttgtatct ttggcgtcac agggagactc agggggaccc ctcgtctgtt ccctccaagg 6121 ccgcatgact ttgactggaa ttgtgagctg gggccgtgga tgtgccctga aggacaagcc 6181 aggcgtctac acgagagtct cacacttctt accctggatc cgcagtcaca ccaaggaaga 6241 gaatggcctg gccctctgag ggtccccagg gaggaaacgg gcaccacccg ctttcttgct 6301 ggttgtcatt tttgcagtag agtcatctcc atcagctgta agaagagact gggaagatag 6361 gctctgcaca gatggatttg cctgtgccac ccaccagggt gaacgacaat agctttaccc 6421 tcaggcatag gcctgggtgc tggctgccca gacccctctg gccaggatgg aggggtggtc 6481 ctgactcaac atgttactga ccagcaactt gtctttttct ggactgaagc ctgcaggagt 6541 taaaaagggc agggcatctc ctgtgcatgg gtgaagggag agccagctcc cccgacggtg 6601 ggcatttgtg aggcccatgg ttgagaaatg aataatttcc caattaggaa gtgtaacagc 6661 tgaggtctct tgagggagct tagccaatgt gggagcagcg gtttggggag cagagacact 6721 aacgacttca gggcagggct ctgatattcc atgaatgtat caggaaatat atatgtgtgt 6781 gtatgtttgc acacttgtgt gtgggctgtg agtgtaagtg tgagtaagag ctggtgtctg 6841 attgttaagt ctaaatattt ccttaaactg tgtggactgt gatgccacac agagtggtct 6901 ttctggagag gttataggtc actcctgggg cctcttgggt cccccacgtg acagtgcctg 6961 ggaatgtact tattctgcag catgacctgt gaccagcact gtctcagttt cactttcaca 7021 tagatgtccc tttcttggcc agttatccct tccttttagc ctagttcatc caatcctcac 7081 tgggtggggt gaggaccact ccttacactg aatatttata tttcactatt tttatttata 7141 tttttgtaat tttaaataaa agtgatcaat aaaatgtgat ttttctgatg acaaatctcc 7201 ctggtgcttg tatgggaagg agttggagta cataaaaagg agaaaataac aaaggtgg // LOCUS HUMSAP01 1394 bp DNA PRI 11-MAR-1998 DEFINITION Homo sapiens gene for serum amyloid P component, complete cds. ACCESSION D00097 NID g220067 KEYWORDS SAP; serum amyloid P component. SOURCE Homo sapiens placenta DNA, clone:Lm hSAP-8. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 822) AUTHORS Mantzouranis,E.C., Dowton,S.B., Whitehead,A.S., Edge,M.D., Bruns,G.A. and Colten,H.R. TITLE Human serum amyloid P component. cDNA isolation, complete sequence of pre-serum amyloid P component, and localization of the gene to chromosome 1 JOURNAL J. Biol. Chem. 260 (12), 7752-7756 (1985) MEDLINE 85207828 REFERENCE 2 (bases 1 to 1394) AUTHORS Ohnishi,S., Maeda,S., Shimada,K. and Arao,T. TITLE Isolation and characterization of the complete complementary and genomic DNA sequences of human serum amyloid P component JOURNAL J. Biochem. 100 (4), 849-858 (1986) MEDLINE 87137351 COMMENT In [2], they isolated the human SAP cDNA and genomic DNA clones, elucidated the nucleotide sequences, and assigned the cap site of the SAP gene which was not done in [1]. In addition, they compared the genomic DNA sequence of human SAP with that of the reported human CRP. FEATURES Location/Qualifiers source 1..1394 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Lm hSAP-8" /tissue_type="placenta" TATA_signal 143..149 exon 172..331 /number=1 prim_transcript 172..1210 /note="SAP mRNA and intron" CDS join(268..331,447..1054) /codon_start=1 /product="serum amyloid P component" /db_xref="PID:d1000504" /db_xref="PID:g220068" /translation="MNKPLLWISVLTSLLEAFAHTDLSGKVFVFPRESVTDHVNLITP LEKPLQNFTLCFRAYSDLSRAYSLFSYNTQGRDNELLVYKERVGEYSLYIGRHKVTSK VIEKFPAPVHICVSWESSSGIAEFWINGTPLVKKGLRQGYFVEAQPKIVLGQEQDSYG GKFDRSQSFVGEIGDLYMWDSVLPPENILSAYQGTPLPANILDWQALNYEIRGYVIIK PLVWV" sig_peptide 268..324 mat_peptide join(325..331,447..1051) /product="serum amyloid P component" intron 332..446 /number=1 exon 447..1210 /number=2 conflict 565 /citation=[1] /replace="c" conflict 682 /citation=[1] /replace="t" conflict 683 /citation=[1] /replace="c" conflict 685 /citation=[1] /replace="a" conflict 814 /citation=[1] /replace="a" conflict 943 /citation=[1] /replace="c" polyA_signal 1182..1187 /note="putative" polyA_site 1210 BASE COUNT 375 a 312 c 302 g 405 t ORIGIN 1 agccatcact tgtctctaat aaataactcc cattgatttt ccagctcagg gctcaccact 61 ccttaccgta agcgcaggag gagactggaa aatcactcac atattattgg tgctcttcct 121 cccccatcct cacccaaggt gcatataaac cctgaataac ctgaagtcta agggcatgaa 181 tatcagacgc tagggggaca gccactgtgt tgtctgctac cctcatcctg gtcactgctt 241 ctgctataac agccctaggc caggaatatg aacaagccgc tgctttggat ctctgtcctc 301 accagcctcc tggaagcctt tgctcacaca ggtaaggagg tgaaggaatg gtcaagaatc 361 ataaagtgag aaaataggtt gaagctgaga tatcttttcc ctgcatttat actgaaggtc 421 attatctttc tttctttatc ccgcagacct cagtgggaag gtgtttgtat ttcctagaga 481 atctgttact gatcatgtaa acttgatcac accgctggag aagcctctac agaactttac 541 cttgtgtttt cgagcctata gtgatctctc tcgtgcctac agcctcttct cctacaatac 601 ccaaggcagg gataatgagc tactagttta taaagaaaga gttggagagt atagtctata 661 cattggaaga cacaaagtta catccaaagt tatcgaaaag ttcccggctc cagtgcacat 721 ctgtgtgagc tgggagtcct catcaggtat tgctgaattt tggatcaatg ggacaccttt 781 ggtgaaaaag ggtctgcgac agggttactt tgtggaagct cagcccaaga ttgtcctggg 841 gcaggaacag gattcctatg ggggcaagtt tgataggagc cagtcctttg tgggagagat 901 tggggatttg tacatgtggg actctgtgct gcccccagaa aatatcctgt ctgcctatca 961 gggtacccct ctccctgcca atatcctgga ctggcaggct ctgaactatg aaatcagagg 1021 atatgtcatc atcaaaccct tggtgtgggt ctgaggtctt gactcaacga gagcacttga 1081 aaatgaaatg actgtctaag agatctggtc aaagcaactg gatactagat cttacatctg 1141 cagtctttct tctttgaatt tcctatctgt atgtctgcct aattaaaaaa atatatattg 1201 tattatgcta cctgcatttg tttagtgctt gtcatagtcc catatcttta tcttatgtct 1261 actacttatc tatctactaa ttggtgtttc attggtaatt ggtgtttcat tatcctgaaa 1321 actccaattg ccaagtacgg ggaggaaaac ctgtaagtaa ctagaaagat atatcgcaaa 1381 gccagagcac tcaa // LOCUS HUMPAP 4497 bp DNA PRI 07-JAN-1995 DEFINITION Homo sapiens pancreatits-associated protein (PAP) gene, complete cds. ACCESSION L15533 NID g482908 KEYWORDS pancreatitis-associated protein. SOURCE Homo sapiens adult blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4497) AUTHORS Dusetti,N.J., Frigerio,J.M., Fox,M.F., Swallow,D.M., Dagorn,J.C. and Iovanna,J.L. TITLE Molecular cloning, genomic organization, and chromosomal localization of the human pancreatitis-associated protein (PAP) gene JOURNAL Genomics 19 (1), 108-114 (1994) MEDLINE 94245143 FEATURES Location/Qualifiers source 1..4497 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /dev_stage="adult" /tissue_type="blood" /map="2" promoter 1..986 /gene="PAP" mRNA join(986..1011,1300..1409,1970..2088,2276..2413, 3040..3166,3445..3731) /gene="PAP" exon 986..1011 /gene="PAP" /number=1 gene join(986..1011,1300..1409,1970..2088,2276..2413, 3040..3166,3445..3731) /gene="PAP" intron 1012..1299 /gene="PAP" /number=1 exon 1300..1409 /gene="PAP" /number=2 CDS join(1334..1409,1970..2088,2276..2413,3040..3166, 3445..3512) /gene="PAP" /codon_start=1 /product="pancreatitis-associated protein" /db_xref="PID:g482909" /translation="MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGS KAYGSHCYALFLSPKSWTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYV WIGLHDPTQGTEPNGEGWEWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKD YNCNVRLPYVCKFTD" intron 1410..1969 /gene="PAP" /number=2 exon 1970..2088 /gene="PAP" /number=3 intron 2089..2275 /gene="PAP" /number=3 exon 2276..2413 /gene="PAP" /number=4 intron 2414..3039 /gene="PAP" /number=4 exon 3040..3166 /gene="PAP" /number=5 intron 3167..3444 /gene="PAP" /number=5 exon 3445..3731 /gene="PAP" /number=6 BASE COUNT 1125 a 1121 c 1020 g 1231 t ORIGIN 1 ctgcagcctt gaactcctgg gttcaactga aggtcctcct acctcagcct gctgagtagc 61 taggaccaca agcacacacc accgcaactg gcttaaatta aaatataaat tgtagagata 121 gggtcttaat gtgttgccca ggctgctctt gaactccttg cttcaggtga tcctcccacc 181 tcagcctctc aaagtgctgg gattatagac ctgagccaca gcacctggcc aactgaccta 241 tgattttaca caatggctgc tcttcccttc tttaactatt attcattctt ctttgatcct 301 cattatttga ctgtagtcct tcttatgtct tgttttcctt cattacctct tattctatca 361 cattgccatt gtcattctcc actggggaag ctctttcttg ctgaagactg gaaagacaag 421 tccattcacc tgattttctg taagattgtg gctcatgtat tgacttgtca gacaattctg 481 aagtttcatc aaaattagct atcatgcttg cataatggcc ctgaaccctc actcctacac 541 ttagcttcag taccatctat gtcctcaact gtccatgata cttataattc ccgtaaatct 601 tcacttaaca cctaacattt atttaatctt actaggcaag gtaataagaa atacataggt 661 ttgcctccag aagtgggttc ttaagaaacc caccagagga actcctcttt cagatgtcca 721 cattagaaga tttcatatca catttggtgc cacaggcctt tgacaaggag gatgcagagg 781 aaaaagcaaa cttcacctct tcctagggaa agtgttggcc tgccaacagg aaagaggcaa 841 catctgggaa aatccccagt ctttgccagg aagagtccat gccaacccca ccccatgacc 901 cctgtcctgc ctactcattg tcactcttca ctccaatgtc cctcccccag atcctctata 961 aaatcccact ctttcctgac cagacaaacc ataccatatc ccaccagaga ggtaagtggg 1021 agctgagaga agatgagacc cagggaggag ctactgcaca tgacacagga gaatacatgg 1081 gagggtccct tcctcaggga gcacaggaac tctgagactc agcaagggtg tcctgggagg 1141 gctcggggat gggagagtac acagattcac aactcattca gaactgtaga agatgatgga 1201 tgtgaccaag atcactttag tcctagggga ctagagaagg aaaatgacat gaggcagtgg 1261 ggtatctgtg tgttctccca ctgaccacgc tttctttagt gactcctgat tgcctcctca 1321 agtcgcagac actatgctgc ctcccatggc cctgcccagt gtatcttgga tgctgctttc 1381 ctgcctcatg ctgctgtctc aggttcaagg tgagattgct ttgcctctag cactgggttc 1441 cctatgaatc ctcagagcta acaagaggag gaaggctcct gtgtgtcatg tgaggtaatg 1501 acgtggtgtc taatgaacct gcctgcagtt cttgcatcat ctctccttcc ttcaggttaa 1561 cttgcagtgg gaggctccat ggtggtccac taacagtgga atgagatggc ttccatttag 1621 tcagtggact ctaatataca ctggtgggaa agtggactct aatatacact ggagggtcag 1681 taatgagatg tggggaggga caatgattgg aggacccaat gtagagacag cccagagtga 1741 ggagagtatt gaatggttga ataaggggaa agggtaataa gagactggat ggtgctccat 1801 ttactatggc tattttgaga taaagaattt ctgaaaacat aagggaagat gaaggggtgt 1861 caggaatgtg gtcttcctcc ccaaggacat tcctaggtat tccccaaggt catctcccac 1921 cccaagcccc actcttcatt ttaccctccc ctctcttctt ccacctcagg tgaagaaccc 1981 cagagggaac tgccctctgc acggatccgc tgtcccaaag gctccaaggc ctatggctcc 2041 cactgctatg ccttgttttt gtcaccaaaa tcctggacag atgcagatgt gagtggttag 2101 atgtggtgtt ggaggtgacc ggtctcaggg ggaggagggt ctccattcag gagagttcct 2161 tgggaatgag gatgaacacg tttatctttc acacagtcct cctcccacct acctttgccc 2221 tgccctccct cagcaggtct caggctccct ctcattctct ttgttgccct caaagctggc 2281 ctgccagaag cggccctctg gaaacctggt gtctgtgctc agtggggctg agggatcctt 2341 cgtgtcctcc ctggtgaaga gcattggtaa cagctactca tacgtctgga ttgggctcca 2401 tgaccccaca caggtgccag tatatcctcc ctctctgtta cctctcaagg tgctattgtt 2461 gcccaggccc actccctgtc ccctgtgcct gcccaggaag tacttcaggg agcactggag 2521 ctcagattct ggggaatatt tggggggaaa gggaaggcca tgaagcatct gaagatctga 2581 gttctgtgga ggtctctatc tttcagataa aatcaatctg ccttcctcag gcgtattaca 2641 taattctcat atgaggctgg gttaacaatt ctctgagctt catggagtct ttgcctacta 2701 ttctgaagga actcttaatg aagataggat caatttttgt ccccatacag aactgacatt 2761 acttttgagg ttcacaagct aatcacaaat gctacatcaa ttattgttct gcaaataata 2821 tattaccttg agttgttcca aaggtcttat gtttattggc tggaattttc caatagcaat 2881 gaggagtcaa ggaagagttt cctactcacc ggcagcatct ggaatagcag accaactttc 2941 ctcatgctgg ggagcaaatc aggtgttgca gctaaggggc catgcaagaa gagctgcaat 3001 ggccattccc ttcacctggc tacctcctct actctacagg gcaccgagcc caatggagaa 3061 ggttgggagt ggagtagcag tgatgtgatg aattactttg catgggagag aaatccctcc 3121 accatctcaa gccccggcca ctgtgcgagc ctgtcgagaa gcacaggtaa gaaacagagg 3181 agctgcctct tcccagtgtt ttccatctca tcccccattc ctgggtctga ccttcaggaa 3241 atcttcctga gctagaaaat acaatgttag tgtgtcttct cttatctcct ctcttctcca 3301 ctttctttga atctctctcc tggattggga cactggtgaa ggtgagggag aggctttaac 3361 ttctaggcta aaacctggga tgccccttca ttggattcac aagcttcctc agccccattc 3421 catttatgtc ttctgtctct ccagcatttc tgaggtggaa agattataac tgtaatgtga 3481 ggttacccta tgtctgcaag ttcactgact agtgcaggag ggaagtcagc agcctgtgtt 3541 tggtgtgcaa ctcatcatgg gcatgagacc agtgtgagga ctcaccctgg aagagaatat 3601 tcgcttaatt cccccaacct gaccacctca ttcttatctt tcttctgttt cttcctcccc 3661 gctgtcattt cagtctcttc attttgtcat acggcctaag gctttaaaga gcaataaaat 3721 ttttagtctg cacttgtttg tcttgtatat gccagtgtca tagccatact ctgagaagga 3781 caaagtgttt gagtggagga aactttatgg gtcttgcttc ttccctattc acccaggcct 3841 ctagggaaaa tgatgaagtg tgcatcccta ccagtgtgtt atgatgaggg tgtgggtcct 3901 gctcatgtag gatttgtgtt gtggagagat gaggacattt ctctcccgcg tacttactgc 3961 cctcccattc ccgtagccca aacctgacag tgtgacatga acagattagg aggctctgat 4021 ggtgcttaga atagtacttc tcagagaatg gcatcagcag gatggtagat aggactttcc 4081 agctcttgaa ccttcacaga aacattcatt tgaactacta cccattaaaa tggaaatacc 4141 ttcacaagag ctaacaatcc caagtgagtg attaaagcat ctgaatgttg caaaaaataa 4201 gaagggatgc atcgaagagg gtagaaagaa gacttttaca ttatttatat cacccctcca 4261 tcaatctcag taagcacagc atggagagac attccctaaa cttggggaaa gagagtgaaa 4321 taagcacttg agttttccat ggaccctaac actaggtttg cctcagtaag acccagtggc 4381 ctctgactcc aggcagacac ccttggactt agactccagg ctgccttgat gccaggccag 4441 gctctgtggc cccaggctct gtgaccccag gctccaggtc agcccccatg actgcag // LOCUS HUMIL1B 7824 bp DNA PRI 09-AUG-1995 DEFINITION Human interleukin 1-beta (IL1B) gene, complete cds. ACCESSION M15840 NID g186281 KEYWORDS Alu repeat; interleukin 1-beta. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7824) AUTHORS Bensi,G., Raugei,G., Palla,E., Carinci,V., Tornese Buonamassa,D. and Melli,M. TITLE Human interleukin-1 beta gene JOURNAL Gene 52 (1), 95-101 (1987) MEDLINE 87248099 REFERENCE 2 (bases 1 to 7824) AUTHORS Bensi,G. TITLE Direct Submission JOURNAL Submitted (26-MAY-1987) G. Bensi, Sclavo Research Center, Siena, Italy FEATURES Location/Qualifiers source 1..7824 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Lambda-hilb[4,8]" /map="2q13-q21" prim_transcript 374..7380 /gene="IL1B" /note="IL1b mRNA and introns; G00-120-094" gene 374..7380 /gene="IL1B" intron 446..908 /gene="IL1B" /note="IL1b intron A; G00-120-094" CDS join(924..970,1536..1587,3576..3777,4323..4487,5723..5853, 6570..6782) /gene="IL1B" /codon_start=1 /db_xref="GDB:G00-120-094" /product="interleukin 1-beta" /db_xref="PID:g386816" /translation="MAEVPELASEMMAYYSGNEDDLFFEADGPKQMKCSFQDLDLCPL DGGIQLRISDHHYSKGFRQAASVVVAMDKLRKMLVPCPQTFQENDLSTFFPFIFEEEP IFFDTWDNEAYVHDAPVRSLNCTLRDSQQKSLVMSGPYELKALHLQGQDMEQQVVFSM SFVQGEESNDKIPVALGLKEKNLYLSCVLKDDKPTLQLESVDPKNYPKKKMEKRFVFN KIEINNKLEFESAQFPNWYISTSQAENMPVFLGGTKGGQDITDFTMQFVSS" exon <924..970 /gene="IL1B" /note="interleukin-1 beta, (first expressed exon); G00-120-094" /number=2 intron 971..1535 /gene="IL1B" /note="IL1b intron B; G00-120-094" exon 1536..1587 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=3 intron 1588..3575 /gene="IL1B" /note="IL1b intron C; G00-120-094" repeat_region 3110..3429 /note="Alu repeat copy A; G00-120-094" exon 3576..3777 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=4 intron 3778..4322 /gene="IL1B" /note="IL1b intron D; G00-120-094" exon 4323..4487 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=5 intron 4488..5722 /gene="IL1B" /note="IL1b intron E; G00-120-094" exon 5723..5853 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=6 intron 5854..6569 /gene="IL1B" /note="IL1b intron F; G00-120-094" exon 6570..>6782 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=7 repeat_region 7280..7379 /note="Alu repeat copy B; G00-120-094" BASE COUNT 2099 a 1905 c 1624 g 2195 t 1 others ORIGIN 242 bp upstream of HindIII site; chromosome 2q13-q21. 1 aaagtatgtg catgtataaa tctgtgtgtc ttccactttg tcccacatat actaaattta 61 aacattcttc taacgtggga aaatccagta ttttaatgtg gacatcaact gcacaacgat 121 tgtcaggaaa acaatgcata tttgcactgg tgatacattt gcaaaatctg tcatagtttg 181 ctactccttg cccttccatg aaccagagaa ttatctcagt ttattagtcc cctcccctaa 241 gaagcttcca ccaatactct tttccccttt cctttaactt gattgtgaaa tcaggtattc 301 aacagagaaa tttctcagcc tcctacttct gcttttgaaa gccataaaaa cagcgaggga 361 gaaactggca gataccaaac ctcttcgagg cacaaggcac aacaggctgc tctgggattc 421 tcttcagcca atcttcattg ctcaagtatg actttaatct tccttacaac taggtgctaa 481 gggagtctct ctgtctctct gcctctttgt gtgtatgcat attctctctc tctctctctt 541 tctttctctg tctctccctc tccttccctc tctgcctccc tctctcagct ttttgcaaaa 601 agccaggtgt aatataatgc ttatgactcg ggaaatattc tgggaatgga tactgcttat 661 ctaacagctg acaccctaaa ggttagtgtc aaagcctctg ctccagctct cctagcctaa 721 tacattgcta gttggggttt ggtttagcaa atgcttttct ctagacccaa aggacttctc 781 tttcacacat tcattcattt actcagagat catttctttg catgactgcc atgcactgga 841 tgctgagaga aatcacacat gaacgtagcc gtcatgggga agtcactcat tttctccttt 901 ttacacaggt gtctgaagca gccatggcag aagtacctga gctcgccagt gaaatgatgg 961 cttattacag gtcagtggag acgctgagac cagtaacatg agcaggtctc ctctttcaag 1021 agtagagtgt tatctgtgct tggagaccag atttttcccc taaattgcct ctttcagtgg 1081 caaacagggt gccaagtaaa tctgatttaa agactacttt cccattacaa gtccctccag 1141 ccttgggacc tggaggctat ccagatgtgt tgttgcaagg gcttcctgca gaggcaaatg 1201 gggagaaaag actccaagcc cacaatacaa ggaatccctt tgcaaagtgt ggcttggagg 1261 gagagggaga gctcagattt tagctgactc tgctgggcta gaggttaggc ctcaagatcc 1321 aacagggagc ncccagggtg cccacctgcc aggcctagaa tctgccttct ggactgttct 1381 gcgcatatca ctgtgaaact tgccaggtgt ttcaggcagc tttgagaggc aggctgtttg 1441 cagtttctta tgaacagtca agtcttgtac acagggaagg aaaaataaac ctgtttagaa 1501 gacataattg agacatgtcc ctgtttttat tacagtggca atgaggatga cttgttcttt 1561 gaagctgatg gccctaaaca gatgaaggta agactatggg tttaactccc aacccaagga 1621 agggctctaa cacagggaaa gctcaaagaa gggagttctg ggccactttg atgccatggt 1681 attttgtttt agaaagactt taacctcttc cagtgagaca caggctgcac cacttgctga 1741 cctggccact tggtcatcat atcaccacag tcactcacta acgttggtgg tggtggccac 1801 acttggtggt gacaggggag gagtagtgat aatgtttccc atttcatagt aggaagacaa 1861 ccaagtcttc aacataaatt tgattatcct tttaagagat ggattcagcc tatgccaatc 1921 acttgagtta aactctgaaa ccaagagatg atcttgagaa ctaacatatg tctacccctt 1981 ttgagtagaa tagttttttg ctacctgggg tgaagcttat aacaacaaga catagatgat 2041 ataaacaaaa agatcaattg agacttgaaa gaaaaccatt cacttgctgt ttgaccttga 2101 caagtcattt tacccgcttt ggacctcatc tgaaaaataa agggctgagc tggatgatct 2161 ctgagattcc agcatcctgc aacctccagt tctgaaatat tttcagttgt agctaagggc 2221 atttgggcag caaatggtca tttttcagac tcatccttac aaagagccat gttatattcc 2281 tgctgtccct tctgttttat atgatggctc agtagccttc ctagtggccc agccatcagc 2341 ctagctaggt cagttgtgca ggttggaggc agccactttt ctctggcttt attttattcc 2401 agtttgtgat agcctcccct agcctcataa tccagtcctc aatcttgtta aaaacatatt 2461 tctttagaag ttttaagact ggcataactt gttggctgca gctgtgggag gagcccattg 2521 gcttgtctgc ctggcctttg cccccattgc ctcttccagc agcttggcgc tgctccaggc 2581 aggaaattct ctcctgctca actttctttt gtgcacttac aggtctcttt aactgtcttt 2641 caagcctttg aaccattatc actgccttaa ggcaacctca gtgaagcctt aatacggagc 2701 ttctctgaat aagaggaaag tggtaacatt tcacaaaaag tactctcaca ggatttgcag 2761 aatgcctatg agacagtgtt atgaaaaagg aaaaaaaaga acagtgtaga aaaattgaat 2821 acttgctgag tgagcatagg tgaatggaaa atgttatggt catctgcatg aaaaagcaaa 2881 tcatagtgtg acagcattag ggatacaaaa agatatagag aaggtataca tgtatggtgt 2941 aggtggggca tgtacaaaaa agatgaacaa agtagaaatg ggatttattc taaagaatag 3001 cctgtaaggt gtcagaaagc ccacattcta gtcttgagtc tgtttctaac ctgctgtgtg 3061 cccttgagta cacacttaac ctcttgagct tcagagaggg ataatctttt tattttattt 3121 tattttattt tgttttgttt tgttttgttt tgttttatga gacagagtct cactctgttg 3181 cccaggctgg agtgcagtgg tacaatcttg gcttactgca tcctccacct cctgagttca 3241 agcgattctc cttcctcagt ctcctgaata gctaggatta caggtgcacc ccaccacacc 3301 cagctaattt ttgtattttt agtagagaag gggtttcgcc atgttggcca ggctggtttt 3361 gaagtcctga cctaaatgat tcatccacct cggcttccca aagtgctggg attacaggca 3421 tgagccacca cgcctggccc agagagggat gatctttaga agctcgggat tctttcaagc 3481 cctttcctcc tctctgagct ttctactctc tgatgtcaaa gcatggttcc tggcaggacc 3541 acctcaccag gctccctccc tcgctctctc cgcagtgctc cttccaggac ctggacctct 3601 gccctctgga tggcggcatc cagctacgaa tctccgacca ccactacagc aagggcttca 3661 ggcaggccgc gtcagttgtt gtggccatgg acaagctgag gaagatgctg gttccctgcc 3721 cacagacctt ccaggagaat gacctgagca ccttctttcc cttcatcttt gaagaaggta 3781 gttagccaag agcaggcagt agatctccac ttgtgtcctc ttggaagtca tcaagcccca 3841 gccaactcaa ttcccccaga gccaaagccc tttaaaggta gaaggcccag cggggagaca 3901 aaacaaagaa ggctggaaac caaagcaatc atctctttag tggaaactat tcttaaagaa 3961 gatcttgatg gctactgaca tttgcaactc cctcactctt tctcaggggc ctttcactta 4021 cattgtcacc agaggttcgt aacctccctg tgggctagtg ttatgaccat caccatttta 4081 cctaagtagc tctgttgctc ggccacagtg agcagtaata gacctgaagc tggaacccat 4141 gtctaatagt gtcaggtcca tgttcttagc caccccactc ccagcttcat ccctactggt 4201 gttgtcatca gactttgacc gtatatgctc agtgtcctcc aagaaatcaa attttgccgc 4261 ctcgcctcac gaggcctgcc cttctgattt tatacctaaa caacatgtgc tccacatttc 4321 agaacctatc ttcttcgaca catgggataa cgaggcttat gtgcacgatg cacctgtacg 4381 atcactgaac tgcacgctcc gggactcaca gcaaaaaagc ttggtgatgt ctggtccata 4441 tgaactgaaa gctctccacc tccagggaca ggatatggag caacaaggta aatggaaaca 4501 tcctggtttc cctgcctggc ctcctggcag cttgctaatt ctccatgttt taaacaaagt 4561 agaaagttaa tttaaggcaa atgatcaaca caagtgaaaa aaaatattaa aaaggaatat 4621 acaaactttg gtcctagaaa tggcacattt gattgcactg gccagtgcat ttgttaacag 4681 gagtgtgacc ctgagaaatt agacggtcaa gcactcccag gaccatgtcc acccaagtct 4741 cttgggcata gtgcaatgtc aattcttcca caatatcccc tcatttgatg gacatggcct 4801 aactgcctgt gggttctctc ttcctgttgt tgaggctgaa acaagagtgc tggagcgata 4861 atgtgtccat cccctcccca gtcttccccc cttgccccaa cagtccgtcc cacccaatgc 4921 aggtggttct tgtagggaaa ttttaccgcc cagcaggaac ttatatctct ccgctgtaac 4981 gggcaaaagt ttcaagtgcg gtgaacccat cattagctgt ggtgatctgc ctggcatcgt 5041 gccacagtag ccaaagcctc tgcacaggag tgtgggcaac taaggctgct gactttgaag 5101 gacagcctca ctcaggggga agctatttgc tctcagccag gccaagaaaa tcctgtttct 5161 ttggaatcgg gtagtaagag tgatcccagg gcctccaatt gacactgctg tgactgagga 5221 agatcaaaat gagtgtctct ctttggagcc actttcccag ctcagcctct cctctcccag 5281 tttcttccca tgggctactc tctgttcctg aaacagttct ggtgcctgat ttctggcaga 5341 agtacagctt cacctctttc ctttccttcc acattgatca agttgttccg ctcctgtgga 5401 tgggcacatt gccagccagt gacacaatgg cttccttcct tccttccttc agcatttaaa 5461 atgtagaccc tctttcattc tccgttccta ctgctatgag gctctgagaa acctcaggcc 5521 tttgagggga aaccctaaat caacaaaatg accctgctat tgtctgtgag aagtcaagtt 5581 atcctgtgtc ttaggccaag gaacctcact gtgggttccc acagaggcta ccaaattaca 5641 tgtatcctac tcatggggcc taggggttgg ggtgaccctg cactgctgtg tccctaacca 5701 caagaccccc ttctttcttc agtggtgttc tccatgtcct ttgtacaagg agaagaaagt 5761 aatgacaaaa tacctgtggc cttgggcctc aaggaaaaga atctgtacct gtcctgcgtg 5821 ttgaaagatg ataagcccac tctacagctg gaggtaagtg aatgctatgg aatgaagccc 5881 ttctcagcct cctgctacca cttattccca gacaaccacc ttctccccgc ccccatccct 5941 aggaaaagct gggaacaggt ctatttgaca attttgcatt aatgtaaata aatttaacat 6001 aatttttaac tgcgtgcaac cttcaatcct gctgcagaaa attaaatcat tttgccgatg 6061 ttattatgtc ctaccatagt tacaacccca acagattata tattgttagg gctgctctca 6121 tttgatagac accttgggaa atagatgact taaagggtcc cattatcacg tccactccac 6181 tcccaaaatc accaccacta tcacctccag ctttctcagc aaaagcttca tttccaagtt 6241 gatgtcattc taggaccata aggaaaaata caataaaaag cccctggaaa ctaggtactt 6301 caagaagctc tagcttaatt ttcacccccc aaaaaaaaaa aattctcacc tacattatgc 6361 tcctcagcat ttggcactaa gttttagaaa agaagagggc tcttttaaat aaattcacac 6421 agaaagttgg gcccagttac aactcaggag tctggctcct gatcatgtga cctgctcgtc 6481 agtttccttt ctggccaacc caaagaacat ctttcccata gcatctttgt cccttgcccc 6541 acaaaaattc ttctttctct ttcgtgcaga gtgtagatcc caaaaattac ccaaagaaga 6601 agatggaaaa gcgatttgtc ttcaacaaga tagaaatcaa taacaagctg gaatttgagt 6661 ctgcccagtt ccccaactgg tacatcagca cctctcaagc agaaaacatg cccgtcttcc 6721 tgggagggac caaaggcggc caggatataa ctgacttcac catgcaattt gtgtcttcct 6781 aaagagagct gtacccagag agtcctgtgc tgaatgtgga ctcaatccct agggctggca 6841 gaaagggaac agaaaggttt ttgagtacgg ctatagcctg gactttcctg ttgtctacac 6901 caatgcccaa ctgcctgcct tagggtagtg ctaagaggat ctcctgtcca tcagccagga 6961 cagtcagctc tctcctttca gggccaatcc ccagcccttt tgttgagcca ggcctctctc 7021 acctctccta ctcacttaaa gcccgcctga cagaaaccac ggccacattt ggttctaaga 7081 aaccctctgt cattcgctcc cacattctga tgagcaaccg cttccctatt tatttattta 7141 tttgtttgtt tgttttattc attggtctaa tttattcaaa gggggcaaga agtagcagtg 7201 tctgtaaaag agcctagttt ttaatagcta tggaatcaat tcaatttgga ctggtgtgct 7261 ctctttaaat caagtccttt aattaagact gaaaatatat aagctcagat tatttaaatg 7321 ggaatattta taaatgagca aatatcatac tgttcaatgg ttctgaaata aacttcactg 7381 aagaaaaaaa aagggtcttt cctgatcatt gacttgtctt ggatttgaca ctgaacagta 7441 aagacaaaca gggctgtgag agttcttggg ggactaaagc ccacctcctc attgctgagt 7501 gctgcaaagt cacctagaaa tatcccttgg ccaccgaaga ctatcctcct cacccatccc 7561 ctttatttct gttgttcaac agaaggatat tcagtgcaca tctggaacag gatcagctga 7621 agcactgcag ggagtcagga ctggtagtaa cagctaccag tgatttatct atcaatgcac 7681 caaacatctg ttgagcaagc gctatgtacg aggagctggg agtacagaga tgagaacagt 7741 cacaagtccc tcctcagata ggagaggcag ctagttataa gcagaaacaa ggtaacatga 7801 caagtagagt aagataaaga acaa // LOCUS HUMCAPG 3734 bp DNA PRI 31-OCT-1994 DEFINITION Human cathepsin G gene, complete cds. ACCESSION J04990 NID g179914 KEYWORDS cathepsin G; serine protease. SOURCE Human lung fibroblast DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3734) AUTHORS Hohn,P.A., Popescu,N.C., Hanson,R.D., Salvesen,G. and Ley,T.J. TITLE Genomic organization and chromosomal localization of the human cathepsin G gene JOURNAL J. Biol. Chem. 264 (23), 13412-13419 (1989) MEDLINE 89340411 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by T.J.Ley, 13-JUN-1989. FEATURES Location/Qualifiers source 1..3734 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11.2" CAAT_signal 253..256 TATA_signal 293..297 prim_transcript 322..3020 /note="cathepsin G mRNA and introns" gene join(350..404,1161..1308,1763..1898,2074..2328,2763..2936) /gene="CTSG" CDS join(350..404,1161..1308,1763..1898,2074..2328,2763..2936) /gene="CTSG" /note="cathepsin G" /codon_start=1 /db_xref="GDB:G00-119-822" /db_xref="PID:g179915" /translation="MQPLLLLLAFLLPTGAEAGEIIGGRESRPHSRPYMAYLQIQSPA GQSRCGGFLVREDFVLTAAHCWGSNINVTLGAHNIQRRENTQQHITARRAIRHPQYNQ RTIQNDIMLLQLSRRVRRNRNVNPVALPRAQEGLRPGTLCTVAGWGRVSMRRGTDTLR EVQLRVQRDRQCLRIFGSYDPRRQICVGDRRERKAAFKGDSGGPLLCNNVAHGIVSYG KSSGVPPEVFTRVSSFLPWIRTTMRSFKLLDQMETPL" exon <350..404 /gene="CTSG" /note="cathepsin G" /number=1 intron 405..1160 /note="cathepsin G Intron A" exon 1161..1308 /gene="CTSG" /number=2 intron 1309..1762 /note="cathepsin G Intron B" exon 1763..1898 /gene="CTSG" /number=3 intron 1899..2073 /note="cathepsin G Intron C" exon 2074..2328 /gene="CTSG" /number=4 intron 2329..2762 /note="cathepsin G Intron D" exon 2763..>2936 /gene="CTSG" /note="cathepsin G" /number=5 polyA_signal 2990..2995 BASE COUNT 990 a 960 c 959 g 825 t ORIGIN Chromosome 14q11.2. 1 cttgctttgc tggagtattc tggtaatttg atgggttgag ggttctggac acaatgcccc 61 aagccccttc cttgttgtgc tgggttccta tttctgctct cggcactgac ttagcagctg 121 ctcaagagct cactatgttg gcttggatta cacggtctca cccacatctc cggcagtttg 181 tgggcaaacc tcctgagcag ccttgggtga tgaaaccttt catggtagca ggagaatggg 241 actgtgaatt ctcaatcccc tgtccccacc ccttccttcc tctctcaggg ccttaaagtc 301 taggaggagg aagcacagca gcaactgact gggcagcctt tcaggaaaga tgcagccact 361 cctgcttctg ctggcctttc tcctacccac tggggctgag gcaggtgagt gaccatcccc 421 accctcagag gcctgacctc atcccataga ttcttgagcc aaattgcctt ggtatatcct 481 aattctgtac tgttgagcaa gttatttgaa tttgtgtttc ctcatctata aaatgagaat 541 aatattaata ccgatcttgc agagttgcca tgagagttaa ataagttaga gtatttaaat 601 gtcttggaat tgcccgcaca ctataagtgc tataaaaaca tgctttgtgt aaataatttg 661 gcagcatgtg tcagacccta cctaggaggt aagaatacag caataacagt accatcagct 721 catgtctaga tttttaaaca ccagtcccac gtggtcttga attggactca gagggctctg 781 ggaagctcca tgaggataaa agtataaggg aacttcagga acaatcctgt acttacagca 841 aagcattctc ctcaatacct gaggctgaag ctggccttgc ctggaacaag ggttgttctc 901 cctcttttgg agaggaggag ggaggtgagg cctaggatgg ggaaaagggc tcctttcaag 961 acagcagtgt ttcctgtaga accctggagc cccctcccaa tctgctgccc catagactcc 1021 aagcctcagc accatctcct ccctctcctg caccctctct cctgccgtcc ccatcttcca 1081 gcctttctgg agccaccaat ctggtaccca cattgcaggt tcagcaagca tagagctaag 1141 tgccaaatgc ttccttccag gggagatcat cggaggccgg gagagcaggc cccactcccg 1201 cccctacatg gcgtatcttc agatccagag tccagcaggt cagagcagat gtggagggtt 1261 cctggtgcga gaagactttg tgctgacagc agctcattgc tggggaaggt gaggagctaa 1321 ggaacttcct ggccagccag gaacacagcc ctgcggagct cttcggtgga agagccatct 1381 gaaagaagag ttgtagcaat gaaagggtga aagaaagacc aagtgagtct ttgcgggagg 1441 gaacaggcca gtgtaaatga ggaggaaagg aggataagat caaaaagagc aagaggaaga 1501 gatggaagac acatattggg gctcaaaata taaactcagg ctatttatca acttaatctg 1561 gggaagtaaa cctgaaggca agtaccaccc tgtcatccct agctcagagc tgctgagaaa 1621 gaggatacag ctgagcccca gggccctccc atcccctcga ttctggttag ctgcagtctt 1681 gccctccccg tgctgtctgc ctaccctgca gagctggtgg accatagctc ctgcagccca 1741 gacctacctc ttgcttttgc agcaatataa atgtcaccct gggcgcccac aatatccaga 1801 gacgggaaaa cacccagcaa cacatcactg cgcgcagagc catccgccac cctcaatata 1861 atcagcggac catccagaat gacatcatgt tattgcaggt accacctacc tggccctctg 1921 gctccttcct agtgtgtccg gggacaatgg aggaggaagt gagggcaagg ctccggggtg 1981 gcggggaggg catgggatgt gtactgcacc agcgaccccc gagccttggc tggaggcccc 2041 agctgagcgg gaacgcctac attcttcctc cagctgagca gaagagtcag acggaatcga 2101 aacgtgaacc cagtggctct gcctagagcc caggagggac tgagacccgg gacgctgtgc 2161 actgtggccg gctggggcag ggtcagcatg aggaggggaa cagatacact ccgagaggtg 2221 cagctgagag tgcagaggga taggcagtgc ctccgcatct tcggttccta cgacccccga 2281 aggcagattt gtgtggggga ccggcgggaa cggaaggctg ccttcaaggt aaggcatggg 2341 cattggccaa cacaccccgg gagagagggg cccgtgcaga gccaggcagt gcgaacagat 2401 tccatcccca cagcctcagc ctggcagcca gaccagggtg ggctggggat tgttttcccc 2461 atcaacctgg tctctggggg aataggagga agacccacaa cacatacata ggcaacattc 2521 tcctggagaa gggagaggta ccttgactca gattgggctg gagacagtaa ttaaggcaga 2581 gctgaagtcc agcgaccgaa aagatccaga ggcttggctc ctgtacccca ccgatcttcc 2641 atctcacaca cacccagcaa ttgaaggggc ccacccaccc ctgccttccc tgagagcccg 2701 gagctcaggg aagcaggagc agggaggcct gtctcagtct cccttctcct ctctacctac 2761 agggggattc cggaggcccc ctgctgtgta acaatgtggc ccacggcatc gtctcctatg 2821 gaaagtcgtc aggggttcct ccagaagtct tcaccagggt ctcaagtttc ctgccctgga 2881 taaggacaac aatgagaagc ttcaaactgc tggatcagat ggagaccccc ctgtgactga 2941 ctcttcttct cggggacaca ggccagctcc acagtgttgc cagagcctta ataaacgtcc 3001 acagagtata aataaccaat tcctcatttg ttcattaaac gtcattcagt acttagtttg 3061 tttggattgc tacaacaaaa tagcacaaat tgggtggctt ataaataaca aatttatttc 3121 tcacaggtct agaggctaag aagtctaaga tcaagtcact agcagattca gtgtctaatt 3181 agggcccatt ttctggttca cagacaacca tcctctccct gtgtccacat atggcaaaag 3241 gggcaaggga attctctgat gtctctttta caagggacct agtctcattc aaagagctca 3301 gcttttacga cctaatcaca tcccaaaggc cccacctaat gccatcacga cattggggat 3361 taggtctggg aaacataggg aaagagtgtc tctacacaaa aattttaaaa ttagccaggc 3421 atggtggcat gtgtctatag tcccagctac ttgggaggct aaagtggaag gattagttga 3481 acccacgagg ttgaggcttc agtgaaccat gcactccagc ctgagcgaca gagcaagaca 3541 ccattccaag aaagaaaaaa aaaaagactg gcaggccaaa aagacagaac tgaaattcca 3601 aaaaaaaaga cctactttag tgtatgaaaa aggtggcatc tcaaatcact gggaaacaat 3661 ggaatttttg aataaatagc attagaacca acctagatag atatttggag gggatggaag 3721 gtataattgg atcc // LOCUS HUMCTLA1 4505 bp DNA PRI 23-MAY-1995 DEFINITION Human cytotoxic T-lymphocyte-associated serine esterase 1 (CTLA1) gene, complete cds. ACCESSION M38193 NID g825611 KEYWORDS cytotoxic T-lymphocyte-associated serine esterase 1. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4505) AUTHORS Caputo,A., Sauer,D.E. and Rowe,P.B. TITLE Nucleotide sequence and genomic organization of a human T lymphocyte serine protease gene JOURNAL J. Immunol. 145 (2), 737-744 (1990) MEDLINE 90308320 COMMENT On May 23, 1995 this sequence version replaced gi:306681. FEATURES Location/Qualifiers source 1..4505 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11.2" mRNA join(877..964,2008..2155,2611..2746,2952..3212,3856..4112) /gene="CTLA1" /note="G00-120-744" exon 877..964 /gene="CTLA1" /note="G00-120-744" /number=1 gene join(877..964,2008..2155,2611..2746,2952..3212,3856..4112) /gene="CTLA1" CDS join(910..964,2008..2155,2611..2746,2952..3212,3856..3999) /gene="CTLA1" /codon_start=1 /db_xref="GDB:G00-120-744" /product="cytotoxic T-lymphocyte-associated serine esterase 1" /db_xref="PID:g306682" /translation="MQPILLLLAFLLLPRADAGEIIGGHEAKPHSRPYMAYLMIWDQK SLKRCGGFLIQDDFVLTAAHCWGSSINVTLGAHNIKEQEPTQQFIPVKRPIPHPAYNP KNFSNDIMLLQLERKAKRTRAVQPLRLPSNKAQVKPGQTCSVAGWGQTAPLGKHSHTL QEVKMTVQEDRKCESDLRHYYDSTIELCVGDPEIKKTSFKGDSGGPLVCNKVAQGIVS YGRNNGMPPRACTKVSSFVHWIKKTMKRY" exon 2008..2155 /gene="CTLA1" /note="G00-120-744" /number=2 exon 2611..2746 /gene="CTLA1" /note="G00-120-744" /number=3 exon 2952..3212 /gene="CTLA1" /note="G00-120-744" /number=4 exon 3856..4112 /gene="CTLA1" /note="G00-120-744" /number=5 BASE COUNT 1127 a 1257 c 1067 g 1054 t ORIGIN 1 tttggctgcc tggcatgctt cctcacttca tatggtatca gcaatttagc accacaaacg 61 tcctttagag aaccagccct ttctcattct tggttctagt ggcttgagta gactgacccc 121 agcctaccca aagtggattt gactcctagc aattcattaa tctagcccaa taaaatgtca 181 agtacaggac ttttattgaa agcattcaga aaagaggtgg actctcacac taaacatttg 241 taactaaata agggatgtta gaaattctct agaaaggaag ctatgataat aaatgggttg 301 ctagatgggt ctagtagatg gtggccatgc tttgttactg ccttgtgtat tgtgctacca 361 tagccctccc caaactgtac tctggctcct ggcatttccg tctcttcaac cagatggtca 421 gctctctaag tgaaggagac acatctccaa catgcttggt tctagcacaa cagaagggct 481 caaacacata cctgctaaag aaactatcct gatggattta gcagcatggc catgaggcat 541 tggcggttct atcactggga actcaggttt ctggtgctcc agtacctcta ctggctgata 601 ccacatccta cagttcactt cataggcttg ggttcctgct ctgggctgaa taggtggtcc 661 actctgagtc atcagctgtg ggtgatgatg tggtcactgc atgattctca cacaagcacc 721 cagaggacgt catcaggcag aggcagtggg ggtgggcagc atttacagaa aatctgtgat 781 gagacaccac aaaaccagag gggaacatga agtcactgag cctgctccac ctctttcctc 841 tcccaagagc taaaaagaga gcaaggagga aacaacagca gctccaacca gggcagcctt 901 cctgagaaga tgcaaccaat cctgcttctg ctggccttcc tcctgctgcc cagggcagat 961 gcaggtgagt gaccgtctcc accctcgggg gcccaacccc atcccacagg tctcctgccc 1021 tttctccaca ttcctgatcc atctatctac caggaatgtt ctgaactcca gctcccattc 1081 taccaagacc ccccaagtgt gatgctggat aagctatcag caggaatggc agagcagcag 1141 gccattctca agaagagcca gtgggtacta tcccttcccc agagcccacc tttgtcacct 1201 ggagagtagg actttcctag aagtaaatgg cagaggatgg gaaactagaa aagagaaata 1261 ttaaattatt ctagagtagg cctggcttct gtttctggga taagacaggt gcttctctca 1321 ctgtctactt aggagagaaa cccagagctc agctgacagc agaattggta caatcactgt 1381 cctcagaaca ctgttaatgt gtttgctcag tcccattctc caactctgct tttcttccct 1441 ggcctttggt ggctcccctc tttccaagga tgaggcacta cggcaggccc cagcttccct 1501 gctttctaga attccaccag cactgctcta ccagccctca tccagaggct aactggagcc 1561 agtccatcat gcagccatga acatttactg ggcacccact acatgtcagg ctctaggaaa 1621 caggatatga cagtatctag atccctccac ttacaccctg gccattagaa agcagcacta 1681 tcctagacac cacaggactc ataagggtct tggaaactca cctgaaacaa aacaaagtca 1741 ggagaggaat gatcaggagc ctctgggatt tcactgtccc taagacaggt atgctcgcct 1801 tcaactacat atggaagaaa gatttacaga ccaaagtctg ctgttcttcc ctttttcaga 1861 gcaggaaatt gaagcccctt cctccaggcc actcccaact ccaggctatc ccaggctccc 1921 aaatgcccag gagttctgga gccactaagc aggtgcccac ccagcagatt ccatgggtgc 1981 ccacaagcag acagactttt ccttcagggg agatcatcgg gggacatgag gccaagcccc 2041 actcccgccc ctacatggct tatcttatga tctgggatca gaagtctctg aagaggtgcg 2101 gtggcttcct gatacaagac gacttcgtgc tgacagctgc tcactgttgg ggaaggtgag 2161 gagcagaaaa cagcccacac cctcctggaa acactccaca gagacccctg ccttcttccc 2221 aaggagctcc ctgggctcct gtgaacacac atgccaggag gtctccttag agggtgagaa 2281 aagggcagtt aagtttgtgg agagagggga aggttggttc cagaggtgct gctgaagtaa 2341 gaaacagcag agtgaccaag cctgccatat ttagaactgg gggcatactt tggcatagaa 2401 tacaaactga agcaattcca cctgtgtttc tagggggaac cgaaccctga gaaacctggt 2461 gcaattacca gaattccaat tcctggggac cgactgtccc cttaatttcc cctcagctgc 2521 agccctgccc cagctgtcac ctgctcttca ctgtctctgg gctgtatacc ctgtgactcc 2581 acccccatcc tcactctgct ctctgtgcag ctccataaat gtcaccttgg gggcccacaa 2641 tatcaaagaa caggagccga cccagcagtt tatccctgtg aaaagaccca tcccccatcc 2701 agcctataat cctaagaact tctccaacga catcatgcta ctgcaggtga ggcacactcc 2761 tgccactctt gctcttcttg gtccagttgg ttccactccc cctggaatgc cggcccttcc 2821 ctcctttcca tcctggcctc ttggttagtt cctatgcctc agaggagagg gaagattgtg 2881 cagccccatc actgtgtcgg ggcccagaag ttcgttggct gacctggact ttcttgcctc 2941 ttccccacca gctggagaga aaggccaagc ggaccagagc tgtgcagccc ctcaggctac 3001 ctagcaacaa ggcccaggtg aagccagggc agacatgcag tgtggccggc tgggggcaga 3061 cggcccccct gggaaaacac tcacacacac tacaagaggt gaagatgaca gtgcaggaag 3121 atcgaaagtg cgaatctgac ttacgccatt attacgacag taccattgag ttgtgcgtgg 3181 gggacccaga gattaaaaag acttccttta aggtaagact atgcacctgc ctggattggc 3241 tcttgggaga aagatgtttg gggaatatct gagacctgga gactcaagta gtgggggact 3301 ccttcaccca ctagactgtg atatttctct ctggaaagag aagaggggac tagactgagc 3361 tggggagaaa ttagggcctc tgcaaactta ccaggaggct tatggtggat ggtgcttctt 3421 tggaaggatg aatttgcaac actccaccca ctccaggtca cagatattag gaaactgtgc 3481 ccactggggg tgcagtaatt ataaccaggt gtgtcttcag aggctggtac ccaacgtggt 3541 taatgggctg gtcctccatg gtggacatca gccctccttg cccacttctg ggtccttaaa 3601 cagccaacgg tcccacatac ctccgatctc aggatctggg ggacatgacg gaggctggcc 3661 cctgggatga ggtgaagcag taacaatgtc cagggccaga gcttggcagc tgggggccac 3721 cagcggcctg ccctgccctc tggtctccca catgtaggct gtgcaagttg gccttttcta 3781 aaagggggct tgagatggaa gagagggcag gacccggagg agcatcagct cagtccttcc 3841 actctctatt cacaggggga ctctggaggc cctcttgtgt gtaacaaggt ggcccagggc 3901 attgtctcct atggacgaaa caatggcatg cctccacgag cctgcaccaa agtctcaagc 3961 tttgtacact ggataaagaa aaccatgaaa cgctactaac tacaggaagc aaactaagcc 4021 cccgctgtaa tgaaacacct tctctggagc caagtccaga tttacactgg gagaggtgcc 4081 agcaactgaa taaatacctc ttagctgagt ggaaaagctg gtttcttgtt tattcattga 4141 ccctcattct caggcaccac atctgcgcta tgcaggccaa tgacacaatt ttgctgtttt 4201 ctgctttctc ctctcccctc accccttgcc acctccccaa acccccacat gaagctgata 4261 ctcagctcct tcctatccac accagtttct ccagggcctg cccttctgcc aaggctgaag 4321 ctgagcacca tcaggagaca acatggacca ctttggtcct ggggctttgg gtaaacttct 4381 tacctccttc tccagtgtta catgacagag aaaaaaggga taataccatg ggacctaact 4441 cctcatcccc cactggggct cctcattctc ccctgggctt agtttctcta ccctcctctg 4501 agctc // LOCUS HSLACTG 3310 bp DNA PRI 24-APR-1993 DEFINITION Human alpha-lactalbumin gene. ACCESSION X05153 NID g34212 KEYWORDS alpha-lactalbumin; Alu repetitive sequence; lactalbumin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3310) AUTHORS Hall,L., Emery,D.C., Davies,M.S., Parker,D. and Craig,R.K. TITLE Organization and sequence of the human alpha-lactalbumin gene JOURNAL Biochem. J. 242 (3), 735-742 (1987) MEDLINE 87241386 COMMENT Data kindly reviewed (01-OCT-1987) by HALL L. FEATURES Location/Qualifiers source 1..3310 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 596..625 /note="pot.regulatory sequence" TATA_signal 706..712 prim_transcript 736..3097 exon 736..894 /number=1 mRNA join(736..894,1542..1700,2190..2265,2765..3097) sig_peptide 762..818 CDS join(762..894,1542..1700,2190..2265,2765..2825) /codon_start=1 /product="alpha-lactalbumin precursor" /db_xref="PID:g296662" /db_xref="SWISS-PROT:P00709" /translation="MRFFVPLFLVGILFPAILAKQFTKCELSQLLKDIDGYGGIALPE LICTMFHTSGYDTQAIVENNESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDD DITDDIMCAKKILDIKGIDYWLAHKALCTEKLEQWLCEKL" intron 895..1541 /number=1 repeat_region complement(1069..1351) /note="Alu repetitive sequence" exon 1542..1700 /number=2 intron 1701..2189 /number=2 exon 2190..2265 /number=3 intron 2266..2764 /number=3 exon 2765..3097 /number=4 polyA_signal 3075..3080 polyA_site 3098 BASE COUNT 770 a 821 c 687 g 1032 t ORIGIN 1 gagctcctgg gctcaagtga tccaccagac tcggcctccc aaaatgccgg gattacaggt 61 gtgagccact gtgcctggcc tagatgcttt catacaggct tttcaattat gcattttcct 121 taagtaggaa gtcttaagat ccaagttata tcggattgtt gtagtctacg ttcccatatt 181 ctattcctat ttctgagcct tcagtcatga gctaccatat taaagaacta attctgggcc 241 ttgttacatg gctggattgg ttggacaagt gccagctctg atcctgggac tgtggcatgt 301 gatgacatac accccctctc cacattctgc atgtctctag gggggaaggg ggaagctcgg 361 tatagaacct ttattgtatt ttctgattgc ctcacttctt atattgcccc catgcccttc 421 tttgttcctc aagtaaccag agacagtgct tcccagaacc aaccctacaa gaaacaaagg 481 gctaaacaaa gccaaatggg aagcaggatc atggtttgaa ctctttctgg ccagagaaca 541 atacctgcta tggactagat actgggagag ggaaaggaaa agtagggtga attatggaag 601 gaagctggca ggctcagcgt ttctgtcttg gcatgaccag tctctcttca ttctcttcct 661 agatgtaggg cttggtacca gagcccctga ggctttctgc atgaatataa ataaatgaaa 721 ctgagtgatg cttccatttc aggttcttgg gggtagccaa aatgaggttc tttgtccctc 781 tgttcctggt gggcatcctg ttccctgcca tcctggccaa gcaattcaca aaatgtgagc 841 tgtcccagct gctgaaagac atagatggtt atggaggcat cgctttgcct gaatgtgagt 901 tccctgcctc tgtgtttcat ccattcctca tacgcttctc tcctccatcc cctctttctt 961 ccacttcgcc cctccacttt tacttaatta tctaatcatc ctcttttctg ctcatttgca 1021 tactctttta tttcatgtat gtatatatgt atgtatttat ttatttttga ggtggagttt 1081 cgctcttgtt gcccagactg gagtgcaatg gtgtaatctc ggctcactgc aacctccgcc 1141 tcctcggttc aagtgattct cctgcctcag cctcccaagt agctggaatt acaggcaccc 1201 accaccatgc ctggctaatt ttgtattttt tgtagagaca gggtttcacc atgttggcca 1261 ggctggtctc aaacttctga cctcaggtga tccgccctcc tcagcctccc aaagtgttgg 1321 gattacaagc gtgagccatc atgcctggcc ccatttattt tcctatcctt tctttctctt 1381 attgtctgat ttttttttgg aattctccat ctcatcaaga aactctgagc tttgccatct 1441 ttggagattg gctggaaagc atttttgtct gagaattaca gttcctcctt tatgcagatc 1501 ctgtacatct ctgtggtatc tctttctcat ctttccctca gtgatctgta ccatgtttca 1561 caccagtggt tatgacacac aagccatagt tgaaaacaat gaaagcacgg aatatggact 1621 cttccagatc agtaataagc tttggtgcaa gagcagccag gtccctcagt caaggaacat 1681 ctgtgacatc tcctgtgaca gtgagtagcc cctataaccc tctttctctg tttttctgag 1741 gcctgccctt gggataatct cctttttagt gccaagcaga cctcaggctt cattgccttg 1801 gctgggctct ataaaaattg tgggacttga attggcagta ctgagtaaga agctgtttgg 1861 atttttcatg gtcatcaaat ccccagacag ttccttgagg ttcagtggta gacaatcgga 1921 gctgtctgag agtcttggaa tctgattgtc tgcattttca gggtaagtca gttgatgaag 1981 ctgatgattc ctccagagat atcccaggga aatgaaggaa gtccctaccc agggttagac 2041 attaccacat tggtcctttc atatagaaag acaacaggca caagccttga gtttagagaa 2101 cccactggat ccaggggtta ggggaactca gtgcctttct gggtaatact tgtcagctgt 2161 ctcaatcctt tccctgtaac tcctgccaga gttcctggat gatgacatta ctgatgacat 2221 aatgtgtgcc aagaagatcc tggatattaa aggaattgac tactggtgaa tccttattct 2281 attttctatt tccccatcct ccttctcctt accccattag cccagcaccc ctttcctctt 2341 accctatctc ttggtcattt aatctagaat acagtgtctg aaacaaagct tacctagaga 2401 ctcaggtttc tgttattaag cctctctcgc tccgctcctt ggtagcaatt ttcctaataa 2461 ggggttgcct aatggagggc tcagacccag gcctcctttc acttagactt ggacatctaa 2521 ttccacttgt ttagttctat gccctaaagc aagctgttgg taacattgca tctctttttt 2581 aaccctacaa ttttcttgga tattttttat ggactgtatt ccacttgatg gcttgtgtcg 2641 cttgacatca ggccaggaat gtctttctgt aattctcgtc cacgctcttc cacttcagcc 2701 ctcctgggaa tgaatgtaaa gattcagtca gctaactcac cttgtccccc ttctccatta 2761 tcaggttggc ccataaagcc ctctgcactg agaagctgga acagtggctt tgtgagaagt 2821 tgtgagtgtc tgctgtcctt ggcacccctg cccactccac actcctggaa tacctcttcc 2881 ctaatgccac ctcagtttgt ttctttctgt tcccccaaag cttatctgtc tctgagcctt 2941 gggccctgta gtgacatcac cgaattcttg aagactattt tccagggatg cctgagtggt 3001 gcactgagct ctagaccctt actcagtgcc ttcgatggca ctttcactac agcacagatt 3061 tcacctctgt cttgaataaa ggtcccactt tgaagtcact ggctgtaatt tttttccccc 3121 tggagggaag gggaagaaat aggatgagta ggtggacact gaagccatag gtcatagcca 3181 ccttccatct ctactgaaga agaagtaggc tgaatttaca atagaaaggt gaaggttact 3241 gtctgtacca actcaatgca acaaactttt attgatcacc taatctattc aaggaactgt 3301 agacggatcc // LOCUS HUMOSTP 10881 bp DNA PRI 30-MAY-1996 DEFINITION Human DNA for osteopontin, complete cds. ACCESSION D14813 NID g506341 KEYWORDS hOP; osteopontin. SOURCE Homo sapiens liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10881) AUTHORS Hijiya,N., Setoguchi,M., Matsuura,K., Higuchi,Y., Akizuki,S. and Yamamoto,S. TITLE Cloning and characterization of the human osteopontin gene and its promoter JOURNAL Biochem. J. 303 (Pt 1), 255-262 (1994) MEDLINE 95031968 REFERENCE 2 (bases 1 to 10881) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (26-MAR-1993) to the DDBJ/EMBL/GenBank databases. Shunsuke Yamamoto, Oita Medical University, Department of Pathology; Hasama-machi, Oita 879-55, Japan (Tel:0975-49-4411(ex.2690), Fax:0975-86-5699) COMMENT Submitted (26-MAR-1993) to DDBJ by: Shunsuke Yamamoto Department of Pathology Oita Medical University 1-1 Idaigaoka, Hasama-machi Oita Gun, Oita 879-56 Japan Phone: 0975-49-4411 x2690 Fax: 0975-49-4217. FEATURES Location/Qualifiers source 1..10881 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" protein_bind 111..115 /bound_moiety="TCF-1" protein_bind 177..185 /bound_moiety="NF-IL6" /function="lipopolysaccharide, IL-1, IL-6 inducible" protein_bind 209..214 /bound_moiety="CF-1" protein_bind 222..226 /bound_moiety="TCF-1" protein_bind 223..229 /bound_moiety="E2A" protein_bind 318..326 /bound_moiety="NF-IL6" /function="lipopolysaccharide, IL-1, IL-6 inducible" misc_feature 376..390 /note="calcitriole response element" /function="calcitriole inducible" protein_bind 396..401 /bound_moiety="GATA-1" protein_bind 468..472 /bound_moiety="TCF-1" protein_bind 490..497 /bound_moiety="NF-IL6" /function="lypopolysaccharide, IL-1, IL-6 inducible" protein_bind 601..605 /bound_moiety="TCF-1" protein_bind 652..656 /bound_moiety="GATA-1" protein_bind 831..835 /bound_moiety="TCF-1" protein_bind 847..851 /bound_moiety="TCF-1" protein_bind 850..855 /bound_moiety="PEA3" /function="TPA, EGF, serum, oncogenes inducible" protein_bind 898..903 /bound_moiety="Myb" protein_bind 999..1004 /bound_moiety="IRF-1" protein_bind 1016..1021 /bound_moiety="GAT-1" protein_bind 1050..1054 /bound_moiety="TCF-1" protein_bind 1171..1175 /bound_moiety="TCF-1" protein_bind 1261..1269 /bound_moiety="NF-IL6" /function="lipopolysaccharide, IL-1, IL-6 inducible" protein_bind 1486..1490 /bound_moiety="CTCF" protein_bind 1550..1558 /bound_moiety="Sp-1" protein_bind 1567..1588 /bound_moiety="VDRE" /function="VD3 inducible" protein_bind 1579..1584 /bound_moiety="PPAR" /function="peroxisome proliferators" protein_bind 1592..1596 /bound_moiety="TCF-1" protein_bind 1632..1636 /bound_moiety="CTCF" protein_bind 1679..1686 /bound_moiety="E2BP" protein_bind 1787..1791 /bound_moiety="TCF-1" protein_bind 1902..1907 /bound_moiety="GATA-1" protein_bind 2031..2038 /bound_moiety="AP-2" /function="TPA, cAMP, retinoic acid inducible" protein_bind 2036..2041 /bound_moiety="SIF" /function="sis/PDGF inducible" protein_bind 2047..2051 /bound_moiety="TCF-1" protein_bind 2084..2089 /bound_moiety="myb" protein_bind 2146..2152 /bound_moiety="E4TF1" /function="ras, raf inducible" protein_bind 2158..2164 /bound_moiety="E2A" protein_bind 2163..2171 /bound_moiety="Sp-1" protein_bind 2190..2196 /bound_moiety="AP-1" /function="TPA inducible" protein_bind 2222..2228 /bound_moiety="Ets-1" /function="c-Ha-ras, v-src, v-mos inducible" TATA_signal 2241..2246 exon 2268..2357 /number=1 intron 2358..3440 /number=1 exon 3441..3508 /number=2 CDS join(3455..3508,3618..3656,6598..6678,6944..6985, 8025..8348,9041..9445) /codon_start=1 /product="osteopontin" /db_xref="PID:d1004065" /db_xref="PID:g506342" /translation="MRIAVICFCLLGITCAIPVKQADSGSSEEKQLYNKYPDAVATWL NPDPSQKQNLLAPQNAVSSEETNDFKQETLPSKSNESHDHMDDMDDEDDDDHVDSQDS IDSNDSDDVDDTDDSHQSDESHHSDESDELVTDFPTDLPATEVFTPVVPTVDTYDGRG DSVVYGLRSKSKKFRRPDIQYPDATDEDITSHMESEELNGAYKAIPVAQDLNAPSDWD SRGKDSYETSQLDDQSAETHSHKQSRLYKRKANDESNEHSDVIDSQELSKVSREFHSH EFHSHEDMLVVDPKSKEEDKHLKFRISHELDSASSEVN" intron 3509..3617 /number=2 exon 3618..3656 /number=3 intron 3657..6597 /number=3 exon 6598..6678 /number=4 intron 6679..6943 /number=4 exon 6944..6985 /number=5 intron 6986..8024 /number=5 exon 8025..8348 /number=6 intron 8349..9040 /number=6 exon 9041..9869 /number=7 BASE COUNT 3466 a 1998 c 1944 g 3455 t 18 others ORIGIN Chromosome 4q. 1 gaattcacaa gccttttctc tgagagaggc cttgggacta ggaacttttt gaatgagtgt 61 agaagtcggg aaggagacaa tagtgtcaac ttgggattgc ctaaggcaac aacagagcaa 121 aacaagaacg ctttggttct ctgggtctct gtccctgatt gcatagcggg tcattgttgg 181 gaaatatttc ctcacctggc attccaagaa atggtgagct ccacagctgt atatagtcct 241 gtcattaaat acaggagtgt tctatcccgc tggaattaag aaaattggta gaaccagatt 301 gtggtctgaa atcttttttc agaaatgctg ccatcgtgtg gcactgcgga gctatgacca 361 gaagagtcct gtaaagggtc gtatggttca tctcaagatg gctgggctcc agcataatct 421 attcctataa ttaattctag cttcatattg aatcattccc gtgggcacag agtaaactac 481 agtaaatcct gtggaaattt tgttgttttt agaattttcg gacttccctc cactaaattg 541 acaacatgac acgcttatgc gngtatgttt aaaggaaaaa aatagttttt agaagcagaa 601 aaaagaagtc tattttgcaa ctttataatc tgtgtgcttn ctattttata gagatagtcg 661 tcatcttact tattaaaatg ggtgcttatt acctacaaac caatcatatc aattcatctg 721 gaatacatcc aatttaaggg agacatattt ccccctacca aatgttcatg aaacctatga 781 attagctata cactatcact gcaagacatt atttaatcta tatttatatt aaaagtaata 841 tttggcaaaa ggaagctgac actttaggac taataaaaac cacaattact tttgcagcaa 901 cctaataata aataggacca tttatttttc atctcaatta cacacaagtc ttaacaataa 961 aggtgtaagg taaataaata gtgcaatctg catttcacaa ctgagaagca aatgaagata 1021 agtaatctca aggcaatatt aaatatttta aaaggaccca gagctctgct atccctgaat 1081 tctgctctaa tattcggact ttccctgtaa ttttctttca ttcagacacc ttttaaatac 1141 ctagtaaagt gttttttaat acagaaattt ttaaaaatgt ttttcttttt aagtggccta 1201 ctttacatac cttgggagaa aaactagaaa aaaagatgat tccaaaatcg aatctgttcc 1261 tttagaaatg tgcaaaattt ccttattgat gcatacaatt taaagatctt acgtctactc 1321 tcattttaat aacctgttct tttaaaggac attacaattc gtgactgcct gcccctctta 1381 aaaatttcat aatagttaac acacatatag tccttaagat acgcagagca tttgcatcta 1441 atatgtgcta agcattgcta gtttaacata ctaattcatt taaacccctc aaaaacccca 1501 tgacctaggt aatagtattg catttcatgg atgagggaac aaggataggt aggctgggcg 1561 atttgcccaa ggttgcacag gtcagcagtg acacagcgga attcagaacc acggtctggc 1621 tcctgaagca gccctctcaa gcagtcatcc ttctctcagt cagaaactgc tttacttctg 1681 caacatctag aataaattac cattcttcta tttcatatag aattttatat tttaatgtca 1741 ctagtgccat ttgtctaagt aacaagctac tgcatactcg aaatcacaaa gctaagcttg 1801 agtagtaaag gacagaggca agttttctga actccttgca ggcttgaaca atagccttct 1861 ggctcttcaa taagtacaat catacaggca agagtggttg cagatattac ctttatgtta 1921 cttaaaccga aagaaacaaa aatccattgt atttaatttt acattaatgt ttttccctac 1981 tttctccctt tttcatggga tccctaagtg ctcttcctgg atgctgaatg cccatcccgt 2041 aaatgaaaaa gctagttaat gatattgtac ataagtaatg ttttaactgt agattgtgtg 2101 tgtgcgtttt tggttttttt ttgttttaac cacaaaacca gagggggaag tgtgggagca 2161 ggtgggctgg gcagtggcag aaaacctcat gacacaatct ctccgcctcc ctgtgttggt 2221 ggaggatgtc tgcagcagca tttaaattct gggagggctt ggttgtcagc agcagcagga 2281 ggaggcagag cacagcatcg tcgggaccag actcgtctca ggccagttgc agccttctca 2341 gccaaacgcc gaccaaggta cagcttcagt ttgctactgg gttgtgcatt cagctgaatt 2401 tcatggggaa gtccaaattc taaggaaaaa tatttttaat tgtaatgctg ttaaacagac 2461 ttaaattttc tagccttttt aataagcaga ttagatacat tgcaggtctc ctgtggaaca 2521 aaggtgtcta gatattttga atgccaatca aatttaaaac ttaaaaatac ttccactggg 2581 tcctcaaaag aacggaaacc accgatgcta atcagaaaat agtaaaatta aattcacctt 2641 tggaataatt atacctatat aattttcagt ggggtactgt ncaggaattt aaaagaaaag 2701 ggatctttta tgctaattaa accaattaca atgctatttt ttaaatgatg tatctcactt 2761 ttaaggggaa gaaaaccctt nctgaatatg ccactgctaa atttagctgt taaaatattc 2821 accaagatac ctgtatgaca ctgtgtaggc ttattattac aaatagaaaa gctgttggct 2881 attttcaatg ttttcctttg aatttcaaat ttttagaaca tcttacttaa ataacaaatt 2941 tcagagatag tttgatttca cctaagtagc acctacttga taattaagct aaaagtcaga 3001 tttaaagtac atgttggaaa aatggataaa gcaaattttt ttcatttttt tctgtgagtt 3061 ttttcttctc taaaaaatat tcccatacta gcttattaat ataattaagt tactgttgat 3121 ctgtttgtag gtttagagag ctagatatat aaggtagtaa tggtataatt tctggaactc 3181 taaattttaa agttgaataa atacagactt gcaaaatttc tctttccctt gcctaatagt 3241 gaaagatgga taataggtgg caatataaat attaacttga aagactgtaa tactaaaaag 3301 aaaaggcatc tctaagaagt agaaaagatt ctatagaaaa tatattttat ttgtgatcat 3361 tttgtaatgt ggtagtataa aaaggtatca ctgttgtaac ctatgaagat gtcagctatt 3421 ccttatgaaa tattttgcag gaaaactcac taccatgaga attgcagtga tttgcttttg 3481 cctcctaggc atcacctgtg ccataccagt gagtacagtt gcatcttaaa gaaaattcct 3541 gaaaataact gaattgtgtg cttccatgtg ctaggaggac attcttgtaa tctttcttca 3601 tcttttctgt ttctaaggtt aaacaggctg attctggaag ttctgaggaa aagcaggtaa 3661 gcatctttta tgtttttata tagttaatca tttactcaat tatggcgaga ggtgcaagaa 3721 acgtatttgc tgcgtattta cttatcttct cagtcaaatc cattggttta caagtattga 3781 ttgactgcct gctatgaatc taggccagta ccaagcacag tatagttttt aataaatata 3841 agtttataaa accaacccag atattttaaa tataataata tctaggcatg tatgatgagt 3901 tatcgcatgt aagataagtt atatgaagtt gtgtgacttt ttttccatta gtccacatac 3961 tgatctaaaa gcagaaaatt ccagcttttg ctttgtttag tggattgcta agtttaaaat 4021 tcacattgga tattagtcag aactgtttgt atgaccataa tattcacaat attgtctgag 4081 atattagctg agaagcccat tgtgaaaaga aagtctatgt gtgctgtttg tatctattgt 4141 gattgtcagc tgatgttaga tcacattttc taaccaaaca taagaccaac caaactcttt 4201 attataatta tttgaccagc actaaagatg tacctacccc tccacaacag atgaaactgt 4261 gccagccaaa caacaaatgg gcattgtccc cagaagcttg gacaaaaagg cacacagagt 4321 tcaattccag ttgaacagaa taaaggccaa aatagagctg ccttgggggt cactgcaatt 4381 acactgctta atgaagacat taaaagaagt atnctgtgtn cgtttgtgtg tggaggggtg 4441 tgtgtgtctg tttttcaact gatttgaaaa tacaggtgtt gaatcctaat aataaaccag 4501 aaaaattaac atctccagag aagatagagg tcatactatt tgaggcaaga attagcgtct 4561 ttttaataaa cgaaaatatg gcaaagatgc atnttagaag gcacgtggag ctataacaat 4621 ttaagaaata cgtgaagagc tcaaggctca gccttctaga atcccagaaa cttaaagcta 4681 gtaaaaaatt ggggaagtct ctaaggatat atgcctgaaa atacacactg gttatctgtg 4741 agtgttagga ttactgggtg gtttttagtc tatcattttg cttaccttta ttttcttcat 4801 attagttttt aaaaattata aatgaaactt atacatcctt nctctctgag cctgtattac 4861 atgtgtcatg agaatagata gatagatatg aaaaagtgaa gagaaaaact ctgaactcat 4921 ctggtctcac tgtttttgcg ccttcttttt tttttttttt tttttttttt tgagacggag 4981 tctcgctccg tcgcccaggc tggagtgcag tggtgtgatc tcggctcact gcaagctccg 5041 cctcccaggt tcaccccatt ctcctgagta gctgggacta caggcgcccg ccaccacgcc 5101 cggataattt tttgtatttt tagtagagac ggggtttcac cgtgttagcc aggatggtct 5161 ccatctcctg acctcgtgat ccaccctcct tggcctccca aagtgctggg attacaggcg 5221 tagcactgcg cncggctgtn ttttcatctt cttaaagcaa ggaacccctt ctttcagcaa 5281 aacctttcgg agaagcccaa tactaagctc ctctggttag agccagccat gagagaaact 5341 ccaagtactt ctgactggtt ctctctctac tcatccaccc cttaggtggc tgcagaagga 5401 actctgtgca acccccagag ttctcattct cagtgacagg gaaatgtaat gattggccct 5461 ggatgattca gcagatcaga tgatacttac tcagagcaat ttccactcct ttgcagtagc 5521 atattatcag tattttccag ataaataact tggctaaaga aaaatccatt tcatttacat 5581 ctttggcacc ttacagcaat agaacttttg tgcaatgatt ttaatattat atttctacat 5641 tggctgataa gatacatatg gctattgagc actcaaaatg tgggctagtg caactgagga 5701 actgaatttt tatcttcttt tttttttttt ttttttgaga tggagtcttg ctctgtcacc 5761 cagactggag tgcagtggcg caatcttggc tcactgcaag ctctgcctcc tgggttcacg 5821 ccattctctt gcctcagcct ccccagtagc tgggggtaca ggtgcctgcc acgcccggct 5881 attttttttt atttttattt tttttagtag aaacggggtt tcactgtgtt agccaggatg 5941 ttctcgatct cctgacctcg tgatccgcct gcctcggcct cccaaagtgc tgggattaca 6001 ggggtgagcc accgtgccta gccatttcat tttaattaac ttaaatttaa atagctccat 6061 gtggttagag gatactgaat tagcacagtc ttagagagtt ccttcttgtt ccatggactg 6121 gacacaatga agattaacag taattaaggt cacttctggt ttagatgtgc ttnatctgag 6181 aggaaaattc agccagcaaa catacaaaaa gaaagcacag tgtgaagttc ggtgttaaga 6241 gctagtntgc ctgcgtttga accctgcctg gctctgccat ttcctaccac ttaactgcac 6301 tgtggctgag ttttctgatc tgtaaggtgg gaataataat gatacctatc tcatagggga 6361 atgaaaggat caaatgagtt catatttgta aagcaatttg aaagagtgcc tagcccacag 6421 taagtgctac ataagagttt gttaaatgaa tctgcaaaaa aaaaaaaaat tacaaaaagg 6481 tacctaaggg tccgggtgac tatatncttc catcaagact agtgaagaat ggttgttttt 6541 tccattcatc cctacatttc tttttttaat aatgataaac atgcaacttt tttgtagctt 6601 tacaacaaat acccagatgc tgtggccaca tggctaaacc ctgacccatc tcagaagcag 6661 aatctcctag ccccacaggt atttttaaac ttctcataat taaactacag tgatgaaaca 6721 tagccacacn caggccattt gggctgctca gatgaatcct gcctgcctgc tggcaaactg 6781 tgcttaggac attgactgat ctgccatgtt ggcttctctc tgtgttaagc catccacaga 6841 tgaggctgaa aaataaaaac tgctttggat taaaaaggtt aacttttgaa taaaaaagct 6901 aggcatgtgt gatgcgcact aacacgtgcc attccttctt cagaatgctg tgtcctctga 6961 agaaaccaat gactttaaac aagaggtaag ttctcatttt caatcagagg cccatcatgc 7021 cttgaagaga tgaaagaagg cattgcctgg attctcttct gatgaaattt cattagcaag 7081 ttttccagct aattggcagt ctaaaacttg ctcataaata aaacatgtat ttactaaata 7141 tcagaaatac taggtttcct cggataagtt tagcattaca gaagatgttt attaatgcct 7201 gttatttgaa acattaatct gcttgcaatt tatttaaggt atttgtagat atctaatatc 7261 taataagcat ctaattaatg catatcaaag ctaagatttt gcctttagga aagttttctt 7321 tcctaataaa atagtttatt tgacaactat tctttttatt aggatcattc atatatttgc 7381 taagcaaaga gtaaatttat tttccttaag attcaatttg aatatactaa gaatattaaa 7441 gcaagttaga taaattaccc aatatatttg tcaatttgaa atttgataga cattagttgt 7501 ttaattcaat gggcagtttt gagctgcagt ttatacacac atgcataaca gagtcacctt 7561 tcaattatcc atgttaatag gaaagtggtt atagatttta gtacacacat taaaatatgg 7621 atactcttct cttttgataa atctcatttc aaataaaaaa accagtctca taattatgta 7681 tctgtatcta ttacatcatt gaatttagta aataatgttt aatatgtata aggaaaaaca 7741 atgttattga catgaagatt atactcacat atttggcttg aaaatatcta taaaaataat 7801 ttctgttgca aagtaagaaa tgttcttcag aatgttatta atccctgtgt taaaagagaa 7861 attggaagat gctcacttta gctcctaaaa gccatggtat gtactgtgaa tgcaaagatt 7921 ctgaaactaa ataaaaagaa agatagtaaa agactaatgt gctataaagg ctaagggaaa 7981 ataaaaaccc atatattaat tttcccggcc atcttaattt tcagaccctt ccaagtaagt 8041 ccaacgaaag ccatgaccac atggatgata tggatgatga agatgatgat gaccatgtgg 8101 acagccagga ctccattgac tcgaacgact ctgatgatgt agatgacact gatgattctc 8161 accagtctga tgagtctcac cattctgatg aatctgatga actggtcact gattttccca 8221 cggacctgcc agcaaccgaa gttttcactc cagttgtccc cacagtagac acatatgatg 8281 gccgaggtga tagtgtggtt tatggactga ggtcaaaatc taagaagttt cgcagacctg 8341 acatccaggt aaatccttta acagacacac ctgatggttc tgactagcgc tcaagtctag 8401 gaaaccacag tttgnatatt cattcattca ttcatccatt cattcatcca ttcagcaaga 8461 attcattcat attctacttt atgaccattg aatacaatct ttttctgctt ggcggttttg 8521 taagtctaca taattctctc tagatttgat tctcaaacac aattctactt tttgaaatcc 8581 tggatcactt attttcagat taaaataaat ggaaaacacc aattatttaa aaaaaataat 8641 ggtcatgttt tgaagttaaa tacctaagag gaattgtagt tgcaaattac actgaatcct 8701 tagtcacaga gaatctggat ttgacatagg gttgccgttt actattctct ttacttttta 8761 actaacaatt cacttcctct ttatgtaggt ttcaatataa tgaaacctac ctcataggtt 8821 tcattacata tgtaagtgat gtagttatta aactaaatga gatgacatat gtgaaaggcc 8881 ttggtaaagt actatacaaa gtaacatgct agtattattt cagccagatt tagacaattt 8941 ttagtataag atgacctaaa agctagagag tggaaaagga ttaccatatt cccatcccta 9001 gccgttcata taattattct tcatttgtgc cgtgattcag taccctgatg ctacagacga 9061 ggacatcacc tcacacatgg aaagcgagga gttgaatggt gcatacaagg ccatccccgt 9121 tgcccaggac ctgaacgcgc cttctgattg ggacagccgt gggaaggaca gttatgaaac 9181 gagtcagctg gatgaccaga gtgctgaaac ccacagccac aagcagtcca gattatataa 9241 gcggaaagct aatgatgaga gcaatgagca ttccgatgtg attgatagtc aggaactttc 9301 caaagtcagc cgtgaattcc acagccatga atttcacagc catgaagata tgctggttgt 9361 agaccccaaa agtaaggaag aagataaaca cctgaaattt cgtatttctc atgaattaga 9421 tagtgcatct tctgaggtca attaaaagga gaaaaaatac aatttctcac tttgcattta 9481 gtcaaaagaa aaaatgcttt atagcaaaat gaaagagaac atgaaatgct tctttctcag 9541 tttattggtt gaatgtgtat ctatttgagt ctggaaataa ctaatgtgtt tgataattag 9601 tttagtttgt ggcttcatgg aaactccctg taaactaaaa gcttcagggt tatgtctatg 9661 ttcattctat agaagaaatg caaactatca ctgtatttta atatttgtta ttctctcatg 9721 aatagaaatt tatgtagaag caaacaaaat acttttaccc acttaaaaag agaatataac 9781 attttatgtc actataatct tttgtttttt aagttagtgt atattttgtt gtgattatct 9841 ttttgtggtg tgaataaatc ttttatgttg aatgtaataa gaatttggtg gtgtcaattg 9901 cttatttgtt ttcccacggt tgtccagcaa ttaataaaac ataacctttt ttactgccta 9961 tataatgttt ttaaaggttt attttggttt caattgatac ataataagtg tacatattta 10021 tggggtacgg tgtgatgttt tgttacatat atacattgta taattatcaa agggtaatta 10081 tcatatccat cacctgaaac acttgtcatt tatttgtgct gagaacattc aatcctcttt 10141 tctagctatt ttgaagtata caatacatta ttattgacta tagccaagct actttgcaat 10201 agaatactag aatttattcc tcctagctaa ctgtaacttt gtacccattg actaacctcc 10261 cctcatccac cttcccactc tcccagccgc tggtaatcac tattctactc tctacttcta 10321 tgaggtcaac ttttctagat nccacatatg agtgagatca tgcagtactc ttccttctgt 10381 gcttggctta tttaacttaa catcctctac cttcgcctat gttgtcaaaa ataccaagag 10441 aaaacatgca caaactatac atctaacaag gaattaaaat ccagaataca taaggaactc 10501 aaacaactta atatcaaaaa aaaaagaaaa aaaaagacaa ctcaaataat ccaatttaaa 10561 atgggcacaa atctgaatag acatttctca aaagaagaca tgcaaatggc caacaggtat 10621 acagaaaaat gctcaacatc actaatcacc aggaaaatgc aaatcacaac cacaatgaga 10681 tatcatccca cccaagctaa aatggcttnt atcaaagaga caaaaaataa cagacacagg 10741 ccaggattcg gggaaagaag gacactcgta cnctggtgag aactgtaaat tagtacagcc 10801 actatgaaaa actgtatgga gacttctcaa aaaaacaaaa atagaactac catattattt 10861 agcaatccca ctgctgagca t // LOCUS HSCD14G 1570 bp DNA PRI 23-JUN-1993 DEFINITION Human gene for CD14 differentiation antigen. ACCESSION X06882 NID g29736 KEYWORDS antigen; CD14 antigen; monocyte differentiation antigen; surface antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1570) AUTHORS Goyert,S.M. TITLE Direct Submission JOURNAL Submitted (22-FEB-1988) Goyert S.M., Hospital for Joint Diseases, 301 E. 17th St., Cellular and Molecular Biology Unit, New York, NY 10003 REFERENCE 2 (bases 1 to 1570) AUTHORS Ferrero,E. and Goyert,S.M. TITLE Nucleotide sequence of the gene encoding the monocyte differentiation antigen, CD14 JOURNAL Nucleic Acids Res. 16 (9), 4173 (1988) MEDLINE 88234022 COMMENT the authors also sequenced corresponding cDNA Data kindly reviewed (28-MAR-1988) by Goyert S.M. FEATURES Location/Qualifiers source 1..1570 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /clone_lib="(lambda)gtWes" /map="long arm of chromosome 5, 5q23-q31" exon 51..158 /number=1 CDS join(156..158,247..1371) /codon_start=1 /product="cd14 protein precursor" /db_xref="PID:g312399" /db_xref="SWISS-PROT:P08571" /translation="MERASCLLLLLLPLVHVSATTPEPCELDDEDFRCVCNFSEPQPD WSEAFQCVSAVEVEIHAGGLNLEPFLKRVDADADPRQYADTVKALRVRRLTVGAAQVP AQLLVGALRVLAYSRLKELTLEDLKITGTMPPLPLEATGLALSSLRLRNVSWATGRSW LAELQQWLKPGLKVLSIAQAHSPAFSYEQVRAFPALTSLDLSDNPGLGERGLMAALCP HKFPAIQNLALRNTGMETPTGVCAALAAAGVQPHSLDLSHNSLRATVNPSAPRCMWSS ALNSLNLSFAGLEQVPKGLPAKLRVLDLSCNRLNRAPQPDELPEVDNLTLDGNPFLVP GTALPHEGSMNSGVVPACARSTLSVGVSGTLVLLQGARGFA" sig_peptide join(156..158,247..300) intron 159..246 /number=1 exon 247..1494 /number=2 mat_peptide 301..1368 /product="cd14 protein" polyA_signal 1477..1482 BASE COUNT 314 a 486 c 453 g 317 t ORIGIN 1 cagaatgaca tcccaggatt acataaactg tcagaggcag ccgaagagtt cacaagtgtg 61 aagcctggaa gccggcgggt gccgctgtgt aggaaagaag ctaaagcact tccagagcct 121 gtccggagct cagaggttcg gaagacttat cgaccatggt gagtgtaggg tcttggggtc 181 gaacgcgtgc cactcgggag ccacaggggt tggatggggc ctcctagacc tctgctctct 241 ccccaggagc gcgcgtcctg cttgttgctg ctgctgctgc cgctggtgca cgtctctgcg 301 accacgccag aaccttgtga gctggacgat gaagatttcc gctgcgtctg caacttctcc 361 gaacctcagc ccgactggtc cgaagccttc cagtgtgtgt ctgcagtaga ggtggagatc 421 catgccggcg gtctcaacct agagccgttt ctaaagcgcg tcgatgcgga cgccgacccg 481 cggcagtatg ctgacacggt caaggctctc cgcgtgcggc ggctcacagt gggagccgca 541 caggttcctg ctcagctact ggtaggcgcc ctgcgtgtgc tagcgtactc ccgcctcaag 601 gaactgacgc tcgaggacct aaagataacc ggcaccatgc ctccgctgcc tctggaagcc 661 acaggacttg cactttccag cttgcgccta cgcaacgtgt cgtgggcgac agggcgttct 721 tggctcgccg agctgcagca gtggctcaag ccaggcctca aggtactgag cattgcccaa 781 gcacactcgc ctgccttttc ctacgaacag gttcgcgcct tcccggccct taccagccta 841 gacctgtctg acaatcctgg actgggcgaa cgcggactga tggcggctct ctgtccccac 901 aagttcccgg ccatccagaa tctagcgctg cgcaacacag gaatggagac gcccacaggc 961 gtgtgcgccg cactggcggc ggcaggtgtg cagccccaca gcctagacct cagccacaac 1021 tcgctgcgcg ccaccgtaaa ccctagcgct ccgagatgca tgtggtccag cgccctgaac 1081 tccctcaatc tgtcgttcgc tgggctggaa caggtgccta aaggactgcc agccaagctc 1141 agagtgctcg atctcagctg caacagactg aacagggcgc cgcagcctga cgagctgccc 1201 gaggtggata acctgacact ggacgggaat cccttcctgg tccctggaac tgccctcccc 1261 cacgagggct caatgaactc cggcgtggtc ccagcctgtg cacgttcgac cctgtcggtg 1321 ggggtgtcgg gaaccctggt gctgctccaa ggggcccggg gctttgccta agatccaaga 1381 cagaataatg aatggactca aactgccttg gcttcagggg agtcccgtca ggacgttgag 1441 gacttttcga ccaattcaac cctttgcccc acctttatta aaatcttaaa caacggttcc 1501 gtgtcattca tttaacagac ctttattgga tgtctgctat gtgctgggca cagtactgga 1561 tggggaattc // LOCUS HSGAPIGNA 2609 bp DNA PRI 10-AUG-1996 DEFINITION H.sapiens gap-I gene. ACCESSION X74322 NID g436804 KEYWORDS gap-I gene; guanylate cyclase activator protein; guanylin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2609) AUTHORS Hill,O. TITLE Direct Submission JOURNAL Submitted (22-JUL-1993) O. Hill, Niedersaechsisches Inst. f. Peptid-, Forschung, Feodor Lynen Strasse 31, 30625 Hannover, FRG REMARK sequence revised by author (07-OCT-1993) REFERENCE 2 (bases 1 to 2609) AUTHORS Hill,O., Kuhn,M., Zucht,H.D., Cetin,Y., Kulaksiz,H., Adermann,K., Klock,G., Rechkemmer,G., Forssmann,W.G. and Magert,H.J. TITLE Analysis of the human guanylin gene and the processing and cellular localization of the peptide JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (6), 2046-2050 (1995) MEDLINE 95199289 FEATURES Location/Qualifiers source 1..2609 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="946203" CDS join(400..474,1502..1709,2142..2206) exon 394..474 /gene="gap-I" /number=1 mRNA join(394..474,1502..1709,2142..2421) /gene="gap-I" gene 394..2421 /gene="gap-I" misc_feature join(409..474,1502..1709,2142..2159) /gene="gap-I" /note="putative cds" /product="guanylate cyclase activating protein" intron 475..1501 /gene="gap-I" /number=1 exon 1502..1709 /gene="gap-I" /number=2 intron 1710..2141 /gene="gap-I" /number=2 exon 2142..2421 /gene="gap-I" /number=3 BASE COUNT 515 a 780 c 738 g 576 t ORIGIN 1 gagctcagcc cccagtgcag ccccgtcacc tcatcctaat caccccaccc taatcgcctt 61 gcgtttcact gtcgggctgg gcccccctcc tcaagggcag ggtctgccgg ccccacacag 121 cagcacaggg cagtcgtgag tgaatgtgcc ttgttggtga atgaatgcag gacccagccc 181 ggctgtgggt caaagtaggt gctcagggag gggccggggc cccccctgcc tggctcttat 241 ctcctagggc tgtctcgagc cttatctgat aaggcctgac aggtgagcag atgagactga 301 cagaggcttc caggttactc agtaacctgc cctctttaaa agtcccgccg cttccccctg 361 gcatccagaa cagccacccc tctctcgggc actgctgcca tgaatgcctt cctgctcttc 421 gcactgtgcc tccttggggc ctgggccgcc ttggcaggag gggtcaccgt gcaggtgagt 481 cctgtgtccc taccccggcc cagctggctc cctgccccta gtgcccaggt ctggtcccct 541 gagagagtac ctcctctttg gggctggatt gagggtgtgt gtcccaggct tggccttctt 601 cgaggagctc ctgctcagtc cagccacaca cacacctttc ccttcaactt tggaggagaa 661 agaaacccag tggagggaag aggaataacc atatggggaa ggcggggagt tggccaattt 721 actgagtgtc ctatgtgccg gctccttccc cagcccctca ctgtggctcc actatcaacc 781 tggaagaaag ggatcattag tcccatttta cagaggggaa aactgagcct cacaaaggtt 841 ctgtaacagg ccatgggcac atggctgtgg gaagtgcagg agctggattt aaatccagac 901 cttactcctc agtgggtccc aggaaccaca tggacaccct ggaggtcctg acagatgggt 961 atgcctacca gtgccaatgc tggcgacagg ccaagctggc acctagcggg ggccctgatc 1021 tctatcaaag ccagattccg ggaggccttt tattgcttct caaaccaaaa ccaggctgca 1081 ggctggagct gggggcccag taatcagccg caccgctctc aggcagggga aagacagcct 1141 gggcttggcc acgtggaact acgatgcctg ggtgctggtc ctttgcctct ccctaaaaat 1201 cagcacactg gctcctactc attaggcacc tatggtgttc ctggagcttc acacccattt 1261 tctccattcc ccatacaaca tgcagaattt gcgggttcca gagtgtccag catcacggag 1321 ccaagccagg acgtggggcc tgaactctgc ccctcacagt gccccactgt ggatcgcctg 1381 gctcccatat gccacgaaca ctgcctgtga gcaccactgg gctggaccgg ggccagcacg 1441 gggggcacag ccaggaagca ggctgcagcc tggcctcatg ctctgggctt ctctcttcca 1501 ggatggaaat ttctcctttt ctctggagtc agtgaagaag ctcaaagacc tccaggagcc 1561 ccaggagccc agggttggga aactcaggaa ctttgcaccc atccctggtg aacctgtggt 1621 tcccatcctc tgtagcaacc cgaactttcc agaagaactc aagcctctct gcaaggagcc 1681 caatgcccag gagatacttc agaggctggg tgagcagcct ccttcctctc tgattggtgg 1741 ggctcgggaa tactgtggtg gcacttgagg aagagatgtg ctctgattgg tgggatttgg 1801 aggaagggct cagctctgac tggtggaact agggagcttg gctctgattg gtgggtctgt 1861 tccatgccca cttcaattcc aagttgtaac tgctgaggtt tgggaactaa gaccattcat 1921 ctggtcttag tttcctctct ttctggtgga atctctgctt tgattggttg gtttgagtgg 1981 tctgtgtgcc ctggtttggg gcgaaaagcc cagtaggtca gagtacgggc ctgctgagat 2041 tctcctgggg ctgggtggtg acgtacaagc tctgggccaa gggcggcagg ggcctcgcga 2101 ggaaatgctg gttctcacga ggctccatct ctgttgtgca gaggaaatcg ctgaggaccc 2161 gggcacatgt gaaatctgtg cctacgctgc ctgtaccgga tgctaggggg gcttgcccac 2221 tgcctgcctc ccctccgcag cagggaagct cttttctcct gcagaaaggg ccacccatga 2281 tactccactc ccagcagctc aacctaccct ggtccagtcg ggaggagcag cccggggagg 2341 aactgggtga ctggaggcct cgccccaaca ctgtccttcc ctgccacttc aacccccagc 2401 taataaacca gattccagag tactctgggt gttgccttcc ttccttcctt catcccctag 2461 tctatcctgc cctggaagct cgggatgtaa tgtactaagc gaaggacccc actctccctg 2521 ctaagctgct cactgtccct ggggctgcca caacacagag gcgtggagag gatctggatc 2581 tccaaaccac caaggcagcg ggtggtggg // LOCUS HSBGPG 1675 bp DNA PRI 24-APR-1993 DEFINITION Human gene for bone gla protein (BGP). ACCESSION X04143 NID g29449 KEYWORDS bone gla protein; osteocalcin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1675) AUTHORS Celeste,A.J., Rosen,V., Buecker,J.L., Kriz,R., Wang,E.A. and Wozney,J.M. TITLE Isolation of the human gene for bone gla protein utilizing mouse and rat cDNA clones JOURNAL EMBO J. 5 (8), 1885-1890 (1986) MEDLINE 87004555 FEATURES Location/Qualifiers source 1..1675 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 420..425 exon <496..559 /number=1 CDS join(496..559,817..849,1026..1095,1297..1426) /codon_start=1 /product="BGP" /db_xref="PID:g29450" /db_xref="SWISS-PROT:P02818" /translation="MRALTLLALLALAALCIAGQAGAKPSGAESSKAFVSKQEGSEVV KRPRRYLYQWLGAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV" intron 560..816 /number=1 exon 817..849 /number=2 intron 850..1025 /number=2 exon 1026..1095 /number=3 intron 1096..1296 /number=3 exon 1297..>1426 /number=4 polyA_signal 1562..1567 BASE COUNT 302 a 524 c 519 g 330 t ORIGIN 1 acggggctga cagtagaaat cacaggctgt gagacagctg gagcccagct ctgcttgaac 61 ctattttagg tctctgatcc ccgcttcctc tttagactcc cctagagctc agccagtgct 121 caacctgagg ctgggggtct ctgaggaaga gtgagttgga gctgaggggt ctggggctgt 181 cccctgagag aggggccaga ggcagtgtca agagccgggc agtctgattg tggctcaccc 241 tccatcactc ccaggggccc ctggcccagc agccgcagct cccaaccaca atatcctctg 301 gggtttggcc tacggagctg gggcggatga cccccaaata gccctggcag attcccccta 361 gacccgcccg caccatggtc aggcatgccc ctcctcatcg ctgggcacag cccagagggt 421 ataaacagtg ctggaggctg gcggggcagg ccagctgagt cctgagcagc agcccagcgc 481 agccaccgag acaccatgag agccctcaca ctcctcgccc tattggccct ggccgcactt 541 tgcatcgctg gccaggcagg tgagtgcccc cacctcccct caggccgcat tgcagtgggg 601 gctgagagga ggaagcacca tggcccacct cttctcaccc ctttggctgg cagtcccttt 661 gcagtctaac caccttgttg caggctcaat ccatttgccc cagctctgcc cttgcagagg 721 gagaggaggg aagagcaagc tgcccgagac gcaggggaag gaggatgagg gccctgggga 781 tgagctgggg tgaaccaggc tccctttcct ttgcaggtgc gaagcccagc ggtgcagagt 841 ccagcaaagg tgcaggtatg aggatggacc tgatgggttc ctggaccctc ccctctcacc 901 ctggtccctc agtctcattc ccccactcct gccacctcct gtctggccat caggaaggcc 961 agcctgctcc ccacctgatc ctcccaaacc cagagccacc tgatgcctgc ccctctgctc 1021 cacagccttt gtgtccaagc aggagggcag cgaggtagtg aagagaccca ggcgctacct 1081 gtatcaatgg ctggggtgag agaaaaggca gagctgggcc aaggccctgc ctctccggga 1141 tggtctgtgg gggagctgca gcagggagtg gcctctctgg gttgtggtgg gggtacaggc 1201 agcctgccct ggtgggcacc ctggagcccc atgtgtaggg agaggaggga tgggcatttt 1261 gcacgggggc tgatgccacc acgtcgggtg tctcagagcc ccagtcccct acccggatcc 1321 cctggagccc aggagggagg tgtgtgagct caatccggac tgtgacgagt tggctgacca 1381 catcggcttt caggaggcct atcggcgctt ctacggcccg gtctagggtg tcgctctgct 1441 ggcctggccg gcaaccccag ttctgctcct ctccaggcac ccttctttcc tcttcccctt 1501 gcccttgccc tgacctccca gccctatgga tgtggggtcc ccatcatccc agctgctccc 1561 aaataaactc cagaagagga atctgtgggc ctgtgagtct gtccagttta tggagtgtgg 1621 gagggaggtg tcaggaggat gggggtgagg aggttttacc ttcttcagtt ctaga // LOCUS HSAPOAIA 2209 bp DNA PRI 03-NOV-1994 DEFINITION Human fetal gene for apolipoprotein AI precursor. ACCESSION X01038 NID g28769 KEYWORDS apolipoprotein; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2209) AUTHORS Seilhamer,J.J., Protter,A.A., Frossard,P. and Levy-Wilson,B. TITLE Isolation and DNA sequence of full-length cDNA and of the entire gene for human apolipoprotein AI--discovery of a new genetic polymorphism in the apo AI gene JOURNAL DNA 3 (4), 309-317 (1984) MEDLINE 85026665 FEATURES Location/Qualifiers source 1..2209 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 8..13 mRNA join(39..56,254..316,504..660,1249..1907) exon 39..56 /number=1 intron 57..253 /number=1 exon 254..316 /number=2 sig_peptide join(274..316,504..514) CDS join(274..316,504..660,1249..1852) /codon_start=1 /product="apolipoprotein AI precursor" /db_xref="PID:g296635" /db_xref="SWISS-PROT:P02647" /translation="MKAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVD VLKDSGRDYVSQFEGSALGKQLNLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKE TEGLRQEMSKDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHE LQEKLSPLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHA KATEHLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ" intron 317..503 /number=2 exon 504..660 /number=3 mat_peptide join(515..660,1249..1849) /product="apolipoprotein AI" intron 661..1248 /number=3 exon 1249..1907 /number=4 BASE COUNT 450 a 688 c 685 g 384 t 2 others ORIGIN 1 ctgcagacat aaataggccc tgcaagagct ggctgcttag agactgcgag aaggaggtgc 61 gtcctgctgc ctgccccggt cactctggct ccccagctca aggttcaggc cttgccccag 121 gccgggcctc tgggtacctg aggtcttctc ccgctctgtg cccttctcct cacctggctg 181 caatgagtgg gggagcacgg ggcttctgca tgctgaaggc accccactca gccaggccct 241 tcttctcctc caggtccccc acggcccttc aggatgaaag ctgcggtgct gaccttggcc 301 gtgctcttcc tgacgggtag gtgtccccta acctagggag ccaaccatcg gggggccttc 361 tccctaaatc cccgtggccc accctcctgg gcagaggcag caggtttctc actggccccc 421 tctcccccac ctccaagctt ggcctttcgg ctcagatctc agcccacagc tggcctgatc 481 tgggtctccc ctcccaccct cagggagcca ggctcggcat ttctggcagc aagatgaacc 541 cccccagagc ccctgggatc gagtgaagga cctggccact gtgtacgtgg atgtgctcaa 601 agacagcggc agagactatg tgtcccagtt tgaaggctcc gccttgggaa aacagctaaa 661 gtaaggaccc agcctggggt tgagggcagg ggcagggggc agaggcctgt gggatgatgt 721 tgaagccaga ctggccgagt cctcacctaa tatctgatga gctgggcccc acagatggtc 781 tggatggaga aaccggaatg gatctccagg cagggtcaca gcccatgtcc cctgcaaagg 841 acagaccagg gctgcccgat gcgtgatcac agagccacat tgtgcctgca agtgtagcaa 901 gcccctttcc cttcttcacc acctcctctg ctcctgccca gcaagactgt gggctgtctt 961 cggagaggag aatgcgctgg aggcatagaa gcgaggtcct tcaagggccc actttggaga 1021 ccaacgtaac tgggcaccag tcccagctct gtctcctttt tagctcctct ctgtgcctcg 1081 gtccagctgc acaacggggc atggcctggc ggggcagggg tgttggttga gagtgtactg 1141 gaaatgctag gccactgcac ctccgcggac aggtgtcacc cagggctcac ccctgatagg 1201 ctggggcgct gggaggccag ccctcaaccc ttctgtctca ccctccagcc taaagctcct 1261 tgacaactgg gacagcgtga cctccacctt cagcaagctg cgcgaacagc tcggccctgt 1321 gacccaggag ttctgggata acctggaaaa ggagacagag ggcctgaggc aggagatgag 1381 caaggatctg gaggaggtga aggccaaggt gcagccctac ctggacgact tccagaagaa 1441 gtggcaggag gagatggagc tctaccgcca gaaggtggag ccgctgcgcg cagagctcca 1501 agagggcgcg cgccagaagc tgcacgagct gcaagagaag ctgagcccac tgggcgagga 1561 gatgcgcgac cgcgcgcgcg cccatgtgga cgcgctgcgc acgcatctgg ccccctacag 1621 cgacgagctg cgccagcgct tggccgcgcg ccttgaggct ctcaaggaga acggcggcgc 1681 cagactggcc gagtaccacg ccaaggccac cgagcatctg agcacgctca gcgagaaggc 1741 caagcccgcg ctcgaggacc tccgccaagg cctgctgccc gtgctggaga gcttcaaggt 1801 cagcttcctg agcgctctcg aggagtacac taagaagctc aacacccagt gaggcgcccg 1861 ccgccgcccc ccttcccggt gctcagaata aacgtttcca aagtgggaag cagcttcttt 1921 cttttgggag aatagagggg ggtgcgggga catccggggg agcccgggag gggcctttgg 1981 ccctggagca gggacttcct gccggatctc aacaactccg tgcccagact ggacgtctta 2041 gggccaagat cgacgttgga ggacctgctg gacgcntggc tgcttacgag tgagggagta 2101 gagtctgcct tagcaaggct caagtagaaa ggaagtcaca gcggacnagg caaagccaca 2161 gacaatccaa ggccaggtgc cctgaaaggg gctcaaacaa ggcctgcag //