J02342. Rous sarcoma virus LOCUS ALRCG 9625 bp ss-RNA linear VRL 04-MAR-1996 DEFINITION Rous sarcoma virus (Prague strain, subgroup C) cDNA to genomic RNA, complete genome. ACCESSION J02342 J02021 J02343 VERSION J02342.1 GI:210171 KEYWORDS c-myc proto-oncogene; complete genome; env protein; gag protein; long terminal repeat; origin of replication; pol protein; polyprotein; protein kinase; src oncogene. SOURCE Rous sarcoma virus ORGANISM Rous sarcoma virus Viruses; Retroid viruses; Retroviridae; Alpharetrovirus. REFERENCE 1 (bases 234 to 351) AUTHORS Haseltine,W.A., Maxam,A.M. and Gilbert,W. TITLE Rous sarcoma virus genome is terminally redundant: the 5' sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74 (3), 989-993 (1977) MEDLINE 77149036 PUBMED 66683 REFERENCE 2 (bases 234 to 351) AUTHORS Shine,J., Czernilofsky,A.P., Friedrich,R., Bishop,J.M. and Goodman,H.M. TITLE Nucleotide sequence at the 5' terminus of the avian sarcoma virus genome JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74 (4), 1473-1477 (1977) MEDLINE 77172915 PUBMED 67601 REFERENCE 3 (bases 1 to 374; 9273 to 9625) AUTHORS Katz,R.A., Omer,C.A., Weis,J.H., Mitsialis,S.A., Faras,A.J. and Guntaka,R.V. TITLE Restriction endonuclease and nucleotide sequence analyses of molecularly cloned unintegrated avian tumor virus DNA: structure of large terminal repeats in circle junctions JOURNAL J. Virol. 42 (1), 346-351 (1982) MEDLINE 82217008 PUBMED 6283156 REFERENCE 4 (bases 234 to 9545) AUTHORS Schwartz,D.E., Tizard,R. and Gilbert,W. TITLE Nucleotide sequence of Rous sarcoma virus JOURNAL Cell 32 (3), 853-869 (1983) MEDLINE 83155662 PUBMED 6299578 REFERENCE 5 (sites) AUTHORS Broome,S. and Gilbert,W. TITLE Rous sarcoma virus encodes a transcriptional activator JOURNAL Cell 40 (3), 537-546 (1985) MEDLINE 85124605 PUBMED 2982497 COMMENT Original source text: Rous sarcoma virus (strain Prague) (clone: pATV-[6,8,9]) cDNA to genomic RNA. [5] sites; transcriptional activator protein and mRNA. Proviral RSV has the following structure: 5'LTR-gag-pol-env-src-3'LTR. The single plus stranded 35S virion RNA is identical, with the exception of lacking the 5' U-3 and 3' U-5 segments. Two identical copies of this 35S RNA, associated as a 70S RNA complex, are present in each virion. After viral infection, the 35S RNA is reverse transcribed in the cytoplasm into linear double stranded DNA with a complete LTR at each end. After migrating to the nucleus, some of these molecules become circularized. The double stranded DNA integrates into the host DNA by an unknown mechanism. Positions 335-352 are complementary to the 3' stem of host-encoded Trp-tRNA. Trp-tRNA binds to virion RNA at this site and serves as a primer for DNA synthesis by reverse transcription. The integrated proviral DNA is transcribed to produce 35S RNAs with sequence identical to the virion RNA. The 35S RNAs can be translated directly or processed by the cellular RNA splicing machinery to produce mRNAs encoding additional viral proteins. In order of length the viral mRNAs (and their products) are: 35Sa (gag-Pr76) mRNA, 35Sb (gag-pol-Pr180) mRNA, 35Sc (trn-act) mRNA, 28S (env-Pr95) mRNA, and 21S (src-p60) mRNA. The 35Sa (gag-Pr76) mRNA is apparently full-length and identical to virion RNA. The 35Sc (trn-act), 28S (env-Pr95), and 21S (src-p60) mRNAs all begin with the same 5' exon, but have varying lengths of intronic sequence (beginning at the splice-donor site following position 630) removed to produce the mature mRNAs. The mechanism for production of the polyprotein precursor gag-pol-Pr-180 remains uncertain. The reading frame of gag is not the same as that of pol, so merely suppressing the amber stop codon of gag does not give a gag-pol read-through product, and no acceptable RNA splicing sites are apparent. It is known that the gag proteins, including P12, are cleaved from the gag-pol-Pr180 polyprotein as well as from the gag-Pr76 polyprotein. gag-pol-Pr180 is tentatively annotated in the features as two exons with undetermined intron boundaries. The pol-derived portion of gag-pol-Pr180 is processed to yield the reverse transcriptase beta subunit, which in turn is processed to yield the reverse transcriptase alpha subunit and p32 (tentatively identified as a DNA endonuclease). [4] reports the dimer linkage site to be at position 756-781. The src gene is believed to have been obtained from avian DNA when an ALV-like virus recombined with host DNA. Homology to the c-src gene of chicken begins at position 7271. A direct repeat of about 100 bp is present near either end of exon 2 of the 21S (src) mRNA. The repeats include positions 7130-7222 and 9024-9123. [4] also sequenced 95% of the genome of Prague C RSV using cDNA to viral RNA. There are numerous conflicts between the sequence obtained from DNA and that obtained from cDNA. The sequence reported here is the DNA sequence. [4] contains an in-depth discussion of the proteins encoded by gag, gag-pol, env, and src mRNAs. [4] points out 12 p19 binding sites that may influence splicing, RNA packing, and 35S RNA dimer formation. [5] found that the target for the action of the transcriptional activator protein lies between 111 and 620 nucleotides upstream of the cap site. Complete source information: Rous sarcoma virus (Prague subgroup C): cDNA to viral RNA [2],[1], [4]; unintegrated DNA, clones pATV-6 [3], pATV-8 [3],[4],[5], and pATV-9 [3]. FEATURES Location/Qualifiers source 1..9625 /organism="Rous sarcoma virus" /mol_type="genomic RNA" /strain="Prague" /db_xref="taxon:11886" /clone="pATV-[6,8,9]" LTR 1..334 /note="5' LTR" mRNA 234..9545 /product="35Sa (gag), 35Sb (gag-pol), 35Sc (trn-act), 28S (env), 21S (src) mRNA" repeat_region 234..254 /note="5' terminal repeat" conflict 279 /citation=[4] /replace="" misc_binding 335..352 /bound_moiety="Trp-tRNA stem" conflict 335 /citation=[2] /replace="" CDS join(613..630,5311..7098) /note="env-Pr95 polyprotein precursor" /codon_start=1 /protein_id="AAB59934.1" /db_xref="GI:210174" /translation="MEAVIKAFLTGYPGKTSKKDSKEKPLATSKKDPEKTPLLPTRVN YILIIGVLVLCEVTGVRADVHLLEQPGNLWITWANRTGQTDFCLSTQSATSPFQTCLI GIPSPISEGDFKGYVSDTNCSTVGTDRLVLSASITGGPDNSTTLTYRKVSCLLLKLNV SMWDEPPELQLLGSQSLPNVTNITQVSGVAGGCVYFAPRATGLFLGWSKQGLSRFLLR HPFTSTSNSTEPFTVVTADRHNLFMGSEYCGAYGYRFWEIYNCSQTRNTYRCGDVGGT GLPETWCRGKGGIWVNQSKEINETEPFSFTANCTGSNLGNVSGCCGEPITILPLGAWI DSTQGSFTKPKALPPAIFLICGDRAWQGIPSRPVGGPCYLGKLTMLAPNHTDILKILA NSSRTGIRRKRSVSHLDDTCSDEVQLWGPTARIFASILAPGVAAAQALREIERLACWS VKQANLTTSLLGDLLDDVTSIRHAVLQNRAAIDFLLLAHGHGCEDVAGMCCFNLSDHS ESIQKKFQLMKKHVNKIGVDSDPIGSWLRGIFGGIGEWAVHLLKGLLLGLVVILLLLV CLPCLLQFVSSSIRKMINSSINYHTEYRKMQGGAV" mat_peptide 5479..7095 /product="glycoprotein-85" mat_peptide 6502..7095 /product="glycoprotein-37" CDS join(613..2343,2736..5423) /note="gag-pol-Pr180 polyprotein precursor" /codon_start=1 /protein_id="AAB59933.1" /db_xref="GI:210173" /translation="MEAVIKVISSACKTYCGKTSPSKKEIGAMLSLLQKEGLLMSPSD LYSPGSWDPITAALSQRAMILGKSGELKTWGLVLGALKAAREEQVTSEQAKFWLGLGG GRVSPPGPECIEKPATERRIDKGEEVGETTVQRDAKMAPEETATPKTVGTSCYHCGTA IGCNCATASAPPPPYVGSGLYPSLAGVGEQQGQGGDTPPGAEQSRAEPGHAGQAPGPA LTDWARVREELASTGPPVVAMPVVIKTEGPAWTPLEPKLITRLADTVRTKGLRSPITM AEVEALMSSPLLPHDVTNLMRVILGPAPYALWMDAWGVQLQTVIAAATRDPRHPANGQ GRGERTNLNRLKGLADGMVGNPQGQAALLRPGELVAITASALQAFREVARLAEPAGPW ADIMQGPSESFVDFANRLIKAVEGSDLPPSARAPVIIDCFRQKSQPDIQQLIRTAPST LTTPGEIIKYVLDRQKTAPLTDQGIAAAMSSAIQPLIMAVVNRERDGQTGSGGRARGL CYTCGSPGHYQAQCPKKRKSGNSRERCQLCNGMGHNAKQCRKRDGNQGQRPGKGLSSG PWPGPEPPAVSTVALHLAIPLKWKPDHTPVWIDQWPLPEGKLVALTQLVEKELQLGHI EPSLSCWNTPVFVIRKASGSYRLLHDLRAVNAKLVPFGAVQQGAPVLSALPRGWPLMV LDLKDCFFSIPLAEQDREAFAFTLPSVNNQAPARRFQWKVLPQGMTCSPTICQLVVGQ VLEPLRLKHPSLCMLHYMDDLLLAASSHDGLEAAGEEVISTLERAGFTISPDKVQREP GVQYLGYKLGSTYVAPVGLVAEPRIATLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQ LRGSDPNEAREWNLDMKMAWREIVRLSTTAALERWDPALPLEGAVARCEQGAIGVLGQ GLSTHPRPCLWLFSTQPTKAFTAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFRE DLPLPEGILLALKGFAGKIRSSDTPSIFDIARPLHVSLKVRVTDHPVPGPTVFTDASS STHKGVVVWREGPRWEIKEIADLGASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAK MLLKMGQEGVPSTAAAFILEDALSQRSAMAAVLHVRSHSEVPGFFTEGNDVADSQATF QAYPLREAKDLHTALHIGPRALSKACNISMQQAREVVQTCPHCNSAPALEAGVNPRGL GPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAVQHHWATAIAVLG RPKAIKTDNGSCFTSKSTREWLARWGIAHTTGIPGNSQGQAMVERANRLLKDRIRVLA EGDGFMKRIPTSKQGELLAKAMYALNHFERGENTKTPIQKHWRPTVLTEGPPVKIRIE TGEWEKGWNVLVWGRGYAAVKNRDTDKVIWVPSRKVKPDITQKDEVTKKDEASPLFAG ISDWIPWEDEQEGLQGETASNKQERPGEDTLAANES" mat_peptide 2736..5420 /product="reverse transcriptase beta subunit" mat_peptide 2736..2786 /product="reverse transcriptase alpha subunit" CDS 613..2718 /note="gag-Pr76 polyprotein precursor" /codon_start=1 /protein_id="AAB59932.1" /db_xref="GI:210175" /translation="MEAVIKVISSACKTYCGKTSPSKKEIGAMLSLLQKEGLLMSPSD LYSPGSWDPITAALSQRAMILGKSGELKTWGLVLGALKAAREEQVTSEQAKFWLGLGG GRVSPPGPECIEKPATERRIDKGEEVGETTVQRDAKMAPEETATPKTVGTSCYHCGTA IGCNCATASAPPPPYVGSGLYPSLAGVGEQQGQGGDTPPGAEQSRAEPGHAGQAPGPA LTDWARVREELASTGPPVVAMPVVIKTEGPAWTPLEPKLITRLADTVRTKGLRSPITM AEVEALMSSPLLPHDVTNLMRVILGPAPYALWMDAWGVQLQTVIAAATRDPRHPANGQ GRGERTNLNRLKGLADGMVGNPQGQAALLRPGELVAITASALQAFREVARLAEPAGPW ADIMQGPSESFVDFANRLIKAVEGSDLPPSARAPVIIDCFRQKSQPDIQQLIRTAPST LTTPGEIIKYVLDRQKTAPLTDQGIAAAMSSAIQPLIMAVVNRERDGQTGSGGRARGL CYTCGSPGHYQAQCPKKRKSGNSRERCQLCNGMGHNAKQCRKRDGNQGQRPGKGLSSG PWPGPEPPAVSLAMTMEHKDRPLVRVILTNTGSHPVKQRSVYITALLDSGADITIISE EDWPTDWPVMEAANPQIHGIGGGIPMRKSRDMIELGVINRDGSLERPLLLFPAVAMVR GSILGRDCLQGLGLRLTNL" mat_peptide 613..1137 /product="P19 protein" mat_peptide 1144..1329 /product="P10 protein" mat_peptide 1330..2049 /product="P27 protein" mat_peptide 2077..2343 /product="P12 protein" mat_peptide 2344..2715 /product="P15 protein" mat_peptide 2344..2346 /product="P15 protein" exon <613..2343 /note="gag-pol-Pr180 polyprotein precursor" /number=1 exon 613..2343 /note="gag-pol-Pr180 polyprotein precursor; putative" /number=1 CDS join(613..630,914..1270) /note="transcriptional activator protein" /codon_start=1 /protein_id="AAB59931.1" /db_xref="GI:210172" /translation="MEAVIKGEGGSLPQVRSASRNQQRSGESTKGRKWEKQLCSEMRR WRRRKRPHLKPLAHPAIIAEQLLAVIAPQPRLLLLLMWGVVCILPWRGWESSRARGVT HLRGRNSQGRSQGMRVRLLGRP" exon <613..630 /note="env-Pr95 polyprotein precursor" /number=1 intron 631..7286 /note="21S (src) intron A" intron 631..5310 /note="env-Pr95 intron A" intron 631..913 /note="trn-act intron A" exon 914..>1270 /note="transcriptional activator protein, [5]" /number=2 exon 2736..5423 /note="gag-pol-Pr180 polyprotein precursor; putative" /number=2 exon 2736..>5423 /note="gag-pol-Pr180 polyprotein precursor" /number=2 exon 5311..>7098 /note="env-Pr95 polyprotein precursor" /number=2 CDS 7362..8942 /note="src-p60 phosphoprotein" /codon_start=1 /protein_id="AAB59935.1" /db_xref="GI:210176" /translation="MGSSKSKPKDPSQRRHSLEPPDSTHHGGFPASQTPDETAAPDAH RNPSRSFGTVATEPKLFWGFNTSDTVTSPQRAGALAGGVTTFVALYDYESWTETDLSF KKGERLQIVNNTEGDWWLAHSLTTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESER LLLNPENPRGTFLVRKSETAKGAYCLSVSDFDNAKGPNVKHYKIYKLYSGGFYITSRT QFGSLQQLVAYYSKHADGLCHRLANVCPTSKPQTQGLAKDAWEIPRESLRLEAKLGQG CFGEVWMGTWNDTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPI YIVIEYMSKGSLLDFLKGEMGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANI LVGENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGI LLTELTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKDPEERPT FKYLQAQLLPACVLEVAE" LTR 9291..9625 /note="3' LTR" repeat_region 9525..9545 /note="3' terminal repeat" ORIGIN 1 atgtagtctt atgcaatact cctgtagtct tgcaacatgc ttatgtaacg atgagttagc 61 aatatgcctt acaaggaaag aaaaggcacc gtgcatgccg attggtggta gtaaggtggt 121 acgatcgtgc cttattagga aggtatcaga cgggtctaac atggattgga cgaaccactg 181 aattccgcat cgcagagata ttgtatttaa gtgcctagct cgatacaata aacgccattt 241 taccattcac cacattggtg tgcacctggg ttgatggccg gaccgtcgat tccctaacga 301 ttgcgaacac ctgaatgaag cagaaggctt catttggtga ccccgacgtg atagttaggg 361 aatagtggtc ggccacagac ggcgtggcga tcctgccctc atccgtctcg cttattcggg 421 gagcggacga tgaccctagt agagggggct gcggcttagg agggcagaag ctgagtggcg 481 tcggagggag ctctactgca gggagcccag ataccctacc gagaactcag agagtcgttg 541 gaagacggga aggaagcccg acgactgagc agtccacccc aggcgtgatt ctggtcgccc 601 ggtggatcaa gcatggaagc cgtcataaag gtgatttcgt ccgcgtgtaa aacctattgc 661 gggaaaacct ctccttctaa gaaggaaata ggggccatgt tgtccctctt acaaaaggaa 721 gggttgctta tgtctccctc agacttatat tccccggggt cctgggatcc cattaccgcg 781 gcgctatccc agcgggctat gatacttggg aaatcgggag agttaaaaac ctggggattg 841 gttttggggg cattgaaggc ggctcgagag gaacaggtta catctgagca agcaaagttt 901 tggttgggat tagggggagg gagggtctct cccccaggtc cggagtgcat cgagaaacca 961 gcaacggagc ggcgaatcga caaaggggag gaagtgggag aaacaactgt gcagcgagat 1021 gcgaagatgg cgccggagga aacggccaca cctaaaaccg ttggcacatc ctgctatcat 1081 tgcggaacag ctattggctg taattgcgcc acagcctcgg ctcctcctcc tccttatgtg 1141 gggagtggtt tgtatccttc cctggcgggg gtgggagagc agcagggcca ggggggtgac 1201 acacctccgg gggcggaaca gtcaagggcg gagccagggc atgcgggtca ggctcctggg 1261 ccggccctga ctgactgggc aagggtcagg gaggagcttg cgagtactgg tccgcccgtg 1321 gtggccatgc ctgtagtgat taagacagag ggacccgctt ggacccctct ggagccaaaa 1381 ttgatcacaa gactggctga tacggtcagg accaagggct tacgatcccc gattactatg 1441 gcagaagtgg aagcgcttat gtcctccccg ctgctgccgc atgacgtcac gaatctaatg 1501 agagttattt tagggcctgc cccatatgcc ttatggatgg acgcttgggg agtccaactc 1561 cagacagtta tagcggcagc cactcgcgac ccccgacacc cagcgaacgg tcaagggcgg 1621 ggggaacgga ctaatttgaa tcgcttaaag ggcttagctg atgggatggt gggcaaccca 1681 cagggtcagg ccgcattatt aagaccgggg gaattggttg ctattacggc gtcggctctc 1741 caggcgttta gagaggttgc ccggctggcg gaacctgcag gtccatgggc ggacatcatg 1801 cagggaccat ctgagtcctt tgttgatttt gccaatcggc ttataaaggc ggttgagggg 1861 tcagatctcc cgccttccgc gcgggctccg gtgatcattg actgctttag gcagaagtca 1921 cagccagata ttcagcagct tatacggaca gcaccctcca cgctgaccac cccaggagag 1981 ataattaaat atgtgctaga caggcagaag actgcccctc ttacggatca aggcatagcc 2041 gcggccatgt cgtctgctat ccagccctta attatggcag tagtcaatag agagagggat 2101 ggacaaactg ggtcgggtgg tcgtgcccga gggctctgct acacttgtgg atccccggga 2161 cattatcagg cgcagtgccc gaaaaaacgg aagtcaggaa acagccgtga gcgatgtcag 2221 ttgtgtaacg ggatgggaca caacgctaaa cagtgtagga agcgggatgg caaccagggc 2281 caacgcccag gaaaaggtct ctcttcgggg ccgtggcccg gccctgagcc acctgccgtc 2341 tcgttagcga tgacaatgga acataaagat cgccccttgg ttagggtcat tctgactaac 2401 actgggagtc atccggtcaa acagcgttcg gtgtatatca ccgcgctgtt ggactctgga 2461 gcggacatca ctattatttc agaggaggat tggcccaccg attggccagt gatggaggcc 2521 gcgaacccgc agatccatgg gataggaggg ggaattccca tgcgaaaatc tcgtgacatg 2581 atagagttgg gggttattaa ccgagacggg tctttggagc gacccctgct cctcttcccc 2641 gcagtagcta tggttagagg gagtatccta ggaagagatt gtctgcaggg cctagggctc 2701 cgcttgacaa atttataggg agggccactg ttctcactgt tgcgctacat ctggctattc 2761 cgctcaaatg gaagccagac cacacgcctg tgtggattga ccagtggccc ctccctgaag 2821 gtaaacttgt agcgctaacg caattagtgg aaaaagaatt acagttagga catatagaac 2881 cttcacttag ttgttggaac acacctgtct tcgtgatccg gaaggcttcc gggtcttacc 2941 gcttactgca tgatttgcgc gctgttaacg ccaagcttgt tccttttggg gccgtccaac 3001 agggggcgcc agttctctcc gcgctcccgc gtggctggcc cctgatggtc ttagacctca 3061 aggattgctt cttttctatc cctcttgcgg aacaagatcg cgaagctttt gcatttacgc 3121 tcccctctgt gaataaccag gcccccgctc gaagattcca atggaaggtc ttgccccaag 3181 ggatgacctg ttctcccact atctgtcagt tggtagtggg tcaggtactt gagcccttgc 3241 gactcaagca cccatctctg tgcatgttgc attatatgga tgatcttttg ctagccgcct 3301 caagtcacga tgggttggaa gcggcagggg aggaggttat cagtacattg gaaagagccg 3361 ggttcactat ttcgcctgat aaggtccaga gggagcccgg agtacaatat cttgggtaca 3421 agttaggcag tacgtatgta gcacccgtag gcctggtagc agaacccagg atagccacct 3481 tgtgggatgt tcaaaagctg gtggggtcac ttcagtggct tcgcccagcg ttaggaatcc 3541 cgccacgact gatgggcccc ttctatgagc agttacgagg gtcagatcct aacgaggcga 3601 gggaatggaa tctagacatg aaaatggcct ggagagagat cgtacggctt agcaccactg 3661 ctgccttgga acgatgggac cctgccctgc ctctggaagg agcggtcgct agatgtgaac 3721 agggggcaat aggggttttg ggacagggac tgtccacaca cccaaggcca tgcttgtggt 3781 tattctccac ccaacccacc aaggcgttta ctgcttggtt agaagtgctc acccttttga 3841 ttactaagct acgtgcttcg gcagtgcgaa cctttggcaa ggaggtcgat atcctcctgt 3901 tgcctgcatg ctttcgggag gaccttccgc tcccagaggg gatcctgtta gcccttaagg 3961 ggtttgcagg aaaaatcagg agtagtgaca cgccatctat ttttgacatt gcgcgtccac 4021 tgcatgtttc tctgaaagtg agggttaccg accaccctgt gccgggaccc actgtcttta 4081 ctgacgcctc ctcaagcacc cataaggggg tggtagtctg gagggagggc ccaaggtggg 4141 agataaaaga aatagctgat ttgggggcaa gtgtacaaca actggaagca cgcgctgtgg 4201 ccatggcact tctgctgtgg ccgacaacgc ccactaatgt agtgactgac tccgcgtttg 4261 ttgcgaaaat gttactcaag atgggacagg agggagtccc gtctacagcg gcggctttta 4321 ttttagagga tgcgttaagc caaaggtcag ccatggccgc cgttctccac gtgcggagtc 4381 attctgaagt gccagggttt ttcacagaag gaaatgacgt ggcagatagc caagccacct 4441 tccaagcgta tcccttgaga gaggctaaag atcttcatac cgctctccat attggacccc 4501 gcgcgctatc caaagcgtgt aatatatcta tgcagcaggc tagggaggtt gttcagacct 4561 gcccgcattg taattcagcc cctgcgttgg aggccggagt aaaccctagg ggtttgggac 4621 ccctacagat atggcagaca gactttacgc ttgagcctag aatggccccc cgttcctggc 4681 tcgctgttac tgtggatacc gcctcatcag cgatagtcgt aactcagcat ggccgtgtca 4741 catcggttgc tgtacaacat cattgggcca cggctatcgc cgttttggga agaccaaagg 4801 ccataaaaac agataatggg tcctgcttca cgtctaaatc cacgcgagag tggctcgcga 4861 gatgggggat agcacacacc accgggattc cgggtaattc ccagggtcaa gctatggtag 4921 agcgggccaa ccggctcctg aaagatagga tccgtgtgct tgcggagggg gacggcttta 4981 tgaaaagaat ccccaccagc aaacaggggg aactattagc caaggcaatg tatgccctca 5041 atcactttga gcgtggtgaa aacacgaaaa caccgataca aaaacactgg agacctaccg 5101 ttcttacaga aggacccccg gttaaaatac gaatagagac aggggagtgg gaaaaaggat 5161 ggaacgtgct ggtctgggga cgaggttatg ccgctgtgaa aaacagggac actgataagg 5221 ttatttgggt accctctcga aaagttaaac cggacatcac ccaaaaggat gaggtgacta 5281 agaaagatga ggcgagccct ctttttgcag gcatttctga ctggataccc tgggaagacg 5341 agcaagaagg actccaagga gaaaccgcta gcaacaagca agaaagaccc ggagaagaca 5401 cccttgctgc caacgagagt taattatatt ctcattattg gtgtcctggt cttgtgtgag 5461 gttacggggg taagagctga tgttcactta ctcgagcagc cagggaacct ttggattaca 5521 tgggccaacc gtacaggcca aacggatttc tgcctctcta cacagtcagc cacctcccct 5581 tttcaaacat gtttgatagg tatcccgtct cctatttccg aaggtgattt taagggatat 5641 gtttctgata caaattgctc cactgtggga actgaccggt tagtcttgtc agccagcatt 5701 accggcggcc ctgacaacag caccaccctc acttatcgaa aggtttcatg cctgctgtta 5761 aagctgaacg tctccatgtg ggatgagcca cctgaactgc agctgctagg ttcccagtct 5821 ctccctaacg ttactaacat tactcaggtc tctggcgtgg ccgggggatg tgtatatttc 5881 gccccaaggg ccactggcct gtttttaggt tggtctaaac aaggtctctc gcggttcctc 5941 ctccgtcacc cctttacctc cacctctaac tccacggaac cgttcacggt ggtgacagcg 6001 gatagacaca atctttttat ggggagtgag tactgtggtg catatggcta cagattttgg 6061 gaaatatata actgctcaca gactaggaat acttaccgct gtggagacgt gggaggtact 6121 ggcctccctg aaacctggtg cagaggaaaa ggaggtatat gggttaatca atcaaaggaa 6181 attaatgaga cagagccgtt cagttttact gcgaactgta ctggcagtaa tttgggtaat 6241 gtcagcggat gttgcggaga accaatcacg attctcccac taggggcatg gatcgacagt 6301 acgcaaggta gtttcactaa accaaaagcg ctaccacccg caattttcct catttgtggg 6361 gatcgcgcat ggcaaggaat tcccagtcgt ccggtagggg gcccctgcta tttaggcaag 6421 cttaccatgt tagcacccaa ccatacagat attctcaaaa tacttgctaa ttcgtcgcgg 6481 acaggtataa gacgtaaacg aagcgtctca cacctggatg atacatgctc agatgaagta 6541 cagctttggg gtcctacagc aagaatcttt gcatctatct tagccccggg ggtagcagct 6601 gcgcaagcct taagagaaat tgagagacta gcctgttggt ccgttaaaca ggctaacttg 6661 acaacatcac tcctcgggga cttattggat gatgtcacga gtattcgaca cgcggtcctg 6721 cagaaccgag cggctattga cttcttgctt ctagctcacg gccatggctg tgaggacgtt 6781 gccggaatgt gttgtttcaa tctgagtgat cacagtgaat ctatacagaa gaagttccag 6841 ctaatgaaga aacatgtcaa taagatcggc gtggacagcg acccaatcgg aagttggctg 6901 cgagggatat tcgggggaat aggggaatgg gccgttcatc tgctaaaagg actgcttttg 6961 gggcttgtag ttattttatt gctactggtg tgcctgcctt gccttttaca atttgtgtct 7021 agtagtattc gaaagatgat taatagttca atcaactatc atactgaata caggaagatg 7081 cagggcggag cagtctagag ctcagttata ataatcctgc gaatcgggct gtaacggggc 7141 aaggcttgac cgaggggact ataacatgta taggcgaaaa gcggggtctc ggttgtaacg 7201 cgcttaggaa gtcccctcga ggtatggcag atatgctctt gcataggggg aaaaaatgta 7261 gtcttaatat tgtctgtgtg ctgcaggagc taagctgact ctgctggtgg cctcgcgtac 7321 cactgtggcc aggcggtagc tgggacgtgc agccgaccac catggggagc agcaagagca 7381 agcctaagga ccccagccag cgccggcaca gcctggagcc acccgacagc acccaccacg 7441 ggggattccc agcctcgcag acccccgacg agacagcagc ccccgacgca caccgcaacc 7501 ccagccgctc cttcgggacc gtggccaccg agcccaagct cttctggggc ttcaacactt 7561 ctgacaccgt cacgtcgccg cagcgtgccg gggcactggc tggcggcgtc accactttcg 7621 tggctctcta cgactacgag tcctggactg aaacggactt gtccttcaag aaaggagaac 7681 gcctgcagat tgtcaacaac acggaaggtg actggtggct ggctcattcc ctcactacag 7741 gacagacggg ctacatcccc agtaactatg tcgcgccctc agactccatc caggctgaag 7801 agtggtactt tgggaagatc actcgtcggg agtccgagcg gctgctgctt aaccccgaaa 7861 acccccgggg aaccttcttg gtccggaaga gcgagacggc aaagggtgcc tattgcctct 7921 ccgtttctga ctttgacaac gccaaggggc ccaatgtgaa gcactacaag atctacaagc 7981 tgtacagcgg cggcttctac atcacctcac gcacacagtt cggcagccta cagcagctgg 8041 tggcctacta ctccaaacat gctgatggct tgtgccaccg cctggccaac gtctgcccca 8101 cgtccaagcc ccagacccag ggactcgcca aggacgcgtg ggaaatcccc cgggagtcgc 8161 tacggctgga ggcgaagctg gggcagggct gctttggaga ggtctggatg gggacctgga 8221 acgacaccac cagagtggcc ataaagactc tgaagcccgg caccatgtcc ccggaggcct 8281 tcctgcagga agcccaagtg atgaagaagc tccggcatga gaagctggtt cagctgtacg 8341 cagtggtgtc ggaagagccc atctacatcg tcattgagta catgagcaag gggagcctcc 8401 tggatttcct gaagggagag atgggcaagt acctgcggct gccacagctc gtcgatatgg 8461 ctgctcagat tgcatccggc atggcctatg tggagagaat gaactacgtg caccgagacc 8521 tgcgggcggc caacatcctg gtgggggaga acctggtgtg caaggtggct gacttcgggc 8581 tggcacgcct catcgaggac aacgagtaca cagcacggca aggtgccaag ttccccatca 8641 agtggacagc ccccgaggca gccctctatg gccggttcac catcaagtcg gatgtctggt 8701 ccttcggcat cctgctgact gagctgacca ccaagggccg ggtgccatac ccagggatgg 8761 tcaacaggga ggtgctggac caggtggaga ggggctaccg catgccctgc ccgcccgagt 8821 gccccgagtc gctgcatgac ctcatgtgcc agtgctggcg gaaggaccct gaggagcggc 8881 ccacctttaa gtacctgcag gcccagctgc tccctgcttg tgtgttggag gtcgctgagt 8941 aagtacgagg cgtgacctac aattgctcaa ataatgcttc tgtagaaatt gtttagcatt 9001 aggcgtcctg cgttgctccg cgatgtacgg gtcaggtata atgtgcagtt tgactgaggg 9061 gaccatgatg tgtataggcg tcaagcgggg cttcggttgt acgcggatag gaatcccctc 9121 aggacaattc tgcttggaat atgatggcgt cttccctgtt ttgcccttag actattcgag 9181 ttgcctctgt ggattagggc tggaggcagc acggatagtc tgatggccaa ataaggcagg 9241 caagacagct atttgtaact gcgaaatacg cttttgcata gggaggggga aatgtagtct 9301 tatgcaatac tcctgtagtc ttgcaacatg cttatgtaac gatgagttag caatatgcct 9361 tacaaggaaa gaaaaggcac cgtgcatgcc gattggtggt agtaaggtgg tacgatcgtg 9421 ccttattagg aaggtatcag acgggtctaa catggattgg acgaaccact gaattccgca 9481 tcgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt ttaccattca 9541 ccacattggt gtgcacctgg gttgatggcc ggaccgtcga ttccctaacg attgcgaaca 9601 cctgaatgaa gcagaaggct tcatt //