Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP034962 | Escherichia coli strain WCHEC020032 plasmid p4_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP034961 | Escherichia coli strain WCHEC020032 plasmid p3_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP034963 | Escherichia coli strain WCHEC020032 plasmid pCMY42_020032, complete sequence | 0 crisprs | RT | 0 | 0 | 1 | 0 |
CP034966 | Escherichia coli strain WCHEC020032 chromosome, complete genome | 10 crisprs | RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2,cas3,DEDDh,c2c9_V-U4,DinG | 1 | 24 | 10 | 0 |
CP034959 | Escherichia coli strain WCHEC020032 plasmid p1_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 1 | 0 |
CP034965 | Escherichia coli strain WCHEC020032 plasmid pNDM5_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP034964 | Escherichia coli strain WCHEC020032 plasmid pCTXM3_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 1 | 0 |
CP034960 | Escherichia coli strain WCHEC020032 plasmid p2_020032, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 90524
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034959|0:90524|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >CP034959|0:90524|57358_57568_+|QAS87842.1|DBSCAN-SWA MAFIPPTIDDVRHCSNALSVDPAETDAARAIAEHYSKISNQEYRITQDDLDDLTDTIEYLMATNQPDSQ >CP034959|0:90524|73560_73806_-|QAS87862.1|DBSCAN-SWA MAERVDDAELSMNQLEALKDMAIDNIRKQAQVVSQVFTGKCRYCNEPIESGIYCDAECAQWHREEQAAKQRKYGMRPAGFD >CP034959|0:90524|73952_74330_+|QAS87863.1|DBSCAN-SWA MATSITTTQSTRQYPLSRYDDRNIADPILRAELRKEVMLMCESNDKNLTIYYVLPDEQYRPDLLAYRMWGIAELRWVVTLAAGLEDESQGMTVGKKLKLPPATWIREMIRHFQYDGQVIGTLSIA >CP034959|0:90524|80023_81388_-|QAS87871.1|DBSCAN-SWA MSASPLESMPNSLSAEQAVLGGLMLDNCRWDEVADRIVADDFYTSAHREIFSEMERLLSHGKPIDLITLAEALEQNGKLERAGGFAYLAEMSKNTPSAANICAYADIVRERAVVREMISVANEIAEAGYAQDGRGSNELLDMAERRVFEIAEKRQKSGSGPKDIASILDATVSRIEELFQRPHDGVTGLDTGFTDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAVRNYKPVLVFSLEMPSHQLMMRSLASLARVDQTRIRTGQLNDEDWARVSGAMGILLDKQNIFIDDSSALTPTELRSRARRVYKENGGLSMIMIDYLQLMRVPELQDNRTLEIAEISRSLKALAKELQVPVVALSQLNRSLEQRADKRPVNSDLRESGAIEQDADLIMFLYRDEVYHPDSEMKGIAEVIIGKQRNGPIGTVRLAFNGQYSRFDNYAGADWQEDY >CP034959|0:90524|12900_13935_-|QAS87801.1|DBSCAN-SWA MKTPLVTRNEIVEAIALHTACMPTREIPGAIANYFMITRRFYTRTDKAVINRLLIAEIRDYLIEQGRLRYATVAAEMRKEAHRMTGNNLNVEKPAPVASATPAPALNVIPNTGDTIDSQTLLKMVNEARKLCSEKPVRNNDFIARVKDELEGETYEIFVGQKNGAEIDIITMTYKQALRVAARESKAVRRSLIDKLEELQQANSPTPSIPQTLPEALRLAAELAEQKMQLEQQLVAAAPKVDFADRVSVANGILIGNFAKVVGLKQNALFSWLRQNGILMAFGARKNVPRQQYINAGYFTVKEVVLDDENGYQIRLTPQLTGKGQQWLTRKLLDAGLLKPVAIG >CP034959|0:90524|89531_89954_+|QAS87880.1|DBSCAN-SWA MSFFSTLKTALSLKEKLAATGVLVLICALVGAGFAWERHQLKQAMEKIGSLDQAIKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYADRQMERKAEVQKQLVAAGNVRQRIPADTQRLLRESISEFNADADKG >CP034959|0:90524|82655_82847_+|QAS87882.1|DBSCAN-SWA MNPLQENALTCYVLRFVELTAGDRAAPDWTYFSVMLMPENQTVMVGAELRHRVVASPAKRALL >CP034959|0:90524|2631_3633_+|QAS87790.1|DBSCAN-SWA MSKKNRPTIGRTLNPSILSGFDSSSSSGDRVEQVFKLSTGRQATFIEEVIPPNQVESDTFVDQHNNGRDQASLTPKSLKSIRSTIKHQQFYPAIGVRRATGKIEILDGSRRRASAILENVGLRVLVTDQEINVQEAQNLAKDVQTALQHSIREIGLRLMRMKNDGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFSDYKTLCAVGDEMGNKNLEFDQLIQNISPEINDILSIEEMAEDEVKNKILRLITKEASLLTDKGSKDKSVVTELWKFEDKDRFARKRVKGRAFSYEFNRLSKELQEELDRMIGHILRKSLDKKPKP >CP034959|0:90524|16485_17097_+|QAS87806.1|tail|DBSCAN-SWA MGLNVASVKSYVSSALTTTLFGSGVGEREVGKLTSIIMNKMLFAQGWQFSVEVDGLEGADFFAKDITYHDYSIEYETIKIGGGNILQPTERSPGQITMMVRDTVDGLVLDWFKTAKSRVINPDGTGNIPSKYLLNVRIYRLLSSGLTKLENEMTVFPVTTGDVTYARDQVTEFKSFPMTFALHSTFNQSSSSLASLLGFSFSL >CP034959|0:90524|72979_73558_-|QAS87861.1|DBSCAN-SWA MLRFTEEEFQAFSERRNKGRSRPKTKKDPFLSLAPVKEVSPHAKALAALAKNPDLRDGNCEHFEQVFIFDYFERKHPDIYELLHATPNGGKRSKSTAGKMKAEGQKKGYPDMSLDKACGIYHGMRIELKEPNGKAPTKEQIAWMRRLREEGYYVVLAYGAEQAITAILEYISLKKGEAIEHVLNGDKWLYAA >CP034959|0:90524|77062_78079_+|QAS87867.1|tail|DBSCAN-SWA MATKTTTAPETDSKRTQLFLQSVSIGQNEIPREMIVGCTYVEPGELSGPQLMLMVRDSTAYVVNKLGVKFGTILTVSLGDPEGHGGILFSEEFFVLKAPRKDDTVLIYAFSNPVRLLKVPSTSAQYFVDKPPSAVVSSLAPGLKVNADSFRKTSTYHLNVGEKPTKVLQEIARDTGSMCWASRGTINFKSMEKMANAAPSLTYESANPNTSGFTISQFNILNADYEYQRRHNYRMASYDMTKGVVYSGNQEDPIKFTSNPDPTALANYNKFILPRLDMLVEGNAALTPGTTLKIVVHNTAGDGELDESIPDKMIVMSVTHFEDRFRFVSRAQLGVVNG >CP034959|0:90524|7224_8040_+|QAS87793.1|DBSCAN-SWA MSKNFFQSGAFLGNGLSRFALNSDPVQLMESARASAEPPTDPVINNNPEPAAQTNDNVPSAPAPEQILEGKDGKEWTVEQAHQMILEAANRSAMQNALSDAADAVFSWADSGDLTFDSLDGFVQAIAGISDDDDSEVTEEQDDAYNEAWANVADFLAACGVDDDLIEALADDEDDDAAADVGASIAGLDSDDRDELEAAFVVAGTSDEMLTEAFKKVVRNGEIKLIRKRLRKKRLTAAQKSALKKARRKAQTGAAKLARKKSMKLRRKRLG >CP034959|0:90524|31595_32198_+|QAS87818.1|DBSCAN-SWA MTKNKYATVDFDQVNEKGLKSLIAAINKTGVTVIEVDSSNRATTKDGVKVKTAKLVLNDGQILAIQVNDTGDISSVKLNGKAIPNAQSPDIKTLGTVMGQAARKNSAKFQKSLIAKAKRVANPVDKKPAVKSNFQRLQEAKQRNAQVVAAYKSAQNSVSFNQQQITDLRAKLDKETGRLNNEKARNGELKRRLKQLKAGN >CP034959|0:90524|24961_29662_+|QAS87813.1|DBSCAN-SWA MNDVTVVTSVTYPSPESLALVADVQYHEPYLSAALNRKFRGIVDPGFYAGFFPKPGGGMNLLITSVDGDKTAGAASVNIGEFYQVTIQQRKDISLALSAGKKYAIVLKGRYLLGEDSYQVNTASHIHAAEFVARTYTDSYQLGDGELLVCTVNIPAGVSAITKEMIDVSDRIDLAIGIEISDSVTSTRSDVAASSLAVKKAYDLAKSKYTAQDASTTQKGLVQLSSATNSDSETMAATPKAVKSVKELADTKAPIESPSLTGTPTAPTAAQGTNSTQIANTAFVKAAITALINGAPGTLDTLKEIAAAINNDPNFSTTINNALALKAPLASPALTGIPTAPTAAQGTNNTQIATTAYVRAAISALVGSSPEALDTLNELAAALGNDPNFATTMTNALAGKQPLDATLTALAGLATGANKLPYFTGTDTVSQTDLTSVGRDILAKTSTLAVIQYLGLREIGTSGEKIPLLSTANTWSSQQTFKGKTAFSAAATFSAGIAGAIEPEKIGDQTVDLNNLTISSDVGAIKYYYCPTFGGGANITNKPDGVNGNFLLRVESTRKVSASDYANMQTLISNDTKRIYVRFVVNGSWAAWSQVVVSGWGQDVSVKSLSAVALSGSLTGNASTATKLQTARTIGGVSFDGSANIDLPGVNKAGNQSTTGNAATATKLQTARTINGVKFDGSANISIPTITSRGRVTALTDTTQGAATGLQMYEAYNNSYPTAYGNVLHMKGASAAGEGELLIGWSGTSGAHAPVFIRSRRDHTDAAWSAWAQVYTSRDSIPGVNATGNQNTTGNAATATKLQTARTIGGVSFDGTANINLPGVNVAGNQNTSGNAATATKLQTARTINGVSFDGSKNIELTPRSIGTINSITMSFSGGAGWFKLATVTMPQASSVVYISLIGSSGYNVNSPMQAGISELVLRAGNGNPKGLTGALWRRTSVGFTNFAWVNTSGDTYDVYVEIGNFATGVNIQWDYTSNASVTIHTSPSYTANKPTGLTDGTVYVIYSSHIKPTATDVGALPITGGNLNGGLTATGEIISKSANGLRIAYGNYGFFIRNDGSNTYFMLTNSGNSLGTYNNLRPLIINNANGTVTIGNGLNVTGGINGSLNGNAATATKLQTARTIGGVSFDGSANIDLPGVNKAGNQSTTGNAATATKLQTARTIGGVSFDGSANIDLPGVNKTGNQSTTGNAATATKLLTARTINGVSFDGSANISLSPANIGCPASPTGWLKTGNNGESITTAQLVTLLQNNGAFNTKAWFARCAWSYATSASIPDSETGCGIIPLAGAVIEVFSNNTDNYTIRITTATTTSVSGALTNAEFIYVFNVSGSTSYSPGWRRAYNTKNKPTTTDLGLSDESGYVGRLISTRVFTSSGTYIPTPGTKRLRVTITGGGGGGGGCKATSNNETFFGAGGGAGGTIISIMTPTQNSYPVTIGAGGAGGVSATNGTRGGNSVFASLIAPGGAGGGKVGVTNTNGGNGGVPSTGDIRITGGDGGDGQSGNISVSGEGGTSHWGGGGRAGAGGGVIGKAYGSGGGGAYDAGYSGTSMTGGKGASGICIIEEFA >CP034959|0:90524|39471_40029_+|QAS87825.1|DBSCAN-SWA MKGKTAAGGGAICAIAVMITIVMGNGNVRTNQAGLELIGNAEGCRRDPYMCPAGVWTDGIGNTHGVTPGVRKTDQQIAADWEKNILIAERCINQHFRGKDMPDNAFSAMTSAAFNMGCNSLRTYYSKARGMRVETSIHKWAQKGEWVNMCNHLPDFVNSNGVPLRGLKIRREKERQLCLTGLVNE >CP034959|0:90524|38814_39303_-|QAS87824.1|DBSCAN-SWA MAQRGVNKVILIGTLGQDPEIRYIPNGGAVGRLSIATNESWRDKQTGQQKEQTEWHRVVLFGKLAEIASEYLRKGSQVYIEGKLKTRKWTDDAGVERYTTEIIVSQGGTMQMIGARRDDSQSSNGWGQSNQPQNHQQYSGGGKPQSNANNEPPMDFDDDIPF >CP034959|0:90524|65514_67371_-|QAS87852.1|DBSCAN-SWA MSNIKYRKDIDGLRAVAILPVLLFHVGYSGFSGGYVGVDIFFVISGYLITKILINDINNGTYSLLTFYERRIRRIIPALTCVILFVLIASPLFLAPDNYSFLPKEIIGTLLFASNIVSFLKSGYFSTDAEQRPLLHTWSLGIEEQFYIISPIVLFLSFKYFKSRVGLILSLFAIVSFAFSVILTKNHPTASFYLIPTRYWELSFGALAAAGVFKKAKGRRQNEVLSILGLLLILFSIFTFTSKTVFPGYAALLPVLGATLIILNAEDTLVGKMLALKPLVFIGVISYSLYLWHWPLVVFSHDKYIIDLNLSREMLVVLSILIAWFSTRFIEAPFRNKQSYDRTRIFKYSSVAYSLLFLTSLAIWPLKGWTDRLSDEKAYILSSTKDYSPVRDKCHFSSGVPETTQYCILGVKDIEPSLFVWGDSHGAEISYALSKLTSVYTATYSACPPVVGFTSTERPECQAHNMRVLDFILNNKKIKNVVLAANYNKYEGDEKYSGFVKGFENTVKKLTDGGKRVTVLDQIPSPGVNVPNDLANAKFIVNKSFAYDDTTFKKIEFENGVNIFHFEKYLCDRDSCSMMYENYPVLFDDNHLSLTVAKIMAPHIYEMISDNRKENERF >CP034959|0:90524|87957_88242_-|QAS87877.1|DBSCAN-SWA MQMELISRKEFDSRVTSGELDNLQAIKVKEGFCLIGNQSGTNRVFMLRRTDLKPFVWKNEIGPSSYAQTRGCLNLAFFYKNELSVVDIQGLQHV >CP034959|0:90524|88515_88695_+|QAS87878.1|DBSCAN-SWA MLVDGKQYAQHTDGNSTHGGQAISTRAWFTVNGKGIVCVGDPVSCGSTVASGDGLVQVS >CP034959|0:90524|82024_82459_-|QAS87874.1|DBSCAN-SWA MFGKLFGKKVASAKVELKKVENRDLMEAIIGGCLLVSAADGEIEKEETAKLDQLVRSNPRLSHFGNEITATITRFTEQLEAGFRVGRMNILREIEDIKNDPKEAEEVFVNMLTIAEADGEIEPAEHKVLEEVGRRLGLRVEDYL >CP034959|0:90524|57678_58530_+|QAS87843.1|DBSCAN-SWA MINYVYGEQLYQEFVSFRDLFLKKAVARAQHVDAASDGRPVRPVVVLPFKETDSIQAEIDKWTLMARELEQYPDLNIPKTILYPVPNILRGVRKVTTYQTEAVNSVNMTAGRIIHLIDKDIRIQKSAGINEHSAKYIENLEATKELMKQYPEDEKFRMRVHGFSETMLRVHYISSSPNYNDGKSVSYHVPLCGVFICDETLRDGIIINGEFEKAKFSLYDSIEPIICDRWPQAKIYRLADIENVKKQIAITREEKKVKSAASVTRSRKTKKGQPVNSNPESAQ >CP034959|0:90524|10188_10893_-|QAS87798.1|DBSCAN-SWA MLNRRTFNVFCDESCHLLNDHNKVMVLGALWCPGSITKKIARDIKELKLKHNLKPDFEIKWTKVSASKVEFYLDVVDYFFSNPALRFRGVVVPDKEQLDHARFHQDHNTFYYKMFFYVLKNIIESNNTYNIYLDIKDTLGIEKIEKLRGVLHNDRYDYNHESINRIQHIRSHEVQQLQLTDLFIGALGYVHRGMNSNAGKIQVINRIKSHTNRELLKSTLPTESKFNIFVWEAR >CP034959|0:90524|50663_51173_-|QAS87832.1|DBSCAN-SWA MNIYIACALTHVPREIFHEYSNWIHSLAKGLSQNNNVKYALINSDPELSKRPESNKSRLCYIWDRDMVEKSDVIIAECSFPSTGLGIELQIAEQKNIPVIICYKDYGINKTKTIEYVNPDETTHNLQVGEGFISHMVLGLPNILDVILCKDIDNTCRKLKILLDMINHN >CP034959|0:90524|72153_72816_-|QAS87859.1|DBSCAN-SWA MLAWRLNLQLRMNIPPRATRTVFCVGSGPSLTREDCAAIEKTGCSIIAVNNSWQMFDDIYALYAGDLSWWKQYGSTIPGGRFRKVTANLAAAKSFSLEYRRYCGPAEGVNSGAQAISLAAESGAEVVVLVGYDCSLQNGLHWHGAHPQALRNPTQVSISKWQQQFLDTRKKHADLHILNASRSSAIQCFPRINLEAVIALLSSAVAQAPQTLLRRAECRL >CP034959|0:90524|70955_71633_-|QAS87857.1|DBSCAN-SWA MMAPTIYHRIDGTKYRNVWVVCDLHGCYTRLMSELHRVDFDPAQDLLISVGDLIDRGTENVECLELLQMPWFRAVMGNHELLMLDALSPDGNVNNWLMNGGQWFFMLDADQEILARALVELVRRLPYIIELNTGQETIVIAHADYPDNEYQFGKEVPLFNVVWARERISDSMDDIGGEISGADRFIFGHTPVKSPKTFWNQHYIDTGAVFCGNLTLMKVKGDGAA >CP034959|0:90524|14772_15285_+|QAS87803.1|DBSCAN-SWA MSKKYTLCALVVSAILLSGCQSSGADYAADVYDTAQLNSKQETKTVNIISVLPAKVKVDNKANKEAAQTFGAVLGAVAGGVAGYNVKGTSTLGAVAGGTGGAALGAAAGSLVSDKTIVEGVSLTYKEGTKVFTSTQVGKACQFTTGLAVLISTKDNETRIQPNATCPEKK >CP034959|0:90524|30490_30772_+|QAS87815.1|DBSCAN-SWA MFDLTIFFITILGGVHSFLNGVREKRYEASCRQLMAECIAAVLAGFIGMYFAEYKGMDESLQNCVTIICSINNKLILEKSQRIIDSYLNRNAS >CP034959|0:90524|90131_90524_+|QAS87881.1|DBSCAN-SWA MSDKVTVKQTINKATSIYKIEHITVGKPGSEQYRHAFELADQLGLKHPDCIEHVFPTYADEQCTHVLTEEDFFSTEEREGVDRCIGVICSSVSYELFPNVHEDGGIGYQFLYEGDELKCYEHGLLIESIE >CP034959|0:90524|58555_60040_-|QAS87844.1|terminase|DBSCAN-SWA MARSCVTDPRWRELVALYRYDWIAAADVLFGKTPTWQQDEIIESTQQDGSWTSVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLSKYFILTETSFFEVTGKGVWTILIKSCRPGNEEALAGEHADHLLYIIDEASGVSDKAFSVITGALTGKDNRILLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSGRMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVPQDEVLSVEDEAQVDEALAWLNE >CP034959|0:90524|43118_49886_+|QAS87829.1|DBSCAN-SWA MNKLSMGVFRCSSVSEILKYIRAITSHRAPIKYGVEKVEGKSYDRLRREANQKAIDLLNSLVDGATLTDEQRQILAGYTGEGGIGGSVSEYYTPKPIAEGVWEIMKLYGADVGNTLEPSAGTGVFNETKPVGTVMTATEISSVSGRINQLLHPEDSVQISPFEQLAVSTPNDSFDHVVGNVPFGGRDNTRNIDKPYAEETDMGSYFMLRMLDKIKPGGFMCVIVPPSIVSGSNMKRLRLRLSRKAEFLGAHRLPTGTFDANGTSTVVDVVLMRKHPAEMAEKIPLVDESTLESANVLWPTFISGKWFEKDGRRFVHGTQEKGFQGRIEVRADGQIDNQALKAKLIHRFESRIDWSLLDMAEPSPTADVVDEGEMRLINGVWQKYAGGRWIEADAGKELKIEVASYGADSWEALQRNLTTTEGRLGMTFTQMANVRDKYTTSISDDMVQLVDWINSQPEKYRERLYRGAMIGRMLIEYQDMKAAGHSAEQIEQQRLSLVSRLQAEIDRFGNPGRGPIAKLSGSGARAWFAFRGAIKLDGTISDELTGKLVTHDSSASYDSTSYQDTLRYLYSDLTRDPIQLDDFRLAFTGELPASDDELLNLLASTPGIAVSPYGGIVPFARATSGDINEIVAPKQEFLATLPDGPVKNNVLNQLAAIEEKRIKTPAENIRFKLNSRWFDRSVILEFLQENGYPDLRYVQSVQLEGDEMVSDTYHGGDGLFVGHRYGVVQRKDKETGEIRYEWDRKSGENATGFPAQLEKYLNGARIGGKDSATANGYREQMALLEDQFNKWIKTHDRYDELVAKYNDVFNSNIPYEHSGDPLGLKGLSGKRQPFDYQNSEVRRLSEDGRGILGFGTGLGKTTTALALEAFNYENGRSTRTAYVVPKSVLENWYYEAKEFLSEEAFSNYLFVGLDVLMDGDQIRQVPVLDENGKPVLGTDGTPVMRDALKLADEATITARMNAIPHSNYRAVVFTKEQYARIPLRDDTVDEHAQDMLYDFVAAGRVASAMDSDSHRKEAARRRVLSEYSDTGTEKAEKYPYFEDMGFDSVIADEGHNYRNSYKNGREASQLAYLPTSAVAQSARDMAIKNAYLMKKNGGRGPVLLTATPVVNTPIDAYNMLSHVLPKEYWQNMGIYGPDDFVKFFGKTRLETVQKISGEVEEKMALMGFENLDALRGIFHRWVTLKTAEDVKDTVEIPELDEHQQDAPLTEEQLAAYEELRQQAEAAAKANNGVTTSVNEDGVIEHEKARPIFSIIRDMDRVCTDMDLYYRRITYRFLPEYADAVQQLADSLPKQATSEDDDSDDSITQQSQYSLIDKGEFIQLQVPEAFEQEVNKRLARFGIDEQTVTHPVTPKYAKLIATLKEFFPEGKQIIFTDEKTQHQKLKRIICNALNLEPSKVGILNAQTVAEAGKTGKKLKAVKPPKELPDEPTDAQIAKYNEQMALYDAYIAQQNEMSLGGLEKIAADFQEGRTPIIICNKKAEVGINLHRGTTDIHHLTLPWTPASIAQRNGRGARVGSNRASVRVHYYCGKGSFDEYRLKTLKRKAGWISDILRSDKSEMENADANDMIEMQMYTAKDDGERLAMMQVQMDKAKAAKRARQKEQATIDLQNYIKAQHAAGEDVEVLTAELERSKAELEKTTAEVAKFKQAAMAKAADNADWKARWGSVHHTDRMLLAQYRASLKSAIQRKANISQAISRYEKLLNRTQKAATDIKRLRPLVEDAINKGILDVDPDLVNHANEFLVIGDRSWRVGQYYDCAGDIVRIKSLDFDSQRADVEIIFTLRGTKSGNWDVKTLDKQVDVTPDEDAVMQKISGGVSIAGINDIISCDDFYRFQQRGMIKITDSYGVQTTESGYSIDFVGTYTDPLKHAVYPDRRDGALKSSIAKWVLGMMSEGNNRQIRSAETFLVELFGSNYGDVIASYGDTLSPEAIQEKIADAIAKMPEKTSQGATRNGDSELEVTNAIFGTHEFRASDYEITTAQFGTIGIYSNKAEIKQAMDAANARIAAEREANLNHAVAALTQSWVTAIREAATTGKITPAIADVVNAGSKFMDAYQMDAVQLPSAYGQLSYRMTYNLVSMFSDLAILGLVDLNEVTPELLSMRKNHVEILQRINTVLAGRTDEEKQADADRINLALGNITEEEIAARNEKQEELSSIQGDATSIAQSLGLNYRVSTADLKMMYAPKFAAGEVFGLQEASGMKGVLFRAKDAIKAKFGARWLPAKAKNSDFPGNWWIIETKHNVADVLAVIQQYA >CP034959|0:90524|81378_81594_+|QAS87872.1|DBSCAN-SWA MQTLALHPGVLNVRYNRHTQITPNDMSHHQKIRINAPDSSRLSHRIARMVNPFTRETTNGGLHIPATWFVA >CP034959|0:90524|88703_89492_+|QAS87879.1|DBSCAN-SWA MLEKDYQLSAYKKLAAAGGMKTPGAITSARNSANTAKLLAEELTGLILDTIVYPDTITSYVSTIRTTTTGLTNIGELTTKHADLLAGYADLSMLLQLDIGWDVYCRANEREVSELPISIAIGDVNITKSLEDAVNALNTSSLVAAMGEINQTLNTGSGSSSGSGSGGGTATPPPALTEEQIESLKVATEQFGVVFNQTTAPTTALLQQYERANESANVAITAYNHAIGTALAEASANKVSTASAVAALVPDSVLDELNKAAQ >CP034959|0:90524|34115_34481_+|QAS87820.1|DBSCAN-SWA MTLSAIELMDLSDKLDSLMSKAATASGMELLDISDEIDQIMQQMGYGASGGGSGEEKQPSVHDGVPKLVADFLADKFVDQSTDAFIGTLQDLSQYVGTYIDLDQVKQHTAAWIAANIKEAA >CP034959|0:90524|61859_62903_-|QAS87847.1|DBSCAN-SWA MKAVITPFVQKELGLATFKVDQEVRKLVEAGRKFIMEPVPRELIEHMEDGLVVTEQTMATNEALQPFFNSDELFRRIGGIDALVAWLRRKEGQCQAADRSWCDNHIVHAERDNSAVLLCWHHDNHYRMRGFNELKETLHNNRVNWILDVARQEMGLSNSHDLSIQELCWWAFMRNMMHLMPEEVCRISINKMKATPQDSGPLKEADIRPYDDRATAYVQMMEERAAPMRAKVCPVDVDSDPGMAHFKIPKLQSLKLPEYMDFVASRPCCGCGAAGAGAHITPYIVRHSRLCAHDIYAIPLCQSCQRDIERDRDNWEKTHGRLAMHQRLFFDYALGVGVITSHSSSVR >CP034959|0:90524|68266_68461_-|QAS87853.1|DBSCAN-SWA MTPILPYTADDVLELAKIALAPCKDGNAMTLIDFLVKELSKGAGWPDGKDFWVEPPIGRTCIFQ >CP034959|0:90524|75560_76289_+|QAS87865.1|tail|DBSCAN-SWA MAGFFDDMFEDTEPSQQVTGDNLPDTESDPDIPGEGSELIEEEDIDAEIETDGVNVGNIVDPVEDNHLPNLDHGLLSDSGVRHRYQGHAVFNNLVRMDWLKAIKLDPDSFDAVLYRAIPYRNKNAPETAPEIIEPNQRIYDYQDPELITALDCPDEMDAFYALYDGSDNTGISDSALILRLAAVNVPVGSMLEWLEQLSDGTTIRRFWYIHKIFNYGTARVGSLFYCVPSRAFEGNFIGDSE >CP034959|0:90524|41332_43042_-|QAS87828.1|portal|DBSCAN-SWA MADNKITLSSVRKALAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAFSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMTIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNEND >CP034959|0:90524|50356_50605_+|QAS87831.1|DBSCAN-SWA MKKRYYTVKHGTLRALQEFADKHNVEVRREGGSKALRMYRPDGKWRTVVDFKTNSVPQGVRDRAFEEWEQIIIDNALLLNAD >CP034959|0:90524|15908_16475_+|QAS87805.1|DBSCAN-SWA MTPRQLLEDVKSRFTPLIADEPALLESLLRKALGTYQDRAGHIKRIRFTDQTCKSLACPADFLALVSVTDHTGDLVYSDVYDGNIELEDTHRAVYPLNVSYLANLRDMDLDNGEVPPEIIGLLSDYLEVLIAIPNTDRLRRISIAGKLDASNLSDENTLYQRKLDLEEKMSATRAIIPGIVLFSSMLK >CP034959|0:90524|3858_5565_+|QAS87791.1|DBSCAN-SWA MSNLREYQNRIADIAKRSKAVLGWASTAQFGTDNQFIKDDAARAASILEAARKDPIFAGISDNATAQIATAWASALADYAATHKSMPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDKSEIYEIFNVAGSSFGSYAIGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDVEGQNVPIRKGRTNIYVNRIKSVVDNGSGTLLHTFNNKAGEQITVTCSLNYNVGQIALSFSKAPDKGTEIAIEVEINIEAAPELIPLINHEMKSYTLFPNQFVIAAEHTVQAAYEAQREFGLDLGSLQFRTLKEYLSHEQDMLRLRIMIWRTLATDSFDIALPANQSFDVWATIIRGKFQTVYRGIIERIKSSGAMGMYAGADAASFFKQLPKDFFQPAEDYIQTPYVHYIGTLFGNVKVFEVPEGICTNLTADGIQFSPMDVLCYVRDENPGKAGFVTGDAVPAVPFQHPTTPALVNRTTLWGSAINDMHPRNGADYFTRVTLTMAKNGGINFLTGNMIDAGDSE >CP034959|0:90524|9261_9558_+|QAS87796.1|DBSCAN-SWA MEIDFSYSPETIERRFEIIGCITISDEHYWVLYDANTWLCALAECQPSLCVGEGAFRHKVLATLEVNTLRYWCVEILSDNKELHLLLLNKCASLRRKA >CP034959|0:90524|56226_57243_-|QAS87840.1|transposase|DBSCAN-SWA MFVIWSHGTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRAKVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >CP034959|0:90524|79721_80024_-|QAS87870.1|DBSCAN-SWA MQRKLTKRNKNWLSDMLKKANRNHMYLNDWLSIKGNLSDAKMIDRHVARYGVSLVLEKAELVFSEYYSIPQISSKGKICGYVLKHKSKLDELLVREKETQ >CP034959|0:90524|63494_63716_-|QAS87850.1|DBSCAN-SWA MQSINFRTARGNLSEVLNNVEAGEEVEITRRGREPAVIVSKATFEAYKKAALDAEFASLFDTLDSTNKELVNR >CP034959|0:90524|78071_78704_+|QAS87868.1|plate|DBSCAN-SWA MGSLTGKYRAVVVSVDDPKGLMRTQIRVVGMMDGLPDASLPWAEAILSNANTFSPFLPGDKVWVEFPYNGDSRWPLIIGYAQDASGGAPNVPPEASGQGEGYVSPEVEGAPAQPSTSAKKDFISSRNGLMEVRTAGGAWAVTHLKSGTTIGFNEAGELYAISQGPAFISSAGNLDIKSGADVALKAGGSMAIEASGNLSIKAAQVSVDKA >CP034959|0:90524|17111_17993_+|QAS87807.1|DBSCAN-SWA MLLPLFPLPSRPTELIQFRQPNIADAMRFNSITPEEQEQQTTAYLKALLAEPAKHDPLTWTAQDRITALWWIFTGSRETPVETFTYTCKHCGKEHYYDCDMNALAEDIQVLEVEPFIDDIEVSVEGVPYQWRIVPLDGWAMEMLEMRRAALPPEDDAEFKEAIVDLRFWEFAYQCELYNDVSGTREDQAERRYETIKRMAIDTEFMKLAAHIRLAHEKLEHGLPCYIDKGEMRLRLPPHKCPNQDKKESTEGAYTRLWVPFRATDFIPQVGIEKLSDLSVQPGFVWGYTDSGR >CP034959|0:90524|9724_10189_-|QAS87797.1|DBSCAN-SWA MLQMPDLLYFNGSWQEYIDDVYDVVREDILISNITFKGLPVRLRYSPEYDGKEFGFWHLVSEGKKEEERIPDLERCKRIRWIAHMIRNYNHCDISCWSERRGPTEEWVIWNECENYVVVLSARRDYWLLKTAYVVTYDSKIRTLKQSRKRALGT >CP034959|0:90524|40320_41340_-|QAS87827.1|head|DBSCAN-SWA MTDVLKTVTDRFCLYSNARKGRQNGRQYVLSAVKAMLESKETQEGLRLGELFGYYGHGRRQLTGKLEVPETSVIMVEGRPVVIDNVPACRTVAISVDDNGIVTHTQEILNTEPGKIVAAMIESRAGGWSWATGGRESGKIAVTTSFHGVDYVTTPNYISLDHPASAGMFESADSKSLLAESLAAHGYSDESVQAVISHYGKMAELEMMVEATERTAELETALLESQGRHLEAMAKIADAEARIALLEETAGIRDDVLAAMQDELDNLPIFVSAAQKDAFRLKEPGDAKIVATLFESLIKVGARNLPVTKKIKEVPQASNVQAPRETSIISFNNSINPFK >CP034959|0:90524|53134_53695_-|QAS87835.1|DBSCAN-SWA MKTIEQKIEQHRKWQKAARERAIARQREKLADPAWRESQYQKMRNTIDRRIAKQKERPPASKTRKSAVKIKSRGLKGRTPTAEERRIANALGALPCIACYMHGVISEEVSLHHISGRTAPGCHKKQLPLCRWHHQHAAPAEVREKYPWLVPVHADGVVGGKKEFTLLNKSEMELLADAYEMANIMH >CP034959|0:90524|63114_63495_-|QAS87849.1|DBSCAN-SWA MRHISPEELIALHDANISRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGHIFNDANKRTALNSALLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYGSAE >CP034959|0:90524|13931_14153_-|QAS87802.1|DBSCAN-SWA MVNANPCARQEFIWRFYSCKKHHYHFVIAATEDEARSQLPDGPCIFTARFSTNSRNSLSYWSLPFSADVQGGL >CP034959|0:90524|62930_63110_-|QAS87848.1|DBSCAN-SWA MARKYNKLSREALKMLLDGVSRRKVKQYLVGKQIGARTAIAVLCRQEMVVLKQRMPGSR >CP034959|0:90524|40164_40341_+|QAS87826.1|holin|DBSCAN-SWA MPDIFEHGREIDAAERNRFRLSTPRGAQIYGTQAKSIIVNENPGTRRGYFWWLFKRID >CP034959|0:90524|63788_64178_-|QAS87851.1|DBSCAN-SWA MGFPSPAADYVESRISLDQQLIRHPSATYFMRAADSHHREGILQGALLVVDSSLTPVDGSLLVCAMEGEYRIKRYRKYPRQHLEDLSTGKKEALPVDDDGYTGSNAVFGVITHVINDARSGEFDDCPVI >CP034959|0:90524|0_861_+|QAS87788.1|DBSCAN-SWA MNQSFISDILYADIESKAKELTVNSNNTVQPVALMRLGVFVPKPSKSKGESKEIDATKAFSQLEIAKAEGYDDIKITGPRLDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKRLDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTGLLKVGRFDADLDLIELEADSKLWELFQLDYRVLLQHHALRALPKKEAAQAIYTFIESLPQNPLPLSFARIRERLALQSAVGEQNRIIKKAIEQLKTIGYLDCSIEKKGRESFVIVHSRNPKLKLPE >CP034959|0:90524|32199_34119_+|QAS87819.1|DBSCAN-SWA MEQFNINKGVTIKPGLDVLPPPVTDDEYRALMAGEDRYLMTESNTLEEIEATFFYDTPIHWCATDLLEAISSTRLQLHRTMQAFVRALNQKLNGTGISAGSDKTGDVAQNGARAIGGAEIGRARNVNGLPVLPAIIPLSDGQTISILFHSPTAENRITNSDTLVAFQFLLNKKDVTHTVAPMSGRDMTLAQVTMKLANLAEKNSAKFQRAQKKKKALVDEITQLQADSDQKEDAMSDLADQVAAVEGQKADLEQKINAVASEADSLYQENERLQAEIDQLNRTGGRDTIAPAGMTGGHSRALTDRLASIKNRMHMDGEATLSNGASMKQFIGDGEGYIQLTDPDGSVYMIKAKSIQGVDMADAIGKLFKAYKAGNVSEYLVQPEEHKPENVEPEPAEDTGSSSPEPEVSVGAYRYALQMRPAAPGAIPEGNKAILPRPDEGDPYYEYARYGIATYDTPLSDQQMSEYDLKLLPREDSFDFLAKTLTNGPFGKYAQKALELATSSPDEFRVMLKTQFQKTFPNIAFPGGAGIEKMVQSMINALQAEVGEITQPEPVPAQPDETVSEADAEANKAIEYLNSVMDMQSTDMAEIRNARGNVREAIAALQAAGRFEENEELVNGAARHLADLLVAIQKAGVAA >CP034959|0:90524|37470_37788_+|QAS87822.1|DBSCAN-SWA MQIKIAAPLGGDAIIEFDDNEEVSGRLSIISGDITEDMIAEAIAGANPNSYMGFVNTLDAPASDVLRTLHLYAGWFVDWPAVEGGDEDDDDDDDDFGDHVDQIVY >CP034959|0:90524|57297_57393_+|QAS87841.1|DBSCAN-SWA MGETVGQSFRLCVVNTVRHNYGFHSTNHRRR >CP034959|0:90524|11061_11907_-|QAS87799.1|DBSCAN-SWA MLAKVTFLSCITMSDFTFSGYELACFVTHSGLSRSAGHILSQCANLAATTSEYFIHKPHRLIAAETGYSQSTVVRAFREAVNKGILSVEIVIGDHHERRANLYRFTPSFLAFAQQAKNALIESKLKISSAATKVKAVLAKTLALFNFLSTPPCQNDTPSPCQDDVAIKNKKSQIKKTKRSVSGGAGTTRLKNLTSWIAEAKAKADNLRLSKKRAQKHEFKQKVEAASRKYAFLKNKRSPDIGGISNFDNLPHCMTVNEALNAVLAKNKDNEQWGIPAGFRG >CP034959|0:90524|87059_87965_-|QAS87876.1|DBSCAN-SWA MFKHWKNITIYKLSREADLTDLEDKKKMILFTPCGSQDMAKFGFVSPFGDNSEVIAMHGNGFILVEAKRETKILPPPVIQRAIQEKIEKLEQEQARKLKKTEKDSLKDEVLHSLLPRAFSKFSVIQAIYDGSTKRIYINASARQAEDMLALMRKSLGSLPVVPLSVENPIELTLTDWVRDGSAPQGFQMGDAAELKAVLEDGGIARVKKQDLGSDEISTHLEAGKLVTKLALDWQNRIKFTLDHNFSLTSVKFADELLEQNSDIDSEDVAQRLDADFFLLTSEISCLVDALVNALGGEAKQ >CP034959|0:90524|11936_12737_-|QAS87800.1|DBSCAN-SWA MQQTFNADMNISNLHQNVDPSTTLPVICGVEITTDCAGRYNLNALHRASGLGAHKAPAQWLRTLSAKQLIEELEKETMQNCIVSFEGRGGGTFAHELLAVEYAGWISPAFRLKVNQTFIDYRAGRLQPAIPQSLPEALRLAADLAEQKQRLEQKMLMDAPKVEFAERVATASGVLIGNYAKVLGLGQNYLFTWLRDNGILIATGERRNVPKQEYISRGYFTLKETVIDTSNGSRISFTTRITGKGQQWLMKRLLDAGVLVPVAATR >CP034959|0:90524|55966_56080_+|QAS87839.1|DBSCAN-SWA MCLQRCRAAQKEMSKQGNLIQRHYSYALLILHTAQCC >CP034959|0:90524|51172_52213_-|QAS87833.1|DBSCAN-SWA MMNIKPLKIRNKVMKPHKALLNPDLTARNVLTYSHWDFVELWLKRNEKDEALFYWEQAKVFNQAANGLPNQSAPLLHYYSFMNAVKALLSSRNINFKQHHGVARGENTAQIDNLSDIKVKIKNEGILPSFSKYIDGTSHEGVYNIKNLFSGLPYIHRTYCLTYDVTDDIFIPLIDAEFVLNESDNSLFFTAILSKDFRFDTVFDILPPAFELFEREKYKIKSVESLLDFNENQSSNMPQLTEFHQKIRRDLYYINGAETLWYLKRKNNQSEHTSVSISPLVITLAAMHRLSELCRYDPLKLRQLLNEKENWIIAEFIQQSPSQFIDAISSEITGHQFLIPNVRSAS >CP034959|0:90524|1418_2615_+|QAS87789.1|DBSCAN-SWA MSDSSQLHKVAQRANRMLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRANVDYAVSEMEEKGYVFDKRPAGSSMKYAMSIQNIIDIYEHRGVPKYRDRYSEAYVIFISNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLRILVIDLDPQSSATMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSVVPGVDVMPASIDDAFIASDWRELCNEHLPGQNIHAVLKENVIDKLKSDYDFILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPELVKLISDEGCECQLATNIGFMSKLSNKADHKYCHSLAKEVFGGDMLDVVLPRLDGFERCGESFDTVISANPATYVGSADALKNARIAAEDFAKAVFDRIEFIRSN >CP034959|0:90524|8668_9178_+|QAS87795.1|plate|DBSCAN-SWA MGHNNTKGNRKFIKGRYTANAAKGERLVSSEFQLTFAGHEDISVLVRTSQIPEMTREDVEDYGPNGVKFNQHGPIRNSGEIQVQCVETIEGDILQFIKDRIAAKDYVDITMAATPESKSSGVNAVTKAATTIEMLDCKIYSDAIDFSTEDVTAAVRPSLRIVYNWIEWD >CP034959|0:90524|24515_24950_+|QAS87812.1|tail|DBSCAN-SWA MSDVSTNLYKSQLLDYYYQRRAESSINKGSRFLISKAVFGTSSLVTKKGDGTYEIGELPKAFDLAELTSKFCTINLVPTYSGGIITVRMDLDQSQLQKGKNYPFNTLVVLDNENKPIAIICVQEDSLYVGKTYTAVMAINTTTA >CP034959|0:90524|18074_21815_+|QAS87808.1|DBSCAN-SWA MERKNANIDDIIRTVETASAKELEELAGIREAVEDLKGERVATVDPVSRSVSALNRTIENSRPDFVTNAPSVDPIVDAIKRLNLGDVSRIREDKVTNREQQAAPTAHNPPNRRREAITEDVKAQRLETVKLARDLKGERVATVDPVSRSVSALNRTIENSRPDFVANVPSVDPIVDAMKRLNLGDVSRVVQEGIAQQEQQAKSTTPKGKKRRRKAIPEDIKAQRTEAAEHAREMFDQKGGAQKSQNQRDARGRFIGKLGSKAATEDARAERAEKARRKEDDERLNAESGLLKKLSKVAEGIGNPSETRAVDALGYAVAGPLWAAGKELGGISKEVGGSLNGARKSIADVIRGNDDNSRRKGFFRRKSQNSADVVQVNTQKRTVQELQEQTSEIKEGNDKILSALDQIAKNTGKKKGGLLSKLFSLLGKGAGGVASLLMGRGMLKKAGALAFGALGAKKLVGMLRGGGKKTIAHEGGDLAARAAGKLGLKAVGKGALRAIPLVGTVAGGIYDAVTGWNDTEAQRRAFGLKSGQDPSFQQKAAYTLANVLDMGGLVSGISSAIGEVLKSLGFEDIGNMLQSFSTESIAQAIDSGITNLETYISNLGDTISTKFEDYTAKIGDAVSAWFSDTSNKLLEKLDAIKDFFTVDNLKQVFSDAIDSAIDFIKNPGKHIKEAAGNIWDGVKNLPGKALDAAVDAVKNTPAAMIVSKIPNPIGEANAKEITPELKAPVNSEANAKEIAPELKAPVNSHQETSDSKTESDAKQTNIATRVINAALDTAKDSNKTVKQTANQIINANAVETGNSALQKIDKAIGQNSSSSSSLNTTGTRNDIQKAADTYNNGNLYVKVGSLGAEGKANLDKLAPYFAELENKYGLPEGTLYAIAATESGGNPYAKSQTGALGMFQFTGIAREETGLAEGESFDPVKSAEAAALLMSKYLKQANGDLNEAITAYNAGFGTINKWKKGTGDLSKENREYAIKVNTHRARYLGGEIYTPGAGAQGGAQYGVRGPLPDNAVIDQSTGLAFTPGDSPFEKGGLVDKIGNAVGVNDLVNKFMNGRGMRREVVQGTLEERARGKGTATAAGNVYVDTPMPVEEVRPVASNSSYFDQLGAQMGIDGLFDKLRNSPGMRKNNATEPASTSKVTTAANDLQQPTGRMQIDGQVISDLGGSGAKPTMQLADNTVSLDGETKRLFAQMTSLLARIEEHTKDSAKGQGTVVKVSTPQPGVMRTVPLSIDDPLMNDYARVD >CP034959|0:90524|15288_15828_+|QAS87804.1|DBSCAN-SWA MNKIILFLIFSTFSVGTALANSLQSQIAAIAQAENEGRAKEQQAEDARKELIRQQAQAERIRREKAASAAAAREKQRVAAENERRAKREAELANDKKRDQAYEDELRKLQLESMKLELQAKAARVQRENDFIEQELKERAAKTDVIQSEADANRNISTGSKDLLQSEGKAREKKASSWW >CP034959|0:90524|76275_77061_+|QAS87866.1|plate|DBSCAN-SWA MILNNQGWLLAIFKKKGLTPTGKLEFATIDGIDSALAQALNEAFDSQVVSFNDRINQSFREFLKRTPRDRITLGTFSDVKEWLSSFEADRAGRKDTASAGPVNKLAMPLVNLSRSPAFSIYEGELCRDNYDEGHVTNENDEIEALVSTIPFSLEYSLWIASDEKESLGMVTTALAFWLRMYASLGQASFTHTANVGGYEIPVTCYIEGQKSIAFQDLTTGTTDNRLFAVGLNLTVVAELPILAYMQQTTGTITVKAKILEE >CP034959|0:90524|5625_7215_+|QAS87792.1|DBSCAN-SWA MSQYSIQQSLGNASSVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSASNERFILKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEDLISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLRYAEALTAVEGTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSGAAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAGLTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK >CP034959|0:90524|71629_72256_-|QAS87858.1|DBSCAN-SWA MFPKNKFRGSDRVIIVGSGPSAANFVAPRGVPIIAVNGAIDWLNRASYFFTLDPSPDNMRRVGRGRRRRGVCYCMALPDVKEREVRDGILCFRRVAERGTEPKNTNSPEWWAWRWSAHFGLCEDENEIASGNSAYGALNLAFHIGFKHVALVGVDATQELRVHSGGTPKNLSHLPLLFQSAREQIDVVSCGKMGGIPQMTLKEWLKNT >CP034959|0:90524|37817_38612_-|QAS87823.1|DBSCAN-SWA MTAQNTKTIQYRLRNGQSVEVTINNDGVPGEKVSISDLAIEKTIMCHLGFTEEVSKKHGVAIWRTMDTGMRRFITARTPGMTMMDLIQIAPLFECEPLDVFSNPAICQQLYGEMKLAVTPIVLHEGSLAGVWKVERISSYMPFHIHVNGVITGENQPVSVTKSDLNRAILEASCRVIGLGKQSYVSFPAGPEGPAEILIMDADLLWQIQFLIGKSIIRAEELDQYITCTMTDEVKSVAIANARNLCRAALTELQENTTEEVESD >CP034959|0:90524|21814_22171_+|QAS87809.1|DBSCAN-SWA MANNNEIDPLLTLELSGVKTYESQEEAWGARLYEWLNTYQGEVYGDPSWGNVLPQFKHEPTNLSHVQIAVEAMLLQKLTVDLPDIPISGLSVAEGDAFDKLKISIRIRDITITQDVVL >CP034959|0:90524|53941_54253_-|QAS87836.1|DBSCAN-SWA MRKNFNIDGKYVVLSVSTNIQSPAVIVTVKLSDRMPDIDSISVAFPVRSMRSAEHFVMNATEEEARRGFAKVMSEFGEFLGHVDKALSISSARSKALTASMMK >CP034959|0:90524|29661_30072_+|QAS87814.1|tail|DBSCAN-SWA MNTSYAVIENGMVVNVIVWDGEAEFTVPDNQQLINISDISEQPGIGWVYSDGGFTAPPTPERSHDELVADAEQKKKSLIDTAMASISLIQLKLQAGRNLTQAETTRLNAVLDYIDAVTATDTSTAPDVNWPAFPEA >CP034959|0:90524|34493_37481_+|QAS87821.1|DBSCAN-SWA MSLSDQVVMATSIETLIELLKNLPNYGRVSYVVTAKGDEVKTAFDIVDASALLVSNTLDGKINPDYPQELQPRDRTRASNLLQVNQISKDLRPAQLTDSGLSSHGAPIIGEDNAVESGNGRTMGIIKAYQDGNADRYREYLIDHATEFGIRPEKVESMAAPVLVRRRLTKVDRVQFAKDSNISDLQEMAASEKAFVDADSITPAMMALFNPSESGDLLSRSNDAFIRGFMTQVGATQAAGLVTEDGRPTRQLVDRIQNAIFAKAYKDARLVRMVAEEPDPDMRNVLTALNAAANDFVQMQALSGEAHKQAVTTIVDGIETADSLDKKALAALKDAVDLVRQSKESGQHITDVIAQGDMFSETAPEVKALALFIVANNRSAKRMATAFKLMAQRINDELQHQGQALGDMFGGGDVSLQDILRQVSQELENEGMQGISGGLFESVSGGSYNGVAPYTSLLLHRASGIKDIIHLIRLLSRTDPQDEQLVQVLAHFVRMPVADVKKWCRLFGISNSLLRGLLNHASSLGRDGFDEIAQAIKNGDMPPAIDWFSIRPTRVKAFLSAAHTASPLAEMVQRLSLIFTDHTALGDLTLDEMKEASIQWADQQNEVNSDFLPAFRKAVSKADDARGILKAFKALQSRVNKHVGDIDGVTAEGRDILKEHGITPEFIDEIRTDMQREVVSSLQIVARALADANPKSAAIVNQVIGDIEASEGMGALKLFLSRAFNPNGNILPGIIGEAKKYVSEEELEHLDQLLKRFSYNPQTRWQMNQRSMGSVHEKVLSAMNSAIANSSVSEEKALEWADSFITEEVEEARAGQNGGINLRKELADIYRLTGGKISTLSKVVHHQGRAYANLNGVVAVNLNDENARALWHELGHHLEYSNPGLLEKARSFLKANVEGGKLSFVNIGGRGKPEWCFRSRLSNIYMAKVYPPVSVSNSGKIRQKSPTISKTSATEVFSMALQLYHDKEAAAASLMNGDGLLELLLGVAKELNNAD >CP034959|0:90524|30839_31169_+|QAS87816.1|holin|DBSCAN-SWA MLDTQELAPVAIAFLLSVIGGIGTFLMDVRDGRQSGNLLGLVTEIFVAVTAGAVAYLLGQHEGWELSITYLMVTIASNNGHEVISGMKRVNIDSILNVLTSLVKKGGGK >CP034959|0:90524|81860_82025_-|QAS87873.1|DBSCAN-SWA MASKARIAIAIGFLLLSVLVDFTSTILSVLSDGALVAVAVTLVWPILKTASKDQ >CP034959|0:90524|72757_72913_-|QAS87860.1|DBSCAN-SWA MSSKVNYESLASVMPRNEQETDAVVDPVIAEMNARLEAEFAAENEHTTQGD >CP034959|0:90524|54303_55335_-|QAS87837.1|DBSCAN-SWA MSNLLTVHQNLPALPVDATSDEVRKNLMDMFRDRQAFSEHTWKMLLSVCRSWAAWCKLNNRKWFPAEPEDVRDYLLYLQARGLAVKTIQQHLGQLNMLHRRSGLPRPSDSNAVSLVMRRIRKENVDAGERAKQALAFERTDFDQVRSLIENSDRCQDIRNLAFLGIAYNTLLRIAEIARIRVKDISRTDGGRMLIHIGRTKTLVSTAGVEKALSLGVTKLVERWISVSGVADDPNNYLFCRVRKNGVASPSSTSQLSTRALEGIFEATHRLIYGAKDDSGQRYLAWSGHSARVGAARDMARAGVSIPEIMQAGGWTNVNIVMNYIRNLDSETGAMVRLLEDGD >CP034959|0:90524|78750_79725_-|QAS87869.1|DBSCAN-SWA MNILIIGRKFEAISDVKTYTEMWAYNLACAFSEAGVTLQYHRPYSPGVESPEDYVEAVLTAALSCSAKAILAPGLRYFTTVPREIGVQLRRRFTGWVAQVYDGSMLDSAPVDITFTVRDDTWRYLDNPGRLERHNRFNKHVGWAANQELFHLETKTDDVLRIFVDHAAFDVSGFDHSLSILMNLQRLTVPYEAKTLTDDGLVTIEPGNISVTPYRRTPVPATEFAAELRKSDIFIVTHPESLGLTVLEAAMCGALILTPPDCLPPDRLELVNHMVIKSRIDWDEVIARVDRVKNAEKVQCHTWSAIAEKMLETFVTQKPSRGNG >CP034959|0:90524|8075_8657_+|QAS87794.1|DBSCAN-SWA MAPIPYGVYSQADGVSPFLKVTLTNSQYQVTGYISQGAAMNMAQNWEAPFTGMSMGSVAGAFSGFAQVGTETTSVARWNSLMVWEGGTPPTFTLPVTFIALFDPFTEVSGAIAALSAMISPELKDASIGGRIPERVTLNIGRRINIIDVAIQDISFDLDAPRDSNGHFLKNTVNLQLTGSSIYNSSDIVRAFQ >CP034959|0:90524|68693_69956_-|QAS87854.1|DBSCAN-SWA MSTSAQKQSIENVSIPDVLNAGIPAIIQNIRAAQRRVSCDDLTARFFDNAVQSAEMLHAQLIDVYNAEADSHNSLVDAAENMQLDLGLKGKEIEELQLEIEHLKRQQQDAIDDATHDANQRADNAERISIELETKLNEMTAMVELRNSQISTLKSQYKEIMKLDPFNLEKRYNKAKSERQELRKQVADLNQQLKKTIKDASEARVAFANKKAEVTALVNENAKFATLKKEMYGITERRFPASKLHPTLGQISFFPRLLAYGISSPKEFNNERPYIVSKLDFAYQFCCDMGYAIDIRINEWLMPNFQPLAIFREFQPEGWVEFFHELICKEMESRRPELVRRVEWAQEVMLADAELPFEPEFIDDLATKGLHTLFDVVTRRHEQLVVELGLEETAARRLLDVCYARSDAWEKENGGTIYVR >CP034959|0:90524|31165_31609_+|QAS87817.1|lysis|DBSCAN-SWA MIGWGVCVLALALADRCLLKRKDITHLELGDVEIKPGFIRVPFKYRSKFPFLRGATVRYWIRDVQKPTTVIEGEQRCLTSAEQGENSEWLYIPTEYMGIGDRLWHFNVMVTHGDSFINPLYRIFPVTQQIRRSYVINLAQDVSDDEK >CP034959|0:90524|74339_75557_+|QAS87864.1|tail|DBSCAN-SWA MPTEYARDNLGRYQTDGLSAKDFNKVFDLIRKQQRQNRRNARRTLTPRIMGMRNRELEAFLSLGKKKDGTYFTPEDIRSFNTSRQAHKTKFKSTVPGITYAQLVAQSTSIDIKRANNKVSDGTGIKAATFLGLKHNLALISVNASDESVHQHHRVRIRFEEWDKAVEDIAEDGAKKARIAADLCKGRVSFDCDCGRHQYWYRYMATAGNYAVAPPKEYAFPKIRNPDLTGVACKHVLHAMTRFQSPTWHKAIIIALEKAAEQVAFGDDKRKTTTYFKGELAKSLARNRTTTTDQAKAAREYELYLKSQDALGKKLRAKDSATDNVRRLLKKARTTANRKNAELKASRVREAQARAEADALKKALQTQANNLIKFFMSQGIDKAAATAQARSILETQINEARKRKG >CP034959|0:90524|52303_52945_-|QAS87834.1|DBSCAN-SWA MDKKICVVSMSVGKPASMTAAWINNELIMAERTSYPERRRDMELQLLRELQEKEEKGFIVLVEEENSFITGRVGQRVRLRDPFMNGRPVLIEAMQIYKELERQKAIKLPRKESGKYILHQSIFDSEHDKKGDEFFNINWSEITTEHVLSLLCCFATEYNNVASADYIRAMAGEVEARQEPSLLSPLINIIRGTQCLADKRVPQGLLTGKENYL >CP034959|0:90524|23600_24437_+|QAS87811.1|tail|DBSCAN-SWA MQRSWFNNRLTSAKQKSLLYKSLADLVQSMMDTFVDPWLERITNRKSIFSMSKEDLETRTNELGQFFTIRTSNSSSVPMLLQQRLDEIHFKGTERPINQTIYREFNGISVLWDPIYAPVDLERHPYGTVLIPESTLETTGGTFGEMFLTSRGMISIPINDLARTMGITGTIDQSAITEEILRKFNQFVKPLLPLHIVFDGLTLYLSVVVNEQADMITLNEISDTEKAYCWFETSDTTSLTGVTSISAPITATPGGTIVKATPTFDRTRADDLLLDSDA >CP034959|0:90524|69957_70176_-|QAS87855.1|DBSCAN-SWA MWPFRRKYHYWLIAFVTPTGGIRHVITRYRNKRLTLARILQAAIGEGLDTNCVVLPPSYLGKMTEAQANTEL >CP034959|0:90524|84021_87063_-|QAS87875.1|DBSCAN-SWA MKELCYGSVCSGIEAASIAWEPLGMRPVWFAEIESFPSAVLALRWPHVANLGDMTKLAKKVLAGEIESPDVLVGGTPCFTAGHMVLCKNGYKPIEDVCPGDYVVSHLGRLQQVKRVGSKIANTGLLNAVGQPLDIRTTNDHPFLAVRWKAQNTRKNGTYFKRELLSEPEWRAACDMPGYQWCALTNFNIASPDICSRFLSEEQAMYLAGAYVGDGYIRRWRGKSKKAVVFGINCQKLRKFHCRIPENIFSVASEIRGSIKVTLNDTCYANWLNEHFGELSHAKRIPAWVMSHPLRHVFLQGYLDTDGTPSGKAGFRINSVSPALAWGVAGLSQTCGYVSSVSFIEVEPKKVIEDRVVNQRNYYQVTICPQKLSRKSRLAHGMLLRTVKEFKSVGLDTVYNIEVEGDHSYILNGAVVHNCQAFSIAGLRGGLDDERGALTLKYVELANAIDDKRAESFLKPAVIVWENVPGVLSSADNAFGCFLAGLAGEDAPFEPGDRPESGKSNAFWRWDGKTGCHAPKWPQCGCIYGPQRKVAWRILDAQYFGVAQRRRRVFVVASARTDLDPATVLFEFEGVRRNIAPSRGEGKETTRYTSNIAIRSCDDTNIVAMAHGQGGAEIKTDNSAPTLTCNHEAPIVLLGDGRIRRLTPVECERLQGFPDGHTLIPTEKRKKVNSDELAYLHNHYPDLSEEEAAMLAADGPRYKAIGNSMAIPVMRWIGERITKAACRQNEGRETKERKVKPAAEFERSIFKWAGGKFGVLEQIFRYLPEGKRLIEPFVGGGAVFMNAGYQENLLNDVNADLINFYKTLQREAHSLITLAHRFFLDYNTQEGFLAVRNAFNKQVYDDLHRAAAFLFLNRHCFNGLTRYNQAGEFNVGYGKYKTPYFPLQEMEAFLGAEGRSEFVCGDFAAVIEGAGEGDVIFCDPPYEPLPNTEGFTNYSGHDFKFEEQKRLVSLLTDAHRRGAKVLITNSGAPNIRELYQDSGFRVEPLFARRSVSCKGDTRGVAHDVIAILL >CP034959|0:90524|55342_55564_-|QAS87838.1|DBSCAN-SWA MGYSAAKVSTHLELEKNRGYWRAKGFDRDSCQLSLSRGEEKIERTRGRWRFYDENHKQVKAEPILYTLLKTII >CP034959|0:90524|22167_23601_+|QAS87810.1|DBSCAN-SWA MSKTTPTKDSIRAEFEELVEKDSFWSKFVGSQFVSMLTLFITQIVYRCFQYADAALAEGFISTATRRSSILAAAETNSYVGTKPTPSSGMIEITATSEDAPAVIPKNMPLISDDQYPYMTMDVCRLVDGTGTVEVAQLEIQEVTYTVTAAKEFLEVVLSKALTAVCYKLEVFVTTDGKTTQWSSSTMFRLAGSKSQVYVEFYKPSEQLGVRFGDGLIGQIPPEGSTITLKVWCTNGDITLVAGQNLTPVDSAANLANLISVKTTTPITAGTDAETTEITRNRAQYYLAYDDQVVWGGDYTYFLVRNIPGLSWVKAWGEGQQEKLDGAYNVQNINKIFISGWHPNKSQSELEEMILTAFKKVPNELNKKFSYKEVRKLPFKITITGRISASLTIENVTDELKSALETKFGRDSNFFDPNGVGKYILIKKKDVWAFIETLGYFRDFYLEFVEWNESNGFYDFVYLDTENSTFNISYEEE >CP034959|0:90524|61318_61771_-|QAS87846.1|DBSCAN-SWA MLLNWQGRHFMEINHSRITSYEIADYMIRTKSLLSAKELAAILEKEYPHLDVDKRDVYLRLKAIAVSKYSSVLIDDSTRPRRFQIHSLNPEFFRRSRAPRRFDEKLQNELYMTQDEKERREHQPWVMARQLFNKVARQHRHYGNATSARI >CP034959|0:90524|70257_70959_-|QAS87856.1|DBSCAN-SWA MKIALVLRSGGDYNASDVQWLVNQLPKDYEIICLTDLKCLHVPGVKVIPLINQWQKCRGWWAKIELFRPDITDDLFYLDLDTVIAGDIRLILENPPTSFTMLRDFYHPQYRGSGALWIPNSVKAHIWSSFWQDPEGWISRCVTTECWGDQGFLRKVMGDDTPAFQDLYPGWFVSYKADVVEPGSKYASARYSRGNGALPKDCRIIFFHGKPRPREVSEDWLPLISSFFERESE >CP034959|0:90524|49919_50360_+|QAS87830.1|DBSCAN-SWA MATLSDTIKPNKTYLEAVLRTALLGKTEDEYVDFFLSGLRGRLLKNPRLYRSYGPYWPEIKKLLLERGYGNFGRLVDRDVRKIYRYDRPALTLIAATLYSQERFDNGQIYSAWHLLPVPEEVDDQDYEFESYDLEVEALAQAGEKT >CP034959|0:90524|60039_61233_-|QAS87845.1|terminase|DBSCAN-SWA MTWDDHKKNFARLARDGGYTIAQYAAEFNLNPNTARRYLRAFKEDTRTTDSRKPNKPVRKPLKSMIIDHSNDQHAGDHISAEIAEKQRVNAVVSAAVENAKRQNKRINDRSDDHDVITRAHRTLRDRLERDTLDDDGERFEFEAGDYLIDNVEARKAARAMLRRSGADVLETTLLEKSLSHLLMLENARDTCIRLVQEMRDQQKDDDEGTPPEYRIASMLNSCSAQISSLINTIYSIRNNYRKESREAEKHALSMGQAGIVKLAYERKRENNWSVLEAAEFIEAHGGKVPPLMLEQIKADLRAPKTNTDDEENQTASGAPSLEDLDKIARERAASRRADAALWIEHRREEIADIVDTGGYGDVDAEGISNEAWLEQDLDEDEEEDEEVTRKLYGDDD |
95 | Escherichia_phage(62.22%) | plate,holin,terminase,transposase,head,tail,lysis,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
5858 : 13593
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034963|5858:13593|DBSCAN-SWA CTCACAGAATAACCTCTTTCACTAACGCTTCGATTTCTTGCTGGGCTTTATCATTTTGCGTTTCAAAAATTGTCATTGATTCGGCCCAAGCATCTCGATGTGCTTTACGATCACATACCCGGGAGTTTGCCAGATGCATTTCAGGGTAATCCTTAAGTACTTCAGCTGCCTCATTGGCTTCATTAACAAACATATTTGTGGGCGTCATGTTTAAAACGAGATATCCCCGAACATCCTCATTAAAATCTTTTGCTGCTGTAAAAACGGAGCATGTATGAGGCACAACATCCAGGTCCATCTGAGATGGGCGCAAAGGAGAAATGAATACATCTGCGGCCATCAATCCACTACGCATTTCTGCGCTATCACGGCCAGCACAGTCTGCAATAACAAAATCATAGGATTTTTCGTGTTCCTTTAACATGGCTTTGATGTTACCTGTCGCGCCTGTAACCGGAATATGAGGGAGATCTTCAGGGCGATTGTTGTACCAGGTTAATACTGATTTCTGGTCATCTGCGTCGACAATAATCACTCTATTGCCTAGTGACATCAGATACGCAGCTATAGATACCGCCAGTGTACTTTTACCTACCCCACCCTTTTGACTACCTAATACTATTATCATTATAGCCTCATATTAAGCTGTTAAGTTAACAACTTAATATTGCCATCTGAACGCGAACCATGCAATTATTTTTGTTATGTTAACAACTTAACAAGCTGTTAAGTATTTGATTTATAGATTAATTAAGATAACAAGATAACAAGATAACAAGATAACAATATTATGGGTGGATTACATATTCATAACTGGGCTACTGGCAAATCTGTCCATCTGGTGGTATACGCAGGAGAAAGCATTTCCCGTTTCATTTGCCACTCAGGGGCGGTTCCTCGCCCCGCAAACCACACTGTCCCTTTTCCCGACTGGTTCAGTTCATCGAGTGTTTTCATTAACTTTTCACTGTTTTTACGGGGCTGGATTTCATCAAATAATCCCGGCTGTGCTATACCCGATGGTGTGAAATCAGCCAGCATGACTCCTGCCTTCATATAGCGGTACCCTTCACGCCAGACATGGTTTAAGGCTCTGCATGCGGCGGCAATAATGTCCCGGCTGTCCTGTGTGGGCAATGGAAGCTTTTCCACAGCGGCATTGCTGTAACAGGGTTCTTTTACTGCAAAGGGGGATGTCCGTACAAATGTCGTCACCTGCCGGCAATACTGACGCTCCCCACGTAGTTTCTCTGCGGCCCGCTCTGCATACTGAACAACAGCCTGGTGCATGGCATCTTTGTCTGTGATTCGTTCACCAAAACTGCGACTACAGACAATCTGCTGTTTTGCCGGTGGTGCTTCTTCCAGGGATATGCAGGACTCGCCGTTGAGTTCGCGTACCGTACGCTCAAGAATGACGCTGAAGTTTTTCCGGATGAATGCCGTGTTAGCCTGCGCCAGCTGCAGTGCTGTGTTAATACCCAGCGCATTCAGCTTTTCCGTCAGTCTGCGTCCTACTCCCCAGACCTCACCAACTGGCTGCAGCCCCAGTAGCTTCAAGGTCCGATTACGGTTTTCTGCCGTCAGCGCGACCACTCCGGAAAACTGTGGCCATTGCTTTGTTGCCCACTGTGCACTTTTAGCCAGCGTTTTTGTAGGCGCAATGCCCACCCCCATGGTGAGTCCTGTCCAGCTCTTTACCTGTTCCCTGAGCTGATGACCAAAAAAATCCGGAGAGATGCAATGATTTATCCCCCGCAAATCAATAAACATTTCATCAATTGAGTAGGGCTCAACTGCGGGAGAAAGCGACTCCAGAACAGCCATGACCCGTTGGCTCATGCTGTGGTACAGCGCATAATTGCTGGAAAATACATGTATTTTCTTCTCCAGGCGCATCTGTCTCACCTGAAACCAGGGCTGCCCCATTTTGATGCCAAGGGCTTTTGCCTCCGGGCTGCGCGCGATCACACAGCCATCGTTATTGCTGAGTACGATGACCGGTTCGTTGCGAAGGTCCGGGCGGAAAACTTTTTCACATGAGGCGTAGAAACTGTTGATATCAGCCAGTGCAAACATCAGCGTAACTCCCGCGTCCTGTGTATCACGTGAGTGACAACACCAAAAATACAGATGTTTTCCGGATACAGTGTGCGGAACTCCGGGCTGTCTGAAACCGGCTCCAGTGCCGGGCGTGGGCGCAACAGCAGTCGTTTGACGGTGAACTCACCGTCGATCTCAGCGATAACGATGTCCCCGTGTTGTGGTTTTTCGGCCCTGTCCACTACCAGCAGATCACCATTCTGCACGCCAGCCTGGTTCATCGATTCACCGCTGGCGCGCAGAAAGAAGGTGGCTGCAGGTCTGCTGATGCAATAGCTGTTCAGATCCAGTTCCTGCTCAGCATAATCAGTGGCAGGCGAAGGAAAACCGGCCTGGCAACGATCGGCAAACTGGGTTTGTTGAAGTCGAGATCTATGTGGGAAACAGGAAACCAATAACGCAGAACGATTGAAGCGTTATTCCCGTAATAGCACAAGAAATCCGCCCTACGTTGATTGTGACCACTCTAACGGGCCAGCTATTGATGAAATAGTTAAACGATCGAAAGACATCCTAAGTACATCGCATCTCCACACCGTGAAACTACGGGACTACCTAAACTGGATTTATACCCAGCATGGTAGCGGAGTCTCCATAGTACTCAAGGTCGGCCTGTAATGGGCTACGGCAGTCCGCATTATACGGAAGCAGCAAAGGTATAAATCAACCGGAAGGGAGATGAAAGCCTCACAGTGCTGCGGGGAAGGGAGACAGTTGTTAGATACATCAATCTCTAAGAGGAGGGTTAGCCAATGCTGCCTGATGAACTTGAGAAAAGATTAGATGGAATTGAGGATGCTTCAAAAAAAGGGTATCCAGTACGTAATCTATACAAAATTGCGTGTGAGCCCGGACTGTGGCTGCAAGCCTACGTGAATATACGGGCCAATGCAGGTGCACTGACTCAAGGTGTATCAACAGATACCATTGATGGCTTCTCTATCGAAAGAGCAGAGAAGCTGGCAAAAACATTGCGGAGTAGAGAATACGTAGCGAAACCTGTACGCAGAGTTCAGATCCCAAAAAAGGACGGAAAAACCAGACCGCTGGGGATTCCAGGTGGAGACGATAAACTGGCGCAAGAAGTAGCAAGAATTATTCTGGAAAGGGTTTATGAACCAGTATTCTCTGAACACTCTCATGGATTCAGACCAAAACGTTCGTGTGAAACTGCGCTCAGATCTATTCGTCCAGTCTGGAACGGAGTTAAATGGATCGTGGATGTTGATATCAAAGGTTTCTTTGACAATATCCCCCACAAACTATTGCTGAATACACTGGCAGAGAAAATTAAGGATAAGAATTTCCTGAAATTAATCGAACAGTGGCTGAAGGCAGGATATGTGGACAATTGGAAATACCATCGCTCATATTCTGGCACGCCACAAGGTGGGATAATTTCTCCCTTACTGGCAAATATATACCTCGACAAACTGGACCGTTTTGTAGAACAGAACTTAATTCCTGCATATACAAGCGGTACAAAACGAAGAGCGAATCCAGATATGAACAGGCTGGCCCATAAGATTCATAAACTACGAAAGAAAGTAGACGGTATGGCAACTGATTCTGAATCAGAAAAAGAAGCCATAAAGAAAGAAATTGAACGACTGCTTGAAGAAAAACGTTCTATACCATCTCAGGTCATGAACGATCCAAACTTCAGGCGTATGTACTATGTTCGATATGCTGATGACTTCGTAATTGGTGTCATCGGCTCGAAGAAAGATGCTGAACATATATCAAGACAGGTCAGAAACTTTATTACTACTTCACTGGGATTGGAAGTTAATGAAGCAAAGACCAGGATCCGCCATATCTCAGAAGGAGTTAATTTTTTAGGCTATGAAATCAGACAGGCAGATGCAAAGAAATTGCTGAAGCAGAAAATGCAAGGGCGGCACGCTCTCCGGAGATCCACCACAGGAATTGTGCAGCTCTTCGTTCCTGACAATATAGCCGCTAAATTTTGTCACCAAAAGAAATATGGCTGTTATGAGAATGTGAAGGCTGTACATCGTTCAAGTCTACAAAATCTCAGTGAAGCTGAAATTGTTCTGACTTTCAACGCGGAAATGCGCGGATTGGCGAACTACTACAGTCTGGCGCTGGACATGAAATATAAATTATCAAAATTATATTTCATATGGCAAATAAGCCTTTTTAAGACCCTAGCGAATAAACGACGTAGTTCAGTGAATAAAGTCGCCAAATCGCTACGGCAAAATAATGGAGATCTGGCTATTACCGTTCAGGCCAAAAATGGATCCCGAAAGATTGAGGTTTTCAAACTCAAACATGTCAATCGCAACCGAACAACGATTGTCGATACGGAACCGAGAACCGCACATATCACAACCCGCACAACAGAGATCATGCGTCGATTAGGGGCAAGGATCTGCGAGTATTGTGCGAAGACTGGGCGCTGCGAAGTACATCATGTCAGAAAGCTAAAAGACCTGAAGAAAGGTCGTAAGGCCGGATATAAACCCAGCCTGTGGCAACTAATGATGATCGCACGCCGCCGGAAAACGATGATTTTGTGCGCAAGTTGCCACGACAAGTTACATAGCGGAAAACTACCCGATCTGAGACAGGAAAAGGGAAGAATACTCCTGCAACCACCTGTCTCAAGTGGGGATTCTGCTAAATGAGTTTATCCCCAATAAGCTAGCAATGGAGAGCCGTATGCATTGAAAGGTGCACGTACGGTTCGGAGGGGGGGTAACGGTGATCTAACTCAAGATAAGCAGGACATATTCGCCATTTATCTTGAAAGACCACACGTACCCTACCCTACCGTCTGGGTAATGATCCGGAAGTCCGTTACATCCCCAAACCGGATGTGCGGGTTATCCGCAATCTGATTACGGAAAGTGAAGTGGCGGTGGCGGGGAACAGTAAATTCCGCTTCGTGGGGGCTGATGCCTTCTCGCCAGACGAACTGCGCACCGATTTGTTCAGCGACGACGAGGGTGGCTATGTGGACTGCGTGGCGCTCGATGCCGCCCTGCTGGAAAAACTCCAGGCTGTCGCAGAGTTCCTTCGGGAAGCCGAAGGCTGGGAATGGTGCGCCGGACGCATGGAGCCTGTCGGTGAGTGCCGTGAGGATGCCGGAACATACCGCTGTCTGCCGGAGCCGGAAGCGGTGCTGACCGAGGCGGAAGAAGAACGTCTGAACGAACTGATGGCACGTTACGACGCGCTGGAAAACCAGTGTGAGGAATCTGACCTGCTGGAAGCAGAAATGAAGCTGATGCGCTGCATGGCGAAGGTCAGAGCGTGGACGCCGGAGATGCGTGCCGGAGGTGGAGTGGTGGTGTCCTGGCGTTATGGCAACGTGTGTGTCCAGCGTGGTGTGCAGTTGCGCAGTGAAGATGACGCGACTGATGACGCTGACCGCACGGAACAGGTGCAGGAGAAAGCGTCAGTGGAGGCAATCAGTCTGCCGTTGCTGACGAAAATGTCTTCCGAACGCACGCTGGCAGTCCAGGCAGCACTGATGCAGCAACCGGACAAATCTCTTGCACTGTTGGCATGGACGCTCTGCCTGAATGTGTTTGACAGCGGAGCGTACAGTAAACCAGCACAAATCAGCCTGGAATGTAAACATTATTCGCTGACCAGCGATGCGCCGTCAGGGAAGGAAGGTGCCGCATTCCTGGCGCTGATGGCAGAAAAAGCCCGTCTTGCGGCCCTGTTACCGGAGGGATGGTCACGGGACATGACAGAGCAGAAATACAGCCGCCAGCGGGAACGTGAGGCAGAACGACGGGAACTGGAATACCAGACATGTTTTGCTCAGGCGCAGATTGACCTTGCGTTTCATACTCCCGCCACGGTCGGAAGCTGGTTGTCCCGCTGGTCTGGTGTTGTTGAGGAGCATGATCTGGAAACGATTTTCTGGGGGTGGTGCGGGCGTTTTCCATCACTGTCATCATTTGACCGGTTTTTCTGGCAGGAGGAACCACTCTGGCGGCTGATTTTTGAAGCCGGTGAGGCCGGTCGTGGTGCACCGGTACAGGTACGTGCACTTGAGCAGTGGATGATCCCGAACAAGCTGGAGAACGTAATATGATGAAATCAGACGAAAAATACCAGGTTCCCGCGTGGATGCGACCTCTGTTGCCGTTGCTCTGCAACACCGGGGGGAACGATCCGGAAGAACTGCTGAATGATACAGAAACCACTGCCAGTGCGAATGTTGTCCGTTATGTACTGATAGTTGCTGTGCGGTCGCAGGTTGATCTGCTGCAGCTTCTGTACAGGAAAGGGCTGTTGCGCACAGAGATACCAGGTGGCTTTTCACCGGAAGAAGCGCAGGCACTCCTGGATAATCTGGTGCGCAGCCATATCAGCAAGGCGCTGTCGGGCGAGCGAATGGCAGCCCGTGACAGAAATGCCGATCTGGCCTGGATTGGACAGCAACTGGTCGATGCCGCCTGGTTTGTCCGTGCCACACTGGAAGCGCATGGGATGAGCGTCGGAAATGAGAGTCCCCCTGCTCCGCCGGAGACAATGCCGGACATACAGACACGGGAACTGGTGATGTTGATCAAGCGACTGGCATCATCGCTGAAAGCGGTGAAACCGGACAGTTGTGTGGTGCGTGAAGCGCAGGACTGGCTGCGCGACAGAAAACTTGTGGATATCACAGATATTCTCCGGTGAACCAGGCATGTATATTTTACGCAGGGACACGCCGATCATGATCGTTTCATATCAGGGGTGGTCTGGAGGTCATGCGGCGTGTCCTCTGCACTCGCCGGAATAAGGAAGTCGCCGGCGGCTCCGCTTTTACCCGGCCATGCGGGGCATGGCCTTGTGGGTTTTCAGCTCTGTGGCCTCAGCGTCGTGTGCGGGCTGTGCCGTGCCTCCATCTTAGCCGGGCTGGCAGGGATGCAAGGGTACGCTTCGCCGCTGCGGTCACCCGGTCCCTCCTTCCCGTCTGCCGTGATTTTCCGGTCGTTCGTCCGCTCAGGTTTCGCGGCCTCACCCTGCAACTCCCGTCAGCCGTGTGCGTCAGGCTGCGGCTTCCCTTGCGCCCCTGCATCCCCGCCTTATCGCCCGGCTTTTATGGAGGCACGGCACCGCCCGGTGTCGCAGAACATGTGACTATGGAGGATTCGGGAATGTCTGTTGTTGCACCTGCTGTATACGTTGGAACCTGGCACAAATACAACTGTGGAAGCATCGCCGGACGCTGGTTTGACCTGACCACGTTTGATGATGAGCGCGACTTTTTCGCCGCCTGCCGTGCTCTTCACCAGGATGAAGCCGATCCTGAACTGATGTTTCAGGATTATGAGGGATTCCCGGGGAATATGGCCTCTGAATGTCATATCAACTGGGCCTGGGTTGAAGGCTTCCGCCGGGCACGGGATGAAGGCTGCGAAGAGGCTTATCGTCTCTGGGTGGATGATACCGGTGAGACGGATTTTGACACCTTCCGCGATGCCTGGTGGGGCGAGGCTGACAGTGAGGAGGCTTTTGCGGTTGAGTTCGCCAGTGATACCGGCCTGCTGGCTGACGTGCCGGAGACGGTGGCGCTCTATTTTGACTATGAGGCGTATGCGCGGGATTTATTCCTGGACTCCTTCACCTTTATTGACGGTCATGTGTTCCGTCGGTGA
Protein sequences of DBSCAN-SWA_1 >CP034963|5858:13593|7935_8355_-|QAS87903.1|DBSCAN-SWA MVSCFPHRSRLQQTQFADRCQAGFPSPATDYAEQELDLNSYCISRPAATFFLRASGESMNQAGVQNGDLLVVDRAEKPQHGDIVIAEIDGEFTVKRLLLRPRPALEPVSDSPEFRTLYPENICIFGVVTHVIHRTRELR >CP034963|5858:13593|13092_13593_+|QAS87906.1|DBSCAN-SWA MSVVAPAVYVGTWHKYNCGSIAGRWFDLTTFDDERDFFAACRALHQDEADPELMFQDYEGFPGNMASECHINWAWVEGFRRARDEGCEEAYRLWVDDTGETDFDTFRDAWWGEADSEEAFAVEFASDTGLLADVPETVALYFDYEAYARDLFLDSFTFIDGHVFRR >CP034963|5858:13593|12034_12631_+|QAS87905.1|DBSCAN-SWA MMKSDEKYQVPAWMRPLLPLLCNTGGNDPEELLNDTETTASANVVRYVLIVAVRSQVDLLQLLYRKGLLRTEIPGGFSPEEAQALLDNLVRSHISKALSGERMAARDRNADLAWIGQQLVDAAWFVRATLEAHGMSVGNESPPAPPETMPDIQTRELVMLIKRLASSLKAVKPDSCVVREAQDWLRDRKLVDITDILR >CP034963|5858:13593|5858_6485_-|QAS87901.1|DBSCAN-SWA MIIVLGSQKGGVGKSTLAVSIAAYLMSLGNRVIIVDADDQKSVLTWYNNRPEDLPHIPVTGATGNIKAMLKEHEKSYDFVIADCAGRDSAEMRSGLMAADVFISPLRPSQMDLDVVPHTCSVFTAAKDFNEDVRGYLVLNMTPTNMFVNEANEAAEVLKDYPEMHLANSRVCDRKAHRDAWAESMTIFETQNDKAQQEIEALVKEVIL >CP034963|5858:13593|8712_10614_+|QAS87904.1|DBSCAN-SWA MLPDELEKRLDGIEDASKKGYPVRNLYKIACEPGLWLQAYVNIRANAGALTQGVSTDTIDGFSIERAEKLAKTLRSREYVAKPVRRVQIPKKDGKTRPLGIPGGDDKLAQEVARIILERVYEPVFSEHSHGFRPKRSCETALRSIRPVWNGVKWIVDVDIKGFFDNIPHKLLLNTLAEKIKDKNFLKLIEQWLKAGYVDNWKYHRSYSGTPQGGIISPLLANIYLDKLDRFVEQNLIPAYTSGTKRRANPDMNRLAHKIHKLRKKVDGMATDSESEKEAIKKEIERLLEEKRSIPSQVMNDPNFRRMYYVRYADDFVIGVIGSKKDAEHISRQVRNFITTSLGLEVNEAKTRIRHISEGVNFLGYEIRQADAKKLLKQKMQGRHALRRSTTGIVQLFVPDNIAAKFCHQKKYGCYENVKAVHRSSLQNLSEAEIVLTFNAEMRGLANYYSLALDMKYKLSKLYFIWQISLFKTLANKRRSSVNKVAKSLRQNNGDLAITVQAKNGSRKIEVFKLKHVNRNRTTIVDTEPRTAHITTRTTEIMRRLGARICEYCAKTGRCEVHHVRKLKDLKKGRKAGYKPSLWQLMMIARRRKTMILCASCHDKLHSGKLPDLRQEKGRILLQPPVSSGDSAK >CP034963|5858:13593|6664_7936_-|QAS87902.1|DBSCAN-SWA MFALADINSFYASCEKVFRPDLRNEPVIVLSNNDGCVIARSPEAKALGIKMGQPWFQVRQMRLEKKIHVFSSNYALYHSMSQRVMAVLESLSPAVEPYSIDEMFIDLRGINHCISPDFFGHQLREQVKSWTGLTMGVGIAPTKTLAKSAQWATKQWPQFSGVVALTAENRNRTLKLLGLQPVGEVWGVGRRLTEKLNALGINTALQLAQANTAFIRKNFSVILERTVRELNGESCISLEEAPPAKQQIVCSRSFGERITDKDAMHQAVVQYAERAAEKLRGERQYCRQVTTFVRTSPFAVKEPCYSNAAVEKLPLPTQDSRDIIAAACRALNHVWREGYRYMKAGVMLADFTPSGIAQPGLFDEIQPRKNSEKLMKTLDELNQSGKGTVWFAGRGTAPEWQMKREMLSPAYTTRWTDLPVAQL |
6 | Burkholderia_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1798 : 30239
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034964|1798:30239|DBSCAN-SWA AATGATTCAAACACGTAATCAATATCTGCAGTTTATGCTGGTTATGCTGGCTGCATGGGGCATTAGTTGGGGAGCCAGATTTGTCATGGAGCAGGCCGTTCTGCTTTATGGATCAGGAAAAAACTATTTGTTCTTCAGTCATGGTACTGTTCTGATGTACCTGCTGTGTGTTTTCCTGGTATACCGCCGTTGGATAGCTCCGCTACCGGTCGTTGGTCAGCTGCGCAACGTTGGCGTACCGTGGCTGGTCGGTGCGATGGCCGTGGTGTATGTCGGTGTATTTCTGCTCGGTAAGGCGCTGGCTCTGCCTGCTGAGCCATTTATGACGAAACTTTTTGCCGATAAGTCCATACCTGACGTGATCCTGACGTTGCTGACCATCTTTATCCTTGCCCCGTTGAATGAGGAAACGCTGTTCCGGGGGATTATGCTGAACGTCTTCCGTTCACGGTACTGCTGGACGATGTGGCTGGGGGCGCTGATAACGTCGTTGTTGTTCGTCGCCGCGCACAGCCAGTATCAGAACCTGCTGACACTGGCAGAACTGTTCCTGGTGGGGTTGATTACATCAGTGGCCAGGATCAGAAGTGGTGGCCTGCTGCTGCCGGTATTGCTGCATATGGAAGCAACCACGCTGGGTTTACTGTTTGGTTGAAAGTTATATTTTTATTAAACATTGTGCGTTAAAGCCTGGTGTGTTTTTTTAGTGGATGTTATATTTAAATATAACTTTTATGGAGGTGAAGAATGCATACCACCCGACTGAAGAGGGTTGGCGGCTCAGTTATGCTGACCGTCCCACCGGCACTGCTGAATGCGCTGTCTCTGGGCACAGATAATGAAGTTGGCATGGTCATTGATAATGGCCGGCTGATTGTTGAGCCGTACAGACGCCCGCAATATTCACTGGCTGAGCTACTGGCACAGTGTGATCCGAATGCTGAAATATCAGCTGAAGAACGAGAATGGCTGGATGCACCGGCGACTGGTCAGGAGGAAATCTGACATGGAAAGAGGGGAAATCTGGCTTGTCTCGCTTGATCCTACCGCAGGTCATGAGCAGCAGGGAACGCGGCCGGTGCTGATTGTCACACCGGCGGCCTTTAATCGCGTGACCCGCCTGCCTGTTGTTGTGCCCGTAACCAGCGGAGGCAATTTTGCCCGCACTGCCGGCTTTGCGGTGTCGTTGGATGGTGTTGGCATACGTACCACAGGTGTTGTACGTTGCGATCAACCCCGGACAATTGATATGAAAGCACGGGGCGGAAAACGACTCGAACGGGTTCCGGAGACTATCATGAACGAAGTTCTTGGCCGCCTGTCCACTATTCTGACTTGAACATGGGGTTTGAGGGGCAACTGGATGAAAACGTACGATTTAGGGCACTAAAACCGCTGTTGTCCCACCATTCTGGTGATTCCCAAACGTTATTTGGCTAAAAAGTAGTTTTGATGTGGTTTATTTTCCGAATTCCAAGCGCAGCCCTACTTTCTTGTGCGTATTTGCGTGTTTTGCGCAGTTTTGAAGTTCCGGTGATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTTTGAGGTGCTCCAGTGGCTTCTGTTTCTATCAGCTGTCCCTCCTGTTCAGCTACTGACGGGGTGGTGCGTAACGGCAAAAGCACCGCCGGACATCAGCGCTATCTCTGCTCTCACTGCCGTAAAACATGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACCAGAAAATCATTGATATGGCCATGAATGGCGTTGGATGCCGGGCAACCGCCCGCATTATGGGCGTTGGCCTCAACACGATTTTCCGCCATTTAAAAAACTCAGGCCGCAGTCGGTAACCTCGCGCATACAGCCGGGCAGTGACGTCATCGTCTGCGCGGAAATGGACGAACAGTGGGGATACGTCGGGGCTAAATCGCGCCAGCGCTGGCTGTTTTACGCGTATGACAGGCTCCGGAAGACGGTTGTTGCGCACGTATTGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGCAACGTTTTCTGCCTCTGACGCCTCTTTTAATGGTCTCAGATGTCCTTTGGTCACCAGTTCTGCCAGCGTGAAGGAATAATGGCCGAGCATATTGATATGTCCGTGGCAAAGCGGGGAGAGGCGTGCGATATCTTCATCATTCAGTGTTTCACCCTGCGCCCGGAGATGATCCAGGGCTGCCTGCATATAAATAGTGTTCCATAACACGACGGCGTTAGTGACCAGCCCCAGTGTGCCCAGTTGATCTTCCTGACCGTCGGTATATCGTTTTCTTATCTCACCTTTTTGACCGTGACAGATGGCTCTGGCAACGGCATGGCGACTTTCTCCCCGATTAAGCTGGGTCAGAATGCGCCGGCGGTAATCTTCATCATCAATATAATTAAGCAGATACAGCGTTTTGTTGATGCGCCCCACTTCAATGATTGCCTGAGTCAGTCCGGAAGGACGTTCACTTTTCAGCAATGAACGGACCAGCACTGAAGCCTGTACTTTGCCCAGCTTCAGGGAGCCAGCGGTCCGGATCATTTCGTCCCACTGAAGGACTATTTTTGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCAAATTTTGTATAATAGGAATTGAAGTTAAATTAGATGCTAAAAATTTGTAATTAAGAAGGAGGGATTCGTCATGTTGGTATTCCAAATGCGTTATCAAATGCGTTATGTAGATAAAACATCTACTGTTTTGAAACAGACTAAAAAAAGTGATTACGCAGATAAATAAATACGTTAGATTAATTCCTACCAGTGACTAATCTTATGACTTTTTAAACAGATAACTAAAATTACAAACAAATCGTTTAACTTCTGTATTTGTTTATAGATGTAATCACTTCAGGAGTAAATTACATGAACAAAAATATAAAATATTCTCAAAACTTTTTAACGAGTGAAAAAGTACTCAACCAAATAATAAAACAATTGAATTTAAAAGAAACCGATACCGTTTACGAAATTGGAACAGGTAAAGGGCATTTAACGACGAAACTGGCTAAAATAAGTAAACAGGTAACGTCTATTGAATTAGACAGTCATCTATTCAACTTATCGTCAGAAAAATTAAAACTGAATACTCGTGTCACTTTAATTCACCAAGATATTCTACAGTTTCAATTCCCTAACAAACAGAGGTATAAAATTGTTGGGAATATTCCTTACCATTTAAGCACACAAATTATTAAAAAAGTGGTTTTTGAAAGCCATGCGTCTGACATCTATCTGATTGTTGAAGAAGGATTCTACAAGCGTACCTTGGATATTCACCGAACACTAGGGTTGCTCTTGCACACTCAAGTCTCGATTCAGCAATTGCTTAAGCTGCCAGCGGAATGCTTTCATCCTAAACCAAGAGTAAACAGTGTCTTAATAAAACTTACCCGCCATACCACAGATGTTCCAGATAAATATTGGAAGCTATATACGTACTTTGTTTCAAAATGGGTCAATCGAGAATATCGTCAACTGTTTACTAAAAATCAGTTTCATCAAGCAATGAAACACGCCAAAGTAAACAATTTAAGTACCGTTACTTATGAGCAAGTATTGTCTATTTTTAATAGTTATCTATTATTTAACGGGAGGAAATAATTCTATGAGTCGCTTTTGTAAATTTGGAAAGTTACACGTTACTAAAGGGAATGTAGATAAATTATTAGGTATACTACTGACAGCTTCCAAGGAGCTAAAGAGGTCCCTAGACTAGCAAGAAGTACACAAGAAGCCTTAAAGATTATAGAAAAGCTTTTCCATGAAATGCTTAACATTATTTAATGTTAACATGTGGCTTTGTGGTAACTACCACGAGTGGTAACTTGTAACATGAAAAAAGTTACAAGTTACAAGTGGTTTATTTACGAACATAAAGGCTAGGAAAGCACTTGATAAGTATATCAAGTGCTTTTTTTCCGTTCGATTTTACTATCCAAGATACCGCTTGACGTACCTAATATTTTAGTGTGCTAAGGGCTTTGCTTTTTCTGGTTATAGTTCGTTCAAAAAAGACCGCAAGGGTAAATGCACGATCATAGATCGCCCTTGCAGCTTTTTAATCTCTCCACTTTTTTATAACCAAGCAAACCACCAAGCACACTGAAAAAATAATTAGGAACTAAAGCGAAGTGCGTATCTTGGATAAATCTTCACTCGATTTTAAAAGATTAAGCACCCAGCCAAAGCGCCTAAAAAAATAATTTTTTTAAAGAAAGGAAGTAATTTAAATGACTACAATTTTAAGCGACTACAATAAATCTATCCTAATTGAATTACTAAACACCAATCGCCAAATTATCGTCGTTCATGGCGACGATATTGATGATTATTACACAGATTCTTGCTATGACATTATCTTTATTGATTACTATGATGATGATGATTTCACTTATAACGGACAGGAAATATGGCGTGGCGATAGCATTTATCTTGTAAAAAGTTATCAAGATGAAATTCAAAACCGTATAGAAAAAGGACAAGAGCTGATATTTGTTAATAAAGAAAATGATATATCATTTTCAGAATCTATTTTAACTTATTACCATCTATTTGATTATGCTGGTAAAACGTTTGAAATAGACGGTCAAATCATTGATGAACAAACAAAAGTTTTTGCCGCACCCGAGCGCCGCTGCGCATGTCGCACGTGCTGGAAGGTGCCAGCCAGGAAGACCTGAACCTTTACCGCGCGGAAGTGGAGCGCGACCAGGCCTATGGCAACTGGCGCGATTTCTTCCTGAACAAGAAGGGCAGCGTCACATCCGTGAGCGGTGATGCCAACCTCGACCAGATCGCGGACGTGTCGTACCTGCTGGATACCTTCTTTGCCAAGGTCACCCGCACCGCGTTGCAGAACGCCGCCTCCATCGCGGGCCTGATGATCACCACCGAAGCCATGGTGGCCGAGGCCCCGAAGAAGGACGAGCCGGCGATGCCGCCGGGCGGCGGCATGGGTGGCATGGGCGGCATGGATTTCTGATGCGGTTGGCCCGGTCGTCAGGGAACCGGACCGCGCCAGCGCGGTCCGATCCCGGCAACGACCCGACATCAAGGCCCCAAGGACGGGGCCGGAGCCCGGCAGCGATGCCGGGCTTTTTGTTGTGCCCGCGCCGCGGCAATGTCTGACGCGAAGATCAGAACGCACCGATACGAACGTGCGAACACAGGTGCAACCCTGAGCAGCCGTCCCCGCACCGGAGCGCTGCGTGCCGCGCCTCGCCACATCCCGGCGGCAAGCCGCGGGATGCGCGCCACTGCCGTCCGCCCACACCGGTTCGCGGTACGCGCGCCACGCGCCCGAGCGCACGCTGCTGTACGCGTTGGTAGAGGCGCACTACCCGGACTTCATTGCACGGATCGAAGCGGAGGGCCGCTCGCTGCCCGGGTATGTCCGCGAGGCGTTCGATGCCTACCTGCGTTGCGGCGTGCTCGAGCACGGCTTCCTGCGGGTGGTGTGCGAGCACTGCCGTGCAGAGAGGCTGGTGGCCTTCTCCTGCAAGAAGCGCGGGTTCTGCCCGAGTTGCGGCGCGCGACGCATGGCCGAGAGTGCGCGGCACCTGGTCGAGGAGGTGTTCGGCCCGCGGCCTGTGCGGCAATGGGTGCTGAGCTTTCCGTACCCCTTGCGTTTCCTGTTCGCCAGCAAGCCAGAAGCCATTGGCCCGGTGCTGGGCATCGTGCAGCGCGTGATCGCCGGCTGGTTGGCCGATCAAGCCGGCATCGACCGCGCCAGCGCCCAGTGCGGCGCGGTGACGCTGATCCAGCGTTTCGGCAGCGCGCTGAACCTGAACATCCACTTCCACATGCTGTGGCTCGACGGCGTGTACGTGGAAGCCACCGAGCTGCCGCGGCGCGAACTGCGCCTGCACCGCGCCCGTGCGCCCACCACCGCGCAGTTGACCCAGCTGGCAGCTGCCATCGCGCACCGGGTGTGTCGGCACCTGACGCGCAAAGGCTGGCTCGAAGGGGAGGGCGAATCGGCCTTCCTGGCAGACAGCGCTGCAGGCGACGACAGCATGGATGGGCTGCGGATGAGTTCGATCACCTACCGCATCGCCACCGGCCGCGACGCTGGCTGCAAGGTCGTCACGCTGCAAACGCTGCCCGGTGACGCCGGTTCGCTGGAGGGCGAAGCCGGCAAGGTCGGCGGCTTCTCACTGCATGCCGGCGTGGCGGCCGAAGCACACGAAAGCCACAAGCTGGAAAAGCTGTGCCGCTACATCACGCGCCCGGCGATCAGCGAGAAGCGGCTGTCGATAGCGCTCCAGGGCAGGGTGCGTTACCAGCTCAAGACCCCGTGGCGCAATGGCACCACGCATGTGGAATGGGATCCGGTGGATTTCATCGCCAAGCTGGCGGCGCTGGTCCCGCCACCGCGCGCCCACCTGACCCGCTTCCACGGGGTGTTTGCCCCGAACGCCGCCCTGCGCGCACAGCTGACGCCATCGGGGCGCGGCAGGCGGCATGACGCCGCTGTGGAGCCGGCGGACGCAAGCGCGAACGACGCGCCGCGCAGCCCCGAGGAGAAGCGCCGTTCGATGAGCTGGGCGCAACGCCTCAAGCGGGTCTTTTCCATCGACGTCACCGCCTGCGTCCACTGCGGTGGCACCGTGCGGATCGTCGCCAGCATCGAGGAACCTGCCGCCATCCGCGCCATCCTTGGCCACTTCGTGAAGCAGGGCGCGCGGGAAGAAGCGCACTACAGGCCCGCAGCGCGCGCACCGCCAGTGCAAGCCGCGTGACGATCTGCCGGCTGCACAGCCGACGGCGAAACCGGAATCCGAGCCGATGCGGCCACGATCCGCAGGGCGGCGCTCGGCCCGCTGTCGGGAATCAGCGAAGCATGGCTGCTGACAACGCCGCTGCGTGGCCCCGCGATGCCGAAATCCCACTCACAGACGTCCGATCCGTGCCCAAAACGGGGCTTGCGCGACCGCCGCCTACCCAGCAGACTGCCCGAAAAGGGCGTTTGAACTTCCTATACGCTGATAGTGCATTATCTTAAAATTTTGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCAACGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCGGACTTTTCAAGTCTCGGAAGGTTTCTTCAATCTGCATTCGCTTCGAATAGATATTAACAAGTTGTTTGGGTGTTCGAATTTCAACAGGTAAGTTAGTTGCTAGAACCCATGGCTCCTTTGCCGACGCTGAGTAGATTTTAGGTGACGGGTGGTGACAATGAGTCCGTGTCGAGCGCTGATTTTTTCGGCCTTTAGAGCGAGATTTATACAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGCCTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGTTTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCGACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTTTAAAGCCAGCATCACTGACAATGAGCGGTGTGGTGTTACTCGGTAGAATGCTCGCAAGGTCGGCTAGAAATTGGTCATGAGCTTTCTTTGAACATTGCTCTGAAAGCGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGCGACTGAAGCTCGCAATACCATAAGTCGTTTTTGCTCACGAATATCAGACCAGTCAACAAGTACAATGGGCATCGTATTGCCCGAACAGATAAAGCTAGCATGCCAACGGTATACAGCGAGTCGCTCTTTGTGGAGGTGACGATTACCTAACAATCGGTCGATTCGTTTGATGTTATGTTTTGTTCTCGCTTTGGTTGGCAGGTTACGGCCAAGTTCGGTAAGAGTGAGAGTTTTACAGTCAAGTAATGCGTGGCAAGCCAACGTTAAGCTGTTGAGTCGTTTTAAGTGTAATTCGGGGCAGAATTGGTAAAGAGAGTCGTGTAAAATATCGAGTTCGCACATCTTGTTGTCTGATTATTGATTTTTCGCGAAACCATTTGATCATATGACAAGATGTGTATCCACCTTAACTTAATGATTTTTACCAAAATCATTAGGGGATTCATCAGCGTATAGTGTTTTGCAGTTTAGAGGAGATATCGCGATGCATACGCGGAAGGCAATAACGGAGGCGCTTCAAAAACTCGGAGTCCAAACCGGTGACCTCTTGATGGTGCATGCCTCACTTAAAGCGATTGGTCCGGTCGAAGGAGGAGCGGAGACGGTCGTTGCCGCGTTACGCTCCGCGGTTGGGCCGACTGGCACTGTGATGGGATACGCGTCGTGGGACCGATCACCCTACGAGGAGACTCTGAATGGCGCTCGGCTGGATGACGAAGCCCGCCGTACCTGGCTGCCGTTCGATCCCGCAACAGCCGGGACTTACCGTGGGTTCGGCCTGCTGAATCAATTTCTGGTTCAAGCCCCCGGCGCGCGGCGCAGCGCGCACCCCGATGCATCGATGGTCGCGGTTGGTCCGCTGGCTGAAACGCTGACGGAGCCTCACGAACTCGGTCACGCCTTGGGGGAAGGATCGCCCGTCGAGCGGTTCGTTCGCCTTGGCGGGAAGGCCCTGCTGTTGGGTGCGCCGCTAAACTCCGTTACCGCATTGCACTACGCCGAGGCGGTTGCCGATATCCCCAACAAACGGTGGGTGACGTATGAGATGCCGATGCTTGGAAGAGACGGTGAAGTCGCCTGGAAAACGGCATCGGATTACGATTCAAACGGCATTCTCGATTGCTTTGCTATCGAAGGAAAGCCGGATGCGGTTGAAACTATAGCAAATGCTTACGTGAAGCTCGGTCGCCATCGAGAAGGTGTCGTGGGCTTTGCTCAGTGCTACCTGTTCGACGCGCAGGACATCGTGACGTTCGGCGTCACCTATCTTGAGAAGCATTTCGGAACCACTCCGATCGTGCCTCCGCACGAGGCCGTCGAGCGCTCTTGCGAGCCTTCAGGTTAGAGGCCGTCGACAATGATAATCTGGATCAACGGACCTTTCGGCGCCGGAAAGACGACGCTCGCTAAGCGGCTGCGCGATCGGCGTTCCAAATCGCTGATCTTTGACCCCGAGGAAATCGGGTTCGTGGTGAAAGAAACGGTCCCCATGCCAGCGAGCGGAGACTATCAGGATCTCCCCTTGTGGAGGGGACTTACGATCGCGGCGGTCAGGGAGATTCGAAGGAATTACTCGCAGGACATCATCATCCCAATGACGCTCGTGCACCCGGACTATCTGACTGAGATACTCGACGGGGTAAGGCGGATCGACGATCAGCTGCTGCACATCTTTCTGACGCTCAACGAGGACCTATTGCGTCACCGGATCGCGAACCAGACCATGCATCCTGACCCGAATCGAAATGCGGAGATTCGAGAGTGGCGATTAGCGAATGTCGCCCGATGCTTGGCCGCAAGGGAACGGCTTCCATGCACAACCCGTGTTCTCGATAGTGGTGCACACACCAGCGATGAACTCGCAGCGATGGTGCTCGACGGAATCGATGGGCGCACCTGATCGCCTTCGACGCCTGCGCAAAGCGTAGCGCGAGGGTGGCGGGCTCACGACCAAACGCCCAGAGGTCGATCATCGCAGGGATGTTTGGCTTTGTGGTGCGGACGACGGGACTCGAACCCGTACTCTCACAGAGAAGCAGATTTTCGTACCACCTCGACTTTCGCCGCCGTCTGATGACGTTCGTGGTCTGGACTGTCCCTTCGCCATTGCCCGAAGGCTTTAGGCGCCGCCCGTCCAGTCTCTACACCTTCCCCCGAAGGGGCTTGGCTCGGGATTGGCTTAGGGTATTGCCCGTTAGCGTTCCCCGACTTTGAGCGGTTCTACTCCGCGGATTTCCCCGCGGGCACTCCAATTTTAAAGTCTGCTGCGTCTACCGATTTCGCCACGTCCGCCTTTTTTCGCCGTTCCTAGCGCTCGTGCGATGCACCTATGTTGCACCTAGCGCCGAATCGTTCTTCGTCATCCTGAAAAACCACGTCTCCTAAAGCCTTGCATAGCTTATCTTTTCTCCACCACGAACTTTTTTGTGGGATGGTAGAAAAAAAGACTTTTTAAGTCCGCTGGCTTGCCAGGCCTTGTTAGCTTGTACGGTCATGGTTATCGGGTAAAGAATATTGACGGCATCGCTGGTGTCGGTGGCTGAAAAGCCGGCTCCCATCAGGGCAATAGCCATTTCAGATGCAGGCGTACAGGGCAATGGTCAACAGCTACAGCCTGTCTGACGATTCCGGCGTCATGGCTGCGGCGGCTATCACGCATTTTTTGTTCGGTCAGGCGGTGTTTTCGTACCTCAATGGTTGGAGCGTGTTGATCGGACCTGGTACAGGTTTGGACAGCACGGGCTGCAAATACGCAAGGGATTTAATGGGCCTGGTGGCGTTCACGGCTTTTATCGTGACGTTTCTGTTCAGGGGCTACTCATAATCTCGTGGCTCGGCGGTTCCCGGCACACCATGACAGTAAGGAAGGACCCTGTGTCTCAACTCTCCCAGCTTCGAAGCCCCGCCGCCGTGCAGGCTGCCATCGATGAGTTCGTGCAACTGGGCCGCACGAAATTCCTGGCGCGCCACGGCTACGGCAAGTCCCGCGACTTCCTGGTACGTGATCCGAAGACCGGCACCGATTGCGATTCCAAGGCCATCGCCGGTGTGGCCTTCGGCAAGCAATTTCCCGAGCAGGGCCCGCTCACTGCTGACAGCTTCTCCGGTGGCGAGGCGACCGTCGTTCCGGCGCTGACGCGGCTCGGGTTTCGCATCATTCGCATCGGCGAAGACTGGTCCGAAGAAGAGGTCCTGGCCACGGTCGAAGACTATTTCGACATGCTGCGTGCCGAGGCGGCTGGGGAGCCGTACAACAAGTCCGAGCACAACCAGGCACTGCGCCAACTGCTGAACGGTCGCAGCAAGTCTTCAGTCGAGCTCAAGCACCAGAACATTAGCGCCGTACTCGATGCCCTGGGCCTGCCCTATATCAACGGCTACAAGCCACGCGGCAACAGCCAACTGCTGCTGCGTAAATCCGTACACGCCTACGTTCTGGAACATCAGCAGACGGTCGGCGCTCTTGTCGATGCCCTGGAGGAGGTAAAACTTCCGGGTGACAAAACCTACCGAGCGGCTTTGGTAGAACCACCCGCCCGTGAAGTGCTTGTGCGTACCCCGGCATCTCTACGGCAACGCCTACCGCGAAAGTTCGATTATGCCGCTCGCGATGAAGCCAACCGCAAGCTGGGCCGGGCAGGGGAGCAGTGGGTGATTGGCTACGAACAGCAACGCCTGACCGAGCTCGGCCACCCAGAGCTTTTTCAGCGGCTGGATTGGGTGTCCGACACCCAGGGAGACGGTGCGGGGTTCGACATCCTGTCGTTCGAAGAGGACGCCCATGAGCGCTTCATCGAGGTGAAAACCACCAATGGCGGGGTAGGCTCGTCTTTCTTGGTCAGCCACAACGAACTCGAATTCTCCAAGGAGGCGGGCGATCAATTCCATCTGTATCGCGTGTTCCAGTTTCGGGACGGTCCGCGCCTGTTCACGCTACCCGGCGACCTCAGCCAACATGTGCATCTCAAGCCGACGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCGGATTGAATATAACCGACGTGACTGTTATATTTAGGTGGCTAAACCCGTCAAGCCCTCAGGAGTGAATCATGACCGTAGTCACGACCGCCGATACCTCCCAACTGTACGCACTTGCAGCCCGACATGGGCTCAAGCTCCATGGCCCGCTGACTGTCAATGAGCTTGGGCTCGACTATAGGATCGTGATCGCCACCGTCGACGATGGACGTCGGTGGGTGCTGCGCATCCCGCGCCGAGCCGAGGTAAGCGCGAAGGTCGAACCAGAGGCGCGGGTGCTGGCAATGCTCAAGAATCGCCTGCCGTTCGCGGTGCCGGACTGGCGCGTGGCCAACGCCGAGCTCGTTGCCTATCCCATGCTCGAAGACTCGACTGCGATGGTCATCCAGCCTGGTTCGTCCACGCCCGACTGGGTCGTGCCGCAGGACTCGGAGGTCTTCGCGGAGAGCTTCGCGACCGCGCTCGCCGCCCTGCATGCCGTCCCCATTTCCGCCGCCGTGGATGCGGGGATGCTCATCCGTACACCGACGCAGGCCCGTCAGAAGGTGGCCGACGACGTTGACCGCGTCCGACGCGAGTTCGTGGTGAACGACAAGCGCCTCCACCGGTGGCAGCGCTGGCTCGACGACGATTCGTCGTGGCCAGATTTCTCCGTGGTGGTGCATGGCGATCTCTACGTGGGCCATGTGCTCATCGACAACACGGAGCGCGTCAGCGGGATGATCGACTGGAGCGAGGCCCGCGTTGATGACCCTGCCATCGACATGGCCGCGCACCTTATGGTCTTTGGTGAAGAGGGGCTCGCGAAGCTCCTCCTCACGTATGAAGCGGCCGGTGGCCGGGTGTGGCCGCGGCTCGCCCACCACATCGCGGAGCGCCTTGCGTTCGGGGCGGTCACCTACGCACTCTTCGCCCTCGACTCGGGTAACGAAGAGTACCTCGCTGCGGCGAAGGCGCAGCTCGCCGCAGCGGAATGAGCGAACGTCGATATAGCCCGCTCGCGACGCTGTTCGCGGCGACCTTTCTCTTCCGGATCGGCAACGCGGTGGCGGCCCTCGCGCTTCCATGGTTCGTCCTGTCTCATACAAAGAGCGCGGCCTGGGCGGGCGCCACGGCCGCTAGCAGCGTCATCGCGACCATCATCGGCGCGTGGGTTGGTGGTGGCCTCGTCGATCGGTTCGGGCGCGCGCCCGTCGCATTGATCTCGGGTGTGGTGGGCGGCGTGGCCATGGCGAGCATCCCACTGCTCGATGCCGTTGGCGCCCTCTCGAACACTGGGCTGATCGCTTGCGTGGTGCTCGGTGCCGCGTTCGACGCACCCGGTATGGCCGCGCAGGACAGTGAGCTGCCCAAACTCGGCCACGTCGCCGGGCTCTCCGTTGAGCGCGTCTCGTCACTGAAAGCGGTGATCGGGAACGTCGCGATTCTAGGTGGCCCGGCCCTTGGGGGGGCCGCAATCGGCCTGCTTGGCGCTGCGCCAACGCTCGGGCTGACGGCGTTCTGCTCCGTCCTTGCAGGTCTGCTCGGCGCGTGGGTGCTTCCCGCGCGTGCCGCTCGGACGATGACCACGACGGCGACTCTCTCCATGCGCGCCGGCGTCGCTTTTCTCTGGAGCGAACCCCTGCTGCGCCCTCTCTTTGGTATAGTGATGATCTTCGTGGGCATCGTTGGCGCCAACGGCAGCGTCATCATGCCTGCGCTGTTTGTAGATGCAGGACGCCAAGTAGCAGAGCTCGGGCTGTTCTCCTCAATGATGGGGGCTGGTGGTCTCCTTGGCATTGCCATTCATGCGTCGGTCGGCGCCCGGATATCAGCGCAGAACTGGCTGGCGGTGGCATTTTGTGGCTCTGCGGTGGGCTCGCTTCTGCTTTCACAGTTGCCAGGCGTGCCGGTGCTGATGTTGTTGGGCGCGCTCGTGGGACTGCTGACCGGCTCAGTCTCTCCCATTCTCAACGCTGCCATCTACAACCGCACGCCGCCAGAACTTCTCGGCCGGGTACTCGGCACGGTCTCGGCGGTGATGCTGTCAGCCTCGCCCATGGTTATGCTTGCGGCCGGCGCGTTTGTCGACCTTGCTGGTCCGCTCCCTGGCCTCGTTGTATCGGCCGTGTTTGCGGGGCTCGTGGCTCTACTCTCGCTCCGTCTTCAATTTGCTACAATGGCGGCGGCAGCCACAGCCTCCGCCCCAACCCATACAGAAGGTGAACACTGATGCCCCGCCCCAAGCTCAAGTCCGATGACGAGGTACTCGAGGCCGCCACCGTAGTGCTGAAGCGTTGCGGTCCCATAGAGTTCACGCTCAGCGGAGTAGCAAAGGAGGTGGGGCTCTCCCGCGCAGCGTTAATCCAGCGCTTCACCAACCGCGATACGCTGCTGGTGAGGATGATGGAGCGCGGCGTCGAGCAGGTGCGGCATTACCTGAATGCGATACCGATAGGCGCAGGGCCGCAAGGGCTCTGGGAATTTTTGCAGGTGCTCGTTCGGAGCATGAACACTCGCAACGACTTCTCGGTGAACTATCTCATCTCCTGGTACGAGCTCCAGGTGCCGGAGCTACGCACGCTTGCGATCCAGCGGAACCGCGCGGTGGTGGAGGGGATCCGCAAGCGACTGCCCCCAGGTGCTCCTGCGGCAGCTGAGTTGCTCCTGCACTCGGTCATCGCTGGCGCGACGATGCAGTGGGCCGTCGATCCGGATGGTGAGCTAGCTGATCATGTGCTGGCTCAGATCGCTGCCATCCTGTGTTTAATGTTTCCCGAACACGACGATTTCCAACTCCTCCAGGCACATGCGTAAACGGAGGTGTGCAGAGTCCCTGCGGCAGGCGACGAACACGACCGTCGTCGATTAGTACCGGTACGGTCGGTGGTATCGAAGTCTTGATCACCACTCAGGTCTACGGCTTACAAATGGTGACCATCCCGATACTTGCGTCAGAGCACCGGGCCGATTCTTTGACAGTGAATCACTCCCGTAAGGTTGTGCCGGTGTGGGTGTCCCGGGTCGAGACGATACTCCGCCAATGCGCCCAGCAAACAACCTGGCCATCGCAGGTGGTGGGGAGCGGTGTGGCGGATGAGTTGGACAAGTTGGTGTAGCAGCACGAGCACGGCGAGATAACATCGCAGGAGTTCGACATGCTCAAGAGACAGCTGATTGCGAATCGCGATGCAGATTCATAACCCGATTGCGGGTTGGCTTCACTCCACCATCACCGAGCAGACTAGCACGGCGGGCTCTGTTGCAAAGATTGGCGGCAGTCAGAGGTAGGCTGTCGCTCTGCGCCGATCAGGCGGCTGCTGCGAAATGGTGGTTGAGCATGCCCATGGCCTCCGTCAGCGCCGAGGGCCCAATGCCAAAAGCTCTCTCCACAAGGCGCACCTCGCCCCTGATGCCGGGCTGCAGGCACCAGGGGCGAGCCTGTCCTTTGCGCAGGGCTCGCATGACTTCGAATCCCTTGATCGTGGCATAGGCCGTGGGGATCGATTTGAAACCGCGCACCGGCTTGATCAGTATCTTGAGCTTTCCGTGATCGGCCTCGATCACGTTATTGAGATACTTCACCTGCCGGTGGGCCGTCTCCCGGTCCAGCTTTCCTTCGCGCTTCAATTCGGTGATCGCTGCACCATAGCTCGGCGCTTTGTCGGTATTGAGCGTGGCAGGCTTTTCCCAGTGCTTCAGGCCTCGCAGGGCCTTGCCCAGGAACCGCTTCGCTGCCTTGGCGCTGCGGGTCGGCGACAGGTAGAAATCGATCGTGTCGCCCCGCTTGTCGACTGCCCGGTACAGGTAGGTCCACTTGCCCCGCACCTTGACGTAGGTTTCATCCAGGCGCCAGCTCGGATCAAAGCCACGCCGCCAGAACCAGCGCAGCCGCTTCTCCATCTCCGGGGCGTAGCACTGGACCCAGCGATAGATCGTCGTATGGTCGACCGAAATGCCGCGTTCCGCCAGCATTTCCTCAAGGTCGCGATAGCTGATCGGATAGCGACAATACCAGCGCACCGCCCACAGGATCACATCACCCTGGAAATGGCGCCACTTGAAATCCGTCATCGTTCCGTCCGTCCAATCTCCGCCAAGCATGCTCAAGCTTCACGATTTTTGCAACAGAGCCCACACGAGTATTGAGCATAGTCGAGATTGGTGCAGATCACTTCTGATATTGAACTGTCAGGAGCTGGCTGCACAACAGCCATTACGCCCAATCAACTGGTGCAGTCGTCTTCTGAAAATGACAATCCAGTTAGGGTATAGCTCAACCTGACATAGAAGCAAAAACTCAACCACCTTCTACCAACTCTCCGAACAGCTCCTTGACCTTTGTTTTCGCATCAGCAAGTGCAGTTCTGCCTTGTTCAGTGATGTCATAAACACGCCGTTCACGTCGCCCGGTGCGTTCGTGGCGTGAGGTCAGATAGCCTTTTTTTTCCAGGCCGTGCAGCATCGGGTACACGGTGCCAGCGCTCATCTCGTAGCCGTGTCGGCGTAGCTCTTCGATGATCCCCAGCCCAAAGACAGGTTCCTCGGCTGCATGGTGAAGGATGTGCAGGCGGATCAAACCGCCGTAGAGGTCTTTGTCAGTCATTTTTTGTGCCTCACAGAGCGACGCTCAACAGCCACCCAGCTGCACCGCTACCGAGGACAACCAGCCACGGCGGGAGCTTCCAGAACATAAGTGCGACAAGGGCAACTAATGCCAAGCCGAAGTCTTGCGGCTGAAAGATGGCGCTAGTCCATACAGGCTGATACAGCGCGGCCAGCAGCAAGCCGACTACAGCGGCATTGATCCCGGCCAGCGCAGCTTGGATGCCTGTATTGCGGCGCAAACGCTCCCAAAATGGCATTGATCCGACGACCAGCAAGAACGAGGGCGCGAAGATAGCCAGCAGACACACAATGCCGCCGATCCAGCCCGACGGGGCGGTGTTCATCGAGGCACCAAGAAACGCGGCGAACGTGAACAAAGGGCCGGGCACCGCTTGAGCTGCCCCGTACCCCGCGAGAAAGGATTCATTGTTGACCCAGCCGGAGGGCACCACTTCGGCTTGCAGTAATGGCAGCACAACGTGACCACCGCCGAACACCAGTGATCCGACACGATAGAAGGAATCCACCATTGCCATGGTTTGACTTGGCATCAGTTCGGCCAACACCGGCAGGCCAATCAGCAAGACAAAGAACAGCGAGAGCCAAAGCACGCCGGCCCGGTGACTGACCGTGATAGGTAGGGGGTCATGCTCAACAACTTTCGCTGGCTTGAACAATAACCGGCCTGCGATGCCTGCGATAGCAATCACGCCAACCTGTCCCCACGCGGACGGCACAAGTAAAACGACGCAGGTAGCAATTGCCATGATGGTGACTCGCAGCCCATCCGTGCATAGGTTACGCGCCATGCCCCATACTGCTTGAGCGACCACGGCCACAGCCACCACTTTTAAGCCATGCAACGCGCCCTGCGAGACGTAATCGCCATAGCTGGAGATGCCGAGCGCAAAAAGGATCAAGGCTATGGCAGACGGCAGCGTGAAGCCAGCCCAAGCAGCCAGCGCCCCGCTGTATCCAGCCCGAGACAGTCCTACCGCTATGCCGACCTGGCTGCTTGCAGGCCCTGGCAAGAACTGACAAAGCGCGACCAAGTCAGCATAGCTCCGTTCGGAGAGCCAGCGCCGCCGTGTGACAAATTCGGCGCGGAAGTAGCCCAAGTGCGCAATGGGGCCGCCAAAAGATGTCAATCCAAGCCGCAGAAAAATAAGAAAGACCGACCATGGTCTGCTGTCATCGGTAGGGTTATTCGTCATACTTTCGCCTTCATGATCTGCAACGAGTTGATCAATAATAAGCGAAATTCGATAACGAAATTCGATATAAATCTAGAAAAAAATACCTCTATGTGTACTACGCAGTTTTAGCTGTGGCTTTCACAGGAGCACGCTTACTTACGGCTTAGCGTGCTTTATTTTCCGTTTTCTGAGGCGATCCCTAGGAGCTCGGATCTCAGGACGAAGGTCTCCGCGAATGTCCGGTCGATCCGCGCGACGTCCCAGGCGGGCGTTCCCTTGGCGGACATCCACGCCGCAGCGTCGTGCATCAGCCGCACAACCTCGTCGATATCACCCGAGCAGGCGACCCGAACGTTCGGAGGCTCCTCGCTGTCCATTCGCTCCCCTGGCGCGGTATGAACCGCCGCCTCATAGTGCAGTTTGATCCTGACGAGCCCAGCATGTCTGCGCCCACCTTCGCGGAACCTGACCAGGGTCCGCTAGCGGGCGGCCGGAAGGTGAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGCATAGTTGAGCGGCGGGCGCAGCCCGTCCGCTTGAACGCCGAGTTAGGCATCAGATGCCCTCGGCGCGGGTCGATGCACTTTTCGCACATGCCGCTCAACGCAAGATTCTCTCAATCGTTGCTTTGGCATATCGAACGAACGCGGCCGTCTCTTCGACGCGCATTGCTAGGTCGTCGTCCTCGCTACCCAGGTACGCCGCGCGTGCCTTGCAGATGAGGGGCCGATGCTCGGCAGGCAAACGCTCCGATACCCATGCGGCAGCAACGTCCTTAGGAGCAATGAGACCAGTTGAAGCGCTGTACCAAATGCGAGCAAGAGCAAGAACGACGTTCCGCTCGTCACCCTTCCAATCCGACTCTGCATTCCACTGGGCAATAGTGTCGAAAAGCGCCTTGGAGAAATGCTCCTTCGGCACCGGCTCGAAAAACGTGGCTGCGGATGGGCCTAGAAGCGCAAGGCTGTGTTGCCTCGCCTTGGTCAGCAAAATCGCAAGATCGTGATCCAGAACGGCAGGCTCGAACGTTCCGGAAAGGATGTCGTGGCGGAGCCACTCACCGAACTGAAGCTCACGCCGCGCCGGATAGCGCCAAGGCACTACTTCGCTTCGAGCGACAACAGTTAGCTCCAGCGGTCGCCATGTTCCGCCATCGCCTGGCGGTGATGAGACTTTCAGCAAATCGAGCATTAGCGCCTGCCGGAGCGAATCGTTAGGTGCGGCGCTGACGGTCACGAGCAAGTCTATGTCGCTGTCCGGCTTCAGCCCTCCATCGATCGCAGATCCGAACAGGTGGATTGTGTCCAGTGTCGCAGCCAGATGGCGCTCGATCACCGCGCGAGCGTGGGACAGCTGCTTGAAAACTTGTGCAGGGAAAAATTCACCCATGATGCCTAACGTTAAGTTCAGCGGCAGCTTTTAAGTTGCGGCTTTGTGGAATACTTTTGCGCAGCAAAACCACAAAGACGCGACTTAAAAGCTGTCCAAGGAGCGAAGCGACTGGTGCTGCAACGCATTGTTAGCCTTTTTTCCAAATCTGGTATGTATAATTTATATTAGACATAAAAAACTGTTCAAAAACCAAATTGAAATTCTCAGGCATTATAGGGAATTTGATATCACCTTCGACTTCAACGTGAACAGTAGACAAATGAATTATATCTGCTTTTTCAATAAGGCTATTATAGATTTGACCCCCGCCAGAGACATATACATGATCTGTAACTTTTGATAGCTCTTTCAAAGCATTTTCTATTGAAGGAAAAACTAGGACGTTTTCATTTGAGCTTGAAATTCCGTTCTTTGACACTACTGCATATTTGCGATTTGGAAGAACACCCATAGAGTCAAATGTTTTTCTTCCGACAAGGAGCCATTGATTATATGTGAGCGCTTTAAAGAGTAGTTGCTCACCTTTTACTGACCACGGGATATCAGGACCACTACCGATTACGCCATTTTCTGACACTGCAGAAATCAATGATATTTTCAATTTAACTCCCTTAATGGCTAACTTTGTTTTAGGGCGACTGCCCTGCTGCGTAACATCGTTGCTGCTCCATAACATCAAACATCGACCCACGGCGTAACGCGCTTGCTGCTTGGATGCCCGAGGCATAGACTGTACAAAAAAACAGTCATAACAAGCCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGTTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGTGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCTGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATACTGAAATAATTGGGGGGTGATTACCACCGTAAATTTTTCCCAGACGGTAAAAAGAATTACAGACTTTTATCAACTAAAAAAGTGGCCTGAATATCAGCACTACCGTCAATACCATTTCAGTAAATCAGGATAGGGAAAAATTGGCAAACCCAGTTCTTCACTTGCTCTGCGGGTTCTGTCACCCTGACGTCATACGGTAATCCCCCCCAATTCATTAATCGACAAGGTCATCCTGATGAAACAAATTACCATGAGCGATATGCAGCAACAGAGCGCAGCGGCTGTGCAATCGCCTCGGCTCCGCGCGCACCGTAATTTCCATCCAGAATTAAGCGATTCGGTCCAACGTCTGGCTATTGCCATGGAACCTGGGACCTACGTGCGCCCGCACCGACACCCTCACACCTTCGAGCTACTGTTGCCATTAAGGGGTCGTTTCGTGGTGCTGAATTTTGACGATCGGGGTACCGTCACCCATCGGGCGATATTGGGGGAAACCTGTACGGTGCTGGAGATGGCCGCAGGAACCTGGCATGCCGTGCTGTCGCTGGATACCGGTGGCATAATTTTTGAAGTAAAACACGGTGGCTATCAACCCGTGGCTGCCGATGACTATGCGCACTGGGCTCCAGCGGAAGGAGAACCAGGAACCACGGAGCTTATGGCCTGGTATGCGCAAGCGCAGGTGGGCGACAGCACTTTTGCCGTCTAAGGCGATAAACAAAAACGGAATGAGTTTCCCCATTCCGTTTCCGCTATTACAAACCGTCGGTGACGATTTTAGCCGCCGACGCTAATACATCGCGACGGCTTTCTGCCTTAGGTTGAGGCTGGGTGAAGTAAGTGACCAGAATCAGCGGCGCACGATCTTTTGGCCAGATCACCGCGATATCGTTGGTGGTGCCATAGTCACCGCTGCCGGTTTTATCCCCCACAACCCAGGAAGCAGGCAGTCCAGCCTGAATGCTCGCTGCACCGGTGGTATTGCCTTTCATCCATGTCACCAGCTGCGCCCGTTGGCTGTCGCCCAATGCTTTACCCAGCGTCAGATTCCGCAGAGTTTGCGCCATTGCCCGAGGTGAAGTGGTATCACGCGGATCGCCCGGAATGGCGGTGTTTAACGTCGGCTCGGTACGGTCGAGACGGAACGTTTCGTCTCCCAGCTGTCGGGCGAACGCGGTGACGCTAGCCGGGCCGCCAACGTGAGCAATCAGCTTATTCATCGCCACGTTATCGCTGTACTGTAGCGCGGCCGCGCTAAGCTCAGCCAGTGACATCGTCCCATTGACGTGCTTTTCCGCAATCGGATTATAGTTAACAAGGTCAGATTTTTTGATCTCAACTCGCTGATTTAACAGATTCGGTTCGCTTTCACTTTTCTTCAGCACCGCGGCCGCGGCCATCACTTTACTGGTGCTGCACATCGCAAAGCGCTCATCAGCACGATAAAGTATTTGCGAATTATCTGCTGTGTTAATCAATGCCACACCCAGTCTGCCTCCCGACTGCCGCTCTAATTCGGCAAGTTTTTGCTGTACGTCCGCCGTTTGCGCATACAGCGGCACACTTCCTAACAACAGCGTGACGGTTGCCGTCGCCATCAGCGTGAACTGGCGCAGTGATTTTTTAACCATGGGATTCCTTATTCTGGAAGAGACGAAATAACAACAACATGAATAGTCAATATTTTACCTGAAGCGAGCCACAACGCGTCCGATTTTATGCTTCCGAAAGGCAAATACGGACGTCGCCAGAATGAAACCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACGCATTGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGTTGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCAT
Protein sequences of DBSCAN-SWA_1 >CP034964|1798:30239|24582_25056_-|QAS88042.1|DBSCAN-SWA MKISLISAVSENGVIGSGPDIPWSVKGEQLLFKALTYNQWLLVGRKTFDSMGVLPNRKYAVVSKNGISSSNENVLVFPSIENALKELSKVTDHVYVSGGGQIYNSLIEKADIIHLSTVHVEVEGDIKFPIMPENFNLVFEQFFMSNINYTYQIWKKG >CP034964|1798:30239|16963_18202_+|QAS87957.1|DBSCAN-SWA MSERRYSPLATLFAATFLFRIGNAVAALALPWFVLSHTKSAAWAGATAASSVIATIIGAWVGGGLVDRFGRAPVALISGVVGGVAMASIPLLDAVGALSNTGLIACVVLGAAFDAPGMAAQDSELPKLGHVAGLSVERVSSLKAVIGNVAILGGPALGGAAIGLLGAAPTLGLTAFCSVLAGLLGAWVLPARAARTMTTTATLSMRAGVAFLWSEPLLRPLFGIVMIFVGIVGANGSVIMPALFVDAGRQVAELGLFSSMMGAGGLLGIAIHASVGARISAQNWLAVAFCGSAVGSLLLSQLPGVPVLMLLGALVGLLTGSVSPILNAAIYNRTPPELLGRVLGTVSAVMLSASPMVMLAAGAFVDLAGPLPGLVVSAVFAGLVALLSLRLQFATMAAAATASAPTHTEGEH >CP034964|1798:30239|12554_13097_+|QAS87951.1|DBSCAN-SWA MIIWINGPFGAGKTTLAKRLRDRRSKSLIFDPEEIGFVVKETVPMPASGDYQDLPLWRGLTIAAVREIRRNYSQDIIIPMTLVHPDYLTEILDGVRRIDDQLLHIFLTLNEDLLRHRIANQTMHPDPNRNAEIREWRLANVARCLAARERLPCTTRVLDSGAHTSDELAAMVLDGIDGRT >CP034964|1798:30239|5506_6244_+|QAS87944.1|DBSCAN-SWA MNKNIKYSQNFLTSEKVLNQIIKQLNLKETDTVYEIGTGKGHLTTKLAKISKQVTSIELDSHLFNLSSEKLKLNTRVTLIHQDILQFQFPNKQRYKIVGNIPYHLSTQIIKKVVFESHASDIYLIVEEGFYKRTLDIHRTLGLLLHTQVSIQQLLKLPAECFHPKPRVNSVLIKLTRHTTDVPDKYWKLYTYFVSKWVNREYRQLFTKNQFHQAMKHAKVNNLSTVTYEQVLSIFNSYLLFNGRK >CP034964|1798:30239|6248_6359_+|QAS87945.1|DBSCAN-SWA MSRFCKFGKLHVTKGNVDKLLGILLTASKELKRSLD >CP034964|1798:30239|23110_23458_-|QAS87967.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >CP034964|1798:30239|13578_13770_-|QAS87952.1|DBSCAN-SWA MAIALMGAGFSATDTSDAVNILYPITMTVQANKAWQASGLKKSFFLPSHKKVRGGEKISYARL >CP034964|1798:30239|6873_7323_+|QAS87946.1|DBSCAN-SWA MTTILSDYNKSILIELLNTNRQIIVVHGDDIDDYYTDSCYDIIFIDYYDDDDFTYNGQEIWRGDSIYLVKSYQDEIQNRIEKGQELIFVNKENDISFSESILTYYHLFDYAGKTFEIDGQIIDEQTKVFAAPERRCACRTCWKVPARKT >CP034964|1798:30239|11681_12542_+|QAS87950.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >CP034964|1798:30239|22277_23117_-|QAS87966.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >CP034964|1798:30239|29534_30239_-|QAS87972.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIINATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|15235_15940_+|QAS87955.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|22168_22348_+|QAS87965.1|DBSCAN-SWA MNRRLIVQFDPDEPSMSAPTFAEPDQGPLAGGRKVNARHDLTLGLWRRDCEISRGFPRR >CP034964|1798:30239|20585_21791_-|QAS87963.1|DBSCAN-SWA MTNNPTDDSRPWSVFLIFLRLGLTSFGGPIAHLGYFRAEFVTRRRWLSERSYADLVALCQFLPGPASSQVGIAVGLSRAGYSGALAAWAGFTLPSAIALILFALGISSYGDYVSQGALHGLKVVAVAVVAQAVWGMARNLCTDGLRVTIMAIATCVVLLVPSAWGQVGVIAIAGIAGRLLFKPAKVVEHDPLPITVSHRAGVLWLSLFFVLLIGLPVLAELMPSQTMAMVDSFYRVGSLVFGGGHVVLPLLQAEVVPSGWVNNESFLAGYGAAQAVPGPLFTFAAFLGASMNTAPSGWIGGIVCLLAIFAPSFLLVVGSMPFWERLRRNTGIQAALAGINAAVVGLLLAALYQPVWTSAIFQPQDFGLALVALVALMFWKLPPWLVVLGSGAAGWLLSVAL >CP034964|1798:30239|16061_16967_+|QAS87956.1|DBSCAN-SWA MTVVTTADTSQLYALAARHGLKLHGPLTVNELGLDYRIVIATVDDGRRWVLRIPRRAEVSAKVEPEARVLAMLKNRLPFAVPDWRVANAELVAYPMLEDSTAMVIQPGSSTPDWVVPQDSEVFAESFATALAALHAVPISAAVDAGMLIRTPTQARQKVADDVDRVRREFVVNDKRLHRWQRWLDDDSSWPDFSVVVHGDLYVGHVLIDNTERVSGMIDWSEARVDDPAIDMAAHLMVFGEEGLAKLLLTYEAAGGRVWPRLAHHIAERLAFGAVTYALFALDSGNEEYLAAAKAQLAAAE >CP034964|1798:30239|9880_10585_+|QAS87949.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIINATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|20269_20575_-|QAS87962.1|DBSCAN-SWA MTDKDLYGGLIRLHILHHAAEEPVFGLGIIEELRRHGYEMSAGTVYPMLHGLEKKGYLTSRHERTGRRERRVYDITEQGRTALADAKTKVKELFGELVEGG >CP034964|1798:30239|7851_9384_+|QAS87947.1|transposase|DBSCAN-SWA MPRLATSRRQAAGCAPLPSAHTGSRYARHAPERTLLYALVEAHYPDFIARIEAEGRSLPGYVREAFDAYLRCGVLEHGFLRVVCEHCRAERLVAFSCKKRGFCPSCGARRMAESARHLVEEVFGPRPVRQWVLSFPYPLRFLFASKPEAIGPVLGIVQRVIAGWLADQAGIDRASAQCGAVTLIQRFGSALNLNIHFHMLWLDGVYVEATELPRRELRLHRARAPTTAQLTQLAAAIAHRVCRHLTRKGWLEGEGESAFLADSAAGDDSMDGLRMSSITYRIATGRDAGCKVVTLQTLPGDAGSLEGEAGKVGGFSLHAGVAAEAHESHKLEKLCRYITRPAISEKRLSIALQGRVRYQLKTPWRNGTTHVEWDPVDFIAKLAALVPPPRAHLTRFHGVFAPNAALRAQLTPSGRGRRHDAAVEPADASANDAPRSPEEKRRSMSWAQRLKRVFSIDVTACVHCGGTVRIVASIEEPAAIRAILGHFVKQGAREEAHYRPAARAPPVQAA >CP034964|1798:30239|18201_18786_+|QAS87958.1|DBSCAN-SWA MPRPKLKSDDEVLEAATVVLKRCGPIEFTLSGVAKEVGLSRAALIQRFTNRDTLLVRMMERGVEQVRHYLNAIPIGAGPQGLWEFLQVLVRSMNTRNDFSVNYLISWYELQVPELRTLAIQRNRAVVEGIRKRLPPGAPAAAELLLHSVIAGATMQWAVDPDGELADHVLAQIAAILCLMFPEHDDFQLLQAHA >CP034964|1798:30239|21946_22150_-|QAS87964.1|DBSCAN-SWA MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAETFVLRSELLGIASENGK >CP034964|1798:30239|20071_20254_+|QAS87961.1|DBSCAN-SWA MLKLHDFCNRAHTSIEHSRDWCRSLLILNCQELAAQQPLRPINWCSRLLKMTIQLGYSST >CP034964|1798:30239|23663_24452_-|QAS87968.1|DBSCAN-SWA MGEFFPAQVFKQLSHARAVIERHLAATLDTIHLFGSAIDGGLKPDSDIDLLVTVSAAPNDSLRQALMLDLLKVSSPPGDGGTWRPLELTVVARSEVVPWRYPARRELQFGEWLRHDILSGTFEPAVLDHDLAILLTKARQHSLALLGPSAATFFEPVPKEHFSKALFDTIAQWNAESDWKGDERNVVLALARIWYSASTGLIAPKDVAAAWVSERLPAEHRPLICKARAAYLGSEDDDLAMRVEETAAFVRYAKATIERILR >CP034964|1798:30239|2803_3136_+|QAS87941.1|DBSCAN-SWA MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGFAVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT >CP034964|1798:30239|5285_5381_+|QAS87943.1|DBSCAN-SWA MLVFQMRYQMRYVDKTSTVLKQTKKSDYADK >CP034964|1798:30239|13775_14021_+|QAS87953.1|DBSCAN-SWA MQAYRAMVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >CP034964|1798:30239|1798_2452_+|QAS87939.1|protease|DBSCAN-SWA MIQTRNQYLQFMLVMLAAWGISWGARFVMEQAVLLYGSGKNYLFFSHGTVLMYLLCVFLVYRRWIAPLPVVGQLRNVGVPWLVGAMAVVYVGVFLLGKALALPAEPFMTKLFADKSIPDVILTLLTIFILAPLNEETLFRGIMLNVFRSRYCWTMWLGALITSLLFVAAHSQYQNLLTLAELFLVGLITSVARIRSGGLLLPVLLHMEATTLGLLFG >CP034964|1798:30239|14071_15199_+|QAS87954.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTGTVANSRW >CP034964|1798:30239|9706_9904_-|QAS87948.1|DBSCAN-SWA MPAFEWVHVQLHQQKGMISLSPPTICNSAIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|27714_28191_+|QAS87970.1|DBSCAN-SWA MKQITMSDMQQQSAAAVQSPRLRAHRNFHPELSDSVQRLAIAMEPGTYVRPHRHPHTFELLLPLRGRFVVLNFDDRGTVTHRAILGETCTVLEMAAGTWHAVLSLDTGGIIFEVKHGGYQPVAADDYAHWAPAEGEPGTTELMAWYAQAQVGDSTFAV >CP034964|1798:30239|19278_20043_-|QAS87960.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >CP034964|1798:30239|25958_26663_+|QAS87969.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|2544_2802_+|QAS87940.1|DBSCAN-SWA MHTTRLKRVGGSVMLTVPPALLNALSLGTDNEVGMVIDNGRLIVEPYRRPQYSLAELLAQCDPNAEISAEEREWLDAPATGQEEI >CP034964|1798:30239|18731_19088_+|QAS87959.1|DBSCAN-SWA MFNVSRTRRFPTPPGTCVNGGVQSPCGRRRTRPSSISTGTVGGIEVLITTQVYGLQMVTIPILASEHRADSLTVNHSRKVVPVWVSRVETILRQCAQQTTWPSQVVGSGVADELDKLV >CP034964|1798:30239|4446_5151_-|QAS87942.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034964|1798:30239|28237_29113_-|QAS87971.1|DBSCAN-SWA MVKKSLRQFTLMATATVTLLLGSVPLYAQTADVQQKLAELERQSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAAAAVLKKSESEPNLLNQRVEIKKSDLVNYNPIAEKHVNGTMSLAELSAAALQYSDNVAMNKLIAHVGGPASVTAFARQLGDETFRLDRTEPTLNTAIPGDPRDTTSPRAMAQTLRNLTLGKALGDSQRAQLVTWMKGNTTGAASIQAGLPASWVVGDKTGSGDYGTTNDIAVIWPKDRAPLILVTYFTQPQPKAESRRDVLASAAKIVTDGL |
35 | Escherichia_phage(58.33%) | protease,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_1 | 619513-619652 | Orphan |
NA
Consensus repeat of CP034966_1
|
1 spacers
spacers of CP034966_1
>1.1|619562|42|CP034966|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around CP034966_1
The CRISPR arrays of CP034966_1 >merge|CP034966|1|619513-619652|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >CP034966|1|1|619513-619652|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>CP034966.1|QAS88669.1|618420_619461_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >CP034966.1|QAS88668.1|617712_618348_+|NAD-dependent-epimerase/dehydratase-family-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >CP034966.1|QAS88667.1|617066_617585_-|type-1-glutamine-amidotransferase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >CP034966.1|QAS88666.1|616643_617087_+|hypothetical-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >CP034966.1|QAS88665.1|616290_616593_-|DNA-damage-response-exodeoxyribonuclease-YhbQ MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >CP034966.1|QAS88664.1|615800_616304_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >CP034966.1|QAS88663.1|615282_615807_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >CP034966.1|QAS88662.1|614078_615074_-|U32-family-peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >CP034966.1|QAS88661.1|613191_614070_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >CP034966.1|QAS88660.1|611978_612986_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >CP034966.1|QAS88670.1|619665_620241_-|divisome-associated-lipoprotein-YraP MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >CP034966.1|QAS88671.1|620250_620841_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >CP034966.1|QAS88672.1|620860_621256_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >CP034966.1|QAS88673.1|621213_623250_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >CP034966.1|QAS88674.1|623314_624175_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >CP034966.1|QAS88675.1|624217_625309_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >CP034966.1|QAS88676.1|625319_627692_-|fimbrial-biogenesis-outer-membrane-usher-protein MLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >CP034966.1|QAS88677.1|628642_629338_-|molecular-chaperone MSKRTFAVIITLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGTSRAPFIITPPLFRLEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDNVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSDHSLNIYNPTEYYVVFAGLAVDKTDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ >CP034966.1|QAS88678.1|629417_630002_-|type-1-fimbrial-protein MNKVTKTAIAGLLALFAGNAAATDGEIVFDGEILKSACEINDSDKKIEVALGHYNAEQFRSVGDRSPKIPFTIPLVNCPVTGWEHDNGNVEASFRLWLETRDNGTVPNFPNLAKVGSFAGTAATGVGIRIDDAESGNLMPLNAMGNDNTVYQIPADSAGIVNVDLIAYYVSTVEASEITPGEADAVVNVTLDYR >CP034966.1|QAS88679.1|630402_631158_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_2 | 649468-649719 | Orphan |
NA
Consensus repeat of CP034966_2
|
2 spacers
spacers of CP034966_2
>2.1|649522|64|CP034966|PILER-CR AGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGC >2.2|649640|59|CP034966|PILER-CR GCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGC |
CRISPR arrays and Neighbor proteins around CP034966_2
The CRISPR arrays of CP034966_2 >merge|CP034966|2|649468-649719|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAA >CP034966|2|1|649468-649719|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATT TACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGT AAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAA
>CP034966.1|QAS88698.1|647962_649108_+|glycerate-2-kinase MKIVIAPDSYKESLSASEVAQAIEKGFREIFPDAQYVSIPVADGGEGTVEAMIAATQGSERHAWVTGPLGEKVNASWGISGDGKTAFIEMAAASGLELVPAEKRDPLVTTSRGTGELILQALESGATNIIIGIGGSATNDGGAGMVQALGAKLCDANGNEIGFGGGSLNTLNDIDISGLDPRLKDCVIRVACDVTNPLVGDNGASRIFGPQKGASEAMIVELDNNLSHYADVIKKALHVDVKDVPGAGAAGGMGAALMAFLGAELKSGIEIVTTALNLEEHIHDCTLVITGEGRIDSQSIHGKVPIGVANVAKKYHKPVIGIAGSLTDDVGVVHQHGIDAVFSVLTSIGTLDEAFRGAYDNICRASRNIAATLAIGMRNAG >CP034966.1|QAS88697.1|646975_647866_+|2-hydroxy-3-oxopropionate-reductase MTMKVGFIGLGIMGKPMSKNLLKAGYSLVVADRNPEAIADVIAAGAETASTAKAIAEQCDVIITMLPNSPHVKEVALGENGIIEGAKPGTVLIDMSSIAPLASREISEALKAKGIDMLDAPVSGGEPKAIDGTLSVMVGGDKAIFDKYYDLMKAMAGSVVHTGEIGAGNVTKLANQVIVALNIAAMSEALTLATKAGVNPDLVYQAIRGGLAGSTVLDAKAPMVMDRNFKPGFRIDLHIKDLANALDTSHGVGAQLPLTAAVMEMMQALRADGLGTADHSALACYYEKLAKVEVTR >CP034966.1|QAS88696.1|646175_646946_+|5-keto-4-deoxy-D-glucarate-aldolase MNNDVFPNKFKAALAAKQVQIGCWSALSNPISTEVLGLAGFDWLVLDGEHAPNDISTFIPQLMALKGSASAPVVRVPTNEPVIIKRLLDIGFYNFLIPFVETKEEAEQAVASTRYPPEGIRGVSVSHRANMFGTVADYFAQSNKNITILVQIESQQGVDNIDAIAATEGVDGIFVGPSDLAAALGHLGNASHPDVQKAIQHIFNRASAHGKPSGILAPIEADARRYLEWGATFVAVGSDLGVFRSATQKLADTFKK >CP034966.1|QAS88695.1|644825_646160_+|MFS-transporter MILDTVDVKKKGVHTRYLILLIIFIVTAVNYADRATLSIAGTEVAKELQLSAVSMGYIFSAFGWAYLLMQIPGGWLLDKFGSKKVYTYSLFFWSLFTFLQGFVDMFPLAWAGISMFFMRFMLGFSEAPSFPANARIVAAWFPTKERGTASAIFNSAQYFSLALFSPLLGWLTFAWGWEHVFTVMGVIGFVLTALWIKLIHNPTDHPRMSAEELKFISENGAVVDMDHKKPGSAAASGPKLHYIKQLLSNRMMLGVFFGQYFINTITWFFLTWFPIYLVQEKGMSILKVGLVASIPALCGFAGGVLGGVFSDYLIKRGLSLTLARKLPIVLGMLLASTIILCNYTNNTTLVVMLMALAFFGKGFGALGWSVISDTAPKEIVGLCGGVFNVFGNVASIVTPLVIGYLVSELHSFNAALIFVGCSALMAMVCYLFVVGDIKRMELQK >CP034966.1|QAS88694.1|642879_644451_-|galactarate-dehydratase MANIEIRQETPTAFYIKVHDTDNVAIIVNDNGLKAGTRFPDGLELIEHIPQGHKVALLDIPANGEIIRYGEVIGYAVRAIPRGSWIDESMVVLPEAPPLHTLPLATKVPEPLPPLEGYTFEGYRNADGSVGTKNLLGITTSVHCVAGVVDYVVKIIERDLLPKYPNVDGVVGLNHLYGCGVAINAPAAVVPIRTIHNISLNPNFGGEVMVIGLGCEKLQPERLLTGTDDVQAIPVESASIVSLQDEKHVGFQSMVEDILQVAERHLQKLNQRQRETCPASELVVGMQCGGSDAFSGVTANPAVGYASDLLVRCGATVMFSEVTEVRDAIHLLTPRAVNEEVGKRLLEEMEWYDNYLNIGKTDRSANPSPGNKKGGLANVVEKALGSIAKSGKSAIVEVLSPGQRPTKRGLIYAATPASDFVCGTQQVASGITVQVFTTGRGTPYGLMAVPVIKMATRTELANRWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTFSDQWGLHNQLAVFNPAPVT >CP034966.1|QAS88693.1|642395_642731_-|type-II-toxin-antitoxin-system-PrlF-family-antitoxin MPANARSHAVLTTESKVTIRGQTTIPAPVREALKLKPGQDSIHYEILPGGQVFMCRLGDEQEDHTMNAFLRFLDADIQNNPQKTRPFNIQQGKKLVAGMDVNIDDEIGDDE >CP034966.1|QAS88692.1|641931_642396_-|type-II-toxin-antitoxin-system-YhaV-family-toxin MDFPQRVNGWALYAHPCFQETYDALVAEVEALKGKDPENYQRKAATKLLAVVHKVIEEHITVNPSSPAFRHGKSLGSGKNKDWSRVKFGAGRYRLFFRYSEKEKVIILGWMNDENTLRTYGKKTDAYTVFSKMLKRGHPPADWESLTQETEENH >CP034966.1|QAS88691.1|641067_641877_+|DeoR-family-transcriptional-regulator MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVILDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGE >CP034966.1|QAS88690.1|639538_640819_-|tagatose-bisphosphate-aldolase-subunit-KbaZ MKHLTEMVRQHKAGKTNAIYAVCSAHPLVLEAAIRYASANQTPLLIEATSNQVDQFGGYTGMTPADFRGFVCQLADSLNFPQDALILGGDHLGPNRWQNLPAAQAMANADDLIKSYVAAGFKKIHLDCSMSCQDDPIPLTDDIVAERAARLAKVAEETCLEHFGEADLEYVIGTEVPVPGGAHETLSELAVTTPDAARATLEAHRHAFEKQGLNAIWPRIIALVVQPGVEFDHTNVIDYQPAKASALSQMVENYETLIFEAHSTDYQTPQSLRQLVIDHFAILKVGPALTFALREALFSLAAIEEELVPAKACSGLRQVLEDVMLDRPEYWQSHYHGDGNARRLARGYSYSDRVRYYWPDSQIDDAFAHLVRNLADSPIPLPLISQYLPLQYVKVRSGELQPTPRELIINHIQDILAQYHTACEGQ >CP034966.1|QAS88689.1|639042_639516_-|PTS-N-acetylgalactosamine-transporter-subunit-IIB MPNIVLSRIDERLIHGQVGVQWVGFAGANLVLVANDEVAEDPVQQNLMEMVLAEGIAVRFWTLQKVIDNIHRAADRQKILLVCKTPADFLTLVKGGVPVNRINVGNMHYANGKQQIAKTVSVDAGDIAAFNDLKAAGVECFVQGVPTEPAVDLFKLL >CP034966.1|QAS88699.1|650017_651205_-|YhaC-family-protein MFPVSSIGNDISSDLVRRKMNDLPESPIVNNLEALAPGIEKLKQTSIQMVTLLNALQPGGKCIITGDFQKELAYLQNVILYNDSSLRMDFFGYNALIIQRSDNTCELTINEPLKNQEISTGNINVNFPLKDIYNEIRRLNVVFSCGTGGIVDLSSLDLRNIDLELYDFTDKHMANAILNPFKLDDTDFTNANMFQVNFVSSKQNTTISWDYLLKITPVLTSISDMYSEEKIKLVESCLNELGDITEEQLKIMRFAIIESIPRATLTDQLENELTKEIYKNSSKINNYLNRIKLPEMKGFSSEKIDYYIDIIIKDYESVKENAYLIDPKINYNTDLNIEDSSSEEFLSDNTLEKDENSPDNCFEVVKYNTYEAYNSENLYFTREEYTYDYDLLNAI >CP034966.1|QAS88700.1|651226_651766_-|hypothetical-protein MKGFPIAHIFHPSIPPMHAVVNNHNRNIDYWTVKRKFAEIVSTNDVNKIYSISNELRRVLSAITALNFYQGDVPSVMIRIQPENMSPFIIDISTGEHDDYIIQTLDVGTFAPFGEQCTCSAVNKKELECIKETISKYCAKFTRKEAILTPPAHFNKTSITSDCWQILFFSPDHFNNDFY >CP034966.1|QAS88701.1|652021_652366_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP034966.1|QAS88702.1|652554_653493_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP034966.1|QAS88703.1|653591_654581_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP034966.1|QAS88704.1|654602_655934_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP034966.1|QAS88705.1|655959_657168_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP034966.1|QAS88706.1|657201_659496_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP034966.1|QAS88707.1|659509_659899_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP034966.1|QAS88708.1|659970_661335_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_3 | 664300-664417 | Orphan |
NA
Consensus repeat of CP034966_3
|
1 spacers
spacers of CP034966_3
>3.1|664340|38|CP034966|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around CP034966_3
The CRISPR arrays of CP034966_3 >merge|CP034966|3|664300-664417|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >CP034966|3|2|664300-664417|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>CP034966.1|QAS88710.1|662968_664279_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >CP034966.1|QAS88709.1|661609_662941_+|HAAAP-family-serine/threonine-permease MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >CP034966.1|QAS88708.1|659970_661335_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >CP034966.1|QAS88707.1|659509_659899_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP034966.1|QAS88706.1|657201_659496_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP034966.1|QAS88705.1|655959_657168_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP034966.1|QAS88704.1|654602_655934_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP034966.1|QAS88703.1|653591_654581_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP034966.1|QAS88702.1|652554_653493_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP034966.1|QAS88701.1|652021_652366_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP034966.1|QAS88711.1|664490_664655_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >CP034966.1|QAS88712.1|664677_665379_-|pirin-family-protein MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >CP034966.1|QAS88713.1|665483_666380_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >CP034966.1|QAS88714.1|666430_666787_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >CP034966.1|QAS88715.1|667028_667394_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >CP034966.1|QAS88716.1|667686_668673_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >CP034966.1|QAS88717.1|668742_669225_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >CP034966.1|QAS88718.1|669320_669620_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >CP034966.1|QAS88719.1|669609_670014_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >CP034966.1|QAS88720.1|670016_670322_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_4 | 1037750-1038266 | Orphan |
I-E
Consensus repeat of CP034966_4
|
8 spacers
spacers of CP034966_4
>4.1|1037779|32|CP034966|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >4.2|1037840|32|CP034966|PILER-CR,CRISPRCasFinder,CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >4.3|1037901|32|CP034966|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >4.4|1037962|32|CP034966|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >4.5|1038023|32|CP034966|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >4.6|1038084|32|CP034966|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >4.8|1038206|32|CP034966|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around CP034966_4
The CRISPR arrays of CP034966_4 >merge|CP034966|4|1037750-1038266|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034966|4|2|1037750-1038266|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034966|4|3|1037750-1038266|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034966|4|1|1037750-1038266|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>CP034966.1|QAS89036.1|1036738_1037410_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP034966.1|QAS89035.1|1036459_1036600_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >CP034966.1|QAS89034.1|1035573_1036446_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP034966.1|QAS89033.1|1034215_1035514_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP034966.1|QAS89032.1|1032490_1034128_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP034966.1|QAS89031.1|1031471_1032263_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP034966.1|QAS89030.1|1031065_1031401_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP034966.1|QAS89029.1|1030817_1031066_+|MazF-MazE-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP034966.1|QAS89028.1|1028505_1030740_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP034966.1|QAS89027.1|1027156_1028458_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP034966.1|QAS89037.1|1038903_1040382_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP034966.1|QAS89038.1|1040408_1041686_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >CP034966.1|QAS92250.1|1042004_1042790_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP034966.1|QAS89039.1|1042859_1044314_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP034966.1|QAS89040.1|1044407_1045745_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP034966.1|QAS89041.1|1045722_1046502_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >CP034966.1|QAS89042.1|1046498_1047359_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP034966.1|QAS89043.1|1047506_1048082_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP034966.1|QAS89044.1|1048098_1048359_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP034966.1|QAS89045.1|1048349_1049621_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_5 | 1060651-1061229 | Unclear |
I-E
Consensus repeat of CP034966_5
|
9 spacers
spacers of CP034966_5
>5.1|1060681|31|CP034966|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >5.2|1060742|31|CP034966|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >5.3|1060803|31|CP034966|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >5.4|1060864|31|CP034966|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >5.5|1060925|31|CP034966|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >5.6|1060986|31|CP034966|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >5.7|1061047|31|CP034966|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >5.8|1061108|31|CP034966|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >5.9|1061169|31|CP034966|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >5.10|1060680|33|CP034966|PILER-CR GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT >5.11|1060741|33|CP034966|PILER-CR GACGGACAAAATATATATTGATTTGCGAATTAT >5.12|1060802|33|CP034966|PILER-CR GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT >5.13|1060863|33|CP034966|PILER-CR GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA >5.14|1060924|33|CP034966|PILER-CR GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT >5.15|1060985|33|CP034966|PILER-CR ATGCGGATTACCGGCAAAACATGGGAGCAAACC >5.16|1061046|33|CP034966|PILER-CR GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >5.17|1061107|33|CP034966|PILER-CR GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >5.18|1061168|33|CP034966|PILER-CR GGGATGACCTGTCGCTAAAACTCGCCGCGTACA >5.19|1060681|32|CP034966|CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >5.20|1060742|32|CP034966|CRT ACGGACAAAATATATATTGATTTGCGAATTAT >5.21|1060803|32|CP034966|CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >5.22|1060864|32|CP034966|CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >5.23|1060925|32|CP034966|CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >5.24|1060986|32|CP034966|CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >5.25|1061047|32|CP034966|CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >5.26|1061108|32|CP034966|CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >5.27|1061169|32|CP034966|CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around CP034966_5
The CRISPR arrays of CP034966_5 >merge|CP034966|5|1060651-1061229|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP034966|5|4|1060651-1061229|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP034966|5|3|1060652-1061228|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACC GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACC GACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACC GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACC GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACC ATGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACC GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACC GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACC >CP034966|5|2|1060652-1061229|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>CP034966.1|QAS89054.1|1060261_1060555_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP034966.1|QAS89053.1|1059341_1060265_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP034966.1|QAS89052.1|1058694_1059345_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP034966.1|QAS89051.1|1057966_1058713_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP034966.1|QAS89050.1|1054963_1055116_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP034966.1|QAS89049.1|1053964_1054699_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >CP034966.1|QAS89048.1|1052178_1053891_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP034966.1|QAS89047.1|1050379_1052179_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >CP034966.1|QAS89046.1|1049698_1050064_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >CP034966.1|QAS89045.1|1048349_1049621_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >CP034966.1|QAS89055.1|1061310_1062348_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP034966.1|QAS89056.1|1062599_1063508_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP034966.1|QAS89057.1|1063509_1064937_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP034966.1|QAS89058.1|1064936_1065542_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP034966.1|QAS89059.1|1065591_1065915_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP034966.1|QAS89060.1|1066108_1066420_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP034966.1|QAS89061.1|1066438_1067149_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP034966.1|QAS89062.1|1067148_1067628_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP034966.1|QAS89063.1|1067624_1068674_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP034966.1|QAS89064.1|1068654_1069416_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_6 | 1562277-1562394 | Orphan |
NA
Consensus repeat of CP034966_6
|
1 spacers
spacers of CP034966_6
>6.1|1562308|56|CP034966|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP034966_6
The CRISPR arrays of CP034966_6 >merge|CP034966|6|1562277-1562394|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP034966|6|5|1562277-1562394|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP034966.1|QAS89497.1|1561049_1562180_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP034966.1|QAS89496.1|1560795_1561050_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP034966.1|QAS89495.1|1560091_1560742_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP034966.1|QAS92261.1|1557357_1557672_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >CP034966.1|QAS89494.1|1556062_1557139_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >CP034966.1|QAS89493.1|1554699_1556058_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >CP034966.1|QAS89492.1|1552798_1554427_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP034966.1|QAS89491.1|1551549_1552809_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-B MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP034966.1|QAS89490.1|1550362_1551553_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >CP034966.1|QAS89489.1|1549270_1550170_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDSVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >CP034966.1|QAS89498.1|1562413_1564699_-|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP034966.1|QAS89499.1|1565394_1569147_+|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP034966.1|QAS92262.1|1569274_1569997_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >CP034966.1|QAS89500.1|1570143_1572771_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP034966.1|QAS89501.1|1572919_1574608_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP034966.1|QAS89502.1|1574604_1575228_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP034966.1|QAS92263.1|1575371_1579766_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >CP034966.1|QAS89503.1|1579766_1581416_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP034966.1|QAS89504.1|1581420_1582197_+|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP034966.1|QAS89505.1|1582270_1583455_-|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_7 | 2156781-2156904 | Orphan |
NA
Consensus repeat of CP034966_7
|
1 spacers
spacers of CP034966_7
>7.1|2156824|38|CP034966|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP034966_7
The CRISPR arrays of CP034966_7 >merge|CP034966|7|2156781-2156904|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP034966|7|6|2156781-2156904|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP034966.1|QAS90011.1|2156340_2156646_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP034966.1|QAS90010.1|2154610_2156215_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP034966.1|QAS90009.1|2153786_2154599_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP034966.1|QAS90008.1|2152997_2153783_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP034966.1|QAS90007.1|2152332_2153001_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP034966.1|QAS90006.1|2151621_2152269_+|YdhW-family-putative-oxidoreductase-system-protein MGEMNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP034966.1|QAS90005.1|2149515_2151618_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >CP034966.1|QAS90004.1|2148868_2149495_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP034966.1|QAS90003.1|2148203_2148413_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP034966.1|QAS90002.1|2146235_2147648_-|pyruvate-kinase-I MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP034966.1|QAS90012.1|2157218_2158475_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP034966.1|QAS90013.1|2158515_2159889_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >CP034966.1|QAS90014.1|2160103_2160745_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP034966.1|QAS90015.1|2160784_2161933_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP034966.1|QAS90016.1|2162223_2163435_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVIAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP034966.1|QAS90017.1|2163547_2164480_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP034966.1|QAS90018.1|2164476_2165502_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP034966.1|QAS90019.1|2165800_2165890_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP034966.1|QAS90020.1|2166055_2167225_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >CP034966.1|QAS90021.1|2167370_2167952_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_10 | 3128945-3129089 | Orphan |
NA
Consensus repeat of CP034966_10
|
1 spacers
spacers of CP034966_10
>10.1|3128997|41|CP034966|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP034966_10
The CRISPR arrays of CP034966_10 >merge|CP034966|10|3128945-3129089|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP034966|10|8|3128945-3129089|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP034966.1|QAS90890.1|3127595_3128879_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP034966.1|QAS90889.1|3126390_3127461_+|integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP034966.1|QAS90888.1|3126194_3126413_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP034966.1|QAS90887.1|3125987_3126155_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP034966.1|QAS90886.1|3125869_3126055_-|hypothetical-protein MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP034966.1|QAS90885.1|3125142_3125745_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP034966.1|QAS90884.1|3124710_3124932_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP034966.1|QAS90883.1|3124330_3124612_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP034966.1|QAS90882.1|3124128_3124320_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP034966.1|QAS90881.1|3123973_3124156_+|DUF1317-family-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP034966.1|QAS90891.1|3129112_3131374_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >CP034966.1|QAS90892.1|3131556_3132990_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP034966.1|QAS90893.1|3133065_3134118_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP034966.1|QAS90894.1|3134301_3135255_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP034966.1|QAS90895.1|3135295_3136291_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP034966.1|QAS90896.1|3136445_3137264_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP034966.1|QAS90897.1|3137264_3138323_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP034966.1|QAS90898.1|3138325_3139015_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP034966.1|QAS90899.1|3139014_3139788_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >CP034966.1|QAS90900.1|3139954_3140104_-|multidrug-efflux-pump-associated-protein,-AcrZ-family MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_11 | 3648788-3648941 | Orphan |
NA
Consensus repeat of CP034966_11
|
1 spacers
spacers of CP034966_11
>11.1|3648841|48|CP034966|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around CP034966_11
The CRISPR arrays of CP034966_11 >merge|CP034966|11|3648788-3648941|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >CP034966|11|9|3648788-3648941|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>CP034966.1|QAS91344.1|3646913_3648653_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >CP034966.1|QAS91343.1|3646183_3646954_-|putative-lateral-flagellar-export/assembly-protein-LafU MIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >CP034966.1|QAS91342.1|3645057_3646113_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >CP034966.1|QAS91341.1|3644608_3645061_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >CP034966.1|QAS91340.1|3644035_3644302_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >CP034966.1|QAS91339.1|3642221_3643679_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >CP034966.1|QAS91338.1|3641502_3641961_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >CP034966.1|QAS91337.1|3640166_3641411_-|esterase-FrsA MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >CP034966.1|QAS91336.1|3639707_3640109_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >CP034966.1|QAS91335.1|3638613_3639669_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP034966.1|QAS91345.1|3648970_3649468_-|transposase MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >CP034966.1|QAS91346.1|3649643_3650402_-|peptidoglycan-endopeptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >CP034966.1|QAS91347.1|3650693_3651434_+|murein-L,D-transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >CP034966.1|QAS91348.1|3651404_3652172_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >CP034966.1|QAS91349.1|3652377_3652956_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >CP034966.1|QAS91350.1|3653195_3655640_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >CP034966.1|QAS91351.1|3655682_3656156_-|C-lysozyme-inhibitor MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >CP034966.1|QAS91352.1|3656309_3657080_+|2-oxoglutaramate-amidase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >CP034966.1|QAS91353.1|3658518_3658968_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >CP034966.1|QAS91354.1|3658979_3662039_-|RHS-repeat-protein MTSPLNSEGRYTEGEGGLKRVVKKEHADGSITRSEYDEAGRLKAQTDAAGRRTEYSLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSRSGETTRYSYDDPASELPTGIQDATGSTKQMAWSRYGQLLAFTDCSGYTTRYEYDRYGQQIAVHREEGISTYSSYNPRGQLVSQKDAQGREIRYEYSAAGDLTATISPDGKRSTIEYDKRGRPVSVTEGGLTRSMGYDAAGRITVLTNENGSQSTFRYDPVDRLTEQRGFDGRTQRYHYDLTGKLTQSEDEGLITLWHYDASDRITHRTVNGDPAEQWQYDEHGWLTTLSHTCEGHRVSVHYGYDDKGRLTGERQTVENPETGEMLWEHETGHAYSEQGLATRQEPDGLPPVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETARSFGGAGSTAGYEQATAYTLTGQLQSRHLNLPQLDCDYTWNDNGQLVRISGPQECREYRYSGTGRLTGVHTTAANLDIDIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLAEKTDRIPEGVIRMHDERTHHYHYDSQHRLVFYTRIQHGEPQVESRYLYDPLGRRTGKRVWRRERDLTGWMSLSRKPEETWYGWDGDRLTTVQTQQTRIQTVYQPGSFTPLLRIETENGEQAKARHRSLAEVLQEDTGVTLPAELAVMLGRLERELRQGSVSEESQQWLAQCGLTAEQMAAQLEAEYIPERKLHLYHCDHRGLPLALISPEGETAWQGEYDEWGNLLGEESAQHLQQSLRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLRGEWNLYKYPLNPVRFIDSLGLKFHVNGDPSDFNQAVEYLKQDSQMKETIDFLSSSEETINIEYIEGTNVRFNSNNMAIYWNSRASLFCSTELNSKSQSPALGLGHEFAHAQYYLLDKENFMALLSRTDKKYENKEEARVITIIESRAAKTLGECTRGAHSGLPFYRVDGPLQTMKITGTPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034966_12 | 3838853-3838968 | Orphan |
NA
Consensus repeat of CP034966_12
|
1 spacers
spacers of CP034966_12
>12.1|3838884|54|CP034966|CRISPRCasFinder TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC |
CRISPR arrays and Neighbor proteins around CP034966_12
The CRISPR arrays of CP034966_12 >merge|CP034966|12|3838853-3838968|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATCTGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTCAACGCCTGATGCGACGCTGGCGCGTCTTATC >CP034966|12|10|3838853-3838968|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATC TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC AACGCCTGATGCGACGCTGGCGCGTCTTATC
>CP034966.1|QAS91498.1|3837330_3838833_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR >CP034966.1|QAS91497.1|3835619_3837320_+|ribulokinase MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDTTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHVTRQDSAVAQSAASWIELCDWVPALLSGTTGPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVPTL >CP034966.1|QAS92362.1|3834402_3835281_-|arabinose-operon-transcriptional-regulator-AraC MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS >CP034966.1|QAS91496.1|3833552_3834317_-|DedA-family-protein MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRKVVGG >CP034966.1|QAS91495.1|3832740_3833439_+|thiamine-ABC-transporter-ATP-binding-protein-ThiQ MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNAAQQEKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALDPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGKTNELLSGKASASALLGITG >CP034966.1|QAS91494.1|3831146_3832757_+|thiamine/thiamine-pyrophosphate-ABC-transporter-permease-ThiP MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGNWVAVWQDSYLWHVVRFSFWQAFLSALLSVVPAIFLARALYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD >CP034966.1|QAS91493.1|3830187_3831171_+|thiamine-ABC-transporter-substrate-binding-subunit MLKKCLPLLLLCTAPVFAKPVLIVYTYDSFAADWGPGPKIKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEQLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR >CP034966.1|QAS91492.1|3828368_3830024_+|HTH-type-transcriptional-regulator-SgrR MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNTTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAGLESLTLTFYQDHSEHRVIAGIMQQILASHQVTLEIKEISYDQWHEGEIESDIWLNSANFTLPLDFSLFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLIHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP >CP034966.1|QAS91491.1|3828148_3828280_-|glucose-uptake-inhibitor-SgrT MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES >CP034966.1|QAS91490.1|3826868_3828047_-|MFS-transporter MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFHSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV >CP034966.1|QAS91499.1|3839032_3839728_+|L-ribulose-5-phosphate-4-epimerase MLEDLKRLVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIATGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ >CP034966.1|QAS91500.1|3839802_3842154_+|DNA-polymerase-II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDIRNGAIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYIRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >CP034966.1|QAS91501.1|3842318_3845225_+|RNA-polymerase-associated-protein-RapA MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ >CP034966.1|QAS91502.1|3845236_3845896_+|bifunctional-tRNA-pseudouridine(32)-synthase/23S-rRNA-pseudouridine(746)-synthase-RluA MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF >CP034966.1|QAS91503.1|3846012_3846828_-|co-chaperone-DjlA MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK >CP034966.1|QAS91504.1|3847082_3849437_+|LPS-assembly-protein-LptD MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNSL >CP034966.1|QAS91505.1|3849489_3850776_+|peptidylprolyl-isomerase-SurA MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESLAQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQDPGSANQGGDLGWATADIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN >CP034966.1|QAS91506.1|3850775_3851765_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTDRAAMLGLPLTLRTYSPNSPAQPQTAGTLTLLPVALRESVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPLLDELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGEADVGSFITALNLAIKMIVNTQ >CP034966.1|QAS91507.1|3851761_3852583_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQYCQMANYLAENAPLQES >CP034966.1|QAS91508.1|3852585_3852963_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP034966_9 | 9.2|3117613|26|CP034966|PILER-CR | 3117613-3117638 | 26 | CP034966.1 | 2687848-2687873 | 0 | 1.0 |
1. spacer 9.2|3117613|26|CP034966|PILER-CR matches to position: 2687848-2687873, mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP034966_8 | 8.1|2823350|40|CP034966|CRISPRCasFinder | 2823350-2823389 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP034966_9 | 9.1|3117566|25|CP034966|PILER-CR | 3117566-3117590 | 25 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 44572-44596 | 0 | 1.0 |
CP034966_9 | 9.2|3117613|26|CP034966|PILER-CR | 3117613-3117638 | 26 | NC_049946 | Escherichia virus Lambda_4A7 genome assembly, chromosome: 1 | 37604-37629 | 0 | 1.0 |
CP034966_9 | 9.2|3117613|26|CP034966|PILER-CR | 3117613-3117638 | 26 | LR595861 | Escherichia virus Lambda_4C10 genome assembly, chromosome: 1 | 37887-37912 | 0 | 1.0 |
CP034966_9 | 9.2|3117613|26|CP034966|PILER-CR | 3117613-3117638 | 26 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 44524-44549 | 1 | 0.962 |
CP034966_7 | 7.1|2156824|38|CP034966|CRISPRCasFinder | 2156824-2156861 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP034966_9 | 9.1|3117566|25|CP034966|PILER-CR | 3117566-3117590 | 25 | NC_049946 | Escherichia virus Lambda_4A7 genome assembly, chromosome: 1 | 37652-37676 | 2 | 0.92 |
CP034966_9 | 9.1|3117566|25|CP034966|PILER-CR | 3117566-3117590 | 25 | LR595861 | Escherichia virus Lambda_4C10 genome assembly, chromosome: 1 | 37935-37959 | 2 | 0.92 |
CP034966_11 | 11.1|3648841|48|CP034966|CRISPRCasFinder | 3648841-3648888 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
CP034966_11 | 11.1|3648841|48|CP034966|CRISPRCasFinder | 3648841-3648888 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
CP034966_11 | 11.1|3648841|48|CP034966|CRISPRCasFinder | 3648841-3648888 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
CP034966_11 | 11.1|3648841|48|CP034966|CRISPRCasFinder | 3648841-3648888 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
CP034966_9 | 9.2|3117613|26|CP034966|PILER-CR | 3117613-3117638 | 26 | NZ_CP015341 | Lactobacillus brevis strain 100D8 plasmid unnamed3, complete sequence | 35661-35686 | 4 | 0.846 |
CP034966_1 | 1.1|619562|42|CP034966|CRISPRCasFinder | 619562-619603 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
CP034966_4 | 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038145-1038176 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
CP034966_5 | 5.1|1060681|31|CP034966|CRISPRCasFinder | 1060681-1060711 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
CP034966_5 | 5.1|1060681|31|CP034966|CRISPRCasFinder | 1060681-1060711 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
CP034966_5 | 5.1|1060681|31|CP034966|CRISPRCasFinder | 1060681-1060711 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
CP034966_5 | 5.4|1060864|31|CP034966|CRISPRCasFinder | 1060864-1060894 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
CP034966_5 | 5.7|1061047|31|CP034966|CRISPRCasFinder | 1061047-1061077 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
CP034966_1 | 1.1|619562|42|CP034966|CRISPRCasFinder | 619562-619603 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
CP034966_4 | 4.6|1038084|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1038084-1038115 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
CP034966_5 | 5.4|1060864|31|CP034966|CRISPRCasFinder | 1060864-1060894 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
CP034966_5 | 5.7|1061047|31|CP034966|CRISPRCasFinder | 1061047-1061077 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
CP034966_5 | 5.7|1061047|31|CP034966|CRISPRCasFinder | 1061047-1061077 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
CP034966_5 | 5.7|1061047|31|CP034966|CRISPRCasFinder | 1061047-1061077 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
CP034966_5 | 5.7|1061047|31|CP034966|CRISPRCasFinder | 1061047-1061077 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
CP034966_5 | 5.10|1060680|33|CP034966|PILER-CR | 1060680-1060712 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62681-62713 | 8 | 0.758 |
CP034966_5 | 5.16|1061046|33|CP034966|PILER-CR | 1061046-1061078 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530672 | 8 | 0.758 |
CP034966_5 | 5.19|1060681|32|CP034966|CRT | 1060681-1060712 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
CP034966_5 | 5.19|1060681|32|CP034966|CRT | 1060681-1060712 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
CP034966_5 | 5.19|1060681|32|CP034966|CRT | 1060681-1060712 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
CP034966_5 | 5.19|1060681|32|CP034966|CRT | 1060681-1060712 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
CP034966_5 | 5.22|1060864|32|CP034966|CRT | 1060864-1060895 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
CP034966_5 | 5.22|1060864|32|CP034966|CRT | 1060864-1060895 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
CP034966_5 | 5.25|1061047|32|CP034966|CRT | 1061047-1061078 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
CP034966_5 | 5.25|1061047|32|CP034966|CRT | 1061047-1061078 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
CP034966_5 | 5.26|1061108|32|CP034966|CRT | 1061108-1061139 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
CP034966_1 | 1.1|619562|42|CP034966|CRISPRCasFinder | 619562-619603 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
CP034966_5 | 5.1|1060681|31|CP034966|CRISPRCasFinder | 1060681-1060711 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
CP034966_5 | 5.2|1060742|31|CP034966|CRISPRCasFinder | 1060742-1060772 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
CP034966_5 | 5.2|1060742|31|CP034966|CRISPRCasFinder | 1060742-1060772 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
CP034966_5 | 5.4|1060864|31|CP034966|CRISPRCasFinder | 1060864-1060894 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
CP034966_5 | 5.4|1060864|31|CP034966|CRISPRCasFinder | 1060864-1060894 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
CP034966_5 | 5.8|1061108|31|CP034966|CRISPRCasFinder | 1061108-1061138 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
CP034966_5 | 5.10|1060680|33|CP034966|PILER-CR | 1060680-1060712 | 33 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86213 | 9 | 0.727 |
CP034966_5 | 5.13|1060863|33|CP034966|PILER-CR | 1060863-1060895 | 33 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17976-18008 | 9 | 0.727 |
CP034966_5 | 5.22|1060864|32|CP034966|CRT | 1060864-1060895 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
CP034966_5 | 5.25|1061047|32|CP034966|CRT | 1061047-1061078 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
CP034966_5 | 5.25|1061047|32|CP034966|CRT | 1061047-1061078 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
CP034966_5 | 5.25|1061047|32|CP034966|CRT | 1061047-1061078 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
CP034966_5 | 5.26|1061108|32|CP034966|CRT | 1061108-1061139 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
CP034966_4 | 4.1|1037779|32|CP034966|PILER-CR,CRISPRCasFinder,CRT | 1037779-1037810 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
CP034966_5 | 5.11|1060741|33|CP034966|PILER-CR | 1060741-1060773 | 33 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78535-78567 | 10 | 0.697 |
CP034966_5 | 5.16|1061046|33|CP034966|PILER-CR | 1061046-1061078 | 33 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14952-14984 | 10 | 0.697 |
CP034966_5 | 5.16|1061046|33|CP034966|PILER-CR | 1061046-1061078 | 33 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14982-15014 | 10 | 0.697 |
CP034966_5 | 5.17|1061107|33|CP034966|PILER-CR | 1061107-1061139 | 33 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35739-35771 | 10 | 0.697 |
CP034966_5 | 5.19|1060681|32|CP034966|CRT | 1060681-1060712 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
CP034966_5 | 5.20|1060742|32|CP034966|CRT | 1060742-1060773 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
CP034966_5 | 5.20|1060742|32|CP034966|CRT | 1060742-1060773 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
CP034966_5 | 5.22|1060864|32|CP034966|CRT | 1060864-1060895 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 8.1|2823350|40|CP034966|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 9.1|3117566|25|CP034966|PILER-CR matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 0, identity: 1.0
gtgcctttacctgatttgggtaaac CRISPR spacer gtgcctttacctgatttgggtaaac Protospacer *************************
3. spacer 9.2|3117613|26|CP034966|PILER-CR matches to NC_049946 (Escherichia virus Lambda_4A7 genome assembly, chromosome: 1) position: , mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
4. spacer 9.2|3117613|26|CP034966|PILER-CR matches to LR595861 (Escherichia virus Lambda_4C10 genome assembly, chromosome: 1) position: , mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
5. spacer 9.2|3117613|26|CP034966|PILER-CR matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 1, identity: 0.962
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttaaggtaaactttat Protospacer ************ *************
6. spacer 7.1|2156824|38|CP034966|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
7. spacer 9.1|3117566|25|CP034966|PILER-CR matches to NC_049946 (Escherichia virus Lambda_4A7 genome assembly, chromosome: 1) position: , mismatch: 2, identity: 0.92
gtgcctttacctgatttgggtaaac CRISPR spacer acgcctttacctgatttgggtaaac Protospacer ..***********************
8. spacer 9.1|3117566|25|CP034966|PILER-CR matches to LR595861 (Escherichia virus Lambda_4C10 genome assembly, chromosome: 1) position: , mismatch: 2, identity: 0.92
gtgcctttacctgatttgggtaaac CRISPR spacer acgcctttacctgatttgggtaaac Protospacer ..***********************
9. spacer 11.1|3648841|48|CP034966|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
10. spacer 11.1|3648841|48|CP034966|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
11. spacer 11.1|3648841|48|CP034966|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
12. spacer 11.1|3648841|48|CP034966|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
13. spacer 9.2|3117613|26|CP034966|PILER-CR matches to NZ_CP015341 (Lactobacillus brevis strain 100D8 plasmid unnamed3, complete sequence) position: , mismatch: 4, identity: 0.846
atttacctctttcaggtaaactttat CRISPR spacer aggtatctcattcaggtaaactttat Protospacer * **.*** ****************
14. spacer 1.1|619562|42|CP034966|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
15. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
16. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
17. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
28. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
29. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
30. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
31. spacer 4.7|1038145|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
32. spacer 5.1|1060681|31|CP034966|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
33. spacer 5.1|1060681|31|CP034966|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
34. spacer 5.1|1060681|31|CP034966|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
35. spacer 5.4|1060864|31|CP034966|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
36. spacer 5.7|1061047|31|CP034966|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
37. spacer 1.1|619562|42|CP034966|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
38. spacer 4.6|1038084|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
39. spacer 5.4|1060864|31|CP034966|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
40. spacer 5.7|1061047|31|CP034966|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
41. spacer 5.7|1061047|31|CP034966|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
42. spacer 5.7|1061047|31|CP034966|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
43. spacer 5.7|1061047|31|CP034966|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
44. spacer 5.10|1060680|33|CP034966|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gtccctatcgcaatgccggcagcatccgcaatc Protospacer **. *. ****** **** ************.
45. spacer 5.16|1061046|33|CP034966|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gccgaacaggtggcgaagcaggtgatgggccag Protospacer *******.* **************.. *** .
46. spacer 5.19|1060681|32|CP034966|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
47. spacer 5.19|1060681|32|CP034966|CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
48. spacer 5.19|1060681|32|CP034966|CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
49. spacer 5.19|1060681|32|CP034966|CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
50. spacer 5.22|1060864|32|CP034966|CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
51. spacer 5.22|1060864|32|CP034966|CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
52. spacer 5.25|1061047|32|CP034966|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
53. spacer 5.25|1061047|32|CP034966|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
54. spacer 5.26|1061108|32|CP034966|CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
55. spacer 1.1|619562|42|CP034966|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
56. spacer 5.1|1060681|31|CP034966|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
57. spacer 5.2|1060742|31|CP034966|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
58. spacer 5.2|1060742|31|CP034966|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
59. spacer 5.4|1060864|31|CP034966|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
60. spacer 5.4|1060864|31|CP034966|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
61. spacer 5.8|1061108|31|CP034966|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
62. spacer 5.10|1060680|33|CP034966|PILER-CR matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.727
-gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer cgcta-ccgcgcaattcgaggagcatccgctggg Protospacer *.*. *********** .*********** .
63. spacer 5.13|1060863|33|CP034966|PILER-CR matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
gcccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer cagcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
64. spacer 5.22|1060864|32|CP034966|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
65. spacer 5.25|1061047|32|CP034966|CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
66. spacer 5.25|1061047|32|CP034966|CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
67. spacer 5.25|1061047|32|CP034966|CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
68. spacer 5.26|1061108|32|CP034966|CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
69. spacer 4.1|1037779|32|CP034966|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
70. spacer 5.11|1060741|33|CP034966|PILER-CR matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.697
gacggacaaaatatatattgatttgcgaattat CRISPR spacer gacggaaaaattatatattgattttacttctgg Protospacer ****** *** ************* .*.
71. spacer 5.16|1061046|33|CP034966|PILER-CR matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
72. spacer 5.16|1061046|33|CP034966|PILER-CR matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
73. spacer 5.17|1061107|33|CP034966|PILER-CR matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.697
ggtttaccgccccgcagaggcgctggcagatcc CRISPR spacer ccgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
74. spacer 5.19|1060681|32|CP034966|CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
75. spacer 5.20|1060742|32|CP034966|CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
76. spacer 5.20|1060742|32|CP034966|CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
77. spacer 5.22|1060864|32|CP034966|CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
502880 : 547183
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034966|502880:547183|DBSCAN-SWA TATGGCCGAAAATAAACTCTCAGATGTTCAATTGCGTGCGTTAACCCGCAAAGTCATCGAAAAACCTTTTGATGTTGCTGACGGTGGCAGCCTTTCTGTTCGCGTTACACCATCAAGGCGCAAAATCCTTGATGGCGAGAAGAGACCGGCCAATAACATCCAATGGTTGTTCCGTTATAAGAATAAGCATAAAAGCGCCAATACTTTGACGCTAGTGCTGGGCAAGTATCCTGCCTTATCTCTTGCTGAGGCGAGGGTGAAACGCGACCAATGCAAACGGTGGCTTGCTCATAATCTAGATCCTAAAGACCAGTTTGACCAAGAGTCAGCAAAAACCATGAAACCTGTCACTATCAGGGAAGCATTGGAACGCTGGATTAATGAGTACGCGGCGGAAAACCGGGTCAACTTTGAAAGGCATAGACGCCAATTCGAAAGACATATTTATCCTTTCATCGGCGAATTGCCACTTGAACAGTGCTCTAAAGCAAGATGGATGGAGACATTTAGCAGAATCAAAAAAGTCGCGCCGGTAGCAAGCGGCTACATATTAGGTAACTGCAAGCAGGCGTTGATTCACAACCGCAGGCTGCACGATGTTTACAGTGATGCGCTGGAGGATATCAAAGTTACTGATGTTGGAAGAAAGCAGGCAAAACGTGATCGTGTTTTAACTTCTGAAGAACTTAGTGGCCTTTGGAATGCGCTAAAAAGCGAATATGCGTTTCTACCTTACTACACCGCGCTTTTAAAATTGCTAATAGTGTTTGGTGCAAGAACACAAGAAATTCGTCTTTCGACATGGTCGGAATGGAACGTGAAAGAATGGATTTGGACAGTTCCGCGAGAACATAGCAAGGGTGGGGAGAAAATCATCAGACCCGTGCCTGAATCTATACAGCCATTTATTTTACAGCTACGCCAAAATCATCTTGATAGCGGAATGCTGCTTGGTGAAATGAAGAAACCCGAAGCTGTCAGCCAGTGGGGGCGTATCTTATACAGACGATTAGAGCATTCAACGCCGTGGACGTTGCATGACCTGCGCCGCACTTTTGCCACCAGTCTAAATAACTTGGGGATAGCCCCGCATGTGGTTGAACAGTTGCTGGGCCACTCAATGCCAGGAGTGATGGCGGTGTACAACCGTAGCCAGTACATCCCGGAAAAACGGTTAGCCTTGGACAAATGGATAAAATATTTAGAAAAAATAAACCTTTTAAAATCAGATAATTAAAAATTGATGATTATATAGAGTTATGTAGCGTTATTTGCAGTTAAGTTTATGAATTTAAAGGGTATTTACTGCTGATACAATTTGAGTATTATTGTCATGTAGTAATAAAAAGCCCCCAGCAGAACTGAGGGCTTGAATAAGGCGGGGATCGGTTCACGTCGCCAAACAGAAAACCGAACCCCTTAATACCAAAAAGCCGGTGAAGGCTGTAATGGTCGCGTTCTTTGTTAACGCAACATAATCATAAATCATTACAGACCACATATCCACCGGCCATGTTTTGAGGGCCGTAAGATGGAATTACTGACTGCTAAGCAAGTCACAACACTAACAACATTATCCCGCATGACTATCTGGCGCTATATCCAGGCTGGCACGTTTCCTAATTGCATAAAATTAGGGCCAAAACGCGTGGCCTGGCGGCGCAAAGATGTTATGGCGTGGCTTGAAAGCCGCCAGCCAGTACAGCGCTGAGGTGGCTTATGAAGAAAACAACCGTAAATCAGGCTATCAGTCGCTATAACAGCCTGTTGCGTAACCCACAGCGCCAGTTAACCGTTGGCGAACTTACCGCCCAGCGTATGGCGGCAGCACAATTATTGTTGCAAGCATGCATTCGCGAAGGTGTAAACCGCCCGTGGACTATTGTTTCACGTCACGCCGCTATGGCTGATAGCCTGGTGCCGTTTCGAATCAGCGATTCTGAATCATGGGCGATGTATCTGGAGTTAAAACGCGGGGTGCGATATGAGAAACGCGCCTAACGTTAAGCTATTACCTAAAGACCCTTTTACGGAAGCAATTATTTTTGCCGGTTCTGACGCTTGGAGCCATGCCAAGGGCTGGGAAGAAGGAATGGGTAAACAAGTTGCCGGTGACACTACTCCTCCTGTCTATCTTGGGCCAAGACAACTGGATGATTTAGATAATCTACGAATTATCGACGATGGTCGCCGGTCTGCGCGGGTATATCTGGCTGGTGATATTAAGCAACTTCAAATCAGCACTATTGCTGACAAGCTGGCAGTGGCTGGAGTTAAGGAAGCCCGTTTATTTAAAGGCATTACTGACCGTGAACCGGAAAAATGGAACATGGACAGGCTTAGAGACGCCGCCCTACGCGGTGAAAGCCTGGTGGGCATGCTGCGCAAAGAAGAACGAGCAACCGCCGCAGCAACTGTAAAAGACATTCACGATATGGCTCCTGTATTAATGACAACCTGGCCAAATATTGGAAGCAATGGCCAGCCTTTAAACACACGCCCTAATGTTGAACGTCTATTAGATAACTACAGCATTTCTGCAAAATATAATGAGATTAGCAAGGACGTTGAGGTTTTAGTACCTGGCGTTAGTGGCGGTTCAGATGCTCGCGATAACTGCGCTGTGAGTGAAATTTTAAGCCTCGCCGCACTGAACCGGTTACCAATGAGTAATATTGAAGGTCATATCAAAACTATCGCTGTGCGCAATACTTATAATCCTGTCAGGGATTTCATCAATCAACGTGAATGGGATGGACGAAGCCGATTTGCAGACCTGCTGAATACTATCAGCACGCCAGATGACTACAGTCGCGATCTGCTGGCCATGCTGGTGCGTCGTTGGTTGCTGTCTGCCGTGGCGGCTGCTTACTTAGAAGCGGGTTTCTGGTCCAAAGGCGTGCTGGTTTTCCAGGGCGAGCAGTCACTGGGTAAAACAGCATGGTTTAAAGCGCTTCTGCCCCCTGACAATCGCAACCTGGTTAAGGTGGGAGCAACTATTGACCCCGGCAATAAAGACAGCGTGGCCAGCGCAATTAGTCACTGGCTTGTTGAGCTAGGCGAATTGGATGCGACGTTTCGCAAGGCGGATATTGCCAAGCTCAAAGCATTTATCAGCCAGGACCGTGATGACCTGCGCCGCCCTTATGACCGGCTTGAATCGAAGTACCAGCGCCGCACCGTGTTTTTTGCGTCAGTGAACCCGAAACACTTCCTGGCCGATGACACCGGGAACGTTCGCTGGTGGACCATTCCTGTAACTGCCGTTAACTACGAGCATGGCATAGACACACAACAATTATGGGCCGAAGTTCTTTCCTGGTTTGAAGCAGGTGAACGCTGGTGGCTGGACCGTGATGAAGAAGCCATGCTGGAAGTCGTGAACGAACAGCACGGGCAAACTGACCCGATTGAAGAAATGATATTGGCGCGTTTTGATTGGAACAGTGATCGCCTGGCTGCATATATAGAAATGACTGCAACGGATGTTCTGTTGTCTATTGGGCAGGATCGGCCAACAAAATCCCAGGCAACGCAGTGCGGTAACATTCTGCGAAAACTGACAGGAAGGGATGCACGTAGAACAAGTAAAGGGCGTTTTTACCTAATGCCACCGAAAGTTTTTGGACAGCAAGACCATCGCTATTTTGATGGGCCTAACCACCAGTTTTGAGGTAATACTATGAGTTATCGCGCATATATCATTGCTTTTGACCCGGAACACGGCACCTACACCGAAACCGAAATTATGGAGGGTTTCGCCACTGAACAGGAAGCGGTAGACCATGCTCGCAATAGGCTGCCAGAGGTGCAGCAGGAACTGGCCAAGTTGGGTGAAAACCTGCTATGCAGCTACCGCATAAGGGTAGTGGATTCCGCTGAAATCTTGCCGTTCATCCGATCATAACATCACAACAGTTGTTGCCCATTACCCTGGCGAATGCCGGGGCTTTTTTTATTTTCAAAGTGACGCCTAGGTGTCATTAATTTATAAGGTGTCATGGTAGGTGTCATTGTTCTCAAGCCATAACCCGCAAGGGTTGCGACATTACTAATGACACCTGTGACACCTATGACAGCATTTTTAAAGAAAATAACGACCACTGTAATAATGTAACAGTGGTCGTTATACTTTATAGAAATGGGTGTCATGGTGTCACAGGTGTCATTCATTACAGTATCTTGTTTGTATTTTTATTTAATAATCAGCGTGTTTCCCGCTATGACACCTAATGACACCTTTCCAACATAACGCCGTCATGGTGACATTAGGTGTCATCGCCATCTGCTGCCAAAAAAGTCAAGGTACTTCCGGAGGGGGTACAGCCTGCGGGGGCATTGGTCCCGCGTAATTTGCGTTGTGCGTGCCCATTCAAAACAGGCTTTATAATTATCCTTTGACAAATCAATAACTTACTCAAGTATTATCAACTATGTACAACAAACCGCATCAAAACTGGAAGCAAAGCGAACACAAAAAACAACTGTTTTAGAAAATTTAATGCATTAAAAACAGATGGTTATTTTATATAAACAAACCGAACACCATAGCCACAATGTGATGACAAACCCCAAGAAGAAACGACCACAGCCATAAGTAACAATCGCTCCATATTCAAGACAAAACCGGAGCGACACAAATGACAGTTTTATCTCAACTTATCAACGACGCGTTGAACCACCAGGGCGGCCACATTCGCACTGATGGCAACGGTGACATTACTCAATTCATCCCTGTGGAACCTGCAGCAGTTCGCTACCTGTCCATGAATGACGTGCTGACCCCGAAAGGCGGCAAGCAGACCACCTTCAACGCTGTTCTGGCTGAACGTTCACTGGTGGCACAGGCAGGCGCTGAAATCATCCGCATCCCTGGCCCTATGACCACGCCTCCGACTGGCAACACCGGCGCTACTGTTGGCCGTGAAGTTGCTGACCGCTTCGTTGTCGTTCGCCCTGGTGCGTTCGCCAAAGTGTCTGACGGCGAAGAAATCACCATGAGCGGCCTGCCTTACCTGGTATCAGCGTTCAACCACAACAACGCACCAGCTTACGGCGTTGGCTACACCCTGAGCCGCAAGCAACTGAAACACGACTTCGCAGACGATACCACGCTGGTAGTGGTCAACACCGCCATTGAGCGCGGCATAGCTGACCTTGCTGACTTCGTACTGCTGAACCACCTGGAAAGCGCAGCGGAAACCCTTACCAGCCCGTCATTCACAGCAGTGGCACAGAAGATTGCAGCCAAAAACCTGCGCTTTGACGAAGTGAAGGCCATCGTTGGCGGCGAATGCACCGGCCTGGAACTGCAAGACGGCATTTTGCGCGCCTATGGCGTGCGCGCTGAAATCAGCGGCCAGACTTCCAGCACAATCATTGGCGCTTTCGGCAATGCAGCCGTTGCCCTAGATGATGAAATCCGCGTCACCGCCCGCCGCGTGCTGAATGGTGCCGTTGAAATCGTTGTGTGGGTGAACGCATCCGCACTGGTGCCTGACTCAACCGTATTCTGGCAGGCTTAATTATGTGGTGGGGCAGCAAGAAAGTGGCAGCGGAGCGTGCTGCCCTGCCTGGTGAAATCCTGGCAGCCATTGAAGGCGTGGCGATGTATACCCGCCACGGTGACGAAAACAACCCGCGCATCGTAGTGCAGCCGGTTGGTTGGTCCGGGTTCCTCTACAGCGACGAAGCCGCAGAAAAGTGGATTCGCAGAGCTTACCCGGAACTGACGCAATACCAGATTGAACGTGCTGTGAATTACCTGGCCTCACTGGTGCGCAGCCATCACCGTGACAGCCGATCGGAAGCACAGCGTGAGCGTTGGATGACAAGGTTTTAAGCCATGAAAAGAGAAACACGCCAGCACGTAACAGTACGCATAGCCCCTGACTTGGTGCGCCTGGTTGAAAAAGAGCAAAACAAACTGGAACGCAGAACGGGCATCCGCCCTTCACGCAGCCAGGTGATTGAAAAGTTTTTGAGTATCGGGCTGAACGGCAAATAAATTGCATAAGGGTTGCCGGTTACCCTTACCCCAATCAACCGGCACTAATTAAGTAATTCATCGTGTCCTCAACGATGGATGCTGGAACCAGGTTGGTGGTCATCCTGGAGGTGAAAAACCACAATAACCCTCACATCATTCTTAGCGGGCAAGACTATGAAAAATCACGACCTGTACGAATCCATGAAAGAAGAAGCGCGCTTAATCCTTAGCGAAACCAAACGCTTCCAAATCATTTCTGAGAAGAACCCTATCAGCTACGAAGCAGCCGACCTGCTGGCAAGCCTTCGCAACCGTTTAAACGCATTCGAACACATGACCGATGAAGCGGGCCGCATCATGCGTGACGAAGCGGAAAACAACGCATCCACCCTGGAGCGCAAGAAATGGAAGCCATCCAGCCGCCTCAGCGTGGGTGAGCGCGCCAAGCGCCAGGCGGTGAAGCGCGGGGCTATCCTTGTGCGGAGTGCATAAGAATGGCTACTGACAACAATATTTATGATGCTGTTCTGAACATCATCGGCGTCGCTGACAAGATGACGAACAACAACGCAAGCCAACTGACCGACGCGCAGAAGGTAGCACTAAAAAACCTTAGTAACGCGATGAAATCCTGGAAAGGTATCACGTTTGGAATGCGTGATATCACAGCCAGTGACAAACCAATTGAAAGTAAAACTGCATAAGGGGCAACCATGCTGGACCGTATCCATACAAAATCGGAAGGAATGAACCCCTGGCCTGAACTGGTGGATGCGGTTCTGCCAAAAGTTCTGGCGTGCAACACAGCGGAGGAAATGCAGGAACAGCGAGACTACCGCACCGAATGCGCCCTGCAACTGGCTGCTGCCGCCTTGCGGGCTGCTGGTGAACACACCCTGGCGGAAAATTTACTTAATTAATTCTTTGAGAGCAAAATTATTATGGCCAATAAGAAGCTGAATGCCACAATAACCATTGGCGGAGCCGTTTCCGGCAGCCTCCGTGGTGCGTTTGGTACGGTGGAGAAATCCACCATAAAGATTGGCGCAGCGATTAAGCACATGTCGCGCGAACAGCGCCAGCTTAACGACGCTATGAAGCGTTACGGGCAGGATGGCACAATGGTCGGGCGCATGAAGGAGCGTTACCAGGCCATTGTTGGCCAAGTGGAACGCCTTCGCGCTGCGCAAGAGCGTTTAAACAGAGTGCAACGCGCCAGTGCGGAAAACCTGGCGAAGCAGGCACAGCTACGCGGGCAGATTATCGATACCGTTATGGCTGGCGCTGCCGTTGCCGCACCCATCAAGATTGCTGCCGATCGTGAACAGCACGCCATTGGCATTGCCAAACAACTGGACGGCGCACGCGATGCAGCTGGCAACCTGACGGCGAAGTTTTGGGAAATGCGCAAGGCCGTTGCTGACCTTGGGCACGAAATTCCACTGGCCACAAACGACCTGTTTGACATGGCCACAGCAGGGCTGCGCATGGGCGTAGCTGGCGACCAAATCCTGGGGTTCACCCGGAACGTGGCCAAACTGGCCAGCGCCTTGGAACTGAACCCGGAAGAAGTCGCAGACAATATGGGCAAAATCCAGAATGTTTATCGACTTACTCAAGCTGAGCTTGTGCGCCTTGGTGATGCTATTAATTACCTAGACGACCAAAACACCGTTAAAGGCGGTGAGCTTATCGACTTCCTACAGCGTGTTGGTGGTTCCGCCTCCCTCGCCAAACTCACTGCCAACGATATGGCAGCAATCGGCACCACGCTGATTTCTATGGGTGAAAGCGCAGACACTGCTGCAACGTCAGTGAAGGCGCTTGTTTCCAAGATGACGCTGGGCAGCAAATCCACGAAATCTGCGCGCGAAGCGTTCAAGGAATTGGGTTATTCCGCGCAGGAAGTGGCGAAATCCATGCAACTGGATAGCGTCAAGACCATCGAAAACTTCATGAAGACGGTGAACAAGCTGCCAGATTACAAGAAATCCGGCATCCTGGTGTCCATCTTCGGGCAGGAATACGTTGGTTCTATCTCCAAACTGGCGGCCAACATGGACAAGTTTGGCGAGGCGTTGCACCAGGCTAATGGCGAAATGTCCAAAGGCAGTGTGCAGAAGGAATTCCAGAACACCGTCAACACCACCAACGCGCAGATGGTTATCCTTCGTAACCGCACAGCTGACGTGGCTGACAATATCGGCACCGTCCTGCTACCGACCGTCAATGACGCAGCCAAAGGCATTGGCCGCATCACAACTGTTGTGGCGAAATTCGCAGCGGAACACCCCGGCCTGACCAAAGCCGTTGTGGGCACAGCGATGGCGCTAACTTCCCTGCGAGTGGCCACCCTGGCGGCGCAGTTCGGCTTCACGTTCCTGAAAGGTGGCGTGTTGCAGATGTCCAGCCTGTTTGCCCGCATGAACGCATCTGCCGTTCTGGCCAGCACTCGTGGCCTGCCTGCGGTGGCTACCGGCATTCGTGCCGTTGGTGCGGCGTTCATATCCACTGGCGTGGGTGCTGCCGTAGCTGGCCTGGCGCTGGCAGGTTATCAGATTTACCGCCATTGGGCTGGTGTGAAGGCTTTCTTTGGCGGAGTGGGTGAAGGCATCAAATCCGGGCTGGAACCGCTTAGCGACGCTCTGAGCCGCCTTTATGAACGCATGGGGCCGCTGAAACCTGTCATTGATGGTGTGGGTGCTGCGGTTAAGACGGTTTTCAACTGGTTCACGTCATTGACCGAGCCGGTCAAGTACAGTGGCGAGGAACTGGATAAGGCCGGTAAAATGGGGATGACGTTCGGTAAGACCCTGGCGGCTGGCATCGAGCTTGTGACCGCACCGATCACGTTCCTGATTGATAAGATTCTGTGGGTGTCTGACAACATCGGCAACCTGACCAACAAGGCACTGGAGTTTAAAAACGCCGTGTCTGATGTTGCTGGTGGTGCGTGGCAGAAAACCAAAGATTTCTTTACCGCACCATTCCGCGACGATGTACAGCAGCCAGCAGCAACCACTGCCTCTGGTGCTGCCCTGCCGTCCCCTGCACTGGCCAACCGCTCTGGTGGCACTACAGTGCATAGCAACGACCAGTACCACATCAGCGTGAAGGCAGAACCTGGTATGAATGAAGATGCATTAGCCCGGAAGGTGATACAGCAACTACGACAGCAACAAGCAGTAAGGCAGCGAAGCATGATGATAGATGGCGCGCAAACACCATAAGGGTTGTCAGTACGGTTGCAACAAAAAGTAAAGGGGCTGAAATAGCCCCTTTAATTTCATGCCGTTACAGCTTGACTTTAGTTCATGCCGTATTTTTTCAATTTTTTACGCAGCGTACCACGGTTGATGCCCATCATCAGCGCAGCACGGGTCTGGTTACCACGGGTGTATTGCATCACCATGTCCAACAGGGGCTGTTCTACTTCAGCCAGTACCAGCTCATAGAGGTCATTCACATCCTGACCATTCAGTTGAGCAAAATAGTTCTTCAGTGCCTGTTTAACCGAGTCACGCAGGGGTTTTTGGGTTACCTGATCCTGAGAGTTAACGGTAGAAACGGTCAGTACGTCAGAATTTACGCGTTGTTCGAACATAGTTCTGTCAGCTCTTTATTTCTGTTTACGCAAAATTTTCGAAGTATGCCTCCAACGCCTCCAGCTGTTCGCTGGCATCCTCAATGGCGTTGAATGTGCGCCGAAACTGGTCATTTGGAGCGTGTTCCTGGAGATACCAGGAAACGTGTTTACGTGCAATTCGGTACCCTTTTGCCGGACCATAAAAGTCATGCAGTTCCCGAACGTGCGCGCAAAGCAAGCGCTTAACCTCTGCCAAAGGCAGCGGGGGCAGCAACTCCCCAGTGTCCAGATAATGCTGGATTTCCCGAAAGATCCAGGGTCTTCCCTGAGCTGCGCGGCCTATCATCAGGGCATCCGCCCCTGTATAGTCGAGCACAGCTCTGGCTTTAAGCGGGTCAGTAATGTCGCCATTCGCGATAACCGGAATGGAAACTTTCTGCTTAACTGCCCGAATACTGTCGTACTCAGCTTCTCCATTGAACAAACAGGCGCGTGTACGGCCATGAATGGTCAGAGCCTGAATGCCACAGTCTTCAGCCAGTTGGGCAATCTCTTCGCAGTTACGGTGTTCCGGTGCCCAGCCGGTGCGAATCTTCAGGGTAACAGGAACGTCCACTGCATTGACGACCTCGGTAAGGATCGATTTAACGACATCCGGGTACTGCAAAAGGGCTGAACCTGCGAGCTTGCGATTCACTTTTTTAGCCGGGCAACCCATATTGATATCAATAATCTGGGCACCGCTTTCCACGTTAATACGTGCTGCATCTGCCATTTCTTTCGGATCGCTACCAGCAATTTGCACGGTGCGAATACCGGGTTCATCAATGTGCACCATCCGTAAACGAGATTTGTCGCTTTCCCAAACCTGTGGGTTAGAAGACATCATCTCGGATACTGTCAATCCGGCTCCCATCTCGTAGCACAACGTCCGAAAAGGTCTGTCTGTAATGCCAGCCATGGGCGCTGCGATCAGGCGATTTCTGAGCTGATATTGTCCGATGCGCATGAGTTAAGAAATGACCATACTGTGACTGCAAGGCGGCGTATATTACGCATTTTTTGCACGAGATGAAAGGCCAAACTTTGACCAATCCTCTGAGATGGATCAAAGAATTGCATTTAAAATGAGCGTGGTGCGATAATTACTCATAAAAATCATCATATTAGAAAATAGTGACTAAAATTTACACTCAAAGAAATTTGAGTAAGTTCTCAATTTTTCTTTATGAATGAAAATTTTGGCACGCAAATTTGCGTAAATAATCGGCAAATTTACGTGCCTTTTGTGAGCTTGCTCGCACTTCGCCCCGCGTCACCCTACGGCGATGCGAAGGTTAATTCTTACGACCGGTAATGCGGCACCACTCTTCTTTTTCCACGACCGGGTCCAGTGCGAAGCTGTCGACATAAGCTTCACAAACGCTCTCTGCCTGGCTTGCCAGAATACCGGAAAGGCCCAGCAAACCGCCTGAAACCGGCAGGACGCTGATTAACGGTGCCAGTTCACGTAATGGGCCTGCAAGGATGTTAGCGACCACCACGTCGGCTTTCATTTCTTCTGGCTGATCTTTCGGTAAGTAGAGTTCCAGACGGTCAGAAACGCCATTACGTTCGGCGTTGTCGCGGCTGGCCTGAATCGCCTGCGGATCGATATCAATACCAATGGCTTTTGCTGCACCCAGTTTCAGCGCCGCGATCGCCAGAATGCCGGAACCACAGCCAAAGTCGATGACTGTTTTACCGGTTAAATCGAGGCTGTCGAGCCATTGCAGGCACAGAGAGGTGGTTGGATGGGTACCCGTACCAAACGCCAGCCCTGGATCTAACATCACGTTGACGGCGTTTTCGTCCGGCACATCACGCCAGCTAGGGCAGATCCACAGTCGTTCACCAAAGCGCATCGGGTGGAAATTATCCATCCATTCGCGCTCCCAGTCTTTATCTTCTAGTTGTTCGATTTTATGCGCGAAGCCTGCGCCGAGCAGCGGATGGTTTTCCAGAATCGCCACCACGTCGTTCATATCGGTTTCAGCGTCGAACAGACCAATCACATCGGTGTCGCCCCACAGGCGCGTTTCGCCCGGCAGCGGCTCAAATACTGGCGTATCGTGGGTATCCTGAAAAGTGATAGAAACGGCACCCGCTTCCATCAGCGCATCGCTAAGATCTTCCGCGTTCGCGCCGGTGGTGTTCAGTTTCAGTTGGATCCAAGGCATGGCAAAACTCTTTATTTATCAGTAGTCAAAACGGTAGCTTGCGGGACGGATGTACCGAAACGGTTTCCGACCAGGAAAGCCACCAAACTTAGTAGTAACGAGGGCACGATAGGGTGGAAGCCCAGGTACTGAATATTCAGCGTCGCGAGTACGGCATACAGCACGCCGCCAACGATCATCGCACTTAGCGCGCCTTTGGCGTTGGCGCGTTCCCAGTAAAGACCCAGCACCAGCGGCCACAGGAAAACGGCTTCCAGCCCACCGAAGGCCAGCAAATTCAGCCAGATGATCATTTCTGGCGGCTTCCAGGCGGCAAGCAGCAGCAACGCGCCGAGAACTAACGTAATTACCGCCGACATCCGCTTCAGACGCGTCTCGTTTTGCATTTGATCCGGACGGATATTCAGATAGAGATCTTTAATGATCGTAGCGGAACTTTGTAGCAATTGGGCGTTAATTGTCGACATGATCGCAGCCATAGGTGCAGCCAGAAAGATCCCGGCAGCAAACGGTGGCAGCACTTTTACCATTAACGTCGGGATCACCAGATCCGGTACGGTGAGATCGGGGATCACTGCACGACCTAACGCTCCGGCCAGGTGCATACCGAACATCAGGATTGCGACTACAATCGTACCGATGATGATCCCCCGGTGTACTGCTTTGCTGTCTTTATAAGAGATACAGCGCACCGCAGTATGCGGCAGGCCAATCACGCCAAAGCACACCAGTACCCAGAACGACGTCATAAAGGCAGGCGACAGAATATCGTCAGCGCCTTGTGGCGTAACCAGTTGCGGATCGATGGTTTGCAAGGTCTGTACTGCGTTGCTTAAGCCGCCAGCGGCATGTACCACGCCAATAAGCAGCACAACGGTGCCAATCAGCATCACAAGCCCTTGCATGGTGTCGTTCAGCACGCTGGCGCGAAAGCCGCCAAAGGCGGTATACAACGCAATGCTGATACCGAAAATCAGCAGCCCGGTTTCATAAGGAATACCCGCCGCGGTTTCCAGCAGGCGCGCACCGCCGATAAACTGCACGGTCATTGCCCCAACGAACGCAACCAGCAAACTCAAACTCGCCAGCCACACCAGAAGACGACTCTGGTAGCGGGCAAACAGCATATCGTTCAGCGTCACTGCATTGTAGCGGCGCGCAAGAATCGCGAACTTCTTGCCGAGAATACCGAGCGAAAGCCAGACTGCAGGAAGCTGAATCATCGCCAGCAATACCCAACCCAGGCCGTATTTATAAGCAGCTCCAGGCCCGCCGATAAACGAACTGGCACTGATATAGGTCGCGGTGAGCGTCATCGCCAGCACAATACCGCCCATAGAGCGGCTGCCGAGGAAATACTCATTAAGGAAGGTGCCGGTGCTCCGTTTACGCATCGCATAAACCGAGATACCGAACACCACCACCAGATAGGCGACCAGCGGTAGAATTACTTCAAGCTGCATCGTCATCCTCCAGTGGGATATCGCGATAGATAAATTTCACCATCGCCCAGCACAGTCCAATAAACAGTAGCGGCGTCAGGATGCAGGCTATCTCAAACCAGCGCGGAAAGCCGGTAAAACCGGGGGCAACGCCAGGTAAGTAAGCGGCTACTAACCAAACTGCCAGATACAAAAGGGTCAGCCCCAGCGCCCAGCGCGCCTCTTTATGGGCCTGAACAAAACGAGTGTCCATTTTTTGTCCCTGATGGGTGAAGAAAGCGAGGATTGTACCTTATGGGGGTTGTCGATCCCCAGTAATAAAAAAGGCCGGAAAATCCGGCCTTTTGACGCTTTTGCAGTCTTATTTTTCCTGAAGACCGAGTTTTTTCTCCAGATAGTGGATGTTAGTGCCACCATGCTGGAAGTTCTCGTCATTCATGATGCGGATCTGCAGATCAACATTGGTTTTGATACCGTCGATGATCAGCTCCTGCAGCGCATTCTTCATGCGGGCAATCGCCACGTCACGGTTTTCACCGTAGCAAATCAGCTTACCGATCATTGAGTCATAGTACGGCGGTACGGTGTAGCCCGCGTAGATATGAGACTCCCAACGTACGCCAAAACCGCCAGGTGCGTGGAAACGGGTGATTTTGCCTGGGCTTGGCAGGAAGGTGTTCGGATCTTCGGCGTTGATACGACATTCCACCGCATGGCCGCGAACGTGAACTTCTTCTTGCTTGATCGACAGCGGTTGACCGGCAGCGATACGCAGCTGTTCTTTGATCAGGTCAACGCCGGTGATCATTTCGGTAACCGGGTGTTCTACCTGAATACGGGTGTTCATTTCGATGAAATAGAACTCGCCGTTTTCGAACAGGAACTCGAAAGTACCTGCACCGCGATAGCCGATATCAACACACGCTTTAGCGCAACGTTCGCCGATGTAGCGACGCAGTTCCGGGGTAATGCCCGGTGCTGGCGCTTCTTCGACCACTTTCTGGTGGCGGCGCTGCATGGAGCAGTCACGTTCCGCCAGATAGATAGCGTTGCCCTGACCGTCAGCCAGTACCTGAATCTCGACGTGGCGAGGATTTTCCAAGTATTTCTCCATGTAAACCATATCGTTGCTGAAAGCAGCTTTCGCTTCCGCACGGGTCATGGAGATGGATTGTGCCAGTTCAGCGTCGCCGCGCACTACGCGCATACCGCGACCGCCGCCGCCGCCGGAGGCTTTGATAATCACCGGATAACCAATGCGTTTAGCAATGGCGCGGTTTTTATCCATATCGTCGCCCAGCGGGCCGTCAGAACCAGGTACGCAAGGAACGCCCGCTTTTTTCATCGCCGCGATTGCGGATACTTTGTCGCCCATCAGGCGAATGGTTTCTGCTTTCGGGCCGATGAAGATAAAGCCGGAGCGTTCAACCTGCTCGGCAAAGTTGGCGTTCTCGGAGAGGAAGCCGTAACCCGGATGGATTGCTACTGCGCCGGTGATTTCAGCGGCGCTGATGATTGCCGGGATGTTCAGATAACTTTTTACTGACGGAGCAGGGCCAATACAGACCGTTTCATCTGCCAGTAATACGTGTTTTAGATCGCGATCCGCGCTGGAGTGCACAGCGACAGTCTTGATGCCCAGTTCTTTACAGGCACGAAGAATACGCAATGCAATCTCGCCGCGGTTGGCAATAACAATTTTATCCAGCATGTTCGCCTCGTTACTCGATGACGACCAGCGGCTCGTCAAATTCTACCGGTTGTCCACTTTCGACCAGAATTGCTTTCACGGTACCGGATTTGTCCGCTTCGATCTGGTTCATCATTTTCATGGCTTCAACGATGCACAGGGTATCGCCCACGTTGACTTTCTGACCCACTTCGATGAACGCTTTTGCGTCCGGGCTTGGGGTGCGGTAGAAAGTACCAACCATCGGGGAACGTACGATGTGACCACTGATTTCCGCTGCTGCTGGCGCTTCCATGGAAGGAACGGTCGCCGGAGCGGCTGCGTTAGATTGAGCTGGCTGCTGCATCATTGGTGCAGCGTAAGCTTGTTGCATCACAGGGAAACTTGCGGCAGGAGCTGCACGGCTAATGCGTACTGACTCTTCGCCTTCAGAAATTTCCAGTTCGGAGATGCCTGATTCTTCAACCAGCTCGATCAGTTTTTTAATCTTACGAATATCCATGAGTGGGTTCCGTACTCTTTGTTTAGTGTGATTGTGACAGGCGTTTCACCGCCGTCTGTAAAGCGTATGCCCCGACAGCTGTATGCATAGCGATAAATTCCAGCAGGCCGGGACGCCGGTCTATTTTGCCCCAGAGTGTCAATGTTAGACTTGACGGACATTGTGCAGCGATCTGCTGCTATCCTCCGGCAAAAAACAAAATATACCTTCGTTGCGCCAATGTCACCTTTTCCGCAGCGCAAAAACTGCGTCAGGGAGGACGAGGTCGCACATTATAACGATTTCGTAGCAATTGGCAGCTAAATACTGGTCTTATCAGGGAAGATAATCAACAGCTAACATGTAAATAACCTTCAACACCGTGTAATTTGCAACAAGCCGCACAATTCACGAAATTAGCGCCATCATCGACGGAACTTCTTATAACGTAAGGTGAAAAACTGTAAAGCCAGCCCTGTGAAGATGAGGGGCTGCAATGAGATAATCTTCACAGACCACAAATAATGTAGGGATGCCAGGATCGCGACAAGATAGACGACATTACGCAGCGGTTGCTGAGGATTACCCAATTTTCACTGCATCGTCTGGATAGACGTGAATGCTAAAGCACGCAAAATCAACCGACTGACAATGCTCAGCATTAAATAGGTTTGAGTGGTTAATTTGTAGTCAAATAAAATCAGATTATTCACCCCATGGTACCAGAAATAAGTAAGGGGCTAAGCGTAGCGTCGCCCAGGCAAAGCACCATAAACCTAACAGGCGACGGAGGCGTATCAATAATAGCTGTTTAGTGGAGTATCAATGCGCATCAATTTGTGGTCAGTCGAAATCATGACTTTACAGTTTGGATGATGTTGTATAACTTTCTGACCAAGGTGTTCGACGAAAGCTGTCCGGCTAACCGCTTTTCCCGTTAACCGATTGAATAATGATTTCAAAATGAGTTCCCTCTCTTATTATTCCCTGCTAAATGGTTAGTTAACCTTCACCAGCGTGCGACCCTGGATCTGGTTATTAATGATGGCCTCGGCAAAGTTCGGTGCCTCTGACAGAGATATCTCTTTTGCCGCCTGGGTATAGAATGATTCCGGTAAATCGGCGACCAGTCGCTGCCAGGCTTGTGCGCGGCGTTCTGGTGGCGTCATTACTGAATCCACCCCTTGCAAACGGACATTACGCAGAATAAATGGCATGACCGTGGTTGGCAGAGTAAAACCACCCGCCAGACCACAGGCAGCCACGCAGCCGCCGTAATTCATTTGCGCCAGCACTTTTGCCAGCACTTTGTCGCCAACAGTGTCAATTGCCCCAGCCCAGACTTGTTTTTCCAGAGGACGGGATTCGGCAAACTCATCACGAGGGAGAATACGACTGGCACCTAAACTTTTCAGATATTCATGGGTACTTTCGCGACCGGAAACGGCAACGACCTGATAACCCAACTTATGCAGCAGCGCCACGGCGGTACTGCCGACGCCACCACTGGCACCCGTCACGACAATCTCCCCGTCCTGCGGGCGAACACCGGCATCTTCCAGCGCCATCACACACAGCATGGCGGTAAAACCGGCAGTACCGATAATCATTGCTTTACGCGCGTCCAGCCCTTGCGGCATGGCAACCAGCCAGTCACCTTTCACTCGCGCCTGCTCCGCCAGCCCACCCCAGTGGTTTTCACCAACGCCCCAGCCAGTGAGTAACATCTCCTGACCGGCATGAAAACGCGGATCTTCGCTGGTGCGTACAGTTCCGGCAAAATCGATCCCAGGAATCATCGGAAAATTACGGATGATTTTTCCCTTACCGGTAATCGCCAGCGCATCTTTATAGTTCAGGCTCGACCAGTGAACATCAACCGTGACATCGCCCTCCGGCAGGCGACTTTCGTCAAGTGTCTGTACCGATGCGAGAGTTTTGCCGTCCTGCTGTTCTAAAAGTAACGCCTGCATAAGTGGTCCTCATGTGCATGATGGATTGGAAAATAATTTCTGAAGACTATACTCGCTAATGGAAAAGCAAAGCCGATGAAGCGCAATAAATTGCGAGATAAATCTGATTTGCTAGTATGCCCGCTTCCTCACTATCGGAGTTAACACAAGGATGAGATTAACGACGAAATTTTCGGCCTTTGTTACGCTGCTCACCGGGTTAACAATTTTTGTGACTTTGCTGGGCTGTTCGCTAAGTTTCTACAACGCAATTCAGTATAAGTTTAGTCATCGCGTTCAGGCGGTGGCGACGGCGATTGATACCCACCTTGTGTCGAATGACTTCAGCACATTAAGGCCACAAATTACCGAATTAATGATGTCGGCAGATATCGTTCGTGTAGACCTGCTCCATGGTGATAAACAGGTTTATACCCTGGCCAGAAATGGTAGTTATCGTCCGGTTGGCTCCAGCGATCTGTTTCGCGAACTGAGTGTTCCGTTGATAAAGCATCCGGGGATGTCGCTGCGTCTGGTTTATCAGGATCCGATGGGCAACTATTTCCATTCTTTAATGACTACCGCGCCGCTCACGGGGGCGATTGGCTTTATCATTGTTATGCTCTTCCTGGCGGTACGCTGGTTACAACGGCAACTTGCCGGGCAAGAATTGCTGGAAACCCGGGCTACTCGTATCTTAAACGGTGAGCGTGGCTCTAATGTGTTGGGAACCATCTATGAATGGCCGCCCAGAACCAGCAGTGCGCTAGATACGCTGCTTCGTGAAATTCAGAACGCACGCGAACAACACAGCCGTCTTGATACGCTGATCCGCTCTTATGCCGCCCAGGACGTGAAAACCGGCCTCAATAACCGACTCTTTTTCGATAATCAGTTAGCAACGTTACTGGAAGATCAGGAGAAAGTAGGTACCCACGGGATCGTGATGATGATTCGTCTGCCGGATTTCAATATGTTGAGCGATACTTGGGGGCACAGCCAGGTTGAAGAACAGTTCTTCACTCTGACGAATCTGCTGTCGACATTTATGATGCGCTACCCTGGCGCACTGCTGGCGCGTTACCACCGCAGTGATTTTGCTGCGCTGTTACCGCACCGGACGTTAAAAGAGGCAGAGAGCATCGCCGGTCAGTTAATCAAAGCCGTCGATACCTTGCCGAACAATAAAATGCTCGATCGCGACGATATGATCCACATTGGTATCTGCGCCTGGCGTAGTGGTCAGGATACCGAGCAGGTAATGGAACATGCAGAGTCTGCCACGCGTAATGCGGGATTGCAGGGCGGCAATAGCTGGGCTATTTACGATGACTCGTTGCCTGAAAAAGGACGCGGTAATGTTCGCTGGCGTACGCTTATCGAGCAAATGCTCAGTCGCGGCGGCCCGCGCCTTTATCAAAAACCGGCGGTTACTCGCGAAGGTCAGGTTCATCATCGCGAACTCATGTGCCGCATCTTCGATGGTAATGAAGAGGTTAGCTCGGCGGAGTATATGCCGATGGTCTTGCAGTTTGGCTTATCGGAAGAGTATGACCGTCTGCAAATCAGCCGTCTTATTCCACTATTGCGTTACTGGCCAGAGGAAAATCTGGCGATTCAGGTTACCGTTGAGTCGCTGATTCGCCCGCGTTTTCAGCGTTGGCTGCGCGATACGTTAATGCAATGTGAAAAATCACAACGAAAACGCATAATTATTGAACTTGCAGAGGCCGATGTAGGTCAACATATCAGTCGTTTACAACCTGTTATTCGTTTAGTAAATGCTTTAGGGGTACGGGTAGCCGTCAACCAGGCTGGTTTGACGCTGGTAAGCACCAGTTGGATCAAAGAACTTAATGTTGAGTTACTCAAGCTCCATCCGGGGCTGGTCAGAAACATTGAGAAGCGAACGGAGAACCAGCTGCTGGTTCAAAGCCTGGTGGAAGCCTGCTCCGGGACCAGCACCCAGGTTTACGCCACCGGTGTGCGTTCGCGAAGCGAGTGGCAGACCCTGATTCAGCGCGGTGTTACAGGCGGGCAAGGGGATTTTTTCGCGTCCTCACAGCCACTTGATACTAACGTGAAAAAATATTCACAAAGATACTCGGTTTAACCTGCCGTTTAATCCGTTTTCACGTAGAATAACGCGCGCTGCGTCTCATGGGAGTGTGCTTGTCTGCTCGCCAGATTGTTGCAGCACATATGCAGATGAATGACCTTACGCGGTTGCAAACAGGCGAGGAATGCTGCTGATGCATTAAGCCTTTCTGGACTCAGGCAGAGATTTGTAACAAAGGAAACGAACTGCACTAATTTTCACCGTAGCAGATGATTTTTGCGCCTTGTCGCTGCTGCGTGTGGTTGGTAAAGTAAGCGGATTTTCTTTTCCGCCCCAGCTTTCAGGATTATCCCTTAGTATGTTGAAAAAATTTCGTGGCATGTTTTCCAATGACTTGTCCATTGACCTGGGTACTGCGAATACCCTCATTTATGTAAAAGGACAAGGCATCGTATTGAATGAGCCTTCCGTGGTGGCCATTCGTCAGGATCGTGCCGGTTCACCGAAAAGCGTAGCTGCAGTAGGTCATGACGCGAAGCAGATGCTGGGCCGTACGCCGGGCAATATTGCTGCCATTCGCCCAATGAAAGACGGCGTTATCGCCGACTTCTTCGTGACTGAAAAAATGCTCCAGCACTTCATCAAACAAGTGCACAGCAACAGCTTTATGCGTCCAAGCCCGCGCGTTCTGGTTTGTGTACCGGTTGGCGCGACCCAGGTTGAACGCCGCGCAATTCGTGAATCCGCGCAGGGCGCTGGTGCCCGTGAAGTCTTCCTGATTGAAGAACCGATGGCTGCCGCAATTGGTGCTGGCCTGCCGGTTTCTGAAGCGACCGGTTCTATGGTGGTTGATATCGGTGGTGGTACCACTGAAGTTGCTGTTATCTCCTTGAACGGCGTGGTTTACTCCTCTTCCGTGCGCATTGGTGGTGATCGTTTCGACGAAGCTATCATCAACTATGTGCGTCGTAATTACGGTTCTCTGATCGGTGAAGCCACCGCAGAACGTATCAAGCACGAAATCGGTTCGGCTTACCCGGGCGATGAAGTCCGTGAAATCGAAGTTCGTGGCCGTAACCTGGCAGAAGGTGTTCCACGCGGTTTTACCCTGAACTCCAACGAAATCCTCGAAGCACTGCAGGAACCGCTGACCGGTATTGTGAGCGCGGTAATGGTTGCACTGGAACAGTGCCCGCCGGAACTGGCTTCCGACATCTCCGAGCGCGGCATGGTGCTCACCGGTGGTGGCGCACTGCTGCGTAACCTTGACCGTTTGTTAATGGAAGAAACCGGCATTCCAGTCGTTGTTGCTGAAGACCCGCTGACCTGTGTGGCGCGCGGTGGCGGCAAAGCGCTGGAAATGATCGACATGCACGGCGGCGACCTGTTCAGCGAAGAGTAATCGGATGCAGGCAGGGGAAATGTCTGTTTACCCTGCCTGGTCTGATACGAGAATACGCATAACTTATGAAGCCAATTTTTAGCCGTGGCCCGTCGCTACAGATTCGCCTTATTCTGGCGGTGCTGGTGGCGCTCGGCATTATTATTGCCGACAGTCGCCTGGGGACGTTCAGTCAAATCCGTACTTATATGGATACCGCCGTCAGTCCTTTCTACTTTGTTTCCAATGCTCCTCGTGAATTGCTGGATGGCGTATCGCAGACGCTGGCCTCGCGTGACCAACTAGAACTTGAAAACCGGGCGTTACGTCAGGAACTGTTGCTGAAAAACAGTGAACTGCTGATGCTTGGACAATACAAACAGGAGAACGCGCGTCTGCGCGAGCTGCTGGGTTCCCCGCTGCGTCAGGATGAGCAGAAAATGGTGACTCAGGTTATCTCCACGGTTAACGATCCTTATAGCGATCAAGTTGTTATCGATAAAGGTAGCGTTAATGGCGTTTATGAAGGCCAGCCGGTCATCAGCGACAAAGGTGTTGTTGGTCAGGTGGTGGCCGTCGCTAAACTGACCAGTCGCGTGCTGCTGATTTGTGATGCGACCCACGCGCTGCCAATCCAGGTGCTGCGCAACGATATCCGCGTAATTGCAGCCGGTAACGGTTGTACGGATGATTTGCAGCTTGAGCATCTGCCAGCGAATACGGATATTCGTGTTGGTGATGTGCTGGTGACCTCCGGTCTGGGCGGTCGTTTCCCGGAAGGCTATCCGGTCGCGGTTGTCTCTTCCGTAAAACTCGATACCCAGCGCGCTTATACTGTGATTCAGGCGCGTCCGACTGCAGGACTGCAACGTTTGCGTTATCTGCTGCTGCTGTGGGGCGCGGATCGTAACGGCGCTAACCCGATGACGCCGGAAGAGGTGCATCGTGTTGCTAATGAACGTCTGATGCAGATGATGCCGCAGGTATTGCCTTCGCCAGACGCGATGGGGCCAAAGTTACCTGAACCGGCAACGGGGATCGCTCAGCCGACTCCGCAGCAACCGACGACAGGAAATGCAGCTACTGCGCCTGCTGCGCCCACACAGCCTGCTGCTAATCGCTCTCCACAAAGGGCTACGCCGCCGCAAAGTGGTGCTCAACCGCCTGCGCGTGCGCCGGGAGGGCAATAGTGGCGAGCTATCGTAGCCAGGGACGTTGGGTAATCTGGCTCTCTTTCCTCATTGCGCTGTTGCTGCAAATCATGCCCTGGCCGGATAACCTGATTGTTTTCCGGCCAAACTGGGTGTTACTCATCTTGTTGTATTGGATCCTGGCCTTGCCTCATCGCGTAAATGTGGGCACAGGTTTTGTGATGGGTGCCATACTGGATCTGATCAGCGGCTCGACGCTTGGCGTACGCGTATTGGCGATGAGCATCATTGCTTACCTGGTGGCGCTGAAATACCAGCTTTTCCGCAACCTCGCATTATGGCAGCAGGCGCTGGTCGTCATGTTGCTTTCGCTGGTGGTGGATATTATTGTTTTCTGGGCAGAGTTTTTAGTGATTAACGTCTCTTTCAGACCGGAAGTGTTCTGGAGTAGTGTAGTCAATGGGGTGCTCTGGCCGTGGATTTTCTTGCTGATGCGTAAAGTCCGTCAGCAGTTTGCAGTGCAATAAAGGTTTCTATGACTTCTCTGTATTTAGCTTCCGGTTCTCCGCGTCGTCAGGAGTTACTTGCGCAACTTGGCGTGACCTTTGAACGTATTGTTACGGGCATTGAGGAGCAGCGTCAGCCGCAGGAGAGCGCGCAGCAGTATGTTGTGCGTCTGGCGCGCGAGAAAGCACAGGCGGGTGTCGCGCAAACGGCGCAGGATCTCCCGGTGCTGGGTGCGGATACTATCGTTATCCTGAATGGAGAAGTGCTGGAGAAACCGCGCGACGCAGAGCATGCGGCGCAGATGTTGCGCAAATTATCGGGTCAGACCCATCAGGTGATGACGGCAGTGGCGTTGGCCGACAGCCAGCACATTCTCGATTGCCTGGTGGTCACCGATGTGACTTTCAGAACGTTAACAGACGAAGACATCGCGGGCTATGTCGCCAGCGGTGAACCGTTAGATAAAGCAGGTGCATACGGTATTCAGGGGCTGGGTGGCTGTTTTGTCAGGAAGATAAATGGCAGCTATCACGCCGTAGTCGGCTTACCGCTGGTTGAAACGTATGAATTATTAAGTAATTTTAACGCACTGCGTGAGAAAAGGGATAAACATGACGGCTGAATTGTTAGTAAACGTAACGCCTTCGGAAACGCGAGTGGCGTATATTGATGGCGGTATTCTGCAGGAAATTCATATTGAACGTGAGGCGCGACGCGGAATAGTAGGCAATATCTACAAGGGTCGTGTAAGTCGTGTACTTCCGGGTATGCAGGCGGCTTTTGTAGATATTGGGCTGGATAAAGCCGCGTTTCTTCATGCATCCGATATCATGCCGCACACCGAATGTGTGGCGGGTGAAGAACAAAAGCAATTCACGGTGCGCGACATCTCGGAACTGGTTCGTCAGGGGCAAGATCTGATGGTGCAGGTGGTGAAAGATCCGCTTGGCACTAAAGGTGCGCGCCTGACCACCGATATCACGCTGCCTTCTCGCTATCTGGTGTTTATGCCAGGGGCTTCTCACGTTGGGGTTTCCCAACGTATTGAAAGCGAATCAGAACGTGAACGCCTGAAAAAAGTGGTCGCAGAGTATTGCGACGAGCAGGGCGGGTTTATCATCCGTACCGCAGCGGAAGGGGTTGGCGAGGCTGAACTGGCCTCCGATGCCGCTTATCTGAAACGCGTCTGGACCAAAGTTATGGAGCGTAAAAAACGCCCGCAGACCCGTTATCAGCTGTACGGCGAACTGGCGCTGGCGCAGCGTGTTCTGCGTGATTTCGCCGACGCCGAACTGGACCGCATTCGCGTTGACTCACGCCTGACTTACGAAGCATTGCTGGAGTTTACCTCGGAGTACATTCCCGAGATGACCAGCAAGCTGGAGCATTACACCGGACGCCAGCCGATTTTCGATCTCTTTGATGTCGAAAACGAAATCCAGCGAGCGCTGGAACGCAAAGTAGAACTGAAATCCGGTGGTTATCTGATTATCGACCAGACCGAAGCGATGACCACCGTGGACATCAATACCGGAGCGTTTGTCGGTCATCGCAATCTTGACGACACCATTTTCAATACCAATATTGAAGCGACGCAGGCTATCGCTCGCCAGTTACGGTTGCGCAATCTGGGCGGGATTATCATTATTGATTTCATCGATATGAATAATGAAGATCACCGCCGCCGAGTGCTGCACTCGCTGGAGCAGGCGTTGAGCAAAGACCGGGTGAAAACCAGCGTTAATGGTTTTTCGGCGCTGGGGCTGGTGGAGATGACGCGTAAACGCACCCGCGAAAGCATTGAGCACGTACTGTGTAACGAATGCCCAACCTGCCACGGTCGCGGAACGGTGAAAACCGTGGAAACGGTATGCTATGAAATCATGCGCGAGATTGTTCGTGTCCACCATGCTTACGACTCCGACCGTTTCCTGGTCTATGCTTCTCCGGCAGTAGCTGAAGCCTTGAAAGGCGAAGAGTCACACTCGCTGGCGGAAGTGGAAATTTTCGTTGGCAAACAGGTTAAAGTACAAATTGAACCGCTCTATAACCAGGAGCAGTTTGACGTCGTAATGATGTAAACAGATGCTGGGCCGCCATCCGGCAAAGGGTTTTTGAGTCACATTTTTAGCAGACAAGGAGTGACGGGTGAGGCGATTGCCGGGGATTTTACTGCTTACTGGAGCCGCGCTCGTTGTGATCGCTGCCCTGCTGGTTAGCGGCCTGCGTATTGCTTTACCGCATCTTGACGCCTGGCGTCCGGAAATCCTCAACAAAATAGAATCCGCGACTGGCATGCCGGTAGAAGCCAGTCAGCTCTCAGCCAGCTGGCAGAATTTTGGCCCGACGCTTGAAGCACACGACATCCGTGCAGAACTAAAAGATGGCGGCGAATTTTCGGTTAAACGCGTTACTCTGGCGCTGGATGTCTGGCAGAGCTTGTTACATATGCGCTGGCAGTTTCGCGACCTCACTTTCTGGCAGCTGCGCTTTCGCACCAACACTCCTATCACCAGCGGTGGTGGTAATGATAGCCTGGAAGCCAGTCACATCAGCGATCTGTTTCTTCGTCAATTTGACCATTTCGATCTCCGCGACAGTGAAGTCAGTTTTCTGACGCCATCCGGTCAGCGCGCCGAGCTGGCGATCCCACAACTCACCTGGCTGAACGATCCACGTCGACACCGTGCGGAAGGCCTGGTAAGCCTCTCCAGCCTTACCGGACAGCACGGCGTGATGCAGGTGCGTATGGATTTGCGCGATGATGAGGGGTTGTTAAGCAATGGTCGCGTCTGGCTACAGGCGGATGACATCGACCTGAAGCCGTGGCTCGGTAAATGGATGCAGGACAATATTGCTCTGGAAACGGCACAGTTCTCCCTTGAAGGCTGGATGACGATCGACAAAGGCGATGTAACCGGCGGTGACGTCTGGCTGAAACAGGGCGGTGCCAGCTGGTTGGGCGAGAAGCAAACGCATACGCTGTCGGTGGATAATCTGACCGCGCATATTACGCGTGAAAATCCGGGCTGGCAGTTCTCTATTCCCGATACACGGATCACGATGGACGGCAAACCCTGGCCGAGCGGAGCATTGACGCTGGCCTGGATACCGGAACAGGACGTTGGCGGCAAAGACAATAAACGCAGTGACGAACTCCGGATTCGCGCCAGTAATCTGGAGCTGGCAGGCCTGGAGGGCGTACGCCCGCTGGCCGCGAAACTTTCACCTGCACTGGGTGATGTTTGGCGCTCCACACAACCGAGCGGCAAGATTAACACTCTGGCGCTGGATATCCCGCTTCAGGCGGCAGACAAGACCCGTTTTCAGGCATCGTGGAGCGATCTGGCCTGGAAGCAATGGAAATTATTACCGGGTGCGGAACACTTCTCCGGGACGCTTTCCGGCAGCGTTGAAAATGGTTTGCTTACCGCGTCGATGAAGCAGGCAAAGATGCCTTACGAAACGGTATTCCGTGCGCCACTGGAAATCGCCGACGGCCAGGCAACTATAAGCTGGCTGAACAATGACAAAGGTTTCCAGCTGGATGGGCGTAATATTGACGTTAAAGCCAAAGCCGTCCATGCGCGCGGCGGTTTTCGTTACCTGCAACCTGCTAACGATGAACCCTGGCTGGGTATTCTGGCTGGCATCAGTACCGATGATGGTTCACAAGCCTGGCGCTATTTCCCGGAAAACTTGATGGGTAAAGACCTGGTTGATTATTTAAGTGGTGCGATTCAGGGCGGTGAAGCGGATAACGCGACGCTGGTTTATGGTGGCAATCCGCAACTCTTCCCCTATAAACACAACGAAGGTCAGTTTGAAGTGCTGGTGCCGCTGCGCAACGCGAAGTTTGCCTTCCAGCCGGACTGGCCTGCATTAACTAACCTTGATATTGAACTGGACTTTATTAACGACGGTTTATGGATGAAAACCGATGGCGTTAATCTGGGCGGCGTGCGCGCGAGTAATCTCACCGCAGTGATCCCTGACTACTCAAAAGAAAAACTGCTGATTGACGCTGACATTAAAGGTCCGGGTAAAGCCGTAGGCCCTTACTTTGATGAGACACCGCTGAAAGATTCCCTGGGTGCGACCCTGCAAGAACTCCAGCTCGACGGCGATGTGAATGCTCGCTTACATCTTGATATCCCGCTGAACGGCGAGCTGGTAACCGCGAAAGGTGAAGTGACGCTGCGTAATAACAGTCTGTTTATCAAACCACTCGACAGCACCCTGAAAAATTTGAGCGGTAAATTCAGCTTTATCAATGGCGATCTGCAAAGTGAACCACTGACAGCAAGCTGGTTTAATCAGCCGTTGAACGTGGATTTTTCCACCAAAGAAGGGGCAAAAGCCTACCAGGTTGCGGTGAATCTCAACGGTAACTGGCAACCGGCGAAAACCGGCGTTTTGCCTGAAGCGGTGAACGAAGCATTAAGTGGCAGCGTGGCGTGGGATGGTAAAGTGGGCATTGATCTGCCTTATCATGCTGGCGCGACCTATAACGTAGAGCTGAACGGCGATCTGAAGAATGTGAGTAGTCACTTACCTTCACCGTTAGCCAAACCTGCGGGTGAACCACTAGCGGTAAACGTTAAGGTTGATGGCAATCTCAACAGCTTTGAATTAACCGGACAGGCTGGTGCGGATAACCATTTCAATAGCCGCTGGTTGCTTGGTCAAAAGCTGACGCTCGATCGTGCTATTTGGGCGGCAGACAGTAAAACGCTCCCGCCGTTGCCGGAACAAAGTGGCGTTGAACTCAATATGCCGCCGATGAATGGTGCCGAGTGGCTGGCCCTGTTCCAGAAAGGCGCTGCGGAGAGTGTCGGTGGTGCAGCGAGTTTCCCACAACACATAACGTTACGTACGCCTATGTTGTCACTGGGAAATCAGCAATGGAATAACCTGAGTATTGTTTCGCAACCGACGGCAAATGGCACCCTGGTTGAGGCGCAAGGGCGTGAAATCAACGCCACGCTAGCGATGCGTAATAACGCGCCGTGGCTGGCGAATATCAAATATCTTTATTACAACCCGAGCGTGGCGAAAACTCGTGGTGATTCAACGCCGTCATCACCTTTCCCGACAACGGAGCGCATTAACTTCCGTGGCTGGCCGGACGCCCAAATACGATGCACAGAGTGCTGGTTCTGGGGGCAAAAATTCGGTCGCATTGACAGTGATATCACCATTTCTGGCGATACGTTAACGCTGACCAATGGACTGATTGATACTGGTTTCTCGCGGCTTACTGCCGATGGTGAATGGGTTAATAATCCGGGGAATGAACGTACCTCGCTGAAAGGAAAACTGCGCGGGCAGAAAATTGATGCCGCCGCAGAATTTTTTGGTGTCACGACGCCCATACGCCAGTCGTCATTTAATGTGGATTACGATTTACACTGGCGCAAAGCACCCTGGCAGCCAGATGAAGCGACGTTGAATGGCATCATTCATACTCAACTGGGTAAAGGCGAAATTACCGAAATCAATACCGGACATGCCGGGCAATTGCTGCGCTTATTGAGCGTAGATGCCCTGATGCGTAAGCTGCGTTTTGATTTCAGAGACACTTTTGGCGAAGGGTTTTATTTTGACTCCATTCGCAGCACCGCGTGGATTAAAGACGGCGTTATGCACACCGACGACACGCTGGTGGATGGCCTGGAGGCGGATATCGCCATGAAAGGGTCGGTAAATCTGGTACGTCGCGACCTGAATATGGAAGCGGTTGTCGCACCAGAGATTTCTGCGACGGTGGGCGTGGCTGCGGCTTTTGCGGTTAACCCCATTGTTGGCGCGGCAGTGTTTGCCGCCAGTAAAGTGCTGGGGCCGCTGTGGAGCAAAGTCTCCATTTTGCGCTATCACATTTCGGGTCCGCTGGACGATCCGCAAATCAACGAAGTGTTGCGCCAACCGCGTAAAGAAAAAGCGCAATGATTTGACGAGGGCGCGTAATTGCCCCAATCTCATATGATAATCGTTGCCAAAGGCCAACGAGCCAGAACATAACCGTAGGTCGGATAGGGCGTTCACGCCGCATCCGGCAGCCGTAAAAAATCCTCTACTGCAGTAACTAACGAGTAGCAAAAACGATGAGTCTTAACCTGGTAAGTGAACAATTGCTAGCGGCGAACGGCCTGAAACATCAGGACTTGTTCGCGATCCTCGGTCAACTGGCCGAACGTCGCCTTGATTATGGCGATCTCTATTTTCAGTCGAGCTATCACGAATCCTGGGTTTTAGAAGACCGCATTATTAAAGATGGTTCTTACAACATCGATCAGGGCGTTGGTGTGCGTGCAATCAGTGGTGAAAAAACCGGATTTGCTTACGCTGACCAAATCAGCCTGCTGGCGCTGGAACAGAGTGCGCAAGCGGCGCGCACCATCGTCCGTGATAGTGGTGATGGCAAAGTACAGACGCTGGGCGCGGTAGAGCATAGCCCGTTGTATACCTCGGTAGATCCGCTGCAAAGCATGAGCCGTGAAGAGAAGCTGGATATCCTGCGTCGCGTCGATAAGGTTGCCCGCGAAGCGGACAAGCGCGTACAAGAAGTGACTGCCAGCCTCAGCGGCGTTTATGAATTAATTCTGGTTGCGGCCACCGACGGCACGCTGGCGGCGGATGTTCGTCCGCTGGTGCGTCTTTCCGTGAGCGTTCTGGTCGAAGAAGATGGCAAACGCGAACGCGGTGCCAGTGGCGGCGGCGGTCGTTTTGGTTATGAGTTCTTCCTTGCCGATCTTGACGGCGAAGTCCGTGCGGATGCATGGGCAAAAGAAGCAGTGCGTATGGCGCTGGTCAATCTTTCTGCCGTTGCTGCACCAGCGGGCACCATGCCGGTAGTACTTGGCGCAGGTTGGCCGGGCGTGCTGTTGCATGAAGCGGTAGGTCACGGTCTGGAAGGCGACTTCAACCGCCGCGGCACTTCAGTATTTAGTGGGCAGGTCGGGGAGCTGGTGGCTTCAGAACTGTGTACCGTGGTTGATGACGGCACGATGGTCGATCGCCGTGGTTCGGTGGCGATTGATGACGAAGGTACGCCAGGCCAGTACAACGTGCTGATTGAGAACGGCATTCTGAAAGGCTACATGCAGGATAAACTCAACGCGCGTTTGATGGGGATGACGCCGACTGGCAACGGTCGCCGTGAATCCTACGCCCATCTGCCCATGCCGCGTATGACCAACACCTATATGCTGCCGGGTAAATCGACCCCGCAGGAAATTATTGAATCCGTTGAGTACGGTATCTATGCACCGAACTTTGGTGGCGGTCAGGTGGATATTACATCCGGCAAATTCGTTTTCTCCACTTCAGAAGCGTATCTGATTGAAAACGGTAAAGTAACGAAGCCGGTGAAAGGCGCGACGTTGATTGGTTCCGGTATCGAAACCATGCAGCAGATTTCGATGGTTGGCAACGACCTGAAACTGGATAACGGCGTGGGTGTCTGCGGTAAAGAAGGGCAAAGTTTGCCGGTTGGCGTGGGCCAGCCAACGCTGAAAGTTGATAACCTGACTGTTGGCGGTACTGCGTAATAATTTCATTATTTTCAGAAGGATAATTAATCTTTCTACGTGCACGAACGGTCCCCTCGCTCCTCTGGGGTTAGGGTGAGGGGAACCCGTTGGCACAGGTTTGTAGAACGTAACAGTACAATATGAATTACTTCTCTTTCCCGCGCCCGTGCATCTCCTGAAACAATTTACCGACCTCAACAAAATAATCCGTCAGCGAGTTGATCACGACCTGTACCTTCAGCGGCAGCTTATCTTTTTCGGTATATAACGCATAAACCGGGCGTGGATCTGACTGGTAACGTGGTAGCAGGATCTCCAGCTCCCCGCGATTGATCTCGTTGATCACCCACATCAGCGGCACGTAGGCGATCCCAGCGCCTGCAGTCAGCCAGCGCACCAGCGTCATCGGATCATTAGTCACAAATCTCCCCTGCGGGATCAGGCGAGTCGAGATCCCTTCCGGTGCGATCAGTTCAAATTCATTGTCGGGCCGCACGCTGTATTCAAGCCATGAATGGCTACTTAAATCGGCGGGTTTTTCCGGTATGCCGTATTGTGTGAGATAGCTTTTTGCGGCGCACACCACCATTGGCATCGCGCCCAGACGGCGGGAAAACAGGCTGGAATCCTGCAACGCGCCGACGCGGATCACCACATCCAGACCGTCGGCAATCAGGTCGGGGGCAGGAATTCCGGTAACCAGATTGACGCTCAAACCTGGGTATTCTTTCAGCATTTTTGCTGTCAGCCCGGCGAGAACATTTTGTGCCATAGTTGAAGAACAGCCAATGCGCAGCGTCCCGATGGGGGTGTTATTGAAGGCATACAGTTGCTCATGAACATCCTGCACTTCATGAAGCATACGACGGCAGCCCTGGTAGTAAATTCTACCGGCTTCGGTCAGGCCAATGCTGCGTGTGCTACGGTTTAACAGCTTTACCTGCAACTCATCTTCCAGTTTTGACACCGTCTGACTGATGGACGAAACGCTCATCTGTAGCTGTCTGGCGGCGGCGGTAAAAGAGCCAAATTCAACTACTTTGGCAAACACCGACATGCGTTTTAGTCGTTCCATTATTCACTCTGACTTAAAAGTGATTTAGATCACATAATATAGATAACAGCATAACAGTTACGCTAATATATTAAATATCAATCTACAGCAATGTTGCTCTCGCCCGGCTTTCCATGCCCTTCTCTGCGGCGACAGATGCTGAAAATAATAACGCCTGCTCTCTCTTTACAACCAAGGTCAACATGAGTCTGTTTCCCGTTATCGTGGTGTTTGGGCTGTCCTTCCCACCGATATTTTTTGAATTGCTTTTATCACTGGCGATTTTCTGGCTGGTGCGCCGGGTACTTGTGCCAACAGGTATCTACGACTTTGTCTGGCATCCGGCGTTGTTCAACACCGCGCTCTATTGCTGCTTGTTTTATTTGATATCGCGACTGTTCGTTTGAGGTTGAAGTGAAAACACTAATAAGAAAATTCTCCCGTACGGCCATCACGGTCGTATTAGTCATTCTGGCCTTCATCGCAATTTTTAATGCCTGGGTCTATTACACCGAATCCCCCTGGACGCGTGACGCGCGCTTTAGCGCTGACGTCGTTGCGATCGCGCCGGACGTTTCTGGACTCATTACCCAGGTGAATGTTCATGATAACCAGCTGGTGAAAAAAGGACAGATACTGTTCACCATCGACCAGCCGCGCTATCAAAAGGCGCTTGAGGAAGCGCAAGCCGATGTTGCTTATTATCAGGTACTGGCACAGGAGAAACGCCAGGAGGCCGGACGTCGTAACCGTCTCGGTGTGCAGGCGATGTCTCGCGAAGAGATCGACCAGGCTAACAACGTACTACAAACGGTTCTGCATCAGTTAGCGAAAGCGCAGGCGACCCGCGATCTGGCAAAACTGGATCTTGAACGCACGGTGATCCGCGCGCCAGCAGATGGCTGGGTGACCAACCTCAACGTCTATACCGGTGAGTTTATTACTCGAGGATCAACGGCGGTTGCGCTGGTGAAACAGAACTCCTTCTATGTACTGGCCTATATGGAAGAAACTAAGCTGGAAGGGGTGCGTCCGGGGTATCGTGCAGAGATCACGCCGCTTGGCAGTAACAAAGTGCTGAAAGGGACTGTTGATAGTGTTGCCGCAGGGGTCACCAACGCCAGCAGCACGCGTGACGACAAAGGGATGGCGACTATAGACTCTAACCTTGAATGGGTGCGTCTTGCGCAACGTGTTCCGGTTCGTATTCGTCTCGACAACCAGCAAGAGAACATCTGGCCTGCGGGCACCACTGCTACAGTGGTGGTCACTGGCAAACAAGATCGCGACGAAAGCCAGGATTCGTTCTTCCGTAAAATGGCCCATCGCCTGCGTGAGTTTGGTTAATCACGATGGGTATTTTCTCCATTGCTAACCAACATATTCGCTTTGCGGTAAAACTGGCGACCGCCATTGTACTGGCGCTGTTTGTTGGCTTTCACTTCCAGCTGGAAACGCCACGCTGGGCGGTACTGACAGCGGCGATTGTTGCCGCCGGTCCGGCCTTTGCTGCGGGAGGTGAACCGTATTCTGGCGCTATTCGTTATCGTGGCTTTTTGCGCATCATCGGCACATTTATTGGCTGTATTGCCGGACTGGTGATCATCATTGCGATGATCCGCGCACCATTATTGATGATTCTGGTGTGCTGTATCTGGGCCGGTTTTTGTACCTGGATATCCTCGCTGGTACGAATAGAAAACTCGTACGCGTGGGGGCTGGCCGGTTATACCGCGCTGATCATTGTGATCACCATTCAGCCGGAACCATTGCTTACGCCGCAGTTTGCCGTCGAACGTTGTAGCGAGATCGTTATCGGTATTGTGTGTGCGATTATGGCGGATTTGCTCTTTTCTCCGCGATCGATCAAACAAGAAGTGGATCGAGAGCTGGAAAGTTTGCTGGTCGCGCAATATCAATTAATGCAACTCTGTATCAAGCATGGCGATGGTGAAGTTGTCGATAAAGCCTGGGGCGACCTGGTGCGACGCACCACGGCGCTACAAGGCATGCGCAGCAACCTGAATATGGAATCTTCCCGCTGGGCGCGGGCCAATCGACGTTTAAAAGCGATCAATACGCTATCGCTGACGCTGATTACCCAATCCTGCGAAACTTATCTTATTCAGAATACGCGCCCGGAATTGATCACTGATACTTTCCGCGAATTTTTTGACACGCCGGTAGAAACCGCGCAGGACGTCCACAAGCAGCTCAAACGCCTGCGGAGAGTTATCGCCTGGACCGGGGAACGGGAAACGCCTGTCACCATTTATAGCTGGGTCGCGGCGGCAACGCGTTATCAGCTTCTCAAGCGCGGCGTTATCAGTAACACAAAAATCAACGCCACCGAAGAAGAGATCCTGCAAGGCGAACCGGAAGTCAAAGTAGAGTCAGCCGAACGTCATCATGCGATGGTTAACTTCTGGCGAACCACACTTTCCTGCATTCTGGGCACGCTTTTCTGGCTGTGGACGGGCTGGACTTCCGGCAGTGGTGCAATGGTGATGATTGCGGTAGTGACGTCACTGGCAATGCGTTTGCCGAATCCACGCATGGTGGCGATCGACTTTATCTACGGGACGCTGGCCGCGCTGCCGTTAGGGCTGCTCTACTTTTTGGTGATTATCCCTAATACCCAACAGAGCATGTTGCTGCTGTGTATTAGCCTGGCAGTGCTGGGATTCTTCCTCGGAATAGAAGTACAGAAACGGCGACTGGGCTCGATGGGGGCGCTGGCCAGCACCATAAATATTATCGTGCTGGATAACCCGATGACTTTCCATTTCAGTCAGTTTCTCGACAGCGCATTAGGGCAAATCGTCGGCTGTGTGCTCGCGTTCACCGTTATTTTGCTGGTGCGGGATAAATCGCGCGACAGGACTGGACGTGTACTGCTTAATCAGTTTGTTTCTGCCGCTGTTTCCGCGATGACTACCAATGTGGCACGTCGTAAAGAGAACCACCTCCCGGCACTTTATCAGCAGCTGTTTTTGCTGATGAATAAGTTCCCAGGGGATTTGCCGAAATTTCGCCTGGCGCTGACGATGATTATCGCGCACCAGCGCCTGCGTGATGCGCCGATCCCGGTTAACGAGGATTTATCGGCGTTTCACCGACAAATGCGCCGCACAGCAGACCATGTGATATCTGCCCGTAGCGATGATAAACGTCGTCGGTACTTTGATCAGTTGCTGGAAGAACTGGAAATCTACCAGGAAAAGCTACGCATCTGGCAAGCGCCACCGCAGGTGACGGAACCGGTTCATCGGCTGGCGGGGATGCTCCATAAGTATCAACATGCGTTGACCGATAGTTAAGTCAAAACCGACGCCAAAAGCGTCGGTTTTTTCATGGCTATACTTAGCGATGAACGGCAGAACTCGCCGCGAAACGTGACGGTGGCAACAGATGAATATTTATACCTTTGATTTTGATGAGATTGAGAGTCAGGAGGATTTTTATCGCGACTTCAGCCAAGCCTTTGGTCTGGCGAAAGATAAGGTACGCGATCTCGACTCACTATGGGATGTGTTAATGAACGATGTCCTGCCGCTACCACTTGAGATTGAATTTGTTCATCTGGGAGAGAAAACGCGTCGCCGTTTTGGCGCGTTAATATTGCTGTTTGATGAGGCAGAGGAAGAGCTGGAAGGGCATTTGCGTTTTAATGTTCGTCATTAGCGCAAAAAAAAGCCCCCGAACCGGGGGCAATATCGTCGGACAAGACGATGAGGGTTTATTTGTACAGTTCAGCCGTAGCGTGCCAGGTGTCACCGCTACGAGCTTCAGTAATCTGGTAGGCCGTTGCGCCTTTCTCTTCCGCTTTTTTGTTCAGCATTTCACGTATATCCATTGGCGAAGACGCCACACCACTTACGGATACGGTCCCGATTGCTTCACGGTTTTGTGCTTGTGCAGCATCAATGGAGTCGGCAGCGAATGCACCGAAAGAGAGAACAGAAAGTACGCTTAATGCAGCAACAGTGGTTTTGATTTTCATGATTTTTACCTCGACATAATCTTTTTGCTGGGTCTTTGTTTCGTGACCCTTATCACAAAATCAAGTATACACTAATTATTGAACTAATTAATACCACGCTAACTGTTTCTGTTGATACTAAGTGTTAAAAATGTGAATGTTAATAACAAAAAAGAATTAAAAATACGGTTATTTATCGTTAATTCTAGTGTTTTCAATTAGATAATGAATTTTGCATAATGTGCTTTTTTTACCTTATGTATTCATTGTGTGAATGACATGTCGCAGTAAAACGCACTATTCGTTAAATGTATCGCAGGAGAAAGCAGGGGGTTGAGAGGGATAAGCAACATTTTCCCCGCCGCCAGAAGCGACGGGGCAGAGATTAAAGCTCCTGGTCGAACAGCTCTAAAATCGCTTCGTACAGGTCTTTGACGGTGAAACCGTTAGCAGGGGTGGTAAAGATGGTGTCATCGCCAGCGATGGTGCCCAGAATACCTTCTGCTTTGCCCAGTGAGTCCAGCAGGCGAGCAATTAACTGCGCCGCGCCAGGGCTGGTATGAATCACGACAACTGCATCGTTGTAGTCGATATCCAGCACCAGATTTTTCAATGGACTGGAGGTGGTTGGTACACCCAGTTCAGCTGGCAGGCAGTAAACCATTTCCATTTTGGCATTGCGTGTACGTACAGCACCAAACTTGGTCAACATCCGCGAGACTTTAGACTGATTAATATTGTCAAAGCCTTGCTCCTGCAACGCGGCGACGATTTCGCCCTGGGAGCTAAATTTCTCTTCTTTAAGTAATGCTTTAAATGCTTTAACTAGTTCTTCTTGCTTAGCCGAGCTTCGCATAAGTCACCCGATATGGTGGTTGATACAACATTATTGTGCATACAGATGAATTTTTATGCAAACAGTCAGCCCTGAAGAAGGCTGAAATAATGTTATGAAAGAGCGGGATTTTATCAAATTTCGTTATTGAGAAACATGCCTGCGTCACGGCATGCAAATTCTGCTTAAAAGTAAATTAATTGTTATCAAATTGATGTTGTTTTGGCTGAACGGTAGGGTATATTGTCACCACCTGTTGGAATGTTGCGCTAATGCATAAGCGACTGTTAATTACGTAAGTTAGGTTCCTGATTACGGCAATTAAATGCATAAACGCTAAACTTGCGTGACTACACATTCTTGAGATGTGGTCATTGTAAACGGCAATTTTGTGGATTAAGGTCGCGGCAGCGGAGCAACATATCTTAGTTTATCAATATAATAAGGAGTTTAGGATGAAAGTCGCAGTCCTCGGCGCTGCTGGCGGTATTGGCCAGGCGCTTGCACTACTGTTAAAAACCCAACTGCCTTCAGGTTCAGAACTCTCTCTGTATGATATCGCTCCAGTGACTCCCGGTGTGGCCGTCGATCTGAGCCATATCCCTACTGCTGTGAAAATCAAAGGTTTTTCTGGTGAAGATGCGACTCCGGCGCTGGAAGGCGCAGATGTCGTTCTTATCTCTGCAGGTGTAGCGCGTAAACCGGGTATGGATCGTTCCGACCTGTTTAACGTTAACGCCGGCATCGTGAAAAACCTGGTACAGCAAGTTGCGAAAACCTGCCCGAAAGCGTGCATTGGTATTATCACTAACCCGGTTAACACCACAGTTGCGATTGCTGCTGAAGTGCTGAAAAAAGCCGGTGTTTATGACAAAAACAAACTGTTCGGCGTTACCACGCTGGATATCATTCGTTCCAACACTTTTGTTGCGGAACTGAAAGGCAAACAGCCAGGCGAAGTTGAAGTGCCGGTTATTGGCGGTCACTCTGGTGTTACCATTCTGCCGCTGCTGTCACAGGTTCCTGGCGTTAGTTTTACCGAGCAGGAAGTGGCTGATCTGACCAAACGTATCCAGAACGCGGGTACTGAGGTGGTTGAAGCGAAAGCCGGTGGCGGGTCTGCAACCCTGTCTATGGGCCAGGCAGCTGCACGTTTTGGTCTGTCTCTGGTTCGTGCACTGCAGGGCGAACAAGGCGTTGTCGAATGTGCCTACGTTGAAGGCGACGGTCAGTACGCCCGTTTCTTCTCTCAACCGCTGCTGCTGGGTAAAAACGGCGTGGAAGAGCGTAAATCTATCGGTACCCTGAGCGCATTTGAACAGAACGCGCTGGAAGGTATGCTGGATACGCTCAAGAAAGATATCGCCCTGGGCGAAGAGTTCGTTAATAAGTAATTGATTAGCGAATAATAAAAAACCGGAGCACAGACTCCGGTTTTTGTTTTGAGCACTCGACTTAATTGGTTGCCGGATATTCCTGAATGGTGACCTGCAGCGTTAACTGCTTATCATCACGCATCACTACAACCGGGATCACCGAACCAGGGCGAATTTCCGCCACCTGATCCATCGTCTCCAGAGCAGAGATGGCCGGTTTGTTATCCACCGAAATAATCAGATCGTTGACCTGAATACCCGCATTCGCCGCCGGGCCGTCAGGTGACACTTCATTAACCACGATCCCTTGCAGTTGATCTATACCACCGCCCTGCGCGTGCAGTGGTGCGATCTCGCGTCCGCCGATACCAATGTAGCCGCGGATCACGCGACCATCGCGGATCAGCTTATCCATAATTTTGGTTGCTAACTGGAAAGGAATCGCAAAGCCGATACCTTCCGGCGTTTCGCCATCGTTACTCTTATCAAACGACAGCGTGTTAATGCCCATCAGTTCGCCCAGCGAGTTCACCAGCGCGCCGCCAGAGTTACCGTGGTTAATGGAAGCATCGGTTTGTAGGAAGTTTTGCCGCCCGGTCGGGTTCAGACCGATTCGACCCGTGGCACTAATAATCCCCTGGGTAATGGTCTGCCCGAGGTTGTACGGGTTACCGATCGCCAGTACTACGTCGCCAATGTGCGGTACGCGACGTGCATTGATCGGAATGGTAGGTAAACCGCCAGTGGCATTAATTTTAAGTACCGCCAGATCGGTTAGAGAGTCAGATCCCACCAGCAATGCTTCAAATACACGACCATCCTGTAAGGCGACGATGATCTGATCGGCGTCGTTGATGACGTGTTTATTGGTGATGATATAACCGCGTTGATCCATGATTACACCGGATCCCAGGGTGCGGATCTCAAGCTGGTTGTGAGAGTTGGTGTTCAAACCACGGTTGTAAACGTTAACCACCGCTGGCGCGGCGCGGCGAACCGCCAGATTATAGCTGGCAGGCGTCTCATCGGTACTGTCAAATTGCGGAGTGGAAAGCGGGTTAAGGCTGCGCAGCGAAGGCATGGCAACCAGCAGAATAGCGCCGACAATTAATCCAATCGCAACGGAACGTAAGAGCTTCACAAACATGATGGAGGCGTCATTAAAAAAGGGAACGGCAGCAGCATACCACGAGTTAACCGGACATCACACGTAAGCCTGATGCCCGGTTTACGACATTAACGCATCAGCAGATAGATGCTTTCATTGCCGCGTACAATTTGCAGGGCGATGATGGCCGGTTTTGCCGCCAGCACTTTACGCATTTCAGCAATCGAGTTCACCCGATCGCGGTTGACGCCAATGATCACATCGTCTTTTTGCAAGCCAGCCTGAGCAGCTGGGCTTCCTTTGACAACTTCATCGATCTTAATACCTTTGCCGCCATCTTTTAGCTGACCATCGCTCAACGTTGCACCTTCCAGCGCTGGCGTGATCATTTCAGCGCTGGCCGACGAAGAGGTGCTGGTATCGAGCGTCACTTCTACTTCCAGTGGCTTGCCGTTACGCAGCAGGCCAAGCTTCACTTTCGTGCCCGGCTCGGTGGTCGCGATACGAGAGCGCAACTCAGCAAAGCTATTCAGCGGTTTGCCGTTGAGGCTGGTAATAATATCGCCCGCTTTGACGCCCGCTTTCGCTGAGCCAGAACCTGGCAACACTTCGCTGACAAACGCGCCACGCTGCACGTCAAGGTTGAAGGCTTTGGCGATATCGGCACTCATCTCGGTGCCTTTGATGCCTAACAAACCGCGTTTGATTTCACCAAAGTCGATAAGCTGCTGCGCCAGTGTTCGCGCCATATTACTGGGGATGGCAAATCCAATCCCGACGCTCCCGCCGCCAGGCGCAAGGATTGCAGTGTTGATGCCAATTAACTCACCGTTAAGGTTTAACAGTGCACCGCCGGAGTTACCGCGGTTAATGGAAGCATCTGTCTGGATAAAGTTTTCCAGACCTTCAAGATTCAACCCGCTGCGTCCTAATGCGGAAACAATGCCAGAGGTGGCGGTTTGCCCAAGGCCAAATGGGTTACCGACCGCTACGGCAAAATCACCGACGCGCAATTTATCGGAGTCGGCAATAGCGATTTGCGTTAATTTGCTCGGGTTTTGAATTTGTAACAGGGCGATATCGCTCTGGTCATCGCTACCAATCAGTTTTGCATCAAACTCGCGCCCATCATTGAGCTGAATACTGATTTTCTGTGCCTGATTAATCACATGGTTGTTGGTCAGCACATAGCCTTTACTGGCGTTGATGATGACACCGGAACCTAAACCTTCGAAGGGTTGTGCAGGTTGATCCGGTAAATCATCACCAAAAAACTTTTTGAATTCTTCCGGGATTTTCTGTCCCTGACTGGCCGTTCCTTCCACCCGTACGCTCACCACTGCCGGAAGCACTTTTTCCAGCATTGGAGCCAGACTGGGGAGAGGGGCCTGATCGGCAACCTGGCCTGGAATCGACGCGACGGCCTGAAATGACGCCGAGAGAGTTAACCCGACACTTAACGCTAATGCACTCAACAGCTGGGTTTGTTTTTTCATTATTCCTGCTCTCGTACCTGAATGATAAGAAAAGAGATTCAAAACAGTTTGATGTTATTGAATTTTCAGACGGCTAACAATGAGACACCAGATTAAATGATAGTCGGGGAGGGAGAGAAGAGGAGGGCGCAATGGCTGCGCCCGAAAAATAAATTAGTCGCGCTTCGCGCCAGTACGCAGCAGGCCGGATGCGCCTTCAGAATAGTCGCGAGGCATCTGCACCGGTGCCTGATCGTTGCTGGCTTCAGACTCTGCCAGACGATTACGGAACGGGTTTGCTTCAGCAGACAGTTCCGGCAGCAGGCTGCTGGAGCTTTTCGCCATGTGCTGATACAGCTGGCGATAGTCGTGCGCCATGGTATCCAGTAATTCCGCGCTGCGGGCAAAGTGGCTAACCAGCTCTTCGCGATACTCGTCCAGTTCAGCTTTATTCTTTTCCAGTTCGTACTGCAACGCCTGTTGCTGGCGTAGTTTACGATTACCAAAACGCATGGCCACAGCACCAATAATGATGCCGACGACTAACCCAATTAGCGCATATTCCCAGGTCATGAACATCTCCCGTTGTCTTTTGTTTCCGTAGGGTGTTGGCTTCAGGCTCCATGCCTGCGGCTGATTATGCCACTATAACCGTTAATTCCACAGAAGTGGAATCCCGACTGCATATCGCGTAGTGTAGAACGGCCTTTTTTTCGTCAACCGTGAACAACGGCGCACCGATTATTCAAGGAATAACAATAAGATCATGCAAAGCGTTACCCCAACATCGCAATACCTGAAGGCGCTCAATGAAGGCAGCCATCAACCCGACGACGTTCAAAAAGAGGCCGTCAGCCGCCTGGAAATTATTTATCAGGAACTCATCAATAGCACGCCACCAGCCCCCAGGACGAGTGGGCTAATGGCGCGGGTCGGTAAGCTGTGGGGTAAACGCGAAGACACAAAGCATACGCCAGTGCGTGGCTTATATATGTGGGGCGGTGTAGGACGCGGGAAAACCTGGCTGATGGACCTTTTCTATCAAAGCCTGCCGGGAGAGCGGAAACAGCGCCTGCACTTTCACCGTTTTATGATGCGGGTGCATGAAGAGCTAACTGCCTTACAGGGGCAGACCGATCCGCTGGAAATTATTGCCGATCGCTTTAAAGCCGAAACTGACGTGCTCTGTTTTGACGAATTTTTTGTTTCTGATATTACCGACGCCATGCTACTTGGCGGTCTGATGAAAGCCCTGTTCGCCCGCGGCATTACCCTGGTAGCGACGTCAAATATTCCGCCGGATGAACTTTATCGAAATGGCCTGCAACGTGCGCGTTTTCTGCCTGCAATCGATGCCATTAAACAGCATTGTGATGTAATGAACGTGGACGCTGGTGTTGATTATCGACTGCGTACACTCACTCAGGCGCATCTGTGGCTTTCGCCACTCAACGATGAAACCCGGGCGCAGATGGATAAACTATGGTTGGCGCTGGCGGGGGCGAAACGAGAAAATTCACCGACGTTAGAAATCAACCATCGGCCATTGGCGACAATGGGCGTCGAGAACCAGACGCTGGCGGTCTCTTTTACTACGCTGTGCGTCGACGCCCGCAGTCAGCATGACTATATTGCGCTCTCACGTCTCTTTCATACGGTCATGTTGTTTGATGTACCAGTTATGACGCGGTTGATGGAGAGCGAAGCGCGGCGCTTTATTGCGCTGGTGGATGAGTTTTACGAGCGCCATGTCAAATTAGTGGTGAGTGCAGAAGTGCCGCTGTATGAAATTTATCAGGGCGATCGGCTGAAGTTTGAGTTCCAGCGTTGCCTGTCACGTCTGCAAGAGATGCAAAGCGAAGAGTATCTGAAGCGCGAGCATTTGGCGGGTTAAAACCTGTCACAAATCACAAAAAGGGGTCGATCTTTGACCCCGACTTCTCTATAATCCTGCGACCCCACGTTACAAGAAAGTTTTTTTCCCAAAACTTTTTGTGTGCTGGCATAGGCTATTCGAAGGGGTAGGTTTGCCGGACTTTGTCGTGTGAACCTCAACAATTGAAGACGTTTGGGTGTTCACCAACGTGTAACTATTTATTGGGTAAGCTTTTAATGAAAACTTTTACAGCTAAACCAGAAACCGTAAAACGCGACTGGTATGTTGTTGACGCGACCGGTAAAACTCTGGGCCGTCTGGCTACTGAACTGGCTCGTCGCCTGCGCGGTAAGCACAAAGCGGAATACACTCCGCACGTAGATACCGGTGATTACATCATCGTTCTGAACGCTGACAAAGTTGCTGTAACCGGCAACAAGCGTACTGACAAAGTGTACTATCACCACACCGGCCACATCGGTGGTATCAAACAAGCGACCTTTGAAGAGATGATTGCTCGCCGTCCTGAGCGTGTGATTGAAATCGCGGTTAAAGGCATGTTGCCAAAAGGCCCGCTGGGTCGTGCTATGTTCCGTAAACTGAAAGTTTACGCGGGTAACGAGCACAACCACGCGGCACAGCAACCGCAAGTTCTTGACATCTAATCGGGATTATAGGCAATGGCTGAAAATCAATACTACGGCACTGGTCGCCGCAAAAGTTCCGCAGCTCGCGTTTTCATCAAACCGGGCAACGGTAAAATCGTAATCAACCAACGTTCTCTGGAACAGTACTTCGGTCGTGAAACTGCTCGCATGGTAGTTCGTCAGCCGCTGGAACTGGTCGACATGGTTGAGAAACTGGACCTGTACATCACCGTTAAAGGTGGTGGTATCTCTGGTCAGGCTGGTGCGATCCGTCACGGTATCACCCGCGCTCTGATGGAATACGACGAGTCCCTGCGTTCTGAACTGCGTAAAGCTGGCTTCGTTACTCGTGACGCTCGTCAGGTTGAACGTAAGAAAGTCGGTCTGCGTAAAGCACGTCGTCGTCCGCAGTTCTCCAAACGTTAATTGGCTTCTGCTCCGGCAGAAAACAATTTGCGAAAAAACCCGCTTCGGCGGGTTTTTTTATGGTTAAATTCTGAATCAGCGTAAAAATTGGAAAGTTGCTTTTTGCTACCGTCTGACAGACAGGCAAAACAAAAACCACATCGCCAATAAGGGACTAAGTCAACTATTTCAGACTAAAGCGCATCTCTTTTTCCCCATTTCCGGCATCGACTCACCACAAAGGTCGCAAAATCTGGTAAACTATCATCCAATTTTCTGCCCAAATGTCGGGTATTGCTCATTTTTTGTTTGATTTTCGAACAAAGAGAGTCGTTCCTTACTGGGTAACACAACTTCTGACTGGCCACCTGGTGGCTGGTAGCAGTAAAAATTCTGACTATACCTGGAGGTTTTCATGGCTGTCGCTGCCAACAAACGTTCGGTAATGACGCTGTTTTCCGGTCCTACTGACATCTATAGCCATCAGGTCCGCATTGTGCTGGCTGAGAAAGGTGTAAGTTTCGAGATCGAACACGTGGAAAAGGACAATCCGCCTCAGGATCTGATTGACCTCAACCCGAATCAGAGCGTTCCGACCCTGGTGGATCGTGAGCTGACCCTGTGGGAATCTCGCATCATTATGGAATATCTGGATGAGCGTTTCCCGCATCCGCCACTGATGCCTGTTTACCCGGTAGCTCGCGGTGAAAGCCGTCTGTACATGCATCGTATCGAAAAAGACTGGTACACGCTGATGAACACCATCATCAACGGTTCAGCTTCTGAAGCAGATGCCGCACGTAAGCAACTGCGCGAAGAACTGCTGGCGATTGCGCCGGTCTTCGGTCAGAAGCCGTACTTCCTGAGCGATGAGTTCAGCCTGGTCGATTGCTATCTTGCTCCGCTGCTGTGGCGTCTGCCGCAACTGGGCATCGAGTTCAGCGGCCCGGGTGCGAAAGAGCTGAAAGGCTATATGACCCGCGTCTTTGAGCGTGACTCTTTCCTTGCTTCTTTAACTGAAGCAGAACGTGAAATGCGTCTGGGCCGGAGTTAATCTGTATGGATTTGTCACAGCTAACACCACGTCGTCCCTATCTGCTGCGTGCATTCTATGAGTGGTTGCTGGATAACCAGCTCACGCCGCACCTGGTGGTGGATGTGACGCTCCCTGGCGTGCAGGTTCCTATGGAATATGCGCGTGACGGGCAAATCGTACTCAACATTGCGCCGCGTGCTGTCGGCAATCTGGAACTGGCGAATGATGAGGTGCGCTTTAACGCGCGCTTTGGCGGCATTCCGCGTCAGGTTTCTGTGCCGCTGGCTGCCGTGCTGGCTATCTACGCCCGTGAAAATGGCGCAGGCACAATGTTTGAGCCTGAAGCTGCCTACGATGAAGATACCAGCATCATGAATGATGAAGAGGCATCGGCAGACAACGAAACCGTTATGTCGGTTATTGATGGCGACAAGCCAGATCACGATGATGACACTCATCCTGACGATGAACCTCCGCAGCCACCACGCGGTGGTCGACCGGCATTACGCGTTGTGAAGTAA
Protein sequences of DBSCAN-SWA_1 >CP034966|502880:547183|539194_540133_+|QAS88594.1|DBSCAN-SWA MKVAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGEDATPALEGADVVLISAGVARKPGMDRSDLFNVNAGIVKNLVQQVAKTCPKACIGIITNPVNTTVAIAAEVLKKAGVYDKNKLFGVTTLDIIRSNTFVAELKGKQPGEVEVPVIGGHSGVTILPLLSQVPGVSFTEQEVADLTKRIQNAGTEVVEAKAGGGSATLSMGQAAARFGLSLVRALQGEQGVVECAYVEGDGQYARFFSQPLLLGKNGVEERKSIGTLSAFEQNALEGMLDTLKKDIALGEEFVNK >CP034966|502880:547183|517836_518307_-|QAS88576.1|DBSCAN-SWA MDIRKIKKLIELVEESGISELEISEGEESVRISRAAPAASFPVMQQAYAAPMMQQPAQSNAAAPATVPSMEAPAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEADKSGTVKAILVESGQPVEFDEPLVVIE >CP034966|502880:547183|520410_522351_+|QAS88578.1|DBSCAN-SWA MRLTTKFSAFVTLLTGLTIFVTLLGCSLSFYNAIQYKFSHRVQAVATAIDTHLVSNDFSTLRPQITELMMSADIVRVDLLHGDKQVYTLARNGSYRPVGSSDLFRELSVPLIKHPGMSLRLVYQDPMGNYFHSLMTTAPLTGAIGFIIVMLFLAVRWLQRQLAGQELLETRATRILNGERGSNVLGTIYEWPPRTSSALDTLLREIQNAREQHSRLDTLIRSYAAQDVKTGLNNRLFFDNQLATLLEDQEKVGTHGIVMMIRLPDFNMLSDTWGHSQVEEQFFTLTNLLSTFMMRYPGALLARYHRSDFAALLPHRTLKEAESIAGQLIKAVDTLPNNKMLDRDDMIHIGICAWRSGQDTEQVMEHAESATRNAGLQGGNSWAIYDDSLPEKGRGNVRWRTLIEQMLSRGGPRLYQKPAVTREGQVHHRELMCRIFDGNEEVSSAEYMPMVLQFGLSEEYDRLQISRLIPLLRYWPEENLAIQVTVESLIRPRFQRWLRDTLMQCEKSQRKRIIIELAEADVGQHISRLQPVIRLVNALGVRVAVNQAGLTLVSTSWIKELNVELLKLHPGLVRNIEKRTENQLLVQSLVEACSGTSTQVYATGVRSRSEWQTLIQRGVTGGQGDFFASSQPLDTNVKKYSQRYSV >CP034966|502880:547183|531440_532886_+|QAS88586.1|protease|DBSCAN-SWA MSLNLVSEQLLAANGLKHQDLFAILGQLAERRLDYGDLYFQSSYHESWVLEDRIIKDGSYNIDQGVGVRAISGEKTGFAYADQISLLALEQSAQAARTIVRDSGDGKVQTLGAVEHSPLYTSVDPLQSMSREEKLDILRRVDKVAREADKRVQEVTASLSGVYELILVAATDGTLAADVRPLVRLSVSVLVEEDGKRERGASGGGGRFGYEFFLADLDGEVRADAWAKEAVRMALVNLSAVAAPAGTMPVVLGAGWPGVLLHEAVGHGLEGDFNRRGTSVFSGQVGELVASELCTVVDDGTMVDRRGSVAIDDEGTPGQYNVLIENGILKGYMQDKLNARLMGMTPTGNGRRESYAHLPMPRMTNTYMLPGKSTPQEIIESVEYGIYAPNFGGGQVDITSGKFVFSTSEAYLIENGKVTKPVKGATLIGSGIETMQQISMVGNDLKLDNGVGVCGKEGQSLPVGVGQPTLKVDNLTVGGTA >CP034966|502880:547183|509809_512098_+|QAS88569.1|tail|DBSCAN-SWA MANKKLNATITIGGAVSGSLRGAFGTVEKSTIKIGAAIKHMSREQRQLNDAMKRYGQDGTMVGRMKERYQAIVGQVERLRAAQERLNRVQRASAENLAKQAQLRGQIIDTVMAGAAVAAPIKIAADREQHAIGIAKQLDGARDAAGNLTAKFWEMRKAVADLGHEIPLATNDLFDMATAGLRMGVAGDQILGFTRNVAKLASALELNPEEVADNMGKIQNVYRLTQAELVRLGDAINYLDDQNTVKGGELIDFLQRVGGSASLAKLTANDMAAIGTTLISMGESADTAATSVKALVSKMTLGSKSTKSAREAFKELGYSAQEVAKSMQLDSVKTIENFMKTVNKLPDYKKSGILVSIFGQEYVGSISKLAANMDKFGEALHQANGEMSKGSVQKEFQNTVNTTNAQMVILRNRTADVADNIGTVLLPTVNDAAKGIGRITTVVAKFAAEHPGLTKAVVGTAMALTSLRVATLAAQFGFTFLKGGVLQMSSLFARMNASAVLASTRGLPAVATGIRAVGAAFISTGVGAAVAGLALAGYQIYRHWAGVKAFFGGVGEGIKSGLEPLSDALSRLYERMGPLKPVIDGVGAAVKTVFNWFTSLTEPVKYSGEELDKAGKMGMTFGKTLAAGIELVTAPITFLIDKILWVSDNIGNLTNKALEFKNAVSDVAGGAWQKTKDFFTAPFRDDVQQPAATTASGAALPSPALANRSGGTTVHSNDQYHISVKAEPGMNEDALARKVIQQLRQQQAVRQRSMMIDGAQTP >CP034966|502880:547183|506788_507052_-|QAS88563.1|DBSCAN-SWA MNDTCDTMTPISIKYNDHCYIITVVVIFFKNAVIGVTGVISNVATLAGYGLRTMTPTMTPYKLMTPRRHFENKKSPGIRQGNGQQLL >CP034966|502880:547183|509581_509788_+|QAS88568.1|DBSCAN-SWA MLDRIHTKSEGMNPWPELVDAVLPKVLACNTAEEMQEQRDYRTECALQLAAAALRAAGEHTLAENLLN >CP034966|502880:547183|504971_506552_+|QAS92229.1|DBSCAN-SWA MGKQVAGDTTPPVYLGPRQLDDLDNLRIIDDGRRSARVYLAGDIKQLQISTIADKLAVAGVKEARLFKGITDREPEKWNMDRLRDAALRGESLVGMLRKEERATAAATVKDIHDMAPVLMTTWPNIGSNGQPLNTRPNVERLLDNYSISAKYNEISKDVEVLVPGVSGGSDARDNCAVSEILSLAALNRLPMSNIEGHIKTIAVRNTYNPVRDFINQREWDGRSRFADLLNTISTPDDYSRDLLAMLVRRWLLSAVAAAYLEAGFWSKGVLVFQGEQSLGKTAWFKALLPPDNRNLVKVGATIDPGNKDSVASAISHWLVELGELDATFRKADIAKLKAFISQDRDDLRRPYDRLESKYQRRTVFFASVNPKHFLADDTGNVRWWTIPVTAVNYEHGIDTQQLWAEVLSWFEAGERWWLDRDEEAMLEVVNEQHGQTDPIEEMILARFDWNSDRLAAYIEMTATDVLLSIGQDRPTKSQATQCGNILRKLTGRDARRTSKGRFYLMPPKVFGQQDHRYFDGPNHQF >CP034966|502880:547183|543464_544592_+|QAS88598.1|DBSCAN-SWA MQSVTPTSQYLKALNEGSHQPDDVQKEAVSRLEIIYQELINSTPPAPRTSGLMARVGKLWGKREDTKHTPVRGLYMWGGVGRGKTWLMDLFYQSLPGERKQRLHFHRFMMRVHEELTALQGQTDPLEIIADRFKAETDVLCFDEFFVSDITDAMLLGGLMKALFARGITLVATSNIPPDELYRNGLQRARFLPAIDAIKQHCDVMNVDAGVDYRLRTLTQAHLWLSPLNDETRAQMDKLWLALAGAKRENSPTLEINHRPLATMGVENQTLAVSFTTLCVDARSQHDYIALSRLFHTVMLFDVPVMTRLMESEARRFIALVDEFYERHVKLVVSAEVPLYEIYQGDRLKFEFQRCLSRLQEMQSEEYLKREHLAG >CP034966|502880:547183|519284_520259_-|QAS88577.1|DBSCAN-SWA MQALLLEQQDGKTLASVQTLDESRLPEGDVTVDVHWSSLNYKDALAITGKGKIIRNFPMIPGIDFAGTVRTSEDPRFHAGQEMLLTGWGVGENHWGGLAEQARVKGDWLVAMPQGLDARKAMIIGTAGFTAMLCVMALEDAGVRPQDGEIVVTGASGGVGSTAVALLHKLGYQVVAVSGRESTHEYLKSLGASRILPRDEFAESRPLEKQVWAGAIDTVGDKVLAKVLAQMNYGGCVAACGLAGGFTLPTTVMPFILRNVRLQGVDSVMTPPERRAQAWQRLVADLPESFYTQAAKEISLSEAPNFAEAIINNQIQGRTLVKVN >CP034966|502880:547183|537661_537925_-|QAS88592.1|DBSCAN-SWA MKIKTTVAALSVLSVLSFGAFAADSIDAAQAQNREAIGTVSVSGVASSPMDIREMLNKKAEEKGATAYQITEARSGDTWHATAELYK >CP034966|502880:547183|546685_547183_+|QAS88603.1|protease|DBSCAN-SWA MDLSQLTPRRPYLLRAFYEWLLDNQLTPHLVVDVTLPGVQVPMEYARDGQIVLNIAPRAVGNLELANDEVRFNARFGGIPRQVSVPLAAVLAIYARENGAGTMFEPEAAYDEDTSIMNDEEASADNETVMSVIDGDKPDHDDDTHPDDEPPQPPRGGRPALRVVK >CP034966|502880:547183|537333_537606_+|QAS88591.1|DBSCAN-SWA MNIYTFDFDEIESQEDFYRDFSQAFGLAKDKVRDLDSLWDVLMNDVLPLPLEIEFVHLGEKTRRRFGALILLFDEAEEELEGHLRFNVRH >CP034966|502880:547183|527484_531285_+|QAS88585.1|DBSCAN-SWA MRRLPGILLLTGAALVVIAALLVSGLRIALPHLDAWRPEILNKIESATGMPVEASQLSASWQNFGPTLEAHDIRAELKDGGEFSVKRVTLALDVWQSLLHMRWQFRDLTFWQLRFRTNTPITSGGGNDSLEASHISDLFLRQFDHFDLRDSEVSFLTPSGQRAELAIPQLTWLNDPRRHRAEGLVSLSSLTGQHGVMQVRMDLRDDEGLLSNGRVWLQADDIDLKPWLGKWMQDNIALETAQFSLEGWMTIDKGDVTGGDVWLKQGGASWLGEKQTHTLSVDNLTAHITRENPGWQFSIPDTRITMDGKPWPSGALTLAWIPEQDVGGKDNKRSDELRIRASNLELAGLEGVRPLAAKLSPALGDVWRSTQPSGKINTLALDIPLQAADKTRFQASWSDLAWKQWKLLPGAEHFSGTLSGSVENGLLTASMKQAKMPYETVFRAPLEIADGQATISWLNNDKGFQLDGRNIDVKAKAVHARGGFRYLQPANDEPWLGILAGISTDDGSQAWRYFPENLMGKDLVDYLSGAIQGGEADNATLVYGGNPQLFPYKHNEGQFEVLVPLRNAKFAFQPDWPALTNLDIELDFINDGLWMKTDGVNLGGVRASNLTAVIPDYSKEKLLIDADIKGPGKAVGPYFDETPLKDSLGATLQELQLDGDVNARLHLDIPLNGELVTAKGEVTLRNNSLFIKPLDSTLKNLSGKFSFINGDLQSEPLTASWFNQPLNVDFSTKEGAKAYQVAVNLNGNWQPAKTGVLPEAVNEALSGSVAWDGKVGIDLPYHAGATYNVELNGDLKNVSSHLPSPLAKPAGEPLAVNVKVDGNLNSFELTGQAGADNHFNSRWLLGQKLTLDRAIWAADSKTLPPLPEQSGVELNMPPMNGAEWLALFQKGAAESVGGAASFPQHITLRTPMLSLGNQQWNNLSIVSQPTANGTLVEAQGREINATLAMRNNAPWLANIKYLYYNPSVAKTRGDSTPSSPFPTTERINFRGWPDAQIRCTECWFWGQKFGRIDSDITISGDTLTLTNGLIDTGFSRLTADGEWVNNPGNERTSLKGKLRGQKIDAAAEFFGVTTPIRQSSFNVDYDLHWRKAPWQPDEATLNGIIHTQLGKGEITEINTGHAGQLLRLLSVDALMRKLRFDFRDTFGEGFYFDSIRSTAWIKDGVMHTDDTLVDGLEADIAMKGSVNLVRRDLNMEAVVAPEISATVGVAAAFAVNPIVGAAVFAASKVLGPLWSKVSILRYHISGPLDDPQINEVLRQPRKEKAQ >CP034966|502880:547183|540194_541262_-|QAS88595.1|protease|DBSCAN-SWA MFVKLLRSVAIGLIVGAILLVAMPSLRSLNPLSTPQFDSTDETPASYNLAVRRAAPAVVNVYNRGLNTNSHNQLEIRTLGSGVIMDQRGYIITNKHVINDADQIIVALQDGRVFEALLVGSDSLTDLAVLKINATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATGRIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINTLSFDKSNDGETPEGIGFAIPFQLATKIMDKLIRDGRVIRGYIGIGGREIAPLHAQGGGIDQLQGIVVNEVSPDGPAANAGIQVNDLIISVDNKPAISALETMDQVAEIRPGSVIPVVVMRDDKQLTLQVTIQEYPATN >CP034966|502880:547183|502880_504116_+|QAS88559.1|integrase|DBSCAN-SWA MAENKLSDVQLRALTRKVIEKPFDVADGGSLSVRVTPSRRKILDGEKRPANNIQWLFRYKNKHKSANTLTLVLGKYPALSLAEARVKRDQCKRWLAHNLDPKDQFDQESAKTMKPVTIREALERWINEYAAENRVNFERHRRQFERHIYPFIGELPLEQCSKARWMETFSRIKKVAPVASGYILGNCKQALIHNRRLHDVYSDALEDIKVTDVGRKQAKRDRVLTSEELSGLWNALKSEYAFLPYYTALLKLLIVFGARTQEIRLSTWSEWNVKEWIWTVPREHSKGGEKIIRPVPESIQPFILQLRQNHLDSGMLLGEMKKPEAVSQWGRILYRRLEHSTPWTLHDLRRTFATSLNNLGIAPHVVEQLLGHSMPGVMAVYNRSQYIPEKRLALDKWIKYLEKINLLKSDN >CP034966|502880:547183|512497_513463_-|QAS88571.1|tRNA|DBSCAN-SWA MRIGQYQLRNRLIAAPMAGITDRPFRTLCYEMGAGLTVSEMMSSNPQVWESDKSRLRMVHIDEPGIRTVQIAGSDPKEMADAARINVESGAQIIDINMGCPAKKVNRKLAGSALLQYPDVVKSILTEVVNAVDVPVTLKIRTGWAPEHRNCEEIAQLAEDCGIQALTIHGRTRACLFNGEAEYDSIRAVKQKVSIPVIANGDITDPLKARAVLDYTGADALMIGRAAQGRPWIFREIQHYLDTGELLPPLPLAEVKRLLCAHVRELHDFYGPAKGYRIARKHVSWYLQEHAPNDQFRRTFNAIEDASEQLEALEAYFENFA >CP034966|502880:547183|522655_523699_+|QAS88580.1|DBSCAN-SWA MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE >CP034966|502880:547183|509362_509572_+|QAS88567.1|DBSCAN-SWA MATDNNIYDAVLNIIGVADKMTNNNASQLTDAQKVALKNLSNAMKSWKGITFGMRDITASDKPIESKTA >CP034966|502880:547183|504598_504880_+|QAS88561.1|DBSCAN-SWA MKKTTVNQAISRYNSLLRNPQRQLTVGELTAQRMAAAQLLLQACIREGVNRPWTIVSRHAAMADSLVPFRISDSESWAMYLELKRGVRYEKRA >CP034966|502880:547183|545254_545647_+|QAS88600.1|DBSCAN-SWA MAENQYYGTGRRKSSAARVFIKPGNGKIVINQRSLEQYFGRETARMVVRQPLELVDMVEKLDLYITVKGGGISGQAGAIRHGITRALMEYDESLRSELRKAGFVTRDARQVERKKVGLRKARRRPQFSKR >CP034966|502880:547183|541351_542719_-|QAS88596.1|protease|DBSCAN-SWA MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEGTASQGQKIPEEFKKFFGDDLPDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPGSGSAKAGVKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKGSPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR >CP034966|502880:547183|546041_546680_+|QAS88602.1|DBSCAN-SWA MAVAANKRSVMTLFSGPTDIYSHQVRIVLAEKGVSFEIEHVEKDNPPQDLIDLNPNQSVPTLVDRELTLWESRIIMEYLDERFPHPPLMPVYPVARGESRLYMHRIEKDWYTLMNTIINGSASEADAARKQLREELLAIAPVFGQKPYFLSDEFSLVDCYLAPLLWRLPQLGIEFSGPGAKELKGYMTRVFERDSFLASLTEAEREMRLGRS >CP034966|502880:547183|514684_516136_-|QAS88573.1|DBSCAN-SWA MQLEVILPLVAYLVVVFGISVYAMRKRSTGTFLNEYFLGSRSMGGIVLAMTLTATYISASSFIGGPGAAYKYGLGWVLLAMIQLPAVWLSLGILGKKFAILARRYNAVTLNDMLFARYQSRLLVWLASLSLLVAFVGAMTVQFIGGARLLETAAGIPYETGLLIFGISIALYTAFGGFRASVLNDTMQGLVMLIGTVVLLIGVVHAAGGLSNAVQTLQTIDPQLVTPQGADDILSPAFMTSFWVLVCFGVIGLPHTAVRCISYKDSKAVHRGIIIGTIVVAILMFGMHLAGALGRAVIPDLTVPDLVIPTLMVKVLPPFAAGIFLAAPMAAIMSTINAQLLQSSATIIKDLYLNIRPDQMQNETRLKRMSAVITLVLGALLLLAAWKPPEMIIWLNLLAFGGLEAVFLWPLVLGLYWERANAKGALSAMIVGGVLYAVLATLNIQYLGFHPIVPSLLLSLVAFLVGNRFGTSVPQATVLTTDK >CP034966|502880:547183|525364_525958_+|QAS88583.1|DBSCAN-SWA MTSLYLASGSPRRQELLAQLGVTFERIVTGIEEQRQPQESAQQYVVRLAREKAQAGVAQTAQDLPVLGADTIVILNGEVLEKPRDAEHAAQMLRKLSGQTHQVMTAVALADSQHILDCLVVTDVTFRTLTDEDIAGYVASGEPLDKAGAYGIQGLGGCFVRKINGSYHAVVGLPLVETYELLSNFNALREKRDKHDG >CP034966|502880:547183|522359_522548_-|QAS88579.1|DBSCAN-SWA MQFVSFVTNLCLSPERLNASAAFLACLQPRKVIHLHMCCNNLASRQAHSHETQRALFYVKTD >CP034966|502880:547183|525947_527417_+|QAS88584.1|DBSCAN-SWA MTAELLVNVTPSETRVAYIDGGILQEIHIEREARRGIVGNIYKGRVSRVLPGMQAAFVDIGLDKAAFLHASDIMPHTECVAGEEQKQFTVRDISELVRQGQDLMVQVVKDPLGTKGARLTTDITLPSRYLVFMPGASHVGVSQRIESESERERLKKVVAEYCDEQGGFIIRTAAEGVGEAELASDAAYLKRVWTKVMERKKRPQTRYQLYGELALAQRVLRDFADAELDRIRVDSRLTYEALLEFTSEYIPEMTSKLEHYTGRQPIFDLFDVENEIQRALERKVELKSGGYLIIDQTEAMTTVDINTGAFVGHRNLDDTIFNTNIEATQAIARQLRLRNLGGIIIIDFIDMNNEDHRRRVLHSLEQALSKDRVKTSVNGFSALGLVEMTRKRTRESIEHVLCNECPTCHGRGTVKTVETVCYEIMREIVRVHHAYDSDRFLVYASPAVAEALKGEESHSLAEVEIFVGKQVKVQIEPLYNQEQFDVVMM >CP034966|502880:547183|542872_543271_-|QAS88597.1|DBSCAN-SWA MTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELDEYREELVSHFARSAELLDTMAHDYRQLYQHMAKSSSSLLPELSAEANPFRNRLAESEASNDQAPVQMPRDYSEGASGLLRTGAKRD >CP034966|502880:547183|509042_509360_+|QAS88566.1|DBSCAN-SWA MKNHDLYESMKEEARLILSETKRFQIISEKNPISYEAADLLASLRNRLNAFEHMTDEAGRIMRDEAENNASTLERKKWKPSSRLSVGERAKRQAVKRGAILVRSA >CP034966|502880:547183|507519_508404_+|QAS88564.1|DBSCAN-SWA MTVLSQLINDALNHQGGHIRTDGNGDITQFIPVEPAAVRYLSMNDVLTPKGGKQTTFNAVLAERSLVAQAGAEIIRIPGPMTTPPTGNTGATVGREVADRFVVVRPGAFAKVSDGEEITMSGLPYLVSAFNHNNAPAYGVGYTLSRKQLKHDFADDTTLVVVNTAIERGIADLADFVLLNHLESAAETLTSPSFTAVAQKIAAKNLRFDEVKAIVGGECTGLELQDGILRAYGVRAEISGQTSSTIIGAFGNAAVALDDEIRVTARRVLNGAVEIVVWVNASALVPDSTVFWQA >CP034966|502880:547183|535274_537242_+|QAS88590.1|DBSCAN-SWA MGIFSIANQHIRFAVKLATAIVLALFVGFHFQLETPRWAVLTAAIVAAGPAFAAGGEPYSGAIRYRGFLRIIGTFIGCIAGLVIIIAMIRAPLLMILVCCIWAGFCTWISSLVRIENSYAWGLAGYTALIIVITIQPEPLLTPQFAVERCSEIVIGIVCAIMADLLFSPRSIKQEVDRELESLLVAQYQLMQLCIKHGDGEVVDKAWGDLVRRTTALQGMRSNLNMESSRWARANRRLKAINTLSLTLITQSCETYLIQNTRPELITDTFREFFDTPVETAQDVHKQLKRLRRVIAWTGERETPVTIYSWVAAATRYQLLKRGVISNTKINATEEEILQGEPEVKVESAERHHAMVNFWRTTLSCILGTLFWLWTGWTSGSGAMVMIAVVTSLAMRLPNPRMVAIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCISLAVLGFFLGIEVQKRRLGSMGALASTINIIVLDNPMTFHFSQFLDSALGQIVGCVLAFTVILLVRDKSRDRTGRVLLNQFVSAAVSAMTTNVARRKENHLPALYQQLFLLMNKFPGDLPKFRLALTMIIAHQRLRDAPIPVNEDLSAFHRQMRRTADHVISARSDDKRRRYFDQLLEELEIYQEKLRIWQAPPQVTEPVHRLAGMLHKYQHALTDS >CP034966|502880:547183|504410_504590_+|QAS88560.1|DBSCAN-SWA MELLTAKQVTTLTTLSRMTIWRYIQAGTFPNCIKLGPKRVAWRRKDVMAWLESRQPVQR >CP034966|502880:547183|544810_545239_+|QAS88599.1|DBSCAN-SWA MKTFTAKPETVKRDWYVVDATGKTLGRLATELARRLRGKHKAEYTPHVDTGDYIIVLNADKVAVTGNKRTDKVYYHHTGHIGGIKQATFEEMIARRPERVIEIAVKGMLPKGPLGRAMFRKLKVYAGNEHNHAAQQPQVLDI >CP034966|502880:547183|516125_516368_-|QAS88574.1|DBSCAN-SWA MDTRFVQAHKEARWALGLTLLYLAVWLVAAYLPGVAPGFTGFPRWFEIACILTPLLFIGLCWAMVKFIYRDIPLEDDDAA >CP034966|502880:547183|516476_517826_-|QAS88575.1|DBSCAN-SWA MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVCIGPAPSVKSYLNIPAIISAAEITGAVAIHPGYGFLSENANFAEQVERSGFIFIGPKAETIRLMGDKVSAIAAMKKAGVPCVPGSDGPLGDDMDKNRAIAKRIGYPVIIKASGGGGGRGMRVVRGDAELAQSISMTRAEAKAAFSNDMVYMEKYLENPRHVEIQVLADGQGNAIYLAERDCSMQRRHQKVVEEAPAPGITPELRRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVTEMITGVDLIKEQLRIAAGQPLSIKQEEVHVRGHAVECRINAEDPNTFLPSPGKITRFHAPGGFGVRWESHIYAGYTVPPYYDSMIGKLICYGENRDVAIARMKNALQELIIDGIKTNVDLQIRIMNDENFQHGGTNIHYLEKKLGLQEK >CP034966|502880:547183|512175_512472_-|QAS88570.1|DBSCAN-SWA MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN >CP034966|502880:547183|534336_535269_+|QAS88589.1|DBSCAN-SWA MKTLIRKFSRTAITVVLVILAFIAIFNAWVYYTESPWTRDARFSADVVAIAPDVSGLITQVNVHDNQLVKKGQILFTIDQPRYQKALEEAQADVAYYQVLAQEKRQEAGRRNRLGVQAMSREEIDQANNVLQTVLHQLAKAQATRDLAKLDLERTVIRAPADGWVTNLNVYTGEFITRGSTAVALVKQNSFYVLAYMEETKLEGVRPGYRAEITPLGSNKVLKGTVDSVAAGVTNASSTRDDKGMATIDSNLEWVRLAQRVPVRIRLDNQQENIWPAGTTATVVVTGKQDRDESQDSFFRKMAHRLREFG >CP034966|502880:547183|545564_545786_-|QAS88601.1|DBSCAN-SWA MVFVLPVCQTVAKSNFPIFTLIQNLTIKKPAEAGFFANCFLPEQKPINVWRTADDDVLYADRLSYVQPDERHE >CP034966|502880:547183|524867_525356_+|QAS88582.1|DBSCAN-SWA MASYRSQGRWVIWLSFLIALLLQIMPWPDNLIVFRPNWVLLILLYWILALPHRVNVGTGFVMGAILDLISGSTLGVRVLAMSIIAYLVALKYQLFRNLALWQQALVVMLLSLVVDIIVFWAEFLVINVSFRPEVFWSSVVNGVLWPWIFLLMRKVRQQFAVQ >CP034966|502880:547183|513791_514673_-|QAS88572.1|DBSCAN-SWA MPWIQLKLNTTGANAEDLSDALMEAGAVSITFQDTHDTPVFEPLPGETRLWGDTDVIGLFDAETDMNDVVAILENHPLLGAGFAHKIEQLEDKDWEREWMDNFHPMRFGERLWICPSWRDVPDENAVNVMLDPGLAFGTGTHPTTSLCLQWLDSLDLTGKTVIDFGCGSGILAIAALKLGAAKAIGIDIDPQAIQASRDNAERNGVSDRLELYLPKDQPEEMKADVVVANILAGPLRELAPLISVLPVSGGLLGLSGILASQAESVCEAYVDSFALDPVVEKEEWCRITGRKN >CP034966|502880:547183|534125_534329_+|QAS88588.1|DBSCAN-SWA MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRLFV >CP034966|502880:547183|508406_508721_+|QAS88565.1|DBSCAN-SWA MWWGSKKVAAERAALPGEILAAIEGVAMYTRHGDENNPRIVVQPVGWSGFLYSDEAAEKWIRRAYPELTQYQIERAVNYLASLVRSHHRDSRSEAQRERWMTRF >CP034966|502880:547183|506561_506786_+|QAS88562.1|DBSCAN-SWA MSYRAYIIAFDPEHGTYTETEIMEGFATEQEAVDHARNRLPEVQQELAKLGENLLCSYRIRVVDSAEILPFIRS >CP034966|502880:547183|538289_538760_-|QAS88593.1|DBSCAN-SWA MRSSAKQEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGVPTTSSPLKNLVLDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTIFTTPANGFTVKDLYEAILELFDQEL >CP034966|502880:547183|523764_524868_+|QAS88581.1|DBSCAN-SWA MKPIFSRGPSLQIRLILAVLVALGIIIADSRLGTFSQIRTYMDTAVSPFYFVSNAPRELLDGVSQTLASRDQLELENRALRQELLLKNSELLMLGQYKQENARLRELLGSPLRQDEQKMVTQVISTVNDPYSDQVVIDKGSVNGVYEGQPVISDKGVVGQVVAVAKLTSRVLLICDATHALPIQVLRNDIRVIAAGNGCTDDLQLEHLPANTDIRVGDVLVTSGLGGRFPEGYPVAVVSSVKLDTQRAYTVIQARPTAGLQRLRYLLLLWGADRNGANPMTPEEVHRVANERLMQMMPQVLPSPDAMGPKLPEPATGIAQPTPQQPTTGNAATAPAAPTQPAANRSPQRATPPQSGAQPPARAPGGQ >CP034966|502880:547183|533013_533943_-|QAS88587.1|DBSCAN-SWA MERLKRMSVFAKVVEFGSFTAAARQLQMSVSSISQTVSKLEDELQVKLLNRSTRSIGLTEAGRIYYQGCRRMLHEVQDVHEQLYAFNNTPIGTLRIGCSSTMAQNVLAGLTAKMLKEYPGLSVNLVTGIPAPDLIADGLDVVIRVGALQDSSLFSRRLGAMPMVVCAAKSYLTQYGIPEKPADLSSHSWLEYSVRPDNEFELIAPEGISTRLIPQGRFVTNDPMTLVRWLTAGAGIAYVPLMWVINEINRGELEILLPRYQSDPRPVYALYTEKDKLPLKVQVVINSLTDYFVEVGKLFQEMHGRGKEK |
46 | Escherichia_phage(28.57%) | protease,tRNA,tail,integrase | attL 502697:502713|attR 512176:512192 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1068654 : 1081837
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034966|1068654:1081837|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_2 >CP034966|1068654:1081837|1078513_1079170_-|QAS89074.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >CP034966|1068654:1081837|1079275_1081837_-|QAS89075.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP034966|1068654:1081837|1072463_1073828_-|QAS89068.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >CP034966|1068654:1081837|1074697_1075336_-|QAS89070.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP034966|1068654:1081837|1076591_1077500_-|QAS89072.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP034966|1068654:1081837|1073916_1074693_-|QAS89069.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >CP034966|1068654:1081837|1077695_1078463_+|QAS89073.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >CP034966|1068654:1081837|1068654_1069416_+|QAS89064.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >CP034966|1068654:1081837|1075332_1076595_-|QAS89071.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP034966|1068654:1081837|1071377_1072370_+|QAS89067.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >CP034966|1068654:1081837|1069409_1070036_+|QAS89065.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >CP034966|1068654:1081837|1070175_1071315_+|QAS89066.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1934259 : 1960503
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP034966|1934259:1960503|DBSCAN-SWA ATCATGCAGCGTCCTCTGCTTTTAATGCCACCAGTGAACCAAAGTTAAAGCACTGGAACCACAGCTCGCTATGCTCAAAACCGGCGTTATGCAGGCGTGCTTTATGTGTTTCCACGGAATCGGTCAGCATCACGTTTTCCAGCATGCTGCGCTTCTGGCTGATCTCCAGTTCGCTGTAACCGTTGGCACGTTTAAAGTCGTGGTGCATGTTGAACAACAGTTCACCAACTTTGGCATCTTCGAAACTGAATTTTTCCGAAAGCACCAGCGCGCCGCCCGGGTTCAGTCCTTGATAAATTTTATCCAGTAATGCCTGGCGCTCGGAAGGATCCAGGAATTGCAGGGTAAAATTCAGCACCACCATCGATGCGTTTTCAATGGCGATATCGCGAATATCACCTTCAATGACGTCTACTGGCGTAGGGGCTTTATAGGCGTCAATATGACGACGGCAGCGTTCAATCATCGCCGGGGAGTTGTCGATGGCAATAATTTTGCAATTATCATGATGAATGTTGCGACGCACCGAGAGCGTCGCCGCGCCCAGAGAACAACCCAGATCGTAAACCTGCGTACCAGGTTGAACGAAGCGCTCGGCTAACATACCAATCATGGAAATAATATTGGAATAGCCGGGAACGGAACGCTGGATCATATCCGGGAAGACTTCAGCTACCCGTTCATCAAAGGTCCAGTCGCCCAGTCTGGCGATAGGGGCAGAAAATAGCGTGTCGCGGTGAGACATAACGTAAAAATCCGGGAAAAAGAAAGTGGCGTATTGTGCGCTAACGCAGGGAGAAAACCAACTCCCAGGGCATATACCAAAGATTCGCCAGCACCATCAGCAACAGCGCACACCAGGTGGCGCTCATGCCGGAACGTCGCCAGCGGAACAGACGGTGATGAAAACCGTAATAATGCATCAGACGACCAGCAAGCAAAACGATGCCGCAAATATGCACCATCCAGGTTTCTGCGCCATTCATTTCCATAAACAGCATCAACACAATCGCGATGGGAATATATTCCACCGCGTTACCATGAATGCGAATAGCGCTTTGCAGTTCGCTAAAACCGCCGTCGCCATAGGCAACGCGGTACTGCATTCGCAGGCGAACGACATCAAAAGAGAACTTCATTAATAACAACGCACTTAAAACGGCATACAGCGCGCTTACCATACAAACTCCCTTTAAAATGGCCGATTGCACCTGTCTATGATAGGTGGCAAATTCAGAAAAGAGAAGATTGCTGTGGAATAGCCGGAACTGCTCCGATCTCTGGAAGCGTTTTACGTAAGTCTTCCCACAGGGTATGTACCAGTTCCGGGGCCTGGGCAATATCTGGCGTATGTAAAAAAAGATAAGGCGTAGTGGTCTGATGCCACTGCGCTAATTTCTGTAACCAGACCTGAAATAATTCCCGGTTTTGCGTCATATCATCACTACCGATAAAACGAATCAGCGGATTTGTCGCCGTCAGTACAGCATGTACCGGAACTTTAGGTTTTTTTCGTTGAGCGTCGCGAATAGCTTCACTGTGTGGACATGCTGCATGAACCGGGCGGCTGTCTAAAATCACCCGATTAACGCCGCGCTGATGTAAACCGCGATTAAGCGTTTGTTCCTCTTCCCCTTTGGCGAAAAACTGTGGATGGCGGACTTCCACACCATAATTAAATTCGCCAGGGAGAGAATCGAGAAAATTCCAAAGCGCAGGCAGCTCCCGTGGGCCGAATGTGGCAGGCAGTTGCAGCCAGTATTGCCCAATGCGCGGAGCCAACGGTGACATGCGGGTCAAAAATTCAGTCACTAAATCATCGCAATGCCGTAATGCTGCCTGATGCGAAATGGTCGCCGGAAACTTAAAACAGAAGCGGAAGTCATCTGTGGTCTGCTCACGCCAGCGCAGGACAACCTCGGGTTTCGGCAGGGCGTAAAGCGTGGTGTTGCCCTCCACGCAGTTAAAGTGGCGGGCATGACGCGCTAACTTGCTGATTTTTATCATTCTCATATTCCTATCACCATGCCATGGGGCATGGATGGGGCAAAGTCCGATAATTTTTGGTTCAACATGGCAATCTGATCGCTGCTGCTGTCGGCCATCCAGGCGCCGTAAACATTGAAAACCATCTGGGCGCTGGCGTGTCCCATCTGGCTGGCAATGAAACTCGGATTGGCCCCGGCAGACAGCGACCAGCAGGCGTAAGTGTGTCTCGACTGATACGCCTTCCGGTGCCTTAAACCTGACCGTTTAAGCGCCGCATCCCATGAGTCACCCACTGAATCGGCTTTGTACAGGTAACCCACGCTGCCACTTTTTTTAACCAACTGAGGATTGAACACAAATGTACAGTCGTGAATAACCGTTCGGCCATACTCCCGCAATGCCACCTCAATCTGATACTGCCTTCCCAGCCTGGTCATTTCTGCCTGGTTCCTCAAAGCGTCGATGGCAGGCTTGATCAGGTGCACGACCCTGTCGGTACCAGCTTCGGTTTTTGGTAGAGTGAAATCACCGAGTTTCGTATAATTCCGGCGTATGGTCATCGTTCCAGCTTTCAGATCTATATCTTCCCATGCGAGGGAGACCAGCTCACCGTGACGTAATCCTGTGTATACCGCAATTGACCACAGGTTTTTCGTTTGCTGATGCTTGCAGGCATCAATGAAGCGAACGAATTCATCACGTGTGAGCGGATCTGGTTCTATCCTAGCCCTCTTTAGCGGCTTGATGCCGTTAAACGGGTTTTCACTCACATAGCCATTATCAACAGCGAACTGAAACATACCCGCGATCGTGGTCATGTAGTAGTTTACCGTCACCACACTCAACCCTTTATCCCCCGCCAGCATATCCTTCCTGATATAGAGAAGCTCTTCTCTTGTCACAGACGAAACCAGCTTATTCGCGCCAACCCTAGGCAGCATGCTTCTCACCACCGATTCATACCTGTTTATGGCATTAGCGCAGATCTCCATCCGTTTAAGCTCAAGCCATTTTTCAGACAGATCTTTTACGGTGATATCTTTCTTCCCGATGCCGAAAGTTTTCAGATTTGGCGAGTTGGGGAATTGGGCCGCATAGTCAAAGGTCCCCATGCGGATAGCGAAACAAACTGACGTTCTCAGCTCCCCGGCCACCTTCCTGTTTTTAGCGGTGTCAGGGACACCGAGGTTTTCCCTGACACGCTTACCTTTAAAAATGAACCATATGCGGAGTGACTTTCCGTGGTTCTCAACGCCCGTTGGGTATGATTCTTTACTCATTTATCCCTCCCGACGTCCAGGAGCGTTGCAAGTTTACCTGTTTCATACCGCCCGATCACCCAATGGTTGCTTTTGCGCCTGAATCCATGCGTCTACCGCTTTGCGGTTGTACATGCATTCGCTGGTTGGCTTTGGGTCACCTTCAGGGGAAACGTGCTTATACTCACGCCCAAGCAGCCAGGATGATTTGCGGGCCCGTGTAATGGTTCCACGTTTCATCCCTGTGACTGCCATCAGCAAGTCCTCTGAAACCCATTCGTTTGGCTCGATCTGGATTATTGTCTGCATGCATCACCTCTGCCGCACTCTTTAACTGATATAGAATTCCCAGCTACTGGCGACGGAATTCATAACCTTAATCGCCATTTCAGCCGTTTCCCTGCTGTCGTAACACTGGAAATACATCTCCTGCCCAGTACGTTTAAGCTTCATCATTACCCACATACATCACCTCAGGTGCTTACCACGTTCTTCAAACTCTTCTTGGCAATCAGCACAGCGCTGGCATCCCGCCACCAGTTCCCGGCGCCGAGCCGCTATCTCTTCCCCGCAGTCGCGGCAGTGAGTAGCCGAAACCGCCGAATGGTTGATGCGACATTTCGCAATGGCGGCTTCCCGCTGGAGTTCCGCTAACTCGTTGGCCTGATCGATGATTTCTGCGCTCATGCTGCACCGCCTTGCCAGTCGACCAAAAATGAACAGTCTTTTTTATGCTCGTTACAAGACCAGATCACTTCGTCGTCACCACGGAAAACATTTACTTCCACCGTGGTTTTAAACTTCGCAACTGCACCGCACTTGCATTTGGCAGAGGTATTTTTGCTTTTGGCGGCAACACTGCCGACTCTTGGGTATTTGCTCATGATTCCACTCCATACCGCTCATTCATGCGGCCAATAACACTGACAAATTTCACCAGGCTGACACCCATCGGCTTTACCTTCTCGTAGTGCTTGCGAAGGATGGGGGGCATACAGCGTTCCACTTCGGTTAGCCATTCTGGGATGGGCATACATCATGGTTAAAACAGGCCAGTGGATTACCAAAAATGCTCTGAGGCAGTGGGACAAGCGTCGTAAGGAATCTCGCCGCCAGAAAGCTGTGAATGAGTTTTATGACGCCTTTGAGCTTAACAGCCTGGAACCTGGCTCTACCGTTCGCCTGGCTACTAAAGGCGACCTGACAATCATGATGTTCCGCAGCGAGGAGGCCGACAAATGATTACCGGGACTGCTAACTATGACGATGTGGCAGAAGTCCGCTGCAATTTGTGCGGCGGTTATTACAAAGCCGACGATCCGGAAAGTCACGAATGTGAGGATGCAGCATGACTGATATCACCGAACTGGCGCTGATTGCCAAAATCAAAAAACAGACCGAAAACTTTGACACTGTAGTGCTGAAAGAGTGGGAAGCCCTCGCGCTGGTAGAGGCGCTGGAGAAGGCGCAGAAGCGCATCGCCGAACTGGATAGAAAAAACTGCGAGCTTGATTCGCTTACTCAGCGATGGGCTGTTGAGCGCGCTGAAAATGCAGACCATATCGCCGAGCTGGAGTCCCACCTAGAGTCTGCTGACAAACTGCAGGACAGTGCATTCCGTAGCGGATTGCAACACGGGTTTAGTCTCGGACAGACAGATGACCAAAAAGGTTATGAGCAGAGCATGGCCGCTTATAGCCCCGACGCTGGCATCAAGGTGGAGGATGAGTGATGGCTACCGTAACAGAAATTCGAGCAAAGCTGCGTGCCGGTGAGGTAGTGATTATGCCGGAAAATTCTGTATTCCTGTTCATGCGTGAATGCGAGCGCCATCCAGAAGGTGATGAGTGCTACAAAATTGAACCCCACTCTTATGGGTACTCAAAAGTTTTCGACCCCAAGCGAGCGAACGGAGCCAACCAATGACCAAATCAACCATAACCAGAGAGCGAATATCCGGGTTTATATTGCCTGGGAGTCATTAATCAATCTGGCGTGGAGTGCGATTCGTGGCTAAATCCCCCGCCGAACGCAAAGCCGCGCAGGAGGCAAAGATCTGTTTGGCTGGAAGTGAGGGCGTGGTAATGTGTTTACCTATTCTTCATTTGCTTAGGTGGCGTGAAAATGCTGTATACAGTGACGTTCAATGAGACAGAGCGGAAGGAAGATATTGAACTTAGCGCGGATGTCAGGGCGGGTGATTTGCTGTCATTAACGCTCGATGGCGTGAAAGATGACTACACGGTGATGACAGTAGGCGGTCCTATCATTGGCAATACCTGTGTCCCATCAATAATCCGGGTAAAAAAAGCTCAGAAATAATAATGTATAATCCCCTCCACAGCAGAGGGGATTTTTATGACAAACAAAAAAATGACACCTGCCGAAAAGCTCAAAGCTTCGCGGAAGCGGTATAAAAAAATCTGGCTTCAACTGGATATCGCTAACGCCAAGCGATTTGGCGAGAAAGAAGTGCTTTCCGTTGACACCTACAGATCGCCCTATGAAACGCGCAAGAAAAGAGGGAGGACAGCAGATTGACAACATCATCCTGGAACATAGCAGCCAAATCGAAAGACGAGCAGGACAAAGTCAACGTTGACCTCGCCGCGTCCGGCGTCGCTTACAAAGAACGCCTGAACATGCCGGTTGTCGCCGAAGTGGTGGCCAGACAACAGCCTGAACACCTGCGGGACTACTTCATGGAGCGCGTCCGCTACTACCGCGAGCAGAGCATCCAGCTTCCCCGCGCATCCGATCCGCGCTATCTGGAAATGGCTGAGCAGAACGCCAAGAAATAGCGATTTCCTCGCCTATGCTCATTTTGCTTTTATCCCCGGGAGGGGCGATAATTACTTAGTCAGTCTGGACAACTGACAACTTTACCCCGGCGCCAAGTGGGGACACATGGCGCAAACACTGCAATTTGAGAAAAGTTATCAAAACGTACTGATTCCCGCAGAGCCGGGAACCAGCGAATACCTGCAACTTATCCCCGTAGGGCAACTGCTTTGCGGTGAGTTCCGCAAGCCCAGGAATTACGCATTCCACAAGAAGTTCTTCAAACTTCTGACTCTCGGGTATCACTACTGGACACCTTCCGGTGGTCTCATTGAGCCCGCGGAGCGTACCCTCATATCCGGTTTTATCGACTTTCTCTCATCCGACCTCGATCAGCGCGCTGCACTCCAGAACGCCGCGGAGATGTATCTCTCCTCGGTCGGTATATCCCGTTCCCGCGATATGGCGCTGCTGAAACACTTCGAATCCTTCCGCGAGTGGGCAACCATTCAGGCTGGCTTTTATGACGAATACCAGATGCCTGACGGCAGCCGTCGTCGTGTCGCAAAGTCGATCTCCTTCGCCAGCATGGACGACAGCCAGTTTAACGGCGTCTACAAATCAGTGCTGAATGTGCTTTGGAACTACATTCTGCGTCGCAAGTTCCACTCGCCGTCTGAGGCTGAAAACGCCGCCAGTCAGCTGCTGAGCTTTGCGGGGTGATGGCGATGAAATACTCATGGTTCCAGCATCCCGACTGCACAACCGAGCAGGCCGAACAGTTAGTGTCCAGATATCAGGCGCGAGGCATTGTCACAGAGAAAAGCCTTAACGCGGATTATCTGAGTTGGACGGTCAGCGCCCGGCTGCCGGTTTGTGTTCGCCCGGAGCATACTCCGCGATCACTTCGTCAACGTATATGGGGGTGAGCATGGCCAATCTTCGTAAAGCAGCTCGCGGTCGTGAATGCCAGGTTCGCATCCCGGGCGTCTGCAACGGCAACCCTGAAACCACGGTATTGGCCCATATCCGCATTGCTGGATTGTGCGGGACGGGGATTAAGCCGCCTGATCTGCTCGCCGCTATCGCCTGTTCATCCTGTCACGATGAAATAGACCGCCGCACGCGCCTGGTAGATGCGGAGTATGCGAAGGAGTGCGCGCTGGAGGGAATGGCCAGAACGCAGGTTATCTGGATGAAAGAGGGGTTGATAAAAGCATGAACCAATATCGCATTTCATTACCCTGGCCGCCAAGCAACAACCGCTACTACCGGCACAACCGGGGACGCACACACATCAGCGCGGAAGGGCAGGCATACCGCGACAGCGTCGCCAGAATCATCAAAGACTCGATGCTTGATATCGGCCTGGCCACGCCATTGAAAATCCGTATTGAGTGTCACATGCCGGATCGCCGGCGCCGTGACCTGGACAATCTGCAAAAGGCAGCATTCGACGCCCTGACGAAATCGGGTTTCTGGCTCGATGACCAGCAGGTTGACTACTACAGCGTAAAGAGAATGCCTGTCGTCAAAGGTGGGCGGCTTGAGCTAACCATTACCGAAATGGAGTCCGCATGAGCCGTGACGTTATCGAACGCATCCGCGACCGCTGGCAAAAGCTTCGCCTCCTGCGTAGCCGCGGCACCGTGCTGGTCGACTACAAAATATTACGCAATTTCGTCCGTATCTATAAGCGCCTGGGAGAAGCAGCATGACAGCTCAATACTTGGAATTTGTTCGCCAGCAGCTGATAGTGGCCACCGCCGATCTGAGTGGTGCGACGAAAGGGCAACTGGTAGCTCTTGCAGAGAACGCGCAATTTACCGCTACGGCGCGCAGCCGTGGCCGGAAGAAGGTATATAGCGAGGTGAAGCAAAAAATGGTTAACCCGGATGGACCGCCGATGAGCGGCAGCCAGTCCCGCGCTAAGGGTTCCTCAATCGCTCTCGTTCTGCCCGTTGAATACTCGACAGCCAGCTGGCGTCGCGCCCTCCTGTCGCTGGAAGACCACCAGAAATCCTGGTTGCTGTGGAACTACAGCGACAATATCCGCTGGGAGCACCAGGAGACGATAACCCGCTGGGCATGGGAGCAATTCAGCGACAAGCTGGCCGGTGTGCGCATTGCAAAGAAAACAGTCGATCGCCTCTGTCAACTTATCTGGCTGGCCGCACAGGACGTCAAAGCCGAGCTGGCAGGACGGGAGACGTATGAATACCAGTCGCTGGCGGAGCTGGTTGGTGTAGCAAAGTCCACATGGACAGAAACCTACCTCCCTCATTGGCTGGCGCTGCGCAGCAGTTTTGTGAAGCTTGATAGCGACGCTCTCATGGCGGTAACGCGATCACGTTCACAACAAAAGGCGTCAAATTTAGATGTAAGTCTTGCAAAACCGAACTGAAAGGCATATATTTCATGTAAATCTGATATCGTCGCCATAGCTTCGATTGTCGACACATAAAGAATTCAAGCCCGAGGTTAACGCCTTGGGCTTTTTCATTTCAGGGTCAGAAGCACAGCGGTTGTGCGTTCGGCTGTTAACCGAATGGTAGAAGGTTCGAATCCTTCCTGTCCCGCCAAATAATGGCCTGACCTGATGACGGGCTCATAATCCAATCCATCAGGGGCGCTGCTGCAACAGCGTCACAGGCCGCCAGACCAAGCCTGGGTATTTTCGGTCATCACCGACATTGCTATTACCCTCATACTTATTGCCTGCCTAACCGCAGGCTTTTTTATTTTCAGGGTCGCGGGAATCACCCTCGACGCTTTGTTGGTAAATCAGCCCGACGGCCCTGACCTTCTCACACACAGCTTCCCGATCTTTCATCGGAGGCGGTAACTATGGCTAAGCGTATGCAAGACAAAGAGAGCATTGCCGGGATGTCCTGGCTGGTTCTGCTGATCATTGCTGGTTGGGGCGGCCTTGTCCGATTCCTGATGGACGTAAAGCAGGGCAAAGCAAAATGGAGCTGGATAAATGCTTTTGCGCAGATTGTGGTTTCGGCTTTTACCGGGGTTATTGGTGGGCTCATCAGCATTGAAGGTGGCCTGAGTATTTACATGATACTGGCCACTGCCGGTATCAGTGGTGCTATGGGTTCCGTAGCGCTCACGTATTTCTGGGAACGAATCACCGGAGTGAAAGCACAATGAAAGCAGACCAGACTATCGAGGGGATCCTCGGCAAAGAGGGCGGTTATGTCGATCATCCGTCGGATAAAGGCGGGCCGACCCGCTGGGGCATCACGCAGACCACAGCTCGAGCACATGGTTACACCGGTGATATGAGAAACCTGCCCAGGGAAACAGCAAAGCAAATTCTGCTCAGCGATTACTGGACCGGCCCCCGATTCGATCAGGTGGCAAGTTTATCTACGTTACTGGCAGATGAGCTTTGCGACACTGGCGTGAACATGGGGCCATCGGTTGCAATTAAGTTTTTCCAGCGCTGGCTCACTGCCCTTAACATGCGTGGGAAGTTGTATCCCGATCTGATCCCGGATGGCGCCATTGGCCCCCGAACCATCACTGCGCTTAAGGGATATCTTTCAGCCCGCGGGAAAGAGGGGGAACAGGTTCTGTTACGTGCGCTGAACTGCAGCCAGGGCGCCAGATACCTCGAACTGGCGGAGGGCCGCGAAGCCAACGAGGAGTTCCTCTACGGCTGGGTTAAGGAGCGCGTGCTATGAAGATGATCATCTTCGCTTTGCTCGTGGTGGTGGCTGTGCTCGTTCTGTTACTGCTGCGCAAATATACCCGGCTGGAGTTCGTTGCCCATGCCAGCCTGCTGCTGAAAACGTGGTCTGTAAAGCTGGGGGCTATCGGTGCGCTGGTTGGCATGTGGGCGCAGTCGTTCCCGGATGCTGCGCTGCACGCCTGGGCGATGCTGCCGCCGGATATCAAAAACATTCTGCCTCCAAACATTGTTGCGTTGATTAGCCCTGCGCTGGTGGTGCTGGCGGTGCTTTCGCAATACGTACGCCAGCCAGCATTGAAAGCTAAGGCCGAAGAACTGAAGGAGCCGCAGCAGTGAGCTTCGAAATTATTTCTGGTCTGGTGGTCGTCATCCTGGGCGCTATAGCTGGCGCGTTCGGCATTGGTCATGCTCGCGGGACCAGTAAGGCGCAAGCCAAAGCCGATCAGCAGCGTACCGAAGAGAATGCCGCCGCCACCGTCGCCGCAGCAGAACGTAAGGCAGAAGTCACGAAGGAGGCAAGCGATGTACAGCAAACCGTTAGCCATATGCCTGATGACGATGTTGATCGGGAGCTGCGCGAAAAGTTTACCCGCCCCGGTAGTCGTTGATACGGCCTGCAGCTGGGTGCGGATCATCTACCTGACCGACCACGACATCGACGTGCTGGATAAGCAGACCAAGCGCGACATCCTGGCGCATAACAAAGCTTGGCAGGCGAACTGCCAGAAACCAACAGCCGTCACCATTCCAAAATGATAAGCATGACAACTGGCATTCAATTATTGCCTATGATGATGCTCTTTGATTTAGTTGCTGATATAATCCCCCTAAATGATTTCAGGAGGGTTCTGCGTTGTCCGCGATAAACCACATAAAAAACCCGTTAACGATTATCGGTATTTTTGCGGGGATAGTAGAGGTCTCTGCAAACCTCGTATTACCTTTTCTTAATGATTCCCAACAAAGTACATATCTTTGGTTTTTAATGTTTTTCCCGGCAGGGCTTGTAATAGTGTTTTTTTTGACGCTGAATTTTAATCATGTTGCACTTTATGCTCCAAGCGATTATAGCAACGATAAAGGATTCATGCAGGCCAATGGTAAAATGATCGGTAATGATGTCAAGGATACTGATGCAACGCAAGGCTTTGAATTAACATGAATAAATTTATTTCAAGCACTTTTAATAATACTGTCGTCGAACTGGATGGCAACCATTTCGAAAAGTGCGTTTTTGAAAATTGCGAAATTGTATATAAAGGATTGCAGCCTTTTAATTTAATTAATTGTAATTTTATTGCATGTAAATGGAAATTGGAAGGTGCAGCGTCAAACACAATTAACTTTTTAAAGGTTATGTATAAAGATATGGGTGAGTTTGGAAAGAAAATGGTAGAAGCCACTTTCGAAAACATAAAAAAATAGTTTCATCCATTTACTTTCAGTTTATGAGATACAGCCCTGCAATCGCGGGGCTTTTTTATGCGCATCGCTCGCGCACATCAAAGAAAGTATTTCAGCGGTGAGCTTGGGCAAACCGTTAACTTTCGGCGGCTCTGCCGTGCGACAGGCTCACATCTAAAAGGAAAACAGCATGAAAAAGTTTTTAATCGACATGATTGATAAGGGCCTCTATTACGCGTTGCTTGCGGCGCTGGCTTTTGGGTCGGTCACAGGGCAAAGCAACGTCCTTAACGTGGCTGCTGCTGCGTTTTGGGTGGTGGTATTTCTTGGTGGGGTGGTTGGCATTATCACAATATTTCTCGCACATGGTGCAGAGCATATAACTGATGAGAAATCACGTCAGTCTGTGCTGGAGTCGCTGAGGAAGATTGTCCGGCGTAAGAACGTCATTGCTCGATGGTGGGGCTGGTTCTGCATGATGGCGACCATCGCTCTACTTGCCTACGGCGGCTGGGTATTTACCGCAGTGTGTTATGCGCTTTCATCCCTGTTTGTGCGGTTCTGCATTTCTCTGGCCAGAGACAAAGTAGAAAAACAGACGGTCGGCGTTTTAGCTTGATGGCATTACAGAGCCACTTCAAGAGGTGGCTCGATAATGTCAAGGCGAGGACAAAATTATGGCAAAACCGGACTGGGAGGCCATCGAATCGGCATACCGGGCCGGAGTCCTTAGTCTCCGTGATATAGGCGATAAATACGGCGTTACTGAAGGGGCTATCAGGAAGAGGGCTAAAAAATTTGACTGGGTACGCAATAGCGGTACGCAGGTACGCAAAAATGGTACGCAAAGTGGTACGCAAAAGAGTAAGGCGCGTACCAGCGAAAAGCCTGCCAGCGCTGGCCGCACGCAAAAAAGTACGCAACCAAAAGCCGAGCCTCCACCAGATACGAAACCGATACGCGGGGTGCGTACCGATCCGCCGACCAACCCATTCCAACCCGGCAACCAGCAGGCGTTAAAGCATGGTGGTTACGCCCGGCGTCTTCTGCTTAAAGATGAGGTCATTGAAGACGCGAAAGCGTTGACACTCGAAGACGAATTATTTCGCCTTCGGGCTAACAACCTTGTCGCCGCAGAGAATATTGGCCGGTGGTTGACCAAGCTGGATGATGCTGAAGGGGACCAGGAAAGAAAGGTGTTGATGGAAAATATCAGCGCCGCCGAGAAGGCGATGATGCGCAATACCGTTCGTATTGAGTCTATCGTCGGCACGCTTGCGACGGTAGGCAAAATATTTGCTGATACAGACTATCGCAAGGCTGCTACTGATAAGGTGTCGCTGGAGGCCGATCGTCTTCGCCGTGATGCAGGTATTGATGATGGCAACGGAGAGCGTGACCTCAATGACTTCTACTCTGACATCCAAACCGACGCTGAATCCGGTCCTGCGTAGCTTCTGGACGACGCAGGCGCGTAACAAAGTGCTTTATGGTGGCCGGTCATCGTCAAAATCGTGGGATGCCGCTGGCATAGCCATATTTCTGTCGAATAAATACAGCCTGCGCTTTTGCTGTGCACGTCAGATCCAGAACAAAATTGAAGAGTCGGTATATACCCTGCTCAAAATTCAGATTGACCGCTTTGGCCTGCGGCATCGTTTCCGCATTCTGAACAACAAAATCATTAACCGGGTGACCGGGTCTGAATTCGTCTTTTATGGGCTCTGGCGCAACATTGAAGAGATTAAGTCTCTGGAGGGTATCAGCGTTCTGTGGCTTGAAGAGGCCCACGCGCTGACGGAATACCAGTGGAAGATACTGGAGCCAACCATCCGTAAAGAGGGATCAGAGTGCTGGTTTATCTTTAACCCCGGGCTGGTGACTGATTTCGTGTGGCGTAACTTTGTGGTCGATCCGCCAGAAGATACGCTGATACGCAAAATCAACTACGATGAAAACCCCTTTTTGTCCGACACCATGCTGAAGGTTATCGACGCCGCTAAACGCCGGGATCCGGATGGGTTTAAGCATGTCTACGAAGGCGTGCCAGAGTCGGATGATGATGCGGCCATTATCAAGCTGTCATGGATTGAGGCGGCCGTTGATGCCCATAAAGTCCTTAATTTCGAGCCGAGCGGGCGCAAGCGTATTGGCTTCGACGTTGCTGATAGCGGCGCCGATAAGTGCGCTAACGTCTATCGTCACGGCTCCGTCGTGTATTGGGCGGATGAGTGGAAGGCGAAAGAAGACGAATTGCTGAAGAGCTGCCAGCGTACGTATCAGGCGGCACTGGAGCGTGATGCTGATATTGTCTACGACTCAATCGGCGTTGGGGCATCTGCTGGCGCGAAATTCTCAGAAATTAATGAGGATCGTAAGCGCGAAAACATGAATGCATCCCGCATCAATTATCAGCGATTCAATGCTGGCGCTGGTGTGAATGAGCCGGACTACGAATATATTGGCATCCCGAACAAGGATTTTTTCGCCAACCTCAAAGCGCAAGCCTGGTGGCTGGTGGCGGATCGCTTCCGTAACACCTTCAACGCGGTAAAGAACGGCGAGCAGTACCCGGTAGATGAGCTGATAAGCATCGACTCATCTTGTCCGCTGCTGGAAAAGCTCAAGCTGGAACTTACCACCCCGCACCGTGATTTTGACAAAAACGGTCGCGTGATGGTGGAAAGCAAGAAAGACCTCGCCAAGCGTGACGTACCATCGCCGAACGTGGCCGACGCGTTCATCATGGCGTTTGCTCCAACCGATACGGCAATGGATATCTGGGAAGCGCTGGGAAACAGCTAAATACCTGGAAATAACCGTTTCACGCAAAATTCACGCTATTCATTTTTCGACCCTGTTTATGCATGTTTTATTCACGCGCTTTTAGCCACTTAACCCAGATAAATAAGCCTTTGGCGGACATTTCATGAACTGAATTTCCGCACAACATAAGTTAAATCGGGTCATTTTTTAACAAATTATCCTATCCGCCACGAGTACCAAAAAAGCCGGAGAATAGTCACCATGGCGAAGAAAACAGGACGAGTCGCCACGGCGGATTCGTACGATAACTTTGTTGCCCGTGTCGGAATGCAGCAGCCTAACCAGCATGCCGCATCGACCTACAGGGCGAACTATACCAGCCGCAACCGCCTGCTCATCGAGTGGGCTTATCGTTCCTCCTGGATTATTGGTGCCGCAGTCGATTCGAAAGCGGACGATATGACCAAAAAGGGCGTGCGGATCACCAGTGAGATTGACCCGAAACGTCGTGGCATTCTGGAATCACGGTTCGATGAGCTTCAGCTTTGGGATTGCATCAACGAGACGCTGAAATGGTCCCGGCTGTATGGCGGGGCGGTGGCGCTGATTCTGATTGAAGGTCAGGCACCGCTGACGCCGCTGGTGCTGGATAAGGTTGGCAAGGGCAGCTTTAAAGGTCTGGCTGTACTTGACCGCTGGATGATTAACCCACAGCTCACCAGGCGCATTAAGGCGCTTGGCCCCAATCTCGGCAAGCCAGAATTCTATGACATCGTGACGACGGCGCAGGGGCTTCCTGCGTGGACCGTTCACCACAGCCGACTGATTCGCATGGATGGTGTGAAACTGCCCTACCAGCAGAAAATCACCGAAAACGAGTGGGGCATGTCCATTGTCGAGCGCATCTTCGATCGCCTGACTTCCTACGATAGCACCAGCGTCGGCGCCGCCCAGCTTGCCTACAAGGCACATCTGCGAACGGCAAAGATTAAAAAGCTGCGTGAAATTATCGCCATGGGCGGTAAACCATTCGAAGCGCTGGTTAAACAAATGGAGTTAGTTCGCCAGTACCAGACGAACGAGGGTATGTCCCTGTTTGATTCGGAGGACGAATTTGAAACACATTCCTATTCTTTCGCGGGCCTGTCTGACCTGCTTAGCGAGTTTAAAGAGGATATCGCGGGTGCTGTTGGCATTCCTCTTGTCCGTCTGTTCCGCCAGTCACCGAAGGGTTTTTCAACCGGTGACGCTGACCTCGCGAACTACTACGACGACGTTGGAACGCTTCAGGAGCGAGATTTACGGCCTCACATCCGCCTGTTATTCGATGTACTGCATCGCTCAGAGTTTGGCGAGCCGTTGCCGCAAGATTTCACCTTTGAGTTTAACCCCCTGTGGCAGATGAGCGACACCGATCGCTCCACGGTGGCAACCAACACAACTACCGCTCTTGCAACCGCGGTGCGTGATTTGGGCATGTCGCCGGCTGCTGCTCTGACTGATTTGCGCGAGCTGTCTGACGTTACCGGCATCGGTGCTTCAATTAGCGATGAGGATATCCAGAATGCGGCGAAACAGTGGCAGGAGACTGAATCTGAAACCAGCCCTCCGCCGCCGATCGGAGGTCCAGTATCAGAAAAGCCTACTGGCGATAGTCGACCAGATAAATCAAATCGTCACGGGTTCCTACGATGGTTCACAGGCAAGCGCTGAGAGCATTGCTAAATCGCTTGTTGACTACTCCGGGGTGATCGACGACTGGGCCGAAATGGTCGGTCGAAAGATGTTTGCCCAGGTGGAGCGTGAAGAGTGGAATCAGTGGCGCTCTGTTTCGGAAGAAATATCCGCTGGTCTGCGTGACGTGATTAGTAACACTCCTGTCGGCATGGTGGCACAAGACACCGTTTACCGGCAGATTCGCTATATGAAATCTTTGCCATTAGAGGCGGCCGGACGTGTCAGGGAGATTCAGGAGCGTGCGATAAAGGCTGTCATCCATGGTGAGCGTCCAGATCAGCTTTACGAGATGATCATGCAATCCGGTGACGTGGCGGCCAGCAGGGCGCGGATGATAGCCCGCACTGAGATAGGCCGCGCAACTACCGCATTAACTCAGGCTCGGGCGCTGTCAGTTGGCTCTGAGGGGTACTGGTGGCGCATCAAGGGGGCTGGTACCAGGCCATCGCACCGAGGAATGAAAGATAAATTTGTGCGCTGGGATAACCCGCCAACGCTTGACGGCATGACCGGACACGCCGGATGCCTGCCGAACTGCGATTGCTGGCCAGAAGTGCAGATCCCTGAGTTTAGAAAATAATGCATTAGTAACAACTCGATAGGGATAGCAAATGAAGCAAAACATACATAAGCTTAACGGGTTAGAATTCACCCATGAACGCGATTTTATCAACGGGCAATGGGTATATTCTTGGTATTTTAGACCCCTAGAGCAATCTGAATGGTGCCCATTCTCGCTGCCAACGGGGAAAACAAGAAAATCTGATATTGAGAATTTCTTAAAAAATTGTGAAGAAGCTACCAATTTCTATTTAGAGTGGTTAAGAAATGCCTCTGATGTGGAAGGTGCTGAACGCTATTTATTATCCGCAAAACAAGCTTGGGAAAGGATATCTAGCCCGGACTGGGGAGGGCGTGGAAATAATCCAAACAAGGATGCCAGGAGGGTGCAACAAGCAAGAGAAACTCTCGAATCGGCAAAAGTAAAGCTTGAGAAAGCCAAGATATTACGAGAAAGATTAAATAGTAACTAACAACGAGGCCGCTATTCAGCGGCCTTTTCTTTGCCCGCCATTCAGCAGGTAACCCATGAAATATTTCTTTAAAACCCGCCTGGGTAATACTCGCTTTCAACTTACTGATGGGTCAGTCCTGTTTAAGGACGTCCCGATCGCAAGGACTGGTGAGCTGGAGTACGACGCCACAGAGCGGCCTGAGCTTGTCCCAAACGACAGAGGGAAGGTCATCGTACGCCGGACGCCAGAAGAGGTGTTCAGCGAGCGAGCCATGGCGTCATTCGAAGGAATGGCAGTCACTATCGGCCATCCGCGAGATTTTGACGGGCAGATCATCTTTGTTACCCCTGATAACTGGCGCCAGCTGGCTCACGGGCACATCCAGAACGTACGACGTGGCACGGACGATAAAACCGATCTGCTGCTGGCTGATGTCATCGTCAAAACCCCGGAAGCCCTGCAGGCAATTGATGATGGTGATGACGAGGTCAGCTGCGGGTACGACGCCGATTACGAACAGATTTCACCTGGTCTCGCAAAGCAATCTGCGATTACCGCTAACCATCTGGCCCTTGTCCCTAACGGGCGGGCCGGTTTCCGTTGTGCAATAGGGGATTCTATGCCAAGCACTACTAAAAACTGGTTTACCCGGCTCCTGAAGGCCCGTAAAACCGGGGACGCTGCCGAAATGGCAAGTCTCATTGATAACCCGCCTGATGATGTCACGGGCGATAACGACGTATCGACCTCTATGACACCTGGCGGAGTGGTCATCAACCTTGCGCCGCAAAATCCGCTTCCCGGCCCGGCATTGCCTGGAACCGGCGATGCCGAGGAAGAAATTCCTGCATGGGGTAAGGCGCTGATTGAGGCGGTGGCCAAACTCACGCCTGCGGCAACTGCTCCTGGAACCGGCGATGCCGAGGACGAAGAGGAGAAAAAGGAAGAAGAGGGTAAGGTTACCGGCGACGCCGCTTACCGTGCCGATCTGATTCAGCCAGGCATCCAGTTGCCAGAAAAGGCGAAGCCGACAGCATTCAAGCGTCAGGTGCTCGCCTCTGCCGATCAATCTCTGGTGCGCTCTATTGTCGGTGATGCCGATATCAGCAAGCTGAAAAAAGCCACGGTGGATATGGCTTTCACAGCTGTTTCTGAGCTGGCTAAAAACCGCAATACCAAAACCGTCGACAGCCTGCAAACGCAGACTGCCACCACTGTTAAAACCATTGCCGGTATGAATCAGGCCGCGCAGGAATTCTGGTCTAAACGAGGCTAACCAATGGGTAATACATTTCTTTACCGGATGCCTGCGGGCATCGCCGGGGCAATTTCTCGTCCGCAGGATCTGACGGTTGAACCTCAACTGCTGGACTCCTCCAACCTTTTTCCCGCTTACGGCCTTGGCGGCAAGATTTCCTCCGGGAAATTTGTGCCAATCGCTGCGAGCGATACAGCGTCGGTGCTGGTGGGCATTTACGTTCGTCCGTATCCGACCGCCAGCCAGCCGGATAAAGTCCAGCAGGTAGGCAGCGGTAAAAACTTCACCGGCGATTGCCTGGTCCGTGGTTACGTCACGGTAAACATCGGCGCGGATGCATCCAGCGTTGCGCTGCATGGCCCGGTCTACATGCGAGTGGCCACACCATCCGCCTCAAGCCCTCTCGGCGCGTTCCTTGCCGCCGCTGATGGCTCGAATACCGTCCAGATCACTAACGCTTACTTCAATGGCCCTGGCGACACCAGCGGCAACATTGAGCTGGCCTTCAATATTTAAGGAAATCGCAAATGCCAATGACATTTGACCAGGCGACAGTCGACGGCACTGGTGCCTTTCTTGTCCATGAGCTGGAGCGTCTCGATCAGACACTGAATCTGCCGCTGGTGAATTTCACCTGGTCGCGCGATATCCAGTTGCGTGAAGACGTGTCTATTGCTGACGAGATCAGCTCTTTCACTAACACCACCTTTGCCGCTGCCGGTACACCGAACGCCAACGGTAAAAACTGGCTGAGCAAAATCCCTACCGCGCTGGCTGGCGTTAACGTCGACATCGCAAAAACTGGCTTCCCGCTTACCCTTTGGGGTATGGAGCTGGGATGGACCGTTCCCGAATTGCAGGCAGCTGCGCAGGTTGGTCGCCCGATCGACACACAGAAGTACGACGGCATGCAGCTGAAGTGGAACATGGACACAGACGAGCAGGTTTATATCGGCGATTCCGGTCTGGACGTTAAAGGCCTGCTGAACCTGACGCAGGTAACGCCGACCAACGCCGCGAAGACCTGGGCGACCTCCACCGCTGACGAAATCCGGGCGAGCATTAATGCCGGGTTGAGTGCTGCGTGGGCCAACTCGGCTTACTCCATGGTACCGACGGACCTGCTGATCCCGCCGGAGCAGTTCTCTCTGCTGGCAAGCACCATCGTATCCAGCGCTGGTAACCAGTCCCTGCTGACCTATCTGGAAACCAACACCATCGCATACCACCAGAACGGGCGTCCTCTGAACATCCGTCCGGTGAAATGGGCGAAAGGTCGTGGCGTGTCGAACTCTGATCGCATGATGTTCTACACCAACGACAAGAAATACGTTCGCTTCCCGATGGTTCCGCTGATGAGCGTGCCGATCCAGTATCGCGGCCTGTATCAGCTCGTAACCTATTACGGCAAGCTCGGTGCAGTAGAGCCGGTTTATCCGGAAACTCTGGCCTACGTCGACGGCATCTAACCTGCGGCGGCCCGAAAGGGCCGCTCATGAGGACTTGCAATGAAAAAGATTTACGTACTCTCCCCGTTTAACTTCAACGACGGCAAAGAGCAAAAGCATTTCCCGGTTGGCTTCCACGACGTTGATGACACGGTTGCTGATCACTGGTTCGTAAAAGCGCACTGTTCTCCGGATGGCGAAGCGCCAGCGGTCGCAGAAGACCCGCGCATTGCTGAGCTGGAAGCAAAAATCGCCGAGAAAGACGCGCGTATTGCTGAACTCGAAGCGCAATTGCCGGAGACTACCAATAATGGCAAGAAATCAAAGTCTGCCGACGCCTGAGCAGTTCAGGGCAACCTTTCCGCAGTTCGCTGACGATACAAAGTACCCCACGCCAATGATCCAGACTCGACTGAATCTTGCTGATGCCATGTTGAGTGAGTCGCGCTTTGGCGTGGATATCTTTCCCTACATCGTCGGGCTGTATGTTGCGCACTACATGTACCTTTACGCTGCCGATATGCGTGGTATGGCTGTGGGTACTGCTGGTGGTGTAAATAGCGGCATACAGACCGCGAAATCAGTGGATAAGGTTTCAGCCAGTTATGACGCAAGCGCAACCCTGGACCCTAATGCCGGTTTCTGGAACAACACCCGTTACGGATCGGAGTTCTGGGAATACCTGATGATGTTTGGTGCCGGAGCGGTTCAACTGGGGACGCCGGAATGAAAAGCGGGCTCACAATTCGGGAAGACAATTACAGTGTCGTTCTGGATGCGCTGAAACAACTGTCAGGCACTGATGTGCTGGTTGGTATCCCGGCAGGTCCTCCGCGTGATGATGCGCCGCTGAGCAACGCTGAGCTGGGGTATCTCCAGTCCACCGGGGCAACCGTAGAGATAGACGGGGAGACCGTTACTCTGCCGCCAAGACCATTTCTGGACATGGGCATTGAGGATTCCCGGGATAAAACGACCGAGCGTTTAAAGCTGGCCGCTCAGTCTGCGCTTGAAGGTAAGGCAGATGTGGCGTCGATGCATCTTGAAGCCGCAGGCCAGATTGCGCGTGATGCCTCAAAGGCTGTCATTGAGGCAGGCGATCGTCTGACCCCACTATCTGAAAAGACCATCAAGAAGCGCAGAGAAATGAAACCGCCCATCCTCGGCGATAAGCCGTTACGTGCCCGCGGATTCCTTTTCAGAGCGATTCAGTATGTCGTGAGGAAAAAATAATGCCGTTTCTCGATGTGACTGATGTTCTGCTTGATCCGGACTTTGTCGACCTGTCCCTGGTGTGTTATCGACAGGTGCAGACGGTGGACGAAGATAATTTTCCGACCAATACCGCGCAGGCTATTCCGTTCTCTGGTGTCGTAACCGTCGATCGCTCGCTTGAGGCTAAGCGTATGGCCGCCGGACAGAACATCAACGGGGCCATTCTCATCGTGACGCAGTTCAGGCTTACTCAGGGGCAACCCGGATTAGATGCCGATATCGTAACCTACCGCGGGCGAGATTATCGTGTGACGTTTGTCGACCCGTATACAGCGTACGGTGCCGGGTTCGTTCAGGCGCATTGCGAGCTGCTGGAATTTGACGGGGGAACGCCGATTGAGTAACGACAGCACAACGGCGGGATATCTGACCCCCGTCGGTGATTCACCGCCCTACGATGAGGATCTGGAACGGCTAATCAGCCGCTGGATACGGGGTGTGACAGGGCTGGCTGCCACGCTGGTTTACCCACGCTGGACTGACCCGCAAAAGCAGATACCCAAAAACGGCACCACCTGGTGCGCGTTCGGTATCACCGGCATTCAGGAGGACTTCAACCCGGCGTACGTGCAGGGCGAAGAGAACACCGAACAGTGGTCGCATGAGACCGTGAGCCTGATCTTGTGCTTCTATGGCCCGCAGGGGCTGGCAATGGCCACGCGCTTTCGTGACGGTCTGCTGGTCTCGCAGAACAATGACGAGCTCAACCGCTCAGGCCTGACATTTCTGCAGCATGGGCGGATCCTCAATCTGCCCGAACTCATCAATAACCAGTGGGTGCGCCGGTACGATATCAGCGTTGACCTGCGCCGCAAAATCATCCGCCAGTACGGCATTCAATCGCTGGTCGACGCGCCAGTGCAATTTTTTGGAGATTAAAACATGGCACAGGGCTTACCTGTTTCCAATGTCGTTAACGTTGACGTCATCATGTCACCGGTAGCGGCAACGGGGCGAAACTTCGGTGCGCTCCTCATTCTGGGAACCTCTACCGTTATTCCGGTTACCGAGCGCATTCGCCAGTATTCGGCCATTGAAGATAAACCAAAGTTCTTTGCACACGTTGACCTGTCCACCAGGCCACTGAGTGATGTTTCCGATGCAATGTCACGGCTAATACCCGATTTTGATATTGATACAGCCGTAGGCGTGCAACTCGACGTTGTGGGCGAATGGGTTGGTCGTTCCCGTCGCGTGGCTACACCGGTAACCGGGATTTATTTTTCGTGGGACACCGAGCGGGTTGGCTGGGATCAGGGGGTTTGGCAGGGACCATATGACCCAAACGACGGTTTTATCGATCTAAGCGATGAAATATATCGGCTAATGCTGAAGGTGAAAGTGGCGATAAACAACTGGGATGGTCAGAACGATTCGCTGCCTTCAATTCTTGATGCCGCCCTTGCCGGGTCTGGAATCCGCATGGCTATTGTCGACAACCAGGATATGTCGATTTCTATCTGGATACTCGGTGACCCATCGGTAGCTCTAAGTGAAATAGACCGGTTAATTCTGGATAGCGCCGTCAATAAAGGCCCCTTTATCGCATTACCGGCAGGTTACGTACCATCGCGCTATGACATTAACCCGATTGACCAGGTTAACAGCGAGCTATGGTGGGCGATTCAGAACGGTTATATGACGGTTAAAGCCGCAGGGGTTCGTGTCCGTGAAATAGAGACCGTCAGTGATGGTTATCAGTTTTTTGGCTTCGATATCGAAAATGACTATATCGCTGGCTTCGACCGCGGGTCATGGGGAGAGAGATTTTAATGGCGACTAACGATTTTAAACCCTTCGCTACTGGTAGCGGGGCAAACGTATTATCACAAGCTGATTATGACGCACTGTCGGCCAGAACAACTGGCTTTTTAAGTGGCAAAGCGTCTTCGGCCCAGGTCAACAAAGCTTTAAGGCAGGCATCAACTATTGCGGCGGTTGTGGCGCAGTTTATTTCTGATAACAGCGGCGATGACACCCTTGATAACGGTAATTTACCTACCTTACTGGCTAGTCTCGAAAGCGCCCTCCTCAAGTCCTCCCCTGGGCGGTTACAGAATATCGTTAGCTTTACCGCAAATGGGACCTACACCCCATCGCCAGGAACCAAACATGTAAAAGTTATTGTTACTAGCGGTGGTGGCGGTGGTGGTGGTTGTCAGGGAACTTCAGGATCTGAATCAGTTTCAGGTGGCGGCGGCGGTGCTGGTGGTACGGCCATTGGTTATTTTGCTGTAACTGAATCCAGCTATGCAGTCACTGTAGGTGCTGGTGGTTCTGCTGGCGTTGGTGCTGTCCAGGGTGGAACAGGCGGAACCTCAATCATTAACGGCATTAGCGGATTGGGCGGAGATGGTGGTCAGAAATCAGGGATCACTACGCTGGCTGGTGGGAAAGGCGGTATTTCTATTGGTGGCTCGGTTAACCTTCCCGGAGGTTACGGCACTGATGGGCAAAATGGCTCCCTGATTATCCCCGGCAATGGCGGCTCGTCATATTGGGGCGGCGGTGGTCGTGGGGGCGCACGCGGGGGCGTGGCAGGGGATTGTTATGGTGCCGGTGGTGGTGGCGCATATGATGCTGCCATGTCTGGCAACTCCTACAATGGCGGACAGGGGAAAGCAGGGATTGTATATATTGAGGAATATTCTTAATGGTAAGCAAGTATGCAGTCCTGAAAGAATGCGTGGTAGAAAACATTATTGTTGCTGATGATAATTACTCTCCTGATGATTTCGAAGTCGTAAAATATAGCGACGAAACATTTTGCCAGCCAGGCATGTTGTACAATAAAGAGGATGGTTTATTTTATGATGATAAATAATCATCAAGAATAAATAACATTATCTACATATCTACACATTACCAATCAACCGGCTTATGCCGGTTTTTTATTGGGGCGACCATGAGTGAATACGATACCGGCAATCCTGTGCCGTCTGCATCAATACCTGATGCATGGGATAATATGCAGACTATTGACAGGTTCGCTAATAGTAGTGATGAAACTATTACCACGCGTACAGGTAAGCAGTTAGACACTCTGCATGGCATCAATGTAAAGTCTGATAACCAACTTAATGAACAACAAGATACCTTTGAATTATCTCAATCCGAAAGGGAATCCTCTTTCGAAGAAAAATCAAATGAATTTGAATCGCGCTTCTCCTCTCAGTTATCGGCGCAGGAATCAACATTTTCAGAATCTCAATCTGATAAAGAAAACCGCTTTCAGCAGTTCCTGAATACTTCAGGATACGTGTTCCTTGGCGATTATCAGGACGGCCCATTCCAGTTTAGTGCCCGTAACCAGTACATCCGTTACGACAATGAGTATTACCGCCTGAATGCTGCTACTGACGTCGGCTTTACGACCACCGGAACCGATGCGACCAGCTTTGCGAACGACGTTACTCACTTCGTTCTGATGGATGGTGATACGCTTCGCCAAGACCTGGGTTCAGGCGAAGGAGCAATGAAGGTTTACCGGAACGCCTCACCTCTGGCCAGAATCATTCGCTCCTCGATTTTTGAATACCTTACTGAAGCTGATCAGCAGGCGTTGCTCACAATTCCTGGCGTTAATGTTATCGCTGACTACGCGTTAAAAAAAGCTATAGCTGATGGAGTGATGGTACTGGATATTCCGTGGAATGTCGGTGCGTTAAATTTCGGGCTTGACCCCGCAATGCTTCCATTAGGTTTTCAATTTATAGGGTGGGGTTGCCGACGCCCATATACAATTGATGACGATAACAGTTTTCTGAATTGCGGAGTCGTCATCCGCGTAGCAGCTGGTGCAAGTTTTCCATTTTATTCAACAGGCAGGCATGTATTCCGGGATATTGTTTTTGATGGCCGAGATAAAACAACGTACCTTTTTTATTCGCCAAATACTGCAACCCAGTTCAACGGCACCCGACTTGAGGGGTGCGGATTTTATCGGTTTGCGATCGGGATTGGCTGGGCTTCAGGAGGAACAGCCAGGTACATCGGAACAATGAAAGCATATTTCTGCTCAATATCCGGAAACGGGGATGGAGTCAGGAATTTAATAGACTCCATGATGTTTGGTTGCACAATCAATGCTAATGATCGAGGAGTGGCCCTTACCGGTGGGGCAAACAATAACTTTTTTGGAGGATGCCGGAACGAATGGAACACCGGCGATAACTGGTATGCGTACCAGTCGGTGGAGAACCAGATTTTCGGCGAACTGTGCGACAGGGCCGGAAGGGGAGGTGTGGTCGCCGGGGCGAAATCCTCATGGATTTTAAACGGCGTTAACGTCCGGCGCAGTGGTGCTAATCAACCCGTGGGTAATGACTATTCCGCAAACTTTATTATTATTGATGACGGTAAAATTGAACTCTCAGGGGTAAGAACTGGTGTCGGTGCGAATGACAGCGGTGACGGAGGGACAATCTCGCCATCCTACAACGTATCGGCTCTTGGCTCTGGCGGGGGGACCTTGCAGGTTTCCGGAAGTGACATGACTGGTTTTGTTACTTCAGCAATTAACCAGAAGGCGACCACGTTAAATAAGTCGATAACCGGCAACCTTGGTATGGACGACGATGTAAATATTGGTATGACCCAGGTTGTTAAAGGCAGGCGAATTATTGGTTCACAGTCATCAGGTACGTTAGCAGGCTCTGCGGGCGCAACGTTATCGCTGACCAAGACCAACATATTCCAGAATTCTTTCGATACATATATTACCCGTTCAATCCTAATTGAATGTCGAATTGGTAGCCAGTCATTTGGTGACGATATTAAAATTCCCGTCAGATTCAGAAGGGAGAATCTTTATTATCTGGATATCCTGACCTCGGGAATTGTTGCCAGCTCTGCACGCATTGGGCTTTCAGGGACTGGCGTAACGGTATCATTGTCCATTAACAGCTCAACCGGTCTGGTTACTGTTCAGTTGACAAATGTTGATGGTCTGGAAAGAACCGTTAATGTATCAATGTTGCCCTCAATGTAGGAGTGAAAATGGAAGACGAAACAGAACTGACCGAACCGCCATTTGAAACCTGGTTCAGAGACGTGGTTGAACTGGTTAAAAATGGCGGACATTCAATGGATATTGTTGCCTATAAGGGGGAATGGGTTGATTATTTTTCGGAAGGATTAACACCAGAAGATGCGTTTATAAAAAGAATGGCTCTTTAAATAACCGACCGAAAATAAATGTCAGGGAGTAACGTATTGATAATTTTCGACAAAACGGCAGAGCATAGTGAAAACCGCCTGTGGATGATGGCCAACATGGGGAGGATTCTGTAATGGGCTTTCCATCACCCGCGACGGATTACACGGAACAGCGATTAACGGTTAACTCGATCTGCAATGTCGGTCCAAATACGCGCCTCTTCGAGCAATCTGGCGGTTACGTTGTGCTGGATGTCTCCCTGAAACCAAAGCAGGCCAGTCAGGTTCTGATCCAGCACGGCGGCGGGACGGAGCTTGCCACGCTGAGAGGAAGGTCGCTGATTACCGAAGATGGTGAAGCGATCGAGGGCGAGGCCCTGGACGATGTTACTGTCATCGGTGTAGTGACGTTTACTATCTGCGATGTGCGCCAGGACAATGCGGTTGTTTAG
Protein sequences of DBSCAN-SWA_3 >CP034966|1934259:1960503|1936267_1937527_-|QAS89788.1|DBSCAN-SWA MSKESYPTGVENHGKSLRIWFIFKGKRVRENLGVPDTAKNRKVAGELRTSVCFAIRMGTFDYAAQFPNSPNLKTFGIGKKDITVKDLSEKWLELKRMEICANAINRYESVVRSMLPRVGANKLVSSVTREELLYIRKDMLAGDKGLSVVTVNYYMTTIAGMFQFAVDNGYVSENPFNGIKPLKRARIEPDPLTRDEFVRFIDACKHQQTKNLWSIAVYTGLRHGELVSLAWEDIDLKAGTMTIRRNYTKLGDFTLPKTEAGTDRVVHLIKPAIDALRNQAEMTRLGRQYQIEVALREYGRTVIHDCTFVFNPQLVKKSGSVGYLYKADSVGDSWDAALKRSGLRHRKAYQSRHTYACWSLSAGANPSFIASQMGHASAQMVFNVYGAWMADSSSDQIAMLNQKLSDFAPSMPHGMVIGI >CP034966|1934259:1960503|1943442_1943982_+|QAS89803.1|DBSCAN-SWA MKADQTIEGILGKEGGYVDHPSDKGGPTRWGITQTTARAHGYTGDMRNLPRETAKQILLSDYWTGPRFDQVASLSTLLADELCDTGVNMGPSVAIKFFQRWLTALNMRGKLYPDLIPDGAIGPRTITALKGYLSARGKEGEQVLLRALNCSQGARYLELAEGREANEEFLYGWVKERVL >CP034966|1934259:1960503|1941505_1941868_+|QAS89800.1|DBSCAN-SWA MNQYRISLPWPPSNNRYYRHNRGRTHISAEGQAYRDSVARIIKDSMLDIGLATPLKIRIECHMPDRRRRDLDNLQKAAFDALTKSGFWLDDQQVDYYSVKRMPVVKGGRLELTITEMESA >CP034966|1934259:1960503|1945148_1945415_+|QAS89807.1|DBSCAN-SWA MNKFISSTFNNTVVELDGNHFEKCVFENCEIVYKGLQPFNLINCNFIACKWKLEGAASNTINFLKVMYKDMGEFGKKMVEATFENIKK >CP034966|1934259:1960503|1945584_1946013_+|QAS89808.1|DBSCAN-SWA MKKFLIDMIDKGLYYALLAALAFGSVTGQSNVLNVAAAAFWVVVFLGGVVGIITIFLAHGAEHITDEKSRQSVLESLRKIVRRKNVIARWWGWFCMMATIALLAYGGWVFTAVCYALSSLFVRFCISLARDKVEKQTVGVLA >CP034966|1934259:1960503|1953679_1953961_+|QAS89817.1|DBSCAN-SWA MKKIYVLSPFNFNDGKEQKHFPVGFHDVDDTVADHWFVKAHCSPDGEAPAVAEDPRIAELEAKIAEKDARIAELEAQLPETTNNGKKSKSADA >CP034966|1934259:1960503|1941864_1942005_+|QAS89801.1|DBSCAN-SWA MSRDVIERIRDRWQKLRLLRSRGTVLVDYKILRNFVRIYKRLGEAA >CP034966|1934259:1960503|1948421_1949873_+|QAS89811.1|DBSCAN-SWA MAKKTGRVATADSYDNFVARVGMQQPNQHAASTYRANYTSRNRLLIEWAYRSSWIIGAAVDSKADDMTKKGVRITSEIDPKRRGILESRFDELQLWDCINETLKWSRLYGGAVALILIEGQAPLTPLVLDKVGKGSFKGLAVLDRWMINPQLTRRIKALGPNLGKPEFYDIVTTAQGLPAWTVHHSRLIRMDGVKLPYQQKITENEWGMSIVERIFDRLTSYDSTSVGAAQLAYKAHLRTAKIKKLREIIAMGGKPFEALVKQMELVRQYQTNEGMSLFDSEDEFETHSYSFAGLSDLLSEFKEDIAGAVGIPLVRLFRQSPKGFSTGDADLANYYDDVGTLQERDLRPHIRLLFDVLHRSEFGEPLPQDFTFEFNPLWQMSDTDRSTVATNTTTALATAVRDLGMSPAAALTDLRELSDVTGIGASISDEDIQNAAKQWQETESETSPPPPIGGPVSEKPTGDSRPDKSNRHGFLRWFTGKR >CP034966|1934259:1960503|1952192_1952687_+|QAS89815.1|DBSCAN-SWA MGNTFLYRMPAGIAGAISRPQDLTVEPQLLDSSNLFPAYGLGGKISSGKFVPIAASDTASVLVGIYVRPYPTASQPDKVQQVGSGKNFTGDCLVRGYVTVNIGADASSVALHGPVYMRVATPSASSPLGAFLAAADGSNTVQITNAYFNGPGDTSGNIELAFNI >CP034966|1934259:1960503|1935043_1935439_-|QAS89786.1|DBSCAN-SWA MVSALYAVLSALLLMKFSFDVVRLRMQYRVAYGDGGFSELQSAIRIHGNAVEYIPIAIVLMLFMEMNGAETWMVHICGIVLLAGRLMHYYGFHHRLFRWRRSGMSATWCALLLMVLANLWYMPWELVFSLR >CP034966|1934259:1960503|1955908_1956667_+|QAS92282.1|DBSCAN-SWA MRQYSAIEDKPKFFAHVDLSTRPLSDVSDAMSRLIPDFDIDTAVGVQLDVVGEWVGRSRRVATPVTGIYFSWDTERVGWDQGVWQGPYDPNDGFIDLSDEIYRLMLKVKVAINNWDGQNDSLPSILDAALAGSGIRMAIVDNQDMSISIWILGDPSVALSEIDRLILDSAVNKGPFIALPAGYVPSRYDINPIDQVNSELWWAIQNGYMTVKAAGVRVREIETVSDGYQFFGFDIENDYIAGFDRGSWGERF >CP034966|1934259:1960503|1946071_1946848_+|QAS89809.1|DBSCAN-SWA MAKPDWEAIESAYRAGVLSLRDIGDKYGVTEGAIRKRAKKFDWVRNSGTQVRKNGTQSGTQKSKARTSEKPASAGRTQKSTQPKAEPPPDTKPIRGVRTDPPTNPFQPGNQQALKHGGYARRLLLKDEVIEDAKALTLEDELFRLRANNLVAAENIGRWLTKLDDAEGDQERKVLMENISAAEKAMMRNTVRIESIVGTLATVGKIFADTDYRKAATDKVSLEADRLRRDAGIDDGNGERDLNDFYSDIQTDAESGPA >CP034966|1934259:1960503|1943146_1943446_+|QAS92279.1|holin|DBSCAN-SWA MQDKESIAGMSWLVLLIIAGWGGLVRFLMDVKQGKAKWSWINAFAQIVVSAFTGVIGGLISIEGGLSIYMILATAGISGAMGSVALTYFWERITGVKAQ >CP034966|1934259:1960503|1939864_1940047_+|QAS89795.1|DBSCAN-SWA MTNKKMTPAEKLKASRKRYKKIWLQLDIANAKRFGEKEVLSVDTYRSPYETRKKRGRTAD >CP034966|1934259:1960503|1941009_1941216_+|QAS89798.1|DBSCAN-SWA MAMKYSWFQHPDCTTEQAEQLVSRYQARGIVTEKSLNADYLSWTVSARLPVCVRPEHTPRSLRQRIWG >CP034966|1934259:1960503|1950986_1952189_+|QAS89814.1|DBSCAN-SWA MKYFFKTRLGNTRFQLTDGSVLFKDVPIARTGELEYDATERPELVPNDRGKVIVRRTPEEVFSERAMASFEGMAVTIGHPRDFDGQIIFVTPDNWRQLAHGHIQNVRRGTDDKTDLLLADVIVKTPEALQAIDDGDDEVSCGYDADYEQISPGLAKQSAITANHLALVPNGRAGFRCAIGDSMPSTTKNWFTRLLKARKTGDAAEMASLIDNPPDDVTGDNDVSTSMTPGGVVINLAPQNPLPGPALPGTGDAEEEIPAWGKALIEAVAKLTPAATAPGTGDAEDEEEKKEEEGKVTGDAAYRADLIQPGIQLPEKAKPTAFKRQVLASADQSLVRSIVGDADISKLKKATVDMAFTAVSELAKNRNTKTVDSLQTQTATTVKTIAGMNQAAQEFWSKRG >CP034966|1934259:1960503|1937569_1937815_-|QAS89789.1|DBSCAN-SWA MQTIIQIEPNEWVSEDLLMAVTGMKRGTITRARKSSWLLGREYKHVSPEGDPKPTSECMYNRKAVDAWIQAQKQPLGDRAV >CP034966|1934259:1960503|1957799_1959884_+|QAS89822.1|DBSCAN-SWA MSEYDTGNPVPSASIPDAWDNMQTIDRFANSSDETITTRTGKQLDTLHGINVKSDNQLNEQQDTFELSQSERESSFEEKSNEFESRFSSQLSAQESTFSESQSDKENRFQQFLNTSGYVFLGDYQDGPFQFSARNQYIRYDNEYYRLNAATDVGFTTTGTDATSFANDVTHFVLMDGDTLRQDLGSGEGAMKVYRNASPLARIIRSSIFEYLTEADQQALLTIPGVNVIADYALKKAIADGVMVLDIPWNVGALNFGLDPAMLPLGFQFIGWGCRRPYTIDDDNSFLNCGVVIRVAAGASFPFYSTGRHVFRDIVFDGRDKTTYLFYSPNTATQFNGTRLEGCGFYRFAIGIGWASGGTARYIGTMKAYFCSISGNGDGVRNLIDSMMFGCTINANDRGVALTGGANNNFFGGCRNEWNTGDNWYAYQSVENQIFGELCDRAGRGGVVAGAKSSWILNGVNVRRSGANQPVGNDYSANFIIIDDGKIELSGVRTGVGANDSGDGGTISPSYNVSALGSGGGTLQVSGSDMTGFVTSAINQKATTLNKSITGNLGMDDDVNIGMTQVVKGRRIIGSQSSGTLAGSAGATLSLTKTNIFQNSFDTYITRSILIECRIGSQSFGDDIKIPVRFRRENLYYLDILTSGIVASSARIGLSGTGVTVSLSINSSTGLVTVQLTNVDGLERTVNVSMLPSM >CP034966|1934259:1960503|1954345_1954852_+|QAS89819.1|DBSCAN-SWA MKSGLTIREDNYSVVLDALKQLSGTDVLVGIPAGPPRDDAPLSNAELGYLQSTGATVEIDGETVTLPPRPFLDMGIEDSRDKTTERLKLAAQSALEGKADVASMHLEAAGQIARDASKAVIEAGDRLTPLSEKTIKKRREMKPPILGDKPLRARGFLFRAIQYVVRKK >CP034966|1934259:1960503|1944548_1944746_+|QAS92280.1|DBSCAN-SWA MLIGSCAKSLPAPVVVDTACSWVRIIYLTDHDIDVLDKQTKRDILAHNKAWQANCQKPTAVTIPK >CP034966|1934259:1960503|1952698_1953640_+|QAS89816.1|DBSCAN-SWA MPMTFDQATVDGTGAFLVHELERLDQTLNLPLVNFTWSRDIQLREDVSIADEISSFTNTTFAAAGTPNANGKNWLSKIPTALAGVNVDIAKTGFPLTLWGMELGWTVPELQAAAQVGRPIDTQKYDGMQLKWNMDTDEQVYIGDSGLDVKGLLNLTQVTPTNAAKTWATSTADEIRASINAGLSAAWANSAYSMVPTDLLIPPEQFSLLASTIVSSAGNQSLLTYLETNTIAYHQNGRPLNIRPVKWAKGRGVSNSDRMMFYTNDKKYVRFPMVPLMSVPIQYRGLYQLVTYYGKLGAVEPVYPETLAYVDGI >CP034966|1934259:1960503|1959892_1960072_+|QAS89823.1|DBSCAN-SWA MEDETELTEPPFETWFRDVVELVKNGGHSMDIVAYKGEWVDYFSEGLTPEDAFIKRMAL >CP034966|1934259:1960503|1944843_1945152_+|QAS89806.1|DBSCAN-SWA MSAINHIKNPLTIIGIFAGIVEVSANLVLPFLNDSQQSTYLWFLMFFPAGLVIVFFLTLNFNHVALYAPSDYSNDKGFMQANGKMIGNDVKDTDATQGFELT >CP034966|1934259:1960503|1960185_1960503_+|QAS89824.1|DBSCAN-SWA MGFPSPATDYTEQRLTVNSICNVGPNTRLFEQSGGYVVLDVSLKPKQASQVLIQHGGGTELATLRGRSLITEDGEAIEGEALDDVTVIGVVTFTICDVRQDNAVV >CP034966|1934259:1960503|1942001_1942691_+|QAS89802.1|DBSCAN-SWA MTAQYLEFVRQQLIVATADLSGATKGQLVALAENAQFTATARSRGRKKVYSEVKQKMVNPDGPPMSGSQSRAKGSSIALVLPVEYSTASWRRALLSLEDHQKSWLLWNYSDNIRWEHQETITRWAWEQFSDKLAGVRIAKKTVDRLCQLIWLAAQDVKAELAGRETYEYQSLAELVGVAKSTWTETYLPHWLALRSSFVKLDSDALMAVTRSRSQQKASNLDVSLAKPN >CP034966|1934259:1960503|1938544_1938748_+|QAS89792.1|DBSCAN-SWA MVKTGQWITKNALRQWDKRRKESRRQKAVNEFYDAFELNSLEPGSTVRLATKGDLTIMMFRSEEADK >CP034966|1934259:1960503|1946798_1948199_+|QAS89810.1|terminase|DBSCAN-SWA MTSTLTSKPTLNPVLRSFWTTQARNKVLYGGRSSSKSWDAAGIAIFLSNKYSLRFCCARQIQNKIEESVYTLLKIQIDRFGLRHRFRILNNKIINRVTGSEFVFYGLWRNIEEIKSLEGISVLWLEEAHALTEYQWKILEPTIRKEGSECWFIFNPGLVTDFVWRNFVVDPPEDTLIRKINYDENPFLSDTMLKVIDAAKRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKVLNFEPSGRKRIGFDVADSGADKCANVYRHGSVVYWADEWKAKEDELLKSCQRTYQAALERDADIVYDSIGVGASAGAKFSEINEDRKRENMNASRINYQRFNAGAGVNEPDYEYIGIPNKDFFANLKAQAWWLVADRFRNTFNAVKNGEQYPVDELISIDSSCPLLEKLKLELTTPHRDFDKNGRVMVESKKDLAKRDVPSPNVADAFIMAFAPTDTAMDIWEALGNS >CP034966|1934259:1960503|1938854_1939238_+|QAS89793.1|DBSCAN-SWA MTDITELALIAKIKKQTENFDTVVLKEWEALALVEALEKAQKRIAELDRKNCELDSLTQRWAVERAENADHIAELESHLESADKLQDSAFRSGLQHGFSLGQTDDQKGYEQSMAAYSPDAGIKVEDE >CP034966|1934259:1960503|1949928_1950477_+|QAS89812.1|DBSCAN-SWA MVGRKMFAQVEREEWNQWRSVSEEISAGLRDVISNTPVGMVAQDTVYRQIRYMKSLPLEAAGRVREIQERAIKAVIHGERPDQLYEMIMQSGDVAASRARMIARTEIGRATTALTQARALSVGSEGYWWRIKGAGTRPSHRGMKDKFVRWDNPPTLDGMTGHAGCLPNCDCWPEVQIPEFRK >CP034966|1934259:1960503|1956666_1957548_+|QAS89821.1|DBSCAN-SWA MATNDFKPFATGSGANVLSQADYDALSARTTGFLSGKASSAQVNKALRQASTIAAVVAQFISDNSGDDTLDNGNLPTLLASLESALLKSSPGRLQNIVSFTANGTYTPSPGTKHVKVIVTSGGGGGGGCQGTSGSESVSGGGGGAGGTAIGYFAVTESSYAVTVGAGGSAGVGAVQGGTGGTSIINGISGLGGDGGQKSGITTLAGGKGGISIGGSVNLPGGYGTDGQNGSLIIPGNGGSSYWGGGGRGGARGGVAGDCYGAGGGGAYDAAMSGNSYNGGQGKAGIVYIEEYS >CP034966|1934259:1960503|1944322_1944598_+|QAS89805.1|DBSCAN-SWA MSFEIISGLVVVILGAIAGAFGIGHARGTSKAQAKADQQRTEENAAATVAAAERKAEVTKEASDVQQTVSHMPDDDVDRELREKFTRPGSR >CP034966|1934259:1960503|1935491_1936271_-|QAS89787.1|DBSCAN-SWA MRMIKISKLARHARHFNCVEGNTTLYALPKPEVVLRWREQTTDDFRFCFKFPATISHQAALRHCDDLVTEFLTRMSPLAPRIGQYWLQLPATFGPRELPALWNFLDSLPGEFNYGVEVRHPQFFAKGEEEQTLNRGLHQRGVNRVILDSRPVHAACPHSEAIRDAQRKKPKVPVHAVLTATNPLIRFIGSDDMTQNRELFQVWLQKLAQWHQTTTPYLFLHTPDIAQAPELVHTLWEDLRKTLPEIGAVPAIPQQSSLF >CP034966|1934259:1960503|1943978_1944326_+|QAS89804.1|DBSCAN-SWA MKMIIFALLVVVAVLVLLLLRKYTRLEFVAHASLLLKTWSVKLGAIGALVGMWAQSFPDAALHAWAMLPPDIKNILPPNIVALISPALVVLAVLSQYVRQPALKAKAEELKEPQQ >CP034966|1934259:1960503|1934259_1935003_-|QAS89785.1|DBSCAN-SWA MSHRDTLFSAPIARLGDWTFDERVAEVFPDMIQRSVPGYSNIISMIGMLAERFVQPGTQVYDLGCSLGAATLSVRRNIHHDNCKIIAIDNSPAMIERCRRHIDAYKAPTPVDVIEGDIRDIAIENASMVVLNFTLQFLDPSERQALLDKIYQGLNPGGALVLSEKFSFEDAKVGELLFNMHHDFKRANGYSELEISQKRSMLENVMLTDSVETHKARLHNAGFEHSELWFQCFNFGSLVALKAEDAA >CP034966|1934259:1960503|1939630_1939828_+|QAS89794.1|DBSCAN-SWA MLYTVTFNETERKEDIELSADVRAGDLLSLTLDGVKDDYTVMTVGGPIIGNTCVPSIIRVKKAQK >CP034966|1934259:1960503|1940043_1940307_+|QAS89796.1|DBSCAN-SWA MTTSSWNIAAKSKDEQDKVNVDLAASGVAYKERLNMPVVAEVVARQQPEHLRDYFMERVRYYREQSIQLPRASDPRYLEMAEQNAKK >CP034966|1934259:1960503|1950508_1950931_+|QAS89813.1|DBSCAN-SWA MKQNIHKLNGLEFTHERDFINGQWVYSWYFRPLEQSEWCPFSLPTGKTRKSDIENFLKNCEEATNFYLEWLRNASDVEGAERYLLSAKQAWERISSPDWGGRGNNPNKDARRVQQARETLESAKVKLEKAKILRERLNSN >CP034966|1934259:1960503|1937974_1938193_-|QAS89790.1|DBSCAN-SWA MSAEIIDQANELAELQREAAIAKCRINHSAVSATHCRDCGEEIAARRRELVAGCQRCADCQEEFEERGKHLR >CP034966|1934259:1960503|1953929_1954349_+|QAS89818.1|DBSCAN-SWA MARNQSLPTPEQFRATFPQFADDTKYPTPMIQTRLNLADAMLSESRFGVDIFPYIVGLYVAHYMYLYAADMRGMAVGTAGGVNSGIQTAKSVDKVSASYDASATLDPNAGFWNNTRYGSEFWEYLMMFGAGAVQLGTPE >CP034966|1934259:1960503|1955332_1955773_+|QAS92281.1|DBSCAN-SWA MTGLAATLVYPRWTDPQKQIPKNGTTWCAFGITGIQEDFNPAYVQGEENTEQWSHETVSLILCFYGPQGLAMATRFRDGLLVSQNNDELNRSGLTFLQHGRILNLPELINNQWVRRYDISVDLRRKIIRQYGIQSLVDAPVQFFGD >CP034966|1934259:1960503|1941218_1941509_+|QAS89799.1|DBSCAN-SWA MANLRKAARGRECQVRIPGVCNGNPETTVLAHIRIAGLCGTGIKPPDLLAAIACSSCHDEIDRRTRLVDAEYAKECALEGMARTQVIWMKEGLIKA >CP034966|1934259:1960503|1954851_1955238_+|QAS89820.1|head,tail|DBSCAN-SWA MPFLDVTDVLLDPDFVDLSLVCYRQVQTVDEDNFPTNTAQAIPFSGVVTVDRSLEAKRMAAGQNINGAILIVTQFRLTQGQPGLDADIVTYRGRDYRVTFVDPYTAYGAGFVQAHCELLEFDGGTPIE >CP034966|1934259:1960503|1938189_1938390_-|QAS89791.1|DBSCAN-SWA MSKYPRVGSVAAKSKNTSAKCKCGAVAKFKTTVEVNVFRGDDEVIWSCNEHKKDCSFLVDWQGGAA >CP034966|1934259:1960503|1940413_1941010_+|QAS89797.1|DBSCAN-SWA MAQTLQFEKSYQNVLIPAEPGTSEYLQLIPVGQLLCGEFRKPRNYAFHKKFFKLLTLGYHYWTPSGGLIEPAERTLISGFIDFLSSDLDQRAALQNAAEMYLSSVGISRSRDMALLKHFESFREWATIQAGFYDEYQMPDGSRRRVAKSISFASMDDSQFNGVYKSVLNVLWNYILRRKFHSPSEAENAASQLLSFAG |
44 | Klebsiella_phage(21.21%) | terminase,tail,holin,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2243763 : 2272366
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP034966|2243763:2272366|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGAAGTGGGAAGGTTCTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTAGAGGCATAAACTGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGTGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCAGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTACGTTCGAATTTTTCGCTGGTGACTGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCTTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGCGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCTACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTGTTATCATCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCATCTTCACGACCAGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCGCCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCGGTGCTGTAGGAGCCATACTGATTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCACCGCGTGTACCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGATCACTTTTTCGCTGGCCTGTTGTACCGTTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTTGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGGTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAATGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATGTGTCGTATCCGGCGTTCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCACGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTGTTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAACCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTGGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTACTGGTGGAGCCGGAACCGAAAAGCATGCGTAATCTGCCGTCCGGGGTCGTTCCTGCCGTTCGCCAGCCGCTGGTGGAAGACAAAACATTGCTGCCGTTTTTCAGTAACGCACGGGTAATTCGTGCTGCTGGTGGTGCTGGTGCATTGTCTGACTGGCTGTTGCGCCATATTAAATCCTGCCAGTGGCCACACGGCGATTATCATCACAGCGAAACCGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAGCTGCGTGACCAGACATCCGAATCACTCGAGCAACTTGCTCATCAAAACCTGTCAGCATGGATGATTGACGTCATCGGTCACGCAATAAGCGGTACGCAGGAGCGTGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCCGCAATCAGGTGGCGGACGCGCTACCGGAAGCGGTATTACGTGGTTCGCTGGGGTTGCGTGCGGAAAAAATCCGCTCAATGTACCGTGAAAGCGACATCGTACCGGGAGAGCAGACCGCCAACAGCATACTGAAACAGCGCACAAAAAATCTTGCGCCGCTGCCTCACGCCCACCAGCAACAGAACCCACCACAGGAAAAGACGGTGGTCAGCATTGCCGTTGATCCTGAGTCTCCGGAATCTTTCATGAAACGACCTAAACGTCGCCGCTGGGTTAACGAGAAATACACACGCTGGGTGAAGACACAGCCGTGTGCGTGTTGTGGTAAGCCAGCCGACGATCCCCATCACCTGATTGGTCATGGTCAGGGCGGAATGGGGACAAAATCTCACGATATTTTCACGCTACCGCTGTGTCGGGAGCATCACAACGAGCTTCATGCGGATCCTCTGGCGTTCGAAGAAAAGCATGGTTCTCAGGTTGATTTAATTTTTCGTTTTCTTGATCACGCCTTTGCAACTGGCGTGCTTGGGTAAAAGAGGTGACTGATGCTCATAGATTTGGTTTTACCTTACCCGCCGACGGTGAACACTTACTGGCGACGCCGTGGCAGCACATATTTTATCTCGGAGGAGGGAAAGCGTTATCGCCGGGCTGTGGCGCTTATTGTTCGCCAGCAGCGGCTGAAATTAAGCCTGTCCGGAAGGCTGGCGATAAAGGTGATTGCAGAGCCACCGGATAAGCGTCGTCGCGACCTGGACAATATCCTGAAAGCACCGCTGGATGCGCTGACGCATGCGGGAGTGTTAATGGACGATGAGCAGTTTGATGAAATCAATATCGTTCGTGGTCAGCCAGTATCTGGTGGACGTCTGGGGGTGAAGATTTACCCCATAATGCATGAAGAGCAGGTCAAAAAATGAAACTGGAAGATTTACCGAAATACTACTCCCCAAAATCCCCTGGCCTGACCGATGCATCGGCCTCAACGTCAAAAGATGCGCTGAGTATCACTGATGTGATGGCCGCGCAGGGCATGACACAGAATCGGGCTGAGATGGGTTTTTCTGCGTTCCTGGGGAAAATGGGCATCAGTATGAATGACAGGGCGCGGGCAACAGAATTACTGGCAGATTATGCACTCAGTCGGTGCGATCGTGTGGCGGCGTTGAGAAAGCTTCCGGCAGAAATAAAACCGGTAGTGATGCGCATTATGGCTTCGTACGCTTTTGAGGATTATGCCCGCAGCGCAGCGAGTAAAAAGCAGTGCCCTTGTTGCTATGGGGAAAAATTTATTGAAAGCGTAGTTTTTACAAACAAGGTCCAGTATCCGGATGGTAAGCCGCCGGTATGGGCAAAGTGTACGAAAGGTGTGTATCCGTCTTACTGGGAAGAATGGAAAAAAGTCAGGGAGGTGGTAAAAGTTGCCTGTCCGGAGTGTGGCGGAAAGGGTGAGGTTTCCACCGCCTGTAAGGATTGCCGTGGGCGTGGTGTCGCCATTCATCGTGAAGAGTCGGTAAAACGTGGTATGCCTGTTATCAGAGACTGCCAGCGTTGTGGTGGTCGTGGCTATGAAAGACTACCATCAACGGAGGCATTTAATGCTATATGCGAGGTGACAAACCAGATAACACGCGCGTCATGGGAAAAAACAGTTAAGAAATTCTATGATGCGCTGGTGACCCGGTTTGATATTGAAGAAGCATGGGCTGAGCGGCAGTTAAAAAAGGTAACTAGGTAACAAGGTTGATTTTTCCGGAATCTGTGGTAAATTCGTCATAACGATGGGCGTTTTATGCCTGACGTTAGAAGAGTTTCTACAACCCGCCGCCGAGCGGGTTTTTTATTGCGGAATTAATTATGGACCGTTATTATTCTGCTCCCGGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGACAGGTCGCTGGGTCAAATCCAGCAAGGGCCACCAACCGTCACCAGTTCATCAGGAAAGAGCGTCAACCCTTTAAGTTGAGTGTGCGAGGTTCGAGTCCCCGGTGGCGGTCCAGTGCCGACTTCGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATATGCGGGCATCGCATAATGGCTATTACCTCAGCCTTCCAAGCTGATGATGCGGGTTCGATTCCCGCTGCCCGCTCCAGTTAGAGTCTTTCAGTCTGCGATGATGGGAAATCCCGGAGTGACTGAAAGACGTTTAAGTTATGAATGATCGCTTTTTTTTGCAAAATTGCTGTGCAGAAATACTAACCTTCGGGCAGGCGATCATTCATAAGCACTCTGCTTTTATTCCGATTAACTGTGGGTGGTTTGTTGGATAGAGTGCTTTCCTTACTGTATATATTGTTTCGCCCGCTTTTGCGGGCTTTTCTTTTCAAATCCCTTTCATTTCTCAGTGTAAAACTACGCCATCCGTTATTTGCGGAGGTGAGGCTATGAAATCCATGGACAAAATTTCAACGGGCATTGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTATCCGCACCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCTTCCGGCACCACGCTGATAAGCCTGGTTGACGGGCAGGGGAGTCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGCGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAACGGATTGCGGATATCAGGCAGGTTGAAACCAGCGCGCGTTATCTTGGTACGGCACTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTCGGTCAGCCGAGTGATGATGCATCAGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGACTGAAATCAGGACGTCCATAACGGATGTCAGCAATGAAATAACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCAGCGATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGAGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAAACGGTACGGTTACAGTAACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACGATCCGGGGAGCGGAAATAACCCAGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGATTTTCCAGAAAGGCAATCAGGGGGCAGGCAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCGGCTTCCGGCATCAGTATTCGTTGAAATATTTATAACCCCAATAAAGGGCGTCAGGAATGACGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCGTCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCCGGACCAACGTTCCCGGCAGTGATGATCTGAACGGGATTAACGTGAAATATCGTTATGAGTTTACGGATACGCTGGGGCTGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGTACTGGTGGAAGAACTATCGAGCAAGCACGTGCGAACTTGCGGGTAATGTATGAGCAAAAAGCTGGCCTTGCTAATACTGACCTAAACACCCTTACCGGTGAATATTCTGGTTTCTATCAACAACCAACGAGCGCTTACGCAACAGAAGAGTTAAATTACCCAATCGGTCTGGCGGGCGCTTTAATAGTGCTCCAAACGAGAGCCAACACTGCTTCTTCCTGCGTTCAGGTGTACCACCCTTATAATAATCCGGGAATTACTTATAGACGAATATATGAAGGAGGTAGCGGTACCTGGTCTGAATGGAAGAGAGATGTATCAACAGAAAGGGTTGAAGAGGGAAAAGAAACAACTTACGTATATTCTACGTATTCTTCAGGCGCACCACGCTTACAGGTTTCCAAATCTGGTTTGTGGGGTTGTCATAATGGCACTGGCTGGTTGCCATTAGCTGTTGGGCAAGGAGGTACAGGTGCGACAACAGTAGAAGATGCGCGAAACAACTTAAGTCTTGGCGAAAGTAGCGCAGTTAAATTTAAAAACCTTACTTTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_4 >CP034966|2243763:2272366|2257443_2257866_+|QAS90103.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >CP034966|2243763:2272366|2266227_2267049_+|QAS90114.1|DBSCAN-SWA MKLEDLPKYYSPKSPGLTDASASTSKDALSITDVMAAQGMTQNRAEMGFSAFLGKMGISMNDRARATELLADYALSRCDRVAALRKLPAEIKPVVMRIMASYAFEDYARSAASKKQCPCCYGEKFIESVVFTNKVQYPDGKPPVWAKCTKGVYPSYWEEWKKVREVVKVACPECGGKGEVSTACKDCRGRGVAIHREESVKRGMPVIRDCQRCGGRGYERLPSTEAFNAICEVTNQITRASWEKTVKKFYDALVTRFDIEEAWAERQLKKVTR >CP034966|2243763:2272366|2249117_2250332_+|QAS90095.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP034966|2243763:2272366|2246801_2247512_+|QAS90091.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP034966|2243763:2272366|2263616_2263724_-|QAS90108.1|DBSCAN-SWA MLTGAFLYLPLVFMPEADSLKHPQQFYLTPVTSPI >CP034966|2243763:2272366|2243763_2246190_-|QAS90090.1|DBSCAN-SWA MSKNDRMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAATVQQASEKVIWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGTRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDNTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP034966|2243763:2272366|2267794_2269957_+|QAS90115.1|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGIRTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGQGSPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNARGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKIFQKGNQGAGNITDCTVIVTKKAASGISIR >CP034966|2243763:2272366|2253189_2255661_-|QAS90099.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP034966|2243763:2272366|2252865_2253102_-|QAS92296.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP034966|2243763:2272366|2259487_2260837_-|QAS90105.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE >CP034966|2243763:2272366|2262116_2263097_+|QAS90107.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >CP034966|2243763:2272366|2257995_2258940_-|QAS90104.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVGWL >CP034966|2243763:2272366|2265856_2266231_+|QAS90113.1|DBSCAN-SWA MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSLSGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGGRLGVKIYPIMHEEQVKK >CP034966|2243763:2272366|2246388_2246694_-|QAS92295.1|DBSCAN-SWA MKLSTCCAALLLALASPVVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP034966|2243763:2272366|2250343_2251363_+|QAS90096.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP034966|2243763:2272366|2251550_2252831_-|QAS90098.1|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP034966|2243763:2272366|2248109_2248451_-|QAS90093.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP034966|2243763:2272366|2247514_2248075_-|QAS90092.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP034966|2243763:2272366|2248585_2248912_+|QAS90094.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >CP034966|2243763:2272366|2256437_2257403_+|QAS92297.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >CP034966|2243763:2272366|2255942_2256131_-|QAS90101.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >CP034966|2243763:2272366|2256214_2256457_+|QAS90102.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP034966|2243763:2272366|2270788_2272186_+|QAS92299.1|DBSCAN-SWA MWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNTQHTDNINKFIPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDLAVRSLTTSNPVKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLVWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQTLTINKNEVNSTVDLTLTKQSGTGNRFVLQNSGNAELPFSVRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRISTLENQVSELVALVRQLTGSEH >CP034966|2243763:2272366|2272240_2272366_+|QAS92300.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >CP034966|2243763:2272366|2264196_2264448_+|QAS90110.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >CP034966|2243763:2272366|2263768_2263981_+|QAS90109.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP034966|2243763:2272366|2261154_2261757_+|QAS90106.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >CP034966|2243763:2272366|2255754_2255946_-|QAS90100.1|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >CP034966|2243763:2272366|2263301_2263610_+|QAS92298.1|DBSCAN-SWA MATSKPTSVSRSKRTRRLACVGLGVFKEVLVVTGCCVPFLQNKITETIPNSYIESMMRQPHIYQNWCTSNTGGCRAGSQICASYCGCNGNLLSCYCSYGSPF >CP034966|2243763:2272366|2264794_2265844_+|QAS90112.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIRAAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQVADALPEAVLRGSLGLRAEKIRSMYRESDIVPGEQTANSILKQRTKNLAPLPHAHQQQNPPQEKTVVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG >CP034966|2243763:2272366|2264514_2264793_+|QAS90111.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >CP034966|2243763:2272366|2251420_2251531_+|QAS90097.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC |
32 | Escherichia_phage(25.0%) | tail,integrase | attL 2244828:2244842|attR 2268606:2268620 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2684012 : 2696750
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP034966|2684012:2696750|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGTCACGAACGGTGCAATAGTGATCCACACCCAACGCCTGAAATCAGATCCAGGGGGTAATCTGCTCTCCTGATTCAGGAGAGTTTATGGTCACTTTTGAGACAGTTATGGAAATTAAAATCCTGCACAAGCAGGGAATGAGTAGCCGGGCGATTGCCAGAGAACTGGGGATCTCCCGCAATACCGTTAAACGTTATTTGCAGGCAAAATCTGAGCCGCCAAAATATACGCCGCGACCTGCTGTTGCTTCACTCCTGGATGAATACCGGGATTATATTCGTCAACGCATCGCCGATGCTCATCCTTACAAAATCCCGGCAACGGTAATCGCTCGCGAGATCAGAGACCAGGGATATCGTGGCGGAATGACCATTCTCAGGGGATTCATTCGTTCTCTCTCGGTTCCTCAGGAGCAGGAGCCTGCCGTTCGGTTCGAAACTGAACCCGGACGACAGATGCAGGTTGACTGGGGCACTATGCGTAATGGCCGCTCACCGCTTCACGTGTTCGTTGCTGTTCTCGGATACAGCCGAATGTTGTACATCGAATTCACTGACAATATGCGTTATGACACGCTGGAGACCTGCCATCGTAATGCGTTCCGCTTCTTTGGTGGTGTGCCGCGCGAAGTGTTGTATGACAATATGAAAACTGTGGTTCTGCAACGTGACGCATATCAGACCGGTCAGCACCGGTTCCATCCTTCGCTGTGGCAGTTCGGCAAGGAGATGGGCTTCTCTCCCCGACTGTGTCGCCCCTTCAGGGCACAGACTAAAGGTAAGGTGGAACGGATGGTGCAGTACACCCGTAACAGTTTTTACATTCCACTAATGACTCGCCTGCGCCCGATGGGGATCACTGTCGATGTTGAAACAGCCAACCGCCACGGTCTGCGCTGGCTGCACGATGTCGCTAACCAACGAAAGCATGAAACAATCCAGGCCCGTCCCTGCGATCGCTGGCTCGAAGAGCAGCAGTCCATGCTGGCACTGCCTCCGGAGAAAAAAGAGTATGACGTGCATCCTGGTGAAAATCTGGTGAACTTCGACAAACACCCCCTGCATCATCCACTCTCCATCTACGACTCATTCTGCAGAGGAGTGGCGTGATGATGGAACTGCAACATCAACGACTGATGGCGCTCGCCGGGCAGTTGCAACTGGAAAGCCTTATAAGCGCAGCGCCTGCGCTGTCACAACAGGCAGTAGACCAGGAATGGAGTTATATGGACTTCCTGGAGCATCTGCTTCATGAAGAAAAACTGGCACGTCATCAACGTAAACAGGCGATGTATACCCGAATGGCAGCCTTCCCGGCGGTGAAAACGTTCGAAGAGTATGACTTCACATTCGCCACCGGAGCACCGCAGAAGCAACTCCAGTCGTTACGCTCACTCAGCTTCATAGAACGTAATGAAAATATCGTATTACTGGGGCCATCAGGTGTGGGGAAAACCCATCTGGCAATAGCGATGGGCTATGAAGCAGTCCGTGCAGGTATCAAAGTTCGCTTCACAACAGCAGCAGATCTGTTACTTCAGTTATCTACGGCACAACGTCAGGGCCGTTATAAAACGACGCTTCAGCGTGGAGTAATGGCCCCCCGCCTGCTCATCATTGATGAAATAGGCTATCTGCCGTTCAGTCAGGAAGAAGCAAAGCTGTTCTTCCAGGTCATCGCTAAACGTTACGAAAAGAGCGCAATGATCCTGACATCCAATCTGCCGTTCGGGCAGTGGGATCAAACGTTCGCCGGTGATGCAGCACTGACCTCAGCGATGCTGGACCGTATCTTACACCACTCACATGTCGTTCAAATCAAAGGAGAAAGCTATCGACTCAGACAGAAACGAAAGGCCGGGGTTATAGCAGAAGCTAATCCTGAGTAAAACGGTGGATCAATATTGGGCCGTTGGTGGAGATATAAGTGGATCACTTTTCATCCGTCGTTGACATTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_5 >CP034966|2684012:2696750|2689054_2689366_+|QAS90481.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >CP034966|2684012:2696750|2694261_2695386_+|QAS92320.1|integrase|DBSCAN-SWA MTFHPSLTLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >CP034966|2684012:2696750|2692057_2692294_+|QAS90486.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >CP034966|2684012:2696750|2693436_2694219_+|QAS90488.1|DBSCAN-SWA MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRKAGVIAEANPE >CP034966|2684012:2696750|2692417_2693440_+|QAS90487.1|transposase|DBSCAN-SWA MVTFETVMEIKILHKQGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRGFIRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVLGYSRMLYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTGQHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMTRLRPMGITVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSMLALPPEKKEYDVHPGENLVNFDKHPLHHPLSIYDSFCRGVA >CP034966|2684012:2696750|2690039_2690198_+|QAS92319.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >CP034966|2684012:2696750|2691678_2691918_+|QAS90485.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >CP034966|2684012:2696750|2689362_2690043_+|QAS90482.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP034966|2684012:2696750|2695499_2696750_-|QAS90489.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >CP034966|2684012:2696750|2684012_2685968_-|QAS90479.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >CP034966|2684012:2696750|2688332_2688872_-|QAS90480.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP034966|2684012:2696750|2690194_2691259_+|QAS90483.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >CP034966|2684012:2696750|2691412_2691631_+|QAS90484.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL |
13 | Enterobacteria_phage(33.33%) | transposase,integrase | attL 2681985:2682008|attR 2695453:2695476 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3011902 : 3020672
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP034966|3011902:3020672|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_6 >CP034966|3011902:3020672|3012249_3014643_-|QAS90766.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY >CP034966|3011902:3020672|3015493_3015721_-|QAS90768.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >CP034966|3011902:3020672|3016021_3016363_-|QAS90770.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >CP034966|3011902:3020672|3015720_3015954_-|QAS90769.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >CP034966|3011902:3020672|3019619_3020672_+|QAS90775.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP >CP034966|3011902:3020672|3016784_3017294_-|QAS90772.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL >CP034966|3011902:3020672|3018583_3019528_+|QAS90774.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >CP034966|3011902:3020672|3011902_3012091_-|QAS90765.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >CP034966|3011902:3020672|3016480_3016777_-|QAS90771.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWITGEDGKRWHPCHSQDELLSELTTRKRRKSKCMRQKVKWFISFVTEGRVIQYLKMICSVAIRHCRAMAVTFSR >CP034966|3011902:3020672|3017693_3018572_+|QAS92332.1|DBSCAN-SWA MQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >CP034966|3011902:3020672|3014639_3015497_-|QAS90767.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >CP034966|3011902:3020672|3017326_3017548_-|QAS90773.1|DBSCAN-SWA MTPNISITLNTPHVTIERYSELTGLSIDTINDMLADGRIPRHRLRKDKKREKVMINLAALTVDALTDCNVVFN |
12 | Salmonella_phage(90.0%) | integrase | attL 3011572:3011585|attR 3020714:3020727 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3100257 : 3127461
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP034966|3100257:3127461|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTTGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACTCGGAAAGTTGCTCGTTGCTCACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGGAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATTACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGGCCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCGATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_7 >CP034966|3100257:3127461|3117076_3118006_-|QAS92337.1|DBSCAN-SWA MANTAEIFNFPVPDAAQKEPRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >CP034966|3100257:3127461|3125142_3125745_-|QAS90885.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP034966|3100257:3127461|3107356_3107767_+|QAS90852.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >CP034966|3100257:3127461|3124330_3124612_+|QAS90883.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP034966|3100257:3127461|3100257_3101547_+|QAS90845.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP034966|3100257:3127461|3120594_3121422_+|QAS90876.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >CP034966|3100257:3127461|3109219_3109435_-|QAS90856.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP034966|3100257:3127461|3121930_3122137_+|QAS90877.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >CP034966|3100257:3127461|3119036_3119726_+|QAS92338.1|DBSCAN-SWA MKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >CP034966|3100257:3127461|3110218_3110353_-|QAS90858.1|DBSCAN-SWA MPYICSIILVLNSFDVRIGKEDILFKKGSAVLIDYNLKDFFHQI >CP034966|3100257:3127461|3126390_3127461_+|QAS90889.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP034966|3100257:3127461|3101605_3102082_+|QAS90846.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP034966|3100257:3127461|3122212_3122509_+|QAS90878.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP034966|3100257:3127461|3106117_3106678_-|QAS90850.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >CP034966|3100257:3127461|3112999_3113140_-|QAS90862.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP034966|3100257:3127461|3109622_3110210_-|QAS90857.1|DBSCAN-SWA MIVDVEEKTVNDFFKSNTLSPFSVRRFYPAYLMVECEDFSLLKNLIACLNCDGRTVDFVRNQISLACLAILSSEKIVQSFLFGCLNSLGSKVKAIIHTDISAAWRLCDISSRLYLSESLLKRKLKHEGLSFSKLILEERMVMAERLLSYNLYSVGKVAEICGYENTSYFVSVFRRYFGVPPHQYSSRFFLEKDMM >CP034966|3100257:3127461|3113778_3113949_-|QAS90865.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >CP034966|3100257:3127461|3116088_3116382_-|QAS90871.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >CP034966|3100257:3127461|3125869_3126055_-|QAS90886.1|DBSCAN-SWA MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP034966|3100257:3127461|3108722_3109220_-|QAS90855.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >CP034966|3100257:3127461|3104558_3105227_+|QAS90848.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP034966|3100257:3127461|3114400_3114502_-|QAS90867.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >CP034966|3100257:3127461|3124128_3124320_+|QAS90882.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP034966|3100257:3127461|3116378_3117080_-|QAS90872.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP034966|3100257:3127461|3112536_3112914_-|QAS90861.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >CP034966|3100257:3127461|3118701_3118932_-|QAS90874.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >CP034966|3100257:3127461|3113136_3113499_-|QAS90863.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP034966|3100257:3127461|3108118_3108271_-|QAS90853.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >CP034966|3100257:3127461|3110704_3111664_-|QAS90859.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >CP034966|3100257:3127461|3125987_3126155_+|QAS90887.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP034966|3100257:3127461|3104232_3104409_-|QAS92336.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >CP034966|3100257:3127461|3123973_3124156_+|QAS90881.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP034966|3100257:3127461|3108299_3108506_-|QAS90854.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >CP034966|3100257:3127461|3123296_3123977_+|QAS90880.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP034966|3100257:3127461|3113948_3114404_-|QAS90866.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP034966|3100257:3127461|3102827_3104159_+|QAS90847.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >CP034966|3100257:3127461|3105171_3105309_-|QAS90849.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKSELRLDAIFSLKRKTLLQYLEPWF >CP034966|3100257:3127461|3124710_3124932_+|QAS90884.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP034966|3100257:3127461|3115860_3116052_+|QAS90870.1|DBSCAN-SWA MRAKIYQLSLWIFISFLAIYAFIIYKGSYIGVALHQIAWIIIIASGLIARLTKPKQKPISSNN >CP034966|3100257:3127461|3107066_3107300_+|QAS90851.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >CP034966|3100257:3127461|3114594_3115047_-|QAS90868.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >CP034966|3100257:3127461|3126194_3126413_+|QAS90888.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP034966|3100257:3127461|3118092_3118632_-|QAS90873.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP034966|3100257:3127461|3111856_3112381_+|QAS90860.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >CP034966|3100257:3127461|3122514_3123300_+|QAS90879.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP034966|3100257:3127461|3119848_3120598_+|QAS90875.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >CP034966|3100257:3127461|3115043_3115604_-|QAS90869.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >CP034966|3100257:3127461|3113495_3113786_-|QAS90864.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA |
48 | Enterobacteria_phage(47.06%) | capsid,lysis,tail,integrase | attL 3102173:3102187|attR 3127535:3127549 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3575358 : 3639669
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP034966|3575358:3639669|DBSCAN-SWA ATCACGCGTCCGGGAACATCACGCTATGGCCCGGCGCTTCACGATGGAGATGAATAAAGTTAAGGTGCCGCTCGTACTGGTCAAGAATATCGGTGATCACCTGCTCTTTGCTGTAGTCCATCAGGTCGTTGCCCTGGCTGCCTTCTAACAGGAAGGTTTCCAGCCGGTAGTAGGTCGATTTACCGCTGCGTGCGCGGTAGGTAAAGCCCGGCACCGAATATTGCTGCGGCCAAATCTGATAGACAAAGTTTTGCTCTTCGCCCATATGCACCAACAAATCCAGATGCCCCAACTGTTGTCCCTCTTCCGGCGGCAGGCTTTTTAGCTCCACGTACGCGCCGCGCAACCGCAGCTCCTGCGCCACTTCTTCCATTGCCGGGTAACAGACCGTCTCCATCATCTGTTTAGTGTAACGCGTGCCCGGATAATTCATCAGGCGCGAGAGACGTTTTTTCCAGCTCAGGCGATCCTGAAGCCCCAGCGGTCGCGGCGCGGTATCGCGGTTGGCACTTTCACGGCGGTAATCTTCTACCTTCAGAGATTTATACAACCCCGCCATCACGAAGAAAATCACAAAGCTGAACGGCAGCCCCATGATTACCGTGGTGTTTTGCAGCGCGGATATCCCGTTGGTCATCAGCATGCCGAGCGTCAGCAGACCAATCGCCACCGACCAGAAGACGCGCAGCCAGCCGGGGGCGTCGCTGTTGATATCTTTAAGCTGCGAGGTGAAATTCCCCAGCACCAGCGCACCGGAGTCCGCCGAGGTCACATAAAACAGCAGGCCAGTAATGGTGGCGACGGAGGCGCTAAAGGTAAACGCCGGATACTGCGCCAGCAGGCTGTAGAAGCCGCGCTCCGGATGGACCATCGCTTCCTCGGCAAATGCCGCGCCGCCGTGGATAATTTCATACAGCGCGCTATTGCCGAACACCGAGAGCCATAACAGCGTGAAGGTAAACGGAATAATCAACGTGCCCAGCACGAACTGGCGAATGGTACGCCCACGCGAGATACGCGCAAGGAACAAGCCGACAAACGGCGACCATGCCACCCACCATGCCCAGAAGAAGAGCGTCCAGTTATTCATCCACTCAACTGGACGGTCGAAGGCAAAACTGTTGAGCGTCATGCCCATAAAGCGATTCACATAGTCGCCAACATTCAGCACCAGCGCATTAAGCAGGAACGAAGTGTCGCCCATAAACAATACGAACAGGATCAATCCCAGCGCCAGCGCGACATTAAGCTCCGATAACACGCGAATGCCCTTATCGACACCGGAGGTGACAGAGATCGTGGCGATTATCACCGACAAGGCGATCAGTGCCGCTTTTGCCGCCATCGAATCGGGAATATCAAACAGTACGCTCAAGCCATAGTTAAGCTGCACCACACCGATACCGAGCGTAGTGGCAATACCGAAGATAGTGCCGATCACCGCTGCAATATCCACTGAGTGACCTATAGGTCCGTTAATCCGTTTACCGAAGATCGGGTACAGCGCCGAGCGGATGGTGAGCGGCAAATTATAACGATAGCTAAAGTATCCGAGCGCCATGCCCATCAGCGCATACATCGACCAGCCGGTTAAGCCGTAGTGAAACAGCGTCCAGACCATCGCCTGACGCGCGGCCTCAATCGTCTGTCCCGCGCCTTCCGGCGGCTGCATATACTGCGTTACCGGTTCGGCTACGGAAAAGAACATCAGGTCGATACCGATCCCGGCAGCAAACAGCATCGCCGCCCAACTCAGCAGGCTGAATTCCGGTTTGGATTGTTCTGGCCCGAGCTTCACCGAACCAAAACGCGAACAAGCGATACAGACCACAAAGACAATATAGAGCGTTGCCGCCAGCAGATAGTACCAACCGAAGGTTTTAGACACCCAGTCCAGCGTGCGGCCAATCCACAGGGCCGAGAAGTCGCGAAACAGGATCGTTGTCAGGGAAAACAACAAAATCAGTCCGGCGGAGGTGTAAAACACCACCGGATTGATTTTGTCCTTTTCCCTGCTGTGTGAAAGGTCTGTCATCCAGTATCCCCACTGTTATTGTTACCATTTAAAATCAAATTCGTAACAATTAAGACACATTTTATATTGAACGTCCAATCAATAACCGCTTTAATAGATAAACAACGCTGATGAATGGAGTGGCGAAAATGCCCAAATTGGGGATGCAGTCGATCCGGCGCAGACAACTGATCGACGCCACACTGGAAGCAATAAATGAAGTGGGCATGCACGATGCAACGATCGCGCAGATCGCCCGCCGTGCAGGCGTTTCTACGGGGATCATCAGCCACTATTTCAGGGACAAAAATGGTCTGCTGGAAGCAACCATGCGCGACATCACCAGTCAGTTACGTGACGCGGTTTTGAATCGATTACATGCACTTCCGCAGGGCAGTGCAGAGCAGCGATTACAGGCGATTGTTGCCGGAAACTTCGATGAAACTCAGGTGAGCAGTGCGGCGATGAAAGCCTGGCTGGCGTTCTGGGCCAGCAGTATGCATCAGCCGATGCTCTATCGTTTACAGCAGGTCAGCAGCCGCCGCTTGCTGTCGAATCTGGTGAGCGAGTTTCGCCGCGAATTGCCGCGCGAACAGGCACAGGAAGCGGGCTACGGCCTTGCCGCGCTGATTGACGGATTATGGCTGCGCGCGGCACTGAGCGGCAAACCGCTGGATAAACCTCTCGCACACTCGCTGACCCGTCACTTTATTACTCAGCATTTACCCACCGATTAACCGAGGAGACGTGATGTCCCGAATGGCAGAACAGCAGCTTTATATACATGGTGGTTATACCTCCGCCACCAGCGGTCGCACCTTCGAGACCATTAACCCGGCCAACGGTAACGTGCTGGCGACCGTGCAGGCCGCCGGGCGCGAGGATGTCGATCGCGCCGTGAAAAGCGCCCTGCAGGGGCAAAAAATCTGGGCGTCGATGACCGCCATGGAGCGCTCGCGTATTCTGCGTCGGGCCGTTGATATTCTGCGTGAACGCAATGACGAACTCGCAAAACTGGAAACCCTCGACACCGGAAAAGCATATTCGGAAACCTCAACCGTCGATATCGTTACCGGTGCGGACGTGCTGGAGTACTACGCCGGGCTGATCCCGGCGCTGGAAGGCAGCCAGATCCCGTTGCGTGAAACGTCCTTTGTGTATACCCGCCGCGAACCGCTGGGCGTAGTGGCAGGGATTGGCGCATGGAACTACCCGATCCAGATTGCCCTGTGGAAATCCGCCCCGGCGCTGGCGGCAGGCAACGCAATGATTTTCAAACCGAGCGAAGTTACCCCGCTTACCGCGTTAAAGCTGGCTGAAATTTACAGCGAAGCGGGCCTGCCGGACGGCGTATTTAACGTGTTGCCGGGCGTGGGCGCGGAGACCGGGCAATATCTGACCGAGCATCCGGGCATTGCCAAAGTGTCATTTACCGGCGGTGTCGCCAGCGGCAAAAAAGTGATGGCTAACTCGGCGGCCTCTTCCCTGAAAGAAGTGACCATGGAACTGGGCGGTAAATCACCGCTGATCGTTTTCGATGATGCGGATCTCGATCTCGCCGCCGATATCGCCATGATGGCAAACTTCTTCAGCTCCGGTCAGGTGTGTACCAATGGCACCCGCGTCTTCGTTCCGGCGAAATGCAAAGCCGCATTTGAGCAGAAAATTCTGGCGCGCGTTGAGCGCATTCGCGCGGGCGACGTTTTCGATCCGCAAACTAACTTCGGCCCGCTGGTCAGCTTCCCGCATCGCGATAACGTGCTGCGCTATATCGCCAAAGGCAAAGAGGAAGGCGCGCGCGTACTGTGCGGCGGCGATGTACTGAAAGGCGATGGCTTCGATAACGGCGCATGGGTTGCACCGACAGTGTTCACCGATTGCAGCGACGATATGACCATCGTGCGTGAAGAGATCTTCGGGCCAGTGATGTCCATTCTGACCTACGAGTCGGAAGACGAAGTCATTCGCCGCGCTAACGATACCGACTACGGCCTGGCGGCGGGCATCGTGACAGCGGACCTGAACCGCGCGCATCGCGTCATTCATCAGCTGGAAGCGGGTATTTGCTGGATCAACACCTGGGGCGAATCCCCGGCAGAGATGCCCGTTGGCGGCTACAAACACTCCGGCATTGGTCGCGAGAACGGCGTGATGACGCTCCAGAGTTACACCCAGGTGAAGTCCATCCAGGTTGAGATGGCTAAATTCCAGTCCATATTCTAACCAGGAGGTTTATTTGCAATTTGACTACATCATTATTGGTGCCGGCTCAGCCGGCAACGTTCTCGCTACCCGTCTGACTGAAGATCCGAATACCTCCGTGCTGCTGCTTGAAGCGGGCGGCCCGGACTATCGCTTTGACTTCCGCACCCAGATGCCCGCTGCCCTGGCATTCCCGCTACAGGGTAAACGCTACAACTGGGCCTATGAAACGGAACCTGAACCGTTTATGAATAACCGCCGCATGGAGTGCGGACGCGGTAAAGGTCTGGGTGGATCGTCGCTGATCAACGGCATGTGCTACATCCGTGGCAATGCGCTGGATCTCGATAACTGGGCGCAAGAACCCGGTCTGGAGAACTGGAGCTACCTCGACTGCCTGCCCTACTACCGCAAGGCCGAGACTCGCGATATGGGTGAAAACGACTATCACGGCGGTGATGGCCCGGTGAGCGTCACTACCTCCAAACCCGGCGTCAATCCGCTGTTTGAAGCGATGATTGAAGCGGGCGTGCAGGCGGGCTACCCGCGCACGGACGATCTCAACGGTTATCAGCAGGAAGGTTTTGGTCCGATGGATCGCACCGTCACGCCGCAGGGCCGTCGCGCCAGCACCGCGCGTGGCTATCTCGATCAGGCCAAATCGCGTCCTAACCTGACCATTCGTACTCACGCTATGACCGATCACATCATTTTTGACGGCAAACGCGCGGTGGGCGTCGAATGGCTGGAAGGCGACAGCACCATCCCAACCCGCGCAACGGCCAACAAAGAAGTGCTGTTATGTGCAGGCGCGATTGCCTCACCGCAGATCCTGCAACGCTCCGGCGTCGGCAACGCTGAACTGCTGGCGGAGTTTGATATTCCGCTGGTGCATGAATTACCCGGCGTCGGCGAAAATCTTCAGGATCATCTGGAGATGTATCTGCAATATGAGTGCAAAGAACCGGTTTCCCTCTACCCTGCCCTGCAGTGGTGGAACCAGCCGAAAATCGGTGCGGAGTGGCTGTTTGGCGGCACTGGCGTTGGTGCCAGCAACCACTTTGAAGCAGGTGGATTTATTCGCAGCCGTGAGGAATTTGCGTGGCCGAATATTCAGTACCATTTCCTGCCAGTAGCGATTAACTATAACGGCTCGAATGCAGTGAAAGAGCACGGTTTCCAGTGCCACGTCGGCTCAATGCGCTCGCCAAGCCGTGGGCATGTGCGGATTAAATCCCGCGACCCGCACCAGCATCCGGCGATTCTGTTTAACTACATGTCGCACGAGCAGGACTGGCAGGAGTTCCGCGACGCAATTCGCATCACCCGCGAGATCATGCATCAACCCGCGCTGGATCAGTATCGTGGCCGCGAAATCAGCCCCGGTGTCGAATGCCAGACGGATGAACAGCTCGATGAGTTCGTGCGTAACCACGCCGAAACCGCCTTCCATCCGTGCGGTACCTGCAAAATGGGTTACGACGAGATGTCCGTGGTTGACGGCGAAGGCCGCGTACACGGGTTAGAAGGCCTGCGTGTGGTGGATGCGTCGATTATGCCGCAGATTATCACCGGGAATTTGAACGCCACGACAATTATGATTGGCGAGAAAATAGCGGATATGATTCGTGGACAGGAAGCGCTGCCGAGGAGCACGGCGGGATATTTTGTGGCAAATGGGATGCCGGTGAGAGCGAAAAAATGAGTCGTGATGTGAACTAACGCAGGAACCGATCAGATTGAGAGTTACCGTTCCAGAGAGGGGGACCGAATCCTTATATAAACACTGAGGTAACTCTCATGCTTCATATCCAGTATTCATGATGCGGGCTTTTTGCTGATATTTAACCCGAACATTCCTGATACAGACTATTAATACAACCTTATCTGCGACAGATTTTATTTTCTGGATATATTATGCGTGAACAAATCAAACAGGATATCGATCTGATTGAGATTTTATTTTATCTGAAGAAAAAGATTCGTGTTATCCTTTTTATTATGGCTATATGTATGGCTATGGTGCTGTTGTTTCTGTATATCAATAAAGACAATATAAAAGTGATTTACAGCCTAAAAATAAACCAGACAACGCCAGGTATACTTGTTAGCTGTGATAGCAATAATAATTTTGCCTGTCAGACTACAATGACTGAAGATGTTATTCAGCGAATTACTACATTTTTTCACACCAGCCCAGATGTCAAGAACAGAGAAATAAGGCTGGAATGGTCAGGAGATAAGAGAGCTTTACCAACTGCTGAAGAGGAAATATCTCGCGTGCAGGCCTCTATTATCAAATGGTATGCGTCAGAATATCATAATGGCAGGCAAGTTCTCGATGAGATACAAACGCCTTCAGCAATTAACAGTGAGCTTTATACAAAAATGATATACCTGACCAGGAACTGGTCATTGTATCCGAACGGTGATGGCTGTGTAACTATAAGCTCACCAGAAATAAAAAATAAATACCCTGCTGCCATTTGCCTGGCTCTGGGATTTTTTCTAAGCATTGTAATTTCTGTAATGTTTTGCCTTGTCAAAAAAATGGTAGATGAATACCAACAAAACTCTGGGCAGTAATAAACAACATTCTGATAGAGAGTGCCTGTATAAAGCTGAAGTGACAATGACCGGGAAGGGAACCCGTTTTCAGGAATGGATGCCTGTCTTTTTCACTGTAAATATCCTGGCATTGGTAATACCAGGATATTTACAGAAGAGCAGCATAACCATGACGGAACGGGGTTAAGTCAGGTTGTCCTAAACGCTATTTTCAACCTCGTATGCCTTCTTCAGGTTTATGTCCAGACTTCATATCTCTCTCAACAATCCTCAATAATCAGATACACCGCTTTCACCGGGCCATGAACTCCGACGACTTTGATAAGCTCAATATCCGCCGTTGAACTGGGGCCGCTAATGATGTTAATGCAGGAAGGCATTCGTTCACCGGCCTGCGCTTTCTGATGCAATTTTTCTGCGAGTTGCGCTACACGCGGCAGGATAGTGCTTTTACGCAGGATAAAAAGAGAATATTCCGGGAGCAGGCTCAATGAACGCCCGCGCTCGGCGGCGGAAAAAAGAATCACGCCTCCCGATTCGGTTAAACCATATTCAGCATACACAACACCCACTTTAGCCTGCTCTGCCTGCGAGATATTCTCGGCACCTTTCGCCGGATCCCAAACAACGGCATTGCATTCCTGCTGCAAACGTTCGCTAATCCCCAATTCCTCCAGCCTCGTGTCACCGCTAATCACGACCGACTGATCTCCCAGCTCTTTACACAGACGTATTGCAGCTTCTGCCGCCTTCGCCTCGCTGGTCAGCTCACAGCGCGTCAACATAACATCGCTGGCAAACTGAATAAACGCGTCACAGCGCTGCTGTTGGTTAAGTTGGGTAAGCCGCTCGTTAGCATAGTTGTTAAGCGGCGCATCTTCTGCTTGCGGTTCAAGTCGCAGCGGGCGACCCAGTGCCTGAGCAACGTTATTCAAAAATTCGCCTCGATTATCCATTCTTTTTCTCCTGCGCCTGATGTTTCTTAAACCAACTACGGAAACTCTCTCCGTCAGCTTCAGGAAGATCGCGTGCTTCCATCCAGTCGCTAATCGCGCCAAATTTGAGTGGTGTTTTGCCGCCATTGATAAACCAGCTTGCCGCATGAGCACCGGCCATCATCCCGACTTTCCACAATCCTGGATGACTATTGGCATAAGCGAACATTTTTATCGCCCGTTGCTCTGCTTTTGCGGTGATCCCTTTTTCAGCCATCACCCGACGATGACGCAAAATCAGTTTTGACAGCGGAATACGCACCGGACACACGTTGTCACAAGCTGTGCATAAAGAGCAGGCGTAGGGTAAATCTTTAAAATCTTTATAGCCGCCAAGTAGCGGAGAAATCACCGCACCAATTGGCCCTGGATAAATAGAGCCATATCCATGACCGCCAATATGGCGATATGCCGGACAAGTATTCATACAAGCCCCGCAGCGAATACAGCGCAGCACATCCCGAAATTCAGAGGCCAGCACCTCAGAACGCCCGTTATCGACAATAACCAGATGAAACTCTTCAGGACCATCAACGTGCCCAGCTTCGCGCGGTCCTGTCAGCCAGGTGTTGTATCCCGTCAAACGTGCACCAACGGCACTGCGCGCCAGCATGGTGATCAATACATCTACCTCGGCAAACGTGGGGGCAATACGCTCCATTCCCATCACTGCAATATGCGTTTTAGGCAGCGTGGTACACATTCGCGCATTACCTTCATTGGTCACCAGGCATACCGAACCGGTCTCTGCCACCGCGAAATTACAGCCGGTAATACCTATTTCAGCACTGAGGAAATCTTCGCGGATTTTTTGCCGGATGAATAAGGTCATCGCTTCAGGCGTTTCCGGCCCCTCATAGCCCAGACGTTCGTGTAGCACTCGACGGATCTGATGGCGATCTTTATGAATTGCCGGGACCACAACATGAGATGGCGGATCTTGATCCAGCTGGAGAATATATTCACCCAGATCGGTTTCAATCACCTGAATGCCAGCATCCTGCAACACATGATTGACACCAATCTCTTCGGTCACCATCGATTTAGATTTCACCACCTTCCGGGCATTTTTGCGTTGGGCAACCTGTAAAATGTAGCGGGTAGCGTCTTCTTTGGTTCTTGCAAAATAGACGTGACCGCCGTTTTGCGTCACTTTTTCTGAGAGCTGGTACAGATAAGCGTCGAGATTACTCAGAACATGATCACGTATCTGGGCGGCCCGATCGCGCCACTCCTCCCAGTGCCCCAATTCATCGACCATTTTTTGCCGATTTGCCCCAATACGCTGCTGCGCGTTTGCCACCGCTTTGCGCATGATCGGATCTTCAATTTGCTGACGGATGCGTGTCTTAAAATCTGTATTACTGGTTTTGATCGACATCTTTATATCCTCAGCGGCTCATCAACACTTCAGCAATATGCATCACTTTGACTTTCTGCCCTTCCCGTTGTAATCGCCCACTGATGTTTAGCAGGCAACTCACGTCAGCACCAATTAAATACTCAGGGCGGACTTCCATCAGGTGCGCAACCTTTTCTTTCACCATCTCGCCGGATATTTCGGCCATTTTGACCGAGAACGTGCCGCCAAATCCGCAGCAGGTATCCTGTTCAGCAAAGGTAAACAGCTCCAGTCCACGCACATTTTTCAGCAGCGTAAGTGGCTCGTCCTTCACTCCCAGCTTACGGGCCAGGCTACAAGATGGGTGATACACCGCTCTCCCTTGCAAACTGGCACCTACATCGACTACCCCTAATTTATTAACAATAAAAGAGGTGAGATCCTGCATACGCGCGGCAACCTTTGCGGCACGTGATGCCCATTCAGGTTCATCCGCCAGATACATCGGGTAACTTTTTACGGCATAGGTGCAGGAGCCAGCCGGTGAAATAATGGGATCATCGTTATCCTCCAGCGCGGCGATCAGATTTTTCATCCCTGGAATCGCTTCTTTGATATAACCGCTATTGATCGCAGGCTGACCGCAGCATCCCTGTTTCTCCGGGAAATTTACGCGACAGCCGAGTTTTTCCAGTAGCAGCACGGAGTCTCGTGCCATTCTTGATTTCAGGGCGTCACCAATACAGGTGACAAAGAAATTGACATTCAAAACTAATACTCCATTACTTCATGCCCATTTATGCCCGATGTTTCCCCCCCACCGGGGGATCAATCATCAATAAATATGGCGTTCTTTTTTAATTGGCTAGCAACATACCACCCCTCATTTGTATGACAAATGATAATCTTTCATCCAAACGTCAAATGCAATAACTAATTTTATTTCTTTCGGCATCGCTCACATTTTATTGGGAAATCGACGATCATTTTTGGCATAAGAAATCCATTTTAAAATGGCTACATCCCTTATACCTATTCAGAGCTACTTATTCAGGGGTTATTGAGATATACACTGATGGTGTACAAAAAAATCACCGACAATAATCACGTATCAAGTAATAGTCGGGATTTTATTTAATTGAAGCAATTTAGTTTGCATTTTTATTTAAAAACTGAGCTAGAGTCCATCAGAAAAAAATATATAAAGCCCAAACCACCGTTGGAAATAGAAATAGTTGAAAGGCTCACAAATCCCGGAATGCGGATTATCTTTGCCCTGATTTCTGGCGGTTTTATTCAGGGTGCAAGCTGTCTGACCCTTTCCCGATATTCTCCCGGAGTACAACCAAACTCGCGGACAAACGCCTTGTGAAAAGATGATTCACTGGCATAGCCTACTGACTCAGCGATCACCACAACAGGGAGCATTTCCCGGGAAAACATCTGGGCCGCTATTTGTAGACGCAACTTTGTTAATACAGCCAGCGGCGTGGTTCCGGAAACATCACGGAAAAGCTGGGCAAAACTTGCCCGGGACATGTGGGCGATGCTGGCCAGCGATTCGACGGTCCAGGCGTGTCCTGGCATTTCCAACATTTGCTGTATTACCGCACCAAGACGTGGATGCAGAAGCAAACTGAGAATGTTTTTCTCAGTATTAACCTGTGCAATCCACTCACGCACCGCGAGGGTAAAGAATGTAGCGCAGATCTGGCTACACAGTGCATCCACTCCCGGCATTGCCAATCTGGATTCCTGTTGTAAAAACGGGATCGCCTCCCTCAGCCAGTTATATTCTACGCTGTGGTTAACCGGTGCTAAAAACAGCGTTTCCGGCAAAGACGTTAAAAAATAACGCGCCGAATGTTGCAGCCGAAGAGTGCCGCAGACAATACAGGTTGATTCATTATCGACATGACTCAGGCGATGGGCTGAATTTTGTGGTAGCAGGACCACATTTCCCGGGCGTAATGTAAAAATCTCCCCCGTCGGCATTTCCAGCTTCGCCGCTCCTTGCGTTAACGCATGCCAACGAATAACCGATAATTCCCCGGCACCATGTGGAAGCTGCCAGTCACTTCCTAACACGCAATTCTTATCGATCGTTCCTTGTGGAGCGTTAAGCATCAACAAACGGCTAAGGGCATCCATATCAGACTCCTGAGTGGTTTTTGATTACAAACACCAACAATCTTAGTAAATAACTAATCTAATACCATCAAATAGTTAATCATGATTTTTGTTTTGTGATTCTGATTAAGAAAAAGAAAATACCAGACGATAAGACAAAATATCAAGCGTTGTGAAGAAATGTTATTTGCTCTTTTGCGTCTATAGTCATGATGTCAAATGAACGCGTTTCGACAGGAAATCATCATGAATAAATATCAGGCAGTGATTATTGGTTTTGGCAAGGCTGGAAAAACATTAGCCGTCACGCTGGCAAAAGCAGGTTGGCGAGTGGCTCTCATCGAACAATCAAATGCAATGTATGGCGGGACCTGTATTAATATCGGCTGCATCCCAACCAAAACATTGGTTCATGACGCACAGCAGCACACAGATTTTGTCCGTGCCATACAGCGTAAAAATGAAGTGGTTAATTTTTTACGTAATAAGAATTTTCATAATCTTGCGGATATGCCCAATATCGACGTGATCGACGGCCAGGCGGAGTTTATCAATAATCATAGCCTGCGTGTTCATCGGCCTGAGGGAAATCTGGAAATTCATGGCGAGAAAATTTTTATTAATACCGGTGCACAAACCGTGGTTCCGCCAATTCCTGGAATTACCACCACGCCAGGAGTATATGACAGCACCGGATTACTTAATCTAAAAGAATTGCCTGGGCATTTAGGTATTTTGGGCGGCGGATATATTGGCGTTGAGTTCGCCTCTATGTTCGCTAATTTTGGCAGCAAGGTAACCATTTTAGAAGCAGCTTCACTGTTTTTGCCTCGGGAAGAACGGGATATTGCTGATAATATCGCGACGATTTTACGCGATCAGGGCGTCGATATTATCCTCAATGCCCATGTGGAGCGAATCAGTCACCATGAAAATCAAGTGCAAGTGCATAGCGAGCACGCCCAACTGGCGGTGGATGCACTGTTAATAGCTTCCGGTCGTCAACCGGCTACCGCTTCGTTACATCCAGAAAATGCCGGTATCGCAGTAAACGAGCGCGGGGCAATTGTCGTTGACAAGCGATTACATACCACCGCAGACAATATTTGGGCGATGGGAGATGTTACCGGCGGGCTGCAATTTACTTACATATCACTGGATGATTACCGCATTGTACGTGATGAGTTACTGGGTGAAGGCAAACGTAGTACTGATGATCGGAAAAATGTGCCTTATTCCGTATTTATGACACCGCCCCTGTCCAGGGTTGGTATGACAGAAGAACAAGCCAGAGAGAGTGGTGCTGATATTCAGGTGGTGACATTGCCTGTAGCTGCAATTCCGCGCGCCAGAGTGATGAACGATACCCGTGGGGTATTAAAAGCGATTGTTGATAATAAAACCCAACGTATATTAGGGGCATCACTGCTGTGTGTTGACTCCCACGAGATGATTAATATAGTGAAAATGGTGATGGATGCCGGGCTGCCTTATAGCATATTACGCGATCAGATATTTACTCATCCGTCGATGAGCGAATCACTCAATGATCTATTTTCATTAGTCAAATAAACTCAAAATCAGACGCCAGAACAAATATTCTGGCGTCTCAGAGAAAAGAATCTTATTAATTCCCTGTTACTCTATAACTTCTTAATCACTTCATTGATGGTATTTTATATGTTTAAAAAATCCGTTTTATTTGCAACACTATTATCTGGCGTTATGGCATTTTCCACCAATGCAGATGATAAAACAATTCTGAAACATATCAGCGTCTCGTCAGTATCAGCATCACCGACAGTTCTGGAGGATGCCATTGCTGATATAGCCAGAAAATATAATGCTTCATCCTGGAAAGTCACATCGATGCGAATTGATAATAATTCAACCGCAACAGCAGTATTGTATAAATAAGGATGTTCACAATGGAAAAATACCTGCACCTGTTAAGTCGGGGAGATAAAATTGGCCTGACATTGATTCGTCTGAGTATTGCAATTGTTTTTATGTGGATTGGGTTATTAAAGTTTGTCCCTTACGAGGCAGACAGCATTACACCATTCGTCGCAAACAGTCCACTAATGTCGTTCTTTTATGAACACCCGGAAGACTATAAACAGTATCTGACTCACGAAGGCGAATACAAACCAGAAGCAAGGGCATGGCAATCGGCCAATAATACCTATGGTTTTTCCAACGGTCTTGGCGTCGTGGAGGTGATTATTGCTCTGCTGGTTTTGGCTAATCCTGTCAATCGCTGGTTAGGTTTATTGGGAGGGCTGATGGCATTTACCACACCGTTGGTAACACTCTCATTTTTAATCACCACCCCGGAGGCATGGGTACCCGCATTGGGTGACGCTCATCATGGTTTCCCTTATTTATCCGGTGCTGGTCGCCTGGTATTGAAAGATACTCTGATGCTGGCAGGTGCAGTCATGATAATGGCAGATTCGGCGCGGGAAATTCTTAAACAACGCAGTAATGAATCCAGTTCAACGTTAAAAACTGAATATTGATAATCCGAACACTGTCTGTTGTTCACTTTTTCAGGCGTCAGGCCCCCTTTCTGATGATGAAGCCTGACGCCATATTACCGGAGATATGACAGAGTGAAGATCAATATCACTCCCCTGTTAAAAGGTTTTTGTCGTCTTAGGTAAAATACATGATTAATATCAATCATGTATTTTTACAGCAAGACAACCACGAACAAATTCAGGATTAAAATGTTTTACCGACTCCCCAACATAACCCAAATCAAGCGAACTAATTTGTGCCATTTCTTTATCTGTCAGTGAGAAATCGATTTTTGATGGGCATCTGCAATACTCTGAAGCAGACTATACGGAAAATTCCACTGTCAGTCCCTCCATTAGGCATGAACAATGAGTCTACGTTAAAACGTAACCTCAAAGTAGTATGTGGATTTTGATATCACTTATGCAAAAAATTCATTAATAATGTAGGACTGAAACCTCTCTATTTTCGGGGACAACGAAGCAGACGCTACCAGTGCTTTTGCCTTCGCCCTTGCTATTTTTGATACACTTAGGGCCCAGGGTATAACGAAAATGTGCGATATGACAAAGAATTAACGGAGAATGAGATGATCAGGCAGAAGATTCTACAGCAGCTCCTGGAGTGGATTGAGTGCAATCTTGAGCACCCTATTTCAATCGAAGATATCGCACAGAAATCTGGCTACAGCAGACGCAACATCCAGCTTCTGTTCCGAAATTTCATGCATGTGCCTTTGGGAGAATACATTCGCAAACGAAGGCTTTGTCGTGCCGCCATTCTTGTCCGGCTCACCGCGAAATCTATGCTTGATATTGCACTCTCTTTGCATTTTGATTCACAGCAGTCATTCAGCCGCGAATTTAAAAAGTTATTCGGCTGCTCTCCCCGTGAATACCGCCACCGTGATTATTGGGATCTCGCAAATATCTTCCCTTCTTTTTTAATACGTCAACAGCAAAAGACGGAGTGTAGATTAATCAACTTTCCTGAGACACCTATTTTTGGCAACTCATTTAAATATGACATTGAAGTGTCGAATAAATCACCGGATGAAGAAGTCAAACTACGACGTCATCATTTAGCCAGATGTATGAAGAATTTTAAGACGGATATCTATTTCGTTTCCACGTTTGAACCGTCAACAAAATCGGTCGATTTGCTCACGGTTGAAACTTTTGCTGGTACGGTATGTGAATATGCTGACATGCCAAAAGAGTGGACAACGACCCGAGGACTTTATGCCTCTTTCCGTTATGAAGGAAACTGGGAAAATTATCCTGACTGGGTGCGTAACATCTATCTGATAGAGTTACCTGCCAGGGGGTTAGCCAGAGTGAACGGCAGCGATATTGAGCGCTTTTATTACAATGAAGATTTCGTAGAAAAGGATGGCAATGATGTTGTTTGCGAAATTTTTATTCCCGTTCGTCCGGTTTAGTTGGTCACTATCTCTTATTGAGTTTTATCCTTGCCTGATACTATTTAATCAGTATCAGGCAAGGTATTCAGTGAATAATGGCGTTGAATATTTCAACACCATTATTCTTTATTAGATCGTAACTTTCATACTATTCAAATTATGGCATCTCCTCCTCGCCATTTTCAGTCTCTGCAGCAACGGCGTTAATTGTCACCTTCACAGGCGTCATTTCGTACCCGCCATACGTCAGTGCGTTGAATGTGAACGTATAGACACCTGGCTTGCTGGTACTAAATGTACGCGTGATTTTCCCACCTGAGAAATGGTCAGCCTTCGACGGCAGGAATTGATAATCCTTCTCCGTCACTCCTTCCGGAGCCTCAATGTCGACCCACATACTTCCTCCTACGGGATTGACTCCCGCCATCAGGGTAACCGTCATTGTTGCAGTAGTCTTCCCGTCCGCCACCAGAGAATCTACACGTTCTGGAGATGCGGCGAGCACCGCGTTTTTATCATCCGCGACGAATTTAGATTCAGCAGTTTTCGTCGATACCTGACCGTTGGCCTGTTCCACTTTCGCCGTGAGTGTATATGTCGCTGCGATAGTGGTAGAGCCGGTGAAAACAGCCTGTCCTTGCTCATTAGTTTTCGCCGTCCCTTTAGGATCCAGAACTAAATTTTCCGAGCTGGCAGTCAGCGTCACGTCGCTATCCTTCAACAGGTTGTTATTGGCATCAGTCACCGTAACTTTATACGTCTGTTTGGTTTTGCCATCCGCCACTGCACGGTTGCCAATCACCTCAATACTGGAGATGGTCGCCGTAGTCTTATCGGCCACAAACGTTACAGACTGCGCATTCGAAGGCTGGTCATTTCCTGCCGACGCCGTGATCTCATACGTTCCGGCAGTCACGGAGACCACTTTCAGTTCCGCTTTACCCTCCTTGTCGGCGTTCACCATGATATTACCGTCTGCAAGCGGTTTGACGCCCCGAGGCAGATTAAAGACGACCACAGCGCCAGGCAGAAGGTTGCCGTATTGATCCTTAACCGTTGCCGTCAGCGTAAAGGCATCATTGTCGTTTGCCACTTTTGGCGTGCTGCCATCAACCTCCAGGGTCGCCTGACCGGTACTGAAATCCGCCTTGAATTTCACCGTAACCGTCTTCTGAGCATTATTCACTGAGGCCGTGATGCTGTGTTCCCCCGCAACCGTTGACATAAGCTCAATGTCCACTTTCCCTGCCGCATTGGTTGTTACTGTATTCCCCGTCTTGCTGGTCACACCCTGCGGTAAAGTCAGCGTAACTTCTTGCCCCTGCAACGGGTTACCATAGCTGTCCACGACGGTCAGGGTTATCTGGTTAGCAGACTGTCCATTAGCCAGTTGGTTATTAACCTTCACTGTCATATCACGAATCTCAGCCTTAGATGCGTCACCTGCAACGTTCAGCACCAGTGGCTGAGCAACGGCATTTTGGCCGTTCACTCGCGGCATCACAGACAACTGACCCGCGGCAGATCCCAGCGTTAAGGTCGCCACGTAGACCCCATTACCTTTCTCTGTCCAGTTTCCTGCTGAAGGACGCTCACTCCCCGTGCTGGCGGCACCGCTAAACACTGGTGCATCTGGCTTCAGCCCGGTGACCGGGTTCCCGTACGCATCCTTCACGGTGAAGGTGAGTTCCGTCGTCATTTTGACGGTCGGAGCCTTATTGTCCGCAACAAGCGTAGAGTTCGCTGATGACATCTCTCCGGCGATGACCGTCAACTGCGCGGCTTCGGTGGCTGCTGGCTGGCCGTTGAAGAGCGGCATGACACGAAGCTCGCCCGTCTTTCCGCCTGTAGTTAACGTAGCAACATAGGAACCGTCACCTTTTTCGGTCCAACTGGAAACGGTCGCCCCTTCAGAGGCGGTCCCCGTCAAACTTGCCGACAACGAAAGACCACTGATAGTGTTGCCATGTGCATCTTTTGCAATAAGCGTCACGGTTGTGCTTTCGCCGGCTTTTACGTGATCTTCTGCGACAGAGACTTTCGACTGGTTTGAAGATAACGCATCAGCCACCACGGTTACTTTTGCCGCATTTGCTGCCGCATCCTGTCCATTTAGCTTCGGCATAACTTCTAATTCACCCGCCGTAGAGCCGAGAGTAATCTGCGCAGTCCACGTCCCATCATCATTATTTGTCCAGCCAGATGCCGTAGAACCTACAGCAGCGGCACCCGCTAATGACGGCGCTTCCGGCGTGAGGCTGGTCACAGGGTTGTCATAGGCATCTTTTGCCGTCCAGATTGCCGTAACTGTACCGCCAACCACCGGTTTATCAGGATTCAGGGTGATGGACGAATGTGCTGCATCAAGCGGACCGGCAACAAACTTCAGCGTTTGTTGTAATGTGCCCAGCTGATACTGCTCGCTGAAAGCACGCACAACAACGTTTCCGGCACGGGTCGAAGAAACCGTGGCGCTGTAAACGCCTGGTTTTATTTCCGAAATGGCACCAACGGTTACACCATTAGTGTCTTGCGGAACAAATCGCAAGCGGCTGGCTTCTCCCGTCACCGGATTACCCTCGGAGTCCACCGCAGTCAACGTCAACGTATAGGCTTGCTGACCATCAGCAACCACATCACCTGACGGCTCGTTAGCGCTCAGGGTGGAGTTTGCCACATCCATCATCGTGGCCCGCAGTTCTGCAGTGACGGTTTTGCTCATGCCATCAACGCTAACAGTAATCGTTGCCTCACCTGACTGCGTTCCGGTAGTAAAGACAGACTGATACACCCCTGCTTCAGTTTCGGTGAACTCACCCAGTGTTGGCTTTGCCTGTGATTTAGTGGCCTTCAGGGAACGAGTCACAATATTTCCAGCCGGTTTGAAGGCTAGTTCAGTCTTGATCTGATCTTTCATGCCCGTGACTGGCTGCCCCTCGGCGTCGCGCAGAGACAGCACCAGCGGTCTTTGCTCATTACCGTTAGCAAGCATTTGAATACGGCTCTGACCGTCAAGCGTTAACGCCGTGCGATCGGCGCTCATACCTGCTCCGGTAATGACCACCTCTGTCTGCACGCGTTTTGAGGCATTGCCTTTGTTATCGTAGGCAACCGCAGAAATCGCATAATAATTGTCTTTGCCTGGACGATAAGCCGGGAGCGTTACTTGCCACTGACTACCCTGACCGGTAATTTTGCCACCCTCAGCCAGTAATGACGGCGCTTCCCACTGCACATTTTTCAGTCCGTGAGTTGCTTTGCTGACCACAAGCCCCAGGGAAAGTGTCTGACCACCCTTACCTTCAATACGTTCAGGCAGAGCAATACGGATCACTTCAGATTTGCGGTACTCAAGAACGATGTTGTTATTACGCTCAACCAGGTCATAGCGGCTGCCTGCCAGTACCCGACGCTCGCGAATGCTGTCTGTATCGAGTTGTTTCGCCAGAGGTTCGCCAATCCGATAATTAACTTCCAGGCCAAAGCGAGTGTCATTCTCACCACTCTTGCCCTGCTTATGCCCGGCGCTCAGTGTCAGAAGAGGCACTGGCGTATAGGTCACCTCGGCAGAAATAGCATGCGGGTCTTTCTGGCGCTTATCTTTACCAAACAGCCCGACTTCATCGCCATAATACTGTTCATACATCAGGCTTGCGCCAAGCTGCGGCCAGGCGGGTAAATAGCCCTCAGCACGAATATCCCAGCCATTCGCCGGGCGTTCCTGATAATCCTCAACATCCGGCGATTTTTTCCAGCCAGAAGCCCGGATATAACCATTGGCGCTCAGTTTCAGATAATCGCGCCAGTATTCCGCACCAACACCAATGCGGGTATGACTACGGGATAAATCATGGTCGATAAAGGTGTTCACCCCCGCCATCCAGTCATTTCCTGAAAAATGACGCCAGCCAAAACCAATATTTGACTGAGTACGATCGTCGGTACGATGTATTGCTCCCTGAGTGAACAACATATTTGTTGGCGTATCATAAATCGGATAAAGCATTTCCAGCGAAGAATCCTTCAGCGAGAAATCTTTATCGACATTCAGTTTGACGCGCGCAGTACCATATTTCCCGAGCCACTCCTGTATTTCCTGGTTAGCTTTAGCTGTGGCCATTCCGGTAATAAAGTTACGTGTCGCATCGCTATCTGGCTGACTGCTTAAAAATGTCCCGGCATTTGCGGCAAACGACGCGACATTTTTCTCCACGTTATTATCAGCAGTTACCGTAGTATTTCCCATGCTCAACCGTGGCTGAACCGCATGCTGCGCACGTGCCGCCATTACTGGGGTAAAGGTGACAGCGAGTGGAAAAAGAACCTGAACAGAGATATTTGCCCACGCCACGCAGCGGGCCAGAACTGAATAACGAAATCGTGGTTGTTTATGACCTGTTTTATAATGTGACATTGAGATAATTTCCTGATGAACGAATTAACGAACATAGGCAGACACAAATAATGGCGTCTGCTTGTCAGAAAAACCGGGGATTTCCCGATTTCCTTTTCCGCAGAAAAGGAAATACCTTTATGTAAGAAAGTGCACCGCCACCTAATATATCCAGTGCCTGTAGCCACTCATCAGGTGGTTGGGCGCGGGATTATATTTTTATTTATGGGGGAAAATTTAGCTTTTTGTGCAAGTCATTAAATAGATACCTAAGTTATATTTCTTACTCTGCGGACAGTGAATAATTCACTATTCAGAGTGAATAACAATGACCTGCCTTTAATTAAAACTATTGGAAATACGTCGAAATAGGTATATATGCCAGGATAAACCAATATATTTTTTATGGATATTCATTTTCCTTCATCGGACTGTAAATTGCGGAGATATCGAACTTGCACTACCCATCGCAACACTCAAACAGATTTATGAATTTCGTTCATAACCCCTGACAAATTCTGCCGTCTAATCGAGCTTTTCCCTTTACGTTATGCTTCTCATGAACGATAACACAACTTGTTCATGAATTAACCATTCCGGATAAACTATGGGATCAAATTCAGGCATCGAAAGGAACTGGCTGGTTTCCGTTAATGGATTAAATATCTATTATTCTCCCGAGAGTAAAACTATCAAATCAATGAGTAACCGATACAGCCTCACAAATTCTTTCAGATAATTAACCCGTCACTGCGGCACGTTCTCTCATAAGGCGTGCCTGCAGTGTAATCCAGTATGTATCCGGCATAGTTGACGGTTTAATAGAACTCAATTCCTCTCCCAGGTCACCAACAACGATATCTTGTATAACACCCCAGGAGATACGACATTATGATGAGATGCGATATGCTCCTCAGCACCAATCAACGCCCGGACTGGGCCAGTCTTGAGATCAGACTCATTCCCGGAACTGACTTCAGGCGACATATATCAATGACGTTGGTGCAATAAAAGATAGTTGATGTAATACATCTATTCAAATTTGTGATACATGTCAAAAAATTGGTTGACCAAACTCGTTATTTATATAAGGGCACTTACGAAGTGCACTCTTTTTTAAAGCGAGGAAGTACCAATGAAAGAGAATAAAGTACAGCAAATCAGTCATAAACTGATTAATATCGTTGTTTTTGTCGCAATTGTAGAATACGCCTATTTATTTCTCCATTTCTATTAATAACGGAAATAAACTGTTCACTTCAGTGATATTTAAAATATGCATCCTCTCCCTTTTTTGTAAGTAATTATTATATCCGTGGGAGAGGAATACACGCTGTCAGGTAATCAATCATACTGCGATAAATCATCGGCCAGTAAAGTGGAGATAACCTCCATTCTCGAAAAATCCATACTCTCAGCAAAACCATCATCAATCACTCATCCAGGCGTTTATGGGAGCGTCACCAATGGCTGCTAACAATGCCAGAGTTCCCCGTTGCGAAAATTCCACATCCACAAAGAGTCACAGGGATTGAGTGTTGAAATGATCCGGATGAGCATGTATCTTTATGGTTATGTTATAACATAACAGGTAAAAATGATGAAGCCCAATATCCATCCTGAGTATCGTACTGTGGTGTTCCACGACACCAGTGTTGATGAGTACTTTAAAATCGGCTCGACTATCAAAACAGACCGTGAGATTGAGCTGGATGGCGTAACGTATCCATACGTGACAATTGATGTCTCTTCTAAATCGCACCCGTTCTATACAGGGAAGCTGAGAACAGTGGCATCAGAAGGAAATGTTGCACGATTCACCCAACGTTTTGGTCGTTTTGTTAGCACGAAAAAGGGGGCGTGATGAAAGTTCTTAACTCTCTGCGTACCGCAAAAGAACGCCATCCAGACTGTCAGATTGTGAAGCGAAAAGGACGGCTATATGTGATTTGTAAATCTAATCCACGTTTTAAGGCCGTTCAGGGTCGTAAGAAAAAACGTTGATTCAAAATTCGACGGATTAACGATATTTGTCTGATTAATAATCAGATCGGATTAATGTTGGTGTGTTTATAACACCAACATTAATTTTCCTGGGGATATATTCTTCCTGTTCATTTGAGGCCAACTGCCTGACGTTTCTCTCCGAATATTCCATTATCTTAATGTTGACTTGTTGACCAGCTTCGCCCCTGTATGCTGGCATCAACCCTCTTTTAGACTGAACACGCCACTCAGTCTCCTCCCTTTGCGGCGCAGCCTGCATTTTCACTCAAACTGTTAAGATGATAAATGTGGTAAATCTGTTGGTACTAACATAAAAACGTTTACGCCACAGGAACAGTCTGATCCACCGGTAACCCCGTCGCCGACGTTCGAGTGCCAGTTAGAGTAACGCGCACAGATAACTGAATGCAGTGCCCTGACAAAAAGGCCATCGTTCCTGTGACAGCTGGCAGCCTTCGTTTAACTTCACTTAATCTGGCTCTTGGGGGCTTACCGAACAGATGACGTACATACGCCCGTTCAATTTTCCATTACTTATTGGAATGAACACCTGTAACCATTTTGTGCGGCATGTTAATCCATTAAAATATCTTACTGATTGGCAAATCATCTTCAATGACAGCTCATCATAGTTTTATATTCTATCCCTTACCCTTAAAACTTGTTTTTTTACTAGTCCATCACACAGCGCATTAAGACTATTCCTAACACTTCAGGGCAAAGTTCCTGACCAATATAAAATGCAAGTAAGAATTGAACGTTATATTGCCAATAACCTTATGAAACCAAATGTCTTTTTCTTCTTATCAAAAAAGCAATATTTTCAGTTTTTCTAAATATTGACTTAACCATTGAATTCCTTTTCCGTTCACATATTGACACTCATCGGGAAAAAAACATAAATTTAAGCCCAATCGAAAATAATTAAACTTAATCTCGTTTAACCTTTATTGATATGTACTACGTATCTTATTTACTTCCGGTTTACTAAGGAAACTGAATGCACCTGTAAAAATTACAGGTTTGGAAAGTAGTGACATGGCAAAGTGATTACAGTAGGGACTATGAGGTTAAAAACCATATGGAATGTCAAAACCGTTCTGATAAATACATCTGGTCTCCCCATGACGCCTACTTCTATAAAGGACTATCTGAACTGATTGTGGATATCGACAGATTAATTTATCTATCGTTGGAGAAAATTAGAAAAGATTTCGTGTTTATCAATCTCAGTACGGATTCTTTATCTGAATTTATAAACCGTGATAATGAATGGTTATCCGCGGTAAAGGGGAAACAGGTCGTATTGATTGCGGCCAGAAAGTCAGAAGCCTTAGCAAATTATTGGTATTACAATAGCAATATTAGGGGCGTGGTATACGCTGGACTGAGTCGTGATATTAGAAAAGAACTGGCCTATGTGATTAATGGCAGGTTCCTGAGAAAAGATATTAAGAAAGATAAAATCACGGACCGGGAAATGGAAATTATCCGCATGACGGCCCAGGGAATGCAACCTAAATCGATTGCCAGAATTGAAAATTGTAGTGTGAAGACAGTGTATACCCATCGGCGTAATGCTGAGGCCAAGCTGTACTCAAAAATATATAAGTTGGTTCAGTAAACTCCAGGCAAGTTAGTTTTAAAAAATGACTCACTGGGACATCACGTCCTCAATTCAACTCGGGAAGAAATGCAATGAAAAAAAAGGTTCTGGCAATAGCTCTGGTAACGGTGTTTACCGGCATGGGTGTGGCGCAGGCTGCTGACGTAACAGCTCAGGCTGTAGCGACCTGGTCGGCAACAGCCAAAAAAGACACCACCAGTAAGCTGGTTGTGACGCCACTCGGTAGCCTGGCGTTCCAGTATGCCGAAGGCATTAAAGGTTTTAACTCACAGAAAGGTCTATTTGACGTGGCTATCGAGGGTGACTCAACGGCTACCGCCTTTAAACTGACCTCACGTCTTATCACCAACACATTAACCCAGTTGGATACCTCAGGTTCCACACTGAATGTGGGCGTGGATTATAACGGCGCGGCAGTCGAAAAAACTGGCGATACCGTGATGATCGATACCGCCAACGGCGTACTGGGCGGCAACCTTAGCCCGCTGGCTAACGGTTACAATGCCAGCAATCGTACCACCGCACAGGATGGTTTCACCTTCACCATCATCAGCGGCACCACCAATGGTACCACCGCAGTAACCGATTACAGCACTCTACCGGAAGGCATCTGGAGCGGCGACGTTAGCGTACAGTTCGACGCGACCTGGACCAGTTAATCTCTCTGATGTACCAGCAGGGGTAGCCCCCCTGCTTTATTCCTGGACGGAATTGTTATGAAAAAGCACCTTCTGCCTCTCGCTCTGCTGTTTTCCGGAATATCTCCGGCCCAGGCGCTGGATGTCGGCGATATATCATCGTTTATGAACAGTGACAGCAGCACGCTGAGCAAAACGATCAAAAACAGTACCGACAGTGGTCGCCTTATCAATATCCGTCTCGAACGGCTCTCTTCACCGCTTGACGACGGGCAGGTTATCTCAATGGACAAGCCGGATGAGTTGCTACTCACTCCCGCCAGCTTGCTGCTACCCGCCCAAGCCAGCGAAGTGATCCGCTTCTTCTATAAGGGACCCGCAGATGAAAAAGAGCGCTACTACCGCATTGTCTGGTTTGATCAGGCCCTCAGTGATGCACAGCGCGATAATGCCAACCGCAGCGCTGTGGCCACTGCTTCCGCCCGCATCGGCACCATTCTGGTCGTCGCACCCCGCCAGGCAAACTACCACTTTCAGTACGCCAACGGCTCCCTGACAAATACAGGAAATGCGACGCTGCGGATCCTCGCCTACGGCCCCTGCCTGAAAGCCGCCAATGGTAAAGAGTGTAAAGAGAATTACTACCTAATGCCGGGCAAGTCGCGTCGTTTTACCCGCGTGGACACGGCGGATAACAAAGGACGGGTTGCACTTTGGCAGGGTGATAAGTTCATTCCCGTGAAATAGATAGCTGTGCAGATGGATAACGACAATGCCTTTACGACGGTTCTCCCCAGGACTGAAAGCCCAGTTTGCCTTCGGCATGGTCTTTTTGTTCGTTCAGCCCGATGCCAGCGCTGCTGACATAAGTGCGCAGCAAATAGGTGGGGTGATTATTCCGCAGGCCTTCAGTCAGGCGCTTCAGGACGGCATGAGCGTCCCGCTCTATATTCATCTCGCCGGTAGCCAGGGTCGCCAGGACGATCAGCGAATCGGCAGCGCTTTTATCTGGCTGGACGATGGACAGCTACGCATCCGGAAAATACAGCTGGAAGAGAGTGAAGATAACGCCAGTGTCAGCGAACAAACTCGACAGCAGCTGATGGCTCTGGCGAACGCCCCGTTCAATGAGGCCCTTACCATCCCCCTGACTGACAACGCGCAGCTGGATCTCAGCTTGCGCCAACTGCTGCTGCAGCTGGTGGTCAAGCGCGAAGCGCTGGGCACCGTACTACGCTCACGTAGCGAAGACATCGGGCAGTCCAGTGTTAACACCCTCAGCAGTAATCTGAGCTATAACTTGGGCGTCTATAACAACCAGTTGCGTAACGGCGGGAGCAACACATCCAGCTATCTGTCGCTGAATAACGTTACTGCGCTGCGCGAACATCATGTGGTGCTCGACGGCTCGCTGTACGGGATCGGTAGCGGTCAACAGGACAGTGAATTATATAAAGCGATGTATGAACGCGATTTTGCCGGTCACCGATTTGCCGGTGGAATGCTCGACACCTGGAACTTGCAGTCCTTAGGGCCGATGACCGCCATTTCAGCAGGGAAGATTTACGGCCTTTCCTGGGGAAACCAGGCCAGCTCCACCATCTTCGACAGCAGCCAGTCAGCCACGCCAGTGATCGCCTTTTTACCGGCGGCGGGCGAAGTACATCTCACCCGTGATGGGCGGCTACTAAGCGTTCAGAACTTCACTATGGGCAATCATGAAGTGGATACCCGGGGTCTACCGTACGGGATTTACGATGTGGAAGTTGAGGTGATCGTTAACGGTCGCGTGATCAGCAAACGCACCCAGCGGGTCAATAAGCTGTTTAGCCGGGGGCGCGGCGTCGGTGCACCACTGGCGTGGCAGGTATGGGGCGGTAGCTTTCATATGGATCGCTGGTCGGAAAACGGGAAAAAGACGCGACCAGCTAAAGAGAGTTGGCTAGCAGGTGCCTCGACCTCCGGCTCATTGAGTACGCTTAGCTGGGCGGCAACGGGATATGGATACGATAATCAGGCGGTGGGTGAAACCCGTCTGACGCTGCCGCTTGGGGGGGCGATCAACGTTAACCTGCAAAATATGCTGGCCAGTGACAGCTCATGGAGCAGCATCGGCAGCATCAGCGCCACTCTACCGGGAGGCTTTAGTTCGCTGTGGGTTAATCAGGAAAAAACCCGCATTGGCAATCAATTGCGACGTAGCGATGCCGATAACCGTGCAATCGGCGGCACACTCAACCTGAACTCACTGTGGTCGAAGCTGGGCACATTCAGCATCAGCTACAATGATGACCGCCGTTACAACAGCCATTATTACACGGCAGATTACTATCAAAATGTCTACAGCGGTACCTTTGGTTCGCTTGGCCTGCGGGCCGGTATTCAGCGCTATAACAACGGCGACAGCAACGCCAATACAGGGAAATATATCGCTCTCGATCTCTCGCTACCACTGGGCAACTGGTTTAGCGCAGGGATGACCCATCAAAACGGCTACACCATGGCAAACCTGTCAGCACGCAAACAGTTTGATGAAGGAACCATTCGCACTGTTGGTGCCAATCTGTCACGTGCCATCTCCGGCGATACCGGTGATGACAAAACCCTCAGCGGTGGGGCGTATGCACAGTTCGACGCTCGTTACGCCAGCGGAACGCTGAACGTCAATAGCGCGGCGGACGGCTACATCAATACCAACTTGACCGCCAATGGCAGCGTCGGCTGGCAGGGTAAAAACATCGCTGCCAGCGGGCGGACTGATGGCAACGCTGGGGTGATATTCAACACCGGGCTGGAGGACGACGGTCAGATCAGCGCCAAAATCAACGGGCGGATTTTCCCGCTTAACGGCAAGCGTAACTATCTCCCGCTCTCTCCCTATGGAAGATATGAGGTGGAGTTACAGAACAGCAAAAACTCACTCGACAGCTACGATATCGTCAGCGGCCGCAAAAGCCATCTGACTCTCTATCCAGGCAATGTCGCTGTCATTGAGCCAGAGGTGAAGCAGATGGTTACCGTCTCCGGTCGTATCCGTGCGGAAGACGGCACACTGCTGGCTAACGCACGGATTAACAACCATATCGGCCGAACCCGAACCGATGAAAACGGCGAGTTTGTCATGGACGTGGATAAGAAATATCCCACTATCGATTTTCGCTACAGTGGCAATAAAACCTGCGAAGTGGCACTGGAACTCAACCAGGCGCGCGGTGCCGTCTGGGTCGGTGATGTGGTCTGCAGCGGCCTCTCATCGTGGGCGGCGGTGACGCAGACAGGAGAAGAGAATGAGAGTTAACCTACTAATAACGATGATAATTTTCGCGCTAATCTGGCCAGTAACTGAGCTCAGAGCGGCAGTGAGCAAAACAACCTGGGCGGATGCACCGGCACGCGAGTTTGTGTTTGTCGAAAACAACTCAGACGACAACTTTTTCGTCACTCCTGGCGGGGCGCTGGATCCGCGCCTGACCGGTGCCAACCGCTGGACCGGTTTAAAATACACTGGTTCAGGAACCATCTATCAGCAAAGCCTCGGCTACATTGATAACGGTTACAACACCGGCCTTTATACCAACTGGAAGTTTGATATGTGGCTGGAAAATTCACCAGTTTCATCTCCTTTAACTGGCTTGCGCTGCATCAACTGGTACGCTGGGTGTAATATGACCACCAGTCTTATCCTGCCGCAAACCACCGACGCCAGTGGATTTTATGGCGCGACGGTGACCAGCGGCGGCGCGAAGTGGATGCACGGCATGTTGTCAGACGCGTTTTACCAGTATCTGCAACAAATGCCCGTCGGCAGCAGCTTTACAATGACCATCAATGCCTGCCAGACCTCTGTGAACTATGACGCCAGCAGCGGCGCACGCTGTAAGGATCAGGCCTCCGGCAACTGGTATGTTCGCAACGTCACCCATACGAAAGCAGCAAATCTGCGGTTAATAAATACCCACTCGCTGGCGGAGGTATTTATCAACAGCGACGGAGTACCGACTCTGGGCGAAGGGAACGCCGACTGCCGGACACAGACCATCGGCAGCCTTTCAGGATTAAGTTGTAAGATGGTCAACTATACCCTGCAAACAAACGGACTCAGCAACACCTCAATCCATATATTCCCGGCGATCGCCAACTCGTCGTTAGCCTCGGCCGTCGGGGCGTACGATATGCAGTTCAGTCTGAATGGCAGTTCATGGAAACCGGTGAGCAATACCGCCTATTACTACACCTTCAACGAGATGAAGAGCGCAGACTCGATCTATGTTTTCTTCTCGAGCAACTTCTTTAAGCAGATGGTGAACCTCGGGATCAGCGATATCAACACCAAAGATCTATTCAACTTTCGCTTTCAGAACACCACATCACCGGAGTCTGGCTGGTATGAATTTTCTACCTCCAACACGCTGATTATCAAACCCCGTGATTTCAGCATCAGTATTATCTCCGATGAATATACTCAGACACCGTCGCGGGAGGGATATGTTGGCAGCGGCGAGTCGGCACTCGATTTCGGCTATATCGTAACCACCAGCGGTAAAACAGCTGCCGACGAAGTGCTGATCAAGGTGACCGGACCCGCGCAGGTGATTGGCGGGCGCTCCTATTGTGTCTTCAGCTCCGATGACGGTAAGGCGAAAGTACCGTTCCCGGCGACACTTTCCTTTATTACCCGCAACGGAGCTACAAAAACCTACGATGCCGGGTGCGATGATAGCTGGCGGGATATGACCGATGCGCTGTGGTTGACCACACCGTGGACTGATATCTCTGGCGAAGTGGGGCAGATGGATAAGACCACAGTCAAATTTTCGATTCCAATGGATAACGCCATTTCGCTGCGTACGGTAGATGATAACGGCTGGTTTGGCGAAGTCAGCGCTTCAGGAGAGATTCATGTTCAGGCGACGTGGCGTAACATTAACTAAGGCCCTGCTGACAGCGGTCTGTATGCTGGCGGCACCTTTGACACAGGCGATTTCGGTCGGCAATCTGACATTTTCGCTGCCGTCCGAGACTGACTTTGTCAGCAAACGTGTAGTGAATAACAACAAAAGCGCACGGATATACCGTATTGCCATCAGTGCTATTGATAGCCCGGGCAGCAGTGAATTGCGCACCCGACCGGTGGATGGTGAACTGCTTTTCGCCCCCCGCCAACTGGCGTTGCAGGCTGGTGAGAGCGAGTATTTTAAATTTTACTATCATGGTCCACGGGATAACCGCGAGCGCTACTACCGGGTCTCATTTCGCGAAGTCCCCACTCGTAACCAGACAAGACGTAGCCCAACCGGCGGCGAGATCAGCACGGAGCCGGTGGTGGTGATGGATACCATTCTGGTAGTACGACCACGTCAGGTTCAGTTTAAATGGTCCTTCGACAAGGTGACAGGAACGGTAAGTAACACTGGCAACACATGGTTTAAGCTACTGATTAAGCCAGGATGTGATTCGACCGAAGAGGAAGGCGATGCCTGGTATCTACGTCCGGGAGACGTGGTTCATCAGCCTGAGTTACGTCAGCCGGGGAATCATTATCTGGTCTATAACGACAAATTCATTAAGATTAGCGATTCCTGTCCGGCTAAGCCCCCTTCGGCGGACTAAGACTTATCCATCGGTCGAGGGAAGAATTCCACCTCAGAGCTCCAAAAATCGTTTAAATGATGTGGAAGCGATCGTCAATGTCACCTGTAACGATGGGCTTTCTCATATGCACACCATAGTGTTCAGGAGACACGGAAAGGTATCAGGCAGCCTGCTAACGAGTGGTTTAATCACGAATTAGTACGTAAAATCGGTAACGGCTGGAAATCATTCAATACTCACACTATCGAAAGTTCCCCAGCCAACCGCGGTACGTTCTTACATACGATGTACCGCTGTTCTCTTTACGATTTTTAGCTGTACTGGTGAATTATGAGCAATCTGAATCCATGCATGACGTGTGGTGCCTGTTGTGCATTTTTCCGCGTCTCTTTTTACTGGGCCGAAGCCGACGATGCTGGCGGTACTATTCCCGCCAGGCTCACTGAACAAATATCCCCTTTTCACCGATGCATGAGCGGTACCAATCAGAAAAACCCCCGATGTATTGCCCTTGCAGGAACCCCGGGCAAAAATGCCTGCTGCACGATATATAAAAATCGATCGTCCACATGCAGAGAATTCGCCATGTCTGGTGAGAACGGAGTCGTCAATGAGGCTTGCAATCGTGCAAGGGCTAAATACGGGCTGACACCGTTATAAACATACAATAATTAATTGCACTGCCCCGCCAGCGATAATAGCGGGGCTTCGTTTTTCAGGGGTAACAAAACCCGATACTTCTTCTATTTGCCAGCAACAATGCCTCTCTTCTGTAGCGTTCTTCGGATCTACATCATCCTGAAGTAGCGCGATATACTCACTGACTATCTGCATCAGAATATAAAAAGCAATGTTTTTAACCTATAAAAATGGCGCTGTATTTGCGCCATTTTTATCATTCAATGCATTATCTGTTTGAGCCTAAAGGGATCTCAGGGTCTGGCTCATGAGTAATTCTGTTTCGTAAATCTCTGCGAATAATTTCAATAGACCAGAACCAGACTAAATGTCCAAAAATTTCAGAAACATTCTCATACCACGGGAGATCAAACAGAGGTGGCGTCAGTCCCATGAGAGGGAATGAAATCATATGAACAAAAAGTTGGGCTAAAGCACCTGCCAGTAAGCCCTGCCAGAGTTTAATTTTTGGAAATACTTCAGCGACCACACAATAACCGACAGCAAACACTATCGAAAAGATAATGTGCGTAACACCAACCCAGTTAAAGACATGCCCGGCAAAGGTATAAACAGCCGCATTGGGATCTGTCAGCCCCAACCAGTCTCGAAGAAAAATATACGGTGGATTGAGAAAATTACGCGAGCAATCAATTTGGCCTGCAGCCCTGATTAATGATTCCGGGCCACACGCTGCATTAAACATATCCACCGGGCTACGTGGCGGCAATGGAACTTCAGCCCCCCACTTCACGAATGCGGAAACAACGCCAGCAATCAGCCCAATGAATGCAGCAAGACCATAACGTCTGCGGTTCGGTGGAGTTTGTTCAAATATATTCATATCTACCCTGCTTGTACCATTATGTTATACACCTCTTCAGGAGTATTCATAAAACAAGGCAAATGTAAAGAACTGTATTGTTTTGTATAACAAGATGGTTTCCTAATCGCCAATGAATATAAGCTCCATCATTTCTCCCTATTTTTATATTAAAAGTGACAGAGATTTGCAGGGTGATGCAGAGCTGAAAATCACAGGTTTCCTTATTGGTTTTTGCATCGTACAACTAAAGCAATAAACCAGCTCCCATCATCATCAACATCCCGGCGAACATGAATTTATTAGCCGGGCTCATAGCTGCATTCGCGTTTTAAGAATAATCCTCCTGCTGTCGCCGACTATGCTTAACATTTAAAAAAGCATCAGCACTCTCGCAACGCACTCTTATTTTCCCCTTTAGAATACCGGAGGCCTGGTATGAGCAACCAAGGCGAATACCCCGAAGATAATCGGGTTGGGAAGCACGAGCCGCACGATTTAAGTTTGACCCGTCGCGATCTGATTAAAGTGAGCGCCGCAACAGCGGCGACCGCCGTGGTTTATCCTCATTCTACGCTGGCGGCAAGCGTTCCGGCAGCTACACCCGCGCCAGAGATAATGCCCCTGACACTGAAGGTGAACGGCAAAACCGAGCAGCTTGAGGTGGATACCCGAACCACGCTACTGGACGCTTTGCGTGAAAATCTGCATTTGATCGGTACCAAGAAAGGTTGCGATCACGGACAGTGCGGAGCCTGTACCGTGCTGGTCAATGGTCGCAGGCTTAATGCCTGCCTGACGCTTGCAGTCATGCATCAGGGGGCCGAGATCACCACCATTGAAGGCCTGGGCTCGCCAGATAATCTTCACCCCATGCAGGCGGCCTTTATCAAGCATGATGGCTTCCAGTGCGGCTACTGCACCTCCGGGCAAATTTGCTCATCAGTAGCGGTGCTAAAAGAGATTCAGGACGGCATTCCCAGTCACGTCACGGTCGATTTGGTTTCCGCTCCAGAAACAACTGCCGATGAGATCCGTGAACGTATGAGCGGCAACATCTGTCGCTGTGGTGCATACGCTAACATCCTTGCCGCCATTGAAGATGCTGCGGGGGAGATAAAATCATGAAGGCGTTTACCTATGAACGAGTGAACACCCCAGCAGAGGCGGCACTTAGCGCTCAGCGCGTACCCGGCGCAAAATTTATCGCGGGCGGGACCAATCTGCTGGACCTGATGAAGCTGGAAATTGAAACGCCCACCCACCTTATCGATGTGAACGGCCTCGGGCTCGATAAGATTGAAGTGACCGACGCGGGTGGGCTGCGCATCGGCGCACTGGTACGGAACACCGACCTGGCGGCTCACGAGCGCGTGCGTCGTGATTACGCGGTACTCTCCCGCGCCCTGCTCGCTGGCGCATCTGGTCAGTTACGTAATCAGGCAACCACCGCAGGTAATCTGCTCCAGCGCACGCGCTGCCCCTATTTTTACGACACCAATCAGCCCTGCAATAAGCGCCTGCCCGGGAGCGGCTGCGCGGCGCTTGAAGGCTTTAGCCGTCAGCACGCGGTGGTAGGCGTAAGCGAAGCCTGCATTGCCACCCATCCGAGCGATATGGCGGTCGCAATGCGGTTGCTGGATGCGGTGGTGGAAACCATCACGCCGGAGGGAAAGACTCGCAGTATCACACTGGCTGATTTTTATCACCCTCCGGGAAAAACGCCGCACATTGAAACCGCCCTGCTTCCCGGTGAGCTTATCGTTGCGGTGACGTTACCTCCGCCGCTCGGCGGAAAACATATCTACCGTAAGGTGCGCGATCGCGCCTCCTACGCCTTTGCCCAGGTATCGGTCGCGGCGATTATTCACCCTGACGGCAGCGGGCGCGTCGCGCTGGGCGGAGTAGCACATAAGCCCTGGCGCATTGAGGCTGCGGATGCTCAGCTATCCCAGGGGGCGCAGGCCGTATATGACACGCTGTTCGCCAGCGCCCATCCCACCGCTGAAAACACCTTTAAACTCCTGTTGGCGAAGCGAACGCTTGCCTCCGTACTGGCTGAAGCGAGGGCACAGGCATGAAATTTGATAAACCCGCAGGGGAAAACCCGATCGATCAGCTGAAGGTTGTCGGTCGTCCCCATGACCGCATCGACGGACCGCTGAAAACTACCGGCACGGCACGCTACGCCTACGAATGGCATGAAGAGGCCCCCAACGCCGCCTATGGCTATATCGTCGGTTCCGCCATTGCCAAAGGACGCCTCACCGCCCTTGATACGGACGCCGCGCAAAAAGCGCCGGGCGTACTGGCTGTCATTACCGCCAGTAACGCCGGGGCACTCGGCAAAGGCGACAAAAACACCGCCAGGCTGTTAGGCGGCCCCACTATTGAGCACTATCATCAGGCCATTGCGCTGGTAGTGGCCGAGACCTTCGAACAGGCGCGAGCGGCGGCCTCGCTGGTGCAGGCGCACTATCGCCGTAATAAAGGAGCTTACTCCCTGGCGGACGAAAAACAGGCCGTCAATCAGCCGCCGGAAGGCACGCCCGACAAAAACGTCGGTGACTTTGACGGGGCTTTCTCCTCCGCTGCGGTGAAGATTGATGCTACCTACACGACCCCGGACCAGAGCCATATGGCGATGGAGCCGCATGCCTCGATGGCCGTCTGGGATGGAAATAAGCTTACTCTCTGGACCTCAAATCAGATGATTGACTGGTGCCGCACCGATCTGGCAAAAACGCTGAAAGTGCCCGTGGAGAATGTGCGTATTATCTCCCCGTATATCGGCGGAGGGTTTGGCGGCAAGCTGTTCCTGAGAAGCGATGCGCTGCTGGCAGCCCTCGCCGCCCGAGCGGTGAAACGTCCGGTTAAAGTGATGCTCCCCCGCCCCACTATTCCCAATAACACCACGCACCGCCCCGCAACCCTTCAGCACCTGCGTATTGGTGCTGACCAGAGCGGGAAAATCACCGCTATCTCACATGAAAGCTGGTCCGGAAACCTGCCCGGCGGCACGCCGGAAACGGCGGTACAGCAAAGCGAATTACTCTACGCCGGGGCAAACCGTCATACCGGCCTGCGGCTCGCCACGCTTGATTTGCCGGAAGGGAACGCCATGCGTGCGCCCGGCGAAGCCCCCGGTCTGATGGCGCTCGAAATCGCGATCGACGAACTGGCGGAAAAAGCGGGCATCGATCCCGTCGAGTTTCGCATCCTGAATGACACTCAGGTTGACCCCGCCGACCCGACGCGCCGCTTCTCTCGCCGTCAGCTTATCGAGTGCTTGCGCACCGGAGCGGATAAATTTGGCTGGAAGCAGCGCAACGCCACACCCGGACAGGTGCGCGACGGGGAGTGGCTAGTCGGCCACGGCGTCGCGGCGGGCTTTCGCAATAATCTGCTGGAAAAATCGGGGGCTCGGGTTCACCTCGAACCAAACGGCACCGTTACCGTGGAAACGGACATGACCGACATTGGCACCGGCAGCTACACCATTCTGGCCCAGACGGCAGCGGAAATGCTTGGCGTACCGCTGGAGCAGGTTGCGGTTCACCTCGGCGATTCCAGTTTCCCGGTTTCTGCGGGTTCTGGTGGACAATGGGGCGCGAATACCTCCACCTCCGGCGTTTACGCCGCCTGTGTGAAGCTTCGCGAAATGATTGCCTCGGCAGTCGGGTTTGATCCTGAGCAGTCGCAGTTTGCCGACGGCAAGATTACCAACGGTACCCGAAGCGCCATGCTACATGAGGCCACCGCAGGCGGCAGACTGATAGCGGAAGAGAGCATTGAATTCGGAACACTGAGCAAGGAGTACCAGCAGTCGACCTTTGCCGGGCATTTTGTGGAGGTCGGCGTGCATAGCGCGACGGGAGAAGTTCGGGTCCGGCGTATGCTCGCTGTGTGTGCTGCAGGACGCATCCTGAATCCGAAAACTGCACGCAGCCAGGTCATTGGCGCAATGACTATGGGCATGGGCGCGGCACTGATGGAGGAGCTGGCGGTGGATGACCGTTTGGGCTACTTCGTTAATCACGATATGGCGGGGTATGAGGTGCCGGTCCATGCGGATATCCCAAAACAGGAGGTGATTTTCCTGGATGATACCGACCCCATATCCTCCCCGATGAAGGCCAAGGGTGTCGGTGAGCTGGGCCTGTGCGGCGTGAGCGCGGCTATCGCCAACGCGGTGTATAACGCCACCGGTATTCGGGTACGCGATTATCCCATCACTCTGGATAAGCTGCTCGATAAGCTGCCGGATGTGGTTTAAGGAGAAACAATGTCATACCCGCTTTTTGATAAAGACGAACACTGGCATAAGCCAGAGCAGGCGTTTCTCACCGATGACCACCGGACCATTCTGCGCTTCGCCGTAGAGGCGCTAATGTCCGGTAAAGGAGCGGTGCTGGTGACGCTGGTGGAGATACGCGGCGGCGCGGCGCGCCCGCTCGGGGCGCAGATGGTGGTGCGCGAAGATGGTCGTTACTGCGGTTTTGTCTCTGGCGGCTGCGTGGAGGCCGCTGCCGCTTTTGAAGCGCTGGAGATGATGGGCTCAGGCCGCGATCGCGAAATTCGCTATGGCGAAGGTTCGCCGTGGTTTGACATCGTTCTGCCCTGCGGCGGTGGGATCACGCTGACGCTCCATAAACTACGCTCGGCACAGCCTCTGCTCGCCGTGCTGAACCGCCTGGAACAGAGAAAACCGGCGGGGCTGCGCTACGATCCGCAAGCACAATCGTTGGTGTGCCTGCCCACGCAAACCCGAACGGGCTGGAATCTCAATGGCTTTGAGGTGGGGTTCAGGCCATGCGTCAGGCTGATGATTTACGGACGTTCTCTTGAGGCGCAGGCAACCGCGAGTCTTGCAGCAGCCACAGGCTATGACAGCCATATCTTCGATCTTTTTCCGGCCTCAGCCAGCGCTCAGATCGATACCGATACGGCGGTCATTTTGCTGTGCCATGATCTCAACCGGGAGCTGCCAGTGTTGCAGGCCGCGCGAGAAGCAAAACCCTTTTATCTCGGCGCATTGGGCAGCTATCGAACCCACACTTTACGTCTGCAAAAGCTCCACGAGCTGGGATGGTCCAGGGAGGAGACAGCGCAAATCCGGGCACCCGTCGGGATATTTCCCAAAGCCCGGGATGCGCATACTCTGGCACTCTCCGTGCTGGCAGAAGTCGCCTCTGTACGTCTCCATCAGGAGGAGGATTCATGCCTGCCCCCGTCGTCCTGATCCTTGCGGCCGGGCGTGGAGAGCGCTTTCTCGCCTCCGGGGGAAATACCCATAAGTGTATCGGCTGGCGTCAGTCCCCGGAGGTTGCGCCTTATCGCTGGCCATTTGAAGAAAACGGGAGAACTTTCGACCTTGCGATTGAACCGCAGATTACGACTAATGATCTGCGTCTGATGTTGCAGCTGGCTCTTGTCGGTGAAGGAATAACAATTGCCACTCAGGAAACTTTCAGGCCATATATTGAAAGCGGTAAGCTTGTATCGCTGCTTGATGACTTTCTTCCACAATTTCCAGGCTTCTATCTGTATTTCCCACAGCGTCGCAATATTGCACCAAAGCTCCGCGCCCTGATTGACCACGTCAAGGAATGGCGGCAGCAATTGGCTTAAATGTCTGCACCTGCATTGCCTGATGTCAGAACAGTATTTTGATGAATTGCCAGGGTTACAATGGCACAAATACGGCACAGGAGGAAACGTGGTGTATTTAAATATGGGGTAACTTATTGATTTAAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATTATAAGAATCCGCTTCTAATTCAAAGCATTACCCCATCAACACTGCGCTCACACGTCCCACCAAATCAAAACATGTAAAGCCTTGCAAGCCATTGCGAGGCCTTATGTGTCTCAGTTTTGTCCCACCTTGTATTACGACTTGCATAGCCAATGAAGATAAACGTGACGACAAACGGCGCAGCGGTCTTCTTTTCCTTTATACTTTCCCCACCCAGCATGCATACCTTCTACCATAACTGTAGTGAATGTCTGTTATGGGCGAAGAGCGGAAAAAGCATACTGAGATCGCCCGGAATATGTCCCATACTTTTGATGAGCTAATTGTGTACCTCAAGGTGGATGTCGAGGCTTAACCTCGTAGTTATCATCTGCGCGTACCTTTTCAAACAGTTTGTTATCGCTAGCAAATAACATCTAAGGCTTCAGGGCCAATAGATAGATATTGACTAAGCTAAATAGTTTGATTACTCTTCGAAAATTTATTAAATAAATTTTAATTGGATGGTAAACATGGAAAACAAAAAAAGTTGCGTTTATTGTAAAAGCACAACAAACTTAACTAAAGAACATATTTTCCCTTCAGCAATAATTAAAAGTTTTAATGTTGAACTTTTATCGATGACTGATAAGAGTGACTACCATTTTAAAGGCGATCCAGTTATTGGGGATGTATGCGCTGAGTGTAATAATGGAATACTGTCGCAACTAGATGCTGATTTCGTAACTTGTTTTAAAAATCAGATGCTAACACCTTTAAAACCCGGAAATGAAATAACATTTGAATATGAATATGATTTATTATTACGTGAACTTCTAAAGATATCATATAATTCAGCGAGGGCATCAAATGGTGGTTATAATGCCAGAGCCATTTTAGAAAAATACATACCATTCATAATCACTGGCAATAAAAACAAAAATGTAGATAGTATAATTCTTTCCCTTCTAATTGTAACTTCTGCAAATATGGTTAATCTGGAAACTGGTAAACATGAAGAGCCTTTAGAACCATATCTATTAAGAAGCGCATCGATTGATGGTTTAAACCTCAACCCTAACAATTATATTGTTAGGATGGTTGCATTTAATAGCTTTTGGTTTTTTTTGTTAATTCCAAAGAGGCCAGTTACATCCAAAGTGAAAAAAGAATTTTGGGATGAATTTAAAAGAAAAAATCATCTGCATGGAGTTTTGTTAAAAAGAAACAACACGTCAATAAAGATCACAAAAGATAAAACGACATATTTGCATCCTGACTTAATAGAAAAGATGTGGCGAAAAATAAAGTAGAACAAATTTTCCAGTTGACAATACCAAAAATAATTAATGGGGACTAGGTTAAATGACGCAGTGGAACAAACTCACCAGAGAAGAAAATTAATGGTATTAATGTTCAATGTGTTCATTACTGTAACCTAGCCCCTACTAATTTATTCACTATATTGCTGAGTATGTCCGCATCTGGCACATAGCAGCCCTAGAGATAGGAGCACACAGTCATAGATGGTCGGTGGGAGGTAGTGAAAATCCTCTCATACAATAAATACGTAAAATCGATAACGGCAAGAGACCTTTCAATACTCGCACTATCGGAAGTTAACCAACCTGCCACAGCACGTTCTCGCATACACGTGTCTGCGCCCCCCCCCCTCCATCAATCATAATCAGATAGAGCCAAAAAACAAAAACAAAAACAAAAAAAACATCAAGTTACAACAATAAAAAAGACGCCAGTCAACCGATGATGAAACAAATGAGGATACGCAAAGGAGCCGCAGCTCCCTAGTAATATGAAAGCCCAGACCATCAAAGCAGAAGACTGTTAAAATATCAAAAACCCGTTATCCGCTGAAACACAGTACTACTTTGTTTAAGTTAACGATATTGCCTCAAGGGCAGTTTTAAAATACGGTGAATACATCACACAGGTCCGATAAAACTTCTCGACAAACATGCCTATGATCGCTGAGCACAGAGTTTATATGCTGAAGTGTTATATACTCTTCATACCATTCTTGTGTTACAACATTTTTCAAATTATAGAGCCTGTGAAATGACTCTTTTGCCTCCTGCGAATTAACAAATTGTTGATTATGCCCCATTAACCTCATTAAAAACCCCTCCAAACAAAATGGTGAGGATTGAATGATTTGTATGTCTCTATTTCTAGCAGCAGATAGAGCATCAGTAGGTATTGCAAGGTCTGAGTCAATAAAAACAACTCTTTTATCAAAACTACCAGCATGGTTTATAGCAGCCATTATAGCTGATTTCGGTCCTCCACCGCCGGCATCATCTAACCTAACACTAAACCCTGCAGTCCTAGTTGTTAAGAGTCGTTTTAAACAATCAGCAAATCTTTTATCAGTAATTCCTTCGCACATCAAAAGCTTAGTTCGTTCAACTCCACGCTTTACACGTCTGGGCTTCATCATAAGTCTGGTACTCCTCCAAAAGCACCAGCTAAATATTTAGCCGCAATATTATCGTCGACTCGCGCCTCACTGGATGGAAAATCATCAAGTCGATAACATTCACTTCTATTGAATTTTTTCTCAGTAATCAACACACGGTACTTACCCAAATAATTTATCAATTCGGGTTTATGACAAGTAAAAATTAACTGTGCATGTAACGGGTTTGTTTCTTTACTGATAAACAACTCTAATATAGGTTTAACCATATGTGGGTGCAAATCATCACCAAGTTCATCAATAAAGCATGCTGTTCCTTTTTTCAGACATTGCATAATATTATGCAATCTGATAAATGCACTTTGTGTTCCAGCAGATTCGAATGCAAAGGGTAACTCAAACTTGGAACCATCTTCTTTTTCATGAATACCATTTATAATATAATAAACATCCTTCTCACCGTTTATATCAACTTTCTCTTCTTTTTCTATAGTTATATCATCTAAACCTAAATCCCATTTGCGTAAAATATTTTTTACACTTTTAAAGGCTACTGGATCTTTATATAAATCCTCCGCTGTCGCACCTAAATCTCCATAATCATAGCTATACCTTCCCATAGCATTTACATTAGTAGTACTGCTCATTTTTTCGGCTATATCTATAGCCAAAGGAACACCTACACGGCGAGCTGCAGAAATTATACTTGTATTAGCAGGAGTTCTCTTACCCTCTAAAAGACCTAAAGGAAAAAGTTCTGCTTTTTCAACATAATCATATTTAATTGTTTTTAACAGTTCTTCAACTTCTTGATGACTAAAATCATCACTACAATTATTTAACTGTTTCTCATAATTTTCTTTATCCAACTTCCTTTTAAAAACATAGATATTTTTATTTTTCTCATTTCTTACATACAATTCTTCTTTAATGACAAAATGGTCGCAGGCCGTAACAACATATGTGTATACTTTTCCATCTAAAACAAAACATATTTTAATCATACCTGGTTTTGTAAAAGGCCGATTAATGTTTAAATACAGATAATCAGTTACTTTTGCTGGTATAGACCAAAAAAACCAACTCAAGAAAGACATTGGTTTAAGGAGTGTTGTCTTACCGGAGGCGTTCGCCCCCATGATAGCAGTAATCAGATTGACCTTGTGTCCATCTACATCTACCCACAGTTCTTTTTTCTTATCTTTAGCTGTAGTAGTTAGGTCTACGAACCCACCTTCATCTTCAAAGCTACCGATATTCTTAATCTCATACCAAAGTATAGCGTTAGCCATAACCCCTCTTTTTATACAAAAAAATGTTCAAAATCGCCACATGCTGAGTTTATAGCAGGAAGCGTATATAGCAAGCAAAAAAACATAGACATGATTTAGTTAAATCGATAACGGCCGGAAATCATTCAATTCCCGCACTATCGAACGTTCACCAGCCAACCGCAAAACGTTATTGCATACAACGTTTCTGCGGCATAATCCCAATGATTACTCCCTGACAGGATTTGCAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCAGGCTTCCAGCAACAAGGTTTCTTTCTCCGTTGCGATCTCCAGCTCAACAACAGTCTGAACGTACCGGGAACAGCCTCCTTCAGAGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCGGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGCTGATCCGGGTGTCTGCTGTTGCCAGATTTTCACGTAGCGTTGTGTTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAACCCGTCACGGCGGCGTAGTCAACATTAAGATAGCGCGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGGCCTTGCAACTCTTCACCGTAATGAGTAAACGATCCCACAGCTTCTGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATCCCGTTTTCCCTGAGCGTGTAGGTGTATCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTGCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGATACCAACTGGTAGTTCTGAATTACCTGAATTAAGTAAAGCAAAACGATTGCCAGTCCCTGTTTGTTTTCTAATTACTAAATCGGCAGTTGAGTTAACCTCATCTTTGTTGATGGTTAGCGCCTGTGCTGTAGCACCGTTGACAGTACCGCTTAGTAGTTGCACCGCGCCATCATCGCCATTTAACAGCACTTGAGCGCTGCTTCTGTGGTTTTTAAGGCACAACATTTTTCCCGTGTTTTTCGATGTACCTACTGACCACGCCGAATTGGTTCCAGTGCTATCAACACCGCGCACAGCACAATTCATACTACTGTAATCTGACGTACTCCCCAGCACATCAACACGCCCGCCGCCTAATTTTGATGGAGTCGTCGAGGTTAACGACCTGACGGTTACATCCTGATTCCATCCTGAAACAACTACCTGACTCCACTCCGTCCAACTTCCATTGACAACAAAGCGAACGTATATGCGTTTTGTGTCGCTGCCAATCAGCGTTTGCATGATCGCATAATCTGAACCCGTAGTTTTACGAGTAGATTCTACACGTAGCAAAAAGTTACCGCTTACGCCGTCAGGCTTATTGGTAATATATGCACCACCTGCGACAGTTTTGCATTGATAGTATTTAACAGATCCAGCCACTGTATTAGCAATGATCAAGTCGTTTAAATCAATAGTCTTGTTGTCAATGGTTTCCGGCTCGATAGATCCCGCCATACCTGCGTTAAATGTGGCAAGGTCAGAAAAGGTTGTTTTACCGCCTACCGTCACCCCGCTATCGAGTTTTGCTTGTGTGATTACTTTGCTGTTTTTTGTGTGGTCAGAATAACTACCAAAATACAAATCACCGTCACCAGCTATACCCAGGTATTTTTGATCAACATCATCAACCTTAAACCCAATGGACAAGTTTTTAATTGTGTTGTTGCTCGTTAAAACCAAAGGCGTATGCTGAGATCCTTTGATGCTTACCGTAGTGTTATCGGAACTGGTGCTTGTGTTTGCAAATTCTGCGGTGGCTGCGCTGATTTTGTAAGTTATTTTTAGATTGCGGGCCTCAACCCTCCCGTCGTGGCGAACAATAAAATCACCACTTGATTCACCTTTTGTGTTTTTCGCCCTGATGTGGATTTCGCCAAGTGATTCGTTGTTTTCAGGCGACCAAACAACGCCGCGCTCGCTTCCGTCGTTGTTCATGAACCACAGATGAGCGGTTCCTTTTGCTGATTTTAGCCTGATAGATGGCGTTTCTTTCAATATATCAAGATCACCAGACATAACATCGCCGGATTTTTTCACCTGCGCATCGTTAGTTACGTTGCCAAGTCCAACATCCGATTTCGACGGCTTGTTTGCCGAGCCGTATAGCTCATTGATGCTAAAAGTCCCGTTGTTAAGCCGGGCATCGTTGCCGGCATAGATCTTAATCTTCGCGGTTGAGTAGTCGATATTGATCGCTGACCACGTATCGCCGCACCGCGAGAATATACCAGAACCGTGGCTATAAATGGTCGCCGTGCTTCCGGTCGGCCTATCACCGCGCCAGAAATGACCACCTTTGTCACGCAGAGCTTTTAAGATCTCTACATCGCTCATCTTGCCGTTTGTTGATATCCCGCTACCGCCAAGACCGAAAGCGCCTGTAAGCATAGCATTAGAAAGACCCAAATCTGCTTTAGTCGGCTTGTTTAACTGGTCGTAAATCCTTACAACGGGGCTTTCAACATATCCAGAAGGCGCGGCGGTTTGCTTAACAAATCCGTCTGGAATGTATAATTCCGTGCGTGCCGTCTGCGCCAGTACCGCAACCTTCAGGGGTATCCCTTTCACCTGGTTATCTCCCGCAACGGTATTCCCGGCAACCTGCCGGGTGGCAACCGAGACCGCTTTCTGTGCCACACGGTTTATCGCCCATGCGCTGGCCTGTGGCACCATACGGGTATCAAGGCTGTTCAGATTGCGGATGGCATTCTCAAGCCCCTTCATCCCACACCTCTTTACTCAATAAAGATCATTGGCTTACCGTTAAAACGTTCATGCCGTGTGACCGTCCATTGTTGTCCGTCATAAACAACGCGATCCCCGCGCCGTGGGCGGTATCCCGAAGAAAACACCACCAGAGAGACCGCAGGTCCGGACAGAGCATTCAGCTCTGCCAGTGTTTCGCCCGGGATCACAGCCATATCGACATCATTAATCGAGGCTGTCTTTCCCATCTTTCTGACCGTGATCGCATCCATACGCGCTGCCAGCCGGGAAAAGGGATCAGACATTGAGTTTTACCGGCACTTCTTCTGCACTGGTTCCGGCATCTGCCCAGACAACCCCGACCAGCGGATCAGAGCCGCTGTTAGTCAGCTGAACTTTTCCGGACTTCAGATAAACCTTCTTACCCGTTTTCATGTCATCCGTTTTCAGTTTAGGCAGGATAAACACACCTTCGGTCATGCCGTCGCCTGTTTCACCCTGTGGAATATCGGTCAGCGCCACCGCAAAAACATCACCCACCTGCACCAGATCTCCGCTGCTGATGGCTGCACTGGCAACAATCGCCACCGTTTTTCCTTCTTCTACAAAATTCTTTGCCATAACTGTCTCCGCACAGCCCCGTTCAGGGGCTGATTTCAGGTACAAAAAAAGCCCTTACGGGCCATCAGAGTTGTTGTCTGCGACGTTTACGCCGTACATTTCACCAGACCGCGGTGATCAACTGGCGCGACACCGGCGTCAATACGCACTTTCGTTGTCACGCCATCCACACTGAAGCCCTCCATCTGATCAATATATGGCGTATCCACACCGTTGAGATAAGCCACTTCAATCGTATCGGAGCCTTTTGACGCAGCCAGGTAGAAGGTGGTCTGGCTGTTATCATCAAGACGAGGCTCTGCAATAACGGTCGCAAAATCTTTCACCGGGTTAATAATACCGGCGTTAATGTCAGCCCCCTTGACACTTGAGGAGCGAATGACCTGGTTAGCAACAGACTCCATCGCCGTCGGTACCAGTACGAACGCAGGACGAATATTCAGATGACGCTCCCCCTCTTTCTGAACGCGCATCAACTGGCGGGCTTTATCCAGCGATGCCACGTCCATTGCAGCGCTCTCCAGTACGTTTGCATGTTTCGCTTTATCGAACAGACTTACATTATCTGTGGAGATTTTCGGGTTAGACGTCAGAATGGCATAAACCAGATCGGCAATAGTGGATTTCGCCGCACGGCCCAGTTTCATCGGGACATCGGTCAGCATATTCAGATCATCATTGATAATGGCCTGACGGGTGATACTGAACAGCTCGCCATAGGTCGCCAGTGCAATAGTGGCCTGTTTATCTCCGGTGGTGACGTATTTATATTCCGCCCCTTCACGCACCTGACGCAGAGCACTGAAGCCCCCCATACCCACACGATGGGCAATTTTAAAATCAGACAACTGACCTTTCCGCGTCCACTGTTCATAGGTTTCAGGGGCATCTTCCCAGCCCTGCAGAATGGCTTTGTTCGCAACATCCAGCAGAATATTACCGAAGTCAGACGTACTGTGTGTGAACGCCGCACCGACCATCTGCATCGGGTTATAACCGGAAACCCCAATACCCCGTTCAGTCAGTGACATACGGGCATATTCACGCAGGGTCATCCCGTTGTAGACATTATCACGTTCGGTTTTTTCAAATCCGGCACGCGCCATCAGCGCCTGGCGAATCCCGTCCCCCACAAAATTACCGTTACCGGCATAAATATGAGCCGGGGTATTTTTATTGGATGGCGTGGACTCGCGCCCCATCTCGTTCAACAGCTTTTCACGGGCCTGCTCCAGCGAACATTCAGGATCGGCAAGACACTGAGCCTGCAGCGTCTGATAACGCCCGCCAAACATGGCAAACAGATCATTAATACCGTTTACACGCGCTTTTTGCTCTGCCAGTACCTGTGCACGGATGCTGTTTTCATCCACCACGGGTGCTGCTGCCTGCACTGGCGTCCGGGAGGCTTCAGATTCATTATCCTGTACGCGTGGAGCACTGTTGCGTGGCGGAGTAATCATGTTTCGAATGGATTCCGGCATCTTTTTAAATTCCTCTGTACGTTTTGACTGAATACATGCCATTGCCTTAACGGCTGGCGTTACCTGATCAGCAAATCCATGTGCCAGACATTCGGCACCGGACATCCAGGTCTCATCCGCCAGCATGGCAGCAATTTCATCGGTGGTTTTCCCGGTTTTCTGTGCATAAGCGGGTAACAGAACCGCCTCAACTTTATCGAGCAGGTCGGCATAGGTGCGCATGTCCTCCGCATCACCGCCCGTAAAGCCAAATGGTTTATGAATCATCATGAAGGTGTTTTCCGGCATAATGACCGGGTTTCCCACCATCGCAATGACCGACGCCATTGACGCCGCCACACCGTCGACATAAACGGTAATGGACGCACCATGTGTTTTCAGCGCATTAAAAATGGCGATGCCTTCAAAGACATCGCCACCCACGTTGTAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCAACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGAGCATGATGTCTTTTCCGGTGTGTCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGGCCATCGTCACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACTGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGACTACCCAGCACACCTATTGCCGCCCACTGTGAGGGCGTGACTTTATCGAGCAACTGTAAAAACCAGTACCCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGATGCGGGTGCTGTGTAATGGAAATAAAAAGGCCACCTGACGTGGCCACCAGATTATTTCCCCACCAGCTCGTTTATCTCTTTCACTGTCTGGTTAAACCGCTCTGACTCAAGCTCAACACCTAAGGCCCGACGCCCCAGCGCCATTGCTGCTTTTATTGTGGAACCGGATCCCATAAAAAAATCAGCAACCAGATCACCTGGTCGACTACTGGCATTGATTATTTGCCTGAGCATATCCGCCGGTTTCTCACACGGATGTTTCCCCGGGTAGAACTGAACGGGTTTATGCATCCAGACATCGGTATAAGGCACGGAGACTGATACGGAGAAATAGCGCCGGAGAGATTTAAACTCATCCAGCAATTCAGAATATTTGCGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGGAAAACAGTGCCTGTAACTTCCGATAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCTGTGGCTTCGGCAATTTGTTTTGCCGTTACACCCAGTTCGGCACGAGCATCCCTGAAATACGATATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCCGCATAGCCGTCACTTTTGCCGCGATATGGTCCCTGGTAATGTTCAGCAAACAGAACGCGCTCTGTGGCAGGAAAATATGCGCGCAGACTTTCTTTATTACACCCATTCCAACGTCCGGACGGCTTCGCCCAGATGATATGGTTAAGCACGTTGAAACGTTCACGCATCATGATCTCAATATCAGATGCCAGGCGATGCCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTCAACACCCGCCAGAACTGGGCCAGACAGTGGTCCAGCCACTTAAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAACCGTTGGGTTTCACCTTGAAGTAAGGCGGATCGGTAACAATCAGGTCAATGGAATCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTTACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCGTGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGTCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAACTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGATATGTGTTTTGTGCAATCTCCCCGACTGTTGCCGGTTCGATGCTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTTCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTACAAAAAAACCCGCTCGACGGCGGGTTTAAGTTGTGTGGCGAAGTAACCACTCTTAACACAATACAATACTTTTTGCGTACGCGTTATAGTTTTCTTACAATCAACTTTCAATTAAAGGAACGAAAACATGACTACACTCAAAGAACTCAAAGAAGAGCTTGCTCAGATACAAGATGAACACGCTAAGAACAGAAAAAAGGCTGAAATTGCAGCTTTAACCGCTTCCGCTGACAATGAAATCAGACTCGCTCAGAGAAACATTGGATACAATGTACGTGAGTGGACCGTTGAAGTCATTATTCAAAAGTATGGTGATAATATTGAAAATGACAAAAACGAGCTTTTTATTCCAGATTATCAACGTGATTACAAGTGGGATACAAAAACAGCCTCTCGCTTTATAGAAAGTATTCTATTAGACTTTCCAATACCTTACCTTTACATTGCTGATGTATTTAATGAAGACCCTGAATTAGATGGTAGGGTAGAAATCATTGATGGTTCACAACGGATACGCTCTATTTATTATTTTTGGAACGATCAATTTGAGTTAAAGGATTTGAAAGAGCTTAAAAGTTTAGAAGGTTTCAAATTTTCAGATCTTTTAGCTAGCCGCCAAAGAAGATTTTTAAGAGCTTCACTAAGATTTATTGAGTTGAAAGGTGATGTTGAGGAACAACATAGAAGAGATTTATTTGAAAGGATCAACTCAGGTGTTAAAAGATTAGAAGCGATGGAAGTGAGGCATGGCTCAGATGCTGCTACCTCAATGTTCTATAAAGATGTTGTGACCCCATGCTCAACAAACTCACTTTTTTCCCAACTAGCTCCATTATCAGACCGGAAACGGTCGAATGGCGATCATCGTGAGTTAGTTTTGAGATTTTTTGCATATTTAAATGATTTAGAAAACTATAAGGGATTTGTCCGTCCCTTTATTGATAATTATTTAAATGAACAAGCAGCAGCTGTGACTACTCAACAAGATGTTGATAATTTTAAACATGATTTTGAAATGATGCTTGCTTTTGTTGCTACCCATTTCCCTATTGGCTTTAAAAAGACCGCAACAAGTAAAACCACCCCACGAGCCAGATACGAAGCCATTGCCGTGGGAACTGCACTTGCATTAAAAACTAACCCACAACTCCAGAGTCCAGCCGTACCTGTAGGTGAATGGCTATTTGAAGAGGAATTTGAAACACTTGTTACTGCTGATAGTGCAAACAATACCAGCCAGCTAAAAAACAGAATCTTTTACGTTAAAAATAAGTTGTTAGGGATTTAAAAATGAGTCTAATTGATTTACGAGATGAATATGAAGAAAGAGCAAGAGACATTATGGAACTGCTTTCTCTTGCATCATCTATAGAGATTCATACTCAGCAGTTAGATCCGCAAGCGCATCAAGATGAAATAGAATCTAATATCCTAAGGGTAAATATTTTAAAATCATCCGTTCATATGATGCTATATAATCAAGTTGAAAATACTGCCAGAGGTTGCATCGAGTCAATTTATGATCATTTACAAGATAATGAGGTGAATTACGCATCACTCAGGGAGAAGCTTCAAGTAAATATATTACATAGAATTGTTTCAGATAATGAAACAGGGCAATCCCTTTATAAAAAGATAGGCACTGACATTTCCAAAAGAATAATTTCAGCCTCATTGAATATTCGTAAAGAATTCAATGGTAATGTTTGCAAGCCTGTATTACACAAAATAACGCAGGCTTACGGAATAACTATCGCAAACTCACCTGAATGTAGAAATGGTATTGACTTAGACTTGCTTAAGGATATCAGAAACGAACTCGCGCATGGAAGTACTAGCTTCTCTAAAAAGGGGCAAATTGACCCCTTAGAAGAAGTTAAGTCTAGAGCAGAGAGAGTTGATCTATATCTTCGTTTATTAATAAACTCAACTGAAGATTATATTATCTCTAACGGATATTTATCCCCTCAACATGCCTAACAAACGTTCTCCCTATTACTTGGCCAATTATAGGGGGAACAGCATTACCAATCATCGTTCCTAACTTTTGGAATGAAAAAGGCGTATTTTTTCCAATAAATTTATAATCCATAGGAAAAGATTGTAAAATAGCAGCTTCACGCAAAGTTATTGCTCTATTTTGCTCAGGATGTCCGAATCGCCCATTACCATATCCATAACATTGGGTAGTTATTGTAGGACTAGTATCGTCCCAAACCATCCGTCCATAAACACTTTTATAGGTAGCACCTGAATGCTTTTTATGGCAATCCGCTCTAATTTCTTCAGGCCAATCATCCCACGTACCACCTGGTAAAGAGTGCAAGATTCGTTTAAGGTTAATATCCCTTAATTTAGGCGAACGATGCAATGGATCACTTTCGAGTTTCTCCCCTGCACCTATTTTTGGCAACTGACCAATAGCATCCTTTACTGTAACTTTACGGCTTACTTTTTTTTGATCAAGGCTGATTGGCCCCAATATGGACCCAATTAGAATTAATCTACGTCTATTTTGAGGCAAGCCATACTCGGAACATTTTACAACGTCGTACCACAGATGATACCCAAGAGTCTTTAATACACTAACAAACTCTTCAAAAACCTTATGATTTCTAAGTTGAGGAACATTTTCCATAGTCACAAGCTCTGGCATGACATCACTTACAATCCTCTGAAACTCAGATAAAAGACGCCACTTTGTATCATCTTTTCTGCTATTTGGATTACGATACTTGGAAAACGGTTGGCAAGGGGCACAGCCTGCAAGTAATCTAATATTTCCTTCCTTGAACATAGCAGACACATCGCTGGATTGCAGTTCAGTAACTGACTGGTTAATGAACTTCGTTAAGGGATTATTGCTCTCAATAGCGAAACGGCAGGACTCATCAATATCAATGCCATGAGAAACTTCAATCCCGGCTTTTTTTAGCCCAAAAGTTAAGCCCCCTGCGCCACAAAAAATGTCTACTGCTTGAATGTTCACAAGATTCTCCGTTACTTTACCCTGGTATTATATACACATAAAAGTTGGGAAAGTAGAATGATTTTACATAGCACTTAACTCATTGTATTAAATAAAAAAATGACAAGCATCCATCAATAAACCCCATAGCTGTTTGCATATCTTTTCGAACTGTTCCGTCTGAACATTTCCGCTTCTTTGCAATTGTACGGAGCGAAATGCCAACAACAAAGTGAGCAATTATCAGCTCATACTCTGCCGGCTTGTATTTACGCAACCTAGCGACACAACCGTCTATTATGATGCCCTCATCATCATCGCACTGAATCCGGGACTTTCTGCCATGAGGTAAAAGCCGCTTGAAGCCAGCAGCTACCGGTTGCCAGTCCACACCACTGTTTTCTGCAGCAGCCCATGCCCCCCAGCGGTCCAAAACTTCATACATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACATCTCTTCAAATGCCACGGTATCTGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAGAGGTCATGCGCTTTTGTACCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCAGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTAGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTGACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGGTCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCACGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCAAACGGCAGGATCATCCGCCAGCGGTTGTGACGCCGCGGGAACGGCACCACTGGCGAAAGATGAATAATGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCGGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGACCAGTTTTTCCGCCTGGCGAGTCAGGCACAAAATCACCCGGGAATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGCTCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGCGTGTATCCCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACAGCCCGGATGGGCTTCAATGAATTTCTGAACTTCATTCAAAAGACTCATGCTCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGCTGTCGTAACTGGATTTGAAATACGGGTCTTCGCGTTTTTCTGTGTATGTGCTGATGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAGCTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGGCTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCAGGCTGTCCAGTTCGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCAATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATGGCGAGTTAAAAAATCCTGTTCAGCCGTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGGTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGCTTTTTTGCCAGATTTCGACGCATCAAATTTTGACGTGTCAGATTTTGATGCGTCAGATTTTGACGGGTCAGAATCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATCCCCGCGCCAATCTGACGGGCAATAGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACGCTGGATAATTTCATGCCTGACGCAGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGTTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCACGGAGGTAGATAACCCGTTGTGTTTCCGGCTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTGTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCCCCCACAGCCCACTCAGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTTACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGCGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCATTACTTTCTGAAACGCAAAAAATTTTGTAGACAAGTGCTGCATGCTCCTGCAGCTTCGAAATTGAGAGGTACAGCTCGTCGTTAATTGCTGTCTTCTCATGCGGTTCCACTACACCGTCTTCGATTGCCGAACGAATCTGTTTTGAATAACTGCCGATCTGTTCAATGACTTCCAGTAAACGCTGGTTAATATCGGCATTGTCCACATCCTCGACGTCAGGAAGAGACACAAAGACGCCATTTGCAGACTGCGCCACAGCGTCGGCAATGAAGTGAGTGCCACCAGCACGTTGTAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCACTCAGCAGCTTCAGCGTAACCCCCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCCACTTTCCAGTGATGATTACCCACGGCTTACCTCCTGTTCCTGTGGTTTAAACCCATTCTGGTTTTGGCTAGATTGAAAACGTGCCGGATAAAGAATCTGCATTTCGCTGATTTCACCCTTAAAAAAATTGGCCAGACGTTCTGCAAGATCGATAGATGGAATTTGTTCCAGTCTTTCAATACGACTCAGCGTCGCTGGATTGACCTGAACGCCAGCAGCAACATGCTGCAAAGTAAATCCGTGCGCCTTACGCACATTCCGTAATGGTGATTGCATATAACCTCCACATATTGCGTGATAAGCATATTATTTCACGCAAATATTTTGCGCAAGTTGATTTGCTTAACGCGCAATAAAGAAATGTAATAAACGCATGAACATAGGAAACCGAGTCAGACAACTTCGCCAGGCGAAGAACATGAAAATCGCCGATCTCGCTGAAGCAATAGGAGTGGATGCGGCGAATATCTCACGCCTGGAAACAGGTAAGCAGAAACAATTCACTGAACAAGCCCTGAGTAATATTGCCAGGAGCTTAGGTGTTGATATTGCTGATCTCTTTACCTCAGACGTCAAAAGTAATACTGTATGTAAAAACAGTATTAGTGAGGATGTTGCGCAGGTGAAGGATGTATTCCGTATTGAAATGCTGGATGTCAGTGCCAGTGCGGGAAATGGCCTTATCCAGGGCGGTGATGTCATTGATGTGATTCATGCCATTGAATACAGAACTGATAATGCTGTATCGATGTTTGGCGGACGGCCAGCCAATCACATTAAAGTTATCAACGTTCGTGGGGACAGTATGTGTCCAACCATTGAGCCAGGAGATCTCATCTTCGTTGATGTCAGTATCAATCAGTTTGATGGAGATGGTATCTATGTATTTGGTTTTGATGATAAAATTTATGTCAAACGACTGCAAATGATACCTGACAAACTACTGGTGATTTCTGATAACCAGATTTACCGTGAATGGGGAATTACCAGCGAAAATGAACACCGGTTTATGGTCTTTGGAAAGGTCTTAATCAGCCAGTCACAAACCCTTAAGCGACACAATTAACCCTTACCTCCTCATCAATTAGCCACCCAAAGGTGGCTTTTCATTACCCTTTAAATTGCATATCTCGCAACAAAAACACTTGCATAATGCGCAACTTCATTTTATCTTTCTTTCCAGACAAACAAACAAGGTACTAACAAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGTGACGTTAAAGGGGACAATCCACTCCTTGCTCGAGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATTCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAGCCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACCCTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCAGTCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCTGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCTGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTTCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCTCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTGTTAATGGCGAGCGTCATTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCTATTCAGGCAACAAAAGCGGCTGCGGCTGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAAACCAAAGATATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCATTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAGGCGGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTGACTCAATAATCGCCGGATGGTGAGGGCTTCCTTTTACCAGAATTCAGCGTGGTGCAGCACATATACGCGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGTGATTAACGATATCGCGGTTTCCCTTTCAAATATCTGTCGCTTTGCCGGTCATCTTTCACACTTCTACAGTGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATCCCCGCGCCGCTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCCACGGTTATGAGCACGCCTGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCAGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAAAGTTATTCCACTGGCTCCGGGCCATGCCTACGGGATGTTTATGGAACGTTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGCGGGAGATGTGTACCACGAGACCTGAAAGTGAATGAAACGGATGCTGAATACCTGGTACGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACTGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTCTGTTATTTGATGCTAACGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTACACCTGGGGACAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACTTTTACCGACAAAGAACTGATTAAAGAAATTAAAGAGCGTATCAGCAGCCTTGACGTGCGAGACGATATTGAGCGCCGTGCTTATGAAATCGCACTCCTATCTCTGGAAGTAGAACCAGATGAACGCGAATCTTATGAATTATTCATGGAAAAGCGTTTCGGTGACTTAGTAGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCGTCTGGCAGCAACGAGCTGGTATCCATTTTTCAACAATGTCACAGCAAGAGGTGAAATAATGGAGCCATACAGCCTCACACTCGATGAGGCCTGTCATTTTCTCAAAATATCCAGACCGACTGCCATTAACTGGATACGCACAGGGCGTCTTCAGGCAACACGCAAAGATCCCACTAAGAATAAATCTCCTTACCTCACAACTCGACAAGCCTGCATTGCGGCTCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAGGTGAAATATGGTACGCCAGTTTCACATTGCCGAACGGTAAAAGATTTAAACAGTCTCTTGGAACAAAGGACAAAAGGCAGGCGACAGAACTCCATGACAAGCTAAAGGCTGAAGCATGGCGGGTCAGCAAACTTGGTGAAATACCTGATATAACGTTCGAGGAAGCGTGTGTCAGGTGGCTTGAAGAGAAAGCACATAAAAAATCACTGGACGATGACAAAAGCCGGATCGGATTCTGGCTTCAACATTTCGCAGGAATGCAACTAAGAGACATTACTGAATCAAAAATTTATTCAGCAATGCAGAAAATGACGAACCGGCGTCATGAGGAAAACTGGAAACTCAGGGCAGAAGCATGCAGAAAAAAAGGGAAACCTGTTCCAGAATACACGCCAAAACCAGCGTCCGTTGCAACGAAGGCTACGCATCTTTCATTTATAAAGGCCCTACTAAGAGCCGCAGAGCGTGAATGGAAAATGCTGGATAAGGCACCAATTATTAAAGTGCCTCAACCAAAGAATAAACGGATCCGCTGGCTGGAGCCCCATGAAGCACAAAGGCTGATTGATGAATGTCCGGAGCCATTAAAGTCTGTTGTTGAATTTGCACTGGCAACAGGTTTAAGACGCTCGAACATCATCAACCTTGAATGGCAACAAATAGATATGCAGCGCCGGGTGGCATGGATAAACCCGGAAGAGAGTAAATCAAACCGCGCAATCGGCGTTGCGCTGAATGATACTGCATGTCGCGTTTTGAAAAAACAAATCGGGAATCATCACCGTTGGGTATTTGTGTACAAGGAAAGCTGTACCAAACCAGACGGAACGAAAGCGCCAACAGTAAGGAAGATGCGGTATGACGCAAACACAGCCTGGAAAGCGGCGCTGAGACGAGCTGGTATTGATGATTTCAGATTTCACGACTTGAGACACACCTGGGCAAGTTGGCTGGTTCAAGCCGGAGTCCCGTTGTCAGTGTTACAGGAAATGGGTGGCTGGGAGTCTATCGAAATGGTTCGTCGATATGCTCACCTCGCGCCTAATCACCTTACCGAACACGCACGGCAAATAGACTCGATTCTGAACCCATCGGTCCCAAATTTGTCCCAGTCAAAAAATAAGGAAGGTACTAATGATGTGTAACTTATTGATTTAAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATGCGCTGCTCTACCAACTGAGCTATATCGGCCCTGAAAGGACATGTTCACGAACGTGAATCACGGTGGACAAGGTTAAAACTAACCGGGCGATGCGTCAATGGCCTTGTGAATCAAATGGCTACTTTTGCATCACCCGGTTTTATTTACGCACGAATGGTGTAATCACCAATACCGATCCACTTGTAAGTGGTCAGTGCTTCCAGCCCCATTGGGCCACGCGCGTGGAGTTTTTGTGTGCTTACCGCCACTTCCGCACCTAGTCCAAACTGGCCGCCGTCGGTAAAACGCGTAGAGGCGTTAACGTAAACAGCGGACGAATCCACTTCGTTAACAAAACGCTGGGCGTTGCGCATATCGCGGGTCAGGATCGCATCGGAGTGTTGTGTGCCGTGTTCACGAATATGGGCGATGGCATCGTCAAGATCACTGACGATTTTGACGTTCAAATCTAATGACAGAAACTCATCGTCATACTCTTCCGCTTTAACAGCCACCACCTTCGCGGGGCCTGTCTGCAACTGCGCCAGCGCAGCTGCATCTGCGTGTAATGCCACGCCGCTTTCCTCCATTTGTTTGCTTAATGCGGGCAGGAAGCTATCGGCGATGTTTTTATTCACCAGCAACGTTTCTACCGTATTACATGTGCTCGGACGCTGAGTTTTCGCGTTGACGATCACTTTTAATGCTTCAGCAATCTCTACACTTTCATCAACATAAATATGGCATACGCCTATACCACCTGTGATCACCGGGATCGTCGACTGTTCGCGGCACAGTTTATGCAAACCAGCGCCACCACGCGGGATCAGCATGTCGATGTATTTATCCATACGCAGCATTTCACTGACCAGCGCACGGTCAGGATTATCAATCGCCTGCACGGCACCCACCGGTAAGCCACAGGATTTCAGGGCGTCCTGAATCACCGCCACCGTTGCCGCGTTAGTGCGACAGGTTTCTTTACCGCCACGCAGAATCACTGCGTTACCGGTTTTCAGGCACAGCGAAGCGACATCAACCGTCACGTTCGGGCGCGCTTCATAAATCACGCCAATAACCCCCAGCGGTACGCGACGACGCTCAAGACGCAGGCCGCTGTCCAGTACGCTGCCATCGATTACCTGCCCCACCGGATCGGCGAGGTTACACACCTGGCGCACATCATCGGCAATGCCTTTCAGCCGTGCGGGCGTCAGTGCCAGACGGTCAAGCATCGCTTCGCCAAGGCCATTGGCACGCGCGTCAGCAACATCCTGGGCGTTAGCGTTGAGGATGATTTCGCTTTGTGCTTCCAGTTCATCGGCGATTTTTTCCAGCACGCGATTTTTTTCGCGGCTGGAGAGTTGCGCTAATTTATACGAGGCTTGCTTCGCGGCAATGCCCATTTGTTCCAGCATCAGCCTGCTCCTTAACGGGTAATCATGTCATCACGGTGAACGGCAACCGGGCCGTATTCATATCCCAGTATTGCATCAATTTCTTGCGAGTGGTGCCCGGCAATACGGCGTAATGCATCGCTGTTGTAACGACTGACGCCGTGGGCGATATCGCGACCTTCGAGGTTGCAAATGCGGATGACTTCACCACGCGAGAAATTGCCAGTCACGCTTTTAATGCCTTTCGGCAACAGGGAGCTGCCGCGTTCAAGAATGGCGGCAGTTGCCCCTTCATCTACCGTGATTTCACCCGCCGGCGGCGCACCGAAAATCCAGCGTTTACGGTTTTCAAGCGGAGTCGCCTGGGCATGGAACAGCGTACCGACGGAAATGCCTTCCATCACATCACCAATAACGCCCGGCTTGCTGCCCGCGGCAATAATGGTGTCGATACCCGCACGGCAAGCCACGTCAGCGGCCTGCAATTTGGTACTCATGCCGCCAGTTCCGAGGCCTGAAACGCTGTCACCGGCAATCGCGCGCAGTGCGTCATCAATGCCGTAAACATCTTTAATCAGTTCTGCCTGCGGATTGCTGCGCGGATCAGCGGTATACAAACCTTTTTGATCGGTCAGCAGCAACAGTTTATCGGCACCCGCCAGAATCGCCGCCAGCGCAGAAAGGTTATCGTTATCGCCGACCTTAATCTCTGCCGTAGCGACAGCATCGTTCTCATTGATTACCGGAACGATATTGTTATCGAGCAACGCACGCAGGGTGTCGCGGGCGTTCAGGAAGCGTTCACGGTCTTCCATATCAGCACGGGTCAGCAGCATCTGCCCGACGTGAATGCCATAAATCGAAAACAGCTGTTCCCACAGTTGAATCAGTCGACTCTGCCCTACCGCCGCCAGCAGTTGTTTCGAGGCGATAGTCGCTGGCAGTTCCGGGTACCCCAGGTGCTCACGTCCGGCGGCGATCGCGCCCGACGTCACAATAACAATCCGATGCCCGGCGGCATGTAACTGCGCGCACTGGCGAACAAGTTCAACGATATGGGCACGGTTCAGACGGCGCGATCCGCCTGTTAGCACACTGGTGCCGAGTTTTACCACCAGCGTCTGGCTGTCACTCATGATTCTCTGCCATTCAATTTTAGGAAAAATGATATCAAACGAACGTTTTAGCAGGACTGTCGTCGGTTGCCAACCATCTGCAAGCAAAGCATGGCGTTTTGTTGCGCGGGATCAGCAAGCCTAGCGGCAGTTGTTTACGCTTTTATTACAGATTTAATAAATTACCACATTTTAAGAATATTATTAATCTGTAATATATCTTTAACAATCTCAGGTTAAAAACTTTCCTGTTTTCAACGGGGCTCTCCCGCTGAATATTCGCGCGTTAATTAAAATCAGGAATGAAAATGAAAAAGAGCACTCTGGCATTAGTGGTGATGGGCATTGTGGCATCTGCATCCGTACAGGCCGCAGAAATATATAACAAAGACGGTAATAAACTGGATGTCTATGGCAAAGTTAAAGCCATGCATTATATGAGTGATAACGACAGTAAAGATGGCGACCAGAGTTATATCCGTTTTGGTTTTAAAGGCGAAACACAAATTAACGATCAACTGACTGGCTATGGCCGTTGGGAAGCGGAGTTTGCCGGAAATAAAGCGGAGAGTGATACTGCACAGCAAAAAACGCGTCTCGCTTTTGCCGGATTGAAGTATAAAGATTTGGGTTCTTTCGACTATGGCCGTAACCTGGGCGCGTTGTATGACGTGGAAGCCTGGACCGATATGTTCCCGGAATTTGGTGGCGACTCCTCGGCGCAGACCGACAACTTTATGACCAAACGCGCCAGCGGTCTGGCGACGTATCGGAACACCGACTTCTTCGGCGTTATCGATGGCCTGAACTTAACCCTGCAATATCAAGGGAAAAACGAAAACCGCGACGTTAAAAAGCAAAACGGCGATGGCTTCGGCACGTCATTGACATATGACTTTGGCGGCAGCGATTTCGCCATTAGTGGTGCCTATACCAACTCAGATCGCACCAACGAGCAGAACCTGCAAAGCCGTGGCACTGGCAAGCGTGCAGAAGCATGGGCAACAGGTCTGAAATACGATGCCAATAATATTTATCTGGCAACTTTTTATTCTGAAACACGCAAAATGACGCCAATAACTGGCGGCTTTGCCAATAAGACACAGAACTTTGAAGCGGTCGCTCAATACCAGTTTGACTTTGGTCTGCGTCCATCGCTGGGTTATGTCTTATCGAAAGGGAAAGATATTGAAGGTATCGGTGATGAAGATCTGGTCAATTATATCGATGTCGGGGCTACATATTATTTCAACAAAAATATGTCAGCGTTTGTTGATTATAAAATCAACCAACTGGATAGCGATAACAAATTGAATATTAATAATGATGATATTGTCGCGGTTGGCATGACCTATCAGTTTTAA
Protein sequences of DBSCAN-SWA_8 >CP034966|3575358:3639669|3599297_3599966_+|QAS91290.1|DBSCAN-SWA MKKHLLPLALLFSGISPAQALDVGDISSFMNSDSSTLSKTIKNSTDSGRLINIRLERLSSPLDDGQVISMDKPDELLLTPASLLLPAQASEVIRFFYKGPADEKERYYRIVWFDQALSDAQRDNANRSAVATASARIGTILVVAPRQANYHFQYANGSLTNTGNATLRILAYGPCLKAANGKECKENYYLMPGKSRRFTRVDTADNKGRVALWQGDKFIPVK >CP034966|3575358:3639669|3612625_3613405_+|QAS91301.1|DBSCAN-SWA MENKKSCVYCKSTTNLTKEHIFPSAIIKSFNVELLSMTDKSDYHFKGDPVIGDVCAECNNGILSQLDADFVTCFKNQMLTPLKPGNEITFEYEYDLLLRELLKISYNSARASNGGYNARAILEKYIPFIITGNKNKNVDSIILSLLIVTSANMVNLETGKHEEPLEPYLLRSASIDGLNLNPNNYIVRMVAFNSFWFFLLIPKRPVTSKVKKEFWDEFKRKNHLHGVLLKRNNTSIKITKDKTTYLHPDLIEKMWRKIK >CP034966|3575358:3639669|3619468_3621613_-|QAS91308.1|protease|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGDDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGGDVFEGIAIFNALKTHGASITVYVDGVAASMASVIAMVGNPVIMPENTFMMIHKPFGFTGGDAEDMRTYADLLDKVEAVLLPAYAQKTGKTTDEIAAMLADETWMSGAECLAHGFADQVTPAVKAMACIQSKRTEEFKKMPESIRNMITPPRNSAPRVQDNESEASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSGYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA >CP034966|3575358:3639669|3633168_3633705_+|QAS91328.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPTVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFKVIPLAPGHAYGMFMERFNELSELRKCA >CP034966|3575358:3639669|3577520_3578108_+|QAS91272.1|DBSCAN-SWA MPKLGMQSIRRRQLIDATLEAINEVGMHDATIAQIARRAGVSTGIISHYFRDKNGLLEATMRDITSQLRDAVLNRLHALPQGSAEQRLQAIVAGNFDETQVSSAAMKAWLAFWASSMHQPMLYRLQQVSSRRLLSNLVSEFRRELPREQAQEAGYGLAALIDGLWLRAALSGKPLDKPLAHSLTRHFITQHLPTD >CP034966|3575358:3639669|3626852_3627842_-|QAS91316.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPEHYSSFASGAVPAASQPLADDPAVWAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEMYGSQLELIFRFIDRALAIGVLA >CP034966|3575358:3639669|3638613_3639669_+|QAS91335.1|DBSCAN-SWA MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP034966|3575358:3639669|3575358_3577392_-|QAS91271.1|holin|DBSCAN-SWA MTDLSHSREKDKINPVVFYTSAGLILLFSLTTILFRDFSALWIGRTLDWVSKTFGWYYLLAATLYIVFVVCIACSRFGSVKLGPEQSKPEFSLLSWAAMLFAAGIGIDLMFFSVAEPVTQYMQPPEGAGQTIEAARQAMVWTLFHYGLTGWSMYALMGMALGYFSYRYNLPLTIRSALYPIFGKRINGPIGHSVDIAAVIGTIFGIATTLGIGVVQLNYGLSVLFDIPDSMAAKAALIALSVIIATISVTSGVDKGIRVLSELNVALALGLILFVLFMGDTSFLLNALVLNVGDYVNRFMGMTLNSFAFDRPVEWMNNWTLFFWAWWVAWSPFVGLFLARISRGRTIRQFVLGTLIIPFTFTLLWLSVFGNSALYEIIHGGAAFAEEAMVHPERGFYSLLAQYPAFTFSASVATITGLLFYVTSADSGALVLGNFTSQLKDINSDAPGWLRVFWSVAIGLLTLGMLMTNGISALQNTTVIMGLPFSFVIFFVMAGLYKSLKVEDYRRESANRDTAPRPLGLQDRLSWKKRLSRLMNYPGTRYTKQMMETVCYPAMEEVAQELRLRGAYVELKSLPPEEGQQLGHLDLLVHMGEEQNFVYQIWPQQYSVPGFTYRARSGKSTYYRLETFLLEGSQGNDLMDYSKEQVITDILDQYERHLNFIHLHREAPGHSVMFPDA >CP034966|3575358:3639669|3590711_3594968_-|QAS91284.1|DBSCAN-SWA MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDVEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLAKQLDTDSIRERRVLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSLLAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGNEQRPLVLSLRDAEGQPVTGMKDQIKTELAFKPAGNIVTRSLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRFVPQDTNGVTVGAISEIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDAYDNPVTSLTPEAPSLAGAAAVGSTASGWTNNDDGTWTAQITLGSTAGELEVMPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVKAGESTTVTLIAKDAHGNTISGLSLSASLTGTASEGATVSSWTEKGDGSYVATLTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGEMSSANSTLVADNKAPTVKMTTELTFTVKDAYGNPVTGLKPDAPVFSGAASTGSERPSAGNWTEKGNGVYVATLTLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVNNQLANGQSANQITLTVVDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQATLEVDGSTPKVANDNDAFTLTATVKDQYGNLLPGAVVVFNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASAGNDQPSNAQSVTFVADKTTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSDVTLTASSENLVLDPKGTAKTNEQGQAVFTGSTTIAATYTLTAKVEQANGQVSTKTAESKFVADDKNAVLAASPERVDSLVADGKTTATMTVTLMAGVNPVGGSMWVDIEAPEGVTEKDYQFLPSKADHFSGGKITRTFSTSKPGVYTFTFNALTYGGYEMTPVKVTINAVAAETENGEEEMP >CP034966|3575358:3639669|3581490_3582159_+|QAS91275.1|DBSCAN-SWA MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTPGILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEISRVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSPEIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ >CP034966|3575358:3639669|3596082_3596184_+|QAS91285.1|DBSCAN-SWA MKENKVQQISHKLINIVVFVAIVEYAYLFLHFY >CP034966|3575358:3639669|3602506_3604150_+|QAS91292.1|DBSCAN-SWA MRVNLLITMIIFALIWPVTELRAAVSKTTWADAPAREFVFVENNSDDNFFVTPGGALDPRLTGANRWTGLKYTGSGTIYQQSLGYIDNGYNTGLYTNWKFDMWLENSPVSSPLTGLRCINWYAGCNMTTSLILPQTTDASGFYGATVTSGGAKWMHGMLSDAFYQYLQQMPVGSSFTMTINACQTSVNYDASSGARCKDQASGNWYVRNVTHTKAANLRLINTHSLAEVFINSDGVPTLGEGNADCRTQTIGSLSGLSCKMVNYTLQTNGLSNTSIHIFPAIANSSLASAVGAYDMQFSLNGSSWKPVSNTAYYYTFNEMKSADSIYVFFSSNFFKQMVNLGISDINTKDLFNFRFQNTTSPESGWYEFSTSNTLIIKPRDFSISIISDEYTQTPSREGYVGSGESALDFGYIVTTSGKTAADEVLIKVTGPAQVIGGRSYCVFSSDDGKAKVPFPATLSFITRNGATKTYDAGCDDSWRDMTDALWLTTPWTDISGEVGQMDKTTVKFSIPMDNAISLRTVDDNGWFGEVSASGEIHVQATWRNIN >CP034966|3575358:3639669|3619058_3619382_-|QAS91307.1|DBSCAN-SWA MAKNFVEEGKTVAIVASAAISSGDLVQVGDVFAVALTDIPQGETGDGMTEGVFILPKLKTDDMKTGKKVYLKSGKVQLTNSGSDPLVGVVWADAGTSAEEVPVKLNV >CP034966|3575358:3639669|3608389_3610588_+|QAS91298.1|DBSCAN-SWA MKFDKPAGENPIDQLKVVGRPHDRIDGPLKTTGTARYAYEWHEEAPNAAYGYIVGSAIAKGRLTALDTDAAQKAPGVLAVITASNAGALGKGDKNTARLLGGPTIEHYHQAIALVVAETFEQARAAASLVQAHYRRNKGAYSLADEKQAVNQPPEGTPDKNVGDFDGAFSSAAVKIDATYTTPDQSHMAMEPHASMAVWDGNKLTLWTSNQMIDWCRTDLAKTLKVPVENVRIISPYIGGGFGGKLFLRSDALLAALAARAVKRPVKVMLPRPTIPNNTTHRPATLQHLRIGADQSGKITAISHESWSGNLPGGTPETAVQQSELLYAGANRHTGLRLATLDLPEGNAMRAPGEAPGLMALEIAIDELAEKAGIDPVEFRILNDTQVDPADPTRRFSRRQLIECLRTGADKFGWKQRNATPGQVRDGEWLVGHGVAAGFRNNLLEKSGARVHLEPNGTVTVETDMTDIGTGSYTILAQTAAEMLGVPLEQVAVHLGDSSFPVSAGSGGQWGANTSTSGVYAACVKLREMIASAVGFDPEQSQFADGKITNGTRSAMLHEATAGGRLIAEESIEFGTLSKEYQQSTFAGHFVEVGVHSATGEVRVRRMLAVCAAGRILNPKTARSQVIGAMTMGMGAALMEELAVDDRLGYFVNHDMAGYEVPVHADIPKQEVIFLDDTDPISSPMKAKGVGELGLCGVSAAIANAVYNATGIRVRDYPITLDKLLDKLPDVV >CP034966|3575358:3639669|3629485_3630043_-|QAS91320.1|DBSCAN-SWA MGNHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCVSESNDARECAAPGVVASIASGCGETNA >CP034966|3575358:3639669|3630393_3631086_+|QAS91323.1|DBSCAN-SWA MNIGNRVRQLRQAKNMKIADLAEAIGVDAANISRLETGKQKQFTEQALSNIARSLGVDIADLFTSDVKSNTVCKNSISEDVAQVKDVFRIEMLDVSASAGNGLIQGGDVIDVIHAIEYRTDNAVSMFGGRPANHIKVINVRGDSMCPTIEPGDLIFVDVSINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNQIYREWGITSENEHRFMVFGKVLISQSQTLKRHN >CP034966|3575358:3639669|3584527_3585247_-|QAS91278.1|DBSCAN-SWA MNVNFFVTCIGDALKSRMARDSVLLLEKLGCRVNFPEKQGCCGQPAINSGYIKEAIPGMKNLIAALEDNDDPIISPAGSCTYAVKSYPMYLADEPEWASRAAKVAARMQDLTSFIVNKLGVVDVGASLQGRAVYHPSCSLARKLGVKDEPLTLLKNVRGLELFTFAEQDTCCGFGGTFSVKMAEISGEMVKEKVAHLMEVRPEYLIGADVSCLLNISGRLQREGQKVKVMHIAEVLMSR >CP034966|3575358:3639669|3637222_3638326_-|QAS91334.1|DBSCAN-SWA MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR >CP034966|3575358:3639669|3621612_3621828_-|QAS91309.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP034966|3575358:3639669|3579607_3581278_+|QAS91274.1|holin|DBSCAN-SWA MQFDYIIIGAGSAGNVLATRLTEDPNTSVLLLEAGGPDYRFDFRTQMPAALAFPLQGKRYNWAYETEPEPFMNNRRMECGRGKGLGGSSLINGMCYIRGNALDLDNWAQEPGLENWSYLDCLPYYRKAETRDMGENDYHGGDGPVSVTTSKPGVNPLFEAMIEAGVQAGYPRTDDLNGYQQEGFGPMDRTVTPQGRRASTARGYLDQAKSRPNLTIRTHAMTDHIIFDGKRAVGVEWLEGDSTIPTRATANKEVLLCAGAIASPQILQRSGVGNAELLAEFDIPLVHELPGVGENLQDHLEMYLQYECKEPVSLYPALQWWNQPKIGAEWLFGGTGVGASNHFEAGGFIRSREEFAWPNIQYHFLPVAINYNGSNAVKEHGFQCHVGSMRSPSRGHVRIKSRDPHQHPAILFNYMSHEQDWQEFRDAIRITREIMHQPALDQYRGREISPGVECQTDEQLDEFVRNHAETAFHPCGTCKMGYDEMSVVDGEGRVHGLEGLRVVDASIMPQIITGNLNATTIMIGEKIADMIRGQEALPRSTAGYFVANGMPVRAKK >CP034966|3575358:3639669|3605141_3605471_+|QAS91294.1|DBSCAN-SWA MSNLNPCMTCGACCAFFRVSFYWAEADDAGGTIPARLTEQISPFHRCMSGTNQKNPRCIALAGTPGKNACCTIYKNRSSTCREFAMSGENGVVNEACNRARAKYGLTPL >CP034966|3575358:3639669|3606750_3607440_+|QAS91296.1|DBSCAN-SWA MSNQGEYPEDNRVGKHEPHDLSLTRRDLIKVSAATAATAVVYPHSTLAASVPAATPAPEIMPLTLKVNGKTEQLEVDTRTTLLDALRENLHLIGTKKGCDHGQCGACTVLVNGRRLNACLTLAVMHQGAEITTIEGLGSPDNLHPMQAAFIKHDGFQCGYCTSGQICSSVAVLKEIQDGIPSHVTVDLVSAPETTADEIRERMSGNICRCGAYANILAAIEDAAGEIKS >CP034966|3575358:3639669|3623540_3624701_+|QAS91312.1|DBSCAN-SWA MTTLKELKEELAQIQDEHAKNRKKAEIAALTASADNEIRLAQRNIGYNVREWTVEVIIQKYGDNIENDKNELFIPDYQRDYKWDTKTASRFIESILLDFPIPYLYIADVFNEDPELDGRVEIIDGSQRIRSIYYFWNDQFELKDLKELKSLEGFKFSDLLASRQRRFLRASLRFIELKGDVEEQHRRDLFERINSGVKRLEAMEVRHGSDAATSMFYKDVVTPCSTNSLFSQLAPLSDRKRSNGDHRELVLRFFAYLNDLENYKGFVRPFIDNYLNEQAAAVTTQQDVDNFKHDFEMMLAFVATHFPIGFKKTATSKTTPRARYEAIAVGTALALKTNPQLQSPAVPVGEWLFEEEFETLVTADSANNTSQLKNRIFYVKNKLLGI >CP034966|3575358:3639669|3615920_3616025_-|QAS91304.1|capsid|DBSCAN-SWA MGLCRRNVVCNNVLRLAGERSIVRELNDFRPLSI >CP034966|3575358:3639669|3634278_3634713_+|QAS91331.1|DBSCAN-SWA MGYDSRLDRLAATSWYPFFNNVTARGEIMEPYSLTLDEACHFLKISRPTAINWIRTGRLQATRKDPTKNKSPYLTTRQACIAALQSPLHTVQVSAGDGITEERKCHSSAEVKYGTPVSHCRTVKDLNSLLEQRTKGRRQNSMTS >CP034966|3575358:3639669|3631788_3632151_+|QAS91326.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP034966|3575358:3639669|3629130_3629310_-|QAS91319.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP034966|3575358:3639669|3596547_3596811_+|QAS92355.1|DBSCAN-SWA MKPNIHPEYRTVVFHDTSVDEYFKIGSTIKTDREIELDGVTYPYVTIDVSSKSHPFYTGKLRTVASEGNVARFTQRFGRFVSTKKGA >CP034966|3575358:3639669|3630035_3630296_-|QAS91321.1|DBSCAN-SWA MQSPLRNVRKAHGFTLQHVAAGVQVNPATLSRIERLEQIPSIDLAERLANFFKGEISEMQILYPARFQSSQNQNGFKPQEQEVSRG >CP034966|3575358:3639669|3627849_3628197_-|QAS91317.1|DBSCAN-SWA MSLLNEVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDSRVILCLTRQAEKLVPELLKLLDTVH >CP034966|3575358:3639669|3626481_3626835_-|QAS91315.1|DBSCAN-SWA MRDMYEVLDRWGAWAAAENSGVDWQPVAAGFKRLLPHGRKSRIQCDDDEGIIIDGCVARLRKYKPAEYELIIAHFVVGISLRTIAKKRKCSDGTVRKDMQTAMGFIDGCLSFFYLIQ >CP034966|3575358:3639669|3618790_3619066_-|QAS91306.1|DBSCAN-SWA MSDPFSRLAARMDAITVRKMGKTASINDVDMAVIPGETLAELNALSGPAVSLVVFSSGYRPRRGDRVVYDGQQWTVTRHERFNGKPMIFIE >CP034966|3575358:3639669|3625361_3626402_-|QAS91314.1|DBSCAN-SWA MNIQAVDIFCGAGGLTFGLKKAGIEVSHGIDIDESCRFAIESNNPLTKFINQSVTELQSSDVSAMFKEGNIRLLAGCAPCQPFSKYRNPNSRKDDTKWRLLSEFQRIVSDVMPELVTMENVPQLRNHKVFEEFVSVLKTLGYHLWYDVVKCSEYGLPQNRRRLILIGSILGPISLDQKKVSRKVTVKDAIGQLPKIGAGEKLESDPLHRSPKLRDINLKRILHSLPGGTWDDWPEEIRADCHKKHSGATYKSVYGRMVWDDTSPTITTQCYGYGNGRFGHPEQNRAITLREAAILQSFPMDYKFIGKNTPFSFQKLGTMIGNAVPPIIGQVIGRTFVRHVEGINIR >CP034966|3575358:3639669|3596810_3596951_+|QAS91286.1|DBSCAN-SWA MKVLNSLRTAKERHPDCQIVKRKGRLYVICKSNPRFKAVQGRKKKR >CP034966|3575358:3639669|3607436_3608393_+|QAS91297.1|DBSCAN-SWA MKAFTYERVNTPAEAALSAQRVPGAKFIAGGTNLLDLMKLEIETPTHLIDVNGLGLDKIEVTDAGGLRIGALVRNTDLAAHERVRRDYAVLSRALLAGASGQLRNQATTAGNLLQRTRCPYFYDTNQPCNKRLPGSGCAALEGFSRQHAVVGVSEACIATHPSDMAVAMRLLDAVVETITPEGKTRSITLADFYHPPGKTPHIETALLPGELIVAVTLPPPLGGKHIYRKVRDRASYAFAQVSVAAIIHPDGSGRVALGGVAHKPWRIEAADAQLSQGAQAVYDTLFASAHPTAENTFKLLLAKRTLASVLAEARAQA >CP034966|3575358:3639669|3583089_3584517_-|QAS91277.1|DBSCAN-SWA MSIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRDHVLSNLDAYLYQLSEKVTQNGGHVYFARTKEDATRYILQVAQRKNARKVVKSKSMVTEEIGVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHKDRHQIRRVLHERLGYEGPETPEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMGMERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSEVLASEFRDVLRCIRCGACMNTCPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPYACSLCTACDNVCPVRIPLSKLILRHRRVMAEKGITAKAEQRAIKMFAYANSHPGLWKVGMMAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG >CP034966|3575358:3639669|3611532_3611943_+|QAS91300.1|DBSCAN-SWA MPAPVVLILAAGRGERFLASGGNTHKCIGWRQSPEVAPYRWPFEENGRTFDLAIEPQITTNDLRLMLQLALVGEGITIATQETFRPYIESGKLVSLLDDFLPQFPGFYLYFPQRRNIAPKLRALIDHVKEWRQQLA >CP034966|3575358:3639669|3610597_3611554_+|QAS91299.1|DBSCAN-SWA MSYPLFDKDEHWHKPEQAFLTDDHRTILRFAVEALMSGKGAVLVTLVEIRGGAARPLGAQMVVREDGRYCGFVSGGCVEAAAAFEALEMMGSGRDREIRYGEGSPWFDIVLPCGGGITLTLHKLRSAQPLLAVLNRLEQRKPAGLRYDPQAQSLVCLPTQTRTGWNLNGFEVGFRPCVRLMIYGRSLEAQATASLAAATGYDSHIFDLFPASASAQIDTDTAVILLCHDLNRELPVLQAAREAKPFYLGALGSYRTHTLRLQKLHELGWSREETAQIRAPVGIFPKARDAHTLALSVLAEVASVRLHQEEDSCLPPSS >CP034966|3575358:3639669|3596985_3597213_-|QAS91287.1|DBSCAN-SWA MQAAPQREETEWRVQSKRGLMPAYRGEAGQQVNIKIMEYSERNVRQLASNEQEEYIPRKINVGVINTPTLIRSDY >CP034966|3575358:3639669|3605718_3606333_-|QAS91295.1|DBSCAN-SWA MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRAAGQIDCSRNFLNPPYIFLRDWLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDLPWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPLGSNR >CP034966|3575358:3639669|3586854_3588180_+|QAS91280.1|DBSCAN-SWA MNKYQAVIIGFGKAGKTLAVTLAKAGWRVALIEQSNAMYGGTCINIGCIPTKTLVHDAQQHTDFVRAIQRKNEVVNFLRNKNFHNLADMPNIDVIDGQAEFINNHSLRVHRPEGNLEIHGEKIFINTGAQTVVPPIPGITTTPGVYDSTGLLNLKELPGHLGILGGGYIGVEFASMFANFGSKVTILEAASLFLPREERDIADNIATILRDQGVDIILNAHVERISHHENQVQVHSEHAQLAVDALLIASGRQPATASLHPENAGIAVNERGAIVVDKRLHTTADNIWAMGDVTGGLQFTYISLDDYRIVRDELLGEGKRSTDDRKNVPYSVFMTPPLSRVGMTEEQARESGADIQVVTLPVAAIPRARVMNDTRGVLKAIVDNKTQRILGASLLCVDSHEMINIVKMVMDAGLPYSILRDQIFTHPSMSESLNDLFSLVK >CP034966|3575358:3639669|3578121_3579594_+|QAS91273.1|DBSCAN-SWA MSRMAEQQLYIHGGYTSATSGRTFETINPANGNVLATVQAAGREDVDRAVKSALQGQKIWASMTAMERSRILRRAVDILRERNDELAKLETLDTGKAYSETSTVDIVTGADVLEYYAGLIPALEGSQIPLRETSFVYTRREPLGVVAGIGAWNYPIQIALWKSAPALAAGNAMIFKPSEVTPLTALKLAEIYSEAGLPDGVFNVLPGVGAETGQYLTEHPGIAKVSFTGGVASGKKVMANSAASSLKEVTMELGGKSPLIVFDDADLDLAADIAMMANFFSSGQVCTNGTRVFVPAKCKAAFEQKILARVERIRAGDVFDPQTNFGPLVSFPHRDNVLRYIAKGKEEGARVLCGGDVLKGDGFDNGAWVAPTVFTDCSDDMTIVREEIFGPVMSILTYESEDEVIRRANDTDYGLAAGIVTADLNRAHRVIHQLEAGICWINTWGESPAEMPVGGYKHSGIGRENGVMTLQSYTQVKSIQVEMAKFQSIF >CP034966|3575358:3639669|3582401_3583097_-|QAS91276.1|DBSCAN-SWA MDNRGEFLNNVAQALGRPLRLEPQAEDAPLNNYANERLTQLNQQQRCDAFIQFASDVMLTRCELTSEAKAAEAAIRLCKELGDQSVVISGDTRLEELGISERLQQECNAVVWDPAKGAENISQAEQAKVGVVYAEYGLTESGGVILFSAAERGRSLSLLPEYSLFILRKSTILPRVAQLAEKLHQKAQAGERMPSCINIISGPSSTADIELIKVVGVHGPVKAVYLIIEDC >CP034966|3575358:3639669|3588536_3589130_+|QAS91282.1|DBSCAN-SWA MEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHPEDYKQYLTHEGEYKPEARAWQSANNTYGFSNGLGVVEVIIALLVLANPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAGAVMIMADSAREILKQRSNESSSTLKTEY >CP034966|3575358:3639669|3634589_3635753_+|QAS91332.1|integrase|DBSCAN-SWA MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV >CP034966|3575358:3639669|3589720_3590572_+|QAS91283.1|DBSCAN-SWA MIRQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRNFMHVPLGEYIRKRRLCRAAILVRLTAKSMLDIALSLHFDSQQSFSREFKKLFGCSPREYRHRDYWDLANIFPSFLIRQQQKTECRLINFPETPIFGNSFKYDIEVSNKSPDEEVKLRRHHLARCMKNFKTDIYFVSTFEPSTKSVDLLTVETFAGTVCEYADMPKEWTTTRGLYASFRYEGNWENYPDWVRNIYLIELPARGLARVNGSDIERFYYNEDFVEKDGNDVVCEIFIPVRPV >CP034966|3575358:3639669|3598652_3599240_+|QAS91289.1|DBSCAN-SWA MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFTIISGTTNGTTAVTDYSTLPEGIWSGDVSVQFDATWTS >CP034966|3575358:3639669|3632216_3633041_+|QAS91327.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERHSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP034966|3575358:3639669|3614546_3615821_-|QAS91303.1|DBSCAN-SWA MANAILWYEIKNIGSFEDEGGFVDLTTTAKDKKKELWVDVDGHKVNLITAIMGANASGKTTLLKPMSFLSWFFWSIPAKVTDYLYLNINRPFTKPGMIKICFVLDGKVYTYVVTACDHFVIKEELYVRNEKNKNIYVFKRKLDKENYEKQLNNCSDDFSHQEVEELLKTIKYDYVEKAELFPLGLLEGKRTPANTSIISAARRVGVPLAIDIAEKMSSTTNVNAMGRYSYDYGDLGATAEDLYKDPVAFKSVKNILRKWDLGLDDITIEKEEKVDINGEKDVYYIINGIHEKEDGSKFELPFAFESAGTQSAFIRLHNIMQCLKKGTACFIDELGDDLHPHMVKPILELFISKETNPLHAQLIFTCHKPELINYLGKYRVLITEKKFNRSECYRLDDFPSSEARVDDNIAAKYLAGAFGGVPDL >CP034966|3575358:3639669|3633695_3634058_+|QAS91329.1|DBSCAN-SWA MRMNVFEMEGFLRGRCVPRDLKVNETDAEYLVRKFDALEAKCAALENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >CP034966|3575358:3639669|3631141_3631420_+|QAS91324.1|DBSCAN-SWA MHISQQKHLHNAQLHFIFLSRQTNKVLTKFGCNTAYGTCVVSGLVTLKGTIHSLLEQTNQVAGMCKSMMMLIRRLTSVAIRFDAWEETRVQR >CP034966|3575358:3639669|3598035_3598578_+|QAS91288.1|DBSCAN-SWA MECQNRSDKYIWSPHDAYFYKGLSELIVDIDRLIYLSLEKIRKDFVFINLSTDSLSEFINRDNEWLSAVKGKQVVLIAARKSEALANYWYYNSNIRGVVYAGLSRDIRKELAYVINGRFLRKDIKKDKITDREMEIIRMTAQGMQPKSIARIENCSVKTVYTHRRNAEAKLYSKIYKLVQ >CP034966|3575358:3639669|3624703_3625393_+|QAS91313.1|DBSCAN-SWA MSLIDLRDEYEERARDIMELLSLASSIEIHTQQLDPQAHQDEIESNILRVNILKSSVHMMLYNQVENTARGCIESIYDHLQDNEVNYASLREKLQVNILHRIVSDNETGQSLYKKIGTDISKRIISASLNIRKEFNGNVCKPVLHKITQAYGITIANSPECRNGIDLDLLKDIRNELAHGSTSFSKKGQIDPLEEVKSRAERVDLYLRLLINSTEDYIISNGYLSPQHA >CP034966|3575358:3639669|3630267_3630420_-|QAS91322.1|DBSCAN-SWA MSDSVSYVHAFITFLYCALSKSTCAKYLREIICLSRNMWRLYAITITECA >CP034966|3575358:3639669|3616211_3618779_-|QAS91305.1|tail|DBSCAN-SWA MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKVAVLAQTARTELYIPDGFVKQTAAPSGYVESPVVRIYDQLNKPTKADLGLSNAMLTGAFGLGGSGISTNGKMSDVEILKALRDKGGHFWRGDRPTGSTATIYSHGSGIFSRCGDTWSAINIDYSTAKIKIYAGNDARLNNGTFSINELYGSANKPSKSDVGLGNVTNDAQVKKSGDVMSGDLDILKETPSIRLKSAKGTAHLWFMNNDGSERGVVWSPENNESLGEIHIRAKNTKGESSGDFIVRHDGRVEARNLKITYKISAATAEFANTSTSSDNTTVSIKGSQHTPLVLTSNNTIKNLSIGFKVDDVDQKYLGIAGDGDLYFGSYSDHTKNSKVITQAKLDSGVTVGGKTTFSDLATFNAGMAGSIEPETIDNKTIDLNDLIIANTVAGSVKYYQCKTVAGGAYITNKPDGVSGNFLLRVESTRKTTGSDYAIMQTLIGSDTKRIYVRFVVNGSWTEWSQVVVSGWNQDVTVRSLTSTTPSKLGGGRVDVLGSTSDYSSMNCAVRGVDSTGTNSAWSVGTSKNTGKMLCLKNHRSSAQVLLNGDDGAVQLLSGTVNGATAQALTINKDEVNSTADLVIRKQTGTGNRFALLNSGNSELPVGIRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLRENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTALEEENTTLRENLATADTRISTLENQVSELVALVRQLTGSEH >CP034966|3575358:3639669|3588288_3588525_+|QAS91281.1|DBSCAN-SWA MFKKSVLFATLLSGVMAFSTNADDKTILKHISVSSVSASPTVLEDAIADIARKYNASSWKVTSMRIDNNSTATAVLYK >CP034966|3575358:3639669|3623097_3623292_-|QAS91311.1|DBSCAN-SWA MLKQQDMTETAKVVFNELSIEPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >CP034966|3575358:3639669|3628199_3629141_-|QAS91318.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDTSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQTAEQDFLTRHPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSISTYTEKREDPYFKSSYDSVDYSQIPAGFRG >CP034966|3575358:3639669|3635957_3637211_-|QAS91333.1|DBSCAN-SWA MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLGEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGSVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQDALKSCGLPVGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALKVIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMEESGVALHADAAALAQLQTGPAKVVAVKAEEYDDEFLSLDLNVKIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDYTIRA >CP034966|3575358:3639669|3604118_3604829_+|QAS91293.1|DBSCAN-SWA MFRRRGVTLTKALLTAVCMLAAPLTQAISVGNLTFSLPSETDFVSKRVVNNNKSARIYRIAISAIDSPGSSELRTRPVDGELLFAPRQLALQAGESEYFKFYYHGPRDNRERYYRVSFREVPTRNQTRRSPTGGEISTEPVVVMDTILVVRPRQVQFKWSFDKVTGTVSNTGNTWFKLLIKPGCDSTEEEGDAWYLRPGDVVHQPELRQPGNHYLVYNDKFIKISDSCPAKPPSAD >CP034966|3575358:3639669|3621895_3622948_-|QAS91310.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGVTAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFSRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >CP034966|3575358:3639669|3614016_3614550_-|QAS91302.1|DBSCAN-SWA MMKPRRVKRGVERTKLLMCEGITDKRFADCLKRLLTTRTAGFSVRLDDAGGGGPKSAIMAAINHAGSFDKRVVFIDSDLAIPTDALSAARNRDIQIIQSSPFCLEGFLMRLMGHNQQFVNSQEAKESFHRLYNLKNVVTQEWYEEYITLQHINSVLSDHRHVCREVLSDLCDVFTVF >CP034966|3575358:3639669|3585774_3586629_-|QAS91279.1|DBSCAN-SWA MDALSRLLMLNAPQGTIDKNCVLGSDWQLPHGAGELSVIRWHALTQGAAKLEMPTGEIFTLRPGNVVLLPQNSAHRLSHVDNESTCIVCGTLRLQHSARYFLTSLPETLFLAPVNHSVEYNWLREAIPFLQQESRLAMPGVDALCSQICATFFTLAVREWIAQVNTEKNILSLLLHPRLGAVIQQMLEMPGHAWTVESLASIAHMSRASFAQLFRDVSGTTPLAVLTKLRLQIAAQMFSREMLPVVVIAESVGYASESSFHKAFVREFGCTPGEYRERVRQLAP >CP034966|3575358:3639669|3634057_3634363_+|QAS91330.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLDVRDDIERRAYEIALLSLEVEPDERESYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK >CP034966|3575358:3639669|3631573_3631774_+|QAS91325.1|DBSCAN-SWA MTTPPQQPLLWRYQFVPLLPAGTALFTKQRRASPDDGLITQSIRAAVTAGVLLCFVEKPTDLAGSI >CP034966|3575358:3639669|3599991_3602517_+|QAS91291.1|DBSCAN-SWA MPLRRFSPGLKAQFAFGMVFLFVQPDASAADISAQQIGGVIIPQAFSQALQDGMSVPLYIHLAGSQGRQDDQRIGSAFIWLDDGQLRIRKIQLEESEDNASVSEQTRQQLMALANAPFNEALTIPLTDNAQLDLSLRQLLLQLVVKREALGTVLRSRSEDIGQSSVNTLSSNLSYNLGVYNNQLRNGGSNTSSYLSLNNVTALREHHVVLDGSLYGIGSGQQDSELYKAMYERDFAGHRFAGGMLDTWNLQSLGPMTAISAGKIYGLSWGNQASSTIFDSSQSATPVIAFLPAAGEVHLTRDGRLLSVQNFTMGNHEVDTRGLPYGIYDVEVEVIVNGRVISKRTQRVNKLFSRGRGVGAPLAWQVWGGSFHMDRWSENGKKTRPAKESWLAGASTSGSLSTLSWAATGYGYDNQAVGETRLTLPLGGAINVNLQNMLASDSSWSSIGSISATLPGGFSSLWVNQEKTRIGNQLRRSDADNRAIGGTLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQNVYSGTFGSLGLRAGIQRYNNGDSNANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGTIRTVGANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYINTNLTANGSVGWQGKNIAASGRTDGNAGVIFNTGLEDDGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQNSKNSLDSYDIVSGRKSHLTLYPGNVAVIEPEVKQMVTVSGRIRAEDGTLLANARINNHIGRTRTDENGEFVMDVDKKYPTIDFRYSGNKTCEVALELNQARGAVWVGDVVCSGLSSWAAVTQTGEENES |
66 | Shigella_phage(31.25%) | lysis,capsid,tail,holin,protease,integrase | attL 3612056:3612115|attR 3635754:3635813 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
3940173 : 3956532
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP034966|3940173:3956532|DBSCAN-SWA ATTATTTAGAGGCAAGCGCCGCCTCAATTGCAGATAATCTTTGTTTTAATTCTGCGTTTTCTTCTTCCAGCGAGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAACCCGTCACGGCGGCGTAGTCAACATTAAGATAGCGCGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGGCCTTGCAACTCTTCACCGTAATGGATGAATGAACCAACGGCCTCCGGGATGGCCTCTTCAACTTCCTGGGCTATAACCCCGGCGCAATGTGCCCCATTCTCCTTGAGCGTGTACGTGTATCCATTAATTTTGCGGATTGCGTCAGTCGCGTTATCGATAATCTCGATATTTTCTTTCAGTCTGCGGTCTGATGACTGATTCAGTGTGGTGCAGTTAACACTTCCGTTTACAGTAAGGTTTTGCCCGTCTGTTGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTGCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGATACTAACTGGTAGTTCTGAATTACCTGAATTAAGTAAAGCAAAACGATTGCCAGTCCCTGTTTGTTTTCTAATTACTAAATCGGCAGTTGAGTTAACCTCATCTTTGTTGATGGTTAGCGCCTGTGCTGTAGCACCGTTGACAGTACCGCTTAGTAGTTGCACCGCGCCATCATCGCCATTTAACAGCACTTGAGCGCTGCTTCTGTGGTTTTTAAGGCACAACATTTTTCCCGTGTTTTTCGATGTACCTACTGACCACGCCGAATTGGTTCCAGTGCTATCAACACCGCGCACAGCACAATTCATACTACTGTAATCTGACGTACTCCCCAGCACATCAACACGCCCGCCGCCTAATTTTGATGGAGTCGTCGAGGTTAACGACCTGACGGTTACATCCTGATTCCATCCTGAAACAACTACCTGACTCCACTCCGTCCAACTTCCATTGACAACAAAGCGAACGTATATGCGTTTTGTGTCGCTGCCAATCAGCGTTTGCATGATCGCATAATCTGAACCCGTAGTTTTACGAGTAGATTCTACACGTAGCAAAAAGTTACCGCTTACGCCGTCAGGCTTATTGGTAATATATGCACCACCTGCGACAGTTTTGCATTGATAGTATTTAACAGATCCAGCCACTGTATTAGCAATGATCAAGTCGTTTAAATCAATAGTCTTGTTGTCAATGGTTTCCGGCTCGATAGATCCCGCCATACCTGCGTTAAATGTGGCAAGGTCAGAAAAGGTTGTTTTACCGCCTACCGTCACCCCGCTATCGAGTTTTGCTTGTGTGATTACTTTGCTGTTTTTTGTGTGGTCAGAATAACTACCAAAATACAAATCACCGTCACCAGCTATACCCAGGTATTTTTGATCAACATCATCAACCTTAAACCCAATGGACAAGTTTTTAATTGTGTTGTTGCTCGTTAAAACCAAAGGCGTATGCTGAGATCCTTTGATGCTTACCGTAGTGTTATCGGAACTGGTGCTTGTGTTTGCAAATTCTGCGGTGGCTGCGCTGATTTTGTAAGTTATTTTTAGATTGCGGGCCTCAACCCTCCCGTCGTGGCGAACAATAAAATCACCACTTGATTCACCTTTTGTGTTTTTCGCCCTGATGTGGATTTCGCCAAGTGATTCGTTGTTTTCAGGCGACCAAACAACGCCGCGCTCGCTTCCGTCGTTGTTCATGAACCACAGATGAGCGGTTCCTTTTGCTGATTTTAGCCTGATAGATGGCGTTTCTTTCAATATATCAAGATCACCAGACATAACATCGCCGGATTTTTTCACCTGCGCATCGTTAGTTACGTTGCCAAGTCCAACATCCGATTTCGACGGCTTGTTTGCCGAGCCGTATAGCTCATTGATGCTAAAAGTCCCGTTGTTAAGCCGGGCATCGTTGCCGGCATAGATCTTAATCTTCGCGGTTGAGTAGTCGATATTGATCGCTGACCACGTATCGCCGCACCGCGAGAATATACCAGAACCGTGGCTATAAATGGTCGCCGTGCTTCCGGTCGGCCTATCACCGCGCCAGAAATGACCACCTTTGTCACGCAGAGCTTTTAAGATCTCTACATCGCTCATCTTGCCGTTTGTTGATATCCCGCTACCGCCAAGACCGAAAGCGCCTGTAAGCATAGCATTAGAAAGACCCAAATCTGCTTTAGTCGGCTTGTTTAACTGGTCGTAAATCCTTACAACGGGGCTTTCAACATATCCAGAAGGCGCGGCGGTTTGCTTAACAAATCCGTCTGGAATGTATAATTCCGGGCGTGCCGTCTGCGCCAGTACCGCAACCTTTGCGCCGTTGATAAATGCGCGCTGGAAGGCCCAAACCTCAATAAAGCCGTCACCTTTAACCAGGCCGTAACGCATCTGGTTGTTATCGGTCAGCCCGGTTGACCCTAAGCGGCGTATACTCAGATGACGAGAAACATTATCCGCACTAAGCAAAGAAGGCAGACCGCGCGCCGAGATCTCGATAAAGTCAATGTTTCCGTAAGGGGAGCCGTAGTTACCAGCGTTAGTAACCATTAGCGTTACATGGCAATTGCCGCTTCCGGGATCGGAAAGTTTCGCGATCTTTATGTAAAAAGACTCATTACCAGTAACAACGGGCCAATCGTATTGCGTCATCGGGCTGACGAGTCCCCCGACCTTGTCGAGATATTCTTTCGCCTTGTTCTCTGACGCTTTAGCGTTGGTTTCGCTGACCTTTGCTGCTGCCTCACTATTTTTCGCGTTGGTTTCTGATTTTTTGGCTGCTGTCGCGGAGTTTGCCGATGCAGTTTGTGAGTCTGCTGCCGCCTGTGCGCTGTTATCCGCATTCGTCTCAGACGTTTTTGCGGCCTTCGCGGAATTTCCTGCTGCCGTTGCCGAGGAAGCTGCACTGCCGGCGCTCGAGGCAGCGCTCGTTTCTGATGATTTTGCCGCCACTTTTGAAGCCGACGCATCCCGGGCTGAGGTGGCAGCTTCTGACGCTTTCGTGGTCGCGGTGGATGCAGAAGTGGCCGCAGATTTTTGTGATGCTGCGGCATTCGTTTCTGACGTTTTCGCTGCACCGGCACTGGTGGCTGCCGCGCTTTTTGAAGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTAGCCTTTGTTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGACGCGGCCTGTCCGGCTGACGTGCTGGCTGCGCGTGCGGAGTCCGCAGCATCAGTCGCATGGGTTGCCGCCTCACGGGCTGATGTGCGGGCATCACTGGCTGACTTCTTCGCGGCTGCCGTGTTCTGTGCTACTGCGGACGCGTTACGCGCCACCTCTTCCACCATCAGTTCAAAGCGGCGCAGTGCCTCCGGACGGACATCATCCTCCGTCATGGCACCGAGAAAATCATTCAGCGTACCCGGTTGTGAGTCTTCATACACGGTGATGGTCCCGGCATGCGATGGCGGGAAGCCTTCCACCAACAGAATAACGCTGTACTGACCGTACTCAACGTCCATGCTGTAACGCCCGGCTTCATCCGGGTTTTCTGAGGCCACTGTGTTCACCACCACCGTGGTGCTGTTGCGCCTGGCCTTTAGCTGAATGGTGCAGTTTTGTATCGGCTTACCTGCGCCATCTTTCAGTACACCTGAAATCCGTACTGCCATATTCCCCCCACAAAAAAACCCGCCTGAACCGGCGGGCTGTCATAACACTGTGTTACCTGGCTAATCAGAATTTATAACCGACACCCACGATGAAACCGTCAGTGCGCCAGTCACCACTGCCGGAACCTTCATAAGCAATATCAATGGCCACGGATTCGGTCGGGTTAAACTGCACGCCAGCCCCCCACGCCAGAGACGTGTTGCTGTGGCGACCGTCATCACTTCCGGTCAGCACATCGTGCGTTTTCCCCTTGTTGTCAGTTACGCGGAGATAATCCCCGGAGAAAGTCGACACACGGCTGTAAGCCACACCCGCCATCGCATACGCGCTGAACCATTCATTCACGCGCACAGACGGCCCCGCCATCACGCTGAACCAGCGGTTACGCACGGAATCCTCATGCCAGCGGGTATCGCTGTAATGCGTTTTTTGCTCATCTTCAGCGTTGGCATAACTGAATGACGTAATCAGCCCCAGCGTGTCCGTAAACTCATAACGGTATTTCACGTTAATCCCGTTCAGATTATCGCTGCCGGGAGCGTTCGTACGGGCATGAAGATACCCTGCGCTCAGTGTGGCCTGCTGCTCAGACGCCCATGCAGGCGCACCGGATACGGCCAGACAGATGGCTGCGGACAAAATGGCTGCACAAACTTTACGCATAATTACCTCTCGCTTTTCTGCAATAAAAAAGGCGCCAGAAATGGCGCCCGCATATGGGTTATGAAAATTCAGCTAATCGTGATACCTGCTGTGGATTTCTTCATCACCACAACCAGCAAATCACTGATACTGGCTGTGGGATACCAGTCATTTACCAGCCACGCTGATACCGAAAACTCCAGCGTCATGTGACCGCGACCAGCAGGCATATCAATAACACCTGTATAACCTTCTGTATCTGCTGAATTGCCGCACTCTGGTCTTCCAGTTTCTTATTGACGGTCTGTGTGATTTCATTGCTGACATCCGTAATGGATGTCCTGATTTCAGCCAGGTCAGGCGCAAGCTGACCGTTATCAATCTGCGTCCACAGCTCCTGAGCCAGATGGGTTTTCCCTATCTCGCCTTTGAAAAAATCCAGATAGCCGGATGCGTCATCACTCGGCTGACCAACAGCCTCCACGAATGCCGATTTGCCAACGGTGTTCACACTGCGAACGTAAAAATAATAATTATGGCCCGGTTTGATATTGATACTGGCGGCTATCCAGTACAACGCCGTACCAAGATAGCGTGCTGTGGTTTCAACCTGCCTGATATCCGCAATCCGCTTTTCCGAGAACCAGAACTCAAACTGTACCGTCGGATCATAAACCGCAAGATGCGGCGTGGCGGTTATCTGAAAATAGCCCGGCGTCAGCTCAATCCGCGACGGCGCTGCCGGTGCGGCAATCCGGAACGATACCGACGCCGGATCGCCCTGCTGCCCCCACGCATTTACTGCCCGGACTGTCAGCCTGTAGTTCCCCAGAGCCAGTTGTGTGAAGCGGTAAGTGGTTTCCGTCGTCCGGGCCGTGCTGACCAGCCGCTCACTGCCGTCATCCGCTGCCACGGTCAGGCGAAGCAGGAAGCTCACGCCCTTCACCACCTTCGGCGTGTCCCAGCGGGCCAGCACCTGGTATTCCCCGCTGTCTGCGGTGACTTCTGCGGTCAGATGCTGCACTGCTGGCGGCGTGACACCATTCACCGTGCCGCTCTGGTCGCCGTCAAAGTGCGCCCCGTTATCCACGATGGCCTCTTTTTCCGGTACATGCTGCACGGCAGTGATGGCATACGTGCCGTCATCGTTCTCACGGATACTCACGCAGCGGAACAGGCGCTGGCGCAGCGTCGGCAGCTTCAGCCCCCACACGCTGTACTCGGCAACGCCGTCAGGAAGCCGGCTCACTTTCACCTTCACGCCGTCGGTGATGGACTGGACCTCCACGCTGACCGGATTGCCACTTCCGTCAACCAGGCTTATCAGCGTGGTACCGGAGGATGGCAGCGTGATTTCACGGTCGAGCGTCAGCGTCCGGGTCTGGCTGTTCACCGCCAGCACGCGCCCGCCGGTGCTGATACCGGCATAGTCATCATCGCAGATTTCAATGACATCGCCCGGTACATGGCGAAGCCCTTCAGCACCCACGCTGAAGTCCACGGTCTGCGTTTCCAGCAGCTCCGTTTTAATCAGCCACAGCCCGGCGCGGTGCGCCTGCCCCCGACTGGTACAGCCAAACGCATCCATCTTCGTGACGTTACGACCATAACGGGCAATGGCCTGCGAGTCCTCCACAAGCTCTGTTGCCGTCTCCCAGCCGTTATCCGGGTCAATCCAGTTCACTTCAACGGCATTATGGCGGTCTTTCAGGGCGCTGAAGCTGTAGCGGAATGGTGCGCCATCATCCGGCATCACCACATTACTGCGGTTATAGGTCCACACCTTATCCGATGGTCGGTCCTGCACGAACGTCAGCGTCTGCCCGTTCCATACCGGCATACAGCGCATCGCCGAGCAGAAATCACTGAGCACATCCCACGCCTTACGTTGTGTGGTCAGGTAAGCGTTACAGGTGATGCGCGGCTCCGTACCGCCAAAGCCGTCCGGCACCGACTGGTCGCAATTCTGGCCGATGACATACAGCGCCCATTTGTCCACATCCGCCGCACCGAGACGTTTCCCCATGCCGTAGCGCGGGTGGGTCAGCATATCCCACAGACACCAGGCCATGTTGTTGCTGTATGACGGTTTAAACGTCCCGTCCCAGATACCGCTGTATTGCCGCGTCTGCGGGTTATAATTCGACGGCACCTGCAGAATACGCCCGCGAAGATGATAATTACGGCTCACCTGCTGGCTGCCGAACTGCTCCGAATCCACCTGTACGCCGACCAGTGCCGTGTTCGGGTAGCACTGTTTCACATCGATGATTTCGGTGTATGACGACCAGAGCGTTTTGTTCTGCAGCTGGTCTGTGGTGCTGTCCGGCGTCATCCTGCGCATCCGGATGTTGAACGGGCGCGGCGGCAGATTATCCACCACTACCGAGGCCAGATACTGCGAGGTGGTTTTGCCCTTAATGGTGATGTCTTTTTCCGTCACCCAGCCACCGTTACGCTGTATCTGAACCAGCAGGCGGACTTCCGACGGATTCCGGTCCCCCTTTGAAGTGGTTTCCACCAGTGCCTGCACACCGAAGGTAAAGCGCAGACGGTCAATGTTTGCCGACGTGATGGTCCGGGTGATCGGCGTGTCGTATTTCACTTCCGTGCCCAGTACCGTCTCGGAACCTGATGATTCAAATCCCTCCGGCGGTGTCTGTTCCTGCTCACCTGCCCGGAACACCACCGTGACACCGGAGATGTTAGTATTCCCCTCACTGTCCAGCACCGGTGTACTGTTCAGCAGCACGCTTTTTAATCCATCCACCGGACCTTCAACCGGCCCTTCGCTGATGGCATCGATCACACTCAGCAACTGCGTGGACTTCAGGTTGTCCTTTGCTTCGCGCGGGGTATGCCCCTTACTGCTGCCTTTACCCATTCCTCACGCCCCATAAACGATAAAACCGCCCGGAGGCGGTTTCACATAAAACGTTTTTCATCAGCGACCAATCACCACAACCTGACCACCGTCCCCTTCGTCTGCCGTGCTGATCTCCTGAGAAACCACGCGTGATCCCACGCGCATTTCACCGTACAGAACGGGCAGAACATTGCCCTGGGCAACCATATTATCCAGTGAAGAAAAATAGGTGTTCTGTTTGCCGTTATCCGTTGTCTGTGTGCGGGGGGTTTTGGGTTTAGGGGCCAGCATCTGTGCAACACCGCCAAGCGTCATACTGGCACCGAGAGAAAACAGCAGATTACTCGCCATAATTCCTACCCCCGGCATCCATATAGCAACCGCCATAACAGCCGCCCCCAGCACAGCCTGAAACACACCGCCACTTTTGGCTCCTGCCAGACGCGGCACGATATGGATCACAGCACCATTTGCCAGCGGTTCATTAAGACGGGCTGATAATTCCGTTTCACCTGTATCACGCCCGGCAATGCGTACCTGATACCAGCCGTCGCTCAGCTTCTGACGAAACGCCGGGAGCTGTGTGGCCAGTGCCCGGATGGCTTCAGCCCCCGTTTTCACACGCAGATCGATGCGGCGGCCAAATCGTTGTAAATCCCCGTAAAGGCAGATGCGCGCCATGCCCGGTGACGCCAGAGGGAGTGTGTGCGTCGCTGCCATTTGTCGGTATACCTCTCTCGTTTGCTCAGTTGTTCAGGAATATGGTGCAGCAGCTCGCCGTCGCCGCAGTAAATTGCGGCGTGATTCGGCACCGATGAACCAAAACAGCACAGCAGCACATCGCCCGGCTGTGCCGCTGACAACGGCACCTGATACAGCCCCGTCGCCTCCAGATTATCCAGATAGAGATTCTGGCCGTTACGCCACCAGTCATCCTCACGATGAAAGTCCGGCATCTCAATCCCCGCCAGATGATAAGCATCCCGGAACAGTGTGTAACAGTCCGTCACACCGTGCTCAAAGCGCCGCCCGGTGAGATGCGGCACACAGCGGAACTTATGAATCGTCCCCCGGCAGACCAGCCACCACGGCAAATCACTCTGCACCTGCAGCCGCCGGTCGGCCTCACTCAGCCAGGGCAGACCACCGGGGTGGCTGTGGACCAGCGCCACAATCTCACCCTGCATTTCTGCCTGCAGCCAGTCTTCCGGCGACATACGGAAATACGCCTCCGGCTCACCGGAGATATTCACGCAGGGGAAATATCTTTCCCCCTCCGGCGTGCTTACCACGAAGCCGCACGACTCCGCTGGCGCACATCGCCGGGCGTGCGCCAGAATCGCTGATTCTGTCTGTGTCATGGGATTTACTGCGAAAGTTTGTTAATGGAAAGGAAGCCGCCAAAGTTGCCGACGTTATTGCGGAACTTACAACCGCTCAGGCATTTGCTGCATTTATCCTTCGTGATATCGGACGTTGGCTGGTCATATTCATCCGCGACAGCCGGACCGCTATAACCGCACTCGTCACCGCGATAGGTCCAGGTGCAGGTGTTGGCCAGCATGATACGTCCCGGAAAAACAGCGCCATCCGTTTCCGTCGGCGTGGACAGTACAAAGGAGGCACTCACCGCGCTCAGTTCGCTGCACTGCTCAATGCGCCAGCGGCTGATCACCTCCTGCTCCGGATCGGCATAACTGTTTCCGTTGACGAAGTTCACCGCATCCAGAAAACGGGCGTAAACATTACGCCGGACCACCGTTCCGCCGACCAGACTCTGCATATCTTCCGCCATCCCGGTGACCATACCGTACAGGTTAGAAACCGTCAGCGTGGGGCGCGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCAGTCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGGGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAAATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCGGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGACAATGACTTCAGCGGCAAACGCTCTCTGATGGAGTCTGTCGAAGCGAAAACCAAAGACATTATGCCAGTAGCATTTGAGTTTAAATGCGTTCCATTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGTGATCGCCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTACAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAAAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCGCCTGATGGCGAGGGTTTTCTTTAACCAAAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGATATTAACGATATCGCGGTTTCCCTTTCAAATATCTGTCGCTTTGCCGGTCATCTTTCGCACTTCTACAGCGTCGCCCAACATGCGGTTCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATTCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCCCCGGTTATGAGCACGCCAGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGCGATCTCGGACTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAAAGTGATTCCACTGGCTCCAGGCCATGCCTACGGGATGTTTATGGAACGTTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGCGGGAAATGTGTACCGCGAGATCTGAAAGTGAACGAAACAAATGCTGAGTACCTGGTACGTAAATTCGATGCGCTTGAAGCTAAATGTGCGGCACTGGAAAACAAAATAATACCAGTGTCAGCTGAACTACCACCAGCAAATGAAAGTGTTCTGTTATTTGATGCTAACGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTACACCTGGGGACAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACATTGAAAGCGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACATTTACCGATAAAGAACTGATTAAAGAAATCAAAGAGCGTATCAGCAGCCTGGAGGTTCGAGACGATATTGAGCGCCGTGCTTATGAAATTGCTCTGGCATCGCTGGAAGAGGAGCCGGTGGCATGGCTACATTCAGACAATGGCTTAGGTATTCCAGCAATAACCAGGAGTAAAAACATTGCTGACAGTTGGTTATCAAAGGGCTGGTATGTTCAGCCGCTATATATAGCCAAGCCAGTGCCGGTGGTGCCAGATGCTCGTCCGACTTTAAATAATGGCATAGTCGGTTTTGATGAAGGCTGGAACGCCTGCCGCACCACCATGCTTCACGGTGTCAAACCTGTAAGCCAGACTTACAAGTTGAACAAGCTGTCTGACAACTCTCCGGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGAACGATAAACAATATGTTTGGTGTTGGGGTAAGTCTTACGGCTGGACTGAGTGCGATACCTTCGAAGGGTATTACGATTGTTCGAGAAACAAATGGTGGGCAGTTACTGACGATGGGGAAGAACCGGCATTGAAAGTAACCCACTGGATGCCGCTACCGGAGCCACCGCAGGAGGTGAAGTAATGAACAACTTAATGACAACTAAACAAGTCGCCGACTTCTGTGGCGTTTCAGTATCGACAGTTCTTCGCTGGAACAGCGTAAACAGGAGAACTGGCCAGAAATATAGGCCTGACTTTCCAGATCCTGATATTAAATCCTGCCCCAATAAATGGGCATCACACAAGATATACAGGTTTGCGGGAGTTATTGAGTAATACGTATTAGTTCAGATGTGAGCTAACACATCTATGGCACAGAGCTAAACCTAATCTGACTGTCTACTCTGTGCCATAAGTGGATGTTAATTACTTCTTGGTATTTTAATGGTTGGATAGCTTATCAAGATTACTAATCACCCTGTGATGTAAAATAAAACCAATGGTATTCTAGAGCTTGCATAGACAACCCAACCTTGTCCCTCAATAGGACACCATTCAGCTCCTGATAGTACCTATCGCGCACCATATCACTTACGTGAATCCTTAAAGTCGCTGGGTCAATAGATATTAGATAGTTATCAAAGAGACGATGTATATCAGTACGCAATAACAAGCCATTTTTTATATGATTATCCTTATCTCCTCTGTAAGGATAAATATGAGCTGCATCAAGTACTGCTATTGTTTTGCACCCAGTAAAGGCACATTGCCCCCAAACGTTTATCAGTTTATTTCTGAAGTCGCTTTGACCTATACGTATCGTGACGTTTCTATTTATAACCTTTCGTTCATCAGAGTTATTTATTTTAAAACTCCCTGCTGTTGAAAAATCAAATTTTTCAGTCGCACTAAATATTTTTTTAGATTGTTTTTTTACTTTTGCAAGCTGTAAATCTGACTTTATTTGTAAGGGTTTTGAGTTTTCTTCATCAAACGATGGAAGTTGAAGTTCTTTAATCATGCTGGTAGATAGAAAATTTTTGTAACTCATTAACTTAAAATCTAATCCTCGCTCATAACGAATTTTAGAGAGCACTATTGCAGCTTCTTCAATGTATTTAAAACTACTTTTCCTTTTATCCCATAATTGAGATAAACAATCTGCATTTAACCTATCATATCTTGATAAAATCTCACCCAAACTGCTGGCTTTAGCGGCGCTATCATAGTTTATATACTGTTTTTCTTGGGAATCTTCAATGGAATCCAGAATAAAATTTATATTTACATGATTATCTGCATCTGCATCTGCAGATGCAGATGAATCATTAAAATTATCGTAGTCAAGAGAATTTTGATCACTCCAAATAGGTAGCGATAGATAGCAAATATTATCAATGCTGTCGTCCACCAACATTAATTTTTGGTGGAGTAAAAATAATATAACTGAAGCTTCAAGGTAATTCATTTCTTGTGACAAGAATTTTTCTTTAAGCATATCAAAGTCATATTGTACGTATTCATTTTTTTTCTTTGATATTACGGAGAGTTGCTCTTCACTTAAGACCACTGGCCTGTCTTCTGATATATTACGAAGTAAGTATGAAATCAATTTTTTTTGATTTTTTTTAGAATCTTCAGACCTAGGAGGTTCGAGAAACACACCTTCAAGTCTATCCAATGCTTTAAGGCAACTTCCCGCAAGCCTAGTCATCGCATCTTTTCTTGTTTGAATTAGAGGATCTGCATAGATAACAAAATTATCCCCATTACGAATATGAGCTATTTCAAGCTGATATACATCTAGATTTTTTATAAGTTTAAATACAAATCCGTATTTTTGCATTCTATGCAAAGTCAGTGTCGGTATAATATCTTTATTAGATTCTTCATCCAAATTATAGTTTTTTATTCTTTTGTACAACCAATTTGTTGCAGAGGAAGAGCAAATTAACTCATTATATTGAATTTTACTATTTTTAGTTATCAGATTTAGAAGAGACATTCCAAACAATGCCTGAATACAATCTACATAATAAGATATAGTTGTATAATCTGGATGTCCTTTATAGGGGAGAGAATCTATTGAAATAAAACCTGAGTTATAGAATCTCATAAAGTTTTTTTCAGATAACACCATGATCCCTAACGTAGTTCTGTCAATAACATTCTGTCCATTTTTTATTATCTCAAGTAGGCTTACTGAAGCTAGTAAGGAAATATAATGCGAACCTAAAGTAGCGAGATCTCTGTGCGAACTTATAGATCTATTTTTACCCTTTATTCTAGGGGGTATAAAAGCCTGGATTGAATTAAAGTTATTTGAAAAACCAAAAGCCATGTTCAATTCATTGTTGATGTAACACTCTTTAACATTCGGTTTCTGTACGCCACCCCATGCTGAAATAATACTATTAGATTTTAATATTTGTTTTTTATTGTTTTTCAAATTCAAATGATTGCAAACTAAACTGGCAACATTATTCTCTGCTTTTTTTTTACTCTCACCACTTCCGATAAATATCTGTCCATTAGGAAGTGCTATTTCAGACTCATATATTGGTGCATGCTCGGTCCCACCAGCCAAGGTGGTTTTGTATGTAGGGATTATGTGTTGATGTCTTTGGAAATATTCATGTAAAAGAGTTTTTGGGGGTTTGCCAACAAATGACTCCGTAATAGAAAGGCTTTCAACAAAAAATTTGAGATGTAATATGATATCACAATTAGATATCGCCCCTAGGTAAAATTGAAATATTTGAATTATCATAGAAGATGAGTTGATTTCTTCTTGTTTTACTCCAGGACCAAATTCTGTTTGTTGGTAGACTTTTGTTAAGGGAAAATTTGTAACAACAAGCTCTCTTAGTTTTTTCTTCAGGCTGCTAGTCATAATGCTTAAATTGTTACCATCCGCACCTTCTATACTCAGAGACCGACAAGCGCAGATTAAATCCTCAAGCGACTTACCAACTGATACAGTTGCGTTAAACAATTCAGTATTGCAATTAAAAGATGAACGATGACTTATCTGAATATAATTTTCGTTGATATGATTTATGAATAACTTATTCATTTTCACCTCGTGTATTATTCGATAAAGATCACATTGTACGTGATATCAACTACTTGCTCAAAGCAGATCGTCATATCCTATTGCGTTCTAGCTACGTAAACTATCCTATCAGTTAGAGTCTGAGTTAGTACAAGTAACAATCGATTCAACTCTCTCCCACCATGCCTGGTAGGCTTTACGCTGTTCTTCTAGATAATCGCTCTTGTCATAAACTTGCCATACCCCTGGCAGTTTATGACCTAGCATTATTTCTGCAATATGAGGCGCAGTAAGATCAGAAAAGTTTGTTCGTGCTGTTCGTCTCAAATCATGAAGAGACCAATGAGGAAATTGATACCCCAAACGCCGCCATGCGTACTGCATTAAATTGTAAGGCAGCGACTGCAATGATGTCCGACCAACTGGTTCCCTGCTTCCTTCCTTAGTAAAAAGCATATCGGAACCGTTGTTCATAGAGATAACGTATTTTATAATCTCTTCAACCGGTTCAATAATGGGCCGCTTTAGCGGTTCGCCTGTTATATCCCCTGTCTTATGTCGTTCTGGTGGTACAGTCCATATCTTATTAATGAAATCAAAATCGTCCACCTTGGCAGTAATTAGCTCTGAACTACGGCAACCAAAATGAAGCAATAGTTTAATGAAGGCCCGGTATTTAGGAACCATTCGAGAACCATCGATCGCAGCATAAAGGATTTTAATTTCATCATGTGTCAGAAACCGTTTCTTCTGACCTTTACGGATATCCATATCTTTACCCGTGATATCCGACAGCGGGCGAGTTTCAATGAGCTTTCTCTTATACGCCCAGACATGGGCCTGCTTTGCGTTAATTAGCAATCGGTCTGCTATTGCTGGAGTCTTAGTGCTAAGAGGCTCCAGGACTTCTAACCAATCATGCAATGTAGCTGCATCGTTAGGGATATTCCCGATTTTAGAGAACAGGTGCAGCTCAAACGAGCGGAGTATCTGTTCAGAACCTTTTTTATTTTTTACACAATATGCTTCATACCAGGCACGGATCACAGACTCTACCGTCATGGCTTCAGTAGCTTTTCGTTTTTCAGCCTGCTTGACCAATCGTGGATTACGGTTTGACTCGAGTTCACCACGGAGACGGATAACTTCTTCTCTGGCCTCTTTTAATCCAGTTGCCGGGTAAGTTCCGATATCAAGACGCTCACCTTTCCCTGCCCATTGATAACGATATTGGAACACTACGCGACCTTTCGGTGATACTCTGACAGACAAACCATCACGATCGGATTTAACCAAAACCTTATCACGTTCCTTTCCAACGACTGAACGCAACCACGCATCAGACAGCGCCAT
Protein sequences of DBSCAN-SWA_9 >CP034966|3940173:3956532|3948001_3948745_-|QAS91585.1|tail|DBSCAN-SWA MTQTESAILAHARRCAPAESCGFVVSTPEGERYFPCVNISGEPEAYFRMSPEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGTIHKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRNGQNLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLAAASICV >CP034966|3940173:3956532|3949376_3949739_+|QAS91586.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP034966|3940173:3956532|3951645_3952266_+|QAS91590.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLEVRDDIERRAYEIALASLEEEPVAWLHSDNGLGIPAITRSKNIADSWLSKGWYVQPLYIAKPVPVVPDARPTLNNGIVGFDEGWNACRTTMLHGVKPVSQTYKLNKLSDNSPVTPDGWISCSERMPNDKQYVWCWGKSYGWTECDTFEGYYDCSRNKWWAVTDDGEEPALKVTHWMPLPEPPQEVK >CP034966|3940173:3956532|3949804_3950629_+|QAS91587.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP034966|3940173:3956532|3944723_3947402_-|QAS91583.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVDNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPSYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQNCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPDNGWETATELVEDSQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGSGNPVSVEVQSITDGVKVKVSRLPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETTARYLGTALYWIAASINIKPGHNYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVIQVLLICLLVAVT >CP034966|3940173:3956532|3955308_3956532_-|QAS91593.1|integrase|DBSCAN-SWA MALSDAWLRSVVGKERDKVLVKSDRDGLSVRVSPKGRVVFQYRYQWAGKGERLDIGTYPATGLKEAREEVIRLRGELESNRNPRLVKQAEKRKATEAMTVESVIRAWYEAYCVKNKKGSEQILRSFELHLFSKIGNIPNDAATLHDWLEVLEPLSTKTPAIADRLLINAKQAHVWAYKRKLIETRPLSDITGKDMDIRKGQKKRFLTHDEIKILYAAIDGSRMVPKYRAFIKLLLHFGCRSSELITAKVDDFDFINKIWTVPPERHKTGDITGEPLKRPIIEPVEEIIKYVISMNNGSDMLFTKEGSREPVGRTSLQSLPYNLMQYAWRRLGYQFPHWSLHDLRRTARTNFSDLTAPHIAEIMLGHKLPGVWQVYDKSDYLEEQRKAYQAWWERVESIVTCTNSDSN >CP034966|3940173:3956532|3940173_3943875_-|QAS91581.1|DBSCAN-SWA MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDVRPEALRRFELMVEEVARNASAVAQNTAAAKKSASDARTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAAASQKSAATSASTATTKASEAATSARDASASKVAAKSSETSAASSAGSAASSATAAGNSAKAAKTSETNADNSAQAAADSQTASANSATAAKKSETNAKNSEAAAKVSETNAKASENKAKEYLDKVGGLVSPMTQYDWPVVTGNESFYIKIAKLSDPGSGNCHVTLMVTNAGNYGSPYGNIDFIEISARGLPSLLSADNVSRHLSIRRLGSTGLTDNNQMRYGLVKGDGFIEVWAFQRAFINGAKVAVLAQTARPELYIPDGFVKQTAAPSGYVESPVVRIYDQLNKPTKADLGLSNAMLTGAFGLGGSGISTNGKMSDVEILKALRDKGGHFWRGDRPTGSTATIYSHGSGIFSRCGDTWSAINIDYSTAKIKIYAGNDARLNNGTFSINELYGSANKPSKSDVGLGNVTNDAQVKKSGDVMSGDLDILKETPSIRLKSAKGTAHLWFMNNDGSERGVVWSPENNESLGEIHIRAKNTKGESSGDFIVRHDGRVEARNLKITYKISAATAEFANTSTSSDNTTVSIKGSQHTPLVLTSNNTIKNLSIGFKVDDVDQKYLGIAGDGDLYFGSYSDHTKNSKVITQAKLDSGVTVGGKTTFSDLATFNAGMAGSIEPETIDNKTIDLNDLIIANTVAGSVKYYQCKTVAGGAYITNKPDGVSGNFLLRVESTRKTTGSDYAIMQTLIGSDTKRIYVRFVVNGSWTEWSQVVVSGWNQDVTVRSLTSTTPSKLGGGRVDVLGSTSDYSSMNCAVRGVDSTGTNSAWSVGTSKNTGKMLCLKNHRSSAQVLLNGDDGAVQLLSGTVNGATAQALTINKDEVNSTADLVIRKQTGTGNRFALLNSGNSELPVSIRVWGSSTRQNVFEVGTSAAYLFYAQKTTDGQNLTVNGSVNCTTLNQSSDRRLKENIEIIDNATDAIRKINGYTYTLKENGAHCAGVIAQEVEEAIPEAVGSFIHYGEELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTSLEEENAELKQRLSAIEAALASK >CP034966|3940173:3956532|3951283_3951646_+|QAS91589.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDIESVNITHWAVMPKAPEAGA >CP034966|3940173:3956532|3950756_3951293_+|QAS91588.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFKVIPLAPGHAYGMFMERFNELSELRKCA >CP034966|3940173:3956532|3949161_3949362_+|QAS92366.1|DBSCAN-SWA MHIFRHPGDHTVQVRNRQRGARTALFTKQRRASPDDGLITQSIRAAVTAGVLLCFVEKPTDLAGSI >CP034966|3940173:3956532|3943939_3944539_-|QAS91582.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEQQATLSAGYLHARTNAPGSDNLNGINVKYRYEFTDTLGLITSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYEGSGSGDWRTDGFIVGVGYKF >CP034966|3940173:3956532|3952265_3952460_+|QAS91591.1|DBSCAN-SWA MNNLMTTKQVADFCGVSVSTVLRWNSVNRRTGQKYRPDFPDPDIKSCPNKWASHKIYRFAGVIE >CP034966|3940173:3956532|3952593_3955200_-|QAS91592.1|DBSCAN-SWA MNKLFINHINENYIQISHRSSFNCNTELFNATVSVGKSLEDLICACRSLSIEGADGNNLSIMTSSLKKKLRELVVTNFPLTKVYQQTEFGPGVKQEEINSSSMIIQIFQFYLGAISNCDIILHLKFFVESLSITESFVGKPPKTLLHEYFQRHQHIIPTYKTTLAGGTEHAPIYESEIALPNGQIFIGSGESKKKAENNVASLVCNHLNLKNNKKQILKSNSIISAWGGVQKPNVKECYINNELNMAFGFSNNFNSIQAFIPPRIKGKNRSISSHRDLATLGSHYISLLASVSLLEIIKNGQNVIDRTTLGIMVLSEKNFMRFYNSGFISIDSLPYKGHPDYTTISYYVDCIQALFGMSLLNLITKNSKIQYNELICSSSATNWLYKRIKNYNLDEESNKDIIPTLTLHRMQKYGFVFKLIKNLDVYQLEIAHIRNGDNFVIYADPLIQTRKDAMTRLAGSCLKALDRLEGVFLEPPRSEDSKKNQKKLISYLLRNISEDRPVVLSEEQLSVISKKKNEYVQYDFDMLKEKFLSQEMNYLEASVILFLLHQKLMLVDDSIDNICYLSLPIWSDQNSLDYDNFNDSSASADADADNHVNINFILDSIEDSQEKQYINYDSAAKASSLGEILSRYDRLNADCLSQLWDKRKSSFKYIEEAAIVLSKIRYERGLDFKLMSYKNFLSTSMIKELQLPSFDEENSKPLQIKSDLQLAKVKKQSKKIFSATEKFDFSTAGSFKINNSDERKVINRNVTIRIGQSDFRNKLINVWGQCAFTGCKTIAVLDAAHIYPYRGDKDNHIKNGLLLRTDIHRLFDNYLISIDPATLRIHVSDMVRDRYYQELNGVLLRDKVGLSMQALEYHWFYFTSQGD >CP034966|3940173:3956532|3947462_3948065_-|QAS91584.1|tail|DBSCAN-SWA MARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPAFRQKLSDGWYQVRIAGRDTGETELSARLNEPLANGAVIHIVPRLAGAKSGGVFQAVLGAAVMAVAIWMPGVGIMASNLLFSLGASMTLGGVAQMLAPKPKTPRTQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR |
14 | Enterobacteria_phage(42.86%) | tail,integrase | attL 3945069:3945083|attR 3958513:3958527 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
4038032 : 4044591
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >CP034966|4038032:4044591|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_10 >CP034966|4038032:4044591|4038989_4039757_+|QAS91668.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >CP034966|4038032:4044591|4040314_4040572_-|QAS91669.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV >CP034966|4038032:4044591|4043145_4043718_+|QAS91672.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >CP034966|4038032:4044591|4038032_4038989_+|QAS91667.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >CP034966|4038032:4044591|4043766_4044591_-|QAS91673.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >CP034966|4038032:4044591|4041623_4042775_+|QAS91670.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >CP034966|4038032:4044591|4042694_4043045_-|QAS91671.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|