assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	1	892263-892402	1	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA	49	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|38aa|down_8|CP029122.1_901256_901370_+	NA|336aa|up_9|CP029122.1_884728_885736_-	PRK10508, PRK10508, luciferase-like monooxygenase	NA|293aa|up_8|CP029122.1_885941_886820_-	PRK15447, PRK15447, putative protease; Provisional	NA|332aa|up_7|CP029122.1_886828_887824_-	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|175aa|up_6|CP029122.1_888032_888557_+	COG3154, COG3154, Putative lipid carrier protein [Lipid metabolism]	NA|168aa|up_5|CP029122.1_888550_889054_+	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|101aa|up_4|CP029122.1_889040_889343_-	PRK00329, PRK00329, GIY-YIG nuclease superfamily protein; Validated	NA|148aa|up_3|CP029122.1_889393_889837_+	PRK03467, PRK03467, hypothetical protein; Provisional	NA|173aa|up_2|CP029122.1_889816_890335_-	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus	NA|212aa|up_1|CP029122.1_890462_891098_+	cd05250, CC3_like_SDR_a, CC3(TIP30)-like, atypical (a) SDRs	NA|347aa|up_0|CP029122.1_891170_892211_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|192aa|down_0|CP029122.1_892415_892991_-	PRK11023, PRK11023, divisome-associated lipoprotein YraP	NA|197aa|down_1|CP029122.1_893000_893591_-	PRK10886, PRK10886, DnaA initiator-associating protein DiaA; Provisional	NA|132aa|down_2|CP029122.1_893610_894006_-	TIGR00252, UPF0102_protein_HI_1656, TIGR00252 family protein	NA|679aa|down_3|CP029122.1_893963_896000_-	COG3107, LppC, Putative lipoprotein [General function prediction only]	NA|287aa|down_4|CP029122.1_896064_896925_+	PRK14994, PRK14994, SAM-dependent 16S ribosomal RNA C1402 ribose 2'-O-methyltransferase; Provisional	NA|364aa|down_5|CP029122.1_896967_898059_-	pfam00419, Fimbrial, Fimbrial protein	NA|722aa|down_6|CP029122.1_898069_900235_-	COG3188, FimD, P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|168aa|down_7|CP029122.1_900732_901236_+	pfam03400, DDE_Tnp_IS1, IS1 transposase	NA|38aa|down_8|CP029122.1_901256_901370_+	NA	NA|200aa|down_9|CP029122.1_901392_901992_-	COG3121, FimC, P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	2	937163-937280	2	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG	40	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|56aa|down_0|CP029122.1_937353_937521_-	NA|73aa|up_9|CP029122.1_924884_925103_-	PRK11424, PRK11424, DNA-binding transcriptional activator TdcR; Provisional	NA|313aa|up_8|CP029122.1_925417_926356_+	PRK10341, PRK10341, transcriptional regulator TdcA	NA|330aa|up_7|CP029122.1_926454_927444_+	PRK08638, PRK08638, bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB	NA|444aa|up_6|CP029122.1_927465_928797_+	PRK13629, PRK13629, threonine/serine transporter TdcC; Provisional	NA|403aa|up_5|CP029122.1_928822_930031_+	PRK12379, PRK12379, propionate kinase	NA|765aa|up_4|CP029122.1_930064_932359_+	cd01678, PFL1, Pyruvate formate lyase 1	NA|130aa|up_3|CP029122.1_932372_932762_+	PRK11401, PRK11401, enamine/imine deaminase	NA|433aa|up_2|CP029122.1_932899_934198_+	PRK15040, PRK15040, L-serine ammonia-lyase	NA|444aa|up_1|CP029122.1_934472_935804_+	TIGR00814, membrane_transport_protein_YhjV, serine transporter	NA|437aa|up_0|CP029122.1_935831_937142_+	COG3681, COG3681, L-cysteine desulfidase [Amino acid transport and metabolism]	NA|56aa|down_0|CP029122.1_937353_937521_-	NA	NA|201aa|down_1|CP029122.1_937540_938143_-	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|299aa|down_2|CP029122.1_938346_939243_+	cd08431, PBP2_HupR, The C-terminal substrate binding domain of LysR-type transcriptional regulator, HupR, which regulates expression of the heme uptake receptor HupA; contains the type 2 periplasmic binding fold	NA|119aa|down_3|CP029122.1_939293_939650_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|122aa|down_4|CP029122.1_939891_940257_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|329aa|down_5|CP029122.1_940549_941536_-	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|131aa|down_6|CP029122.1_941605_941998_-	COG2259, COG2259, Predicted membrane protein [Function unknown]	NA|100aa|down_7|CP029122.1_942183_942483_-	pfam13997, YqjK, YqjK-like protein	NA|135aa|down_8|CP029122.1_942472_942877_-	COG5393, COG5393, Predicted membrane protein [Function unknown]	NA|102aa|down_9|CP029122.1_942879_943185_-	COG4575, ElaB, Uncharacterized conserved protein [Function unknown]
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	3	1310603-1311119	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas7	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Unclear	GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	8,8,8	8	Unclear	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA|47aa|up_1|CP029122.1_1309312_1309453_-,NA	NA|434aa|up_9|CP029122.1_1300009_1301311_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|CP029122.1_1301358_1303593_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|CP029122.1_1303670_1303919_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|75aa|up_6|CP029122.1_1304029_1304254_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|CP029122.1_1304324_1305116_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|CP029122.1_1305343_1306981_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|CP029122.1_1307068_1308367_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|CP029122.1_1308426_1309299_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|CP029122.1_1309312_1309453_-	NA	NA|224aa|up_0|CP029122.1_1309591_1310263_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|CP029122.1_1311756_1313235_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|CP029122.1_1313261_1314539_-	cd06174, MFS, Major Facilitator Superfamily	NA|262aa|down_2|CP029122.1_1314857_1315643_+	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|485aa|down_3|CP029122.1_1315712_1317167_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|439aa|down_4|CP029122.1_1317281_1318598_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_5|CP029122.1_1318575_1319355_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_6|CP029122.1_1319351_1320212_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|181aa|down_7|CP029122.1_1320359_1320902_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_8|CP029122.1_1320951_1321212_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_9|CP029122.1_1321202_1322474_-	PRK10015, PRK10015, oxidoreductase; Provisional
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	4	1333504-1334082	4,2,2	CRISPRCasFinder,PILER-CR,CRT	no	cas3,cas7,cas5,cas6e,cas1,cas2	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Unclear	TGTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACC,GTGTTCCCCGCGCCAGCGGGGATAAACCG	30,28,29	0	0	NA	NA	I-E:I-E:I-E	9,9,9	9	Unclear	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|40aa|down_5|CP029122.1_1338854_1338974_+	NA|600aa|up_9|CP029122.1_1323232_1325032_+	PRK10953, cysJ, NADPH-dependent assimilatory sulfite reductase flavoprotein subunit	NA|571aa|up_8|CP029122.1_1325031_1326744_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|245aa|up_7|CP029122.1_1326817_1327552_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|51aa|up_6|CP029122.1_1327816_1327969_+	pfam01848, HOK_GEF, Hok/gef family	cas3|596aa|up_5|CP029122.1_1328162_1329950_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas7|295aa|up_4|CP029122.1_1329924_1330809_+	cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|249aa|up_3|CP029122.1_1330819_1331566_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|217aa|up_2|CP029122.1_1331547_1332198_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas1|308aa|up_1|CP029122.1_1332194_1333118_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|96aa|up_0|CP029122.1_1333120_1333408_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|CP029122.1_1334163_1335201_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|CP029122.1_1335452_1336361_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|CP029122.1_1336362_1337790_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|CP029122.1_1337789_1338395_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|CP029122.1_1338444_1338768_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|40aa|down_5|CP029122.1_1338854_1338974_+	NA	NA|104aa|down_6|CP029122.1_1338961_1339273_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_7|CP029122.1_1339291_1340002_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_8|CP029122.1_1340001_1340481_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_9|CP029122.1_1340477_1341527_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	5	1836729-1836846	5	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA|47aa|up_6|CP029122.1_1831809_1831950_-,NA|44aa|up_4|CP029122.1_1833835_1833967_-,NA	NA|543aa|up_9|CP029122.1_1827250_1828879_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_8|CP029122.1_1829151_1830510_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_7|CP029122.1_1830514_1831591_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|47aa|up_6|CP029122.1_1831809_1831950_-	NA	NA|168aa|up_5|CP029122.1_1832367_1832871_+	pfam03400, DDE_Tnp_IS1, IS1 transposase	NA|44aa|up_4|CP029122.1_1833835_1833967_-	NA	NA|69aa|up_3|CP029122.1_1834122_1834329_-	PRK09729, PRK09729, hypothetical protein; Provisional	NA|217aa|up_2|CP029122.1_1834543_1835194_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|CP029122.1_1835247_1835502_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|CP029122.1_1835501_1836632_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|CP029122.1_1836865_1839151_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1235aa|down_1|CP029122.1_1839894_1843599_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|CP029122.1_1843726_1844449_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|CP029122.1_1844595_1847223_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|CP029122.1_1847371_1849060_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|CP029122.1_1849056_1849680_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1505aa|down_6|CP029122.1_1849703_1854218_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|CP029122.1_1854218_1855868_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|CP029122.1_1855872_1856649_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|CP029122.1_1856722_1857907_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	6	2457166-2457289	6	CRISPRCasFinder	no	DEDDh	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA|46aa|up_8|CP029122.1_2449088_2449226_-,NA	NA|49aa|up_9|CP029122.1_2448651_2448798_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|46aa|up_8|CP029122.1_2449088_2449226_-	NA	NA|150aa|up_7|CP029122.1_2449430_2449880_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|CP029122.1_2449900_2452003_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|213aa|up_5|CP029122.1_2452015_2452654_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|185aa|up_4|CP029122.1_2452831_2453386_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|CP029122.1_2453382_2454168_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|255aa|up_2|CP029122.1_2454219_2454984_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|CP029122.1_2454995_2456600_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|CP029122.1_2456725_2457031_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|CP029122.1_2457603_2458860_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|CP029122.1_2458900_2460274_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|CP029122.1_2460488_2461130_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|CP029122.1_2461169_2462318_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|CP029122.1_2462608_2463820_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|CP029122.1_2463932_2464865_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|CP029122.1_2464861_2465887_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|40aa|down_7|CP029122.1_2466155_2466275_+	PRK14756, small_mem_YnhF, YnhF family membrane protein; Validated	NA|390aa|down_8|CP029122.1_2466440_2467610_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|CP029122.1_2467755_2468337_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	8	3414320-3414464	8	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA|72aa|up_4|CP029122.1_3409771_3409987_+,NA|50aa|up_3|CP029122.1_3410517_3410667_-,NA|56aa|up_2|CP029122.1_3411362_3411530_+,NA|43aa|down_3|CP029122.1_3419578_3419707_-	NA|69aa|up_9|CP029122.1_3407305_3407512_+	PRK11354, kil, FtsZ inhibitor protein; Reviewed	NA|99aa|up_8|CP029122.1_3407587_3407884_+	pfam06064, Gam, Host-nuclease inhibitor protein Gam	NA|262aa|up_7|CP029122.1_3407889_3408675_+	TIGR01913, Uncharacterized_protein_UU154, phage recombination protein Bet	NA|227aa|up_6|CP029122.1_3408671_3409352_+	pfam09588, YqaJ, YqaJ-like viral recombinase domain	NA|38aa|up_5|CP029122.1_3409581_3409695_+	pfam07131, DUF1382, Protein of unknown function (DUF1382)	NA|72aa|up_4|CP029122.1_3409771_3409987_+	NA	NA|50aa|up_3|CP029122.1_3410517_3410667_-	NA	NA|56aa|up_2|CP029122.1_3411362_3411530_+	NA	NA|357aa|up_1|CP029122.1_3411765_3412836_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|428aa|up_0|CP029122.1_3412970_3414254_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|CP029122.1_3414487_3416749_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|CP029122.1_3416931_3418365_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|CP029122.1_3418440_3419493_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|43aa|down_3|CP029122.1_3419578_3419707_-	NA	NA|318aa|down_4|CP029122.1_3419676_3420630_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_5|CP029122.1_3420670_3421666_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_6|CP029122.1_3421820_3422639_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_7|CP029122.1_3422639_3423698_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_8|CP029122.1_3423700_3424390_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_9|CP029122.1_3424389_3425163_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	9	3910465-3910618	9	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG	53	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|47aa|down_2|CP029122.1_3911984_3912125_+	NA|415aa|up_9|CP029122.1_3901843_3903088_-	PRK05077, frsA, esterase FrsA	NA|153aa|up_8|CP029122.1_3903179_3903638_-	PRK09177, PRK09177, xanthine-guanine phosphoribosyltransferase; Validated	NA|486aa|up_7|CP029122.1_3903898_3905356_+	PRK15026, PRK15026, aminoacyl-histidine dipeptidase; Provisional	NA|47aa|up_6|CP029122.1_3905412_3905553_-	PRK08179, prfH, peptide chain release factor-like protein; Reviewed	NA|66aa|up_5|CP029122.1_3905546_3905744_-	PRK08179, prfH, peptide chain release factor-like protein; Reviewed	NA|85aa|up_4|CP029122.1_3905712_3905967_-	PRK09588, PRK09588, hypothetical protein; Reviewed	NA|151aa|up_3|CP029122.1_3906285_3906738_-	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|352aa|up_2|CP029122.1_3906734_3907790_-	PRK02406, PRK02406, DNA polymerase IV; Validated	NA|257aa|up_1|CP029122.1_3907860_3908631_-	PRK06778, PRK06778, hypothetical protein; Validated	NA|580aa|up_0|CP029122.1_3908590_3910330_+	COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|166aa|down_0|CP029122.1_3910647_3911145_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|209aa|down_1|CP029122.1_3911320_3911947_-	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|47aa|down_2|CP029122.1_3911984_3912125_+	NA	NA|247aa|down_3|CP029122.1_3912370_3913111_+	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|256aa|down_4|CP029122.1_3913081_3913849_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|193aa|down_5|CP029122.1_3914054_3914633_-	PRK00414, gmhA, D-sedoheptulose 7-phosphate isomerase	NA|815aa|down_6|CP029122.1_3914872_3917317_+	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|149aa|down_7|CP029122.1_3917359_3917806_-	PRK09993, PRK09993, C-lysozyme inhibitor; Provisional	NA|257aa|down_8|CP029122.1_3917986_3918757_+	PRK10438, PRK10438, C-N hydrolase family amidase; Provisional	NA|348aa|down_9|CP029122.1_3918798_3919842_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	10	4100634-4100749	10	CRISPRCasFinder	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	AACGCCTGATGCGACGCTGACGCGTCTTATC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA	NA|393aa|up_9|CP029122.1_4088649_4089828_-	TIGR00899, Sugar_efflux_transporter_A, sugar efflux transporter	NA|44aa|up_8|CP029122.1_4089929_4090061_-	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|552aa|up_7|CP029122.1_4090149_4091805_+	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|328aa|up_6|CP029122.1_4091968_4092952_+	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|537aa|up_5|CP029122.1_4092927_4094538_+	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|233aa|up_4|CP029122.1_4094521_4095220_+	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|255aa|up_3|CP029122.1_4095333_4096098_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|293aa|up_2|CP029122.1_4096183_4097062_-	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|567aa|up_1|CP029122.1_4097400_4099101_+	PRK04123, PRK04123, ribulokinase; Provisional	NA|501aa|up_0|CP029122.1_4099111_4100614_+	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|232aa|down_0|CP029122.1_4100813_4101509_+	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|784aa|down_1|CP029122.1_4101583_4103935_+	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|920aa|down_2|CP029122.1_4104246_4107006_+	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|220aa|down_3|CP029122.1_4107017_4107677_+	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|272aa|down_4|CP029122.1_4107793_4108609_-	PRK09430, djlA, co-chaperone DjlA	NA|785aa|down_5|CP029122.1_4108863_4111218_+	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|429aa|down_6|CP029122.1_4111270_4112557_+	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|330aa|down_7|CP029122.1_4112556_4113546_+	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|274aa|down_8|CP029122.1_4113542_4114364_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|84aa|down_9|CP029122.1_4114492_4114744_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed
GCA_003073955.1_ASM307395v1	CP029122	Escherichia coli strain AR434 chromosome, complete genome	11	4123662-4123794	3	PILER-CR	no		cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	ATCACCAATATTGAAAA	17	0	0	NA	NA	NA	2	2	Orphan	cas3,RT,csa3,PD-DExK,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|73aa|down_7|CP029122.1_4131710_4131929_+	NA|84aa|up_9|CP029122.1_4114492_4114744_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|281aa|up_8|CP029122.1_4114750_4115593_+	TIGR00668, Bis5'-nucleosyl-tetraphosphatase_symmetrical, bis(5'-nucleosyl)-tetraphosphatase (symmetrical)	NA|160aa|up_7|CP029122.1_4115670_4116150_-	PRK10769, folA, type 3 dihydrofolate reductase	NA|621aa|up_6|CP029122.1_4116341_4118204_-	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|158aa|up_5|CP029122.1_4118196_4118670_-	PRK00871, PRK00871, glutathione-regulated potassium-efflux system oxidoreductase KefF	NA|444aa|up_4|CP029122.1_4118834_4120166_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|96aa|up_3|CP029122.1_4120223_4120511_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_2|CP029122.1_4120507_4121794_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_1|CP029122.1_4121844_4122786_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_0|CP029122.1_4122800_4123571_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|down_0|CP029122.1_4124044_4125559_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|down_1|CP029122.1_4125589_4126732_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|down_2|CP029122.1_4126860_4128078_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|down_3|CP029122.1_4128151_4129705_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|down_4|CP029122.1_4129813_4130599_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|down_5|CP029122.1_4130604_4131195_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_6|CP029122.1_4131280_4131676_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|73aa|down_7|CP029122.1_4131710_4131929_+	NA	NA|1074aa|down_8|CP029122.1_4131936_4135158_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_9|CP029122.1_4135175_4136324_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit
