assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	1	957834-958655	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Unclear	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	1	1	957863-957894	NZ_AP019675.1_1275946-1275977	I-E:I-E:I-E	12,13,13	13	Unclear	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|47aa|up_1|NZ_AP019675.1_956543_956684_-,NA	NA|434aa|up_9|NZ_AP019675.1_947240_948542_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|NZ_AP019675.1_948589_950824_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|NZ_AP019675.1_950901_951150_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|NZ_AP019675.1_951149_951485_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|NZ_AP019675.1_951555_952347_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|NZ_AP019675.1_952574_954212_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|NZ_AP019675.1_954299_955598_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|NZ_AP019675.1_955657_956530_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|NZ_AP019675.1_956543_956684_-	NA	NA|224aa|up_0|NZ_AP019675.1_956822_957494_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|NZ_AP019675.1_959293_960772_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|NZ_AP019675.1_960798_962076_-	cd06174, MFS, Major Facilitator Superfamily	NA|262aa|down_2|NZ_AP019675.1_962394_963180_+	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|485aa|down_3|NZ_AP019675.1_963249_964704_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|470aa|down_4|NZ_AP019675.1_964725_966135_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_5|NZ_AP019675.1_966112_966892_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_6|NZ_AP019675.1_966888_967749_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|192aa|down_7|NZ_AP019675.1_967896_968472_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_8|NZ_AP019675.1_968488_968749_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_9|NZ_AP019675.1_968739_970011_-	PRK10015, PRK10015, oxidoreductase; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	2	984860-985742	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Type I-E	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGTCAGCGGGGATAAACCG,GTTCCCCGCGTCAGCGGGGATAAACCG	29,29,27	0	0	NA	NA	I-E:I-E:I-E	14,14,14	14	TypeI-E	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|245aa|up_9|NZ_AP019675.1_974354_975089_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|51aa|up_8|NZ_AP019675.1_975353_975506_+	pfam01848, HOK_GEF, Hok/gef family	cas3|899aa|up_7|NZ_AP019675.1_975569_978266_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|521aa|up_6|NZ_AP019675.1_978876_980439_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|201aa|up_5|NZ_AP019675.1_980428_981031_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas7|353aa|up_4|NZ_AP019675.1_981053_982112_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|242aa|up_3|NZ_AP019675.1_982121_982847_+	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|236aa|up_2|NZ_AP019675.1_982846_983554_+	cd09664, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|306aa|up_1|NZ_AP019675.1_983550_984468_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|95aa|up_0|NZ_AP019675.1_984469_984754_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|NZ_AP019675.1_985824_986862_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|NZ_AP019675.1_987113_988022_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|NZ_AP019675.1_988023_989451_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|NZ_AP019675.1_989450_990056_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|NZ_AP019675.1_990105_990429_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|NZ_AP019675.1_990622_990934_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|NZ_AP019675.1_990952_991663_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|NZ_AP019675.1_991662_992142_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|NZ_AP019675.1_992138_993188_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|NZ_AP019675.1_993168_993930_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	3	1473730-1473847	3	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|62aa|up_9|NZ_AP019675.1_1462919_1463105_-	PRK09956, PRK09956, ISNCY family transposase	NA|332aa|up_8|NZ_AP019675.1_1463117_1464113_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_7|NZ_AP019675.1_1464305_1465496_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_6|NZ_AP019675.1_1465492_1466752_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_5|NZ_AP019675.1_1466741_1468370_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_4|NZ_AP019675.1_1468642_1470001_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_3|NZ_AP019675.1_1470005_1471082_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|217aa|up_2|NZ_AP019675.1_1471544_1472195_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|NZ_AP019675.1_1472248_1472503_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|NZ_AP019675.1_1472502_1473633_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|NZ_AP019675.1_1473866_1476152_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|NZ_AP019675.1_1476847_1480600_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|NZ_AP019675.1_1480727_1481450_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|NZ_AP019675.1_1481596_1484224_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|NZ_AP019675.1_1484372_1486061_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|NZ_AP019675.1_1486057_1486681_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1465aa|down_6|NZ_AP019675.1_1486824_1491219_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|496aa|down_7|NZ_AP019675.1_1491219_1492707_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|NZ_AP019675.1_1492711_1493488_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|NZ_AP019675.1_1493561_1494746_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	4	1737063-1737196	4	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	TGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCA	38	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|249aa|up_9|NZ_AP019675.1_1725833_1726580_+	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE	NA|183aa|up_8|NZ_AP019675.1_1726595_1727144_+	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|374aa|up_7|NZ_AP019675.1_1727170_1728292_+	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|322aa|up_6|NZ_AP019675.1_1728294_1729260_+	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|160aa|up_5|NZ_AP019675.1_1729262_1729742_+	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|408aa|up_4|NZ_AP019675.1_1729738_1730962_+	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|479aa|up_3|NZ_AP019675.1_1730964_1732401_+	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|457aa|up_2|NZ_AP019675.1_1732681_1734052_+	PRK15414, PRK15414, phosphomannomutase	NA|465aa|up_1|NZ_AP019675.1_1734106_1735501_+	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|493aa|up_0|NZ_AP019675.1_1735502_1736981_+	PRK10459, PRK10459, MOP flippase family protein	NA|427aa|down_0|NZ_AP019675.1_1737256_1738537_+	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|407aa|down_1|NZ_AP019675.1_1738533_1739754_+	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|465aa|down_2|NZ_AP019675.1_1739764_1741159_+	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|332aa|down_3|NZ_AP019675.1_1741316_1742312_+	cd05238, Gne_like_SDR_e, Escherichia coli Gne (a nucleoside-diphosphate-sugar 4-epimerase)-like, extended (e) SDRs	NA|298aa|down_4|NZ_AP019675.1_1742554_1743448_+	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|359aa|down_5|NZ_AP019675.1_1743819_1744896_+	PRK10084, PRK10084, dTDP-glucose 4,6 dehydratase; Provisional	NA|291aa|down_6|NZ_AP019675.1_1744892_1745765_+	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|136aa|down_7|NZ_AP019675.1_1745757_1746165_+	cd20292, cupin_QdtA-like, sugar 3,4-ketoisomerase QdtA and related proteins, cupin domain	NA|178aa|down_8|NZ_AP019675.1_1746157_1746691_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|368aa|down_9|NZ_AP019675.1_1746703_1747807_+	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	5	2149105-2149228	5	CRISPRCasFinder	no	DEDDh	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA|30aa|down_7|NZ_AP019675.1_2158124_2158214_+	NA|471aa|up_9|NZ_AP019675.1_2138559_2139972_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_8|NZ_AP019675.1_2140528_2140738_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_7|NZ_AP019675.1_2141192_2141819_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|NZ_AP019675.1_2141839_2143942_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|213aa|up_5|NZ_AP019675.1_2143954_2144593_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|NZ_AP019675.1_2144656_2145325_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|NZ_AP019675.1_2145321_2146107_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|NZ_AP019675.1_2146110_2146923_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|NZ_AP019675.1_2146934_2148539_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|NZ_AP019675.1_2148664_2148970_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|NZ_AP019675.1_2149542_2150799_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|NZ_AP019675.1_2150839_2152213_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|NZ_AP019675.1_2152427_2153069_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|NZ_AP019675.1_2153108_2154257_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|NZ_AP019675.1_2154547_2155759_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|NZ_AP019675.1_2155871_2156804_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|NZ_AP019675.1_2156800_2157826_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|NZ_AP019675.1_2158124_2158214_+	NA	NA|390aa|down_8|NZ_AP019675.1_2158379_2159549_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|NZ_AP019675.1_2159783_2160365_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	6	2856592-2856683	6	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	CCACCTTTTTTACCTGCTTCAGATGC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|70aa|up_9|NZ_AP019675.1_2845667_2845877_-,NA	NA|70aa|up_9|NZ_AP019675.1_2845667_2845877_-	NA	NA|1321aa|up_8|NZ_AP019675.1_2845931_2849894_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|213aa|up_7|NZ_AP019675.1_2849933_2850572_-	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|364aa|up_6|NZ_AP019675.1_2850859_2851951_+	TIGR03612, RutA, pyrimidine utilization protein A	NA|231aa|up_5|NZ_AP019675.1_2851950_2852643_+	TIGR03614, RutB, pyrimidine utilization protein B	NA|129aa|up_4|NZ_AP019675.1_2852654_2853041_+	TIGR03610, RutC, pyrimidine utilization protein C	NA|267aa|up_3|NZ_AP019675.1_2853048_2853849_+	TIGR03611, RutD, pyrimidine utilization protein D	NA|197aa|up_2|NZ_AP019675.1_2853858_2854449_+	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|165aa|up_1|NZ_AP019675.1_2854459_2854954_+	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|443aa|up_0|NZ_AP019675.1_2854974_2856303_+	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|199aa|down_0|NZ_AP019675.1_2857106_2857703_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|76aa|down_1|NZ_AP019675.1_2857723_2857951_+	PRK10174, PRK10174, hypothetical protein; Provisional	NA|414aa|down_2|NZ_AP019675.1_2857988_2859230_-	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|307aa|down_3|NZ_AP019675.1_2861038_2861959_+	PRK10266, PRK10266, curved DNA-binding protein	NA|102aa|down_4|NZ_AP019675.1_2861958_2862264_+	PRK10265, PRK10265, chaperone modulator CbpM	NA|200aa|down_5|NZ_AP019675.1_2862415_2863015_-	PRK04976, torD, chaperone protein TorD; Validated	NA|849aa|down_6|NZ_AP019675.1_2863011_2865558_-	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|391aa|down_7|NZ_AP019675.1_2865557_2866730_-	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|231aa|down_8|NZ_AP019675.1_2866859_2867552_+	PRK10766, PRK10766, two-component system response regulator TorR	NA|343aa|down_9|NZ_AP019675.1_2867524_2868553_-	PRK10936, PRK10936, TMAO reductase system periplasmic protein TorT; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	7	2998191-2998396	7,3,3	CRISPRCasFinder,CRT,PILER-CR	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	TTTCTAAGCTGCCTGTACGGCAGTGAAC,TTTCTAAGCTGCCTGTACGGCAGTGAAC,TTTCTAAGCTGCCTGTACGGCAGTGAACG	28,28,29	0	0	NA	NA	I-F:I-F:I-F	3,3,2	3	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|431aa|up_9|NZ_AP019675.1_2983314_2984607_-	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional	NA|448aa|up_8|NZ_AP019675.1_2984697_2986041_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|204aa|up_7|NZ_AP019675.1_2986051_2986663_-	TIGR00547, Outer-membrane_lipoprotein_carrier_protein, periplasmic chaperone LolA	NA|1330aa|up_6|NZ_AP019675.1_2986817_2990807_-	PRK10263, PRK10263, DNA translocase FtsK; Provisional	NA|165aa|up_5|NZ_AP019675.1_2990941_2991436_-	PRK11169, PRK11169, leucine-responsive transcriptional regulator Lrp	NA|322aa|up_4|NZ_AP019675.1_2991980_2992946_+	PRK10262, PRK10262, thioredoxin reductase; Provisional	NA|589aa|up_3|NZ_AP019675.1_2993068_2994835_+	PRK11174, PRK11174, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|574aa|up_2|NZ_AP019675.1_2994835_2996557_+	PRK11160, PRK11160, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|235aa|up_1|NZ_AP019675.1_2996598_2997303_+	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|73aa|up_0|NZ_AP019675.1_2997587_2997806_+	PRK00276, infA, translation initiation factor IF-1; Validated	NA|759aa|down_0|NZ_AP019675.1_2998606_3000883_-	PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit; Provisional	NA|107aa|down_1|NZ_AP019675.1_3000913_3001234_-	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|75aa|down_2|NZ_AP019675.1_3001556_3001781_+	PRK09937, PRK09937, cold shock-like protein CspD	NA|649aa|down_3|NZ_AP019675.1_3001853_3003800_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|372aa|down_4|NZ_AP019675.1_3003796_3004912_-	PRK11578, PRK11578, macrolide transporter subunit MacA; Provisional	NA|319aa|down_5|NZ_AP019675.1_3005062_3006019_+	COG2990, VirK, Uncharacterized protein conserved in bacteria [Function unknown]	NA|553aa|down_6|NZ_AP019675.1_3006015_3007674_-	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|232aa|down_7|NZ_AP019675.1_3008098_3008794_+	PRK05420, PRK05420, aquaporin Z; Provisional	NA|300aa|down_8|NZ_AP019675.1_3009288_3010188_+	COG2431, COG2431, Predicted membrane protein [Function unknown]	NA|551aa|down_9|NZ_AP019675.1_3010331_3011984_+	PRK05290, PRK05290, hybrid cluster protein; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	8	3154936-3155080	8	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|94aa|up_9|NZ_AP019675.1_3149043_3149325_+,NA|205aa|up_6|NZ_AP019675.1_3150191_3150806_+,NA|113aa|up_5|NZ_AP019675.1_3151099_3151438_+,NA|56aa|up_3|NZ_AP019675.1_3151978_3152146_+,NA	NA|94aa|up_9|NZ_AP019675.1_3149043_3149325_+	NA	NA|74aa|up_8|NZ_AP019675.1_3149423_3149645_+	PHA00080, PHA00080, DksA-like zinc finger domain containing protein	NA|84aa|up_7|NZ_AP019675.1_3149641_3149893_+	pfam13935, Ead_Ea22, Ead/Ea22-like protein	NA|205aa|up_6|NZ_AP019675.1_3150191_3150806_+	NA	NA|113aa|up_5|NZ_AP019675.1_3151099_3151438_+	NA	NA|143aa|up_4|NZ_AP019675.1_3151466_3151895_+	pfam10711, DUF2513, Hypothetical protein (DUF2513)	NA|56aa|up_3|NZ_AP019675.1_3151978_3152146_+	NA	NA|73aa|up_2|NZ_AP019675.1_3152185_3152404_+	pfam07825, Exc, Excisionase-like protein	NA|357aa|up_1|NZ_AP019675.1_3152381_3153452_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|428aa|up_0|NZ_AP019675.1_3153586_3154870_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|NZ_AP019675.1_3155103_3157365_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|351aa|down_1|NZ_AP019675.1_3159056_3160109_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_2|NZ_AP019675.1_3160292_3161246_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_3|NZ_AP019675.1_3161286_3162282_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_4|NZ_AP019675.1_3162436_3163255_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_5|NZ_AP019675.1_3163255_3164314_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_6|NZ_AP019675.1_3164316_3165006_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_7|NZ_AP019675.1_3165005_3165779_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|50aa|down_8|NZ_AP019675.1_3165944_3166094_-	pfam10766, AcrZ, Multidrug efflux pump-associated protein AcrZ	NA|263aa|down_9|NZ_AP019675.1_3166222_3167011_+	PRK10676, PRK10676, DNA-binding transcriptional regulator ModE; Provisional
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	9	3387049-3387145	9	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	TTGTAGGCCTGATAAGATGCGTCAAGC	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|231aa|up_9|NZ_AP019675.1_3378067_3378760_-	PRK15195, PRK15195, molecular chaperone FimC	NA|181aa|up_8|NZ_AP019675.1_3379756_3380299_-	PRK15194, PRK15194, type 1 fimbrial protein subunit FimA	NA|289aa|up_7|NZ_AP019675.1_3380769_3381636_+	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|71aa|up_6|NZ_AP019675.1_3381637_3381850_+	PRK11507, PRK11507, ribosome-associated protein YbcJ	NA|174aa|up_5|NZ_AP019675.1_3381957_3382479_+	COG1988, COG1988, Predicted membrane-bound metal-dependent hydrolases [General function prediction only]	NA|462aa|up_4|NZ_AP019675.1_3382514_3383900_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|165aa|up_3|NZ_AP019675.1_3384073_3384568_+	PRK10791, PRK10791, peptidylprolyl isomerase B	NA|241aa|up_2|NZ_AP019675.1_3384570_3385293_+	PRK05340, PRK05340, UDP-2,3-diacylglucosamine hydrolase; Provisional	NA|170aa|up_1|NZ_AP019675.1_3385410_3385920_+	COG0041, PurE, Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism]	NA|356aa|up_0|NZ_AP019675.1_3385916_3386984_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|298aa|down_0|NZ_AP019675.1_3387178_3388072_-	PRK09411, PRK09411, carbamate kinase; Reviewed	NA|272aa|down_1|NZ_AP019675.1_3388068_3388884_-	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|420aa|down_2|NZ_AP019675.1_3388894_3390154_-	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|556aa|down_3|NZ_AP019675.1_3390163_3391831_-	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|350aa|down_4|NZ_AP019675.1_3392146_3393196_+	PRK15025, PRK15025, ureidoglycolate dehydrogenase; Provisional	NA|412aa|down_5|NZ_AP019675.1_3393217_3394453_+	TIGR03176, AllC, allantoate amidohydrolase	NA|262aa|down_6|NZ_AP019675.1_3394463_3395249_+	TIGR03214, ura-cupin, putative allantoin catabolism protein	NA|382aa|down_7|NZ_AP019675.1_3395476_3396622_-	PRK09932, PRK09932, glycerate 3-kinase	NA|434aa|down_8|NZ_AP019675.1_3396643_3397945_-	PRK11412, PRK11412, uracil/xanthine transporter	NA|454aa|down_9|NZ_AP019675.1_3398001_3399363_-	PRK08044, PRK08044, allantoinase AllB
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	10	3938473-3938608	4	PILER-CR	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	TGAATCACCAATATTGAAAA	20	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|126aa|up_9|NZ_AP019675.1_3929179_3929557_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|281aa|up_8|NZ_AP019675.1_3929563_3930406_+	TIGR00668, Bis5'-nucleosyl-tetraphosphatase_symmetrical, bis(5'-nucleosyl)-tetraphosphatase (symmetrical)	NA|160aa|up_7|NZ_AP019675.1_3930483_3930963_-	PRK10769, folA, type 3 dihydrofolate reductase	NA|621aa|up_6|NZ_AP019675.1_3931154_3933017_-	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|177aa|up_5|NZ_AP019675.1_3933009_3933540_-	PRK00871, PRK00871, glutathione-regulated potassium-efflux system oxidoreductase KefF	NA|444aa|up_4|NZ_AP019675.1_3933647_3934979_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|96aa|up_3|NZ_AP019675.1_3935037_3935325_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_2|NZ_AP019675.1_3935321_3936608_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_1|NZ_AP019675.1_3936658_3937600_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_0|NZ_AP019675.1_3937614_3938385_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|down_0|NZ_AP019675.1_3938857_3940372_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|down_1|NZ_AP019675.1_3940402_3941545_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|down_2|NZ_AP019675.1_3941673_3942891_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|down_3|NZ_AP019675.1_3942964_3944518_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|down_4|NZ_AP019675.1_3944626_3945412_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|down_5|NZ_AP019675.1_3945417_3946008_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_6|NZ_AP019675.1_3946216_3946612_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|1074aa|down_7|NZ_AP019675.1_3946873_3950095_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_8|NZ_AP019675.1_3950112_3951261_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|274aa|down_9|NZ_AP019675.1_3951716_3952538_-	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]
GCF_009176965.1_ASM917696v1	NZ_AP019675	Escherichia coli strain GSH8M-2	11	3946046-3946187	10	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	Orphan	GCTGGAGAGCAACCGTAGGCCGGATAAGATGCGCCAGCATCGCATCCGGCGA	52	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|96aa|up_9|NZ_AP019675.1_3935037_3935325_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_8|NZ_AP019675.1_3935321_3936608_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_7|NZ_AP019675.1_3936658_3937600_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_6|NZ_AP019675.1_3937614_3938385_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|up_5|NZ_AP019675.1_3938857_3940372_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|up_4|NZ_AP019675.1_3940402_3941545_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|up_3|NZ_AP019675.1_3941673_3942891_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|up_2|NZ_AP019675.1_3942964_3944518_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|up_1|NZ_AP019675.1_3944626_3945412_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|up_0|NZ_AP019675.1_3945417_3946008_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_0|NZ_AP019675.1_3946216_3946612_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|1074aa|down_1|NZ_AP019675.1_3946873_3950095_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_2|NZ_AP019675.1_3950112_3951261_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|274aa|down_3|NZ_AP019675.1_3951716_3952538_-	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]	NA|305aa|down_4|NZ_AP019675.1_3952704_3953619_-	PRK10768, PRK10768, ribonucleoside hydrolase RihC; Provisional	NA|317aa|down_5|NZ_AP019675.1_3953684_3954635_-	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|150aa|down_6|NZ_AP019675.1_3954636_3955086_-	PRK15095, PRK15095, FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional	NA|165aa|down_7|NZ_AP019675.1_3955210_3955705_-	PRK00376, lspA, lipoprotein signal peptidase	NA|939aa|down_8|NZ_AP019675.1_3955704_3958521_-	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|314aa|down_9|NZ_AP019675.1_3958563_3959505_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase
GCF_009176965.1_ASM917696v1	NZ_AP019678	Escherichia coli strain GSH8M-2 plasmid pGSH8M-2-3, complete sequence	1	49072-49191	1	CRISPRCasFinder	no			Orphan	TGAAAGGTGGATGGGTACGCACTGAAAGGTGGATAGGTACGCA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,RT,DEDDh,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|386aa|up_9|NZ_AP019678.1_43548_44706_-,NA|182aa|up_8|NZ_AP019678.1_44708_45254_-,NA|91aa|up_7|NZ_AP019678.1_45580_45853_-,NA|50aa|up_6|NZ_AP019678.1_45865_46015_-,NA|110aa|up_5|NZ_AP019678.1_46301_46631_-,NA|72aa|up_4|NZ_AP019678.1_46723_46939_-,NA|82aa|up_3|NZ_AP019678.1_46928_47174_-,NA|108aa|up_2|NZ_AP019678.1_47218_47542_-,NA	NA|386aa|up_9|NZ_AP019678.1_43548_44706_-	NA	NA|182aa|up_8|NZ_AP019678.1_44708_45254_-	NA	NA|91aa|up_7|NZ_AP019678.1_45580_45853_-	NA	NA|50aa|up_6|NZ_AP019678.1_45865_46015_-	NA	NA|110aa|up_5|NZ_AP019678.1_46301_46631_-	NA	NA|72aa|up_4|NZ_AP019678.1_46723_46939_-	NA	NA|82aa|up_3|NZ_AP019678.1_46928_47174_-	NA	NA|108aa|up_2|NZ_AP019678.1_47218_47542_-	NA	NA|94aa|up_1|NZ_AP019678.1_47687_47969_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|84aa|up_0|NZ_AP019678.1_47958_48210_-	PRK02854, PRK02854, primosomal protein DnaT	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
