assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	1	299878-300333	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Type I-E	CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	7,7,7	7	TypeI-E	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|254aa|up_9|NZ_CP021840.1_291690_292452_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|350aa|up_8|NZ_CP021840.1_292432_293482_-	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|160aa|up_7|NZ_CP021840.1_293478_293958_-	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|237aa|up_6|NZ_CP021840.1_293957_294668_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|104aa|up_5|NZ_CP021840.1_294686_294998_-	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|108aa|up_4|NZ_CP021840.1_295191_295515_-	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|202aa|up_3|NZ_CP021840.1_295564_296170_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|476aa|up_2|NZ_CP021840.1_296169_297597_-	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|303aa|up_1|NZ_CP021840.1_297598_298507_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|346aa|up_0|NZ_CP021840.1_298758_299796_+	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	cas2|98aa|down_0|NZ_CP021840.1_300429_300723_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|308aa|down_1|NZ_CP021840.1_300719_301643_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|217aa|down_2|NZ_CP021840.1_301639_302290_-	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas5|249aa|down_3|NZ_CP021840.1_302271_303018_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|352aa|down_4|NZ_CP021840.1_303028_304084_-	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cse2gr11|179aa|down_5|NZ_CP021840.1_304098_304635_-	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas8e|521aa|down_6|NZ_CP021840.1_304631_306194_-	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cas3|900aa|down_7|NZ_CP021840.1_306283_308983_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|51aa|down_8|NZ_CP021840.1_309174_309327_-	pfam01848, HOK_GEF, Hok/gef family	NA|245aa|down_9|NZ_CP021840.1_309592_310327_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	2	326025-326297	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas3	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Unclear	CGGTTTATCCCCGCTGGCGCGGGGAACTC,CGGTTTATCCCCGCTGGCGCGGGGAACTC,CGGTTTATCCCCGCTGGCGCGGGGAACTC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	4,4,4	4	Unclear	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|424aa|up_9|NZ_CP021840.1_314670_315942_+	PRK10015, PRK10015, oxidoreductase; Provisional	NA|87aa|up_8|NZ_CP021840.1_315932_316193_+	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|192aa|up_7|NZ_CP021840.1_316209_316785_+	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|287aa|up_6|NZ_CP021840.1_316931_317792_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|up_5|NZ_CP021840.1_317788_318568_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|446aa|up_4|NZ_CP021840.1_318545_319883_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|485aa|up_3|NZ_CP021840.1_319976_321431_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_2|NZ_CP021840.1_321500_322286_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_1|NZ_CP021840.1_322603_323881_+	cd06174, MFS, Major Facilitator Superfamily	NA|493aa|up_0|NZ_CP021840.1_323907_325386_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|224aa|down_0|NZ_CP021840.1_326635_327307_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|201aa|down_1|NZ_CP021840.1_327479_328082_+	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|293aa|down_2|NZ_CP021840.1_328095_328974_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|383aa|down_3|NZ_CP021840.1_328988_330137_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|305aa|down_4|NZ_CP021840.1_330133_331048_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|433aa|down_5|NZ_CP021840.1_331107_332406_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_6|NZ_CP021840.1_332493_334131_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_7|NZ_CP021840.1_334358_335150_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_8|NZ_CP021840.1_335221_335557_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_9|NZ_CP021840.1_335556_335805_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	3	543579-543698	3	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	AGCGTCGCATCAGACGTTGATTGCCGGATGCGG	33	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|244aa|up_9|NZ_CP021840.1_536230_536962_+	TIGR00046, Ribosomal_RNA_small_subunit_methyltransferase_E, RNA methyltransferase, RsmE family	NA|317aa|up_8|NZ_CP021840.1_536974_537925_+	PRK05246, PRK05246, glutathione synthetase; Provisional	NA|188aa|up_7|NZ_CP021840.1_538033_538597_+	PRK00228, PRK00228, YqgE/AlgH family protein	NA|139aa|up_6|NZ_CP021840.1_538596_539013_+	PRK00109, PRK00109, Holliday junction resolvase RuvX	NA|327aa|up_5|NZ_CP021840.1_539196_540177_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|235aa|up_4|NZ_CP021840.1_540194_540899_+	cd06824, PLPDE_III_Yggs_like, Pyridoxal 5-phosphate (PLP)-binding TIM barrel domain of Type III PLP-Dependent Enzymes, Yggs-like proteins	NA|189aa|up_3|NZ_CP021840.1_540916_541483_+	pfam02325, YGGT, YGGT family	NA|97aa|up_2|NZ_CP021840.1_541479_541770_+	PRK05090, PRK05090, hypothetical protein; Validated	NA|198aa|up_1|NZ_CP021840.1_541777_542371_+	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|379aa|up_0|NZ_CP021840.1_542363_543500_+	PRK05660, PRK05660, radical SAM family heme chaperone HemW	NA|336aa|down_0|NZ_CP021840.1_543742_544750_-	pfam06717, DUF1202, Protein of unknown function (DUF1202)	NA|349aa|down_1|NZ_CP021840.1_544866_545913_-	PRK11096, ansB, L-asparaginase II; Provisional	NA|240aa|down_2|NZ_CP021840.1_546088_546808_-	PRK10626, PRK10626, hypothetical protein; Provisional	NA|109aa|down_3|NZ_CP021840.1_546991_547318_-	PRK11702, PRK11702, hypothetical protein; Provisional	NA|240aa|down_4|NZ_CP021840.1_547317_548037_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|351aa|down_5|NZ_CP021840.1_548197_549250_+	PRK10880, PRK10880, adenine DNA glycosylase	NA|92aa|down_6|NZ_CP021840.1_549277_549553_+	PRK05408, PRK05408, oxidative damage protection protein; Provisional	NA|360aa|down_7|NZ_CP021840.1_549617_550697_+	PRK11671, mltC, membrane-bound lytic murein transglycosylase MltC	NA|419aa|down_8|NZ_CP021840.1_550898_552155_+	TIGR00889, Putative_nucleoside_transporter_YegT, nucleoside transporter	NA|712aa|down_9|NZ_CP021840.1_552204_554340_-	PRK13578, PRK13578, ornithine decarboxylase; Provisional
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	4	2107289-2107438	4	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	CGCGTCTTATCAGGCCTACGAGTTCGGTGCTGTGTAGGTCGGATAAGGCGTTCA	54	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA|597aa|up_6|NZ_CP021840.1_2098394_2100185_+,NA	NA|238aa|up_9|NZ_CP021840.1_2096297_2097011_-	PRK12742, PRK12742, SDR family oxidoreductase	NA|198aa|up_8|NZ_CP021840.1_2097081_2097675_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|151aa|up_7|NZ_CP021840.1_2097819_2098272_+	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|597aa|up_6|NZ_CP021840.1_2098394_2100185_+	NA	NA|338aa|up_5|NZ_CP021840.1_2100237_2101251_-	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|139aa|up_4|NZ_CP021840.1_2101412_2101829_+	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|168aa|up_3|NZ_CP021840.1_2101874_2102378_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|399aa|up_2|NZ_CP021840.1_2102570_2103767_+	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|952aa|up_1|NZ_CP021840.1_2103822_2106678_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|148aa|up_0|NZ_CP021840.1_2106677_2107121_-	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|504aa|down_0|NZ_CP021840.1_2107474_2108986_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|367aa|down_1|NZ_CP021840.1_2109252_2110353_+	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|361aa|down_2|NZ_CP021840.1_2110352_2111435_+	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|501aa|down_3|NZ_CP021840.1_2111595_2113098_-	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|333aa|down_4|NZ_CP021840.1_2113175_2114174_-	cd01575, PBP1_GntR, ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators	NA|440aa|down_5|NZ_CP021840.1_2114240_2115560_-	TIGR00791, Gluconate_permease, gluconate transporter	NA|255aa|down_6|NZ_CP021840.1_2115624_2116389_-	PRK08085, PRK08085, gluconate 5-dehydrogenase; Provisional	NA|344aa|down_7|NZ_CP021840.1_2116412_2117444_-	PRK09880, PRK09880, L-idonate 5-dehydrogenase; Provisional	NA|188aa|down_8|NZ_CP021840.1_2117660_2118224_+	PRK09825, idnK, gluconokinase	NA|340aa|down_9|NZ_CP021840.1_2118227_2119247_-	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	5	2335956-2336090	5	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	TGCCGGATGCGCGTTGCTTATCCGGCCTACAAAATCGCAGCGTGTAGGCC	50	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA|251aa|up_4|NZ_CP021840.1_2327909_2328662_+,NA|190aa|down_7|NZ_CP021840.1_2345157_2345727_+	NA|274aa|up_9|NZ_CP021840.1_2321029_2321851_-	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|330aa|up_8|NZ_CP021840.1_2321847_2322837_-	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|429aa|up_7|NZ_CP021840.1_2322836_2324123_-	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|790aa|up_6|NZ_CP021840.1_2324175_2326545_-	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|272aa|up_5|NZ_CP021840.1_2326799_2327615_+	PRK09430, djlA, co-chaperone DjlA	NA|251aa|up_4|NZ_CP021840.1_2327909_2328662_+	NA	NA|220aa|up_3|NZ_CP021840.1_2329079_2329739_-	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|969aa|up_2|NZ_CP021840.1_2329750_2332657_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|784aa|up_1|NZ_CP021840.1_2332820_2335172_-	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|232aa|up_0|NZ_CP021840.1_2335246_2335942_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|501aa|down_0|NZ_CP021840.1_2336141_2337644_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|567aa|down_1|NZ_CP021840.1_2337654_2339355_-	PRK04123, PRK04123, ribulokinase; Provisional	NA|293aa|down_2|NZ_CP021840.1_2339692_2340571_+	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|255aa|down_3|NZ_CP021840.1_2340656_2341421_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|233aa|down_4|NZ_CP021840.1_2341534_2342233_-	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|537aa|down_5|NZ_CP021840.1_2342216_2343827_-	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|328aa|down_6|NZ_CP021840.1_2343802_2344786_-	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|190aa|down_7|NZ_CP021840.1_2345157_2345727_+	NA	NA|553aa|down_8|NZ_CP021840.1_2345955_2347614_-	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|44aa|down_9|NZ_CP021840.1_2347702_2347834_+	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	6	2548981-2549093	6	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	GATGCCTGATGCGACGCTAGCGCGTCTTATCATGCCTACAAAC	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|257aa|up_9|NZ_CP021840.1_2540255_2541026_-	PRK10438, PRK10438, C-N hydrolase family amidase; Provisional	NA|158aa|up_8|NZ_CP021840.1_2541179_2541653_+	PRK09993, PRK09993, C-lysozyme inhibitor; Provisional	NA|815aa|up_7|NZ_CP021840.1_2541695_2544140_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|193aa|up_6|NZ_CP021840.1_2544379_2544958_+	PRK00414, gmhA, D-sedoheptulose 7-phosphate isomerase	NA|256aa|up_5|NZ_CP021840.1_2545163_2545931_+	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|247aa|up_4|NZ_CP021840.1_2545901_2546642_-	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|93aa|up_3|NZ_CP021840.1_2546797_2547076_-	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	NA|87aa|up_2|NZ_CP021840.1_2547078_2547339_-	COG3077, RelB, DNA-damage-inducible protein J [DNA replication, recombination, and repair]	NA|250aa|up_1|NZ_CP021840.1_2547548_2548298_+	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|166aa|up_0|NZ_CP021840.1_2548473_2548971_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|698aa|down_0|NZ_CP021840.1_2549179_2551273_-	COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|380aa|down_1|NZ_CP021840.1_2551256_2552396_-	PRK05702, flhB, flagellar type III secretion system protein FlhB	NA|261aa|down_2|NZ_CP021840.1_2552385_2553168_-	COG1684, FliR, Flagellar biosynthesis pathway, component FliR [Cell motility and secretion / Intracellular trafficking and secretion]	NA|91aa|down_3|NZ_CP021840.1_2553169_2553442_-	COG1987, FliQ, Flagellar biosynthesis pathway, component FliQ [Cell motility and secretion / Intracellular trafficking and secretion]	NA|251aa|down_4|NZ_CP021840.1_2553444_2554197_-	PRK05699, fliP, flagellar biosynthesis protein FliP; Reviewed	NA|124aa|down_5|NZ_CP021840.1_2554193_2554565_-	TIGR02480, Flagellar_motor_switch_protein_FliN, flagellar motor switch protein FliN	NA|284aa|down_6|NZ_CP021840.1_2554557_2555409_-	pfam01052, FliMN_C, Type III flagellar switch regulator (C-ring) FliN C-term	NA|329aa|down_7|NZ_CP021840.1_2555796_2556783_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|114aa|down_8|NZ_CP021840.1_2556797_2557139_+	pfam02049, FliE, Flagellar hook-basal body complex protein FliE	NA|552aa|down_9|NZ_CP021840.1_2557143_2558799_+	PRK07193, fliF, flagellar MS-ring protein; Reviewed
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	7	2566649-2566755	7	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	CCCCTCACCCTAACCCTCTCCCGGAGGGAGAGGG	34	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA|310aa|down_0|NZ_CP021840.1_2566850_2567780_-	NA|329aa|up_9|NZ_CP021840.1_2555796_2556783_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|114aa|up_8|NZ_CP021840.1_2556797_2557139_+	pfam02049, FliE, Flagellar hook-basal body complex protein FliE	NA|552aa|up_7|NZ_CP021840.1_2557143_2558799_+	PRK07193, fliF, flagellar MS-ring protein; Reviewed	NA|337aa|up_6|NZ_CP021840.1_2558776_2559787_+	PRK07194, fliG, flagellar motor switch protein G; Reviewed	NA|237aa|up_5|NZ_CP021840.1_2559790_2560501_+	PRK13386, fliH, flagellar assembly protein H; Provisional	NA|446aa|up_4|NZ_CP021840.1_2560493_2561831_+	PRK07196, fliI, flagellar protein export ATPase FliI	NA|145aa|up_3|NZ_CP021840.1_2561833_2562268_+	TIGR02473, conserved_hypothetical_protein, flagellar export protein FliJ	NA|135aa|up_2|NZ_CP021840.1_2562270_2562675_+	cd02171, G3P_Cytidylyltransferase, glycerol-3-phosphate cytidylyltransferase	NA|359aa|up_1|NZ_CP021840.1_2562668_2563745_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|815aa|up_0|NZ_CP021840.1_2564145_2566590_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|310aa|down_0|NZ_CP021840.1_2566850_2567780_-	NA	NA|143aa|down_1|NZ_CP021840.1_2567907_2568336_-	pfam05130, FlgN, FlgN protein	NA|93aa|down_2|NZ_CP021840.1_2568348_2568627_-	TIGR03824, FlgM_jcvi, flagellar biosynthesis anti-sigma factor FlgM	NA|246aa|down_3|NZ_CP021840.1_2568707_2569445_-	PRK06804, flgA, flagellar basal body P-ring formation protein FlgA	NA|112aa|down_4|NZ_CP021840.1_2569526_2569862_+	PRK12685, flgB, flagellar basal body rod protein FlgB; Reviewed	NA|144aa|down_5|NZ_CP021840.1_2569864_2570296_+	PRK06802, flgC, flagellar basal body rod protein FlgC; Reviewed	NA|238aa|down_6|NZ_CP021840.1_2570295_2571009_+	PRK09619, flgD, flagellar hook assembly protein FlgD	NA|401aa|down_7|NZ_CP021840.1_2571152_2572355_+	PRK06803, flgE, flagellar basal body protein FlaE	NA|246aa|down_8|NZ_CP021840.1_2572354_2573092_+	PRK12640, flgF, flagellar basal body rod protein FlgF; Reviewed	NA|262aa|down_9|NZ_CP021840.1_2573270_2574056_+	PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	8	3370138-3370286	8	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	GTTCACTGCCGTACAGGCAGCTTAGAAA	28	0	0	NA	NA	I-F	2	2	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|551aa|up_9|NZ_CP021840.1_3356582_3358235_-	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|300aa|up_8|NZ_CP021840.1_3358378_3359278_-	COG2431, COG2431, Predicted membrane protein [Function unknown]	NA|232aa|up_7|NZ_CP021840.1_3359734_3360430_-	PRK05420, PRK05420, aquaporin Z; Provisional	NA|553aa|up_6|NZ_CP021840.1_3360855_3362514_+	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|319aa|up_5|NZ_CP021840.1_3362510_3363467_-	COG2990, VirK, Uncharacterized protein conserved in bacteria [Function unknown]	NA|372aa|up_4|NZ_CP021840.1_3363617_3364733_+	PRK11578, PRK11578, macrolide transporter subunit MacA; Provisional	NA|649aa|up_3|NZ_CP021840.1_3364729_3366676_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|75aa|up_2|NZ_CP021840.1_3366748_3366973_-	PRK09937, PRK09937, cold shock-like protein CspD	NA|107aa|up_1|NZ_CP021840.1_3367295_3367616_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|759aa|up_0|NZ_CP021840.1_3367646_3369923_+	PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit; Provisional	NA|73aa|down_0|NZ_CP021840.1_3370671_3370890_-	PRK00276, infA, translation initiation factor IF-1; Validated	NA|235aa|down_1|NZ_CP021840.1_3371174_3371879_-	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|574aa|down_2|NZ_CP021840.1_3371920_3373642_-	PRK11160, PRK11160, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|589aa|down_3|NZ_CP021840.1_3373642_3375409_-	PRK11174, PRK11174, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|322aa|down_4|NZ_CP021840.1_3375531_3376497_-	PRK10262, PRK10262, thioredoxin reductase; Provisional	NA|165aa|down_5|NZ_CP021840.1_3377041_3377536_+	PRK11169, PRK11169, leucine-responsive transcriptional regulator Lrp	NA|1356aa|down_6|NZ_CP021840.1_3377670_3381738_+	PRK10263, PRK10263, DNA translocase FtsK; Provisional	NA|204aa|down_7|NZ_CP021840.1_3381892_3382504_+	TIGR00547, Outer-membrane_lipoprotein_carrier_protein, periplasmic chaperone LolA	NA|448aa|down_8|NZ_CP021840.1_3382514_3383858_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|431aa|down_9|NZ_CP021840.1_3383948_3385241_+	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	9	4633065-4633199	9	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	TGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACA	38	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|520aa|up_9|NZ_CP021840.1_4621340_4622900_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|181aa|up_8|NZ_CP021840.1_4622973_4623516_-	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|293aa|up_7|NZ_CP021840.1_4623520_4624399_-	PRK15480, PRK15480, glucose-1-phosphate thymidylyltransferase RfbA; Provisional	NA|300aa|up_6|NZ_CP021840.1_4624456_4625356_-	PRK09987, PRK09987, dTDP-4-dehydrorhamnose reductase; Provisional	NA|362aa|up_5|NZ_CP021840.1_4625355_4626441_-	PRK10084, PRK10084, dTDP-glucose 4,6 dehydratase; Provisional	NA|298aa|up_4|NZ_CP021840.1_4626812_4627706_-	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|332aa|up_3|NZ_CP021840.1_4627948_4628944_-	cd05238, Gne_like_SDR_e, Escherichia coli Gne (a nucleoside-diphosphate-sugar 4-epimerase)-like, extended (e) SDRs	NA|465aa|up_2|NZ_CP021840.1_4629101_4630496_-	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|407aa|up_1|NZ_CP021840.1_4630506_4631727_-	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|427aa|up_0|NZ_CP021840.1_4631723_4633004_-	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|493aa|down_0|NZ_CP021840.1_4633280_4634759_-	PRK10459, PRK10459, MOP flippase family protein	NA|465aa|down_1|NZ_CP021840.1_4634760_4636155_-	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|457aa|down_2|NZ_CP021840.1_4636209_4637580_-	PRK15414, PRK15414, phosphomannomutase	NA|479aa|down_3|NZ_CP021840.1_4637860_4639297_-	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|408aa|down_4|NZ_CP021840.1_4639299_4640523_-	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|160aa|down_5|NZ_CP021840.1_4640519_4640999_-	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|322aa|down_6|NZ_CP021840.1_4641001_4641967_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|374aa|down_7|NZ_CP021840.1_4641969_4643091_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|183aa|down_8|NZ_CP021840.1_4643117_4643666_-	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|249aa|down_9|NZ_CP021840.1_4643681_4644428_-	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	10	4637725-4637859	10	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	GGATAAGGCGTTCACGCCGCATCCGACAAACAGCGCCTGATGCGACG	47	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|300aa|up_9|NZ_CP021840.1_4624456_4625356_-	PRK09987, PRK09987, dTDP-4-dehydrorhamnose reductase; Provisional	NA|362aa|up_8|NZ_CP021840.1_4625355_4626441_-	PRK10084, PRK10084, dTDP-glucose 4,6 dehydratase; Provisional	NA|298aa|up_7|NZ_CP021840.1_4626812_4627706_-	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|332aa|up_6|NZ_CP021840.1_4627948_4628944_-	cd05238, Gne_like_SDR_e, Escherichia coli Gne (a nucleoside-diphosphate-sugar 4-epimerase)-like, extended (e) SDRs	NA|465aa|up_5|NZ_CP021840.1_4629101_4630496_-	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|407aa|up_4|NZ_CP021840.1_4630506_4631727_-	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|427aa|up_3|NZ_CP021840.1_4631723_4633004_-	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|493aa|up_2|NZ_CP021840.1_4633280_4634759_-	PRK10459, PRK10459, MOP flippase family protein	NA|465aa|up_1|NZ_CP021840.1_4634760_4636155_-	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|457aa|up_0|NZ_CP021840.1_4636209_4637580_-	PRK15414, PRK15414, phosphomannomutase	NA|479aa|down_0|NZ_CP021840.1_4637860_4639297_-	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|408aa|down_1|NZ_CP021840.1_4639299_4640523_-	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|160aa|down_2|NZ_CP021840.1_4640519_4640999_-	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|322aa|down_3|NZ_CP021840.1_4641001_4641967_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|374aa|down_4|NZ_CP021840.1_4641969_4643091_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|183aa|down_5|NZ_CP021840.1_4643117_4643666_-	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|249aa|down_6|NZ_CP021840.1_4643681_4644428_-	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE	NA|406aa|down_7|NZ_CP021840.1_4644438_4645656_-	TIGR04010, WcaD, putative colanic acid polymerase WcaD	NA|406aa|down_8|NZ_CP021840.1_4645630_4646848_-	TIGR04015, WcaC, colanic acid biosynthesis glycosyl transferase WcaC	NA|163aa|down_9|NZ_CP021840.1_4646844_4647333_-	PRK10191, PRK10191, putative acyl transferase; Provisional
GCF_002196475.1_ASM219647v1	NZ_CP021840	Escherichia coli strain EC974 chromosome, complete genome	11	4864868-4864994	11	CRISPRCasFinder	no		cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG	Orphan	TTTGTAGGCCTGATAAGACGCGCCAGCGTCGCCTCAGGC	39	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL,DEDDh,c2c9_V-U4,DinG,RT	NA,NA	NA|441aa|up_9|NZ_CP021840.1_4842409_4843732_+	pfam02667, SCFA_trans, Short chain fatty acid transporter	NA|395aa|up_8|NZ_CP021840.1_4843762_4844947_+	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|259aa|up_7|NZ_CP021840.1_4845020_4845797_-	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|550aa|up_6|NZ_CP021840.1_4845801_4847451_-	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|1465aa|up_5|NZ_CP021840.1_4847451_4851846_-	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|208aa|up_4|NZ_CP021840.1_4851989_4852613_-	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|563aa|up_3|NZ_CP021840.1_4852609_4854298_-	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|876aa|up_2|NZ_CP021840.1_4854446_4857074_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|241aa|up_1|NZ_CP021840.1_4857220_4857943_+	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|762aa|up_0|NZ_CP021840.1_4862531_4864817_+	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|377aa|down_0|NZ_CP021840.1_4865050_4866181_+	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|85aa|down_1|NZ_CP021840.1_4866180_4866435_+	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|217aa|down_2|NZ_CP021840.1_4866488_4867139_-	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|397aa|down_3|NZ_CP021840.1_4867219_4868410_-	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|293aa|down_4|NZ_CP021840.1_4868574_4869453_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|360aa|down_5|NZ_CP021840.1_4869497_4870577_-	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|453aa|down_6|NZ_CP021840.1_4870581_4871940_-	PRK11273, glpT, glycerol-3-phosphate transporter	NA|543aa|down_7|NZ_CP021840.1_4872212_4873841_+	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|420aa|down_8|NZ_CP021840.1_4873830_4875090_+	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|397aa|down_9|NZ_CP021840.1_4875086_4876277_+	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit
