assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002163935.1_ASM216393v1	NZ_CP019560	Escherichia coli strain KSC1031 chromosome, complete genome	1	2146980-2147436	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	Type I-E	CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	7,7,7	7	TypeI-E	DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	NA,NA	NA|254aa|up_9|NZ_CP019560.1_2138792_2139554_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|350aa|up_8|NZ_CP019560.1_2139534_2140584_-	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|160aa|up_7|NZ_CP019560.1_2140580_2141060_-	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|237aa|up_6|NZ_CP019560.1_2141059_2141770_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|104aa|up_5|NZ_CP019560.1_2141788_2142100_-	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|108aa|up_4|NZ_CP019560.1_2142293_2142617_-	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|202aa|up_3|NZ_CP019560.1_2142666_2143272_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|476aa|up_2|NZ_CP019560.1_2143271_2144699_-	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|303aa|up_1|NZ_CP019560.1_2144700_2145609_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|346aa|up_0|NZ_CP019560.1_2145860_2146898_+	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	cas2|98aa|down_0|NZ_CP019560.1_2147532_2147826_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|307aa|down_1|NZ_CP019560.1_2147825_2148746_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|217aa|down_2|NZ_CP019560.1_2148742_2149393_-	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas5|249aa|down_3|NZ_CP019560.1_2149374_2150121_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|358aa|down_4|NZ_CP019560.1_2150131_2151205_-	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cse2gr11|178aa|down_5|NZ_CP019560.1_2151219_2151753_-	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas8e|521aa|down_6|NZ_CP019560.1_2151749_2153312_-	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cas3|900aa|down_7|NZ_CP019560.1_2153409_2156109_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|51aa|down_8|NZ_CP019560.1_2156302_2156455_-	pfam01848, HOK_GEF, Hok/gef family	NA|245aa|down_9|NZ_CP019560.1_2156719_2157454_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase
GCF_002163935.1_ASM216393v1	NZ_CP019560	Escherichia coli strain KSC1031 chromosome, complete genome	2	2173151-2173423	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas3	DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	Unclear	CGGTTTATCCCCGCTGGCGCGGGGAACT,CGGTTTATCCCCGCTGGCGCGGGGAACTC,CGGTTTATCCCCGCTGGCGCGGGGAACTC	28,29,29	0	0	NA	NA	I-E:I-E:I-E	4,4,4	4	Unclear	DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	NA,NA|47aa|down_1|NZ_CP019560.1_2174572_2174713_+	NA|423aa|up_9|NZ_CP019560.1_2161797_2163066_+	PRK10015, PRK10015, oxidoreductase; Provisional	NA|87aa|up_8|NZ_CP019560.1_2163056_2163317_+	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|192aa|up_7|NZ_CP019560.1_2163333_2163909_+	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|287aa|up_6|NZ_CP019560.1_2164056_2164917_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|up_5|NZ_CP019560.1_2164913_2165693_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|446aa|up_4|NZ_CP019560.1_2165670_2167008_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|485aa|up_3|NZ_CP019560.1_2167101_2168556_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_2|NZ_CP019560.1_2168625_2169411_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_1|NZ_CP019560.1_2169729_2171007_+	cd06174, MFS, Major Facilitator Superfamily	NA|493aa|up_0|NZ_CP019560.1_2171033_2172512_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|224aa|down_0|NZ_CP019560.1_2173762_2174434_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|47aa|down_1|NZ_CP019560.1_2174572_2174713_+	NA	NA|433aa|down_2|NZ_CP019560.1_2175658_2176957_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_3|NZ_CP019560.1_2177044_2178682_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_4|NZ_CP019560.1_2178909_2179701_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_5|NZ_CP019560.1_2179771_2180107_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_6|NZ_CP019560.1_2180106_2180355_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|745aa|down_7|NZ_CP019560.1_2180432_2182667_-	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|434aa|down_8|NZ_CP019560.1_2182714_2184016_-	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|919aa|down_9|NZ_CP019560.1_2184072_2186829_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional
GCF_002163935.1_ASM216393v1	NZ_CP019560	Escherichia coli strain KSC1031 chromosome, complete genome	3	4021557-4021672	3	CRISPRCasFinder	no		DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	Orphan	GATAAGACGCGCCAGCGTCGCATCAGGCGTT	31	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK	NA,NA	NA|126aa|up_9|NZ_CP019560.1_4007561_4007939_-	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|274aa|up_8|NZ_CP019560.1_4007941_4008763_-	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|330aa|up_7|NZ_CP019560.1_4008759_4009749_-	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|429aa|up_6|NZ_CP019560.1_4009748_4011035_-	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|785aa|up_5|NZ_CP019560.1_4011087_4013442_-	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|272aa|up_4|NZ_CP019560.1_4013696_4014512_+	PRK09430, djlA, co-chaperone DjlA	NA|220aa|up_3|NZ_CP019560.1_4014628_4015288_-	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|969aa|up_2|NZ_CP019560.1_4015299_4018206_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|784aa|up_1|NZ_CP019560.1_4018370_4020722_-	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|232aa|up_0|NZ_CP019560.1_4020796_4021492_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|501aa|down_0|NZ_CP019560.1_4021691_4023194_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|567aa|down_1|NZ_CP019560.1_4023204_4024905_-	PRK04123, PRK04123, ribulokinase; Provisional	NA|293aa|down_2|NZ_CP019560.1_4025243_4026122_+	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|255aa|down_3|NZ_CP019560.1_4026207_4026972_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|233aa|down_4|NZ_CP019560.1_4027085_4027784_-	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|537aa|down_5|NZ_CP019560.1_4027767_4029378_-	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|328aa|down_6|NZ_CP019560.1_4029353_4030337_-	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|552aa|down_7|NZ_CP019560.1_4030500_4032156_-	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|44aa|down_8|NZ_CP019560.1_4032244_4032376_+	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|393aa|down_9|NZ_CP019560.1_4032477_4033656_+	TIGR00899, Sugar_efflux_transporter_A, sugar efflux transporter
