assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002863685.1_ASM286368v1	NZ_CP024618	Escherichia coli strain SMN152SH1 chromosome, complete genome	1	562443-562558	1	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	Orphan	GATAAGACGCGCCAGCGTCGCATCAGGCGTT	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	NA,NA	NA|126aa|up_9|NZ_CP024618.1_548448_548826_-	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|274aa|up_8|NZ_CP024618.1_548828_549650_-	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|330aa|up_7|NZ_CP024618.1_549646_550636_-	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|429aa|up_6|NZ_CP024618.1_550635_551922_-	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|785aa|up_5|NZ_CP024618.1_551974_554329_-	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|272aa|up_4|NZ_CP024618.1_554583_555399_+	PRK09430, djlA, co-chaperone DjlA	NA|220aa|up_3|NZ_CP024618.1_555515_556175_-	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|969aa|up_2|NZ_CP024618.1_556186_559093_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|784aa|up_1|NZ_CP024618.1_559256_561608_-	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|232aa|up_0|NZ_CP024618.1_561682_562378_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|501aa|down_0|NZ_CP024618.1_562577_564080_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|567aa|down_1|NZ_CP024618.1_564090_565791_-	PRK04123, PRK04123, ribulokinase; Provisional	NA|293aa|down_2|NZ_CP024618.1_566129_567008_+	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|255aa|down_3|NZ_CP024618.1_567093_567858_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|233aa|down_4|NZ_CP024618.1_567927_568626_-	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|537aa|down_5|NZ_CP024618.1_568609_570220_-	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|328aa|down_6|NZ_CP024618.1_570195_571179_-	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|552aa|down_7|NZ_CP024618.1_571342_572998_-	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|44aa|down_8|NZ_CP024618.1_573086_573218_+	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|202aa|down_9|NZ_CP024618.1_574545_575151_-	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit
GCF_002863685.1_ASM286368v1	NZ_CP024618	Escherichia coli strain SMN152SH1 chromosome, complete genome	2	1623895-1623986	2	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	Orphan	GCATCAGAAGCAGGTAAAAAAGGTGG	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	NA,NA	NA|231aa|up_9|NZ_CP024618.1_1613082_1613775_-	PRK10766, PRK10766, two-component system response regulator TorR	NA|391aa|up_8|NZ_CP024618.1_1613904_1615077_+	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|849aa|up_7|NZ_CP024618.1_1615076_1617623_+	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|200aa|up_6|NZ_CP024618.1_1617619_1618219_+	PRK04976, torD, chaperone protein TorD; Validated	NA|102aa|up_5|NZ_CP024618.1_1618311_1618617_-	PRK10265, PRK10265, chaperone modulator CbpM	NA|307aa|up_4|NZ_CP024618.1_1618616_1619537_-	PRK10266, PRK10266, curved DNA-binding protein	NA|420aa|up_3|NZ_CP024618.1_1619797_1621057_+	PRK09784, PRK09784, YccE family protein	NA|414aa|up_2|NZ_CP024618.1_1621347_1622589_+	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|76aa|up_1|NZ_CP024618.1_1622626_1622854_-	PRK10174, PRK10174, hypothetical protein; Provisional	NA|199aa|up_0|NZ_CP024618.1_1622874_1623471_-	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|443aa|down_0|NZ_CP024618.1_1624099_1625428_-	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|165aa|down_1|NZ_CP024618.1_1625448_1625943_-	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|197aa|down_2|NZ_CP024618.1_1625953_1626544_-	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|267aa|down_3|NZ_CP024618.1_1626553_1627354_-	TIGR03611, RutD, pyrimidine utilization protein D	NA|129aa|down_4|NZ_CP024618.1_1627361_1627748_-	TIGR03610, RutC, pyrimidine utilization protein C	NA|231aa|down_5|NZ_CP024618.1_1627759_1628452_-	TIGR03614, RutB, pyrimidine utilization protein B	NA|364aa|down_6|NZ_CP024618.1_1628451_1629543_-	TIGR03612, RutA, pyrimidine utilization protein A	NA|213aa|down_7|NZ_CP024618.1_1629830_1630469_+	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|1321aa|down_8|NZ_CP024618.1_1630508_1634471_-	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|232aa|down_9|NZ_CP024618.1_1634614_1635309_-	pfam03400, DDE_Tnp_IS1, IS1 transposase
GCF_002863685.1_ASM286368v1	NZ_CP024618	Escherichia coli strain SMN152SH1 chromosome, complete genome	3	3892101-3892251	1	PILER-CR	no	cas2,cas6e,cas7,cas8e,cas3	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	Type I-E	GAACGGTTTATCCCCGCTGGCGCGGGGAACAC	32	0	0	NA	NA	I-E	2	2	TypeI-E	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	NA,NA	NA|254aa|up_9|NZ_CP024618.1_3883913_3884675_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|350aa|up_8|NZ_CP024618.1_3884655_3885705_-	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|160aa|up_7|NZ_CP024618.1_3885701_3886181_-	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|237aa|up_6|NZ_CP024618.1_3886180_3886891_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|104aa|up_5|NZ_CP024618.1_3886909_3887221_-	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|108aa|up_4|NZ_CP024618.1_3887414_3887738_-	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|202aa|up_3|NZ_CP024618.1_3887787_3888393_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|476aa|up_2|NZ_CP024618.1_3888392_3889820_-	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|303aa|up_1|NZ_CP024618.1_3889821_3890730_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|346aa|up_0|NZ_CP024618.1_3890981_3892019_+	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	cas2|98aa|down_0|NZ_CP024618.1_3892347_3892641_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas6e|217aa|down_1|NZ_CP024618.1_3893556_3894207_-	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas7|424aa|down_2|NZ_CP024618.1_3894946_3896218_-	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cas8e|520aa|down_3|NZ_CP024618.1_3896214_3897774_-	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cas3|900aa|down_4|NZ_CP024618.1_3899181_3901881_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|51aa|down_5|NZ_CP024618.1_3902073_3902226_-	pfam01848, HOK_GEF, Hok/gef family	NA|245aa|down_6|NZ_CP024618.1_3902490_3903225_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|571aa|down_7|NZ_CP024618.1_3903299_3905012_-	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|600aa|down_8|NZ_CP024618.1_3905011_3906811_-	PRK10953, cysJ, NADPH-dependent assimilatory sulfite reductase flavoprotein subunit	NA|122aa|down_9|NZ_CP024618.1_3907126_3907492_+	cd00470, PTPS, 6-pyruvoyl tetrahydropterin synthase (PTPS)
GCF_002863685.1_ASM286368v1	NZ_CP024618	Escherichia coli strain SMN152SH1 chromosome, complete genome	4	3919701-3919789	3	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	Orphan	GGTTTATCCCCGCTGGCGCGGGGAACAC	28	0	0	NA	NA	I-E	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	NA,NA|47aa|down_1|NZ_CP024618.1_3920938_3921079_+	NA|87aa|up_9|NZ_CP024618.1_3908831_3909092_+	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|192aa|up_8|NZ_CP024618.1_3909108_3909684_+	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|287aa|up_7|NZ_CP024618.1_3909831_3910692_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|up_6|NZ_CP024618.1_3910688_3911468_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|446aa|up_5|NZ_CP024618.1_3911445_3912783_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|485aa|up_4|NZ_CP024618.1_3912876_3914331_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_3|NZ_CP024618.1_3914400_3915186_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_2|NZ_CP024618.1_3915503_3916781_+	cd06174, MFS, Major Facilitator Superfamily	NA|493aa|up_1|NZ_CP024618.1_3916807_3918286_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|232aa|up_0|NZ_CP024618.1_3918452_3919147_+	pfam03400, DDE_Tnp_IS1, IS1 transposase	NA|224aa|down_0|NZ_CP024618.1_3920128_3920800_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|47aa|down_1|NZ_CP024618.1_3920938_3921079_+	NA	NA|433aa|down_2|NZ_CP024618.1_3922013_3923312_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_3|NZ_CP024618.1_3923399_3925037_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_4|NZ_CP024618.1_3925264_3926056_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_5|NZ_CP024618.1_3926126_3926462_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_6|NZ_CP024618.1_3926461_3926710_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|745aa|down_7|NZ_CP024618.1_3926787_3929022_-	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|434aa|down_8|NZ_CP024618.1_3929069_3930371_-	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|919aa|down_9|NZ_CP024618.1_3930427_3933184_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional
GCF_002863685.1_ASM286368v1	NZ_CP024618	Escherichia coli strain SMN152SH1 chromosome, complete genome	5	4263111-4263231	4	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	Orphan	CACCCCGTAGGCCGGATAAGATGCGCCAGCATCGCATCCGGCA	43	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas6e,cas7,cas8e	NA|55aa|up_0|NZ_CP024618.1_4262875_4263040_+,NA	NA|135aa|up_9|NZ_CP024618.1_4256742_4257147_+	COG5393, COG5393, Predicted membrane protein [Function unknown]	NA|100aa|up_8|NZ_CP024618.1_4257136_4257436_+	pfam13997, YqjK, YqjK-like protein	NA|161aa|up_7|NZ_CP024618.1_4257531_4258014_+	COG2259, COG2259, Predicted membrane protein [Function unknown]	NA|329aa|up_6|NZ_CP024618.1_4258083_4259070_+	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|122aa|up_5|NZ_CP024618.1_4259362_4259728_+	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|232aa|up_4|NZ_CP024618.1_4259913_4260608_-	pfam03400, DDE_Tnp_IS1, IS1 transposase	NA|119aa|up_3|NZ_CP024618.1_4260743_4261100_+	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|299aa|up_2|NZ_CP024618.1_4261150_4262047_-	cd08431, PBP2_HupR, The C-terminal substrate binding domain of LysR-type transcriptional regulator, HupR, which regulates expression of the heme uptake receptor HupA; contains the type 2 periplasmic binding fold	NA|234aa|up_1|NZ_CP024618.1_4262151_4262853_+	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|55aa|up_0|NZ_CP024618.1_4262875_4263040_+	NA	NA|437aa|down_0|NZ_CP024618.1_4263251_4264562_-	COG3681, COG3681, L-cysteine desulfidase [Amino acid transport and metabolism]	NA|444aa|down_1|NZ_CP024618.1_4264589_4265921_-	TIGR00814, membrane_transport_protein_YhjV, serine transporter	NA|455aa|down_2|NZ_CP024618.1_4266193_4267558_-	PRK15040, PRK15040, L-serine ammonia-lyase	NA|130aa|down_3|NZ_CP024618.1_4267629_4268019_-	PRK11401, PRK11401, enamine/imine deaminase	NA|703aa|down_4|NZ_CP024618.1_4268032_4270141_-	cd01678, PFL1, Pyruvate formate lyase 1	NA|403aa|down_5|NZ_CP024618.1_4270359_4271568_-	PRK12379, PRK12379, propionate kinase	NA|444aa|down_6|NZ_CP024618.1_4271593_4272925_-	PRK13629, PRK13629, threonine/serine transporter TdcC; Provisional	NA|330aa|down_7|NZ_CP024618.1_4272946_4273936_-	PRK08638, PRK08638, bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB	NA|313aa|down_8|NZ_CP024618.1_4274034_4274973_-	PRK10341, PRK10341, transcriptional regulator TdcA	NA|112aa|down_9|NZ_CP024618.1_4275161_4275497_+	PRK11424, PRK11424, DNA-binding transcriptional activator TdcR; Provisional
