assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	1	1006499-1006832	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Type I-E	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	5,5,5	5	TypeI-E	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|47aa|up_1|CP033096.1_1005208_1005349_-,NA	NA|434aa|up_9|CP033096.1_995905_997207_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|CP033096.1_997254_999489_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|CP033096.1_999566_999815_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|CP033096.1_999814_1000150_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|CP033096.1_1000220_1001012_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|CP033096.1_1001239_1002877_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|CP033096.1_1002964_1004263_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|CP033096.1_1004322_1005195_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|CP033096.1_1005208_1005349_-	NA	NA|224aa|up_0|CP033096.1_1005487_1006159_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|91aa|down_0|CP033096.1_1007699_1007972_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_1|CP033096.1_1007962_1009234_-	PRK10015, PRK10015, oxidoreductase; Provisional	NA|122aa|down_2|CP033096.1_1009311_1009677_-	cd00470, PTPS, 6-pyruvoyl tetrahydropterin synthase (PTPS)	NA|600aa|down_3|CP033096.1_1009992_1011792_+	PRK10953, cysJ, NADPH-dependent assimilatory sulfite reductase flavoprotein subunit	NA|571aa|down_4|CP033096.1_1011791_1013504_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|245aa|down_5|CP033096.1_1013578_1014313_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|51aa|down_6|CP033096.1_1014577_1014730_+	pfam01848, HOK_GEF, Hok/gef family	NA|371aa|down_7|CP033096.1_1014922_1016035_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|339aa|down_8|CP033096.1_1016114_1017131_+	COG3039, COG3039, Transposase and inactivated derivatives, IS5 family [DNA replication, recombination, and repair]	cas8e|521aa|down_9|CP033096.1_1020269_1021832_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	2	1026128-1026583	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Type I-E	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	7,7,7	7	TypeI-E	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|51aa|up_9|CP033096.1_1014577_1014730_+	pfam01848, HOK_GEF, Hok/gef family	NA|371aa|up_8|CP033096.1_1014922_1016035_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|339aa|up_7|CP033096.1_1016114_1017131_+	COG3039, COG3039, Transposase and inactivated derivatives, IS5 family [DNA replication, recombination, and repair]	cas8e|521aa|up_6|CP033096.1_1020269_1021832_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|179aa|up_5|CP033096.1_1021828_1022365_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas7|352aa|up_4|CP033096.1_1022376_1023432_+	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cas5|249aa|up_3|CP033096.1_1023442_1024189_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|217aa|up_2|CP033096.1_1024170_1024821_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas1|308aa|up_1|CP033096.1_1024817_1025741_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|98aa|up_0|CP033096.1_1025737_1026031_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|CP033096.1_1026664_1027702_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|CP033096.1_1027953_1028862_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|CP033096.1_1028863_1030291_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|CP033096.1_1030290_1030896_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|CP033096.1_1030945_1031269_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|CP033096.1_1031462_1031774_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|CP033096.1_1031792_1032503_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|CP033096.1_1032502_1032982_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|CP033096.1_1032978_1034028_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|CP033096.1_1034008_1034770_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	4	1555695-1555812	4	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|300aa|up_9|CP033096.1_1545178_1546078_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_8|CP033096.1_1546270_1547461_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_7|CP033096.1_1547457_1548717_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_6|CP033096.1_1548706_1550335_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_5|CP033096.1_1550607_1551966_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_4|CP033096.1_1551970_1553047_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|69aa|up_3|CP033096.1_1553088_1553295_-	PRK09729, PRK09729, hypothetical protein; Provisional	NA|217aa|up_2|CP033096.1_1553509_1554160_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|CP033096.1_1554213_1554468_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|CP033096.1_1554467_1555598_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|CP033096.1_1555831_1558117_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|CP033096.1_1558812_1562565_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|CP033096.1_1562692_1563415_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|CP033096.1_1563561_1566189_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|CP033096.1_1566337_1568026_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|CP033096.1_1568022_1568646_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1535aa|down_6|CP033096.1_1568579_1573184_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|CP033096.1_1573184_1574834_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|252aa|down_8|CP033096.1_1574858_1575614_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|CP033096.1_1575687_1576872_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	5	2140068-2140191	5	CRISPRCasFinder	no	DEDDh	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA|30aa|down_7|CP033096.1_2149087_2149177_+	NA|471aa|up_9|CP033096.1_2129522_2130935_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_8|CP033096.1_2131491_2131701_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_7|CP033096.1_2132155_2132782_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|CP033096.1_2132802_2134905_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|216aa|up_5|CP033096.1_2134908_2135556_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|CP033096.1_2135619_2136288_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|CP033096.1_2136284_2137070_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|CP033096.1_2137073_2137886_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|CP033096.1_2137897_2139502_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|CP033096.1_2139627_2139933_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|CP033096.1_2140505_2141762_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|CP033096.1_2141802_2143176_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|CP033096.1_2143390_2144032_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|CP033096.1_2144071_2145220_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|CP033096.1_2145510_2146722_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|CP033096.1_2146834_2147767_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|CP033096.1_2147763_2148789_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|CP033096.1_2149087_2149177_+	NA	NA|390aa|down_8|CP033096.1_2149342_2150512_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|CP033096.1_2150746_2151328_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	7	2952721-2952926	7,3,3	CRISPRCasFinder,CRT,PILER-CR	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	TTTCTAAGCTGCCTGTACGGCAGTGAAC,TTTCTAAGCTGCCTGTACGGCAGTGAAC,TTTCTAAGCTGCCTGTACGGCAGTGAACG	28,28,29	0	0	NA	NA	I-F:I-F:I-F	3,3,2	3	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|71aa|up_5|CP033096.1_2946065_2946278_-,NA	NA|448aa|up_9|CP033096.1_2939239_2940583_-	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|204aa|up_8|CP033096.1_2940593_2941205_-	TIGR00547, Outer-membrane_lipoprotein_carrier_protein, periplasmic chaperone LolA	NA|1326aa|up_7|CP033096.1_2941359_2945337_-	PRK10263, PRK10263, DNA translocase FtsK; Provisional	NA|165aa|up_6|CP033096.1_2945471_2945966_-	PRK11169, PRK11169, leucine-responsive transcriptional regulator Lrp	NA|71aa|up_5|CP033096.1_2946065_2946278_-	NA	NA|322aa|up_4|CP033096.1_2946510_2947476_+	PRK10262, PRK10262, thioredoxin reductase; Provisional	NA|589aa|up_3|CP033096.1_2947598_2949365_+	PRK11174, PRK11174, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|574aa|up_2|CP033096.1_2949365_2951087_+	PRK11160, PRK11160, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|235aa|up_1|CP033096.1_2951128_2951833_+	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|73aa|up_0|CP033096.1_2952117_2952336_+	PRK00276, infA, translation initiation factor IF-1; Validated	NA|759aa|down_0|CP033096.1_2953136_2955413_-	PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit; Provisional	NA|107aa|down_1|CP033096.1_2955443_2955764_-	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|75aa|down_2|CP033096.1_2956086_2956311_+	PRK09937, PRK09937, cold shock-like protein CspD	NA|647aa|down_3|CP033096.1_2956389_2958330_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|372aa|down_4|CP033096.1_2958326_2959442_-	PRK11578, PRK11578, macrolide transporter subunit MacA; Provisional	NA|331aa|down_5|CP033096.1_2959556_2960549_+	COG2990, VirK, Uncharacterized protein conserved in bacteria [Function unknown]	NA|553aa|down_6|CP033096.1_2960545_2962204_-	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|232aa|down_7|CP033096.1_2962628_2963324_+	PRK05420, PRK05420, aquaporin Z; Provisional	NA|300aa|down_8|CP033096.1_2963818_2964718_+	COG2431, COG2431, Predicted membrane protein [Function unknown]	NA|551aa|down_9|CP033096.1_2964861_2966514_+	PRK05290, PRK05290, hybrid cluster protein; Provisional
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	8	3119666-3119810	8	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|94aa|up_5|CP033096.1_3115906_3116188_+,NA	NA|262aa|up_9|CP033096.1_3114090_3114876_+	TIGR01913, Uncharacterized_protein_UU154, phage recombination protein Bet	NA|227aa|up_8|CP033096.1_3114872_3115553_+	pfam09588, YqaJ, YqaJ-like viral recombinase domain	NA|61aa|up_7|CP033096.1_3115549_3115732_+	pfam07026, DUF1317, Protein of unknown function (DUF1317)	NA|64aa|up_6|CP033096.1_3115704_3115896_+	pfam07131, DUF1382, Protein of unknown function (DUF1382)	NA|94aa|up_5|CP033096.1_3115906_3116188_+	NA	NA|73aa|up_4|CP033096.1_3116286_3116505_+	PHA00080, PHA00080, DksA-like zinc finger domain containing protein	NA|93aa|up_3|CP033096.1_3116551_3116830_+	pfam13973, DUF4222, Domain of unknown function (DUF4222)	NA|73aa|up_2|CP033096.1_3116915_3117134_+	pfam07825, Exc, Excisionase-like protein	NA|357aa|up_1|CP033096.1_3117111_3118182_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|428aa|up_0|CP033096.1_3118316_3119600_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|CP033096.1_3119833_3122095_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|CP033096.1_3122277_3123711_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|CP033096.1_3123786_3124839_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_3|CP033096.1_3125022_3125976_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_4|CP033096.1_3126016_3127012_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_5|CP033096.1_3127166_3127985_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_6|CP033096.1_3127985_3129044_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_7|CP033096.1_3129046_3129736_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_8|CP033096.1_3129735_3130509_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|50aa|down_9|CP033096.1_3130675_3130825_-	pfam10766, AcrZ, Multidrug efflux pump-associated protein AcrZ
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	9	3366523-3366619	9	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	TTGTAGGCCTGATAAGATGCGTCAAGC	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|231aa|up_9|CP033096.1_3358318_3359011_-	PRK15195, PRK15195, molecular chaperone FimC	NA|181aa|up_8|CP033096.1_3359230_3359773_-	PRK15194, PRK15194, type 1 fimbrial protein subunit FimA	NA|289aa|up_7|CP033096.1_3360243_3361110_+	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|71aa|up_6|CP033096.1_3361111_3361324_+	PRK11507, PRK11507, ribosome-associated protein YbcJ	NA|174aa|up_5|CP033096.1_3361431_3361953_+	COG1988, COG1988, Predicted membrane-bound metal-dependent hydrolases [General function prediction only]	NA|462aa|up_4|CP033096.1_3361988_3363374_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|165aa|up_3|CP033096.1_3363547_3364042_+	PRK10791, PRK10791, peptidylprolyl isomerase B	NA|241aa|up_2|CP033096.1_3364044_3364767_+	PRK05340, PRK05340, UDP-2,3-diacylglucosamine hydrolase; Provisional	NA|170aa|up_1|CP033096.1_3364884_3365394_+	COG0041, PurE, Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism]	NA|356aa|up_0|CP033096.1_3365390_3366458_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|298aa|down_0|CP033096.1_3366652_3367546_-	PRK09411, PRK09411, carbamate kinase; Reviewed	NA|272aa|down_1|CP033096.1_3367542_3368358_-	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|420aa|down_2|CP033096.1_3368368_3369628_-	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|556aa|down_3|CP033096.1_3369637_3371305_-	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|339aa|down_4|CP033096.1_3371687_3372704_+	COG3039, COG3039, Transposase and inactivated derivatives, IS5 family [DNA replication, recombination, and repair]	NA|382aa|down_5|CP033096.1_3373341_3374487_-	PRK09932, PRK09932, glycerate 3-kinase	NA|434aa|down_6|CP033096.1_3374508_3375810_-	PRK11412, PRK11412, uracil/xanthine transporter	NA|454aa|down_7|CP033096.1_3375866_3377228_-	PRK08044, PRK08044, allantoinase AllB	NA|485aa|down_8|CP033096.1_3377287_3378742_-	PRK11375, PRK11375, putative allantoin permease	NA|293aa|down_9|CP033096.1_3378910_3379789_-	PRK15059, PRK15059, 2-hydroxy-3-oxopropionate reductase
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	10	3507385-3507529	10	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	TTTTGCAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|357aa|up_9|CP033096.1_3492523_3493594_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|194aa|up_8|CP033096.1_3493686_3494268_+	PRK10045, PRK10045, ACP phosphodiesterase	NA|605aa|up_7|CP033096.1_3494272_3496087_-	PRK10785, PRK10785, maltodextrin glucosidase; Provisional	NA|458aa|up_6|CP033096.1_3496245_3497619_-	PRK10580, proY, putative proline-specific permease; Provisional	NA|440aa|up_5|CP033096.1_3497694_3499014_-	PRK15433, PRK15433, branched-chain amino acid transporter carrier protein BrnQ	NA|432aa|up_4|CP033096.1_3499420_3500716_-	PRK11006, phoR, phosphate regulon sensor histidine kinase PhoR	NA|230aa|up_3|CP033096.1_3500773_3501463_-	PRK10161, PRK10161, phosphate response regulator transcription factor PhoB	NA|401aa|up_2|CP033096.1_3501652_3502855_+	PRK10966, PRK10966, exonuclease subunit SbcD; Provisional	NA|1049aa|up_1|CP033096.1_3502851_3505998_+	PRK10246, PRK10246, exonuclease subunit SbcC; Provisional	NA|395aa|up_0|CP033096.1_3506123_3507308_+	PRK10091, PRK10091, MFS transport protein AraJ; Provisional	NA|303aa|down_0|CP033096.1_3507552_3508461_-	PRK09557, PRK09557, fructokinase; Reviewed	NA|304aa|down_1|CP033096.1_3508585_3509497_+	PRK00321, rdgC, recombination associated protein; Reviewed	NA|95aa|down_2|CP033096.1_3510143_3510428_-	PRK10579, PRK10579, pyrimidine/purine nucleoside phosphorylase	NA|226aa|down_3|CP033096.1_3510499_3511177_-	PRK10481, PRK10481, hypothetical protein; Provisional	NA|64aa|down_4|CP033096.1_3511434_3511626_-	PRK10380, PRK10380, hypothetical protein; Provisional	NA|175aa|down_5|CP033096.1_3511675_3512200_-	PRK03731, aroL, shikimate kinase AroL	NA|155aa|down_6|CP033096.1_3512376_3512841_-	PRK00124, PRK00124, YaiI/YqxD family protein	NA|270aa|down_7|CP033096.1_3512960_3513770_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|372aa|down_8|CP033096.1_3513786_3514902_-	PRK10245, adrA, diguanylate cyclase AdrA; Provisional	NA|107aa|down_9|CP033096.1_3515003_3515324_-	PRK11505, PRK11505, phosphate starvation-inducible protein PsiF
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	12	3890675-3890816	12	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	GCTGGAGAGCAACCGTAGGCCGGATAAGATGCGCCAGCATCGCATCCGGCGA	52	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA,NA	NA|96aa|up_9|CP033096.1_3878467_3878755_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_8|CP033096.1_3878751_3880038_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_7|CP033096.1_3880088_3881030_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|339aa|up_6|CP033096.1_3881820_3882837_-	COG3039, COG3039, Transposase and inactivated derivatives, IS5 family [DNA replication, recombination, and repair]	NA|505aa|up_5|CP033096.1_3883486_3885001_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|up_4|CP033096.1_3885031_3886174_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|up_3|CP033096.1_3886302_3887520_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|up_2|CP033096.1_3887593_3889147_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|up_1|CP033096.1_3889255_3890041_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|up_0|CP033096.1_3890046_3890637_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_0|CP033096.1_3890845_3891241_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|1074aa|down_1|CP033096.1_3891502_3894724_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_2|CP033096.1_3894741_3895890_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|274aa|down_3|CP033096.1_3896345_3897167_-	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]	NA|305aa|down_4|CP033096.1_3897333_3898248_-	PRK10768, PRK10768, ribonucleoside hydrolase RihC; Provisional	NA|317aa|down_5|CP033096.1_3898313_3899264_-	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|150aa|down_6|CP033096.1_3899265_3899715_-	PRK15095, PRK15095, FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional	NA|165aa|down_7|CP033096.1_3899839_3900334_-	PRK00376, lspA, lipoprotein signal peptidase	NA|939aa|down_8|CP033096.1_3900333_3903150_-	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|314aa|down_9|CP033096.1_3903192_3904134_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase
GCA_003691425.1_ASM369142v1	CP033096	Escherichia coli strain CP53 chromosome, complete genome	13	4033689-4033838	13	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	Orphan	TGAACGCCTTATCCGACCTACACAGCACTGAACTCGTAGGCCTGATAAGACGCG	54	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|103aa|up_9|CP033096.1_4021921_4022230_-,NA|77aa|down_6|CP033096.1_4040978_4041209_-,NA|448aa|down_7|CP033096.1_4041195_4042539_-	NA|103aa|up_9|CP033096.1_4021921_4022230_-	NA	NA|301aa|up_8|CP033096.1_4022756_4023659_-	pfam09160, FimH_man-bind, FimH, mannose binding	NA|168aa|up_7|CP033096.1_4023678_4024182_-	COG3539, FimA, P pilus assembly protein, pilin FimA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|110aa|up_6|CP033096.1_4024735_4025065_-	PRK15193, PRK15193, outer membrane usher protein; Provisional	NA|238aa|up_5|CP033096.1_4025805_4026519_-	PRK09692, PRK09692, integrase; Provisional	NA|340aa|up_4|CP033096.1_4026921_4027941_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|501aa|up_3|CP033096.1_4028070_4029573_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|361aa|up_2|CP033096.1_4029691_4030774_-	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|367aa|up_1|CP033096.1_4030773_4031874_-	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|504aa|up_0|CP033096.1_4032140_4033652_+	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|148aa|down_0|CP033096.1_4034005_4034449_+	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|952aa|down_1|CP033096.1_4034448_4037304_+	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|399aa|down_2|CP033096.1_4037357_4038554_-	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|168aa|down_3|CP033096.1_4038746_4039250_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|139aa|down_4|CP033096.1_4039295_4039712_-	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|335aa|down_5|CP033096.1_4039873_4040878_+	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|77aa|down_6|CP033096.1_4040978_4041209_-	NA	NA|448aa|down_7|CP033096.1_4041195_4042539_-	NA	NA|151aa|down_8|CP033096.1_4042661_4043114_-	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|198aa|down_9|CP033096.1_4043258_4043852_-	COG1309, AcrR, Transcriptional regulator [Transcription]
GCA_003691425.1_ASM369142v1	CP033094	Escherichia coli strain CP53 plasmid pCP53-mcr, complete sequence	1	99262-99419	1	PILER-CR	no			Orphan	CCGTACCCGGTATAGTGGAT	20	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|114aa|up_9|CP033094.1_91335_91677_+,NA|120aa|up_8|CP033094.1_91681_92041_+,NA|60aa|up_7|CP033094.1_92092_92272_+,NA|235aa|up_6|CP033094.1_92268_92973_+,NA|140aa|up_5|CP033094.1_93005_93425_-,NA|166aa|down_0|CP033094.1_99608_100106_-,NA|68aa|down_1|CP033094.1_100120_100324_-,NA|59aa|down_3|CP033094.1_103676_103853_-,NA|386aa|down_5|CP033094.1_105807_106965_-,NA|274aa|down_7|CP033094.1_108163_108985_-,NA|67aa|down_9|CP033094.1_109971_110172_-	NA|114aa|up_9|CP033094.1_91335_91677_+	NA	NA|120aa|up_8|CP033094.1_91681_92041_+	NA	NA|60aa|up_7|CP033094.1_92092_92272_+	NA	NA|235aa|up_6|CP033094.1_92268_92973_+	NA	NA|140aa|up_5|CP033094.1_93005_93425_-	NA	NA|795aa|up_4|CP033094.1_93512_95897_-	pfam13750, Big_3_3, Bacterial Ig-like domain (group 3)	NA|201aa|up_3|CP033094.1_96147_96750_+	pfam13752, DUF4165, Domain of unknown function (DUF4165)	NA|79aa|up_2|CP033094.1_96761_96998_+	pfam12245, Big_3_2, Bacterial Ig-like domain (group 3)	NA|199aa|up_1|CP033094.1_97153_97750_+	pfam13750, Big_3_3, Bacterial Ig-like domain (group 3)	NA|294aa|up_0|CP033094.1_98141_99023_+	pfam16441, DUF5038, Domain of unknown function (DUF5038)	NA|166aa|down_0|CP033094.1_99608_100106_-	NA	NA|68aa|down_1|CP033094.1_100120_100324_-	NA	NA|819aa|down_2|CP033094.1_100774_103231_-	pfam13750, Big_3_3, Bacterial Ig-like domain (group 3)	NA|59aa|down_3|CP033094.1_103676_103853_-	NA	NA|585aa|down_4|CP033094.1_103867_105622_-	pfam13708, DUF4942, Domain of unknown function (DUF4942)	NA|386aa|down_5|CP033094.1_105807_106965_-	NA	NA|275aa|down_6|CP033094.1_107324_108149_+	TIGR00571, DNA_adenine_methylase, DNA adenine methylase (dam)	NA|274aa|down_7|CP033094.1_108163_108985_-	NA	NA|236aa|down_8|CP033094.1_109267_109975_-	cd10719, DnaJ_zf, Zinc finger domain of DnaJ and HSP40	NA|67aa|down_9|CP033094.1_109971_110172_-	NA
GCA_003691425.1_ASM369142v1	CP033093	Escherichia coli strain CP53 plasmid pCP53-38k, complete sequence	1	19489-19608	1	CRISPRCasFinder	no	csa3	csa3	Type I-A	TGAAAGGTGGATGGGTACGCACTGAAAGGTGGATGGGTACGCA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,WYL,DinG,c2c9_V-U4	NA|65aa|up_8|CP033093.1_9599_9794_-,NA|77aa|down_0|CP033093.1_19691_19922_-,NA|119aa|down_2|CP033093.1_21319_21676_+,NA|326aa|down_3|CP033093.1_21672_22650_-,NA|94aa|down_4|CP033093.1_22674_22956_-,NA|28aa|down_7|CP033093.1_24895_24979_-	NA|287aa|up_9|CP033093.1_8192_9053_+	PRK15442, PRK15442, beta-lactamase TEM; Provisional	NA|65aa|up_8|CP033093.1_9599_9794_-	NA	NA|250aa|up_7|CP033093.1_12621_13371_+	COG3121, FimC, P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]	NA|355aa|up_6|CP033093.1_13372_14437_+	pfam00419, Fimbrial, Fimbrial protein	NA|67aa|up_5|CP033093.1_14479_14680_-	PRK02922, PRK02922, cell surface composition regulator GlgS	NA|198aa|up_4|CP033093.1_14948_15542_+	pfam07290, DUF1449, Protein of unknown function (DUF1449)	NA|104aa|up_3|CP033093.1_16336_16648_+	pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit	NA|161aa|up_2|CP033093.1_16856_17339_+	pfam04471, Mrr_cat, Restriction endonuclease	NA|94aa|up_1|CP033093.1_18104_18386_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|84aa|up_0|CP033093.1_18375_18627_-	PRK02854, PRK02854, primosomal protein DnaT	NA|77aa|down_0|CP033093.1_19691_19922_-	NA	NA|279aa|down_1|CP033093.1_19860_20697_+	pfam01051, Rep_3, Initiator Replication protein	NA|119aa|down_2|CP033093.1_21319_21676_+	NA	NA|326aa|down_3|CP033093.1_21672_22650_-	NA	NA|94aa|down_4|CP033093.1_22674_22956_-	NA	NA|221aa|down_5|CP033093.1_23053_23716_-	PHA02518, PHA02518, ParA-like protein; Provisional	NA|215aa|down_6|CP033093.1_24096_24741_+	cd03767, SR_Res_par, Serine recombinase (SR) family, Partitioning (par)-Resolvase subfamily, catalytic domain; Serine recombinases catalyze site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme	NA|28aa|down_7|CP033093.1_24895_24979_-	NA	NA|996aa|down_8|CP033093.1_25331_28319_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|378aa|down_9|CP033093.1_29018_30152_-	COG0701, COG0701, Predicted permeases [General function prediction only]
