assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	1	31726-31828	1	CRISPRCasFinder	no		csa3,DEDDh	Orphan	GGCGGCATGGACTTCTGAGGAAGTCC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh	NA,NA|239aa|down_5|NZ_CP045144.1_38355_39072_+,NA|154aa|down_7|NZ_CP045144.1_39985_40447_-	NA|159aa|up_9|NZ_CP045144.1_17158_17635_+	cd00907, Bacterioferritin, Bacterioferritin, ferritin-like diiron-binding domain	NA|347aa|up_8|NZ_CP045144.1_17906_18947_+	cd13542, PBP2_FutA1_ilke, Substrate binding domain of ferric iron-binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|555aa|up_7|NZ_CP045144.1_19047_20712_+	COG1178, ThiP, ABC-type Fe3+ transport system, permease component [Inorganic ion transport and metabolism]	NA|656aa|up_6|NZ_CP045144.1_20848_22816_-	PRK03584, PRK03584, acetoacetate--CoA ligase	NA|630aa|up_5|NZ_CP045144.1_23021_24911_+	COG2982, AsmA, Uncharacterized protein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane]	NA|151aa|up_4|NZ_CP045144.1_24917_25370_-	COG1238, COG1238, Predicted membrane protein [Function unknown]	NA|885aa|up_3|NZ_CP045144.1_25418_28073_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|124aa|up_2|NZ_CP045144.1_28395_28767_+	TIGR01985, kDa_protein, phasin	NA|165aa|up_1|NZ_CP045144.1_29008_29503_+	TIGR01985, kDa_protein, phasin	NA|99aa|up_0|NZ_CP045144.1_29758_30055_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|210aa|down_0|NZ_CP045144.1_32113_32743_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|341aa|down_1|NZ_CP045144.1_32764_33787_-	COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy production and conversion]	NA|471aa|down_2|NZ_CP045144.1_33790_35203_-	pfam01654, Cyt_bd_oxida_I, Cytochrome bd terminal oxidase subunit I	NA|303aa|down_3|NZ_CP045144.1_35324_36233_-	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|556aa|down_4|NZ_CP045144.1_36344_38012_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|239aa|down_5|NZ_CP045144.1_38355_39072_+	NA	NA|298aa|down_6|NZ_CP045144.1_39105_39999_+	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|154aa|down_7|NZ_CP045144.1_39985_40447_-	NA	NA|281aa|down_8|NZ_CP045144.1_40797_41640_+	cd03256, ABC_PhnC_transporter, ATP-binding cassette domain of the binding protein-dependent phosphonate transport system	NA|336aa|down_9|NZ_CP045144.1_41653_42661_+	cd13575, PBP2_PnhD, Substrate binding domain of ABC-type phosphonate uptake system; contains the type 2 periplasmic binding fold
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	2	1159459-1159539	2	CRISPRCasFinder	no		csa3,DEDDh	Orphan	AAAAGCCGCCCTCCGGGGCGGCTTT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh	NA,NA	NA|436aa|up_9|NZ_CP045144.1_1148885_1150193_+	PRK07492, PRK07492, adenylosuccinate lyase; Provisional	NA|189aa|up_8|NZ_CP045144.1_1150220_1150787_-	COG4991, COG4991, Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown]	NA|942aa|up_7|NZ_CP045144.1_1150977_1153803_-	COG4991, COG4991, Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown]	NA|108aa|up_6|NZ_CP045144.1_1153976_1154300_-	pfam07345, DUF1476, Domain of unknown function (DUF1476)	NA|270aa|up_5|NZ_CP045144.1_1154601_1155411_+	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|81aa|up_4|NZ_CP045144.1_1155551_1155794_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|233aa|up_3|NZ_CP045144.1_1155799_1156498_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|738aa|up_2|NZ_CP045144.1_1156553_1158767_+	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|78aa|up_1|NZ_CP045144.1_1158780_1159014_+	COG0271, BolA, Stress-induced morphogen (activity unknown) [Signal transduction mechanisms]	NA|113aa|up_0|NZ_CP045144.1_1159092_1159431_+	TIGR00365, TIGR00365, monothiol glutaredoxin, Grx4 family	NA|206aa|down_0|NZ_CP045144.1_1159563_1160181_-	PRK05327, rpsD, 30S ribosomal protein S4; Validated	NA|399aa|down_1|NZ_CP045144.1_1160482_1161679_-	pfam13406, SLT_2, Transglycosylase SLT domain	NA|288aa|down_2|NZ_CP045144.1_1161835_1162699_-	PRK00865, PRK00865, glutamate racemase; Provisional	NA|279aa|down_3|NZ_CP045144.1_1162870_1163707_-	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|404aa|down_4|NZ_CP045144.1_1163994_1165206_+	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|417aa|down_5|NZ_CP045144.1_1165275_1166526_-	pfam00520, Ion_trans, Ion transport protein	NA|884aa|down_6|NZ_CP045144.1_1166568_1169220_-	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|362aa|down_7|NZ_CP045144.1_1169539_1170625_-	PRK09354, recA, recombinase A; Provisional	NA|440aa|down_8|NZ_CP045144.1_1170751_1172071_-	cd10231, YegD_like, Escherichia coli YegD, a putative chaperone protein, and related proteins	NA|501aa|down_9|NZ_CP045144.1_1172160_1173663_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	3	1324398-1324478	3	CRISPRCasFinder	no	csa3	csa3,DEDDh	Type I-A	CCTCTCCCCGACGGGGAGAGGTG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh	NA,NA|227aa|down_2|NZ_CP045144.1_1327485_1328166_-,NA|63aa|down_6|NZ_CP045144.1_1331655_1331844_-,NA|62aa|down_7|NZ_CP045144.1_1331840_1332026_-	NA|306aa|up_9|NZ_CP045144.1_1314155_1315073_-	PRK09429, mepA, penicillin-insensitive murein endopeptidase; Reviewed	NA|340aa|up_8|NZ_CP045144.1_1315081_1316101_-	PRK06132, PRK06132, hypothetical protein; Provisional	NA|128aa|up_7|NZ_CP045144.1_1316338_1316722_+	pfam05239, PRC, PRC-barrel domain	NA|62aa|up_6|NZ_CP045144.1_1316812_1316998_-	COG3422, COG3422, Uncharacterized conserved protein [Function unknown]	NA|602aa|up_5|NZ_CP045144.1_1317068_1318874_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|212aa|up_4|NZ_CP045144.1_1318988_1319624_+	cd07737, YcbL-like_MBL-fold, Salmonella enterica serovar typhimurium YcbL and related proteins; MBL-fold metallo hydrolase domain	NA|282aa|up_3|NZ_CP045144.1_1319876_1320722_+	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|432aa|up_2|NZ_CP045144.1_1320893_1322189_-	PRK07572, PRK07572, cytosine deaminase; Validated	NA|315aa|up_1|NZ_CP045144.1_1322199_1323144_-	COG1079, COG1079, Uncharacterized ABC-type transport system, permease component [General function prediction only]	NA|356aa|up_0|NZ_CP045144.1_1323146_1324214_-	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|516aa|down_0|NZ_CP045144.1_1324602_1326150_-	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|346aa|down_1|NZ_CP045144.1_1326339_1327377_-	cd06304, PBP1_BmpA_Med_PnrA-like, periplasmic binding component of a family of basic membrane lipoproteins from Borrelia and various putative lipoproteins from other bacteria	NA|227aa|down_2|NZ_CP045144.1_1327485_1328166_-	NA	NA|299aa|down_3|NZ_CP045144.1_1328244_1329141_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|283aa|down_4|NZ_CP045144.1_1329146_1329995_-	PRK05756, PRK05756, pyridoxal kinase PdxY	NA|408aa|down_5|NZ_CP045144.1_1330136_1331360_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|63aa|down_6|NZ_CP045144.1_1331655_1331844_-	NA	NA|62aa|down_7|NZ_CP045144.1_1331840_1332026_-	NA	NA|423aa|down_8|NZ_CP045144.1_1332182_1333451_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|256aa|down_9|NZ_CP045144.1_1333567_1334335_+	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	4	2256545-2256660	4	CRISPRCasFinder	no		csa3,DEDDh	Orphan	GCCGGGCCCGCAGGACCTGCGGGACC	26	0	0	NA	NA	NA	2	2	Orphan	csa3,DEDDh	NA,NA|98aa|down_1|NZ_CP045144.1_2258022_2258316_+,NA|153aa|down_8|NZ_CP045144.1_2264040_2264499_-	NA|404aa|up_9|NZ_CP045144.1_2240832_2242044_+	COG1485, COG1485, Predicted ATPase [General function prediction only]	NA|239aa|up_8|NZ_CP045144.1_2242129_2242846_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|322aa|up_7|NZ_CP045144.1_2243004_2243970_+	PRK06223, PRK06223, malate dehydrogenase; Reviewed	NA|994aa|up_6|NZ_CP045144.1_2244278_2247260_+	PRK09404, sucA, 2-oxoglutarate dehydrogenase E1 component; Reviewed	NA|414aa|up_5|NZ_CP045144.1_2247322_2248564_+	PRK05704, PRK05704, 2-oxoglutarate dehydrogenase complex dihydrolipoyllysine-residue succinyltransferase	NA|556aa|up_4|NZ_CP045144.1_2248669_2250337_-	COG1620, LldP, L-lactate permease [Energy production and conversion]	NA|467aa|up_3|NZ_CP045144.1_2250627_2252028_+	PRK06115, PRK06115, dihydrolipoamide dehydrogenase; Reviewed	NA|145aa|up_2|NZ_CP045144.1_2252465_2252900_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|334aa|up_1|NZ_CP045144.1_2252930_2253932_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|739aa|up_0|NZ_CP045144.1_2254052_2256269_+	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|199aa|down_0|NZ_CP045144.1_2257239_2257836_+	COG1495, DsbB, Disulfide bond formation protein DsbB [Posttranslational modification, protein turnover, chaperones]	NA|98aa|down_1|NZ_CP045144.1_2258022_2258316_+	NA	NA|315aa|down_2|NZ_CP045144.1_2258318_2259263_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|428aa|down_3|NZ_CP045144.1_2259371_2260655_-	cd08560, GDPD_EcGlpQ_like_1, Glycerophosphodiester phosphodiesterase domain similar to Escherichia coli periplasmic phosphodiesterase (GlpQ) include uncharacterized proteins	NA|224aa|down_4|NZ_CP045144.1_2260731_2261403_-	COG5482, COG5482, Uncharacterized conserved protein [Function unknown]	NA|230aa|down_5|NZ_CP045144.1_2261408_2262098_-	COG0421, SpeE, Spermidine synthase [Amino acid transport and metabolism]	NA|292aa|down_6|NZ_CP045144.1_2262165_2263041_-	PRK05710, PRK05710, tRNA glutamyl-Q(34) synthetase GluQRS	NA|261aa|down_7|NZ_CP045144.1_2263232_2264015_-	PRK05950, sdhB, succinate dehydrogenase iron-sulfur subunit; Reviewed	NA|153aa|down_8|NZ_CP045144.1_2264040_2264499_-	NA	NA|612aa|down_9|NZ_CP045144.1_2264612_2266448_-	PRK09078, sdhA, succinate dehydrogenase flavoprotein subunit; Reviewed
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	5	3019464-3019573	5	CRISPRCasFinder	no		csa3,DEDDh	Orphan	CCTCTCCCCCACGGGGAGAGGTGGC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh	NA,NA	NA|344aa|up_9|NZ_CP045144.1_3008282_3009314_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|304aa|up_8|NZ_CP045144.1_3009310_3010222_-	TIGR01250, Proline_iminopeptidase, proline-specific peptidase, Bacillus coagulans-type subfamily	NA|270aa|up_7|NZ_CP045144.1_3010227_3011037_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|265aa|up_6|NZ_CP045144.1_3011033_3011828_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|302aa|up_5|NZ_CP045144.1_3011829_3012735_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|374aa|up_4|NZ_CP045144.1_3012739_3013861_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|385aa|up_3|NZ_CP045144.1_3014061_3015216_-	COG0687, PotD, Spermidine/putrescine-binding periplasmic protein [Amino acid transport and metabolism]	NA|260aa|up_2|NZ_CP045144.1_3015475_3016255_+	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|430aa|up_1|NZ_CP045144.1_3016289_3017579_-	pfam02530, Porin_2, Porin subfamily	NA|574aa|up_0|NZ_CP045144.1_3017635_3019357_-	cd07578, nitrilase_1_R1, First nitrilase domain of an uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|331aa|down_0|NZ_CP045144.1_3019581_3020574_-	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|238aa|down_1|NZ_CP045144.1_3020598_3021312_-	cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of branched-chain amino acid transporter	NA|252aa|down_2|NZ_CP045144.1_3021304_3022060_-	cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component of branched chain amino acids transport system	NA|355aa|down_3|NZ_CP045144.1_3022059_3023124_-	TIGR03727, urea_t_UrtC_arc, urea ABC transporter, permease protein UrtC, archaeal type	NA|291aa|down_4|NZ_CP045144.1_3023145_3024018_-	TIGR03622, urea_t_UrtB_arc, urea ABC transporter, permease protein UrtB	NA|413aa|down_5|NZ_CP045144.1_3024321_3025560_-	TIGR03669, urea_ABC_arch, urea ABC transporter, substrate-binding protein, archaeal type	NA|219aa|down_6|NZ_CP045144.1_3025579_3026236_-	COG3707, AmiR, Response regulator with putative antiterminator output domain [Signal transduction mechanisms]	NA|373aa|down_7|NZ_CP045144.1_3026235_3027354_-	cd06357, PBP1_AmiC, periplasmic binding domain of amidase (AmiC) that belongs to the type 1 periplasmic binding fold protein family	NA|680aa|down_8|NZ_CP045144.1_3027748_3029788_+	PRK02628, nadE, NAD synthetase; Reviewed	NA|262aa|down_9|NZ_CP045144.1_3029825_3030611_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]
GCF_009223885.1_ASM922388v1	NZ_CP045144	Ancylobacter sp. TS-1 chromosome, complete genome	6	3300517-3300612	6	CRISPRCasFinder	no		csa3,DEDDh	Orphan	CCCTCTCCCCGTCGGGGAGAGGGCT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh	NA,NA|100aa|down_3|NZ_CP045144.1_3304162_3304462_+	NA|409aa|up_9|NZ_CP045144.1_3288220_3289447_-	PRK09064, PRK09064, 5-aminolevulinate synthase; Validated	NA|168aa|up_8|NZ_CP045144.1_3289581_3290085_-	PRK10019, PRK10019, nickel/cobalt efflux transporter RcnA	NA|402aa|up_7|NZ_CP045144.1_3290234_3291440_-	cd09990, Agmatinase-like, Agmatinase-like family	NA|222aa|up_6|NZ_CP045144.1_3291742_3292408_-	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|423aa|up_5|NZ_CP045144.1_3292628_3293897_+	PRK07198, PRK07198, GTP cyclohydrolase II	NA|414aa|up_4|NZ_CP045144.1_3294041_3295283_+	pfam07958, DUF1688, Protein of unknown function (DUF1688)	NA|329aa|up_3|NZ_CP045144.1_3295420_3296407_-	cd13578, PBP2_Bug27, Aromatic solutes transporter of Bug (Bordetella uptake gene) protein family;  contains the type 2 periplasmic binding fold	NA|329aa|up_2|NZ_CP045144.1_3296648_3297635_-	cd13578, PBP2_Bug27, Aromatic solutes transporter of Bug (Bordetella uptake gene) protein family;  contains the type 2 periplasmic binding fold	NA|365aa|up_1|NZ_CP045144.1_3297989_3299084_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|445aa|up_0|NZ_CP045144.1_3299148_3300483_+	COG0687, PotD, Spermidine/putrescine-binding periplasmic protein [Amino acid transport and metabolism]	NA|284aa|down_0|NZ_CP045144.1_3300646_3301498_+	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|277aa|down_1|NZ_CP045144.1_3301666_3302497_+	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|455aa|down_2|NZ_CP045144.1_3302623_3303988_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|100aa|down_3|NZ_CP045144.1_3304162_3304462_+	NA	NA|323aa|down_4|NZ_CP045144.1_3304947_3305916_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|1048aa|down_5|NZ_CP045144.1_3306454_3309598_+	COG4625, COG4625, Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain [Function unknown]	NA|180aa|down_6|NZ_CP045144.1_3309648_3310188_+	pfam06776, IalB, Invasion associated locus B (IalB) protein	NA|430aa|down_7|NZ_CP045144.1_3311277_3312567_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|161aa|down_8|NZ_CP045144.1_3312673_3313156_+	pfam11164, DUF2948, Protein of unknown function (DUF2948)	NA|431aa|down_9|NZ_CP045144.1_3313324_3314617_+	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed
