assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002443035.1_ASM244303v1	NZ_CP022547	Streptococcus thermophilus strain B59671 chromosome, complete genome	1	295224-295498	1,1,1	CRT,CRISPRCasFinder,PILER-CR	no	cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DinG,cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,DEDDh,RT,csn2	Type I-E	NNGGATCACCCCCGCGTGTGCGGGAAAAAC,GGATCACCCCCGCGTGTGCGGGAAAAAC,AGGATCACCCCCGCGTGTGCGGGAAAAAC	30,28,29	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	4,4,3	4	TypeI-E	DinG,cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,DEDDh,RT,csn2	NA|183aa|up_8|NZ_CP022547.1_277528_278077_-,NA|537aa|up_7|NZ_CP022547.1_278069_279680_-,NA|24aa|up_5|NZ_CP022547.1_283122_283194_-,NA	NA|999aa|up_9|NZ_CP022547.1_274487_277484_-	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)	NA|183aa|up_8|NZ_CP022547.1_277528_278077_-	NA	NA|537aa|up_7|NZ_CP022547.1_278069_279680_-	NA	NA|340aa|up_6|NZ_CP022547.1_279666_280686_-	TIGR01072, UDP-N-acetylglucosamine_1-carboxyvinyltransferase, UDP-N-acetylglucosamine 1-carboxyvinyltransferase	NA|24aa|up_5|NZ_CP022547.1_283122_283194_-	NA	NA|285aa|up_4|NZ_CP022547.1_283275_284130_+	TIGR01716, HTH-type_transcriptional_regulator_rgg, transcriptional activator, Rgg/GadR/MutR family, C-terminal domain	NA|585aa|up_3|NZ_CP022547.1_287147_288902_-	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase	NA|463aa|up_2|NZ_CP022547.1_289072_290461_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|333aa|up_1|NZ_CP022547.1_290617_291616_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|324aa|up_0|NZ_CP022547.1_291640_292612_-	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]	cas1|314aa|down_0|NZ_CP022547.1_296424_297366_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|213aa|down_1|NZ_CP022547.1_297369_298008_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|242aa|down_2|NZ_CP022547.1_298012_298738_-	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|356aa|down_3|NZ_CP022547.1_298751_299819_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|198aa|down_4|NZ_CP022547.1_299808_300402_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|556aa|down_5|NZ_CP022547.1_300411_302079_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|927aa|down_6|NZ_CP022547.1_302065_304846_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|423aa|down_7|NZ_CP022547.1_307131_308400_-	PRK09357, pyrC, dihydroorotase; Validated	NA|218aa|down_8|NZ_CP022547.1_308419_309073_-	PRK05254, PRK05254, uracil-DNA glycosylase; Provisional	NA|399aa|down_9|NZ_CP022547.1_309287_310484_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]
GCF_002443035.1_ASM244303v1	NZ_CP022547	Streptococcus thermophilus strain B59671 chromosome, complete genome	2	1815698-1816920	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas2,csn2	DinG,cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,DEDDh,RT,csn2	Type II-A	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36	0	0	NA	NA	NA:NA:NA	18,18,8	18	TypeII-A	DinG,cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,DEDDh,RT,csn2	NA|65aa|up_4|NZ_CP022547.1_1810172_1810367_+,NA	NA|447aa|up_9|NZ_CP022547.1_1804432_1805773_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|52aa|up_8|NZ_CP022547.1_1805820_1805976_-	NF033215, BlpC_Streptocco, quorum-sensing system pheromone BlpC	NA|454aa|up_7|NZ_CP022547.1_1806074_1807436_-	TIGR01000, Mesentericin_Y105_secretion_protein_MesE, bacteriocin secretion accessory protein	NA|718aa|up_6|NZ_CP022547.1_1807447_1809601_-	TIGR01193, transport/processing_ATP-binding_protein, ABC-type bacteriocin transporter	NA|86aa|up_5|NZ_CP022547.1_1809898_1810156_+	TIGR03789, pdsO, proteobacterial sortase system peptidoglycan-associated protein	NA|65aa|up_4|NZ_CP022547.1_1810172_1810367_+	NA	NA|451aa|up_3|NZ_CP022547.1_1810969_1812321_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas1|304aa|up_2|NZ_CP022547.1_1813348_1814260_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NZ_CP022547.1_1814261_1814585_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP022547.1_1814581_1815634_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP022547.1_1816961_1817777_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP022547.1_1817776_1818679_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
