assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001705585.1_ASM170558v1	NZ_CP016877	Streptococcus thermophilus strain KLDS 3.1003 chromosome, complete genome	1	121258-122019	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	csn2,cas2,cas1,cas9	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Type II-C,Type II-B,Type II-A	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAACT,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,37,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	11,11,11	11	TypeII-C,TypeII-B,TypeII-A	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|66aa|up_7|NZ_CP016877.1_114968_115166_-,NA|148aa|up_4|NZ_CP016877.1_118051_118495_-,NA|81aa|up_2|NZ_CP016877.1_119596_119839_+,NA|80aa|up_0|NZ_CP016877.1_120934_121174_-,NA	NA|737aa|up_9|NZ_CP016877.1_111945_114156_+	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|247aa|up_8|NZ_CP016877.1_114155_114896_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_7|NZ_CP016877.1_114968_115166_-	NA	NA|438aa|up_6|NZ_CP016877.1_115332_116646_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_5|NZ_CP016877.1_116739_116868_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_4|NZ_CP016877.1_118051_118495_-	NA	NA|128aa|up_3|NZ_CP016877.1_118525_118909_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|81aa|up_2|NZ_CP016877.1_119596_119839_+	NA	NA|297aa|up_1|NZ_CP016877.1_120006_120897_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP016877.1_120934_121174_-	NA	csn2|220aa|down_0|NZ_CP016877.1_122339_122999_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP016877.1_122988_123333_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP016877.1_123329_124199_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP016877.1_124198_128365_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP016877.1_128698_129346_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP016877.1_129457_131182_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP016877.1_131278_133231_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|202aa|down_7|NZ_CP016877.1_133231_133837_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_8|NZ_CP016877.1_133877_134573_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_9|NZ_CP016877.1_134565_134940_-	pfam03788, LrgA, LrgA family
GCF_001705585.1_ASM170558v1	NZ_CP016877	Streptococcus thermophilus strain KLDS 3.1003 chromosome, complete genome	2	1254932-1255890	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,csn2	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Type II-A	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36	1	1	1255561-1255590	NZ_CP016877.1_1497926-1497897	NA:NA:NA	14,14,14	14	TypeII-A	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|51aa|up_7|NZ_CP016877.1_1243284_1243437_+,NA	NA|344aa|up_9|NZ_CP016877.1_1241524_1242556_+	TIGR01295, Pediocin_PA-1_biosynthesis_protein_PedC, bacteriocin transport accessory protein, putative	NA|232aa|up_8|NZ_CP016877.1_1242559_1243255_+	pfam02517, Abi, CAAX protease self-immunity	NA|51aa|up_7|NZ_CP016877.1_1243284_1243437_+	NA	NA|132aa|up_6|NZ_CP016877.1_1243622_1244018_+	COG1191, FliA, DNA-directed RNA polymerase specialized sigma subunit [Transcription]	NA|598aa|up_5|NZ_CP016877.1_1244524_1246318_+	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|277aa|up_4|NZ_CP016877.1_1246522_1247353_+	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|205aa|up_3|NZ_CP016877.1_1248246_1248861_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas1|304aa|up_2|NZ_CP016877.1_1252583_1253495_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|108aa|up_1|NZ_CP016877.1_1253496_1253820_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP016877.1_1253816_1254869_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP016877.1_1255931_1256747_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP016877.1_1256746_1257649_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP016877.1_1258012_1260145_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP016877.1_1260131_1260572_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP016877.1_1260637_1260901_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP016877.1_1261028_1261958_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP016877.1_1261957_1262749_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP016877.1_1262871_1263270_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP016877.1_1263282_1263723_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP016877.1_1263743_1264040_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_001705585.1_ASM170558v1	NZ_CP016877	Streptococcus thermophilus strain KLDS 3.1003 chromosome, complete genome	3	1506836-1507597	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Type III-B,Type III-A,Type III-C,Type III-D	GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	9,10,10	10	TypeIII-B,TypeIII-A,TypeIII-C,TypeIII-D	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA|175aa|up_7|NZ_CP016877.1_1501912_1502437_+,NA|72aa|up_4|NZ_CP016877.1_1503308_1503524_-,NA	NA|532aa|up_9|NZ_CP016877.1_1498548_1500144_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|394aa|up_8|NZ_CP016877.1_1500133_1501315_+	cd17262, RMtype1_S_Aco12261I-TRD2-CR2, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Aminobacterium colombiense DSM 12261 S subunit (S	NA|175aa|up_7|NZ_CP016877.1_1501912_1502437_+	NA	NA|75aa|up_6|NZ_CP016877.1_1502437_1502662_+	pfam05857, TraX, TraX protein	NA|183aa|up_5|NZ_CP016877.1_1502703_1503252_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|72aa|up_4|NZ_CP016877.1_1503308_1503524_-	NA	NA|268aa|up_3|NZ_CP016877.1_1503492_1504296_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_2|NZ_CP016877.1_1504314_1505262_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas1|326aa|up_1|NZ_CP016877.1_1505428_1506406_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NZ_CP016877.1_1506405_1506735_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|244aa|down_0|NZ_CP016877.1_1507718_1508450_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas10|759aa|down_1|NZ_CP016877.1_1508430_1510707_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|131aa|down_2|NZ_CP016877.1_1510710_1511103_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|221aa|down_3|NZ_CP016877.1_1511102_1511765_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|300aa|down_4|NZ_CP016877.1_1511766_1512666_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm5gr7|358aa|down_5|NZ_CP016877.1_1512668_1513742_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm6|392aa|down_6|NZ_CP016877.1_1513895_1515071_+	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	NA|232aa|down_7|NZ_CP016877.1_1515270_1515966_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|210aa|down_8|NZ_CP016877.1_1516054_1516684_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|484aa|down_9|NZ_CP016877.1_1517377_1518829_-	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]
GCF_001705585.1_ASM170558v1	NZ_CP016877	Streptococcus thermophilus strain KLDS 3.1003 chromosome, complete genome	4	1681483-1681556	4	CRISPRCasFinder	no		csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	Orphan	GGTTGGGGCGGTCCAGAAGATGG	23	0	0	NA	NA	NA	1	1	Orphan	csn2,cas2,cas1,cas9,DEDDh,cas3,DinG,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csm6	NA,NA|57aa|down_0|NZ_CP016877.1_1682308_1682479_+,NA|169aa|down_1|NZ_CP016877.1_1682645_1683152_+,NA|124aa|down_9|NZ_CP016877.1_1689868_1690240_-	NA|79aa|up_9|NZ_CP016877.1_1671966_1672203_-	TIGR02327, conserved_hypothetical_protein, conserved hypothetical integral membrane protein	NA|338aa|up_8|NZ_CP016877.1_1672391_1673405_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|167aa|up_7|NZ_CP016877.1_1673419_1673920_-	pfam16116, DUF4832, Domain of unknown function (DUF4832)	NA|398aa|up_6|NZ_CP016877.1_1674898_1676092_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|313aa|up_5|NZ_CP016877.1_1676371_1677310_+	PRK11886, PRK11886, bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA	NA|62aa|up_4|NZ_CP016877.1_1677296_1677482_-	pfam11676, DUF3272, Protein of unknown function (DUF3272)	NA|551aa|up_3|NZ_CP016877.1_1677708_1679361_-	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|170aa|up_2|NZ_CP016877.1_1679360_1679870_-	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|300aa|up_1|NZ_CP016877.1_1679905_1680805_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|59aa|up_0|NZ_CP016877.1_1680880_1681057_+	pfam11240, DUF3042, Protein of unknown function (DUF3042)	NA|57aa|down_0|NZ_CP016877.1_1682308_1682479_+	NA	NA|169aa|down_1|NZ_CP016877.1_1682645_1683152_+	NA	NA|350aa|down_2|NZ_CP016877.1_1683240_1684290_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|116aa|down_3|NZ_CP016877.1_1684452_1684800_-	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|318aa|down_4|NZ_CP016877.1_1685225_1686179_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|411aa|down_5|NZ_CP016877.1_1686165_1687398_-	cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins	NA|91aa|down_6|NZ_CP016877.1_1687407_1687680_-	PRK07248, PRK07248, chorismate mutase	NA|512aa|down_7|NZ_CP016877.1_1687754_1689290_-	cd01031, EriC, ClC chloride channel EriC	NA|148aa|down_8|NZ_CP016877.1_1689340_1689784_-	PRK07308, PRK07308, flavodoxin; Validated	NA|124aa|down_9|NZ_CP016877.1_1689868_1690240_-	NA
