assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010120595.1_ASM1012059v1	NZ_CP038020	Streptococcus thermophilus strain ATCC 19258 chromosome, complete genome	1	249240-249321	1	CRISPRCasFinder	no	csa3	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	Type I-A	AATAACATTCAAGTGTTTGTTTGAATA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	NA|113aa|up_5|NZ_CP038020.1_241791_242130_-,NA|119aa|up_4|NZ_CP038020.1_242162_242519_-,NA|100aa|down_6|NZ_CP038020.1_255406_255706_-,NA|122aa|down_7|NZ_CP038020.1_255698_256064_-	NA|49aa|up_9|NZ_CP038020.1_240291_240438_-	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|53aa|up_8|NZ_CP038020.1_240452_240611_+	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|183aa|up_7|NZ_CP038020.1_240614_241163_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|187aa|up_6|NZ_CP038020.1_241218_241779_-	PRK13690, PRK13690, hypothetical protein; Provisional	NA|113aa|up_5|NZ_CP038020.1_241791_242130_-	NA	NA|119aa|up_4|NZ_CP038020.1_242162_242519_-	NA	NA|399aa|up_3|NZ_CP038020.1_244221_245418_-	pfam00589, Phage_integrase, Phage integrase family	NA|68aa|up_2|NZ_CP038020.1_245497_245701_-	pfam09035, Tn916-Xis, Excisionase from transposon Tn916	NA|77aa|up_1|NZ_CP038020.1_246090_246321_-	pfam12645, HTH_16, Helix-turn-helix domain	NA|708aa|up_0|NZ_CP038020.1_247098_249222_-	cd07545, P-type_ATPase_Cd-like, P-type heavy metal-transporting ATPase, similar to Staphylococcus aureus plasmid pI258 CadA, a cadmium-efflux ATPase	NA|168aa|down_0|NZ_CP038020.1_249448_249952_+	pfam01252, Peptidase_A8, Signal peptidase (SPase) II	csa3|112aa|down_1|NZ_CP038020.1_250122_250458_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|382aa|down_2|NZ_CP038020.1_250629_251774_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|419aa|down_3|NZ_CP038020.1_251836_253093_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|91aa|down_4|NZ_CP038020.1_253122_253395_-	cd04793, LanC, Cyclases involved in the biosynthesis of lantibiotics	NA|335aa|down_5|NZ_CP038020.1_253693_254698_+	pfam05598, DUF772, Transposase domain (DUF772)	NA|100aa|down_6|NZ_CP038020.1_255406_255706_-	NA	NA|122aa|down_7|NZ_CP038020.1_255698_256064_-	NA	NA|294aa|down_8|NZ_CP038020.1_258801_259683_-	PRK07315, PRK07315, fructose-bisphosphate aldolase; Provisional	NA|84aa|down_9|NZ_CP038020.1_260763_261015_-	COG3572, GshA, Gamma-glutamylcysteine synthetase [Coenzyme metabolism]
GCF_010120595.1_ASM1012059v1	NZ_CP038020	Streptococcus thermophilus strain ATCC 19258 chromosome, complete genome	2	934656-934765	2	CRISPRCasFinder	no		cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	Orphan	TAACATCTAGAGAGGACCGGATAGGTCCTTTTTTTATG	38	1	21	934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727|934694-934727	NZ_CP038020.1_910-877|NZ_CP038020.1_160010-159977|NZ_CP038020.1_258643-258610|NZ_CP038020.1_282795-282762|NZ_CP038020.1_711581-711614|NZ_CP038020.1_922208-922241|NZ_CP038020.1_1184557-1184524|NZ_CP038020.1_1188611-1188578|NZ_CP038020.1_1218754-1218787|NZ_CP038020.1_1483999-1483966|NZ_CP038020.1_1517276-1517243|NZ_CP038020.1_1600905-1600872|NZ_CP038020.1_1778578-1778545|NZ_CP038020.1_1807817-1807850|NZ_CP038020.1_1990379-1990412|NZ_CP038020.1_2093615-2093582|NZ_CP038020.1_391036-391069|NZ_CP038020.1_502241-502274|NZ_CP038020.1_783785-783818|NZ_CP038020.1_1282378-1282345|NZ_CP038020.1_1978536-1978503	NA	1	1	Orphan	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	NA|71aa|up_5|NZ_CP038020.1_927447_927660_+,NA|212aa|down_1|NZ_CP038020.1_937731_938367_-,NA|88aa|down_5|NZ_CP038020.1_941730_941994_+,NA|159aa|down_7|NZ_CP038020.1_942461_942938_+,NA|75aa|down_8|NZ_CP038020.1_943081_943306_+	NA|412aa|up_9|NZ_CP038020.1_923757_924993_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|114aa|up_8|NZ_CP038020.1_925003_925345_+	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|219aa|up_7|NZ_CP038020.1_925442_926099_+	COG1705, FlgJ, Muramidase (flagellum-specific) [Cell motility and secretion / Intracellular trafficking and secretion]	NA|274aa|up_6|NZ_CP038020.1_926121_926943_+	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|71aa|up_5|NZ_CP038020.1_927447_927660_+	NA	NA|197aa|up_4|NZ_CP038020.1_927647_928238_+	COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and metabolism]	NA|383aa|up_3|NZ_CP038020.1_928322_929471_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|306aa|up_2|NZ_CP038020.1_931100_932018_+	PRK09348, glyQ, glycyl-tRNA synthetase subunit alpha; Validated	NA|679aa|up_1|NZ_CP038020.1_932303_934340_+	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|86aa|up_0|NZ_CP038020.1_934352_934610_+	PRK02539, PRK02539, DUF896 family protein	NA|464aa|down_0|NZ_CP038020.1_936270_937662_+	COG1113, AnsP, Gamma-aminobutyrate permease and related permeases [Amino acid transport and metabolism]	NA|212aa|down_1|NZ_CP038020.1_937731_938367_-	NA	NA|59aa|down_2|NZ_CP038020.1_938668_938845_+	TIGR01995, beta-glucosides_PTS_EIIBCA, PTS system, beta-glucoside-specific IIABC component	NA|412aa|down_3|NZ_CP038020.1_939340_940576_+	TIGR01995, beta-glucosides_PTS_EIIBCA, PTS system, beta-glucoside-specific IIABC component	NA|253aa|down_4|NZ_CP038020.1_940611_941370_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|88aa|down_5|NZ_CP038020.1_941730_941994_+	NA	NA|136aa|down_6|NZ_CP038020.1_942057_942465_+	pfam02322, Cyt_bd_oxida_II, Cytochrome bd terminal oxidase subunit II	NA|159aa|down_7|NZ_CP038020.1_942461_942938_+	NA	NA|75aa|down_8|NZ_CP038020.1_943081_943306_+	NA	NA|302aa|down_9|NZ_CP038020.1_943420_944326_+	COG0583, LysR, Transcriptional regulator [Transcription]
GCF_010120595.1_ASM1012059v1	NZ_CP038020	Streptococcus thermophilus strain ATCC 19258 chromosome, complete genome	3	1166167-1168509	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas9,cas1,cas2,csn2	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	Type II-A,Type II-B,Type II-C	GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC,GTTTTTGTACTCTCAAGATTTAAGTAACTGTACAAC	36,36,36	1	1	1166928-1166957	NZ_CP038020.1_1104924-1104953	NA:NA:NA	35,35,16	35	TypeII-A,TypeII-B,TypeII-C	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	NA|121aa|up_5|NZ_CP038020.1_1157843_1158206_+,NA	NA|371aa|up_9|NZ_CP038020.1_1154519_1155631_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|66aa|up_8|NZ_CP038020.1_1155804_1156002_-	pfam03029, ATP_bind_1, Conserved hypothetical ATP binding protein	NA|131aa|up_7|NZ_CP038020.1_1156697_1157090_+	pfam08349, DUF1722, Protein of unknown function (DUF1722)	NA|121aa|up_6|NZ_CP038020.1_1157086_1157449_+	TIGR02328, TIGR02328, conserved hypothetical protein	NA|121aa|up_5|NZ_CP038020.1_1157843_1158206_+	NA	NA|204aa|up_4|NZ_CP038020.1_1159411_1160023_+	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	cas9|1121aa|up_3|NZ_CP038020.1_1160278_1163641_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|304aa|up_2|NZ_CP038020.1_1163818_1164730_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|108aa|up_1|NZ_CP038020.1_1164731_1165055_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|351aa|up_0|NZ_CP038020.1_1165051_1166104_+	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	NA|272aa|down_0|NZ_CP038020.1_1168550_1169366_-	COG3689, COG3689, Predicted membrane protein [Function unknown]	NA|301aa|down_1|NZ_CP038020.1_1169365_1170268_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|711aa|down_2|NZ_CP038020.1_1170631_1172764_+	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|147aa|down_3|NZ_CP038020.1_1172750_1173191_+	PRK04351, PRK04351, SprT family protein	NA|88aa|down_4|NZ_CP038020.1_1173255_1173519_+	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|310aa|down_5|NZ_CP038020.1_1173646_1174576_+	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|264aa|down_6|NZ_CP038020.1_1174575_1175367_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|133aa|down_7|NZ_CP038020.1_1175488_1175887_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|147aa|down_8|NZ_CP038020.1_1175899_1176340_+	pfam12732, YtxH, YtxH-like protein	NA|99aa|down_9|NZ_CP038020.1_1176360_1176657_-	pfam11674, DUF3270, Protein of unknown function (DUF3270)
GCF_010120595.1_ASM1012059v1	NZ_CP038020	Streptococcus thermophilus strain ATCC 19258 chromosome, complete genome	4	1930999-1932488	2,4,2	PILER-CR,CRISPRCasFinder,CRT	no	csn2,cas2,cas1,cas9	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	Type II-A,Type II-B,Type II-C	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	22,22,22	22	TypeII-A,TypeII-B,TypeII-C	cas3,csa3,DEDDh,RT,cas9,cas1,cas2,csn2,DinG	NA|66aa|up_8|NZ_CP038020.1_1924576_1924774_-,NA|148aa|up_5|NZ_CP038020.1_1927791_1928235_-,NA|80aa|up_0|NZ_CP038020.1_1930675_1930915_-,NA	NA|247aa|up_9|NZ_CP038020.1_1923763_1924504_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|66aa|up_8|NZ_CP038020.1_1924576_1924774_-	NA	NA|438aa|up_7|NZ_CP038020.1_1924940_1926254_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_6|NZ_CP038020.1_1926347_1926476_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|148aa|up_5|NZ_CP038020.1_1927791_1928235_-	NA	NA|128aa|up_4|NZ_CP038020.1_1928265_1928649_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|102aa|up_3|NZ_CP038020.1_1928742_1929048_-	pfam02566, OsmC, OsmC-like protein	NA|81aa|up_2|NZ_CP038020.1_1929337_1929580_+	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|297aa|up_1|NZ_CP038020.1_1929747_1930638_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP038020.1_1930675_1930915_-	NA	csn2|220aa|down_0|NZ_CP038020.1_1932809_1933469_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP038020.1_1933458_1933803_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP038020.1_1933799_1934669_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP038020.1_1934668_1938835_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP038020.1_1939167_1939815_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP038020.1_1939926_1941651_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP038020.1_1941747_1943700_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|232aa|down_7|NZ_CP038020.1_1944348_1945044_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_8|NZ_CP038020.1_1945036_1945411_-	pfam03788, LrgA, LrgA family	NA|118aa|down_9|NZ_CP038020.1_1945690_1946044_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC
