assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009856565.1_ASM985656v1	NZ_CP047191	Streptococcus thermophilus strain EU01 chromosome, complete genome	1	1085434-1086855	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,csn2	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	Type II-A	GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC,GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC,GTTTTAGAGCTGTGTTGTTTCGAATGGTTCCAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	21,21,21	21	TypeII-A	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	NA,NA|80aa|down_0|NZ_CP047191.1_1086938_1087178_+,NA|148aa|down_4|NZ_CP047191.1_1089618_1090062_+,NA|66aa|down_9|NZ_CP047191.1_1092947_1093145_+	NA|118aa|up_9|NZ_CP047191.1_1071883_1072237_+	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|125aa|up_8|NZ_CP047191.1_1072516_1072891_+	pfam03788, LrgA, LrgA family	NA|232aa|up_7|NZ_CP047191.1_1072883_1073579_+	pfam04172, LrgB, LrgB-like family	NA|202aa|up_6|NZ_CP047191.1_1073619_1074225_+	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|651aa|up_5|NZ_CP047191.1_1074225_1076178_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|575aa|up_4|NZ_CP047191.1_1076274_1077999_+	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|216aa|up_3|NZ_CP047191.1_1078110_1078758_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	cas1|290aa|up_2|NZ_CP047191.1_1083252_1084122_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|115aa|up_1|NZ_CP047191.1_1084118_1084463_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|220aa|up_0|NZ_CP047191.1_1084452_1085112_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|80aa|down_0|NZ_CP047191.1_1086938_1087178_+	NA	NA|297aa|down_1|NZ_CP047191.1_1087215_1088106_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|81aa|down_2|NZ_CP047191.1_1088273_1088516_-	COG2182, MalE, Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism]	NA|128aa|down_3|NZ_CP047191.1_1089204_1089588_+	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|148aa|down_4|NZ_CP047191.1_1089618_1090062_+	NA	NA|78aa|down_5|NZ_CP047191.1_1090198_1090432_+	smart00871, AraC_E_bind, Bacterial transcription activator, effector binding domain	NA|244aa|down_6|NZ_CP047191.1_1090432_1091164_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|43aa|down_7|NZ_CP047191.1_1091245_1091374_+	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|438aa|down_8|NZ_CP047191.1_1091467_1092781_+	PRK12297, obgE, GTPase CgtA; Reviewed	NA|66aa|down_9|NZ_CP047191.1_1092947_1093145_+	NA
GCF_009856565.1_ASM985656v1	NZ_CP047191	Streptococcus thermophilus strain EU01 chromosome, complete genome	2	1577770-1578028	2,2,2	CRT,CRISPRCasFinder,PILER-CR	no	csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,cas2,cas1	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	Type III-B,Type III-D,Type III-C,Type III-A	GTTTCCGTCCCCTCTCGAGGTAATTAGGTTTATATC,GTCCCCTCTCGAGGTAATTAGGTTTATATC,GTTTCCGTCCCCTCTCGAGGTAATTAGGTTTATATC	36,30,36	0	0	NA	NA	II-B,III-A:NA:II-B,III-A	3,3,2	3	TypeIII-B,TypeIII-D,TypeIII-C,TypeIII-A	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	NA|43aa|up_9|NZ_CP047191.1_1562718_1562847_+,NA|72aa|down_4|NZ_CP047191.1_1581338_1581554_+,NA|73aa|down_8|NZ_CP047191.1_1583295_1583514_-	NA|43aa|up_9|NZ_CP047191.1_1562718_1562847_+	NA	NA|484aa|up_8|NZ_CP047191.1_1567071_1568523_+	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|210aa|up_7|NZ_CP047191.1_1569225_1569855_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|232aa|up_6|NZ_CP047191.1_1569943_1570639_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	csm5gr7|358aa|up_5|NZ_CP047191.1_1571634_1572708_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|300aa|up_4|NZ_CP047191.1_1572710_1573610_-	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm3gr7|221aa|up_3|NZ_CP047191.1_1573611_1574274_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|127aa|up_2|NZ_CP047191.1_1574273_1574654_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|757aa|up_1|NZ_CP047191.1_1574657_1576928_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|244aa|up_0|NZ_CP047191.1_1576908_1577640_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas2|110aa|down_0|NZ_CP047191.1_1578128_1578458_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP047191.1_1578457_1579462_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	NA|316aa|down_2|NZ_CP047191.1_1579600_1580548_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|268aa|down_3|NZ_CP047191.1_1580566_1581370_-	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|72aa|down_4|NZ_CP047191.1_1581338_1581554_+	NA	NA|183aa|down_5|NZ_CP047191.1_1581610_1582159_-	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|70aa|down_6|NZ_CP047191.1_1582200_1582410_-	pfam05857, TraX, TraX protein	NA|223aa|down_7|NZ_CP047191.1_1582425_1583094_-	pfam05857, TraX, TraX protein	NA|73aa|down_8|NZ_CP047191.1_1583295_1583514_-	NA	NA|64aa|down_9|NZ_CP047191.1_1583950_1584142_-	COG4405, COG4405, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_009856565.1_ASM985656v1	NZ_CP047191	Streptococcus thermophilus strain EU01 chromosome, complete genome	3	1824944-1827552	3,3,3	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	Type II-C,Type II-B,Type II-A	GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	39,39,14	39	TypeII-C,TypeII-B,TypeII-A	cas3,DEDDh,csa3,cas1,cas2,csn2,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6,DinG,cas9	NA,NA|86aa|down_5|NZ_CP047191.1_1834477_1834735_+	NA|99aa|up_9|NZ_CP047191.1_1816794_1817091_+	pfam11674, DUF3270, Protein of unknown function (DUF3270)	NA|147aa|up_8|NZ_CP047191.1_1817111_1817552_-	pfam12732, YtxH, YtxH-like protein	NA|133aa|up_7|NZ_CP047191.1_1817564_1817963_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|264aa|up_6|NZ_CP047191.1_1818085_1818877_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|310aa|up_5|NZ_CP047191.1_1818876_1819806_-	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|88aa|up_4|NZ_CP047191.1_1819935_1820199_-	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|147aa|up_3|NZ_CP047191.1_1820264_1820705_-	PRK04351, PRK04351, SprT family protein	NA|711aa|up_2|NZ_CP047191.1_1820691_1822824_-	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|300aa|up_1|NZ_CP047191.1_1823187_1824087_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|272aa|up_0|NZ_CP047191.1_1824086_1824902_+	COG3689, COG3689, Predicted membrane protein [Function unknown]	csn2|351aa|down_0|NZ_CP047191.1_1827615_1828668_-	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	cas2|108aa|down_1|NZ_CP047191.1_1828664_1828988_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|304aa|down_2|NZ_CP047191.1_1828989_1829901_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1121aa|down_3|NZ_CP047191.1_1830078_1833441_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|204aa|down_4|NZ_CP047191.1_1833696_1834308_-	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	NA|86aa|down_5|NZ_CP047191.1_1834477_1834735_+	NA	NA|153aa|down_6|NZ_CP047191.1_1834901_1835360_-	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|74aa|down_7|NZ_CP047191.1_1835359_1835581_-	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|454aa|down_8|NZ_CP047191.1_1836787_1838149_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|183aa|down_9|NZ_CP047191.1_1838364_1838913_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]
