assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013307265.1_ASM1330726v1	NZ_CP030927	Streptococcus thermophilus strain CS9 chromosome, complete genome	1	154999-155150	1	CRISPRCasFinder	no		cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	Orphan	CATATCATGCATATTGTCCATAT	23	1	1	155106-155127	NZ_CP030927.1_155151-155172	NA	3	3	Orphan	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	NA,NA|133aa|down_7|NZ_CP030927.1_169804_170203_+	NA|164aa|up_9|NZ_CP030927.1_144253_144745_+	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	NA|257aa|up_8|NZ_CP030927.1_145962_146733_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|297aa|up_7|NZ_CP030927.1_146713_147604_+	PRK05416, PRK05416, RNase adapter RapZ	NA|325aa|up_6|NZ_CP030927.1_147600_148575_+	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|304aa|up_5|NZ_CP030927.1_148571_149483_+	COG1481, COG1481, Uncharacterized protein conserved in bacteria [Function unknown]	NA|250aa|up_4|NZ_CP030927.1_149500_150250_+	pfam03577, Peptidase_C69, Peptidase family C69	NA|79aa|up_3|NZ_CP030927.1_150256_150493_+	pfam03577, Peptidase_C69, Peptidase family C69	NA|67aa|up_2|NZ_CP030927.1_150792_150993_+	pfam00313, CSD, 'Cold-shock' DNA-binding domain	NA|67aa|up_1|NZ_CP030927.1_151272_151473_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|392aa|up_0|NZ_CP030927.1_151803_152979_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|304aa|down_0|NZ_CP030927.1_158483_159395_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|395aa|down_1|NZ_CP030927.1_159416_160601_+	PRK07671, PRK07671, bifunctional cystathionine gamma-lyase/homocysteine desulfhydrase	NA|186aa|down_2|NZ_CP030927.1_160566_161124_+	TIGR01172, Serine_acetyltransferase, serine O-acetyltransferase	NA|211aa|down_3|NZ_CP030927.1_163559_164192_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|296aa|down_4|NZ_CP030927.1_164670_165558_+	COG4975, GlcU, Putative glucose uptake permease [Carbohydrate transport and metabolism]	NA|65aa|down_5|NZ_CP030927.1_166173_166368_+	PRK13977, PRK13977, myosin-cross-reactive antigen; Provisional	NA|515aa|down_6|NZ_CP030927.1_168041_169586_+	cd01017, AdcA, Metal binding protein AdcA	NA|133aa|down_7|NZ_CP030927.1_169804_170203_+	NA	NA|336aa|down_8|NZ_CP030927.1_170718_171726_-	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|238aa|down_9|NZ_CP030927.1_171898_172612_+	COG1705, FlgJ, Muramidase (flagellum-specific) [Cell motility and secretion / Intracellular trafficking and secretion]
GCF_013307265.1_ASM1330726v1	NZ_CP030927	Streptococcus thermophilus strain CS9 chromosome, complete genome	2	239071-239324	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	Type III-D,Type III-B,Type III-C,Type III-A	GATATAAACCTAATTACCTCGAGAGGGGACGGAAACG,GATATAAACCTAATTACCTCGAGAGGGGACGGAAAC,GATATAAACCTAATTACCTCGAGAGGGGACGGAAACG	37,36,37	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	2,3,3	3	TypeIII-D,TypeIII-B,TypeIII-C,TypeIII-A	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	NA|124aa|up_7|NZ_CP030927.1_226952_227324_+,NA|72aa|up_2|NZ_CP030927.1_235541_235757_-,NA|77aa|down_9|NZ_CP030927.1_253038_253269_+	NA|299aa|up_9|NZ_CP030927.1_222128_223025_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|365aa|up_8|NZ_CP030927.1_223136_224231_+	COG0276, HemH, Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism]	NA|124aa|up_7|NZ_CP030927.1_226952_227324_+	NA	NA|255aa|up_6|NZ_CP030927.1_227750_228515_+	cd09007, NP-I_spr0068, uncharacterized subfamily of the nucleoside phosphorylase-I family	NA|212aa|up_5|NZ_CP030927.1_228576_229212_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|553aa|up_4|NZ_CP030927.1_232076_233735_-	pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA)	NA|183aa|up_3|NZ_CP030927.1_234936_235485_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|72aa|up_2|NZ_CP030927.1_235541_235757_-	NA	NA|268aa|up_1|NZ_CP030927.1_235725_236529_+	PRK00054, PRK00054, dihydroorotate dehydrogenase electron transfer subunit; Reviewed	NA|316aa|up_0|NZ_CP030927.1_236547_237495_+	PRK07259, PRK07259, dihydroorotate dehydrogenase	cas6|244aa|down_0|NZ_CP030927.1_239453_240185_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas10|757aa|down_1|NZ_CP030927.1_240165_242436_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|127aa|down_2|NZ_CP030927.1_242439_242820_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|221aa|down_3|NZ_CP030927.1_242819_243482_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|300aa|down_4|NZ_CP030927.1_243483_244383_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	csm5gr7|358aa|down_5|NZ_CP030927.1_244385_245459_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|232aa|down_6|NZ_CP030927.1_246648_247344_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|210aa|down_7|NZ_CP030927.1_247432_248062_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|484aa|down_8|NZ_CP030927.1_248756_250208_-	COG3104, PTR2, Dipeptide/tripeptide permease [Amino acid transport and metabolism]	NA|77aa|down_9|NZ_CP030927.1_253038_253269_+	NA
GCF_013307265.1_ASM1330726v1	NZ_CP030927	Streptococcus thermophilus strain CS9 chromosome, complete genome	3	406855-406926	3	CRISPRCasFinder	no		cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	Orphan	ACCTAAAAAGGTACACTGGACCT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	NA,NA|180aa|down_4|NZ_CP030927.1_410329_410869_+	NA|217aa|up_9|NZ_CP030927.1_397992_398643_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|294aa|up_8|NZ_CP030927.1_398714_399596_-	smart00892, Endonuclease_NS, DNA/RNA non-specific endonuclease	NA|63aa|up_7|NZ_CP030927.1_399657_399846_-	pfam11772, EpuA, DNA-directed RNA polymerase subunit beta	NA|424aa|up_6|NZ_CP030927.1_399847_401119_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|79aa|up_5|NZ_CP030927.1_401188_401425_-	TIGR02327, conserved_hypothetical_protein, conserved hypothetical integral membrane protein	NA|338aa|up_4|NZ_CP030927.1_401612_402626_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|170aa|up_3|NZ_CP030927.1_402585_403095_-	pfam16116, DUF4832, Domain of unknown function (DUF4832)	NA|398aa|up_2|NZ_CP030927.1_404117_405311_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|313aa|up_1|NZ_CP030927.1_405591_406530_+	PRK11886, PRK11886, bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA	NA|62aa|up_0|NZ_CP030927.1_406516_406702_-	pfam11676, DUF3272, Protein of unknown function (DUF3272)	NA|551aa|down_0|NZ_CP030927.1_406928_408581_-	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|170aa|down_1|NZ_CP030927.1_408580_409090_-	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|300aa|down_2|NZ_CP030927.1_409125_410025_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|59aa|down_3|NZ_CP030927.1_410100_410277_+	pfam11240, DUF3042, Protein of unknown function (DUF3042)	NA|180aa|down_4|NZ_CP030927.1_410329_410869_+	NA	NA|116aa|down_5|NZ_CP030927.1_411220_411568_-	PRK05338, rplS, 50S ribosomal protein L19; Provisional	NA|409aa|down_6|NZ_CP030927.1_411695_412922_-	cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins	NA|91aa|down_7|NZ_CP030927.1_412931_413204_-	PRK07248, PRK07248, chorismate mutase	NA|512aa|down_8|NZ_CP030927.1_413278_414814_-	cd01031, EriC, ClC chloride channel EriC	NA|148aa|down_9|NZ_CP030927.1_414864_415308_-	PRK07308, PRK07308, flavodoxin; Validated
GCF_013307265.1_ASM1330726v1	NZ_CP030927	Streptococcus thermophilus strain CS9 chromosome, complete genome	4	779107-779933	4,2,2,3	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	Type II-B,Type II-A,Type II-C	GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,ATTGGTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC,GTTGTACAGTTACTTAAATCTTGAGAGTACAAAAAC	36,36,40,36	0	0	NA	NA	NA:NA:NA:NA	12,12,6,6	12	TypeII-B,TypeII-A,TypeII-C	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	NA,NA|86aa|down_5|NZ_CP030927.1_786864_787122_+	NA|99aa|up_9|NZ_CP030927.1_770955_771252_+	pfam11674, DUF3270, Protein of unknown function (DUF3270)	NA|147aa|up_8|NZ_CP030927.1_771272_771713_-	pfam12732, YtxH, YtxH-like protein	NA|133aa|up_7|NZ_CP030927.1_771725_772124_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|264aa|up_6|NZ_CP030927.1_772246_773038_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|310aa|up_5|NZ_CP030927.1_773037_773967_-	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|88aa|up_4|NZ_CP030927.1_774095_774359_-	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|147aa|up_3|NZ_CP030927.1_774424_774865_-	PRK04351, PRK04351, SprT family protein	NA|711aa|up_2|NZ_CP030927.1_774851_776984_-	COG2183, Tex, Transcriptional accessory protein [Transcription]	NA|301aa|up_1|NZ_CP030927.1_777347_778250_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|272aa|up_0|NZ_CP030927.1_778249_779065_+	COG3689, COG3689, Predicted membrane protein [Function unknown]	csn2|351aa|down_0|NZ_CP030927.1_779996_781049_-	pfam16813, Cas_St_Csn2, CRISPR-associated protein Csn2 subfamily St	cas2|108aa|down_1|NZ_CP030927.1_781045_781369_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|304aa|down_2|NZ_CP030927.1_781370_782282_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1123aa|down_3|NZ_CP030927.1_782458_785827_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|204aa|down_4|NZ_CP030927.1_786083_786695_-	pfam09911, DUF2140, Uncharacterized protein conserved in bacteria (DUF2140)	NA|86aa|down_5|NZ_CP030927.1_786864_787122_+	NA	NA|153aa|down_6|NZ_CP030927.1_787288_787747_-	COG3392, COG3392, Adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|74aa|down_7|NZ_CP030927.1_787746_787968_-	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|277aa|down_8|NZ_CP030927.1_788135_788966_-	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|599aa|down_9|NZ_CP030927.1_789167_790964_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain
GCF_013307265.1_ASM1330726v1	NZ_CP030927	Streptococcus thermophilus strain CS9 chromosome, complete genome	5	1394006-1395229	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	Type II-B,Type II-A,Type II-C	GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC,GTTTTGGAACCATTCGAAACAACACAGCTCTAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	18,18,9	18	TypeII-B,TypeII-A,TypeII-C	cas3,DEDDh,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,DinG,csn2,cas2,cas1,cas9	NA|66aa|up_9|NZ_CP030927.1_1387721_1387919_-,NA|148aa|up_4|NZ_CP030927.1_1390805_1391249_-,NA|59aa|up_2|NZ_CP030927.1_1392385_1392562_-,NA|80aa|up_0|NZ_CP030927.1_1393682_1393922_-,NA	NA|66aa|up_9|NZ_CP030927.1_1387721_1387919_-	NA	NA|438aa|up_8|NZ_CP030927.1_1388085_1389399_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|43aa|up_7|NZ_CP030927.1_1389492_1389621_-	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|244aa|up_6|NZ_CP030927.1_1389702_1390434_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|78aa|up_5|NZ_CP030927.1_1390434_1390668_-	COG3708, COG3708, Uncharacterized protein conserved in bacteria [Function unknown]	NA|148aa|up_4|NZ_CP030927.1_1390805_1391249_-	NA	NA|128aa|up_3|NZ_CP030927.1_1391279_1391663_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|59aa|up_2|NZ_CP030927.1_1392385_1392562_-	NA	NA|297aa|up_1|NZ_CP030927.1_1392754_1393645_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|80aa|up_0|NZ_CP030927.1_1393682_1393922_-	NA	csn2|220aa|down_0|NZ_CP030927.1_1395550_1396210_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NZ_CP030927.1_1396199_1396544_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NZ_CP030927.1_1396540_1397410_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1389aa|down_3|NZ_CP030927.1_1397409_1401576_-	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	NA|216aa|down_4|NZ_CP030927.1_1401909_1402557_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|575aa|down_5|NZ_CP030927.1_1402668_1404393_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|651aa|down_6|NZ_CP030927.1_1404489_1406442_-	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|187aa|down_7|NZ_CP030927.1_1406442_1407003_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|232aa|down_8|NZ_CP030927.1_1407088_1407784_-	pfam04172, LrgB, LrgB-like family	NA|125aa|down_9|NZ_CP030927.1_1407776_1408151_-	pfam03788, LrgA, LrgA family
