assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009676685.2_ASM967668v2	NZ_CP046042	Streptococcus equi subsp. zooepidemicus strain TN-714097 chromosome, complete genome	1	500685-501236	1	CRT	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CTNAGGTTNNGGNTTAGGCTCAGG	24	2	8	500847-500870|500847-500870|500847-500870|500847-500870|500847-500870|500847-500870|501075-501092|501075-501092	NZ_CP046042.2_1544997-1544974|NZ_CP046042.2_924421-924398|NZ_CP046042.2_924445-924422|NZ_CP046042.2_924523-924500|NZ_CP046042.2_924613-924590|NZ_CP046042.2_924691-924668|NZ_CP046042.2_485982-485999|NZ_CP046042.2_1544985-1544968	NA	9	9	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|229aa|up_3|NZ_CP046042.2_496491_497178_+,NA|362aa|down_1|NZ_CP046042.2_504527_505613_-,NA|560aa|down_8|NZ_CP046042.2_515135_516815_+	NA|86aa|up_9|NZ_CP046042.2_491288_491546_+	PRK02539, PRK02539, DUF896 family protein	NA|55aa|up_8|NZ_CP046042.2_492074_492239_+	PLN02866, PLN02866, phospholipase D	NA|317aa|up_7|NZ_CP046042.2_492466_493417_+	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|355aa|up_6|NZ_CP046042.2_493413_494478_+	TIGR03603, cyclo_dehy_ocin, thiazole/oxazole-forming peptide maturase, SagC family component	NA|453aa|up_5|NZ_CP046042.2_494490_495849_+	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|224aa|up_4|NZ_CP046042.2_495823_496495_+	pfam02517, Abi, CAAX protease self-immunity	NA|229aa|up_3|NZ_CP046042.2_496491_497178_+	NA	NA|308aa|up_2|NZ_CP046042.2_497200_498124_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|376aa|up_1|NZ_CP046042.2_498132_499260_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|373aa|up_0|NZ_CP046042.2_499256_500375_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|619aa|down_0|NZ_CP046042.2_502256_504113_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|362aa|down_1|NZ_CP046042.2_504527_505613_-	NA	NA|210aa|down_2|NZ_CP046042.2_506086_506716_-	pfam03932, CutC, CutC family	NA|185aa|down_3|NZ_CP046042.2_506926_507481_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|171aa|down_4|NZ_CP046042.2_507937_508450_+	COG0350, Ada, Methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|118aa|down_5|NZ_CP046042.2_508453_508807_+	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|276aa|down_6|NZ_CP046042.2_508882_509710_-	cd09087, Ape1-like_AP-endo, Human Ape1-like subfamily of the ExoIII family apurinic/apyrimidinic (AP) endonucleases	NA|1633aa|down_7|NZ_CP046042.2_510062_514961_+	cd07475, Peptidases_S8_C5a_Peptidase, Peptidase S8 family domain in Streptococcal C5a peptidases	NA|560aa|down_8|NZ_CP046042.2_515135_516815_+	NA	NA|417aa|down_9|NZ_CP046042.2_517349_518600_+	PRK10819, PRK10819, transport protein TonB; Provisional
GCF_009676685.2_ASM967668v2	NZ_CP046042	Streptococcus equi subsp. zooepidemicus strain TN-714097 chromosome, complete genome	2	563039-564324	1,1,2,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Type I-C,Type I-U, Type I-U?	GTCTCGCCTTTCATGGGCGAGTGGATTGAAATCT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT,GTCTCGCCTTTCATGGGCGAGTGGATTGAAAT	34,32,32,32	0	0	NA	NA	I-C:I-C:I-C:I-C	14,19,19,14	19	TypeI-C,TypeI-U,TypeI-U?	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|139aa|up_9|NZ_CP046042.2_552254_552671_+,NA	NA|139aa|up_9|NZ_CP046042.2_552254_552671_+	NA	NA|140aa|up_8|NZ_CP046042.2_552900_553320_+	pfam14021, TNT, Tuberculosis necrotizing toxin	NA|107aa|up_7|NZ_CP046042.2_553319_553640_+	pfam15597, Imm59, Immunity protein 59	cas3|818aa|up_6|NZ_CP046042.2_554230_556684_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_5|NZ_CP046042.2_557440_558160_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|628aa|up_4|NZ_CP046042.2_558159_560043_+	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|283aa|up_3|NZ_CP046042.2_560043_560892_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|224aa|up_2|NZ_CP046042.2_560893_561565_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NZ_CP046042.2_561561_562587_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NZ_CP046042.2_562597_562891_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|down_0|NZ_CP046042.2_565762_566473_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|down_1|NZ_CP046042.2_566486_567137_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|down_2|NZ_CP046042.2_567154_568267_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_3|NZ_CP046042.2_568534_569275_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|down_4|NZ_CP046042.2_569271_570990_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|down_5|NZ_CP046042.2_571029_572118_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|down_6|NZ_CP046042.2_572132_572837_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|down_7|NZ_CP046042.2_573073_573562_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_8|NZ_CP046042.2_574205_574679_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_9|NZ_CP046042.2_574819_575500_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
GCF_009676685.2_ASM967668v2	NZ_CP046042	Streptococcus equi subsp. zooepidemicus strain TN-714097 chromosome, complete genome	3	573919-574074	2	CRISPRCasFinder	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Type I-C,Type I-U, Type I-U?	CTCCTTTTGCAGGAGTGTGGATTG	24	0	0	NA	NA	NA	2	2	TypeI-C,TypeI-U,TypeI-U?	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA,NA	cas1|342aa|up_9|NZ_CP046042.2_561561_562587_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_8|NZ_CP046042.2_562597_562891_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|up_7|NZ_CP046042.2_565762_566473_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|up_6|NZ_CP046042.2_566486_567137_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|up_5|NZ_CP046042.2_567154_568267_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|up_4|NZ_CP046042.2_568534_569275_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|up_3|NZ_CP046042.2_569271_570990_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|up_2|NZ_CP046042.2_571029_572118_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|up_1|NZ_CP046042.2_572132_572837_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|up_0|NZ_CP046042.2_573073_573562_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_0|NZ_CP046042.2_574205_574679_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_1|NZ_CP046042.2_574819_575500_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|412aa|down_2|NZ_CP046042.2_575768_577004_+	PRK01388, PRK01388, arginine deiminase; Provisional	NA|144aa|down_3|NZ_CP046042.2_577013_577445_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|338aa|down_4|NZ_CP046042.2_577459_578473_+	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|498aa|down_5|NZ_CP046042.2_578634_580128_+	COG1288, COG1288, Predicted membrane protein [Function unknown]	NA|444aa|down_6|NZ_CP046042.2_580144_581476_+	PRK07205, PRK07205, hypothetical protein; Provisional	NA|317aa|down_7|NZ_CP046042.2_581493_582444_+	PRK12353, PRK12353, putative amino acid kinase; Reviewed	NA|331aa|down_8|NZ_CP046042.2_582662_583655_+	COG2502, AsnA, Asparagine synthetase A [Amino acid transport and metabolism]	NA|180aa|down_9|NZ_CP046042.2_583824_584364_+	pfam03602, Cons_hypoth95, Conserved hypothetical protein 95
GCF_009676685.2_ASM967668v2	NZ_CP046042	Streptococcus equi subsp. zooepidemicus strain TN-714097 chromosome, complete genome	4	612926-613003	3	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	AGGCCCAGCAGGCCCTTGTTCGCC	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA,NA	NA|224aa|up_9|NZ_CP046042.2_601210_601882_+	COG0325, COG0325, Predicted enzyme with a TIM-barrel fold [General function prediction only]	NA|221aa|up_8|NZ_CP046042.2_601894_602557_+	COG1799, COG1799, Uncharacterized protein conserved in bacteria [Function unknown]	NA|86aa|up_7|NZ_CP046042.2_602561_602819_+	COG0762, COG0762, Predicted integral membrane protein [Function unknown]	NA|265aa|up_6|NZ_CP046042.2_602815_603610_+	COG2302, COG2302, Uncharacterized conserved protein, contains S4-like domain [Function unknown]	NA|252aa|up_5|NZ_CP046042.2_603619_604375_+	pfam05103, DivIVA, DivIVA protein	NA|933aa|up_4|NZ_CP046042.2_604645_607444_+	PRK13804, ileS, isoleucyl-tRNA synthetase; Provisional	NA|101aa|up_3|NZ_CP046042.2_607912_608215_-	pfam08860, DUF1827, Domain of unknown function (DUF1827)	NA|149aa|up_2|NZ_CP046042.2_608276_608723_-	cd04684, Nudix_Hydrolase_25, Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function	NA|752aa|up_1|NZ_CP046042.2_608870_611126_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|77aa|up_0|NZ_CP046042.2_611407_611638_+	COG4703, COG4703, Uncharacterized protein conserved in bacteria [Function unknown]	NA|228aa|down_0|NZ_CP046042.2_613595_614279_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|246aa|down_1|NZ_CP046042.2_614280_615018_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|565aa|down_2|NZ_CP046042.2_615464_617159_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|285aa|down_3|NZ_CP046042.2_617270_618125_+	PRK14179, PRK14179, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase	NA|285aa|down_4|NZ_CP046042.2_618121_618976_+	cd01171, YXKO-related, B	NA|447aa|down_5|NZ_CP046042.2_619259_620600_+	PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed	NA|72aa|down_6|NZ_CP046042.2_620577_620793_+	PRK00977, PRK00977, exodeoxyribonuclease VII small subunit; Provisional	NA|290aa|down_7|NZ_CP046042.2_620792_621662_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|276aa|down_8|NZ_CP046042.2_621654_622482_+	COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|157aa|down_9|NZ_CP046042.2_622468_622939_+	COG1438, ArgR, Arginine repressor [Transcription]
GCF_009676685.2_ASM967668v2	NZ_CP046042	Streptococcus equi subsp. zooepidemicus strain TN-714097 chromosome, complete genome	5	931312-931421	4	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	Orphan	CTGACTTGATGACAGCTTATACTAAAA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,RT,DinG,csm6	NA|183aa|up_3|NZ_CP046042.2_925067_925616_-,NA|96aa|up_2|NZ_CP046042.2_925608_925896_-,NA|152aa|up_1|NZ_CP046042.2_925895_926351_-,NA|714aa|down_5|NZ_CP046042.2_935585_937727_+	NA|192aa|up_9|NZ_CP046042.2_917158_917734_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|657aa|up_8|NZ_CP046042.2_917820_919791_+	pfam05738, Cna_B, Cna protein B-type domain	NA|481aa|up_7|NZ_CP046042.2_919805_921248_+	pfam16569, GramPos_pilinBB, Gram-positive pilin backbone subunit 2, Cna-B-like domain	NA|270aa|up_6|NZ_CP046042.2_921328_922138_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|393aa|up_5|NZ_CP046042.2_922452_923631_+	pfam09028, Mac-1, Mac 1	NA|393aa|up_4|NZ_CP046042.2_923710_924889_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|183aa|up_3|NZ_CP046042.2_925067_925616_-	NA	NA|96aa|up_2|NZ_CP046042.2_925608_925896_-	NA	NA|152aa|up_1|NZ_CP046042.2_925895_926351_-	NA	NA|1239aa|up_0|NZ_CP046042.2_927456_931173_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|180aa|down_0|NZ_CP046042.2_931461_932001_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|212aa|down_1|NZ_CP046042.2_931990_932626_+	pfam13490, zf-HC2, Putative zinc-finger	NA|255aa|down_2|NZ_CP046042.2_932684_933449_+	cd07750, PolyPPase_VTC_like, Polyphosphate(polyP) polymerase domain of yeast vacuolar transport chaperone (VTC) proteins VTC-2, -3 and- 4, and similar proteins	NA|226aa|down_3|NZ_CP046042.2_933462_934140_+	pfam16316, DUF4956, Domain of unknown function (DUF4956)	NA|478aa|down_4|NZ_CP046042.2_934145_935579_+	pfam08757, CotH, CotH kinase protein	NA|714aa|down_5|NZ_CP046042.2_935585_937727_+	NA	NA|465aa|down_6|NZ_CP046042.2_937874_939269_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|1075aa|down_7|NZ_CP046042.2_939479_942704_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1214aa|down_8|NZ_CP046042.2_942690_946332_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|375aa|down_9|NZ_CP046042.2_946594_947719_+	PRK07324, PRK07324, transaminase; Validated
