assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020765.1_ASM2076v1	NC_011134	Streptococcus equi subsp. zooepidemicus MGCS10565, complete sequence	1	581380-582010	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	 Type I-U?,Type I-C,Type I-U	GTCTCGCCCCATACGGGCGAGTGGATTGAAAT,GTCTCGCCCCATACGGGCGAGTGGATTGAAAT,GTCTCGCCCCATACGGGCGAGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	8,9,9	9	TypeI-U?,TypeI-C,TypeI-U	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	NA|110aa|up_9|NC_011134.1_570967_571297_+,NA|173aa|up_7|NC_011134.1_572102_572621_+,NA	NA|110aa|up_9|NC_011134.1_570967_571297_+	NA	NA|124aa|up_8|NC_011134.1_571655_572027_+	pfam14021, TNT, Tuberculosis necrotizing toxin	NA|173aa|up_7|NC_011134.1_572102_572621_+	NA	cas3|818aa|up_6|NC_011134.1_573086_575540_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_5|NC_011134.1_575780_576500_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|628aa|up_4|NC_011134.1_576499_578383_+	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|283aa|up_3|NC_011134.1_578383_579232_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|224aa|up_2|NC_011134.1_579233_579905_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NC_011134.1_579901_580927_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NC_011134.1_580937_581231_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|237aa|down_0|NC_011134.1_584154_584865_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|217aa|down_1|NC_011134.1_584878_585529_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|371aa|down_2|NC_011134.1_585546_586659_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_3|NC_011134.1_586927_587668_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|573aa|down_4|NC_011134.1_587664_589383_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|363aa|down_5|NC_011134.1_589422_590511_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|235aa|down_6|NC_011134.1_590525_591230_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|163aa|down_7|NC_011134.1_591488_591977_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|158aa|down_8|NC_011134.1_592487_592961_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_9|NC_011134.1_593101_593782_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
GCF_000020765.1_ASM2076v1	NC_011134	Streptococcus equi subsp. zooepidemicus MGCS10565, complete sequence	2	858453-858562	2	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	Orphan	CTGACTTGATGACAGCTTATACTAAAA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	NA|183aa|up_3|NC_011134.1_852206_852755_-,NA|96aa|up_2|NC_011134.1_852747_853035_-,NA|152aa|up_1|NC_011134.1_853034_853490_-,NA|714aa|down_5|NC_011134.1_862726_864868_+	NA|198aa|up_9|NC_011134.1_844294_844888_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|655aa|up_8|NC_011134.1_844956_846921_+	pfam05738, Cna_B, Cna protein B-type domain	NA|479aa|up_7|NC_011134.1_846935_848372_+	pfam16569, GramPos_pilinBB, Gram-positive pilin backbone subunit 2, Cna-B-like domain	NA|270aa|up_6|NC_011134.1_848452_849262_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|393aa|up_5|NC_011134.1_849576_850755_+	pfam09028, Mac-1, Mac 1	NA|401aa|up_4|NC_011134.1_850834_852037_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|183aa|up_3|NC_011134.1_852206_852755_-	NA	NA|96aa|up_2|NC_011134.1_852747_853035_-	NA	NA|152aa|up_1|NC_011134.1_853034_853490_-	NA	NA|1239aa|up_0|NC_011134.1_854597_858314_+	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|180aa|down_0|NC_011134.1_858602_859142_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|212aa|down_1|NC_011134.1_859131_859767_+	pfam13490, zf-HC2, Putative zinc-finger	NA|232aa|down_2|NC_011134.1_859894_860590_+	cd07750, PolyPPase_VTC_like, Polyphosphate(polyP) polymerase domain of yeast vacuolar transport chaperone (VTC) proteins VTC-2, -3 and- 4, and similar proteins	NA|226aa|down_3|NC_011134.1_860603_861281_+	pfam16316, DUF4956, Domain of unknown function (DUF4956)	NA|478aa|down_4|NC_011134.1_861286_862720_+	pfam08757, CotH, CotH kinase protein	NA|714aa|down_5|NC_011134.1_862726_864868_+	NA	NA|467aa|down_6|NC_011134.1_865008_866409_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|1074aa|down_7|NC_011134.1_866619_869841_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1215aa|down_8|NC_011134.1_869830_873475_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|372aa|down_9|NC_011134.1_873740_874856_+	PRK07324, PRK07324, transaminase; Validated
GCF_000020765.1_ASM2076v1	NC_011134	Streptococcus equi subsp. zooepidemicus MGCS10565, complete sequence	3	879872-879943	3	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	Orphan	CCTTTTTAACGGCGAGCTAAAAA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	NA|714aa|up_9|NC_011134.1_862726_864868_+,NA	NA|714aa|up_9|NC_011134.1_862726_864868_+	NA	NA|467aa|up_8|NC_011134.1_865008_866409_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|1074aa|up_7|NC_011134.1_866619_869841_+	TIGR02774, putative_ATP-dependent_exonuclease_subunit_B, ATP-dependent nuclease subunit B	NA|1215aa|up_6|NC_011134.1_869830_873475_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|372aa|up_5|NC_011134.1_873740_874856_+	PRK07324, PRK07324, transaminase; Validated	NA|271aa|up_4|NC_011134.1_874923_875736_+	cd13620, PBP2_GltS, Substrate binding domain of glutamate or arginine ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|59aa|up_3|NC_011134.1_875902_876079_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|125aa|up_2|NC_011134.1_876306_876681_-	pfam01741, MscL, Large-conductance mechanosensitive channel, MscL	NA|604aa|up_1|NC_011134.1_876927_878739_+	PRK05667, dnaG, DNA primase; Validated	NA|370aa|up_0|NC_011134.1_878747_879857_+	PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated	NA|113aa|down_0|NC_011134.1_880208_880547_+	COG2151, PaaD, Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]	NA|283aa|down_1|NC_011134.1_880687_881536_+	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	NA|385aa|down_2|NC_011134.1_881660_882815_+	cd04955, GT4-like, glycosyltransferase family 4 proteins	NA|312aa|down_3|NC_011134.1_882804_883740_+	cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|268aa|down_4|NC_011134.1_883739_884543_+	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|403aa|down_5|NC_011134.1_884542_885751_+	COG1134, TagH, ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|336aa|down_6|NC_011134.1_885771_886779_+	cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|582aa|down_7|NC_011134.1_886775_888521_+	COG3754, RgpF, Lipopolysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|230aa|down_8|NC_011134.1_888550_889240_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|119aa|down_9|NC_011134.1_889244_889601_+	pfam10066, DUF2304, Uncharacterized conserved protein (DUF2304)
GCF_000020765.1_ASM2076v1	NC_011134	Streptococcus equi subsp. zooepidemicus MGCS10565, complete sequence	4	1366029-1367186	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas2,cas1,cas9	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	Type II-C,Type II-B,Type II-A	GTTTTGGAACCATTCAATACAGCATAACTCTAAAAC,GTTTTGGAACCATTCAATACAGCATAACTCTAAAAC,GTTTTGGAACCATTCAATACAGCATAACTCTAAAAC	36,36,36	1	1	1366065-1366094	NC_011134.1_1865617-1865646	II-A,II-B:II-A,II-B:II-A,II-B	17,17,10	17	TypeII-C,TypeII-B,TypeII-A	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	NA,NA|141aa|down_7|NC_011134.1_1376965_1377388_-	NA|434aa|up_9|NC_011134.1_1351868_1353170_+	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|270aa|up_8|NC_011134.1_1353381_1354191_+	PRK10513, PRK10513, sugar phosphate phosphatase; Provisional	NA|405aa|up_7|NC_011134.1_1354190_1355405_+	COG2348, COG2348, Peptidoglycan interpeptide bridge formation enzyme [Cell wall/membrane/envelope biogenesis]	NA|412aa|up_6|NC_011134.1_1355404_1356640_+	pfam02388, FemAB, FemAB family	NA|253aa|up_5|NC_011134.1_1356952_1357711_-	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|399aa|up_4|NC_011134.1_1357976_1359173_-	PRK00049, PRK00049, elongation factor Tu; Reviewed	NA|1049aa|up_3|NC_011134.1_1360121_1363268_+	pfam03272, Mucin_bdg, Putative mucin or carbohydrate-binding module	NA|363aa|up_2|NC_011134.1_1363557_1364646_-	pfam13800, Sigma_reg_N, Sigma factor regulator N-terminal	NA|157aa|up_1|NC_011134.1_1364629_1365100_-	TIGR02950, RNA_polymerase_ECF-type_sigma_factor, RNA polymerase sigma factor, SigM family	NA|70aa|up_0|NC_011134.1_1365456_1365666_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	csn2|226aa|down_0|NC_011134.1_1367461_1368139_-	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|115aa|down_1|NC_011134.1_1368128_1368473_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|290aa|down_2|NC_011134.1_1368469_1369339_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1349aa|down_3|NC_011134.1_1369338_1373385_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|108aa|down_4|NC_011134.1_1373898_1374222_-	pfam15597, Imm59, Immunity protein 59	NA|131aa|down_5|NC_011134.1_1374222_1374615_-	pfam14021, TNT, Tuberculosis necrotizing toxin	NA|249aa|down_6|NC_011134.1_1374914_1375661_-	pfam08929, DUF1911, Domain of unknown function (DUF1911)	NA|141aa|down_7|NC_011134.1_1376965_1377388_-	NA	NA|135aa|down_8|NC_011134.1_1377577_1377982_-	pfam15599, Imm63, Immunity protein 63	NA|403aa|down_9|NC_011134.1_1377990_1379199_-	pfam14021, TNT, Tuberculosis necrotizing toxin
GCF_000020765.1_ASM2076v1	NC_011134	Streptococcus equi subsp. zooepidemicus MGCS10565, complete sequence	5	1669119-1669283	5	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	Orphan	AATGTCAATCCACGCTCCCATGAAGGGCGAGAC	33	0	0	NA	NA	I-C	2	2	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,csm6,csn2,cas9	NA,NA|95aa|down_8|NC_011134.1_1678616_1678901_-,NA|138aa|down_9|NC_011134.1_1680256_1680670_-	NA|95aa|up_9|NC_011134.1_1659296_1659581_-	PRK02302, PRK02302, hypothetical protein; Provisional	NA|149aa|up_8|NC_011134.1_1659581_1660028_-	COG3679, COG3679, Regulatory protein involved in competence development and sporulation [Replication, recombination and repair;    Signal transduction mechanisms]	NA|197aa|up_7|NC_011134.1_1660132_1660723_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|210aa|up_6|NC_011134.1_1660954_1661584_-	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|544aa|up_5|NC_011134.1_1661733_1663365_-	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|482aa|up_4|NC_011134.1_1663450_1664896_+	PRK14022, PRK14022, UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase	NA|260aa|up_3|NC_011134.1_1665155_1665935_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|313aa|up_2|NC_011134.1_1665959_1666898_+	cd01138, FeuA, Periplasmic binding protein FeuA	NA|342aa|up_1|NC_011134.1_1666908_1667934_+	pfam01032, FecCD, FecCD transport family	NA|333aa|up_0|NC_011134.1_1667930_1668929_+	pfam01032, FecCD, FecCD transport family	NA|218aa|down_0|NC_011134.1_1669665_1670319_-	pfam08820, DUF1803, Domain of unknown function (DUF1803)	NA|312aa|down_1|NC_011134.1_1670385_1671321_-	PRK05427, PRK05427, putative manganese-dependent inorganic pyrophosphatase; Provisional	NA|264aa|down_2|NC_011134.1_1671805_1672597_-	TIGR02493, PFLA, pyruvate formate-lyase 1-activating enzyme	NA|445aa|down_3|NC_011134.1_1672673_1674008_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|186aa|down_4|NC_011134.1_1674135_1674693_-	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|308aa|down_5|NC_011134.1_1675500_1676424_-	COG1242, COG1242, Predicted Fe-S oxidoreductase [General function prediction only]	NA|216aa|down_6|NC_011134.1_1676615_1677263_-	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|193aa|down_7|NC_011134.1_1677252_1677831_-	COG3601, COG3601, Predicted membrane protein [Function unknown]	NA|95aa|down_8|NC_011134.1_1678616_1678901_-	NA	NA|138aa|down_9|NC_011134.1_1680256_1680670_-	NA
