assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_008693725.1_ASM869372v1	NZ_CP044102	Streptococcus dysgalactiae strain FDAARGOS_654 chromosome, complete genome	1	826865-827011	1	CRISPRCasFinder	no		DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	Orphan	AGCTTTCTCTGCTTCTAATTTCTCACGGATATGAGTTTGAACTTCAGCAAG	51	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	NA|97aa|up_7|NZ_CP044102.1_812691_812982_+,NA|184aa|down_3|NZ_CP044102.1_831437_831989_+	NA|120aa|up_9|NZ_CP044102.1_811719_812079_-	PRK06531, yajC, preprotein translocase subunit YajC; Validated	NA|117aa|up_8|NZ_CP044102.1_812185_812536_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|97aa|up_7|NZ_CP044102.1_812691_812982_+	NA	NA|1208aa|up_6|NZ_CP044102.1_813134_816758_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|538aa|up_5|NZ_CP044102.1_816923_818537_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|378aa|up_4|NZ_CP044102.1_818618_819752_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|283aa|up_3|NZ_CP044102.1_820050_820899_-	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|441aa|up_2|NZ_CP044102.1_821227_822550_+	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|148aa|up_1|NZ_CP044102.1_822647_823091_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|740aa|up_0|NZ_CP044102.1_823104_825324_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|514aa|down_0|NZ_CP044102.1_827677_829219_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|156aa|down_1|NZ_CP044102.1_829705_830173_+	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|327aa|down_2|NZ_CP044102.1_830453_831434_+	cd02653, nuc_hydro_3, NH_3: A subgroup of nucleoside hydrolases	NA|184aa|down_3|NZ_CP044102.1_831437_831989_+	NA	NA|346aa|down_4|NZ_CP044102.1_832059_833097_+	cd05657, M42_glucanase_like, M42 Peptidase, endoglucanase-like subfamily	NA|171aa|down_5|NZ_CP044102.1_833120_833633_+	PRK06762, PRK06762, hypothetical protein; Provisional	NA|275aa|down_6|NZ_CP044102.1_833641_834466_-	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|729aa|down_7|NZ_CP044102.1_834544_836731_-	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|336aa|down_8|NZ_CP044102.1_837000_838008_-	cd06294, PBP1_MalR-like, ligand-binding domain of maltose transcription regulator MalR which is a member of the LacI-GalR family repressors	NA|253aa|down_9|NZ_CP044102.1_838173_838932_-	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_008693725.1_ASM869372v1	NZ_CP044102	Streptococcus dysgalactiae strain FDAARGOS_654 chromosome, complete genome	2	1085998-1086094	2	CRISPRCasFinder	no		DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	Orphan	ATAAGACGAAAAAAAATTGGATTTTT	26	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	NA|175aa|up_1|NZ_CP044102.1_1084662_1085187_-,NA|81aa|down_2|NZ_CP044102.1_1086939_1087182_+,NA|111aa|down_3|NZ_CP044102.1_1087405_1087738_+,NA|66aa|down_4|NZ_CP044102.1_1087855_1088053_+,NA|123aa|down_5|NZ_CP044102.1_1087961_1088330_+,NA|174aa|down_6|NZ_CP044102.1_1088339_1088861_+,NA|538aa|down_7|NZ_CP044102.1_1088850_1090464_+,NA|58aa|down_8|NZ_CP044102.1_1090750_1090924_+,NA|82aa|down_9|NZ_CP044102.1_1091426_1091672_+	NA|633aa|up_9|NZ_CP044102.1_1075579_1077478_+	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|659aa|up_8|NZ_CP044102.1_1077568_1079545_+	COG3887, COG3887, Predicted signaling protein consisting of a modified GGDEF domain and a DHH domain [Signal transduction mechanisms]	NA|151aa|up_7|NZ_CP044102.1_1079541_1079994_+	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|454aa|up_6|NZ_CP044102.1_1080029_1081391_+	PRK05748, PRK05748, replicative DNA helicase; Provisional	NA|91aa|up_5|NZ_CP044102.1_1081406_1081679_+	COG4466, Veg, Uncharacterized protein conserved in bacteria [Function unknown]	NA|156aa|up_4|NZ_CP044102.1_1081830_1082298_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|204aa|up_3|NZ_CP044102.1_1082486_1083098_+	PRK05327, rpsD, 30S ribosomal protein S4; Validated	NA|388aa|up_2|NZ_CP044102.1_1083254_1084418_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|175aa|up_1|NZ_CP044102.1_1084662_1085187_-	NA	NA|148aa|up_0|NZ_CP044102.1_1085497_1085941_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|63aa|down_0|NZ_CP044102.1_1086109_1086298_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|188aa|down_1|NZ_CP044102.1_1086313_1086877_+	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|81aa|down_2|NZ_CP044102.1_1086939_1087182_+	NA	NA|111aa|down_3|NZ_CP044102.1_1087405_1087738_+	NA	NA|66aa|down_4|NZ_CP044102.1_1087855_1088053_+	NA	NA|123aa|down_5|NZ_CP044102.1_1087961_1088330_+	NA	NA|174aa|down_6|NZ_CP044102.1_1088339_1088861_+	NA	NA|538aa|down_7|NZ_CP044102.1_1088850_1090464_+	NA	NA|58aa|down_8|NZ_CP044102.1_1090750_1090924_+	NA	NA|82aa|down_9|NZ_CP044102.1_1091426_1091672_+	NA
GCF_008693725.1_ASM869372v1	NZ_CP044102	Streptococcus dysgalactiae strain FDAARGOS_654 chromosome, complete genome	3	1659210-1659975	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	Type I-U, Type I-U?,Type I-C	GTCTCACCCTTCGCGGGTGAGTGGATTGAAAT,GTCTCACCCTTCGCGGGTGAGTGGATTGAAAT,GTCTCACCCTTCGCGGGTGAGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	10,11,11	11	TypeI-U,TypeI-U?,TypeI-C	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	NA,NA	NA|197aa|up_9|NZ_CP044102.1_1646826_1647417_+	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|188aa|up_8|NZ_CP044102.1_1647413_1647977_+	pfam13238, AAA_18, AAA domain	NA|883aa|up_7|NZ_CP044102.1_1647978_1650627_+	PRK05729, valS, valyl-tRNA synthetase; Reviewed	cas3|809aa|up_6|NZ_CP044102.1_1650893_1653320_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|243aa|up_5|NZ_CP044102.1_1653582_1654311_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|632aa|up_4|NZ_CP044102.1_1654310_1656206_+	cd09642, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|283aa|up_3|NZ_CP044102.1_1656210_1657059_+	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas4|225aa|up_2|NZ_CP044102.1_1657060_1657735_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NZ_CP044102.1_1657731_1658757_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NZ_CP044102.1_1658767_1659061_+	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	NA|320aa|down_0|NZ_CP044102.1_1660100_1661060_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|311aa|down_1|NZ_CP044102.1_1661151_1662084_+	cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family	NA|237aa|down_2|NZ_CP044102.1_1662303_1663014_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|208aa|down_3|NZ_CP044102.1_1663026_1663650_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|369aa|down_4|NZ_CP044102.1_1663692_1664799_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_5|NZ_CP044102.1_1664886_1665627_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|578aa|down_6|NZ_CP044102.1_1665623_1667357_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|360aa|down_7|NZ_CP044102.1_1667429_1668509_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|239aa|down_8|NZ_CP044102.1_1668522_1669239_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|158aa|down_9|NZ_CP044102.1_1669416_1669890_-	COG1438, ArgR, Arginine repressor [Transcription]
GCF_008693725.1_ASM869372v1	NZ_CP044102	Streptococcus dysgalactiae strain FDAARGOS_654 chromosome, complete genome	4	2019038-2020262	2,4,2	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	Type II-A,Type II-B,Type II-C	GTTTTAGAGCTATGTTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGTTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGTTGTTTTGAATGGTCCCAAAAC	36,36,36	0	0	NA	NA	II-A,II-B:II-A,II-B:II-A,II-B	17,18,18	18	TypeII-A,TypeII-B,TypeII-C	DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas9,csn2,csm6	NA|214aa|up_8|NZ_CP044102.1_2008494_2009136_+,NA	NA|452aa|up_9|NZ_CP044102.1_2007015_2008371_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_8|NZ_CP044102.1_2008494_2009136_+	NA	NA|377aa|up_7|NZ_CP044102.1_2009197_2010328_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_6|NZ_CP044102.1_2010338_2011091_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_5|NZ_CP044102.1_2011090_2011855_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|210aa|up_4|NZ_CP044102.1_2011854_2012484_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1372aa|up_3|NZ_CP044102.1_2012960_2017076_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_2|NZ_CP044102.1_2017075_2017945_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NZ_CP044102.1_2017941_2018283_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NZ_CP044102.1_2018272_2018935_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|153aa|down_0|NZ_CP044102.1_2020377_2020836_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|611aa|down_1|NZ_CP044102.1_2020905_2022738_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|146aa|down_2|NZ_CP044102.1_2022913_2023351_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|77aa|down_3|NZ_CP044102.1_2024221_2024452_+	pfam01721, Bacteriocin_II, Class II bacteriocin	NA|99aa|down_4|NZ_CP044102.1_2024451_2024748_+	pfam08951, EntA_Immun, Enterocin A Immunity	NA|390aa|down_5|NZ_CP044102.1_2024937_2026107_+	pfam11187, DUF2974, Protein of unknown function (DUF2974)	NA|340aa|down_6|NZ_CP044102.1_2026217_2027237_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_7|NZ_CP044102.1_2027443_2027869_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_8|NZ_CP044102.1_2027888_2028380_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_9|NZ_CP044102.1_2028396_2029206_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]
