assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018125.1_ASM1812v1	NC_011375	Streptococcus pyogenes NZ131, complete sequence	1	827277-827578	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Type II-C,Type II-B,Type II-A	GGTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC	37,36,36	0	0	NA	NA	II-A:II-A:II-A	3,4,4	4	TypeII-C,TypeII-B,TypeII-A	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA|214aa|up_8|NC_011375.1_816738_817380_+,NA	NA|452aa|up_9|NC_011375.1_815260_816616_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_8|NC_011375.1_816738_817380_+	NA	NA|377aa|up_7|NC_011375.1_817442_818573_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_6|NC_011375.1_818582_819335_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_5|NC_011375.1_819334_820099_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|211aa|up_4|NC_011375.1_820098_820731_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1369aa|up_3|NC_011375.1_821209_825316_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_2|NC_011375.1_825315_826185_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NC_011375.1_826181_826523_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NC_011375.1_826512_827175_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|611aa|down_0|NC_011375.1_828222_830055_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|146aa|down_1|NC_011375.1_831619_832057_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|340aa|down_2|NC_011375.1_832186_833206_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_3|NC_011375.1_833412_833838_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_4|NC_011375.1_833864_834356_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|280aa|down_5|NC_011375.1_835179_836019_+	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|550aa|down_6|NC_011375.1_836143_837793_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|263aa|down_7|NC_011375.1_837796_838585_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|349aa|down_8|NC_011375.1_838578_839625_+	cd13546, PBP2_BitB, Substrate binding domain of a putative iron transporter BitB, a member of the type 2 periplasmic binding fold superfamily	NA|466aa|down_9|NC_011375.1_839740_841138_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like
GCF_000018125.1_ASM1812v1	NC_011375	Streptococcus pyogenes NZ131, complete sequence	2	1009852-1009953	2	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	AATAATTGGTATAGTCTAATTATA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|262aa|down_6|NC_011375.1_1017783_1018569_+	NA|83aa|up_9|NC_011375.1_997568_997817_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|773aa|up_8|NC_011375.1_998179_1000498_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|441aa|up_7|NC_011375.1_1001026_1002349_+	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|412aa|up_6|NC_011375.1_1002468_1003704_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|258aa|up_5|NC_011375.1_1004073_1004847_-	pfam07373, CAMP_factor, CAMP factor (Cfa)	NA|279aa|up_4|NC_011375.1_1005216_1006053_-	cd00996, PBP2_AatB_like, Polar amino acids-binding domain of ATP-binding cassette transporter-like systems that belong to the type 2 periplasmic binding fold protein superfamily	NA|210aa|up_3|NC_011375.1_1006068_1006698_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|214aa|up_2|NC_011375.1_1006707_1007349_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|112aa|up_1|NC_011375.1_1007455_1007791_-	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|605aa|up_0|NC_011375.1_1007986_1009801_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|186aa|down_0|NC_011375.1_1009976_1010534_-	TIGR02227, Inactive_signal_peptidase_IA	NA|501aa|down_1|NC_011375.1_1010751_1012254_-	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|338aa|down_2|NC_011375.1_1012316_1013330_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|1037aa|down_3|NC_011375.1_1013409_1016520_-	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|124aa|down_4|NC_011375.1_1016704_1017076_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|233aa|down_5|NC_011375.1_1017075_1017774_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|262aa|down_6|NC_011375.1_1017783_1018569_+	NA	NA|202aa|down_7|NC_011375.1_1018705_1019311_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|207aa|down_8|NC_011375.1_1020049_1020670_+	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|755aa|down_9|NC_011375.1_1020926_1023191_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins
GCF_000018125.1_ASM1812v1	NC_011375	Streptococcus pyogenes NZ131, complete sequence	3	1201584-1201948	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Type I-U,Type I-C, Type I-U?	ATTTCAATCCACTCACCCATGAAGGGTGAGAC,ATTTCAATCCACTCACCCATGAAGGGTGAGAC,ATTTCAATCCACTCACCCATGAAGGGTGAGAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	5,5,5	5	TypeI-U,TypeI-C,TypeI-U?	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA	NA|412aa|up_9|NC_011375.1_1191518_1192754_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|227aa|up_8|NC_011375.1_1193027_1193708_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|158aa|up_7|NC_011375.1_1193849_1194323_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|239aa|up_6|NC_011375.1_1194489_1195206_-	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|360aa|up_5|NC_011375.1_1195219_1196299_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|578aa|up_4|NC_011375.1_1196371_1198105_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|247aa|up_3|NC_011375.1_1198101_1198842_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|369aa|up_2|NC_011375.1_1198929_1200036_-	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|208aa|up_1|NC_011375.1_1200078_1200702_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|230aa|up_0|NC_011375.1_1200714_1201404_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	cas2|98aa|down_0|NC_011375.1_1202096_1202390_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|342aa|down_1|NC_011375.1_1202400_1203426_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|225aa|down_2|NC_011375.1_1203422_1204097_-	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas7|283aa|down_3|NC_011375.1_1204098_1204947_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|632aa|down_4|NC_011375.1_1204951_1206847_-	TIGR01863, CRISPR-associated_protein_CT1133_family, CRISPR-associated protein Cas8c/Csd1, subtype I-C/DVULG	cas5|243aa|down_5|NC_011375.1_1206846_1207575_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|803aa|down_6|NC_011375.1_1207707_1210116_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|883aa|down_7|NC_011375.1_1210269_1212918_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|188aa|down_8|NC_011375.1_1212919_1213483_-	pfam13238, AAA_18, AAA domain	NA|132aa|down_9|NC_011375.1_1214085_1214481_-	PRK07758, PRK07758, hypothetical protein; Provisional
GCF_000018125.1_ASM1812v1	NC_011375	Streptococcus pyogenes NZ131, complete sequence	4	1616950-1617119	3	PILER-CR	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	GCCTCACCTTGTGGGCCTTGTGCGCCAGTTTCCGGTGTCACCTTTTTCACC	51	0	0	NA	NA	NA	2	2	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|361aa|down_5|NC_011375.1_1624044_1625127_-	NA|250aa|up_9|NC_011375.1_1602022_1602772_-	PRK14830, PRK14830, undecaprenyl pyrophosphate synthase; Provisional	NA|122aa|up_8|NC_011375.1_1602990_1603356_-	PRK06531, yajC, preprotein translocase subunit YajC; Validated	NA|125aa|up_7|NC_011375.1_1603471_1603846_-	TIGR01295, Pediocin_PA-1_biosynthesis_protein_PedC, bacteriocin transport accessory protein, putative	NA|1166aa|up_6|NC_011375.1_1603960_1607458_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|538aa|up_5|NC_011375.1_1607655_1609269_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|378aa|up_4|NC_011375.1_1609397_1610531_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|283aa|up_3|NC_011375.1_1610828_1611677_-	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|441aa|up_2|NC_011375.1_1612016_1613339_+	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|148aa|up_1|NC_011375.1_1613436_1613880_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|740aa|up_0|NC_011375.1_1613894_1616114_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|161aa|down_0|NC_011375.1_1618007_1618490_+	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|273aa|down_1|NC_011375.1_1618882_1619701_-	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|729aa|down_2|NC_011375.1_1619783_1621970_-	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|250aa|down_3|NC_011375.1_1622325_1623075_-	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]	NA|318aa|down_4|NC_011375.1_1623074_1624028_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	NA|361aa|down_5|NC_011375.1_1624044_1625127_-	NA	NA|106aa|down_6|NC_011375.1_1625687_1626005_-	cd06555, ASCH_PF0470_like, ASC-1 homology domain, subfamily similar to Pyrococcus furiosus Pf0470	NA|157aa|down_7|NC_011375.1_1626018_1626489_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|74aa|down_8|NC_011375.1_1626553_1626775_-	pfam13310, Virulence_RhuM, Virulence protein RhuM family	NA|193aa|down_9|NC_011375.1_1626953_1627532_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed
