assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001635935.1_ASM163593v1	NZ_CP013839	Streptococcus pyogenes strain MGAS23530 chromosome, complete genome	1	112086-112182	1	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	GCTAGATGGTGAAGAAGTCCCAGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|222aa|down_8|NZ_CP013839.1_124703_125369_-	NA|257aa|up_9|NZ_CP013839.1_102397_103168_-	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|356aa|up_8|NZ_CP013839.1_103215_104283_-	TIGR03107, Glutamyl_aminopeptidase, glutamyl aminopeptidase	NA|98aa|up_7|NZ_CP013839.1_104738_105032_+	pfam15513, DUF4651, Domain of unknown function (DUF4651)	NA|106aa|up_6|NZ_CP013839.1_105028_105346_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|209aa|up_5|NZ_CP013839.1_105363_105990_+	cd02796, tRNA_bind_bactPheRS, tRNA-binding-domain-containing prokaryotic phenylalanly tRNA synthetase (PheRS) beta chain	NA|132aa|up_4|NZ_CP013839.1_106141_106537_+	PRK07274, PRK07274, single-stranded DNA-binding protein; Provisional	NA|214aa|up_3|NZ_CP013839.1_106796_107438_-	COG1428, COG1428, Deoxynucleoside kinases [Nucleotide transport and metabolism]	NA|326aa|up_2|NZ_CP013839.1_107457_108435_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|291aa|up_1|NZ_CP013839.1_108421_109294_-	PRK00114, hslO, Hsp33 family molecular chaperone HslO	NA|498aa|up_0|NZ_CP013839.1_109440_110934_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|283aa|down_0|NZ_CP013839.1_112976_113825_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|754aa|down_1|NZ_CP013839.1_114110_116372_+	NF033396, pilus_ancill_1, pilus ancillary protein 1	NA|174aa|down_2|NZ_CP013839.1_116368_116890_+	TIGR02227, Inactive_signal_peptidase_IA	NA|350aa|down_3|NZ_CP013839.1_116911_117961_+	TIGR03786, strep_pil_rpt, streptococcal pilin isopeptide linkage domain	NA|242aa|down_4|NZ_CP013839.1_117976_118702_+	TIGR03064, sortase_srtB, sortase, SrtB family	NA|196aa|down_5|NZ_CP013839.1_118718_119306_+	TIGR03786, strep_pil_rpt, streptococcal pilin isopeptide linkage domain	NA|402aa|down_6|NZ_CP013839.1_119464_120670_-	TIGR04094, AraC_family_transcriptional_regulator, YSIRK-targeted surface antigen transcriptional regulator	NA|1126aa|down_7|NZ_CP013839.1_121060_124438_+	pfam05738, Cna_B, Cna protein B-type domain	NA|222aa|down_8|NZ_CP013839.1_124703_125369_-	NA	NA|469aa|down_9|NZ_CP013839.1_125721_127128_+	COG2031, AtoE, Short chain fatty acids transporter [Lipid metabolism]
GCF_001635935.1_ASM163593v1	NZ_CP013839	Streptococcus pyogenes strain MGAS23530 chromosome, complete genome	2	756938-757303	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Type II-A,Type II-C,Type II-B	GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC,GTTTTAGAGCTATGCTGTTTTGAATGGTCCCAAAAC	36,36,36	0	0	NA	NA	II-A:II-A:II-A	4,5,5	5	TypeII-A,TypeII-C,TypeII-B	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA|214aa|up_8|NZ_CP013839.1_746399_747041_+,NA	NA|452aa|up_9|NZ_CP013839.1_744920_746276_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|214aa|up_8|NZ_CP013839.1_746399_747041_+	NA	NA|377aa|up_7|NZ_CP013839.1_747103_748234_+	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|251aa|up_6|NZ_CP013839.1_748243_748996_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|255aa|up_5|NZ_CP013839.1_748995_749760_+	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|211aa|up_4|NZ_CP013839.1_749759_750392_+	COG4478, COG4478, Predicted membrane protein [Function unknown]	cas9|1369aa|up_3|NZ_CP013839.1_750869_754976_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|290aa|up_2|NZ_CP013839.1_754975_755845_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NZ_CP013839.1_755841_756183_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NZ_CP013839.1_756172_756835_+	cd09758, Csn2, CRISPR/Cas system-associated protein Csn2	NA|611aa|down_0|NZ_CP013839.1_757953_759786_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|471aa|down_1|NZ_CP013839.1_759934_761347_+	TIGR00927, retinal_rod, K+-dependent Na+/Ca+ exchanger	NA|146aa|down_2|NZ_CP013839.1_761546_761984_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|340aa|down_3|NZ_CP013839.1_762113_763133_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|142aa|down_4|NZ_CP013839.1_763339_763765_+	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|164aa|down_5|NZ_CP013839.1_763783_764275_+	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|270aa|down_6|NZ_CP013839.1_764291_765101_+	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]	NA|276aa|down_7|NZ_CP013839.1_765097_765925_+	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|550aa|down_8|NZ_CP013839.1_766060_767710_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|263aa|down_9|NZ_CP013839.1_767713_768502_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]
GCF_001635935.1_ASM163593v1	NZ_CP013839	Streptococcus pyogenes strain MGAS23530 chromosome, complete genome	3	953307-953408	3	CRISPRCasFinder	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	AATAATTGGTATAGTCTAATTATA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA|262aa|down_6|NZ_CP013839.1_961238_962024_+	NA|83aa|up_9|NZ_CP013839.1_941023_941272_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|773aa|up_8|NZ_CP013839.1_941634_943953_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|441aa|up_7|NZ_CP013839.1_944481_945804_+	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|412aa|up_6|NZ_CP013839.1_945923_947159_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|258aa|up_5|NZ_CP013839.1_947528_948302_-	pfam07373, CAMP_factor, CAMP factor (Cfa)	NA|279aa|up_4|NZ_CP013839.1_948671_949508_-	cd00996, PBP2_AatB_like, Polar amino acids-binding domain of ATP-binding cassette transporter-like systems that belong to the type 2 periplasmic binding fold protein superfamily	NA|210aa|up_3|NZ_CP013839.1_949523_950153_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|214aa|up_2|NZ_CP013839.1_950162_950804_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|112aa|up_1|NZ_CP013839.1_950910_951246_-	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|605aa|up_0|NZ_CP013839.1_951441_953256_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|186aa|down_0|NZ_CP013839.1_953431_953989_-	TIGR02227, Inactive_signal_peptidase_IA	NA|501aa|down_1|NZ_CP013839.1_954206_955709_-	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|338aa|down_2|NZ_CP013839.1_955771_956785_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|1037aa|down_3|NZ_CP013839.1_956864_959975_-	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|124aa|down_4|NZ_CP013839.1_960159_960531_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|233aa|down_5|NZ_CP013839.1_960530_961229_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|262aa|down_6|NZ_CP013839.1_961238_962024_+	NA	NA|205aa|down_7|NZ_CP013839.1_962154_962769_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|207aa|down_8|NZ_CP013839.1_963509_964130_+	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|755aa|down_9|NZ_CP013839.1_964386_966651_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins
GCF_001635935.1_ASM163593v1	NZ_CP013839	Streptococcus pyogenes strain MGAS23530 chromosome, complete genome	4	1147958-1148191	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	 Type I-U?,Type I-C,Type I-U	TTATTTCAATCCACTCACCCATGAAGGGTGAGAC,TTATTTCAATCCACTCACCCATGAAGGGTGAGAC,TTATTTCAATCCACTCACCCATGAAGGGTGAGAC	34,34,34	0	0	NA	NA	I-C:I-C:I-C	3,3,2	3	TypeI-U?,TypeI-C,TypeI-U	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA,NA	NA|412aa|up_9|NZ_CP013839.1_1137894_1139130_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|227aa|up_8|NZ_CP013839.1_1139403_1140084_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|158aa|up_7|NZ_CP013839.1_1140225_1140699_+	COG1438, ArgR, Arginine repressor [Transcription]	NA|239aa|up_6|NZ_CP013839.1_1140864_1141581_-	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|360aa|up_5|NZ_CP013839.1_1141594_1142674_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|578aa|up_4|NZ_CP013839.1_1142746_1144480_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|247aa|up_3|NZ_CP013839.1_1144476_1145217_-	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|369aa|up_2|NZ_CP013839.1_1145304_1146411_-	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|208aa|up_1|NZ_CP013839.1_1146453_1147077_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|237aa|up_0|NZ_CP013839.1_1147089_1147800_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	cas2|98aa|down_0|NZ_CP013839.1_1148339_1148633_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|342aa|down_1|NZ_CP013839.1_1148643_1149669_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|225aa|down_2|NZ_CP013839.1_1149665_1150340_-	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas7|283aa|down_3|NZ_CP013839.1_1150341_1151190_-	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas8c|632aa|down_4|NZ_CP013839.1_1151194_1153090_-	TIGR01863, CRISPR-associated_protein_CT1133_family, CRISPR-associated protein Cas8c/Csd1, subtype I-C/DVULG	cas5|243aa|down_5|NZ_CP013839.1_1153089_1153818_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas3|803aa|down_6|NZ_CP013839.1_1153950_1156359_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|883aa|down_7|NZ_CP013839.1_1156512_1159161_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|188aa|down_8|NZ_CP013839.1_1159162_1159726_-	pfam13238, AAA_18, AAA domain	NA|132aa|down_9|NZ_CP013839.1_1160326_1160722_-	PRK07758, PRK07758, hypothetical protein; Provisional
GCF_001635935.1_ASM163593v1	NZ_CP013839	Streptococcus pyogenes strain MGAS23530 chromosome, complete genome	5	1542363-1542617	3	PILER-CR	no		cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	Orphan	TGGAGTTTCTGGGGCTGCTGGAGTTGATGGAGATGGGGTTTCCT	44	0	0	NA	NA	NA	2	2	Orphan	cas3,DinG,csm6,cas9,cas1,cas2,csn2,cas4,cas7,cas8c,cas5,DEDDh,csa3	NA|75aa|up_8|NZ_CP013839.1_1532474_1532699_-,NA|84aa|down_5|NZ_CP013839.1_1553417_1553669_-	NA|234aa|up_9|NZ_CP013839.1_1531447_1532149_+	pfam02876, Stap_Strp_tox_C, Staphylococcal/Streptococcal toxin, beta-grasp domain	NA|75aa|up_8|NZ_CP013839.1_1532474_1532699_-	NA	NA|543aa|up_7|NZ_CP013839.1_1532877_1534506_+	cd08518, PBP2_NikA_DppA_OppA_like_19, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|326aa|up_6|NZ_CP013839.1_1534618_1535596_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|274aa|up_5|NZ_CP013839.1_1535592_1536414_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|268aa|up_4|NZ_CP013839.1_1536425_1537229_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|209aa|up_3|NZ_CP013839.1_1537212_1537839_+	COG1124, DppF, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|67aa|up_2|NZ_CP013839.1_1537918_1538119_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|824aa|up_1|NZ_CP013839.1_1538287_1540759_-	TIGR01363, pneumococcal_histidine_triad_A_protein, streptococcal histidine triad protein	NA|307aa|up_0|NZ_CP013839.1_1540771_1541692_-	cd01017, AdcA, Metal binding protein AdcA	NA|1166aa|down_0|NZ_CP013839.1_1543209_1546707_-	cd07475, Peptidases_S8_C5a_Peptidase, Peptidase S8 family domain in Streptococcal C5a peptidases	NA|369aa|down_1|NZ_CP013839.1_1547041_1548148_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|368aa|down_2|NZ_CP013839.1_1548357_1549461_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|426aa|down_3|NZ_CP013839.1_1549685_1550963_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|531aa|down_4|NZ_CP013839.1_1551149_1552742_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|84aa|down_5|NZ_CP013839.1_1553417_1553669_-	NA	NA|543aa|down_6|NZ_CP013839.1_1553746_1555375_-	COG3942, COG3942, Surface antigen [General function prediction only]	NA|463aa|down_7|NZ_CP013839.1_1555476_1556865_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|218aa|down_8|NZ_CP013839.1_1556861_1557515_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|406aa|down_9|NZ_CP013839.1_1557608_1558826_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB
