assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900475005.1_41906_F02	NZ_LS483340	Streptococcus pyogenes strain NCTC12062 chromosome 1	1	574255-574748	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	Type I-U, Type I-U?,Type I-C	GTCTCACCCTTCATGGGTGAGTGGATTGAAAT,GTCTCACCCTTCATGGGTGAGTGGATTGAAAT,GTCTCACCCTTCATGGGTGAGTGGATTGAAAT	32,32,32	0	0	NA	NA	I-C:I-C:I-C	7,7,7	7	TypeI-U,TypeI-U?,TypeI-C	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	NA,NA	NA|132aa|up_9|NZ_LS483340.1_561723_562119_+	PRK07758, PRK07758, hypothetical protein; Provisional	NA|188aa|up_8|NZ_LS483340.1_562719_563283_+	pfam13238, AAA_18, AAA domain	NA|883aa|up_7|NZ_LS483340.1_563284_565933_+	PRK05729, valS, valyl-tRNA synthetase; Reviewed	cas3|803aa|up_6|NZ_LS483340.1_566086_568495_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|243aa|up_5|NZ_LS483340.1_568627_569356_+	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	cas8c|632aa|up_4|NZ_LS483340.1_569355_571251_+	TIGR01863, CRISPR-associated_protein_CT1133_family, CRISPR-associated protein Cas8c/Csd1, subtype I-C/DVULG	cas7|283aa|up_3|NZ_LS483340.1_571255_572104_+	COG3649, COG3649, CRISPR system related protein [Defense mechanisms]	cas4|225aa|up_2|NZ_LS483340.1_572105_572780_+	COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense    mechanisms]	cas1|342aa|up_1|NZ_LS483340.1_572776_573802_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|NZ_LS483340.1_573812_574106_+	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	NA|237aa|down_0|NZ_LS483340.1_574907_575618_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|208aa|down_1|NZ_LS483340.1_575630_576254_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|369aa|down_2|NZ_LS483340.1_576296_577403_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|247aa|down_3|NZ_LS483340.1_577490_578231_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|578aa|down_4|NZ_LS483340.1_578227_579961_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|360aa|down_5|NZ_LS483340.1_580033_581113_+	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|239aa|down_6|NZ_LS483340.1_581126_581843_+	COG3382, COG3382, Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only]	NA|158aa|down_7|NZ_LS483340.1_582009_582483_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|227aa|down_8|NZ_LS483340.1_582624_583305_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|412aa|down_9|NZ_LS483340.1_583578_584814_+	PRK01388, PRK01388, arginine deiminase; Provisional
GCF_900475005.1_41906_F02	NZ_LS483340	Streptococcus pyogenes strain NCTC12062 chromosome 1	2	795233-795333	2	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	Orphan	TATAATTAGACTATACCAATTTT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	NA|262aa|up_6|NZ_LS483340.1_786616_787402_-,NA	NA|755aa|up_9|NZ_LS483340.1_781993_784258_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|207aa|up_8|NZ_LS483340.1_784514_785135_-	pfam12978, DUF3862, Domain of Unknown Function with PDB structure (DUF3862)	NA|205aa|up_7|NZ_LS483340.1_785875_786490_+	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|262aa|up_6|NZ_LS483340.1_786616_787402_-	NA	NA|233aa|up_5|NZ_LS483340.1_787411_788110_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|124aa|up_4|NZ_LS483340.1_788109_788481_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|1037aa|up_3|NZ_LS483340.1_788665_791776_+	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|338aa|up_2|NZ_LS483340.1_791855_792869_+	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|501aa|up_1|NZ_LS483340.1_792931_794434_+	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|186aa|up_0|NZ_LS483340.1_794651_795209_+	TIGR02227, Inactive_signal_peptidase_IA	NA|605aa|down_0|NZ_LS483340.1_795384_797199_+	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|112aa|down_1|NZ_LS483340.1_797394_797730_+	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|214aa|down_2|NZ_LS483340.1_797836_798478_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|210aa|down_3|NZ_LS483340.1_798487_799117_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|279aa|down_4|NZ_LS483340.1_799132_799969_+	cd00996, PBP2_AatB_like, Polar amino acids-binding domain of ATP-binding cassette transporter-like systems that belong to the type 2 periplasmic binding fold protein superfamily	NA|258aa|down_5|NZ_LS483340.1_800338_801112_+	pfam07373, CAMP_factor, CAMP factor (Cfa)	NA|412aa|down_6|NZ_LS483340.1_801481_802717_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|441aa|down_7|NZ_LS483340.1_802836_804159_-	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|773aa|down_8|NZ_LS483340.1_804687_807006_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|83aa|down_9|NZ_LS483340.1_807367_807616_+	COG2261, COG2261, Predicted membrane protein [Function unknown]
GCF_900475005.1_41906_F02	NZ_LS483340	Streptococcus pyogenes strain NCTC12062 chromosome 1	3	994971-995130	2	PILER-CR	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	Orphan	GTTTTGGGACCATTCAAAACAGCATAGCT	29	0	0	NA	NA	II-A	2	2	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	NA,NA|214aa|down_4|NZ_LS483340.1_998640_999282_-	NA|550aa|up_9|NZ_LS483340.1_983038_984688_-	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|276aa|up_8|NZ_LS483340.1_984823_985651_-	COG3716, ManZ, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID [Carbohydrate transport and metabolism]	NA|270aa|up_7|NZ_LS483340.1_985647_986457_-	COG3715, ManY, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism]	NA|164aa|up_6|NZ_LS483340.1_986473_986965_-	COG3444, COG3444, Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]	NA|142aa|up_5|NZ_LS483340.1_986991_987417_-	COG2893, ManX, Phosphotransferase system, mannose/fructose-specific component IIA [Carbohydrate transport and metabolism]	NA|340aa|up_4|NZ_LS483340.1_987623_988643_-	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|152aa|up_3|NZ_LS483340.1_988772_989228_-	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|415aa|up_2|NZ_LS483340.1_989397_990642_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|611aa|up_1|NZ_LS483340.1_991098_992931_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|378aa|up_0|NZ_LS483340.1_993099_994233_-	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|211aa|down_0|NZ_LS483340.1_995289_995922_-	COG4478, COG4478, Predicted membrane protein [Function unknown]	NA|255aa|down_1|NZ_LS483340.1_995921_996686_-	cd07530, HAD_Pase_UmpH-like, UmpH/NagD family phosphatase, similar to Escherichia coli UmpH UMP phosphatase/NagD nucleotide phosphatase and Mycobacterium tuberculosis Rv1692 glycerol 3-phosphate phosphatase	NA|251aa|down_2|NZ_LS483340.1_996685_997438_-	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|377aa|down_3|NZ_LS483340.1_997447_998578_-	PRK08599, PRK08599, oxygen-independent coproporphyrinogen III oxidase	NA|214aa|down_4|NZ_LS483340.1_998640_999282_-	NA	NA|452aa|down_5|NZ_LS483340.1_999405_1000761_-	PRK14316, glmM, phosphoglucosamine mutase; Provisional	NA|319aa|down_6|NZ_LS483340.1_1000814_1001771_-	COG4856, COG4856, Uncharacterized protein conserved in bacteria [Function unknown]	NA|284aa|down_7|NZ_LS483340.1_1001767_1002619_-	COG1624, COG1624, Uncharacterized conserved protein [Function unknown]	NA|448aa|down_8|NZ_LS483340.1_1002725_1004069_+	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|264aa|down_9|NZ_LS483340.1_1004068_1004860_+	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]
GCF_900475005.1_41906_F02	NZ_LS483340	Streptococcus pyogenes strain NCTC12062 chromosome 1	4	1616068-1616345	3	CRISPRCasFinder	no		cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	Orphan	CGCCTGGTTGGCCTGCTTCACCTTGTGGGCCT	32	1	1	1616100-1616133	NZ_LS483340.1_1616052-1616085	NA	4	4	Orphan	cas3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,csm6,DinG,csa3	NA,NA|361aa|down_5|NZ_LS483340.1_1623313_1624396_-	NA|250aa|up_9|NZ_LS483340.1_1601257_1602007_-	PRK14830, PRK14830, undecaprenyl pyrophosphate synthase; Provisional	NA|122aa|up_8|NZ_LS483340.1_1602225_1602591_-	PRK06531, yajC, preprotein translocase subunit YajC; Validated	NA|125aa|up_7|NZ_LS483340.1_1602706_1603081_-	TIGR01295, Pediocin_PA-1_biosynthesis_protein_PedC, bacteriocin transport accessory protein, putative	NA|1166aa|up_6|NZ_LS483340.1_1603195_1606693_-	TIGR02102, alkaline_amylopullulanase, pullulanase, extracellular, Gram-positive	NA|538aa|up_5|NZ_LS483340.1_1606890_1608504_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|378aa|up_4|NZ_LS483340.1_1608632_1609766_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|283aa|up_3|NZ_LS483340.1_1610063_1610912_-	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|433aa|up_2|NZ_LS483340.1_1611251_1612550_+	pfam02821, Staphylokinase, Staphylokinase/Streptokinase family	NA|148aa|up_1|NZ_LS483340.1_1612647_1613091_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|740aa|up_0|NZ_LS483340.1_1613105_1615325_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|161aa|down_0|NZ_LS483340.1_1617274_1617757_+	PRK02551, PRK02551, flavoprotein NrdI; Provisional	NA|273aa|down_1|NZ_LS483340.1_1618150_1618969_-	cd09079, RgfB-like, Streptococcus agalactiae RgfB, part of a putative two component signal transduction system, and related proteins	NA|729aa|down_2|NZ_LS483340.1_1619051_1621238_-	TIGR02003, PTS_system_glucose-specific_IIBC_component, PTS system, IIBC component	NA|250aa|down_3|NZ_LS483340.1_1621594_1622344_-	COG1385, COG1385, Uncharacterized protein conserved in bacteria [Function unknown]	NA|318aa|down_4|NZ_LS483340.1_1622343_1623297_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	NA|361aa|down_5|NZ_LS483340.1_1623313_1624396_-	NA	NA|147aa|down_6|NZ_LS483340.1_1624489_1624930_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|106aa|down_7|NZ_LS483340.1_1624957_1625275_-	cd06555, ASCH_PF0470_like, ASC-1 homology domain, subfamily similar to Pyrococcus furiosus Pf0470	NA|157aa|down_8|NZ_LS483340.1_1625288_1625759_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|586aa|down_9|NZ_LS483340.1_1628089_1629847_+	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]
