assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003351725.1_ASM335172v1	NZ_CP031246	Streptococcus pneumoniae strain M26368 chromosome, complete genome	1	99789-99874	1	CRISPRCasFinder	no		RT,cas3,DEDDh,DinG	Orphan	TCCTTTTTTGAAACGTTTCATTTTT	25	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,DinG	NA|63aa|up_6|NZ_CP031246.1_92318_92507_-,NA|80aa|up_5|NZ_CP031246.1_92661_92901_-,NA|244aa|down_9|NZ_CP031246.1_107561_108293_-	NA|157aa|up_9|NZ_CP031246.1_90752_91223_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_8|NZ_CP031246.1_91513_91966_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|60aa|up_7|NZ_CP031246.1_92002_92182_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|63aa|up_6|NZ_CP031246.1_92318_92507_-	NA	NA|80aa|up_5|NZ_CP031246.1_92661_92901_-	NA	NA|424aa|up_4|NZ_CP031246.1_93170_94442_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|NZ_CP031246.1_94988_96308_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|NZ_CP031246.1_96317_97934_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|NZ_CP031246.1_97962_98853_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|NZ_CP031246.1_98863_99781_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|334aa|down_0|NZ_CP031246.1_99930_100932_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|494aa|down_1|NZ_CP031246.1_101293_102775_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|189aa|down_2|NZ_CP031246.1_103466_104033_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_3|NZ_CP031246.1_104044_104215_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_4|NZ_CP031246.1_104253_104862_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_5|NZ_CP031246.1_104892_105096_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|149aa|down_6|NZ_CP031246.1_105168_105615_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|220aa|down_7|NZ_CP031246.1_105684_106344_+	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|384aa|down_8|NZ_CP031246.1_106395_107547_-	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|244aa|down_9|NZ_CP031246.1_107561_108293_-	NA
GCF_003351725.1_ASM335172v1	NZ_CP031246	Streptococcus pneumoniae strain M26368 chromosome, complete genome	2	654880-654975	2	CRISPRCasFinder	no		RT,cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,DinG	NA|74aa|up_9|NZ_CP031246.1_645367_645589_+,NA|107aa|down_7|NZ_CP031246.1_662798_663119_-	NA|74aa|up_9|NZ_CP031246.1_645367_645589_+	NA	NA|310aa|up_8|NZ_CP031246.1_645907_646837_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_CP031246.1_646850_647774_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_CP031246.1_648031_649507_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_CP031246.1_649778_650765_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_CP031246.1_650886_651747_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_CP031246.1_651924_652989_-	pfam05262, Borrelia_P83, Borrelia P83/100 protein	NA|304aa|up_2|NZ_CP031246.1_653051_653963_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_CP031246.1_653955_654549_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_CP031246.1_654535_654862_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_CP031246.1_655000_656167_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|386aa|down_1|NZ_CP031246.1_656224_657382_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_CP031246.1_657423_659274_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_CP031246.1_659711_660344_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_CP031246.1_660365_661238_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_CP031246.1_661246_661918_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|NZ_CP031246.1_662159_662747_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NZ_CP031246.1_662798_663119_-	NA	NA|84aa|down_8|NZ_CP031246.1_663577_663829_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NZ_CP031246.1_663880_665989_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
GCF_003351725.1_ASM335172v1	NZ_CP031246	Streptococcus pneumoniae strain M26368 chromosome, complete genome	3	1743892-1743963	3	CRISPRCasFinder	no		RT,cas3,DEDDh,DinG	Orphan	ATTTACAAAATCAACCTCGCTCT	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,DinG	NA|106aa|up_4|NZ_CP031246.1_1739340_1739658_+,NA|592aa|down_6|NZ_CP031246.1_1750442_1752218_-	NA|360aa|up_9|NZ_CP031246.1_1735196_1736276_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|308aa|up_8|NZ_CP031246.1_1736325_1737249_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|174aa|up_7|NZ_CP031246.1_1737267_1737789_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|210aa|up_6|NZ_CP031246.1_1737999_1738629_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|181aa|up_5|NZ_CP031246.1_1738628_1739171_-	COG1399, COG1399, Predicted metal-binding, possibly nucleic acid-binding protein [General function prediction only]	NA|106aa|up_4|NZ_CP031246.1_1739340_1739658_+	NA	NA|520aa|up_3|NZ_CP031246.1_1739896_1741456_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|300aa|up_2|NZ_CP031246.1_1741604_1742504_-	PRK04897, PRK04897, heat shock protein HtpX; Provisional	NA|187aa|up_1|NZ_CP031246.1_1742505_1743066_-	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|238aa|up_0|NZ_CP031246.1_1743159_1743873_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|428aa|down_0|NZ_CP031246.1_1744175_1745459_+	PRK10720, PRK10720, uracil transporter; Provisional	NA|524aa|down_1|NZ_CP031246.1_1745653_1747225_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|111aa|down_2|NZ_CP031246.1_1747236_1747569_-	PRK00118, PRK00118, putative DNA-binding protein; Validated	NA|122aa|down_3|NZ_CP031246.1_1747659_1748025_-	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|435aa|down_4|NZ_CP031246.1_1748108_1749413_+	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|269aa|down_5|NZ_CP031246.1_1749425_1750232_+	PRK10513, PRK10513, sugar phosphate phosphatase; Provisional	NA|592aa|down_6|NZ_CP031246.1_1750442_1752218_-	NA	NA|394aa|down_7|NZ_CP031246.1_1752703_1753885_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|407aa|down_8|NZ_CP031246.1_1753897_1755118_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|285aa|down_9|NZ_CP031246.1_1755211_1756066_+	TIGR01716, HTH-type_transcriptional_regulator_rgg, transcriptional activator, Rgg/GadR/MutR family, C-terminal domain
