assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900476445.1_55312_D02	NZ_LS483448	Streptococcus pneumoniae strain 4041STDY6836167 chromosome 1	1	11280-11381	1	CRISPRCasFinder	no	cas3	cas3,DEDDh,DinG	Unclear	GTTGTTTCTTATACAGTTTTTCTTG	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DEDDh,DinG	NA|56aa|up_8|NZ_LS483448.1_6384_6552_-,NA|92aa|up_5|NZ_LS483448.1_9414_9690_-,NA|114aa|up_4|NZ_LS483448.1_9682_10024_-,NA|98aa|up_3|NZ_LS483448.1_10004_10298_-,NA|72aa|up_2|NZ_LS483448.1_10294_10510_-,NA|67aa|up_1|NZ_LS483448.1_10511_10712_-,NA|151aa|up_0|NZ_LS483448.1_10708_11161_-,NA|335aa|down_3|NZ_LS483448.1_13823_14828_+	NA|137aa|up_9|NZ_LS483448.1_4104_4515_-	pfam07374, DUF1492, Protein of unknown function (DUF1492)	NA|56aa|up_8|NZ_LS483448.1_6384_6552_-	NA	NA|490aa|up_7|NZ_LS483448.1_6892_8362_-	TIGR01613, putative_primase, phage/plasmid primase, P4 family, C-terminal domain	NA|287aa|up_6|NZ_LS483448.1_8414_9275_-	smart00942, PriCT_1, Primase C terminal 1 (PriCT-1)	NA|92aa|up_5|NZ_LS483448.1_9414_9690_-	NA	NA|114aa|up_4|NZ_LS483448.1_9682_10024_-	NA	NA|98aa|up_3|NZ_LS483448.1_10004_10298_-	NA	NA|72aa|up_2|NZ_LS483448.1_10294_10510_-	NA	NA|67aa|up_1|NZ_LS483448.1_10511_10712_-	NA	NA|151aa|up_0|NZ_LS483448.1_10708_11161_-	NA	NA|209aa|down_0|NZ_LS483448.1_11612_12239_-	COG3646, COG3646, Uncharacterized phage-encoded protein [Function unknown]	NA|78aa|down_1|NZ_LS483448.1_12328_12562_-	pfam01381, HTH_3, Helix-turn-helix	NA|319aa|down_2|NZ_LS483448.1_12718_13675_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|335aa|down_3|NZ_LS483448.1_13823_14828_+	NA	NA|151aa|down_4|NZ_LS483448.1_14827_15280_+	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|389aa|down_5|NZ_LS483448.1_15375_16542_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|372aa|down_6|NZ_LS483448.1_16661_17777_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|190aa|down_7|NZ_LS483448.1_17847_18417_+	COG0193, Pth, Peptidyl-tRNA hydrolase [Translation, ribosomal structure and biogenesis]	cas3|1170aa|down_8|NZ_LS483448.1_18417_21927_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|89aa|down_9|NZ_LS483448.1_21994_22261_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
GCF_900476445.1_55312_D02	NZ_LS483448	Streptococcus pneumoniae strain 4041STDY6836167 chromosome 1	2	141563-141658	2	CRISPRCasFinder	no		cas3,DEDDh,DinG	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG	NA|74aa|up_9|NZ_LS483448.1_132051_132273_+,NA|107aa|down_7|NZ_LS483448.1_149339_149660_-	NA|74aa|up_9|NZ_LS483448.1_132051_132273_+	NA	NA|310aa|up_8|NZ_LS483448.1_132591_133521_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|NZ_LS483448.1_133534_134458_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|NZ_LS483448.1_134715_136191_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_LS483448.1_136462_137449_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|NZ_LS483448.1_137570_138431_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_LS483448.1_138608_139673_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|NZ_LS483448.1_139734_140646_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_LS483448.1_140638_141232_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_LS483448.1_141218_141545_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_LS483448.1_141683_142850_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|386aa|down_1|NZ_LS483448.1_142907_144065_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_LS483448.1_144106_145957_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_LS483448.1_146266_146899_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_LS483448.1_146920_147793_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_LS483448.1_147801_148473_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|191aa|down_6|NZ_LS483448.1_148714_149287_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|107aa|down_7|NZ_LS483448.1_149339_149660_-	NA	NA|96aa|down_8|NZ_LS483448.1_150084_150372_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_9|NZ_LS483448.1_150423_152532_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein
