assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002813955.1_ASM281395v1	NZ_CP018838	Streptococcus pneumoniae strain 11A chromosome, complete genome	1	1168178-1168261	1	CRISPRCasFinder	no		cas3,DinG,DEDDh	Orphan	CTTTTTTGAAACGTTTCATTTTT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh	NA|60aa|up_6|NZ_CP018838.1_1160707_1160887_-,NA|80aa|up_5|NZ_CP018838.1_1161051_1161291_-,NA	NA|157aa|up_9|NZ_CP018838.1_1159141_1159612_-	pfam11217, DUF3013, Protein of unknown function (DUF3013)	NA|151aa|up_8|NZ_CP018838.1_1159902_1160355_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|69aa|up_7|NZ_CP018838.1_1160391_1160598_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|60aa|up_6|NZ_CP018838.1_1160707_1160887_-	NA	NA|80aa|up_5|NZ_CP018838.1_1161051_1161291_-	NA	NA|424aa|up_4|NZ_CP018838.1_1161560_1162832_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|440aa|up_3|NZ_CP018838.1_1163375_1164695_-	COG1621, SacC, Beta-fructosidases (levanase/invertase) [Carbohydrate transport and metabolism]	NA|539aa|up_2|NZ_CP018838.1_1164704_1166321_-	cd13581, PBP2_AlgQ_like_2, Periplasmic-binding component of alginate-specific ABC uptake system-like; contains the type 2 periplasmic binding fold	NA|297aa|up_1|NZ_CP018838.1_1166349_1167240_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|306aa|up_0|NZ_CP018838.1_1167250_1168168_-	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|334aa|down_0|NZ_CP018838.1_1168317_1169319_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|494aa|down_1|NZ_CP018838.1_1169680_1171162_-	pfam08270, PRD_Mga, M protein trans-acting positive regulator (MGA) PRD domain	NA|55aa|down_2|NZ_CP018838.1_1171564_1171729_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|189aa|down_3|NZ_CP018838.1_1171812_1172379_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|57aa|down_4|NZ_CP018838.1_1172390_1172561_+	COG5547, COG5547, Small integral membrane protein [Function unknown]	NA|203aa|down_5|NZ_CP018838.1_1172599_1173208_+	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_6|NZ_CP018838.1_1173238_1173442_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|96aa|down_7|NZ_CP018838.1_1174133_1174421_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|257aa|down_8|NZ_CP018838.1_1174698_1175469_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|283aa|down_9|NZ_CP018838.1_1175483_1176332_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1
GCF_002813955.1_ASM281395v1	NZ_CP018838	Streptococcus pneumoniae strain 11A chromosome, complete genome	2	1628110-1628211	2	CRISPRCasFinder	no	cas3	cas3,DinG,DEDDh	Unclear	GTTGTTTCTTATACAGTTTTTCTTG	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DinG,DEDDh	NA|131aa|up_9|NZ_CP018838.1_1620386_1620779_-,NA|92aa|up_5|NZ_CP018838.1_1626244_1626520_-,NA|114aa|up_4|NZ_CP018838.1_1626512_1626854_-,NA|98aa|up_3|NZ_CP018838.1_1626834_1627128_-,NA|72aa|up_2|NZ_CP018838.1_1627124_1627340_-,NA|67aa|up_1|NZ_CP018838.1_1627341_1627542_-,NA|132aa|up_0|NZ_CP018838.1_1627538_1627934_-,NA|335aa|down_3|NZ_CP018838.1_1630653_1631658_+	NA|131aa|up_9|NZ_CP018838.1_1620386_1620779_-	NA	NA|137aa|up_8|NZ_CP018838.1_1620934_1621345_-	pfam07374, DUF1492, Protein of unknown function (DUF1492)	NA|490aa|up_7|NZ_CP018838.1_1623722_1625192_-	TIGR01613, putative_primase, phage/plasmid primase, P4 family, C-terminal domain	NA|287aa|up_6|NZ_CP018838.1_1625244_1626105_-	smart00942, PriCT_1, Primase C terminal 1 (PriCT-1)	NA|92aa|up_5|NZ_CP018838.1_1626244_1626520_-	NA	NA|114aa|up_4|NZ_CP018838.1_1626512_1626854_-	NA	NA|98aa|up_3|NZ_CP018838.1_1626834_1627128_-	NA	NA|72aa|up_2|NZ_CP018838.1_1627124_1627340_-	NA	NA|67aa|up_1|NZ_CP018838.1_1627341_1627542_-	NA	NA|132aa|up_0|NZ_CP018838.1_1627538_1627934_-	NA	NA|209aa|down_0|NZ_CP018838.1_1628442_1629069_-	COG3646, COG3646, Uncharacterized phage-encoded protein [Function unknown]	NA|78aa|down_1|NZ_CP018838.1_1629158_1629392_-	pfam01381, HTH_3, Helix-turn-helix	NA|319aa|down_2|NZ_CP018838.1_1629548_1630505_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|335aa|down_3|NZ_CP018838.1_1630653_1631658_+	NA	NA|151aa|down_4|NZ_CP018838.1_1631657_1632110_+	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|389aa|down_5|NZ_CP018838.1_1632205_1633372_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|372aa|down_6|NZ_CP018838.1_1633491_1634607_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|190aa|down_7|NZ_CP018838.1_1634677_1635247_+	COG0193, Pth, Peptidyl-tRNA hydrolase [Translation, ribosomal structure and biogenesis]	cas3|1175aa|down_8|NZ_CP018838.1_1635247_1638772_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|89aa|down_9|NZ_CP018838.1_1638839_1639106_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
GCF_002813955.1_ASM281395v1	NZ_CP018838	Streptococcus pneumoniae strain 11A chromosome, complete genome	3	1729834-1729929	3	CRISPRCasFinder	no		cas3,DinG,DEDDh	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh	NA|47aa|up_7|NZ_CP018838.1_1722802_1722943_-,NA|196aa|down_8|NZ_CP018838.1_1738878_1739466_+	NA|310aa|up_9|NZ_CP018838.1_1720862_1721792_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_8|NZ_CP018838.1_1721805_1722729_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|47aa|up_7|NZ_CP018838.1_1722802_1722943_-	NA	NA|492aa|up_6|NZ_CP018838.1_1722986_1724462_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|NZ_CP018838.1_1724733_1725720_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|338aa|up_4|NZ_CP018838.1_1725688_1726702_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|NZ_CP018838.1_1726879_1727944_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|NZ_CP018838.1_1728005_1728917_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|NZ_CP018838.1_1728909_1729503_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|NZ_CP018838.1_1729489_1729816_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|NZ_CP018838.1_1729954_1731121_-	COG2807, CynX, Cyanate permease [Inorganic ion transport and metabolism]	NA|386aa|down_1|NZ_CP018838.1_1731178_1732336_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|NZ_CP018838.1_1732377_1734228_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|NZ_CP018838.1_1734533_1735166_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|NZ_CP018838.1_1735187_1736060_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|NZ_CP018838.1_1736068_1736740_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|217aa|down_6|NZ_CP018838.1_1736979_1737630_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|288aa|down_7|NZ_CP018838.1_1737681_1738545_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|196aa|down_8|NZ_CP018838.1_1738878_1739466_+	NA	NA|291aa|down_9|NZ_CP018838.1_1739709_1740582_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins
