assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003966505.1_ASM396650v1	AP018937	Streptococcus pneumoniae HU-OH DNA, complete genome	1	94186-94281	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	AATGTGTAAGATTTTTATATATAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|74aa|up_9|AP018937.1_84569_84791_+,NA	NA|74aa|up_9|AP018937.1_84569_84791_+	NA	NA|310aa|up_8|AP018937.1_85109_86039_+	COG4209, LplB, ABC-type polysaccharide transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|up_7|AP018937.1_86052_86976_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|492aa|up_6|AP018937.1_87185_88661_+	pfam12010, DUF3502, Domain of unknown function (DUF3502)	NA|329aa|up_5|AP018937.1_88932_89919_+	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|287aa|up_4|AP018937.1_90193_91054_+	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|355aa|up_3|AP018937.1_91231_92296_-	pfam16001, DUF4775, Domain of unknown function (DUF4775)	NA|304aa|up_2|AP018937.1_92357_93269_-	pfam13349, DUF4097, Putative adhesin	NA|198aa|up_1|AP018937.1_93261_93855_-	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|109aa|up_0|AP018937.1_93841_94168_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|389aa|down_0|AP018937.1_94306_95473_-	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|356aa|down_1|AP018937.1_95623_96691_+	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|617aa|down_2|AP018937.1_96729_98580_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|211aa|down_3|AP018937.1_98882_99515_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|291aa|down_4|AP018937.1_99536_100409_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|224aa|down_5|AP018937.1_100417_101089_-	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|196aa|down_6|AP018937.1_101330_101918_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|96aa|down_7|AP018937.1_102721_103009_+	TIGR01653, hypothetical_protein, bacteriocin, lactococcin 972 family	NA|703aa|down_8|AP018937.1_103060_105169_+	TIGR01654, unnamed_protein_product, bacteriocin-associated integral membrane (putative immunity) protein	NA|214aa|down_9|AP018937.1_105165_105807_+	TIGR03608, L_ocin_972_ABC, putative bacteriocin export ABC transporter, lactococcin 972 group
GCA_003966505.1_ASM396650v1	AP018937	Streptococcus pneumoniae HU-OH DNA, complete genome	2	1036691-1036770	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,RT	Orphan	GGGCCAAGCGGTGGCGGACACCAGAA	26	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,RT	NA|100aa|up_7|AP018937.1_1027258_1027558_+,NA|163aa|up_6|AP018937.1_1027547_1028036_+,NA|81aa|up_4|AP018937.1_1029933_1030176_+,NA|285aa|up_3|AP018937.1_1030192_1031047_+,NA|44aa|down_7|AP018937.1_1044104_1044236_-	NA|197aa|up_9|AP018937.1_1025549_1026140_+	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin	NA|279aa|up_8|AP018937.1_1026139_1026976_+	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|100aa|up_7|AP018937.1_1027258_1027558_+	NA	NA|163aa|up_6|AP018937.1_1027547_1028036_+	NA	NA|627aa|up_5|AP018937.1_1028032_1029913_+	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|81aa|up_4|AP018937.1_1029933_1030176_+	NA	NA|285aa|up_3|AP018937.1_1030192_1031047_+	NA	NA|120aa|up_2|AP018937.1_1031100_1031460_+	pfam12666, PrgI, PrgI family protein	NA|786aa|up_1|AP018937.1_1031410_1033768_+	TIGR02746, hypothetical_protein, type-IV secretion system protein TraC	NA|938aa|up_0|AP018937.1_1033779_1036593_+	pfam18013, Phage_lysozyme2, Phage tail lysozyme	NA|406aa|down_0|AP018937.1_1036997_1038215_-	pfam00589, Phage_integrase, Phage integrase family	NA|68aa|down_1|AP018937.1_1038296_1038500_-	pfam09035, Tn916-Xis, Excisionase from transposon Tn916	NA|77aa|down_2|AP018937.1_1038960_1039191_-	pfam12645, HTH_16, Helix-turn-helix domain	NA|141aa|down_3|AP018937.1_1039187_1039610_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|80aa|down_4|AP018937.1_1040114_1040354_+	pfam01381, HTH_3, Helix-turn-helix	NA|973aa|down_5|AP018937.1_1040409_1043328_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|185aa|down_6|AP018937.1_1043331_1043886_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|44aa|down_7|AP018937.1_1044104_1044236_-	NA	NA|246aa|down_8|AP018937.1_1044240_1044978_-	pfam00398, RrnaAD, Ribosomal RNA adenine dimethylase	NA|640aa|down_9|AP018937.1_1045902_1047822_-	cd04168, TetM_like, Tet(M)-like family includes Tet(M), Tet(O), Tet(W), and OtrA, containing tetracycline resistant proteins
