assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010202265.1_ASM1020226v1	NZ_CP047918	Candidatus Saccharibacteria bacterium FS14P chromosome, complete genome	1	90313-90528	1	CRT	no		csn2,cas2,cas1,cas9,DEDDh	Orphan	CCGGAGCCGACACCGACGCCAGAT	24	2	2	90337-90354|90433-90450	NZ_CP047918.1_90295-90312|NZ_CP047918.1_90295-90312	NA	4	4	Orphan	csn2,cas2,cas1,cas9,DEDDh	NA|299aa|up_9|NZ_CP047918.1_79906_80803_+,NA|170aa|down_1|NZ_CP047918.1_91933_92443_+,NA|720aa|down_2|NZ_CP047918.1_92439_94599_+,NA|246aa|down_4|NZ_CP047918.1_95693_96431_+,NA|227aa|down_5|NZ_CP047918.1_96430_97111_+,NA|78aa|down_6|NZ_CP047918.1_97110_97344_+	NA|299aa|up_9|NZ_CP047918.1_79906_80803_+	NA	NA|193aa|up_8|NZ_CP047918.1_80858_81437_+	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|710aa|up_7|NZ_CP047918.1_81562_83692_+	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|743aa|up_6|NZ_CP047918.1_83838_86067_+	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|269aa|up_5|NZ_CP047918.1_86063_86870_-	COG0077, PheA, Prephenate dehydratase [Amino acid transport and metabolism]	NA|121aa|up_4|NZ_CP047918.1_86987_87350_+	cd00473, bS6, Bacterial ribosomal protein S6	NA|153aa|up_3|NZ_CP047918.1_87429_87888_+	pfam00436, SSB, Single-strand binding protein family	NA|66aa|up_2|NZ_CP047918.1_87897_88095_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|187aa|up_1|NZ_CP047918.1_88098_88659_+	PRK00529, PRK00529, elongation factor P; Validated	NA|242aa|up_0|NZ_CP047918.1_88758_89484_+	COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|361aa|down_0|NZ_CP047918.1_90760_91843_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|170aa|down_1|NZ_CP047918.1_91933_92443_+	NA	NA|720aa|down_2|NZ_CP047918.1_92439_94599_+	NA	NA|348aa|down_3|NZ_CP047918.1_94650_95694_+	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|246aa|down_4|NZ_CP047918.1_95693_96431_+	NA	NA|227aa|down_5|NZ_CP047918.1_96430_97111_+	NA	NA|78aa|down_6|NZ_CP047918.1_97110_97344_+	NA	NA|587aa|down_7|NZ_CP047918.1_97356_99117_+	COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion]	NA|357aa|down_8|NZ_CP047918.1_99132_100203_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|403aa|down_9|NZ_CP047918.1_100207_101416_+	COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]
GCF_010202265.1_ASM1020226v1	NZ_CP047918	Candidatus Saccharibacteria bacterium FS14P chromosome, complete genome	2	170353-170428	1	CRISPRCasFinder	no		csn2,cas2,cas1,cas9,DEDDh	Orphan	ACGGAACCGTCAATGGTGATGTCTA	25	0	0	NA	NA	NA	1	1	Orphan	csn2,cas2,cas1,cas9,DEDDh	NA|92aa|up_3|NZ_CP047918.1_165164_165440_+,NA|82aa|down_1|NZ_CP047918.1_171693_171939_+,NA|262aa|down_7|NZ_CP047918.1_176446_177232_+	NA|152aa|up_9|NZ_CP047918.1_160362_160818_+	pfam09424, YqeY, Yqey-like protein	NA|184aa|up_8|NZ_CP047918.1_160837_161389_+	PRK00279, adk, adenylate kinase; Reviewed	NA|255aa|up_7|NZ_CP047918.1_161376_162141_+	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|412aa|up_6|NZ_CP047918.1_162176_163412_+	COG0849, ftsA, Cell division ATPase FtsA [Cell division and chromosome partitioning]	NA|398aa|up_5|NZ_CP047918.1_163489_164683_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|150aa|up_4|NZ_CP047918.1_164708_165158_+	PRK00464, nrdR, transcriptional repressor NrdR	NA|92aa|up_3|NZ_CP047918.1_165164_165440_+	NA	NA|203aa|up_2|NZ_CP047918.1_165423_166032_+	cd12922, VKOR_5, Vitamin K epoxide reductase family in bacteria	NA|280aa|up_1|NZ_CP047918.1_166306_167146_+	PRK00865, PRK00865, glutamate racemase; Provisional	NA|959aa|up_0|NZ_CP047918.1_167229_170106_+	PRK06039, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|57aa|down_0|NZ_CP047918.1_171379_171550_+	pfam04024, PspC, PspC domain	NA|82aa|down_1|NZ_CP047918.1_171693_171939_+	NA	NA|102aa|down_2|NZ_CP047918.1_172007_172313_+	pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein	NA|255aa|down_3|NZ_CP047918.1_172402_173167_+	pfam13614, AAA_31, AAA domain	NA|296aa|down_4|NZ_CP047918.1_173156_174044_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|203aa|down_5|NZ_CP047918.1_174053_174662_-	TIGR02227, Inactive_signal_peptidase_IA	NA|556aa|down_6|NZ_CP047918.1_174757_176425_+	COG1316, LytR, Transcriptional regulator [Transcription]	NA|262aa|down_7|NZ_CP047918.1_176446_177232_+	NA	NA|384aa|down_8|NZ_CP047918.1_177260_178412_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|274aa|down_9|NZ_CP047918.1_178404_179226_-	TIGR03534, RF_mod_PrmC, protein-(glutamine-N5) methyltransferase, release factor-specific
GCF_010202265.1_ASM1020226v1	NZ_CP047918	Candidatus Saccharibacteria bacterium FS14P chromosome, complete genome	3	261888-262789	2,1,2	CRT,PILER-CR,CRISPRCasFinder	no	csn2,cas2,cas1,cas9	csn2,cas2,cas1,cas9,DEDDh	Type II-C,Type II-B,Type II-A	GTTTCAGTACCATTCAATATAACACTACTGTAAAAC,GTTTCAGTACCATTCAATATAACACTACTGTAAAAC,GTTTCAGTACCATTCAATATAACACTACTGTAAAAC	36,36,36	1	1	261924-261961	NZ_CP047918.1_290646-290683	NA:NA:NA	13,12,12	13	TypeII-C,TypeII-B,TypeII-A	csn2,cas2,cas1,cas9,DEDDh	NA|117aa|up_8|NZ_CP047918.1_246053_246404_+,NA|317aa|up_3|NZ_CP047918.1_251677_252628_+,NA|133aa|up_2|NZ_CP047918.1_253424_253823_-,NA|110aa|down_7|NZ_CP047918.1_269035_269365_-,NA|82aa|down_9|NZ_CP047918.1_269911_270157_+	NA|297aa|up_9|NZ_CP047918.1_244952_245843_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|117aa|up_8|NZ_CP047918.1_246053_246404_+	NA	NA|584aa|up_7|NZ_CP047918.1_246416_248168_+	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|349aa|up_6|NZ_CP047918.1_248187_249234_+	cd06852, GT_MraY, Phospho-N-acetylmuramoyl-pentapeptide-transferase (mraY) is an enzyme responsible for the formation of the first lipid intermediate in the synthesis of bacterial cell wall peptidoglycan	NA|440aa|up_5|NZ_CP047918.1_249236_250556_+	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|386aa|up_4|NZ_CP047918.1_250482_251640_+	PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional	NA|317aa|up_3|NZ_CP047918.1_251677_252628_+	NA	NA|133aa|up_2|NZ_CP047918.1_253424_253823_-	NA	NA|2177aa|up_1|NZ_CP047918.1_254060_260591_+	pfam03382, DUF285, Mycoplasma protein of unknown function, DUF285	NA|171aa|up_0|NZ_CP047918.1_260651_261164_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	csn2|230aa|down_0|NZ_CP047918.1_262843_263533_-	cd09644, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|107aa|down_1|NZ_CP047918.1_263529_263850_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|175aa|down_2|NZ_CP047918.1_263852_264377_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas1|134aa|down_3|NZ_CP047918.1_264328_264730_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|214aa|down_4|NZ_CP047918.1_264733_265375_-	pfam16595, Cas9_PI, PAM-interacting domain of CRISPR-associated endonuclease Cas9	cas9|1004aa|down_5|NZ_CP047918.1_265236_268248_-	pfam16592, Cas9_REC, REC lobe of CRISPR-associated endonuclease Cas9	cas9|167aa|down_6|NZ_CP047918.1_268253_268754_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|110aa|down_7|NZ_CP047918.1_269035_269365_-	NA	NA|148aa|down_8|NZ_CP047918.1_269440_269884_-	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|82aa|down_9|NZ_CP047918.1_269911_270157_+	NA
GCF_010202265.1_ASM1020226v1	NZ_CP047918	Candidatus Saccharibacteria bacterium FS14P chromosome, complete genome	4	290684-291511	2,3,3	PILER-CR,CRISPRCasFinder,CRT	no	csn2,cas2,cas1,cas9	csn2,cas2,cas1,cas9,DEDDh	Type II-C,Type II-B,Type II-A	GTTTCAGTACCATTCAATATAACACTACTGTAAAAC,GTTTCAGTACCATTCAATATAACACTACTGTAAAAC,GTTTCAGTACCATTCAATATAACACTACTGTAAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	12,12,12	12	TypeII-C,TypeII-B,TypeII-A	csn2,cas2,cas1,cas9,DEDDh	NA|93aa|up_7|NZ_CP047918.1_285445_285724_+,NA|51aa|up_6|NZ_CP047918.1_285741_285894_+,NA|78aa|up_4|NZ_CP047918.1_286782_287016_+,NA|84aa|up_1|NZ_CP047918.1_289057_289309_+,NA|123aa|down_7|NZ_CP047918.1_304655_305024_-,NA|215aa|down_8|NZ_CP047918.1_305353_305998_-,NA|143aa|down_9|NZ_CP047918.1_306253_306682_-	NA|118aa|up_9|NZ_CP047918.1_284147_284501_+	pfam03382, DUF285, Mycoplasma protein of unknown function, DUF285	NA|140aa|up_8|NZ_CP047918.1_284478_284898_+	pfam03382, DUF285, Mycoplasma protein of unknown function, DUF285	NA|93aa|up_7|NZ_CP047918.1_285445_285724_+	NA	NA|51aa|up_6|NZ_CP047918.1_285741_285894_+	NA	NA|78aa|up_5|NZ_CP047918.1_286164_286398_+	pfam09479, Flg_new, Listeria-Bacteroides repeat domain (List_Bact_rpt)	NA|78aa|up_4|NZ_CP047918.1_286782_287016_+	NA	NA|96aa|up_3|NZ_CP047918.1_286969_287257_+	NF033189, internalin_A, class 1 internalin InlA	NA|462aa|up_2|NZ_CP047918.1_287600_288986_+	NF012196, Ig_like_ice, Ig-like domain-containing protein	NA|84aa|up_1|NZ_CP047918.1_289057_289309_+	NA	NA|171aa|up_0|NZ_CP047918.1_289374_289887_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	csn2|230aa|down_0|NZ_CP047918.1_291565_292255_-	cd09644, Csn2, CRISPR/Cas system-associated protein Csn2	cas2|107aa|down_1|NZ_CP047918.1_292251_292572_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|293aa|down_2|NZ_CP047918.1_292574_293453_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1341aa|down_3|NZ_CP047918.1_293456_297479_-	pfam16592, Cas9_REC, REC lobe of CRISPR-associated endonuclease Cas9	NA|941aa|down_4|NZ_CP047918.1_297760_300583_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|353aa|down_5|NZ_CP047918.1_300708_301767_-	COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription]	NA|781aa|down_6|NZ_CP047918.1_302205_304548_-	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|123aa|down_7|NZ_CP047918.1_304655_305024_-	NA	NA|215aa|down_8|NZ_CP047918.1_305353_305998_-	NA	NA|143aa|down_9|NZ_CP047918.1_306253_306682_-	NA
