assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001457555.1_NCTC10562	NZ_LN831027	Fusobacterium nucleatum subsp. polymorphum strain NCTC10562 chromosome 1	1	863498-863575	1	CRISPRCasFinder	no		WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	Orphan	CAGTATAATCAAAAGAAATACAC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	NA|140aa|up_7|NZ_LN831027.1_854206_854626_+,NA	NA|599aa|up_9|NZ_LN831027.1_851652_853449_+	cd03819, GT4_WavL-like, Vibrio cholerae WavL and similar sequences	NA|247aa|up_8|NZ_LN831027.1_853451_854192_+	COG3713, OmpV, Outer membrane protein V [Cell envelope biogenesis, outer membrane]	NA|140aa|up_7|NZ_LN831027.1_854206_854626_+	NA	NA|328aa|up_6|NZ_LN831027.1_854694_855678_+	pfam14568, SUKH_6, SMI1-KNR4 cell-wall	NA|288aa|up_5|NZ_LN831027.1_855680_856544_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|512aa|up_4|NZ_LN831027.1_856784_858320_+	cd08499, PBP2_Ylib_like, The substrate-binding component of an uncharacterized ABC-type peptide import system Ylib contains the type 2 periplasmic binding fold	NA|309aa|up_3|NZ_LN831027.1_858403_859330_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|290aa|up_2|NZ_LN831027.1_859339_860209_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|336aa|up_1|NZ_LN831027.1_860227_861235_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|325aa|up_0|NZ_LN831027.1_861227_862202_+	PRK15079, PRK15079, oligopeptide ABC transporter ATP-binding protein OppF; Provisional	NA|95aa|down_0|NZ_LN831027.1_864466_864751_+	COG1862, YajC, Preprotein translocase subunit YajC [Intracellular trafficking and secretion]	NA|343aa|down_1|NZ_LN831027.1_864846_865875_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|145aa|down_2|NZ_LN831027.1_865879_866314_+	COG2031, AtoE, Short chain fatty acids transporter [Lipid metabolism]	NA|358aa|down_3|NZ_LN831027.1_866338_867412_+	PRK00591, prfA, peptide chain release factor 1; Validated	NA|384aa|down_4|NZ_LN831027.1_867411_868563_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|344aa|down_5|NZ_LN831027.1_868543_869575_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|183aa|down_6|NZ_LN831027.1_869587_870136_+	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|71aa|down_7|NZ_LN831027.1_870346_870559_+	COG1722, XseB, Exonuclease VII small subunit [DNA replication, recombination, and repair]	NA|298aa|down_8|NZ_LN831027.1_870560_871454_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|231aa|down_9|NZ_LN831027.1_871586_872279_+	pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase
GCF_001457555.1_NCTC10562	NZ_LN831027	Fusobacterium nucleatum subsp. polymorphum strain NCTC10562 chromosome 1	2	1026216-1028166	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	Unclear	ATTTATGTATTTCTATATTAGAATTTAAA,ATTTATGTATTTCTATATTAGAATTTAAAT,ATTTATGTATTTCTATATTAGAATTTAAAT	29,30,30	0	0	NA	NA	NA:NA:NA	28,29,29	29	Unclear	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	NA,NA	NA|249aa|up_9|NZ_LN831027.1_1015177_1015924_+	cd01411, SIR2H, SIR2H: Uncharacterized prokaryotic Sir2 homologs from several gram positive bacterial species and Fusobacteria; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|300aa|up_8|NZ_LN831027.1_1016396_1017296_+	COG4823, AbiF, Abortive infection bacteriophage resistance protein [Defense mechanisms]	cas6|251aa|up_7|NZ_LN831027.1_1017324_1018077_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b2|515aa|up_6|NZ_LN831027.1_1018066_1019611_+	pfam09657, Cas_Csx8, CRISPR-associated protein Csx8 (Cas_Csx8)	cas7|301aa|up_5|NZ_LN831027.1_1019623_1020526_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas5|367aa|up_4|NZ_LN831027.1_1020539_1021640_+	TIGR02593, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5, N-terminal domain	cas3|813aa|up_3|NZ_LN831027.1_1021731_1024170_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|165aa|up_2|NZ_LN831027.1_1024239_1024734_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|up_1|NZ_LN831027.1_1024745_1025738_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|107aa|up_0|NZ_LN831027.1_1025700_1026021_+	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	NA|335aa|down_0|NZ_LN831027.1_1028608_1029613_+	PRK09653, eutD, phosphotransacetylase	NA|399aa|down_1|NZ_LN831027.1_1029680_1030877_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|1192aa|down_2|NZ_LN831027.1_1031022_1034598_+	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|319aa|down_3|NZ_LN831027.1_1034816_1035773_+	TIGR01771, L-lactate_dehydrogenase, L-lactate dehydrogenase	NA|405aa|down_4|NZ_LN831027.1_1035793_1037008_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|340aa|down_5|NZ_LN831027.1_1037043_1038063_-	PRK09478, mglC, galactose/methyl galactoside ABC transporter permease MglC	NA|501aa|down_6|NZ_LN831027.1_1038083_1039586_-	PRK10982, PRK10982, galactose/methyl galaxtoside transporter ATP-binding protein; Provisional	NA|342aa|down_7|NZ_LN831027.1_1039676_1040702_-	PRK15395, PRK15395, galactose/glucose ABC transporter substrate-binding protein MglB	NA|316aa|down_8|NZ_LN831027.1_1040898_1041846_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|381aa|down_9|NZ_LN831027.1_1041867_1043010_-	cd16283, RomA-like_MBL-fold, Enterobacter cloacae RomA and related proteins; MBL-fold metallo hydrolase domain
GCF_001457555.1_NCTC10562	NZ_LN831027	Fusobacterium nucleatum subsp. polymorphum strain NCTC10562 chromosome 1	3	1913843-1913939	3	CRISPRCasFinder	no		WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	Orphan	TGTTTAACTTTATATTGGTATTATTGAGAAAA	32	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	NA|137aa|up_4|NZ_LN831027.1_1911367_1911778_-,NA|145aa|up_3|NZ_LN831027.1_1912032_1912467_-,NA|64aa|up_2|NZ_LN831027.1_1912574_1912766_-,NA|61aa|up_1|NZ_LN831027.1_1912789_1912972_-,NA|139aa|down_2|NZ_LN831027.1_1917765_1918182_-,NA|112aa|down_9|NZ_LN831027.1_1922439_1922775_-	NA|369aa|up_9|NZ_LN831027.1_1905994_1907101_-	TIGR01554, prophage_Lp3_protein_18, phage major capsid protein, HK97 family	NA|237aa|up_8|NZ_LN831027.1_1907105_1907816_-	cd07016, S14_ClpP_1, Caseinolytic protease (ClpP) is an ATP-dependent, highly conserved serine protease	NA|385aa|up_7|NZ_LN831027.1_1907808_1908963_-	pfam04860, Phage_portal, Phage portal protein	NA|575aa|up_6|NZ_LN831027.1_1909019_1910744_-	COG4626, COG4626, Phage terminase-like protein, large subunit [General function prediction only]	NA|162aa|up_5|NZ_LN831027.1_1910744_1911230_-	TIGR01558, hypothetical_protein, phage terminase, small subunit, putative, P27 family	NA|137aa|up_4|NZ_LN831027.1_1911367_1911778_-	NA	NA|145aa|up_3|NZ_LN831027.1_1912032_1912467_-	NA	NA|64aa|up_2|NZ_LN831027.1_1912574_1912766_-	NA	NA|61aa|up_1|NZ_LN831027.1_1912789_1912972_-	NA	NA|146aa|up_0|NZ_LN831027.1_1912972_1913410_-	pfam11753, DUF3310, Protein of unknwon function (DUF3310)	NA|640aa|down_0|NZ_LN831027.1_1913974_1915894_-	COG3378, COG3378, Phage associated DNA primase [General function prediction only]	NA|588aa|down_1|NZ_LN831027.1_1915924_1917688_-	cd05538, POLBc_Pol_II_B, DNA polymerase type-II B subfamily catalytic domain	NA|139aa|down_2|NZ_LN831027.1_1917765_1918182_-	NA	NA|219aa|down_3|NZ_LN831027.1_1918366_1919023_-	pfam13479, AAA_24, AAA domain	NA|221aa|down_4|NZ_LN831027.1_1919060_1919723_-	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|311aa|down_5|NZ_LN831027.1_1919770_1920703_-	TIGR03033, hypothetical_protein, putative phage-type endonuclease	NA|402aa|down_6|NZ_LN831027.1_1920695_1921901_-	cd18010, DEXHc_HARP_SMARCAL1, DEXH-box helicase domain of SMARCAL1	NA|108aa|down_7|NZ_LN831027.1_1921897_1922221_-	smart00990, VRR_NUC, This model contains proteins with the VRR-NUC domain	NA|61aa|down_8|NZ_LN831027.1_1922264_1922447_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|112aa|down_9|NZ_LN831027.1_1922439_1922775_-	NA
GCF_001457555.1_NCTC10562	NZ_LN831027	Fusobacterium nucleatum subsp. polymorphum strain NCTC10562 chromosome 1	4	2136381-2136509	4	CRISPRCasFinder	no		WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	Orphan	TAGAAAGTTACTAGAAAATACTAGAAAGTTACTTGTTAGTTTT	43	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	NA|184aa|up_9|NZ_LN831027.1_2126341_2126893_+,NA|126aa|up_8|NZ_LN831027.1_2126917_2127295_+,NA|59aa|up_7|NZ_LN831027.1_2127405_2127582_-,NA|108aa|down_1|NZ_LN831027.1_2139628_2139952_-,NA|196aa|down_2|NZ_LN831027.1_2139990_2140578_-	NA|184aa|up_9|NZ_LN831027.1_2126341_2126893_+	NA	NA|126aa|up_8|NZ_LN831027.1_2126917_2127295_+	NA	NA|59aa|up_7|NZ_LN831027.1_2127405_2127582_-	NA	NA|62aa|up_6|NZ_LN831027.1_2127630_2127816_-	pfam08139, LPAM_1, Prokaryotic membrane lipoprotein lipid attachment site	NA|198aa|up_5|NZ_LN831027.1_2127812_2128406_-	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|441aa|up_4|NZ_LN831027.1_2128407_2129730_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|501aa|up_3|NZ_LN831027.1_2129852_2131355_+	cd03399, SPFH_flotillin, Flotillin or reggie family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|1014aa|up_2|NZ_LN831027.1_2131396_2134438_-	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|179aa|up_1|NZ_LN831027.1_2134455_2134992_-	cd17288, RMtype1_S_LlaAI06ORF1089P_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Lactococcus lactis S subunit (S	NA|427aa|up_0|NZ_LN831027.1_2135003_2136284_-	cd17281, RMtype1_S_HpyAXIII_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Helicobacter pylori 26695 S subunit (S	NA|521aa|down_0|NZ_LN831027.1_2137948_2139511_-	TIGR00497, hsdM, type I restriction system adenine methylase (hsdM)	NA|108aa|down_1|NZ_LN831027.1_2139628_2139952_-	NA	NA|196aa|down_2|NZ_LN831027.1_2139990_2140578_-	NA	NA|444aa|down_3|NZ_LN831027.1_2140601_2141933_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|378aa|down_4|NZ_LN831027.1_2142157_2143291_-	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|726aa|down_5|NZ_LN831027.1_2143305_2145483_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|171aa|down_6|NZ_LN831027.1_2145501_2146014_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|268aa|down_7|NZ_LN831027.1_2146028_2146832_-	sd00006, TPR, Tetratricopeptide repeat	NA|225aa|down_8|NZ_LN831027.1_2146846_2147521_-	COG2928, COG2928, Uncharacterized conserved protein [Function unknown]	NA|427aa|down_9|NZ_LN831027.1_2147517_2148798_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]
GCF_001457555.1_NCTC10562	NZ_LN831028	Fusobacterium nucleatum subsp. polymorphum strain NCTC10562 plasmid 2, complete sequence	1	3467-3546	1	CRISPRCasFinder	no			Orphan	GTCGCCTTTATATTCATTTATAATA	25	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14k,cas3,cas14j,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG	NA|103aa|up_0|NZ_LN831028.1_1767_2076_-,NA|60aa|down_0|NZ_LN831028.1_3579_3759_+,NA|82aa|down_2|NZ_LN831028.1_5190_5436_+,NA|103aa|down_3|NZ_LN831028.1_5512_5821_-,NA|107aa|down_4|NZ_LN831028.1_6220_6541_+,NA|109aa|down_5|NZ_LN831028.1_6559_6886_+,NA|199aa|down_7|NZ_LN831028.1_7014_7611_+	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|330aa|up_2|NZ_LN831028.1_0_990_+	pfam01051, Rep_3, Initiator Replication protein	NA|203aa|up_1|NZ_LN831028.1_1080_1689_+	pfam13654, AAA_32, AAA domain	NA|103aa|up_0|NZ_LN831028.1_1767_2076_-	NA	NA|60aa|down_0|NZ_LN831028.1_3579_3759_+	NA	NA|248aa|down_1|NZ_LN831028.1_4115_4859_+	pfam01051, Rep_3, Initiator Replication protein	NA|82aa|down_2|NZ_LN831028.1_5190_5436_+	NA	NA|103aa|down_3|NZ_LN831028.1_5512_5821_-	NA	NA|107aa|down_4|NZ_LN831028.1_6220_6541_+	NA	NA|109aa|down_5|NZ_LN831028.1_6559_6886_+	NA	NA|72aa|down_6|NZ_LN831028.1_6857_7073_+	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|199aa|down_7|NZ_LN831028.1_7014_7611_+	NA	NA|398aa|down_8|NZ_LN831028.1_7570_8764_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|202aa|down_9|NZ_LN831028.1_9132_9738_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain
