assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	1	172969-176486	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas5,cas3,cas4,cas1,cas2	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Unclear	ATGAACTGTAAACTTGAAAAGTTTTGAAAT,ATGAACTGTAAACTTGAAAAGTTTTGAAAT,ATGAACTNTAAACTTGAAAAGTTTTGAAAT	30,30,30	0	0	NA	NA	NA:NA:NA	51,51,53	53	Unclear	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA,NA|85aa|down_0|NC_021281.1_176754_177009_+	NA|248aa|up_9|NC_021281.1_158124_158868_+	pfam14080, DUF4261, Domain of unknown function (DUF4261)	NA|473aa|up_8|NC_021281.1_160240_161659_+	PRK09206, PRK09206, pyruvate kinase PykF	NA|435aa|up_7|NC_021281.1_161684_162989_+	PRK00077, eno, enolase; Provisional	NA|185aa|up_6|NC_021281.1_163115_163670_+	COG3758, COG3758, Uncharacterized protein conserved in bacteria [Function unknown]	cas6|248aa|up_5|NC_021281.1_164171_164915_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas5|257aa|up_4|NC_021281.1_167606_168377_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|846aa|up_3|NC_021281.1_168446_170984_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas4|163aa|up_2|NC_021281.1_170994_171483_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|333aa|up_1|NC_021281.1_171494_172493_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|95aa|up_0|NC_021281.1_172511_172796_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|85aa|down_0|NC_021281.1_176754_177009_+	NA	NA|245aa|down_1|NC_021281.1_177204_177939_+	cd08563, GDPD_TtGDE_like, Glycerophosphodiester phosphodiesterase domain of Thermoanaerobacter tengcongensis and similar proteins	NA|455aa|down_2|NC_021281.1_177944_179309_+	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|176aa|down_3|NC_021281.1_179317_179845_+	cd02908, Macro_OAADPr_deacetylase, macrodomain, O-acetyl-ADP-ribose (OAADPr) family	NA|463aa|down_4|NC_021281.1_179988_181377_+	cd01087, Prolidase, Prolidase	NA|120aa|down_5|NC_021281.1_181386_181746_+	cd11535, NTP-PPase_SsMazG, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|157aa|down_6|NC_021281.1_186983_187454_+	COG0779, COG0779, Uncharacterized protein conserved in bacteria [Function unknown]	NA|360aa|down_7|NC_021281.1_187481_188561_+	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|177aa|down_8|NC_021281.1_188553_189084_+	COG2740, COG2740, Predicted nucleic-acid-binding protein implicated in transcription termination [Transcription]	NA|748aa|down_9|NC_021281.1_189080_191324_+	PRK05306, infB, translation initiation factor IF-2; Validated
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	2	312280-312401	2	CRISPRCasFinder	no		WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Orphan	TTTTTCAGGATTTTTTCAAGATAAAAAAAATCCTTTT	37	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA|89aa|up_3|NC_021281.1_308615_308882_-,NA	NA|184aa|up_9|NC_021281.1_302170_302722_-	COG1971, COG1971, Predicted membrane protein [Function unknown]	NA|501aa|up_8|NC_021281.1_302835_304338_+	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|317aa|up_7|NC_021281.1_304573_305524_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|470aa|up_6|NC_021281.1_305526_306936_+	PRK08207, PRK08207, coproporphyrinogen III oxidase; Provisional	NA|181aa|up_5|NC_021281.1_306932_307475_+	COG1555, ComEA, DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]	NA|286aa|up_4|NC_021281.1_307462_308320_+	COG1281, COG1281, Disulfide bond chaperones of the HSP33 family [Posttranslational modification, protein turnover, chaperones]	NA|89aa|up_3|NC_021281.1_308615_308882_-	NA	NA|264aa|up_2|NC_021281.1_309003_309795_+	pfam13277, YmdB, YmdB-like protein	NA|311aa|up_1|NC_021281.1_309775_310708_+	COG2264, PrmA, Ribosomal protein L11 methylase [Translation, ribosomal structure and biogenesis]	NA|508aa|up_0|NC_021281.1_310750_312274_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|220aa|down_0|NC_021281.1_312432_313092_+	PRK00023, cmk, (d)CMP kinase	NA|156aa|down_1|NC_021281.1_313104_313572_+	pfam08239, SH3_3, Bacterial SH3 domain	NA|641aa|down_2|NC_021281.1_313584_315507_+	COG1519, KdtA, 3-deoxy-D-manno-octulosonic-acid transferase [Cell envelope biogenesis, outer membrane]	NA|426aa|down_3|NC_021281.1_315833_317111_+	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|731aa|down_4|NC_021281.1_317372_319565_+	COG5324, COG5324, Uncharacterized conserved protein [Function unknown]	NA|157aa|down_5|NC_021281.1_319582_320053_+	COG1683, COG1683, Uncharacterized conserved protein [Function unknown]	NA|196aa|down_6|NC_021281.1_320281_320869_-	COG1739, COG1739, Uncharacterized conserved protein [Function unknown]	NA|479aa|down_7|NC_021281.1_320944_322381_-	COG0260, PepB, Leucyl aminopeptidase [Amino acid transport and metabolism]	NA|1510aa|down_8|NC_021281.1_322639_327169_+	pfam03797, Autotransporter, Autotransporter beta-domain	NA|162aa|down_9|NC_021281.1_327288_327774_+	COG2131, ComEB, Deoxycytidylate deaminase [Nucleotide transport and metabolism]
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	3	712046-712125	3	CRISPRCasFinder	no		WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Orphan	GAGCAGTATAATCAAAAGAAATACA	25	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA|43aa|up_0|NC_021281.1_711828_711957_+,NA|149aa|down_3|NC_021281.1_714431_714878_+	NA|288aa|up_9|NC_021281.1_703600_704464_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|509aa|up_8|NC_021281.1_704646_706173_+	cd08499, PBP2_Ylib_like, The substrate-binding component of an uncharacterized ABC-type peptide import system Ylib contains the type 2 periplasmic binding fold	NA|309aa|up_7|NC_021281.1_706262_707189_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|290aa|up_6|NC_021281.1_707198_708068_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|336aa|up_5|NC_021281.1_708086_709094_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|325aa|up_4|NC_021281.1_709086_710061_+	PRK11308, dppF, dipeptide transporter ATP-binding subunit; Provisional	NA|89aa|up_3|NC_021281.1_710115_710382_-	pfam06769, YoeB_toxin, YoeB-like toxin of bacterial type II toxin-antitoxin system	NA|89aa|up_2|NC_021281.1_710381_710648_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|159aa|up_1|NC_021281.1_710963_711440_-	cd17492, toxin_CptN, type III toxin-antitoxin system toxin CptN and similar proteins	NA|43aa|up_0|NC_021281.1_711828_711957_+	NA	NA|141aa|down_0|NC_021281.1_712187_712610_+	cd17493, toxin_TenpN, type III toxin-antitoxin system toxin TenpN and similar proteins	NA|95aa|down_1|NC_021281.1_713018_713303_+	COG1862, YajC, Preprotein translocase subunit YajC [Intracellular trafficking and secretion]	NA|343aa|down_2|NC_021281.1_713397_714426_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|149aa|down_3|NC_021281.1_714431_714878_+	NA	NA|358aa|down_4|NC_021281.1_714901_715975_+	PRK00591, prfA, peptide chain release factor 1; Validated	NA|384aa|down_5|NC_021281.1_715974_717126_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|344aa|down_6|NC_021281.1_717106_718138_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|183aa|down_7|NC_021281.1_718150_718699_+	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]	NA|71aa|down_8|NC_021281.1_718906_719119_+	COG1722, XseB, Exonuclease VII small subunit [DNA replication, recombination, and repair]	NA|298aa|down_9|NC_021281.1_719120_720014_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	4	1858245-1858431	2	PILER-CR	no		WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Orphan	TCTCCTAACATACCTGCTGATGTTTCTTTTTCTGTTGTTATAG	43	0	0	NA	NA	NA	2	2	Orphan	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA|195aa|up_5|NC_021281.1_1847026_1847611_-,NA|197aa|up_4|NC_021281.1_1847628_1848219_-,NA	NA|175aa|up_9|NC_021281.1_1842537_1843062_+	PRK11281, PRK11281, mechanosensitive channel MscK	NA|355aa|up_8|NC_021281.1_1843137_1844202_+	cd00430, PLPDE_III_AR, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Alanine Racemase	NA|326aa|up_7|NC_021281.1_1844285_1845263_+	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|511aa|up_6|NC_021281.1_1845414_1846947_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|195aa|up_5|NC_021281.1_1847026_1847611_-	NA	NA|197aa|up_4|NC_021281.1_1847628_1848219_-	NA	NA|518aa|up_3|NC_021281.1_1848205_1849759_-	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|367aa|up_2|NC_021281.1_1849879_1850981_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|123aa|up_1|NC_021281.1_1851037_1851406_-	COG0736, AcpS, Phosphopantetheinyl transferase (holo-ACP synthase) [Lipid metabolism]	NA|253aa|up_0|NC_021281.1_1851402_1852161_-	cd01310, TatD_DNAse, TatD like proteins;  E	NA|473aa|down_0|NC_021281.1_1863848_1865267_-	COG2985, COG2985, Predicted permease [General function prediction only]	NA|361aa|down_1|NC_021281.1_1865419_1866502_-	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|444aa|down_2|NC_021281.1_1866524_1867856_-	COG0849, ftsA, Cell division ATPase FtsA [Cell division and chromosome partitioning]	NA|236aa|down_3|NC_021281.1_1867852_1868560_-	COG1589, FtsQ, Cell division septal protein [Cell envelope biogenesis, outer membrane]	NA|288aa|down_4|NC_021281.1_1868572_1869436_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|282aa|down_5|NC_021281.1_1869448_1870294_-	COG0812, MurB, UDP-N-acetylmuramate dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|461aa|down_6|NC_021281.1_1870290_1871673_-	COG0773, MurC, UDP-N-acetylmuramate-alanine ligase [Cell envelope biogenesis, outer membrane]	NA|355aa|down_7|NC_021281.1_1871677_1872742_-	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|433aa|down_8|NC_021281.1_1872749_1874048_-	COG0771, MurD, UDP-N-acetylmuramoylalanine-D-glutamate ligase [Cell envelope biogenesis, outer membrane]	NA|362aa|down_9|NC_021281.1_1874047_1875133_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	5	1970958-1971043	4	CRISPRCasFinder	no		WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Orphan	TCTTCAGGTTTTTCCGTTGCCTGCCAACTG	30	0	0	NA	NA	NA	1	1	Orphan	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA|130aa|up_3|NC_021281.1_1968442_1968832_-,NA|128aa|up_2|NC_021281.1_1968847_1969231_-,NA	NA|313aa|up_9|NC_021281.1_1952240_1953179_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|277aa|up_8|NC_021281.1_1953179_1954010_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|527aa|up_7|NC_021281.1_1954288_1955869_+	cd08490, PBP2_NikA_DppA_OppA_like_3, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|262aa|up_6|NC_021281.1_1955880_1956666_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|329aa|up_5|NC_021281.1_1956646_1957633_+	COG4608, AppF, ABC-type oligopeptide transport system, ATPase component [Amino acid transport and metabolism]	NA|3549aa|up_4|NC_021281.1_1957782_1968429_-	NF033175, fuso_auto_Nterm, autotransporter-associated N-terminal domain-containing protein	NA|130aa|up_3|NC_021281.1_1968442_1968832_-	NA	NA|128aa|up_2|NC_021281.1_1968847_1969231_-	NA	NA|124aa|up_1|NC_021281.1_1969243_1969615_-	pfam09403, FadA, Adhesion protein FadA	NA|312aa|up_0|NC_021281.1_1969967_1970903_-	pfam18813, PBECR4, phage-Barnase-EndoU-ColicinE5/D-RelE like nuclease4	NA|236aa|down_0|NC_021281.1_1971228_1971936_-	COG1346, LrgB, Putative effector of murein hydrolase [Cell envelope biogenesis, outer membrane]	NA|128aa|down_1|NC_021281.1_1971928_1972312_-	COG1380, COG1380, Putative effector of murein hydrolase LrgA [General function prediction only]	NA|324aa|down_2|NC_021281.1_1972342_1973314_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|down_3|NC_021281.1_1973325_1974105_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|476aa|down_4|NC_021281.1_1975285_1976713_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|389aa|down_5|NC_021281.1_1976941_1978108_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|397aa|down_6|NC_021281.1_1978104_1979295_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|184aa|down_7|NC_021281.1_1979507_1980059_+	COG1556, COG1556, Uncharacterized conserved protein [Function unknown]	NA|720aa|down_8|NC_021281.1_1980101_1982261_+	COG1139, COG1139, Uncharacterized conserved protein containing a ferredoxin-like domain [Energy production and conversion]	NA|327aa|down_9|NC_021281.1_1982261_1983242_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]
GCF_000400875.1_ASM40087v1	NC_021281	Fusobacterium nucleatum subsp. animalis 4_8, complete sequence	6	2087591-2087784	3	PILER-CR	no		WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	Orphan	AAGACTGGCGCTCTACCAACTGAGCTA	27	0	0	NA	NA	NA	2	2	Orphan	WYL,cas14j,cas6,cas5,cas3,cas4,cas1,cas2,DinG,PD-DExK,csa3	NA,NA|71aa|down_1|NC_021281.1_2089355_2089568_-	NA|97aa|up_9|NC_021281.1_2075795_2076086_-	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|96aa|up_8|NC_021281.1_2076082_2076370_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|316aa|up_7|NC_021281.1_2076643_2077591_-	pfam03382, DUF285, Mycoplasma protein of unknown function, DUF285	NA|230aa|up_6|NC_021281.1_2077609_2078299_-	COG2964, COG2964, Uncharacterized protein conserved in bacteria [Function unknown]	NA|546aa|up_5|NC_021281.1_2078762_2080400_+	COG3033, TnaA, Tryptophanase [Amino acid transport and metabolism]	NA|445aa|up_4|NC_021281.1_2080523_2081858_+	COG0733, COG0733, Na+-dependent transporters of the SNF family [General function prediction only]	NA|482aa|up_3|NC_021281.1_2082175_2083621_-	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|242aa|up_2|NC_021281.1_2083647_2084373_-	COG2071, COG2071, Predicted glutamine amidotransferases [General function prediction only]	NA|515aa|up_1|NC_021281.1_2084465_2086010_-	COG2978, AbgT, Putative p-aminobenzoyl-glutamate transporter [Coenzyme metabolism]	NA|283aa|up_0|NC_021281.1_2086238_2087087_-	COG1737, RpiR, Transcriptional regulators [Transcription]	NA|346aa|down_0|NC_021281.1_2087934_2088972_-	sd00006, TPR, Tetratricopeptide repeat	NA|71aa|down_1|NC_021281.1_2089355_2089568_-	NA	NA|317aa|down_2|NC_021281.1_2089642_2090593_-	pfam01261, AP_endonuc_2, Xylose isomerase-like TIM barrel	NA|258aa|down_3|NC_021281.1_2090855_2091629_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|342aa|down_4|NC_021281.1_2091625_2092651_-	pfam01032, FecCD, FecCD transport family	NA|290aa|down_5|NC_021281.1_2092653_2093523_-	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|658aa|down_6|NC_021281.1_2093830_2095804_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|124aa|down_7|NC_021281.1_2096233_2096605_-	cd01109, HTH_YyaN, Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB	NA|261aa|down_8|NC_021281.1_2096686_2097469_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|126aa|down_9|NC_021281.1_2097550_2097928_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase
