assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000953215.1_DG5	NZ_LM995447	[Clostridium] cellulosi genome assembly DG5, chromosome : I	1	352008-352101	1	CRISPRCasFinder	no		DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	GCTGGCCCGGCTGCAGTTCCAGGTTTAGGT	30	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|72aa|up_2|NZ_LM995447.1_347328_347544_-,NA|180aa|down_1|NZ_LM995447.1_354644_355184_-,NA|63aa|down_3|NZ_LM995447.1_355634_355823_-	NA|290aa|up_9|NZ_LM995447.1_341386_342256_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|614aa|up_8|NZ_LM995447.1_342510_344352_+	TIGR01536, Asparagine_synthetase_1, asparagine synthase (glutamine-hydrolyzing)	NA|213aa|up_7|NZ_LM995447.1_344590_345229_-	pfam17117, DUF5104, Domain of unknown function (DUF5104)	NA|176aa|up_6|NZ_LM995447.1_345201_345729_-	pfam17117, DUF5104, Domain of unknown function (DUF5104)	NA|173aa|up_5|NZ_LM995447.1_345832_346351_-	pfam17117, DUF5104, Domain of unknown function (DUF5104)	NA|173aa|up_4|NZ_LM995447.1_346441_346960_-	pfam17117, DUF5104, Domain of unknown function (DUF5104)	NA|113aa|up_3|NZ_LM995447.1_347026_347365_-	pfam17117, DUF5104, Domain of unknown function (DUF5104)	NA|72aa|up_2|NZ_LM995447.1_347328_347544_-	NA	NA|1154aa|up_1|NZ_LM995447.1_347543_351005_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|178aa|up_0|NZ_LM995447.1_351267_351801_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|684aa|down_0|NZ_LM995447.1_352477_354529_-	cd02931, ER_like_FMN, Enoate reductase (ER)-like FMN-binding domain	NA|180aa|down_1|NZ_LM995447.1_354644_355184_-	NA	NA|64aa|down_2|NZ_LM995447.1_355264_355456_-	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators	NA|63aa|down_3|NZ_LM995447.1_355634_355823_-	NA	NA|505aa|down_4|NZ_LM995447.1_356026_357541_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|115aa|down_5|NZ_LM995447.1_357542_357887_-	PRK03760, PRK03760, hypothetical protein; Provisional	NA|309aa|down_6|NZ_LM995447.1_357889_358816_-	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|293aa|down_7|NZ_LM995447.1_358826_359705_-	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|443aa|down_8|NZ_LM995447.1_359770_361099_-	COG4962, CpaF, Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion]	NA|393aa|down_9|NZ_LM995447.1_361135_362314_-	cd03111, CpaE-like, pilus assembly ATPase CpaE
GCF_000953215.1_DG5	NZ_LM995447	[Clostridium] cellulosi genome assembly DG5, chromosome : I	2	491213-491359	2	CRISPRCasFinder	no		DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	CAGTTGCCAACGTCCGCGGGGAACAGAGAGGTTAACTCTGCGAAC	45	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|235aa|up_4|NZ_LM995447.1_486814_487519_+,NA	NA|323aa|up_9|NZ_LM995447.1_479276_480245_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|870aa|up_8|NZ_LM995447.1_480435_483045_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|372aa|up_7|NZ_LM995447.1_483292_484408_+	COG0116, COG0116, Predicted N6-adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|95aa|up_6|NZ_LM995447.1_484690_484975_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|546aa|up_5|NZ_LM995447.1_485001_486639_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|235aa|up_4|NZ_LM995447.1_486814_487519_+	NA	NA|180aa|up_3|NZ_LM995447.1_487945_488485_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|66aa|up_2|NZ_LM995447.1_488514_488712_+	PRK00172, rpmI, 50S ribosomal protein L35; Reviewed	NA|118aa|up_1|NZ_LM995447.1_488751_489105_+	PRK05185, rplT, 50S ribosomal protein L20; Provisional	NA|327aa|up_0|NZ_LM995447.1_488992_489973_+	cd18095, SpoU-like_rRNA-MTase, SAM-dependent rRNA methylase related to SpoU-TrmH	NA|693aa|down_0|NZ_LM995447.1_491753_493832_+	PRK05582, PRK05582, type I DNA topoisomerase	NA|438aa|down_1|NZ_LM995447.1_493828_495142_+	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|332aa|down_2|NZ_LM995447.1_495160_496156_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|78aa|down_3|NZ_LM995447.1_496396_496630_+	PRK00982, acpP, acyl carrier protein; Provisional	NA|228aa|down_4|NZ_LM995447.1_496829_497513_+	PRK00102, rnc, ribonuclease III; Reviewed	NA|338aa|down_5|NZ_LM995447.1_497509_498523_+	COG1243, ELP3, Histone acetyltransferase [Transcription / Chromatin structure and dynamics]	NA|1191aa|down_6|NZ_LM995447.1_498546_502119_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|304aa|down_7|NZ_LM995447.1_502400_503312_+	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|199aa|down_8|NZ_LM995447.1_503327_503924_+	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|96aa|down_9|NZ_LM995447.1_503974_504262_+	pfam01985, CRS1_YhbY, CRS1 / YhbY (CRM) domain
GCF_000953215.1_DG5	NZ_LM995447	[Clostridium] cellulosi genome assembly DG5, chromosome : I	3	1504440-1504552	3	CRISPRCasFinder	no		DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	TCAGCAGCTCTCAGTTCTGGCTCATCTCA	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA,NA	NA|428aa|up_9|NZ_LM995447.1_1487354_1488638_+	COG0621, MiaB, 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]	NA|1239aa|up_8|NZ_LM995447.1_1488823_1492540_+	TIGR01857, FGAM-synthase, phosphoribosylformylglycinamidine synthase, clade II	NA|246aa|up_7|NZ_LM995447.1_1492684_1493422_-	smart00257, LysM, Lysin motif	NA|232aa|up_6|NZ_LM995447.1_1493614_1494310_-	pfam11738, DUF3298, Protein of unknown function (DUF3298)	NA|210aa|up_5|NZ_LM995447.1_1494402_1495032_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|298aa|up_4|NZ_LM995447.1_1495251_1496145_+	cd03402, SPFH_like_u2, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|75aa|up_3|NZ_LM995447.1_1496157_1496382_+	COG4877, COG4877, Uncharacterized protein conserved in bacteria [Function unknown]	NA|964aa|up_2|NZ_LM995447.1_1496676_1499568_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|605aa|up_1|NZ_LM995447.1_1499911_1501726_+	PRK08645, PRK08645, bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed	NA|196aa|up_0|NZ_LM995447.1_1502027_1502615_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|211aa|down_0|NZ_LM995447.1_1505497_1506130_-	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	NA|418aa|down_1|NZ_LM995447.1_1506122_1507376_-	PRK09240, thiH, 2-iminoacetate synthase ThiH	NA|257aa|down_2|NZ_LM995447.1_1507389_1508160_-	PRK00208, thiG, thiazole synthase; Reviewed	NA|205aa|down_3|NZ_LM995447.1_1508176_1508791_-	PRK08644, PRK08644, sulfur carrier protein ThiS adenylyltransferase ThiF	NA|65aa|down_4|NZ_LM995447.1_1508813_1509008_-	cd00565, Ubl_ThiS, ubiquitin-like (Ubl) domain found in sulfur carrier protein ThiS	NA|134aa|down_5|NZ_LM995447.1_1509424_1509826_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|365aa|down_6|NZ_LM995447.1_1510046_1511141_+	COG3274, COG3274, Predicted O-acyltransferase [General function prediction only]	NA|844aa|down_7|NZ_LM995447.1_1511190_1513722_-	PRK08315, PRK08315, AMP-binding domain protein; Validated	NA|339aa|down_8|NZ_LM995447.1_1514358_1515375_+	PRK03743, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA	NA|210aa|down_9|NZ_LM995447.1_1515547_1516177_+	PRK00215, PRK00215, transcriptional repressor LexA
GCF_000953215.1_DG5	NZ_LM995447	[Clostridium] cellulosi genome assembly DG5, chromosome : I	4	2129406-2135837	1,1,4	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	 Type I-U?,Type I-C,Type I-U	ATTTCAATCCACGCTCCCGTGTGGGGAGCGAC,ATTTCAATCCACGCTCCCGTGTGGGGAGCGAC,ATTTCAATCCACGCTCCCGTGTGGGGAGCGAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	95,94,94	95	TypeI-U,TypeI-U?,TypeI-C	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|72aa|up_8|NZ_LM995447.1_2117271_2117487_-,NA	NA|434aa|up_9|NZ_LM995447.1_2115966_2117268_-	pfam15979, Glyco_hydro_115, Glycosyl hydrolase family 115	NA|72aa|up_8|NZ_LM995447.1_2117271_2117487_-	NA	NA|282aa|up_7|NZ_LM995447.1_2117620_2118466_-	PRK08277, PRK08277, D-mannonate oxidoreductase; Provisional	NA|346aa|up_6|NZ_LM995447.1_2119533_2120571_-	COG2222, AgaS, Predicted phosphosugar isomerases [Cell envelope biogenesis, outer membrane]	NA|277aa|up_5|NZ_LM995447.1_2120622_2121453_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|296aa|up_4|NZ_LM995447.1_2121467_2122355_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|453aa|up_3|NZ_LM995447.1_2122429_2123788_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|884aa|up_2|NZ_LM995447.1_2124200_2126852_+	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|231aa|up_1|NZ_LM995447.1_2126905_2127598_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|352aa|up_0|NZ_LM995447.1_2127933_2128989_+	PRK03906, PRK03906, mannonate dehydratase; Provisional	cas2|97aa|down_0|NZ_LM995447.1_2135995_2136286_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_LM995447.1_2136312_2137344_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|242aa|down_2|NZ_LM995447.1_2137279_2138005_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|296aa|down_3|NZ_LM995447.1_2138007_2138895_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|604aa|down_4|NZ_LM995447.1_2138896_2140708_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|219aa|down_5|NZ_LM995447.1_2140704_2141361_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|729aa|down_6|NZ_LM995447.1_2141652_2143839_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|166aa|down_7|NZ_LM995447.1_2144332_2144830_+	pfam05103, DivIVA, DivIVA protein	NA|313aa|down_8|NZ_LM995447.1_2144994_2145933_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|341aa|down_9|NZ_LM995447.1_2145951_2146974_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]
GCF_000953215.1_DG5	NZ_LM995447	[Clostridium] cellulosi genome assembly DG5, chromosome : I	5	2144047-2144139	5	CRISPRCasFinder	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	 Type I-U?,Type I-C,Type I-U	GCTCCCGCGTGGGGAGCGACAAT	23	0	0	NA	NA	NA	1	1	TypeI-U,TypeI-U?,TypeI-C	DEDDh,cas3,PD-DExK,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA,NA	NA|884aa|up_9|NZ_LM995447.1_2124200_2126852_+	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|231aa|up_8|NZ_LM995447.1_2126905_2127598_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|352aa|up_7|NZ_LM995447.1_2127933_2128989_+	PRK03906, PRK03906, mannonate dehydratase; Provisional	cas2|97aa|up_6|NZ_LM995447.1_2135995_2136286_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|up_5|NZ_LM995447.1_2136312_2137344_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|242aa|up_4|NZ_LM995447.1_2137279_2138005_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|296aa|up_3|NZ_LM995447.1_2138007_2138895_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|604aa|up_2|NZ_LM995447.1_2138896_2140708_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|219aa|up_1|NZ_LM995447.1_2140704_2141361_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|729aa|up_0|NZ_LM995447.1_2141652_2143839_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|166aa|down_0|NZ_LM995447.1_2144332_2144830_+	pfam05103, DivIVA, DivIVA protein	NA|313aa|down_1|NZ_LM995447.1_2144994_2145933_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|341aa|down_2|NZ_LM995447.1_2145951_2146974_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|303aa|down_3|NZ_LM995447.1_2147067_2147976_-	pfam01636, APH, Phosphotransferase enzyme family	NA|320aa|down_4|NZ_LM995447.1_2148009_2148969_-	cd11548, NodZ_like, Alpha 1,6-fucosyltransferase similar to Bradyrhizobium NodZ	NA|354aa|down_5|NZ_LM995447.1_2148958_2150020_-	cd11296, O-FucT_like, GDP-fucose protein O-fucosyltransferase and related proteins	NA|382aa|down_6|NZ_LM995447.1_2150050_2151196_-	cd11296, O-FucT_like, GDP-fucose protein O-fucosyltransferase and related proteins	NA|346aa|down_7|NZ_LM995447.1_2151326_2152364_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|348aa|down_8|NZ_LM995447.1_2152375_2153419_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|322aa|down_9|NZ_LM995447.1_2153452_2154418_-	cd02000, TPP_E1_PDC_ADC_BCADC, Thiamine pyrophosphate (TPP) family, E1 of PDC_ADC_BCADC subfamily, TPP-binding module; composed of proteins similar to the E1 components of the human pyruvate dehydrogenase complex (PDC), the acetoin dehydrogenase complex (ADC) and the branched chain alpha-keto acid dehydrogenase/2-oxoisovalerate dehydrogenase complex (BCADC)
