assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000833105.2_ASM83310v2	NZ_CP010086	Clostridium beijerinckii strain NCIMB 14988 chromosome, complete genome	1	1340091-1340296	1	CRISPRCasFinder	no		cas3,DEDDh,RT,csa3,DinG,WYL	Orphan	CCAGTTTGCATATCTCCAGATGAATT	26	0	0	NA	NA	NA	3	3	Orphan	cas3,DEDDh,RT,csa3,DinG,WYL	NA|180aa|up_8|NZ_CP010086.2_1326362_1326902_-,NA|453aa|up_5|NZ_CP010086.2_1332027_1333386_+,NA|220aa|down_0|NZ_CP010086.2_1340884_1341544_+,NA|435aa|down_1|NZ_CP010086.2_1341605_1342910_+	NA|350aa|up_9|NZ_CP010086.2_1325278_1326328_+	cd01539, PBP1_GGBP, periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species	NA|180aa|up_8|NZ_CP010086.2_1326362_1326902_-	NA	NA|635aa|up_7|NZ_CP010086.2_1327775_1329680_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|533aa|up_6|NZ_CP010086.2_1330442_1332041_+	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|453aa|up_5|NZ_CP010086.2_1332027_1333386_+	NA	NA|504aa|up_4|NZ_CP010086.2_1333401_1334913_+	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)	NA|76aa|up_3|NZ_CP010086.2_1334927_1335155_+	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|174aa|up_2|NZ_CP010086.2_1335333_1335855_+	COG5652, COG5652, Predicted integral membrane protein [Function unknown]	NA|293aa|up_1|NZ_CP010086.2_1336365_1337244_+	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|660aa|up_0|NZ_CP010086.2_1337282_1339262_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|220aa|down_0|NZ_CP010086.2_1340884_1341544_+	NA	NA|435aa|down_1|NZ_CP010086.2_1341605_1342910_+	NA	NA|351aa|down_2|NZ_CP010086.2_1343012_1344065_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|415aa|down_3|NZ_CP010086.2_1345020_1346265_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|151aa|down_4|NZ_CP010086.2_1346387_1346840_+	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein	NA|394aa|down_5|NZ_CP010086.2_1346842_1348024_+	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|146aa|down_6|NZ_CP010086.2_1348025_1348463_+	TIGR03419, NifU_clost, FeS cluster assembly scaffold protein NifU, Clostridium type	NA|359aa|down_7|NZ_CP010086.2_1348474_1349551_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|99aa|down_8|NZ_CP010086.2_1349962_1350259_+	PTZ00395, PTZ00395, Sec24-related protein; Provisional	NA|167aa|down_9|NZ_CP010086.2_1351049_1351550_+	COG3881, COG3881, PRC-barrel domain containing protein [General function prediction only]
GCF_000833105.2_ASM83310v2	NZ_CP010086	Clostridium beijerinckii strain NCIMB 14988 chromosome, complete genome	2	1658747-1658830	2	CRISPRCasFinder	no		cas3,DEDDh,RT,csa3,DinG,WYL	Orphan	AAAAAAGCGCGAAGCGCAACCAAGAT	26	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,RT,csa3,DinG,WYL	NA|165aa|up_5|NZ_CP010086.2_1653789_1654284_-,NA|302aa|up_3|NZ_CP010086.2_1655072_1655978_-,NA|216aa|up_0|NZ_CP010086.2_1657919_1658567_-,NA|347aa|down_2|NZ_CP010086.2_1661779_1662820_+,NA|166aa|down_6|NZ_CP010086.2_1665067_1665565_+	NA|466aa|up_9|NZ_CP010086.2_1649398_1650796_+	PRK02256, PRK02256, putative aminopeptidase 1; Provisional	NA|280aa|up_8|NZ_CP010086.2_1650882_1651722_-	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|161aa|up_7|NZ_CP010086.2_1651969_1652452_-	pfam16224, DUF4883, DOmain of unknown function (DUF4883)	NA|341aa|up_6|NZ_CP010086.2_1652596_1653619_+	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|165aa|up_5|NZ_CP010086.2_1653789_1654284_-	NA	NA|202aa|up_4|NZ_CP010086.2_1654467_1655073_-	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|302aa|up_3|NZ_CP010086.2_1655072_1655978_-	NA	NA|181aa|up_2|NZ_CP010086.2_1656387_1656930_+	cd01046, Rubrerythrin_like, rubrerythrin-like, diiron-binding domain	NA|215aa|up_1|NZ_CP010086.2_1657033_1657678_-	cd02908, Macro_OAADPr_deacetylase, macrodomain, O-acetyl-ADP-ribose (OAADPr) family	NA|216aa|up_0|NZ_CP010086.2_1657919_1658567_-	NA	NA|456aa|down_0|NZ_CP010086.2_1659164_1660532_+	cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|345aa|down_1|NZ_CP010086.2_1660730_1661765_+	COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|347aa|down_2|NZ_CP010086.2_1661779_1662820_+	NA	NA|84aa|down_3|NZ_CP010086.2_1662939_1663191_+	TIGR03959, putative_iron-only_hydrogenase_system_regulator, putative iron-only hydrogenase system regulator	NA|259aa|down_4|NZ_CP010086.2_1663467_1664244_+	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|193aa|down_5|NZ_CP010086.2_1664332_1664911_-	COG2112, COG2112, Predicted Ser/Thr protein kinase [Signal transduction mechanisms]	NA|166aa|down_6|NZ_CP010086.2_1665067_1665565_+	NA	NA|214aa|down_7|NZ_CP010086.2_1666015_1666657_+	pfam02589, LUD_dom, LUD domain	NA|554aa|down_8|NZ_CP010086.2_1666921_1668583_+	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|588aa|down_9|NZ_CP010086.2_1668665_1670429_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]
GCF_000833105.2_ASM83310v2	NZ_CP010086	Clostridium beijerinckii strain NCIMB 14988 chromosome, complete genome	3	3571711-3571799	3	CRISPRCasFinder	no		cas3,DEDDh,RT,csa3,DinG,WYL	Orphan	ACATTGGTTCCAGTTAAAGAAAG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,RT,csa3,DinG,WYL	NA|274aa|up_8|NZ_CP010086.2_3560865_3561687_-,NA|261aa|up_6|NZ_CP010086.2_3562433_3563216_-,NA|130aa|up_4|NZ_CP010086.2_3566299_3566689_-,NA|56aa|up_3|NZ_CP010086.2_3566755_3566923_-,NA|140aa|up_1|NZ_CP010086.2_3568721_3569141_-,NA|124aa|down_0|NZ_CP010086.2_3574135_3574507_-,NA|114aa|down_1|NZ_CP010086.2_3574841_3575183_-,NA|206aa|down_5|NZ_CP010086.2_3578674_3579292_-,NA|197aa|down_7|NZ_CP010086.2_3580778_3581369_-,NA|67aa|down_8|NZ_CP010086.2_3581829_3582030_-	NA|272aa|up_9|NZ_CP010086.2_3559805_3560621_-	pfam14253, AbiH, Bacteriophage abortive infection AbiH	NA|274aa|up_8|NZ_CP010086.2_3560865_3561687_-	NA	NA|121aa|up_7|NZ_CP010086.2_3561767_3562130_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|261aa|up_6|NZ_CP010086.2_3562433_3563216_-	NA	NA|559aa|up_5|NZ_CP010086.2_3563443_3565120_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|130aa|up_4|NZ_CP010086.2_3566299_3566689_-	NA	NA|56aa|up_3|NZ_CP010086.2_3566755_3566923_-	NA	NA|67aa|up_2|NZ_CP010086.2_3568176_3568377_+	pfam12841, YvrJ, YvrJ protein family	NA|140aa|up_1|NZ_CP010086.2_3568721_3569141_-	NA	NA|232aa|up_0|NZ_CP010086.2_3569688_3570384_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|124aa|down_0|NZ_CP010086.2_3574135_3574507_-	NA	NA|114aa|down_1|NZ_CP010086.2_3574841_3575183_-	NA	NA|768aa|down_2|NZ_CP010086.2_3575261_3577565_-	TIGR03920, T7SS_EccD, type VII secretion integral membrane protein EccD	NA|142aa|down_3|NZ_CP010086.2_3577606_3578032_-	pfam14107, DUF4280, Domain of unknown function (DUF4280)	NA|175aa|down_4|NZ_CP010086.2_3578078_3578603_-	PRK12300, leuS, leucyl-tRNA synthetase; Reviewed	NA|206aa|down_5|NZ_CP010086.2_3578674_3579292_-	NA	NA|473aa|down_6|NZ_CP010086.2_3579320_3580739_-	TIGR01646, conserved_hypothetical_protein, Rhs element Vgr protein	NA|197aa|down_7|NZ_CP010086.2_3580778_3581369_-	NA	NA|67aa|down_8|NZ_CP010086.2_3581829_3582030_-	NA	NA|264aa|down_9|NZ_CP010086.2_3582048_3582840_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)
GCF_000833105.2_ASM83310v2	NZ_CP010086	Clostridium beijerinckii strain NCIMB 14988 chromosome, complete genome	4	5033871-5033967	4	CRISPRCasFinder	no		cas3,DEDDh,RT,csa3,DinG,WYL	Orphan	CCATACTTGTATGGTAGTTATATTT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,RT,csa3,DinG,WYL	NA,NA|99aa|down_4|NZ_CP010086.2_5041030_5041327_+,NA|104aa|down_8|NZ_CP010086.2_5043659_5043971_-	NA|306aa|up_9|NZ_CP010086.2_5019800_5020718_-	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP	NA|566aa|up_8|NZ_CP010086.2_5020883_5022581_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|488aa|up_7|NZ_CP010086.2_5022663_5024127_-	cd01536, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of active transport systems that are members of the type 1 periplasmic binding protein (PBP1) superfamily	NA|186aa|up_6|NZ_CP010086.2_5024705_5025263_-	COG3963, COG3963, Phospholipid N-methyltransferase [Lipid metabolism]	NA|141aa|up_5|NZ_CP010086.2_5026149_5026572_-	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|502aa|up_4|NZ_CP010086.2_5026855_5028361_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|471aa|up_3|NZ_CP010086.2_5028399_5029812_-	pfam02614, UxaC, Glucuronate isomerase	NA|538aa|up_2|NZ_CP010086.2_5030222_5031836_-	COG0246, MtlD, Mannitol-1-phosphate/altronate dehydrogenases [Carbohydrate transport and metabolism]	NA|354aa|up_1|NZ_CP010086.2_5031934_5032996_-	PRK03906, PRK03906, mannonate dehydratase; Provisional	NA|225aa|up_0|NZ_CP010086.2_5033165_5033840_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|560aa|down_0|NZ_CP010086.2_5034186_5035866_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|392aa|down_1|NZ_CP010086.2_5035947_5037123_+	COG4134, COG4134, ABC-type uncharacterized transport system, periplasmic component [General function prediction only]	NA|621aa|down_2|NZ_CP010086.2_5037202_5039065_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|406aa|down_3|NZ_CP010086.2_5039362_5040580_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|99aa|down_4|NZ_CP010086.2_5041030_5041327_+	NA	NA|148aa|down_5|NZ_CP010086.2_5041463_5041907_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|52aa|down_6|NZ_CP010086.2_5042194_5042350_-	PRK09857, PRK09857, recombination-promoting nuclease RpnA	NA|354aa|down_7|NZ_CP010086.2_5042502_5043564_-	cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like	NA|104aa|down_8|NZ_CP010086.2_5043659_5043971_-	NA	NA|64aa|down_9|NZ_CP010086.2_5044054_5044246_-	pfam04024, PspC, PspC domain
GCF_000833105.2_ASM83310v2	NZ_CP010086	Clostridium beijerinckii strain NCIMB 14988 chromosome, complete genome	5	5924674-5924820	5	CRISPRCasFinder	no	RT	cas3,DEDDh,RT,csa3,DinG,WYL	Unclear	CATAGCACCTGATCCATTTAAGTAGTA	27	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,RT,csa3,DinG,WYL	NA|236aa|up_3|NZ_CP010086.2_5920391_5921099_-,NA|514aa|up_1|NZ_CP010086.2_5921951_5923493_-,NA|120aa|up_0|NZ_CP010086.2_5923567_5923927_-,NA|74aa|down_6|NZ_CP010086.2_5936913_5937135_+,NA|84aa|down_7|NZ_CP010086.2_5937233_5937485_+,NA|48aa|down_8|NZ_CP010086.2_5937530_5937674_+,NA|89aa|down_9|NZ_CP010086.2_5937740_5938007_+	NA|1055aa|up_9|NZ_CP010086.2_5909315_5912480_-	COG5610, COG5610, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|461aa|up_8|NZ_CP010086.2_5912544_5913927_-	COG5610, COG5610, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|346aa|up_7|NZ_CP010086.2_5913878_5914916_-	PRK10073, PRK10073, putative glycosyl transferase; Provisional	NA|475aa|up_6|NZ_CP010086.2_5915314_5916739_+	TIGR03023, Sugar_transferase	NA|379aa|up_5|NZ_CP010086.2_5916735_5917872_+	cd03808, GT4_CapM-like, capsular polysaccharide biosynthesis glycosyltransferase CapM and similar proteins	NA|611aa|up_4|NZ_CP010086.2_5918168_5920001_+	pfam04932, Wzy_C, O-Antigen ligase	NA|236aa|up_3|NZ_CP010086.2_5920391_5921099_-	NA	NA|293aa|up_2|NZ_CP010086.2_5921055_5921934_-	cd03264, ABC_drug_resistance_like, ABC-type multidrug transport system, ATPase component	NA|514aa|up_1|NZ_CP010086.2_5921951_5923493_-	NA	NA|120aa|up_0|NZ_CP010086.2_5923567_5923927_-	NA	NA|405aa|down_0|NZ_CP010086.2_5926958_5928173_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|415aa|down_1|NZ_CP010086.2_5928316_5929561_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|516aa|down_2|NZ_CP010086.2_5929611_5931159_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|425aa|down_3|NZ_CP010086.2_5931389_5932664_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|434aa|down_4|NZ_CP010086.2_5932828_5934130_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|576aa|down_5|NZ_CP010086.2_5934674_5936402_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|74aa|down_6|NZ_CP010086.2_5936913_5937135_+	NA	NA|84aa|down_7|NZ_CP010086.2_5937233_5937485_+	NA	NA|48aa|down_8|NZ_CP010086.2_5937530_5937674_+	NA	NA|89aa|down_9|NZ_CP010086.2_5937740_5938007_+	NA
