assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000178115.2_ASM17811v2	NC_014828	Ethanoligenens harbinense YUAN-3, complete genome	1	228227-228313	1	CRISPRCasFinder	no	cas3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Unclear	TTTTTCGCCGCATTTCGGGCAGGT	24	0	0	NA	NA	NA	1	1	Unclear	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA|87aa|up_1|NC_014828.1_226971_227232_+,NA	NA|563aa|up_9|NC_014828.1_216715_218404_-	cd08513, PBP2_thermophilic_Hb8_like, The substrate-binding component of ABC-type thermophilic oligopeptide-binding protein Hb8-like import systems, contains the type 2 periplasmic binding fold	NA|87aa|up_8|NC_014828.1_218463_218724_-	pfam04232, SpoVS, Stage V sporulation protein S (SpoVS)	NA|260aa|up_7|NC_014828.1_218909_219689_-	pfam13277, YmdB, YmdB-like protein	NA|517aa|up_6|NC_014828.1_219805_221356_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|826aa|up_5|NC_014828.1_221800_224278_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|349aa|up_4|NC_014828.1_224293_225340_-	PRK01059, PRK01059, ATP:guanido phosphotransferase; Provisional	NA|204aa|up_3|NC_014828.1_225336_225948_-	COG3880, COG3880, Modulator of heat shock repressor CtsR, McsA [Signal transduction    mechanisms]	NA|152aa|up_2|NC_014828.1_225895_226351_-	pfam05848, CtsR, Firmicute transcriptional repressor of class III stress genes (CtsR)	NA|87aa|up_1|NC_014828.1_226971_227232_+	NA	NA|256aa|up_0|NC_014828.1_227235_228003_-	cd06259, YdcF-like, YdcF-like	NA|186aa|down_0|NC_014828.1_228716_229274_-	PRK00529, PRK00529, elongation factor P; Validated	NA|356aa|down_1|NC_014828.1_229486_230554_-	cd01092, APP-like, Similar to Prolidase and Aminopeptidase P	NA|146aa|down_2|NC_014828.1_230592_231030_-	pfam01220, DHquinase_II, Dehydroquinase class II	NA|286aa|down_3|NC_014828.1_231085_231943_-	COG0648, Nfo, Endonuclease IV [DNA replication, recombination, and repair]	NA|202aa|down_4|NC_014828.1_231939_232545_-	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|59aa|down_5|NC_014828.1_232545_232722_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|439aa|down_6|NC_014828.1_233261_234578_+	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|219aa|down_7|NC_014828.1_234559_235216_+	pfam04168, Alpha-E, A predicted alpha-helical domain with a conserved ER motif	NA|274aa|down_8|NC_014828.1_235278_236100_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|311aa|down_9|NC_014828.1_236104_237037_+	cd06253, M14_ASTE_ASPA-like, Peptidase M14 Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA)-like; uncharacterized subgroup
GCF_000178115.2_ASM17811v2	NC_014828	Ethanoligenens harbinense YUAN-3, complete genome	2	674783-693586	2,1,1,2,3	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-U, Type I-U?,Type I-C	ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC	33,33,33,33,33	0	0	NA	NA	NA:NA:NA:NA:NA	279,279,273,273,273	279	TypeI-U,TypeI-U?,TypeI-C	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA|100aa|up_7|NC_014828.1_666376_666676_-,NA|77aa|up_6|NC_014828.1_666792_667023_+,NA|53aa|up_5|NC_014828.1_667089_667248_-,NA	NA|368aa|up_9|NC_014828.1_663509_664613_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|295aa|up_8|NC_014828.1_665479_666364_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|100aa|up_7|NC_014828.1_666376_666676_-	NA	NA|77aa|up_6|NC_014828.1_666792_667023_+	NA	NA|53aa|up_5|NC_014828.1_667089_667248_-	NA	NA|68aa|up_4|NC_014828.1_667323_667527_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|118aa|up_3|NC_014828.1_667734_668088_+	PRK00215, PRK00215, transcriptional repressor LexA	NA|387aa|up_2|NC_014828.1_668393_669554_-	pfam07907, YibE_F, YibE/F-like protein	NA|1532aa|up_1|NC_014828.1_669697_674293_-	cd07399, MPP_YvnB, Bacillus subtilis YvnB and related proteins, metallophosphatase domain	NA|54aa|up_0|NC_014828.1_674563_674725_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	cas2|97aa|down_0|NC_014828.1_693759_694050_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NC_014828.1_694058_695090_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|222aa|down_2|NC_014828.1_695086_695752_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas7|283aa|down_3|NC_014828.1_695741_696590_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|656aa|down_4|NC_014828.1_696589_698557_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|248aa|down_5|NC_014828.1_698531_699275_-	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|815aa|down_6|NC_014828.1_699327_701772_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|418aa|down_7|NC_014828.1_701966_703220_-	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|376aa|down_8|NC_014828.1_703486_704614_+	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|216aa|down_9|NC_014828.1_704830_705478_+	PRK01362, PRK01362, fructose-6-phosphate aldolase
GCF_000178115.2_ASM17811v2	NC_014828	Ethanoligenens harbinense YUAN-3, complete genome	3	2919357-2919457	3	CRISPRCasFinder	no	csa3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-A	AACGTGGGCTGTGCCCACTGCCTGCTTCA	29	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA,NA|441aa|down_5|NC_014828.1_2927797_2929120_+,NA|218aa|down_7|NC_014828.1_2931004_2931658_-	NA|291aa|up_9|NC_014828.1_2908735_2909608_+	cd02431, Ferritin_CCC1_C, CCC1-related domain of ferritin	NA|115aa|up_8|NC_014828.1_2909884_2910229_+	PTZ00397, PTZ00397, macrophage migration inhibition factor-like protein; Provisional	NA|413aa|up_7|NC_014828.1_2910207_2911446_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|176aa|up_6|NC_014828.1_2911570_2912098_+	TIGR04002, TIGR04002_family_protein, TIGR04002 family protein	NA|713aa|up_5|NC_014828.1_2912285_2914424_-	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	csa3|126aa|up_4|NC_014828.1_2914430_2914808_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|257aa|up_3|NC_014828.1_2915108_2915879_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|238aa|up_2|NC_014828.1_2916232_2916946_+	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|166aa|up_1|NC_014828.1_2916891_2917389_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|433aa|up_0|NC_014828.1_2917892_2919191_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|205aa|down_0|NC_014828.1_2919480_2920095_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|1181aa|down_1|NC_014828.1_2920199_2923742_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|456aa|down_2|NC_014828.1_2924172_2925540_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|257aa|down_3|NC_014828.1_2925569_2926340_+	cd09087, Ape1-like_AP-endo, Human Ape1-like subfamily of the ExoIII family apurinic/apyrimidinic (AP) endonucleases	NA|349aa|down_4|NC_014828.1_2926703_2927750_+	COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|441aa|down_5|NC_014828.1_2927797_2929120_+	NA	NA|574aa|down_6|NC_014828.1_2929194_2930916_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|218aa|down_7|NC_014828.1_2931004_2931658_-	NA	NA|99aa|down_8|NC_014828.1_2931732_2932029_-	pfam12637, TSCPD, TSCPD domain	NA|296aa|down_9|NC_014828.1_2932073_2932961_-	cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase (HisPPase) AMP bound
