assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	1	199499-199597	1	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	AAATGCGGACGTTACCGCTCTATT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|152aa|up_5|NC_020814.1_195181_195637_+,NA|76aa|up_0|NC_020814.1_198930_199158_-,NA|207aa|down_0|NC_020814.1_199840_200461_-,NA|65aa|down_1|NC_020814.1_200472_200667_-,NA|193aa|down_2|NC_020814.1_200712_201291_-,NA|138aa|down_9|NC_020814.1_206103_206517_+	NA|155aa|up_9|NC_020814.1_190627_191092_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|426aa|up_8|NC_020814.1_191066_192344_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|555aa|up_7|NC_020814.1_192360_194025_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_6|NC_020814.1_194034_195165_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|152aa|up_5|NC_020814.1_195181_195637_+	NA	NA|218aa|up_4|NC_020814.1_195712_196366_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_3|NC_020814.1_196365_197523_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|171aa|up_2|NC_020814.1_197859_198372_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|149aa|up_1|NC_020814.1_198397_198844_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|up_0|NC_020814.1_198930_199158_-	NA	NA|207aa|down_0|NC_020814.1_199840_200461_-	NA	NA|65aa|down_1|NC_020814.1_200472_200667_-	NA	NA|193aa|down_2|NC_020814.1_200712_201291_-	NA	NA|311aa|down_3|NC_020814.1_201585_202518_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_4|NC_020814.1_202514_203480_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_5|NC_020814.1_203481_203997_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|61aa|down_6|NC_020814.1_203977_204160_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|340aa|down_7|NC_020814.1_204171_205191_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|304aa|down_8|NC_020814.1_205190_206102_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|138aa|down_9|NC_020814.1_206103_206517_+	NA
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	2	340547-340846	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	ANGTTTTNTNTGTNCCTATAGGGGATTGAAAC,GTTTTTTGTGTACCTATAGGGGATTGAAAC,GGTTTTTTGTGTACCTATAGGGGATTGAAACG	32,30,32	0	0	NA	NA	NA:NA:NA	4,4,2	4	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|139aa|up_7|NC_020814.1_332883_333300_+,NA|400aa|down_8|NC_020814.1_350194_351394_-	NA|117aa|up_9|NC_020814.1_331632_331983_+	cd06664, IscU_like, Iron-sulfur cluster scaffold-like proteins	NA|309aa|up_8|NC_020814.1_331960_332887_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|139aa|up_7|NC_020814.1_332883_333300_+	NA	NA|536aa|up_6|NC_020814.1_333309_334917_+	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|244aa|up_5|NC_020814.1_334989_335721_+	cd13519, PBP2_PEB3_AcfC, Ligand-binding domain of a glycoprotein adhesion and an accessory colonization factor, a member of the type 2 periplasmic binding fold superfamily	NA|230aa|up_4|NC_020814.1_335883_336573_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_3|NC_020814.1_336813_338349_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_2|NC_020814.1_338422_338947_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_1|NC_020814.1_338943_339159_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_0|NC_020814.1_339163_339661_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|411aa|down_0|NC_020814.1_341008_342241_-	pfam00872, Transposase_mut, Transposase, Mutator family	csm3gr7|440aa|down_1|NC_020814.1_342268_343588_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|down_2|NC_020814.1_344116_344725_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|down_3|NC_020814.1_344744_345335_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|down_4|NC_020814.1_345348_346299_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_5|NC_020814.1_347526_348792_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_6|NC_020814.1_348778_349348_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_7|NC_020814.1_349338_350142_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_8|NC_020814.1_350194_351394_-	NA	NA|326aa|down_9|NC_020814.1_351374_352352_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	3	346954-347117	3,2	CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GAGTTTCATCTGAACCGTGTGGGTTAAGAA,GAGTTTCATCTGAACCGTGTGGGTTAAGAAGC	30,32	0	0	NA	NA	NA:NA	2,2	2	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|400aa|down_3|NC_020814.1_350194_351394_-	NA|512aa|up_9|NC_020814.1_336813_338349_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_8|NC_020814.1_338422_338947_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_7|NC_020814.1_338943_339159_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_6|NC_020814.1_339163_339661_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|301aa|up_5|NC_020814.1_339664_340567_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	NA|411aa|up_4|NC_020814.1_341008_342241_-	pfam00872, Transposase_mut, Transposase, Mutator family	csm3gr7|440aa|up_3|NC_020814.1_342268_343588_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|up_2|NC_020814.1_344116_344725_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|up_1|NC_020814.1_344744_345335_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|up_0|NC_020814.1_345348_346299_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_0|NC_020814.1_347526_348792_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_1|NC_020814.1_348778_349348_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_2|NC_020814.1_349338_350142_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_3|NC_020814.1_350194_351394_-	NA	NA|326aa|down_4|NC_020814.1_351374_352352_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_5|NC_020814.1_352344_354204_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]	NA|225aa|down_6|NC_020814.1_354193_354868_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|284aa|down_7|NC_020814.1_354860_355712_-	COG2445, COG2445, Uncharacterized conserved protein [Function unknown]	NA|611aa|down_8|NC_020814.1_355756_357589_-	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|175aa|down_9|NC_020814.1_357594_358119_-	COG2143, COG2143, Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	4	591414-592102	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no	Cas14b_CAS-V-F,cas6,cas7,cas5,cas3,cas4,cas14j	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCATCTGAACCGTGTGGGATATAAA,GTTTCATCTGAACCGTGTGGGATATAAAGT,GTTTCATCTGAACCGTGTGGGATATAAA	28,30,28	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeV	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|116aa|up_9|NC_020814.1_582509_582857_-,NA|49aa|up_8|NC_020814.1_582863_583010_-,NA|54aa|up_7|NC_020814.1_583011_583173_-,NA|144aa|up_0|NC_020814.1_588980_589412_-,NA|53aa|down_6|NC_020814.1_599461_599620_+	NA|116aa|up_9|NC_020814.1_582509_582857_-	NA	NA|49aa|up_8|NC_020814.1_582863_583010_-	NA	NA|54aa|up_7|NC_020814.1_583011_583173_-	NA	NA|453aa|up_6|NC_020814.1_583469_584828_-	sd00006, TPR, Tetratricopeptide repeat	NA|277aa|up_5|NC_020814.1_584865_585696_-	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|133aa|up_4|NC_020814.1_585790_586189_-	pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203)	NA|305aa|up_3|NC_020814.1_586236_587151_-	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|74aa|up_2|NC_020814.1_587143_587365_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|465aa|up_1|NC_020814.1_587361_588756_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|144aa|up_0|NC_020814.1_588980_589412_-	NA	cas6|266aa|down_0|NC_020814.1_592393_593191_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|556aa|down_1|NC_020814.1_593187_594855_+	pfam09706, Cas_CXXC_CXXC, CRISPR-associated protein (Cas_CXXC_CXXC)	cas7|329aa|down_2|NC_020814.1_594892_595879_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|224aa|down_3|NC_020814.1_595886_596558_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|764aa|down_4|NC_020814.1_596545_598837_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|72aa|down_5|NC_020814.1_598833_599049_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|53aa|down_6|NC_020814.1_599461_599620_+	NA	cas14j|407aa|down_7|NC_020814.1_599734_600955_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|260aa|down_8|NC_020814.1_600963_601743_+	smart00756, VKc, Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase	NA|983aa|down_9|NC_020814.1_601823_604772_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	5	928122-929267	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC,GTTTCATCTGAACCGTGTGGGTTAAGAAG	29,29,29	2	2	929072-929107|929073-929108	NC_020814.1_1338011-1338046|NC_020814.1_1338011-1338046	NA:NA:NA	17,17,17	17	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_2|NC_020814.1_932832_933138_-,NA|59aa|down_6|NC_020814.1_937935_938112_-	NA|332aa|up_9|NC_020814.1_919817_920813_+	TIGR00433, biotin_synthase, biotin synthase	NA|148aa|up_8|NC_020814.1_920803_921247_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_7|NC_020814.1_921251_921653_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_6|NC_020814.1_921649_922966_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_5|NC_020814.1_922983_923922_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_4|NC_020814.1_923893_925591_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_3|NC_020814.1_925583_926084_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_2|NC_020814.1_926080_927085_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_1|NC_020814.1_927069_927510_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_0|NC_020814.1_927522_927957_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|down_0|NC_020814.1_929853_930813_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_1|NC_020814.1_931804_932836_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_2|NC_020814.1_932832_933138_-	NA	NA|262aa|down_3|NC_020814.1_933130_933916_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_4|NC_020814.1_933931_935254_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_5|NC_020814.1_935260_937789_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_6|NC_020814.1_937935_938112_-	NA	NA|503aa|down_7|NC_020814.1_938242_939751_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_8|NC_020814.1_939857_940406_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_9|NC_020814.1_940402_940885_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	6	930862-931286	5,6,4	PILER-CR,CRISPRCasFinder,CRT	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCATCTGAACCGTGTGGGTTAAGAAG,CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_1|NC_020814.1_932832_933138_-,NA|59aa|down_5|NC_020814.1_937935_938112_-	NA|148aa|up_9|NC_020814.1_920803_921247_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_8|NC_020814.1_921251_921653_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_7|NC_020814.1_921649_922966_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_6|NC_020814.1_922983_923922_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_5|NC_020814.1_923893_925591_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_4|NC_020814.1_925583_926084_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_3|NC_020814.1_926080_927085_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_2|NC_020814.1_927069_927510_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_1|NC_020814.1_927522_927957_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|up_0|NC_020814.1_929853_930813_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_0|NC_020814.1_931804_932836_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_1|NC_020814.1_932832_933138_-	NA	NA|262aa|down_2|NC_020814.1_933130_933916_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_3|NC_020814.1_933931_935254_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_4|NC_020814.1_935260_937789_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_5|NC_020814.1_937935_938112_-	NA	NA|503aa|down_6|NC_020814.1_938242_939751_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_7|NC_020814.1_939857_940406_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_8|NC_020814.1_940402_940885_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]	NA|143aa|down_9|NC_020814.1_940889_941318_-	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	7	1024045-1024541	7,5,6	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	7,7,6	7	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|129aa|up_4|NC_020814.1_1019022_1019409_-,NA|200aa|up_2|NC_020814.1_1021068_1021668_-,NA|140aa|down_8|NC_020814.1_1032116_1032536_-	NA|530aa|up_9|NC_020814.1_1012173_1013763_-	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|234aa|up_8|NC_020814.1_1014105_1014807_-	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|194aa|up_7|NC_020814.1_1014803_1015385_-	pfam13174, TPR_6, Tetratricopeptide repeat	NA|94aa|up_6|NC_020814.1_1015371_1015653_-	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|1088aa|up_5|NC_020814.1_1015762_1019026_-	cd18808, SF1_C_Upf1, C-terminal helicase domain of Upf1-like family helicases	NA|129aa|up_4|NC_020814.1_1019022_1019409_-	NA	NA|328aa|up_3|NC_020814.1_1019823_1020807_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|200aa|up_2|NC_020814.1_1021068_1021668_-	NA	NA|217aa|up_1|NC_020814.1_1021794_1022445_-	sd00010, SLR, Sel1-like repeat	NA|316aa|up_0|NC_020814.1_1022986_1023934_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|555aa|down_0|NC_020814.1_1024933_1026598_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|242aa|down_1|NC_020814.1_1026667_1027393_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|93aa|down_2|NC_020814.1_1027355_1027634_-	COG1826, TatA, Sec-independent protein secretion pathway components [Intracellular trafficking and secretion]	NA|273aa|down_3|NC_020814.1_1027624_1028443_-	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|501aa|down_4|NC_020814.1_1028439_1029942_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|201aa|down_5|NC_020814.1_1029944_1030547_-	TIGR00741, Probable_sigma54_modulation_protein_ORF3_ORF95	NA|252aa|down_6|NC_020814.1_1030550_1031306_-	TIGR01352, Protein_TonB, TonB family C-terminal domain	NA|258aa|down_7|NC_020814.1_1031341_1032115_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|140aa|down_8|NC_020814.1_1032116_1032536_-	NA	NA|408aa|down_9|NC_020814.1_1032532_1033756_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	8	1088919-1092251	7,8,6	PILER-CR,CRISPRCasFinder,CRT	no	cas7,cas5,cas8b2,cas3,cas4,cas1,cas2,TnpB_regular.1	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG	29,29,29	1	1	1088948-1088986	NC_020814.1_578092-578130	NA:NA:NA	50,50,50	50	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|95aa|up_9|NC_020814.1_1079674_1079959_+,NA|64aa|up_0|NC_020814.1_1088492_1088684_-,NA|143aa|down_1|NC_020814.1_1093035_1093464_-,NA|322aa|down_2|NC_020814.1_1093445_1094411_-	NA|95aa|up_9|NC_020814.1_1079674_1079959_+	NA	NA|188aa|up_8|NC_020814.1_1079958_1080522_+	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	cas7|329aa|up_7|NC_020814.1_1080567_1081554_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas5|524aa|up_6|NC_020814.1_1081540_1083112_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas8b2|220aa|up_5|NC_020814.1_1083102_1083762_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas3|736aa|up_4|NC_020814.1_1083733_1085941_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|171aa|up_3|NC_020814.1_1085942_1086455_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|318aa|up_2|NC_020814.1_1086447_1087401_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|94aa|up_1|NC_020814.1_1087397_1087679_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|64aa|up_0|NC_020814.1_1088492_1088684_-	NA	NA|262aa|down_0|NC_020814.1_1092253_1093039_-	COG3031, PulC, Type II secretory pathway, component PulC [Intracellular trafficking and secretion]	NA|143aa|down_1|NC_020814.1_1093035_1093464_-	NA	NA|322aa|down_2|NC_020814.1_1093445_1094411_-	NA	NA|129aa|down_3|NC_020814.1_1094411_1094798_-	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	TnpB_regular.1|485aa|down_4|NC_020814.1_1094988_1096443_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|582aa|down_5|NC_020814.1_1096486_1098232_+	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|516aa|down_6|NC_020814.1_1098331_1099879_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|274aa|down_7|NC_020814.1_1099875_1100697_-	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|360aa|down_8|NC_020814.1_1100693_1101773_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|188aa|down_9|NC_020814.1_1101773_1102337_-	pfam09936, Methyltrn_RNA_4, SAM-dependent RNA methyltransferase
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	9	1151862-1151968	9	CRISPRCasFinder	no	cas3	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	AGAAATGCCGTTGGCACACTAGAGCG	26	0	0	NA	NA	NA	1	1	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|115aa|up_5|NC_020814.1_1147548_1147893_-,NA|120aa|down_8|NC_020814.1_1159241_1159601_+,NA|94aa|down_9|NC_020814.1_1159584_1159866_+	NA|287aa|up_9|NC_020814.1_1142826_1143687_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|348aa|up_8|NC_020814.1_1143695_1144739_+	cd07432, PHP_HisPPase, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase	NA|455aa|up_7|NC_020814.1_1144730_1146095_-	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|454aa|up_6|NC_020814.1_1146147_1147509_-	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated	NA|115aa|up_5|NC_020814.1_1147548_1147893_-	NA	NA|380aa|up_4|NC_020814.1_1147909_1149049_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|259aa|up_3|NC_020814.1_1149069_1149846_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|311aa|up_2|NC_020814.1_1149842_1150775_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|115aa|up_1|NC_020814.1_1151068_1151413_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|110aa|up_0|NC_020814.1_1151409_1151739_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|566aa|down_0|NC_020814.1_1152010_1153708_-	PRK14667, uvrC, excinuclease ABC subunit C; Provisional	NA|358aa|down_1|NC_020814.1_1153734_1154808_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_2|NC_020814.1_1154788_1155457_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|263aa|down_3|NC_020814.1_1155453_1156242_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|183aa|down_4|NC_020814.1_1156238_1156787_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|90aa|down_5|NC_020814.1_1156869_1157139_+	pfam17209, Hfq, Hfq protein	NA|240aa|down_6|NC_020814.1_1157178_1157898_+	pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase	NA|426aa|down_7|NC_020814.1_1158001_1159279_+	PRK00077, eno, enolase; Provisional	NA|120aa|down_8|NC_020814.1_1159241_1159601_+	NA	NA|94aa|down_9|NC_020814.1_1159584_1159866_+	NA
GCF_000348765.2_ASM34876v2	NC_020814	Hydrogenobaculum sp. SN, complete sequence	10	1215274-1215375	10	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GAAATGCGATTGCATCGCTCTAGT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|120aa|up_7|NC_020814.1_1203773_1204133_+,NA|408aa|up_6|NC_020814.1_1204176_1205400_+,NA|69aa|down_3|NC_020814.1_1219102_1219309_+	NA|87aa|up_9|NC_020814.1_1202138_1202399_-	pfam01381, HTH_3, Helix-turn-helix	NA|396aa|up_8|NC_020814.1_1202409_1203597_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|120aa|up_7|NC_020814.1_1203773_1204133_+	NA	NA|408aa|up_6|NC_020814.1_1204176_1205400_+	NA	NA|165aa|up_5|NC_020814.1_1205485_1205980_-	COG3260, COG3260, Ni,Fe-hydrogenase III small subunit [Energy production and conversion]	NA|479aa|up_4|NC_020814.1_1205989_1207426_-	COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy production and conversion]	NA|481aa|up_3|NC_020814.1_1207422_1208865_-	PRK06458, PRK06458, hydrogenase 4 subunit F; Validated	NA|223aa|up_2|NC_020814.1_1208861_1209530_-	COG4237, HyfE, Hydrogenase 4 membrane component (E) [Energy production and conversion]	NA|306aa|up_1|NC_020814.1_1209539_1210457_-	COG0650, HyfC, Formate hydrogenlyase subunit 4 [Energy production and conversion]	NA|622aa|up_0|NC_020814.1_1210449_1212315_-	PRK06521, PRK06521, hydrogenase 4 subunit B; Validated	NA|273aa|down_0|NC_020814.1_1215576_1216395_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|182aa|down_1|NC_020814.1_1216391_1216937_-	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|655aa|down_2|NC_020814.1_1216927_1218892_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|69aa|down_3|NC_020814.1_1219102_1219309_+	NA	NA|100aa|down_4|NC_020814.1_1219532_1219832_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|308aa|down_5|NC_020814.1_1219818_1220742_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|279aa|down_6|NC_020814.1_1220753_1221590_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|515aa|down_7|NC_020814.1_1221643_1223188_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|175aa|down_8|NC_020814.1_1225337_1225862_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|354aa|down_9|NC_020814.1_1225896_1226958_-	pfam07680, DoxA, TQO small subunit DoxA
