assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	1	199502-199600	1	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	AAATGCGGACGTTACCGCTCTATT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|152aa|up_5|NC_015557.1_195184_195640_+,NA|76aa|up_0|NC_015557.1_198933_199161_-,NA|207aa|down_0|NC_015557.1_199843_200464_-,NA|65aa|down_1|NC_015557.1_200475_200670_-,NA|193aa|down_2|NC_015557.1_200715_201294_-,NA|138aa|down_9|NC_015557.1_206106_206520_+	NA|155aa|up_9|NC_015557.1_190630_191095_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|426aa|up_8|NC_015557.1_191069_192347_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|555aa|up_7|NC_015557.1_192363_194028_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_6|NC_015557.1_194037_195168_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|152aa|up_5|NC_015557.1_195184_195640_+	NA	NA|218aa|up_4|NC_015557.1_195715_196369_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_3|NC_015557.1_196368_197526_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|171aa|up_2|NC_015557.1_197862_198375_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|149aa|up_1|NC_015557.1_198400_198847_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|up_0|NC_015557.1_198933_199161_-	NA	NA|207aa|down_0|NC_015557.1_199843_200464_-	NA	NA|65aa|down_1|NC_015557.1_200475_200670_-	NA	NA|193aa|down_2|NC_015557.1_200715_201294_-	NA	NA|311aa|down_3|NC_015557.1_201588_202521_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_4|NC_015557.1_202517_203483_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_5|NC_015557.1_203484_204000_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|61aa|down_6|NC_015557.1_203980_204163_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|340aa|down_7|NC_015557.1_204174_205194_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|304aa|down_8|NC_015557.1_205193_206105_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|138aa|down_9|NC_015557.1_206106_206520_+	NA
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	2	340552-340851	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	ANGTTTTNTNTGTNCCTATAGGGGATTGAAAC,GTTTTTTGTGTACCTATAGGGGATTGAAAC,GGTTTTTTGTGTACCTATAGGGGATTGAAACG	32,30,32	0	0	NA	NA	NA:NA:NA	4,4,2	4	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|139aa|up_7|NC_015557.1_332888_333305_+,NA|400aa|down_7|NC_015557.1_350200_351400_-	NA|117aa|up_9|NC_015557.1_331637_331988_+	cd06664, IscU_like, Iron-sulfur cluster scaffold-like proteins	NA|309aa|up_8|NC_015557.1_331965_332892_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|139aa|up_7|NC_015557.1_332888_333305_+	NA	NA|536aa|up_6|NC_015557.1_333314_334922_+	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|244aa|up_5|NC_015557.1_334994_335726_+	cd13519, PBP2_PEB3_AcfC, Ligand-binding domain of a glycoprotein adhesion and an accessory colonization factor, a member of the type 2 periplasmic binding fold superfamily	NA|230aa|up_4|NC_015557.1_335888_336578_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_3|NC_015557.1_336818_338354_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_2|NC_015557.1_338427_338952_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_1|NC_015557.1_338948_339164_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_0|NC_015557.1_339168_339666_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	csm3gr7|440aa|down_0|NC_015557.1_342274_343594_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|down_1|NC_015557.1_344122_344731_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|down_2|NC_015557.1_344750_345341_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|down_3|NC_015557.1_345354_346305_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_4|NC_015557.1_347532_348798_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_5|NC_015557.1_348784_349354_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_6|NC_015557.1_349344_350148_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_7|NC_015557.1_350200_351400_-	NA	NA|326aa|down_8|NC_015557.1_351380_352358_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_9|NC_015557.1_352350_354210_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	3	346960-347123	3,2	CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GAGTTTCATCTGAACCGTGTGGGTTAAGAA,GAGTTTCATCTGAACCGTGTGGGTTAAGAAGC	30,32	0	0	NA	NA	NA:NA	2,2	2	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|400aa|down_3|NC_015557.1_350200_351400_-	NA|230aa|up_9|NC_015557.1_335888_336578_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_8|NC_015557.1_336818_338354_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_7|NC_015557.1_338427_338952_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_6|NC_015557.1_338948_339164_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_5|NC_015557.1_339168_339666_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|301aa|up_4|NC_015557.1_339669_340572_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	csm3gr7|440aa|up_3|NC_015557.1_342274_343594_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|up_2|NC_015557.1_344122_344731_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|up_1|NC_015557.1_344750_345341_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|up_0|NC_015557.1_345354_346305_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_0|NC_015557.1_347532_348798_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_1|NC_015557.1_348784_349354_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_2|NC_015557.1_349344_350148_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_3|NC_015557.1_350200_351400_-	NA	NA|326aa|down_4|NC_015557.1_351380_352358_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_5|NC_015557.1_352350_354210_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]	NA|225aa|down_6|NC_015557.1_354199_354874_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|284aa|down_7|NC_015557.1_354866_355718_-	COG2445, COG2445, Uncharacterized conserved protein [Function unknown]	NA|611aa|down_8|NC_015557.1_355762_357595_-	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|175aa|down_9|NC_015557.1_357600_358125_-	COG2143, COG2143, Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	4	591419-592107	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas7,cas5,cas3,cas4,cas14j	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCATCTGAACCGTGTGGGATATAAA,GTTTCATCTGAACCGTGTGGGATATAAAGT,GTTTCATCTGAACCGTGTGGGATATAAA	28,30,28	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeV	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|116aa|up_9|NC_015557.1_582514_582862_-,NA|49aa|up_8|NC_015557.1_582868_583015_-,NA|54aa|up_7|NC_015557.1_583016_583178_-,NA|144aa|up_0|NC_015557.1_588985_589417_-,NA|53aa|down_6|NC_015557.1_599466_599625_+	NA|116aa|up_9|NC_015557.1_582514_582862_-	NA	NA|49aa|up_8|NC_015557.1_582868_583015_-	NA	NA|54aa|up_7|NC_015557.1_583016_583178_-	NA	NA|453aa|up_6|NC_015557.1_583474_584833_-	sd00006, TPR, Tetratricopeptide repeat	NA|277aa|up_5|NC_015557.1_584870_585701_-	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|133aa|up_4|NC_015557.1_585795_586194_-	pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203)	NA|305aa|up_3|NC_015557.1_586241_587156_-	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|74aa|up_2|NC_015557.1_587148_587370_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|465aa|up_1|NC_015557.1_587366_588761_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|144aa|up_0|NC_015557.1_588985_589417_-	NA	cas6|266aa|down_0|NC_015557.1_592398_593196_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|556aa|down_1|NC_015557.1_593192_594860_+	pfam09706, Cas_CXXC_CXXC, CRISPR-associated protein (Cas_CXXC_CXXC)	cas7|329aa|down_2|NC_015557.1_594897_595884_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|224aa|down_3|NC_015557.1_595891_596563_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|764aa|down_4|NC_015557.1_596550_598842_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|72aa|down_5|NC_015557.1_598838_599054_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|53aa|down_6|NC_015557.1_599466_599625_+	NA	cas14j|407aa|down_7|NC_015557.1_599739_600960_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|260aa|down_8|NC_015557.1_600968_601748_+	smart00756, VKc, Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase	NA|983aa|down_9|NC_015557.1_601828_604777_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	5	927965-929110	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC,GTTTCATCTGAACCGTGTGGGTTAAGAAG	29,29,29	2	2	928915-928950|928916-928951	NC_015557.1_1337855-1337890|NC_015557.1_1337855-1337890	NA:NA:NA	17,17,17	17	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_2|NC_015557.1_932675_932981_-,NA|59aa|down_6|NC_015557.1_937778_937955_-	NA|332aa|up_9|NC_015557.1_919660_920656_+	TIGR00433, biotin_synthase, biotin synthase	NA|148aa|up_8|NC_015557.1_920646_921090_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_7|NC_015557.1_921094_921496_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_6|NC_015557.1_921492_922809_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_5|NC_015557.1_922826_923765_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_4|NC_015557.1_923736_925434_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_3|NC_015557.1_925426_925927_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_2|NC_015557.1_925923_926928_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_1|NC_015557.1_926912_927353_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_0|NC_015557.1_927365_927800_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|down_0|NC_015557.1_929696_930656_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_1|NC_015557.1_931647_932679_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_2|NC_015557.1_932675_932981_-	NA	NA|262aa|down_3|NC_015557.1_932973_933759_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_4|NC_015557.1_933774_935097_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_5|NC_015557.1_935103_937632_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_6|NC_015557.1_937778_937955_-	NA	NA|503aa|down_7|NC_015557.1_938085_939594_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_8|NC_015557.1_939700_940249_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_9|NC_015557.1_940245_940728_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	6	930705-931129	5,6,4	PILER-CR,CRISPRCasFinder,CRT	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCATCTGAACCGTGTGGGTTAAGAAG,CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_1|NC_015557.1_932675_932981_-,NA|59aa|down_5|NC_015557.1_937778_937955_-	NA|148aa|up_9|NC_015557.1_920646_921090_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_8|NC_015557.1_921094_921496_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_7|NC_015557.1_921492_922809_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_6|NC_015557.1_922826_923765_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_5|NC_015557.1_923736_925434_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_4|NC_015557.1_925426_925927_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_3|NC_015557.1_925923_926928_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_2|NC_015557.1_926912_927353_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_1|NC_015557.1_927365_927800_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|up_0|NC_015557.1_929696_930656_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_0|NC_015557.1_931647_932679_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_1|NC_015557.1_932675_932981_-	NA	NA|262aa|down_2|NC_015557.1_932973_933759_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_3|NC_015557.1_933774_935097_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_4|NC_015557.1_935103_937632_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_5|NC_015557.1_937778_937955_-	NA	NA|503aa|down_6|NC_015557.1_938085_939594_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_7|NC_015557.1_939700_940249_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_8|NC_015557.1_940245_940728_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]	NA|143aa|down_9|NC_015557.1_940732_941161_-	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	7	1023888-1024384	7,5,6	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	7,7,6	7	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|129aa|up_4|NC_015557.1_1018865_1019252_-,NA|200aa|up_2|NC_015557.1_1020911_1021511_-,NA|140aa|down_8|NC_015557.1_1031959_1032379_-	NA|530aa|up_9|NC_015557.1_1012016_1013606_-	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|234aa|up_8|NC_015557.1_1013948_1014650_-	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|194aa|up_7|NC_015557.1_1014646_1015228_-	pfam13174, TPR_6, Tetratricopeptide repeat	NA|94aa|up_6|NC_015557.1_1015214_1015496_-	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|1088aa|up_5|NC_015557.1_1015605_1018869_-	cd18808, SF1_C_Upf1, C-terminal helicase domain of Upf1-like family helicases	NA|129aa|up_4|NC_015557.1_1018865_1019252_-	NA	NA|328aa|up_3|NC_015557.1_1019666_1020650_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|200aa|up_2|NC_015557.1_1020911_1021511_-	NA	NA|217aa|up_1|NC_015557.1_1021637_1022288_-	sd00010, SLR, Sel1-like repeat	NA|316aa|up_0|NC_015557.1_1022829_1023777_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|555aa|down_0|NC_015557.1_1024776_1026441_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|242aa|down_1|NC_015557.1_1026510_1027236_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|93aa|down_2|NC_015557.1_1027198_1027477_-	COG1826, TatA, Sec-independent protein secretion pathway components [Intracellular trafficking and secretion]	NA|273aa|down_3|NC_015557.1_1027467_1028286_-	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|501aa|down_4|NC_015557.1_1028282_1029785_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|201aa|down_5|NC_015557.1_1029787_1030390_-	TIGR00741, Probable_sigma54_modulation_protein_ORF3_ORF95	NA|252aa|down_6|NC_015557.1_1030393_1031149_-	TIGR01352, Protein_TonB, TonB family C-terminal domain	NA|258aa|down_7|NC_015557.1_1031184_1031958_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|140aa|down_8|NC_015557.1_1031959_1032379_-	NA	NA|408aa|down_9|NC_015557.1_1032375_1033599_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	8	1088763-1092095	7,8,6	PILER-CR,CRISPRCasFinder,CRT	no	cas7,cas8b2,cas3,cas4,cas1,cas2,TnpB_regular.1	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG	29,29,29	1	1	1088792-1088830	NC_015557.1_578097-578135	NA:NA:NA	50,50,50	50	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|95aa|up_8|NC_015557.1_1079517_1079802_+,NA|64aa|up_0|NC_015557.1_1088336_1088528_-,NA|143aa|down_1|NC_015557.1_1092879_1093308_-,NA|322aa|down_2|NC_015557.1_1093289_1094255_-	NA|905aa|up_9|NC_015557.1_1076786_1079501_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|95aa|up_8|NC_015557.1_1079517_1079802_+	NA	NA|188aa|up_7|NC_015557.1_1079801_1080365_+	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	cas7|329aa|up_6|NC_015557.1_1080410_1081397_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas8b2|741aa|up_5|NC_015557.1_1081383_1083606_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|736aa|up_4|NC_015557.1_1083577_1085785_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|171aa|up_3|NC_015557.1_1085786_1086299_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|318aa|up_2|NC_015557.1_1086291_1087245_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|94aa|up_1|NC_015557.1_1087241_1087523_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|64aa|up_0|NC_015557.1_1088336_1088528_-	NA	NA|262aa|down_0|NC_015557.1_1092097_1092883_-	COG3031, PulC, Type II secretory pathway, component PulC [Intracellular trafficking and secretion]	NA|143aa|down_1|NC_015557.1_1092879_1093308_-	NA	NA|322aa|down_2|NC_015557.1_1093289_1094255_-	NA	NA|129aa|down_3|NC_015557.1_1094255_1094642_-	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	TnpB_regular.1|485aa|down_4|NC_015557.1_1094832_1096287_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|582aa|down_5|NC_015557.1_1096330_1098076_+	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|516aa|down_6|NC_015557.1_1098175_1099723_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|274aa|down_7|NC_015557.1_1099719_1100541_-	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|360aa|down_8|NC_015557.1_1100537_1101617_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|188aa|down_9|NC_015557.1_1101617_1102181_-	pfam09936, Methyltrn_RNA_4, SAM-dependent RNA methyltransferase
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	9	1151706-1151812	9	CRISPRCasFinder	no	cas3	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	AGAAATGCCGTTGGCACACTAGAGCG	26	0	0	NA	NA	NA	1	1	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|115aa|up_5|NC_015557.1_1147392_1147737_-,NA|120aa|down_8|NC_015557.1_1159085_1159445_+,NA|94aa|down_9|NC_015557.1_1159428_1159710_+	NA|287aa|up_9|NC_015557.1_1142670_1143531_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|348aa|up_8|NC_015557.1_1143539_1144583_+	cd07432, PHP_HisPPase, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase	NA|455aa|up_7|NC_015557.1_1144574_1145939_-	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|454aa|up_6|NC_015557.1_1145991_1147353_-	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated	NA|115aa|up_5|NC_015557.1_1147392_1147737_-	NA	NA|380aa|up_4|NC_015557.1_1147753_1148893_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|259aa|up_3|NC_015557.1_1148913_1149690_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|311aa|up_2|NC_015557.1_1149686_1150619_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|115aa|up_1|NC_015557.1_1150912_1151257_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|110aa|up_0|NC_015557.1_1151253_1151583_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|566aa|down_0|NC_015557.1_1151854_1153552_-	PRK14667, uvrC, excinuclease ABC subunit C; Provisional	NA|358aa|down_1|NC_015557.1_1153578_1154652_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_2|NC_015557.1_1154632_1155301_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|263aa|down_3|NC_015557.1_1155297_1156086_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|183aa|down_4|NC_015557.1_1156082_1156631_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|90aa|down_5|NC_015557.1_1156713_1156983_+	pfam17209, Hfq, Hfq protein	NA|240aa|down_6|NC_015557.1_1157022_1157742_+	pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase	NA|426aa|down_7|NC_015557.1_1157845_1159123_+	PRK00077, eno, enolase; Provisional	NA|120aa|down_8|NC_015557.1_1159085_1159445_+	NA	NA|94aa|down_9|NC_015557.1_1159428_1159710_+	NA
GCF_000213785.1_ASM21378v1	NC_015557	Hydrogenobaculum sp. 3684, complete sequence	10	1215118-1215219	10	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GAAATGCGATTGCATCGCTCTAGT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|120aa|up_7|NC_015557.1_1203617_1203977_+,NA|408aa|up_6|NC_015557.1_1204020_1205244_+,NA|69aa|down_3|NC_015557.1_1218946_1219153_+	NA|87aa|up_9|NC_015557.1_1201982_1202243_-	pfam01381, HTH_3, Helix-turn-helix	NA|396aa|up_8|NC_015557.1_1202253_1203441_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|120aa|up_7|NC_015557.1_1203617_1203977_+	NA	NA|408aa|up_6|NC_015557.1_1204020_1205244_+	NA	NA|165aa|up_5|NC_015557.1_1205329_1205824_-	COG3260, COG3260, Ni,Fe-hydrogenase III small subunit [Energy production and conversion]	NA|479aa|up_4|NC_015557.1_1205833_1207270_-	COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy production and conversion]	NA|481aa|up_3|NC_015557.1_1207266_1208709_-	PRK06458, PRK06458, hydrogenase 4 subunit F; Validated	NA|223aa|up_2|NC_015557.1_1208705_1209374_-	COG4237, HyfE, Hydrogenase 4 membrane component (E) [Energy production and conversion]	NA|306aa|up_1|NC_015557.1_1209383_1210301_-	COG0650, HyfC, Formate hydrogenlyase subunit 4 [Energy production and conversion]	NA|622aa|up_0|NC_015557.1_1210293_1212159_-	PRK06521, PRK06521, hydrogenase 4 subunit B; Validated	NA|273aa|down_0|NC_015557.1_1215420_1216239_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|182aa|down_1|NC_015557.1_1216235_1216781_-	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|655aa|down_2|NC_015557.1_1216771_1218736_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|69aa|down_3|NC_015557.1_1218946_1219153_+	NA	NA|100aa|down_4|NC_015557.1_1219376_1219676_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|308aa|down_5|NC_015557.1_1219662_1220586_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|279aa|down_6|NC_015557.1_1220597_1221434_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|515aa|down_7|NC_015557.1_1221487_1223032_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|175aa|down_8|NC_015557.1_1225181_1225706_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|354aa|down_9|NC_015557.1_1225740_1226802_-	pfam07680, DoxA, TQO small subunit DoxA
