assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	1	201703-201815	1	CRISPRCasFinder	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	TAGTGTGCGTAGCATTTCTTACACTACCAGTGCCT	35	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|51aa|up_2|NC_011126.1_199660_199813_+,NA|76aa|down_3|NC_011126.1_204753_204981_-,NA|195aa|down_4|NC_011126.1_205559_206144_-,NA|65aa|down_5|NC_011126.1_206155_206350_-,NA|193aa|down_6|NC_011126.1_206396_206975_-	NA|375aa|up_9|NC_011126.1_191806_192931_+	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|356aa|up_8|NC_011126.1_192912_193980_+	PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional	NA|363aa|up_7|NC_011126.1_193976_195065_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|155aa|up_6|NC_011126.1_195106_195571_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|426aa|up_5|NC_011126.1_195545_196823_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|555aa|up_4|NC_011126.1_196838_198503_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_3|NC_011126.1_198513_199644_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|51aa|up_2|NC_011126.1_199660_199813_+	NA	NA|218aa|up_1|NC_011126.1_199849_200503_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_0|NC_011126.1_200502_201660_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|322aa|down_0|NC_011126.1_201963_202929_-	PRK15285, PRK15285, fimbrial chaperone StfD	NA|170aa|down_1|NC_011126.1_203361_203871_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|139aa|down_2|NC_011126.1_204250_204667_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|down_3|NC_011126.1_204753_204981_-	NA	NA|195aa|down_4|NC_011126.1_205559_206144_-	NA	NA|65aa|down_5|NC_011126.1_206155_206350_-	NA	NA|193aa|down_6|NC_011126.1_206396_206975_-	NA	NA|311aa|down_7|NC_011126.1_207146_208079_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_8|NC_011126.1_208075_209041_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_9|NC_011126.1_209042_209558_+	pfam02620, DUF177, Uncharacterized ACR, COG1399
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	2	205259-205359	2	CRISPRCasFinder	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	TAAAATGCGATTGTATCGCTCTA	23	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|51aa|up_7|NC_011126.1_199660_199813_+,NA|58aa|up_4|NC_011126.1_201680_201854_+,NA|76aa|up_0|NC_011126.1_204753_204981_-,NA|195aa|down_0|NC_011126.1_205559_206144_-,NA|65aa|down_1|NC_011126.1_206155_206350_-,NA|193aa|down_2|NC_011126.1_206396_206975_-,NA|138aa|down_9|NC_011126.1_211664_212078_+	NA|555aa|up_9|NC_011126.1_196838_198503_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_8|NC_011126.1_198513_199644_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|51aa|up_7|NC_011126.1_199660_199813_+	NA	NA|218aa|up_6|NC_011126.1_199849_200503_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_5|NC_011126.1_200502_201660_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|58aa|up_4|NC_011126.1_201680_201854_+	NA	NA|322aa|up_3|NC_011126.1_201963_202929_-	PRK15285, PRK15285, fimbrial chaperone StfD	NA|170aa|up_2|NC_011126.1_203361_203871_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|139aa|up_1|NC_011126.1_204250_204667_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|up_0|NC_011126.1_204753_204981_-	NA	NA|195aa|down_0|NC_011126.1_205559_206144_-	NA	NA|65aa|down_1|NC_011126.1_206155_206350_-	NA	NA|193aa|down_2|NC_011126.1_206396_206975_-	NA	NA|311aa|down_3|NC_011126.1_207146_208079_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_4|NC_011126.1_208075_209041_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_5|NC_011126.1_209042_209558_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|61aa|down_6|NC_011126.1_209538_209721_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|340aa|down_7|NC_011126.1_209732_210752_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|304aa|down_8|NC_011126.1_210751_211663_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|138aa|down_9|NC_011126.1_211664_212078_+	NA
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	3	709507-709601	3	CRISPRCasFinder	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	TGGAGAACGACTAAAGCACACTT	23	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|92aa|up_4|NC_011126.1_702548_702824_-,NA	NA|217aa|up_9|NC_011126.1_697906_698557_+	PRK00300, gmk, guanylate kinase; Provisional	NA|76aa|up_8|NC_011126.1_698531_698759_+	PRK00392, rpoZ, DNA-directed RNA polymerase subunit omega; Reviewed	NA|416aa|up_7|NC_011126.1_698739_699987_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|452aa|up_6|NC_011126.1_699983_701339_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	NA|365aa|up_5|NC_011126.1_701469_702564_+	COG5542, COG5542, Predicted integral membrane protein [Function unknown]	NA|92aa|up_4|NC_011126.1_702548_702824_-	NA	NA|962aa|up_3|NC_011126.1_702865_705751_-	TIGR00618, Nuclease_SbcCD_subunit_C, exonuclease SbcC	NA|118aa|up_2|NC_011126.1_705747_706101_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|126aa|up_1|NC_011126.1_706084_706462_-	pfam02261, Asp_decarbox, Aspartate decarboxylase	NA|938aa|up_0|NC_011126.1_706528_709342_-	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|406aa|down_0|NC_011126.1_709679_710897_+	pfam13524, Glyco_trans_1_2, Glycosyl transferases group 1	NA|316aa|down_1|NC_011126.1_710922_711870_+	pfam02606, LpxK, Tetraacyldisaccharide-1-P 4'-kinase	NA|250aa|down_2|NC_011126.1_711856_712606_+	PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional	NA|476aa|down_3|NC_011126.1_712626_714054_+	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|248aa|down_4|NC_011126.1_714050_714794_+	COG0345, ProC, Pyrroline-5-carboxylate reductase [Amino acid transport and metabolism]	NA|515aa|down_5|NC_011126.1_714790_716335_+	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|335aa|down_6|NC_011126.1_716368_717373_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1016aa|down_7|NC_011126.1_717369_720417_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|288aa|down_8|NC_011126.1_720467_721331_+	COG2264, PrmA, Ribosomal protein L11 methylase [Translation, ribosomal structure and biogenesis]	NA|825aa|down_9|NC_011126.1_721331_723806_+	cd05914, LC_FACL_like, Uncharacterized subfamily of fatty acid CoA ligase (FACL)
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	4	1037710-1039665	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	GTTTTGTTTGTACCTATAGGGGATTGAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	28,29,29	29	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|140aa|up_8|NC_011126.1_1029695_1030115_+,NA|159aa|down_2|NC_011126.1_1042162_1042639_+,NA|196aa|down_4|NC_011126.1_1043829_1044417_+	NA|408aa|up_9|NC_011126.1_1028475_1029699_+	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|140aa|up_8|NC_011126.1_1029695_1030115_+	NA	NA|258aa|up_7|NC_011126.1_1030116_1030890_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|252aa|up_6|NC_011126.1_1030925_1031681_+	TIGR01352, Protein_TonB, TonB family C-terminal domain	NA|201aa|up_5|NC_011126.1_1031684_1032287_+	TIGR00741, Probable_sigma54_modulation_protein_ORF3_ORF95	NA|501aa|up_4|NC_011126.1_1032289_1033792_+	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|279aa|up_3|NC_011126.1_1033788_1034625_+	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|93aa|up_2|NC_011126.1_1034615_1034894_+	PRK14858, tatA, twin arginine translocase protein A; Provisional	NA|242aa|up_1|NC_011126.1_1034856_1035582_+	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|555aa|up_0|NC_011126.1_1035652_1037317_+	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|315aa|down_0|NC_011126.1_1039775_1040720_+	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|223aa|down_1|NC_011126.1_1041345_1042014_+	sd00010, SLR, Sel1-like repeat	NA|159aa|down_2|NC_011126.1_1042162_1042639_+	NA	NA|344aa|down_3|NC_011126.1_1042685_1043717_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|196aa|down_4|NC_011126.1_1043829_1044417_+	NA	NA|316aa|down_5|NC_011126.1_1044690_1045638_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|94aa|down_6|NC_011126.1_1045936_1046218_+	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|194aa|down_7|NC_011126.1_1046204_1046786_+	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|234aa|down_8|NC_011126.1_1046782_1047484_+	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|530aa|down_9|NC_011126.1_1047831_1049421_+	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	5	1053443-1053545	5	CRISPRCasFinder	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	AAATGCGATGTTATCGCTCTATAGT	25	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|67aa|up_4|NC_011126.1_1049650_1049851_-,NA	NA|316aa|up_9|NC_011126.1_1044690_1045638_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|94aa|up_8|NC_011126.1_1045936_1046218_+	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|194aa|up_7|NC_011126.1_1046204_1046786_+	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|234aa|up_6|NC_011126.1_1046782_1047484_+	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|530aa|up_5|NC_011126.1_1047831_1049421_+	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|67aa|up_4|NC_011126.1_1049650_1049851_-	NA	NA|415aa|up_3|NC_011126.1_1049938_1051183_-	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|112aa|up_2|NC_011126.1_1051443_1051779_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|350aa|up_1|NC_011126.1_1051775_1052825_-	TIGR01141, Histidinol-phosphate_aminotransferase, histidinol-phosphate aminotransferase	NA|158aa|up_0|NC_011126.1_1052846_1053320_+	pfam00731, AIRC, AIR carboxylase	NA|124aa|down_0|NC_011126.1_1053604_1053976_+	pfam14213, DUF4325, STAS-like domain of unknown function (DUF4325)	NA|227aa|down_1|NC_011126.1_1054817_1055498_-	TIGR02227, Inactive_signal_peptidase_IA	NA|406aa|down_2|NC_011126.1_1055565_1056783_+	cd00887, MoeA, MoeA family	NA|69aa|down_3|NC_011126.1_1056805_1057012_+	cd00565, Ubl_ThiS, ubiquitin-like (Ubl) domain found in sulfur carrier protein ThiS	NA|264aa|down_4|NC_011126.1_1057012_1057804_+	PRK00208, thiG, thiazole synthase; Reviewed	NA|442aa|down_5|NC_011126.1_1057800_1059126_+	PLN02576, PLN02576, protoporphyrinogen oxidase	NA|209aa|down_6|NC_011126.1_1059112_1059739_+	PRK08317, PRK08317, hypothetical protein; Provisional	NA|241aa|down_7|NC_011126.1_1059747_1060470_+	cd07363, 45_DOPA_Dioxygenase, The Class III extradiol dioxygenase, 4,5-DOPA Dioxygenase, catalyzes the incorporation of both atoms of molecular oxygen into 4,5-dihydroxy-phenylalanine	NA|105aa|down_8|NC_011126.1_1060502_1060817_+	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|309aa|down_9|NC_011126.1_1060854_1061781_+	COG4105, ComL, DNA uptake lipoprotein [General function prediction only]
GCF_000020785.1_ASM2078v1	NC_011126	Hydrogenobaculum sp. Y04AAS1, complete genome	6	1221928-1222029	6	CRISPRCasFinder	no		Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	Orphan	AGAAATGCTAAAGCACACTAGAGCG	25	1	1	1221953-1222004	NC_011126.1_1222128-1222179	NA	1	1	Orphan	Cas9_archaeal,cas14k,cas3,DEDDh,DinG,csa3,Cas14b_CAS-V-F	NA|69aa|up_0|NC_011126.1_1221613_1221820_+,NA|471aa|down_5|NC_011126.1_1229796_1231209_+,NA|179aa|down_9|NC_011126.1_1234478_1235015_+	NA|479aa|up_9|NC_011126.1_1208535_1209972_-	COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy production and conversion]	NA|481aa|up_8|NC_011126.1_1209968_1211411_-	PRK06458, PRK06458, hydrogenase 4 subunit F; Validated	NA|223aa|up_7|NC_011126.1_1211407_1212076_-	COG4237, HyfE, Hydrogenase 4 membrane component (E) [Energy production and conversion]	NA|306aa|up_6|NC_011126.1_1212085_1213003_-	COG0650, HyfC, Formate hydrogenlyase subunit 4 [Energy production and conversion]	NA|634aa|up_5|NC_011126.1_1212995_1214897_-	PRK06521, PRK06521, hydrogenase 4 subunit B; Validated	NA|1006aa|up_4|NC_011126.1_1215032_1218050_-	PRK07207, PRK07207, ribonucleoside-diphosphate reductase subunit alpha	NA|273aa|up_3|NC_011126.1_1218120_1218939_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|182aa|up_2|NC_011126.1_1218935_1219481_-	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|655aa|up_1|NC_011126.1_1219471_1221436_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|69aa|up_0|NC_011126.1_1221613_1221820_+	NA	NA|98aa|down_0|NC_011126.1_1222265_1222559_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|308aa|down_1|NC_011126.1_1222545_1223469_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|279aa|down_2|NC_011126.1_1223480_1224317_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|515aa|down_3|NC_011126.1_1224371_1225916_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|435aa|down_4|NC_011126.1_1228302_1229607_+	pfam13185, GAF_2, GAF domain	NA|471aa|down_5|NC_011126.1_1229796_1231209_+	NA	NA|211aa|down_6|NC_011126.1_1231210_1231843_+	cd02137, MhqN-like, nitroreductase family protein similar to the NAD(P)H nitroreductase MhqN	NA|442aa|down_7|NC_011126.1_1231858_1233184_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|354aa|down_8|NC_011126.1_1233267_1234329_-	pfam07680, DoxA, TQO small subunit DoxA	NA|179aa|down_9|NC_011126.1_1234478_1235015_+	NA
