assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003945285.1_ASM394528v1	AP019005	Hydrogenimonas sp. MAG DNA, complete genome	1	858099-858237	1	CRISPRCasFinder	no		DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GGGTGCAGGGGACGGTCAGTCCCCGCCGGAAGATGGGCTTTGCCC	45	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|145aa|down_3|AP019005.1_862439_862874_-,NA|103aa|down_6|AP019005.1_864374_864683_+	NA|305aa|up_9|AP019005.1_849915_850830_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|302aa|up_8|AP019005.1_850916_851822_+	PRK15068, PRK15068, tRNA 5-methoxyuridine(34)/uridine 5-oxyacetic acid(34) synthase CmoB	NA|159aa|up_7|AP019005.1_851830_852307_+	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|199aa|up_6|AP019005.1_852320_852917_-	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|276aa|up_5|AP019005.1_852986_853814_+	pfam04305, DUF455, Protein of unknown function (DUF455)	NA|379aa|up_4|AP019005.1_854035_855172_+	COG0399, WecE, Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|309aa|up_3|AP019005.1_855171_856098_+	PRK00652, lpxK, tetraacyldisaccharide 4'-kinase; Reviewed	NA|284aa|up_2|AP019005.1_856422_857274_+	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|121aa|up_1|AP019005.1_857301_857664_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|114aa|up_0|AP019005.1_857632_857974_+	pfam03091, CutA1, CutA1 divalent ion tolerance protein	NA|489aa|down_0|AP019005.1_858275_859742_+	PRK09225, PRK09225, threonine synthase; Validated	NA|410aa|down_1|AP019005.1_859741_860971_+	TIGR00275, TIGR00275, flavoprotein, HI0933 family	NA|435aa|down_2|AP019005.1_860997_862302_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|145aa|down_3|AP019005.1_862439_862874_-	NA	NA|106aa|down_4|AP019005.1_863016_863334_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|273aa|down_5|AP019005.1_863565_864384_+	COG0846, SIR2, NAD-dependent protein deacetylases, SIR2 family [Transcription]	NA|103aa|down_6|AP019005.1_864374_864683_+	NA	NA|650aa|down_7|AP019005.1_864683_866633_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|416aa|down_8|AP019005.1_866925_868173_+	cd07343, M48A_Zmpste24p_like, Peptidase M48 subfamily A, a type 1 CaaX endopeptidase	NA|278aa|down_9|AP019005.1_868169_869003_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional
GCA_003945285.1_ASM394528v1	AP019005	Hydrogenimonas sp. MAG DNA, complete genome	2	1605480-1606242	1,1,2,2	CRT,PILER-CR,CRISPRCasFinder,PILER-CR	no	cas9,cas1,cas2,csa3	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type II-A,Type II-B,Type II-C	TGTTTTAAGACCCCTCAAAACCCTGACCTGTTACAAT,GTTTTAAGACCCCTCAAAACCCTGACCTGTTACAAT,GTTTTAAGACCCCTCAAAACCCTGACCTGTTACAAT,TGTTTTAAGACCCCTCAAAACCCTGACCTGTTACAAT	37,36,36,37	0	0	NA	NA	NA:NA:NA:NA	11,8,11,8	11	TypeII-A,TypeII-B,TypeII-C	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|230aa|up_6|AP019005.1_1598032_1598722_+,NA|48aa|up_4|AP019005.1_1599694_1599838_+,NA|46aa|up_3|AP019005.1_1600097_1600235_-,NA|44aa|down_2|AP019005.1_1609002_1609134_+,NA|54aa|down_5|AP019005.1_1610531_1610693_+,NA|62aa|down_6|AP019005.1_1610843_1611029_+,NA|70aa|down_9|AP019005.1_1614517_1614727_-	NA|85aa|up_9|AP019005.1_1594027_1594282_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|397aa|up_8|AP019005.1_1594486_1595677_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|653aa|up_7|AP019005.1_1596069_1598028_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|230aa|up_6|AP019005.1_1598032_1598722_+	NA	NA|228aa|up_5|AP019005.1_1598763_1599447_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|48aa|up_4|AP019005.1_1599694_1599838_+	NA	NA|46aa|up_3|AP019005.1_1600097_1600235_-	NA	cas9|1130aa|up_2|AP019005.1_1600712_1604102_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|305aa|up_1|AP019005.1_1604070_1604985_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|100aa|up_0|AP019005.1_1605004_1605304_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|629aa|down_0|AP019005.1_1606305_1608192_-	pfam16448, LapD_MoxY_N, LapD/MoxY periplasmic domain	NA|210aa|down_1|AP019005.1_1608188_1608818_-	COG3672, COG3672, Predicted transglutaminase-like cysteine proteinase [General    function prediction only]	NA|44aa|down_2|AP019005.1_1609002_1609134_+	NA	NA|242aa|down_3|AP019005.1_1609140_1609866_-	PRK00481, PRK00481, NAD-dependent deacetylase; Provisional	NA|140aa|down_4|AP019005.1_1609888_1610308_-	COG3399, COG3399, Uncharacterized protein conserved in bacteria [Function unknown]	NA|54aa|down_5|AP019005.1_1610531_1610693_+	NA	NA|62aa|down_6|AP019005.1_1610843_1611029_+	NA	NA|691aa|down_7|AP019005.1_1611124_1613197_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|445aa|down_8|AP019005.1_1613190_1614525_+	pfam03929, PepSY_TM, PepSY-associated TM region	NA|70aa|down_9|AP019005.1_1614517_1614727_-	NA
GCA_003945285.1_ASM394528v1	AP019005	Hydrogenimonas sp. MAG DNA, complete genome	3	1932552-1933279	3,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type I-E	CGGTTCATCCCCACGTCCGTGGGGAACAC,CGGTTCATCCCCACGTCCGTGGGGAACAC,GGTTCATCCCCACGTCCGTGGGGAACAC	29,29,28	0	0	NA	NA	I-E,II-B:I-E,II-B:I-E,II-B	11,11,10	11	TypeI-E	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|49aa|up_5|AP019005.1_1924537_1924684_+,NA	NA|1031aa|up_9|AP019005.1_1918069_1921162_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|284aa|up_8|AP019005.1_1921228_1922080_+	COG2996, COG2996, Predicted RNA-bindining protein (contains S1 and HTH domains) [General function prediction only]	NA|271aa|up_7|AP019005.1_1922076_1922889_+	cd12110, PHP_HisPPase_Hisj_like, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase of Hisj like	NA|477aa|up_6|AP019005.1_1923000_1924431_+	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|49aa|up_5|AP019005.1_1924537_1924684_+	NA	NA|222aa|up_4|AP019005.1_1924916_1925582_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|419aa|up_3|AP019005.1_1925578_1926835_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|604aa|up_2|AP019005.1_1926866_1928678_-	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|398aa|up_1|AP019005.1_1928902_1930096_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|463aa|up_0|AP019005.1_1930092_1931481_-	cd09913, EHD, Eps15 homology domain (EHD), C-terminal domain	cas2|61aa|down_0|AP019005.1_1933339_1933522_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|277aa|down_1|AP019005.1_1933615_1934446_-	pfam13683, rve_3, Integrase core domain	NA|88aa|down_2|AP019005.1_1934439_1934703_-	pfam01527, HTH_Tnp_1, Transposase	NA|42aa|down_3|AP019005.1_1934725_1934851_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|305aa|down_4|AP019005.1_1934831_1935746_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|168aa|down_5|AP019005.1_1935745_1936249_-	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas5|225aa|down_6|AP019005.1_1936325_1937000_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|375aa|down_7|AP019005.1_1936999_1938124_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|159aa|down_8|AP019005.1_1938127_1938604_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|494aa|down_9|AP019005.1_1938600_1940082_-	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1
GCA_003945285.1_ASM394528v1	AP019005	Hydrogenimonas sp. MAG DNA, complete genome	5	2079054-2081115	5,3,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	WYL,csx1,csx20,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-D,Type III-A,Type III-C,Type III-B	GTCTCAATCCCCTCAAAATCGGGTCTGATTGTAAT,GTCTCAATCCCCTCAAAATCGGGTCTGATTGTAAT,GTCTCAATCCCCTCAAAATCGGGTCTGATTGTAAT,GTCTCAATCCCCTCAAAATCGGGTCTGATTGTAAT	35,35,35,35	0	0	NA	NA	NA:NA:NA:NA	29,29,26,26	29	TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	DEDDh,cas4,cas9,cas1,cas2,csa3,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csx1,csx20,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|83aa|up_9|AP019005.1_2072593_2072842_+,NA|84aa|up_6|AP019005.1_2074666_2074918_+,NA|134aa|up_5|AP019005.1_2074928_2075330_+,NA|116aa|up_4|AP019005.1_2075331_2075679_-,csx20|139aa|up_1|AP019005.1_2078079_2078496_-,NA|131aa|up_0|AP019005.1_2078492_2078885_-,NA|80aa|down_1|AP019005.1_2081535_2081775_-	NA|83aa|up_9|AP019005.1_2072593_2072842_+	NA	NA|379aa|up_8|AP019005.1_2072974_2074111_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|196aa|up_7|AP019005.1_2074107_2074695_+	PRK11071, PRK11071, esterase YqiA; Provisional	NA|84aa|up_6|AP019005.1_2074666_2074918_+	NA	NA|134aa|up_5|AP019005.1_2074928_2075330_+	NA	NA|116aa|up_4|AP019005.1_2075331_2075679_-	NA	WYL|336aa|up_3|AP019005.1_2075874_2076882_-	pfam13280, WYL, WYL domain	csx1|401aa|up_2|AP019005.1_2076884_2078087_-	TIGR02221, CRISPR-associated_protein_Csx1_2, CRISPR-associated protein, TM1812 family	csx20|139aa|up_1|AP019005.1_2078079_2078496_-	NA	NA|131aa|up_0|AP019005.1_2078492_2078885_-	NA	cas2|93aa|down_0|AP019005.1_2081260_2081539_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|80aa|down_1|AP019005.1_2081535_2081775_-	NA	cas1|289aa|down_2|AP019005.1_2081771_2082638_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|91aa|down_3|AP019005.1_2082634_2082907_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	csm5gr7|432aa|down_4|AP019005.1_2083058_2084354_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|297aa|down_5|AP019005.1_2084350_2085241_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|211aa|down_6|AP019005.1_2085244_2085877_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|132aa|down_7|AP019005.1_2085887_2086283_-	cd09647, Csm2_III-A, CRISPR/Cas system-associated protein Csm2	cas10|779aa|down_8|AP019005.1_2086279_2088616_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|281aa|down_9|AP019005.1_2088602_2089445_-	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6
