assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020325.1_ASM2032v1	NC_010730	Sulfurihydrogenibium sp. YO3AOP1, complete genome	1	26281-27614	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no		Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	Orphan	CTTTATAACCCACACGGTTCAGATGTAAC,CTTTATAACCCACACGGTTCAGATGTAAC,CTTTATAACCCACACGGTTCAGATGTAAC	29,29,29	0	0	NA	NA	NA:NA:NA	20,20,16	20	Orphan	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	NA|178aa|up_9|NC_010730.1_17177_17711_-,NA|175aa|up_1|NC_010730.1_24051_24576_-,NA	NA|178aa|up_9|NC_010730.1_17177_17711_-	NA	NA|318aa|up_8|NC_010730.1_17707_18661_-	cd12836, HpCorA-like, Mg2+ transporter Helicobacter pylori CorA-like subgroup	NA|203aa|up_7|NC_010730.1_18669_19278_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|326aa|up_6|NC_010730.1_19255_20233_-	pfam02606, LpxK, Tetraacyldisaccharide-1-P 4'-kinase	NA|327aa|up_5|NC_010730.1_20222_21203_-	PRK02126, PRK02126, ribonuclease Z; Provisional	NA|221aa|up_4|NC_010730.1_21206_21869_-	pfam09568, RE_MjaI, MjaI restriction endonuclease	NA|413aa|up_3|NC_010730.1_21861_23100_-	pfam01555, N6_N4_Mtase, DNA methylase	NA|324aa|up_2|NC_010730.1_23092_24064_-	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|175aa|up_1|NC_010730.1_24051_24576_-	NA	NA|473aa|up_0|NC_010730.1_24576_25995_-	pfam01116, F_bP_aldolase, Fructose-bisphosphate aldolase class-II	NA|433aa|down_0|NC_010730.1_28268_29567_-	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|516aa|down_1|NC_010730.1_29566_31114_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|753aa|down_2|NC_010730.1_31118_33377_-	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|157aa|down_3|NC_010730.1_33525_33996_+	PRK05988, PRK05988, formate dehydrogenase subunit gamma; Validated	NA|524aa|down_4|NC_010730.1_33979_35551_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|1001aa|down_5|NC_010730.1_35560_38563_+	COG3383, COG3383, Uncharacterized anaerobic dehydrogenase [General function prediction only]	NA|294aa|down_6|NC_010730.1_39140_40022_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|132aa|down_7|NC_010730.1_40094_40490_+	pfam02367, TsaE, Threonylcarbamoyl adenosine biosynthesis protein TsaE	NA|176aa|down_8|NC_010730.1_41089_41617_+	sd00006, TPR, Tetratricopeptide repeat	NA|82aa|down_9|NC_010730.1_41701_41947_+	pfam13424, TPR_12, Tetratricopeptide repeat
GCF_000020325.1_ASM2032v1	NC_010730	Sulfurihydrogenibium sp. YO3AOP1, complete genome	2	578053-578190	2,2	CRT,CRISPRCasFinder	no		Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	Orphan	TTGTTGTTCAGCTGGTGCTGG,GTTGTTCAGCTGGTGCTGGAGCAGCTTGTTGAGCTGGTTGTT	21,42	2	4	578113-578130|578113-578130|578152-578169|578152-578169	NC_010730.1_312113-312130|NC_010730.1_141156-141173|NC_010730.1_578179-578196|NC_010730.1_141456-141473	NA:NA	3,1	3	Orphan	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	NA,NA	NA|82aa|up_9|NC_010730.1_567903_568149_-	pfam03776, MinE, Septum formation topological specificity factor MinE	NA|261aa|up_8|NC_010730.1_568164_568947_-	COG2894, MinD, Septum formation inhibitor-activating ATPase [Cell division and chromosome partitioning]	NA|203aa|up_7|NC_010730.1_568961_569570_-	PRK00513, minC, septum formation inhibitor; Reviewed	NA|420aa|up_6|NC_010730.1_569653_570913_+	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|425aa|up_5|NC_010730.1_570909_572184_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|492aa|up_4|NC_010730.1_572358_573834_-	TIGR03701, mena_SCO4490, menaquinone biosynthesis decarboxylase, SCO4490 family	NA|215aa|up_3|NC_010730.1_573823_574468_-	COG2928, COG2928, Uncharacterized conserved protein [Function unknown]	NA|284aa|up_2|NC_010730.1_574464_575316_-	pfam01972, SDH_sah, Serine dehydrogenase proteinase	NA|228aa|up_1|NC_010730.1_575308_575992_-	pfam08761, dUTPase_2, dUTPase	NA|514aa|up_0|NC_010730.1_576218_577760_-	cd10792, GH57N_AmyC_like, N-terminal catalytic domain of  alpha-amylase ( AmyC ) and similar proteins	NA|212aa|down_0|NC_010730.1_580386_581022_+	cd00446, GrpE, nucleotide exchange factor GrpE	NA|297aa|down_1|NC_010730.1_581021_581912_+	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|109aa|down_2|NC_010730.1_582192_582519_-	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|357aa|down_3|NC_010730.1_582620_583691_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|217aa|down_4|NC_010730.1_583749_584400_+	pfam07238, PilZ, PilZ domain	NA|218aa|down_5|NC_010730.1_584396_585050_-	COG0602, NrdG, Organic radical activating enzymes [Posttranslational modification, protein turnover, chaperones]	NA|929aa|down_6|NC_010730.1_585030_587817_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|99aa|down_7|NC_010730.1_587829_588126_-	COG1358, RPL8A, Ribosomal protein HS6-type (S12/L30/L7a) [Translation, ribosomal structure and biogenesis]	NA|372aa|down_8|NC_010730.1_588125_589241_-	PRK09202, nusA, transcription elongation factor NusA; Validated	NA|148aa|down_9|NC_010730.1_589255_589699_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed
GCF_000020325.1_ASM2032v1	NC_010730	Sulfurihydrogenibium sp. YO3AOP1, complete genome	3	693082-694222	2,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	Type III-D,Type III-B,Type III-C,Type III-A	TTTATAACCCACACGGTTCAGATGTAAC,GTTACATCTGAACCGTGTGGGTTATAAAG,GTTACATCTGAACCGTGTGGGTTATAAA	28,29,28	0	0	NA	NA	NA:NA:NA	16,17,17	17	TypeIII-D,TypeIII-B,TypeIII-C,TypeIII-A	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	NA,NA	NA|161aa|up_9|NC_010730.1_683317_683800_+	pfam02517, Abi, CAAX protease self-immunity	cas6|253aa|up_8|NC_010730.1_683843_684602_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|120aa|up_7|NC_010730.1_684702_685062_+	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	cas8b2|540aa|up_6|NC_010730.1_685073_686693_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas7|305aa|up_5|NC_010730.1_686689_687604_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|231aa|up_4|NC_010730.1_687628_688321_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|730aa|up_3|NC_010730.1_688322_690512_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|170aa|up_2|NC_010730.1_690512_691022_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|327aa|up_1|NC_010730.1_691034_692015_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|89aa|up_0|NC_010730.1_692016_692283_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|431aa|down_0|NC_010730.1_694584_695877_+	TIGR01125, Ribosomal_protein_S12_methylthiotransferase_RimO, ribosomal protein S12 methylthiotransferase RimO	NA|340aa|down_1|NC_010730.1_695873_696893_+	COG0182, COG0182, Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis]	NA|331aa|down_2|NC_010730.1_696889_697882_+	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|526aa|down_3|NC_010730.1_697871_699449_+	COG0029, NadB, Aspartate oxidase [Coenzyme metabolism]	NA|169aa|down_4|NC_010730.1_699451_699958_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|255aa|down_5|NC_010730.1_701641_702406_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|207aa|down_6|NC_010730.1_702405_703026_-	PRK14824, PRK14824, putative deoxyribonucleotide triphosphate pyrophosphatase; Provisional	NA|155aa|down_7|NC_010730.1_703329_703794_-	cd16962, RuvC, Crossover junction endodeoxyribonuclease RuvC	NA|252aa|down_8|NC_010730.1_703793_704549_-	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|207aa|down_9|NC_010730.1_704853_705474_+	COG2231, COG2231, Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]
GCF_000020325.1_ASM2032v1	NC_010730	Sulfurihydrogenibium sp. YO3AOP1, complete genome	4	855939-856163	4,4,3	CRT,CRISPRCasFinder,PILER-CR	no		Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	Orphan	GTTATTAGCCTACCTATGAGGAATTGAAA,TAGCCTACCTATGAGGAATTAAA,GTTATTAGCCTACCTATGAGGAATTGA	29,23,27	0	0	NA	NA	NA:NA:NA	3,3,2	3	Orphan	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	NA|91aa|up_9|NC_010730.1_848784_849057_+,NA|236aa|up_3|NC_010730.1_853134_853842_-,NA|100aa|down_0|NC_010730.1_856415_856715_+	NA|91aa|up_9|NC_010730.1_848784_849057_+	NA	NA|93aa|up_8|NC_010730.1_849070_849349_+	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|81aa|up_7|NC_010730.1_849348_849591_+	pfam09383, NIL, NIL domain	NA|306aa|up_6|NC_010730.1_849590_850508_+	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|664aa|up_5|NC_010730.1_850522_852514_+	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|92aa|up_4|NC_010730.1_852846_853122_-	cd02232, cupin_ARD, acireductone dioxygenase (ARD), cupin domain	NA|236aa|up_3|NC_010730.1_853134_853842_-	NA	NA|260aa|up_2|NC_010730.1_853905_854685_-	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|59aa|up_1|NC_010730.1_854681_854858_-	COG2835, COG2835, Uncharacterized conserved protein [Function unknown]	NA|231aa|up_0|NC_010730.1_854847_855540_-	cd00635, PLPDE_III_YBL036c_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, YBL036c-like proteins	NA|100aa|down_0|NC_010730.1_856415_856715_+	NA	NA|483aa|down_1|NC_010730.1_856733_858182_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|439aa|down_2|NC_010730.1_858178_859495_+	cd01360, Adenylsuccinate_lyase_1, Adenylsuccinate lyase (ASL)_subgroup 1	NA|438aa|down_3|NC_010730.1_859924_861238_+	PRK14331, PRK14331, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|161aa|down_4|NC_010730.1_861240_861723_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|592aa|down_5|NC_010730.1_861800_863576_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|311aa|down_6|NC_010730.1_863565_864498_-	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|353aa|down_7|NC_010730.1_864490_865549_-	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|59aa|down_8|NC_010730.1_865549_865726_-	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|176aa|down_9|NC_010730.1_865775_866303_-	pfam02620, DUF177, Uncharacterized ACR, COG1399
GCF_000020325.1_ASM2032v1	NC_010730	Sulfurihydrogenibium sp. YO3AOP1, complete genome	5	1561790-1562087	4,5,5	PILER-CR,CRISPRCasFinder,CRT	no		Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	Orphan	CTTTATAACCCACACGGTTCAGATGTAACCC,GGTTACATCTGAACCGTGTGGGTTATAAAG,GGTTACATCTGAACCGTGTGGGTTATAAAG	31,30,30	0	0	NA	NA	NA:NA:NA	3,4,4	4	Orphan	Cas14b_CAS-V-F,c2c9_V-U4,csa3,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas14j,DEDDh,cas14k	NA,NA|182aa|down_0|NC_010730.1_1562187_1562733_+,NA|70aa|down_1|NC_010730.1_1562956_1563166_+,NA|146aa|down_6|NC_010730.1_1567918_1568356_-	NA|178aa|up_9|NC_010730.1_1552156_1552690_+	COG0242, Def, N-formylmethionyl-tRNA deformylase [Translation, ribosomal structure and biogenesis]	NA|261aa|up_8|NC_010730.1_1552699_1553482_+	pfam01790, LGT, Prolipoprotein diacylglyceryl transferase	NA|414aa|up_7|NC_010730.1_1553465_1554707_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|284aa|up_6|NC_010730.1_1554681_1555533_+	PRK00130, truB, tRNA pseudouridine synthase B; Provisional	NA|379aa|up_5|NC_010730.1_1555851_1556988_+	smart00933, NurA, NurA nuclease	NA|107aa|up_4|NC_010730.1_1556984_1557305_+	pfam12836, HHH_3, Helix-hairpin-helix motif	NA|106aa|up_3|NC_010730.1_1557292_1557610_+	sd00006, TPR, Tetratricopeptide repeat	NA|326aa|up_2|NC_010730.1_1557591_1558569_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|161aa|up_1|NC_010730.1_1558603_1559086_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|428aa|up_0|NC_010730.1_1559060_1560344_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|182aa|down_0|NC_010730.1_1562187_1562733_+	NA	NA|70aa|down_1|NC_010730.1_1562956_1563166_+	NA	NA|632aa|down_2|NC_010730.1_1563392_1565288_-	pfam11992, DUF3488, Domain of unknown function (DUF3488)	NA|294aa|down_3|NC_010730.1_1565271_1566153_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|306aa|down_4|NC_010730.1_1566149_1567067_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|281aa|down_5|NC_010730.1_1567076_1567919_-	pfam02569, Pantoate_ligase, Pantoate-beta-alanine ligase	NA|146aa|down_6|NC_010730.1_1567918_1568356_-	NA	NA|222aa|down_7|NC_010730.1_1568361_1569027_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|408aa|down_8|NC_010730.1_1569013_1570237_-	cd17472, MFS_YajR_like, Escherichia coli inner membrane transport protein YajR and similar multidrug-efflux transporters of the Major Facilitator Superfamily	NA|333aa|down_9|NC_010730.1_1570233_1571232_-	PRK00094, gpsA, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase
