assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	1	772501-772609	1	CRISPRCasFinder	no		cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Orphan	TCGCGGGAGAAGGAATGAGGATGAG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|98aa|up_6|NZ_AP014936.1_757695_757989_-,NA|104aa|up_4|NZ_AP014936.1_759937_760249_-,NA|117aa|down_2|NZ_AP014936.1_778425_778776_-,NA|261aa|down_3|NZ_AP014936.1_779131_779914_-,NA|85aa|down_4|NZ_AP014936.1_779989_780244_+	NA|132aa|up_9|NZ_AP014936.1_755485_755881_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|159aa|up_8|NZ_AP014936.1_756327_756804_-	cd08353, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|94aa|up_7|NZ_AP014936.1_757136_757418_+	pfam05589, DUF768, Protein of unknown function (DUF768)	NA|98aa|up_6|NZ_AP014936.1_757695_757989_-	NA	NA|364aa|up_5|NZ_AP014936.1_758063_759154_+	pfam13683, rve_3, Integrase core domain	NA|104aa|up_4|NZ_AP014936.1_759937_760249_-	NA	NA|336aa|up_3|NZ_AP014936.1_760617_761625_+	TIGR02249, Integrase/recombinase_E2_protein	NA|1714aa|up_2|NZ_AP014936.1_761647_766789_+	PRK05672, dnaE2, error-prone DNA polymerase; Validated	NA|952aa|up_1|NZ_AP014936.1_767001_769857_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|369aa|up_0|NZ_AP014936.1_770003_771110_+	pfam06727, DUF1207, Protein of unknown function (DUF1207)	NA|420aa|down_0|NZ_AP014936.1_775927_777187_-	PRK00711, PRK00711, D-amino acid dehydrogenase	NA|327aa|down_1|NZ_AP014936.1_777436_778417_-	COG0530, ECM27, Ca2+/Na+ antiporter [Inorganic ion transport and metabolism]	NA|117aa|down_2|NZ_AP014936.1_778425_778776_-	NA	NA|261aa|down_3|NZ_AP014936.1_779131_779914_-	NA	NA|85aa|down_4|NZ_AP014936.1_779989_780244_+	NA	NA|319aa|down_5|NZ_AP014936.1_780746_781703_-	pfam12762, DDE_Tnp_IS1595, ISXO2-like transposase domain	NA|148aa|down_6|NZ_AP014936.1_782701_783145_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|157aa|down_7|NZ_AP014936.1_783249_783720_+	pfam01322, Cytochrom_C_2, Cytochrome C'	NA|323aa|down_8|NZ_AP014936.1_783677_784646_+	COG2010, CccA, Cytochrome c, mono- and diheme variants [Energy production and conversion]	NA|562aa|down_9|NZ_AP014936.1_784783_786469_+	TIGR02094, Glycogen_phosphorylase, alpha-glucan phosphorylases
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	2	881016-881103	2	CRISPRCasFinder	no	cas3	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Unclear	TGTCATTGCGAGTGCAGCGAAGCAATCTC	29	0	0	NA	NA	NA	1	1	Unclear	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|131aa|up_9|NZ_AP014936.1_871823_872216_+,NA|93aa|up_3|NZ_AP014936.1_877842_878121_-,NA|139aa|up_2|NZ_AP014936.1_878314_878731_-,NA|211aa|up_0|NZ_AP014936.1_880318_880951_+,NA|452aa|down_0|NZ_AP014936.1_881184_882540_+,NA|195aa|down_5|NZ_AP014936.1_892159_892744_+,NA|124aa|down_7|NZ_AP014936.1_895804_896176_+	NA|131aa|up_9|NZ_AP014936.1_871823_872216_+	NA	NA|182aa|up_8|NZ_AP014936.1_872419_872965_-	COG0309, HypE, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|325aa|up_7|NZ_AP014936.1_873270_874245_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|148aa|up_6|NZ_AP014936.1_874255_874699_+	COG1585, COG1585, Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|133aa|up_5|NZ_AP014936.1_874791_875190_-	cd04622, CBS_pair_HRP1_like, CBS pair domain found in Hypoxic Response Protein 1 (HRP1) -like proteinds	NA|647aa|up_4|NZ_AP014936.1_875717_877658_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|93aa|up_3|NZ_AP014936.1_877842_878121_-	NA	NA|139aa|up_2|NZ_AP014936.1_878314_878731_-	NA	NA|318aa|up_1|NZ_AP014936.1_879073_880027_+	cd01846, fatty_acyltransferase_like, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases	NA|211aa|up_0|NZ_AP014936.1_880318_880951_+	NA	NA|452aa|down_0|NZ_AP014936.1_881184_882540_+	NA	NA|403aa|down_1|NZ_AP014936.1_882872_884081_+	COG4850, COG4850, Uncharacterized conserved protein [Function unknown]	cas3|1162aa|down_2|NZ_AP014936.1_884326_887812_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|491aa|down_3|NZ_AP014936.1_887815_889288_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|958aa|down_4|NZ_AP014936.1_889284_892158_+	COG1743, COG1743, Adenine-specific DNA methylase containing a Zn-ribbon [DNA replication, recombination, and repair]	NA|195aa|down_5|NZ_AP014936.1_892159_892744_+	NA	NA|944aa|down_6|NZ_AP014936.1_892753_895585_+	COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|124aa|down_7|NZ_AP014936.1_895804_896176_+	NA	NA|235aa|down_8|NZ_AP014936.1_898046_898751_+	pfam13462, Thioredoxin_4, Thioredoxin	NA|444aa|down_9|NZ_AP014936.1_899107_900439_+	PRK11649, PRK11649, putative peptidase; Provisional
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	3	909305-909430	3	CRISPRCasFinder	no		cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Orphan	TCACCCTCTCCCCGGCCCTCCCCCATCGAGGGGGAGG	37	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|133aa|up_8|NZ_AP014936.1_902660_903059_+,NA|207aa|up_6|NZ_AP014936.1_903837_904458_+,NA|87aa|up_5|NZ_AP014936.1_904582_904843_-,NA|61aa|up_1|NZ_AP014936.1_907717_907900_-,NA|415aa|up_0|NZ_AP014936.1_907930_909175_+,NA|99aa|down_1|NZ_AP014936.1_910811_911108_-,NA|168aa|down_2|NZ_AP014936.1_911309_911813_+,NA|192aa|down_4|NZ_AP014936.1_913264_913840_-	NA|128aa|up_9|NZ_AP014936.1_902245_902629_+	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|133aa|up_8|NZ_AP014936.1_902660_903059_+	NA	NA|114aa|up_7|NZ_AP014936.1_903207_903549_+	COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|207aa|up_6|NZ_AP014936.1_903837_904458_+	NA	NA|87aa|up_5|NZ_AP014936.1_904582_904843_-	NA	NA|421aa|up_4|NZ_AP014936.1_905020_906283_+	pfam04286, DUF445, Protein of unknown function (DUF445)	NA|134aa|up_3|NZ_AP014936.1_906279_906681_+	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|119aa|up_2|NZ_AP014936.1_906943_907300_-	pfam11399, DUF3192, Protein of unknown function (DUF3192)	NA|61aa|up_1|NZ_AP014936.1_907717_907900_-	NA	NA|415aa|up_0|NZ_AP014936.1_907930_909175_+	NA	NA|313aa|down_0|NZ_AP014936.1_909676_910615_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|99aa|down_1|NZ_AP014936.1_910811_911108_-	NA	NA|168aa|down_2|NZ_AP014936.1_911309_911813_+	NA	NA|239aa|down_3|NZ_AP014936.1_912076_912793_+	PRK09038, PRK09038, flagellar motor protein MotD; Reviewed	NA|192aa|down_4|NZ_AP014936.1_913264_913840_-	NA	NA|369aa|down_5|NZ_AP014936.1_914169_915276_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|205aa|down_6|NZ_AP014936.1_915829_916444_+	TIGR04239, GlpG_protein, rhomboid family protease GlpG	NA|509aa|down_7|NZ_AP014936.1_916571_918098_+	pfam10670, DUF4198, Domain of unknown function (DUF4198)	NA|234aa|down_8|NZ_AP014936.1_918341_919043_+	PRK12378, PRK12378, YebC/PmpR family DNA-binding transcriptional regulator	NA|238aa|down_9|NZ_AP014936.1_919510_920224_+	pfam01863, DUF45, Protein of unknown function DUF45
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	4	1253009-1253157	4	CRISPRCasFinder	no		cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Orphan	CGCTCGGTTGGACCCCTCTCCCTGTCCCTCTCCCGCGAGGGGAGAGGGGACCAAC	55	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|106aa|up_0|NZ_AP014936.1_1252630_1252948_+,NA|194aa|down_5|NZ_AP014936.1_1261529_1262111_+	NA|392aa|up_9|NZ_AP014936.1_1243097_1244273_+	cd03886, M20_Acy1, M20 Peptidase Aminoacylase 1 family	NA|376aa|up_8|NZ_AP014936.1_1244272_1245400_+	PRK13517, PRK13517, glutamate--cysteine ligase	NA|507aa|up_7|NZ_AP014936.1_1245634_1247155_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|235aa|up_6|NZ_AP014936.1_1247327_1248032_-	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|244aa|up_5|NZ_AP014936.1_1248018_1248750_-	cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component of branched chain amino acids transport system	NA|323aa|up_4|NZ_AP014936.1_1248736_1249705_-	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|289aa|up_3|NZ_AP014936.1_1249701_1250568_-	cd06582, TM_PBP1_LivH_like, Transmembrane subunit (TM) of Escherichia coli LivH and related proteins	NA|410aa|up_2|NZ_AP014936.1_1250613_1251843_-	cd19982, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems predicted to be involved in transport of amino acids, peptides, or inorganic ions	NA|203aa|up_1|NZ_AP014936.1_1251976_1252585_+	COG3124, COG3124, Uncharacterized protein conserved in bacteria [Function unknown]	NA|106aa|up_0|NZ_AP014936.1_1252630_1252948_+	NA	NA|518aa|down_0|NZ_AP014936.1_1253648_1255202_-	cd03477, Rieske_YhfW_C, YhfW family, C-terminal Rieske domain; YhfW is a protein of unknown function with an N-terminal DadA-like (glycine/D-amino acid dehydrogenase) domain and a C-terminal Rieske domain	NA|123aa|down_1|NZ_AP014936.1_1255294_1255663_-	pfam08850, DUF1820, Domain of unknown function (DUF1820)	NA|535aa|down_2|NZ_AP014936.1_1255703_1257308_-	pfam05977, MFS_3, Transmembrane secretion effector	NA|729aa|down_3|NZ_AP014936.1_1257389_1259576_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|74aa|down_4|NZ_AP014936.1_1259755_1259977_-	pfam04102, SlyX, SlyX	NA|194aa|down_5|NZ_AP014936.1_1261529_1262111_+	NA	NA|110aa|down_6|NZ_AP014936.1_1262186_1262516_-	pfam02600, DsbB, Disulfide bond formation protein DsbB	NA|157aa|down_7|NZ_AP014936.1_1262544_1263015_+	cd06989, cupin_DRT102, Arabidopsis thaliana DRT102 and related proteins, cupin domain	NA|155aa|down_8|NZ_AP014936.1_1263162_1263627_+	pfam01957, NfeD, NfeD-like C-terminal, partner-binding	NA|124aa|down_9|NZ_AP014936.1_1263744_1264116_+	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	5	1446755-1446845	5	CRISPRCasFinder	no		cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Orphan	TCTCACCAACTCCCCCTCTCCCGC	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|178aa|up_4|NZ_AP014936.1_1442459_1442993_+,NA|103aa|down_3|NZ_AP014936.1_1452407_1452716_+,NA|67aa|down_4|NZ_AP014936.1_1452712_1452913_-	NA|439aa|up_9|NZ_AP014936.1_1437438_1438755_+	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|462aa|up_8|NZ_AP014936.1_1438751_1440137_+	pfam05292, MCD, Malonyl-CoA decarboxylase C-terminal domain	NA|257aa|up_7|NZ_AP014936.1_1440327_1441098_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|217aa|up_6|NZ_AP014936.1_1441124_1441775_-	sd00010, SLR, Sel1-like repeat	NA|170aa|up_5|NZ_AP014936.1_1441885_1442395_+	PRK03661, PRK03661, nicotinamide-nucleotide amidase	NA|178aa|up_4|NZ_AP014936.1_1442459_1442993_+	NA	NA|383aa|up_3|NZ_AP014936.1_1443012_1444161_-	pfam04551, GcpE, GcpE protein	NA|249aa|up_2|NZ_AP014936.1_1444237_1444984_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|349aa|up_1|NZ_AP014936.1_1445170_1446217_+	PRK09354, recA, recombinase A; Provisional	NA|155aa|up_0|NZ_AP014936.1_1446216_1446681_+	PRK00117, recX, recombination regulator RecX; Reviewed	NA|863aa|down_0|NZ_AP014936.1_1446954_1449543_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|419aa|down_1|NZ_AP014936.1_1449646_1450903_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|70aa|down_2|NZ_AP014936.1_1451236_1451446_+	PRK01712, PRK01712, carbon storage regulator CsrA	NA|103aa|down_3|NZ_AP014936.1_1452407_1452716_+	NA	NA|67aa|down_4|NZ_AP014936.1_1452712_1452913_-	NA	NA|146aa|down_5|NZ_AP014936.1_1452951_1453389_+	pfam11645, PDDEXK_5, PD-(D/E)XK endonuclease	NA|449aa|down_6|NZ_AP014936.1_1454146_1455493_-	TIGR02915, Two_Component_Transcriptional_Regulator_Fis_family, PEP-CTERM-box response regulator transcription factor	NA|726aa|down_7|NZ_AP014936.1_1455462_1457640_-	TIGR02916, sensor_histidine_kinase, putative PEP-CTERM system histidine kinase	NA|796aa|down_8|NZ_AP014936.1_1458088_1460476_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|541aa|down_9|NZ_AP014936.1_1460509_1462132_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins
GCF_002355415.1_ASM235541v1	NZ_AP014936	Sulfurifustis variabilis strain skN76	6	2937112-2937174	6	CRISPRCasFinder	no		cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	Orphan	GGCATCAACGCCTGCAAGGGCAAG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,DEDDh,DinG,cas6,Cas9_archaeal	NA|137aa|up_6|NZ_AP014936.1_2928026_2928437_+,NA	NA|246aa|up_9|NZ_AP014936.1_2923960_2924698_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|300aa|up_8|NZ_AP014936.1_2924805_2925705_-	PRK10797, PRK10797, glutamate and aspartate transporter subunit; Provisional	NA|672aa|up_7|NZ_AP014936.1_2925909_2927925_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|137aa|up_6|NZ_AP014936.1_2928026_2928437_+	NA	NA|163aa|up_5|NZ_AP014936.1_2928671_2929160_+	COG3748, COG3748, Predicted membrane protein [Function unknown]	NA|388aa|up_4|NZ_AP014936.1_2929316_2930480_+	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|303aa|up_3|NZ_AP014936.1_2930476_2931385_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|357aa|up_2|NZ_AP014936.1_2931493_2932564_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|741aa|up_1|NZ_AP014936.1_2932662_2934885_+	cd01948, EAL, EAL domain	NA|559aa|up_0|NZ_AP014936.1_2934991_2936668_+	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|289aa|down_0|NZ_AP014936.1_2937295_2938162_+	pfam05114, DUF692, Protein of unknown function (DUF692)	NA|260aa|down_1|NZ_AP014936.1_2938189_2938969_+	pfam09836, DUF2063, Putative DNA-binding domain	NA|902aa|down_2|NZ_AP014936.1_2938930_2941636_-	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite	NA|121aa|down_3|NZ_AP014936.1_2941652_2942015_-	cd03530, Rieske_NirD_small_Bacillus, Small subunit of nitrite reductase (NirD) family, Rieske domain; composed of proteins similar to the Bacillus subtilis small subunit of assimilatory nitrite reductase containing a Rieske domain	NA|815aa|down_4|NZ_AP014936.1_2942352_2944797_-	COG1251, NirB, NAD(P)H-nitrite reductase [Energy production and conversion]	NA|267aa|down_5|NZ_AP014936.1_2944804_2945605_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|300aa|down_6|NZ_AP014936.1_2945616_2946516_-	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|476aa|down_7|NZ_AP014936.1_2946532_2947960_-	pfam13379, NMT1_2, NMT1-like family	NA|197aa|down_8|NZ_AP014936.1_2948306_2948897_-	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|456aa|down_9|NZ_AP014936.1_2948893_2950261_-	TIGR03508, decahem_SO, decaheme c-type cytochrome, DmsE family
