assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007855915.1_ASM785591v1	NZ_CP042251	Geobacillus thermoleovorans strain ARTRW1 chromosome, complete genome	1	394716-398091	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	50,50,50	50	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA|75aa|down_0|NZ_CP042251.1_398166_398391_-,NA|175aa|down_3|NZ_CP042251.1_401977_402502_-,NA|62aa|down_8|NZ_CP042251.1_409026_409212_+	NA|210aa|up_9|NZ_CP042251.1_379367_379997_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|376aa|up_8|NZ_CP042251.1_380251_381379_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|301aa|up_7|NZ_CP042251.1_381395_382298_-	cd13528, PBP2_osmoprotectants, Substrate-binding domain of osmoregulatory ABC-type transporters; the type 2 periplasmic-binding protein fold	NA|553aa|up_6|NZ_CP042251.1_382878_384537_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|216aa|up_5|NZ_CP042251.1_384665_385313_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|497aa|up_4|NZ_CP042251.1_385457_386948_+	TIGR02677, conserved_hypothetical_protein, TIGR02677 family protein	NA|399aa|up_3|NZ_CP042251.1_386950_388147_+	TIGR02678, hypothetical_protein, TIGR02678 family protein	NA|1374aa|up_2|NZ_CP042251.1_388106_392228_+	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|407aa|up_1|NZ_CP042251.1_392224_393445_+	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|309aa|up_0|NZ_CP042251.1_393587_394514_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|75aa|down_0|NZ_CP042251.1_398166_398391_-	NA	NA|411aa|down_1|NZ_CP042251.1_399932_401165_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|218aa|down_2|NZ_CP042251.1_401364_402018_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|175aa|down_3|NZ_CP042251.1_401977_402502_-	NA	NA|770aa|down_4|NZ_CP042251.1_403410_405720_-	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|458aa|down_5|NZ_CP042251.1_406372_407746_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|221aa|down_6|NZ_CP042251.1_407735_408398_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|142aa|down_7|NZ_CP042251.1_408577_409003_+	pfam03929, PepSY_TM, PepSY-associated TM region	NA|62aa|down_8|NZ_CP042251.1_409026_409212_+	NA	NA|402aa|down_9|NZ_CP042251.1_409192_410398_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family
GCF_007855915.1_ASM785591v1	NZ_CP042251	Geobacillus thermoleovorans strain ARTRW1 chromosome, complete genome	2	1128907-1129000	2	CRISPRCasFinder	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTAGACACAAAATATAGTGCGGAAGAATCG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA|73aa|up_9|NZ_CP042251.1_1119437_1119656_+,NA	NA|73aa|up_9|NZ_CP042251.1_1119437_1119656_+	NA	NA|262aa|up_8|NZ_CP042251.1_1119759_1120545_+	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|289aa|up_7|NZ_CP042251.1_1120571_1121438_+	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|404aa|up_6|NZ_CP042251.1_1121612_1122824_+	cd01155, ACAD_FadE2, Acyl-CoA dehydrogenases similar to fadE2	NA|261aa|up_5|NZ_CP042251.1_1122842_1123625_+	PRK08213, PRK08213, gluconate 5-dehydrogenase; Provisional	NA|540aa|up_4|NZ_CP042251.1_1123646_1125266_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|195aa|up_3|NZ_CP042251.1_1125359_1125944_+	pfam17932, TetR_C_24, Tetracyclin repressor-like, C-terminal domain	NA|353aa|up_2|NZ_CP042251.1_1126041_1127100_+	cd05154, ACAD10_11_N-like, N-terminal domain of Acyl-CoA dehydrogenase (ACAD) 10 and 11, and similar proteins	NA|261aa|up_1|NZ_CP042251.1_1127109_1127892_+	pfam04029, 2-ph_phosp, 2-phosphosulpholactate phosphatase	NA|325aa|up_0|NZ_CP042251.1_1127888_1128863_+	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|386aa|down_0|NZ_CP042251.1_1129163_1130321_-	PRK07683, PRK07683, aminotransferase A; Validated	NA|76aa|down_1|NZ_CP042251.1_1130476_1130704_-	PRK03636, PRK03636, hypothetical protein; Provisional	NA|331aa|down_2|NZ_CP042251.1_1130899_1131892_+	cd12831, TmCorA-like_u2, Uncharacterized bacterial subfamily of the Thermotoga maritima CorA-like family	NA|217aa|down_3|NZ_CP042251.1_1131968_1132619_+	cd06418, GH25_BacA-like, BacA is a bacterial lysin from Enterococcus faecalis that degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues	NA|543aa|down_4|NZ_CP042251.1_1132792_1134421_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|341aa|down_5|NZ_CP042251.1_1134420_1135443_+	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|469aa|down_6|NZ_CP042251.1_1135442_1136849_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|145aa|down_7|NZ_CP042251.1_1137000_1137435_-	pfam14177, YkyB, YkyB-like protein	NA|290aa|down_8|NZ_CP042251.1_1137503_1138373_-	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|256aa|down_9|NZ_CP042251.1_1138516_1139284_+	PRK07677, PRK07677, short chain dehydrogenase; Provisional
GCF_007855915.1_ASM785591v1	NZ_CP042251	Geobacillus thermoleovorans strain ARTRW1 chromosome, complete genome	3	2063815-2064713	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	13,13,12	13	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA	NA|450aa|up_9|NZ_CP042251.1_2051287_2052637_-	pfam02447, GntP_permease, GntP family permease	NA|515aa|up_8|NZ_CP042251.1_2052793_2054338_-	TIGR01314, gntK_FGGY, gluconate kinase, FGGY type	NA|349aa|up_7|NZ_CP042251.1_2054324_2055371_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|312aa|up_6|NZ_CP042251.1_2055891_2056827_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|229aa|up_5|NZ_CP042251.1_2056888_2057575_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|533aa|up_4|NZ_CP042251.1_2057647_2059246_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|509aa|up_3|NZ_CP042251.1_2059318_2060845_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|155aa|up_2|NZ_CP042251.1_2060858_2061323_-	pfam07331, TctB, Tripartite tricarboxylate transporter TctB family	NA|347aa|up_1|NZ_CP042251.1_2061377_2062418_-	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|310aa|up_0|NZ_CP042251.1_2062784_2063714_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|387aa|down_0|NZ_CP042251.1_2064893_2066054_-	PRK02318, PRK02318, mannitol-1-phosphate 5-dehydrogenase; Provisional	NA|148aa|down_1|NZ_CP042251.1_2066053_2066497_-	COG4668, MtlA, Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]	NA|697aa|down_2|NZ_CP042251.1_2066502_2068593_-	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|483aa|down_3|NZ_CP042251.1_2068904_2070353_-	COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC component [Carbohydrate transport and metabolism]	NA|168aa|down_4|NZ_CP042251.1_2070495_2070999_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|97aa|down_5|NZ_CP042251.1_2070991_2071282_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|455aa|down_6|NZ_CP042251.1_2071508_2072873_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|487aa|down_7|NZ_CP042251.1_2073488_2074949_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|396aa|down_8|NZ_CP042251.1_2075150_2076338_-	cd08194, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|570aa|down_9|NZ_CP042251.1_2076532_2078242_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]
GCF_007855915.1_ASM785591v1	NZ_CP042251	Geobacillus thermoleovorans strain ARTRW1 chromosome, complete genome	4	2997928-2998010	4	CRISPRCasFinder	no	Cas14u_CAS-V	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Unclear	ATGGCCACCAAAGACGAACTCGC	23	0	0	NA	NA	NA	1	1	Unclear	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA|96aa|up_5|NZ_CP042251.1_2991520_2991808_-,NA|71aa|down_1|NZ_CP042251.1_3010310_3010523_-,NA|84aa|down_7|NZ_CP042251.1_3016078_3016330_-	NA|390aa|up_9|NZ_CP042251.1_2986203_2987373_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|667aa|up_8|NZ_CP042251.1_2987254_2989255_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|317aa|up_7|NZ_CP042251.1_2989473_2990424_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|209aa|up_6|NZ_CP042251.1_2990556_2991183_+	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|96aa|up_5|NZ_CP042251.1_2991520_2991808_-	NA	NA|185aa|up_4|NZ_CP042251.1_2991817_2992372_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|501aa|up_3|NZ_CP042251.1_2992601_2994104_-	pfam17936, Big_6, Bacterial Ig domain	NA|327aa|up_2|NZ_CP042251.1_2994156_2995137_-	pfam01032, FecCD, FecCD transport family	NA|333aa|up_1|NZ_CP042251.1_2995129_2996128_-	pfam01032, FecCD, FecCD transport family	NA|314aa|up_0|NZ_CP042251.1_2996158_2997100_-	cd01146, FhuD, Fe3+-siderophore binding domain FhuD	NA|154aa|down_0|NZ_CP042251.1_2998646_2999108_-	pfam11518, DUF3221, Protein of unknown function (DUF3221)	NA|71aa|down_1|NZ_CP042251.1_3010310_3010523_-	NA	NA|273aa|down_2|NZ_CP042251.1_3010567_3011386_+	PRK03187, tgl, transglutaminase; Provisional	NA|303aa|down_3|NZ_CP042251.1_3011356_3012265_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|226aa|down_4|NZ_CP042251.1_3012391_3013069_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|434aa|down_5|NZ_CP042251.1_3013445_3014747_-	COG3935, DnaD, Putative primosome component and related proteins [DNA replication, recombination, and repair]	NA|294aa|down_6|NZ_CP042251.1_3014984_3015866_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|84aa|down_7|NZ_CP042251.1_3016078_3016330_-	NA	NA|167aa|down_8|NZ_CP042251.1_3016387_3016888_-	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|63aa|down_9|NZ_CP042251.1_3017823_3018012_-	pfam08141, SspH, Small acid-soluble spore protein H family
GCF_007855915.1_ASM785591v1	NZ_CP042251	Geobacillus thermoleovorans strain ARTRW1 chromosome, complete genome	5	3499820-3499916	5	CRISPRCasFinder	no		cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	Orphan	GTTTTCCAAAAACAGACAACGAATCGTTGT	30	1	1	3499850-3499886	NZ_CP042251.1_1852475-1852439	NA	1	1	Orphan	cas3,c2c10_CAS-V-U3,cas14k,csa3,RT,DEDDh,DinG,cas14j,Cas14u_CAS-V	NA,NA|64aa|down_2|NZ_CP042251.1_3502510_3502702_-	NA|429aa|up_9|NZ_CP042251.1_3487397_3488684_-	PRK12830, PRK12830, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Reviewed	NA|214aa|up_8|NZ_CP042251.1_3488816_3489458_-	PRK01362, PRK01362, fructose-6-phosphate aldolase	NA|288aa|up_7|NZ_CP042251.1_3489533_3490397_-	PRK07709, PRK07709, fructose-bisphosphate aldolase; Provisional	NA|121aa|up_6|NZ_CP042251.1_3490605_3490968_-	cd17553, REC_Spo0F-like, phosphoacceptor receiver (REC) domain of Spo0F and similar domains	NA|174aa|up_5|NZ_CP042251.1_3491091_3491613_+	pfam10740, DUF2529, Domain of unknown function (DUF2529)	NA|532aa|up_4|NZ_CP042251.1_3491793_3493389_-	PRK05380, pyrG, CTP synthetase; Validated	NA|186aa|up_3|NZ_CP042251.1_3493549_3494107_-	TIGR04567, DNA-directed_RNA_polymerase_subunit_delta, DNA-directed RNA polymerase delta subunit	NA|1087aa|up_2|NZ_CP042251.1_3494495_3497756_-	cd03678, MM_CoA_mutase_1, Coenzyme B12-dependent-methylmalonyl coenzyme A (CoA) mutase (MCM) family, unknown subfamily 1; composed of uncharacterized bacterial proteins containing a C-terminal MCM domain	NA|211aa|up_1|NZ_CP042251.1_3497773_3498406_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|381aa|up_0|NZ_CP042251.1_3498613_3499756_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|382aa|down_0|NZ_CP042251.1_3500267_3501413_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|284aa|down_1|NZ_CP042251.1_3501646_3502498_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|64aa|down_2|NZ_CP042251.1_3502510_3502702_-	NA	NA|393aa|down_3|NZ_CP042251.1_3502820_3503999_-	PRK08235, PRK08235, acetyl-CoA C-acetyltransferase	NA|699aa|down_4|NZ_CP042251.1_3504155_3506252_-	COG0247, GlpC, Fe-S oxidoreductase [Energy production and conversion]	NA|397aa|down_5|NZ_CP042251.1_3506588_3507779_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|56aa|down_6|NZ_CP042251.1_3507908_3508076_-	COG4317, COG4317, Uncharacterized protein conserved in bacteria [Function unknown]	NA|558aa|down_7|NZ_CP042251.1_3508150_3509824_-	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|144aa|down_8|NZ_CP042251.1_3509827_3510259_-	pfam09148, DUF1934, Domain of unknown function (DUF1934)	NA|126aa|down_9|NZ_CP042251.1_3510513_3510891_-	PRK14485, PRK14485, putative bifunctional cbb3-type cytochrome c oxidase subunit I/II; Provisional
