assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	1	339380-341000	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Type III-A,Type III-C,Type III-D,Type III-B,Type I-B	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	23,24,24	24	TypeIII-A,TypeIII-C,TypeIII-D,TypeIII-B,TypeI-B	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA	NA|196aa|up_9|NC_014915.1_324822_325410_+	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|210aa|up_8|NC_014915.1_325857_326487_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|376aa|up_7|NC_014915.1_326741_327869_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|301aa|up_6|NC_014915.1_327885_328788_-	cd13528, PBP2_osmoprotectants, Substrate-binding domain of osmoregulatory ABC-type transporters; the type 2 periplasmic-binding protein fold	NA|244aa|up_5|NC_014915.1_329258_329990_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|497aa|up_4|NC_014915.1_330134_331625_+	TIGR02677, conserved_hypothetical_protein, TIGR02677 family protein	NA|399aa|up_3|NC_014915.1_331627_332824_+	TIGR02678, hypothetical_protein, TIGR02678 family protein	NA|1354aa|up_2|NC_014915.1_332783_336845_+	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|407aa|up_1|NC_014915.1_336901_338122_+	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|305aa|up_0|NC_014915.1_338266_339181_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	cas8b1|572aa|down_0|NC_014915.1_341179_342895_+	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas7|319aa|down_1|NC_014915.1_342896_343853_+	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas5|248aa|down_2|NC_014915.1_343868_344612_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|777aa|down_3|NC_014915.1_344613_346944_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas4|170aa|down_4|NC_014915.1_346953_347463_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|334aa|down_5|NC_014915.1_347466_348468_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|down_6|NC_014915.1_348478_348742_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|249aa|down_7|NC_014915.1_348778_349525_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm6|399aa|down_8|NC_014915.1_352687_353884_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csx1|447aa|down_9|NC_014915.1_353886_355227_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	2	349707-352139	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Type III-A,Type III-C,Type III-D, Type III-B?,Type III-B,Type I-B	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	36,36,36	36	TypeIII-A,TypeIII-C,TypeIII-D,TypeIII-B?,TypeIII-B,TypeI-B	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA	NA|407aa|up_9|NC_014915.1_336901_338122_+	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|305aa|up_8|NC_014915.1_338266_339181_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	cas8b1|572aa|up_7|NC_014915.1_341179_342895_+	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas7|319aa|up_6|NC_014915.1_342896_343853_+	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas5|248aa|up_5|NC_014915.1_343868_344612_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|777aa|up_4|NC_014915.1_344613_346944_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas4|170aa|up_3|NC_014915.1_346953_347463_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|334aa|up_2|NC_014915.1_347466_348468_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|up_1|NC_014915.1_348478_348742_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|249aa|up_0|NC_014915.1_348778_349525_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm6|399aa|down_0|NC_014915.1_352687_353884_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csx1|447aa|down_1|NC_014915.1_353886_355227_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	NA|303aa|down_2|NC_014915.1_355226_356135_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cas10|547aa|down_3|NC_014915.1_356131_357772_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|377aa|down_4|NC_014915.1_357771_358902_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|298aa|down_5|NC_014915.1_358901_359795_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|136aa|down_6|NC_014915.1_359808_360216_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr6gr7|383aa|down_7|NC_014915.1_360229_361378_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|556aa|down_8|NC_014915.1_361609_363277_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|348aa|down_9|NC_014915.1_366369_367413_+	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	3	363492-366113	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas5,cas3,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Type III-D,Type III-A,Type III-C,Type III-B	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	1	1	364790-364824	NC_014915.1_3001657-3001691	NA:NA:NA	39,39,39	39	TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA|174aa|down_4|NC_014915.1_371635_372157_-,NA|143aa|down_5|NC_014915.1_372290_372719_+,NA|234aa|down_6|NC_014915.1_372809_373511_-,NA|137aa|down_7|NC_014915.1_373866_374277_+	cas6|249aa|up_9|NC_014915.1_348778_349525_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm6|399aa|up_8|NC_014915.1_352687_353884_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csx1|447aa|up_7|NC_014915.1_353886_355227_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	NA|303aa|up_6|NC_014915.1_355226_356135_+	TIGR01894, hypothetical_protein, CRISPR type III-B/RAMP module RAMP protein Cmr1	cas10|547aa|up_5|NC_014915.1_356131_357772_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|377aa|up_4|NC_014915.1_357771_358902_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|298aa|up_3|NC_014915.1_358901_359795_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|136aa|up_2|NC_014915.1_359808_360216_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr6gr7|383aa|up_1|NC_014915.1_360229_361378_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|556aa|up_0|NC_014915.1_361609_363277_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|348aa|down_0|NC_014915.1_366369_367413_+	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates	RT|421aa|down_1|NC_014915.1_367495_368758_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|411aa|down_2|NC_014915.1_369389_370622_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|218aa|down_3|NC_014915.1_371022_371676_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|174aa|down_4|NC_014915.1_371635_372157_-	NA	NA|143aa|down_5|NC_014915.1_372290_372719_+	NA	NA|234aa|down_6|NC_014915.1_372809_373511_-	NA	NA|137aa|down_7|NC_014915.1_373866_374277_+	NA	NA|770aa|down_8|NC_014915.1_374666_376976_-	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|458aa|down_9|NC_014915.1_377973_379347_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	4	1337850-1337948	4	CRISPRCasFinder	no		cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Orphan	AACCAAACAAAAAGGAGACATGAACCAGAC	30	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA|173aa|down_5|NC_014915.1_1348969_1349488_+,NA|70aa|down_9|NC_014915.1_1354018_1354228_+	NA|408aa|up_9|NC_014915.1_1324019_1325243_+	pfam04143, Sulf_transp, Sulphur transport	NA|314aa|up_8|NC_014915.1_1325424_1326366_+	cd01941, YeiC_kinase_like, YeiC-like sugar kinase	NA|302aa|up_7|NC_014915.1_1326365_1327271_+	pfam04227, Indigoidine_A, Indigoidine synthase A like protein	NA|118aa|up_6|NC_014915.1_1327324_1327678_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|389aa|up_5|NC_014915.1_1327955_1329122_+	PRK05590, PRK05590, hypothetical protein; Provisional	NA|387aa|up_4|NC_014915.1_1329552_1330713_+	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|179aa|up_3|NC_014915.1_1330681_1331218_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|256aa|up_2|NC_014915.1_1331458_1332226_-	cd07363, 45_DOPA_Dioxygenase, The Class III extradiol dioxygenase, 4,5-DOPA Dioxygenase, catalyzes the incorporation of both atoms of molecular oxygen into 4,5-dihydroxy-phenylalanine	NA|560aa|up_1|NC_014915.1_1333541_1335221_+	pfam12102, DUF3578, Domain of unknown function (DUF3578)	NA|834aa|up_0|NC_014915.1_1335189_1337691_+	pfam09823, DUF2357, Domain of unknown function (DUF2357)	NA|98aa|down_0|NC_014915.1_1338947_1339241_+	pfam01381, HTH_3, Helix-turn-helix	NA|1082aa|down_1|NC_014915.1_1339684_1342930_+	cd10311, PLDc_N_DEXD_c, N-terminal putative catalytic domain of uncharacterized prokaryotic and archeal HKD family nucleases fused to a DEAD/DEAH box helicase domain	NA|275aa|down_2|NC_014915.1_1342935_1343760_+	pfam14335, DUF4391, Domain of unknown function (DUF4391)	NA|649aa|down_3|NC_014915.1_1343710_1345657_+	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|985aa|down_4|NC_014915.1_1345660_1348615_+	COG3587, COG3587, Restriction endonuclease [Defense mechanisms]	NA|173aa|down_5|NC_014915.1_1348969_1349488_+	NA	NA|427aa|down_6|NC_014915.1_1349758_1351039_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|330aa|down_7|NC_014915.1_1351434_1352424_+	COG4127, COG4127, Uncharacterized conserved protein [Function unknown]	NA|435aa|down_8|NC_014915.1_1352494_1353799_-	pfam03069, FmdA_AmdA, Acetamidase/Formamidase family	NA|70aa|down_9|NC_014915.1_1354018_1354228_+	NA
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	5	1981010-1981107	5	CRISPRCasFinder	no		cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Orphan	GTTTCAGTTCCTCATGGGCACGATAAAAAACC	32	1	1	1981042-1981075	NC_014915.1_1608939-1608906	NA	1	1	Orphan	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA	NA|255aa|up_9|NC_014915.1_1968918_1969683_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|353aa|up_8|NC_014915.1_1969657_1970716_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|300aa|up_7|NC_014915.1_1970725_1971625_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|419aa|up_6|NC_014915.1_1971681_1972938_-	TIGR03407, urea_ABC_UrtA, urea ABC transporter, urea binding protein	NA|466aa|up_5|NC_014915.1_1973256_1974654_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|450aa|up_4|NC_014915.1_1974772_1976122_-	pfam02447, GntP_permease, GntP family permease	NA|524aa|up_3|NC_014915.1_1976236_1977808_-	TIGR01314, gntK_FGGY, gluconate kinase, FGGY type	NA|339aa|up_2|NC_014915.1_1977825_1978842_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|174aa|up_1|NC_014915.1_1979083_1979605_+	cd14247, Lmo2686_like, Uncharacterized hexameric protein conserved in Bacilli	NA|309aa|up_0|NC_014915.1_1979910_1980837_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|312aa|down_0|NC_014915.1_1981286_1982222_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|231aa|down_1|NC_014915.1_1982281_1982974_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|537aa|down_2|NC_014915.1_1982987_1984598_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|509aa|down_3|NC_014915.1_1984670_1986197_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|152aa|down_4|NC_014915.1_1986210_1986666_-	pfam07331, TctB, Tripartite tricarboxylate transporter TctB family	NA|347aa|down_5|NC_014915.1_1986721_1987762_-	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|302aa|down_6|NC_014915.1_1988124_1989030_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|387aa|down_7|NC_014915.1_1991776_1992937_-	PRK02318, PRK02318, mannitol-1-phosphate 5-dehydrogenase; Provisional	NA|148aa|down_8|NC_014915.1_1992936_1993380_-	COG4668, MtlA, Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]	NA|697aa|down_9|NC_014915.1_1993385_1995476_-	COG3711, BglG, Transcriptional antiterminator [Transcription]
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	6	1989131-1991594	6,4,4	CRISPRCasFinder,CRT,PILER-CR	no		cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Orphan	GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	37,37,35	37	Orphan	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA,NA	NA|339aa|up_9|NC_014915.1_1977825_1978842_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|174aa|up_8|NC_014915.1_1979083_1979605_+	cd14247, Lmo2686_like, Uncharacterized hexameric protein conserved in Bacilli	NA|309aa|up_7|NC_014915.1_1979910_1980837_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|312aa|up_6|NC_014915.1_1981286_1982222_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|231aa|up_5|NC_014915.1_1982281_1982974_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|537aa|up_4|NC_014915.1_1982987_1984598_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|509aa|up_3|NC_014915.1_1984670_1986197_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|152aa|up_2|NC_014915.1_1986210_1986666_-	pfam07331, TctB, Tripartite tricarboxylate transporter TctB family	NA|347aa|up_1|NC_014915.1_1986721_1987762_-	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|302aa|up_0|NC_014915.1_1988124_1989030_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|387aa|down_0|NC_014915.1_1991776_1992937_-	PRK02318, PRK02318, mannitol-1-phosphate 5-dehydrogenase; Provisional	NA|148aa|down_1|NC_014915.1_1992936_1993380_-	COG4668, MtlA, Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]	NA|697aa|down_2|NC_014915.1_1993385_1995476_-	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|293aa|down_3|NC_014915.1_1997346_1998225_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|168aa|down_4|NC_014915.1_1998358_1998862_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|97aa|down_5|NC_014915.1_1998854_1999145_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|487aa|down_6|NC_014915.1_1999640_2001101_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|396aa|down_7|NC_014915.1_2001302_2002490_-	cd08194, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|570aa|down_8|NC_014915.1_2002683_2004393_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|316aa|down_9|NC_014915.1_2004606_2005554_-	cd19074, Aldo_ket_red_shaker-like, Shaker potassium channel beta subunit family and similar proteins
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	7	2960188-2961416	7,5,5	CRISPRCasFinder,CRT,PILER-CR	no		cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Orphan	GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTTTATCGTACCTATGAGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	18,18,18	18	Orphan	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA|55aa|up_6|NC_014915.1_2951832_2951997_-,NA|57aa|down_6|NC_014915.1_2969234_2969405_-	NA|105aa|up_9|NC_014915.1_2947245_2947560_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|806aa|up_8|NC_014915.1_2947617_2950035_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|400aa|up_7|NC_014915.1_2950422_2951622_-	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|55aa|up_6|NC_014915.1_2951832_2951997_-	NA	NA|88aa|up_5|NC_014915.1_2952015_2952279_-	pfam10732, DUF2524, Protein of unknown function (DUF2524)	NA|196aa|up_4|NC_014915.1_2952571_2953159_+	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|361aa|up_3|NC_014915.1_2953216_2954299_-	pfam10776, DUF2600, Protein of unknown function (DUF2600)	NA|183aa|up_2|NC_014915.1_2954664_2955213_+	COG0663, PaaY, Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only]	NA|404aa|up_1|NC_014915.1_2955235_2956447_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|529aa|up_0|NC_014915.1_2956909_2958496_+	PRK09344, PRK09344, phosphoenolpyruvate carboxykinase	NA|293aa|down_0|NC_014915.1_2961523_2962402_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|301aa|down_1|NC_014915.1_2964157_2965060_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|81aa|down_2|NC_014915.1_2965339_2965582_-	pfam10763, DUF2584, Protein of unknown function (DUF2584)	NA|263aa|down_3|NC_014915.1_2965674_2966463_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|199aa|down_4|NC_014915.1_2967399_2967996_-	pfam13814, Replic_Relax, Replication-relaxation	NA|387aa|down_5|NC_014915.1_2967943_2969104_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|57aa|down_6|NC_014915.1_2969234_2969405_-	NA	NA|128aa|down_7|NC_014915.1_2969401_2969785_-	pfam08006, DUF1700, Protein of unknown function (DUF1700)	NA|69aa|down_8|NC_014915.1_2970021_2970228_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|162aa|down_9|NC_014915.1_2970220_2970706_+	pfam14470, bPH_3, Bacterial PH domain
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	8	2962467-2963956	6,8,6	PILER-CR,CRISPRCasFinder,CRT	no		cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Orphan	GTTTTTATCGTACCTATGAGGGATTGAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC,GTTTCAATCCCTCATAGGTACGATAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	22,22,22	22	Orphan	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA|55aa|up_7|NC_014915.1_2951832_2951997_-,NA|57aa|down_5|NC_014915.1_2969234_2969405_-	NA|806aa|up_9|NC_014915.1_2947617_2950035_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|400aa|up_8|NC_014915.1_2950422_2951622_-	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|55aa|up_7|NC_014915.1_2951832_2951997_-	NA	NA|88aa|up_6|NC_014915.1_2952015_2952279_-	pfam10732, DUF2524, Protein of unknown function (DUF2524)	NA|196aa|up_5|NC_014915.1_2952571_2953159_+	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|361aa|up_4|NC_014915.1_2953216_2954299_-	pfam10776, DUF2600, Protein of unknown function (DUF2600)	NA|183aa|up_3|NC_014915.1_2954664_2955213_+	COG0663, PaaY, Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only]	NA|404aa|up_2|NC_014915.1_2955235_2956447_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|529aa|up_1|NC_014915.1_2956909_2958496_+	PRK09344, PRK09344, phosphoenolpyruvate carboxykinase	NA|293aa|up_0|NC_014915.1_2961523_2962402_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|301aa|down_0|NC_014915.1_2964157_2965060_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|81aa|down_1|NC_014915.1_2965339_2965582_-	pfam10763, DUF2584, Protein of unknown function (DUF2584)	NA|263aa|down_2|NC_014915.1_2965674_2966463_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|199aa|down_3|NC_014915.1_2967399_2967996_-	pfam13814, Replic_Relax, Replication-relaxation	NA|387aa|down_4|NC_014915.1_2967943_2969104_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|57aa|down_5|NC_014915.1_2969234_2969405_-	NA	NA|128aa|down_6|NC_014915.1_2969401_2969785_-	pfam08006, DUF1700, Protein of unknown function (DUF1700)	NA|69aa|down_7|NC_014915.1_2970021_2970228_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|162aa|down_8|NC_014915.1_2970220_2970706_+	pfam14470, bPH_3, Bacterial PH domain	NA|227aa|down_9|NC_014915.1_2970771_2971452_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3
GCF_000174795.2_ASM17479v2	NC_014915	Geobacillus sp. Y412MC52, complete sequence	9	3047909-3047991	9	CRISPRCasFinder	no	c2c10_CAS-V-U3	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	Type V-U3	ATGGCCACCAAAGACGAACTCGC	23	0	0	NA	NA	NA	1	1	TypeV-U3	cas3,cas8b1,cas7,cas5,cas4,cas1,cas2,cas6,csm6,csx1,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,RT,csa3,c2c10_CAS-V-U3,cas14j,DEDDh,DinG	NA|96aa|up_4|NC_014915.1_3041549_3041837_-,NA|57aa|down_0|NC_014915.1_3048376_3048547_+,NA|84aa|down_3|NC_014915.1_3050871_3051123_-	NA|345aa|up_9|NC_014915.1_3035337_3036372_-	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|390aa|up_8|NC_014915.1_3036368_3037538_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|667aa|up_7|NC_014915.1_3037419_3039420_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|317aa|up_6|NC_014915.1_3039638_3040589_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|209aa|up_5|NC_014915.1_3040719_3041346_+	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|96aa|up_4|NC_014915.1_3041549_3041837_-	NA	NA|507aa|up_3|NC_014915.1_3042600_3044121_-	pfam17936, Big_6, Bacterial Ig domain	NA|327aa|up_2|NC_014915.1_3044137_3045118_-	pfam01032, FecCD, FecCD transport family	NA|333aa|up_1|NC_014915.1_3045110_3046109_-	pfam01032, FecCD, FecCD transport family	NA|314aa|up_0|NC_014915.1_3046139_3047081_-	cd01146, FhuD, Fe3+-siderophore binding domain FhuD	NA|57aa|down_0|NC_014915.1_3048376_3048547_+	NA	NA|181aa|down_1|NC_014915.1_3048660_3049203_-	pfam11518, DUF3221, Protein of unknown function (DUF3221)	NA|154aa|down_2|NC_014915.1_3050197_3050659_-	cd09281, UPF0066, Escherichia coli YaeB and related proteins	NA|84aa|down_3|NC_014915.1_3050871_3051123_-	NA	NA|403aa|down_4|NC_014915.1_3051197_3052406_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|148aa|down_5|NC_014915.1_3052644_3053088_-	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|273aa|down_6|NC_014915.1_3061213_3062032_+	PRK03187, tgl, transglutaminase; Provisional	NA|303aa|down_7|NC_014915.1_3062002_3062911_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|226aa|down_8|NC_014915.1_3063028_3063706_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|431aa|down_9|NC_014915.1_3064082_3065375_-	COG3935, DnaD, Putative primosome component and related proteins [DNA replication, recombination, and repair]
