assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000445995.2_ASM44599v2	NC_022080	Geobacillus genomosp. 3, complete sequence	1	301645-302140	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas9	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	 or Type II-C?,Type II-C,Type II-A,Type II-B, Type II-B	GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAAT,GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAAT,GTCATAGTTCCCCTGAGATTATCGCTGTGGTATAAT	36,36,36	0	0	NA	NA	NA:NA:NA	7,7,7	7	orTypeII-C?,TypeII-C,TypeII-A,TypeII-B,TypeII-B	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	NA,NA|97aa|down_5|NC_022080.4_310344_310635_-,NA|175aa|down_7|NC_022080.4_311301_311826_-	NA|477aa|up_9|NC_022080.4_286280_287711_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|435aa|up_8|NC_022080.4_287897_289202_+	PRK08248, PRK08248, homocysteine synthase	NA|210aa|up_7|NC_022080.4_289414_290044_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|376aa|up_6|NC_022080.4_290281_291409_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|301aa|up_5|NC_022080.4_291425_292328_-	cd13528, PBP2_osmoprotectants, Substrate-binding domain of osmoregulatory ABC-type transporters; the type 2 periplasmic-binding protein fold	NA|146aa|up_4|NC_022080.4_292781_293219_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|503aa|up_3|NC_022080.4_293360_294869_+	TIGR02677, conserved_hypothetical_protein, TIGR02677 family protein	NA|399aa|up_2|NC_022080.4_294852_296049_+	TIGR02678, hypothetical_protein, TIGR02678 family protein	NA|1374aa|up_1|NC_022080.4_296008_300130_+	TIGR02680, conserved_hypothetical_protein, TIGR02680 family protein	NA|407aa|up_0|NC_022080.4_300126_301347_+	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	cas2|103aa|down_0|NC_022080.4_302240_302549_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|300aa|down_1|NC_022080.4_302553_303453_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1088aa|down_2|NC_022080.4_303382_306646_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|351aa|down_3|NC_022080.4_307743_308796_+	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates	NA|414aa|down_4|NC_022080.4_309062_310304_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|97aa|down_5|NC_022080.4_310344_310635_-	NA	NA|218aa|down_6|NC_022080.4_310688_311342_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|175aa|down_7|NC_022080.4_311301_311826_-	NA	NA|458aa|down_8|NC_022080.4_312498_313872_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|224aa|down_9|NC_022080.4_313852_314524_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_000445995.2_ASM44599v2	NC_022080	Geobacillus genomosp. 3, complete sequence	2	1885394-1885564	2	PILER-CR	no		cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	Orphan	GAAGGTTTCAATCCCTCATAGGTACGATAAAAACC	35	0	0	NA	NA	NA	2	2	Orphan	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	NA,NA|101aa|down_3|NC_022080.4_1889419_1889722_-	NA|450aa|up_9|NC_022080.4_1872790_1874140_-	pfam02447, GntP_permease, GntP family permease	NA|520aa|up_8|NC_022080.4_1874259_1875819_-	TIGR01314, gntK_FGGY, gluconate kinase, FGGY type	NA|339aa|up_7|NC_022080.4_1875836_1876853_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|312aa|up_6|NC_022080.4_1877510_1878446_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|231aa|up_5|NC_022080.4_1878538_1879231_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|537aa|up_4|NC_022080.4_1879244_1880855_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|509aa|up_3|NC_022080.4_1880925_1882452_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|152aa|up_2|NC_022080.4_1882465_1882921_-	pfam07331, TctB, Tripartite tricarboxylate transporter TctB family	NA|347aa|up_1|NC_022080.4_1882976_1884017_-	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|299aa|up_0|NC_022080.4_1884396_1885293_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|388aa|down_0|NC_022080.4_1885751_1886915_-	PRK02318, PRK02318, mannitol-1-phosphate 5-dehydrogenase; Provisional	NA|148aa|down_1|NC_022080.4_1886914_1887358_-	COG4668, MtlA, Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]	NA|697aa|down_2|NC_022080.4_1887363_1889454_-	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|101aa|down_3|NC_022080.4_1889419_1889722_-	NA	NA|484aa|down_4|NC_022080.4_1889724_1891176_-	COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC component [Carbohydrate transport and metabolism]	NA|488aa|down_5|NC_022080.4_1891764_1893228_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|397aa|down_6|NC_022080.4_1893329_1894520_-	cd08194, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|569aa|down_7|NC_022080.4_1894712_1896419_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|334aa|down_8|NC_022080.4_1896900_1897902_-	cd05299, CtBP_dh, C-terminal binding protein (CtBP), D-isomer-specific 2-hydroxyacid dehydrogenases related repressor	NA|383aa|down_9|NC_022080.4_1897963_1899112_-	PRK14017, PRK14017, galactonate dehydratase; Provisional
GCF_000445995.2_ASM44599v2	NC_022080	Geobacillus genomosp. 3, complete sequence	3	1985340-1985438	2	CRISPRCasFinder	no		cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	Orphan	TAAACGAGGCCCATTTCTATCAAG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	NA,NA	NA|444aa|up_9|NC_022080.4_1972562_1973894_-	cd05913, PaaK, Phenylacetate-CoA ligase (also known as PaaK)	NA|560aa|up_8|NC_022080.4_1974670_1976350_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|375aa|up_7|NC_022080.4_1976846_1977971_+	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|414aa|up_6|NC_022080.4_1977967_1979209_+	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|98aa|up_5|NC_022080.4_1979309_1979603_+	pfam01212, Beta_elim_lyase, Beta-eliminating lyase	NA|135aa|up_4|NC_022080.4_1979649_1980054_-	pfam04138, GtrA, GtrA-like protein	NA|189aa|up_3|NC_022080.4_1980926_1981493_-	PRK00131, aroK, shikimate kinase; Reviewed	NA|256aa|up_2|NC_022080.4_1981563_1982331_-	PRK02412, aroD, type I 3-dehydroquinate dehydratase	NA|436aa|up_1|NC_022080.4_1982662_1983970_+	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|328aa|up_0|NC_022080.4_1984161_1985145_-	cd08261, Zn_ADH7, Alcohol dehydrogenases of the MDR family	NA|283aa|down_0|NC_022080.4_1985664_1986513_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|138aa|down_1|NC_022080.4_1986662_1987076_+	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|373aa|down_2|NC_022080.4_1987333_1988452_+	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|129aa|down_3|NC_022080.4_1988581_1988968_-	cd14797, DUF302, Uncharacterized domain family DUF302	NA|261aa|down_4|NC_022080.4_1989110_1989893_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|379aa|down_5|NC_022080.4_1989965_1991102_-	cd07724, POD-like_MBL-fold, ETHE1 (PDO type I), persulfide dioxygenase A (PDOA, PDO type II) and related proteins; MBL-fold metallo-hydrolase domain	NA|190aa|down_6|NC_022080.4_1991152_1991722_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|108aa|down_7|NC_022080.4_1991718_1992042_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|122aa|down_8|NC_022080.4_1992054_1992420_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|161aa|down_9|NC_022080.4_1992434_1992917_-	pfam13686, DrsE_2, DsrE/DsrF/DrsH-like family
GCF_000445995.2_ASM44599v2	NC_022080	Geobacillus genomosp. 3, complete sequence	4	3406240-3406389	3	CRISPRCasFinder	no		cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	Orphan	TTATTTACGGAGTTAGGTTCGTTA	24	0	0	NA	NA	NA	3	3	Orphan	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	NA,NA	NA|287aa|up_9|NC_022080.4_3395507_3396368_+	cd07574, nitrilase_Rim1_like, Uncharacterized subgroup of the nitrilase superfamily; some members of this subgroup have an N-terminal RimI domain (class 12 nitrilases)	NA|486aa|up_8|NC_022080.4_3396702_3398160_-	TIGR02121, Osmoregulated_proline_transporter, sodium/proline symporter	NA|274aa|up_7|NC_022080.4_3398565_3399387_+	cd13624, PBP2_Arg_Lys_His, Substrate binding domain of the arginine-, lysine-, histidine-binding protein ArtJ; the type 2 periplasmic binding protein fold	NA|219aa|up_6|NC_022080.4_3399455_3400112_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|241aa|up_5|NC_022080.4_3400108_3400831_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|100aa|up_4|NC_022080.4_3400933_3401233_-	TIGR02901, cytochrome_aa3_quinol_oxidase_subunit_IV, cytochrome aa3 quinol oxidase, subunit IV	NA|205aa|up_3|NC_022080.4_3401234_3401849_-	cd02863, Ubiquinol_oxidase_III, Ubiquinol oxidase subunit III subfamily	NA|649aa|up_2|NC_022080.4_3401852_3403799_-	TIGR02882, cytochrome_aa3_quinol_oxidase_subunit_I, cytochrome aa3 quinol oxidase, subunit I	NA|303aa|up_1|NC_022080.4_3403816_3404725_-	TIGR01432, AA3-600_quinol_oxidase_subunit_II, cytochrome aa3 quinol oxidase, subunit II	NA|242aa|up_0|NC_022080.4_3404994_3405720_-	cd18092, SpoU-like_TrmH, SAM-dependent tRNA methylase related to TrmH	NA|509aa|down_0|NC_022080.4_3407461_3408988_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|321aa|down_1|NC_022080.4_3409163_3410126_+	COG1482, ManA, Phosphomannose isomerase [Carbohydrate transport and metabolism]	NA|385aa|down_2|NC_022080.4_3410170_3411325_-	pfam02595, Gly_kinase, Glycerate kinase family	NA|413aa|down_3|NC_022080.4_3411934_3413173_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|310aa|down_4|NC_022080.4_3413352_3414282_+	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|255aa|down_5|NC_022080.4_3414501_3415266_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|160aa|down_6|NC_022080.4_3415422_3415902_-	pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase	NA|50aa|down_7|NC_022080.4_3415985_3416135_-	pfam14116, YyzF, YyzF-like protein	NA|406aa|down_8|NC_022080.4_3416198_3417416_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|265aa|down_9|NC_022080.4_3417517_3418312_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain
GCF_000445995.2_ASM44599v2	NC_022092	Geobacillus genomosp. 3 plasmid pBt40, complete sequence	1	32525-32663	1	CRISPRCasFinder	no			Orphan	TACATTTTTGGTGTTAGGCGTACATTTTTGGTG	33	0	0	NA	NA	NA	1	1	Orphan	cas3,cas2,cas1,cas9,cas14j,csa3,DEDDh,c2c10_CAS-V-U3,DinG,RT	NA|440aa|up_9|NC_022092.1_25180_26500_-,NA|136aa|up_2|NC_022092.1_30311_30719_+,NA|96aa|up_0|NC_022092.1_31604_31892_+,NA	NA|440aa|up_9|NC_022092.1_25180_26500_-	NA	NA|190aa|up_8|NC_022092.1_27438_28008_-	cd01192, INT_C_like_3, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|157aa|up_7|NC_022092.1_28189_28660_-	cd14845, L-Ala-D-Glu_peptidase_like, L-Ala-D-Glu peptidase, also known as L-alanyl-D-glutamate endopeptidase	NA|49aa|up_6|NC_022092.1_28679_28826_-	pfam12841, YvrJ, YvrJ protein family	NA|71aa|up_5|NC_022092.1_28890_29103_-	pfam11148, DUF2922, Protein of unknown function (DUF2922)	NA|74aa|up_4|NC_022092.1_29121_29343_-	pfam07872, DUF1659, Protein of unknown function (DUF1659)	NA|177aa|up_3|NC_022092.1_29484_30015_-	pfam06338, ComK, ComK protein	NA|136aa|up_2|NC_022092.1_30311_30719_+	NA	NA|266aa|up_1|NC_022092.1_30823_31621_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|96aa|up_0|NC_022092.1_31604_31892_+	NA	NA|383aa|down_0|NC_022092.1_32834_33983_+	pfam01051, Rep_3, Initiator Replication protein	NA|357aa|down_1|NC_022092.1_34067_35138_+	pfam14020, DUF4236, Protein of unknown function (DUF4236)	NA|356aa|down_2|NC_022092.1_35215_36283_+	smart00318, SNc, Staphylococcal nuclease homologues	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
