assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000015065.1_ASM1506v1	NC_008600	Bacillus thuringiensis str. Al Hakam, complete sequence	1	1216021-1216129	1	CRISPRCasFinder	no		cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	Orphan	TGTATGATTACCTTCCGCATGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	NA|58aa|up_7|NC_008600.1_1209958_1210132_-,NA|124aa|up_3|NC_008600.1_1212558_1212930_+,NA	NA|415aa|up_9|NC_008600.1_1206717_1207962_+	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|609aa|up_8|NC_008600.1_1208012_1209839_+	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|58aa|up_7|NC_008600.1_1209958_1210132_-	NA	NA|298aa|up_6|NC_008600.1_1210361_1211255_-	pfam13743, Thioredoxin_5, Thioredoxin	NA|133aa|up_5|NC_008600.1_1211254_1211653_-	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|193aa|up_4|NC_008600.1_1211833_1212412_-	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|124aa|up_3|NC_008600.1_1212558_1212930_+	NA	NA|213aa|up_2|NC_008600.1_1212960_1213599_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|266aa|up_1|NC_008600.1_1213617_1214415_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|298aa|up_0|NC_008600.1_1214430_1215324_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|247aa|down_0|NC_008600.1_1216902_1217643_-	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|387aa|down_1|NC_008600.1_1217717_1218878_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|309aa|down_2|NC_008600.1_1218996_1219923_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|251aa|down_3|NC_008600.1_1219936_1220689_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_4|NC_008600.1_1220903_1221749_-	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|302aa|down_5|NC_008600.1_1221871_1222777_-	pfam18573, BclA_C, BclA C-terminal domain	NA|366aa|down_6|NC_008600.1_1222942_1224040_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|230aa|down_7|NC_008600.1_1224251_1224941_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|229aa|down_8|NC_008600.1_1224937_1225624_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|227aa|down_9|NC_008600.1_1225638_1226319_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family
GCF_000015065.1_ASM1506v1	NC_008600	Bacillus thuringiensis str. Al Hakam, complete sequence	2	4478629-4478736	2	CRISPRCasFinder	no		cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	Orphan	TATATCAGCGATTTTTTGAATATATC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	NA|99aa|up_3|NC_008600.1_4476317_4476614_+,NA|145aa|down_4|NC_008600.1_4482253_4482688_-,NA|61aa|down_5|NC_008600.1_4482703_4482886_-	NA|230aa|up_9|NC_008600.1_4471334_4472024_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|485aa|up_8|NC_008600.1_4472025_4473480_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|227aa|up_7|NC_008600.1_4473629_4474310_+	sd00045, ANK, ankyrin repeats	NA|309aa|up_6|NC_008600.1_4474389_4475316_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|133aa|up_5|NC_008600.1_4475444_4475843_+	PRK13955, mscL, large conductance mechanosensitive channel protein MscL	NA|91aa|up_4|NC_008600.1_4475894_4476167_-	pfam13055, DUF3917, Protein of unknown function (DUF3917)	NA|99aa|up_3|NC_008600.1_4476317_4476614_+	NA	NA|79aa|up_2|NC_008600.1_4476695_4476932_+	pfam13133, DUF3949, Protein of unknown function (DUF3949)	NA|121aa|up_1|NC_008600.1_4476933_4477296_+	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|333aa|up_0|NC_008600.1_4477331_4478330_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|339aa|down_0|NC_008600.1_4478940_4479957_-	COG4851, CamS, Protein involved in sex pheromone biosynthesis [General function prediction only]	NA|109aa|down_1|NC_008600.1_4480074_4480401_-	pfam11009, DUF2847, Protein of unknown function (DUF2847)	NA|171aa|down_2|NC_008600.1_4480400_4480913_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|372aa|down_3|NC_008600.1_4481100_4482216_+	COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]	NA|145aa|down_4|NC_008600.1_4482253_4482688_-	NA	NA|61aa|down_5|NC_008600.1_4482703_4482886_-	NA	NA|135aa|down_6|NC_008600.1_4482882_4483287_-	pfam06713, bPH_4, Bacterial PH domain	NA|185aa|down_7|NC_008600.1_4483378_4483933_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|437aa|down_8|NC_008600.1_4484131_4485442_-	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|373aa|down_9|NC_008600.1_4485694_4486813_-	PRK07188, PRK07188, nicotinate phosphoribosyltransferase; Provisional
GCF_000015065.1_ASM1506v1	NC_008600	Bacillus thuringiensis str. Al Hakam, complete sequence	3	4650894-4650980	3	CRISPRCasFinder	no		cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	Orphan	GATATATCTTAAAAATCGCTGATATA	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	NA,NA	NA|482aa|up_9|NC_008600.1_4639746_4641192_-	PRK03640, PRK03640, o-succinylbenzoate--CoA ligase	NA|273aa|up_8|NC_008600.1_4641422_4642241_-	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|271aa|up_7|NC_008600.1_4642310_4643123_-	TIGR03695, menH_SHCHC, 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase	NA|585aa|up_6|NC_008600.1_4643119_4644874_-	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|465aa|up_5|NC_008600.1_4644870_4646265_-	COG1169, MenF, Isochorismate synthase [Coenzyme metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_4|NC_008600.1_4646459_4647413_+	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|244aa|up_3|NC_008600.1_4647522_4648254_+	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|67aa|up_2|NC_008600.1_4648298_4648499_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|261aa|up_1|NC_008600.1_4648825_4649608_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|241aa|up_0|NC_008600.1_4649886_4650609_+	cd00519, Lipase_3, Lipase (class 3)	NA|803aa|down_0|NC_008600.1_4651035_4653444_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|477aa|down_1|NC_008600.1_4653462_4654893_-	PRK00654, glgA, glycogen synthase GlgA	NA|345aa|down_2|NC_008600.1_4655005_4656040_-	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|377aa|down_3|NC_008600.1_4656058_4657189_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|646aa|down_4|NC_008600.1_4657136_4659074_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|202aa|down_5|NC_008600.1_4659542_4660148_+	pfam12389, Peptidase_M73, Camelysin metallo-endopeptidase	NA|1408aa|down_6|NC_008600.1_4660180_4664404_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|315aa|down_7|NC_008600.1_4664575_4665520_-	PRK00066, ldh, L-lactate dehydrogenase; Reviewed	NA|227aa|down_8|NC_008600.1_4673431_4674112_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|332aa|down_9|NC_008600.1_4674207_4675203_+	pfam07885, Ion_trans_2, Ion channel
GCF_000015065.1_ASM1506v1	NC_008600	Bacillus thuringiensis str. Al Hakam, complete sequence	4	4761639-4761761	4	CRISPRCasFinder	no		cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	Orphan	TTAAACAAACGTTTGATTAACTCCCTATTTTTCTTTGTTCAC	42	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,cas14k,DinG,cas14j,DEDDh,RT,c2c9_V-U4	NA|115aa|up_4|NC_008600.1_4758995_4759340_-,NA|130aa|down_1|NC_008600.1_4762714_4763104_-,NA|64aa|down_2|NC_008600.1_4763428_4763620_-,NA|77aa|down_3|NC_008600.1_4763635_4763866_-,NA|176aa|down_4|NC_008600.1_4763921_4764449_-,NA|62aa|down_9|NC_008600.1_4767790_4767976_-	NA|262aa|up_9|NC_008600.1_4754039_4754825_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NC_008600.1_4755064_4755871_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NC_008600.1_4755943_4756756_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NC_008600.1_4756779_4757445_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NC_008600.1_4757437_4758463_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NC_008600.1_4758995_4759340_-	NA	NA|103aa|up_3|NC_008600.1_4759494_4759803_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NC_008600.1_4759806_4760151_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NC_008600.1_4760641_4761025_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NC_008600.1_4761066_4761432_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|80aa|down_0|NC_008600.1_4761962_4762202_-	pfam13073, DUF3937, Protein of unknown function (DUF3937)	NA|130aa|down_1|NC_008600.1_4762714_4763104_-	NA	NA|64aa|down_2|NC_008600.1_4763428_4763620_-	NA	NA|77aa|down_3|NC_008600.1_4763635_4763866_-	NA	NA|176aa|down_4|NC_008600.1_4763921_4764449_-	NA	NA|216aa|down_5|NC_008600.1_4764592_4765240_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_6|NC_008600.1_4765295_4766309_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|378aa|down_7|NC_008600.1_4766331_4767465_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_8|NC_008600.1_4767528_4767777_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_9|NC_008600.1_4767790_4767976_-	NA
