assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013267335.1_ASM1326733v1	NZ_CP053981	Bacillus thuringiensis strain FDAARGOS_793 chromosome, complete genome	1	94270-94377	1	CRISPRCasFinder	no		csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	Orphan	TATATCAGCGATTTTTTGAATATATC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	NA|99aa|up_3|NZ_CP053981.1_91958_92255_+,NA|145aa|down_4|NZ_CP053981.1_97894_98329_-,NA|61aa|down_5|NZ_CP053981.1_98344_98527_-	NA|230aa|up_9|NZ_CP053981.1_86975_87665_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|485aa|up_8|NZ_CP053981.1_87666_89121_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|227aa|up_7|NZ_CP053981.1_89270_89951_+	sd00045, ANK, ankyrin repeats	NA|309aa|up_6|NZ_CP053981.1_90030_90957_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|133aa|up_5|NZ_CP053981.1_91085_91484_+	PRK13955, mscL, large conductance mechanosensitive channel protein MscL	NA|91aa|up_4|NZ_CP053981.1_91535_91808_-	pfam13055, DUF3917, Protein of unknown function (DUF3917)	NA|99aa|up_3|NZ_CP053981.1_91958_92255_+	NA	NA|79aa|up_2|NZ_CP053981.1_92336_92573_+	pfam13133, DUF3949, Protein of unknown function (DUF3949)	NA|121aa|up_1|NZ_CP053981.1_92574_92937_+	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|333aa|up_0|NZ_CP053981.1_92972_93971_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|339aa|down_0|NZ_CP053981.1_94581_95598_-	COG4851, CamS, Protein involved in sex pheromone biosynthesis [General function prediction only]	NA|109aa|down_1|NZ_CP053981.1_95715_96042_-	pfam11009, DUF2847, Protein of unknown function (DUF2847)	NA|171aa|down_2|NZ_CP053981.1_96041_96554_-	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|372aa|down_3|NZ_CP053981.1_96741_97857_+	COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]	NA|145aa|down_4|NZ_CP053981.1_97894_98329_-	NA	NA|61aa|down_5|NZ_CP053981.1_98344_98527_-	NA	NA|135aa|down_6|NZ_CP053981.1_98523_98928_-	pfam06713, bPH_4, Bacterial PH domain	NA|185aa|down_7|NZ_CP053981.1_99019_99574_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|437aa|down_8|NZ_CP053981.1_99772_101083_-	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|373aa|down_9|NZ_CP053981.1_101335_102454_-	PRK07188, PRK07188, nicotinate phosphoribosyltransferase; Provisional
GCF_013267335.1_ASM1326733v1	NZ_CP053981	Bacillus thuringiensis strain FDAARGOS_793 chromosome, complete genome	2	266535-266621	2	CRISPRCasFinder	no		csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	Orphan	GATATATCTTAAAAATCGCTGATATA	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	NA,NA	NA|482aa|up_9|NZ_CP053981.1_255387_256833_-	PRK03640, PRK03640, o-succinylbenzoate--CoA ligase	NA|273aa|up_8|NZ_CP053981.1_257063_257882_-	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|271aa|up_7|NZ_CP053981.1_257951_258764_-	TIGR03695, menH_SHCHC, 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase	NA|585aa|up_6|NZ_CP053981.1_258760_260515_-	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|465aa|up_5|NZ_CP053981.1_260511_261906_-	COG1169, MenF, Isochorismate synthase [Coenzyme metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_4|NZ_CP053981.1_262100_263054_+	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|244aa|up_3|NZ_CP053981.1_263163_263895_+	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|67aa|up_2|NZ_CP053981.1_263939_264140_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|261aa|up_1|NZ_CP053981.1_264466_265249_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|241aa|up_0|NZ_CP053981.1_265527_266250_+	cd00519, Lipase_3, Lipase (class 3)	NA|803aa|down_0|NZ_CP053981.1_266676_269085_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|477aa|down_1|NZ_CP053981.1_269103_270534_-	PRK00654, glgA, glycogen synthase GlgA	NA|345aa|down_2|NZ_CP053981.1_270646_271681_-	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|377aa|down_3|NZ_CP053981.1_271699_272830_-	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|646aa|down_4|NZ_CP053981.1_272777_274715_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|202aa|down_5|NZ_CP053981.1_275183_275789_+	pfam12389, Peptidase_M73, Camelysin metallo-endopeptidase	NA|1408aa|down_6|NZ_CP053981.1_275821_280045_+	cd07474, Peptidases_S8_subtilisin_Vpr-like, Peptidase S8 family domain in Vpr-like proteins	NA|315aa|down_7|NZ_CP053981.1_280216_281161_-	PRK00066, ldh, L-lactate dehydrogenase; Reviewed	NA|227aa|down_8|NZ_CP053981.1_289071_289752_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|332aa|down_9|NZ_CP053981.1_289847_290843_+	pfam07885, Ion_trans_2, Ion channel
GCF_013267335.1_ASM1326733v1	NZ_CP053981	Bacillus thuringiensis strain FDAARGOS_793 chromosome, complete genome	3	377279-377401	3	CRISPRCasFinder	no		csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	Orphan	TTAAACAAACGTTTGATTAACTCCCTATTTTTCTTTGTTCAC	42	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	NA|115aa|up_4|NZ_CP053981.1_374635_374980_-,NA|130aa|down_1|NZ_CP053981.1_378354_378744_-,NA|64aa|down_2|NZ_CP053981.1_379068_379260_-,NA|77aa|down_3|NZ_CP053981.1_379275_379506_-,NA|176aa|down_4|NZ_CP053981.1_379561_380089_-,NA|62aa|down_9|NZ_CP053981.1_383430_383616_-	NA|262aa|up_9|NZ_CP053981.1_369679_370465_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP053981.1_370704_371511_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP053981.1_371583_372396_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP053981.1_372419_373085_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP053981.1_373077_374103_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP053981.1_374635_374980_-	NA	NA|103aa|up_3|NZ_CP053981.1_375134_375443_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP053981.1_375446_375791_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP053981.1_376281_376665_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP053981.1_376706_377072_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|80aa|down_0|NZ_CP053981.1_377602_377842_-	pfam13073, DUF3937, Protein of unknown function (DUF3937)	NA|130aa|down_1|NZ_CP053981.1_378354_378744_-	NA	NA|64aa|down_2|NZ_CP053981.1_379068_379260_-	NA	NA|77aa|down_3|NZ_CP053981.1_379275_379506_-	NA	NA|176aa|down_4|NZ_CP053981.1_379561_380089_-	NA	NA|216aa|down_5|NZ_CP053981.1_380232_380880_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_6|NZ_CP053981.1_380935_381949_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|378aa|down_7|NZ_CP053981.1_381971_383105_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_8|NZ_CP053981.1_383168_383417_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_9|NZ_CP053981.1_383430_383616_-	NA
GCF_013267335.1_ASM1326733v1	NZ_CP053981	Bacillus thuringiensis strain FDAARGOS_793 chromosome, complete genome	4	2088751-2088859	4	CRISPRCasFinder	no		csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	Orphan	TGTATGATTACCTTCCGCATGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,DEDDh,c2c9_V-U4,cas3,WYL,cas14k,DinG,RT	NA|58aa|up_7|NZ_CP053981.1_2082688_2082862_-,NA|124aa|up_3|NZ_CP053981.1_2085288_2085660_+,NA	NA|415aa|up_9|NZ_CP053981.1_2079447_2080692_+	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|609aa|up_8|NZ_CP053981.1_2080742_2082569_+	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|58aa|up_7|NZ_CP053981.1_2082688_2082862_-	NA	NA|298aa|up_6|NZ_CP053981.1_2083091_2083985_-	pfam13743, Thioredoxin_5, Thioredoxin	NA|133aa|up_5|NZ_CP053981.1_2083984_2084383_-	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|193aa|up_4|NZ_CP053981.1_2084563_2085142_-	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|124aa|up_3|NZ_CP053981.1_2085288_2085660_+	NA	NA|213aa|up_2|NZ_CP053981.1_2085690_2086329_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|266aa|up_1|NZ_CP053981.1_2086347_2087145_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|298aa|up_0|NZ_CP053981.1_2087160_2088054_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|247aa|down_0|NZ_CP053981.1_2089632_2090373_-	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|387aa|down_1|NZ_CP053981.1_2090447_2091608_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|309aa|down_2|NZ_CP053981.1_2091726_2092653_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|251aa|down_3|NZ_CP053981.1_2092666_2093419_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_4|NZ_CP053981.1_2093633_2094479_-	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|302aa|down_5|NZ_CP053981.1_2094601_2095507_-	pfam18573, BclA_C, BclA C-terminal domain	NA|366aa|down_6|NZ_CP053981.1_2095672_2096770_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|230aa|down_7|NZ_CP053981.1_2096981_2097671_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|229aa|down_8|NZ_CP053981.1_2097667_2098354_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|227aa|down_9|NZ_CP053981.1_2098368_2099049_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family
