assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013267295.1_ASM1326729v1	NZ_CP053972	Bacillus thuringiensis strain FDAARGOS_796 chromosome, complete genome	1	1890641-1890757	1	CRISPRCasFinder	no		csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG	Orphan	CTTAAACAAGCGTTTGATTAATTCTCCATTTTTCTT	36	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG,cas4,RT	NA|115aa|up_4|NZ_CP053972.1_1887770_1888115_-,NA|176aa|down_0|NZ_CP053972.1_1890856_1891384_-,NA|62aa|down_5|NZ_CP053972.1_1894729_1894915_-	NA|262aa|up_9|NZ_CP053972.1_1882935_1883721_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP053972.1_1883959_1884766_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP053972.1_1884837_1885650_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP053972.1_1885673_1886339_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP053972.1_1886331_1887357_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP053972.1_1887770_1888115_-	NA	NA|103aa|up_3|NZ_CP053972.1_1888267_1888576_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP053972.1_1888579_1888924_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP053972.1_1889654_1890038_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP053972.1_1890079_1890445_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NZ_CP053972.1_1890856_1891384_-	NA	NA|216aa|down_1|NZ_CP053972.1_1891528_1892176_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NZ_CP053972.1_1892235_1893249_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|390aa|down_3|NZ_CP053972.1_1893271_1894441_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_4|NZ_CP053972.1_1894467_1894716_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_5|NZ_CP053972.1_1894729_1894915_-	NA	NA|240aa|down_6|NZ_CP053972.1_1895028_1895748_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|595aa|down_7|NZ_CP053972.1_1895863_1897648_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_8|NZ_CP053972.1_1898036_1899209_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|794aa|down_9|NZ_CP053972.1_1899230_1901612_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]
GCF_013267295.1_ASM1326729v1	NZ_CP053972	Bacillus thuringiensis strain FDAARGOS_796 chromosome, complete genome	2	2146079-2146212	2	CRISPRCasFinder	no		csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG	Orphan	GTTGATTTCTCTTCTTTTTGAGA	23	0	0	NA	NA	NA	2	2	Orphan	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG,cas4,RT	NA|45aa|up_0|NZ_CP053972.1_2145756_2145891_-,NA	NA|220aa|up_9|NZ_CP053972.1_2137758_2138418_-	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase	NA|293aa|up_8|NZ_CP053972.1_2138436_2139315_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|256aa|up_7|NZ_CP053972.1_2139554_2140322_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|234aa|up_6|NZ_CP053972.1_2140433_2141135_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|up_5|NZ_CP053972.1_2141124_2141868_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|226aa|up_4|NZ_CP053972.1_2142131_2142809_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|145aa|up_3|NZ_CP053972.1_2143150_2143585_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|334aa|up_2|NZ_CP053972.1_2144013_2145015_-	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|91aa|up_1|NZ_CP053972.1_2145175_2145448_-	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_0|NZ_CP053972.1_2145756_2145891_-	NA	NA|236aa|down_0|NZ_CP053972.1_2147100_2147808_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|281aa|down_1|NZ_CP053972.1_2147807_2148650_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|down_2|NZ_CP053972.1_2148831_2149839_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|340aa|down_3|NZ_CP053972.1_2149937_2150957_-	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|435aa|down_4|NZ_CP053972.1_2151165_2152470_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|237aa|down_5|NZ_CP053972.1_2152509_2153220_-	pfam08680, DUF1779, TATA-box binding	NA|79aa|down_6|NZ_CP053972.1_2153265_2153502_-	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|507aa|down_7|NZ_CP053972.1_2153704_2155225_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|501aa|down_8|NZ_CP053972.1_2155226_2156729_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|621aa|down_9|NZ_CP053972.1_2156725_2158588_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed
GCF_013267295.1_ASM1326729v1	NZ_CP053972	Bacillus thuringiensis strain FDAARGOS_796 chromosome, complete genome	3	3057784-3057859	3	CRISPRCasFinder	no	csa3	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG	Type I-A	ATCATCATCATGGAGGACACAATCA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG,cas4,RT	NA,NA|48aa|down_8|NZ_CP053972.1_3070866_3071010_+	NA|335aa|up_9|NZ_CP053972.1_3046505_3047510_+	pfam01032, FecCD, FecCD transport family	NA|353aa|up_8|NZ_CP053972.1_3047506_3048565_+	pfam01032, FecCD, FecCD transport family	NA|274aa|up_7|NZ_CP053972.1_3048577_3049399_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|244aa|up_6|NZ_CP053972.1_3049430_3050162_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|397aa|up_5|NZ_CP053972.1_3050375_3051566_+	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|322aa|up_4|NZ_CP053972.1_3051610_3052576_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|141aa|up_3|NZ_CP053972.1_3052635_3053058_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|628aa|up_2|NZ_CP053972.1_3053095_3054979_-	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|298aa|up_1|NZ_CP053972.1_3054982_3055876_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|510aa|up_0|NZ_CP053972.1_3056003_3057533_-	PRK12452, PRK12452, cardiolipin synthase	NA|568aa|down_0|NZ_CP053972.1_3058582_3060286_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|466aa|down_1|NZ_CP053972.1_3060317_3061715_-	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|237aa|down_2|NZ_CP053972.1_3062167_3062878_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|476aa|down_3|NZ_CP053972.1_3063020_3064448_+	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|554aa|down_4|NZ_CP053972.1_3064461_3066123_+	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|369aa|down_5|NZ_CP053972.1_3067260_3068367_-	pfam03845, Spore_permease, Spore germination protein	NA|501aa|down_6|NZ_CP053972.1_3068347_3069850_-	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|274aa|down_7|NZ_CP053972.1_3070038_3070860_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|48aa|down_8|NZ_CP053972.1_3070866_3071010_+	NA	NA|487aa|down_9|NZ_CP053972.1_3071169_3072630_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family
GCF_013267295.1_ASM1326729v1	NZ_CP053970	Bacillus thuringiensis strain FDAARGOS_796 plasmid unnamed2, complete sequence	1	6354-6814	1	CRISPRCasFinder	no			Orphan	GATATCATTGAAGACGAAGATGA	23	3	5	6512-6530|6512-6530|6554-6572|6773-6791|6773-6791	NZ_CP053970.1_195788-195806|NZ_CP053972.1_1454814-1454796|NZ_CP053970.1_195788-195806|NZ_CP053972.1_3327578-3327596|NZ_CP053972.1_2688848-2688866	NA	9	9	Orphan	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG,cas4,RT	NA|69aa|up_9|NZ_CP053970.1_90_297_+,NA|62aa|up_8|NZ_CP053970.1_283_469_+,NA|59aa|up_7|NZ_CP053970.1_478_655_+,NA|65aa|up_6|NZ_CP053970.1_671_866_+,NA|339aa|up_5|NZ_CP053970.1_897_1914_+,NA|120aa|up_2|NZ_CP053970.1_3965_4325_+,NA|178aa|down_0|NZ_CP053970.1_6945_7479_+,NA|186aa|down_1|NZ_CP053970.1_7594_8152_+,NA|164aa|down_2|NZ_CP053970.1_8155_8647_+,NA|226aa|down_3|NZ_CP053970.1_8664_9342_+,NA|207aa|down_4|NZ_CP053970.1_9364_9985_+,NA|161aa|down_5|NZ_CP053970.1_10008_10491_+,NA|160aa|down_6|NZ_CP053970.1_10513_10993_+,NA|173aa|down_7|NZ_CP053970.1_11293_11812_+,NA|176aa|down_8|NZ_CP053970.1_11846_12374_+,NA|246aa|down_9|NZ_CP053970.1_12416_13154_+	NA|69aa|up_9|NZ_CP053970.1_90_297_+	NA	NA|62aa|up_8|NZ_CP053970.1_283_469_+	NA	NA|59aa|up_7|NZ_CP053970.1_478_655_+	NA	NA|65aa|up_6|NZ_CP053970.1_671_866_+	NA	NA|339aa|up_5|NZ_CP053970.1_897_1914_+	NA	NA|418aa|up_4|NZ_CP053970.1_1901_3155_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|263aa|up_3|NZ_CP053970.1_3161_3950_+	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|120aa|up_2|NZ_CP053970.1_3965_4325_+	NA	NA|242aa|up_1|NZ_CP053970.1_4355_5081_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|316aa|up_0|NZ_CP053970.1_5179_6127_+	PRK13480, PRK13480, 3'-5' exoribonuclease YhaM; Provisional	NA|178aa|down_0|NZ_CP053970.1_6945_7479_+	NA	NA|186aa|down_1|NZ_CP053970.1_7594_8152_+	NA	NA|164aa|down_2|NZ_CP053970.1_8155_8647_+	NA	NA|226aa|down_3|NZ_CP053970.1_8664_9342_+	NA	NA|207aa|down_4|NZ_CP053970.1_9364_9985_+	NA	NA|161aa|down_5|NZ_CP053970.1_10008_10491_+	NA	NA|160aa|down_6|NZ_CP053970.1_10513_10993_+	NA	NA|173aa|down_7|NZ_CP053970.1_11293_11812_+	NA	NA|176aa|down_8|NZ_CP053970.1_11846_12374_+	NA	NA|246aa|down_9|NZ_CP053970.1_12416_13154_+	NA
GCF_013267295.1_ASM1326729v1	NZ_CP053970	Bacillus thuringiensis strain FDAARGOS_796 plasmid unnamed2, complete sequence	2	162785-162922	2	CRISPRCasFinder	no			Orphan	CTAAACCTGTACAAAAGCAAGAAG	24	0	0	NA	NA	NA	2	2	Orphan	csa3,WYL,DEDDh,cas14k,cas3,c2c9_V-U4,cas14j,DinG,cas4,RT	NA|137aa|up_9|NZ_CP053970.1_133622_134033_+,NA|194aa|up_3|NZ_CP053970.1_147982_148564_+,NA|139aa|up_2|NZ_CP053970.1_148643_149060_+,NA|133aa|down_1|NZ_CP053970.1_164451_164850_+,NA|204aa|down_2|NZ_CP053970.1_164927_165539_+,NA|247aa|down_3|NZ_CP053970.1_165646_166387_+,NA|139aa|down_4|NZ_CP053970.1_166406_166823_+,NA|127aa|down_5|NZ_CP053970.1_166828_167209_+,NA|120aa|down_6|NZ_CP053970.1_167227_167587_+,NA|237aa|down_7|NZ_CP053970.1_167598_168309_+,NA|86aa|down_8|NZ_CP053970.1_168633_168891_+,NA|108aa|down_9|NZ_CP053970.1_168911_169235_+	NA|137aa|up_9|NZ_CP053970.1_133622_134033_+	NA	NA|396aa|up_8|NZ_CP053970.1_134269_135457_+	smart00475, 53EXOc, 5'-3' exonuclease	NA|423aa|up_7|NZ_CP053970.1_135666_136935_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|2561aa|up_6|NZ_CP053970.1_137574_145257_+	COG4932, COG4932, Predicted outer membrane protein [Cell envelope biogenesis, outer membrane]	NA|471aa|up_5|NZ_CP053970.1_145679_147092_+	COG4990, COG4990, Uncharacterized protein conserved in bacteria [Function unknown]	NA|237aa|up_4|NZ_CP053970.1_147252_147963_+	cd06165, Sortase_A, Sortase domain found in class A sortases	NA|194aa|up_3|NZ_CP053970.1_147982_148564_+	NA	NA|139aa|up_2|NZ_CP053970.1_148643_149060_+	NA	NA|353aa|up_1|NZ_CP053970.1_149311_150370_+	pfam16403, DUF5011, Domain of unknown function (DUF5011)	NA|3527aa|up_0|NZ_CP053970.1_150948_161529_+	TIGR04226, Fimbrial_subunit_type_2, fimbrial isopeptide formation D2 domain	NA|283aa|down_0|NZ_CP053970.1_163436_164285_+	cd10446, GIY-YIG_unchar_1, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria	NA|133aa|down_1|NZ_CP053970.1_164451_164850_+	NA	NA|204aa|down_2|NZ_CP053970.1_164927_165539_+	NA	NA|247aa|down_3|NZ_CP053970.1_165646_166387_+	NA	NA|139aa|down_4|NZ_CP053970.1_166406_166823_+	NA	NA|127aa|down_5|NZ_CP053970.1_166828_167209_+	NA	NA|120aa|down_6|NZ_CP053970.1_167227_167587_+	NA	NA|237aa|down_7|NZ_CP053970.1_167598_168309_+	NA	NA|86aa|down_8|NZ_CP053970.1_168633_168891_+	NA	NA|108aa|down_9|NZ_CP053970.1_168911_169235_+	NA
