assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000835025.1_ASM83502v1	NZ_CP009351	Bacillus thuringiensis HD1002 chromosome, complete genome	1	429891-429987	1	CRISPRCasFinder	no		DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4	Orphan	AAATACTTTGCAACAAGCAAATTT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA|147aa|up_9|NZ_CP009351.1_415827_416268_+,NA|118aa|up_8|NZ_CP009351.1_416805_417159_-,NA|85aa|up_6|NZ_CP009351.1_421093_421348_-,NA|77aa|up_2|NZ_CP009351.1_427413_427644_+,NA|82aa|down_0|NZ_CP009351.1_430035_430281_+,NA|105aa|down_3|NZ_CP009351.1_431773_432088_+,NA|335aa|down_5|NZ_CP009351.1_433253_434258_+,NA|89aa|down_7|NZ_CP009351.1_435074_435341_+	NA|147aa|up_9|NZ_CP009351.1_415827_416268_+	NA	NA|118aa|up_8|NZ_CP009351.1_416805_417159_-	NA	NA|1094aa|up_7|NZ_CP009351.1_417734_421016_+	COG4932, COG4932, Predicted outer membrane protein [Cell envelope biogenesis, outer membrane]	NA|85aa|up_6|NZ_CP009351.1_421093_421348_-	NA	NA|248aa|up_5|NZ_CP009351.1_421606_422350_+	cd14852, LD-carboxypeptidase, L,D-carboxypeptidase DacB and LdcB, and related proteins	NA|411aa|up_4|NZ_CP009351.1_423081_424314_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|354aa|up_3|NZ_CP009351.1_425247_426309_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|77aa|up_2|NZ_CP009351.1_427413_427644_+	NA	NA|418aa|up_1|NZ_CP009351.1_427785_429039_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|110aa|up_0|NZ_CP009351.1_429459_429789_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|82aa|down_0|NZ_CP009351.1_430035_430281_+	NA	NA|63aa|down_1|NZ_CP009351.1_430425_430614_+	COG1476, COG1476, Predicted transcriptional regulators [Transcription]	NA|260aa|down_2|NZ_CP009351.1_430831_431611_+	pfam10552, ORF6C, ORF6C domain	NA|105aa|down_3|NZ_CP009351.1_431773_432088_+	NA	NA|216aa|down_4|NZ_CP009351.1_432361_433009_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|335aa|down_5|NZ_CP009351.1_433253_434258_+	NA	NA|271aa|down_6|NZ_CP009351.1_434220_435033_+	PRK06921, PRK06921, hypothetical protein; Provisional	NA|89aa|down_7|NZ_CP009351.1_435074_435341_+	NA	NA|55aa|down_8|NZ_CP009351.1_435412_435577_+	pfam13128, DUF3954, Protein of unknown function (DUF3954)	NA|72aa|down_9|NZ_CP009351.1_435594_435810_+	pfam11195, DUF2829, Protein of unknown function (DUF2829)
GCF_000835025.1_ASM83502v1	NZ_CP009351	Bacillus thuringiensis HD1002 chromosome, complete genome	2	3224236-3224311	2	CRISPRCasFinder	no	csa3	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4	Type I-A	TGATTGTGTCCTCCATGGTGATGAT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA|48aa|up_8|NZ_CP009351.1_3211084_3211228_-,NA	NA|487aa|up_9|NZ_CP009351.1_3209464_3210925_-	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|48aa|up_8|NZ_CP009351.1_3211084_3211228_-	NA	NA|274aa|up_7|NZ_CP009351.1_3211234_3212056_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|501aa|up_6|NZ_CP009351.1_3212244_3213747_+	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|369aa|up_5|NZ_CP009351.1_3213727_3214834_+	pfam03845, Spore_permease, Spore germination protein	NA|554aa|up_4|NZ_CP009351.1_3215971_3217633_-	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|476aa|up_3|NZ_CP009351.1_3217646_3219074_-	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|237aa|up_2|NZ_CP009351.1_3219216_3219927_-	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|466aa|up_1|NZ_CP009351.1_3220379_3221777_+	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|568aa|up_0|NZ_CP009351.1_3221808_3223512_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|510aa|down_0|NZ_CP009351.1_3224561_3226091_+	PRK12452, PRK12452, cardiolipin synthase	NA|298aa|down_1|NZ_CP009351.1_3226218_3227112_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|628aa|down_2|NZ_CP009351.1_3227115_3228999_+	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|141aa|down_3|NZ_CP009351.1_3229036_3229459_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|322aa|down_4|NZ_CP009351.1_3229518_3230484_-	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|397aa|down_5|NZ_CP009351.1_3230528_3231719_-	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|244aa|down_6|NZ_CP009351.1_3231932_3232664_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|274aa|down_7|NZ_CP009351.1_3232695_3233517_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|353aa|down_8|NZ_CP009351.1_3233529_3234588_-	pfam01032, FecCD, FecCD transport family	NA|335aa|down_9|NZ_CP009351.1_3234584_3235589_-	pfam01032, FecCD, FecCD transport family
GCF_000835025.1_ASM83502v1	NZ_CP009351	Bacillus thuringiensis HD1002 chromosome, complete genome	3	4128951-4129084	3	CRISPRCasFinder	no		DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4	Orphan	GACAAATCTCAAAAAGAAGAGAA	23	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA,NA|45aa|down_0|NZ_CP009351.1_4129277_4129412_+	NA|621aa|up_9|NZ_CP009351.1_4116580_4118443_+	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed	NA|501aa|up_8|NZ_CP009351.1_4118439_4119942_+	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|507aa|up_7|NZ_CP009351.1_4119943_4121464_+	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|79aa|up_6|NZ_CP009351.1_4121666_4121903_+	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|237aa|up_5|NZ_CP009351.1_4121948_4122659_+	pfam08680, DUF1779, TATA-box binding	NA|435aa|up_4|NZ_CP009351.1_4122698_4124003_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|340aa|up_3|NZ_CP009351.1_4124211_4125231_+	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|336aa|up_2|NZ_CP009351.1_4125329_4126337_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|281aa|up_1|NZ_CP009351.1_4126518_4127361_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|236aa|up_0|NZ_CP009351.1_4127360_4128068_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|45aa|down_0|NZ_CP009351.1_4129277_4129412_+	NA	NA|91aa|down_1|NZ_CP009351.1_4129720_4129993_+	pfam12116, SpoIIID, Stage III sporulation protein D	NA|334aa|down_2|NZ_CP009351.1_4130153_4131155_+	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|145aa|down_3|NZ_CP009351.1_4131583_4132018_+	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|226aa|down_4|NZ_CP009351.1_4132359_4133037_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|down_5|NZ_CP009351.1_4133300_4134044_+	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|234aa|down_6|NZ_CP009351.1_4134033_4134735_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|256aa|down_7|NZ_CP009351.1_4134846_4135614_+	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|293aa|down_8|NZ_CP009351.1_4135853_4136732_+	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|220aa|down_9|NZ_CP009351.1_4136750_4137410_+	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase
GCF_000835025.1_ASM83502v1	NZ_CP009351	Bacillus thuringiensis HD1002 chromosome, complete genome	4	4384412-4384528	4	CRISPRCasFinder	no		DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4	Orphan	AAGAAAAATGGAGAATTAATCAAACGCTTGTTTAAG	36	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA|62aa|up_5|NZ_CP009351.1_4380253_4380439_+,NA|176aa|up_0|NZ_CP009351.1_4383784_4384312_+,NA|115aa|down_4|NZ_CP009351.1_4387053_4387398_+	NA|794aa|up_9|NZ_CP009351.1_4373556_4375938_+	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]	NA|391aa|up_8|NZ_CP009351.1_4375959_4377132_+	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|601aa|up_7|NZ_CP009351.1_4377502_4379305_+	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|240aa|up_6|NZ_CP009351.1_4379420_4380140_+	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|62aa|up_5|NZ_CP009351.1_4380253_4380439_+	NA	NA|83aa|up_4|NZ_CP009351.1_4380452_4380701_+	pfam07875, Coat_F, Coat F domain	NA|390aa|up_3|NZ_CP009351.1_4380727_4381897_+	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|338aa|up_2|NZ_CP009351.1_4381919_4382933_+	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|216aa|up_1|NZ_CP009351.1_4382992_4383640_-	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|176aa|up_0|NZ_CP009351.1_4383784_4384312_+	NA	NA|122aa|down_0|NZ_CP009351.1_4384723_4385089_+	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|128aa|down_1|NZ_CP009351.1_4385130_4385514_+	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|115aa|down_2|NZ_CP009351.1_4386244_4386589_+	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|100aa|down_3|NZ_CP009351.1_4386601_4386901_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|down_4|NZ_CP009351.1_4387053_4387398_+	NA	NA|342aa|down_5|NZ_CP009351.1_4387811_4388837_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|222aa|down_6|NZ_CP009351.1_4388829_4389495_+	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|271aa|down_7|NZ_CP009351.1_4389518_4390331_+	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|269aa|down_8|NZ_CP009351.1_4390402_4391209_+	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|262aa|down_9|NZ_CP009351.1_4391447_4392233_+	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]
GCF_000835025.1_ASM83502v1	NZ_CP009348	Bacillus thuringiensis HD1002 plasmid 2, complete sequence	1	132888-133348	1	CRISPRCasFinder	no			Orphan	GATATCATTGAAGACGAAGATGA	23	4	7	133007-133022|133007-133022|133046-133064|133046-133064|133088-133106|133307-133325|133307-133325	NZ_CP009348.1_3703-3718|NZ_CP009348.1_345124-345139|NZ_CP009351.1_4820705-4820723|NZ_CP009348.1_322324-322342|NZ_CP009348.1_322324-322342|NZ_CP009351.1_2954328-2954310|NZ_CP009351.1_3586044-3586026	NA	9	9	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA|172aa|up_9|NZ_CP009348.1_126096_126612_+,NA|69aa|up_8|NZ_CP009348.1_126624_126831_+,NA|62aa|up_7|NZ_CP009348.1_126817_127003_+,NA|65aa|up_6|NZ_CP009348.1_127205_127400_+,NA|339aa|up_5|NZ_CP009348.1_127431_128448_+,NA|120aa|up_2|NZ_CP009348.1_130499_130859_+,NA|178aa|down_0|NZ_CP009348.1_133479_134013_+,NA|186aa|down_1|NZ_CP009348.1_134128_134686_+,NA|164aa|down_2|NZ_CP009348.1_134689_135181_+,NA|226aa|down_3|NZ_CP009348.1_135198_135876_+,NA|207aa|down_4|NZ_CP009348.1_135898_136519_+,NA|161aa|down_5|NZ_CP009348.1_136542_137025_+,NA|160aa|down_6|NZ_CP009348.1_137047_137527_+,NA|173aa|down_7|NZ_CP009348.1_137827_138346_+,NA|187aa|down_8|NZ_CP009348.1_138347_138908_+,NA|246aa|down_9|NZ_CP009348.1_138950_139688_+	NA|172aa|up_9|NZ_CP009348.1_126096_126612_+	NA	NA|69aa|up_8|NZ_CP009348.1_126624_126831_+	NA	NA|62aa|up_7|NZ_CP009348.1_126817_127003_+	NA	NA|65aa|up_6|NZ_CP009348.1_127205_127400_+	NA	NA|339aa|up_5|NZ_CP009348.1_127431_128448_+	NA	NA|418aa|up_4|NZ_CP009348.1_128435_129689_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|263aa|up_3|NZ_CP009348.1_129695_130484_+	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|120aa|up_2|NZ_CP009348.1_130499_130859_+	NA	NA|242aa|up_1|NZ_CP009348.1_130889_131615_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|316aa|up_0|NZ_CP009348.1_131713_132661_+	PRK13480, PRK13480, 3'-5' exoribonuclease YhaM; Provisional	NA|178aa|down_0|NZ_CP009348.1_133479_134013_+	NA	NA|186aa|down_1|NZ_CP009348.1_134128_134686_+	NA	NA|164aa|down_2|NZ_CP009348.1_134689_135181_+	NA	NA|226aa|down_3|NZ_CP009348.1_135198_135876_+	NA	NA|207aa|down_4|NZ_CP009348.1_135898_136519_+	NA	NA|161aa|down_5|NZ_CP009348.1_136542_137025_+	NA	NA|160aa|down_6|NZ_CP009348.1_137047_137527_+	NA	NA|173aa|down_7|NZ_CP009348.1_137827_138346_+	NA	NA|187aa|down_8|NZ_CP009348.1_138347_138908_+	NA	NA|246aa|down_9|NZ_CP009348.1_138950_139688_+	NA
GCF_000835025.1_ASM83502v1	NZ_CP009348	Bacillus thuringiensis HD1002 plasmid 2, complete sequence	2	289321-289458	2	CRISPRCasFinder	no			Orphan	CTAAACCTGTACAAAAGCAAGAAG	24	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas14k,WYL,csa3,cas3,DinG,cas14j,c2c9_V-U4,RT,cas4	NA|137aa|up_9|NZ_CP009348.1_260157_260568_+,NA|194aa|up_3|NZ_CP009348.1_274517_275099_+,NA|139aa|up_2|NZ_CP009348.1_275178_275595_+,NA|133aa|down_1|NZ_CP009348.1_290987_291386_+,NA|204aa|down_2|NZ_CP009348.1_291463_292075_+,NA|247aa|down_3|NZ_CP009348.1_292182_292923_+,NA|139aa|down_4|NZ_CP009348.1_292942_293359_+,NA|127aa|down_5|NZ_CP009348.1_293364_293745_+,NA|120aa|down_6|NZ_CP009348.1_293763_294123_+,NA|237aa|down_7|NZ_CP009348.1_294134_294845_+,NA|86aa|down_8|NZ_CP009348.1_295169_295427_+,NA|108aa|down_9|NZ_CP009348.1_295447_295771_+	NA|137aa|up_9|NZ_CP009348.1_260157_260568_+	NA	NA|396aa|up_8|NZ_CP009348.1_260804_261992_+	smart00475, 53EXOc, 5'-3' exonuclease	NA|423aa|up_7|NZ_CP009348.1_262201_263470_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|2561aa|up_6|NZ_CP009348.1_264109_271792_+	COG4932, COG4932, Predicted outer membrane protein [Cell envelope biogenesis, outer membrane]	NA|471aa|up_5|NZ_CP009348.1_272214_273627_+	COG4990, COG4990, Uncharacterized protein conserved in bacteria [Function unknown]	NA|237aa|up_4|NZ_CP009348.1_273787_274498_+	cd06165, Sortase_A, Sortase domain found in class A sortases	NA|194aa|up_3|NZ_CP009348.1_274517_275099_+	NA	NA|139aa|up_2|NZ_CP009348.1_275178_275595_+	NA	NA|353aa|up_1|NZ_CP009348.1_275847_276906_+	pfam16403, DUF5011, Domain of unknown function (DUF5011)	NA|3527aa|up_0|NZ_CP009348.1_277484_288065_+	TIGR04226, Fimbrial_subunit_type_2, fimbrial isopeptide formation D2 domain	NA|283aa|down_0|NZ_CP009348.1_289972_290821_+	cd10446, GIY-YIG_unchar_1, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria	NA|133aa|down_1|NZ_CP009348.1_290987_291386_+	NA	NA|204aa|down_2|NZ_CP009348.1_291463_292075_+	NA	NA|247aa|down_3|NZ_CP009348.1_292182_292923_+	NA	NA|139aa|down_4|NZ_CP009348.1_292942_293359_+	NA	NA|127aa|down_5|NZ_CP009348.1_293364_293745_+	NA	NA|120aa|down_6|NZ_CP009348.1_293763_294123_+	NA	NA|237aa|down_7|NZ_CP009348.1_294134_294845_+	NA	NA|86aa|down_8|NZ_CP009348.1_295169_295427_+	NA	NA|108aa|down_9|NZ_CP009348.1_295447_295771_+	NA
