assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002215175.1_ASM221517v1	NZ_CP016595	Bacillus cereus strain K8, complete sequence	1	626765-626840	1	CRISPRCasFinder	no	csa3	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh	Type I-A	ATCATCATCATGGAGGACACAATCA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh,cas14j	NA,NA	NA|335aa|up_9|NZ_CP016595.1_615489_616494_+	pfam01032, FecCD, FecCD transport family	NA|353aa|up_8|NZ_CP016595.1_616490_617549_+	pfam01032, FecCD, FecCD transport family	NA|274aa|up_7|NZ_CP016595.1_617561_618383_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|244aa|up_6|NZ_CP016595.1_618410_619142_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|397aa|up_5|NZ_CP016595.1_619355_620546_+	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|322aa|up_4|NZ_CP016595.1_620590_621556_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|141aa|up_3|NZ_CP016595.1_621615_622038_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|628aa|up_2|NZ_CP016595.1_622075_623959_-	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|298aa|up_1|NZ_CP016595.1_623962_624856_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|510aa|up_0|NZ_CP016595.1_624983_626513_-	PRK12452, PRK12452, cardiolipin synthase	NA|568aa|down_0|NZ_CP016595.1_627564_629268_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|466aa|down_1|NZ_CP016595.1_629299_630697_-	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|237aa|down_2|NZ_CP016595.1_631152_631863_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|476aa|down_3|NZ_CP016595.1_632004_633432_+	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|554aa|down_4|NZ_CP016595.1_633445_635107_+	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|375aa|down_5|NZ_CP016595.1_635140_636265_-	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|369aa|down_6|NZ_CP016595.1_636245_637352_-	pfam03845, Spore_permease, Spore germination protein	NA|324aa|down_7|NZ_CP016595.1_639022_639994_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|487aa|down_8|NZ_CP016595.1_640153_641614_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|244aa|down_9|NZ_CP016595.1_641746_642478_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]
GCF_002215175.1_ASM221517v1	NZ_CP016595	Bacillus cereus strain K8, complete sequence	2	1072671-1072764	2	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh	Orphan	GGTTTAAATACGTTAAATAGCAAAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh,cas14j	NA|100aa|up_4|NZ_CP016595.1_1067098_1067398_+,NA|282aa|up_3|NZ_CP016595.1_1067451_1068297_-,NA|71aa|down_0|NZ_CP016595.1_1074694_1074907_+,NA|143aa|down_8|NZ_CP016595.1_1079829_1080258_+	NA|349aa|up_9|NZ_CP016595.1_1059506_1060553_+	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|312aa|up_8|NZ_CP016595.1_1060567_1061503_+	PRK12435, PRK12435, ferrochelatase; Provisional	NA|474aa|up_7|NZ_CP016595.1_1061522_1062944_+	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|451aa|up_6|NZ_CP016595.1_1063005_1064358_-	pfam13218, DUF4026, Protein of unknown function (DUF4026)	NA|789aa|up_5|NZ_CP016595.1_1064599_1066966_+	COG2374, COG2374, Predicted extracellular nuclease [General function prediction only]	NA|100aa|up_4|NZ_CP016595.1_1067098_1067398_+	NA	NA|282aa|up_3|NZ_CP016595.1_1067451_1068297_-	NA	NA|134aa|up_2|NZ_CP016595.1_1068526_1068928_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|650aa|up_1|NZ_CP016595.1_1068930_1070880_+	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|191aa|up_0|NZ_CP016595.1_1071152_1071725_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|71aa|down_0|NZ_CP016595.1_1074694_1074907_+	NA	NA|102aa|down_1|NZ_CP016595.1_1074910_1075216_+	pfam09860, DUF2087, Uncharacterized protein conserved in bacteria (DUF2087)	NA|118aa|down_2|NZ_CP016595.1_1075242_1075596_-	pfam14470, bPH_3, Bacterial PH domain	NA|170aa|down_3|NZ_CP016595.1_1075729_1076239_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|338aa|down_4|NZ_CP016595.1_1076433_1077447_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|43aa|down_5|NZ_CP016595.1_1077486_1077615_-	pfam14149, YhfH, YhfH-like protein	NA|245aa|down_6|NZ_CP016595.1_1077795_1078530_+	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|330aa|down_7|NZ_CP016595.1_1078539_1079529_+	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|143aa|down_8|NZ_CP016595.1_1079829_1080258_+	NA	NA|511aa|down_9|NZ_CP016595.1_1080419_1081952_+	PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated
GCF_002215175.1_ASM221517v1	NZ_CP016595	Bacillus cereus strain K8, complete sequence	3	4610281-4610820	1	CRT	no	csa3	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh	Type I-A	TGGACCTTGAATTCCTTG	18	1	2	4610371-4610388|4610371-4610388	NZ_CP016595.1_3190197-3190214|NZ_CP016595.1_4611715-4611732	NA	9	9	Orphan	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh,cas14j	NA,NA|108aa|down_0|NZ_CP016595.1_4613819_4614143_-,NA|61aa|down_3|NZ_CP016595.1_4614969_4615152_+,NA|62aa|down_8|NZ_CP016595.1_4620774_4620960_+	NA|262aa|up_9|NZ_CP016595.1_4602254_4603040_-	PRK08936, PRK08936, glucose-1-dehydrogenase; Provisional	NA|286aa|up_8|NZ_CP016595.1_4603053_4603911_-	COG4975, GlcU, Putative glucose uptake permease [Carbohydrate transport and metabolism]	NA|140aa|up_7|NZ_CP016595.1_4603947_4604367_-	pfam13027, DUF3888, Protein of unknown function (DUF3888)	NA|78aa|up_6|NZ_CP016595.1_4604460_4604694_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|155aa|up_5|NZ_CP016595.1_4604696_4605161_-	COG0314, MoaE, Molybdopterin converting factor, large subunit [Coenzyme metabolism]	NA|174aa|up_4|NZ_CP016595.1_4605157_4605679_-	cd03116, MobB, molybdopterin-guanine dinucleotide biosynthesis protein B	NA|430aa|up_3|NZ_CP016595.1_4605642_4606932_-	cd00887, MoeA, MoeA family	NA|162aa|up_2|NZ_CP016595.1_4607015_4607501_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|338aa|up_1|NZ_CP016595.1_4607538_4608552_-	PRK12475, PRK12475, thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional	NA|338aa|up_0|NZ_CP016595.1_4608568_4609582_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|108aa|down_0|NZ_CP016595.1_4613819_4614143_-	NA	NA|75aa|down_1|NZ_CP016595.1_4614163_4614388_-	pfam10676, gerPA, Spore germination protein gerPA/gerPF	NA|102aa|down_2|NZ_CP016595.1_4614466_4614772_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|61aa|down_3|NZ_CP016595.1_4614969_4615152_+	NA	NA|375aa|down_4|NZ_CP016595.1_4615201_4616326_-	PRK06765, PRK06765, homoserine O-acetyltransferase; Provisional	NA|677aa|down_5|NZ_CP016595.1_4616478_4618509_+	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|367aa|down_6|NZ_CP016595.1_4618526_4619627_+	pfam03845, Spore_permease, Spore germination protein	NA|362aa|down_7|NZ_CP016595.1_4619596_4620682_+	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|62aa|down_8|NZ_CP016595.1_4620774_4620960_+	NA	NA|141aa|down_9|NZ_CP016595.1_4621057_4621480_+	cd11545, NTP-PPase_YP_001813558, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3
GCF_002215175.1_ASM221517v1	NZ_CP016595	Bacillus cereus strain K8, complete sequence	4	4851423-4851541	3	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh	Orphan	GTTTCTTCCTACCGAACATACAGCTTAAACAAACGTTT	38	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh,cas14j	NA|115aa|up_4|NZ_CP016595.1_4848610_4848955_-,NA|176aa|down_0|NZ_CP016595.1_4851580_4852108_-,NA|62aa|down_5|NZ_CP016595.1_4855460_4855646_-	NA|431aa|up_9|NZ_CP016595.1_4843217_4844510_-	COG0719, SufB, Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones]	NA|262aa|up_8|NZ_CP016595.1_4844525_4845311_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|271aa|up_7|NZ_CP016595.1_4845549_4846362_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP016595.1_4846384_4847050_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP016595.1_4847042_4848068_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP016595.1_4848610_4848955_-	NA	NA|100aa|up_3|NZ_CP016595.1_4849107_4849407_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP016595.1_4849419_4849764_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP016595.1_4850378_4850762_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP016595.1_4850803_4851169_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NZ_CP016595.1_4851580_4852108_-	NA	NA|216aa|down_1|NZ_CP016595.1_4852251_4852899_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NZ_CP016595.1_4852964_4853978_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|398aa|down_3|NZ_CP016595.1_4854000_4855194_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_4|NZ_CP016595.1_4855198_4855447_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_5|NZ_CP016595.1_4855460_4855646_-	NA	NA|183aa|down_6|NZ_CP016595.1_4856336_4856885_-	pfam13305, WHG, WHG domain	NA|241aa|down_7|NZ_CP016595.1_4856888_4857611_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|595aa|down_8|NZ_CP016595.1_4857752_4859537_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_9|NZ_CP016595.1_4859758_4860931_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase
GCF_002215175.1_ASM221517v1	NZ_CP016595	Bacillus cereus strain K8, complete sequence	5	5115689-5115822	4	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh	Orphan	GTTGATTTCTCTTCTTTTTGAGA	23	0	0	NA	NA	NA	2	2	Orphan	cas3,c2c9_V-U4,csa3,WYL,DinG,DEDDh,cas14j	NA|45aa|up_0|NZ_CP016595.1_5115366_5115501_-,NA	NA|229aa|up_9|NZ_CP016595.1_5107340_5108027_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|294aa|up_8|NZ_CP016595.1_5108044_5108926_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|256aa|up_7|NZ_CP016595.1_5109168_5109936_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|234aa|up_6|NZ_CP016595.1_5110047_5110749_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|up_5|NZ_CP016595.1_5110738_5111482_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|226aa|up_4|NZ_CP016595.1_5111740_5112418_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|145aa|up_3|NZ_CP016595.1_5112761_5113196_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|334aa|up_2|NZ_CP016595.1_5113623_5114625_-	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|91aa|up_1|NZ_CP016595.1_5114785_5115058_-	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_0|NZ_CP016595.1_5115366_5115501_-	NA	NA|235aa|down_0|NZ_CP016595.1_5116710_5117415_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|281aa|down_1|NZ_CP016595.1_5117414_5118257_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|down_2|NZ_CP016595.1_5118437_5119445_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|340aa|down_3|NZ_CP016595.1_5119544_5120564_-	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|435aa|down_4|NZ_CP016595.1_5120770_5122075_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|237aa|down_5|NZ_CP016595.1_5122114_5122825_-	pfam08680, DUF1779, TATA-box binding	NA|79aa|down_6|NZ_CP016595.1_5122870_5123107_-	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|507aa|down_7|NZ_CP016595.1_5123309_5124830_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|501aa|down_8|NZ_CP016595.1_5124831_5126334_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|621aa|down_9|NZ_CP016595.1_5126330_5128193_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed
