assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002214725.1_ASM221472v1	NZ_CP016316	Bacillus cereus strain M3, complete sequence	1	865607-865698	1	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	Orphan	TTTGTTAGCAATTGCAAATGCTT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	NA|68aa|up_4|NZ_CP016316.1_860114_860318_-,NA|82aa|down_6|NZ_CP016316.1_874362_874608_+	NA|113aa|up_9|NZ_CP016316.1_855276_855615_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|180aa|up_8|NZ_CP016316.1_855784_856324_+	pfam07366, SnoaL, SnoaL-like polyketide cyclase	NA|183aa|up_7|NZ_CP016316.1_856347_856896_+	pfam02525, Flavodoxin_2, Flavodoxin-like fold	NA|587aa|up_6|NZ_CP016316.1_856968_858729_-	cd08507, PBP2_SgrR_like, The C-terminal solute-binding domain of DNA-binding transcriptional regulator SgrR is related to the ABC-type oligopeptide-binding proteins and contains the type 2 periplasmic-binding fold	NA|419aa|up_5|NZ_CP016316.1_858813_860070_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|68aa|up_4|NZ_CP016316.1_860114_860318_-	NA	NA|172aa|up_3|NZ_CP016316.1_860557_861073_-	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus	NA|400aa|up_2|NZ_CP016316.1_861352_862552_-	pfam05975, EcsB, Bacterial ABC transporter protein EcsB	NA|475aa|up_1|NZ_CP016316.1_862612_864037_-	TIGR03810, arginine-ornithine_antiporter, arginine-ornithine antiporter	NA|276aa|up_0|NZ_CP016316.1_864568_865396_+	PRK00865, PRK00865, glutamate racemase; Provisional	NA|587aa|down_0|NZ_CP016316.1_867477_869238_+	PRK10789, PRK10789, SmdA family multidrug ABC transporter permease/ATP-binding protein	NA|667aa|down_1|NZ_CP016316.1_869234_871235_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|266aa|down_2|NZ_CP016316.1_871505_872303_+	cd13711, PBP2_Ngo0372_TcyA, Substrate binding domain of ABC transporters involved in cystine import; the type 2 periplasmic binding protein fold	NA|233aa|down_3|NZ_CP016316.1_872289_872988_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|245aa|down_4|NZ_CP016316.1_873016_873751_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|68aa|down_5|NZ_CP016316.1_873802_874006_-	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|82aa|down_6|NZ_CP016316.1_874362_874608_+	NA	NA|487aa|down_7|NZ_CP016316.1_874624_876085_-	pfam14398, ATPgrasp_YheCD, YheC/D like ATP-grasp	NA|351aa|down_8|NZ_CP016316.1_876105_877158_-	pfam14398, ATPgrasp_YheCD, YheC/D like ATP-grasp	NA|379aa|down_9|NZ_CP016316.1_877292_878429_+	COG4399, COG4399, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002214725.1_ASM221472v1	NZ_CP016316	Bacillus cereus strain M3, complete sequence	2	2746050-2746244	2	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	Orphan	TTTCGGAATGAACGTTCATTCCT	23	0	0	NA	NA	NA	3	3	Orphan	cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	NA|142aa|up_9|NZ_CP016316.1_2731422_2731848_-,NA|152aa|up_2|NZ_CP016316.1_2741077_2741533_+,NA	NA|142aa|up_9|NZ_CP016316.1_2731422_2731848_-	NA	NA|424aa|up_8|NZ_CP016316.1_2732018_2733290_-	PRK09410, ulaA, PTS system ascorbate-specific transporter subunit IIC; Reviewed	NA|90aa|up_7|NZ_CP016316.1_2733314_2733584_-	cd05563, PTS_IIB_ascorbate, PTS_IIB_ascorbate: subunit IIB of enzyme II (EII) of the L-ascorbate-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS)	NA|702aa|up_6|NZ_CP016316.1_2733597_2735703_-	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|224aa|up_5|NZ_CP016316.1_2736009_2736681_+	TIGR03717, R_switched_YjbE, integral membrane protein, YjbE family	NA|658aa|up_4|NZ_CP016316.1_2736739_2738713_-	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|542aa|up_3|NZ_CP016316.1_2739233_2740859_-	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|152aa|up_2|NZ_CP016316.1_2741077_2741533_+	NA	NA|283aa|up_1|NZ_CP016316.1_2741712_2742561_-	PRK06761, PRK06761, hypothetical protein; Provisional	NA|1039aa|up_0|NZ_CP016316.1_2742897_2746014_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|189aa|down_0|NZ_CP016316.1_2746264_2746831_-	pfam16295, TetR_C_10, Tetracycline repressor, C-terminal all-alpha domain	NA|386aa|down_1|NZ_CP016316.1_2747051_2748209_-	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|430aa|down_2|NZ_CP016316.1_2748345_2749635_-	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|367aa|down_3|NZ_CP016316.1_2749651_2750752_-	PRK06545, PRK06545, prephenate dehydrogenase; Validated	NA|367aa|down_4|NZ_CP016316.1_2750744_2751845_-	PRK01533, PRK01533, histidinol-phosphate aminotransferase; Validated	NA|391aa|down_5|NZ_CP016316.1_2751863_2753036_-	PRK12463, PRK12463, chorismate synthase; Reviewed	NA|359aa|down_6|NZ_CP016316.1_2753315_2754392_-	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|1172aa|down_7|NZ_CP016316.1_2755222_2758738_-	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|146aa|down_8|NZ_CP016316.1_2758888_2759326_+	pfam10710, DUF2512, Protein of unknown function (DUF2512)	NA|177aa|down_9|NZ_CP016316.1_2759668_2760199_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases
GCF_002214725.1_ASM221472v1	NZ_CP016316	Bacillus cereus strain M3, complete sequence	3	3145294-3145496	1	PILER-CR	no		cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	Orphan	GGTCCCTGTATGCCTTGAATTCCTTGTGGACCGGTTGGACC	41	0	0	NA	NA	NA	2	2	Orphan	cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	NA,NA|126aa|down_4|NZ_CP016316.1_3152036_3152414_+	NA|280aa|up_9|NZ_CP016316.1_3135683_3136523_+	pfam10978, DUF2785, Protein of unknown function (DUF2785)	NA|150aa|up_8|NZ_CP016316.1_3136540_3136990_+	cd04677, Nudix_Hydrolase_18, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|127aa|up_7|NZ_CP016316.1_3137049_3137430_-	cd07241, VOC_BsYyaH, vicinal oxygen chelate (VOC) family protein similar to Bacillus subtilis YyaH	NA|343aa|up_6|NZ_CP016316.1_3137491_3138520_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|188aa|up_5|NZ_CP016316.1_3138856_3139420_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|333aa|up_4|NZ_CP016316.1_3139630_3140629_-	PRK12556, PRK12556, tryptophanyl-tRNA synthetase; Provisional	NA|121aa|up_3|NZ_CP016316.1_3140987_3141350_-	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|374aa|up_2|NZ_CP016316.1_3141737_3142859_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|83aa|up_1|NZ_CP016316.1_3142922_3143171_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|154aa|up_0|NZ_CP016316.1_3143307_3143769_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|812aa|down_0|NZ_CP016316.1_3146649_3149085_-	pfam03157, Glutenin_hmw, High molecular weight glutenin subunit	NA|252aa|down_1|NZ_CP016316.1_3149232_3149988_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|180aa|down_2|NZ_CP016316.1_3150030_3150570_-	pfam11706, zf-CGNR, CGNR zinc finger	NA|396aa|down_3|NZ_CP016316.1_3150682_3151870_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|126aa|down_4|NZ_CP016316.1_3152036_3152414_+	NA	NA|80aa|down_5|NZ_CP016316.1_3152467_3152707_+	pfam13061, DUF3923, Protein of unknown function (DUF3923)	NA|183aa|down_6|NZ_CP016316.1_3152813_3153362_-	PRK04164, PRK04164, hypothetical protein; Provisional	NA|138aa|down_7|NZ_CP016316.1_3153560_3153974_-	pfam03899, ATP-synt_I, ATP synthase I chain	NA|443aa|down_8|NZ_CP016316.1_3154027_3155356_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|136aa|down_9|NZ_CP016316.1_3155475_3155883_-	cd04779, HTH_MerR-like_sg4, Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily
GCF_002214725.1_ASM221472v1	NZ_CP016316	Bacillus cereus strain M3, complete sequence	4	4529294-4529377	3	CRISPRCasFinder	no		cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	Orphan	AATCGCCGATATAATTTGATTTA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	NA|76aa|up_6|NZ_CP016316.1_4524897_4525125_-,NA|372aa|up_5|NZ_CP016316.1_4525339_4526455_-,NA|106aa|up_2|NZ_CP016316.1_4527902_4528220_-,NA|77aa|up_0|NZ_CP016316.1_4528454_4528685_+,NA	NA|336aa|up_9|NZ_CP016316.1_4522280_4523288_+	COG0715, TauA, ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism]	NA|255aa|up_8|NZ_CP016316.1_4523299_4524064_+	cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain of the nitrate and sulfonate transporters	NA|269aa|up_7|NZ_CP016316.1_4524056_4524863_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|76aa|up_6|NZ_CP016316.1_4524897_4525125_-	NA	NA|372aa|up_5|NZ_CP016316.1_4525339_4526455_-	NA	NA|180aa|up_4|NZ_CP016316.1_4526684_4527224_-	pfam11667, DUF3267, Putative zincin peptidase	NA|160aa|up_3|NZ_CP016316.1_4527338_4527818_-	TIGR02705, conserved_hypothetical_protein, nucleoside triphosphatase YtkD	NA|106aa|up_2|NZ_CP016316.1_4527902_4528220_-	NA	NA|50aa|up_1|NZ_CP016316.1_4528303_4528453_+	NF033232, small_YtzI, YtzI protein	NA|77aa|up_0|NZ_CP016316.1_4528454_4528685_+	NA	NA|158aa|down_0|NZ_CP016316.1_4529421_4529895_-	PRK02260, PRK02260, S-ribosylhomocysteine lyase	NA|79aa|down_1|NZ_CP016316.1_4530023_4530260_+	PRK00041, PRK00041, hypothetical protein; Validated	NA|188aa|down_2|NZ_CP016316.1_4530256_4530820_-	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|450aa|down_3|NZ_CP016316.1_4531044_4532394_+	pfam01654, Cyt_bd_oxida_I, Cytochrome bd terminal oxidase subunit I	NA|343aa|down_4|NZ_CP016316.1_4532383_4533412_+	pfam02322, Cyt_bd_oxida_II, Cytochrome bd terminal oxidase subunit II	NA|78aa|down_5|NZ_CP016316.1_4533578_4533812_-	pfam10676, gerPA, Spore germination protein gerPA/gerPF	NA|346aa|down_6|NZ_CP016316.1_4534678_4535716_+	cd06259, YdcF-like, YdcF-like	NA|291aa|down_7|NZ_CP016316.1_4535903_4536776_+	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|272aa|down_8|NZ_CP016316.1_4537183_4537999_-	pfam14042, DUF4247, Domain of unknown function (DUF4247)	NA|169aa|down_9|NZ_CP016316.1_4538011_4538518_-	pfam13785, DUF4178, Domain of unknown function (DUF4178)
GCF_002214725.1_ASM221472v1	NZ_CP016317	Bacillus cereus strain M3 plasmid pBCM301, complete sequence	1	143440-143816	1	CRT	no	cas14j	csa3,RT,cas14j	Unclear	CCAGAAGAAAAACCAGAAGTGAAGCCAGA	29	3	3	143469-143487|143517-143535|143565-143583	NZ_CP016316.1_2981888-2981870|NZ_CP016316.1_1779977-1779995|NZ_CP016316.1_1779977-1779995	NA	7	7	TypeV	cas3,c2c9_V-U4,csa3,cas14k,WYL,cas4,c2c10_CAS-V-U3,DinG,cas14j,RT,Cas14u_CAS-V,DEDDh	NA|112aa|up_7|NZ_CP016317.1_133365_133701_-,NA|379aa|up_4|NZ_CP016317.1_137349_138486_+,NA|318aa|up_3|NZ_CP016317.1_139195_140149_+,NA|500aa|up_2|NZ_CP016317.1_140168_141668_+,NA|150aa|up_1|NZ_CP016317.1_141684_142134_+,NA|784aa|down_0|NZ_CP016317.1_144396_146748_+,NA|85aa|down_3|NZ_CP016317.1_150101_150356_+	NA|429aa|up_9|NZ_CP016317.1_130308_131595_-	pfam13814, Replic_Relax, Replication-relaxation	NA|436aa|up_8|NZ_CP016317.1_132031_133339_-	smart00864, Tubulin, Tubulin/FtsZ family, GTPase domain	NA|112aa|up_7|NZ_CP016317.1_133365_133701_-	NA	NA|202aa|up_6|NZ_CP016317.1_134287_134893_+	cd04765, HTH_MlrA-like_sg2, Helix-Turn-Helix DNA binding domain of putative MlrA-like transcription regulators	cas14j|371aa|up_5|NZ_CP016317.1_135210_136323_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|379aa|up_4|NZ_CP016317.1_137349_138486_+	NA	NA|318aa|up_3|NZ_CP016317.1_139195_140149_+	NA	NA|500aa|up_2|NZ_CP016317.1_140168_141668_+	NA	NA|150aa|up_1|NZ_CP016317.1_141684_142134_+	NA	NA|132aa|up_0|NZ_CP016317.1_142159_142555_+	pfam14208, DUF4320, Domain of unknown function (DUF4320)	NA|784aa|down_0|NZ_CP016317.1_144396_146748_+	NA	NA|454aa|down_1|NZ_CP016317.1_147294_148655_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|414aa|down_2|NZ_CP016317.1_148729_149971_+	pfam03738, GSP_synth, Glutathionylspermidine synthase preATP-grasp	NA|85aa|down_3|NZ_CP016317.1_150101_150356_+	NA	NA|131aa|down_4|NZ_CP016317.1_150548_150941_+	cd02077, P-type_ATPase_Mg, magnesium transporting ATPase (MgtA), similar to Escherichia coli MgtA and Salmonella typhimurium MgtA	NA|236aa|down_5|NZ_CP016317.1_150990_151698_+	pfam02517, Abi, CAAX protease self-immunity	NA|172aa|down_6|NZ_CP016317.1_151742_152258_+	cd07503, HAD_HisB-N, histidinol phosphate phosphatase and related phosphatases	NA|988aa|down_7|NZ_CP016317.1_152522_155486_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain	NA|376aa|down_8|NZ_CP016317.1_155795_156923_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|188aa|down_9|NZ_CP016317.1_157271_157835_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain
