assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021305.1_ASM2130v1	NC_011772	Bacillus cereus G9842, complete genome	1	619757-619832	1	CRISPRCasFinder	no	csa3	cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	Type I-A	ATCATCATCATGGAGGACACAATCA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	NA,NA	NA|335aa|up_9|NC_011772.1_608478_609483_+	pfam01032, FecCD, FecCD transport family	NA|353aa|up_8|NC_011772.1_609479_610538_+	pfam01032, FecCD, FecCD transport family	NA|274aa|up_7|NC_011772.1_610550_611372_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|244aa|up_6|NC_011772.1_611403_612135_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|397aa|up_5|NC_011772.1_612348_613539_+	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|322aa|up_4|NC_011772.1_613583_614549_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|141aa|up_3|NC_011772.1_614608_615031_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|628aa|up_2|NC_011772.1_615068_616952_-	COG4548, NorD, Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]	NA|298aa|up_1|NC_011772.1_616955_617849_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|510aa|up_0|NC_011772.1_617976_619506_-	PRK12452, PRK12452, cardiolipin synthase	NA|568aa|down_0|NC_011772.1_620579_622283_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|466aa|down_1|NC_011772.1_622314_623712_-	TIGR00905, Arginine/ornithine_antiporter, transporter, basic amino acid/polyamine antiporter (APA) family	NA|237aa|down_2|NC_011772.1_624164_624875_+	TIGR02404, Trehalose_operon_transcriptional_repressor, trehalose operon repressor, B	NA|476aa|down_3|NC_011772.1_625017_626445_+	TIGR01992, phosphotransferase_system_trehalose_permease, PTS system, trehalose-specific IIBC component	NA|554aa|down_4|NC_011772.1_626458_628120_+	TIGR02403, Trehalose-6-phosphate_hydrolase, alpha,alpha-phosphotrehalase	NA|375aa|down_5|NC_011772.1_628152_629277_-	TIGR02887, Spore_germination_protein_B3, germination protein, Ger(x)C family	NA|369aa|down_6|NC_011772.1_629257_630364_-	pfam03845, Spore_permease, Spore germination protein	NA|501aa|down_7|NC_011772.1_630344_631847_-	pfam03323, GerA, Bacillus/Clostridium GerA spore germination protein	NA|324aa|down_8|NC_011772.1_632035_633007_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|487aa|down_9|NC_011772.1_633166_634627_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family
GCF_000021305.1_ASM2130v1	NC_011772	Bacillus cereus G9842, complete genome	2	4895991-4896107	2	CRISPRCasFinder	no		cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	Orphan	CTTAAACAAGCGTTTGATTAATTCTCCATTTTTCTT	36	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	NA|115aa|up_4|NC_011772.1_4893000_4893345_-,NA|176aa|down_0|NC_011772.1_4896206_4896734_-,NA|62aa|down_5|NC_011772.1_4900079_4900265_-	NA|262aa|up_9|NC_011772.1_4888164_4888950_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NC_011772.1_4889188_4889995_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NC_011772.1_4890066_4890879_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NC_011772.1_4890902_4891568_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NC_011772.1_4891560_4892586_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NC_011772.1_4893000_4893345_-	NA	NA|100aa|up_3|NC_011772.1_4893497_4893797_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NC_011772.1_4893809_4894154_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NC_011772.1_4895004_4895388_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NC_011772.1_4895429_4895795_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NC_011772.1_4896206_4896734_-	NA	NA|216aa|down_1|NC_011772.1_4896878_4897526_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NC_011772.1_4897585_4898599_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|390aa|down_3|NC_011772.1_4898621_4899791_-	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|83aa|down_4|NC_011772.1_4899817_4900066_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_5|NC_011772.1_4900079_4900265_-	NA	NA|240aa|down_6|NC_011772.1_4900378_4901098_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|601aa|down_7|NC_011772.1_4901213_4903016_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_8|NC_011772.1_4903142_4904315_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|794aa|down_9|NC_011772.1_4904336_4906718_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]
GCF_000021305.1_ASM2130v1	NC_011772	Bacillus cereus G9842, complete genome	3	5154822-5154955	3	CRISPRCasFinder	no		cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	Orphan	GTTGATTTCTCTTCTTTTTGAGA	23	0	0	NA	NA	NA	2	2	Orphan	cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	NA|45aa|up_0|NC_011772.1_5154499_5154634_-,NA	NA|229aa|up_9|NC_011772.1_5146474_5147161_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|293aa|up_8|NC_011772.1_5147177_5148056_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|256aa|up_7|NC_011772.1_5148295_5149063_-	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|234aa|up_6|NC_011772.1_5149175_5149877_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|248aa|up_5|NC_011772.1_5149866_5150610_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|226aa|up_4|NC_011772.1_5150873_5151551_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|145aa|up_3|NC_011772.1_5151892_5152327_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|334aa|up_2|NC_011772.1_5152756_5153758_-	PRK13928, PRK13928, rod shape-determining protein Mbl; Provisional	NA|91aa|up_1|NC_011772.1_5153918_5154191_-	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_0|NC_011772.1_5154499_5154634_-	NA	NA|236aa|down_0|NC_011772.1_5155843_5156551_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|281aa|down_1|NC_011772.1_5156550_5157393_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|336aa|down_2|NC_011772.1_5157574_5158582_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|340aa|down_3|NC_011772.1_5158679_5159699_-	TIGR02870, Stage_II_sporulation_protein_D, stage II sporulation protein D	NA|435aa|down_4|NC_011772.1_5159908_5161213_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|237aa|down_5|NC_011772.1_5161252_5161963_-	pfam08680, DUF1779, TATA-box binding	NA|79aa|down_6|NC_011772.1_5162008_5162245_-	COG4836, COG4836, Predicted membrane protein [Function unknown]	NA|507aa|down_7|NC_011772.1_5162447_5163968_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|501aa|down_8|NC_011772.1_5163969_5165472_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|621aa|down_9|NC_011772.1_5165468_5167331_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed
GCF_000021305.1_ASM2130v1	NC_011775	Bacillus cereus G9842 plasmid pG9842_209, complete sequence	1	30936-31372	1	CRT	no		csa3	Orphan	ACAAAGCCAGANACAAAACCAGAGACAAANCCAGANACAAA	41	1	2	31097-31127|31097-31127	NC_011775.1_31349-31379|NC_011775.1_31361-31391	NA	6	6	Orphan	cas3,cas14k,csa3,WYL,RT,c2c10_CAS-V-U3,DinG,cas14j,DEDDh,Cas14u_CAS-V	NA|159aa|up_9|NC_011775.1_17155_17632_+,NA|571aa|up_7|NC_011775.1_19429_21142_-,NA|485aa|up_6|NC_011775.1_21848_23303_+,NA|147aa|up_4|NC_011775.1_25567_26008_-,NA|248aa|up_3|NC_011775.1_26313_27057_+,NA|200aa|up_2|NC_011775.1_27110_27710_+,NA|215aa|up_0|NC_011775.1_28963_29608_-,NA|189aa|down_0|NC_011775.1_31827_32394_+,NA|288aa|down_2|NC_011775.1_35984_36848_+,NA|236aa|down_8|NC_011775.1_42477_43185_+,NA|180aa|down_9|NC_011775.1_43196_43736_+	NA|159aa|up_9|NC_011775.1_17155_17632_+	NA	NA|136aa|up_8|NC_011775.1_17839_18247_+	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|571aa|up_7|NC_011775.1_19429_21142_-	NA	NA|485aa|up_6|NC_011775.1_21848_23303_+	NA	NA|575aa|up_5|NC_011775.1_23353_25078_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|147aa|up_4|NC_011775.1_25567_26008_-	NA	NA|248aa|up_3|NC_011775.1_26313_27057_+	NA	NA|200aa|up_2|NC_011775.1_27110_27710_+	NA	NA|362aa|up_1|NC_011775.1_27805_28891_+	pfam01170, UPF0020, Putative RNA methylase family UPF0020	NA|215aa|up_0|NC_011775.1_28963_29608_-	NA	NA|189aa|down_0|NC_011775.1_31827_32394_+	NA	NA|1117aa|down_1|NC_011775.1_32608_35959_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|288aa|down_2|NC_011775.1_35984_36848_+	NA	NA|333aa|down_3|NC_011775.1_37076_38075_+	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|451aa|down_4|NC_011775.1_38096_39449_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|187aa|down_5|NC_011775.1_39479_40040_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|64aa|down_6|NC_011775.1_40477_40669_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|392aa|down_7|NC_011775.1_40880_42056_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|236aa|down_8|NC_011775.1_42477_43185_+	NA	NA|180aa|down_9|NC_011775.1_43196_43736_+	NA
