assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002250925.2_ASM225092v2	CP024104	Bacillus cytotoxicus strain CH_23 chromosome, complete genome	1	1577501-1577629	1	CRISPRCasFinder	no		cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh	Orphan	AACTAGGGTGGTACCGCGATGTTTTAATCGTCCCTACG	38	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh,TnsE_C	NA|63aa|up_9|CP024104.1_1563655_1563844_-,NA|155aa|up_7|CP024104.1_1565843_1566308_-,NA	NA|63aa|up_9|CP024104.1_1563655_1563844_-	NA	NA|459aa|up_8|CP024104.1_1564411_1565788_+	COG4193, LytD, Beta- N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|155aa|up_7|CP024104.1_1565843_1566308_-	NA	NA|271aa|up_6|CP024104.1_1566815_1567628_+	TIGR03708, poly_P_AMP_trns, polyphosphate:AMP phosphotransferase	NA|420aa|up_5|CP024104.1_1568126_1569386_+	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|523aa|up_4|CP024104.1_1569717_1571286_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|547aa|up_3|CP024104.1_1571388_1573029_-	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|403aa|up_2|CP024104.1_1573246_1574455_+	cd17478, MFS_FsR, Fosmidomycin resistance protein of the Major Facilitator Superfamily of transporters	NA|306aa|up_1|CP024104.1_1574470_1575388_+	TIGR01139, Cysteine_synthase, cysteine synthase A	NA|545aa|up_0|CP024104.1_1575457_1577092_-	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|446aa|down_0|CP024104.1_1577732_1579070_+	cd10336, SLC6sbd_Tyt1-Like, solute carrier 6 subfamily, Fusobacterium nucleatum Tyt1-like; solute-binding domain	NA|89aa|down_1|CP024104.1_1579341_1579608_+	pfam08765, Mor, Mor transcription activator family	NA|213aa|down_2|CP024104.1_1579747_1580386_+	cd16342, FusC_FusB, Fusidic acid resistance protein (FusC/FusB)	NA|179aa|down_3|CP024104.1_1580410_1580947_-	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|300aa|down_4|CP024104.1_1581865_1582765_+	PRK12479, PRK12479, branched-chain-amino-acid transaminase	NA|33aa|down_5|CP024104.1_1583263_1583362_+	pfam12323, HTH_OrfB_IS605, Helix-turn-helix domain	NA|558aa|down_6|CP024104.1_1583696_1585370_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|421aa|down_7|CP024104.1_1585403_1586666_+	PRK08639, PRK08639, threonine dehydratase; Validated	NA|847aa|down_8|CP024104.1_1587665_1590206_+	pfam00899, ThiF, ThiF family	NA|405aa|down_9|CP024104.1_1590364_1591579_+	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily
GCA_002250925.2_ASM225092v2	CP024104	Bacillus cytotoxicus strain CH_23 chromosome, complete genome	2	2037713-2038144	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas9	cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh	 Type II-B,Type II-C, or Type II-C?,Type II-B,Type II-A	GCCATAATTCCTCTTTAAATCTTGGTATGATATGAT,GCCATAATTCCTCTTTAAATCTTGGTATGATATGAT,GCCATAATTCCTCTTTAAATCTTGGTATGATATGAT	36,36,36	0	0	NA	NA	NA:NA:NA	6,6,6	6	TypeII-B,TypeII-C,orTypeII-C?,TypeII-B,TypeII-A	cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh,TnsE_C	NA|137aa|up_4|CP024104.1_2032253_2032664_-,NA|178aa|up_2|CP024104.1_2033835_2034369_-,NA|101aa|up_1|CP024104.1_2034444_2034747_-,NA|103aa|down_4|CP024104.1_2044066_2044375_-,NA|56aa|down_8|CP024104.1_2049058_2049226_-	NA|285aa|up_9|CP024104.1_2027177_2028032_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|193aa|up_8|CP024104.1_2028233_2028812_-	cd10548, cupin_CDO, cysteine dioxygenase, cupin domain	NA|52aa|up_7|CP024104.1_2029081_2029237_-	pfam07561, DUF1540, Domain of Unknown Function (DUF1540)	NA|428aa|up_6|CP024104.1_2029309_2030593_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|412aa|up_5|CP024104.1_2030864_2032100_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|137aa|up_4|CP024104.1_2032253_2032664_-	NA	NA|335aa|up_3|CP024104.1_2032818_2033823_-	pfam12395, DUF3658, Protein of unknown function	NA|178aa|up_2|CP024104.1_2033835_2034369_-	NA	NA|101aa|up_1|CP024104.1_2034444_2034747_-	NA	NA|55aa|up_0|CP024104.1_2037444_2037609_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas2|103aa|down_0|CP024104.1_2038298_2038607_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|301aa|down_1|CP024104.1_2038611_2039514_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1078aa|down_2|CP024104.1_2039503_2042737_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|254aa|down_3|CP024104.1_2043048_2043810_-	pfam13105, DUF3959, Protein of unknown function (DUF3959)	NA|103aa|down_4|CP024104.1_2044066_2044375_-	NA	NA|782aa|down_5|CP024104.1_2044523_2046869_-	pfam13032, DUF3893, Domain of unknown function (DUF3893)	NA|228aa|down_6|CP024104.1_2047122_2047806_-	cd19364, TenA_C_BsTenA-like, TenA_C proteins similar to Bacillus subtilis TenA	NA|216aa|down_7|CP024104.1_2048124_2048772_-	PRK10484, PRK10484, putative transporter; Provisional	NA|56aa|down_8|CP024104.1_2049058_2049226_-	NA	NA|167aa|down_9|CP024104.1_2049310_2049811_-	TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase
GCA_002250925.2_ASM225092v2	CP024104	Bacillus cytotoxicus strain CH_23 chromosome, complete genome	3	2165612-2166364	3,2	CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b2,cas6	cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh	Unclear	CTTTATATCCCACTACGTTCAGATAAAAC,CTTTATATCCCACTACGTTCAGATAAAAC	29,29	0	0	NA	NA	I-A:I-A	11,11	11	Unclear	cas14j,cas3,csa3,WYL,DinG,cas6,cas8b2,cas7,cas5,cas2,cas1,cas9,cas4,DEDDh,TnsE_C	NA|84aa|up_0|CP024104.1_2164830_2165082_-,NA|115aa|down_1|CP024104.1_2167318_2167663_+,NA|82aa|down_4|CP024104.1_2169031_2169277_+	NA|128aa|up_9|CP024104.1_2154698_2155082_-	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	cas2|88aa|up_8|CP024104.1_2156504_2156768_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|328aa|up_7|CP024104.1_2156772_2157756_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_6|CP024104.1_2157749_2158247_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|742aa|up_5|CP024104.1_2158309_2160535_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas5|233aa|up_4|CP024104.1_2160588_2161287_-	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|298aa|up_3|CP024104.1_2161283_2162177_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b2|583aa|up_2|CP024104.1_2162169_2163918_-	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|234aa|up_1|CP024104.1_2163917_2164619_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|84aa|up_0|CP024104.1_2164830_2165082_-	NA	NA|109aa|down_0|CP024104.1_2166650_2166977_-	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	NA|115aa|down_1|CP024104.1_2167318_2167663_+	NA	NA|110aa|down_2|CP024104.1_2167787_2168117_-	PRK07667, PRK07667, uridine kinase; Provisional	NA|189aa|down_3|CP024104.1_2168095_2168662_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|82aa|down_4|CP024104.1_2169031_2169277_+	NA	NA|179aa|down_5|CP024104.1_2169514_2170051_-	pfam11193, DUF2812, Protein of unknown function (DUF2812)	NA|108aa|down_6|CP024104.1_2170052_2170376_-	TIGR03433, padR_acidobact, transcriptional regulator, Acidobacterial, PadR-family	NA|328aa|down_7|CP024104.1_2170977_2171961_-	pfam13170, DUF4003, Protein of unknown function (DUF4003)	NA|156aa|down_8|CP024104.1_2172041_2172509_-	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|82aa|down_9|CP024104.1_2172867_2173113_-	pfam00144, Beta-lactamase, Beta-lactamase
