assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002250965.2_ASM225096v2	CP024111	Bacillus cytotoxicus strain CH_4 chromosome, complete genome	1	692701-693316	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas2	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL	Unclear	GTTTTATCTGAACGTAGTGGGATATAAAG,GTTTTATCTGAACGTAGTGGGATATAAAG,GTTTTATCTGAACGTAGTGGGATATAAAG	29,29,29	0	0	NA	NA	I-A:I-A:I-A	8,9,9	9	Unclear	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL,TnsE_C	NA|82aa|up_4|CP024111.1_689787_690033_-,NA|115aa|up_1|CP024111.1_691401_691746_-,NA	NA|82aa|up_9|CP024111.1_685951_686197_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|155aa|up_8|CP024111.1_686555_687020_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|328aa|up_7|CP024111.1_687103_688087_+	pfam13170, DUF4003, Protein of unknown function (DUF4003)	NA|108aa|up_6|CP024111.1_688688_689012_+	TIGR03433, padR_acidobact, transcriptional regulator, Acidobacterial, PadR-family	NA|179aa|up_5|CP024111.1_689013_689550_+	pfam11193, DUF2812, Protein of unknown function (DUF2812)	NA|82aa|up_4|CP024111.1_689787_690033_-	NA	NA|189aa|up_3|CP024111.1_690402_690969_+	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|110aa|up_2|CP024111.1_690947_691277_+	PRK07667, PRK07667, uridine kinase; Provisional	NA|115aa|up_1|CP024111.1_691401_691746_-	NA	NA|109aa|up_0|CP024111.1_692087_692414_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	cas6|234aa|down_0|CP024111.1_694650_695352_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas8b2|583aa|down_1|CP024111.1_695351_697100_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas7|298aa|down_2|CP024111.1_697092_697986_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|233aa|down_3|CP024111.1_697982_698681_+	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|742aa|down_4|CP024111.1_698734_700960_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|166aa|down_5|CP024111.1_701022_701520_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas2|88aa|down_6|CP024111.1_702500_702764_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|85aa|down_7|CP024111.1_703705_703960_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|128aa|down_8|CP024111.1_704578_704962_+	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|446aa|down_9|CP024111.1_705260_706598_+	PRK02991, PRK02991, D-serine dehydratase; Provisional
GCA_002250965.2_ASM225096v2	CP024111	Bacillus cytotoxicus strain CH_4 chromosome, complete genome	2	702988-703475	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas2	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL	Unclear	GTTTTATCTGAACGTAGTGGGATATAAAG,GTTTTATCTGAACGTAGTGGGATATAAAG,GTTTTATCTGAACGTAGTGGGATATAAAG	29,29,29	1	1	703082-703118	CP024111.1_2120264-2120300	I-A:I-A:I-A	6,7,7	7	Unclear	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL,TnsE_C	NA|115aa|up_8|CP024111.1_691401_691746_-,NA	NA|110aa|up_9|CP024111.1_690947_691277_+	PRK07667, PRK07667, uridine kinase; Provisional	NA|115aa|up_8|CP024111.1_691401_691746_-	NA	NA|109aa|up_7|CP024111.1_692087_692414_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	cas6|234aa|up_6|CP024111.1_694650_695352_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas8b2|583aa|up_5|CP024111.1_695351_697100_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas7|298aa|up_4|CP024111.1_697092_697986_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|233aa|up_3|CP024111.1_697982_698681_+	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|742aa|up_2|CP024111.1_698734_700960_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|166aa|up_1|CP024111.1_701022_701520_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas2|88aa|up_0|CP024111.1_702500_702764_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|85aa|down_0|CP024111.1_703705_703960_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|128aa|down_1|CP024111.1_704578_704962_+	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|446aa|down_2|CP024111.1_705260_706598_+	PRK02991, PRK02991, D-serine dehydratase; Provisional	NA|146aa|down_3|CP024111.1_706644_707082_-	pfam10710, DUF2512, Protein of unknown function (DUF2512)	NA|1177aa|down_4|CP024111.1_707241_710772_+	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|359aa|down_5|CP024111.1_711539_712616_+	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|542aa|down_6|CP024111.1_712915_714541_+	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|658aa|down_7|CP024111.1_715025_716999_+	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|702aa|down_8|CP024111.1_717634_719740_+	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|90aa|down_9|CP024111.1_719752_720022_+	cd05563, PTS_IIB_ascorbate, PTS_IIB_ascorbate: subunit IIB of enzyme II (EII) of the L-ascorbate-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS)
GCA_002250965.2_ASM225096v2	CP024111	Bacillus cytotoxicus strain CH_4 chromosome, complete genome	3	831163-831856	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL	Type II-B,Type II-C, Type II-B, or Type II-C?,Type II-A	ATCATATCATACCAAGATTTAAAGAGGAATTATGGC,ATCATATCATACCAAGATTTAAAGAGGAATTATGGC,ATCATATCATACCAAGATTTAAAGAGGAATTATGGC	36,36,36	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeII-B,TypeII-C,TypeII-B,orTypeII-C?,TypeII-A	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL,TnsE_C	NA|56aa|up_8|CP024111.1_820079_820247_+,NA|101aa|down_2|CP024111.1_834831_835134_+,NA|178aa|down_3|CP024111.1_835209_835743_+,NA|137aa|down_5|CP024111.1_836914_837325_+	NA|167aa|up_9|CP024111.1_819494_819995_+	TIGR01575, rimI, ribosomal-protein-alanine acetyltransferase	NA|56aa|up_8|CP024111.1_820079_820247_+	NA	NA|235aa|up_7|CP024111.1_820476_821181_+	PRK10484, PRK10484, putative transporter; Provisional	NA|228aa|up_6|CP024111.1_821499_822183_+	cd19364, TenA_C_BsTenA-like, TenA_C proteins similar to Bacillus subtilis TenA	NA|782aa|up_5|CP024111.1_822437_824783_+	pfam13032, DUF3893, Domain of unknown function (DUF3893)	NA|140aa|up_4|CP024111.1_824820_825240_+	pfam14184, YrvL, Regulatory protein YrvL	NA|254aa|up_3|CP024111.1_825496_826258_+	pfam13105, DUF3959, Protein of unknown function (DUF3959)	cas9|1078aa|up_2|CP024111.1_826569_829803_+	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	cas1|301aa|up_1|CP024111.1_829792_830695_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|103aa|up_0|CP024111.1_830699_831008_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|55aa|down_0|CP024111.1_831959_832124_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|355aa|down_1|CP024111.1_832462_833527_+	pfam18801, RapH_N, response regulator aspartate phosphatase H, N terminal	NA|101aa|down_2|CP024111.1_834831_835134_+	NA	NA|178aa|down_3|CP024111.1_835209_835743_+	NA	NA|335aa|down_4|CP024111.1_835755_836760_+	pfam12395, DUF3658, Protein of unknown function	NA|137aa|down_5|CP024111.1_836914_837325_+	NA	NA|412aa|down_6|CP024111.1_837478_838714_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|428aa|down_7|CP024111.1_838985_840269_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|52aa|down_8|CP024111.1_840341_840497_+	pfam07561, DUF1540, Domain of Unknown Function (DUF1540)	NA|193aa|down_9|CP024111.1_840767_841346_+	cd10548, cupin_CDO, cysteine dioxygenase, cupin domain
GCA_002250965.2_ASM225096v2	CP024111	Bacillus cytotoxicus strain CH_4 chromosome, complete genome	4	1334505-1334633	4	CRISPRCasFinder	no		cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL	Orphan	CGTAGGGACGATTAAAATATCGCGGTACCACCCTAGTT	38	0	0	NA	NA	NA	1	1	Orphan	cas14j,cas3,cas6,cas8b2,cas7,cas5,cas4,cas2,csa3,cas9,cas1,DinG,DEDDh,WYL,TnsE_C	NA,NA|155aa|down_6|CP024111.1_1345824_1346289_+,NA|63aa|down_8|CP024111.1_1348289_1348478_+	NA|240aa|up_9|CP024111.1_1322976_1323696_-	pfam04474, DUF554, Protein of unknown function (DUF554)	NA|421aa|up_8|CP024111.1_1323917_1325180_-	PRK08639, PRK08639, threonine dehydratase; Validated	NA|558aa|up_7|CP024111.1_1325213_1326887_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|33aa|up_6|CP024111.1_1327221_1327320_-	pfam12323, HTH_OrfB_IS605, Helix-turn-helix domain	NA|300aa|up_5|CP024111.1_1327817_1328717_-	PRK12479, PRK12479, branched-chain-amino-acid transaminase	NA|179aa|up_4|CP024111.1_1329635_1330172_+	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|464aa|up_3|CP024111.1_1330294_1331686_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|213aa|up_2|CP024111.1_1331747_1332386_-	cd16342, FusC_FusB, Fusidic acid resistance protein (FusC/FusB)	NA|89aa|up_1|CP024111.1_1332525_1332792_-	pfam08765, Mor, Mor transcription activator family	NA|446aa|up_0|CP024111.1_1333063_1334401_-	cd10336, SLC6sbd_Tyt1-Like, solute carrier 6 subfamily, Fusobacterium nucleatum Tyt1-like; solute-binding domain	NA|545aa|down_0|CP024111.1_1335041_1336676_+	cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known as Stage III sporulation protein F, and related proteins	NA|306aa|down_1|CP024111.1_1336745_1337663_-	TIGR01139, Cysteine_synthase, cysteine synthase A	NA|403aa|down_2|CP024111.1_1337678_1338887_-	cd17478, MFS_FsR, Fosmidomycin resistance protein of the Major Facilitator Superfamily of transporters	NA|547aa|down_3|CP024111.1_1339104_1340745_+	cd08504, PBP2_OppA, The substrate-binding component of an ABC-type oligopetide import system contains the type 2 periplasmic binding fold	NA|523aa|down_4|CP024111.1_1340847_1342416_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|420aa|down_5|CP024111.1_1342747_1344007_-	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|155aa|down_6|CP024111.1_1345824_1346289_+	NA	NA|459aa|down_7|CP024111.1_1346344_1347721_-	COG4193, LytD, Beta- N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|63aa|down_8|CP024111.1_1348289_1348478_+	NA	NA|411aa|down_9|CP024111.1_1348619_1349852_+	PRK06635, PRK06635, aspartate kinase; Reviewed
