assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002243645.1_ASM224364v1	NZ_CP017704	Bacillus simplex NBRC 15720 = DSM 1321, complete genome	1	1334243-1334328	1	CRISPRCasFinder	no		csa3,DEDDh,RT,cas14j,DinG,cas3	Orphan	CGGTGACCAGCATAAGACAGAGGAAATCG	29	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,RT,cas14j,DinG,cas3	NA,NA|44aa|down_3|NZ_CP017704.1_1336792_1336924_+	NA|361aa|up_9|NZ_CP017704.1_1323972_1325055_-	PRK12838, PRK12838, carbamoyl phosphate synthase small subunit; Reviewed	NA|384aa|up_8|NZ_CP017704.1_1325137_1326289_-	PRK02936, argD, acetylornithine transaminase	NA|258aa|up_7|NZ_CP017704.1_1326285_1327059_-	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|411aa|up_6|NZ_CP017704.1_1327070_1328303_-	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|346aa|up_5|NZ_CP017704.1_1328319_1329357_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|230aa|up_4|NZ_CP017704.1_1329589_1330279_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|338aa|up_3|NZ_CP017704.1_1330536_1331550_-	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|103aa|up_2|NZ_CP017704.1_1331658_1331967_-	COG2151, PaaD, Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]	NA|263aa|up_1|NZ_CP017704.1_1332085_1332874_-	PRK10566, PRK10566, esterase; Provisional	NA|272aa|up_0|NZ_CP017704.1_1333105_1333921_+	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|84aa|down_0|NZ_CP017704.1_1334470_1334722_-	COG3729, GsiB, General stress protein [General function prediction only]	NA|281aa|down_1|NZ_CP017704.1_1334858_1335701_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|289aa|down_2|NZ_CP017704.1_1335857_1336724_+	TIGR00762, DegV, EDD domain protein, DegV family	NA|44aa|down_3|NZ_CP017704.1_1336792_1336924_+	NA	NA|164aa|down_4|NZ_CP017704.1_1337027_1337519_+	cd11740, YajQ_like, Proteins similar to Escherichia coli YajQ	NA|292aa|down_5|NZ_CP017704.1_1337754_1338630_+	cd05355, SDR_c1, classical (c) SDR, subgroup 1	NA|267aa|down_6|NZ_CP017704.1_1338724_1339525_+	cd07712, MBLAC2-like_MBL-fold, uncharacterized human metallo-beta-lactamase domain-containing protein 2 and related proteins; MBL-fold metallo hydrolase domain	NA|348aa|down_7|NZ_CP017704.1_1339563_1340607_-	cd06294, PBP1_MalR-like, ligand-binding domain of maltose transcription regulator MalR which is a member of the LacI-GalR family repressors	NA|550aa|down_8|NZ_CP017704.1_1340949_1342599_-	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|512aa|down_9|NZ_CP017704.1_1342678_1344214_-	cd11339, AmyAc_bac_CMD_like_2, Alpha amylase catalytic domain found in bacterial cyclomaltodextrinases and related proteins
GCF_002243645.1_ASM224364v1	NZ_CP017704	Bacillus simplex NBRC 15720 = DSM 1321, complete genome	2	1389026-1389145	2	CRISPRCasFinder	no		csa3,DEDDh,RT,cas14j,DinG,cas3	Orphan	TGATTCACATCATAATGTAAATGAGGTGTCAGTTAAAGACGTAT	44	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,RT,cas14j,DinG,cas3	NA,NA|61aa|down_7|NZ_CP017704.1_1394911_1395094_+,NA|28aa|down_8|NZ_CP017704.1_1395206_1395290_-,NA|28aa|down_9|NZ_CP017704.1_1395473_1395557_-	NA|43aa|up_9|NZ_CP017704.1_1379969_1380098_+	pfam14149, YhfH, YhfH-like protein	NA|192aa|up_8|NZ_CP017704.1_1380261_1380837_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|312aa|up_7|NZ_CP017704.1_1381120_1382056_-	PRK12435, PRK12435, ferrochelatase; Provisional	NA|348aa|up_6|NZ_CP017704.1_1382134_1383178_-	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|172aa|up_5|NZ_CP017704.1_1383449_1383965_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	NA|208aa|up_4|NZ_CP017704.1_1384149_1384773_-	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|404aa|up_3|NZ_CP017704.1_1384865_1386077_+	cd08021, M20_Acy1_YhaA-like, M20 Peptidase aminoacylase 1 subfamily, includes Bacillus subtilis YhaA and Staphylococcus aureus amidohydrolase, SACOL0085	NA|235aa|up_2|NZ_CP017704.1_1386106_1386811_-	pfam12787, EcsC, EcsC protein family	NA|403aa|up_1|NZ_CP017704.1_1386916_1388125_-	pfam05975, EcsB, Bacterial ABC transporter protein EcsB	NA|248aa|up_0|NZ_CP017704.1_1388117_1388861_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|141aa|down_0|NZ_CP017704.1_1389521_1389944_+	cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding protein) subgroup: Members of this CD belong to the superfamily of histidine triad hydrolases that act on alpha-phosphate of ribonucleotides	NA|368aa|down_1|NZ_CP017704.1_1390217_1391321_+	PRK05355, PRK05355, 3-phosphoserine/phosphohydroxythreonine transaminase	NA|173aa|down_2|NZ_CP017704.1_1391447_1391966_+	pfam17099, TrpP, Tryptophan transporter TrpP	NA|121aa|down_3|NZ_CP017704.1_1392562_1392925_+	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|193aa|down_4|NZ_CP017704.1_1393113_1393692_+	PRK13777, PRK13777, HTH-type transcriptional regulator Hpr	NA|109aa|down_5|NZ_CP017704.1_1393694_1394021_-	pfam08963, DUF1878, Protein of unknown function (DUF1878)	NA|180aa|down_6|NZ_CP017704.1_1394263_1394803_+	pfam11667, DUF3267, Putative zincin peptidase	NA|61aa|down_7|NZ_CP017704.1_1394911_1395094_+	NA	NA|28aa|down_8|NZ_CP017704.1_1395206_1395290_-	NA	NA|28aa|down_9|NZ_CP017704.1_1395473_1395557_-	NA
GCF_002243645.1_ASM224364v1	NZ_CP017704	Bacillus simplex NBRC 15720 = DSM 1321, complete genome	3	4909179-4909265	3	CRISPRCasFinder	no		csa3,DEDDh,RT,cas14j,DinG,cas3	Orphan	GCTGAATATCGGAGCTGTCTTTAAGGCA	28	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,RT,cas14j,DinG,cas3	NA|167aa|up_2|NZ_CP017704.1_4907187_4907688_+,NA|219aa|up_0|NZ_CP017704.1_4908478_4909135_+,NA|115aa|down_1|NZ_CP017704.1_4910202_4910547_-,NA|34aa|down_2|NZ_CP017704.1_4910787_4910889_+,NA|150aa|down_3|NZ_CP017704.1_4911011_4911461_+,NA|62aa|down_5|NZ_CP017704.1_4915199_4915385_+	NA|75aa|up_9|NZ_CP017704.1_4897845_4898070_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|63aa|up_8|NZ_CP017704.1_4898136_4898325_+	PRK14861, tatA, twin arginine translocase protein A; Provisional	NA|246aa|up_7|NZ_CP017704.1_4898388_4899126_+	COG0805, TatC, Sec-independent protein secretion pathway component TatC [Intracellular trafficking and secretion]	NA|425aa|up_6|NZ_CP017704.1_4899421_4900696_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|206aa|up_5|NZ_CP017704.1_4901333_4901951_-	pfam14278, TetR_C_8, Transcriptional regulator C-terminal region	NA|565aa|up_4|NZ_CP017704.1_4902245_4903940_+	COG2936, COG2936, Predicted acyl esterases [General function prediction only]	NA|313aa|up_3|NZ_CP017704.1_4904259_4905198_+	cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of the Escherichia coli CorA-Salmonella typhimurium ZntB family	NA|167aa|up_2|NZ_CP017704.1_4907187_4907688_+	NA	NA|68aa|up_1|NZ_CP017704.1_4907999_4908203_-	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|219aa|up_0|NZ_CP017704.1_4908478_4909135_+	NA	NA|173aa|down_0|NZ_CP017704.1_4909434_4909953_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|115aa|down_1|NZ_CP017704.1_4910202_4910547_-	NA	NA|34aa|down_2|NZ_CP017704.1_4910787_4910889_+	NA	NA|150aa|down_3|NZ_CP017704.1_4911011_4911461_+	NA	NA|415aa|down_4|NZ_CP017704.1_4912495_4913740_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|62aa|down_5|NZ_CP017704.1_4915199_4915385_+	NA	NA|513aa|down_6|NZ_CP017704.1_4915495_4917034_-	cd13891, CuRO_3_CotA_like, The third Cupredoxin domain of bacterial laccases including CotA, a bacterial endospore coat component	NA|144aa|down_7|NZ_CP017704.1_4918207_4918639_-	TIGR03187, hypothetical_protein, DGQHR domain	NA|725aa|down_8|NZ_CP017704.1_4919065_4921240_+	PRK07726, PRK07726, DNA topoisomerase 3	NA|143aa|down_9|NZ_CP017704.1_4921342_4921771_+	PRK03902, PRK03902, transcriptional regulator MntR
