assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009740005.1_ASM974000v1	NZ_CP040344	Bacillus albus strain DLOU-Yingkou chromosome, complete genome	1	1140202-1140331	1	CRISPRCasFinder	no		Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j	Orphan	CCGAACATACATCTTAAACAAACGTTTGATTAACTCCCTATTTTTCTTT	49	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j,RT	NA|115aa|up_4|NZ_CP040344.1_1137562_1137907_-,NA|176aa|down_0|NZ_CP040344.1_1140429_1140957_-,NA|62aa|down_4|NZ_CP040344.1_1144296_1144482_-	NA|262aa|up_9|NZ_CP040344.1_1132607_1133393_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|269aa|up_8|NZ_CP040344.1_1133631_1134438_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|271aa|up_7|NZ_CP040344.1_1134509_1135322_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|222aa|up_6|NZ_CP040344.1_1135345_1136011_-	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP040344.1_1136003_1137029_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|115aa|up_4|NZ_CP040344.1_1137562_1137907_-	NA	NA|100aa|up_3|NZ_CP040344.1_1138061_1138361_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|115aa|up_2|NZ_CP040344.1_1138373_1138718_-	COG1658, COG1658, Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]	NA|128aa|up_1|NZ_CP040344.1_1139217_1139601_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|122aa|up_0|NZ_CP040344.1_1139642_1140008_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC	NA|176aa|down_0|NZ_CP040344.1_1140429_1140957_-	NA	NA|216aa|down_1|NZ_CP040344.1_1141100_1141748_+	cd03386, PAP2_Aur1_like, PAP2_like proteins, Aur1_like subfamily	NA|338aa|down_2|NZ_CP040344.1_1141803_1142817_-	pfam13303, PTS_EIIC_2, Phosphotransferase system, EIIC	NA|84aa|down_3|NZ_CP040344.1_1144031_1144283_-	pfam07875, Coat_F, Coat F domain	NA|62aa|down_4|NZ_CP040344.1_1144296_1144482_-	NA	NA|183aa|down_5|NZ_CP040344.1_1145162_1145711_-	pfam13305, WHG, WHG domain	NA|241aa|down_6|NZ_CP040344.1_1145714_1146437_-	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|601aa|down_7|NZ_CP040344.1_1146575_1148378_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|391aa|down_8|NZ_CP040344.1_1148626_1149799_-	PRK07661, PRK07661, acetyl-CoA C-acetyltransferase	NA|794aa|down_9|NZ_CP040344.1_1149820_1152202_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]
GCF_009740005.1_ASM974000v1	NZ_CP040344	Bacillus albus strain DLOU-Yingkou chromosome, complete genome	2	2674579-2674672	2	CRISPRCasFinder	no	WYL	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j	Unclear	GGTTTAAATACGTTAAATAGCAAAA	25	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j,RT	NA|100aa|up_4|NZ_CP040344.1_2669118_2669418_+,NA|282aa|up_3|NZ_CP040344.1_2669472_2670318_-,NA|237aa|down_0|NZ_CP040344.1_2676549_2677260_-,NA|72aa|down_1|NZ_CP040344.1_2677600_2677816_+	NA|349aa|up_9|NZ_CP040344.1_2661531_2662578_+	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|312aa|up_8|NZ_CP040344.1_2662592_2663528_+	PRK12435, PRK12435, ferrochelatase; Provisional	NA|474aa|up_7|NZ_CP040344.1_2663548_2664970_+	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|451aa|up_6|NZ_CP040344.1_2665009_2666362_-	pfam13218, DUF4026, Protein of unknown function (DUF4026)	NA|789aa|up_5|NZ_CP040344.1_2666601_2668968_+	COG2374, COG2374, Predicted extracellular nuclease [General function prediction only]	NA|100aa|up_4|NZ_CP040344.1_2669118_2669418_+	NA	NA|282aa|up_3|NZ_CP040344.1_2669472_2670318_-	NA	NA|133aa|up_2|NZ_CP040344.1_2670548_2670947_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|632aa|up_1|NZ_CP040344.1_2670952_2672848_+	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|191aa|up_0|NZ_CP040344.1_2673062_2673635_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|237aa|down_0|NZ_CP040344.1_2676549_2677260_-	NA	NA|72aa|down_1|NZ_CP040344.1_2677600_2677816_+	NA	NA|102aa|down_2|NZ_CP040344.1_2677852_2678158_+	pfam09860, DUF2087, Uncharacterized protein conserved in bacteria (DUF2087)	NA|168aa|down_3|NZ_CP040344.1_2678229_2678733_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|167aa|down_4|NZ_CP040344.1_2678849_2679350_-	pfam12867, DinB_2, DinB superfamily	WYL|314aa|down_5|NZ_CP040344.1_2679434_2680376_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|338aa|down_6|NZ_CP040344.1_2680573_2681587_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|43aa|down_7|NZ_CP040344.1_2681626_2681755_-	pfam14149, YhfH, YhfH-like protein	NA|245aa|down_8|NZ_CP040344.1_2681935_2682670_+	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|330aa|down_9|NZ_CP040344.1_2682679_2683669_+	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase
GCF_009740005.1_ASM974000v1	NZ_CP040344	Bacillus albus strain DLOU-Yingkou chromosome, complete genome	3	3389763-3389854	3	CRISPRCasFinder	no	csa3	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j	Type I-A	TTTAAATCAATTTTTTTTGATTAA	24	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j,RT	NA|182aa|up_4|NZ_CP040344.1_3380662_3381208_-,NA|429aa|up_1|NZ_CP040344.1_3387689_3388976_-,NA|68aa|down_1|NZ_CP040344.1_3391366_3391570_+	NA|160aa|up_9|NZ_CP040344.1_3373651_3374131_+	pfam07301, DUF1453, Protein of unknown function (DUF1453)	NA|77aa|up_8|NZ_CP040344.1_3374178_3374409_-	PRK01631, PRK01631, hypothetical protein; Provisional	NA|228aa|up_7|NZ_CP040344.1_3375628_3376312_-	pfam13386, DsbD_2, Cytochrome C biogenesis protein transmembrane region	NA|428aa|up_6|NZ_CP040344.1_3377569_3378853_+	cd01087, Prolidase, Prolidase	NA|547aa|up_5|NZ_CP040344.1_3378970_3380611_+	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|182aa|up_4|NZ_CP040344.1_3380662_3381208_-	NA	NA|715aa|up_3|NZ_CP040344.1_3381540_3383685_+	PRK07726, PRK07726, DNA topoisomerase 3	NA|960aa|up_2|NZ_CP040344.1_3384535_3387415_+	pfam13475, DUF4116, Domain of unknown function (DUF4116)	NA|429aa|up_1|NZ_CP040344.1_3387689_3388976_-	NA	csa3|107aa|up_0|NZ_CP040344.1_3389408_3389729_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|462aa|down_0|NZ_CP040344.1_3389901_3391287_+	pfam13738, Pyr_redox_3, Pyridine nucleotide-disulphide oxidoreductase	NA|68aa|down_1|NZ_CP040344.1_3391366_3391570_+	NA	NA|165aa|down_2|NZ_CP040344.1_3391629_3392124_+	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|249aa|down_3|NZ_CP040344.1_3392249_3392996_-	cd10440, GIY-YIG_COG3680, GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs	csa3|116aa|down_4|NZ_CP040344.1_3393566_3393914_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|350aa|down_5|NZ_CP040344.1_3393942_3394992_+	TIGR00832, Uncharacterized_transporter_slr0944, arsenical-resistance protein	NA|135aa|down_6|NZ_CP040344.1_3395024_3395429_+	PRK13530, PRK13530, arsenate reductase (thioredoxin)	NA|120aa|down_7|NZ_CP040344.1_3395493_3395853_+	pfam06953, ArsD, Arsenical resistance operon trans-acting repressor ArsD	NA|587aa|down_8|NZ_CP040344.1_3395871_3397632_+	TIGR04291, Arsenical_pump-driving_ATPase, arsenical pump-driving ATPase	NA|94aa|down_9|NZ_CP040344.1_3397671_3397953_+	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein
GCF_009740005.1_ASM974000v1	NZ_CP040344	Bacillus albus strain DLOU-Yingkou chromosome, complete genome	4	5253301-5255316	1	CRT	no		Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j	Orphan	GGNGCTACNGGNGCTACT	18	9	17	5253319-5253345|5254057-5254083|5254057-5254083|5254210-5254236|5254210-5254236|5254345-5254371|5254345-5254371|5254480-5254506|5254480-5254506|5254615-5254641|5254615-5254641|5254876-5254902|5254876-5254902|5255101-5255127|5255101-5255127|5255281-5255298|5255281-5255298	NZ_CP040344.1_3006992-3006966|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255398-5255424|NZ_CP040344.1_5255443-5255469|NZ_CP040344.1_5255488-5255505|NZ_CP040344.1_5255443-5255460	NA	40	40	Orphan	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j,RT	NA|65aa|up_9|NZ_CP040344.1_5241426_5241621_-,NA|83aa|up_8|NZ_CP040344.1_5242578_5242827_-,NA|82aa|down_2|NZ_CP040344.1_5258819_5259065_+,NA|113aa|down_9|NZ_CP040344.1_5267520_5267859_-	NA|65aa|up_9|NZ_CP040344.1_5241426_5241621_-	NA	NA|83aa|up_8|NZ_CP040344.1_5242578_5242827_-	NA	NA|220aa|up_7|NZ_CP040344.1_5243031_5243691_+	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|386aa|up_6|NZ_CP040344.1_5243910_5245067_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|445aa|up_5|NZ_CP040344.1_5245181_5246516_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|130aa|up_4|NZ_CP040344.1_5246564_5246954_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|424aa|up_3|NZ_CP040344.1_5247129_5248401_-	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|425aa|up_2|NZ_CP040344.1_5248393_5249668_-	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|208aa|up_1|NZ_CP040344.1_5249760_5250384_+	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|322aa|up_0|NZ_CP040344.1_5251723_5252689_+	COG4974, XerD, Site-specific recombinase XerD [DNA replication, recombination, and repair]	NA|75aa|down_0|NZ_CP040344.1_5257437_5257662_-	PRK00395, hfq, RNA-binding protein Hfq; Provisional	NA|318aa|down_1|NZ_CP040344.1_5257683_5258637_-	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|82aa|down_2|NZ_CP040344.1_5258819_5259065_+	NA	NA|311aa|down_3|NZ_CP040344.1_5259127_5260060_-	COG3584, COG3584, Uncharacterized protein conserved in bacteria [Function unknown]	NA|619aa|down_4|NZ_CP040344.1_5260407_5262264_-	PRK10712, PRK10712, PTS system fructose-specific transporter subunits IIBC; Provisional	NA|304aa|down_5|NZ_CP040344.1_5262277_5263189_-	cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor 6-phosphofructokinase (pfkB) and related sugar kinases	NA|251aa|down_6|NZ_CP040344.1_5263185_5263938_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|388aa|down_7|NZ_CP040344.1_5264095_5265259_-	cd08187, BDH, Butanol dehydrogenase catalyzes the conversion of butyraldehyde to butanol with the cofactor NAD(P)H being oxidized in the process	NA|181aa|down_8|NZ_CP040344.1_5266659_5267202_-	COG2322, COG2322, Predicted membrane protein [Function unknown]	NA|113aa|down_9|NZ_CP040344.1_5267520_5267859_-	NA
GCF_009740005.1_ASM974000v1	NZ_CP040345	Bacillus albus strain DLOU-Yingkou plasmid unnamed1, complete sequence	1	46190-46483	1	CRT	no	Cas14u_CAS-V	cas14j,csa3,Cas14u_CAS-V,RT	Unclear	AAAACCAGAAACAAAACCAGAGACAAANCC	30	2	2	46220-46249|46436-46453	NZ_CP040345.1_468452-468423|NZ_CP040345.1_46184-46201	NA	5	5	Unclear	Cas14u_CAS-V,DEDDh,cas3,c2c9_V-U4,csa3,cas14k,WYL,DinG,cas14j,RT	NA|56aa|up_8|NZ_CP040345.1_38019_38187_+,NA|64aa|up_2|NZ_CP040345.1_42722_42914_-,NA|46aa|down_0|NZ_CP040345.1_47748_47886_-,NA|91aa|down_4|NZ_CP040345.1_51127_51400_-,NA|37aa|down_5|NZ_CP040345.1_52414_52525_+,NA|37aa|down_6|NZ_CP040345.1_52564_52675_+,NA|188aa|down_8|NZ_CP040345.1_52971_53535_+	NA|466aa|up_9|NZ_CP040345.1_36486_37884_+	TIGR01000, Mesentericin_Y105_secretion_protein_MesE, bacteriocin secretion accessory protein	NA|56aa|up_8|NZ_CP040345.1_38019_38187_+	NA	NA|207aa|up_7|NZ_CP040345.1_38755_39376_+	cd03025, DsbA_FrnE_like, DsbA family, FrnE-like subfamily; composed of uncharacterized proteins containing a CXXC motif with similarity to DsbA and FrnE	NA|274aa|up_6|NZ_CP040345.1_39547_40369_+	cd07739, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|131aa|up_5|NZ_CP040345.1_40373_40766_+	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|176aa|up_4|NZ_CP040345.1_40920_41448_+	COG4922, COG4922, Uncharacterized protein conserved in bacteria [Function unknown]	NA|322aa|up_3|NZ_CP040345.1_41671_42637_-	cd04275, ZnMc_pappalysin_like, Zinc-dependent metalloprotease, pappalysin_like subfamily	NA|64aa|up_2|NZ_CP040345.1_42722_42914_-	NA	NA|308aa|up_1|NZ_CP040345.1_42928_43852_-	pfam04389, Peptidase_M28, Peptidase family M28	NA|320aa|up_0|NZ_CP040345.1_44383_45343_+	cd07477, Peptidases_S8_Subtilisin_subset, Peptidase S8 family domain in Subtilisin proteins	NA|46aa|down_0|NZ_CP040345.1_47748_47886_-	NA	NA|365aa|down_1|NZ_CP040345.1_47882_48977_-	pfam18801, RapH_N, response regulator aspartate phosphatase H, N terminal	NA|187aa|down_2|NZ_CP040345.1_49639_50200_-	PTZ00121, PTZ00121, MAEBL; Provisional	NA|46aa|down_3|NZ_CP040345.1_50492_50630_+	pfam12960, DUF3849, Protein of unknown function (DUF3849)	NA|91aa|down_4|NZ_CP040345.1_51127_51400_-	NA	NA|37aa|down_5|NZ_CP040345.1_52414_52525_+	NA	NA|37aa|down_6|NZ_CP040345.1_52564_52675_+	NA	NA|42aa|down_7|NZ_CP040345.1_52713_52839_+	pfam09680, Tiny_TM_bacill, Protein of unknown function (Tiny_TM_bacill)	NA|188aa|down_8|NZ_CP040345.1_52971_53535_+	NA	NA|71aa|down_9|NZ_CP040345.1_53953_54166_+	smart00843, Ftsk_gamma, This domain directs oriented DNA translocation and forms a winged helix structure
