assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010726645.1_ASM1072664v1	NZ_AP022563	Mycolicibacterium duvalii strain JCM 6396	1	2243189-2243303	1	PILER-CR	no	cas3	DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	Unclear	GCCGCCGTCGCCGCCGTCGAGCCCG	25	1	1	2243214-2243233	NZ_AP022563.1_2894449-2894430	NA	2	2	Unclear	DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	NA|240aa|up_8|NZ_AP022563.1_2234983_2235703_+,NA|261aa|down_0|NZ_AP022563.1_2243894_2244677_-,NA|64aa|down_2|NZ_AP022563.1_2245227_2245419_+	NA|124aa|up_9|NZ_AP022563.1_2234474_2234846_-	cd04488, RecG_wedge_OBF, RecG_wedge_OBF: A subfamily of OB folds corresponding to the OB fold found in the N-terminal (wedge) domain of Escherichia coli RecG	NA|240aa|up_8|NZ_AP022563.1_2234983_2235703_+	NA	NA|251aa|up_7|NZ_AP022563.1_2235718_2236471_-	pfam12502, DUF3710, Protein of unknown function (DUF3710)	NA|155aa|up_6|NZ_AP022563.1_2236490_2236955_-	PRK00601, dut, dUTP diphosphatase	NA|158aa|up_5|NZ_AP022563.1_2236980_2237454_+	pfam11292, DUF3093, Protein of unknown function (DUF3093)	NA|101aa|up_4|NZ_AP022563.1_2237473_2237776_-	pfam13834, DUF4193, Domain of unknown function (DUF4193)	NA|218aa|up_3|NZ_AP022563.1_2237962_2238616_+	pfam13399, LytR_C, LytR cell envelope-related transcriptional attenuator	NA|285aa|up_2|NZ_AP022563.1_2238623_2239478_-	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|260aa|up_1|NZ_AP022563.1_2239580_2240360_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|501aa|up_0|NZ_AP022563.1_2240540_2242043_+	PRK05901, PRK05901, RNA polymerase sigma factor; Provisional	NA|261aa|down_0|NZ_AP022563.1_2243894_2244677_-	NA	NA|123aa|down_1|NZ_AP022563.1_2244725_2245094_-	pfam06108, DUF952, Protein of unknown function (DUF952)	NA|64aa|down_2|NZ_AP022563.1_2245227_2245419_+	NA	NA|323aa|down_3|NZ_AP022563.1_2245446_2246415_+	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|79aa|down_4|NZ_AP022563.1_2246404_2246641_-	pfam11238, DUF3039, Protein of unknown function (DUF3039)	NA|149aa|down_5|NZ_AP022563.1_2246702_2247149_+	pfam11298, DUF3099, Protein of unknown function (DUF3099)	NA|320aa|down_6|NZ_AP022563.1_2247296_2248256_+	PRK07921, PRK07921, RNA polymerase sigma factor SigB; Reviewed	NA|231aa|down_7|NZ_AP022563.1_2248387_2249080_+	COG1321, TroR, Mn-dependent transcriptional regulator [Transcription]	NA|350aa|down_8|NZ_AP022563.1_2249085_2250135_-	pfam13830, DUF4192, Domain of unknown function (DUF4192)	NA|330aa|down_9|NZ_AP022563.1_2250440_2251430_+	pfam09754, PAC2, PAC2 family
GCF_010726645.1_ASM1072664v1	NZ_AP022563	Mycolicibacterium duvalii strain JCM 6396	2	3950646-3951225	1	CRISPRCasFinder	no		DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	Orphan	GGCGGAAACGGCGGCAACGGCGGC	24	0	0	NA	NA	NA	9	9	Orphan	DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-,NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+,NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+,NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA|475aa|up_9|NZ_AP022563.1_3938766_3940191_+	COG3800, COG3800, Predicted transcriptional regulator [General function prediction only]	NA|185aa|up_8|NZ_AP022563.1_3940187_3940742_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|374aa|up_7|NZ_AP022563.1_3940704_3941826_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-	NA	NA|467aa|up_5|NZ_AP022563.1_3942186_3943587_-	PRK07818, PRK07818, dihydrolipoamide dehydrogenase; Reviewed	NA|215aa|up_4|NZ_AP022563.1_3943583_3944228_-	cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of nickel/oligopeptides specific transporters	NA|256aa|up_3|NZ_AP022563.1_3944224_3944992_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|277aa|up_2|NZ_AP022563.1_3944988_3945819_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|324aa|up_1|NZ_AP022563.1_3945815_3946787_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|529aa|up_0|NZ_AP022563.1_3946837_3948424_-	cd08518, PBP2_NikA_DppA_OppA_like_19, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|153aa|down_0|NZ_AP022563.1_3952189_3952648_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|93aa|down_1|NZ_AP022563.1_3952640_3952919_-	pfam16936, Holin_9, Putative holin	NA|670aa|down_2|NZ_AP022563.1_3952989_3954999_+	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+	NA	NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+	NA	NA|223aa|down_5|NZ_AP022563.1_3955896_3956565_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|299aa|down_6|NZ_AP022563.1_3956727_3957624_+	PRK12478, PRK12478, crotonase/enoyl-CoA hydratase family protein	NA|502aa|down_7|NZ_AP022563.1_3957662_3959168_+	PRK06184, PRK06184, hypothetical protein; Provisional	NA|203aa|down_8|NZ_AP022563.1_3959164_3959773_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA
GCF_010726645.1_ASM1072664v1	NZ_AP022563	Mycolicibacterium duvalii strain JCM 6396	3	3951417-3951537	2	CRISPRCasFinder	no		DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	Orphan	GGCGGAAACGGCGGCAACGGCGGC	24	1	1	3951441-3951455	NZ_AP022563.1_1707604-1707618	NA	2	2	Orphan	DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-,NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+,NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+,NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA|475aa|up_9|NZ_AP022563.1_3938766_3940191_+	COG3800, COG3800, Predicted transcriptional regulator [General function prediction only]	NA|185aa|up_8|NZ_AP022563.1_3940187_3940742_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|374aa|up_7|NZ_AP022563.1_3940704_3941826_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-	NA	NA|467aa|up_5|NZ_AP022563.1_3942186_3943587_-	PRK07818, PRK07818, dihydrolipoamide dehydrogenase; Reviewed	NA|215aa|up_4|NZ_AP022563.1_3943583_3944228_-	cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of nickel/oligopeptides specific transporters	NA|256aa|up_3|NZ_AP022563.1_3944224_3944992_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|277aa|up_2|NZ_AP022563.1_3944988_3945819_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|324aa|up_1|NZ_AP022563.1_3945815_3946787_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|529aa|up_0|NZ_AP022563.1_3946837_3948424_-	cd08518, PBP2_NikA_DppA_OppA_like_19, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|153aa|down_0|NZ_AP022563.1_3952189_3952648_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|93aa|down_1|NZ_AP022563.1_3952640_3952919_-	pfam16936, Holin_9, Putative holin	NA|670aa|down_2|NZ_AP022563.1_3952989_3954999_+	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+	NA	NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+	NA	NA|223aa|down_5|NZ_AP022563.1_3955896_3956565_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|299aa|down_6|NZ_AP022563.1_3956727_3957624_+	PRK12478, PRK12478, crotonase/enoyl-CoA hydratase family protein	NA|502aa|down_7|NZ_AP022563.1_3957662_3959168_+	PRK06184, PRK06184, hypothetical protein; Provisional	NA|203aa|down_8|NZ_AP022563.1_3959164_3959773_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA
GCF_010726645.1_ASM1072664v1	NZ_AP022563	Mycolicibacterium duvalii strain JCM 6396	4	3951609-3951680	3	CRISPRCasFinder	no		DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	Orphan	GGCGGAAACGGCGGCAACGGCGGC	24	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,csa3,c2c9_V-U4,cas4,WYL,cas3	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-,NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+,NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+,NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA|475aa|up_9|NZ_AP022563.1_3938766_3940191_+	COG3800, COG3800, Predicted transcriptional regulator [General function prediction only]	NA|185aa|up_8|NZ_AP022563.1_3940187_3940742_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|374aa|up_7|NZ_AP022563.1_3940704_3941826_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|94aa|up_6|NZ_AP022563.1_3941904_3942186_-	NA	NA|467aa|up_5|NZ_AP022563.1_3942186_3943587_-	PRK07818, PRK07818, dihydrolipoamide dehydrogenase; Reviewed	NA|215aa|up_4|NZ_AP022563.1_3943583_3944228_-	cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of nickel/oligopeptides specific transporters	NA|256aa|up_3|NZ_AP022563.1_3944224_3944992_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|277aa|up_2|NZ_AP022563.1_3944988_3945819_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|324aa|up_1|NZ_AP022563.1_3945815_3946787_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|529aa|up_0|NZ_AP022563.1_3946837_3948424_-	cd08518, PBP2_NikA_DppA_OppA_like_19, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|153aa|down_0|NZ_AP022563.1_3952189_3952648_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|93aa|down_1|NZ_AP022563.1_3952640_3952919_-	pfam16936, Holin_9, Putative holin	NA|670aa|down_2|NZ_AP022563.1_3952989_3954999_+	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|215aa|down_3|NZ_AP022563.1_3955000_3955645_+	NA	NA|108aa|down_4|NZ_AP022563.1_3955646_3955970_+	NA	NA|223aa|down_5|NZ_AP022563.1_3955896_3956565_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|299aa|down_6|NZ_AP022563.1_3956727_3957624_+	PRK12478, PRK12478, crotonase/enoyl-CoA hydratase family protein	NA|502aa|down_7|NZ_AP022563.1_3957662_3959168_+	PRK06184, PRK06184, hypothetical protein; Provisional	NA|203aa|down_8|NZ_AP022563.1_3959164_3959773_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|283aa|down_9|NZ_AP022563.1_3959774_3960623_-	NA
