assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	1	127997-128113	1	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	GAAGCTGAACGGGTTGTCGACGAGGGCGTTGGTGCC	36	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA,NA|151aa|down_6|NC_022115.1_133430_133883_-,NA|416aa|down_8|NC_022115.1_135601_136849_-	NA|401aa|up_9|NC_022115.1_117779_118982_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|349aa|up_8|NC_022115.1_119096_120143_+	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|247aa|up_7|NC_022115.1_120139_120880_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|147aa|up_6|NC_022115.1_120991_121432_+	pfam13426, PAS_9, PAS domain	NA|209aa|up_5|NC_022115.1_121438_122065_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|324aa|up_4|NC_022115.1_122116_123088_+	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|431aa|up_3|NC_022115.1_123034_124327_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|447aa|up_2|NC_022115.1_124323_125664_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|476aa|up_1|NC_022115.1_125656_127084_-	COG2936, COG2936, Predicted acyl esterases [General function prediction only]	NA|271aa|up_0|NC_022115.1_127080_127893_-	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|420aa|down_0|NC_022115.1_128268_129528_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|356aa|down_1|NC_022115.1_129612_130680_+	pfam01032, FecCD, FecCD transport family	NA|265aa|down_2|NC_022115.1_130676_131471_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|327aa|down_3|NC_022115.1_131467_132448_+	cd01148, TroA_a, Metal binding protein TroA_a	NA|229aa|down_4|NC_022115.1_132452_133139_+	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|98aa|down_5|NC_022115.1_133140_133434_-	pfam09851, SHOCT, Short C-terminal domain	NA|151aa|down_6|NC_022115.1_133430_133883_-	NA	NA|549aa|down_7|NC_022115.1_133886_135533_-	cd17321, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide) and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|416aa|down_8|NC_022115.1_135601_136849_-	NA	NA|196aa|down_9|NC_022115.1_136867_137455_+	pfam04978, DUF664, Protein of unknown function (DUF664)
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	2	2186716-2186801	2	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	CACCGGATGCTGCGCCGCCGCCG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA|117aa|up_1|NC_022115.1_2185654_2186005_-,NA	NA|904aa|up_9|NC_022115.1_2177659_2180371_-	PRK09800, PRK09800, putative hypoxanthine oxidase; Provisional	NA|270aa|up_8|NC_022115.1_2180367_2181177_-	pfam00941, FAD_binding_5, FAD binding domain in molybdopterin dehydrogenase	NA|457aa|up_7|NC_022115.1_2181184_2182555_-	PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase; Reviewed	NA|316aa|up_6|NC_022115.1_2182551_2183499_-	TIGR03383, Structure_Of_Uricase, urate oxidase	NA|110aa|up_5|NC_022115.1_2183533_2183863_-	pfam00576, Transthyretin, HIUase/Transthyretin family	NA|173aa|up_4|NC_022115.1_2183859_2184378_-	PRK13798, PRK13798, putative OHCU decarboxylase; Provisional	NA|267aa|up_3|NC_022115.1_2184494_2185295_+	pfam13350, Y_phosphatase3, Tyrosine phosphatase family	NA|121aa|up_2|NC_022115.1_2185295_2185658_-	COG5652, COG5652, Predicted integral membrane protein [Function unknown]	NA|117aa|up_1|NC_022115.1_2185654_2186005_-	NA	NA|142aa|up_0|NC_022115.1_2186004_2186430_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|467aa|down_0|NC_022115.1_2188441_2189842_-	pfam00668, Condensation, Condensation domain	NA|139aa|down_1|NC_022115.1_2189846_2190263_-	pfam02657, SufE, Fe-S metabolism associated domain	NA|302aa|down_2|NC_022115.1_2190259_2191165_-	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|488aa|down_3|NC_022115.1_2191283_2192747_-	pfam00668, Condensation, Condensation domain	NA|498aa|down_4|NC_022115.1_2192746_2194240_-	pfam00668, Condensation, Condensation domain	NA|566aa|down_5|NC_022115.1_2194390_2196088_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|437aa|down_6|NC_022115.1_2196152_2197463_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|214aa|down_7|NC_022115.1_2197587_2198229_-	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|114aa|down_8|NC_022115.1_2198244_2198586_-	pfam13822, ACC_epsilon, Acyl-CoA carboxylase epsilon subunit	NA|547aa|down_9|NC_022115.1_2198582_2200223_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	3	3061754-3061827	3	CRISPRCasFinder	no	csa3	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Type I-A	CCCTGGTACCCGCCGCGGCTCTC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA|293aa|up_6|NC_022115.1_3052088_3052967_-,NA|294aa|up_2|NC_022115.1_3057185_3058067_-,NA|261aa|down_6|NC_022115.1_3071259_3072042_+	NA|345aa|up_9|NC_022115.1_3049221_3050256_-	cd02653, nuc_hydro_3, NH_3: A subgroup of nucleoside hydrolases	NA|159aa|up_8|NC_022115.1_3050248_3050725_-	COG1764, osmC, Organic hydroperoxide reductase [Secondary metabolites biosynthesis, transport and catabolism]	NA|438aa|up_7|NC_022115.1_3050756_3052070_-	pfam02515, CoA_transf_3, CoA-transferase family III	NA|293aa|up_6|NC_022115.1_3052088_3052967_-	NA	NA|363aa|up_5|NC_022115.1_3052970_3054059_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|461aa|up_4|NC_022115.1_3054051_3055434_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|479aa|up_3|NC_022115.1_3055745_3057182_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|294aa|up_2|NC_022115.1_3057185_3058067_-	NA	NA|501aa|up_1|NC_022115.1_3058071_3059574_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|486aa|up_0|NC_022115.1_3059636_3061094_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|318aa|down_0|NC_022115.1_3063492_3064446_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|138aa|down_1|NC_022115.1_3064471_3064885_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|858aa|down_2|NC_022115.1_3065076_3067650_+	pfam09924, DUF2156, Uncharacterized conserved protein (DUF2156)	NA|284aa|down_3|NC_022115.1_3067660_3068512_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|364aa|down_4|NC_022115.1_3068508_3069600_-	COG4779, FepG, ABC-type enterobactin transport system, permease component [Inorganic ion transport and metabolism]	NA|487aa|down_5|NC_022115.1_3069596_3071057_-	PRK10441, PRK10441, Fe(3+)-siderophore ABC transporter permease	NA|261aa|down_6|NC_022115.1_3071259_3072042_+	NA	NA|143aa|down_7|NC_022115.1_3072278_3072707_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|472aa|down_8|NC_022115.1_3072794_3074210_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	csa3|255aa|down_9|NC_022115.1_3074653_3075418_+	cd08893, SRPBCC_CalC_Aha1-like_GntR-HTH, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins; some contain an N-terminal GntR family winged HTH DNA-binding domain
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	4	3152160-3152239	4	CRISPRCasFinder	no	WYL,cas4	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Unclear	CCCCACGGCTGTTGCTGGCCGGG	23	0	0	NA	NA	NA	1	1	Unclear	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA,NA	NA|120aa|up_9|NC_022115.1_3143139_3143499_-	cd07238, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|214aa|up_8|NC_022115.1_3143509_3144151_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|322aa|up_7|NC_022115.1_3144310_3145276_-	cd06225, HAMP, Histidine kinase, Adenylyl cyclase, Methyl-accepting protein, and Phosphatase (HAMP) domain	NA|249aa|up_6|NC_022115.1_3145376_3146123_-	PRK08057, PRK08057, cobalt-precorrin-6x reductase; Reviewed	NA|250aa|up_5|NC_022115.1_3146146_3146896_-	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]	NA|428aa|up_4|NC_022115.1_3146892_3148176_-	COG2242, CobL, Precorrin-6B methylase 2 [Coenzyme metabolism]	NA|254aa|up_3|NC_022115.1_3148172_3148934_-	PRK05599, PRK05599, SDR family oxidoreductase	NA|142aa|up_2|NC_022115.1_3148964_3149390_+	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|379aa|up_1|NC_022115.1_3149390_3150527_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|322aa|up_0|NC_022115.1_3150564_3151530_+	smart00475, 53EXOc, 5'-3' exonuclease	NA|904aa|down_0|NC_022115.1_3152432_3155144_-	COG4581, COG4581, Superfamily II RNA helicase [DNA replication, recombination, and repair]	NA|343aa|down_1|NC_022115.1_3155192_3156221_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|92aa|down_2|NC_022115.1_3156294_3156570_-	PRK00575, tatA, Sec-independent protein translocase subunit TatA	WYL|326aa|down_3|NC_022115.1_3156704_3157682_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	WYL|338aa|down_4|NC_022115.1_3157681_3158695_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|448aa|down_5|NC_022115.1_3158805_3160149_-	TIGR03686, pupylate_PafA, Pup--protein ligase	NA|488aa|down_6|NC_022115.1_3160263_3161727_+	smart00046, DAGKc, Diacylglycerol kinase catalytic domain (presumed)	NA|256aa|down_7|NC_022115.1_3161764_3162532_-	TIGR03691, 20S_bact_alpha, proteasome, alpha subunit, bacterial type	NA|293aa|down_8|NC_022115.1_3162528_3163407_-	TIGR03690, 20S_bact_beta, proteasome, beta subunit, bacterial type	NA|65aa|down_9|NC_022115.1_3163403_3163598_-	pfam05639, Pup, Pup-like protein
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	5	4195851-4195987	5	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	TGATCGTGCGAACGGTACGTTCGGTCGATCTGAGGCGACG	40	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA,NA|94aa|down_4|NC_022115.1_4211397_4211679_-,NA|142aa|down_6|NC_022115.1_4215793_4216219_-,NA|257aa|down_8|NC_022115.1_4217352_4218123_-,NA|410aa|down_9|NC_022115.1_4218259_4219489_-	NA|472aa|up_9|NC_022115.1_4181528_4182944_+	pfam11380, Stealth_CR2, Stealth protein CR2, conserved region 2	NA|456aa|up_8|NC_022115.1_4182995_4184363_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|449aa|up_7|NC_022115.1_4184383_4185730_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|302aa|up_6|NC_022115.1_4185904_4186810_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|458aa|up_5|NC_022115.1_4187149_4188523_+	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|120aa|up_4|NC_022115.1_4188695_4189055_+	pfam05901, Excalibur, Excalibur calcium-binding domain	NA|373aa|up_3|NC_022115.1_4189107_4190226_-	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|1061aa|up_2|NC_022115.1_4190337_4193520_+	cd18012, DEXQc_arch_SWI2_SNF2, DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins	NA|192aa|up_1|NC_022115.1_4193633_4194209_+	pfam02342, TerD, TerD domain	NA|463aa|up_0|NC_022115.1_4194291_4195680_-	pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein	NA|137aa|down_0|NC_022115.1_4202106_4202517_+	COG3607, COG3607, Predicted lactoylglutathione lyase [General function prediction only]	NA|325aa|down_1|NC_022115.1_4202589_4203564_-	PRK08241, PRK08241, RNA polymerase subunit sigma-70	NA|244aa|down_2|NC_022115.1_4203617_4204349_-	pfam08445, FR47, FR47-like protein	NA|1828aa|down_3|NC_022115.1_4205844_4211328_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|94aa|down_4|NC_022115.1_4211397_4211679_-	NA	NA|1368aa|down_5|NC_022115.1_4211690_4215794_-	cd07539, P-type_ATPase, uncharacterized subfamily of P-type ATPase transporters	NA|142aa|down_6|NC_022115.1_4215793_4216219_-	NA	NA|316aa|down_7|NC_022115.1_4216402_4217350_+	cd19081, AKR_AKR9C1, AKR9C family of aldo-keto reductase (AKR)	NA|257aa|down_8|NC_022115.1_4217352_4218123_-	NA	NA|410aa|down_9|NC_022115.1_4218259_4219489_-	NA
GCF_000454045.1_ASM45404v1	NC_022115	Rhodococcus erythropolis CCM2595, complete sequence	6	6234301-6234443	6	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	AGATCCGCGTTGCGGAGGTTCGC	23	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	NA|109aa|up_9|NC_022115.1_6224072_6224399_+,NA|227aa|up_3|NC_022115.1_6228693_6229374_-,NA|148aa|up_2|NC_022115.1_6230169_6230613_+,NA|66aa|up_1|NC_022115.1_6230713_6230911_+,NA|199aa|down_2|NC_022115.1_6239959_6240556_-	NA|109aa|up_9|NC_022115.1_6224072_6224399_+	NA	NA|265aa|up_8|NC_022115.1_6224387_6225182_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|353aa|up_7|NC_022115.1_6225178_6226237_-	pfam04261, Dyp_perox, Dyp-type peroxidase family	NA|319aa|up_6|NC_022115.1_6226305_6227262_-	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|319aa|up_5|NC_022115.1_6227295_6228252_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|125aa|up_4|NC_022115.1_6228253_6228628_-	cd00781, ketosteroid_isomerase, ketosteroid isomerase: Many biological reactions proceed by enzymatic cleavage of a C-H bond adjacent to carbonyl or a carboxyl group, leading to an enol or a enolate intermediate that is subsequently re-protonated at the same or an adjacent carbon	NA|227aa|up_3|NC_022115.1_6228693_6229374_-	NA	NA|148aa|up_2|NC_022115.1_6230169_6230613_+	NA	NA|66aa|up_1|NC_022115.1_6230713_6230911_+	NA	NA|531aa|up_0|NC_022115.1_6232393_6233986_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|588aa|down_0|NC_022115.1_6236328_6238092_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|502aa|down_1|NC_022115.1_6238171_6239677_-	cd06179, MFS_TRI12_like, Fungal trichothecene efflux pump (TRI12) of the Major Facilitator Superfamily of transporters	NA|199aa|down_2|NC_022115.1_6239959_6240556_-	NA	NA|368aa|down_3|NC_022115.1_6240781_6241885_-	PRK04247, PRK04247, endonuclease NucS	NA|949aa|down_4|NC_022115.1_6242029_6244876_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|165aa|down_5|NC_022115.1_6244977_6245472_+	pfam13630, SdpI, SdpI/YfhL protein family	NA|316aa|down_6|NC_022115.1_6245489_6246437_+	pfam07853, DUF1648, Protein of unknown function (DUF1648)	NA|232aa|down_7|NC_022115.1_6246405_6247101_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|390aa|down_8|NC_022115.1_6247263_6248433_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|327aa|down_9|NC_022115.1_6248437_6249418_+	pfam13593, SBF_like, SBF-like CPA transporter family (DUF4137)
