assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002501585.1_ASM250158v1	NZ_CP023720	Rhodococcus sp. H-CA8f chromosome, complete genome	1	214611-214724	1	CRISPRCasFinder	no		csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4	Orphan	CAGGGTCGTCCGCCGCAAGCCCGACCT	27	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA,NA	NA|236aa|up_9|NZ_CP023720.1_204076_204784_-	pfam12900, Pyridox_ox_2, Pyridoxamine 5'-phosphate oxidase	NA|462aa|up_8|NZ_CP023720.1_204881_206267_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|706aa|up_7|NZ_CP023720.1_206326_208444_+	cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN binding domain	NA|340aa|up_6|NZ_CP023720.1_208452_209472_-	COG3491, PcbC, Isopenicillin N synthase and related dioxygenases [General function prediction only]	NA|368aa|up_5|NZ_CP023720.1_209681_210785_-	TIGR03450, mycothiol_INO1, inositol 1-phosphate synthase, Actinobacterial type	NA|188aa|up_4|NZ_CP023720.1_210777_211341_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|257aa|up_3|NZ_CP023720.1_211457_212228_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|274aa|up_2|NZ_CP023720.1_212237_213059_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|280aa|up_1|NZ_CP023720.1_213068_213908_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|140aa|up_0|NZ_CP023720.1_214036_214456_+	pfam17249, DUF5318, Family of unknown function (DUF5318)	NA|716aa|down_0|NZ_CP023720.1_215033_217181_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|526aa|down_1|NZ_CP023720.1_217220_218798_+	COG5650, COG5650, Predicted integral membrane protein [Function unknown]	NA|100aa|down_2|NZ_CP023720.1_218800_219100_-	COG2771, CsgD, DNA-binding HTH domain-containing proteins [Transcription]	NA|1281aa|down_3|NZ_CP023720.1_219287_223130_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|260aa|down_4|NZ_CP023720.1_223136_223916_-	pfam13828, DUF4190, Domain of unknown function (DUF4190)	NA|195aa|down_5|NZ_CP023720.1_223961_224546_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|452aa|down_6|NZ_CP023720.1_224545_225901_-	COG5650, COG5650, Predicted integral membrane protein [Function unknown]	NA|96aa|down_7|NZ_CP023720.1_226111_226399_+	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|175aa|down_8|NZ_CP023720.1_226463_226988_+	PRK07772, PRK07772, single-stranded DNA-binding protein; Provisional	NA|81aa|down_9|NZ_CP023720.1_227027_227270_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed
GCF_002501585.1_ASM250158v1	NZ_CP023720	Rhodococcus sp. H-CA8f chromosome, complete genome	2	4014543-4014628	2	CRISPRCasFinder	no		csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4	Orphan	CGGCGGCGGCGCAGCATCCGGTG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA,NA|117aa|down_1|NZ_CP023720.1_4015338_4015689_+	NA|547aa|up_9|NZ_CP023720.1_4001127_4002768_+	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|114aa|up_8|NZ_CP023720.1_4002764_4003106_+	pfam13822, ACC_epsilon, Acyl-CoA carboxylase epsilon subunit	NA|214aa|up_7|NZ_CP023720.1_4003121_4003763_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|435aa|up_6|NZ_CP023720.1_4003886_4005191_-	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|566aa|up_5|NZ_CP023720.1_4005255_4006953_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|498aa|up_4|NZ_CP023720.1_4007103_4008597_+	pfam00668, Condensation, Condensation domain	NA|488aa|up_3|NZ_CP023720.1_4008596_4010060_+	pfam00668, Condensation, Condensation domain	NA|302aa|up_2|NZ_CP023720.1_4010178_4011084_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|139aa|up_1|NZ_CP023720.1_4011080_4011497_+	pfam02657, SufE, Fe-S metabolism associated domain	NA|467aa|up_0|NZ_CP023720.1_4011501_4012902_+	pfam00668, Condensation, Condensation domain	NA|142aa|down_0|NZ_CP023720.1_4014913_4015339_+	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|117aa|down_1|NZ_CP023720.1_4015338_4015689_+	NA	NA|121aa|down_2|NZ_CP023720.1_4015685_4016048_+	COG5652, COG5652, Predicted integral membrane protein [Function unknown]	NA|267aa|down_3|NZ_CP023720.1_4016048_4016849_-	pfam13350, Y_phosphatase3, Tyrosine phosphatase family	NA|173aa|down_4|NZ_CP023720.1_4016965_4017484_+	PRK13798, PRK13798, putative OHCU decarboxylase; Provisional	NA|110aa|down_5|NZ_CP023720.1_4017480_4017810_+	pfam00576, Transthyretin, HIUase/Transthyretin family	NA|326aa|down_6|NZ_CP023720.1_4017814_4018792_+	TIGR03383, Structure_Of_Uricase, urate oxidase	NA|457aa|down_7|NZ_CP023720.1_4018788_4020159_+	PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase; Reviewed	NA|270aa|down_8|NZ_CP023720.1_4020166_4020976_+	pfam00941, FAD_binding_5, FAD binding domain in molybdopterin dehydrogenase	NA|904aa|down_9|NZ_CP023720.1_4020972_4023684_+	PRK09800, PRK09800, putative hypoxanthine oxidase; Provisional
GCF_002501585.1_ASM250158v1	NZ_CP023720	Rhodococcus sp. H-CA8f chromosome, complete genome	3	5836601-5836692	3	CRISPRCasFinder	no		csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4	Orphan	CCGCAGGGTGGAAGTTATCCCCC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA,NA	NA|260aa|up_9|NZ_CP023720.1_5827281_5828061_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|430aa|up_8|NZ_CP023720.1_5828083_5829373_-	cd17368, MFS_CitA, Citrate-proton symporter of the Major Facilitator Superfamily of transporters	NA|396aa|up_7|NZ_CP023720.1_5829399_5830587_-	PRK07522, PRK07522, acetylornithine deacetylase; Provisional	NA|364aa|up_6|NZ_CP023720.1_5830674_5831766_+	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|187aa|up_5|NZ_CP023720.1_5831817_5832378_+	TIGR03086, TIGR03086, TIGR03086 family protein	NA|203aa|up_4|NZ_CP023720.1_5832429_5833038_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|299aa|up_3|NZ_CP023720.1_5833039_5833936_+	PRK05710, PRK05710, tRNA glutamyl-Q(34) synthetase GluQRS	NA|247aa|up_2|NZ_CP023720.1_5834191_5834932_+	cd01948, EAL, EAL domain	NA|218aa|up_1|NZ_CP023720.1_5834960_5835614_-	pfam13828, DUF4190, Domain of unknown function (DUF4190)	NA|145aa|up_0|NZ_CP023720.1_5836032_5836467_+	pfam06271, RDD, RDD family	NA|205aa|down_0|NZ_CP023720.1_5836807_5837422_+	COG5473, COG5473, Predicted integral membrane protein [Function unknown]	NA|245aa|down_1|NZ_CP023720.1_5837539_5838274_+	pfam02592, Vut_1, Putative vitamin uptake transporter	NA|430aa|down_2|NZ_CP023720.1_5838264_5839554_-	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|264aa|down_3|NZ_CP023720.1_5839745_5840537_+	PRK06688, PRK06688, enoyl-CoA hydratase; Provisional	NA|324aa|down_4|NZ_CP023720.1_5840563_5841535_+	COG0715, TauA, ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism]	NA|780aa|down_5|NZ_CP023720.1_5841651_5843991_+	pfam02515, CoA_transf_3, CoA-transferase family III	NA|316aa|down_6|NZ_CP023720.1_5844041_5844989_+	COG0604, Qor, NADPH:quinone reductase and related Zn-dependent oxidoreductases [Energy production and conversion / General function prediction only]	NA|508aa|down_7|NZ_CP023720.1_5845004_5846528_+	PRK06155, PRK06155, crotonobetaine/carnitine-CoA ligase; Provisional	NA|301aa|down_8|NZ_CP023720.1_5846539_5847442_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|275aa|down_9|NZ_CP023720.1_5847609_5848434_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
GCF_002501585.1_ASM250158v1	NZ_CP023720	Rhodococcus sp. H-CA8f chromosome, complete genome	4	6096601-6096705	4	CRISPRCasFinder	no		csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4	Orphan	CTCGTCGACAACCCGTTCACCTTC	24	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA|416aa|up_8|NZ_CP023720.1_6087860_6089108_+,NA|151aa|up_6|NZ_CP023720.1_6090826_6091279_+,NA	NA|196aa|up_9|NZ_CP023720.1_6087254_6087842_-	pfam04978, DUF664, Protein of unknown function (DUF664)	NA|416aa|up_8|NZ_CP023720.1_6087860_6089108_+	NA	NA|549aa|up_7|NZ_CP023720.1_6089176_6090823_+	cd17321, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide) and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|151aa|up_6|NZ_CP023720.1_6090826_6091279_+	NA	NA|98aa|up_5|NZ_CP023720.1_6091275_6091569_+	pfam09851, SHOCT, Short C-terminal domain	NA|229aa|up_4|NZ_CP023720.1_6091570_6092257_-	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|327aa|up_3|NZ_CP023720.1_6092253_6093234_-	cd01148, TroA_a, Metal binding protein TroA_a	NA|265aa|up_2|NZ_CP023720.1_6093230_6094025_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|356aa|up_1|NZ_CP023720.1_6094021_6095089_-	pfam01032, FecCD, FecCD transport family	NA|420aa|up_0|NZ_CP023720.1_6095173_6096433_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|271aa|down_0|NZ_CP023720.1_6096808_6097621_+	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|476aa|down_1|NZ_CP023720.1_6097617_6099045_+	COG2936, COG2936, Predicted acyl esterases [General function prediction only]	NA|447aa|down_2|NZ_CP023720.1_6099037_6100378_+	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|431aa|down_3|NZ_CP023720.1_6100374_6101667_+	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|324aa|down_4|NZ_CP023720.1_6101613_6102585_-	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|209aa|down_5|NZ_CP023720.1_6102636_6103263_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|147aa|down_6|NZ_CP023720.1_6103269_6103710_-	pfam13426, PAS_9, PAS domain	NA|247aa|down_7|NZ_CP023720.1_6103821_6104562_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|374aa|down_8|NZ_CP023720.1_6104558_6105680_-	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|401aa|down_9|NZ_CP023720.1_6105719_6106922_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002501585.1_ASM250158v1	NZ_CP023721	Rhodococcus sp. H-CA8f plasmid unnamed, complete sequence	1	84720-85176	1	CRISPRCasFinder	no	c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,DEDDh	c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,DEDDh,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5	Type I-E	TCACCCCCGAGCACGCGGGGATCAACCCG	29	0	0	NA	NA	NA	7	7	TypeI-E	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA|66aa|up_9|NZ_CP023721.1_75025_75223_-,NA|222aa|up_8|NZ_CP023721.1_75352_76018_+,NA|125aa|up_7|NZ_CP023721.1_76310_76685_+,NA|215aa|up_6|NZ_CP023721.1_76905_77550_+,NA|81aa|down_0|NZ_CP023721.1_85980_86223_-,NA|107aa|down_1|NZ_CP023721.1_86344_86665_-,NA|72aa|down_2|NZ_CP023721.1_87239_87455_-,NA|78aa|down_3|NZ_CP023721.1_87470_87704_-,NA|170aa|down_4|NZ_CP023721.1_87756_88266_-,NA|120aa|down_5|NZ_CP023721.1_88267_88627_-,NA|89aa|down_6|NZ_CP023721.1_89032_89299_-,NA|105aa|down_7|NZ_CP023721.1_89516_89831_+	NA|66aa|up_9|NZ_CP023721.1_75025_75223_-	NA	NA|222aa|up_8|NZ_CP023721.1_75352_76018_+	NA	NA|125aa|up_7|NZ_CP023721.1_76310_76685_+	NA	NA|215aa|up_6|NZ_CP023721.1_76905_77550_+	NA	cas6e|231aa|up_5|NZ_CP023721.1_78012_78705_-	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas5|270aa|up_4|NZ_CP023721.1_78701_79511_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|389aa|up_3|NZ_CP023721.1_79507_80674_-	pfam09344, Cas_CT1975, CT1975-like protein	NA|185aa|up_2|NZ_CP023721.1_80709_81264_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|457aa|up_1|NZ_CP023721.1_81263_82634_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	NA|668aa|up_0|NZ_CP023721.1_82670_84674_+	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|81aa|down_0|NZ_CP023721.1_85980_86223_-	NA	NA|107aa|down_1|NZ_CP023721.1_86344_86665_-	NA	NA|72aa|down_2|NZ_CP023721.1_87239_87455_-	NA	NA|78aa|down_3|NZ_CP023721.1_87470_87704_-	NA	NA|170aa|down_4|NZ_CP023721.1_87756_88266_-	NA	NA|120aa|down_5|NZ_CP023721.1_88267_88627_-	NA	NA|89aa|down_6|NZ_CP023721.1_89032_89299_-	NA	NA|105aa|down_7|NZ_CP023721.1_89516_89831_+	NA	NA|164aa|down_8|NZ_CP023721.1_89968_90460_-	pfam13384, HTH_23, Homeodomain-like domain	NA|878aa|down_9|NZ_CP023721.1_90802_93436_-	cd18010, DEXHc_HARP_SMARCAL1, DEXH-box helicase domain of SMARCAL1
GCF_002501585.1_ASM250158v1	NZ_CP023721	Rhodococcus sp. H-CA8f plasmid unnamed, complete sequence	2	110708-110867	2	CRISPRCasFinder	no	DEDDh	c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,DEDDh,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5	Unclear	CGGCTGCTCTACGTACGACTGCTCC	25	0	0	NA	NA	NA	3	3	Orphan	csa3,RT,Cas9_archaeal,WYL,DEDDh,cas3,DinG,cas4,c2c10_CAS-V-U3,cas6e,cas5,cas7,cas8e,csf1gr8,csf4gr11,csf2gr7,csf3gr5	NA|197aa|up_9|NZ_CP023721.1_100432_101023_+,NA|95aa|up_7|NZ_CP023721.1_102212_102497_-,NA|100aa|up_6|NZ_CP023721.1_102535_102835_-,NA|105aa|up_4|NZ_CP023721.1_104004_104319_-,NA|97aa|up_0|NZ_CP023721.1_109216_109507_-,NA|130aa|down_2|NZ_CP023721.1_116220_116610_-,NA|105aa|down_3|NZ_CP023721.1_116616_116931_-,NA|228aa|down_4|NZ_CP023721.1_117717_118401_+	NA|197aa|up_9|NZ_CP023721.1_100432_101023_+	NA	NA|306aa|up_8|NZ_CP023721.1_101034_101952_+	pfam14011, ESX-1_EspG, EspG family	NA|95aa|up_7|NZ_CP023721.1_102212_102497_-	NA	NA|100aa|up_6|NZ_CP023721.1_102535_102835_-	NA	NA|365aa|up_5|NZ_CP023721.1_102899_103994_-	pfam00823, PPE, PPE family	NA|105aa|up_4|NZ_CP023721.1_104004_104319_-	NA	NA|514aa|up_3|NZ_CP023721.1_104814_106356_-	TIGR03923, T7SS_EccE, type VII secretion protein EccE	NA|456aa|up_2|NZ_CP023721.1_106352_107720_-	TIGR03921, T7SS_mycosin, type VII secretion-associated serine protease mycosin	NA|493aa|up_1|NZ_CP023721.1_107719_109198_-	TIGR03920, T7SS_EccD, type VII secretion integral membrane protein EccD	NA|97aa|up_0|NZ_CP023721.1_109216_109507_-	NA	NA|738aa|down_0|NZ_CP023721.1_111391_113605_-	cd05844, GT4-like, glycosyltransferase family 4 proteins	NA|866aa|down_1|NZ_CP023721.1_113597_116195_-	pfam12846, AAA_10, AAA-like domain	NA|130aa|down_2|NZ_CP023721.1_116220_116610_-	NA	NA|105aa|down_3|NZ_CP023721.1_116616_116931_-	NA	NA|228aa|down_4|NZ_CP023721.1_117717_118401_+	NA	NA|319aa|down_5|NZ_CP023721.1_118397_119354_-	pfam12642, TpcC, Conjugative transposon protein TcpC	NA|502aa|down_6|NZ_CP023721.1_119403_120909_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|617aa|down_7|NZ_CP023721.1_120925_122776_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|485aa|down_8|NZ_CP023721.1_122854_124309_+	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|1381aa|down_9|NZ_CP023721.1_124443_128586_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa
