assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_011046535.1_ASM1104653v1	NZ_CP049257	Nocardioides sp. R-3366 chromosome, complete genome	1	1516134-1516207	1	CRISPRCasFinder	no	csa3	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	Type I-A	CGCCGCGCCGCCGCGGTCGGCCGGCGC	27	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	NA,NA|494aa|down_1|NZ_CP049257.1_1517413_1518895_+,NA|139aa|down_2|NZ_CP049257.1_1518899_1519316_-,NA|152aa|down_5|NZ_CP049257.1_1520482_1520938_+,NA|83aa|down_6|NZ_CP049257.1_1522845_1523094_+,NA|79aa|down_7|NZ_CP049257.1_1523135_1523372_-,NA|132aa|down_8|NZ_CP049257.1_1523567_1523963_+	NA|318aa|up_9|NZ_CP049257.1_1504462_1505416_-	PRK05901, PRK05901, RNA polymerase sigma factor; Provisional	NA|183aa|up_8|NZ_CP049257.1_1505650_1506199_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|240aa|up_7|NZ_CP049257.1_1506282_1507002_+	PRK08264, PRK08264, SDR family oxidoreductase	NA|716aa|up_6|NZ_CP049257.1_1507011_1509159_-	COG1770, PtrB, Protease II [Amino acid transport and metabolism]	NA|268aa|up_5|NZ_CP049257.1_1509097_1509901_+	pfam03746, LamB_YcsF, LamB/YcsF family	NA|213aa|up_4|NZ_CP049257.1_1510747_1511386_+	pfam02682, CT_C_D, Carboxyltransferase domain, subdomain C and D	NA|282aa|up_3|NZ_CP049257.1_1511389_1512235_+	pfam02626, CT_A_B, Carboxyltransferase domain, subdomain A and B	NA|337aa|up_2|NZ_CP049257.1_1512221_1513232_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|445aa|up_1|NZ_CP049257.1_1513274_1514609_-	PRK03348, PRK03348, DNA polymerase IV; Provisional	NA|302aa|up_0|NZ_CP049257.1_1514736_1515642_+	pfam13830, DUF4192, Domain of unknown function (DUF4192)	NA|371aa|down_0|NZ_CP049257.1_1516291_1517404_+	PRK13517, PRK13517, glutamate--cysteine ligase	NA|494aa|down_1|NZ_CP049257.1_1517413_1518895_+	NA	NA|139aa|down_2|NZ_CP049257.1_1518899_1519316_-	NA	NA|133aa|down_3|NZ_CP049257.1_1519441_1519840_+	cd00293, USP_Like, Usp: Universal stress protein family	NA|192aa|down_4|NZ_CP049257.1_1519874_1520450_+	TIGR03252, TIGR03252, uncharacterized HhH-GPD family protein	NA|152aa|down_5|NZ_CP049257.1_1520482_1520938_+	NA	NA|83aa|down_6|NZ_CP049257.1_1522845_1523094_+	NA	NA|79aa|down_7|NZ_CP049257.1_1523135_1523372_-	NA	NA|132aa|down_8|NZ_CP049257.1_1523567_1523963_+	NA	NA|152aa|down_9|NZ_CP049257.1_1524135_1524591_+	cd08898, SRPBCC_CalC_Aha1-like_5, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins
GCF_011046535.1_ASM1104653v1	NZ_CP049257	Nocardioides sp. R-3366 chromosome, complete genome	2	3494493-3494682	1	CRT	no		DinG,WYL,csa3,DEDDh,cas4,cas3,casR	Orphan	GGCCAGGAGCTGGTCGGCCTCG	22	0	0	NA	NA	NA	3	3	Orphan	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	NA|86aa|up_5|NZ_CP049257.1_3488586_3488844_+,NA|107aa|down_4|NZ_CP049257.1_3498026_3498347_-,NA|74aa|down_9|NZ_CP049257.1_3501265_3501487_-	NA|402aa|up_9|NZ_CP049257.1_3483597_3484803_-	PRK08198, PRK08198, threonine dehydratase; Provisional	NA|442aa|up_8|NZ_CP049257.1_3484807_3486133_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|442aa|up_7|NZ_CP049257.1_3486129_3487455_-	pfam13941, MutL, MutL protein	NA|386aa|up_6|NZ_CP049257.1_3487451_3488609_-	PRK07811, PRK07811, cystathionine gamma-synthase; Provisional	NA|86aa|up_5|NZ_CP049257.1_3488586_3488844_+	NA	NA|356aa|up_4|NZ_CP049257.1_3488918_3489986_-	cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup 3, extended (e) SDRs	NA|215aa|up_3|NZ_CP049257.1_3490118_3490763_+	PRK00058, PRK00058, peptide-methionine (S)-S-oxide reductase MsrA	NA|124aa|up_2|NZ_CP049257.1_3490850_3491222_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|204aa|up_1|NZ_CP049257.1_3491218_3491830_-	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|270aa|up_0|NZ_CP049257.1_3491826_3492636_-	cd07814, SRPBCC_CalC_Aha1-like, Putative hydrophobic ligand-binding SRPBCC domain of Micromonospora echinospora CalC, human Aha1, and related proteins	NA|450aa|down_0|NZ_CP049257.1_3495102_3496452_+	cd13970, ABC1_ADCK3, Activator of bc1 complex (ABC1) kinases, also called aarF domain containing kinase 3	NA|160aa|down_1|NZ_CP049257.1_3496448_3496928_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|119aa|down_2|NZ_CP049257.1_3496936_3497293_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|247aa|down_3|NZ_CP049257.1_3497289_3498030_-	pfam18029, Glyoxalase_6, Glyoxalase-like domain	NA|107aa|down_4|NZ_CP049257.1_3498026_3498347_-	NA	NA|260aa|down_5|NZ_CP049257.1_3498369_3499149_-	PRK10621, PRK10621, hypothetical protein; Provisional	NA|281aa|down_6|NZ_CP049257.1_3499336_3500179_+	cd01144, BtuF, Cobalamin binding protein BtuF	NA|189aa|down_7|NZ_CP049257.1_3500180_3500747_+	pfam13305, WHG, WHG domain	NA|165aa|down_8|NZ_CP049257.1_3500774_3501269_+	pfam16571, FBP_C, FBP C-terminal treble-clef zinc-finger	NA|74aa|down_9|NZ_CP049257.1_3501265_3501487_-	NA
GCF_011046535.1_ASM1104653v1	NZ_CP049257	Nocardioides sp. R-3366 chromosome, complete genome	3	3584050-3584192	2	CRISPRCasFinder	no		DinG,WYL,csa3,DEDDh,cas4,cas3,casR	Orphan	AAGGCCCGCCAGGCGGGGCAGCACCCACAAGCCCGGCAGACCAACCGGG	49	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	NA,NA|182aa|down_1|NZ_CP049257.1_3585341_3585887_+,NA|222aa|down_2|NZ_CP049257.1_3585897_3586563_+,NA|144aa|down_6|NZ_CP049257.1_3590663_3591095_+	NA|204aa|up_9|NZ_CP049257.1_3572351_3572963_+	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|790aa|up_8|NZ_CP049257.1_3572988_3575358_+	TIGR02412, Aminopeptidase_N, aminopeptidase N, Streptomyces lividans type	NA|292aa|up_7|NZ_CP049257.1_3575344_3576220_+	pfam13539, Peptidase_M15_4, D-alanyl-D-alanine carboxypeptidase	NA|439aa|up_6|NZ_CP049257.1_3576200_3577517_-	sd00045, ANK, ankyrin repeats	NA|324aa|up_5|NZ_CP049257.1_3577657_3578629_-	PRK03092, PRK03092, ribose-phosphate diphosphokinase	NA|464aa|up_4|NZ_CP049257.1_3578669_3580061_-	PRK14352, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|340aa|up_3|NZ_CP049257.1_3580362_3581382_-	PRK08241, PRK08241, RNA polymerase subunit sigma-70	NA|194aa|up_2|NZ_CP049257.1_3581515_3582097_+	pfam01872, RibD_C, RibD C-terminal domain	NA|472aa|up_1|NZ_CP049257.1_3582093_3583509_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|130aa|up_0|NZ_CP049257.1_3583534_3583924_-	COG0251, TdcF, Putative translation initiation inhibitor, yjgF family [Translation, ribosomal structure and biogenesis]	NA|363aa|down_0|NZ_CP049257.1_3584243_3585332_+	PRK08224, ligC, ATP-dependent DNA ligase; Reviewed	NA|182aa|down_1|NZ_CP049257.1_3585341_3585887_+	NA	NA|222aa|down_2|NZ_CP049257.1_3585897_3586563_+	NA	NA|504aa|down_3|NZ_CP049257.1_3586559_3588071_+	cd03820, GT4_AmsD-like, amylovoran biosynthesis glycosyltransferase AmsD and similar proteins	NA|436aa|down_4|NZ_CP049257.1_3587887_3589195_+	pfam11380, Stealth_CR2, Stealth protein CR2, conserved region 2	NA|364aa|down_5|NZ_CP049257.1_3589531_3590623_-	cd04865, LigD_Pol_like_2, LigD_Pol_like_2: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 2	NA|144aa|down_6|NZ_CP049257.1_3590663_3591095_+	NA	NA|376aa|down_7|NZ_CP049257.1_3591120_3592248_+	COG1887, TagB, Putative glycosyl/glycerophosphate transferases involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC [Cell envelope biogenesis, outer membrane]	NA|529aa|down_8|NZ_CP049257.1_3592197_3593784_-	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|400aa|down_9|NZ_CP049257.1_3593791_3594991_-	COG0404, GcvT, Glycine cleavage system T protein (aminomethyltransferase) [Amino acid transport and metabolism]
GCF_011046535.1_ASM1104653v1	NZ_CP049257	Nocardioides sp. R-3366 chromosome, complete genome	4	4801905-4802012	3	CRISPRCasFinder	no		DinG,WYL,csa3,DEDDh,cas4,cas3,casR	Orphan	GGTCGGGCTGCTGCTGGCCGGGGG	24	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	NA|224aa|up_5|NZ_CP049257.1_4795304_4795976_-,NA	NA|326aa|up_9|NZ_CP049257.1_4790551_4791529_-	cd05265, SDR_a1, atypical (a) SDRs, subgroup 1	NA|189aa|up_8|NZ_CP049257.1_4791673_4792240_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|513aa|up_7|NZ_CP049257.1_4792247_4793786_+	cd09603, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|492aa|up_6|NZ_CP049257.1_4793832_4795308_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|224aa|up_5|NZ_CP049257.1_4795304_4795976_-	NA	NA|480aa|up_4|NZ_CP049257.1_4796109_4797549_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|175aa|up_3|NZ_CP049257.1_4797536_4798061_+	cd08899, SRPBCC_CalC_Aha1-like_6, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins	NA|236aa|up_2|NZ_CP049257.1_4798057_4798765_-	cd12108, Hr-like, Hemerythrin-like domain	NA|503aa|up_1|NZ_CP049257.1_4798856_4800365_+	cd01300, YtcJ_like, YtcJ_like metal dependent amidohydrolases	NA|275aa|up_0|NZ_CP049257.1_4800353_4801178_-	cd01823, SEST_like, SEST_like	NA|382aa|down_0|NZ_CP049257.1_4802311_4803457_-	COG4941, COG4941, Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]	NA|135aa|down_1|NZ_CP049257.1_4803467_4803872_-	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|166aa|down_2|NZ_CP049257.1_4803885_4804383_-	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|374aa|down_3|NZ_CP049257.1_4804480_4805602_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|429aa|down_4|NZ_CP049257.1_4805682_4806969_+	cd14747, PBP2_MalE, Maltose-binding protein MalE; possesses type 2 periplasmic binding fold	NA|318aa|down_5|NZ_CP049257.1_4807093_4808047_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|250aa|down_6|NZ_CP049257.1_4808140_4808890_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|445aa|down_7|NZ_CP049257.1_4808886_4810221_+	pfam00933, Glyco_hydro_3, Glycosyl hydrolase family 3 N terminal domain	NA|290aa|down_8|NZ_CP049257.1_4810229_4811099_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|192aa|down_9|NZ_CP049257.1_4811935_4812511_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family
GCF_011046535.1_ASM1104653v1	NZ_CP049257	Nocardioides sp. R-3366 chromosome, complete genome	5	5031231-5031469	1	PILER-CR	no		DinG,WYL,csa3,DEDDh,cas4,cas3,casR	Orphan	GGGGGGCCCGGCCCCCATCGTCGAGCGGGCGGGTCCCCAGAGAC	44	0	0	NA	NA	NA	2	2	Orphan	DinG,WYL,csa3,DEDDh,cas4,cas3,casR	NA,NA|148aa|down_8|NZ_CP049257.1_5038662_5039106_-	NA|424aa|up_9|NZ_CP049257.1_5020489_5021761_+	cd08191, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|490aa|up_8|NZ_CP049257.1_5021757_5023227_+	cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde dehydrogenase and ALDH family members 5A1 and 5F1-like	NA|481aa|up_7|NZ_CP049257.1_5023301_5024744_+	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|305aa|up_6|NZ_CP049257.1_5024777_5025692_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|303aa|up_5|NZ_CP049257.1_5025691_5026600_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|356aa|up_4|NZ_CP049257.1_5026599_5027667_+	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|111aa|up_3|NZ_CP049257.1_5028919_5029252_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|287aa|up_2|NZ_CP049257.1_5029244_5030105_-	pfam17765, MLTR_LBD, MmyB-like transcription regulator ligand binding domain	NA|245aa|up_1|NZ_CP049257.1_5030205_5030940_+	cd05362, THN_reductase-like_SDR_c, tetrahydroxynaphthalene/trihydroxynaphthalene reductase-like, classical (c) SDRs	NA|72aa|up_0|NZ_CP049257.1_5030971_5031187_+	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|553aa|down_0|NZ_CP049257.1_5031679_5033338_+	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|115aa|down_1|NZ_CP049257.1_5033591_5033936_+	cd12952, MMP_ACEL2062, Minimal MMP-like domain found in Acidothermus cellulolyticus hypothetical protein ACEL2062 and similar protein	NA|111aa|down_2|NZ_CP049257.1_5033926_5034259_-	COG0239, CrcB, Integral membrane protein possibly involved in chromosome condensation [Cell division and chromosome partitioning]	NA|122aa|down_3|NZ_CP049257.1_5034255_5034621_-	pfam02537, CRCB, CrcB-like protein, Camphor Resistance (CrcB)	NA|314aa|down_4|NZ_CP049257.1_5034630_5035572_-	cd16936, HATPase_RsbW-like, Histidine kinase-like ATPase domain of RsbW, an anti sigma-B factor and serine-protein kinase involved in regulating sigma-B during stress in Bacilli, and related domains	NA|517aa|down_5|NZ_CP049257.1_5035859_5037410_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|216aa|down_6|NZ_CP049257.1_5037423_5038071_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|190aa|down_7|NZ_CP049257.1_5038145_5038715_+	COG3981, COG3981, Predicted acetyltransferase [General function prediction only]	NA|148aa|down_8|NZ_CP049257.1_5038662_5039106_-	NA	NA|598aa|down_9|NZ_CP049257.1_5040931_5042725_+	smart00701, PGRP, Animal peptidoglycan recognition proteins homologous to Bacteriophage T3 lysozyme
