assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_900683635.1_54424_D01	LR215973	Nocardia cyriacigeorgica strain 3012STDY6756504 genome assembly, chromosome: 1	2	558471-558558	2	CRISPRCasFinder	no		DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	Orphan	CGGGTGCCTGTCGTGCACGGCCTGG	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	NA|122aa|up_2|LR215973.1_555868_556234_-,NA|150aa|down_0|LR215973.1_558896_559346_-,NA|78aa|down_7|LR215973.1_569240_569474_-	NA|276aa|up_9|LR215973.1_547172_548000_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|274aa|up_8|LR215973.1_548320_549142_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|195aa|up_7|LR215973.1_549321_549906_+	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083	NA|651aa|up_6|LR215973.1_549954_551907_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|546aa|up_5|LR215973.1_551959_553597_-	cd11480, SLC5sbd_u4, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|119aa|up_4|LR215973.1_553593_553950_-	pfam04341, DUF485, Protein of unknown function, DUF485	NA|587aa|up_3|LR215973.1_554090_555851_-	cd11480, SLC5sbd_u4, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|122aa|up_2|LR215973.1_555868_556234_-	NA	NA|259aa|up_1|LR215973.1_556230_557007_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|406aa|up_0|LR215973.1_557190_558408_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|150aa|down_0|LR215973.1_558896_559346_-	NA	NA|169aa|down_1|LR215973.1_559689_560196_+	cd12820, LbR_YadA-like, YadA-like, left-handed beta-roll	NA|552aa|down_2|LR215973.1_560235_561891_+	pfam01636, APH, Phosphotransferase enzyme family	NA|79aa|down_3|LR215973.1_562355_562592_+	pfam13453, zf-TFIIB, Transcription factor zinc-finger	NA|1193aa|down_4|LR215973.1_562720_566299_-	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|419aa|down_5|LR215973.1_566640_567897_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|399aa|down_6|LR215973.1_567915_569112_-	pfam17885, Smoa_sbd, Styrene monooxygenase A putative substrate binding domain	NA|78aa|down_7|LR215973.1_569240_569474_-	NA	NA|838aa|down_8|LR215973.1_569710_572224_+	COG3903, COG3903, Predicted ATPase [General function prediction only]	NA|406aa|down_9|LR215973.1_572269_573487_-	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]
GCA_900683635.1_54424_D01	LR215973	Nocardia cyriacigeorgica strain 3012STDY6756504 genome assembly, chromosome: 1	3	2045647-2045733	3	CRISPRCasFinder	no		DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	Orphan	GGGTTAGCCGGCGTCGCCCGAGCGC	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	NA|73aa|up_7|LR215973.1_2036708_2036927_-,NA|107aa|up_3|LR215973.1_2039994_2040315_+,NA|159aa|down_0|LR215973.1_2046428_2046905_-	NA|193aa|up_9|LR215973.1_2034997_2035576_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|279aa|up_8|LR215973.1_2035608_2036445_-	TIGR03621, F420_MSMEG_2516, probable F420-dependent oxidoreductase, MSMEG_2516 family	NA|73aa|up_7|LR215973.1_2036708_2036927_-	NA	NA|353aa|up_6|LR215973.1_2036939_2037998_-	cd05227, AR_SDR_e, aldehyde reductase, extended (e) SDRs	NA|205aa|up_5|LR215973.1_2038107_2038722_+	pfam13305, WHG, WHG domain	NA|381aa|up_4|LR215973.1_2038767_2039910_+	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|107aa|up_3|LR215973.1_2039994_2040315_+	NA	NA|405aa|up_2|LR215973.1_2040436_2041651_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|575aa|up_1|LR215973.1_2041691_2043416_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|591aa|up_0|LR215973.1_2043412_2045185_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|159aa|down_0|LR215973.1_2046428_2046905_-	NA	NA|322aa|down_1|LR215973.1_2046928_2047894_+	cd08414, PBP2_LTTR_aromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold	NA|154aa|down_2|LR215973.1_2047984_2048446_+	pfam05610, DUF779, Protein of unknown function (DUF779)	NA|148aa|down_3|LR215973.1_2048447_2048891_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|215aa|down_4|LR215973.1_2049012_2049657_+	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083	NA|148aa|down_5|LR215973.1_2049644_2050088_-	COG3801, COG3801, Uncharacterized protein conserved in bacteria [Function unknown]	NA|518aa|down_6|LR215973.1_2050128_2051682_-	PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed	NA|798aa|down_7|LR215973.1_2051881_2054275_+	pfam08376, NIT, Nitrate and nitrite sensing	NA|140aa|down_8|LR215973.1_2054275_2054695_+	pfam03259, Robl_LC7, Roadblock/LC7 domain	NA|125aa|down_9|LR215973.1_2054709_2055084_+	pfam05331, DUF742, Protein of unknown function (DUF742)
GCA_900683635.1_54424_D01	LR215973	Nocardia cyriacigeorgica strain 3012STDY6756504 genome assembly, chromosome: 1	7	4507574-4507673	7	CRISPRCasFinder	no		DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	Orphan	CGCCTGACCCGGTCAAGGGCCGG	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	NA|405aa|up_9|LR215973.1_4491950_4493165_+,NA|98aa|up_5|LR215973.1_4497786_4498080_-,NA|515aa|up_2|LR215973.1_4502467_4504012_-,NA|380aa|up_1|LR215973.1_4503998_4505138_-,NA|43aa|down_5|LR215973.1_4515791_4515920_-	NA|405aa|up_9|LR215973.1_4491950_4493165_+	NA	NA|790aa|up_8|LR215973.1_4493161_4495531_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|388aa|up_7|LR215973.1_4495580_4496744_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|283aa|up_6|LR215973.1_4496901_4497750_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|98aa|up_5|LR215973.1_4497786_4498080_-	NA	NA|482aa|up_4|LR215973.1_4498290_4499736_+	cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde dehydrogenase  AldA-like	NA|611aa|up_3|LR215973.1_4500400_4502233_-	PRK04210, PRK04210, phosphoenolpyruvate carboxykinase (GTP)	NA|515aa|up_2|LR215973.1_4502467_4504012_-	NA	NA|380aa|up_1|LR215973.1_4503998_4505138_-	NA	NA|261aa|up_0|LR215973.1_4505210_4505993_-	PRK08267, PRK08267, SDR family oxidoreductase	NA|273aa|down_0|LR215973.1_4507738_4508557_+	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|237aa|down_1|LR215973.1_4508556_4509267_+	pfam01936, NYN, NYN domain	NA|1109aa|down_2|LR215973.1_4509266_4512593_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|523aa|down_3|LR215973.1_4512682_4514251_-	pfam10101, DUF2339, Predicted membrane protein (DUF2339)	NA|241aa|down_4|LR215973.1_4515058_4515781_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|43aa|down_5|LR215973.1_4515791_4515920_-	NA	NA|219aa|down_6|LR215973.1_4515926_4516583_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|398aa|down_7|LR215973.1_4516586_4517780_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|451aa|down_8|LR215973.1_4517757_4519110_-	pfam01594, AI-2E_transport, AI-2E family transporter	NA|520aa|down_9|LR215973.1_4519172_4520732_+	cd07302, CHD, cyclase homology domain
GCA_900683635.1_54424_D01	LR215973	Nocardia cyriacigeorgica strain 3012STDY6756504 genome assembly, chromosome: 1	10	5342200-5342309	9	CRISPRCasFinder	no		DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	Orphan	CCGCATCGGCGTGGGGTGGCCGGTT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	NA|57aa|up_6|LR215973.1_5336717_5336888_-,NA	NA|509aa|up_9|LR215973.1_5332587_5334114_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|230aa|up_8|LR215973.1_5334110_5334800_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|551aa|up_7|LR215973.1_5334904_5336557_-	cd03788, GT20_TPS, trehalose-6-phosphate synthase	NA|57aa|up_6|LR215973.1_5336717_5336888_-	NA	NA|278aa|up_5|LR215973.1_5337050_5337884_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|81aa|up_4|LR215973.1_5338036_5338279_+	COG3251, COG3251, Uncharacterized protein conserved in bacteria [Function unknown]	NA|361aa|up_3|LR215973.1_5338530_5339613_-	pfam13847, Methyltransf_31, Methyltransferase domain	NA|142aa|up_2|LR215973.1_5339772_5340198_-	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|126aa|up_1|LR215973.1_5340253_5340631_-	pfam08000, bPH_1, Bacterial PH domain	NA|424aa|up_0|LR215973.1_5340844_5342116_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|560aa|down_0|LR215973.1_5342537_5344217_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|371aa|down_1|LR215973.1_5344316_5345429_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|182aa|down_2|LR215973.1_5345670_5346216_+	COG3226, COG3226, Uncharacterized protein conserved in bacteria [Function unknown]	NA|474aa|down_3|LR215973.1_5346222_5347644_+	cd03302, Adenylsuccinate_lyase_2, Adenylsuccinate lyase (ASL)_subgroup 2	NA|150aa|down_4|LR215973.1_5347697_5348147_+	cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding protein) subgroup: Members of this CD belong to the superfamily of histidine triad hydrolases that act on alpha-phosphate of ribonucleotides	NA|320aa|down_5|LR215973.1_5348280_5349240_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|729aa|down_6|LR215973.1_5349443_5351630_+	COG1770, PtrB, Protease II [Amino acid transport and metabolism]	NA|234aa|down_7|LR215973.1_5351781_5352483_+	COG3233, COG3233, Predicted deacetylase [General function prediction only]	NA|334aa|down_8|LR215973.1_5352681_5353683_+	TIGR03813, put_Glu_GABA_T, putative glutamate/gamma-aminobutyrate antiporter	NA|137aa|down_9|LR215973.1_5353689_5354100_+	TIGR03813, put_Glu_GABA_T, putative glutamate/gamma-aminobutyrate antiporter
GCA_900683635.1_54424_D01	LR215973	Nocardia cyriacigeorgica strain 3012STDY6756504 genome assembly, chromosome: 1	11	5868031-5868315	10	CRISPRCasFinder	no		DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	Orphan	GCGCCAGGGTGTCGCCGCGCAGCGTCAGGG	30	0	0	NA	NA	NA	4	4	Orphan	DEDDh,csa3,WYL,cas4,Cas14u_CAS-V,cas3,DinG	NA|63aa|up_0|LR215973.1_5866484_5866673_+,NA|64aa|down_4|LR215973.1_5873273_5873465_-	NA|503aa|up_9|LR215973.1_5856093_5857602_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|298aa|up_8|LR215973.1_5857602_5858496_-	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|235aa|up_7|LR215973.1_5858498_5859203_+	PLN02267, PLN02267, enoyl-CoA hydratase/isomerase family protein	NA|436aa|up_6|LR215973.1_5859257_5860565_+	cd05672, M20_ACY1L2-like, M20 Peptidase aminoacylase 1-like protein 2-like, amidohydrolase subfamily	NA|423aa|up_5|LR215973.1_5860561_5861830_+	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|160aa|up_4|LR215973.1_5861922_5862402_-	pfam13772, AIG2_2, AIG2-like family	NA|467aa|up_3|LR215973.1_5862536_5863937_+	PRK07845, PRK07845, flavoprotein disulfide reductase; Reviewed	NA|365aa|up_2|LR215973.1_5864146_5865241_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|339aa|up_1|LR215973.1_5865330_5866347_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|63aa|up_0|LR215973.1_5866484_5866673_+	NA	NA|500aa|down_0|LR215973.1_5868377_5869877_-	PRK00047, glpK, glycerol kinase GlpK	NA|578aa|down_1|LR215973.1_5870065_5871799_+	COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy production and conversion]	NA|277aa|down_2|LR215973.1_5871849_5872680_-	cd07742, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|188aa|down_3|LR215973.1_5872676_5873240_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|64aa|down_4|LR215973.1_5873273_5873465_-	NA	NA|261aa|down_5|LR215973.1_5873553_5874336_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|1564aa|down_6|LR215973.1_5874360_5879052_-	PRK09751, PRK09751, putative ATP-dependent helicase Lhr; Provisional	NA|182aa|down_7|LR215973.1_5879120_5879666_+	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|510aa|down_8|LR215973.1_5879679_5881209_-	cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase, ALDH family members 7A1 and 7B	NA|151aa|down_9|LR215973.1_5881320_5881773_+	smart00344, HTH_ASNC, helix_turn_helix ASNC type
