assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010731635.1_ASM1073163v1	NZ_AP022583	Mycobacterium noviomagense strain JCM 16367	1	390788-391381	1	CRT	no	DinG	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	Type IV-A	CCGCGCCCGCGCCTCNGCGGCCANNGCTTC	30	3	3	391022-391039|391070-391087|391118-391135	NZ_AP022583.1_4291514-4291531|NZ_AP022583.1_899910-899893|NZ_AP022583.1_4291514-4291531	NA	11	11	Orphan	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	NA,NA	NA|238aa|up_9|NZ_AP022583.1_380072_380786_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|398aa|up_8|NZ_AP022583.1_380789_381983_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|398aa|up_7|NZ_AP022583.1_382008_383202_-	TIGR04253, mesacon_CoA_iso, mesaconyl-CoA isomerase	NA|402aa|up_6|NZ_AP022583.1_383198_384404_-	PRK06205, PRK06205, acetyl-CoA C-acetyltransferase	NA|247aa|up_5|NZ_AP022583.1_384481_385222_-	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|388aa|up_4|NZ_AP022583.1_385355_386519_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|274aa|up_3|NZ_AP022583.1_386550_387372_+	TIGR03971, short-chain_dehydrogenase/reductase_SDR, SDR family mycofactocin-dependent oxidoreductase	NA|143aa|up_2|NZ_AP022583.1_387428_387857_+	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|359aa|up_1|NZ_AP022583.1_387866_388943_-	pfam02958, EcKinase, Ecdysteroid kinase	NA|216aa|up_0|NZ_AP022583.1_388930_389578_+	pfam05481, Myco_19_kDa, Mycobacterium 19 kDa lipoprotein antigen	NA|127aa|down_0|NZ_AP022583.1_392774_393155_+	COG4578, GutM, Glucitol operon activator [Transcription]	NA|119aa|down_1|NZ_AP022583.1_393151_393508_+	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|200aa|down_2|NZ_AP022583.1_393575_394175_-	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|260aa|down_3|NZ_AP022583.1_394171_394951_-	PRK00173, rph, ribonuclease PH; Reviewed	NA|254aa|down_4|NZ_AP022583.1_394968_395730_-	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|273aa|down_5|NZ_AP022583.1_395815_396634_-	PRK00865, PRK00865, glutamate racemase; Provisional	NA|221aa|down_6|NZ_AP022583.1_396630_397293_-	pfam01694, Rhomboid, Rhomboid family	NA|321aa|down_7|NZ_AP022583.1_397283_398246_-	TIGR01136, Cysteine_synthase, cysteine synthase	NA|91aa|down_8|NZ_AP022583.1_398248_398521_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|138aa|down_9|NZ_AP022583.1_398540_398954_-	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)
GCF_010731635.1_ASM1073163v1	NZ_AP022583	Mycobacterium noviomagense strain JCM 16367	2	580972-581053	1	CRISPRCasFinder	no		csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	Orphan	CCGCTGGCGGCCCTCGTCGTCACCCG	26	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	NA,NA	NA|141aa|up_9|NZ_AP022583.1_571431_571854_-	PRK03100, PRK03100, Sec-independent protein translocase subunit TatB	NA|501aa|up_8|NZ_AP022583.1_571854_573357_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|140aa|up_7|NZ_AP022583.1_573431_573851_-	cd20301, cupin_ChrR, anti-ECFsigma factor, ChrR , cupin domain	NA|255aa|up_6|NZ_AP022583.1_573957_574722_-	PRK09647, PRK09647, RNA polymerase sigma factor SigE; Reviewed	NA|223aa|up_5|NZ_AP022583.1_574921_575590_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|221aa|up_4|NZ_AP022583.1_575657_576320_+	pfam17933, TetR_C_25, Tetracyclin repressor-like, C-terminal domain	NA|312aa|up_3|NZ_AP022583.1_576309_577245_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|520aa|up_2|NZ_AP022583.1_577328_578888_+	COG3559, TnrB3, Putative exporter of polyketide antibiotics [Cell envelope biogenesis, outer membrane]	NA|225aa|up_1|NZ_AP022583.1_578906_579581_+	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|405aa|up_0|NZ_AP022583.1_579598_580813_-	PRK00844, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|388aa|down_0|NZ_AP022583.1_581068_582232_+	TIGR02149, glgA_Coryne, glycogen synthase, Corynebacterium family	NA|56aa|down_1|NZ_AP022583.1_582276_582444_-	pfam11314, DUF3117, Protein of unknown function (DUF3117)	NA|191aa|down_2|NZ_AP022583.1_582594_583167_-	pfam03352, Adenine_glyco, Methyladenine glycosylase	NA|116aa|down_3|NZ_AP022583.1_583163_583511_-	TIGR03544, cell_division_initiation_protein_DivIVA, DivIVA domain	NA|322aa|down_4|NZ_AP022583.1_583553_584519_-	PRK13915, PRK13915, putative glucosyl-3-phosphoglycerate synthase; Provisional	NA|292aa|down_5|NZ_AP022583.1_584515_585391_-	TIGR01496, Dihydropteroate_synthase, dihydropteroate synthase	NA|593aa|down_6|NZ_AP022583.1_585529_587308_-	PRK08279, PRK08279, long-chain-acyl-CoA synthetase; Validated	NA|185aa|down_7|NZ_AP022583.1_587370_587925_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein	NA|745aa|down_8|NZ_AP022583.1_588046_590281_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|355aa|down_9|NZ_AP022583.1_590265_591330_-	PRK13007, PRK13007, succinyl-diaminopimelate desuccinylase; Reviewed
GCF_010731635.1_ASM1073163v1	NZ_AP022583	Mycobacterium noviomagense strain JCM 16367	3	598998-599076	2	CRISPRCasFinder	no		csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	Orphan	CCGGCTTCGCCGCTCTTGCGATCGCC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	NA,NA	NA|593aa|up_9|NZ_AP022583.1_585529_587308_-	PRK08279, PRK08279, long-chain-acyl-CoA synthetase; Validated	NA|185aa|up_8|NZ_AP022583.1_587370_587925_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein	NA|745aa|up_7|NZ_AP022583.1_588046_590281_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|355aa|up_6|NZ_AP022583.1_590265_591330_-	PRK13007, PRK13007, succinyl-diaminopimelate desuccinylase; Reviewed	NA|316aa|up_5|NZ_AP022583.1_591469_592417_+	TIGR03535, DapD_actino, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase	NA|117aa|up_4|NZ_AP022583.1_592484_592835_-	pfam18029, Glyoxalase_6, Glyoxalase-like domain	NA|465aa|up_3|NZ_AP022583.1_592825_594220_-	PRK07787, PRK07787, acyl-CoA synthetase; Validated	NA|504aa|up_2|NZ_AP022583.1_595475_596987_-	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|278aa|up_1|NZ_AP022583.1_597004_597838_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|320aa|up_0|NZ_AP022583.1_598010_598970_-	COG0506, PutA, Proline dehydrogenase [Amino acid transport and metabolism]	NA|544aa|down_0|NZ_AP022583.1_599082_600714_-	cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH families 4 and 17	NA|426aa|down_1|NZ_AP022583.1_600796_602074_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|395aa|down_2|NZ_AP022583.1_602140_603325_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|195aa|down_3|NZ_AP022583.1_603411_603996_+	pfam16859, TetR_C_11, Bacterial transcriptional repressor C-terminal	NA|776aa|down_4|NZ_AP022583.1_604648_606976_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|592aa|down_5|NZ_AP022583.1_607014_608790_-	cd01154, AidB, Proteins involved in DNA damage response, similar to the AidB gene product	NA|362aa|down_6|NZ_AP022583.1_608865_609951_-	PRK07865, PRK07865, N-succinyldiaminopimelate aminotransferase; Reviewed	NA|109aa|down_7|NZ_AP022583.1_609977_610304_-	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|197aa|down_8|NZ_AP022583.1_610523_611114_-	cd00865, PEBP_bact_arch, PhosphatidylEthanolamine-Binding Protein (PEBP) domain present in bacteria and archaea	NA|255aa|down_9|NZ_AP022583.1_611153_611918_-	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083
GCF_010731635.1_ASM1073163v1	NZ_AP022583	Mycobacterium noviomagense strain JCM 16367	4	1133382-1133510	3	CRISPRCasFinder	no	csa3	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	Type I-A	AGCCCGGCGACGATGCAGAGCGCGCAGCGCGATGAGG	37	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	NA,NA	NA|359aa|up_9|NZ_AP022583.1_1122203_1123280_+	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|378aa|up_8|NZ_AP022583.1_1123283_1124417_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|708aa|up_7|NZ_AP022583.1_1124642_1126766_+	TIGR00350, Transcriptional_regulator_LytR, cell envelope-related function transcriptional attenuator common domain	NA|228aa|up_6|NZ_AP022583.1_1126869_1127553_+	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU	NA|259aa|up_5|NZ_AP022583.1_1127549_1128326_-	PRK14241, PRK14241, phosphate transporter ATP-binding protein; Provisional	NA|305aa|up_4|NZ_AP022583.1_1128334_1129249_-	TIGR00974, 3a0107s02c, phosphate ABC transporter, permease protein PstA	NA|342aa|up_3|NZ_AP022583.1_1129245_1130271_-	TIGR02138, phosphate_transport_system_permease_protein_PstC, phosphate ABC transporter, permease protein PstC	NA|367aa|up_2|NZ_AP022583.1_1130353_1131454_-	TIGR00975, precursor_PBP-3_PstS-3_Antigen_Ag88	NA|307aa|up_1|NZ_AP022583.1_1131596_1132517_-	TIGR03448, mycothiol_MshD, mycothiol synthase	NA|256aa|up_0|NZ_AP022583.1_1132513_1133281_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|271aa|down_0|NZ_AP022583.1_1133563_1134376_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|141aa|down_1|NZ_AP022583.1_1134419_1134842_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|171aa|down_2|NZ_AP022583.1_1135042_1135555_+	pfam14340, DUF4395, Domain of unknown function (DUF4395)	NA|278aa|down_3|NZ_AP022583.1_1135533_1136367_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|101aa|down_4|NZ_AP022583.1_1136369_1136672_+	pfam07210, DUF1416, Protein of unknown function (DUF1416)	NA|223aa|down_5|NZ_AP022583.1_1136823_1137492_+	pfam08768, DUF1794, Domain of unknown function (DUF1794)	NA|290aa|down_6|NZ_AP022583.1_1137733_1138603_-	PRK07849, PRK07849, aminodeoxychorismate lyase	NA|365aa|down_7|NZ_AP022583.1_1138708_1139803_+	COG0354, COG0354, Predicted aminomethyltransferase related to GcvT [General function prediction only]	NA|64aa|down_8|NZ_AP022583.1_1139940_1140132_+	pfam11273, DUF3073, Protein of unknown function (DUF3073)	NA|365aa|down_9|NZ_AP022583.1_1140153_1141248_-	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional
GCF_010731635.1_ASM1073163v1	NZ_AP022583	Mycobacterium noviomagense strain JCM 16367	5	3233643-3233726	4	CRISPRCasFinder	no		csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	Orphan	TCCCTCGTCGTCATCGGGCTAGGCTT	26	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,DEDDh,DinG,cas3,WYL,cas4	NA,NA|128aa|down_5|NZ_AP022583.1_3238381_3238765_+	NA|349aa|up_9|NZ_AP022583.1_3222460_3223507_-	cd01146, FhuD, Fe3+-siderophore binding domain FhuD	NA|582aa|up_8|NZ_AP022583.1_3223755_3225501_+	cd01662, Ubiquinol_Oxidase_I, Ubiquinol oxidase subunit I	NA|412aa|up_7|NZ_AP022583.1_3225526_3226762_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|281aa|up_6|NZ_AP022583.1_3226869_3227712_+	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|272aa|up_5|NZ_AP022583.1_3227708_3228524_+	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|247aa|up_4|NZ_AP022583.1_3228520_3229261_+	PRK05869, PRK05869, enoyl-CoA hydratase; Validated	NA|328aa|up_3|NZ_AP022583.1_3229272_3230256_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|410aa|up_2|NZ_AP022583.1_3230189_3231419_+	pfam18096, Thump_like, THUMP domain-like	NA|229aa|up_1|NZ_AP022583.1_3231495_3232182_+	pfam11738, DUF3298, Protein of unknown function (DUF3298)	NA|365aa|up_0|NZ_AP022583.1_3232204_3233299_-	COG1520, COG1520, FOG: WD40-like repeat [Function unknown]	NA|246aa|down_0|NZ_AP022583.1_3233733_3234471_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|160aa|down_1|NZ_AP022583.1_3234467_3234947_-	cd15904, TSPO_MBR, Translocator protein (TSPO)/peripheral-type benzodiazepine receptor (MBR) family	NA|354aa|down_2|NZ_AP022583.1_3235180_3236242_+	PRK05437, PRK05437, isopentenyl pyrophosphate isomerase; Provisional	NA|418aa|down_3|NZ_AP022583.1_3236245_3237499_+	cd03784, GT1_Gtf-like, UDP-glycosyltransferases and similar proteins	NA|169aa|down_4|NZ_AP022583.1_3237504_3238011_-	COG1846, MarR, Transcriptional regulators [Transcription]	NA|128aa|down_5|NZ_AP022583.1_3238381_3238765_+	NA	NA|491aa|down_6|NZ_AP022583.1_3238769_3240242_+	cd05245, SDR_a2, atypical (a) SDRs, subgroup 2	NA|449aa|down_7|NZ_AP022583.1_3240320_3241667_+	COG0415, PhrB, Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]	NA|123aa|down_8|NZ_AP022583.1_3241663_3242032_-	pfam10861, DUF2784, Protein of Unknown function (DUF2784)	NA|333aa|down_9|NZ_AP022583.1_3242049_3243048_-	pfam00355, Rieske, Rieske [2Fe-2S] domain
