assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002356315.1_ASM235631v1	NZ_AP018164	Mycobacterium shigaense strain JCM 32072	1	332464-332582	1	CRISPRCasFinder	no		DinG,csa3,cas3,WYL,cas4,DEDDh	Orphan	TGAGCCGCCGCCGATGGCGGCAGGGCCTCCTGCGCCACCGC	41	0	0	NA	NA	NA	1	1	Orphan	DinG,csa3,cas3,WYL,cas4,DEDDh	NA|42aa|up_4|NZ_AP018164.1_327819_327945_-,NA|51aa|down_0|NZ_AP018164.1_332665_332818_-	NA|407aa|up_9|NZ_AP018164.1_323032_324253_-	cd01155, ACAD_FadE2, Acyl-CoA dehydrogenases similar to fadE2	NA|154aa|up_8|NZ_AP018164.1_324386_324848_+	pfam07080, DUF1348, Protein of unknown function (DUF1348)	NA|367aa|up_7|NZ_AP018164.1_324922_326023_+	cd05304, Rubrum_tdh, Rubrum transdehydrogenase NAD-binding and catalytic domains	NA|106aa|up_6|NZ_AP018164.1_326023_326341_+	pfam12769, PNTB_4TM, 4TM region of pyridine nucleotide transhydrogenase, mitoch	NA|475aa|up_5|NZ_AP018164.1_326337_327762_+	pfam02233, PNTB, NAD(P) transhydrogenase beta subunit	NA|42aa|up_4|NZ_AP018164.1_327819_327945_-	NA	NA|212aa|up_3|NZ_AP018164.1_328121_328757_+	pfam17932, TetR_C_24, Tetracyclin repressor-like, C-terminal domain	NA|367aa|up_2|NZ_AP018164.1_328965_330066_+	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|247aa|up_1|NZ_AP018164.1_330082_330823_+	cd05341, 3beta-17beta-HSD_like_SDR_c, 3beta17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|448aa|up_0|NZ_AP018164.1_330844_332188_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|51aa|down_0|NZ_AP018164.1_332665_332818_-	NA	NA|167aa|down_1|NZ_AP018164.1_333315_333816_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|180aa|down_2|NZ_AP018164.1_333838_334378_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|278aa|down_3|NZ_AP018164.1_334431_335265_-	pfam13622, 4HBT_3, Thioesterase-like superfamily	NA|273aa|down_4|NZ_AP018164.1_335284_336103_-	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|294aa|down_5|NZ_AP018164.1_336381_337263_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|307aa|down_6|NZ_AP018164.1_337268_338189_+	TIGR03619, F420_Rv2161c, probable F420-dependent oxidoreductase, Rv2161c family	NA|243aa|down_7|NZ_AP018164.1_339420_340149_+	COG0428, COG0428, Predicted divalent heavy-metal cations transporter [Inorganic ion transport and metabolism]	NA|568aa|down_8|NZ_AP018164.1_340218_341922_-	PRK06184, PRK06184, hypothetical protein; Provisional	NA|222aa|down_9|NZ_AP018164.1_342294_342960_-	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)
GCF_002356315.1_ASM235631v1	NZ_AP018164	Mycobacterium shigaense strain JCM 32072	2	845080-845155	2	CRISPRCasFinder	no		DinG,csa3,cas3,WYL,cas4,DEDDh	Orphan	CGGGGGTCATCGGCGCCGCTCCT	23	1	8	845103-845132|845103-845132|845103-845132|845103-845132|845103-845132|845103-845132|845103-845132|845103-845132	NZ_AP018164.1_1219549-1219520|NZ_AP018164.1_2987274-2987245|NZ_AP018164.1_2987327-2987298|NZ_AP018164.1_1427504-1427533|NZ_AP018164.1_2011540-2011511|NZ_AP018164.1_2011593-2011564|NZ_AP018164.1_3122757-3122786|NZ_AP018164.1_4795872-4795901	NA	1	1	Orphan	DinG,csa3,cas3,WYL,cas4,DEDDh	NA|449aa|up_9|NZ_AP018164.1_827373_828720_-,NA|162aa|up_4|NZ_AP018164.1_836544_837030_-,NA|278aa|down_3|NZ_AP018164.1_848755_849589_-	NA|449aa|up_9|NZ_AP018164.1_827373_828720_-	NA	NA|864aa|up_8|NZ_AP018164.1_828814_831406_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|722aa|up_7|NZ_AP018164.1_831817_833983_-	pfam13515, FUSC_2, Fusaric acid resistance protein-like	NA|365aa|up_6|NZ_AP018164.1_834071_835166_+	cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate	NA|374aa|up_5|NZ_AP018164.1_835162_836284_+	PRK00555, PRK00555, galactokinase; Provisional	NA|162aa|up_4|NZ_AP018164.1_836544_837030_-	NA	NA|874aa|up_3|NZ_AP018164.1_837118_839740_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_2|NZ_AP018164.1_839748_842136_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|316aa|up_1|NZ_AP018164.1_842297_843245_+	TIGR00027, Hypothetical_protein_Rv0893c/MT0917/Mb0917c	NA|172aa|up_0|NZ_AP018164.1_843277_843793_+	cd02139, nitroreductase, nitroreductase family protein	NA|67aa|down_0|NZ_AP018164.1_846539_846740_-	pfam13370, Fer4_13, 4Fe-4S single cluster domain of Ferredoxin I	NA|340aa|down_1|NZ_AP018164.1_846834_847854_+	cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2	NA|232aa|down_2|NZ_AP018164.1_847981_848677_-	PRK06213, PRK06213, crotonase/enoyl-CoA hydratase family protein	NA|278aa|down_3|NZ_AP018164.1_848755_849589_-	NA	NA|241aa|down_4|NZ_AP018164.1_849684_850407_-	cd16275, BaeB-like_MBL-fold, Bacillus amyloliquefaciens BaeB and related proteins; MBL-fold metallo hydrolase domain	NA|67aa|down_5|NZ_AP018164.1_850520_850721_+	pfam09957, VapB_antitoxin, Bacterial antitoxin of type II TA system, VapB	NA|56aa|down_6|NZ_AP018164.1_851046_851214_+	PRK00504, rpmG, 50S ribosomal protein L33; Validated	NA|160aa|down_7|NZ_AP018164.1_851271_851751_+	PRK13692, PRK13692, (3R)-hydroxyacyl-ACP dehydratase subunit HadA; Provisional	NA|143aa|down_8|NZ_AP018164.1_851737_852166_+	PRK13693, PRK13693, (3R)-hydroxyacyl-ACP dehydratase subunit HadB; Provisional	NA|167aa|down_9|NZ_AP018164.1_852179_852680_+	PRK13691, PRK13691, (3R)-hydroxyacyl-ACP dehydratase subunit HadC; Provisional
GCF_002356315.1_ASM235631v1	NZ_AP018164	Mycobacterium shigaense strain JCM 32072	3	1944836-1944922	3	CRISPRCasFinder	no		DinG,csa3,cas3,WYL,cas4,DEDDh	Orphan	CCCGCTGCGCCCGGCTGCGCCGCG	24	0	0	NA	NA	NA	1	1	Orphan	DinG,csa3,cas3,WYL,cas4,DEDDh	NA,NA|178aa|down_4|NZ_AP018164.1_1950270_1950804_-,NA|204aa|down_6|NZ_AP018164.1_1951320_1951932_-	NA|106aa|up_9|NZ_AP018164.1_1934956_1935274_-	pfam10824, T7SS_ESX_EspC, Excreted virulence factor EspC, type VII ESX diderm	NA|408aa|up_8|NZ_AP018164.1_1935270_1936494_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|432aa|up_7|NZ_AP018164.1_1936662_1937958_-	COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and biogenesis]	NA|212aa|up_6|NZ_AP018164.1_1937959_1938595_-	pfam11452, DUF3000, Protein of unknown function (DUF3000)	NA|269aa|up_5|NZ_AP018164.1_1938716_1939523_-	PRK07327, PRK07327, enoyl-CoA hydratase/isomerase family protein	NA|350aa|up_4|NZ_AP018164.1_1939587_1940637_+	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|459aa|up_3|NZ_AP018164.1_1940633_1942010_+	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|232aa|up_2|NZ_AP018164.1_1942015_1942711_+	COG3253, ywfI, Predicted heme peroxidase involved in anaerobic stress response [General function prediction only]	NA|164aa|up_1|NZ_AP018164.1_1943871_1944363_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|141aa|up_0|NZ_AP018164.1_1944384_1944807_-	pfam01641, SelR, SelR domain	NA|520aa|down_0|NZ_AP018164.1_1946303_1947863_-	pfam08386, Abhydrolase_4, TAP-like protein	NA|260aa|down_1|NZ_AP018164.1_1947887_1948667_-	PRK14059, PRK14059, pyrimidine reductase family protein	NA|363aa|down_2|NZ_AP018164.1_1948665_1949754_+	pfam03969, AFG1_ATPase, AFG1-like ATPase	NA|184aa|down_3|NZ_AP018164.1_1949731_1950283_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|178aa|down_4|NZ_AP018164.1_1950270_1950804_-	NA	NA|116aa|down_5|NZ_AP018164.1_1950965_1951313_+	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|204aa|down_6|NZ_AP018164.1_1951320_1951932_-	NA	NA|123aa|down_7|NZ_AP018164.1_1952020_1952389_+	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|314aa|down_8|NZ_AP018164.1_1952445_1953387_+	pfam09678, Caa3_CtaG, Cytochrome c oxidase caa3 assembly factor (Caa3_CtaG)	NA|378aa|down_9|NZ_AP018164.1_1953401_1954535_-	COG1819, COG1819, Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]
GCF_002356315.1_ASM235631v1	NZ_AP018164	Mycobacterium shigaense strain JCM 32072	4	2208245-2208447	1	PILER-CR	no		DinG,csa3,cas3,WYL,cas4,DEDDh	Orphan	ACCGACTCCGGCACCTTCACCGATGCCGGCACCGTCACCGT	41	0	0	NA	NA	NA	2	2	Orphan	DinG,csa3,cas3,WYL,cas4,DEDDh	NA|113aa|up_8|NZ_AP018164.1_2196174_2196513_+,NA|134aa|up_0|NZ_AP018164.1_2205681_2206083_-,NA|231aa|down_7|NZ_AP018164.1_2217362_2218055_+	NA|272aa|up_9|NZ_AP018164.1_2194769_2195585_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|113aa|up_8|NZ_AP018164.1_2196174_2196513_+	NA	NA|583aa|up_7|NZ_AP018164.1_2196654_2198403_+	pfam12077, DUF3556, Transmembrane protein of unknown function (DUF3556)	NA|344aa|up_6|NZ_AP018164.1_2198576_2199608_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|416aa|up_5|NZ_AP018164.1_2199610_2200858_+	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|262aa|up_4|NZ_AP018164.1_2200854_2201640_+	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|78aa|up_3|NZ_AP018164.1_2201693_2201927_+	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|933aa|up_2|NZ_AP018164.1_2201980_2204779_+	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|269aa|up_1|NZ_AP018164.1_2204862_2205669_+	pfam06724, DUF1206, Domain of Unknown Function (DUF1206)	NA|134aa|up_0|NZ_AP018164.1_2205681_2206083_-	NA	NA|250aa|down_0|NZ_AP018164.1_2208689_2209439_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|304aa|down_1|NZ_AP018164.1_2209435_2210347_-	TIGR00534, Putative_OxPP_cycle_protein_OpcA, glucose-6-phosphate dehydrogenase assembly protein OpcA	NA|515aa|down_2|NZ_AP018164.1_2210396_2211941_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|373aa|down_3|NZ_AP018164.1_2211937_2213056_-	PRK03343, PRK03343, transaldolase; Validated	NA|697aa|down_4|NZ_AP018164.1_2213070_2215161_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|305aa|down_5|NZ_AP018164.1_2215428_2216343_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|326aa|down_6|NZ_AP018164.1_2216365_2217343_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|231aa|down_7|NZ_AP018164.1_2217362_2218055_+	NA	NA|315aa|down_8|NZ_AP018164.1_2218004_2218949_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|201aa|down_9|NZ_AP018164.1_2219091_2219694_-	COG1309, AcrR, Transcriptional regulator [Transcription]
