assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	1	460936-461050	1,1	PILER-CR,CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	ACCGGGCTCCTCGGGGGTCTGACCGGCGGCG,ACCGGGCTCCTCGGGGGTCTGACC	31,24	0	0	NA	NA	NA:NA	2,2	2	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA	NA|65aa|up_9|NZ_AP022573.1_446928_447123_+	PRK00172, rpmI, 50S ribosomal protein L35; Reviewed	NA|132aa|up_8|NZ_AP022573.1_447202_447598_+	PRK05185, rplT, 50S ribosomal protein L20; Provisional	NA|260aa|up_7|NZ_AP022573.1_447610_448390_+	COG0566, SpoU, rRNA methylases [Translation, ribosomal structure and biogenesis]	NA|356aa|up_6|NZ_AP022573.1_448413_449481_-	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|310aa|up_5|NZ_AP022573.1_449682_450612_+	cd07302, CHD, cyclase homology domain	NA|348aa|up_4|NZ_AP022573.1_451559_452603_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|829aa|up_3|NZ_AP022573.1_452602_455089_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|556aa|up_2|NZ_AP022573.1_455474_457142_+	pfam00934, PE, PE family	NA|519aa|up_1|NZ_AP022573.1_457417_458974_+	pfam00934, PE, PE family	NA|349aa|up_0|NZ_AP022573.1_459165_460212_+	pfam00934, PE, PE family	NA|1606aa|down_0|NZ_AP022573.1_462397_467215_+	pfam00934, PE, PE family	NA|348aa|down_1|NZ_AP022573.1_467374_468418_+	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|405aa|down_2|NZ_AP022573.1_468414_469629_+	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|296aa|down_3|NZ_AP022573.1_469625_470513_+	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|396aa|down_4|NZ_AP022573.1_470509_471697_+	PRK03244, argD, acetylornithine transaminase	NA|309aa|down_5|NZ_AP022573.1_471689_472616_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|165aa|down_6|NZ_AP022573.1_472612_473107_+	PRK03341, PRK03341, arginine repressor; Provisional	NA|399aa|down_7|NZ_AP022573.1_473122_474319_+	PRK00509, PRK00509, argininosuccinate synthase; Provisional	NA|470aa|down_8|NZ_AP022573.1_474422_475832_+	PRK00855, PRK00855, argininosuccinate lyase; Provisional	NA|354aa|down_9|NZ_AP022573.1_475934_476996_+	COG3424, BcsA, Predicted naringenin-chalcone synthase [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	2	1046279-1046358	2	CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	CTCGCCGATGAGGCCGCCGACCGGGC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA|66aa|down_6|NZ_AP022573.1_1067170_1067368_-	NA|279aa|up_9|NZ_AP022573.1_1036895_1037732_-	pfam11296, DUF3097, Protein of unknown function (DUF3097)	NA|96aa|up_8|NZ_AP022573.1_1037949_1038237_+	pfam11829, DUF3349, Protein of unknown function (DUF3349)	NA|334aa|up_7|NZ_AP022573.1_1038233_1039235_-	cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2	NA|357aa|up_6|NZ_AP022573.1_1039231_1040302_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|281aa|up_5|NZ_AP022573.1_1040304_1041147_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|320aa|up_4|NZ_AP022573.1_1041133_1042093_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|441aa|up_3|NZ_AP022573.1_1042089_1043412_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|117aa|up_2|NZ_AP022573.1_1043698_1044049_+	pfam13564, DoxX_2, DoxX-like family	NA|266aa|up_1|NZ_AP022573.1_1044053_1044851_-	pfam02136, NTF2, Nuclear transport factor 2 (NTF2) domain	NA|187aa|up_0|NZ_AP022573.1_1044850_1045411_-	cd01011, nicotinamidase, Nicotinamidase/pyrazinamidase (PZase)	NA|162aa|down_0|NZ_AP022573.1_1046574_1047060_-	COG3265, GntK, Gluconate kinase [Carbohydrate transport and metabolism]	NA|668aa|down_1|NZ_AP022573.1_1047083_1049087_-	pfam02958, EcKinase, Ecdysteroid kinase	NA|502aa|down_2|NZ_AP022573.1_1049104_1050610_-	COG2272, PnbA, Carboxylesterase type B [Lipid metabolism]	NA|227aa|down_3|NZ_AP022573.1_1050746_1051427_+	PRK14959, PRK14959, DNA polymerase III subunits gamma and tau; Provisional	NA|874aa|down_4|NZ_AP022573.1_1051680_1054302_-	PRK05865, PRK05865, sugar epimerase family protein	NA|4169aa|down_5|NZ_AP022573.1_1054323_1066830_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|66aa|down_6|NZ_AP022573.1_1067170_1067368_-	NA	NA|112aa|down_7|NZ_AP022573.1_1067685_1068021_+	pfam13397, RbpA, RNA polymerase-binding protein	NA|268aa|down_8|NZ_AP022573.1_1068039_1068843_-	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|511aa|down_9|NZ_AP022573.1_1070925_1072458_-	cd01300, YtcJ_like, YtcJ_like metal dependent amidohydrolases
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	3	1782052-1782134	3	CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	ACGTTCGGGTCGAAGCCCACCGG	23	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA	NA|371aa|up_9|NZ_AP022573.1_1768818_1769931_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|485aa|up_8|NZ_AP022573.1_1769927_1771382_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|85aa|up_7|NZ_AP022573.1_1771444_1771699_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|104aa|up_6|NZ_AP022573.1_1771713_1772025_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|960aa|up_5|NZ_AP022573.1_1772260_1775140_-	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|137aa|up_4|NZ_AP022573.1_1775445_1775856_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|129aa|up_3|NZ_AP022573.1_1775885_1776272_-	pfam14017, DUF4233, Protein of unknown function (DUF4233)	NA|490aa|up_2|NZ_AP022573.1_1776268_1777738_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|885aa|up_1|NZ_AP022573.1_1777734_1780389_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|420aa|up_0|NZ_AP022573.1_1780415_1781675_-	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|203aa|down_0|NZ_AP022573.1_1783437_1784046_-	PRK00576, PRK00576, molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional	NA|369aa|down_1|NZ_AP022573.1_1784060_1785167_-	PRK11867, PRK11867, 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed	NA|659aa|down_2|NZ_AP022573.1_1785163_1787140_-	TIGR03710, OAFO_sf, 2-oxoacid:acceptor oxidoreductase, alpha subunit	NA|427aa|down_3|NZ_AP022573.1_1787541_1788822_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|215aa|down_4|NZ_AP022573.1_1790478_1791123_-	PRK12553, PRK12553, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|197aa|down_5|NZ_AP022573.1_1791119_1791710_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|478aa|down_6|NZ_AP022573.1_1791837_1793271_-	PRK01490, tig, trigger factor; Provisional	NA|403aa|down_7|NZ_AP022573.1_1793649_1794858_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|267aa|down_8|NZ_AP022573.1_1795722_1796523_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|163aa|down_9|NZ_AP022573.1_1796526_1797015_-	PRK05571, PRK05571, ribose-5-phosphate isomerase B; Provisional
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	4	4945287-4945394	4	CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	CAAGCGAGCGCGGGGCGGGCGCCCGGCTGGCCGGGCGC	38	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA	NA|131aa|up_9|NZ_AP022573.1_4932490_4932883_+	pfam05899, Cupin_3, Protein of unknown function (DUF861)	NA|467aa|up_8|NZ_AP022573.1_4932966_4934367_+	TIGR03329, Phn_aa_oxid, putative aminophosphonate oxidoreductase	NA|487aa|up_7|NZ_AP022573.1_4934445_4935906_+	cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter aldehyde dehydrogenase, DhaS-like	NA|291aa|up_6|NZ_AP022573.1_4935953_4936826_+	pfam13561, adh_short_C2, Enoyl-(Acyl carrier protein) reductase	NA|487aa|up_5|NZ_AP022573.1_4936937_4938398_+	COG1457, CodB, Purine-cytosine permease and related proteins [Nucleotide transport and metabolism]	NA|222aa|up_4|NZ_AP022573.1_4938415_4939081_+	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|545aa|up_3|NZ_AP022573.1_4939111_4940746_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|746aa|up_2|NZ_AP022573.1_4940884_4943122_+	pfam03971, IDH, Monomeric isocitrate dehydrogenase	NA|412aa|up_1|NZ_AP022573.1_4943149_4944385_+	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|280aa|up_0|NZ_AP022573.1_4944412_4945252_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|340aa|down_0|NZ_AP022573.1_4945461_4946481_+	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|343aa|down_1|NZ_AP022573.1_4946490_4947519_+	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|367aa|down_2|NZ_AP022573.1_4947515_4948616_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|454aa|down_3|NZ_AP022573.1_4948630_4949992_-	pfam00083, Sugar_tr, Sugar (and other) transporter	NA|416aa|down_4|NZ_AP022573.1_4949988_4951236_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|444aa|down_5|NZ_AP022573.1_4951322_4952654_-	PRK06541, PRK06541, aspartate aminotransferase family protein	NA|314aa|down_6|NZ_AP022573.1_4952701_4953643_+	PRK09636, PRK09636, RNA polymerase sigma factor SigJ; Provisional	NA|201aa|down_7|NZ_AP022573.1_4953578_4954181_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|267aa|down_8|NZ_AP022573.1_4954286_4955087_-	PRK05950, sdhB, succinate dehydrogenase iron-sulfur subunit; Reviewed	NA|585aa|down_9|NZ_AP022573.1_4955086_4956841_-	PRK08205, sdhA, succinate dehydrogenase flavoprotein subunit; Reviewed
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	5	5540822-5540901	5	CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	GGACTTGACGAGTGGCGACCCGC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA	NA|483aa|up_9|NZ_AP022573.1_5530928_5532377_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|323aa|up_8|NZ_AP022573.1_5532508_5533477_+	pfam14417, MEDS, MEDS: MEthanogen/methylotroph, DcmR Sensory domain	NA|174aa|up_7|NZ_AP022573.1_5533990_5534512_+	TIGR00970, 2-isopropylmalate_synthase, 2-isopropylmalate synthase, yeast type	NA|494aa|up_6|NZ_AP022573.1_5534522_5536004_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|216aa|up_5|NZ_AP022573.1_5536130_5536778_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|311aa|up_4|NZ_AP022573.1_5536805_5537738_+	PRK07832, PRK07832, SDR family oxidoreductase	NA|345aa|up_3|NZ_AP022573.1_5537672_5538707_-	PRK03352, PRK03352, DNA polymerase IV; Validated	NA|203aa|up_2|NZ_AP022573.1_5538706_5539315_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|191aa|up_1|NZ_AP022573.1_5539405_5539978_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|80aa|up_0|NZ_AP022573.1_5540535_5540775_+	TIGR02194, Glutaredoxin-like_protein_NrdH, Glutaredoxin-like protein NrdH	NA|722aa|down_0|NZ_AP022573.1_5541310_5543476_+	PRK08188, PRK08188, ribonucleotide-diphosphate reductase subunit alpha; Validated	NA|248aa|down_1|NZ_AP022573.1_5543614_5544358_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|520aa|down_2|NZ_AP022573.1_5544414_5545974_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|196aa|down_3|NZ_AP022573.1_5546162_5546750_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|524aa|down_4|NZ_AP022573.1_5546850_5548422_+	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|306aa|down_5|NZ_AP022573.1_5548412_5549330_-	cd05374, 17beta-HSD-like_SDR_c, 17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|191aa|down_6|NZ_AP022573.1_5549422_5549995_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|183aa|down_7|NZ_AP022573.1_5549971_5550520_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|149aa|down_8|NZ_AP022573.1_5550503_5550950_-	cd08862, SRPBCC_Smu440-like, Ligand-binding SRPBCC domain of Streptococcus mutans Smu	NA|459aa|down_9|NZ_AP022573.1_5551192_5552569_+	pfam02720, DUF222, Domain of unknown function (DUF222)
GCF_010729105.1_ASM1072910v1	NZ_AP022573	Mycobacterium saskatchewanense strain JCM 13016	6	5637447-5637530	6	CRISPRCasFinder	no		WYL,cas3,cas4,DEDDh,csa3,DinG	Orphan	CTGATGGCGGTGACCCGCATCGCCCGGCA	29	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas4,DEDDh,csa3,DinG	NA,NA	NA|266aa|up_9|NZ_AP022573.1_5626062_5626860_+	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|489aa|up_8|NZ_AP022573.1_5626856_5628323_+	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|171aa|up_7|NZ_AP022573.1_5628696_5629209_-	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|234aa|up_6|NZ_AP022573.1_5629538_5630240_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|483aa|up_5|NZ_AP022573.1_5630307_5631756_+	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|199aa|up_4|NZ_AP022573.1_5631774_5632371_+	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit	NA|212aa|up_3|NZ_AP022573.1_5632585_5633221_+	COG0776, HimA, Bacterial nucleoid DNA-binding protein [DNA replication, recombination, and repair]	NA|314aa|up_2|NZ_AP022573.1_5633285_5634227_-	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|730aa|up_1|NZ_AP022573.1_5634312_5636502_-	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|217aa|up_0|NZ_AP022573.1_5636750_5637401_-	COG1920, COG1920, Predicted nucleotidyltransferase, CobY/MobA/RfbA family [General function prediction only]	NA|340aa|down_0|NZ_AP022573.1_5637624_5638644_+	PRK00094, gpsA, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase	NA|373aa|down_1|NZ_AP022573.1_5638711_5639830_+	PRK01966, ddl, D-alanine--D-alanine ligase	NA|191aa|down_2|NZ_AP022573.1_5639826_5640399_-	pfam12028, DUF3515, Protein of unknown function (DUF3515)	NA|323aa|down_3|NZ_AP022573.1_5640518_5641487_+	PRK05731, PRK05731, thiamine monophosphate kinase; Provisional	NA|228aa|down_4|NZ_AP022573.1_5641528_5642212_+	PRK05254, PRK05254, uracil-DNA glycosylase; Provisional	NA|177aa|down_5|NZ_AP022573.1_5642217_5642748_+	cd05289, MDR_like_2, alcohol dehydrogenase and quinone reductase-like medium chain degydrogenases/reductases	NA|217aa|down_6|NZ_AP022573.1_5642780_5643431_+	cd02136, PnbA_NfnB-like, nitroreductase similar to Mycobacterium smegmatis NfnB	NA|65aa|down_7|NZ_AP022573.1_5643441_5643636_-	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|561aa|down_8|NZ_AP022573.1_5643903_5645586_+	TIGR03599, YloV, DAK2 domain fusion protein YloV	NA|742aa|down_9|NZ_AP022573.1_5645588_5647814_+	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional
