assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010727605.1_ASM1072760v1	NZ_AP022568	Mycobacterium simiae strain JCM 12377	1	306706-306828	1	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	Orphan	CATCACTTCGTTCTGCATCGTCGCGGCGCGACTCATTTGCGCCGCTT	47	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	NA|228aa|up_2|NZ_AP022568.1_305023_305707_-,NA|121aa|down_1|NZ_AP022568.1_308427_308790_-,NA|92aa|down_2|NZ_AP022568.1_308977_309253_-	NA|420aa|up_9|NZ_AP022568.1_296863_298123_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|463aa|up_8|NZ_AP022568.1_298127_299516_-	TIGR02946, Putative_diacyglycerol_O-acyltransferase_Mb3115, acyltransferase, WS/DGAT/MGAT	NA|326aa|up_7|NZ_AP022568.1_299569_300547_-	TIGR00647, DNA_bind_WhiA, DNA-binding protein WhiA	NA|359aa|up_6|NZ_AP022568.1_300543_301620_-	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|303aa|up_5|NZ_AP022568.1_301616_302525_-	COG1660, COG1660, Predicted P-loop-containing kinase [General function prediction only]	NA|660aa|up_4|NZ_AP022568.1_302521_304501_-	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|113aa|up_3|NZ_AP022568.1_304677_305016_+	TIGR04529, hypothetical_protein, hemophore-related protein, Rv0203/Rv1174c family	NA|228aa|up_2|NZ_AP022568.1_305023_305707_-	NA	NA|157aa|up_1|NZ_AP022568.1_305742_306213_-	pfam10756, bPH_6, Bacterial PH domain	NA|160aa|up_0|NZ_AP022568.1_306209_306689_-	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|426aa|down_0|NZ_AP022568.1_306846_308124_-	PRK09311, PRK09311, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II	NA|121aa|down_1|NZ_AP022568.1_308427_308790_-	NA	NA|92aa|down_2|NZ_AP022568.1_308977_309253_-	NA	NA|203aa|down_3|NZ_AP022568.1_309249_309858_-	PRK09289, PRK09289, riboflavin synthase	NA|235aa|down_4|NZ_AP022568.1_309989_310694_+	pfam07161, LppX_LprAFG, LppX_LprAFG lipoprotein	NA|516aa|down_5|NZ_AP022568.1_310699_312247_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|342aa|down_6|NZ_AP022568.1_312247_313273_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|230aa|down_7|NZ_AP022568.1_313269_313959_-	PRK05581, PRK05581, ribulose-phosphate 3-epimerase; Validated	NA|463aa|down_8|NZ_AP022568.1_313997_315386_-	PRK14902, PRK14902, 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB	NA|311aa|down_9|NZ_AP022568.1_315382_316315_-	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed
GCF_010727605.1_ASM1072760v1	NZ_AP022568	Mycobacterium simiae strain JCM 12377	2	587978-588081	2	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	Orphan	GCCGGCTCAGGGCCGGCTCAGGCTTCGGTCGCT	33	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	NA,NA|110aa|down_9|NZ_AP022568.1_597686_598016_+	NA|276aa|up_9|NZ_AP022568.1_577775_578603_-	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|106aa|up_8|NZ_AP022568.1_578743_579061_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|221aa|up_7|NZ_AP022568.1_579119_579782_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase	NA|179aa|up_6|NZ_AP022568.1_579748_580285_+	PRK07922, PRK07922, amino-acid N-acetyltransferase	NA|847aa|up_5|NZ_AP022568.1_580342_582883_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|105aa|up_4|NZ_AP022568.1_583161_583476_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|278aa|up_3|NZ_AP022568.1_583472_584306_+	TIGR03971, short-chain_dehydrogenase/reductase_SDR, SDR family mycofactocin-dependent oxidoreductase	NA|291aa|up_2|NZ_AP022568.1_584311_585184_+	TIGR00027, Hypothetical_protein_Rv0893c/MT0917/Mb0917c	NA|559aa|up_1|NZ_AP022568.1_585180_586857_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|301aa|up_0|NZ_AP022568.1_586876_587779_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|354aa|down_0|NZ_AP022568.1_588850_589912_-	cd17266, RMtype1_S_Sau1132ORF3780P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Staphylococcus aureus subsp	NA|459aa|down_1|NZ_AP022568.1_589917_591294_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|408aa|down_2|NZ_AP022568.1_591293_592517_-	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|233aa|down_3|NZ_AP022568.1_592558_593257_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|169aa|down_4|NZ_AP022568.1_593357_593864_-	pfam00186, DHFR_1, Dihydrofolate reductase	NA|267aa|down_5|NZ_AP022568.1_593860_594661_-	PRK01827, thyA, thymidylate synthase; Reviewed	NA|250aa|down_6|NZ_AP022568.1_594725_595475_+	pfam01738, DLH, Dienelactone hydrolase family	NA|410aa|down_7|NZ_AP022568.1_595471_596701_-	cd03784, GT1_Gtf-like, UDP-glycosyltransferases and similar proteins	NA|259aa|down_8|NZ_AP022568.1_596775_597552_-	PRK07231, FabG-like, SDR family oxidoreductase	NA|110aa|down_9|NZ_AP022568.1_597686_598016_+	NA
GCF_010727605.1_ASM1072760v1	NZ_AP022568	Mycobacterium simiae strain JCM 12377	3	2654050-2654156	3	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	Orphan	TTGATGCCAGCCGGGGTCGAGGAGTTGCGCCCGTGACCC	39	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	NA|86aa|up_3|NZ_AP022568.1_2648208_2648466_+,NA	NA|296aa|up_9|NZ_AP022568.1_2640266_2641154_-	smart00956, RQC, This DNA-binding domain is found in the RecQ helicase among others and has a helix-turn-helix structure	NA|111aa|up_8|NZ_AP022568.1_2641280_2641613_-	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|226aa|up_7|NZ_AP022568.1_2641853_2642531_-	pfam17939, TetR_C_30, Tetracyclin repressor-like, C-terminal domain	NA|311aa|up_6|NZ_AP022568.1_2642527_2643460_-	pfam01557, FAA_hydrolase, Fumarylacetoacetate (FAA) hydrolase family	NA|252aa|up_5|NZ_AP022568.1_2643553_2644309_+	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|453aa|up_4|NZ_AP022568.1_2644347_2645706_+	PRK05077, frsA, esterase FrsA	NA|86aa|up_3|NZ_AP022568.1_2648208_2648466_+	NA	NA|291aa|up_2|NZ_AP022568.1_2650499_2651372_-	pfam18860, AbiJ_NTD3, AbiJ N-terminal domain 3	NA|552aa|up_1|NZ_AP022568.1_2651785_2653441_+	pfam12401, DUF3662, Protein of unknown function (DUF2662)	NA|156aa|up_0|NZ_AP022568.1_2653549_2654017_+	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|470aa|down_0|NZ_AP022568.1_2655654_2657064_+	pfam01098, FTSW_RODA_SPOVE, Cell cycle protein	NA|493aa|down_1|NZ_AP022568.1_2657060_2658539_+	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|444aa|down_2|NZ_AP022568.1_2658535_2659867_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|624aa|down_3|NZ_AP022568.1_2659863_2661735_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|232aa|down_4|NZ_AP022568.1_2661712_2662408_-	PRK07765, PRK07765, aminodeoxychorismate/anthranilate synthase component II	NA|248aa|down_5|NZ_AP022568.1_2662457_2663201_-	COG3879, COG3879, Uncharacterized protein conserved in bacteria [Function unknown]	NA|94aa|down_6|NZ_AP022568.1_2663319_2663601_+	PRK00159, PRK00159, putative septation inhibitor protein; Reviewed	NA|145aa|down_7|NZ_AP022568.1_2663756_2664191_+	pfam10756, bPH_6, Bacterial PH domain	NA|183aa|down_8|NZ_AP022568.1_2664217_2664766_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|146aa|down_9|NZ_AP022568.1_2664903_2665341_+	pfam10814, CwsA, Cell wall synthesis protein CwsA
GCF_010727605.1_ASM1072760v1	NZ_AP022568	Mycobacterium simiae strain JCM 12377	4	2695513-2695598	4	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	Orphan	GCGGGCCGTCCCGGCCCGCATCGTCACCCG	30	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas4,WYL,PD-DExK	NA|251aa|up_7|NZ_AP022568.1_2683654_2684407_+,NA|233aa|up_3|NZ_AP022568.1_2687211_2687910_-,NA|800aa|up_1|NZ_AP022568.1_2692285_2694685_-,NA|170aa|down_1|NZ_AP022568.1_2697093_2697603_+	NA|272aa|up_9|NZ_AP022568.1_2681538_2682354_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|328aa|up_8|NZ_AP022568.1_2682376_2683360_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|251aa|up_7|NZ_AP022568.1_2683654_2684407_+	NA	NA|407aa|up_6|NZ_AP022568.1_2684403_2685624_-	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|118aa|up_5|NZ_AP022568.1_2685758_2686112_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|337aa|up_4|NZ_AP022568.1_2686108_2687119_-	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|233aa|up_3|NZ_AP022568.1_2687211_2687910_-	NA	NA|1203aa|up_2|NZ_AP022568.1_2688680_2692289_-	COG0728, MviN, Uncharacterized membrane protein, putative virulence factor [General function prediction only]	NA|800aa|up_1|NZ_AP022568.1_2692285_2694685_-	NA	NA|247aa|up_0|NZ_AP022568.1_2694681_2695422_-	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|485aa|down_0|NZ_AP022568.1_2695614_2697069_+	TIGR02692, putative_tRNA_nucleotidyltransferase, tRNA adenylyltransferase	NA|170aa|down_1|NZ_AP022568.1_2697093_2697603_+	NA	NA|108aa|down_2|NZ_AP022568.1_2697743_2698067_+	pfam06013, WXG100, Proteins of 100 residues with WXG	NA|97aa|down_3|NZ_AP022568.1_2698118_2698409_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]	NA|278aa|down_4|NZ_AP022568.1_2698468_2699302_+	pfam14011, ESX-1_EspG, EspG family	NA|302aa|down_5|NZ_AP022568.1_2699460_2700366_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|494aa|down_6|NZ_AP022568.1_2700377_2701859_+	TIGR03920, T7SS_EccD, type VII secretion integral membrane protein EccD	NA|554aa|down_7|NZ_AP022568.1_2701855_2703517_+	TIGR03921, T7SS_mycosin, type VII secretion-associated serine protease mycosin	NA|517aa|down_8|NZ_AP022568.1_2703513_2705064_+	TIGR03923, T7SS_EccE, type VII secretion protein EccE	NA|614aa|down_9|NZ_AP022568.1_2705060_2706902_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA
