assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	1	85829-85956	1	CRISPRCasFinder	no		WYL,csa3,cas3,DEDDh	Orphan	TCGGGTGTCGAGTGCGACAACTTCTGGACAGTGGCGAGCGGCGGACGG	48	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,DEDDh	NA,NA|225aa|down_9|NZ_CP047183.1_98055_98730_+	NA|232aa|up_9|NZ_CP047183.1_71986_72682_-	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|146aa|up_8|NZ_CP047183.1_72743_73181_-	pfam07179, SseB, SseB protein N-terminal domain	NA|142aa|up_7|NZ_CP047183.1_73399_73825_+	pfam11774, Lsr2, Lsr2	NA|232aa|up_6|NZ_CP047183.1_73944_74640_-	PRK03298, PRK03298, endonuclease NucS	NA|233aa|up_5|NZ_CP047183.1_74680_75379_-	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|404aa|up_4|NZ_CP047183.1_75477_76689_-	cd17393, MFS_MosC_like, Membrane protein MosC and similar proteins of the Major Facilitator Superfamily of transporters	NA|398aa|up_3|NZ_CP047183.1_76687_77881_+	cd06279, PBP1_LacI-like, ligand-binding domain of an uncharacterized transcription regulator from Corynebacterium glutamicum and its close homologs from other bacteria	NA|434aa|up_2|NZ_CP047183.1_77924_79226_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|299aa|up_1|NZ_CP047183.1_79278_80175_+	cd19098, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|1756aa|up_0|NZ_CP047183.1_80475_85743_+	COG3693, XynA, Beta-1,4-xylanase [Carbohydrate transport and metabolism]	NA|210aa|down_0|NZ_CP047183.1_85982_86612_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|249aa|down_1|NZ_CP047183.1_86789_87536_-	PRK06500, PRK06500, SDR family oxidoreductase	NA|98aa|down_2|NZ_CP047183.1_87687_87981_-	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|266aa|down_3|NZ_CP047183.1_87977_88775_-	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|349aa|down_4|NZ_CP047183.1_88771_89818_-	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|369aa|down_5|NZ_CP047183.1_89889_90996_-	cd19973, PBP1_ABC_sugar_binding-like, monosaccharide ABC transporter substrate-binding protein	NA|333aa|down_6|NZ_CP047183.1_91159_92158_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|319aa|down_7|NZ_CP047183.1_95874_96831_-	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|269aa|down_8|NZ_CP047183.1_96842_97649_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|225aa|down_9|NZ_CP047183.1_98055_98730_+	NA
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	2	327891-328003	2	CRISPRCasFinder	no		WYL,csa3,cas3,DEDDh	Orphan	TGCCGCCCGGCTCCGCTCGCGAGATGCCACTT	32	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,DEDDh	NA|104aa|up_9|NZ_CP047183.1_315369_315681_+,NA|182aa|up_2|NZ_CP047183.1_324234_324780_-,NA|68aa|down_7|NZ_CP047183.1_336630_336834_-	NA|104aa|up_9|NZ_CP047183.1_315369_315681_+	NA	NA|305aa|up_8|NZ_CP047183.1_315905_316820_-	pfam00480, ROK, ROK family	NA|554aa|up_7|NZ_CP047183.1_316854_318516_-	NF033435, S-layer_Clost, S-layer protein SlpA	NA|724aa|up_6|NZ_CP047183.1_318708_320880_-	cd00063, FN3, Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin	NA|279aa|up_5|NZ_CP047183.1_321320_322157_+	COG1834, COG1834, N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]	NA|410aa|up_4|NZ_CP047183.1_322284_323514_+	PRK04073, rocD, ornithine--oxo-acid transaminase; Provisional	NA|209aa|up_3|NZ_CP047183.1_323611_324238_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|182aa|up_2|NZ_CP047183.1_324234_324780_-	NA	NA|637aa|up_1|NZ_CP047183.1_325077_326988_+	pfam13520, AA_permease_2, Amino acid permease	NA|199aa|up_0|NZ_CP047183.1_327282_327879_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|480aa|down_0|NZ_CP047183.1_328074_329514_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|294aa|down_1|NZ_CP047183.1_329759_330641_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|444aa|down_2|NZ_CP047183.1_330658_331990_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|142aa|down_3|NZ_CP047183.1_332146_332572_-	pfam05025, RbsD_FucU, RbsD / FucU transport protein family	NA|334aa|down_4|NZ_CP047183.1_332581_333583_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|340aa|down_5|NZ_CP047183.1_333584_334604_-	cd19071, AKR_AKR1-5-like, AKR1/2/3/4/5 family of aldo-keto reductase (AKR) and similar proteins	NA|550aa|down_6|NZ_CP047183.1_334984_336634_-	cd11334, AmyAc_TreS, Alpha amylase catalytic domain found in Trehalose synthetase	NA|68aa|down_7|NZ_CP047183.1_336630_336834_-	NA	NA|399aa|down_8|NZ_CP047183.1_337204_338401_+	COG3844, COG3844, Kynureninase [Amino acid transport and metabolism]	NA|132aa|down_9|NZ_CP047183.1_338517_338913_-	pfam06150, ChaB, ChaB
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	3	337028-337129	3	CRISPRCasFinder	no		WYL,csa3,cas3,DEDDh	Orphan	CGAGTGCGACGACTTCTGGACAG	23	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,DEDDh	NA|68aa|up_0|NZ_CP047183.1_336630_336834_-,NA	NA|637aa|up_9|NZ_CP047183.1_325077_326988_+	pfam13520, AA_permease_2, Amino acid permease	NA|199aa|up_8|NZ_CP047183.1_327282_327879_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|480aa|up_7|NZ_CP047183.1_328074_329514_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|294aa|up_6|NZ_CP047183.1_329759_330641_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|444aa|up_5|NZ_CP047183.1_330658_331990_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|142aa|up_4|NZ_CP047183.1_332146_332572_-	pfam05025, RbsD_FucU, RbsD / FucU transport protein family	NA|334aa|up_3|NZ_CP047183.1_332581_333583_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|340aa|up_2|NZ_CP047183.1_333584_334604_-	cd19071, AKR_AKR1-5-like, AKR1/2/3/4/5 family of aldo-keto reductase (AKR) and similar proteins	NA|550aa|up_1|NZ_CP047183.1_334984_336634_-	cd11334, AmyAc_TreS, Alpha amylase catalytic domain found in Trehalose synthetase	NA|68aa|up_0|NZ_CP047183.1_336630_336834_-	NA	NA|399aa|down_0|NZ_CP047183.1_337204_338401_+	COG3844, COG3844, Kynureninase [Amino acid transport and metabolism]	NA|132aa|down_1|NZ_CP047183.1_338517_338913_-	pfam06150, ChaB, ChaB	NA|409aa|down_2|NZ_CP047183.1_339062_340289_+	COG0415, PhrB, Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]	NA|419aa|down_3|NZ_CP047183.1_340334_341591_-	TIGR01678, D-arabinono-14-lactone_oxidase, sugar 1,4-lactone oxidases	NA|554aa|down_4|NZ_CP047183.1_342268_343930_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|302aa|down_5|NZ_CP047183.1_344306_345212_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|308aa|down_6|NZ_CP047183.1_345220_346144_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|417aa|down_7|NZ_CP047183.1_346143_347394_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|349aa|down_8|NZ_CP047183.1_347479_348526_-	pfam01636, APH, Phosphotransferase enzyme family	NA|199aa|down_9|NZ_CP047183.1_348989_349586_-	pfam11377, DUF3180, Protein of unknown function (DUF3180)
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	4	620326-620436	4	CRISPRCasFinder	no		WYL,csa3,cas3,DEDDh	Orphan	AGTGCGACAACTCCTGCACATTCGCGGGAG	30	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,DEDDh	NA|526aa|up_0|NZ_CP047183.1_618662_620240_+,NA|214aa|down_6|NZ_CP047183.1_628961_629603_+	NA|280aa|up_9|NZ_CP047183.1_607905_608745_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|520aa|up_8|NZ_CP047183.1_608741_610301_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|353aa|up_7|NZ_CP047183.1_610520_611579_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|461aa|up_6|NZ_CP047183.1_611722_613105_+	COG1239, ChlI, Mg-chelatase subunit ChlI [Coenzyme metabolism]	NA|683aa|up_5|NZ_CP047183.1_613097_615146_+	COG4867, COG4867, Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]	NA|255aa|up_4|NZ_CP047183.1_615244_616009_-	TIGR03843, Phosphatidylinositol_3-_and_4-kinase, conserved hypothetical protein	NA|178aa|up_3|NZ_CP047183.1_616005_616539_-	pfam11290, DUF3090, Protein of unknown function (DUF3090)	NA|229aa|up_2|NZ_CP047183.1_616819_617506_-	TIGR03848, MSMEG_4193, probable phosphomutase, MSMEG_4193 family	NA|307aa|up_1|NZ_CP047183.1_617760_618681_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|526aa|up_0|NZ_CP047183.1_618662_620240_+	NA	NA|162aa|down_0|NZ_CP047183.1_620711_621197_+	pfam13563, 2_5_RNA_ligase2, 2'-5' RNA ligase superfamily	NA|712aa|down_1|NZ_CP047183.1_621323_623459_+	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	NA|169aa|down_2|NZ_CP047183.1_623586_624093_-	PRK05415, PRK05415, hypothetical protein; Provisional	NA|715aa|down_3|NZ_CP047183.1_624390_626535_+	cd01150, AXO, Peroxisomal acyl-CoA oxidase	NA|328aa|down_4|NZ_CP047183.1_626628_627612_-	PRK05442, PRK05442, malate dehydrogenase; Provisional	NA|300aa|down_5|NZ_CP047183.1_627652_628552_-	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|214aa|down_6|NZ_CP047183.1_628961_629603_+	NA	NA|966aa|down_7|NZ_CP047183.1_629599_632497_+	cd09001, GH43_FsAxh1-like, Glycosyl hydrolase family 43 such as Fibrobacter succinogenes subsp	NA|794aa|down_8|NZ_CP047183.1_632634_635016_+	cd09003, GH43_XynD-like, Glycosyl hydrolase family 43 protein such as Bacillus subtilis arabinoxylan arabinofuranohydrolase  (XynD;BsAXH-m23;BSU18160)	NA|316aa|down_9|NZ_CP047183.1_635161_636109_-	COG0583, LysR, Transcriptional regulator [Transcription]
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	5	1925035-1925223	1	CRT	no		WYL,csa3,cas3,DEDDh	Orphan	GCCCTCGAGGCCGAGGTCGCC	21	0	0	NA	NA	NA	3	3	Orphan	WYL,csa3,cas3,DEDDh	NA|599aa|up_0|NZ_CP047183.1_1922375_1924172_-,NA|79aa|down_1|NZ_CP047183.1_1926608_1926845_-	NA|833aa|up_9|NZ_CP047183.1_1906139_1908638_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|368aa|up_8|NZ_CP047183.1_1908634_1909738_-	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|430aa|up_7|NZ_CP047183.1_1909796_1911086_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|691aa|up_6|NZ_CP047183.1_1911084_1913157_+	cd11326, AmyAc_Glg_debranch, Alpha amylase catalytic domain found in glycogen debranching enzymes	NA|161aa|up_5|NZ_CP047183.1_1913156_1913639_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|858aa|up_4|NZ_CP047183.1_1913701_1916275_-	cd04299, GT35_Glycogen_Phosphorylase-like, proteins similar to glycogen phosphorylase	NA|739aa|up_3|NZ_CP047183.1_1916499_1918716_+	cd11344, AmyAc_GlgE_like, Alpha amylase catalytic domain found in GlgE-like proteins	NA|877aa|up_2|NZ_CP047183.1_1918712_1921343_+	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|320aa|up_1|NZ_CP047183.1_1921419_1922379_-	COG3118, COG3118, Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]	NA|599aa|up_0|NZ_CP047183.1_1922375_1924172_-	NA	NA|242aa|down_0|NZ_CP047183.1_1925886_1926612_-	COG2945, COG2945, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|79aa|down_1|NZ_CP047183.1_1926608_1926845_-	NA	NA|583aa|down_2|NZ_CP047183.1_1926951_1928700_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|247aa|down_3|NZ_CP047183.1_1928759_1929500_-	TIGR04216, halo_surf_glyco, major cell surface glycoprotein	NA|184aa|down_4|NZ_CP047183.1_1929551_1930103_-	TIGR03543, divI1A_rptt_fam, DivIVA domain repeat protein	NA|313aa|down_5|NZ_CP047183.1_1930165_1931104_-	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|185aa|down_6|NZ_CP047183.1_1931184_1931739_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|244aa|down_7|NZ_CP047183.1_1931771_1932503_-	PRK00358, pyrH, uridylate kinase; Provisional	NA|276aa|down_8|NZ_CP047183.1_1932638_1933466_-	PRK09377, tsf, elongation factor Ts; Provisional	NA|316aa|down_9|NZ_CP047183.1_1933647_1934595_-	PRK05299, rpsB, 30S ribosomal protein S2; Provisional
GCF_009834245.1_ASM983424v1	NZ_CP047183	Rathayibacter sp. VKM Ac-2801 chromosome, complete genome	6	3274657-3274794	5	CRISPRCasFinder	no		WYL,csa3,cas3,DEDDh	Orphan	AAGTTGTCGCACTCGGCGTCGACTGCGACAACTCCTGGC	39	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,DEDDh	NA|230aa|up_9|NZ_CP047183.1_3265693_3266383_+,NA|185aa|up_7|NZ_CP047183.1_3267053_3267608_+,NA|68aa|up_6|NZ_CP047183.1_3267757_3267961_-,NA|62aa|up_5|NZ_CP047183.1_3267980_3268166_-,NA|242aa|down_9|NZ_CP047183.1_3288548_3289274_+	NA|230aa|up_9|NZ_CP047183.1_3265693_3266383_+	NA	NA|179aa|up_8|NZ_CP047183.1_3266379_3266916_+	TIGR02228, Signal_peptidase_I_W, signal peptidase I, archaeal type	NA|185aa|up_7|NZ_CP047183.1_3267053_3267608_+	NA	NA|68aa|up_6|NZ_CP047183.1_3267757_3267961_-	NA	NA|62aa|up_5|NZ_CP047183.1_3267980_3268166_-	NA	NA|252aa|up_4|NZ_CP047183.1_3268357_3269113_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|357aa|up_3|NZ_CP047183.1_3269109_3270180_-	pfam01032, FecCD, FecCD transport family	NA|351aa|up_2|NZ_CP047183.1_3270149_3271202_-	cd01148, TroA_a, Metal binding protein TroA_a	NA|696aa|up_1|NZ_CP047183.1_3271502_3273590_+	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|300aa|up_0|NZ_CP047183.1_3273655_3274555_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|1225aa|down_0|NZ_CP047183.1_3275024_3278699_+	pfam14587, Glyco_hydr_30_2, O-Glycosyl hydrolase family 30	NA|297aa|down_1|NZ_CP047183.1_3279139_3280030_-	pfam05661, DUF808, Protein of unknown function (DUF808)	NA|153aa|down_2|NZ_CP047183.1_3280099_3280558_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|396aa|down_3|NZ_CP047183.1_3280616_3281804_+	PRK07588, PRK07588, FAD-binding domain	NA|347aa|down_4|NZ_CP047183.1_3281814_3282855_-	cd01574, PBP1_LacI, ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators	NA|314aa|down_5|NZ_CP047183.1_3283016_3283958_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|306aa|down_6|NZ_CP047183.1_3283957_3284875_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|450aa|down_7|NZ_CP047183.1_3284960_3286310_+	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|588aa|down_8|NZ_CP047183.1_3286408_3288172_+	pfam01301, Glyco_hydro_35, Glycosyl hydrolases family 35	NA|242aa|down_9|NZ_CP047183.1_3288548_3289274_+	NA
