assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	1	250976-251244	1,1	PILER-CR,CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	AAAATGTTTCATGTGAAACATTTT,AAAATGTTTCATGTGAAACATTTT	24,24	0	0	NA	NA	NA:NA	3,2	3	Orphan	RT,csa3,cas3,DEDDh,cas4	NA,NA	NA|370aa|up_9|NZ_LR698969.1_240783_241893_-	PRK05643, PRK05643, DNA polymerase III subunit beta; Validated	NA|488aa|up_8|NZ_LR698969.1_242059_243523_-	TIGR00362, DnaA, chromosomal replication initiator protein DnaA	NA|45aa|up_7|NZ_LR698969.1_244018_244153_+	pfam00468, Ribosomal_L34, Ribosomal protein L34	NA|161aa|up_6|NZ_LR698969.1_244213_244696_+	pfam00825, Ribonuclease_P, Ribonuclease P	NA|72aa|up_5|NZ_LR698969.1_244692_244908_+	pfam01809, Haemolytic, Haemolytic domain	NA|246aa|up_4|NZ_LR698969.1_245008_245746_+	pfam02096, 60KD_IMP, 60Kd inner membrane protein	NA|368aa|up_3|NZ_LR698969.1_245745_246849_+	COG1847, Jag, Predicted RNA-binding protein [General function prediction only]	NA|474aa|up_2|NZ_LR698969.1_246885_248307_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|634aa|up_1|NZ_LR698969.1_248316_250218_+	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|243aa|up_0|NZ_LR698969.1_250219_250948_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|414aa|down_0|NZ_LR698969.1_251255_252497_+	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|266aa|down_1|NZ_LR698969.1_252484_253282_+	pfam13614, AAA_31, AAA domain	NA|373aa|down_2|NZ_LR698969.1_253284_254403_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|380aa|down_3|NZ_LR698969.1_254414_255554_+	pfam14584, DUF4446, Protein of unknown function (DUF4446)	NA|326aa|down_4|NZ_LR698969.1_255675_256653_+	PRK09653, eutD, phosphotransacetylase	NA|485aa|down_5|NZ_LR698969.1_256733_258188_-	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|222aa|down_6|NZ_LR698969.1_258328_258994_+	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|113aa|down_7|NZ_LR698969.1_259121_259460_+	pfam07784, DUF1622, Protein of unknown function (DUF1622)	NA|103aa|down_8|NZ_LR698969.1_259446_259755_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|308aa|down_9|NZ_LR698969.1_259768_260692_+	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	2	358112-358368	1	CRT	no		RT,csa3,cas3,DEDDh,cas4	Orphan	NGAAAACGCNTTTCCACCACCATCGTTTNTGCCTTCGG	38	3	31	358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358204-358221|358259-358276|358259-358276|358259-358276|358259-358276|358259-358276|358259-358276|358259-358276|358259-358276|358259-358276|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331|358314-358331	NZ_LR698969.1_290473-290490|NZ_LR698969.1_310378-310395|NZ_LR698969.1_1416082-1416065|NZ_LR698969.1_1416137-1416120|NZ_LR698969.1_1416192-1416175|NZ_LR698969.1_1416247-1416230|NZ_LR698969.1_1416357-1416340|NZ_LR698969.1_290308-290325|NZ_LR698969.1_290363-290380|NZ_LR698969.1_290528-290545|NZ_LR698969.1_1416302-1416285|NZ_LR698969.1_358369-358386|NZ_LR698969.1_290253-290270|NZ_LR698969.1_343895-343878|NZ_LR698969.1_606740-606723|NZ_LR698969.1_606795-606778|NZ_LR698969.1_606850-606833|NZ_LR698969.1_606905-606888|NZ_LR698969.1_606960-606943|NZ_LR698969.1_683032-683049|NZ_LR698969.1_290473-290490|NZ_LR698969.1_310378-310395|NZ_LR698969.1_1416082-1416065|NZ_LR698969.1_1416137-1416120|NZ_LR698969.1_1416192-1416175|NZ_LR698969.1_1416247-1416230|NZ_LR698969.1_1416357-1416340|NZ_LR698969.1_290308-290325|NZ_LR698969.1_290363-290380|NZ_LR698969.1_290528-290545|NZ_LR698969.1_1416302-1416285	NA	4	4	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|82aa|up_9|NZ_LR698969.1_343708_343954_-,NA|268aa|up_5|NZ_LR698969.1_349888_350692_-,NA|133aa|up_3|NZ_LR698969.1_352606_353005_-,NA|180aa|up_2|NZ_LR698969.1_353417_353957_+,NA	NA|82aa|up_9|NZ_LR698969.1_343708_343954_-	NA	NA|941aa|up_8|NZ_LR698969.1_344082_346905_+	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|373aa|up_7|NZ_LR698969.1_346914_348034_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|572aa|up_6|NZ_LR698969.1_348035_349751_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|268aa|up_5|NZ_LR698969.1_349888_350692_-	NA	NA|227aa|up_4|NZ_LR698969.1_350860_351541_-	pfam02588, YitT_membrane, Uncharacterized 5xTM membrane BCR, YitT family COG1284	NA|133aa|up_3|NZ_LR698969.1_352606_353005_-	NA	NA|180aa|up_2|NZ_LR698969.1_353417_353957_+	NA	NA|422aa|up_1|NZ_LR698969.1_354102_355368_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|720aa|up_0|NZ_LR698969.1_355740_357900_+	cd02931, ER_like_FMN, Enoate reductase (ER)-like FMN-binding domain	NA|68aa|down_0|NZ_LR698969.1_358615_358819_+	pfam01197, Ribosomal_L31, Ribosomal protein L31	NA|295aa|down_1|NZ_LR698969.1_359641_360526_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|362aa|down_2|NZ_LR698969.1_360587_361673_+	PRK00591, prfA, peptide chain release factor 1; Validated	NA|793aa|down_3|NZ_LR698969.1_361684_364063_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|289aa|down_4|NZ_LR698969.1_364169_365036_+	TIGR01859, Fructose-bisphosphate_aldolase, fructose-1,6-bisphosphate aldolase, class II, various bacterial and amitochondriate protist	NA|55aa|down_5|NZ_LR698969.1_366164_366329_+	NF012221, MARTX_Nterm, MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin RtxA	NA|305aa|down_6|NZ_LR698969.1_366656_367571_-	pfam07501, G5, G5 domain	NA|482aa|down_7|NZ_LR698969.1_368977_370423_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|532aa|down_8|NZ_LR698969.1_370695_372291_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|132aa|down_9|NZ_LR698969.1_372253_372649_-	pfam07508, Recombinase, Recombinase
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	3	368491-368577	2	CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CGAATATGGGCTCCTGGGCTTTGG	24	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|180aa|up_9|NZ_LR698969.1_353417_353957_+,NA|65aa|down_4|NZ_LR698969.1_374299_374494_-,NA|132aa|down_7|NZ_LR698969.1_376035_376431_-	NA|180aa|up_9|NZ_LR698969.1_353417_353957_+	NA	NA|422aa|up_8|NZ_LR698969.1_354102_355368_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|720aa|up_7|NZ_LR698969.1_355740_357900_+	cd02931, ER_like_FMN, Enoate reductase (ER)-like FMN-binding domain	NA|68aa|up_6|NZ_LR698969.1_358615_358819_+	pfam01197, Ribosomal_L31, Ribosomal protein L31	NA|295aa|up_5|NZ_LR698969.1_359641_360526_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|362aa|up_4|NZ_LR698969.1_360587_361673_+	PRK00591, prfA, peptide chain release factor 1; Validated	NA|793aa|up_3|NZ_LR698969.1_361684_364063_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|289aa|up_2|NZ_LR698969.1_364169_365036_+	TIGR01859, Fructose-bisphosphate_aldolase, fructose-1,6-bisphosphate aldolase, class II, various bacterial and amitochondriate protist	NA|55aa|up_1|NZ_LR698969.1_366164_366329_+	NF012221, MARTX_Nterm, MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin RtxA	NA|305aa|up_0|NZ_LR698969.1_366656_367571_-	pfam07501, G5, G5 domain	NA|482aa|down_0|NZ_LR698969.1_368977_370423_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|532aa|down_1|NZ_LR698969.1_370695_372291_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|132aa|down_2|NZ_LR698969.1_372253_372649_-	pfam07508, Recombinase, Recombinase	NA|524aa|down_3|NZ_LR698969.1_372679_374251_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|65aa|down_4|NZ_LR698969.1_374299_374494_-	NA	NA|313aa|down_5|NZ_LR698969.1_374688_375627_-	cd06414, GH25_LytC-like, The LytC lysozyme of Streptococcus pneumoniae is a bacterial cell wall hydrolase that cleaves the beta1-4-glycosydic bond located between the N-acetylmuramoyl-N-glucosaminyl residues of the cell wall polysaccharide chains	NA|140aa|down_6|NZ_LR698969.1_375619_376039_-	pfam05105, Phage_holin_4_1, Bacteriophage holin family	NA|132aa|down_7|NZ_LR698969.1_376035_376431_-	NA	NA|734aa|down_8|NZ_LR698969.1_376441_378643_-	pfam05895, DUF859, Siphovirus protein of unknown function (DUF859)	NA|566aa|down_9|NZ_LR698969.1_378657_380355_-	pfam06605, Prophage_tail, Prophage endopeptidase tail
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	4	646546-646660	3	CRISPRCasFinder	no	csa3	RT,csa3,cas3,DEDDh,cas4	Type I-A	CCCTGTTTTAACGCGATATTTCCCCTTCG	29	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|406aa|up_8|NZ_LR698969.1_637091_638309_+,NA|143aa|up_2|NZ_LR698969.1_644882_645311_+,NA|49aa|down_2|NZ_LR698969.1_649201_649348_-	NA|639aa|up_9|NZ_LR698969.1_634685_636602_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|406aa|up_8|NZ_LR698969.1_637091_638309_+	NA	NA|454aa|up_7|NZ_LR698969.1_638539_639901_+	pfam08353, DUF1727, Domain of unknown function (DUF1727)	NA|233aa|up_6|NZ_LR698969.1_639897_640596_+	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|164aa|up_5|NZ_LR698969.1_641040_641532_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|232aa|up_4|NZ_LR698969.1_641621_642317_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|757aa|up_3|NZ_LR698969.1_642316_644587_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|143aa|up_2|NZ_LR698969.1_644882_645311_+	NA	NA|129aa|up_1|NZ_LR698969.1_645450_645837_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|164aa|up_0|NZ_LR698969.1_645843_646335_+	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|326aa|down_0|NZ_LR698969.1_647122_648100_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|308aa|down_1|NZ_LR698969.1_648285_649209_-	COG0348, NapH, Polyferredoxin [Energy production and conversion]	NA|49aa|down_2|NZ_LR698969.1_649201_649348_-	NA	NA|226aa|down_3|NZ_LR698969.1_649451_650129_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|441aa|down_4|NZ_LR698969.1_650125_651448_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	csa3|122aa|down_5|NZ_LR698969.1_651453_651819_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|73aa|down_6|NZ_LR698969.1_652511_652730_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|651aa|down_7|NZ_LR698969.1_653557_655510_+	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	NA|318aa|down_8|NZ_LR698969.1_655554_656508_+	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|409aa|down_9|NZ_LR698969.1_656586_657813_+	cd17333, MFS_FucP_MFSD4_like, Bacterial fucose permease, eukaryotic Major facilitator superfamily domain-containing protein 4, and similar proteins
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	5	652089-652252	4	CRISPRCasFinder	no	csa3	RT,csa3,cas3,DEDDh,cas4	Type I-A	TTTTGTGTCCGAGAAAACGCAAAATGCGAGAATAGGG	37	1	1	652126-652151	NZ_LR698969.1_652062-652087	NA	2	2	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|143aa|up_8|NZ_LR698969.1_644882_645311_+,NA|49aa|up_3|NZ_LR698969.1_649201_649348_-,NA	NA|757aa|up_9|NZ_LR698969.1_642316_644587_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|143aa|up_8|NZ_LR698969.1_644882_645311_+	NA	NA|129aa|up_7|NZ_LR698969.1_645450_645837_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|164aa|up_6|NZ_LR698969.1_645843_646335_+	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|326aa|up_5|NZ_LR698969.1_647122_648100_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|308aa|up_4|NZ_LR698969.1_648285_649209_-	COG0348, NapH, Polyferredoxin [Energy production and conversion]	NA|49aa|up_3|NZ_LR698969.1_649201_649348_-	NA	NA|226aa|up_2|NZ_LR698969.1_649451_650129_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|441aa|up_1|NZ_LR698969.1_650125_651448_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	csa3|122aa|up_0|NZ_LR698969.1_651453_651819_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|73aa|down_0|NZ_LR698969.1_652511_652730_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|651aa|down_1|NZ_LR698969.1_653557_655510_+	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	NA|318aa|down_2|NZ_LR698969.1_655554_656508_+	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|409aa|down_3|NZ_LR698969.1_656586_657813_+	cd17333, MFS_FucP_MFSD4_like, Bacterial fucose permease, eukaryotic Major facilitator superfamily domain-containing protein 4, and similar proteins	NA|289aa|down_4|NZ_LR698969.1_657915_658782_+	smart00729, Elp3, Elongator protein 3, MiaB family, Radical SAM	NA|659aa|down_5|NZ_LR698969.1_660077_662054_+	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated	NA|112aa|down_6|NZ_LR698969.1_662063_662399_+	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|207aa|down_7|NZ_LR698969.1_662458_663079_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|253aa|down_8|NZ_LR698969.1_663083_663842_-	COG2071, COG2071, Predicted glutamine amidotransferases [General function prediction only]	NA|426aa|down_9|NZ_LR698969.1_663972_665250_+	cd00430, PLPDE_III_AR, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Alanine Racemase
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	6	935971-936065	5	CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CAACGGATACGGCTATGCCAGTG	23	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|68aa|up_3|NZ_LR698969.1_931628_931832_-,NA|107aa|down_2|NZ_LR698969.1_943221_943542_+	NA|269aa|up_9|NZ_LR698969.1_925799_926606_+	pfam01784, NIF3, NIF3 (NGG1p interacting factor 3)	NA|320aa|up_8|NZ_LR698969.1_926691_927651_-	pfam06207, DUF1002, Protein of unknown function (DUF1002)	NA|329aa|up_7|NZ_LR698969.1_927875_928862_+	pfam07751, Abi_2, Abi-like protein	NA|428aa|up_6|NZ_LR698969.1_928874_930158_+	TIGR01499, Folylpolyglutamate_synthase, folylpolyglutamate synthase/dihydrofolate synthase	NA|212aa|up_5|NZ_LR698969.1_930221_930857_-	PRK00215, PRK00215, transcriptional repressor LexA	NA|124aa|up_4|NZ_LR698969.1_931100_931472_+	TIGR00320, Desulfoferrodoxin_homolog, desulfoferrodoxin	NA|68aa|up_3|NZ_LR698969.1_931628_931832_-	NA	NA|187aa|up_2|NZ_LR698969.1_932074_932635_+	pfam02659, Mntp, Putative manganese efflux pump	NA|435aa|up_1|NZ_LR698969.1_932701_934006_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|469aa|up_0|NZ_LR698969.1_934034_935441_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|67aa|down_0|NZ_LR698969.1_940718_940919_+	pfam12161, HsdM_N, HsdM N-terminal domain	NA|428aa|down_1|NZ_LR698969.1_941428_942712_+	pfam04326, AlbA_2, Putative DNA-binding domain	NA|107aa|down_2|NZ_LR698969.1_943221_943542_+	NA	NA|557aa|down_3|NZ_LR698969.1_944243_945914_+	cd02028, UMPK_like, Uridine monophosphate kinase_like (UMPK_like) is a family of proteins highly similar to the uridine monophosphate kinase (UMPK, EC 2	NA|285aa|down_4|NZ_LR698969.1_946052_946907_+	PRK07105, PRK07105, pyridoxamine kinase; Validated	NA|292aa|down_5|NZ_LR698969.1_946913_947789_-	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|495aa|down_6|NZ_LR698969.1_947831_949316_-	COG1966, CstA, Carbon starvation protein, predicted membrane protein [Signal transduction mechanisms]	NA|124aa|down_7|NZ_LR698969.1_949312_949684_-	pfam01817, CM_2, Chorismate mutase type II	NA|513aa|down_8|NZ_LR698969.1_949783_951322_+	TIGR00785, Uncharacterized_transporter_HI_0020, anion transporter	NA|335aa|down_9|NZ_LR698969.1_951744_952749_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	7	943638-943784	2	PILER-CR	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CGAAAAAAGGAGCCTAAAATAC	22	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|68aa|up_7|NZ_LR698969.1_931628_931832_-,NA|107aa|up_0|NZ_LR698969.1_943221_943542_+,NA	NA|212aa|up_9|NZ_LR698969.1_930221_930857_-	PRK00215, PRK00215, transcriptional repressor LexA	NA|124aa|up_8|NZ_LR698969.1_931100_931472_+	TIGR00320, Desulfoferrodoxin_homolog, desulfoferrodoxin	NA|68aa|up_7|NZ_LR698969.1_931628_931832_-	NA	NA|187aa|up_6|NZ_LR698969.1_932074_932635_+	pfam02659, Mntp, Putative manganese efflux pump	NA|435aa|up_5|NZ_LR698969.1_932701_934006_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|469aa|up_4|NZ_LR698969.1_934034_935441_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|1507aa|up_3|NZ_LR698969.1_935846_940367_+	pfam05738, Cna_B, Cna protein B-type domain	NA|67aa|up_2|NZ_LR698969.1_940718_940919_+	pfam12161, HsdM_N, HsdM N-terminal domain	NA|428aa|up_1|NZ_LR698969.1_941428_942712_+	pfam04326, AlbA_2, Putative DNA-binding domain	NA|107aa|up_0|NZ_LR698969.1_943221_943542_+	NA	NA|557aa|down_0|NZ_LR698969.1_944243_945914_+	cd02028, UMPK_like, Uridine monophosphate kinase_like (UMPK_like) is a family of proteins highly similar to the uridine monophosphate kinase (UMPK, EC 2	NA|285aa|down_1|NZ_LR698969.1_946052_946907_+	PRK07105, PRK07105, pyridoxamine kinase; Validated	NA|292aa|down_2|NZ_LR698969.1_946913_947789_-	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|495aa|down_3|NZ_LR698969.1_947831_949316_-	COG1966, CstA, Carbon starvation protein, predicted membrane protein [Signal transduction mechanisms]	NA|124aa|down_4|NZ_LR698969.1_949312_949684_-	pfam01817, CM_2, Chorismate mutase type II	NA|513aa|down_5|NZ_LR698969.1_949783_951322_+	TIGR00785, Uncharacterized_transporter_HI_0020, anion transporter	NA|335aa|down_6|NZ_LR698969.1_951744_952749_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|215aa|down_7|NZ_LR698969.1_952745_953390_+	COG2011, AbcD, ABC-type metal ion transport system, permease component [Inorganic ion transport and metabolism]	NA|290aa|down_8|NZ_LR698969.1_953475_954345_+	cd13597, PBP2_lipoprotein_Tp32, The substrate-binding domain of the 32-kilodalton lipoprotein (Tp32) from Treponema pallidum binds L-methionine; the type 2 periplasmic-binding protein fold	NA|534aa|down_9|NZ_LR698969.1_954572_956174_+	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	8	1281519-1281646	6	CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CGCCCTGCGCGCTTCGCGCTTGGTCTTGCGGGAGCTCTGGCAATC	45	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|559aa|up_2|NZ_LR698969.1_1277052_1278729_-,NA|207aa|down_5|NZ_LR698969.1_1293825_1294446_-	NA|494aa|up_9|NZ_LR698969.1_1270841_1272323_-	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|228aa|up_8|NZ_LR698969.1_1272333_1273017_-	pfam04977, DivIC, Septum formation initiator	NA|84aa|up_7|NZ_LR698969.1_1273013_1273265_-	COG1188, COG1188, Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]	NA|158aa|up_6|NZ_LR698969.1_1273348_1273822_-	PRK00601, dut, dUTP diphosphatase	NA|266aa|up_5|NZ_LR698969.1_1273899_1274697_-	cd07516, HAD_Pase, phosphatase, similar to Escherichia coli Cof and Thermotoga maritima TM0651; belongs to the haloacid dehalogenase-like superfamily	NA|290aa|up_4|NZ_LR698969.1_1274876_1275746_+	pfam04228, Zn_peptidase, Putative neutral zinc metallopeptidase	NA|172aa|up_3|NZ_LR698969.1_1276042_1276558_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|559aa|up_2|NZ_LR698969.1_1277052_1278729_-	NA	NA|423aa|up_1|NZ_LR698969.1_1279073_1280342_-	COG3681, COG3681, L-cysteine desulfidase [Amino acid transport and metabolism]	NA|180aa|up_0|NZ_LR698969.1_1280508_1281048_+	COG1592, COG1592, Rubrerythrin [Energy production and conversion]	NA|335aa|down_0|NZ_LR698969.1_1287346_1288351_-	cd08963, L-asparaginase_I, Type I (cytosolic) bacterial L-asparaginase	NA|505aa|down_1|NZ_LR698969.1_1288445_1289960_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|211aa|down_2|NZ_LR698969.1_1290048_1290681_-	COG1394, NtpD, Archaeal/vacuolar-type H+-ATPase subunit D [Energy production and conversion]	NA|456aa|down_3|NZ_LR698969.1_1290684_1292052_-	PRK04196, PRK04196, V-type ATP synthase subunit B; Provisional	NA|595aa|down_4|NZ_LR698969.1_1292048_1293833_-	PRK04192, PRK04192, V-type ATP synthase subunit A; Provisional	NA|207aa|down_5|NZ_LR698969.1_1293825_1294446_-	NA	NA|84aa|down_6|NZ_LR698969.1_1294448_1294700_-	pfam01990, ATP-synt_F, ATP synthase (F/14-kDa) subunit	NA|142aa|down_7|NZ_LR698969.1_1294709_1295135_-	cd18120, ATP-synt_Vo_Ao_c, Membrane-bound Vo/Ao complexes of V/A-type ATP synthases, subunit c	NA|648aa|down_8|NZ_LR698969.1_1295168_1297112_-	PRK05771, PRK05771, V-type ATP synthase subunit I; Validated	NA|342aa|down_9|NZ_LR698969.1_1297122_1298148_-	pfam01992, vATP-synt_AC39, ATP synthase (C/AC39) subunit
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	9	1389928-1390043	7	CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CCGCCCCTGTTAAACGCCCGACTTCGACC	29	1	1	1389957-1390014	NZ_LR698969.1_1389828-1389885	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA|115aa|up_6|NZ_LR698969.1_1380041_1380386_-,NA	NA|402aa|up_9|NZ_LR698969.1_1376794_1378000_-	COG3919, COG3919, Predicted ATP-grasp enzyme [General function prediction only]	NA|227aa|up_8|NZ_LR698969.1_1377986_1378667_-	pfam01177, Asp_Glu_race, Asp/Glu/Hydantoin racemase	NA|443aa|up_7|NZ_LR698969.1_1378676_1380005_-	COG1114, BrnQ, Branched-chain amino acid permeases [Amino acid transport and metabolism]	NA|115aa|up_6|NZ_LR698969.1_1380041_1380386_-	NA	NA|434aa|up_5|NZ_LR698969.1_1380719_1382021_+	cd01539, PBP1_GGBP, periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species	NA|503aa|up_4|NZ_LR698969.1_1382105_1383614_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|563aa|up_3|NZ_LR698969.1_1383629_1385318_+	COG4211, MglC, ABC-type glucose/galactose transport system, permease component [Carbohydrate transport and metabolism]	NA|63aa|up_2|NZ_LR698969.1_1386034_1386223_-	pfam12669, P12, Virus attachment protein p12 family	NA|839aa|up_1|NZ_LR698969.1_1386324_1388841_-	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|76aa|up_0|NZ_LR698969.1_1388954_1389182_-	COG1918, FeoA, Fe2+ transport system protein A [Inorganic ion transport and metabolism]	NA|454aa|down_0|NZ_LR698969.1_1390817_1392179_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|215aa|down_1|NZ_LR698969.1_1392542_1393187_+	pfam12672, DUF3793, Protein of unknown function (DUF3793)	NA|142aa|down_2|NZ_LR698969.1_1393273_1393699_+	PRK05568, PRK05568, flavodoxin; Provisional	NA|195aa|down_3|NZ_LR698969.1_1393836_1394421_-	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|282aa|down_4|NZ_LR698969.1_1394712_1395558_+	cd01558, D-AAT_like, D-Alanine aminotransferase (D-AAT_like): D-amino acid aminotransferase catalyzes transamination between D-amino acids and their respective alpha-keto acids	NA|491aa|down_5|NZ_LR698969.1_1395578_1397051_+	NF033460, glycerol3P_ox_II, type 2 glycerol-3-phosphate oxidase	NA|417aa|down_6|NZ_LR698969.1_1397037_1398288_+	pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide oxidoreductase	NA|123aa|down_7|NZ_LR698969.1_1398284_1398653_+	COG3862, COG3862, Uncharacterized protein with conserved CXXC pairs [Function unknown]	NA|296aa|down_8|NZ_LR698969.1_1398670_1399558_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|292aa|down_9|NZ_LR698969.1_1399652_1400528_+	PRK04180, PRK04180, pyridoxal 5'-phosphate synthase lyase subunit PdxS
GCF_902381775.1_UHGG_MGYG-HGUT-01563	NZ_LR698969	Murdochiella vaginalis isolate MGYG-HGUT-01563 chromosome 1	10	1667949-1668057	8	CRISPRCasFinder	no		RT,csa3,cas3,DEDDh,cas4	Orphan	CTTGCGGGAGCTCTGGCAATCCCAC	25	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas3,DEDDh,cas4	NA,NA	NA|147aa|up_9|NZ_LR698969.1_1656700_1657141_-	cd03446, MaoC_like, MoaC_like    Similar to the MaoC (monoamine oxidase C) dehydratase regulatory protein but without the N-terminal PutA domain	NA|432aa|up_8|NZ_LR698969.1_1657190_1658486_-	COG2851, CitM, H+/citrate symporter [Energy production and conversion]	NA|400aa|up_7|NZ_LR698969.1_1658600_1659800_-	pfam02515, CoA_transf_3, CoA-transferase family III	NA|587aa|up_6|NZ_LR698969.1_1660136_1661897_-	pfam06506, PrpR_N, Propionate catabolism activator	NA|343aa|up_5|NZ_LR698969.1_1662102_1663131_+	cd08233, butanediol_DH_like, (2R,3R)-2,3-butanediol dehydrogenase	NA|331aa|up_4|NZ_LR698969.1_1663144_1664137_+	cd13603, PBP2_TRAP_Siap_TeaA_like, Substrate-binding domain of a sialic acid binding Tripartite ATP-independent  Periplasmic transport system (SiaP) and related proteins; the type 2 periplasmic-binding protein fold	NA|158aa|up_3|NZ_LR698969.1_1664133_1664607_+	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|422aa|up_2|NZ_LR698969.1_1664603_1665869_+	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|444aa|up_1|NZ_LR698969.1_1665877_1667209_+	COG3395, COG3395, Uncharacterized protein conserved in bacteria [Function unknown]	NA|192aa|up_0|NZ_LR698969.1_1667225_1667801_+	pfam00596, Aldolase_II, Class II Aldolase and Adducin N-terminal domain	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
