assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	1	1034599-1034693	1	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	CGGTGATCAAGAGGTTTACGTCA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA|1634aa|up_4|NZ_CP041061.1_1025217_1030119_-,NA	NA|816aa|up_9|NZ_CP041061.1_1017370_1019818_+	cd01948, EAL, EAL domain	NA|345aa|up_8|NZ_CP041061.1_1019926_1020961_-	cd08589, PI-PLCc_SaPLC1_like, Catalytic domain of Streptomyces antibioticus phosphatidylinositol-specific phospholipase C1-like proteins	NA|693aa|up_7|NZ_CP041061.1_1021097_1023176_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|387aa|up_6|NZ_CP041061.1_1023304_1024465_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|164aa|up_5|NZ_CP041061.1_1024461_1024953_+	pfam00731, AIRC, AIR carboxylase	NA|1634aa|up_4|NZ_CP041061.1_1025217_1030119_-	NA	NA|476aa|up_3|NZ_CP041061.1_1030274_1031702_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|388aa|up_2|NZ_CP041061.1_1031951_1033115_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|224aa|up_1|NZ_CP041061.1_1033256_1033928_-	PLN03209, PLN03209, translocon at the inner envelope of chloroplast subunit 62; Provisional	NA|172aa|up_0|NZ_CP041061.1_1033951_1034467_+	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|658aa|down_0|NZ_CP041061.1_1034933_1036907_+	PRK03584, PRK03584, acetoacetate--CoA ligase	NA|232aa|down_1|NZ_CP041061.1_1037199_1037895_-	TIGR03089, conserved_hypothetical_protein, TIGR03089 family protein	NA|382aa|down_2|NZ_CP041061.1_1038046_1039192_+	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|364aa|down_3|NZ_CP041061.1_1039255_1040347_+	COG0836, {ManC}, Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane]	NA|211aa|down_4|NZ_CP041061.1_1040495_1041128_-	cd03674, Nudix_Hydrolase_1, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|346aa|down_5|NZ_CP041061.1_1041167_1042205_+	pfam13810, DUF4185, Domain of unknown function (DUF4185)	NA|334aa|down_6|NZ_CP041061.1_1042484_1043486_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|376aa|down_7|NZ_CP041061.1_1043610_1044738_-	PRK13294, PRK13294, F420-0--gamma-glutamyl ligase; Provisional	NA|316aa|down_8|NZ_CP041061.1_1044734_1045682_-	PRK13606, PRK13606, LPPG:FO 2-phospho-L-lactate transferase; Provisional	NA|289aa|down_9|NZ_CP041061.1_1045781_1046648_+	pfam09849, DUF2076, Uncharacterized protein conserved in bacteria (DUF2076)
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	2	1464150-1464242	2	CRISPRCasFinder	no	csa3	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Type I-A	AGCGCAGCGGAGTCCCGCAGTCGCGAACGAAAGG	34	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA,NA|304aa|down_3|NZ_CP041061.1_1469743_1470655_-,NA|298aa|down_4|NZ_CP041061.1_1470930_1471824_+	NA|786aa|up_9|NZ_CP041061.1_1448522_1450880_+	COG2609, AceE, Pyruvate dehydrogenase complex, dehydrogenase (E1) component [Energy production and conversion]	NA|545aa|up_8|NZ_CP041061.1_1451327_1452962_-	pfam00149, Metallophos, Calcineurin-like phosphoesterase	NA|388aa|up_7|NZ_CP041061.1_1453064_1454228_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|602aa|up_6|NZ_CP041061.1_1454298_1456104_-	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|309aa|up_5|NZ_CP041061.1_1456272_1457199_+	cd12166, 2-Hacid_dh_7, Putative D-isomer specific 2-hydroxyacid dehydrogenases	NA|125aa|up_4|NZ_CP041061.1_1457237_1457612_-	pfam10756, bPH_6, Bacterial PH domain	NA|817aa|up_3|NZ_CP041061.1_1457910_1460361_+	cd01948, EAL, EAL domain	NA|340aa|up_2|NZ_CP041061.1_1460389_1461409_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|426aa|up_1|NZ_CP041061.1_1461794_1463072_+	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|338aa|up_0|NZ_CP041061.1_1463122_1464136_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|287aa|down_0|NZ_CP041061.1_1464266_1465127_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|824aa|down_1|NZ_CP041061.1_1465123_1467595_+	COG3250, LacZ, Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]	NA|670aa|down_2|NZ_CP041061.1_1467727_1469737_+	pfam12831, FAD_oxidored, FAD dependent oxidoreductase	NA|304aa|down_3|NZ_CP041061.1_1469743_1470655_-	NA	NA|298aa|down_4|NZ_CP041061.1_1470930_1471824_+	NA	csa3|137aa|down_5|NZ_CP041061.1_1471967_1472378_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|555aa|down_6|NZ_CP041061.1_1472374_1474039_+	PRK11660, PRK11660, putative transporter; Provisional	NA|272aa|down_7|NZ_CP041061.1_1474399_1475215_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|616aa|down_8|NZ_CP041061.1_1475355_1477203_-	PRK12448, PRK12448, dihydroxy-acid dehydratase; Provisional	NA|805aa|down_9|NZ_CP041061.1_1477313_1479728_+	cd01948, EAL, EAL domain
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	3	1606750-1607200	3	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	GGCGACACCGGCACCGGCCGGCTGG	25	1	2	1607156-1607175|1607156-1607175	NZ_CP041061.1_3691314-3691333|NZ_CP041061.1_2481304-2481285	NA	8	8	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA,NA	NA|160aa|up_9|NZ_CP041061.1_1595076_1595556_+	cd04685, Nudix_Hydrolase_26, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|271aa|up_8|NZ_CP041061.1_1595549_1596362_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|108aa|up_7|NZ_CP041061.1_1596358_1596682_+	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|369aa|up_6|NZ_CP041061.1_1596740_1597847_-	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|397aa|up_5|NZ_CP041061.1_1598871_1600062_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|393aa|up_4|NZ_CP041061.1_1600257_1601436_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|448aa|up_3|NZ_CP041061.1_1601717_1603061_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|391aa|up_2|NZ_CP041061.1_1603120_1604293_-	pfam02481, DNA_processg_A, DNA recombination-mediator protein A	NA|507aa|up_1|NZ_CP041061.1_1604289_1605810_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|120aa|up_0|NZ_CP041061.1_1605874_1606234_-	PRK12497, PRK12497, YraN family protein	NA|286aa|down_0|NZ_CP041061.1_1607968_1608826_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|276aa|down_1|NZ_CP041061.1_1608960_1609788_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|256aa|down_2|NZ_CP041061.1_1609959_1610727_+	PRK00358, pyrH, uridylate kinase; Provisional	NA|186aa|down_3|NZ_CP041061.1_1610814_1611372_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|427aa|down_4|NZ_CP041061.1_1611738_1613019_+	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|392aa|down_5|NZ_CP041061.1_1613123_1614299_+	PRK14459, PRK14459, ribosomal RNA large subunit methyltransferase N; Provisional	NA|416aa|down_6|NZ_CP041061.1_1614298_1615546_+	TIGR03544, cell_division_initiation_protein_DivIVA, DivIVA domain	NA|74aa|down_7|NZ_CP041061.1_1615615_1615837_-	pfam10939, DUF2631, Protein of unknown function (DUF2631)	NA|527aa|down_8|NZ_CP041061.1_1616088_1617669_+	COG2220, COG2220, Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only]	NA|225aa|down_9|NZ_CP041061.1_1617769_1618444_+	pfam06736, DUF1211, Protein of unknown function (DUF1211)
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	4	1615198-1615268	4	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	GCCGCTGGGTGGTCCGCCGATGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA|408aa|up_6|NZ_CP041061.1_1606458_1607682_+,NA|111aa|down_7|NZ_CP041061.1_1626240_1626573_-,NA|87aa|down_8|NZ_CP041061.1_1626662_1626923_-	NA|391aa|up_9|NZ_CP041061.1_1603120_1604293_-	pfam02481, DNA_processg_A, DNA recombination-mediator protein A	NA|507aa|up_8|NZ_CP041061.1_1604289_1605810_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|120aa|up_7|NZ_CP041061.1_1605874_1606234_-	PRK12497, PRK12497, YraN family protein	NA|408aa|up_6|NZ_CP041061.1_1606458_1607682_+	NA	NA|286aa|up_5|NZ_CP041061.1_1607968_1608826_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|276aa|up_4|NZ_CP041061.1_1608960_1609788_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|256aa|up_3|NZ_CP041061.1_1609959_1610727_+	PRK00358, pyrH, uridylate kinase; Provisional	NA|186aa|up_2|NZ_CP041061.1_1610814_1611372_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|427aa|up_1|NZ_CP041061.1_1611738_1613019_+	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|392aa|up_0|NZ_CP041061.1_1613123_1614299_+	PRK14459, PRK14459, ribosomal RNA large subunit methyltransferase N; Provisional	NA|74aa|down_0|NZ_CP041061.1_1615615_1615837_-	pfam10939, DUF2631, Protein of unknown function (DUF2631)	NA|527aa|down_1|NZ_CP041061.1_1616088_1617669_+	COG2220, COG2220, Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only]	NA|225aa|down_2|NZ_CP041061.1_1617769_1618444_+	pfam06736, DUF1211, Protein of unknown function (DUF1211)	NA|534aa|down_3|NZ_CP041061.1_1618468_1620070_-	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|783aa|down_4|NZ_CP041061.1_1620406_1622755_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|190aa|down_5|NZ_CP041061.1_1623588_1624158_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|520aa|down_6|NZ_CP041061.1_1624495_1626055_+	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]	NA|111aa|down_7|NZ_CP041061.1_1626240_1626573_-	NA	NA|87aa|down_8|NZ_CP041061.1_1626662_1626923_-	NA	NA|264aa|down_9|NZ_CP041061.1_1627136_1627928_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	5	3939441-3939514	5	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	GCGTTCCCGTTGCCCGCGTTGCC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA|213aa|up_5|NZ_CP041061.1_3931418_3932057_+,NA|196aa|up_3|NZ_CP041061.1_3934550_3935138_+,NA|77aa|down_2|NZ_CP041061.1_3942953_3943184_-,NA|189aa|down_7|NZ_CP041061.1_3947044_3947611_+,NA|174aa|down_9|NZ_CP041061.1_3949315_3949837_-	NA|626aa|up_9|NZ_CP041061.1_3924626_3926504_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|485aa|up_8|NZ_CP041061.1_3926692_3928147_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|410aa|up_7|NZ_CP041061.1_3928292_3929522_+	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|595aa|up_6|NZ_CP041061.1_3929621_3931406_+	pfam13196, DUF4012, Protein of unknown function (DUF4012)	NA|213aa|up_5|NZ_CP041061.1_3931418_3932057_+	NA	NA|104aa|up_4|NZ_CP041061.1_3933793_3934105_+	cd11533, NTP-PPase_Af0060_like, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|196aa|up_3|NZ_CP041061.1_3934550_3935138_+	NA	NA|138aa|up_2|NZ_CP041061.1_3935425_3935839_+	cd14505, CDKN3-like, cyclin-dependent kinase inhibitor 3 and similar proteins	NA|759aa|up_1|NZ_CP041061.1_3935909_3938186_-	COG0178, UvrA, Excinuclease ATPase subunit [DNA replication, recombination, and repair]	NA|237aa|up_0|NZ_CP041061.1_3938331_3939042_+	cd04772, HTH_TioE_rpt1, First Helix-Turn-Helix DNA binding domain of the regulatory protein TioE	NA|417aa|down_0|NZ_CP041061.1_3940681_3941932_+	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|243aa|down_1|NZ_CP041061.1_3942151_3942880_+	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|77aa|down_2|NZ_CP041061.1_3942953_3943184_-	NA	NA|411aa|down_3|NZ_CP041061.1_3943402_3944635_+	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|271aa|down_4|NZ_CP041061.1_3944661_3945474_+	cd14817, D-Ala-D-Ala_dipeptidase_VanX, D-Ala-D-Ala dipeptidase VanX	NA|169aa|down_5|NZ_CP041061.1_3945639_3946146_+	pfam04978, DUF664, Protein of unknown function (DUF664)	NA|260aa|down_6|NZ_CP041061.1_3946196_3946976_-	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|189aa|down_7|NZ_CP041061.1_3947044_3947611_+	NA	NA|400aa|down_8|NZ_CP041061.1_3947991_3949191_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|174aa|down_9|NZ_CP041061.1_3949315_3949837_-	NA
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	6	5031591-5032839	1,6,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Type I-E	GTCGTCCCCGCACGCGCGGGGGTCTT,GTCGTCCCCGCACGCGCGGGGGTCTTCC,GTCGTCCCCGCACGCGCGGGGGTCTTCC	26,28,28	0	0	NA	NA	NA:NA:NA	16,20,20	20	TypeI-E	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA|222aa|up_7|NZ_CP041061.1_5015773_5016439_-,NA	NA|159aa|up_9|NZ_CP041061.1_5014689_5015166_-	cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication	NA|110aa|up_8|NZ_CP041061.1_5015478_5015808_+	PRK15251, PRK15251, cytolethal distending toxin subunit B family protein	NA|222aa|up_7|NZ_CP041061.1_5015773_5016439_-	NA	NA|91aa|up_6|NZ_CP041061.1_5016577_5016850_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|1458aa|up_5|NZ_CP041061.1_5017234_5021608_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|105aa|up_4|NZ_CP041061.1_5022417_5022732_-	cd13442, CDI_toxin_Bp1026b_like, Mg-dependent tRNAse of the contact-dependent growth inhibition (CDI) system of Burkholderia pseudomallei 1026b, and related proteins	NA|344aa|up_3|NZ_CP041061.1_5023250_5024282_+	PRK03624, PRK03624, putative acetyltransferase; Provisional	NA|883aa|up_2|NZ_CP041061.1_5024285_5026934_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|650aa|up_1|NZ_CP041061.1_5027053_5029003_+	pfam12810, Gly_rich, Glycine rich protein	NA|383aa|up_0|NZ_CP041061.1_5029290_5030439_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|400aa|down_0|NZ_CP041061.1_5032868_5034068_-	pfam01548, DEDD_Tnp_IS110, Transposase	cas2|93aa|down_1|NZ_CP041061.1_5035339_5035618_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|331aa|down_2|NZ_CP041061.1_5035614_5036607_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|244aa|down_3|NZ_CP041061.1_5036608_5037340_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|270aa|down_4|NZ_CP041061.1_5037345_5038155_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|387aa|down_5|NZ_CP041061.1_5038151_5039312_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|195aa|down_6|NZ_CP041061.1_5039370_5039955_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|532aa|down_7|NZ_CP041061.1_5039951_5041547_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|922aa|down_8|NZ_CP041061.1_5041884_5044650_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|327aa|down_9|NZ_CP041061.1_5047718_5048699_-	pfam03372, Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	7	5034997-5035207	7,2	CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Type I-E	GTCGTCCCCGCACGCGCGGGGGTCTTCC,GTCGTCCCCGCACGCGCGGGG	28,21	0	0	NA	NA	NA:NA	3,3	3	TypeI-E	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA|222aa|up_8|NZ_CP041061.1_5015773_5016439_-,NA	NA|110aa|up_9|NZ_CP041061.1_5015478_5015808_+	PRK15251, PRK15251, cytolethal distending toxin subunit B family protein	NA|222aa|up_8|NZ_CP041061.1_5015773_5016439_-	NA	NA|91aa|up_7|NZ_CP041061.1_5016577_5016850_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|1458aa|up_6|NZ_CP041061.1_5017234_5021608_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|105aa|up_5|NZ_CP041061.1_5022417_5022732_-	cd13442, CDI_toxin_Bp1026b_like, Mg-dependent tRNAse of the contact-dependent growth inhibition (CDI) system of Burkholderia pseudomallei 1026b, and related proteins	NA|344aa|up_4|NZ_CP041061.1_5023250_5024282_+	PRK03624, PRK03624, putative acetyltransferase; Provisional	NA|883aa|up_3|NZ_CP041061.1_5024285_5026934_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|650aa|up_2|NZ_CP041061.1_5027053_5029003_+	pfam12810, Gly_rich, Glycine rich protein	NA|383aa|up_1|NZ_CP041061.1_5029290_5030439_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|400aa|up_0|NZ_CP041061.1_5032868_5034068_-	pfam01548, DEDD_Tnp_IS110, Transposase	cas2|93aa|down_0|NZ_CP041061.1_5035339_5035618_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|331aa|down_1|NZ_CP041061.1_5035614_5036607_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|244aa|down_2|NZ_CP041061.1_5036608_5037340_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|270aa|down_3|NZ_CP041061.1_5037345_5038155_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|387aa|down_4|NZ_CP041061.1_5038151_5039312_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|195aa|down_5|NZ_CP041061.1_5039370_5039955_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|532aa|down_6|NZ_CP041061.1_5039951_5041547_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|922aa|down_7|NZ_CP041061.1_5041884_5044650_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|327aa|down_8|NZ_CP041061.1_5047718_5048699_-	pfam03372, Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family	NA|675aa|down_9|NZ_CP041061.1_5048695_5050720_-	COG4889, COG4889, Predicted helicase [General function prediction only]
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	8	5044978-5047682	2,8,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Type I-E	GTCGTCCCCGCACGCGCGGGG,GTCGTCCCCGCACGCGCGGGGGTCTTCC,GTCGTCCCCGCACGCGCGGGGGTCTTCC	21,28,28	1	1	5047380-5047412	NZ_CP041061.1_475992-476024	NA:NA:NA	44,44,44	44	TypeI-E	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA,NA|199aa|down_5|NZ_CP041061.1_5054452_5055049_-,NA|111aa|down_7|NZ_CP041061.1_5056944_5057277_-,NA|226aa|down_8|NZ_CP041061.1_5057459_5058137_-	NA|383aa|up_9|NZ_CP041061.1_5029290_5030439_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|400aa|up_8|NZ_CP041061.1_5032868_5034068_-	pfam01548, DEDD_Tnp_IS110, Transposase	cas2|93aa|up_7|NZ_CP041061.1_5035339_5035618_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|331aa|up_6|NZ_CP041061.1_5035614_5036607_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|244aa|up_5|NZ_CP041061.1_5036608_5037340_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|270aa|up_4|NZ_CP041061.1_5037345_5038155_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|387aa|up_3|NZ_CP041061.1_5038151_5039312_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|195aa|up_2|NZ_CP041061.1_5039370_5039955_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|532aa|up_1|NZ_CP041061.1_5039951_5041547_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|922aa|up_0|NZ_CP041061.1_5041884_5044650_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|327aa|down_0|NZ_CP041061.1_5047718_5048699_-	pfam03372, Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family	NA|675aa|down_1|NZ_CP041061.1_5048695_5050720_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|375aa|down_2|NZ_CP041061.1_5050716_5051841_-	pfam12895, ANAPC3, Anaphase-promoting complex, cyclosome, subunit 3	NA|379aa|down_3|NZ_CP041061.1_5051934_5053071_-	cd09996, HDAC_classII_1, Histone deacetylases and histone-like deacetylases, classII	NA|423aa|down_4|NZ_CP041061.1_5053187_5054456_-	PRK02769, PRK02769, histidine decarboxylase; Provisional	NA|199aa|down_5|NZ_CP041061.1_5054452_5055049_-	NA	NA|486aa|down_6|NZ_CP041061.1_5055490_5056948_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|111aa|down_7|NZ_CP041061.1_5056944_5057277_-	NA	NA|226aa|down_8|NZ_CP041061.1_5057459_5058137_-	NA	NA|400aa|down_9|NZ_CP041061.1_5058145_5059345_-	cd06187, O2ase_reductase_like, The oxygenase reductase FAD/NADH binding domain acts as part of the multi-component bacterial oxygenases which oxidize hydrocarbons using oxygen as the oxidant
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	9	5647539-5647798	9	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	ACGGACCGGCAGACCAGCGCGGACCG	26	0	0	NA	NA	NA	4	4	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA,NA|67aa|down_2|NZ_CP041061.1_5651062_5651263_-,NA|432aa|down_5|NZ_CP041061.1_5654221_5655517_+,NA|110aa|down_8|NZ_CP041061.1_5658464_5658794_-	NA|174aa|up_9|NZ_CP041061.1_5634216_5634738_-	pfam13671, AAA_33, AAA domain	NA|326aa|up_8|NZ_CP041061.1_5634734_5635712_-	cd01942, ribokinase_group_A, Ribokinase-like subgroup A	NA|123aa|up_7|NZ_CP041061.1_5635946_5636315_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	NA|379aa|up_6|NZ_CP041061.1_5636487_5637624_-	pfam02595, Gly_kinase, Glycerate kinase family	NA|400aa|up_5|NZ_CP041061.1_5637718_5638918_+	PRK09375, PRK09375, quinolinate synthase NadA	NA|452aa|up_4|NZ_CP041061.1_5639157_5640513_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|208aa|up_3|NZ_CP041061.1_5640614_5641238_+	pfam11241, DUF3043, Protein of unknown function (DUF3043)	NA|327aa|up_2|NZ_CP041061.1_5641341_5642322_-	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|273aa|up_1|NZ_CP041061.1_5642318_5643137_-	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|283aa|up_0|NZ_CP041061.1_5643129_5643978_-	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|333aa|down_0|NZ_CP041061.1_5649218_5650217_+	cd19074, Aldo_ket_red_shaker-like, Shaker potassium channel beta subunit family and similar proteins	NA|266aa|down_1|NZ_CP041061.1_5650265_5651063_-	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|67aa|down_2|NZ_CP041061.1_5651062_5651263_-	NA	NA|654aa|down_3|NZ_CP041061.1_5651431_5653393_+	pfam02277, DBI_PRT, Phosphoribosyltransferase	NA|258aa|down_4|NZ_CP041061.1_5653382_5654156_+	PRK00235, cobS, cobalamin synthase; Reviewed	NA|432aa|down_5|NZ_CP041061.1_5654221_5655517_+	NA	NA|377aa|down_6|NZ_CP041061.1_5655527_5656658_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|524aa|down_7|NZ_CP041061.1_5656819_5658391_+	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|110aa|down_8|NZ_CP041061.1_5658464_5658794_-	NA	NA|463aa|down_9|NZ_CP041061.1_5658975_5660364_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed
GCF_007833915.1_ASM783391v1	NZ_CP041061	Micromonospora sp. HM134 chromosome, complete genome	10	5945098-5945225	10	CRISPRCasFinder	no		csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	Orphan	GTGATCAAGAAGTTTGCGGCGGGAATCGGGCCGGGG	36	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,RT,WYL,cas4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,DinG	NA,NA|288aa|down_6|NZ_CP041061.1_5951568_5952432_+	NA|299aa|up_9|NZ_CP041061.1_5934672_5935569_-	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|253aa|up_8|NZ_CP041061.1_5935689_5936448_+	pfam13349, DUF4097, Putative adhesin	NA|361aa|up_7|NZ_CP041061.1_5936464_5937547_-	pfam01757, Acyl_transf_3, Acyltransferase family	NA|299aa|up_6|NZ_CP041061.1_5937686_5938583_-	PRK00089, era, GTPase Era; Reviewed	NA|129aa|up_5|NZ_CP041061.1_5938579_5938966_-	cd01283, cytidine_deaminase, Cytidine deaminase zinc-binding domain	NA|453aa|up_4|NZ_CP041061.1_5938958_5940317_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|158aa|up_3|NZ_CP041061.1_5940316_5940790_-	PRK00016, PRK00016, metal-binding heat shock protein; Provisional	NA|709aa|up_2|NZ_CP041061.1_5940808_5942935_-	COG1702, PhoH, Phosphate starvation-inducible protein PhoH, predicted ATPase [Signal transduction mechanisms]	NA|470aa|up_1|NZ_CP041061.1_5943148_5944558_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|120aa|up_0|NZ_CP041061.1_5944557_5944917_-	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|245aa|down_0|NZ_CP041061.1_5945358_5946093_-	PRK11713, PRK11713, 16S ribosomal RNA methyltransferase RsmE; Provisional	NA|388aa|down_1|NZ_CP041061.1_5946282_5947446_-	PRK14278, PRK14278, chaperone protein DnaJ; Provisional	NA|341aa|down_2|NZ_CP041061.1_5947502_5948525_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|201aa|down_3|NZ_CP041061.1_5948760_5949363_+	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|415aa|down_4|NZ_CP041061.1_5949446_5950691_-	PRK05628, PRK05628, coproporphyrinogen III oxidase; Validated	NA|261aa|down_5|NZ_CP041061.1_5950689_5951472_+	PRK07827, PRK07827, enoyl-CoA hydratase family protein	NA|288aa|down_6|NZ_CP041061.1_5951568_5952432_+	NA	NA|214aa|down_7|NZ_CP041061.1_5952469_5953111_-	COG2258, COG2258, Uncharacterized protein conserved in bacteria [Function unknown]	NA|277aa|down_8|NZ_CP041061.1_5953153_5953984_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|329aa|down_9|NZ_CP041061.1_5953980_5954967_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]
