assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	1	57117-57236	1	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GGGTGTGGGGTGTAGGGTTTTACCGATTTTGAGG	34	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|50aa|up_8|NC_010296.1_44113_44263_+,NA|67aa|up_1|NC_010296.1_54014_54215_+,NA|164aa|up_0|NC_010296.1_56559_57051_-,NA|81aa|down_0|NC_010296.1_57484_57727_-,NA|47aa|down_8|NC_010296.1_70533_70674_-	NA|465aa|up_9|NC_010296.1_40980_42375_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|50aa|up_8|NC_010296.1_44113_44263_+	NA	NA|86aa|up_7|NC_010296.1_47568_47826_+	TIGR04220, hypothetical_protein_L8106_29040, cyanobactin biosynthesis protein, PatB/AcyB/McaB family	NA|320aa|up_6|NC_010296.1_47949_48909_+	TIGR04447, hypothetical_protein, cyanobactin cluster PatC/TenC/TruC protein	NA|52aa|up_5|NC_010296.1_49053_49209_+	TIGR04446, anacyclamide_precursor, prenylated cyclic peptide, anacyclamide/piricyclamide family	NA|264aa|up_4|NC_010296.1_49280_50072_-	pfam05685, Uma2, Putative restriction endonuclease	NA|342aa|up_3|NC_010296.1_52525_53551_+	cd19080, AKR_AKR9A_9B, AKR9A and AKR9B families of aldo-keto reductase (AKR)	NA|136aa|up_2|NC_010296.1_53547_53955_+	COG2105, COG2105, Uncharacterized conserved protein [Function unknown]	NA|67aa|up_1|NC_010296.1_54014_54215_+	NA	NA|164aa|up_0|NC_010296.1_56559_57051_-	NA	NA|81aa|down_0|NC_010296.1_57484_57727_-	NA	NA|408aa|down_1|NC_010296.1_57925_59148_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|74aa|down_2|NC_010296.1_59332_59554_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|453aa|down_3|NC_010296.1_59706_61065_+	PRK00093, PRK00093, GTP-binding protein Der; Reviewed	NA|292aa|down_4|NC_010296.1_61068_61944_+	pfam02361, CbiQ, Cobalt transport protein	NA|369aa|down_5|NC_010296.1_64127_65234_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|204aa|down_6|NC_010296.1_67538_68150_+	pfam11780, DUF3318, Protein of unknown function (DUF3318)	NA|704aa|down_7|NC_010296.1_68265_70377_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|47aa|down_8|NC_010296.1_70533_70674_-	NA	NA|483aa|down_9|NC_010296.1_71387_72836_+	CHL00060, atpB, ATP synthase CF1 beta subunit
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	2	80946-81749	1	CRT	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	AGNAGCCGTATTACCNGNTATTGTACTGTTGGTTAA	36	0	0	NA	NA	NA	10	10	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|47aa|up_8|NC_010296.1_70533_70674_-,NA|129aa|down_0|NC_010296.1_82781_83168_-,NA|337aa|down_1|NC_010296.1_83218_84229_-,NA|340aa|down_3|NC_010296.1_84708_85728_-,NA|51aa|down_7|NC_010296.1_87101_87254_+	NA|704aa|up_9|NC_010296.1_68265_70377_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|47aa|up_8|NC_010296.1_70533_70674_-	NA	NA|483aa|up_7|NC_010296.1_71387_72836_+	CHL00060, atpB, ATP synthase CF1 beta subunit	NA|139aa|up_6|NC_010296.1_72911_73328_+	CHL00063, atpE, ATP synthase CF1 epsilon subunit	NA|108aa|up_5|NC_010296.1_73472_73796_-	PRK02724, PRK02724, 30S ribosomal protein PSRP-3	NA|135aa|up_4|NC_010296.1_73899_74304_+	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|88aa|up_3|NC_010296.1_74678_74942_+	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|356aa|up_2|NC_010296.1_74987_76055_+	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|108aa|up_1|NC_010296.1_76055_76379_-	COG0316, sufA, Fe-S cluster assembly scaffold protein [Posttranslational modification, protein turnover, chaperones]	NA|97aa|up_0|NC_010296.1_76940_77231_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|129aa|down_0|NC_010296.1_82781_83168_-	NA	NA|337aa|down_1|NC_010296.1_83218_84229_-	NA	NA|116aa|down_2|NC_010296.1_84225_84573_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|340aa|down_3|NC_010296.1_84708_85728_-	NA	NA|97aa|down_4|NC_010296.1_85781_86072_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|166aa|down_5|NC_010296.1_86226_86724_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|82aa|down_6|NC_010296.1_86720_86966_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|51aa|down_7|NC_010296.1_87101_87254_+	NA	NA|307aa|down_8|NC_010296.1_87225_88146_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|170aa|down_9|NC_010296.1_88743_89253_+	PRK07571, PRK07571, bidirectional hydrogenase complex protein HoxE; Reviewed
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	3	335211-335347	2	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	ACCCAAGGAAACCCAACAAAAAGATTCAGACGGATTTAGGT	41	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|58aa|up_9|NC_010296.1_329771_329945_+,NA|115aa|up_3|NC_010296.1_333326_333671_-,NA|416aa|down_3|NC_010296.1_343246_344494_-	NA|58aa|up_9|NC_010296.1_329771_329945_+	NA	NA|79aa|up_8|NC_010296.1_330212_330449_+	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|83aa|up_7|NC_010296.1_330432_330681_+	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|364aa|up_6|NC_010296.1_331149_332241_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|114aa|up_5|NC_010296.1_332416_332758_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|98aa|up_4|NC_010296.1_332754_333048_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|115aa|up_3|NC_010296.1_333326_333671_-	NA	NA|116aa|up_2|NC_010296.1_333679_334027_-	pfam18648, ADPRTs_Tse2, Tse2 ADP-ribosyltransferase toxins	NA|99aa|up_1|NC_010296.1_334511_334808_-	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|63aa|up_0|NC_010296.1_334819_335008_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|1380aa|down_0|NC_010296.1_335427_339567_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|823aa|down_1|NC_010296.1_339632_342101_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|376aa|down_2|NC_010296.1_342116_343244_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|416aa|down_3|NC_010296.1_343246_344494_-	NA	NA|618aa|down_4|NC_010296.1_344767_346621_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|316aa|down_5|NC_010296.1_347168_348116_+	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|469aa|down_6|NC_010296.1_348057_349464_-	pfam09852, DUF2079, Predicted membrane protein (DUF2079)	NA|286aa|down_7|NC_010296.1_349476_350334_-	cd13688, PBP2_GltI_DEBP, Substrate-binding domain of ABC aspartate-glutamate transporter; the type 2 periplasmic binding protein fold	NA|346aa|down_8|NC_010296.1_350352_351390_-	pfam01594, AI-2E_transport, AI-2E family transporter	NA|266aa|down_9|NC_010296.1_352029_352827_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	4	473126-473248	3	CRISPRCasFinder	no	Cas14c_CAS-V-F	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	CTAGAAAAAAAGAGGTGCGTTACATTTCATTAAC	34	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|74aa|up_3|NC_010296.1_470606_470828_+,NA|115aa|up_1|NC_010296.1_472292_472637_+,NA|85aa|up_0|NC_010296.1_472794_473049_+,NA|101aa|down_0|NC_010296.1_473265_473568_-,NA|89aa|down_6|NC_010296.1_480306_480573_-	NA|246aa|up_9|NC_010296.1_463957_464695_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|380aa|up_8|NC_010296.1_464983_466123_+	PRK14036, PRK14036, citrate synthase; Provisional	NA|581aa|up_7|NC_010296.1_466321_468064_+	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|423aa|up_6|NC_010296.1_468063_469332_+	COG1819, COG1819, Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]	NA|115aa|up_5|NC_010296.1_469594_469939_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|98aa|up_4|NC_010296.1_469935_470229_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|74aa|up_3|NC_010296.1_470606_470828_+	NA	Cas14c_CAS-V-F|396aa|up_2|NC_010296.1_471055_472243_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|115aa|up_1|NC_010296.1_472292_472637_+	NA	NA|85aa|up_0|NC_010296.1_472794_473049_+	NA	NA|101aa|down_0|NC_010296.1_473265_473568_-	NA	NA|332aa|down_1|NC_010296.1_474076_475072_-	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|114aa|down_2|NC_010296.1_475455_475797_+	pfam06108, DUF952, Protein of unknown function (DUF952)	NA|438aa|down_3|NC_010296.1_475773_477087_-	PRK07583, PRK07583, cytosine deaminase	NA|296aa|down_4|NC_010296.1_477645_478533_-	PRK06245, cofG, FO synthase subunit 1; Reviewed	NA|555aa|down_5|NC_010296.1_478645_480310_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|89aa|down_6|NC_010296.1_480306_480573_-	NA	NA|132aa|down_7|NC_010296.1_480711_481107_-	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|87aa|down_8|NC_010296.1_481093_481354_-	pfam09907, HigB_toxin, HigB_toxin, RelE-like toxic component of a toxin-antitoxin system	NA|237aa|down_9|NC_010296.1_481787_482498_+	COG2138, COG2138, Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	5	546309-546474	2	CRT	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CCCTTGATAAGGGGGGTG	18	2	37	546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546364-546382|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456|546438-546456	NC_010296.1_403693-403675|NC_010296.1_1661250-1661232|NC_010296.1_1761578-1761596|NC_010296.1_1979029-1979047|NC_010296.1_3091314-3091296|NC_010296.1_3217867-3217885|NC_010296.1_3541378-3541360|NC_010296.1_3541415-3541397|NC_010296.1_3541452-3541434|NC_010296.1_3541489-3541471|NC_010296.1_5277947-5277965|NC_010296.1_5364281-5364299|NC_010296.1_5514547-5514565|NC_010296.1_1485334-1485352|NC_010296.1_4223594-4223576|NC_010296.1_5364244-5364262|NC_010296.1_2778633-2778651|NC_010296.1_225126-225144|NC_010296.1_2151615-2151597|NC_010296.1_2190724-2190742|NC_010296.1_2569747-2569729|NC_010296.1_3915512-3915530|NC_010296.1_4698694-4698676|NC_010296.1_5354021-5354039|NC_010296.1_5474736-5474718|NC_010296.1_1975549-1975567|NC_010296.1_2340175-2340157|NC_010296.1_2878721-2878703|NC_010296.1_3845455-3845473|NC_010296.1_3845492-3845510|NC_010296.1_3845529-3845547|NC_010296.1_5083876-5083858|NC_010296.1_5277984-5278002|NC_010296.1_5347566-5347584|NC_010296.1_5733075-5733093|NC_010296.1_5795386-5795368|NC_010296.1_2289924-2289942	NA	4	4	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|68aa|up_8|NC_010296.1_533422_533626_-,NA|129aa|up_5|NC_010296.1_536380_536767_+,NA|51aa|down_5|NC_010296.1_552105_552258_-,NA|51aa|down_7|NC_010296.1_552543_552696_+	NA|225aa|up_9|NC_010296.1_532761_533436_-	TIGR04283, glycosyl_transferase_family_2, transferase 2, rSAM/selenodomain-associated	NA|68aa|up_8|NC_010296.1_533422_533626_-	NA	NA|139aa|up_7|NC_010296.1_533667_534084_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|620aa|up_6|NC_010296.1_534131_535991_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|129aa|up_5|NC_010296.1_536380_536767_+	NA	NA|459aa|up_4|NC_010296.1_537089_538466_-	PRK01077, PRK01077, cobyrinate a,c-diamide synthase	NA|553aa|up_3|NC_010296.1_538978_540637_-	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|393aa|up_2|NC_010296.1_541177_542355_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|560aa|up_1|NC_010296.1_542852_544532_+	PRK09344, PRK09344, phosphoenolpyruvate carboxykinase	NA|408aa|up_0|NC_010296.1_545026_546250_-	TIGR00275, TIGR00275, flavoprotein, HI0933 family	NA|233aa|down_0|NC_010296.1_546521_547220_-	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|249aa|down_1|NC_010296.1_547258_548005_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|385aa|down_2|NC_010296.1_548118_549273_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|387aa|down_3|NC_010296.1_549276_550437_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|439aa|down_4|NC_010296.1_550531_551848_-	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|51aa|down_5|NC_010296.1_552105_552258_-	NA	NA|92aa|down_6|NC_010296.1_552254_552530_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|51aa|down_7|NC_010296.1_552543_552696_+	NA	NA|111aa|down_8|NC_010296.1_552852_553185_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|202aa|down_9|NC_010296.1_553398_554004_+	cd01457, vWA_ORF176_type, VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	6	835367-835457	4	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GATTTGCGTTGAAGCACTTTTTTGC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|117aa|up_9|NC_010296.1_823757_824108_+,NA|56aa|up_4|NC_010296.1_827999_828167_+,NA|84aa|down_0|NC_010296.1_837257_837509_+,NA|263aa|down_5|NC_010296.1_841228_842017_+,NA|264aa|down_9|NC_010296.1_845537_846329_+	NA|117aa|up_9|NC_010296.1_823757_824108_+	NA	NA|283aa|up_8|NC_010296.1_824159_825008_-	pfam01710, HTH_Tnp_IS630, Transposase	NA|376aa|up_7|NC_010296.1_825631_826759_+	PRK11783, rlmL, bifunctional 23S rRNA (guanine(2069)-N(7))-methyltransferase RlmK/23S rRNA (guanine(2445)-N(2))-methyltransferase RlmL	NA|326aa|up_6|NC_010296.1_826759_827737_-	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|72aa|up_5|NC_010296.1_827763_827979_-	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|56aa|up_4|NC_010296.1_827999_828167_+	NA	NA|884aa|up_3|NC_010296.1_828209_830861_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|up_2|NC_010296.1_831039_832389_-	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|371aa|up_1|NC_010296.1_832777_833890_+	pfam12565, DUF3747, Protein of unknown function (DUF3747)	NA|369aa|up_0|NC_010296.1_834179_835287_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|84aa|down_0|NC_010296.1_837257_837509_+	NA	NA|388aa|down_1|NC_010296.1_837601_838765_-	PLN02449, PLN02449, ferrochelatase	NA|208aa|down_2|NC_010296.1_838884_839508_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|185aa|down_3|NC_010296.1_839607_840162_+	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|252aa|down_4|NC_010296.1_840414_841170_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|263aa|down_5|NC_010296.1_841228_842017_+	NA	NA|331aa|down_6|NC_010296.1_842141_843134_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|372aa|down_7|NC_010296.1_843645_844761_+	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|257aa|down_8|NC_010296.1_844770_845541_+	COG3166, PilN, Tfp pilus assembly protein PilN [Cell motility and secretion / Intracellular trafficking and secretion]	NA|264aa|down_9|NC_010296.1_845537_846329_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	7	1085282-1085367	5	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GATCGAACCATACCAAGCAACGAAGAACC	29	1	23	1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338|1085311-1085338	NC_010296.1_671718-671745|NC_010296.1_1120867-1120894|NC_010296.1_2672468-2672495|NC_010296.1_2850671-2850698|NC_010296.1_3577743-3577770|NC_010296.1_4068809-4068836|NC_010296.1_4332668-4332641|NC_010296.1_4412396-4412369|NC_010296.1_5353229-5353202|NC_010296.1_1807024-1807051|NC_010296.1_3243055-3243028|NC_010296.1_3244284-3244257|NC_010296.1_3711016-3710989|NC_010296.1_4149675-4149648|NC_010296.1_4454381-4454408|NC_010296.1_5624224-5624197|NC_010296.1_632708-632681|NC_010296.1_759411-759438|NC_010296.1_1879074-1879047|NC_010296.1_3262259-3262286|NC_010296.1_3377991-3378018|NC_010296.1_5110961-5110934|NC_010296.1_5635024-5634997	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|75aa|up_6|NC_010296.1_1074253_1074478_-,NA|63aa|up_3|NC_010296.1_1078737_1078926_+,NA|61aa|down_0|NC_010296.1_1093935_1094118_+,NA|87aa|down_7|NC_010296.1_1107288_1107549_-	NA|149aa|up_9|NC_010296.1_1070895_1071342_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|769aa|up_8|NC_010296.1_1071280_1073587_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|94aa|up_7|NC_010296.1_1073975_1074257_-	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	NA|75aa|up_6|NC_010296.1_1074253_1074478_-	NA	NA|684aa|up_5|NC_010296.1_1074779_1076831_-	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|345aa|up_4|NC_010296.1_1077499_1078534_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|63aa|up_3|NC_010296.1_1078737_1078926_+	NA	NA|635aa|up_2|NC_010296.1_1079259_1081164_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|504aa|up_1|NC_010296.1_1081727_1083239_+	TIGR02730, Carotenoid_isomerase, carotene isomerase	NA|560aa|up_0|NC_010296.1_1083453_1085133_+	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|61aa|down_0|NC_010296.1_1093935_1094118_+	NA	NA|541aa|down_1|NC_010296.1_1097822_1099445_+	smart00237, Calx_beta, Domains in Na-Ca exchangers and integrin-beta4	NA|227aa|down_2|NC_010296.1_1100081_1100762_-	cd17877, NP_MTAN-like, nucleoside phosphorylases similar to 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|352aa|down_3|NC_010296.1_1100885_1101941_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|302aa|down_4|NC_010296.1_1102129_1103035_-	PRK10446, PRK10446, 30S ribosomal protein S6--L-glutamate ligase	NA|1067aa|down_5|NC_010296.1_1103496_1106697_-	pfam12770, CHAT, CHAT domain	NA|137aa|down_6|NC_010296.1_1106891_1107302_-	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|87aa|down_7|NC_010296.1_1107288_1107549_-	NA	NA|345aa|down_8|NC_010296.1_1107673_1108708_-	pfam14420, Clr5, Clr5 domain	NA|204aa|down_9|NC_010296.1_1108970_1109582_+	NF033203, entero_EhxA, enterohemolysin EhxA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	8	1394867-1394970	6	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	AGTTTTATTGTCTAGGGGTGTCTCTGGTTGAG	32	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|62aa|up_5|NC_010296.1_1388625_1388811_+,NA|200aa|down_6|NC_010296.1_1401757_1402357_-,NA|405aa|down_7|NC_010296.1_1402405_1403620_-,NA|168aa|down_8|NC_010296.1_1404208_1404712_-,NA|105aa|down_9|NC_010296.1_1405002_1405317_+	NA|488aa|up_9|NC_010296.1_1384014_1385478_-	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|757aa|up_8|NC_010296.1_1385745_1388016_-	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|67aa|up_7|NC_010296.1_1388104_1388305_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|69aa|up_6|NC_010296.1_1388301_1388508_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|62aa|up_5|NC_010296.1_1388625_1388811_+	NA	NA|261aa|up_4|NC_010296.1_1389030_1389813_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|118aa|up_3|NC_010296.1_1390024_1390378_-	pfam13744, HTH_37, Helix-turn-helix domain	NA|100aa|up_2|NC_010296.1_1390481_1390781_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|87aa|up_1|NC_010296.1_1390783_1391044_-	TIGR02606, Antitoxin_ParD, putative addiction module antidote protein, CC2985 family	NA|393aa|up_0|NC_010296.1_1391285_1392463_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|216aa|down_0|NC_010296.1_1396191_1396839_+	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|79aa|down_1|NC_010296.1_1397139_1397376_+	pfam01106, NifU, NifU-like domain	NA|363aa|down_2|NC_010296.1_1397663_1398752_+	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|208aa|down_3|NC_010296.1_1398912_1399536_+	pfam14218, COP23, Circadian oscillating protein COP23	NA|310aa|down_4|NC_010296.1_1399573_1400503_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|402aa|down_5|NC_010296.1_1400531_1401737_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|200aa|down_6|NC_010296.1_1401757_1402357_-	NA	NA|405aa|down_7|NC_010296.1_1402405_1403620_-	NA	NA|168aa|down_8|NC_010296.1_1404208_1404712_-	NA	NA|105aa|down_9|NC_010296.1_1405002_1405317_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	9	1396906-1397012	7	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	GATCCCCCCGCCTATCGGCACCCCCCTTATCAAGGG	36	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|62aa|up_7|NC_010296.1_1388625_1388811_+,NA|200aa|down_5|NC_010296.1_1401757_1402357_-,NA|405aa|down_6|NC_010296.1_1402405_1403620_-,NA|168aa|down_7|NC_010296.1_1404208_1404712_-,NA|105aa|down_8|NC_010296.1_1405002_1405317_+,NA|58aa|down_9|NC_010296.1_1405345_1405519_+	NA|67aa|up_9|NC_010296.1_1388104_1388305_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|69aa|up_8|NC_010296.1_1388301_1388508_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|62aa|up_7|NC_010296.1_1388625_1388811_+	NA	NA|261aa|up_6|NC_010296.1_1389030_1389813_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|118aa|up_5|NC_010296.1_1390024_1390378_-	pfam13744, HTH_37, Helix-turn-helix domain	NA|100aa|up_4|NC_010296.1_1390481_1390781_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|87aa|up_3|NC_010296.1_1390783_1391044_-	TIGR02606, Antitoxin_ParD, putative addiction module antidote protein, CC2985 family	NA|393aa|up_2|NC_010296.1_1391285_1392463_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|617aa|up_1|NC_010296.1_1394158_1396009_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|216aa|up_0|NC_010296.1_1396191_1396839_+	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|79aa|down_0|NC_010296.1_1397139_1397376_+	pfam01106, NifU, NifU-like domain	NA|363aa|down_1|NC_010296.1_1397663_1398752_+	pfam00180, Iso_dh, Isocitrate/isopropylmalate dehydrogenase	NA|208aa|down_2|NC_010296.1_1398912_1399536_+	pfam14218, COP23, Circadian oscillating protein COP23	NA|310aa|down_3|NC_010296.1_1399573_1400503_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|402aa|down_4|NC_010296.1_1400531_1401737_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|200aa|down_5|NC_010296.1_1401757_1402357_-	NA	NA|405aa|down_6|NC_010296.1_1402405_1403620_-	NA	NA|168aa|down_7|NC_010296.1_1404208_1404712_-	NA	NA|105aa|down_8|NC_010296.1_1405002_1405317_+	NA	NA|58aa|down_9|NC_010296.1_1405345_1405519_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	10	1585741-1585902	8	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	CTCCTGGATAATAGATATAGCTATCGGAAATTTCCAATCCCTCTGGATAAT	51	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|47aa|up_0|NC_010296.1_1585306_1585447_-,NA|77aa|down_4|NC_010296.1_1590409_1590640_-	NA|759aa|up_9|NC_010296.1_1573041_1575318_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|65aa|up_8|NC_010296.1_1576008_1576203_+	pfam09957, VapB_antitoxin, Bacterial antitoxin of type II TA system, VapB	NA|124aa|up_7|NC_010296.1_1576199_1576571_+	cd18760, PIN_MtVapC3-like, uncharacterized subgroup of the VapC3-like nuclease subfamily of the PIN domain superfamily	NA|79aa|up_6|NC_010296.1_1576825_1577062_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|232aa|up_5|NC_010296.1_1577151_1577847_-	cd19927, REC_Ycf29, phosphoacceptor receiver (REC) domain of probable transcriptional regulator Ycf29	NA|719aa|up_4|NC_010296.1_1578923_1581080_+	TIGR01418, Phosphoenolpyruvate_synthase, phosphoenolpyruvate synthase	NA|917aa|up_3|NC_010296.1_1581659_1584410_-	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|55aa|up_2|NC_010296.1_1584460_1584625_+	pfam07994, NAD_binding_5, Myo-inositol-1-phosphate synthase	NA|205aa|up_1|NC_010296.1_1584613_1585228_-	pfam05685, Uma2, Putative restriction endonuclease	NA|47aa|up_0|NC_010296.1_1585306_1585447_-	NA	NA|335aa|down_0|NC_010296.1_1586226_1587231_+	PRK10130, PRK10130, HTH-type transcriptional regulator EutR	NA|218aa|down_1|NC_010296.1_1587427_1588081_-	pfam04955, HupE_UreJ, HupE / UreJ protein	NA|309aa|down_2|NC_010296.1_1588095_1589022_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|399aa|down_3|NC_010296.1_1589224_1590421_+	pfam01837, HcyBio, Homocysteine biosynthesis enzyme, sulfur-incorporation	NA|77aa|down_4|NC_010296.1_1590409_1590640_-	NA	NA|507aa|down_5|NC_010296.1_1591264_1592785_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|421aa|down_6|NC_010296.1_1592929_1594192_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	cas14j|396aa|down_7|NC_010296.1_1594282_1595470_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|223aa|down_8|NC_010296.1_1595513_1596182_-	cd16444, LipB, lipoyl/octanoyl transferase	NA|445aa|down_9|NC_010296.1_1596808_1598143_-	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	11	1635598-1635708	9	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TCTTTTAGAAAAGGAACAAGAACGACA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|127aa|up_2|NC_010296.1_1632582_1632963_-,NA|52aa|down_3|NC_010296.1_1638839_1638995_+,NA|89aa|down_8|NC_010296.1_1643834_1644101_-	NA|143aa|up_9|NC_010296.1_1625642_1626071_+	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|309aa|up_8|NC_010296.1_1626279_1627206_+	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|462aa|up_7|NC_010296.1_1627251_1628637_+	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|228aa|up_6|NC_010296.1_1628876_1629560_+	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|563aa|up_5|NC_010296.1_1629910_1631599_+	TIGR00815, Sulfate_transporter, high affinity sulphate transporter 1	NA|73aa|up_4|NC_010296.1_1631909_1632128_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|107aa|up_3|NC_010296.1_1632129_1632450_+	cd18770, PIN_Mut7-C-like, uncharacterized subgroup of the Mut7-C-like family of the PIN domain superfamily	NA|127aa|up_2|NC_010296.1_1632582_1632963_-	NA	NA|167aa|up_1|NC_010296.1_1633138_1633639_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|262aa|up_0|NC_010296.1_1634088_1634874_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|202aa|down_0|NC_010296.1_1635843_1636449_+	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|394aa|down_1|NC_010296.1_1636669_1637851_-	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|255aa|down_2|NC_010296.1_1638008_1638773_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|52aa|down_3|NC_010296.1_1638839_1638995_+	NA	NA|270aa|down_4|NC_010296.1_1639321_1640131_-	COG1117, PstB, ABC-type phosphate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|295aa|down_5|NC_010296.1_1640142_1641027_-	TIGR00974, 3a0107s02c, phosphate ABC transporter, permease protein PstA	NA|325aa|down_6|NC_010296.1_1641031_1642006_-	COG0573, PstC, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|338aa|down_7|NC_010296.1_1642399_1643413_-	cd13565, PBP2_PstS, Substrate binding domain of ABC-type phosphate transporter, a member of the type 2 periplasmic-binding fold superfamily	NA|89aa|down_8|NC_010296.1_1643834_1644101_-	NA	NA|266aa|down_9|NC_010296.1_1644615_1645413_-	PRK14243, PRK14243, phosphate transporter ATP-binding protein; Provisional
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	12	1656795-1656934	10	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	ATCGGCACCCCCCTTATCAAGGG	23	0	0	NA	NA	NA	3	3	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|50aa|up_3|NC_010296.1_1653615_1653765_+,NA|127aa|down_2|NC_010296.1_1658001_1658382_-,NA|66aa|down_4|NC_010296.1_1659797_1659995_+	NA|268aa|up_9|NC_010296.1_1645504_1646308_-	PRK14243, PRK14243, phosphate transporter ATP-binding protein; Provisional	NA|298aa|up_8|NC_010296.1_1646516_1647410_-	TIGR00974, 3a0107s02c, phosphate ABC transporter, permease protein PstA	NA|320aa|up_7|NC_010296.1_1647633_1648593_-	COG0573, PstC, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|374aa|up_6|NC_010296.1_1648687_1649809_-	TIGR00975, precursor_PBP-3_PstS-3_Antigen_Ag88	NA|345aa|up_5|NC_010296.1_1650408_1651443_+	cd13654, PBP2_phosphate_like_2, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|514aa|up_4|NC_010296.1_1651866_1653408_-	PRK09566, nirA, ferredoxin-nitrite reductase; Reviewed	NA|50aa|up_3|NC_010296.1_1653615_1653765_+	NA	NA|213aa|up_2|NC_010296.1_1653879_1654518_+	COG2802, COG2802, Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only]	NA|201aa|up_1|NC_010296.1_1654748_1655351_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|306aa|up_0|NC_010296.1_1655352_1656270_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|164aa|down_0|NC_010296.1_1656993_1657485_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|156aa|down_1|NC_010296.1_1657495_1657963_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|127aa|down_2|NC_010296.1_1658001_1658382_-	NA	NA|351aa|down_3|NC_010296.1_1658724_1659777_-	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|66aa|down_4|NC_010296.1_1659797_1659995_+	NA	NA|348aa|down_5|NC_010296.1_1660100_1661144_-	cd07025, Peptidase_S66, LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein	NA|97aa|down_6|NC_010296.1_1661198_1661489_-	pfam10779, XhlA, Haemolysin XhlA	NA|156aa|down_7|NC_010296.1_1661531_1661999_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|189aa|down_8|NC_010296.1_1662160_1662727_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|283aa|down_9|NC_010296.1_1662798_1663647_+	PRK07428, PRK07428, carboxylating nicotinate-nucleotide diphosphorylase
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	13	1760800-1760934	11	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	AATGATTGAATGAATACTATGATTGTGTTGTCTTCAGCA	39	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|56aa|up_3|NC_010296.1_1757056_1757224_-,NA|115aa|down_0|NC_010296.1_1761231_1761576_+	NA|88aa|up_9|NC_010296.1_1749287_1749551_+	pfam12441, CopG_antitoxin, CopG antitoxin of type II toxin-antitoxin system	NA|354aa|up_8|NC_010296.1_1750150_1751212_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|88aa|up_7|NC_010296.1_1751466_1751730_-	pfam11344, DUF3146, Protein of unknown function (DUF3146)	NA|330aa|up_6|NC_010296.1_1751816_1752806_-	COG1808, COG1808, Predicted membrane protein [Function unknown]	NA|186aa|up_5|NC_010296.1_1755000_1755558_-	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|448aa|up_4|NC_010296.1_1755585_1756929_-	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|56aa|up_3|NC_010296.1_1757056_1757224_-	NA	NA|388aa|up_2|NC_010296.1_1757284_1758448_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|348aa|up_1|NC_010296.1_1758655_1759699_+	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|348aa|up_0|NC_010296.1_1759701_1760745_+	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|115aa|down_0|NC_010296.1_1761231_1761576_+	NA	NA|336aa|down_1|NC_010296.1_1761705_1762713_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|334aa|down_2|NC_010296.1_1765012_1766014_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|504aa|down_3|NC_010296.1_1766563_1768075_-	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|746aa|down_4|NC_010296.1_1768098_1770336_-	pfam05231, MASE1, MASE1	NA|211aa|down_5|NC_010296.1_1770804_1771437_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|226aa|down_6|NC_010296.1_1771456_1772134_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|251aa|down_7|NC_010296.1_1772325_1773078_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|355aa|down_8|NC_010296.1_1773264_1774329_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|298aa|down_9|NC_010296.1_1774492_1775386_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	14	2288925-2289064	12	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GCTTTTTCCCCAATTTCTGTCACTGATTCTGTAATAGTTTCCCCGATAGCCGC	53	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|283aa|up_5|NC_010296.1_2281988_2282837_+,NA|79aa|up_2|NC_010296.1_2284405_2284642_-,NA|119aa|up_1|NC_010296.1_2284759_2285116_-,NA|55aa|down_3|NC_010296.1_2294426_2294591_+,NA|371aa|down_7|NC_010296.1_2299545_2300658_-	NA|243aa|up_9|NC_010296.1_2277787_2278516_+	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|448aa|up_8|NC_010296.1_2278566_2279910_+	CHL00177, ccs1, c-type cytochrome biogenensis protein; Validated	NA|286aa|up_7|NC_010296.1_2280025_2280883_+	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|277aa|up_6|NC_010296.1_2280883_2281714_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|283aa|up_5|NC_010296.1_2281988_2282837_+	NA	NA|168aa|up_4|NC_010296.1_2282874_2283378_-	PRK12886, ubiA, prenyltransferase; Reviewed	NA|308aa|up_3|NC_010296.1_2283416_2284340_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|79aa|up_2|NC_010296.1_2284405_2284642_-	NA	NA|119aa|up_1|NC_010296.1_2284759_2285116_-	NA	NA|1109aa|up_0|NC_010296.1_2285230_2288557_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|328aa|down_0|NC_010296.1_2290009_2290993_+	PRK07452, PRK07452, DNA polymerase III subunit delta; Validated	NA|470aa|down_1|NC_010296.1_2291116_2292526_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|551aa|down_2|NC_010296.1_2292767_2294420_+	pfam00498, FHA, FHA domain	NA|55aa|down_3|NC_010296.1_2294426_2294591_+	NA	NA|386aa|down_4|NC_010296.1_2294635_2295793_-	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|343aa|down_5|NC_010296.1_2295984_2297013_+	PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA	NA|455aa|down_6|NC_010296.1_2297037_2298402_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|371aa|down_7|NC_010296.1_2299545_2300658_-	NA	NA|105aa|down_8|NC_010296.1_2300788_2301103_-	pfam08855, DUF1825, Domain of unknown function (DUF1825)	NA|250aa|down_9|NC_010296.1_2301355_2302105_-	PRK14831, PRK14831, undecaprenyl pyrophosphate synthase; Provisional
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	15	2382045-2382122	13	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GATCGCCGTTTTCAAGAAACAGAA	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|332aa|up_6|NC_010296.1_2366132_2367128_-,NA|153aa|up_3|NC_010296.1_2369783_2370242_-,NA|424aa|up_2|NC_010296.1_2370366_2371638_-,NA|78aa|down_1|NC_010296.1_2383412_2383646_-,NA|110aa|down_3|NC_010296.1_2384754_2385084_-,NA|51aa|down_7|NC_010296.1_2389028_2389181_-,NA|62aa|down_8|NC_010296.1_2389236_2389422_+	NA|379aa|up_9|NC_010296.1_2362994_2364131_-	TIGR02032, Uncharacterized_protein_MJ1520, geranylgeranyl reductase family	NA|395aa|up_8|NC_010296.1_2364370_2365555_+	COG4637, COG4637, Predicted ATPase [General function prediction only]	NA|173aa|up_7|NC_010296.1_2365554_2366073_+	pfam14103, DUF4276, Domain of unknown function (DUF4276)	NA|332aa|up_6|NC_010296.1_2366132_2367128_-	NA	NA|341aa|up_5|NC_010296.1_2367365_2368388_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|422aa|up_4|NC_010296.1_2368387_2369653_-	cd01299, Met_dep_hydrolase_A, Metallo-dependent hydrolases, subgroup A is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site	NA|153aa|up_3|NC_010296.1_2369783_2370242_-	NA	NA|424aa|up_2|NC_010296.1_2370366_2371638_-	NA	NA|347aa|up_1|NC_010296.1_2371776_2372817_-	pfam01764, Lipase_3, Lipase (class 3)	NA|219aa|up_0|NC_010296.1_2374869_2375526_-	pfam08924, DUF1906, Domain of unknown function (DUF1906)	NA|245aa|down_0|NC_010296.1_2382605_2383340_-	pfam01764, Lipase_3, Lipase (class 3)	NA|78aa|down_1|NC_010296.1_2383412_2383646_-	NA	NA|262aa|down_2|NC_010296.1_2383843_2384629_-	COG3179, COG3179, Predicted chitinase [General function prediction only]	NA|110aa|down_3|NC_010296.1_2384754_2385084_-	NA	NA|131aa|down_4|NC_010296.1_2385606_2385999_-	pfam14105, DUF4278, Domain of unknown function (DUF4278)	NA|134aa|down_5|NC_010296.1_2386153_2386555_-	pfam00498, FHA, FHA domain	NA|350aa|down_6|NC_010296.1_2387045_2388095_+	pfam01594, AI-2E_transport, AI-2E family transporter	NA|51aa|down_7|NC_010296.1_2389028_2389181_-	NA	NA|62aa|down_8|NC_010296.1_2389236_2389422_+	NA	NA|431aa|down_9|NC_010296.1_2389411_2390704_-	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	16	2498662-2498764	14	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GCTTGGGCTGCCGCAATGGTTTCAG	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|71aa|up_5|NC_010296.1_2486444_2486657_-,NA|325aa|down_0|NC_010296.1_2499324_2500299_-,NA|259aa|down_1|NC_010296.1_2500797_2501574_-	NA|333aa|up_9|NC_010296.1_2482561_2483560_-	cd01168, adenosine_kinase, Adenosine kinase (AK) catalyzes the phosphorylation of ribofuranosyl-containing nucleoside analogues at the 5'-hydroxyl using ATP or GTP as the phosphate donor	NA|71aa|up_8|NC_010296.1_2483724_2483937_-	pfam10742, DUF2555, Protein of unknown function (DUF2555)	NA|229aa|up_7|NC_010296.1_2485401_2486088_-	pfam12836, HHH_3, Helix-hairpin-helix motif	NA|118aa|up_6|NC_010296.1_2486094_2486448_-	pfam05154, TM2, TM2 domain	NA|71aa|up_5|NC_010296.1_2486444_2486657_-	NA	NA|94aa|up_4|NC_010296.1_2486711_2486993_-	pfam10615, DUF2470, Protein of unknown function (DUF2470)	NA|178aa|up_3|NC_010296.1_2487721_2488255_-	pfam10615, DUF2470, Protein of unknown function (DUF2470)	NA|553aa|up_2|NC_010296.1_2488650_2490309_-	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|423aa|up_1|NC_010296.1_2491677_2492946_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|1033aa|up_0|NC_010296.1_2494315_2497414_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|325aa|down_0|NC_010296.1_2499324_2500299_-	NA	NA|259aa|down_1|NC_010296.1_2500797_2501574_-	NA	NA|301aa|down_2|NC_010296.1_2502645_2503548_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|173aa|down_3|NC_010296.1_2503648_2504167_-	TIGR04353, hypothetical_protein_MarpuDRAFT_0194, PqqD family protein, HPr-rel-A system	NA|88aa|down_4|NC_010296.1_2504452_2504716_+	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|599aa|down_5|NC_010296.1_2504778_2506575_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|600aa|down_6|NC_010296.1_2506642_2508442_+	TIGR02032, Uncharacterized_protein_MJ1520, geranylgeranyl reductase family	NA|1578aa|down_7|NC_010296.1_2508555_2513289_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1608aa|down_8|NC_010296.1_2513353_2518177_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|978aa|down_9|NC_010296.1_2518395_2521329_+	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	17	2623111-2623214	15	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CCCGCACAGTAAAATCGGACAAAATCAATTTCT	33	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|314aa|up_9|NC_010296.1_2602385_2603327_+,NA|343aa|up_7|NC_010296.1_2604643_2605672_+,NA|74aa|down_0|NC_010296.1_2623573_2623795_+,NA|256aa|down_2|NC_010296.1_2625686_2626454_-,NA|69aa|down_7|NC_010296.1_2632905_2633112_-,NA|78aa|down_8|NC_010296.1_2633083_2633317_-	NA|314aa|up_9|NC_010296.1_2602385_2603327_+	NA	NA|319aa|up_8|NC_010296.1_2603636_2604593_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|343aa|up_7|NC_010296.1_2604643_2605672_+	NA	NA|1060aa|up_6|NC_010296.1_2605803_2608983_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|386aa|up_5|NC_010296.1_2609482_2610640_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|562aa|up_4|NC_010296.1_2615787_2617473_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|287aa|up_3|NC_010296.1_2617787_2618648_+	sd00006, TPR, Tetratricopeptide repeat	NA|145aa|up_2|NC_010296.1_2618892_2619327_+	COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion transport and metabolism]	NA|307aa|up_1|NC_010296.1_2619361_2620282_+	TIGR00145, Uncharacterized_protein_slr0964, FTR1 family protein	NA|538aa|up_0|NC_010296.1_2620491_2622105_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|74aa|down_0|NC_010296.1_2623573_2623795_+	NA	NA|413aa|down_1|NC_010296.1_2624399_2625638_-	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|256aa|down_2|NC_010296.1_2625686_2626454_-	NA	NA|317aa|down_3|NC_010296.1_2627095_2628045_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|333aa|down_4|NC_010296.1_2629908_2630907_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|357aa|down_5|NC_010296.1_2631173_2632244_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|143aa|down_6|NC_010296.1_2632271_2632700_-	pfam06951, PLA2G12, Group XII secretory phospholipase A2 precursor (PLA2G12)	NA|69aa|down_7|NC_010296.1_2632905_2633112_-	NA	NA|78aa|down_8|NC_010296.1_2633083_2633317_-	NA	NA|195aa|down_9|NC_010296.1_2633378_2633963_-	cd06260, DUF820, Domain of unknown function (DUF820)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	18	2662579-2662675	16	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TCAGTAAACAGTAAACAGTAATCAG	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|161aa|up_7|NC_010296.1_2652026_2652509_-,NA|67aa|up_4|NC_010296.1_2655764_2655965_+,NA|139aa|down_0|NC_010296.1_2663160_2663577_-,NA|50aa|down_2|NC_010296.1_2665713_2665863_-	NA|845aa|up_9|NC_010296.1_2648279_2650814_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|143aa|up_8|NC_010296.1_2651324_2651753_+	cd03467, Rieske, Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes	NA|161aa|up_7|NC_010296.1_2652026_2652509_-	NA	NA|881aa|up_6|NC_010296.1_2652753_2655396_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|64aa|up_5|NC_010296.1_2655557_2655749_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|67aa|up_4|NC_010296.1_2655764_2655965_+	NA	NA|85aa|up_3|NC_010296.1_2655922_2656177_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|111aa|up_2|NC_010296.1_2656356_2656689_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|1535aa|up_1|NC_010296.1_2656757_2661362_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|296aa|up_0|NC_010296.1_2661668_2662556_-	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|139aa|down_0|NC_010296.1_2663160_2663577_-	NA	NA|288aa|down_1|NC_010296.1_2663666_2664530_-	TIGR02069, cyanophycinase, cyanophycinase	NA|50aa|down_2|NC_010296.1_2665713_2665863_-	NA	NA|159aa|down_3|NC_010296.1_2665913_2666390_-	pfam05532, CsbD, CsbD-like	NA|61aa|down_4|NC_010296.1_2666682_2666865_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|152aa|down_5|NC_010296.1_2669847_2670303_-	pfam04972, BON, BON domain	NA|201aa|down_6|NC_010296.1_2670331_2670934_-	pfam11181, YflT, Heat induced stress protein YflT	NA|110aa|down_7|NC_010296.1_2671208_2671538_-	pfam13744, HTH_37, Helix-turn-helix domain	NA|120aa|down_8|NC_010296.1_2671534_2671894_-	COG4679, COG4679, Phage-related protein [Function unknown]	NA|79aa|down_9|NC_010296.1_2672079_2672316_-	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	19	2784147-2784258	17	CRISPRCasFinder	no	WYL,cas3,csc2gr7,csc1gr5	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Type I-D	TTTCCACCTTTTCGAGTCGTTTATCCATCCC	31	0	0	NA	NA	NA	1	1	TypeI-D	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|58aa|up_6|NC_010296.1_2776683_2776857_+,NA|194aa|up_1|NC_010296.1_2782697_2783279_+,NA|223aa|down_2|NC_010296.1_2788393_2789062_+	NA|293aa|up_9|NC_010296.1_2771957_2772836_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|314aa|up_8|NC_010296.1_2773198_2774140_-	pfam13524, Glyco_trans_1_2, Glycosyl transferases group 1	NA|389aa|up_7|NC_010296.1_2774463_2775630_-	PRK05957, PRK05957, pyridoxal phosphate-dependent aminotransferase	NA|58aa|up_6|NC_010296.1_2776683_2776857_+	NA	NA|560aa|up_5|NC_010296.1_2776917_2778597_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|427aa|up_4|NC_010296.1_2778843_2780124_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|183aa|up_3|NC_010296.1_2781130_2781679_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|241aa|up_2|NC_010296.1_2781665_2782388_-	cd04254, AAK_UMPK-PyrH-Ec, UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway)	NA|194aa|up_1|NC_010296.1_2782697_2783279_+	NA	NA|152aa|up_0|NC_010296.1_2783500_2783956_-	PRK05395, PRK05395, type II 3-dehydroquinate dehydratase	NA|461aa|down_0|NC_010296.1_2784655_2786038_+	cd03480, Rieske_RO_Alpha_PaO, Rieske non-heme iron oxygenase (RO) family, Pheophorbide a oxygenase (PaO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of a small subfamily of enzymes found in plants as well as oxygenic cyanobacterial photosynthesizers including LLS1 (lethal leaf spot 1, also known as PaO) and ACD1 (accelerated cell death 1)	NA|708aa|down_1|NC_010296.1_2786060_2788184_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|223aa|down_2|NC_010296.1_2788393_2789062_+	NA	NA|390aa|down_3|NC_010296.1_2789431_2790601_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	WYL|288aa|down_4|NC_010296.1_2790669_2791533_-	pfam13280, WYL, WYL domain	cas3|722aa|down_5|NC_010296.1_2791632_2793798_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|140aa|down_6|NC_010296.1_2793967_2794387_+	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|133aa|down_7|NC_010296.1_2794428_2794827_+	pfam01934, DUF86, Protein of unknown function DUF86	NA|153aa|down_8|NC_010296.1_2794819_2795278_+	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|145aa|down_9|NC_010296.1_2795281_2795716_+	pfam01934, DUF86, Protein of unknown function DUF86
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	20	2814769-2822884	1,18,3	PILER-CR,CRISPRCasFinder,CRT	no	csc2gr7,csc1gr5,WYL,cas8b5,cas7,cas5,cas3,cas6,cas4,cas1,cas2	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Type I-D	GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	111,112,112	112	TypeI-D	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	cas7|298aa|up_8|NC_010296.1_2805433_2806327_+,cas5|275aa|up_7|NC_010296.1_2806330_2807155_+,NA|49aa|down_7|NC_010296.1_2834968_2835115_+	cas8b5|895aa|up_9|NC_010296.1_2802746_2805431_+	TIGR03319, RNase_Y, ribonuclease Y	cas7|298aa|up_8|NC_010296.1_2805433_2806327_+	NA	cas5|275aa|up_7|NC_010296.1_2806330_2807155_+	NA	cas3|908aa|up_6|NC_010296.1_2807147_2809871_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|189aa|up_5|NC_010296.1_2809918_2810485_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|189aa|up_4|NC_010296.1_2811003_2811570_+	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|278aa|up_3|NC_010296.1_2811810_2812644_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|NC_010296.1_2812646_2813240_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|335aa|up_1|NC_010296.1_2813245_2814250_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|91aa|up_0|NC_010296.1_2814262_2814535_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|408aa|down_0|NC_010296.1_2824734_2825957_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|91aa|down_1|NC_010296.1_2829464_2829737_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|133aa|down_2|NC_010296.1_2829902_2830301_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|310aa|down_3|NC_010296.1_2830388_2831318_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|144aa|down_4|NC_010296.1_2831641_2832073_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|406aa|down_5|NC_010296.1_2832133_2833351_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|493aa|down_6|NC_010296.1_2833548_2835027_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|49aa|down_7|NC_010296.1_2834968_2835115_+	NA	NA|294aa|down_8|NC_010296.1_2835149_2836031_+	sd00006, TPR, Tetratricopeptide repeat	NA|80aa|down_9|NC_010296.1_2838224_2838464_+	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	21	2823032-2824595	4,2,19	CRT,PILER-CR,CRISPRCasFinder	no	cas7,cas5,cas3,cas6,cas4,cas1,cas2	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	21,19,20	21	Unclear	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	cas7|298aa|up_8|NC_010296.1_2805433_2806327_+,cas5|275aa|up_7|NC_010296.1_2806330_2807155_+,NA|49aa|down_7|NC_010296.1_2834968_2835115_+	cas8b5|895aa|up_9|NC_010296.1_2802746_2805431_+	TIGR03319, RNase_Y, ribonuclease Y	cas7|298aa|up_8|NC_010296.1_2805433_2806327_+	NA	cas5|275aa|up_7|NC_010296.1_2806330_2807155_+	NA	cas3|908aa|up_6|NC_010296.1_2807147_2809871_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|189aa|up_5|NC_010296.1_2809918_2810485_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|189aa|up_4|NC_010296.1_2811003_2811570_+	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|278aa|up_3|NC_010296.1_2811810_2812644_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|NC_010296.1_2812646_2813240_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|335aa|up_1|NC_010296.1_2813245_2814250_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|91aa|up_0|NC_010296.1_2814262_2814535_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|408aa|down_0|NC_010296.1_2824734_2825957_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|91aa|down_1|NC_010296.1_2829464_2829737_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|133aa|down_2|NC_010296.1_2829902_2830301_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|310aa|down_3|NC_010296.1_2830388_2831318_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|144aa|down_4|NC_010296.1_2831641_2832073_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|406aa|down_5|NC_010296.1_2832133_2833351_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|493aa|down_6|NC_010296.1_2833548_2835027_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|49aa|down_7|NC_010296.1_2834968_2835115_+	NA	NA|294aa|down_8|NC_010296.1_2835149_2836031_+	sd00006, TPR, Tetratricopeptide repeat	NA|80aa|down_9|NC_010296.1_2838224_2838464_+	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	22	2826228-2829166	3,20,5	PILER-CR,CRISPRCasFinder,CRT	no	cas5,cas3,cas6,cas4,cas1,cas2	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC,GTTCCAATTAATCTTAAACCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	40,40,40	40	Unclear	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	cas7|298aa|up_9|NC_010296.1_2805433_2806327_+,cas5|275aa|up_8|NC_010296.1_2806330_2807155_+,NA|49aa|down_6|NC_010296.1_2834968_2835115_+	cas7|298aa|up_9|NC_010296.1_2805433_2806327_+	NA	cas5|275aa|up_8|NC_010296.1_2806330_2807155_+	NA	cas3|908aa|up_7|NC_010296.1_2807147_2809871_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|189aa|up_6|NC_010296.1_2809918_2810485_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|189aa|up_5|NC_010296.1_2811003_2811570_+	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|278aa|up_4|NC_010296.1_2811810_2812644_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_3|NC_010296.1_2812646_2813240_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|335aa|up_2|NC_010296.1_2813245_2814250_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|91aa|up_1|NC_010296.1_2814262_2814535_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|408aa|up_0|NC_010296.1_2824734_2825957_+	pfam07592, DDE_Tnp_ISAZ013, Rhodopirellula transposase DDE domain	NA|91aa|down_0|NC_010296.1_2829464_2829737_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|133aa|down_1|NC_010296.1_2829902_2830301_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|310aa|down_2|NC_010296.1_2830388_2831318_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|144aa|down_3|NC_010296.1_2831641_2832073_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|406aa|down_4|NC_010296.1_2832133_2833351_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|493aa|down_5|NC_010296.1_2833548_2835027_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|49aa|down_6|NC_010296.1_2834968_2835115_+	NA	NA|294aa|down_7|NC_010296.1_2835149_2836031_+	sd00006, TPR, Tetratricopeptide repeat	NA|80aa|down_8|NC_010296.1_2838224_2838464_+	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	NA|143aa|down_9|NC_010296.1_2838473_2838902_+	cd18748, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	23	2869751-2869835	21	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CTCATATTTTTAGGTTTTTTGTAG	24	1	48	2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811|2869775-2869811	NC_010296.1_9446-9482|NC_010296.1_485513-485477|NC_010296.1_497582-497546|NC_010296.1_542436-542400|NC_010296.1_667141-667105|NC_010296.1_821893-821857|NC_010296.1_825421-825385|NC_010296.1_993923-993959|NC_010296.1_1048739-1048703|NC_010296.1_1318381-1318345|NC_010296.1_1520731-1520767|NC_010296.1_1646488-1646452|NC_010296.1_1650078-1650042|NC_010296.1_1725458-1725494|NC_010296.1_2157520-2157484|NC_010296.1_2500548-2500512|NC_010296.1_2551489-2551453|NC_010296.1_2854099-2854063|NC_010296.1_2999853-2999817|NC_010296.1_3070735-3070699|NC_010296.1_3673120-3673156|NC_010296.1_3863889-3863925|NC_010296.1_3920740-3920704|NC_010296.1_4211874-4211910|NC_010296.1_4301777-4301813|NC_010296.1_4604768-4604804|NC_010296.1_4778841-4778877|NC_010296.1_4943938-4943974|NC_010296.1_5050052-5050016|NC_010296.1_5369102-5369138|NC_010296.1_5435823-5435787|NC_010296.1_5447675-5447639|NC_010296.1_309072-309036|NC_010296.1_347085-347049|NC_010296.1_587711-587675|NC_010296.1_1051072-1051108|NC_010296.1_1164280-1164244|NC_010296.1_1797311-1797347|NC_010296.1_2617681-2617717|NC_010296.1_3093968-3094004|NC_010296.1_3162674-3162710|NC_010296.1_3414059-3414095|NC_010296.1_3465592-3465628|NC_010296.1_3711893-3711857|NC_010296.1_3995719-3995683|NC_010296.1_4854252-4854216|NC_010296.1_995671-995707|NC_010296.1_2126113-2126077	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|99aa|up_8|NC_010296.1_2857153_2857450_+,NA|185aa|down_5|NC_010296.1_2874887_2875442_+,NA|260aa|down_9|NC_010296.1_2880574_2881354_+	NA|245aa|up_9|NC_010296.1_2856090_2856825_-	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|99aa|up_8|NC_010296.1_2857153_2857450_+	NA	NA|348aa|up_7|NC_010296.1_2857503_2858547_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|600aa|up_6|NC_010296.1_2858851_2860651_-	PRK12305, thrS, threonyl-tRNA synthetase; Reviewed	NA|307aa|up_5|NC_010296.1_2860785_2861706_+	cd01137, PsaA, Metal binding protein PsaA	NA|368aa|up_4|NC_010296.1_2862008_2863112_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|146aa|up_3|NC_010296.1_2863367_2863805_-	pfam13301, DUF4079, Protein of unknown function (DUF4079)	NA|879aa|up_2|NC_010296.1_2863809_2866446_+	COG0308, PepN, Aminopeptidase N [Amino acid transport and metabolism]	NA|204aa|up_1|NC_010296.1_2866583_2867195_-	pfam14103, DUF4276, Domain of unknown function (DUF4276)	NA|396aa|up_0|NC_010296.1_2867219_2868407_-	COG4637, COG4637, Predicted ATPase [General function prediction only]	NA|281aa|down_0|NC_010296.1_2870719_2871562_-	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|114aa|down_1|NC_010296.1_2871581_2871923_-	pfam09383, NIL, NIL domain	NA|348aa|down_2|NC_010296.1_2871934_2872978_-	cd01005, PBP2_CysP, Substrate binding domain of an active sulfate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|260aa|down_3|NC_010296.1_2873148_2873928_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|227aa|down_4|NC_010296.1_2874148_2874829_-	TIGR02191, Ribonuclease_3, ribonuclease III, bacterial	NA|185aa|down_5|NC_010296.1_2874887_2875442_+	NA	NA|619aa|down_6|NC_010296.1_2875650_2877507_+	PRK12305, thrS, threonyl-tRNA synthetase; Reviewed	NA|344aa|down_7|NC_010296.1_2877620_2878652_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|364aa|down_8|NC_010296.1_2878829_2879921_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|260aa|down_9|NC_010296.1_2880574_2881354_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	24	3148751-3148912	22	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	ACCCCCCTTATCAAGGGAGATCCCCCCGCCTACCGGCACCCCCCTTATCAAGGG	54	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|82aa|up_9|NC_010296.1_3136638_3136884_-,NA|241aa|up_8|NC_010296.1_3137230_3137953_+,NA|85aa|up_7|NC_010296.1_3138118_3138373_+,NA|88aa|up_4|NC_010296.1_3139443_3139707_-,NA|67aa|down_1|NC_010296.1_3151025_3151226_+,NA|64aa|down_3|NC_010296.1_3152197_3152389_+,NA|60aa|down_8|NC_010296.1_3154170_3154350_-	NA|82aa|up_9|NC_010296.1_3136638_3136884_-	NA	NA|241aa|up_8|NC_010296.1_3137230_3137953_+	NA	NA|85aa|up_7|NC_010296.1_3138118_3138373_+	NA	NA|112aa|up_6|NC_010296.1_3138373_3138709_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|156aa|up_5|NC_010296.1_3138975_3139443_-	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|88aa|up_4|NC_010296.1_3139443_3139707_-	NA	NA|346aa|up_3|NC_010296.1_3140534_3141572_-	PLN02498, PLN02498, omega-3 fatty acid desaturase	NA|347aa|up_2|NC_010296.1_3141870_3142911_+	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|351aa|up_1|NC_010296.1_3144106_3145158_-	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|89aa|up_0|NC_010296.1_3146824_3147091_-	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|450aa|down_0|NC_010296.1_3149130_3150480_-	pfam01637, ATPase_2, ATPase domain predominantly from Archaea	NA|67aa|down_1|NC_010296.1_3151025_3151226_+	NA	NA|331aa|down_2|NC_010296.1_3151164_3152157_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|64aa|down_3|NC_010296.1_3152197_3152389_+	NA	NA|112aa|down_4|NC_010296.1_3152459_3152795_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|108aa|down_5|NC_010296.1_3152778_3153102_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|80aa|down_6|NC_010296.1_3153372_3153612_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|165aa|down_7|NC_010296.1_3153605_3154100_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|60aa|down_8|NC_010296.1_3154170_3154350_-	NA	NA|83aa|down_9|NC_010296.1_3154991_3155240_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	25	3168657-3168819	23	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	AACTGTGGTTTGGATATACCCAAGCTACAAGCTTTGTTGGAAGAAATCGAAC	52	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|123aa|up_7|NC_010296.1_3161269_3161638_+,NA|147aa|up_1|NC_010296.1_3167020_3167461_-,NA|85aa|down_2|NC_010296.1_3172756_3173011_-,NA|51aa|down_9|NC_010296.1_3180946_3181099_+	NA|807aa|up_9|NC_010296.1_3156761_3159182_-	PRK05261, PRK05261, phosphoketolase	NA|345aa|up_8|NC_010296.1_3160059_3161094_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|123aa|up_7|NC_010296.1_3161269_3161638_+	NA	NA|338aa|up_6|NC_010296.1_3161634_3162648_+	COG0354, COG0354, Predicted aminomethyltransferase related to GcvT [General function prediction only]	NA|553aa|up_5|NC_010296.1_3162936_3164595_-	PRK13981, PRK13981, NAD synthetase; Provisional	NA|243aa|up_4|NC_010296.1_3164607_3165336_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|189aa|up_3|NC_010296.1_3165332_3165899_-	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|197aa|up_2|NC_010296.1_3166430_3167021_-	pfam08808, RES, RES domain	NA|147aa|up_1|NC_010296.1_3167020_3167461_-	NA	NA|80aa|up_0|NC_010296.1_3167518_3167758_-	pfam08865, DUF1830, Domain of unknown function (DUF1830)	NA|42aa|down_0|NC_010296.1_3171581_3171707_-	pfam06769, YoeB_toxin, YoeB-like toxin of bacterial type II toxin-antitoxin system	NA|204aa|down_1|NC_010296.1_3172027_3172639_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|85aa|down_2|NC_010296.1_3172756_3173011_-	NA	NA|569aa|down_3|NC_010296.1_3173007_3174714_-	COG1226, Kch, Kef-type K+ transport systems, predicted NAD-binding component [Inorganic ion transport and metabolism]	NA|315aa|down_4|NC_010296.1_3174864_3175809_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|693aa|down_5|NC_010296.1_3176087_3178166_+	COG1523, PulA, Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]	NA|254aa|down_6|NC_010296.1_3178281_3179043_+	pfam13026, DUF3887, Protein of unknown function (DUF3887)	NA|433aa|down_7|NC_010296.1_3179089_3180388_+	PRK00077, eno, enolase; Provisional	NA|121aa|down_8|NC_010296.1_3180592_3180955_-	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|51aa|down_9|NC_010296.1_3180946_3181099_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	26	4229997-4230132	24	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	AATCATCTGTAGGGTGGGTTAGGCAAAAATAAGTTATACTCTGACT	46	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|47aa|up_6|NC_010296.1_4220163_4220304_-,NA|109aa|down_4|NC_010296.1_4233898_4234225_+	NA|206aa|up_9|NC_010296.1_4216532_4217150_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|180aa|up_8|NC_010296.1_4217219_4217759_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|539aa|up_7|NC_010296.1_4218283_4219900_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|47aa|up_6|NC_010296.1_4220163_4220304_-	NA	NA|261aa|up_5|NC_010296.1_4220317_4221100_+	cd09083, EEP-1, Exonuclease-Endonuclease-Phosphatase domain; uncharacterized family 1	NA|178aa|up_4|NC_010296.1_4221425_4221959_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|466aa|up_3|NC_010296.1_4222103_4223501_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|625aa|up_2|NC_010296.1_4223632_4225507_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|417aa|up_1|NC_010296.1_4226096_4227347_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|774aa|up_0|NC_010296.1_4227672_4229994_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|144aa|down_0|NC_010296.1_4230459_4230891_+	pfam14159, CAAD, CAAD domains of cyanobacterial aminoacyl-tRNA synthetase	NA|391aa|down_1|NC_010296.1_4231303_4232476_-	cd17774, CBS_two-component_sensor_histidine_kinase_repeat2, 2 tandem repeats of the CBS domain in the two-component sensor histidine kinase and related-proteins, repeat 2	NA|152aa|down_2|NC_010296.1_4232699_4233155_-	pfam07508, Recombinase, Recombinase	NA|108aa|down_3|NC_010296.1_4233494_4233818_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|109aa|down_4|NC_010296.1_4233898_4234225_+	NA	NA|206aa|down_5|NC_010296.1_4234275_4234893_+	cd04630, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|104aa|down_6|NC_010296.1_4235817_4236129_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|542aa|down_7|NC_010296.1_4236175_4237801_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|499aa|down_8|NC_010296.1_4238132_4239629_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|638aa|down_9|NC_010296.1_4239932_4241846_+	cd15832, SNAP, Soluble N-ethylmaleimide-sensitive factor (NSF) Attachment Protein family
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	27	4275989-4276059	25	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CCCCCCTTGATAAGGGGGGTGCC	23	1	65	4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036|4276012-4276036	NC_010296.1_4223592-4223568|NC_010296.1_5795384-5795360|NC_010296.1_225128-225152|NC_010296.1_1074608-1074632|NC_010296.1_1389909-1389885|NC_010296.1_1661248-1661224|NC_010296.1_1797597-1797573|NC_010296.1_1975551-1975575|NC_010296.1_2151613-2151589|NC_010296.1_2190689-2190713|NC_010296.1_2190726-2190750|NC_010296.1_2354142-2354166|NC_010296.1_2577428-2577452|NC_010296.1_2855268-2855244|NC_010296.1_3091312-3091288|NC_010296.1_3217869-3217893|NC_010296.1_3361991-3362015|NC_010296.1_3543015-3542991|NC_010296.1_3845420-3845444|NC_010296.1_3845457-3845481|NC_010296.1_3845494-3845518|NC_010296.1_3845531-3845555|NC_010296.1_3915514-3915538|NC_010296.1_4239785-4239761|NC_010296.1_4698692-4698668|NC_010296.1_4767142-4767166|NC_010296.1_4826410-4826386|NC_010296.1_4977664-4977640|NC_010296.1_5083874-5083850|NC_010296.1_5364283-5364307|NC_010296.1_5474734-5474710|NC_010296.1_5646751-5646727|NC_010296.1_5646787-5646763|NC_010296.1_5682691-5682715|NC_010296.1_5733077-5733101|NC_010296.1_151400-151424|NC_010296.1_237846-237822|NC_010296.1_403691-403667|NC_010296.1_404249-404273|NC_010296.1_700515-700491|NC_010296.1_1370690-1370666|NC_010296.1_1761580-1761604|NC_010296.1_1797560-1797536|NC_010296.1_1797634-1797610|NC_010296.1_1979031-1979055|NC_010296.1_2289926-2289950|NC_010296.1_2354105-2354129|NC_010296.1_2407145-2407121|NC_010296.1_2569745-2569721|NC_010296.1_2622251-2622227|NC_010296.1_2778657-2778681|NC_010296.1_3005449-3005425|NC_010296.1_3005481-3005457|NC_010296.1_3541376-3541352|NC_010296.1_3541413-3541389|NC_010296.1_3541450-3541426|NC_010296.1_3541487-3541463|NC_010296.1_4211805-4211781|NC_010296.1_4993780-4993756|NC_010296.1_5277949-5277973|NC_010296.1_5354023-5354047|NC_010296.1_5364246-5364270|NC_010296.1_5514512-5514536|NC_010296.1_5514549-5514573|NC_010296.1_5823419-5823443	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|661aa|up_9|NC_010296.1_4263882_4265865_-,NA|263aa|down_2|NC_010296.1_4278458_4279247_-	NA|661aa|up_9|NC_010296.1_4263882_4265865_-	NA	NA|355aa|up_8|NC_010296.1_4265892_4266957_-	cd05258, CDP_TE_SDR_e, CDP-tyvelose 2-epimerase, extended (e) SDRs	NA|424aa|up_7|NC_010296.1_4266984_4268256_-	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|405aa|up_6|NC_010296.1_4268305_4269520_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|508aa|up_5|NC_010296.1_4269725_4271249_-	COG1928, PMT1, Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones]	NA|356aa|up_4|NC_010296.1_4271325_4272393_-	cd05258, CDP_TE_SDR_e, CDP-tyvelose 2-epimerase, extended (e) SDRs	NA|260aa|up_3|NC_010296.1_4272463_4273243_-	PRK13331, PRK13331, pantothenate kinase; Reviewed	NA|278aa|up_2|NC_010296.1_4273346_4274180_-	COG1408, COG1408, Predicted phosphohydrolases [General function prediction only]	NA|412aa|up_1|NC_010296.1_4274291_4275527_-	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|71aa|up_0|NC_010296.1_4275680_4275893_+	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|314aa|down_0|NC_010296.1_4276112_4277054_-	PLN00016, PLN00016, RNA-binding protein; Provisional	NA|152aa|down_1|NC_010296.1_4277994_4278450_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|263aa|down_2|NC_010296.1_4278458_4279247_-	NA	NA|169aa|down_3|NC_010296.1_4279477_4279984_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|71aa|down_4|NC_010296.1_4280037_4280250_-	TIGR04216, halo_surf_glyco, major cell surface glycoprotein	NA|175aa|down_5|NC_010296.1_4280369_4280894_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|110aa|down_6|NC_010296.1_4281529_4281859_-	pfam10779, XhlA, Haemolysin XhlA	NA|144aa|down_7|NC_010296.1_4281943_4282375_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|144aa|down_8|NC_010296.1_4282461_4282893_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|133aa|down_9|NC_010296.1_4282976_4283375_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	28	4460210-4460329	26	CRISPRCasFinder	no	RT	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	GAGTGCGATGCCAAAGGCACTGCGTAGCAG	30	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|130aa|up_9|NC_010296.1_4452722_4453112_+,NA|130aa|up_8|NC_010296.1_4453125_4453515_+,NA|76aa|up_4|NC_010296.1_4456173_4456401_+,NA|73aa|up_2|NC_010296.1_4458796_4459015_-,NA|99aa|down_1|NC_010296.1_4465600_4465897_-	NA|130aa|up_9|NC_010296.1_4452722_4453112_+	NA	NA|130aa|up_8|NC_010296.1_4453125_4453515_+	NA	NA|217aa|up_7|NC_010296.1_4453645_4454296_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|300aa|up_6|NC_010296.1_4454468_4455368_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|158aa|up_5|NC_010296.1_4455697_4456171_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|76aa|up_4|NC_010296.1_4456173_4456401_+	NA	NA|131aa|up_3|NC_010296.1_4458411_4458804_-	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|73aa|up_2|NC_010296.1_4458796_4459015_-	NA	NA|75aa|up_1|NC_010296.1_4459206_4459431_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|66aa|up_0|NC_010296.1_4459798_4459996_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|458aa|down_0|NC_010296.1_4464142_4465516_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|99aa|down_1|NC_010296.1_4465600_4465897_-	NA	NA|103aa|down_2|NC_010296.1_4466009_4466318_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|92aa|down_3|NC_010296.1_4466431_4466707_-	pfam10779, XhlA, Haemolysin XhlA	NA|92aa|down_4|NC_010296.1_4466820_4467096_-	pfam10779, XhlA, Haemolysin XhlA	RT|587aa|down_5|NC_010296.1_4468887_4470648_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|399aa|down_6|NC_010296.1_4471392_4472589_+	PRK00053, alr, alanine racemase; Reviewed	NA|435aa|down_7|NC_010296.1_4472802_4474107_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|664aa|down_8|NC_010296.1_4474235_4476227_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|364aa|down_9|NC_010296.1_4476348_4477440_-	cd17507, GT28_Beta-DGS-like, beta-diglucosyldiacylglycerol synthase and similar proteins
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	29	4595670-4595967	27	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	CTCTACGGCGGTAACGGGAACGAT	24	0	0	NA	NA	NA	4	4	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|121aa|up_8|NC_010296.1_4588121_4588484_+,NA|188aa|up_0|NC_010296.1_4594830_4595394_+,NA	NA|147aa|up_9|NC_010296.1_4587056_4587497_+	TIGR03042, hypothetical_protein, photosystem II protein PsbQ	NA|121aa|up_8|NC_010296.1_4588121_4588484_+	NA	NA|250aa|up_7|NC_010296.1_4588542_4589292_+	CHL00046, atpI, ATP synthase CF0 A subunit	NA|82aa|up_6|NC_010296.1_4589522_4589768_+	CHL00061, atpH, ATP synthase CF0 C subunit	NA|144aa|up_5|NC_010296.1_4589998_4590430_+	PRK07353, PRK07353, F0F1 ATP synthase subunit B'; Validated	NA|181aa|up_4|NC_010296.1_4590563_4591106_+	PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated	NA|183aa|up_3|NC_010296.1_4591106_4591655_+	PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated	NA|503aa|up_2|NC_010296.1_4591723_4593232_+	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|316aa|up_1|NC_010296.1_4593427_4594375_+	PRK05621, PRK05621, F0F1 ATP synthase subunit gamma; Validated	NA|188aa|up_0|NC_010296.1_4594830_4595394_+	NA	NA|202aa|down_0|NC_010296.1_4601003_4601609_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|426aa|down_1|NC_010296.1_4601798_4603076_-	PRK05476, PRK05476, S-adenosyl-L-homocysteine hydrolase; Provisional	NA|396aa|down_2|NC_010296.1_4603319_4604507_+	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|224aa|down_3|NC_010296.1_4605159_4605831_-	PRK09289, PRK09289, riboflavin synthase	NA|432aa|down_4|NC_010296.1_4605846_4607142_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|391aa|down_5|NC_010296.1_4607476_4608649_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|278aa|down_6|NC_010296.1_4609316_4610150_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|408aa|down_7|NC_010296.1_4610269_4611493_+	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|272aa|down_8|NC_010296.1_4611939_4612755_+	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|421aa|down_9|NC_010296.1_4613072_4614335_+	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	30	4596180-4596338	28	CRISPRCasFinder	no	cas14j	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	CTCTACGGCGGTAACGGGAACGAT	24	0	0	NA	NA	NA	2	2	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|121aa|up_8|NC_010296.1_4588121_4588484_+,NA|188aa|up_0|NC_010296.1_4594830_4595394_+,NA	NA|147aa|up_9|NC_010296.1_4587056_4587497_+	TIGR03042, hypothetical_protein, photosystem II protein PsbQ	NA|121aa|up_8|NC_010296.1_4588121_4588484_+	NA	NA|250aa|up_7|NC_010296.1_4588542_4589292_+	CHL00046, atpI, ATP synthase CF0 A subunit	NA|82aa|up_6|NC_010296.1_4589522_4589768_+	CHL00061, atpH, ATP synthase CF0 C subunit	NA|144aa|up_5|NC_010296.1_4589998_4590430_+	PRK07353, PRK07353, F0F1 ATP synthase subunit B'; Validated	NA|181aa|up_4|NC_010296.1_4590563_4591106_+	PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated	NA|183aa|up_3|NC_010296.1_4591106_4591655_+	PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated	NA|503aa|up_2|NC_010296.1_4591723_4593232_+	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|316aa|up_1|NC_010296.1_4593427_4594375_+	PRK05621, PRK05621, F0F1 ATP synthase subunit gamma; Validated	NA|188aa|up_0|NC_010296.1_4594830_4595394_+	NA	NA|202aa|down_0|NC_010296.1_4601003_4601609_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|426aa|down_1|NC_010296.1_4601798_4603076_-	PRK05476, PRK05476, S-adenosyl-L-homocysteine hydrolase; Provisional	NA|396aa|down_2|NC_010296.1_4603319_4604507_+	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|224aa|down_3|NC_010296.1_4605159_4605831_-	PRK09289, PRK09289, riboflavin synthase	NA|432aa|down_4|NC_010296.1_4605846_4607142_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|391aa|down_5|NC_010296.1_4607476_4608649_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|278aa|down_6|NC_010296.1_4609316_4610150_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|408aa|down_7|NC_010296.1_4610269_4611493_+	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|272aa|down_8|NC_010296.1_4611939_4612755_+	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|421aa|down_9|NC_010296.1_4613072_4614335_+	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	31	4808313-4808414	29	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TCATGACTGAGATATGTCTGAGTTTTTTGCGATAAAA	37	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|99aa|up_0|NC_010296.1_4807946_4808243_-,NA|167aa|down_4|NC_010296.1_4812764_4813265_+,NA|68aa|down_6|NC_010296.1_4815529_4815733_-	NA|156aa|up_9|NC_010296.1_4787183_4787651_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|783aa|up_8|NC_010296.1_4787647_4789996_-	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	NA|405aa|up_7|NC_010296.1_4790256_4791471_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|395aa|up_6|NC_010296.1_4795421_4796606_-	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|990aa|up_5|NC_010296.1_4798214_4801184_+	cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate dehydrogenase, RocA	NA|545aa|up_4|NC_010296.1_4801628_4803263_+	cd03085, PGM1, Phosphoglucomutase 1 (PGM1) catalyzes the bidirectional interconversion of glucose-1-phosphate (G-1-P) and glucose-6-phosphate (G-6-P) via a glucose 1,6-diphosphate intermediate, an important metabolic step in prokaryotes and eukaryotes	NA|137aa|up_3|NC_010296.1_4803549_4803960_-	cd07177, terB_like, tellurium resistance terB-like protein	NA|451aa|up_2|NC_010296.1_4803949_4805302_-	PRK14901, PRK14901, 16S rRNA methyltransferase B; Provisional	NA|233aa|up_1|NC_010296.1_4805298_4805997_-	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|99aa|up_0|NC_010296.1_4807946_4808243_-	NA	NA|419aa|down_0|NC_010296.1_4808442_4809699_+	pfam14516, AAA_35, AAA-like domain	NA|374aa|down_1|NC_010296.1_4809875_4810997_+	pfam14516, AAA_35, AAA-like domain	NA|277aa|down_2|NC_010296.1_4811453_4812284_-	pfam02517, Abi, CAAX protease self-immunity	NA|96aa|down_3|NC_010296.1_4812276_4812564_-	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|167aa|down_4|NC_010296.1_4812764_4813265_+	NA	NA|718aa|down_5|NC_010296.1_4813288_4815442_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|68aa|down_6|NC_010296.1_4815529_4815733_-	NA	NA|78aa|down_7|NC_010296.1_4815893_4816127_-	CHL00136, rpl31, ribosomal protein L31; Validated	NA|138aa|down_8|NC_010296.1_4816146_4816560_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|152aa|down_9|NC_010296.1_4816564_4817020_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	32	5076925-5077097	30	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CCCCCTTGATAAGGGGGGTGCCGAT	25	2	27	5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5076950-5076998|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072|5077024-5077072	NC_010296.1_1443358-1443310|NC_010296.1_1443395-1443347|NC_010296.1_1443432-1443384|NC_010296.1_1443469-1443421|NC_010296.1_1443506-1443458|NC_010296.1_3938448-3938496|NC_010296.1_3938485-3938533|NC_010296.1_4290672-4290624|NC_010296.1_4290709-4290661|NC_010296.1_4290746-4290698|NC_010296.1_4290783-4290735|NC_010296.1_1786572-1786524|NC_010296.1_1786609-1786561|NC_010296.1_4972351-4972399|NC_010296.1_4972388-4972436|NC_010296.1_4972277-4972325|NC_010296.1_4972314-4972362|NC_010296.1_3845423-3845471|NC_010296.1_3845460-3845508|NC_010296.1_3845497-3845545|NC_010296.1_2190692-2190740|NC_010296.1_2354108-2354156|NC_010296.1_3541410-3541362|NC_010296.1_3541447-3541399|NC_010296.1_3541484-3541436|NC_010296.1_5277952-5278000|NC_010296.1_5514515-5514563	NA	2	2	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|231aa|up_9|NC_010296.1_5062784_5063477_-,NA|141aa|up_6|NC_010296.1_5069689_5070112_+,NA|92aa|up_5|NC_010296.1_5070228_5070504_+,NA|78aa|up_4|NC_010296.1_5071086_5071320_+,NA|180aa|down_4|NC_010296.1_5084695_5085235_-,NA|85aa|down_6|NC_010296.1_5087905_5088160_+,NA|131aa|down_7|NC_010296.1_5088509_5088902_-	NA|231aa|up_9|NC_010296.1_5062784_5063477_-	NA	NA|1386aa|up_8|NC_010296.1_5063690_5067848_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|472aa|up_7|NC_010296.1_5067897_5069313_-	pfam14516, AAA_35, AAA-like domain	NA|141aa|up_6|NC_010296.1_5069689_5070112_+	NA	NA|92aa|up_5|NC_010296.1_5070228_5070504_+	NA	NA|78aa|up_4|NC_010296.1_5071086_5071320_+	NA	NA|165aa|up_3|NC_010296.1_5071342_5071837_+	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|636aa|up_2|NC_010296.1_5072170_5074078_+	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|470aa|up_1|NC_010296.1_5074260_5075670_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|357aa|up_0|NC_010296.1_5075753_5076824_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|283aa|down_0|NC_010296.1_5080676_5081525_-	pfam01710, HTH_Tnp_IS630, Transposase	NA|136aa|down_1|NC_010296.1_5082766_5083174_-	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|74aa|down_2|NC_010296.1_5083178_5083400_-	TIGR02609, hypothetical_protein_XAC1195, putative addiction module antidote	NA|84aa|down_3|NC_010296.1_5083469_5083721_-	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|180aa|down_4|NC_010296.1_5084695_5085235_-	NA	NA|503aa|down_5|NC_010296.1_5086155_5087664_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|85aa|down_6|NC_010296.1_5087905_5088160_+	NA	NA|131aa|down_7|NC_010296.1_5088509_5088902_-	NA	NA|146aa|down_8|NC_010296.1_5090203_5090641_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|491aa|down_9|NC_010296.1_5090755_5092228_-	cd06176, MFS_BCD_PucC-like, Bacteriochlorophyll delivery (BCD) family, also called PucC family, of the Major Facilitator Superfamily
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	33	5174081-5174180	31	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	CAAGTTTGACAGCCTGTCTTTTGACAA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|78aa|up_8|NC_010296.1_5165783_5166017_-,NA|76aa|up_7|NC_010296.1_5166208_5166436_-,NA|111aa|up_4|NC_010296.1_5167914_5168247_-,NA|104aa|up_1|NC_010296.1_5173306_5173618_-,NA|83aa|down_2|NC_010296.1_5176683_5176932_+,NA|65aa|down_3|NC_010296.1_5176899_5177094_-,NA|124aa|down_4|NC_010296.1_5177793_5178165_-,NA|307aa|down_6|NC_010296.1_5181529_5182450_-	NA|86aa|up_9|NC_010296.1_5165519_5165777_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|78aa|up_8|NC_010296.1_5165783_5166017_-	NA	NA|76aa|up_7|NC_010296.1_5166208_5166436_-	NA	NA|112aa|up_6|NC_010296.1_5167168_5167504_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|81aa|up_5|NC_010296.1_5167661_5167904_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|111aa|up_4|NC_010296.1_5167914_5168247_-	NA	NA|962aa|up_3|NC_010296.1_5168248_5171134_-	sd00006, TPR, Tetratricopeptide repeat	NA|576aa|up_2|NC_010296.1_5171567_5173295_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|104aa|up_1|NC_010296.1_5173306_5173618_-	NA	NA|126aa|up_0|NC_010296.1_5173671_5174049_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|107aa|down_0|NC_010296.1_5174241_5174562_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|560aa|down_1|NC_010296.1_5174678_5176358_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|83aa|down_2|NC_010296.1_5176683_5176932_+	NA	NA|65aa|down_3|NC_010296.1_5176899_5177094_-	NA	NA|124aa|down_4|NC_010296.1_5177793_5178165_-	NA	NA|499aa|down_5|NC_010296.1_5179730_5181227_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|307aa|down_6|NC_010296.1_5181529_5182450_-	NA	NA|268aa|down_7|NC_010296.1_5182526_5183330_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|446aa|down_8|NC_010296.1_5183518_5184856_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|397aa|down_9|NC_010296.1_5185019_5186210_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	34	5383058-5383144	32	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TTACCGCGTTCTATCCCTCGTAACTCC	27	1	12	5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117|5383085-5383117	NC_010296.1_83017-83049|NC_010296.1_5663252-5663284|NC_010296.1_310007-310039|NC_010296.1_2605473-2605441|NC_010296.1_5665885-5665917|NC_010296.1_309995-310027|NC_010296.1_2605485-2605453|NC_010296.1_3837050-3837018|NC_010296.1_3837062-3837030|NC_010296.1_3837074-3837042|NC_010296.1_5603455-5603487|NC_010296.1_5663180-5663212	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|273aa|up_9|NC_010296.1_5368106_5368925_-,NA|47aa|up_6|NC_010296.1_5372079_5372220_+,NA|320aa|up_1|NC_010296.1_5381407_5382367_-,NA|348aa|down_1|NC_010296.1_5384357_5385401_-,NA|172aa|down_3|NC_010296.1_5385881_5386397_-,NA|79aa|down_4|NC_010296.1_5386433_5386670_-,NA|63aa|down_9|NC_010296.1_5390282_5390471_+	NA|273aa|up_9|NC_010296.1_5368106_5368925_-	NA	NA|388aa|up_8|NC_010296.1_5369290_5370454_+	PRK14012, PRK14012, IscS subfamily cysteine desulfurase	NA|491aa|up_7|NC_010296.1_5370516_5371989_-	PRK06473, PRK06473, NADH-quinone oxidoreductase subunit M	NA|47aa|up_6|NC_010296.1_5372079_5372220_+	NA	NA|458aa|up_5|NC_010296.1_5372198_5373572_-	PRK07445, PRK07445, O-succinylbenzoic acid--CoA ligase; Reviewed	NA|322aa|up_4|NC_010296.1_5373547_5374513_-	PRK02714, PRK02714, o-succinylbenzoate synthase	NA|348aa|up_3|NC_010296.1_5374841_5375885_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|393aa|up_2|NC_010296.1_5376102_5377280_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|320aa|up_1|NC_010296.1_5381407_5382367_-	NA	NA|116aa|up_0|NC_010296.1_5382363_5382711_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|116aa|down_0|NC_010296.1_5383874_5384222_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|348aa|down_1|NC_010296.1_5384357_5385401_-	NA	NA|97aa|down_2|NC_010296.1_5385455_5385746_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|172aa|down_3|NC_010296.1_5385881_5386397_-	NA	NA|79aa|down_4|NC_010296.1_5386433_5386670_-	NA	NA|332aa|down_5|NC_010296.1_5386865_5387861_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|71aa|down_6|NC_010296.1_5388240_5388453_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|145aa|down_7|NC_010296.1_5388443_5388878_+	pfam01850, PIN, PIN domain	NA|336aa|down_8|NC_010296.1_5389028_5390036_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|63aa|down_9|NC_010296.1_5390282_5390471_+	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	35	5384581-5384667	33	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TTACCGCGTTCTATCCCTCGTAACTCC	27	1	12	5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640|5384608-5384640	NC_010296.1_83017-83049|NC_010296.1_5663252-5663284|NC_010296.1_310007-310039|NC_010296.1_2605473-2605441|NC_010296.1_5665885-5665917|NC_010296.1_309995-310027|NC_010296.1_2605485-2605453|NC_010296.1_3837050-3837018|NC_010296.1_3837062-3837030|NC_010296.1_3837074-3837042|NC_010296.1_5603455-5603487|NC_010296.1_5663180-5663212	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|47aa|up_8|NC_010296.1_5372079_5372220_+,NA|320aa|up_3|NC_010296.1_5381407_5382367_-,NA|344aa|up_1|NC_010296.1_5382846_5383878_-,NA|172aa|down_1|NC_010296.1_5385881_5386397_-,NA|79aa|down_2|NC_010296.1_5386433_5386670_-,NA|63aa|down_7|NC_010296.1_5390282_5390471_+	NA|491aa|up_9|NC_010296.1_5370516_5371989_-	PRK06473, PRK06473, NADH-quinone oxidoreductase subunit M	NA|47aa|up_8|NC_010296.1_5372079_5372220_+	NA	NA|458aa|up_7|NC_010296.1_5372198_5373572_-	PRK07445, PRK07445, O-succinylbenzoic acid--CoA ligase; Reviewed	NA|322aa|up_6|NC_010296.1_5373547_5374513_-	PRK02714, PRK02714, o-succinylbenzoate synthase	NA|348aa|up_5|NC_010296.1_5374841_5375885_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|393aa|up_4|NC_010296.1_5376102_5377280_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|320aa|up_3|NC_010296.1_5381407_5382367_-	NA	NA|116aa|up_2|NC_010296.1_5382363_5382711_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|344aa|up_1|NC_010296.1_5382846_5383878_-	NA	NA|116aa|up_0|NC_010296.1_5383874_5384222_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|97aa|down_0|NC_010296.1_5385455_5385746_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|172aa|down_1|NC_010296.1_5385881_5386397_-	NA	NA|79aa|down_2|NC_010296.1_5386433_5386670_-	NA	NA|332aa|down_3|NC_010296.1_5386865_5387861_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|71aa|down_4|NC_010296.1_5388240_5388453_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|145aa|down_5|NC_010296.1_5388443_5388878_+	pfam01850, PIN, PIN domain	NA|336aa|down_6|NC_010296.1_5389028_5390036_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|63aa|down_7|NC_010296.1_5390282_5390471_+	NA	NA|476aa|down_8|NC_010296.1_5390708_5392136_-	COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|412aa|down_9|NC_010296.1_5392347_5393583_+	PRK07424, PRK07424, bifunctional sterol desaturase/short chain dehydrogenase; Validated
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	36	5470868-5470989	34	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	GGTAGGTGTTAAAAACTGTCAGACACCCCCCTTATCAAG	39	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|47aa|up_3|NC_010296.1_5466031_5466172_+,NA|52aa|down_4|NC_010296.1_5476387_5476543_-,NA|228aa|down_5|NC_010296.1_5476545_5477229_-	NA|175aa|up_9|NC_010296.1_5457225_5457750_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|178aa|up_8|NC_010296.1_5457789_5458323_-	COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and chromosome partitioning]	NA|657aa|up_7|NC_010296.1_5458645_5460616_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|82aa|up_6|NC_010296.1_5460867_5461113_+	CHL00065, psaC, photosystem I subunit VII	NA|364aa|up_5|NC_010296.1_5461628_5462720_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|553aa|up_4|NC_010296.1_5464230_5465889_-	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|47aa|up_3|NC_010296.1_5466031_5466172_+	NA	NA|244aa|up_2|NC_010296.1_5468429_5469161_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|243aa|up_1|NC_010296.1_5469236_5469965_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|257aa|up_0|NC_010296.1_5469967_5470738_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|470aa|down_0|NC_010296.1_5471101_5472511_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|370aa|down_1|NC_010296.1_5472890_5474000_+	PRK07409, PRK07409, threonine synthase; Validated	NA|210aa|down_2|NC_010296.1_5474029_5474659_-	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|505aa|down_3|NC_010296.1_5474865_5476380_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|52aa|down_4|NC_010296.1_5476387_5476543_-	NA	NA|228aa|down_5|NC_010296.1_5476545_5477229_-	NA	NA|71aa|down_6|NC_010296.1_5477730_5477943_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|282aa|down_7|NC_010296.1_5478001_5478847_-	PLN02536, PLN02536, diaminopimelate epimerase	NA|71aa|down_8|NC_010296.1_5478908_5479121_+	cd01716, Hfq, bacterial Hfq-like	NA|391aa|down_9|NC_010296.1_5479481_5480654_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	37	5490949-5491051	35	CRISPRCasFinder	no		Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Orphan	TATGAAACGTGCTGTCAAGCGGATC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|100aa|up_2|NC_010296.1_5486293_5486593_-,NA|259aa|up_0|NC_010296.1_5489598_5490375_+,NA|56aa|down_6|NC_010296.1_5496966_5497134_-,NA|207aa|down_9|NC_010296.1_5500191_5500812_-	NA|391aa|up_9|NC_010296.1_5479481_5480654_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|206aa|up_8|NC_010296.1_5480713_5481331_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|339aa|up_7|NC_010296.1_5481350_5482367_+	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|211aa|up_6|NC_010296.1_5482376_5483009_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|322aa|up_5|NC_010296.1_5483032_5483998_+	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|137aa|up_4|NC_010296.1_5484015_5484426_+	pfam04138, GtrA, GtrA-like protein	NA|516aa|up_3|NC_010296.1_5484623_5486171_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|100aa|up_2|NC_010296.1_5486293_5486593_-	NA	NA|822aa|up_1|NC_010296.1_5486668_5489134_-	CHL00095, clpC, Clp protease ATP binding subunit	NA|259aa|up_0|NC_010296.1_5489598_5490375_+	NA	NA|89aa|down_0|NC_010296.1_5492290_5492557_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|86aa|down_1|NC_010296.1_5492559_5492817_+	pfam14384, BrnA_antitoxin, BrnA antitoxin of type II toxin-antitoxin system	NA|83aa|down_2|NC_010296.1_5492875_5493124_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|316aa|down_3|NC_010296.1_5493287_5494235_-	pfam05239, PRC, PRC-barrel domain	NA|420aa|down_4|NC_010296.1_5494499_5495759_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|248aa|down_5|NC_010296.1_5496130_5496874_-	pfam04481, DUF561, Protein of unknown function (DUF561)	NA|56aa|down_6|NC_010296.1_5496966_5497134_-	NA	NA|451aa|down_7|NC_010296.1_5497325_5498678_+	PRK14360, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|383aa|down_8|NC_010296.1_5499006_5500155_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|207aa|down_9|NC_010296.1_5500191_5500812_-	NA
GCF_000010625.1_ASM1062v1	NC_010296	Microcystis aeruginosa NIES-843, complete genome	38	5821983-5822248	6,4,36	CRT,PILER-CR,CRISPRCasFinder	no	Cas14c_CAS-V-F,cas14k,c2c9_V-U4	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	Unclear	CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC,ATACCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC,CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAA	36,39,35	0	0	NA	NA	NA:NA:NA	3,2,2	3	TypeV	Cas14c_CAS-V-F,csa3,cas14j,RT,cas14k,WYL,cas3,csc2gr7,csc1gr5,cas8b5,cas7,cas5,cas6,cas4,cas1,cas2,c2c9_V-U4,2OG_CAS,Cas14u_CAS-V,DinG	NA|117aa|up_6|NC_010296.1_5815999_5816350_+,NA|63aa|down_3|NC_010296.1_5825382_5825571_-,NA|47aa|down_9|NC_010296.1_5832131_5832272_+	NA|286aa|up_9|NC_010296.1_5812779_5813637_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|317aa|up_8|NC_010296.1_5813898_5814849_-	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|244aa|up_7|NC_010296.1_5815059_5815791_+	COG0678, AHP1, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|117aa|up_6|NC_010296.1_5815999_5816350_+	NA	Cas14c_CAS-V-F|429aa|up_5|NC_010296.1_5816528_5817815_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|69aa|up_4|NC_010296.1_5818013_5818220_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|76aa|up_3|NC_010296.1_5818222_5818450_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|178aa|up_2|NC_010296.1_5818708_5819242_+	PRK09448, PRK09448, DNA starvation/stationary phase protection protein Dps; Provisional	NA|428aa|up_1|NC_010296.1_5819274_5820558_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|286aa|up_0|NC_010296.1_5820819_5821677_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|303aa|down_0|NC_010296.1_5822456_5823365_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|479aa|down_1|NC_010296.1_5823519_5824956_+	cd00880, Era_like, E	NA|99aa|down_2|NC_010296.1_5825101_5825398_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|63aa|down_3|NC_010296.1_5825382_5825571_-	NA	NA|231aa|down_4|NC_010296.1_5826044_5826737_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|190aa|down_5|NC_010296.1_5826827_5827397_+	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	cas14k|355aa|down_6|NC_010296.1_5827399_5828464_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|359aa|down_7|NC_010296.1_5828946_5830023_-	PRK10983, PRK10983, AI-2E family transporter YdiK	NA|256aa|down_8|NC_010296.1_5830297_5831065_+	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|47aa|down_9|NC_010296.1_5832131_5832272_+	NA
