assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	1	62023-62135	1	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TGATCAGTATATAACAATGACCTTAAT	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|34aa|up_7|NZ_CP020771.1_52910_53012_-,NA|98aa|down_4|NZ_CP020771.1_66258_66552_-,NA|213aa|down_5|NZ_CP020771.1_66682_67321_-,NA|531aa|down_6|NZ_CP020771.1_67428_69021_-	NA|152aa|up_9|NZ_CP020771.1_51915_52371_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|157aa|up_8|NZ_CP020771.1_52367_52838_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|34aa|up_7|NZ_CP020771.1_52910_53012_-	NA	NA|509aa|up_6|NZ_CP020771.1_53057_54584_-	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|247aa|up_5|NZ_CP020771.1_54805_55546_+	CHL00148, orf27, Ycf27; Reviewed	NA|214aa|up_4|NZ_CP020771.1_55555_56197_+	pfam11152, CCB2_CCB4, Cofactor assembly of complex C subunit B, CCB2/CCB4	NA|219aa|up_3|NZ_CP020771.1_57392_58049_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|327aa|up_2|NZ_CP020771.1_58165_59146_+	PRK09283, PRK09283, porphobilinogen synthase	NA|204aa|up_1|NZ_CP020771.1_59142_59754_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|693aa|up_0|NZ_CP020771.1_59831_61910_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|251aa|down_0|NZ_CP020771.1_62751_63504_-	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|372aa|down_1|NZ_CP020771.1_63608_64724_-	COG4177, LivM, ABC-type branched-chain amino acid transport system, permease component [Amino acid transport and metabolism]	NA|219aa|down_2|NZ_CP020771.1_65066_65723_+	cd19927, REC_Ycf29, phosphoacceptor receiver (REC) domain of probable transcriptional regulator Ycf29	NA|173aa|down_3|NZ_CP020771.1_65737_66256_-	cd16343, LMWPTP, Low molecular weight protein tyrosine phosphatase	NA|98aa|down_4|NZ_CP020771.1_66258_66552_-	NA	NA|213aa|down_5|NZ_CP020771.1_66682_67321_-	NA	NA|531aa|down_6|NZ_CP020771.1_67428_69021_-	NA	NA|159aa|down_7|NZ_CP020771.1_69366_69843_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|932aa|down_8|NZ_CP020771.1_70184_72980_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|626aa|down_9|NZ_CP020771.1_73152_75030_-	PRK00558, uvrC, excinuclease ABC subunit UvrC
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	2	72556-72896	1	CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GGGCAAATCTTGAAGGGGCAAATCTT	26	1	2	72627-72645|72627-72645	NZ_CP020771.1_72882-72900|NZ_CP020771.1_4252953-4252935	NA	7	7	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|98aa|up_3|NZ_CP020771.1_66258_66552_-,NA|213aa|up_2|NZ_CP020771.1_66682_67321_-,NA|531aa|up_1|NZ_CP020771.1_67428_69021_-,NA|87aa|down_1|NZ_CP020771.1_75112_75373_-	NA|693aa|up_9|NZ_CP020771.1_59831_61910_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|213aa|up_8|NZ_CP020771.1_62124_62763_+	PRK13141, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|251aa|up_7|NZ_CP020771.1_62751_63504_-	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|372aa|up_6|NZ_CP020771.1_63608_64724_-	COG4177, LivM, ABC-type branched-chain amino acid transport system, permease component [Amino acid transport and metabolism]	NA|219aa|up_5|NZ_CP020771.1_65066_65723_+	cd19927, REC_Ycf29, phosphoacceptor receiver (REC) domain of probable transcriptional regulator Ycf29	NA|173aa|up_4|NZ_CP020771.1_65737_66256_-	cd16343, LMWPTP, Low molecular weight protein tyrosine phosphatase	NA|98aa|up_3|NZ_CP020771.1_66258_66552_-	NA	NA|213aa|up_2|NZ_CP020771.1_66682_67321_-	NA	NA|531aa|up_1|NZ_CP020771.1_67428_69021_-	NA	NA|159aa|up_0|NZ_CP020771.1_69366_69843_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|626aa|down_0|NZ_CP020771.1_73152_75030_-	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|87aa|down_1|NZ_CP020771.1_75112_75373_-	NA	NA|362aa|down_2|NZ_CP020771.1_75452_76538_-	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|358aa|down_3|NZ_CP020771.1_76709_77783_-	TIGR01208, rmlA_long, glucose-1-phosphate thymidylylransferase, long form	NA|292aa|down_4|NZ_CP020771.1_77787_78663_-	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	NA|182aa|down_5|NZ_CP020771.1_78668_79214_-	TIGR01221, dTDP-4-dehydrorhamnose_35-epimerase, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|361aa|down_6|NZ_CP020771.1_79347_80430_+	TIGR00433, biotin_synthase, biotin synthase	NA|217aa|down_7|NZ_CP020771.1_80413_81064_+	pfam02632, BioY, BioY family	NA|157aa|down_8|NZ_CP020771.1_81075_81546_+	PRK00376, lspA, lipoprotein signal peptidase	NA|598aa|down_9|NZ_CP020771.1_81737_83531_-	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	3	267538-267637	2	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CCCTTGATTAACAATTCTCTCATAA	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|67aa|up_8|NZ_CP020771.1_257750_257951_+,NA|62aa|up_6|NZ_CP020771.1_259222_259408_+,NA|97aa|up_2|NZ_CP020771.1_264556_264847_-,NA|121aa|up_1|NZ_CP020771.1_264806_265169_-,NA|86aa|down_3|NZ_CP020771.1_273576_273834_+,NA|63aa|down_5|NZ_CP020771.1_276652_276841_-,NA|69aa|down_7|NZ_CP020771.1_278058_278265_-	NA|130aa|up_9|NZ_CP020771.1_257075_257465_-	pfam14105, DUF4278, Domain of unknown function (DUF4278)	NA|67aa|up_8|NZ_CP020771.1_257750_257951_+	NA	NA|350aa|up_7|NZ_CP020771.1_257972_259022_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|62aa|up_6|NZ_CP020771.1_259222_259408_+	NA	NA|431aa|up_5|NZ_CP020771.1_259397_260690_-	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|390aa|up_4|NZ_CP020771.1_262105_263275_+	pfam01636, APH, Phosphotransferase enzyme family	NA|368aa|up_3|NZ_CP020771.1_263299_264403_+	pfam17914, HopA1, HopA1 effector protein family	NA|97aa|up_2|NZ_CP020771.1_264556_264847_-	NA	NA|121aa|up_1|NZ_CP020771.1_264806_265169_-	NA	NA|249aa|up_0|NZ_CP020771.1_266079_266826_-	TIGR04500, PpiC_rel_mature, putative peptide maturation system protein	NA|905aa|down_0|NZ_CP020771.1_268605_271320_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|193aa|down_1|NZ_CP020771.1_271377_271956_-	pfam13384, HTH_23, Homeodomain-like domain	NA|354aa|down_2|NZ_CP020771.1_272103_273165_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|86aa|down_3|NZ_CP020771.1_273576_273834_+	NA	NA|405aa|down_4|NZ_CP020771.1_274402_275617_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|63aa|down_5|NZ_CP020771.1_276652_276841_-	NA	NA|286aa|down_6|NZ_CP020771.1_276856_277714_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|69aa|down_7|NZ_CP020771.1_278058_278265_-	NA	NA|445aa|down_8|NZ_CP020771.1_278286_279621_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|234aa|down_9|NZ_CP020771.1_279628_280330_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	4	707651-707749	3	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CTTCCCTACACCCCACACCCCACACCCCACACC	33	1	3	707684-707716|707684-707716|707684-707716	NZ_CP020771.1_2169424-2169392|NZ_CP020771.1_2262545-2262513|NZ_CP020771.1_4835717-4835685	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|102aa|up_7|NZ_CP020771.1_701467_701773_-,NA|66aa|up_6|NZ_CP020771.1_701903_702101_-,NA|192aa|down_3|NZ_CP020771.1_710532_711108_-,NA|61aa|down_8|NZ_CP020771.1_719133_719316_+,NA|354aa|down_9|NZ_CP020771.1_719673_720734_+	NA|152aa|up_9|NZ_CP020771.1_699560_700016_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|405aa|up_8|NZ_CP020771.1_700103_701318_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|102aa|up_7|NZ_CP020771.1_701467_701773_-	NA	NA|66aa|up_6|NZ_CP020771.1_701903_702101_-	NA	NA|74aa|up_5|NZ_CP020771.1_702495_702717_+	pfam09907, HigB_toxin, HigB_toxin, RelE-like toxic component of a toxin-antitoxin system	NA|139aa|up_4|NZ_CP020771.1_702682_703099_+	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|176aa|up_3|NZ_CP020771.1_704832_705360_-	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins	NA|196aa|up_2|NZ_CP020771.1_705553_706141_+	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|286aa|up_1|NZ_CP020771.1_706095_706953_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|up_0|NZ_CP020771.1_706888_707359_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|332aa|down_0|NZ_CP020771.1_707779_708775_+	cd01167, bac_FRK, Fructokinases (FRKs) mainly from bacteria and plants are enzymes with high specificity for fructose, as are all FRKs, but they catalyzes the conversion of fructose to fructose-6-phosphate, which is an entry point into glycolysis via conversion into glucose-6-phosphate	NA|357aa|down_1|NZ_CP020771.1_709021_710092_-	COG0429, COG0429, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|123aa|down_2|NZ_CP020771.1_710157_710526_-	pfam08848, DUF1818, Domain of unknown function (DUF1818)	NA|192aa|down_3|NZ_CP020771.1_710532_711108_-	NA	NA|851aa|down_4|NZ_CP020771.1_711736_714289_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|251aa|down_5|NZ_CP020771.1_714309_715062_-	pfam05116, S6PP, Sucrose-6F-phosphate phosphohydrolase	NA|810aa|down_6|NZ_CP020771.1_715068_717498_-	TIGR02470, Sucrose_synthase_1, sucrose synthase	NA|486aa|down_7|NZ_CP020771.1_717662_719120_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|61aa|down_8|NZ_CP020771.1_719133_719316_+	NA	NA|354aa|down_9|NZ_CP020771.1_719673_720734_+	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	5	741327-741465	4	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	ATTTAAAAGTGTTGGGTTTCACTATCGTTCAACCCAACCTACGATCT	47	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA,NA|64aa|down_0|NZ_CP020771.1_742214_742406_-,NA|153aa|down_1|NZ_CP020771.1_742457_742916_-,NA|354aa|down_5|NZ_CP020771.1_746287_747349_+,NA|360aa|down_9|NZ_CP020771.1_750613_751693_+	NA|356aa|up_9|NZ_CP020771.1_726060_727128_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|332aa|up_8|NZ_CP020771.1_727452_728448_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|249aa|up_7|NZ_CP020771.1_728463_729210_+	pfam17784, Sulfotransfer_4, Sulfotransferase domain	NA|315aa|up_6|NZ_CP020771.1_729760_730705_+	cd08419, PBP2_CbbR_RubisCO_like, The C-terminal substrate binding of LysR-type transcriptional regulator (CbbR) of RubisCO operon, which is involved in the carbon dioxide fixation, contains the type 2 periplasmic binding fold	NA|618aa|up_5|NZ_CP020771.1_730751_732605_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|242aa|up_4|NZ_CP020771.1_732685_733411_+	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|517aa|up_3|NZ_CP020771.1_733489_735040_+	PRK07363, PRK07363, NADH-quinone oxidoreductase subunit M	NA|432aa|up_2|NZ_CP020771.1_735097_736393_+	TIGR01964, chpXY, CO2 hydration protein	NA|148aa|up_1|NZ_CP020771.1_736607_737051_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|553aa|up_0|NZ_CP020771.1_739366_741025_-	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|64aa|down_0|NZ_CP020771.1_742214_742406_-	NA	NA|153aa|down_1|NZ_CP020771.1_742457_742916_-	NA	NA|349aa|down_2|NZ_CP020771.1_742943_743990_-	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|82aa|down_3|NZ_CP020771.1_745278_745524_-	pfam08972, DUF1902, Domain of unknown function (DUF1902)	NA|69aa|down_4|NZ_CP020771.1_745734_745941_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|354aa|down_5|NZ_CP020771.1_746287_747349_+	NA	NA|66aa|down_6|NZ_CP020771.1_747535_747733_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|493aa|down_7|NZ_CP020771.1_747736_749215_-	PRK07208, PRK07208, hypothetical protein; Provisional	NA|333aa|down_8|NZ_CP020771.1_749267_750266_-	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|360aa|down_9|NZ_CP020771.1_750613_751693_+	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	6	791075-791177	5	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CTAATCAAGTTCAGCAGCTACTCTC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|155aa|up_7|NZ_CP020771.1_782451_782916_-,NA|85aa|down_0|NZ_CP020771.1_791279_791534_+	NA|1247aa|up_9|NZ_CP020771.1_774733_778474_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|1255aa|up_8|NZ_CP020771.1_778522_782287_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|155aa|up_7|NZ_CP020771.1_782451_782916_-	NA	NA|159aa|up_6|NZ_CP020771.1_783517_783994_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|288aa|up_5|NZ_CP020771.1_784404_785268_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|108aa|up_4|NZ_CP020771.1_785298_785622_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|192aa|up_3|NZ_CP020771.1_785793_786369_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|299aa|up_2|NZ_CP020771.1_786418_787315_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|429aa|up_1|NZ_CP020771.1_787945_789232_-	pfam04932, Wzy_C, O-Antigen ligase	NA|276aa|up_0|NZ_CP020771.1_789233_790061_-	COG4735, COG4735, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|down_0|NZ_CP020771.1_791279_791534_+	NA	NA|688aa|down_1|NZ_CP020771.1_791752_793816_+	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|325aa|down_2|NZ_CP020771.1_794088_795063_-	TIGR04184, hypothetical_protein_HMPREF0204_12500, ATP-grasp ribosomal peptide maturase, MvdD family	NA|327aa|down_3|NZ_CP020771.1_795116_796097_-	TIGR04185, RimK-like_ATP-grasp_domain_protein, ATP-grasp ribosomal peptide maturase, MvdC family	NA|74aa|down_4|NZ_CP020771.1_796139_796361_-	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|427aa|down_5|NZ_CP020771.1_796371_797652_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|354aa|down_6|NZ_CP020771.1_797877_798939_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|264aa|down_7|NZ_CP020771.1_799121_799913_-	COG3217, COG3217, Uncharacterized Fe-S protein [General function prediction only]	NA|426aa|down_8|NZ_CP020771.1_799899_801177_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|444aa|down_9|NZ_CP020771.1_801447_802779_-	PRK09201, PRK09201, AtzE family amidohydrolase
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	7	1117610-1117722	6	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GGATTCTGTGGTTAAAAAACAGGAGACAG	29	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|91aa|up_8|NZ_CP020771.1_1101814_1102087_+,NA|99aa|up_4|NZ_CP020771.1_1109994_1110291_+,NA|65aa|down_2|NZ_CP020771.1_1120329_1120524_+	NA|247aa|up_9|NZ_CP020771.1_1101068_1101809_+	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|91aa|up_8|NZ_CP020771.1_1101814_1102087_+	NA	NA|248aa|up_7|NZ_CP020771.1_1102473_1103217_+	pfam09353, DUF1995, Domain of unknown function (DUF1995)	NA|75aa|up_6|NZ_CP020771.1_1108292_1108517_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|101aa|up_5|NZ_CP020771.1_1108894_1109197_-	pfam08869, XisI, XisI protein	NA|99aa|up_4|NZ_CP020771.1_1109994_1110291_+	NA	NA|246aa|up_3|NZ_CP020771.1_1111004_1111741_-	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|534aa|up_2|NZ_CP020771.1_1112972_1114574_+	PRK06850, PRK06850, hypothetical protein; Provisional	NA|231aa|up_1|NZ_CP020771.1_1114644_1115337_+	COG2173, DdpX, D-alanyl-D-alanine dipeptidase [Cell envelope biogenesis, outer membrane]	NA|135aa|up_0|NZ_CP020771.1_1115578_1115983_-	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|228aa|down_0|NZ_CP020771.1_1118724_1119408_+	cd06259, YdcF-like, YdcF-like	NA|215aa|down_1|NZ_CP020771.1_1119540_1120185_-	PRK14003, PRK14003, K(+)-transporting ATPase subunit C	NA|65aa|down_2|NZ_CP020771.1_1120329_1120524_+	NA	NA|69aa|down_3|NZ_CP020771.1_1120488_1120695_-	pfam09604, Potass_KdpF, F subunit of K+-transporting ATPase (Potass_KdpF)	NA|307aa|down_4|NZ_CP020771.1_1120852_1121773_-	cd00315, Cyt_C5_DNA_methylase, Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology	NA|720aa|down_5|NZ_CP020771.1_1123064_1125224_-	PRK01122, PRK01122, potassium-transporting ATPase subunit KdpB	NA|582aa|down_6|NZ_CP020771.1_1125451_1127197_-	pfam03814, KdpA, Potassium-transporting ATPase A subunit	NA|118aa|down_7|NZ_CP020771.1_1128417_1128771_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|605aa|down_8|NZ_CP020771.1_1128828_1130643_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|110aa|down_9|NZ_CP020771.1_1130877_1131207_-	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	8	1475089-1475189	7	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CCAGAAATTGATTTTGTCCGATTTCACTGT	30	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|559aa|up_8|NZ_CP020771.1_1464069_1465746_-,NA|118aa|up_7|NZ_CP020771.1_1465742_1466096_-,NA|260aa|up_1|NZ_CP020771.1_1472717_1473497_+,NA|99aa|down_7|NZ_CP020771.1_1484553_1484850_+	NA|145aa|up_9|NZ_CP020771.1_1463333_1463768_-	pfam14229, DUF4332, Domain of unknown function (DUF4332)	NA|559aa|up_8|NZ_CP020771.1_1464069_1465746_-	NA	NA|118aa|up_7|NZ_CP020771.1_1465742_1466096_-	NA	NA|480aa|up_6|NZ_CP020771.1_1466319_1467759_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|438aa|up_5|NZ_CP020771.1_1467800_1469114_-	TIGR01125, Ribosomal_protein_S12_methylthiotransferase_RimO, ribosomal protein S12 methylthiotransferase RimO	NA|284aa|up_4|NZ_CP020771.1_1469295_1470147_+	COG0434, SgcQ, Predicted TIM-barrel enzyme [General function prediction only]	NA|329aa|up_3|NZ_CP020771.1_1470326_1471313_+	cd12916, VKOR_1, Vitamin K epoxide reductase family in bacteria and plants	NA|398aa|up_2|NZ_CP020771.1_1471422_1472616_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|260aa|up_1|NZ_CP020771.1_1472717_1473497_+	NA	NA|413aa|up_0|NZ_CP020771.1_1473545_1474784_+	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|538aa|down_0|NZ_CP020771.1_1475684_1477298_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|307aa|down_1|NZ_CP020771.1_1477453_1478374_-	TIGR00145, Uncharacterized_protein_slr0964, FTR1 family protein	NA|145aa|down_2|NZ_CP020771.1_1478408_1478843_-	COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion transport and metabolism]	NA|287aa|down_3|NZ_CP020771.1_1479072_1479933_-	sd00006, TPR, Tetratricopeptide repeat	NA|562aa|down_4|NZ_CP020771.1_1480292_1481978_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|104aa|down_5|NZ_CP020771.1_1482638_1482950_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|453aa|down_6|NZ_CP020771.1_1483014_1484373_-	pfam13173, AAA_14, AAA domain	NA|99aa|down_7|NZ_CP020771.1_1484553_1484850_+	NA	NA|123aa|down_8|NZ_CP020771.1_1484892_1485261_+	pfam03576, Peptidase_S58, Peptidase family S58	NA|264aa|down_9|NZ_CP020771.1_1485713_1486505_+	NF033188, internalin_H, InlH/InlC2 family class 1 internalin
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	9	1485511-1485601	8	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	AGAGCGATCGCCTTTTTGTTTGTA	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|99aa|up_1|NZ_CP020771.1_1484553_1484850_+,NA|151aa|down_6|NZ_CP020771.1_1490950_1491403_+	NA|153aa|up_9|NZ_CP020771.1_1475182_1475641_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|538aa|up_8|NZ_CP020771.1_1475684_1477298_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|307aa|up_7|NZ_CP020771.1_1477453_1478374_-	TIGR00145, Uncharacterized_protein_slr0964, FTR1 family protein	NA|145aa|up_6|NZ_CP020771.1_1478408_1478843_-	COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion transport and metabolism]	NA|287aa|up_5|NZ_CP020771.1_1479072_1479933_-	sd00006, TPR, Tetratricopeptide repeat	NA|562aa|up_4|NZ_CP020771.1_1480292_1481978_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|104aa|up_3|NZ_CP020771.1_1482638_1482950_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|453aa|up_2|NZ_CP020771.1_1483014_1484373_-	pfam13173, AAA_14, AAA domain	NA|99aa|up_1|NZ_CP020771.1_1484553_1484850_+	NA	NA|123aa|up_0|NZ_CP020771.1_1484892_1485261_+	pfam03576, Peptidase_S58, Peptidase family S58	NA|264aa|down_0|NZ_CP020771.1_1485713_1486505_+	NF033188, internalin_H, InlH/InlC2 family class 1 internalin	NA|104aa|down_1|NZ_CP020771.1_1486852_1487164_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|199aa|down_2|NZ_CP020771.1_1487183_1487780_-	CHL00016, ndhG, NADH dehydrogenase subunit 6	NA|191aa|down_3|NZ_CP020771.1_1487894_1488467_-	TIGR00403, NADPH-quinone_oxidoreductase_subunit_I, NADH-plastoquinone oxidoreductase subunit I protein	NA|373aa|down_4|NZ_CP020771.1_1488509_1489628_-	CHL00032, ndhA, NADH dehydrogenase subunit 1	NA|196aa|down_5|NZ_CP020771.1_1489958_1490546_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|151aa|down_6|NZ_CP020771.1_1490950_1491403_+	NA	NA|665aa|down_7|NZ_CP020771.1_1491657_1493652_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|203aa|down_8|NZ_CP020771.1_1493824_1494433_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|1486aa|down_9|NZ_CP020771.1_1494525_1498983_-	PRK12467, PRK12467, peptide synthase; Provisional
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	10	1485924-1486274	2	CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TTAGCTAATTTAACTAATCTT	21	0	0	NA	NA	NA	5	5	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|99aa|up_1|NZ_CP020771.1_1484553_1484850_+,NA|151aa|down_5|NZ_CP020771.1_1490950_1491403_+	NA|153aa|up_9|NZ_CP020771.1_1475182_1475641_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|538aa|up_8|NZ_CP020771.1_1475684_1477298_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|307aa|up_7|NZ_CP020771.1_1477453_1478374_-	TIGR00145, Uncharacterized_protein_slr0964, FTR1 family protein	NA|145aa|up_6|NZ_CP020771.1_1478408_1478843_-	COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion transport and metabolism]	NA|287aa|up_5|NZ_CP020771.1_1479072_1479933_-	sd00006, TPR, Tetratricopeptide repeat	NA|562aa|up_4|NZ_CP020771.1_1480292_1481978_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|104aa|up_3|NZ_CP020771.1_1482638_1482950_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|453aa|up_2|NZ_CP020771.1_1483014_1484373_-	pfam13173, AAA_14, AAA domain	NA|99aa|up_1|NZ_CP020771.1_1484553_1484850_+	NA	NA|123aa|up_0|NZ_CP020771.1_1484892_1485261_+	pfam03576, Peptidase_S58, Peptidase family S58	NA|104aa|down_0|NZ_CP020771.1_1486852_1487164_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|199aa|down_1|NZ_CP020771.1_1487183_1487780_-	CHL00016, ndhG, NADH dehydrogenase subunit 6	NA|191aa|down_2|NZ_CP020771.1_1487894_1488467_-	TIGR00403, NADPH-quinone_oxidoreductase_subunit_I, NADH-plastoquinone oxidoreductase subunit I protein	NA|373aa|down_3|NZ_CP020771.1_1488509_1489628_-	CHL00032, ndhA, NADH dehydrogenase subunit 1	NA|196aa|down_4|NZ_CP020771.1_1489958_1490546_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|151aa|down_5|NZ_CP020771.1_1490950_1491403_+	NA	NA|665aa|down_6|NZ_CP020771.1_1491657_1493652_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|203aa|down_7|NZ_CP020771.1_1493824_1494433_-	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|1486aa|down_8|NZ_CP020771.1_1494525_1498983_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|259aa|down_9|NZ_CP020771.1_1499059_1499836_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	11	1521956-1522710	1,9,3,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC,CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC,CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC,CCTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC	35,35,35,36	0	0	NA	NA	NA:NA:NA:NA	7,10,10,7	10	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA,NA|60aa|down_0|NZ_CP020771.1_1522771_1522951_+,NA|73aa|down_1|NZ_CP020771.1_1523456_1523675_-,NA|72aa|down_2|NZ_CP020771.1_1523909_1524125_+,NA|85aa|down_4|NZ_CP020771.1_1525633_1525888_-,NA|145aa|down_7|NZ_CP020771.1_1528537_1528972_-	NA|259aa|up_9|NZ_CP020771.1_1499059_1499836_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|1093aa|up_8|NZ_CP020771.1_1499891_1503170_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|267aa|up_7|NZ_CP020771.1_1503874_1504675_-	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|209aa|up_6|NZ_CP020771.1_1504723_1505350_-	cd20307, cupin_BacB_N, Bacillus subtilis bacilysin and related proteins, N-terminal cupin domain	NA|211aa|up_5|NZ_CP020771.1_1505446_1506079_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|348aa|up_4|NZ_CP020771.1_1506072_1507116_-	PRK05437, PRK05437, isopentenyl pyrophosphate isomerase; Provisional	NA|1587aa|up_3|NZ_CP020771.1_1507155_1511916_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|626aa|up_2|NZ_CP020771.1_1512039_1513917_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|1420aa|up_1|NZ_CP020771.1_1514153_1518413_-	cd05906, A_NRPS_TubE_like, The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) synthesizing toxins and antitumor agents	NA|723aa|up_0|NZ_CP020771.1_1519549_1521718_+	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|60aa|down_0|NZ_CP020771.1_1522771_1522951_+	NA	NA|73aa|down_1|NZ_CP020771.1_1523456_1523675_-	NA	NA|72aa|down_2|NZ_CP020771.1_1523909_1524125_+	NA	NA|86aa|down_3|NZ_CP020771.1_1524124_1524382_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|85aa|down_4|NZ_CP020771.1_1525633_1525888_-	NA	NA|348aa|down_5|NZ_CP020771.1_1527125_1528169_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|136aa|down_6|NZ_CP020771.1_1528124_1528532_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|145aa|down_7|NZ_CP020771.1_1528537_1528972_-	NA	NA|575aa|down_8|NZ_CP020771.1_1529448_1531173_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|349aa|down_9|NZ_CP020771.1_1532400_1533447_-	PRK00292, glk, glucokinase; Provisional
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	12	1524835-1525521	3,10,4	PILER-CR,CRISPRCasFinder,CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC,CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC,CTTGCTTCCAATTCGTGAAGCGTATGAATGGAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	8,9,9	9	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|60aa|up_3|NZ_CP020771.1_1522771_1522951_+,NA|73aa|up_2|NZ_CP020771.1_1523456_1523675_-,NA|72aa|up_1|NZ_CP020771.1_1523909_1524125_+,NA|85aa|down_0|NZ_CP020771.1_1525633_1525888_-,NA|145aa|down_3|NZ_CP020771.1_1528537_1528972_-,NA|85aa|down_7|NZ_CP020771.1_1534665_1534920_-	NA|211aa|up_9|NZ_CP020771.1_1505446_1506079_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|348aa|up_8|NZ_CP020771.1_1506072_1507116_-	PRK05437, PRK05437, isopentenyl pyrophosphate isomerase; Provisional	NA|1587aa|up_7|NZ_CP020771.1_1507155_1511916_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|626aa|up_6|NZ_CP020771.1_1512039_1513917_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|1420aa|up_5|NZ_CP020771.1_1514153_1518413_-	cd05906, A_NRPS_TubE_like, The adenylation domain (A domain) of a family of nonribosomal peptide synthetases (NRPSs) synthesizing toxins and antitumor agents	NA|723aa|up_4|NZ_CP020771.1_1519549_1521718_+	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|60aa|up_3|NZ_CP020771.1_1522771_1522951_+	NA	NA|73aa|up_2|NZ_CP020771.1_1523456_1523675_-	NA	NA|72aa|up_1|NZ_CP020771.1_1523909_1524125_+	NA	NA|86aa|up_0|NZ_CP020771.1_1524124_1524382_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|85aa|down_0|NZ_CP020771.1_1525633_1525888_-	NA	NA|348aa|down_1|NZ_CP020771.1_1527125_1528169_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|136aa|down_2|NZ_CP020771.1_1528124_1528532_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|145aa|down_3|NZ_CP020771.1_1528537_1528972_-	NA	NA|575aa|down_4|NZ_CP020771.1_1529448_1531173_-	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|349aa|down_5|NZ_CP020771.1_1532400_1533447_-	PRK00292, glk, glucokinase; Provisional	NA|214aa|down_6|NZ_CP020771.1_1533637_1534279_+	cd07051, BMC_like_1_repeat1, Bacterial Micro-Compartment (BMC)-like domain 1 repeat 1	NA|85aa|down_7|NZ_CP020771.1_1534665_1534920_-	NA	NA|460aa|down_8|NZ_CP020771.1_1534973_1536353_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|364aa|down_9|NZ_CP020771.1_1536987_1538079_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	13	1731195-1732179	4,11,5	PILER-CR,CRISPRCasFinder,CRT	no	cmr3gr5,cas10	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type III-D,Type III-A,Type III-B,Type III-C	CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC,CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC,CCTTACCTATTAGGTCAAATAGGATTAGTTGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	13,13,13	13	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|315aa|up_9|NZ_CP020771.1_1715509_1716454_+,NA|204aa|down_3|NZ_CP020771.1_1735573_1736185_+,NA|111aa|down_8|NZ_CP020771.1_1743166_1743499_+,NA|122aa|down_9|NZ_CP020771.1_1743602_1743968_-	NA|315aa|up_9|NZ_CP020771.1_1715509_1716454_+	NA	NA|266aa|up_8|NZ_CP020771.1_1717400_1718198_+	cd13537, PBP2_YvgL_like, Substrate binding domain of putative molybdate-binding protein YvgL and similar proteins;the type 2 periplasmic binding protein fold	NA|613aa|up_7|NZ_CP020771.1_1718201_1720040_+	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|351aa|up_6|NZ_CP020771.1_1720553_1721605_+	pfam06782, UPF0236, Uncharacterized protein family (UPF0236)	NA|73aa|up_5|NZ_CP020771.1_1721923_1722142_-	PRK09974, PRK09974, type II toxin-antitoxin system PrlF family antitoxin	cmr3gr5|328aa|up_4|NZ_CP020771.1_1722191_1723175_-	pfam09700, Cas_Cmr3, CRISPR-associated protein (Cas_Cmr3)	cas10|475aa|up_3|NZ_CP020771.1_1723185_1724610_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|335aa|up_2|NZ_CP020771.1_1725242_1726247_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|201aa|up_1|NZ_CP020771.1_1728335_1728938_-	pfam05685, Uma2, Putative restriction endonuclease	NA|470aa|up_0|NZ_CP020771.1_1729256_1730666_+	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|197aa|down_0|NZ_CP020771.1_1733083_1733674_-	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|317aa|down_1|NZ_CP020771.1_1733679_1734630_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|222aa|down_2|NZ_CP020771.1_1734778_1735444_-	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|204aa|down_3|NZ_CP020771.1_1735573_1736185_+	NA	NA|168aa|down_4|NZ_CP020771.1_1736186_1736690_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|288aa|down_5|NZ_CP020771.1_1737062_1737926_+	pfam13413, HTH_25, Helix-turn-helix domain	NA|171aa|down_6|NZ_CP020771.1_1737994_1738507_-	pfam09626, DHC, Dihaem cytochrome c	NA|1223aa|down_7|NZ_CP020771.1_1739508_1743177_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|111aa|down_8|NZ_CP020771.1_1743166_1743499_+	NA	NA|122aa|down_9|NZ_CP020771.1_1743602_1743968_-	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	14	1831398-1831501	12	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CTGTCAAAGAATAAAATTTTCCCAAGCTCTCAGA	34	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|101aa|up_5|NZ_CP020771.1_1824539_1824842_+,NA|132aa|up_4|NZ_CP020771.1_1824842_1825238_+,NA|75aa|down_8|NZ_CP020771.1_1838576_1838801_-	NA|334aa|up_9|NZ_CP020771.1_1822058_1823060_-	PRK06270, PRK06270, homoserine dehydrogenase; Provisional	NA|72aa|up_8|NZ_CP020771.1_1823370_1823586_+	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|129aa|up_7|NZ_CP020771.1_1823630_1824017_+	CHL00193, ycf35, Ycf35; Provisional	NA|136aa|up_6|NZ_CP020771.1_1824016_1824424_+	pfam13370, Fer4_13, 4Fe-4S single cluster domain of Ferredoxin I	NA|101aa|up_5|NZ_CP020771.1_1824539_1824842_+	NA	NA|132aa|up_4|NZ_CP020771.1_1824842_1825238_+	NA	NA|239aa|up_3|NZ_CP020771.1_1825694_1826411_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|543aa|up_2|NZ_CP020771.1_1826367_1827996_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|244aa|up_1|NZ_CP020771.1_1829118_1829850_-	pfam02397, Bac_transf, Bacterial sugar transferase	NA|264aa|up_0|NZ_CP020771.1_1830375_1831167_-	PRK09510, tolA, cell envelope integrity inner membrane protein TolA; Provisional	NA|241aa|down_0|NZ_CP020771.1_1831560_1832283_+	TIGR01198, 6-phosphogluconolactonase_6PGL	NA|209aa|down_1|NZ_CP020771.1_1832430_1833057_+	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|423aa|down_2|NZ_CP020771.1_1833180_1834449_+	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|255aa|down_3|NZ_CP020771.1_1834346_1835111_-	cd00144, MPP_PPP_family, phosphoprotein phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|242aa|down_4|NZ_CP020771.1_1835480_1836206_+	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|261aa|down_5|NZ_CP020771.1_1836202_1836985_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|130aa|down_6|NZ_CP020771.1_1837130_1837520_+	cd15487, bS6_chloro_cyano, 30S ribosomal protein S6 of chloroplasts and cyanobacteria	NA|233aa|down_7|NZ_CP020771.1_1837526_1838225_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|75aa|down_8|NZ_CP020771.1_1838576_1838801_-	NA	NA|568aa|down_9|NZ_CP020771.1_1839105_1840809_+	TIGR03156, GTP_HflX, GTP-binding protein HflX
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	15	1893159-1893341	13	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TTTGAGGGGATTTGGGCGCACGCGATGCGCCCCTACGGGTTTTTG	45	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|67aa|up_7|NZ_CP020771.1_1882032_1882233_-,NA|283aa|up_4|NZ_CP020771.1_1884343_1885192_+,NA|60aa|down_1|NZ_CP020771.1_1894202_1894382_-,NA|516aa|down_2|NZ_CP020771.1_1894447_1895995_-,NA|164aa|down_3|NZ_CP020771.1_1896135_1896627_+,NA|90aa|down_6|NZ_CP020771.1_1899508_1899778_+	NA|265aa|up_9|NZ_CP020771.1_1878280_1879075_-	TIGR03611, RutD, pyrimidine utilization protein D	NA|200aa|up_8|NZ_CP020771.1_1879264_1879864_-	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|67aa|up_7|NZ_CP020771.1_1882032_1882233_-	NA	NA|249aa|up_6|NZ_CP020771.1_1882800_1883547_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|254aa|up_5|NZ_CP020771.1_1883573_1884335_+	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|283aa|up_4|NZ_CP020771.1_1884343_1885192_+	NA	NA|331aa|up_3|NZ_CP020771.1_1885402_1886395_-	cd19093, AKR_AtPLR-like, Arabidopsis thaliana pyridoxal reductase (PLR) and similar proteins	NA|1022aa|up_2|NZ_CP020771.1_1886539_1889605_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|487aa|up_1|NZ_CP020771.1_1889828_1891289_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|475aa|up_0|NZ_CP020771.1_1891537_1892962_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|209aa|down_0|NZ_CP020771.1_1893440_1894067_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|60aa|down_1|NZ_CP020771.1_1894202_1894382_-	NA	NA|516aa|down_2|NZ_CP020771.1_1894447_1895995_-	NA	NA|164aa|down_3|NZ_CP020771.1_1896135_1896627_+	NA	NA|345aa|down_4|NZ_CP020771.1_1896623_1897658_-	cd05657, M42_glucanase_like, M42 Peptidase, endoglucanase-like subfamily	NA|446aa|down_5|NZ_CP020771.1_1897772_1899110_-	PRK07373, PRK07373, DNA polymerase III subunit alpha; Reviewed	NA|90aa|down_6|NZ_CP020771.1_1899508_1899778_+	NA	NA|72aa|down_7|NZ_CP020771.1_1900766_1900982_+	PRK09371, PRK09371, gas vesicle structural protein GvpA	NA|72aa|down_8|NZ_CP020771.1_1901336_1901552_+	PRK09371, PRK09371, gas vesicle structural protein GvpA	NA|72aa|down_9|NZ_CP020771.1_1901882_1902098_+	PRK09371, PRK09371, gas vesicle structural protein GvpA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	16	2221669-2221763	14	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GGTCCTAAACTACTTGGAGCGTT	23	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|236aa|up_6|NZ_CP020771.1_2213700_2214408_+,NA|156aa|up_4|NZ_CP020771.1_2217621_2218089_-,NA|78aa|up_3|NZ_CP020771.1_2218075_2218309_-,NA|191aa|up_2|NZ_CP020771.1_2218301_2218874_-,NA|278aa|up_1|NZ_CP020771.1_2219028_2219862_-,NA|108aa|down_0|NZ_CP020771.1_2221828_2222152_-,NA|49aa|down_2|NZ_CP020771.1_2222966_2223113_-,NA|292aa|down_5|NZ_CP020771.1_2224408_2225284_+,NA|90aa|down_6|NZ_CP020771.1_2225623_2225893_+,NA|110aa|down_7|NZ_CP020771.1_2225885_2226215_+,NA|80aa|down_8|NZ_CP020771.1_2227498_2227738_+	NA|372aa|up_9|NZ_CP020771.1_2209784_2210900_-	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]	NA|482aa|up_8|NZ_CP020771.1_2210981_2212427_-	cd11646, Precorrin_3B_C17_MT, Precorrin-3B C(17)-methyltransferase (also named CobJ or CbiH)	NA|243aa|up_7|NZ_CP020771.1_2212423_2213152_-	pfam05685, Uma2, Putative restriction endonuclease	NA|236aa|up_6|NZ_CP020771.1_2213700_2214408_+	NA	NA|143aa|up_5|NZ_CP020771.1_2214574_2215003_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|156aa|up_4|NZ_CP020771.1_2217621_2218089_-	NA	NA|78aa|up_3|NZ_CP020771.1_2218075_2218309_-	NA	NA|191aa|up_2|NZ_CP020771.1_2218301_2218874_-	NA	NA|278aa|up_1|NZ_CP020771.1_2219028_2219862_-	NA	NA|510aa|up_0|NZ_CP020771.1_2219876_2221406_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|108aa|down_0|NZ_CP020771.1_2221828_2222152_-	NA	NA|190aa|down_1|NZ_CP020771.1_2222348_2222918_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|49aa|down_2|NZ_CP020771.1_2222966_2223113_-	NA	NA|234aa|down_3|NZ_CP020771.1_2223133_2223835_-	NF033186, internalin_K, class 1 internalin InlK	NA|73aa|down_4|NZ_CP020771.1_2223899_2224118_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|292aa|down_5|NZ_CP020771.1_2224408_2225284_+	NA	NA|90aa|down_6|NZ_CP020771.1_2225623_2225893_+	NA	NA|110aa|down_7|NZ_CP020771.1_2225885_2226215_+	NA	NA|80aa|down_8|NZ_CP020771.1_2227498_2227738_+	NA	NA|854aa|down_9|NZ_CP020771.1_2228339_2230901_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	17	2223265-2223402	6	CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TAGGGAGAGGGTTAGGGA	18	3	17	2223283-2223300|2223283-2223300|2223283-2223300|2223283-2223300|2223319-2223348|2223319-2223348|2223319-2223348|2223319-2223348|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384|2223367-2223384	NZ_CP020771.1_2684475-2684458|NZ_CP020771.1_4795386-4795369|NZ_CP020771.1_2684415-2684398|NZ_CP020771.1_4795326-4795309|NZ_CP020771.1_2684439-2684410|NZ_CP020771.1_2684463-2684434|NZ_CP020771.1_4795350-4795321|NZ_CP020771.1_4795374-4795345|NZ_CP020771.1_2684379-2684362|NZ_CP020771.1_2684391-2684374|NZ_CP020771.1_4795290-4795273|NZ_CP020771.1_4795302-4795285|NZ_CP020771.1_2223259-2223276|NZ_CP020771.1_2684403-2684386|NZ_CP020771.1_2684499-2684482|NZ_CP020771.1_4795314-4795297|NZ_CP020771.1_4795410-4795393	NA	3	3	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|236aa|up_9|NZ_CP020771.1_2213700_2214408_+,NA|156aa|up_7|NZ_CP020771.1_2217621_2218089_-,NA|78aa|up_6|NZ_CP020771.1_2218075_2218309_-,NA|191aa|up_5|NZ_CP020771.1_2218301_2218874_-,NA|278aa|up_4|NZ_CP020771.1_2219028_2219862_-,NA|108aa|up_2|NZ_CP020771.1_2221828_2222152_-,NA|49aa|up_0|NZ_CP020771.1_2222966_2223113_-,NA|292aa|down_1|NZ_CP020771.1_2224408_2225284_+,NA|90aa|down_2|NZ_CP020771.1_2225623_2225893_+,NA|110aa|down_3|NZ_CP020771.1_2225885_2226215_+,NA|80aa|down_4|NZ_CP020771.1_2227498_2227738_+,NA|184aa|down_9|NZ_CP020771.1_2234486_2235038_-	NA|236aa|up_9|NZ_CP020771.1_2213700_2214408_+	NA	NA|143aa|up_8|NZ_CP020771.1_2214574_2215003_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|156aa|up_7|NZ_CP020771.1_2217621_2218089_-	NA	NA|78aa|up_6|NZ_CP020771.1_2218075_2218309_-	NA	NA|191aa|up_5|NZ_CP020771.1_2218301_2218874_-	NA	NA|278aa|up_4|NZ_CP020771.1_2219028_2219862_-	NA	NA|510aa|up_3|NZ_CP020771.1_2219876_2221406_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|108aa|up_2|NZ_CP020771.1_2221828_2222152_-	NA	NA|190aa|up_1|NZ_CP020771.1_2222348_2222918_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|49aa|up_0|NZ_CP020771.1_2222966_2223113_-	NA	NA|73aa|down_0|NZ_CP020771.1_2223899_2224118_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|292aa|down_1|NZ_CP020771.1_2224408_2225284_+	NA	NA|90aa|down_2|NZ_CP020771.1_2225623_2225893_+	NA	NA|110aa|down_3|NZ_CP020771.1_2225885_2226215_+	NA	NA|80aa|down_4|NZ_CP020771.1_2227498_2227738_+	NA	NA|854aa|down_5|NZ_CP020771.1_2228339_2230901_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|383aa|down_6|NZ_CP020771.1_2231328_2232477_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|119aa|down_7|NZ_CP020771.1_2232543_2232900_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|402aa|down_8|NZ_CP020771.1_2233277_2234483_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|184aa|down_9|NZ_CP020771.1_2234486_2235038_-	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	18	2305298-2305397	15	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GATTTATTTTTGCACAAATTCCCGAA	26	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|63aa|up_2|NZ_CP020771.1_2303354_2303543_-,NA|263aa|down_3|NZ_CP020771.1_2310006_2310795_-,NA|93aa|down_8|NZ_CP020771.1_2313722_2314001_-	NA|156aa|up_9|NZ_CP020771.1_2296258_2296726_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|192aa|up_8|NZ_CP020771.1_2296803_2297379_-	pfam05685, Uma2, Putative restriction endonuclease	NA|190aa|up_7|NZ_CP020771.1_2297405_2297975_-	pfam05685, Uma2, Putative restriction endonuclease	NA|449aa|up_6|NZ_CP020771.1_2298018_2299365_-	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|419aa|up_5|NZ_CP020771.1_2299648_2300905_+	cd06572, Histidinol_dh, Histidinol dehydrogenase, HisD, E	NA|348aa|up_4|NZ_CP020771.1_2300999_2302043_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|62aa|up_3|NZ_CP020771.1_2303144_2303330_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|63aa|up_2|NZ_CP020771.1_2303354_2303543_-	NA	NA|147aa|up_1|NZ_CP020771.1_2304043_2304484_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|44aa|up_0|NZ_CP020771.1_2304891_2305023_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|312aa|down_0|NZ_CP020771.1_2307812_2308748_-	PLN00016, PLN00016, RNA-binding protein; Provisional	NA|149aa|down_1|NZ_CP020771.1_2308992_2309439_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|156aa|down_2|NZ_CP020771.1_2309542_2310010_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|263aa|down_3|NZ_CP020771.1_2310006_2310795_-	NA	NA|171aa|down_4|NZ_CP020771.1_2311242_2311755_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|174aa|down_5|NZ_CP020771.1_2312055_2312577_-	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|147aa|down_6|NZ_CP020771.1_2312712_2313153_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|133aa|down_7|NZ_CP020771.1_2313238_2313637_-	pfam13747, DUF4164, Domain of unknown function (DUF4164)	NA|93aa|down_8|NZ_CP020771.1_2313722_2314001_-	NA	NA|94aa|down_9|NZ_CP020771.1_2314117_2314399_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	19	2443839-2443936	16	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CCCTTGATAAGGGGGGTGCCGATA	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|59aa|up_9|NZ_CP020771.1_2432790_2432967_+,NA|77aa|up_5|NZ_CP020771.1_2437915_2438146_+,NA|76aa|up_3|NZ_CP020771.1_2438408_2438636_+,NA	NA|59aa|up_9|NZ_CP020771.1_2432790_2432967_+	NA	NA|105aa|up_8|NZ_CP020771.1_2433094_2433409_+	TIGR00442, hisS, histidyl-tRNA synthetase	NA|132aa|up_7|NZ_CP020771.1_2433435_2433831_+	TIGR03319, RNase_Y, ribonuclease Y	NA|1202aa|up_6|NZ_CP020771.1_2433896_2437502_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|77aa|up_5|NZ_CP020771.1_2437915_2438146_+	NA	NA|75aa|up_4|NZ_CP020771.1_2438149_2438374_+	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|76aa|up_3|NZ_CP020771.1_2438408_2438636_+	NA	NA|470aa|up_2|NZ_CP020771.1_2438888_2440298_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|405aa|up_1|NZ_CP020771.1_2440664_2441879_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|403aa|up_0|NZ_CP020771.1_2442508_2443717_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|342aa|down_0|NZ_CP020771.1_2444186_2445212_-	PRK07565, PRK07565, dihydroorotate dehydrogenase-like protein	NA|1185aa|down_1|NZ_CP020771.1_2445214_2448769_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|205aa|down_2|NZ_CP020771.1_2449444_2450059_+	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|89aa|down_3|NZ_CP020771.1_2450188_2450455_+	pfam09493, DUF2389, Tryptophan-rich protein (DUF2389)	NA|445aa|down_4|NZ_CP020771.1_2450470_2451805_-	pfam14233, DUF4335, Domain of unknown function (DUF4335)	NA|180aa|down_5|NZ_CP020771.1_2451958_2452498_-	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|524aa|down_6|NZ_CP020771.1_2452633_2454205_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|163aa|down_7|NZ_CP020771.1_2454490_2454979_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|84aa|down_8|NZ_CP020771.1_2454975_2455227_-	COG2886, COG2886, Uncharacterized small protein [Function unknown]	NA|172aa|down_9|NZ_CP020771.1_2455441_2455957_-	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	20	2575920-2576013	17	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TGATCGGTGAATAATATCATTTAAA	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|265aa|up_7|NZ_CP020771.1_2567269_2568064_+,NA	NA|669aa|up_9|NZ_CP020771.1_2564680_2566687_+	PLN02790, PLN02790, transketolase	NA|87aa|up_8|NZ_CP020771.1_2566800_2567061_+	pfam01381, HTH_3, Helix-turn-helix	NA|265aa|up_7|NZ_CP020771.1_2567269_2568064_+	NA	NA|298aa|up_6|NZ_CP020771.1_2568073_2568967_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|405aa|up_5|NZ_CP020771.1_2570042_2571257_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|157aa|up_4|NZ_CP020771.1_2571460_2571931_+	pfam01850, PIN, PIN domain	NA|292aa|up_3|NZ_CP020771.1_2572774_2573650_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|140aa|up_2|NZ_CP020771.1_2573654_2574074_-	sd00006, TPR, Tetratricopeptide repeat	NA|246aa|up_1|NZ_CP020771.1_2574193_2574931_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|256aa|up_0|NZ_CP020771.1_2574962_2575730_-	cd03765, proteasome_beta_bacterial, Bacterial proteasome, beta subunit	NA|255aa|down_0|NZ_CP020771.1_2576138_2576903_-	PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated	NA|300aa|down_1|NZ_CP020771.1_2577056_2577956_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|164aa|down_2|NZ_CP020771.1_2578196_2578688_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|405aa|down_3|NZ_CP020771.1_2579964_2581179_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|168aa|down_4|NZ_CP020771.1_2581512_2582016_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|260aa|down_5|NZ_CP020771.1_2582298_2583078_+	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|439aa|down_6|NZ_CP020771.1_2583508_2584825_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|405aa|down_7|NZ_CP020771.1_2585340_2586555_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|553aa|down_8|NZ_CP020771.1_2587494_2589153_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|267aa|down_9|NZ_CP020771.1_2590592_2591393_-	pfam04851, ResIII, Type III restriction enzyme, res subunit
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	21	2685995-2686089	18	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	AACGCTCCAAGTAGTTTAGGACC	23	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|80aa|up_8|NZ_CP020771.1_2680019_2680259_-,NA|110aa|up_7|NZ_CP020771.1_2681542_2681872_-,NA|90aa|up_6|NZ_CP020771.1_2681864_2682134_-,NA|292aa|up_5|NZ_CP020771.1_2682473_2683349_-,NA|49aa|up_2|NZ_CP020771.1_2684644_2684791_+,NA|108aa|up_0|NZ_CP020771.1_2685605_2685929_+,NA|278aa|down_1|NZ_CP020771.1_2687895_2688729_+,NA|191aa|down_2|NZ_CP020771.1_2688883_2689456_+,NA|78aa|down_3|NZ_CP020771.1_2689448_2689682_+,NA|156aa|down_4|NZ_CP020771.1_2689668_2690136_+,NA|143aa|down_8|NZ_CP020771.1_2695733_2696162_+	NA|75aa|up_9|NZ_CP020771.1_2678728_2678953_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|80aa|up_8|NZ_CP020771.1_2680019_2680259_-	NA	NA|110aa|up_7|NZ_CP020771.1_2681542_2681872_-	NA	NA|90aa|up_6|NZ_CP020771.1_2681864_2682134_-	NA	NA|292aa|up_5|NZ_CP020771.1_2682473_2683349_-	NA	NA|83aa|up_4|NZ_CP020771.1_2683609_2683858_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|234aa|up_3|NZ_CP020771.1_2683922_2684624_+	NF033186, internalin_K, class 1 internalin InlK	NA|49aa|up_2|NZ_CP020771.1_2684644_2684791_+	NA	NA|190aa|up_1|NZ_CP020771.1_2684839_2685409_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|108aa|up_0|NZ_CP020771.1_2685605_2685929_+	NA	NA|510aa|down_0|NZ_CP020771.1_2686351_2687881_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|278aa|down_1|NZ_CP020771.1_2687895_2688729_+	NA	NA|191aa|down_2|NZ_CP020771.1_2688883_2689456_+	NA	NA|78aa|down_3|NZ_CP020771.1_2689448_2689682_+	NA	NA|156aa|down_4|NZ_CP020771.1_2689668_2690136_+	NA	NA|127aa|down_5|NZ_CP020771.1_2692320_2692701_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|309aa|down_6|NZ_CP020771.1_2693544_2694471_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|359aa|down_7|NZ_CP020771.1_2694517_2695594_+	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|143aa|down_8|NZ_CP020771.1_2695733_2696162_+	NA	NA|181aa|down_9|NZ_CP020771.1_2696192_2696735_+	pfam08872, KGK, KGK domain
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	22	2719783-2719882	19	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CAAGTTTGACAGCCTGTCTTTTGACAA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|73aa|up_9|NZ_CP020771.1_2709131_2709350_-,NA|445aa|up_4|NZ_CP020771.1_2713773_2715108_-,NA|217aa|up_3|NZ_CP020771.1_2715380_2716031_-,NA|67aa|down_5|NZ_CP020771.1_2726825_2727026_+,NA|133aa|down_9|NZ_CP020771.1_2731043_2731442_+	NA|73aa|up_9|NZ_CP020771.1_2709131_2709350_-	NA	NA|97aa|up_8|NZ_CP020771.1_2709605_2709896_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|68aa|up_7|NZ_CP020771.1_2709892_2710096_-	pfam18506, RelB_N, RelB Antitoxin alpha helical domain	NA|76aa|up_6|NZ_CP020771.1_2710240_2710468_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|920aa|up_5|NZ_CP020771.1_2710512_2713272_-	sd00006, TPR, Tetratricopeptide repeat	NA|445aa|up_4|NZ_CP020771.1_2713773_2715108_-	NA	NA|217aa|up_3|NZ_CP020771.1_2715380_2716031_-	NA	NA|238aa|up_2|NZ_CP020771.1_2716335_2717049_+	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|576aa|up_1|NZ_CP020771.1_2717242_2718970_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|126aa|up_0|NZ_CP020771.1_2719373_2719751_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|107aa|down_0|NZ_CP020771.1_2719943_2720264_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|560aa|down_1|NZ_CP020771.1_2720380_2722060_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|125aa|down_2|NZ_CP020771.1_2722193_2722568_-	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|228aa|down_3|NZ_CP020771.1_2723031_2723715_+	cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase (HisPPase) AMP bound	NA|633aa|down_4|NZ_CP020771.1_2723958_2725857_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|67aa|down_5|NZ_CP020771.1_2726825_2727026_+	NA	NA|129aa|down_6|NZ_CP020771.1_2727290_2727677_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|481aa|down_7|NZ_CP020771.1_2728412_2729855_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|300aa|down_8|NZ_CP020771.1_2729945_2730845_-	cd06257, DnaJ, DnaJ domain or J-domain	NA|133aa|down_9|NZ_CP020771.1_2731043_2731442_+	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	23	2914765-2914913	7	CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GGGACGACACCTCCCGCACC	20	0	0	NA	NA	NA	3	3	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|264aa|up_9|NZ_CP020771.1_2903637_2904429_+,NA|62aa|up_0|NZ_CP020771.1_2912429_2912615_+,NA|350aa|down_0|NZ_CP020771.1_2915077_2916127_+,NA|80aa|down_9|NZ_CP020771.1_2924082_2924322_+	NA|264aa|up_9|NZ_CP020771.1_2903637_2904429_+	NA	NA|702aa|up_8|NZ_CP020771.1_2904502_2906608_+	pfam00263, Secretin, Bacterial type II and III secretion system protein	NA|349aa|up_7|NZ_CP020771.1_2906785_2907832_-	cd07025, Peptidase_S66, LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein	NA|84aa|up_6|NZ_CP020771.1_2908024_2908276_-	pfam10779, XhlA, Haemolysin XhlA	NA|156aa|up_5|NZ_CP020771.1_2908318_2908786_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|189aa|up_4|NZ_CP020771.1_2908947_2909514_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|283aa|up_3|NZ_CP020771.1_2909585_2910434_+	PRK07428, PRK07428, carboxylating nicotinate-nucleotide diphosphorylase	NA|272aa|up_2|NZ_CP020771.1_2910571_2911387_-	pfam01887, SAM_adeno_trans, S-adenosyl-l-methionine hydroxide adenosyltransferase	NA|152aa|up_1|NZ_CP020771.1_2911989_2912445_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|62aa|up_0|NZ_CP020771.1_2912429_2912615_+	NA	NA|350aa|down_0|NZ_CP020771.1_2915077_2916127_+	NA	NA|533aa|down_1|NZ_CP020771.1_2916313_2917912_+	COG2385, SpoIID, Sporulation protein and related proteins [Cell division and chromosome partitioning]	NA|230aa|down_2|NZ_CP020771.1_2918044_2918734_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|445aa|down_3|NZ_CP020771.1_2918822_2920157_-	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|383aa|down_4|NZ_CP020771.1_2920166_2921315_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|52aa|down_5|NZ_CP020771.1_2921484_2921640_-	pfam10013, DUF2256, Uncharacterized protein conserved in bacteria (DUF2256)	NA|267aa|down_6|NZ_CP020771.1_2921676_2922477_+	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|394aa|down_7|NZ_CP020771.1_2922504_2923686_+	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|89aa|down_8|NZ_CP020771.1_2923689_2923956_-	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|80aa|down_9|NZ_CP020771.1_2924082_2924322_+	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	24	3201876-3202229	8	CRT	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CCACGCCGGACACCACGC	18	0	0	NA	NA	NA	7	7	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|142aa|up_8|NZ_CP020771.1_3189099_3189525_+,NA|375aa|up_6|NZ_CP020771.1_3191660_3192785_-,NA|293aa|up_4|NZ_CP020771.1_3194180_3195059_+,NA|72aa|down_2|NZ_CP020771.1_3205892_3206108_+,NA|185aa|down_6|NZ_CP020771.1_3209101_3209656_-	NA|309aa|up_9|NZ_CP020771.1_3188040_3188967_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|142aa|up_8|NZ_CP020771.1_3189099_3189525_+	NA	NA|501aa|up_7|NZ_CP020771.1_3189795_3191298_-	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|375aa|up_6|NZ_CP020771.1_3191660_3192785_-	NA	NA|115aa|up_5|NZ_CP020771.1_3193441_3193786_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|293aa|up_4|NZ_CP020771.1_3194180_3195059_+	NA	NA|167aa|up_3|NZ_CP020771.1_3195088_3195589_-	pfam04545, Sigma70_r4, Sigma-70, region 4	NA|746aa|up_2|NZ_CP020771.1_3195773_3198011_+	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c	NA|142aa|up_1|NZ_CP020771.1_3198156_3198582_+	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|246aa|up_0|NZ_CP020771.1_3198797_3199535_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|470aa|down_0|NZ_CP020771.1_3202690_3204100_-	pfam14706, Tnp_DNA_bind, Transposase DNA-binding	NA|470aa|down_1|NZ_CP020771.1_3204341_3205751_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|72aa|down_2|NZ_CP020771.1_3205892_3206108_+	NA	NA|65aa|down_3|NZ_CP020771.1_3206346_3206541_+	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|139aa|down_4|NZ_CP020771.1_3206567_3206984_-	pfam08846, DUF1816, Domain of unknown function (DUF1816)	NA|371aa|down_5|NZ_CP020771.1_3207347_3208460_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|185aa|down_6|NZ_CP020771.1_3209101_3209656_-	NA	NA|790aa|down_7|NZ_CP020771.1_3210585_3212955_+	cd16025, PAS_like, Bacterial Arylsulfatase of Pseudomonas aeruginosa and related proteins	NA|225aa|down_8|NZ_CP020771.1_3213044_3213719_+	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion]	NA|468aa|down_9|NZ_CP020771.1_3214252_3215656_+	TIGR01788, Glutamate_decarboxylase_alpha_GAD-alpha
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	25	3251309-3251419	20	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TTAGAACAAGAGCGTCTTAAAGCTGAA	27	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|69aa|up_8|NZ_CP020771.1_3240087_3240294_-,NA|135aa|down_5|NZ_CP020771.1_3259658_3260063_-	NA|162aa|up_9|NZ_CP020771.1_3239472_3239958_-	PRK13619, psbV, cytochrome c-550; Provisional	NA|69aa|up_8|NZ_CP020771.1_3240087_3240294_-	NA	NA|90aa|up_7|NZ_CP020771.1_3240756_3241026_+	PRK04323, PRK04323, hypothetical protein; Provisional	NA|185aa|up_6|NZ_CP020771.1_3241038_3241593_+	PRK00300, gmk, guanylate kinase; Provisional	NA|200aa|up_5|NZ_CP020771.1_3241785_3242385_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|405aa|up_4|NZ_CP020771.1_3243099_3244314_+	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|1391aa|up_3|NZ_CP020771.1_3244332_3248505_-	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|157aa|up_2|NZ_CP020771.1_3248559_3249030_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|224aa|up_1|NZ_CP020771.1_3249307_3249979_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|238aa|up_0|NZ_CP020771.1_3250000_3250714_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|475aa|down_0|NZ_CP020771.1_3251586_3253011_+	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|574aa|down_1|NZ_CP020771.1_3253113_3254835_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|296aa|down_2|NZ_CP020771.1_3254838_3255726_-	PRK00971, PRK00971, glutaminase; Provisional	NA|407aa|down_3|NZ_CP020771.1_3255737_3256958_+	cd00887, MoeA, MoeA family	NA|553aa|down_4|NZ_CP020771.1_3257559_3259218_-	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|135aa|down_5|NZ_CP020771.1_3259658_3260063_-	NA	NA|237aa|down_6|NZ_CP020771.1_3260265_3260976_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|641aa|down_7|NZ_CP020771.1_3261465_3263388_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|110aa|down_8|NZ_CP020771.1_3263584_3263914_-	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|179aa|down_9|NZ_CP020771.1_3263935_3264472_-	cd01286, deoxycytidylate_deaminase, Deoxycytidylate deaminase domain
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	26	3559317-3565202	9,5,21,6	CRT,PILER-CR,CRISPRCasFinder,PILER-CR	no	cas2,cas1,csx18,WYL,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCATTCAATTAATTTCTCNTAGCGAGTAGAGAG,GTTTCCATTCAATTAATTTCTCTAGCGAGTAGAGAG,GTTTCCATTCAATTAATTTCTCTAGCGAGTAGAGAG,GTTTCCATTCAATTAATTTCTCTAGCGAGTAGAGAG	37,36,36,36	0	0	NA	NA	NA:NA:NA:NA	79,77,78,77	79	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|115aa|up_2|NZ_CP020771.1_3556342_3556687_+,csx18|93aa|down_2|NZ_CP020771.1_3566802_3567081_-,cmr5gr11|185aa|down_7|NZ_CP020771.1_3573807_3574362_+	NA|253aa|up_9|NZ_CP020771.1_3543166_3543925_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|400aa|up_8|NZ_CP020771.1_3544095_3545295_+	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|1101aa|up_7|NZ_CP020771.1_3545301_3548604_+	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|522aa|up_6|NZ_CP020771.1_3549044_3550610_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|250aa|up_5|NZ_CP020771.1_3550628_3551378_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|150aa|up_4|NZ_CP020771.1_3551390_3551840_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|278aa|up_3|NZ_CP020771.1_3555074_3555908_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|115aa|up_2|NZ_CP020771.1_3556342_3556687_+	NA	NA|478aa|up_1|NZ_CP020771.1_3556747_3558181_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|340aa|up_0|NZ_CP020771.1_3558225_3559245_-	pfam00891, Methyltransf_2, O-methyltransferase	cas2|93aa|down_0|NZ_CP020771.1_3565397_3565676_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|down_1|NZ_CP020771.1_3565796_3566789_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx18|93aa|down_2|NZ_CP020771.1_3566802_3567081_-	NA	WYL|418aa|down_3|NZ_CP020771.1_3567275_3568529_+	pfam13280, WYL, WYL domain	cas10|1028aa|down_4|NZ_CP020771.1_3568647_3571731_+	pfam12469, DUF3692, CRISPR-associated protein	cmr3gr5|380aa|down_5|NZ_CP020771.1_3571730_3572870_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|291aa|down_6|NZ_CP020771.1_3572928_3573801_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|185aa|down_7|NZ_CP020771.1_3573807_3574362_+	NA	csm3gr7|558aa|down_8|NZ_CP020771.1_3574425_3576099_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|190aa|down_9|NZ_CP020771.1_3576109_3576679_+	cd06260, DUF820, Domain of unknown function (DUF820)
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	27	3570142-3570249	22	CRISPRCasFinder	no	cas2,cas1,csx18,WYL,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type III-B,Type III-D,Type III-A,Type III-C	CCCGTCGTCCATCCCGATAGTAACAACGATTGGATT	36	0	0	NA	NA	NA	1	1	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|115aa|up_6|NZ_CP020771.1_3556342_3556687_+,csx18|93aa|up_1|NZ_CP020771.1_3566802_3567081_-,cmr5gr11|185aa|down_2|NZ_CP020771.1_3573807_3574362_+,NA|151aa|down_6|NZ_CP020771.1_3578967_3579420_-,NA|103aa|down_8|NZ_CP020771.1_3580841_3581150_+	NA|250aa|up_9|NZ_CP020771.1_3550628_3551378_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|150aa|up_8|NZ_CP020771.1_3551390_3551840_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|278aa|up_7|NZ_CP020771.1_3555074_3555908_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|115aa|up_6|NZ_CP020771.1_3556342_3556687_+	NA	NA|478aa|up_5|NZ_CP020771.1_3556747_3558181_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|340aa|up_4|NZ_CP020771.1_3558225_3559245_-	pfam00891, Methyltransf_2, O-methyltransferase	cas2|93aa|up_3|NZ_CP020771.1_3565397_3565676_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_2|NZ_CP020771.1_3565796_3566789_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx18|93aa|up_1|NZ_CP020771.1_3566802_3567081_-	NA	WYL|418aa|up_0|NZ_CP020771.1_3567275_3568529_+	pfam13280, WYL, WYL domain	cmr3gr5|380aa|down_0|NZ_CP020771.1_3571730_3572870_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|291aa|down_1|NZ_CP020771.1_3572928_3573801_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|185aa|down_2|NZ_CP020771.1_3573807_3574362_+	NA	csm3gr7|558aa|down_3|NZ_CP020771.1_3574425_3576099_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|190aa|down_4|NZ_CP020771.1_3576109_3576679_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|91aa|down_5|NZ_CP020771.1_3578321_3578594_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|151aa|down_6|NZ_CP020771.1_3578967_3579420_-	NA	NA|284aa|down_7|NZ_CP020771.1_3579651_3580503_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|103aa|down_8|NZ_CP020771.1_3580841_3581150_+	NA	NA|154aa|down_9|NZ_CP020771.1_3581312_3581774_+	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	28	3572806-3572902	23	CRISPRCasFinder	no	cas2,cas1,csx18,WYL,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csm3gr7	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type III-B,Type III-D,Type III-A,Type III-C	GCAAATTAGGCTACTCAGAATTACTTTGGA	30	0	0	NA	NA	NA	1	1	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|115aa|up_7|NZ_CP020771.1_3556342_3556687_+,csx18|93aa|up_2|NZ_CP020771.1_3566802_3567081_-,cmr5gr11|185aa|down_1|NZ_CP020771.1_3573807_3574362_+,NA|151aa|down_5|NZ_CP020771.1_3578967_3579420_-,NA|103aa|down_7|NZ_CP020771.1_3580841_3581150_+	NA|150aa|up_9|NZ_CP020771.1_3551390_3551840_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|278aa|up_8|NZ_CP020771.1_3555074_3555908_-	pfam01716, MSP, Manganese-stabilizing protein / photosystem II polypeptide	NA|115aa|up_7|NZ_CP020771.1_3556342_3556687_+	NA	NA|478aa|up_6|NZ_CP020771.1_3556747_3558181_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|340aa|up_5|NZ_CP020771.1_3558225_3559245_-	pfam00891, Methyltransf_2, O-methyltransferase	cas2|93aa|up_4|NZ_CP020771.1_3565397_3565676_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_3|NZ_CP020771.1_3565796_3566789_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx18|93aa|up_2|NZ_CP020771.1_3566802_3567081_-	NA	WYL|418aa|up_1|NZ_CP020771.1_3567275_3568529_+	pfam13280, WYL, WYL domain	cas10|1028aa|up_0|NZ_CP020771.1_3568647_3571731_+	pfam12469, DUF3692, CRISPR-associated protein	cmr4gr7|291aa|down_0|NZ_CP020771.1_3572928_3573801_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|185aa|down_1|NZ_CP020771.1_3573807_3574362_+	NA	csm3gr7|558aa|down_2|NZ_CP020771.1_3574425_3576099_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|190aa|down_3|NZ_CP020771.1_3576109_3576679_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|91aa|down_4|NZ_CP020771.1_3578321_3578594_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain	NA|151aa|down_5|NZ_CP020771.1_3578967_3579420_-	NA	NA|284aa|down_6|NZ_CP020771.1_3579651_3580503_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|103aa|down_7|NZ_CP020771.1_3580841_3581150_+	NA	NA|154aa|down_8|NZ_CP020771.1_3581312_3581774_+	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily	NA|362aa|down_9|NZ_CP020771.1_3581770_3582856_+	TIGR00378, cax, calcium/proton exchanger (cax)
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	29	3852102-3852222	24	CRISPRCasFinder	no	csa3,Cas14c_CAS-V-F	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type I-A	TTCCCCCTGTCTTTCGGTGCGTTCCCCTTGACGG	34	0	0	NA	NA	NA	1	1	TypeV,TypeI-A	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|126aa|up_7|NZ_CP020771.1_3843212_3843590_-,NA|445aa|down_6|NZ_CP020771.1_3859838_3861173_-	NA|348aa|up_9|NZ_CP020771.1_3839304_3840348_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|936aa|up_8|NZ_CP020771.1_3840394_3843202_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|126aa|up_7|NZ_CP020771.1_3843212_3843590_-	NA	NA|376aa|up_6|NZ_CP020771.1_3844072_3845200_-	PRK00064, recF, recombination protein F; Reviewed	NA|191aa|up_5|NZ_CP020771.1_3845350_3845923_+	pfam10989, DUF2808, Protein of unknown function (DUF2808)	NA|215aa|up_4|NZ_CP020771.1_3846010_3846655_-	pfam05685, Uma2, Putative restriction endonuclease	NA|55aa|up_3|NZ_CP020771.1_3847001_3847166_+	pfam02069, Metallothio_Pro, Prokaryotic metallothionein	csa3|124aa|up_2|NZ_CP020771.1_3847264_3847636_+	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|263aa|up_1|NZ_CP020771.1_3847632_3848421_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|1021aa|up_0|NZ_CP020771.1_3848622_3851685_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|259aa|down_0|NZ_CP020771.1_3852536_3853313_-	pfam09865, DUF2092, Predicted periplasmic protein (DUF2092)	NA|221aa|down_1|NZ_CP020771.1_3853346_3854009_-	pfam07885, Ion_trans_2, Ion channel	NA|887aa|down_2|NZ_CP020771.1_3854450_3857111_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|162aa|down_3|NZ_CP020771.1_3857652_3858138_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|101aa|down_4|NZ_CP020771.1_3858160_3858463_-	pfam01744, GLTT, GLTT repeat (6 copies)	NA|298aa|down_5|NZ_CP020771.1_3858582_3859476_-	PRK08645, PRK08645, bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed	NA|445aa|down_6|NZ_CP020771.1_3859838_3861173_-	NA	NA|243aa|down_7|NZ_CP020771.1_3861576_3862305_+	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|55aa|down_8|NZ_CP020771.1_3862549_3862714_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|237aa|down_9|NZ_CP020771.1_3862893_3863604_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	30	3915922-3920830	7,25,10,8	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Type I-D	GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC,GTTTCAATCCCTAATAGGGTTTAAGATTAATTGGAAC	37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	66,67,67,66	67	TypeI-D	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|59aa|up_7|NZ_CP020771.1_3900391_3900568_+,NA|59aa|up_4|NZ_CP020771.1_3906149_3906326_-,NA|445aa|up_3|NZ_CP020771.1_3906721_3908056_-,NA|107aa|up_2|NZ_CP020771.1_3909502_3909823_+,NA	NA|617aa|up_9|NZ_CP020771.1_3897134_3898985_+	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|258aa|up_8|NZ_CP020771.1_3899031_3899805_+	TIGR03413, GSH_gloB, hydroxyacylglutathione hydrolase	NA|59aa|up_7|NZ_CP020771.1_3900391_3900568_+	NA	NA|405aa|up_6|NZ_CP020771.1_3900626_3901841_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|303aa|up_5|NZ_CP020771.1_3905263_3906172_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|59aa|up_4|NZ_CP020771.1_3906149_3906326_-	NA	NA|445aa|up_3|NZ_CP020771.1_3906721_3908056_-	NA	NA|107aa|up_2|NZ_CP020771.1_3909502_3909823_+	NA	NA|364aa|up_1|NZ_CP020771.1_3910010_3911102_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|889aa|up_0|NZ_CP020771.1_3912910_3915577_-	PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional	cas2|91aa|down_0|NZ_CP020771.1_3921063_3921336_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas4|198aa|down_1|NZ_CP020771.1_3922358_3922952_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|279aa|down_2|NZ_CP020771.1_3922954_3923791_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|189aa|down_3|NZ_CP020771.1_3923918_3924485_-	cd06260, DUF820, Domain of unknown function (DUF820)	csc1gr5|225aa|down_4|NZ_CP020771.1_3924494_3925169_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|336aa|down_5|NZ_CP020771.1_3925224_3926232_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1149aa|down_6|NZ_CP020771.1_3926249_3929696_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|153aa|down_7|NZ_CP020771.1_3929917_3930376_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|133aa|down_8|NZ_CP020771.1_3930368_3930767_-	pfam01934, DUF86, Protein of unknown function DUF86	NA|140aa|down_9|NZ_CP020771.1_3930808_3931228_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	31	4164097-4164213	26	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	ACAAAGGACACAAAGATCGATCACTACATTCAGTTAAACTG	41	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|74aa|up_9|NZ_CP020771.1_4154834_4155056_+,NA|84aa|up_8|NZ_CP020771.1_4154982_4155234_-,NA|179aa|up_4|NZ_CP020771.1_4157778_4158315_+,NA|217aa|up_2|NZ_CP020771.1_4160028_4160679_-,NA|103aa|down_2|NZ_CP020771.1_4167946_4168255_+,NA|54aa|down_6|NZ_CP020771.1_4173643_4173805_-	NA|74aa|up_9|NZ_CP020771.1_4154834_4155056_+	NA	NA|84aa|up_8|NZ_CP020771.1_4154982_4155234_-	NA	NA|148aa|up_7|NZ_CP020771.1_4155220_4155664_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|82aa|up_6|NZ_CP020771.1_4156726_4156972_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|80aa|up_5|NZ_CP020771.1_4156979_4157219_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|179aa|up_4|NZ_CP020771.1_4157778_4158315_+	NA	NA|576aa|up_3|NZ_CP020771.1_4158259_4159987_+	TIGR01695, Putative_lipid_II_flippase_MurJ, murein biosynthesis integral membrane protein MurJ	NA|217aa|up_2|NZ_CP020771.1_4160028_4160679_-	NA	NA|427aa|up_1|NZ_CP020771.1_4160720_4162001_-	PRK00037, hisS, histidyl-tRNA synthetase; Reviewed	NA|637aa|up_0|NZ_CP020771.1_4162021_4163932_-	cd07551, P-type_ATPase_HM_ZosA_PfeT-like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis ZosA/PfeT which transports copper, and perhaps zinc under oxidative stress, and perhaps ferrous iron	NA|971aa|down_0|NZ_CP020771.1_4164375_4167288_+	pfam12770, CHAT, CHAT domain	NA|216aa|down_1|NZ_CP020771.1_4167291_4167939_+	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|103aa|down_2|NZ_CP020771.1_4167946_4168255_+	NA	NA|693aa|down_3|NZ_CP020771.1_4168809_4170888_+	COG1523, PulA, Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]	NA|254aa|down_4|NZ_CP020771.1_4171003_4171765_+	pfam13026, DUF3887, Protein of unknown function (DUF3887)	NA|433aa|down_5|NZ_CP020771.1_4171811_4173110_+	PRK00077, eno, enolase; Provisional	NA|54aa|down_6|NZ_CP020771.1_4173643_4173805_-	NA	NA|250aa|down_7|NZ_CP020771.1_4173839_4174589_+	COG2859, COG2859, Uncharacterized protein conserved in bacteria [Function unknown]	NA|130aa|down_8|NZ_CP020771.1_4175485_4175875_-	pfam14250, AbrB-like, AbrB-like transcriptional regulator	NA|148aa|down_9|NZ_CP020771.1_4176175_4176619_+	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	32	4311300-4311399	27	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	CGTTACATTTCATTAACGCACCCTACAAGATC	32	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|46aa|up_8|NZ_CP020771.1_4305340_4305478_-,NA|76aa|up_6|NZ_CP020771.1_4305969_4306197_-,NA|92aa|up_3|NZ_CP020771.1_4309239_4309515_-,NA|72aa|up_1|NZ_CP020771.1_4310325_4310541_-,NA|129aa|up_0|NZ_CP020771.1_4310661_4311048_-,NA|416aa|down_0|NZ_CP020771.1_4311499_4312747_-,NA|186aa|down_1|NZ_CP020771.1_4312821_4313379_-,NA|445aa|down_3|NZ_CP020771.1_4316161_4317496_-	NA|511aa|up_9|NZ_CP020771.1_4303135_4304668_+	TIGR03794, conserved_hypothetical_protein, NHLM bacteriocin system secretion protein	NA|46aa|up_8|NZ_CP020771.1_4305340_4305478_-	NA	NA|143aa|up_7|NZ_CP020771.1_4305538_4305967_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|76aa|up_6|NZ_CP020771.1_4305969_4306197_-	NA	NA|737aa|up_5|NZ_CP020771.1_4306477_4308688_+	COG4129, COG4129, Predicted membrane protein [Function unknown]	NA|124aa|up_4|NZ_CP020771.1_4308871_4309243_-	TIGR03698, clan_AA_DTGF, clan AA aspartic protease, AF_0612 family	NA|92aa|up_3|NZ_CP020771.1_4309239_4309515_-	NA	NA|146aa|up_2|NZ_CP020771.1_4309895_4310333_-	pfam13650, Asp_protease_2, Aspartyl protease	NA|72aa|up_1|NZ_CP020771.1_4310325_4310541_-	NA	NA|129aa|up_0|NZ_CP020771.1_4310661_4311048_-	NA	NA|416aa|down_0|NZ_CP020771.1_4311499_4312747_-	NA	NA|186aa|down_1|NZ_CP020771.1_4312821_4313379_-	NA	NA|553aa|down_2|NZ_CP020771.1_4313973_4315632_+	pfam14104, DUF4277, Domain of unknown function (DUF4277)	NA|445aa|down_3|NZ_CP020771.1_4316161_4317496_-	NA	NA|136aa|down_4|NZ_CP020771.1_4317778_4318186_+	TIGR02249, Integrase/recombinase_E2_protein	NA|285aa|down_5|NZ_CP020771.1_4318343_4319198_-	pfam00582, Usp, Universal stress protein family	NA|214aa|down_6|NZ_CP020771.1_4321056_4321698_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|353aa|down_7|NZ_CP020771.1_4321803_4322862_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|251aa|down_8|NZ_CP020771.1_4322951_4323704_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|194aa|down_9|NZ_CP020771.1_4323954_4324536_-	pfam09367, CpeS, CpeS-like protein
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	33	4783945-4784028	28	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GGGAAGCTTTTTTCAGTGATCAGT	24	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|178aa|up_9|NZ_CP020771.1_4771917_4772451_+,NA|80aa|down_6|NZ_CP020771.1_4790930_4791170_-,NA|110aa|down_7|NZ_CP020771.1_4792453_4792783_-,NA|90aa|down_8|NZ_CP020771.1_4792775_4793045_-,NA|292aa|down_9|NZ_CP020771.1_4793384_4794260_-	NA|178aa|up_9|NZ_CP020771.1_4771917_4772451_+	NA	NA|123aa|up_8|NZ_CP020771.1_4773002_4773371_+	TIGR04256, conserved_hypothetical_protein, GxxExxY protein	NA|148aa|up_7|NZ_CP020771.1_4773895_4774339_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|880aa|up_6|NZ_CP020771.1_4774962_4777602_+	pfam12770, CHAT, CHAT domain	NA|79aa|up_5|NZ_CP020771.1_4778269_4778506_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|98aa|up_4|NZ_CP020771.1_4778400_4778694_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|577aa|up_3|NZ_CP020771.1_4778842_4780573_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|600aa|up_2|NZ_CP020771.1_4780829_4782629_+	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|121aa|up_1|NZ_CP020771.1_4782606_4782969_+	smart00823, PKS_PP, Phosphopantetheine attachment site	NA|208aa|up_0|NZ_CP020771.1_4783290_4783914_-	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|559aa|down_0|NZ_CP020771.1_4784077_4785754_-	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|322aa|down_1|NZ_CP020771.1_4785783_4786749_-	COG1622, CyoA, Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]	NA|307aa|down_2|NZ_CP020771.1_4787063_4787984_+	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|317aa|down_3|NZ_CP020771.1_4787997_4788948_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|445aa|down_4|NZ_CP020771.1_4789057_4790392_-	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|170aa|down_5|NZ_CP020771.1_4790512_4791022_+	pfam01327, Pep_deformylase, Polypeptide deformylase	NA|80aa|down_6|NZ_CP020771.1_4790930_4791170_-	NA	NA|110aa|down_7|NZ_CP020771.1_4792453_4792783_-	NA	NA|90aa|down_8|NZ_CP020771.1_4792775_4793045_-	NA	NA|292aa|down_9|NZ_CP020771.1_4793384_4794260_-	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	34	4796906-4797000	29	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	AACGCTCCAAGTAGTTTAGGACC	23	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|80aa|up_8|NZ_CP020771.1_4790930_4791170_-,NA|110aa|up_7|NZ_CP020771.1_4792453_4792783_-,NA|90aa|up_6|NZ_CP020771.1_4792775_4793045_-,NA|292aa|up_5|NZ_CP020771.1_4793384_4794260_-,NA|49aa|up_2|NZ_CP020771.1_4795555_4795702_+,NA|108aa|up_0|NZ_CP020771.1_4796516_4796840_+,NA|278aa|down_1|NZ_CP020771.1_4798806_4799640_+,NA|191aa|down_2|NZ_CP020771.1_4799794_4800367_+,NA|78aa|down_3|NZ_CP020771.1_4800359_4800593_+,NA|156aa|down_4|NZ_CP020771.1_4800579_4801047_+,NA|291aa|down_9|NZ_CP020771.1_4805060_4805933_+	NA|170aa|up_9|NZ_CP020771.1_4790512_4791022_+	pfam01327, Pep_deformylase, Polypeptide deformylase	NA|80aa|up_8|NZ_CP020771.1_4790930_4791170_-	NA	NA|110aa|up_7|NZ_CP020771.1_4792453_4792783_-	NA	NA|90aa|up_6|NZ_CP020771.1_4792775_4793045_-	NA	NA|292aa|up_5|NZ_CP020771.1_4793384_4794260_-	NA	NA|73aa|up_4|NZ_CP020771.1_4794550_4794769_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|234aa|up_3|NZ_CP020771.1_4794833_4795535_+	NF033186, internalin_K, class 1 internalin InlK	NA|49aa|up_2|NZ_CP020771.1_4795555_4795702_+	NA	NA|190aa|up_1|NZ_CP020771.1_4795750_4796320_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|108aa|up_0|NZ_CP020771.1_4796516_4796840_+	NA	NA|510aa|down_0|NZ_CP020771.1_4797262_4798792_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|278aa|down_1|NZ_CP020771.1_4798806_4799640_+	NA	NA|191aa|down_2|NZ_CP020771.1_4799794_4800367_+	NA	NA|78aa|down_3|NZ_CP020771.1_4800359_4800593_+	NA	NA|156aa|down_4|NZ_CP020771.1_4800579_4801047_+	NA	NA|154aa|down_5|NZ_CP020771.1_4801795_4802257_+	pfam11210, DUF2996, Protein of unknown function (DUF2996)	NA|126aa|down_6|NZ_CP020771.1_4802336_4802714_+	pfam11360, DUF3110, Protein of unknown function (DUF3110)	NA|140aa|down_7|NZ_CP020771.1_4802998_4803418_+	pfam02021, UPF0102, Uncharacterized protein family UPF0102	NA|273aa|down_8|NZ_CP020771.1_4803967_4804786_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|291aa|down_9|NZ_CP020771.1_4805060_4805933_+	NA
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	35	4819130-4819269	30	CRISPRCasFinder	no	Cas14c_CAS-V-F	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Unclear	GCTTTTTCCCCAATTTCTGTCACTGATTCTGTAATAGGTTCCCCGATAGCTGC	53	0	0	NA	NA	NA	1	1	TypeV	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|291aa|up_9|NZ_CP020771.1_4805060_4805933_+,NA|158aa|up_8|NZ_CP020771.1_4806002_4806476_-,NA|79aa|up_6|NZ_CP020771.1_4807485_4807722_-,NA|119aa|up_5|NZ_CP020771.1_4807839_4808196_-,NA|931aa|up_1|NZ_CP020771.1_4814873_4817666_-,NA|152aa|up_0|NZ_CP020771.1_4817746_4818202_-,NA	NA|291aa|up_9|NZ_CP020771.1_4805060_4805933_+	NA	NA|158aa|up_8|NZ_CP020771.1_4806002_4806476_-	NA	NA|308aa|up_7|NZ_CP020771.1_4806496_4807420_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|79aa|up_6|NZ_CP020771.1_4807485_4807722_-	NA	NA|119aa|up_5|NZ_CP020771.1_4807839_4808196_-	NA	NA|1109aa|up_4|NZ_CP020771.1_4808309_4811636_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|281aa|up_3|NZ_CP020771.1_4811944_4812787_-	cd14852, LD-carboxypeptidase, L,D-carboxypeptidase DacB and LdcB, and related proteins	NA|689aa|up_2|NZ_CP020771.1_4812820_4814887_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|931aa|up_1|NZ_CP020771.1_4814873_4817666_-	NA	NA|152aa|up_0|NZ_CP020771.1_4817746_4818202_-	NA	NA|62aa|down_0|NZ_CP020771.1_4819851_4820037_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|328aa|down_1|NZ_CP020771.1_4820214_4821198_+	PRK07452, PRK07452, DNA polymerase III subunit delta; Validated	NA|552aa|down_2|NZ_CP020771.1_4821261_4822917_+	pfam00498, FHA, FHA domain	NA|386aa|down_3|NZ_CP020771.1_4823045_4824203_-	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	Cas14c_CAS-V-F|429aa|down_4|NZ_CP020771.1_4824317_4825604_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|339aa|down_5|NZ_CP020771.1_4825862_4826879_+	PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA	NA|455aa|down_6|NZ_CP020771.1_4827116_4828481_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|1110aa|down_7|NZ_CP020771.1_4828636_4831966_+	cd09178, PLDc_N_Snf2_like, N-terminal putative catalytic domain of uncharacterized HKD family nucleases fused to putative helicases from the Snf2-like family	NA|1141aa|down_8|NZ_CP020771.1_4832243_4835666_+	TIGR02987, m6_adenine_and_m5_cytosine_DNA_methyltransferase, type II restriction m6 adenine DNA methyltransferase, Alw26I/Eco31I/Esp3I family	NA|271aa|down_9|NZ_CP020771.1_4836238_4837051_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	36	4959041-4959131	31	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	GCTAAAAAGTGCTTCAACGCAAATC	25	0	0	NA	NA	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA|445aa|up_3|NZ_CP020771.1_4954201_4955536_-,NA|84aa|up_1|NZ_CP020771.1_4957171_4957423_-,NA	NA|119aa|up_9|NZ_CP020771.1_4948801_4949158_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|222aa|up_8|NZ_CP020771.1_4949163_4949829_+	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|332aa|up_7|NZ_CP020771.1_4950321_4951317_-	CHL00180, rbcR, LysR transcriptional regulator; Provisional	NA|252aa|up_6|NZ_CP020771.1_4951615_4952371_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|185aa|up_5|NZ_CP020771.1_4952623_4953178_-	COG2179, COG2179, Predicted hydrolase of the HAD superfamily [General function prediction only]	NA|208aa|up_4|NZ_CP020771.1_4953294_4953918_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|445aa|up_3|NZ_CP020771.1_4954201_4955536_-	NA	NA|388aa|up_2|NZ_CP020771.1_4955912_4957076_+	PLN02449, PLN02449, ferrochelatase	NA|84aa|up_1|NZ_CP020771.1_4957171_4957423_-	NA	NA|473aa|up_0|NZ_CP020771.1_4957573_4958992_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|369aa|down_0|NZ_CP020771.1_4959210_4960318_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|371aa|down_1|NZ_CP020771.1_4960416_4961529_-	pfam12565, DUF3747, Protein of unknown function (DUF3747)	NA|87aa|down_2|NZ_CP020771.1_4962035_4962296_-	COG3609, COG3609, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain [Transcription]	NA|450aa|down_3|NZ_CP020771.1_4962467_4963817_+	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|876aa|down_4|NZ_CP020771.1_4963973_4966601_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|61aa|down_5|NZ_CP020771.1_4966769_4966952_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|502aa|down_6|NZ_CP020771.1_4967334_4968840_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|421aa|down_7|NZ_CP020771.1_4968873_4970136_-	PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine desulfurase	NA|442aa|down_8|NZ_CP020771.1_4970191_4971517_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD	NA|257aa|down_9|NZ_CP020771.1_4971716_4972487_-	CHL00131, ycf16, sulfate ABC transporter protein; Validated
GCF_002095975.1_ASM209597v1	NZ_CP020771	Microcystis aeruginosa PCC 7806SL chromosome, complete genome	37	5074940-5075081	32	CRISPRCasFinder	no		Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	Orphan	TTGCCTTTTGCCTATCTTGACTAGGAAATTAATTTTGCACGACTACTTATA	51	1	15	5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030|5074991-5075030	NZ_CP020771.1_737107-737146|NZ_CP020771.1_1589270-1589309|NZ_CP020771.1_2856882-2856843|NZ_CP020771.1_3399749-3399788|NZ_CP020771.1_4285856-4285895|NZ_CP020771.1_4331121-4331082|NZ_CP020771.1_4565975-4566014|NZ_CP020771.1_5019228-5019189|NZ_CP020771.1_783259-783298|NZ_CP020771.1_923369-923330|NZ_CP020771.1_1034159-1034198|NZ_CP020771.1_3439282-3439243|NZ_CP020771.1_3630238-3630277|NZ_CP020771.1_3695353-3695314|NZ_CP020771.1_4356174-4356135	NA	1	1	Orphan	Cas14c_CAS-V-F,cas14k,RT,cmr5gr11,cas3,cmr3gr5,cas10,cas14j,DinG,cas2,cas1,csx18,WYL,cmr4gr7,csm3gr7,csa3,cas4,cas6,csc1gr5,csc2gr7,cas10d,2OG_CAS	NA,NA|54aa|down_0|NZ_CP020771.1_5075221_5075383_+,NA|127aa|down_3|NZ_CP020771.1_5078571_5078952_+	NA|510aa|up_9|NZ_CP020771.1_5060770_5062300_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|124aa|up_8|NZ_CP020771.1_5062346_5062718_+	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|452aa|up_7|NZ_CP020771.1_5062997_5064353_+	COG3429, COG3429, Glucose-6-P dehydrogenase subunit [Carbohydrate transport and metabolism]	NA|196aa|up_6|NZ_CP020771.1_5064510_5065098_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|403aa|up_5|NZ_CP020771.1_5065399_5066608_-	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|137aa|up_4|NZ_CP020771.1_5066771_5067182_+	cd00293, USP_Like, Usp: Universal stress protein family	NA|161aa|up_3|NZ_CP020771.1_5067559_5068042_+	PRK00704, PRK00704, photosystem I reaction center protein subunit XI; Provisional	NA|39aa|up_2|NZ_CP020771.1_5068099_5068216_+	PRK11877, psaI, photosystem I reaction center subunit VIII; Reviewed	NA|734aa|up_1|NZ_CP020771.1_5068370_5070572_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|1214aa|up_0|NZ_CP020771.1_5070960_5074602_+	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|54aa|down_0|NZ_CP020771.1_5075221_5075383_+	NA	NA|469aa|down_1|NZ_CP020771.1_5075339_5076746_-	TIGR00400, mgtE, Mg2+ transporter (mgtE)	NA|351aa|down_2|NZ_CP020771.1_5077312_5078365_+	COG2334, COG2334, Putative homoserine kinase type II (protein kinase fold) [General function prediction only]	NA|127aa|down_3|NZ_CP020771.1_5078571_5078952_+	NA	NA|164aa|down_4|NZ_CP020771.1_5078961_5079453_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|164aa|down_5|NZ_CP020771.1_5079466_5079958_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|164aa|down_6|NZ_CP020771.1_5079968_5080460_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|300aa|down_7|NZ_CP020771.1_5080762_5081662_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|89aa|down_8|NZ_CP020771.1_5081621_5081888_-	PRK10856, PRK10856, cytoskeleton protein RodZ	NA|333aa|down_9|NZ_CP020771.1_5083122_5084121_-	pfam09194, Endonuc-BsobI, Restriction endonuclease BsobI
