assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	1	314435-314553	1	PILER-CR	no	DinG	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Type IV-A	AGTAGTGAGTGGTGAGTGCTGAGTT	25	2	215	314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314473-314495|314534-314544	CP023278.1_48165-48143|CP023278.1_149222-149244|CP023278.1_151380-151358|CP023278.1_269334-269312|CP023278.1_393278-393256|CP023278.1_422074-422096|CP023278.1_422105-422127|CP023278.1_662049-662027|CP023278.1_778520-778542|CP023278.1_1143152-1143130|CP023278.1_1200192-1200214|CP023278.1_1291585-1291607|CP023278.1_1305679-1305657|CP023278.1_1306556-1306578|CP023278.1_1462202-1462180|CP023278.1_1557970-1557992|CP023278.1_1620262-1620240|CP023278.1_1627078-1627100|CP023278.1_2089111-2089133|CP023278.1_2136198-2136176|CP023278.1_2150091-2150069|CP023278.1_2205694-2205716|CP023278.1_2241889-2241867|CP023278.1_2287341-2287319|CP023278.1_2449224-2449246|CP023278.1_2535124-2535102|CP023278.1_2591356-2591334|CP023278.1_2673767-2673745|CP023278.1_2836677-2836655|CP023278.1_3134902-3134880|CP023278.1_3140999-3141021|CP023278.1_3171242-3171264|CP023278.1_3191218-3191240|CP023278.1_3350182-3350160|CP023278.1_3455359-3455337|CP023278.1_3512399-3512377|CP023278.1_3550962-3550984|CP023278.1_3557859-3557837|CP023278.1_3621455-3621477|CP023278.1_3627345-3627323|CP023278.1_3627376-3627354|CP023278.1_3996619-3996597|CP023278.1_4031782-4031804|CP023278.1_4045097-4045075|CP023278.1_4075670-4075692|CP023278.1_4111548-4111526|CP023278.1_4121411-4121433|CP023278.1_4188310-4188288|CP023278.1_4343532-4343510|CP023278.1_4347210-4347188|CP023278.1_4347283-4347261|CP023278.1_4469806-4469828|CP023278.1_4535468-4535490|CP023278.1_4604234-4604256|CP023278.1_4620920-4620942|CP023278.1_4673743-4673765|CP023278.1_4673781-4673803|CP023278.1_4685600-4685622|CP023278.1_4985428-4985406|CP023278.1_5070447-5070469|CP023278.1_5082396-5082418|CP023278.1_5109478-5109500|CP023278.1_5117378-5117400|CP023278.1_5179821-5179799|CP023278.1_5261906-5261928|CP023278.1_5291128-5291150|CP023278.1_5347667-5347645|CP023278.1_5402296-5402274|CP023278.1_5403084-5403106|CP023278.1_5406575-5406553|CP023278.1_5446475-5446453|CP023278.1_5539913-5539891|CP023278.1_5652047-5652025|CP023278.1_5701798-5701820|CP023278.1_5871561-5871583|CP023278.1_5875349-5875327|CP023278.1_5975641-5975619|CP023278.1_6112075-6112097|CP023278.1_6180404-6180426|CP023278.1_6236716-6236694|CP023278.1_6287086-6287064|CP023278.1_6362577-6362555|CP023278.1_6518144-6518122|CP023278.1_6569337-6569315|CP023278.1_6569473-6569451|CP023278.1_6634472-6634450|CP023278.1_6712155-6712177|CP023278.1_6724974-6724996|CP023278.1_6737433-6737411|CP023278.1_6919343-6919321|CP023278.1_260997-261019|CP023278.1_379866-379888|CP023278.1_632666-632688|CP023278.1_1004293-1004271|CP023278.1_1040725-1040703|CP023278.1_1227206-1227184|CP023278.1_1336343-1336365|CP023278.1_1375451-1375429|CP023278.1_1844837-1844815|CP023278.1_2013136-2013114|CP023278.1_2150122-2150100|CP023278.1_2301965-2301987|CP023278.1_2436121-2436143|CP023278.1_2436152-2436174|CP023278.1_2436183-2436205|CP023278.1_2546134-2546156|CP023278.1_2614933-2614911|CP023278.1_3450157-3450179|CP023278.1_3618762-3618740|CP023278.1_3631153-3631131|CP023278.1_4188258-4188236|CP023278.1_4462991-4462969|CP023278.1_4673045-4673023|CP023278.1_4673076-4673054|CP023278.1_4871045-4871023|CP023278.1_4991584-4991606|CP023278.1_5091027-5091049|CP023278.1_5446423-5446401|CP023278.1_5617083-5617105|CP023278.1_5666759-5666781|CP023278.1_5666790-5666812|CP023278.1_5701767-5701789|CP023278.1_5730604-5730582|CP023278.1_5790415-5790393|CP023278.1_6128241-6128263|CP023278.1_6180373-6180395|CP023278.1_6220375-6220353|CP023278.1_6329211-6329233|CP023278.1_6439376-6439398|CP023278.1_6603448-6603470|CP023278.1_367762-367740|CP023278.1_378200-378178|CP023278.1_422043-422065|CP023278.1_613309-613331|CP023278.1_632697-632719|CP023278.1_643808-643786|CP023278.1_660231-660209|CP023278.1_662080-662058|CP023278.1_743322-743344|CP023278.1_768383-768361|CP023278.1_778489-778511|CP023278.1_1012533-1012511|CP023278.1_1119550-1119528|CP023278.1_1145479-1145457|CP023278.1_1193295-1193317|CP023278.1_1273310-1273288|CP023278.1_1338305-1338327|CP023278.1_1338357-1338379|CP023278.1_1461934-1461956|CP023278.1_1528774-1528752|CP023278.1_1537046-1537068|CP023278.1_1615757-1615735|CP023278.1_1615869-1615847|CP023278.1_1617761-1617783|CP023278.1_1745148-1745170|CP023278.1_1882145-1882167|CP023278.1_1882176-1882198|CP023278.1_1953508-1953530|CP023278.1_1976856-1976834|CP023278.1_2298716-2298738|CP023278.1_2309826-2309848|CP023278.1_2401323-2401345|CP023278.1_2405115-2405093|CP023278.1_2559182-2559160|CP023278.1_2560673-2560651|CP023278.1_2562300-2562278|CP023278.1_2875012-2875034|CP023278.1_3015589-3015611|CP023278.1_3063181-3063159|CP023278.1_3103605-3103583|CP023278.1_3150569-3150591|CP023278.1_3171287-3171309|CP023278.1_3191187-3191209|CP023278.1_3425354-3425376|CP023278.1_3447536-3447558|CP023278.1_3512337-3512315|CP023278.1_3512368-3512346|CP023278.1_3557828-3557806|CP023278.1_3570862-3570884|CP023278.1_3790880-3790858|CP023278.1_3790918-3790896|CP023278.1_3998994-3999016|CP023278.1_4035775-4035797|CP023278.1_4121449-4121471|CP023278.1_4188213-4188191|CP023278.1_4188341-4188319|CP023278.1_4535437-4535459|CP023278.1_4734353-4734331|CP023278.1_4828749-4828727|CP023278.1_5109509-5109531|CP023278.1_5237428-5237450|CP023278.1_5291097-5291119|CP023278.1_5461693-5461715|CP023278.1_5461724-5461746|CP023278.1_5461755-5461777|CP023278.1_5498714-5498736|CP023278.1_5518545-5518567|CP023278.1_5539944-5539922|CP023278.1_5539975-5539953|CP023278.1_5611690-5611668|CP023278.1_5617268-5617246|CP023278.1_5738560-5738582|CP023278.1_5975672-5975650|CP023278.1_6142289-6142311|CP023278.1_6329180-6329202|CP023278.1_6461639-6461661|CP023278.1_6507384-6507406|CP023278.1_6514537-6514515|CP023278.1_6515249-6515271|CP023278.1_6522188-6522210|CP023278.1_6584346-6584324|CP023278.1_6712110-6712132|CP023278.1_6725068-6725090|CP023278.1_6919312-6919290|CP023279.1_23315-23325	NA	2	2	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|72aa|up_8|CP023278.1_301638_301854_+,NA|142aa|up_7|CP023278.1_301846_302272_+,NA|141aa|up_2|CP023278.1_310465_310888_-,NA|137aa|up_1|CP023278.1_311268_311679_-,NA|66aa|down_4|CP023278.1_319623_319821_-	NA|87aa|up_9|CP023278.1_301171_301432_+	pfam03693, ParD_antitoxin, Bacterial antitoxin of ParD toxin-antitoxin type II system and RHH	NA|72aa|up_8|CP023278.1_301638_301854_+	NA	NA|142aa|up_7|CP023278.1_301846_302272_+	NA	NA|649aa|up_6|CP023278.1_303149_305096_+	pfam13520, AA_permease_2, Amino acid permease	NA|644aa|up_5|CP023278.1_305682_307614_-	TIGR01536, Asparagine_synthetase_1, asparagine synthase (glutamine-hydrolyzing)	NA|509aa|up_4|CP023278.1_308153_309680_-	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|186aa|up_3|CP023278.1_309832_310390_+	pfam05685, Uma2, Putative restriction endonuclease	NA|141aa|up_2|CP023278.1_310465_310888_-	NA	NA|137aa|up_1|CP023278.1_311268_311679_-	NA	NA|321aa|up_0|CP023278.1_311766_312729_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|71aa|down_0|CP023278.1_314689_314902_+	pfam10999, DUF2839, Protein of unknown function (DUF2839)	NA|115aa|down_1|CP023278.1_316031_316376_+	pfam08844, DUF1815, Domain of unknown function (DUF1815)	NA|686aa|down_2|CP023278.1_316760_318818_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|167aa|down_3|CP023278.1_318940_319441_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|66aa|down_4|CP023278.1_319623_319821_-	NA	NA|399aa|down_5|CP023278.1_320408_321605_-	PLN02696, PLN02696, 1-deoxy-D-xylulose-5-phosphate reductoisomerase	NA|431aa|down_6|CP023278.1_321880_323173_+	COG5542, COG5542, Predicted integral membrane protein [Function unknown]	NA|469aa|down_7|CP023278.1_323293_324700_+	pfam09852, DUF2079, Predicted membrane protein (DUF2079)	NA|516aa|down_8|CP023278.1_324708_326256_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|305aa|down_9|CP023278.1_327208_328123_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	2	691624-694180	1,1,2	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCAGTCCCCTTGCGGGGAAATGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGGTTAATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	34,34,33	34	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|554aa|up_3|CP023278.1_685791_687453_-,NA|237aa|up_0|CP023278.1_690802_691513_+,NA|214aa|down_6|CP023278.1_704386_705028_-,NA|277aa|down_8|CP023278.1_706424_707255_+	NA|104aa|up_9|CP023278.1_681748_682060_+	pfam08681, DUF1778, Protein of unknown function (DUF1778)	NA|170aa|up_8|CP023278.1_682060_682570_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|247aa|up_7|CP023278.1_682654_683395_-	pfam05023, Phytochelatin, Phytochelatin synthase	NA|223aa|up_6|CP023278.1_683532_684201_-	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|211aa|up_5|CP023278.1_684306_684939_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|253aa|up_4|CP023278.1_684935_685694_-	cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of the osmoprotectant transporter	NA|554aa|up_3|CP023278.1_685791_687453_-	NA	NA|634aa|up_2|CP023278.1_687563_689465_-	pfam06182, ABC2_membrane_6, ABC-2 family transporter protein	NA|317aa|up_1|CP023278.1_689489_690440_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|237aa|up_0|CP023278.1_690802_691513_+	NA	NA|817aa|down_0|CP023278.1_694533_696984_-	CHL00095, clpC, Clp protease ATP binding subunit	NA|99aa|down_1|CP023278.1_697216_697513_+	pfam11691, DUF3288, Protein of unknown function (DUF3288)	NA|462aa|down_2|CP023278.1_697591_698977_-	PLN03094, PLN03094, Substrate binding subunit of ER-derived-lipid transporter; Provisional	NA|261aa|down_3|CP023278.1_698993_699776_-	COG1127, Ttg2A, ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|500aa|down_4|CP023278.1_699931_701431_+	TIGR02730, Carotenoid_isomerase, carotene isomerase	NA|303aa|down_5|CP023278.1_703295_704204_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|214aa|down_6|CP023278.1_704386_705028_-	NA	NA|357aa|down_7|CP023278.1_705269_706340_-	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|277aa|down_8|CP023278.1_706424_707255_+	NA	NA|256aa|down_9|CP023278.1_707382_708150_+	cd05346, SDR_c5, classical (c) SDR, subgroup 5
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	4	1531654-1531792	3	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	TCAGCACCGGCTAAACGCCGCGCTACCGCTAACAGCACTCA	41	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|99aa|up_3|CP023278.1_1528212_1528509_-,NA|222aa|down_5|CP023278.1_1538679_1539345_-	NA|753aa|up_9|CP023278.1_1517893_1520152_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|152aa|up_8|CP023278.1_1520274_1520730_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|906aa|up_7|CP023278.1_1520732_1523450_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|471aa|up_6|CP023278.1_1523602_1525015_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|541aa|up_5|CP023278.1_1525164_1526787_-	PRK00074, guaA, GMP synthase; Reviewed	NA|372aa|up_4|CP023278.1_1527005_1528121_-	PRK00075, cbiD, cobalt-precorrin-6A synthase; Reviewed	NA|99aa|up_3|CP023278.1_1528212_1528509_-	NA	NA|199aa|up_2|CP023278.1_1528798_1529395_-	pfam11375, DUF3177, Protein of unknown function (DUF3177)	NA|79aa|up_1|CP023278.1_1529814_1530051_-	pfam02672, CP12, CP12 domain	NA|410aa|up_0|CP023278.1_1530395_1531625_+	COG4398, COG4398, Uncharacterized protein conserved in bacteria [Function unknown]	NA|372aa|down_0|CP023278.1_1531806_1532922_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|214aa|down_1|CP023278.1_1533202_1533844_+	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|451aa|down_2|CP023278.1_1534109_1535462_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|471aa|down_3|CP023278.1_1535619_1537032_+	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|226aa|down_4|CP023278.1_1537844_1538522_+	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|222aa|down_5|CP023278.1_1538679_1539345_-	NA	NA|328aa|down_6|CP023278.1_1539953_1540937_-	PRK05949, PRK05949, RNA polymerase sigma factor; Validated	NA|579aa|down_7|CP023278.1_1541573_1543310_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|264aa|down_8|CP023278.1_1543496_1544288_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|273aa|down_9|CP023278.1_1544724_1545543_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	5	1557873-1557980	4	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	TGAGTGCTGAGTGCTGTTAGCGGAAGCG	28	1	2	1557901-1557952|1557901-1557952	CP023278.1_5518556-5518607|CP023278.1_1273299-1273248	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|61aa|up_4|CP023278.1_1550649_1550832_-,NA|73aa|down_4|CP023278.1_1564035_1564254_-,NA|122aa|down_5|CP023278.1_1564314_1564680_-	NA|264aa|up_9|CP023278.1_1543496_1544288_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|273aa|up_8|CP023278.1_1544724_1545543_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|245aa|up_7|CP023278.1_1545960_1546695_-	pfam13847, Methyltransf_31, Methyltransferase domain	NA|523aa|up_6|CP023278.1_1546707_1548276_-	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|446aa|up_5|CP023278.1_1548615_1549953_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|61aa|up_4|CP023278.1_1550649_1550832_-	NA	NA|323aa|up_3|CP023278.1_1550918_1551887_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|455aa|up_2|CP023278.1_1552021_1553386_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|607aa|up_1|CP023278.1_1553754_1555575_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|245aa|up_0|CP023278.1_1557131_1557866_+	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|279aa|down_0|CP023278.1_1558317_1559154_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|565aa|down_1|CP023278.1_1559279_1560974_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|559aa|down_2|CP023278.1_1561018_1562695_-	NF033432, ThioGly_TfuA_rel, TfuA-related McrA-glycine thioamidation protein	NA|396aa|down_3|CP023278.1_1562774_1563962_-	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|73aa|down_4|CP023278.1_1564035_1564254_-	NA	NA|122aa|down_5|CP023278.1_1564314_1564680_-	NA	NA|284aa|down_6|CP023278.1_1564926_1565778_-	TIGR03891, lantibiotic_dehydratase, thiopeptide-type bacteriocin biosynthesis domain	NA|346aa|down_7|CP023278.1_1565876_1566914_+	pfam13621, Cupin_8, Cupin-like domain	NA|750aa|down_8|CP023278.1_1566950_1569200_-	TIGR03796, ABC_transporter_related, NHLM bacteriocin system ABC transporter, peptidase/ATP-binding protein	NA|480aa|down_9|CP023278.1_1569234_1570674_-	TIGR03794, conserved_hypothetical_protein, NHLM bacteriocin system secretion protein
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	6	1757234-1757341	5	CRISPRCasFinder	no	RT	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Unclear	GTTTTAGTCCCCTTGCGGGGAAAAGGTTGATGGAAAC	37	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|85aa|up_8|CP023278.1_1749502_1749757_+,NA|86aa|up_7|CP023278.1_1750182_1750440_+,NA|84aa|up_6|CP023278.1_1750578_1750830_+,NA|83aa|up_5|CP023278.1_1750968_1751217_+,NA|179aa|down_7|CP023278.1_1768590_1769127_-	NA|491aa|up_9|CP023278.1_1747749_1749222_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|85aa|up_8|CP023278.1_1749502_1749757_+	NA	NA|86aa|up_7|CP023278.1_1750182_1750440_+	NA	NA|84aa|up_6|CP023278.1_1750578_1750830_+	NA	NA|83aa|up_5|CP023278.1_1750968_1751217_+	NA	NA|391aa|up_4|CP023278.1_1751392_1752565_+	pfam01636, APH, Phosphotransferase enzyme family	NA|365aa|up_3|CP023278.1_1752744_1753839_+	pfam17914, HopA1, HopA1 effector protein family	NA|269aa|up_2|CP023278.1_1753905_1754712_-	pfam05685, Uma2, Putative restriction endonuclease	NA|312aa|up_1|CP023278.1_1754826_1755762_-	cd00537, MTHFR, Methylenetetrahydrofolate reductase (MTHFR)	NA|336aa|up_0|CP023278.1_1755917_1756925_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	RT|590aa|down_0|CP023278.1_1757469_1759239_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|293aa|down_1|CP023278.1_1760190_1761069_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|181aa|down_2|CP023278.1_1762171_1762714_+	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|461aa|down_3|CP023278.1_1762735_1764118_-	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|633aa|down_4|CP023278.1_1764360_1766259_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|225aa|down_5|CP023278.1_1766354_1767029_+	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|down_6|CP023278.1_1767097_1768447_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|179aa|down_7|CP023278.1_1768590_1769127_-	NA	NA|1784aa|down_8|CP023278.1_1769369_1774721_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|770aa|down_9|CP023278.1_1774908_1777218_-	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	7	1759827-1760161	6,2,3	CRISPRCasFinder,CRT,PILER-CR	no	RT	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Unclear	GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	4,4,3	4	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|85aa|up_9|CP023278.1_1749502_1749757_+,NA|86aa|up_8|CP023278.1_1750182_1750440_+,NA|84aa|up_7|CP023278.1_1750578_1750830_+,NA|83aa|up_6|CP023278.1_1750968_1751217_+,NA|179aa|down_6|CP023278.1_1768590_1769127_-,NA|416aa|down_9|CP023278.1_1777289_1778537_-	NA|85aa|up_9|CP023278.1_1749502_1749757_+	NA	NA|86aa|up_8|CP023278.1_1750182_1750440_+	NA	NA|84aa|up_7|CP023278.1_1750578_1750830_+	NA	NA|83aa|up_6|CP023278.1_1750968_1751217_+	NA	NA|391aa|up_5|CP023278.1_1751392_1752565_+	pfam01636, APH, Phosphotransferase enzyme family	NA|365aa|up_4|CP023278.1_1752744_1753839_+	pfam17914, HopA1, HopA1 effector protein family	NA|269aa|up_3|CP023278.1_1753905_1754712_-	pfam05685, Uma2, Putative restriction endonuclease	NA|312aa|up_2|CP023278.1_1754826_1755762_-	cd00537, MTHFR, Methylenetetrahydrofolate reductase (MTHFR)	NA|336aa|up_1|CP023278.1_1755917_1756925_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	RT|590aa|up_0|CP023278.1_1757469_1759239_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|293aa|down_0|CP023278.1_1760190_1761069_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|181aa|down_1|CP023278.1_1762171_1762714_+	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|461aa|down_2|CP023278.1_1762735_1764118_-	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|633aa|down_3|CP023278.1_1764360_1766259_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|225aa|down_4|CP023278.1_1766354_1767029_+	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|down_5|CP023278.1_1767097_1768447_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|179aa|down_6|CP023278.1_1768590_1769127_-	NA	NA|1784aa|down_7|CP023278.1_1769369_1774721_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|770aa|down_8|CP023278.1_1774908_1777218_-	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|416aa|down_9|CP023278.1_1777289_1778537_-	NA
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	8	1761163-1761569	4,7,3	PILER-CR,CRISPRCasFinder,CRT	no	RT	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Unclear	GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAAAGGTTAATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	5,5,5	5	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|86aa|up_9|CP023278.1_1750182_1750440_+,NA|84aa|up_8|CP023278.1_1750578_1750830_+,NA|83aa|up_7|CP023278.1_1750968_1751217_+,NA|179aa|down_5|CP023278.1_1768590_1769127_-,NA|416aa|down_8|CP023278.1_1777289_1778537_-	NA|86aa|up_9|CP023278.1_1750182_1750440_+	NA	NA|84aa|up_8|CP023278.1_1750578_1750830_+	NA	NA|83aa|up_7|CP023278.1_1750968_1751217_+	NA	NA|391aa|up_6|CP023278.1_1751392_1752565_+	pfam01636, APH, Phosphotransferase enzyme family	NA|365aa|up_5|CP023278.1_1752744_1753839_+	pfam17914, HopA1, HopA1 effector protein family	NA|269aa|up_4|CP023278.1_1753905_1754712_-	pfam05685, Uma2, Putative restriction endonuclease	NA|312aa|up_3|CP023278.1_1754826_1755762_-	cd00537, MTHFR, Methylenetetrahydrofolate reductase (MTHFR)	NA|336aa|up_2|CP023278.1_1755917_1756925_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	RT|590aa|up_1|CP023278.1_1757469_1759239_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|293aa|up_0|CP023278.1_1760190_1761069_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|181aa|down_0|CP023278.1_1762171_1762714_+	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|461aa|down_1|CP023278.1_1762735_1764118_-	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|633aa|down_2|CP023278.1_1764360_1766259_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|225aa|down_3|CP023278.1_1766354_1767029_+	COG3544, COG3544, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|down_4|CP023278.1_1767097_1768447_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|179aa|down_5|CP023278.1_1768590_1769127_-	NA	NA|1784aa|down_6|CP023278.1_1769369_1774721_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|770aa|down_7|CP023278.1_1774908_1777218_-	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|416aa|down_8|CP023278.1_1777289_1778537_-	NA	NA|165aa|down_9|CP023278.1_1778914_1779409_+	COG4446, COG4446, Uncharacterized protein conserved in bacteria [Function unknown]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	9	1876938-1879661	8,4,5,6	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCAGTCCCCTTGCGGGGAAATGATGTTTAGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGATGTTTAGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGATGTTTAGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGATGTTTAGAAACT	37,37,37,38	0	0	NA	NA	NA:NA:NA:NA	36,36,35,35	36	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|191aa|up_8|CP023278.1_1868313_1868886_-,NA|165aa|up_6|CP023278.1_1871244_1871739_-,NA|227aa|down_2|CP023278.1_1885709_1886390_-	NA|123aa|up_9|CP023278.1_1867805_1868174_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|191aa|up_8|CP023278.1_1868313_1868886_-	NA	NA|597aa|up_7|CP023278.1_1869234_1871025_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|165aa|up_6|CP023278.1_1871244_1871739_-	NA	NA|189aa|up_5|CP023278.1_1871808_1872375_-	pfam05685, Uma2, Putative restriction endonuclease	NA|332aa|up_4|CP023278.1_1872804_1873800_+	TIGR04185, RimK-like_ATP-grasp_domain_protein, ATP-grasp ribosomal peptide maturase, MvdC family	NA|504aa|up_3|CP023278.1_1873915_1875427_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|175aa|up_2|CP023278.1_1875510_1876035_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|106aa|up_1|CP023278.1_1876034_1876352_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|140aa|up_0|CP023278.1_1876394_1876814_-	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|117aa|down_0|CP023278.1_1882365_1882716_-	COG2146, {NirD}, Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only]	NA|864aa|down_1|CP023278.1_1883099_1885691_+	cd07302, CHD, cyclase homology domain	NA|227aa|down_2|CP023278.1_1885709_1886390_-	NA	NA|170aa|down_3|CP023278.1_1887080_1887590_-	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|486aa|down_4|CP023278.1_1887682_1889140_+	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|245aa|down_5|CP023278.1_1889196_1889931_-	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|662aa|down_6|CP023278.1_1890306_1892292_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|381aa|down_7|CP023278.1_1892641_1893784_+	PRK13914, PRK13914, invasion associated endopeptidase	NA|466aa|down_8|CP023278.1_1893815_1895213_+	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|168aa|down_9|CP023278.1_1895549_1896053_+	sd00006, TPR, Tetratricopeptide repeat
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	10	2115351-2116662	7,9,5	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Type I-D	GTTTCAGTCCCCTTGCGGGGAAATGGTTGATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGGTTGATGGAAAC,GTTTCAGTCCCCTTGCGGGGAAATGGTTGATGGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	17,17,17	17	TypeI-D	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|124aa|up_6|CP023278.1_2108870_2109242_-,NA	NA|461aa|up_9|CP023278.1_2105503_2106886_+	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|192aa|up_8|CP023278.1_2107104_2107680_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|258aa|up_7|CP023278.1_2108011_2108785_-	cd17767, UP_EcUdp-like, uridine phosphorylases similar to Escherichia coli Udp and related phosphorylases	NA|124aa|up_6|CP023278.1_2108870_2109242_-	NA	NA|303aa|up_5|CP023278.1_2109457_2110366_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|60aa|up_4|CP023278.1_2110487_2110667_+	PRK08264, PRK08264, SDR family oxidoreductase	NA|330aa|up_3|CP023278.1_2110972_2111962_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|550aa|up_2|CP023278.1_2111951_2113601_-	pfam00665, rve, Integrase core domain	NA|212aa|up_1|CP023278.1_2113602_2114238_-	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|204aa|up_0|CP023278.1_2114542_2115154_+	PRK08264, PRK08264, SDR family oxidoreductase	cas2|98aa|down_0|CP023278.1_2116882_2117176_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|CP023278.1_2117178_2118156_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|201aa|down_2|CP023278.1_2118189_2118792_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|276aa|down_3|CP023278.1_2118822_2119650_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csc1gr5|242aa|down_4|CP023278.1_2119618_2120344_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|331aa|down_5|CP023278.1_2120467_2121460_-	pfam18320, Csc2, Csc2 Crispr	cas10d|986aa|down_6|CP023278.1_2121460_2124418_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|706aa|down_7|CP023278.1_2124437_2126555_-	TIGR03158, cas3_cyano, CRISPR-associated helicase Cas3, subtype CYANO	WYL|320aa|down_8|CP023278.1_2126606_2127566_+	pfam13280, WYL, WYL domain	NA|193aa|down_9|CP023278.1_2127604_2128183_-	pfam05685, Uma2, Putative restriction endonuclease
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	11	2262107-2263410	8,10,6	PILER-CR,CRISPRCasFinder,CRT	no	PD-DExK	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Unclear	GTTTCAGTCCCCTTGCGGGGAAATGTTTGATGGAAAC,GTTTCCATCAAACATTTCCCCGCAAGGGGACTGAAAC,GTTTCCATCAAACATTTCCCCGCAAGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	16,17,17	17	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|156aa|up_7|CP023278.1_2256500_2256968_+,NA|84aa|up_6|CP023278.1_2256939_2257191_+,NA|250aa|up_5|CP023278.1_2257228_2257978_-,NA|142aa|up_4|CP023278.1_2258104_2258530_+,PD-DExK|202aa|down_4|CP023278.1_2268650_2269256_+	NA|640aa|up_9|CP023278.1_2253345_2255265_+	cd07551, P-type_ATPase_HM_ZosA_PfeT-like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis ZosA/PfeT which transports copper, and perhaps zinc under oxidative stress, and perhaps ferrous iron	NA|380aa|up_8|CP023278.1_2255261_2256401_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|156aa|up_7|CP023278.1_2256500_2256968_+	NA	NA|84aa|up_6|CP023278.1_2256939_2257191_+	NA	NA|250aa|up_5|CP023278.1_2257228_2257978_-	NA	NA|142aa|up_4|CP023278.1_2258104_2258530_+	NA	NA|99aa|up_3|CP023278.1_2258539_2258836_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	NA|717aa|up_2|CP023278.1_2258939_2261090_+	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|41aa|up_1|CP023278.1_2261232_2261355_-	PRK13686, PRK13686, photosystem II reaction center protein Ycf12	NA|117aa|up_0|CP023278.1_2261481_2261832_+	COG0727, COG0727, Predicted Fe-S-cluster oxidoreductase [General function prediction only]	NA|242aa|down_0|CP023278.1_2263738_2264464_+	COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase [Posttranslational modification, protein turnover, chaperones]	NA|282aa|down_1|CP023278.1_2264677_2265523_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|314aa|down_2|CP023278.1_2265610_2266552_+	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|591aa|down_3|CP023278.1_2266554_2268327_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	PD-DExK|202aa|down_4|CP023278.1_2268650_2269256_+	NA	NA|172aa|down_5|CP023278.1_2269295_2269811_-	pfam00582, Usp, Universal stress protein family	NA|297aa|down_6|CP023278.1_2270122_2271013_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|361aa|down_7|CP023278.1_2271156_2272239_-	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|124aa|down_8|CP023278.1_2272340_2272712_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|378aa|down_9|CP023278.1_2272780_2273914_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	12	2659277-2659849	11,7,9	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCCGTCCCCTTGCGGGGAAAAGGTGGTGTTTAACTA,GTTTCCGTCCCCTTGCGGGGAAAAGGTGGTGTTTAAC,GTTTCCGTCCCCTTGCGGGGAAAAGGTGGTGTTTAAC	39,37,37	0	0	NA	NA	NA:NA:NA	7,7,6	7	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA,NA|93aa|down_7|CP023278.1_2675122_2675401_-,NA|351aa|down_8|CP023278.1_2675462_2676515_-,NA|220aa|down_9|CP023278.1_2676516_2677176_-	NA|457aa|up_9|CP023278.1_2648154_2649525_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|454aa|up_8|CP023278.1_2650017_2651379_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|204aa|up_7|CP023278.1_2651822_2652434_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|431aa|up_6|CP023278.1_2652523_2653816_+	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|387aa|up_5|CP023278.1_2653921_2655082_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|251aa|up_4|CP023278.1_2655124_2655877_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|74aa|up_3|CP023278.1_2655962_2656184_-	pfam02594, DUF167, Uncharacterized ACR, YggU family COG1872	NA|187aa|up_2|CP023278.1_2656391_2656952_-	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|249aa|up_1|CP023278.1_2656963_2657710_-	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|352aa|up_0|CP023278.1_2657975_2659031_+	COG5592, COG5592, Uncharacterized conserved protein [Function unknown]	NA|395aa|down_0|CP023278.1_2660596_2661781_-	cd18079, S-AdoMet_synt, S-adenosylmethionine synthetase	NA|1004aa|down_1|CP023278.1_2661899_2664911_-	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|568aa|down_2|CP023278.1_2665383_2667087_-	cd01948, EAL, EAL domain	NA|420aa|down_3|CP023278.1_2667169_2668429_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|622aa|down_4|CP023278.1_2668590_2670456_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|775aa|down_5|CP023278.1_2671307_2673632_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|365aa|down_6|CP023278.1_2673975_2675070_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|93aa|down_7|CP023278.1_2675122_2675401_-	NA	NA|351aa|down_8|CP023278.1_2675462_2676515_-	NA	NA|220aa|down_9|CP023278.1_2676516_2677176_-	NA
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	14	3651650-3651777	13	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	CGGCTAAACGCCGCGCTACTGCTAACAGCACTCA	34	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|278aa|up_5|CP023278.1_3644927_3645761_-,NA	NA|744aa|up_9|CP023278.1_3639082_3641314_-	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]	NA|361aa|up_8|CP023278.1_3641491_3642574_-	cd03804, GT4_WbaZ-like, mannosyltransferase WbaZ and similar proteins	NA|438aa|up_7|CP023278.1_3642857_3644171_-	COG4091, COG4091, Predicted homoserine dehydrogenase [Amino acid transport and metabolism]	NA|190aa|up_6|CP023278.1_3644282_3644852_-	cd00438, cupin_RmlC, RmlC carbohydrate epimerase, involved in dTDP-L-rhamnose production	NA|278aa|up_5|CP023278.1_3644927_3645761_-	NA	NA|350aa|up_4|CP023278.1_3645764_3646814_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|343aa|up_3|CP023278.1_3646916_3647945_-	cd08946, SDR_e, extended (e) SDRs	NA|314aa|up_2|CP023278.1_3648048_3648990_-	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|406aa|up_1|CP023278.1_3649086_3650304_-	PRK11728, PRK11728, L-2-hydroxyglutarate oxidase	NA|259aa|up_0|CP023278.1_3650462_3651239_-	cd02524, G1P_cytidylyltransferase, G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose	NA|1977aa|down_0|CP023278.1_3651795_3657726_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|730aa|down_1|CP023278.1_3657912_3660102_-	COG4248, COG4248, Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains [General function prediction only]	NA|259aa|down_2|CP023278.1_3660218_3660995_-	pfam13672, PP2C_2, Protein phosphatase 2C	NA|225aa|down_3|CP023278.1_3661177_3661852_-	COG4245, TerY, Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]	NA|464aa|down_4|CP023278.1_3662357_3663749_+	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|666aa|down_5|CP023278.1_3664035_3666033_-	pfam03068, PAD, Protein-arginine deiminase (PAD)	NA|274aa|down_6|CP023278.1_3666833_3667655_+	cd02146, NfsA-like, nitroreductase similar to Escherichia coli NfsA	NA|250aa|down_7|CP023278.1_3667742_3668492_-	cd08934, CAD_SDR_c, clavulanic acid dehydrogenase (CAD), classical (c) SDR	NA|200aa|down_8|CP023278.1_3668790_3669390_-	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|305aa|down_9|CP023278.1_3669462_3670377_-	PRK09553, tauD, taurine dioxygenase; Reviewed
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	15	3760342-3761400	10,14,8	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Type V-U5	GTTTCAACGACCATCCCGGCTAGGGGTGGGTTGAAAG,GTTTCAACGACCATCCCGGCTAGGGGTGGGTTGAAAG,GTTTCAACGACCATCCCGGCTAGGGGTGGGTTGAAAG	37,37,37	0	0	NA	NA	V-U5:V-U5:V-U5	13,14,14	14	TypeV-U5	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|226aa|up_7|CP023278.1_3749308_3749986_-,NA|135aa|up_6|CP023278.1_3749985_3750390_-,NA|113aa|up_4|CP023278.1_3753064_3753403_+,NA|106aa|up_3|CP023278.1_3753625_3753943_+,NA|105aa|up_2|CP023278.1_3754084_3754399_+,NA	NA|402aa|up_9|CP023278.1_3746290_3747496_-	cd17261, RMtype1_S_EcoKI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli str	NA|602aa|up_8|CP023278.1_3747492_3749298_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|226aa|up_7|CP023278.1_3749308_3749986_-	NA	NA|135aa|up_6|CP023278.1_3749985_3750390_-	NA	NA|798aa|up_5|CP023278.1_3750399_3752793_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|113aa|up_4|CP023278.1_3753064_3753403_+	NA	NA|106aa|up_3|CP023278.1_3753625_3753943_+	NA	NA|105aa|up_2|CP023278.1_3754084_3754399_+	NA	NA|436aa|up_1|CP023278.1_3755260_3756568_+	pfam13546, DDE_5, DDE superfamily endonuclease	c2c5_V-U5|636aa|up_0|CP023278.1_3757938_3759846_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|298aa|down_0|CP023278.1_3761927_3762821_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|306aa|down_1|CP023278.1_3762933_3763851_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|down_2|CP023278.1_3764481_3765732_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|down_3|CP023278.1_3766067_3766523_+	pfam01475, FUR, Ferric uptake regulator family	NA|413aa|down_4|CP023278.1_3766836_3768075_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1360aa|down_5|CP023278.1_3768089_3772169_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|644aa|down_6|CP023278.1_3772261_3774193_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|231aa|down_7|CP023278.1_3774369_3775062_+	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|304aa|down_8|CP023278.1_3776562_3777474_+	TIGR04052, hypothetical_protein_MettrDRAFT_3899, AZL_007920/MXAN_0976 family protein	NA|387aa|down_9|CP023278.1_3777506_3778667_+	TIGR04039, MXAN_0977_Heme2, di-heme enzyme, MXAN_0977 family
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	16	3775407-3776476	11,15,9	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Type V-U5	TTTCAGTCCCCTTGCGGGGAAATAGTTGATGGAAAC,GTTTCCATCAACTATTTCCCCGCAAGGGGACTGAAAC,GTTTCCATCAACTATTTCCCCGCAAGGGGACTGAAAC	36,37,37	0	0	NA	NA	NA:NA:NA	14,14,14	14	TypeV-U5	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA,NA	NA|436aa|up_9|CP023278.1_3755260_3756568_+	pfam13546, DDE_5, DDE superfamily endonuclease	c2c5_V-U5|636aa|up_8|CP023278.1_3757938_3759846_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|298aa|up_7|CP023278.1_3761927_3762821_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|306aa|up_6|CP023278.1_3762933_3763851_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|up_5|CP023278.1_3764481_3765732_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|up_4|CP023278.1_3766067_3766523_+	pfam01475, FUR, Ferric uptake regulator family	NA|413aa|up_3|CP023278.1_3766836_3768075_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1360aa|up_2|CP023278.1_3768089_3772169_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|644aa|up_1|CP023278.1_3772261_3774193_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|231aa|up_0|CP023278.1_3774369_3775062_+	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|304aa|down_0|CP023278.1_3776562_3777474_+	TIGR04052, hypothetical_protein_MettrDRAFT_3899, AZL_007920/MXAN_0976 family protein	NA|387aa|down_1|CP023278.1_3777506_3778667_+	TIGR04039, MXAN_0977_Heme2, di-heme enzyme, MXAN_0977 family	NA|222aa|down_2|CP023278.1_3778830_3779496_+	pfam05685, Uma2, Putative restriction endonuclease	NA|389aa|down_3|CP023278.1_3779896_3781063_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|491aa|down_4|CP023278.1_3781086_3782559_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|399aa|down_5|CP023278.1_3782722_3783919_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|238aa|down_6|CP023278.1_3784257_3784971_+	COG2138, COG2138, Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism]	NA|264aa|down_7|CP023278.1_3784967_3785759_+	PRK06136, PRK06136, uroporphyrinogen-III C-methyltransferase	NA|230aa|down_8|CP023278.1_3787669_3788359_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|274aa|down_9|CP023278.1_3788478_3789300_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	17	4143778-4143906	16	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	TAAACGCCGCGCTACCTCTAACAGCACTCATTACT	35	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|75aa|up_8|CP023278.1_4129281_4129506_-,NA|69aa|up_7|CP023278.1_4129582_4129789_-,NA|82aa|up_6|CP023278.1_4129828_4130074_-,NA|167aa|down_1|CP023278.1_4144677_4145178_-,NA|51aa|down_3|CP023278.1_4146600_4146753_-,NA|240aa|down_8|CP023278.1_4151968_4152688_-	NA|394aa|up_9|CP023278.1_4128023_4129205_-	pfam01636, APH, Phosphotransferase enzyme family	NA|75aa|up_8|CP023278.1_4129281_4129506_-	NA	NA|69aa|up_7|CP023278.1_4129582_4129789_-	NA	NA|82aa|up_6|CP023278.1_4129828_4130074_-	NA	NA|251aa|up_5|CP023278.1_4130429_4131182_-	TIGR04500, PpiC_rel_mature, putative peptide maturation system protein	NA|508aa|up_4|CP023278.1_4131249_4132773_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|884aa|up_3|CP023278.1_4132808_4135460_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|242aa|up_2|CP023278.1_4135407_4136133_-	pfam13565, HTH_32, Homeodomain-like domain	NA|449aa|up_1|CP023278.1_4139930_4141277_-	PRK14360, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|689aa|up_0|CP023278.1_4141634_4143701_-	pfam00350, Dynamin_N, Dynamin family	NA|154aa|down_0|CP023278.1_4144159_4144621_+	cd07891, CYTH-like_CthTTM-like_1, CYTH-like Clostridium thermocellum TTM-like subgroup 1	NA|167aa|down_1|CP023278.1_4144677_4145178_-	NA	NA|231aa|down_2|CP023278.1_4145809_4146502_+	cd02588, HAD_L2-DEX, L-2-haloacid dehalogenase	NA|51aa|down_3|CP023278.1_4146600_4146753_-	NA	NA|457aa|down_4|CP023278.1_4147208_4148579_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|201aa|down_5|CP023278.1_4148673_4149276_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|249aa|down_6|CP023278.1_4149236_4149983_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|571aa|down_7|CP023278.1_4150104_4151817_+	PRK13981, PRK13981, NAD synthetase; Provisional	NA|240aa|down_8|CP023278.1_4151968_4152688_-	NA	NA|272aa|down_9|CP023278.1_4152755_4153571_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	18	4425357-4425437	17	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	AAAATCCAAAATCCAAAATTCCAA	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|70aa|up_5|CP023278.1_4419580_4419790_+,NA|63aa|down_6|CP023278.1_4432617_4432806_+,NA|289aa|down_8|CP023278.1_4434321_4435188_+,NA|129aa|down_9|CP023278.1_4435269_4435656_+	NA|295aa|up_9|CP023278.1_4415928_4416813_+	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|293aa|up_8|CP023278.1_4416911_4417790_-	PRK02259, PRK02259, aspartoacylase; Provisional	NA|147aa|up_7|CP023278.1_4417956_4418397_+	pfam02657, SufE, Fe-S metabolism associated domain	NA|300aa|up_6|CP023278.1_4418620_4419520_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|70aa|up_5|CP023278.1_4419580_4419790_+	NA	NA|230aa|up_4|CP023278.1_4419791_4420481_-	COG3599, DivIVA, Cell division initiation protein [Cell division and chromosome partitioning]	NA|191aa|up_3|CP023278.1_4420369_4420942_-	PRK00168, coaD, phosphopantetheine adenylyltransferase; Provisional	NA|292aa|up_2|CP023278.1_4421207_4422083_+	pfam01790, LGT, Prolipoprotein diacylglyceryl transferase	NA|259aa|up_1|CP023278.1_4422453_4423230_+	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]	NA|491aa|up_0|CP023278.1_4423508_4424981_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|447aa|down_0|CP023278.1_4425451_4426792_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|378aa|down_1|CP023278.1_4427017_4428151_+	cd08602, GDPD_ScGlpQ1_like, Glycerophosphodiester phosphodiesterase domain of Streptomycin coelicolor (GlpQ1) and similar proteins	NA|274aa|down_2|CP023278.1_4428447_4429269_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|571aa|down_3|CP023278.1_4429691_4431404_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|101aa|down_4|CP023278.1_4431483_4431786_+	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|255aa|down_5|CP023278.1_4431845_4432610_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|63aa|down_6|CP023278.1_4432617_4432806_+	NA	NA|384aa|down_7|CP023278.1_4432826_4433978_+	cd06451, AGAT_like, Alanine-glyoxylate aminotransferase (AGAT) family	NA|289aa|down_8|CP023278.1_4434321_4435188_+	NA	NA|129aa|down_9|CP023278.1_4435269_4435656_+	NA
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	20	4534142-4534294	19	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTGCTGAGTAATAAGTAATGAGT	23	3	189	4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534165-4534186|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534210-4534228|4534252-4534271	CP023278.1_48163-48142|CP023278.1_149224-149245|CP023278.1_151378-151357|CP023278.1_269332-269311|CP023278.1_393276-393255|CP023278.1_422076-422097|CP023278.1_422107-422128|CP023278.1_662047-662026|CP023278.1_778522-778543|CP023278.1_1143150-1143129|CP023278.1_1200194-1200215|CP023278.1_1291587-1291608|CP023278.1_1305677-1305656|CP023278.1_1306558-1306579|CP023278.1_1462200-1462179|CP023278.1_1557972-1557993|CP023278.1_1620260-1620239|CP023278.1_1627080-1627101|CP023278.1_2089113-2089134|CP023278.1_2150089-2150068|CP023278.1_2205696-2205717|CP023278.1_2241887-2241866|CP023278.1_2287339-2287318|CP023278.1_2449226-2449247|CP023278.1_2535122-2535101|CP023278.1_2591354-2591333|CP023278.1_2673765-2673744|CP023278.1_2836675-2836654|CP023278.1_3134900-3134879|CP023278.1_3141001-3141022|CP023278.1_3171244-3171265|CP023278.1_3191220-3191241|CP023278.1_3350180-3350159|CP023278.1_3455357-3455336|CP023278.1_3512397-3512376|CP023278.1_3550964-3550985|CP023278.1_3557857-3557836|CP023278.1_3621457-3621478|CP023278.1_3996617-3996596|CP023278.1_4031784-4031805|CP023278.1_4045095-4045074|CP023278.1_4075672-4075693|CP023278.1_4111546-4111525|CP023278.1_4121413-4121434|CP023278.1_4188308-4188287|CP023278.1_4347281-4347260|CP023278.1_4469808-4469829|CP023278.1_4535470-4535491|CP023278.1_4604236-4604257|CP023278.1_4620922-4620943|CP023278.1_4673745-4673766|CP023278.1_4673783-4673804|CP023278.1_4685602-4685623|CP023278.1_4985426-4985405|CP023278.1_5070449-5070470|CP023278.1_5082398-5082419|CP023278.1_5109480-5109501|CP023278.1_5117380-5117401|CP023278.1_5179819-5179798|CP023278.1_5261908-5261929|CP023278.1_5291130-5291151|CP023278.1_5347665-5347644|CP023278.1_5402294-5402273|CP023278.1_5403086-5403107|CP023278.1_5406573-5406552|CP023278.1_5446473-5446452|CP023278.1_5539911-5539890|CP023278.1_5652045-5652024|CP023278.1_5871563-5871584|CP023278.1_5875347-5875326|CP023278.1_5975639-5975618|CP023278.1_6112077-6112098|CP023278.1_6180406-6180427|CP023278.1_6236714-6236693|CP023278.1_6287084-6287063|CP023278.1_6362575-6362554|CP023278.1_6518142-6518121|CP023278.1_6569335-6569314|CP023278.1_6569471-6569450|CP023278.1_6634470-6634449|CP023278.1_6712157-6712178|CP023278.1_6724976-6724997|CP023278.1_6737431-6737410|CP023278.1_260999-261020|CP023278.1_379868-379889|CP023278.1_1004291-1004270|CP023278.1_1040723-1040702|CP023278.1_1197680-1197701|CP023278.1_1336345-1336366|CP023278.1_1780731-1780752|CP023278.1_1844835-1844814|CP023278.1_2013134-2013113|CP023278.1_2301967-2301988|CP023278.1_2436123-2436144|CP023278.1_2436154-2436175|CP023278.1_2614931-2614910|CP023278.1_2967692-2967713|CP023278.1_3450159-3450180|CP023278.1_3618760-3618739|CP023278.1_3631151-3631130|CP023278.1_4188256-4188235|CP023278.1_4462989-4462968|CP023278.1_4673043-4673022|CP023278.1_4673074-4673053|CP023278.1_4871043-4871022|CP023278.1_5091029-5091050|CP023278.1_5446421-5446400|CP023278.1_5701769-5701790|CP023278.1_5730602-5730581|CP023278.1_5790413-5790392|CP023278.1_6128243-6128264|CP023278.1_6329213-6329234|CP023278.1_743324-743345|CP023278.1_768381-768360|CP023278.1_3063179-3063158|CP023278.1_3150571-3150592|CP023278.1_3171289-3171310|CP023278.1_3342215-3342194|CP023278.1_3570864-3570885|CP023278.1_3627312-3627291|CP023278.1_4035777-4035798|CP023278.1_4188211-4188190|CP023278.1_4856147-4856126|CP023278.1_5461726-5461747|CP023278.1_5461757-5461778|CP023278.1_5539973-5539952|CP023278.1_6514535-6514514|CP023278.1_6515251-6515272|CP023278.1_6712112-6712133|CP023278.1_6725070-6725091|CP023278.1_1273263-1273245|CP023278.1_1306634-1306616|CP023278.1_3022474-3022456|CP023278.1_3022488-3022470|CP023278.1_3450190-3450208|CP023278.1_4040306-4040324|CP023278.1_6726110-6726092|CP023278.1_48125-48107|CP023278.1_48132-48114|CP023278.1_1161447-1161465|CP023278.1_1273277-1273259|CP023278.1_1305601-1305619|CP023278.1_1306620-1306602|CP023278.1_2064877-2064895|CP023278.1_2064884-2064902|CP023278.1_2064891-2064909|CP023278.1_2287119-2287137|CP023278.1_2502261-2502279|CP023278.1_2502317-2502335|CP023278.1_2875059-2875077|CP023278.1_2967650-2967668|CP023278.1_2986325-2986343|CP023278.1_3022502-3022484|CP023278.1_3281737-3281755|CP023278.1_3450204-3450222|CP023278.1_3555218-3555200|CP023278.1_3903324-3903342|CP023278.1_4035836-4035854|CP023278.1_4035843-4035861|CP023278.1_4040327-4040345|CP023278.1_4206709-4206727|CP023278.1_4441528-4441546|CP023278.1_4468281-4468263|CP023278.1_4468295-4468277|CP023278.1_4697471-4697453|CP023278.1_5563969-5563951|CP023278.1_6461679-6461697|CP023278.1_6461686-6461704|CP023278.1_6725021-6725039|CP023278.1_6726124-6726106|CP023278.1_1273249-1273231|CP023278.1_1306627-1306609|CP023278.1_1686416-1686398|CP023278.1_2502289-2502307|CP023278.1_2716782-2716800|CP023278.1_3385687-3385669|CP023278.1_3414541-3414523|CP023278.1_3450197-3450215|CP023278.1_3555169-3555151|CP023278.1_3555260-3555242|CP023278.1_4040334-4040352|CP023278.1_4468288-4468270|CP023278.1_4828716-4828698|CP023278.1_5395701-5395719|CP023278.1_5611650-5611632|CP023278.1_5763834-5763852|CP023278.1_6461672-6461690|CP023278.1_6826871-6826889|CP023278.1_4035871-4035890	NA	3	3	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA,NA	NA|719aa|up_9|CP023278.1_4521820_4523977_+	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|88aa|up_8|CP023278.1_4524191_4524455_+	pfam11344, DUF3146, Protein of unknown function (DUF3146)	NA|145aa|up_7|CP023278.1_4524644_4525079_-	smart00732, YqgFc, Likely ribonuclease with RNase H fold	NA|471aa|up_6|CP023278.1_4525130_4526543_-	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|224aa|up_5|CP023278.1_4526846_4527518_-	TIGR03697, NtcA_cyano, global nitrogen regulator NtcA, cyanobacterial	NA|259aa|up_4|CP023278.1_4528149_4528926_+	PRK07370, PRK07370, enoyl-[acyl-carrier-protein] reductase FabI	NA|210aa|up_3|CP023278.1_4529094_4529724_+	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|299aa|up_2|CP023278.1_4529931_4530828_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|328aa|up_1|CP023278.1_4531054_4532038_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|492aa|up_0|CP023278.1_4532601_4534077_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|279aa|down_0|CP023278.1_4534560_4535397_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|290aa|down_1|CP023278.1_4535667_4536537_-	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|242aa|down_2|CP023278.1_4536715_4537441_-	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|292aa|down_3|CP023278.1_4539672_4540548_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|290aa|down_4|CP023278.1_4540640_4541510_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|282aa|down_5|CP023278.1_4544215_4545061_-	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|114aa|down_6|CP023278.1_4545051_4545393_-	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|368aa|down_7|CP023278.1_4545411_4546515_-	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|72aa|down_8|CP023278.1_4546603_4546819_-	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|378aa|down_9|CP023278.1_4546872_4548006_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	21	4537835-4539431	12,20,10	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCAGTCCCCTTGCGGGGAAATGGTTGATGGAAAC,GTTTCCATCAACCATTTCCCCGCAAGGGGACTGAAAC,GTTTCCATCAACCATTTCCCCGCAAGGGGACTGAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	21,21,21	21	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|71aa|up_3|CP023278.1_4534281_4534494_+,NA	NA|224aa|up_9|CP023278.1_4526846_4527518_-	TIGR03697, NtcA_cyano, global nitrogen regulator NtcA, cyanobacterial	NA|259aa|up_8|CP023278.1_4528149_4528926_+	PRK07370, PRK07370, enoyl-[acyl-carrier-protein] reductase FabI	NA|210aa|up_7|CP023278.1_4529094_4529724_+	PRK00951, hisB, imidazoleglycerol-phosphate dehydratase HisB	NA|299aa|up_6|CP023278.1_4529931_4530828_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|328aa|up_5|CP023278.1_4531054_4532038_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|492aa|up_4|CP023278.1_4532601_4534077_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|71aa|up_3|CP023278.1_4534281_4534494_+	NA	NA|279aa|up_2|CP023278.1_4534560_4535397_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|290aa|up_1|CP023278.1_4535667_4536537_-	cd00739, DHPS, DHPS subgroup of Pterin binding enzymes	NA|242aa|up_0|CP023278.1_4536715_4537441_-	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|292aa|down_0|CP023278.1_4539672_4540548_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|290aa|down_1|CP023278.1_4540640_4541510_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|282aa|down_2|CP023278.1_4544215_4545061_-	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|114aa|down_3|CP023278.1_4545051_4545393_-	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|368aa|down_4|CP023278.1_4545411_4546515_-	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|72aa|down_5|CP023278.1_4546603_4546819_-	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|378aa|down_6|CP023278.1_4546872_4548006_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|89aa|down_7|CP023278.1_4548171_4548438_-	pfam01455, HupF_HypC, HupF/HypC family	NA|808aa|down_8|CP023278.1_4548554_4550978_-	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|394aa|down_9|CP023278.1_4550967_4552149_-	cd05819, NHL, NHL repeat unit of beta-propeller proteins
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	22	5234117-5234212	21	CRISPRCasFinder	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	TGAGTGGTGTTAGCGGTAGCGCGGCGTTTA	30	1	9	5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182|5234147-5234182	CP023278.1_4620940-4620975|CP023278.1_6569453-6569418|CP023278.1_1200212-1200247|CP023278.1_2298736-2298771|CP023278.1_5518565-5518600|CP023278.1_1273290-1273255|CP023278.1_3450177-3450212|CP023278.1_5875329-5875294|CP023278.1_6514517-6514482	NA	1	1	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA,NA|471aa|down_5|CP023278.1_5240135_5241548_+,NA|466aa|down_6|CP023278.1_5241646_5243044_-,NA|128aa|down_7|CP023278.1_5243159_5243543_-	NA|127aa|up_9|CP023278.1_5214576_5214957_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|1069aa|up_8|CP023278.1_5215012_5218219_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|2182aa|up_7|CP023278.1_5218343_5224889_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|133aa|up_6|CP023278.1_5225257_5225656_-	PLN02895, PLN02895, phosphoacetylglucosamine mutase	NA|570aa|up_5|CP023278.1_5225840_5227550_-	cd01583, IPMI, 3-isopropylmalate dehydratase catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate	NA|364aa|up_4|CP023278.1_5227533_5228625_-	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|334aa|up_3|CP023278.1_5229036_5230038_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|234aa|up_2|CP023278.1_5230713_5231415_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|389aa|up_1|CP023278.1_5231543_5232710_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|399aa|up_0|CP023278.1_5232889_5234086_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|206aa|down_0|CP023278.1_5234481_5235099_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|423aa|down_1|CP023278.1_5235237_5236506_+	TIGR02227, Inactive_signal_peptidase_IA	NA|205aa|down_2|CP023278.1_5236758_5237373_+	pfam11210, DUF2996, Protein of unknown function (DUF2996)	NA|171aa|down_3|CP023278.1_5237624_5238137_-	PHA01886, PHA01886, TM2 domain-containing protein	NA|359aa|down_4|CP023278.1_5238938_5240015_+	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|471aa|down_5|CP023278.1_5240135_5241548_+	NA	NA|466aa|down_6|CP023278.1_5241646_5243044_-	NA	NA|128aa|down_7|CP023278.1_5243159_5243543_-	NA	NA|172aa|down_8|CP023278.1_5244798_5245314_-	COG3685, COG3685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|227aa|down_9|CP023278.1_5245604_5246285_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	23	6629058-6629529	22,11,13	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCCGTCCCCGTGAAGGGGAATAAAT,GTTTCCGTCCCCGTGAAGGGGAATAAATGGAAAAC,GTTTCCGTCCCCGTGAAGGGGAATAAATGGAAAAC	28,35,35	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|197aa|up_9|CP023278.1_6617112_6617703_+,NA|64aa|up_4|CP023278.1_6623340_6623532_+,NA|138aa|up_3|CP023278.1_6623659_6624073_-,NA|220aa|down_6|CP023278.1_6641148_6641808_-,NA|122aa|down_9|CP023278.1_6645861_6646227_-	NA|197aa|up_9|CP023278.1_6617112_6617703_+	NA	NA|57aa|up_8|CP023278.1_6617871_6618042_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|56aa|up_7|CP023278.1_6619702_6619870_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|75aa|up_6|CP023278.1_6619856_6620081_-	COG0864, NikR, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain and a metal-binding domain [Transcription]	NA|843aa|up_5|CP023278.1_6620310_6622839_-	cd05805, MPG1_transferase, GTP-mannose-1-phosphate guanyltransferase (MPG1 transferase), also known as GDP-mannose pyrophosphorylase, is a bifunctional enzyme with both phosphomannose isomerase (PMI) activity and GDP-mannose phosphorylase (GMP) activity	NA|64aa|up_4|CP023278.1_6623340_6623532_+	NA	NA|138aa|up_3|CP023278.1_6623659_6624073_-	NA	NA|90aa|up_2|CP023278.1_6624451_6624721_-	PRK12864, PRK12864, YciI-like protein; Reviewed	NA|191aa|up_1|CP023278.1_6624793_6625366_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|811aa|up_0|CP023278.1_6625609_6628042_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|684aa|down_0|CP023278.1_6629929_6631981_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|83aa|down_1|CP023278.1_6632234_6632483_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|467aa|down_2|CP023278.1_6632931_6634332_-	cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase ywdH-like	NA|367aa|down_3|CP023278.1_6634494_6635595_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|628aa|down_4|CP023278.1_6638261_6640145_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|226aa|down_5|CP023278.1_6640290_6640968_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|220aa|down_6|CP023278.1_6641148_6641808_-	NA	NA|466aa|down_7|CP023278.1_6642055_6643453_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|724aa|down_8|CP023278.1_6643557_6645729_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|122aa|down_9|CP023278.1_6645861_6646227_-	NA
GCA_002896875.1_ASM289687v1	CP023278	Nostoc sp. CENA543 chromosome, complete genome	24	6636296-6638066	14,23,12,15	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	Orphan	GTTTCCGTCCCCTTGCGGGGAAAGGGTGGTGTTTAAC,GTTAAACACCACCCTTTCCCCGCAAGGGGACGGAAAC,GTTAAACACCACCCTTTCCCCGCAAGGGGACGGAAAC,GTTTCCGTCCCCTTGCGGGGAAAGGGTGGTGTTTAAC	37,37,37,37	0	0	NA	NA	NA:NA:NA:NA	21,22,22,21	22	Orphan	PD-DExK,c2c9_V-U4,DinG,Cas9_archaeal,cas14k,Cas14c_CAS-V-F,cas6,csa3,csx1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx3,WYL,RT,cas2,cas1,cas4,csc1gr5,csc2gr7,cas10d,cas3,c2c5_V-U5,cas8f,Cas14u_CAS-V,2OG_CAS,cas5,cas7,cas8b3	NA|64aa|up_8|CP023278.1_6623340_6623532_+,NA|138aa|up_7|CP023278.1_6623659_6624073_-,NA|220aa|down_2|CP023278.1_6641148_6641808_-,NA|122aa|down_5|CP023278.1_6645861_6646227_-,NA|71aa|down_6|CP023278.1_6646259_6646472_-,NA|75aa|down_7|CP023278.1_6646775_6647000_-,NA|143aa|down_8|CP023278.1_6647333_6647762_-	NA|843aa|up_9|CP023278.1_6620310_6622839_-	cd05805, MPG1_transferase, GTP-mannose-1-phosphate guanyltransferase (MPG1 transferase), also known as GDP-mannose pyrophosphorylase, is a bifunctional enzyme with both phosphomannose isomerase (PMI) activity and GDP-mannose phosphorylase (GMP) activity	NA|64aa|up_8|CP023278.1_6623340_6623532_+	NA	NA|138aa|up_7|CP023278.1_6623659_6624073_-	NA	NA|90aa|up_6|CP023278.1_6624451_6624721_-	PRK12864, PRK12864, YciI-like protein; Reviewed	NA|191aa|up_5|CP023278.1_6624793_6625366_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|811aa|up_4|CP023278.1_6625609_6628042_-	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|684aa|up_3|CP023278.1_6629929_6631981_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|83aa|up_2|CP023278.1_6632234_6632483_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|467aa|up_1|CP023278.1_6632931_6634332_-	cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase ywdH-like	NA|367aa|up_0|CP023278.1_6634494_6635595_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|628aa|down_0|CP023278.1_6638261_6640145_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|226aa|down_1|CP023278.1_6640290_6640968_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|220aa|down_2|CP023278.1_6641148_6641808_-	NA	NA|466aa|down_3|CP023278.1_6642055_6643453_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|724aa|down_4|CP023278.1_6643557_6645729_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|122aa|down_5|CP023278.1_6645861_6646227_-	NA	NA|71aa|down_6|CP023278.1_6646259_6646472_-	NA	NA|75aa|down_7|CP023278.1_6646775_6647000_-	NA	NA|143aa|down_8|CP023278.1_6647333_6647762_-	NA	NA|186aa|down_9|CP023278.1_6647882_6648440_-	PRK05800, cobU, adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated
