assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	1	463747-463985	1,1	CRISPRCasFinder,PILER-CR	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TAAGCAAACTGGAGTTTGAATAAGTCAAA,CATTTTGCTTGCACAAAACACTAAAATGAATACGCAAACT	29,40	0	0	NA	NA	NA:NA	1,2	2	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA,NA|75aa|down_1|NC_010628.1_464962_465187_-,NA|323aa|down_3|NC_010628.1_465520_466489_-,NA|141aa|down_9|NC_010628.1_474021_474444_+	NA|1219aa|up_9|NC_010628.1_443917_447574_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1234aa|up_8|NC_010628.1_448135_451837_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|977aa|up_7|NC_010628.1_452587_455518_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|276aa|up_6|NC_010628.1_455678_456506_-	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|114aa|up_5|NC_010628.1_456496_456838_-	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|352aa|up_4|NC_010628.1_456858_457914_-	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|393aa|up_3|NC_010628.1_458007_459186_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|87aa|up_2|NC_010628.1_459375_459636_-	pfam01455, HupF_HypC, HupF/HypC family	NA|782aa|up_1|NC_010628.1_459717_462063_-	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|396aa|up_0|NC_010628.1_462215_463403_-	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|282aa|down_0|NC_010628.1_464060_464906_-	pfam01106, NifU, NifU-like domain	NA|75aa|down_1|NC_010628.1_464962_465187_-	NA	NA|89aa|down_2|NC_010628.1_465195_465462_-	PRK00068, PRK00068, hypothetical protein; Validated	NA|323aa|down_3|NC_010628.1_465520_466489_-	NA	NA|321aa|down_4|NC_010628.1_467304_468267_+	COG1740, HyaA, Ni,Fe-hydrogenase I small subunit [Energy production and conversion]	NA|532aa|down_5|NC_010628.1_468459_470055_+	COG0374, HyaB, Ni,Fe-hydrogenase I large subunit [Energy production and conversion]	NA|898aa|down_6|NC_010628.1_470137_472831_-	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|82aa|down_7|NC_010628.1_473166_473412_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|133aa|down_8|NC_010628.1_473408_473807_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|141aa|down_9|NC_010628.1_474021_474444_+	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	2	509305-509417	2	CRISPRCasFinder	no	RT	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Unclear	ATATAGAGTAGCGTGTTGTAGAGTTGGGGCGAT	33	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|52aa|up_9|NC_010628.1_497077_497233_+,NA|193aa|up_4|NC_010628.1_502554_503133_+,NA|116aa|up_3|NC_010628.1_503258_503606_+,NA|101aa|up_0|NC_010628.1_508799_509102_+,NA	NA|52aa|up_9|NC_010628.1_497077_497233_+	NA	NA|396aa|up_8|NC_010628.1_497396_498584_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|295aa|up_7|NC_010628.1_498892_499777_+	pfam07335, Glyco_hydro_75, Fungal chitosanase of glycosyl hydrolase group 75	NA|319aa|up_6|NC_010628.1_500149_501106_-	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|222aa|up_5|NC_010628.1_501721_502387_-	pfam05857, TraX, TraX protein	NA|193aa|up_4|NC_010628.1_502554_503133_+	NA	NA|116aa|up_3|NC_010628.1_503258_503606_+	NA	RT|362aa|up_2|NC_010628.1_504298_505384_+	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|509aa|up_1|NC_010628.1_506288_507815_+	cd13438, SPFH_eoslipins_u2, Uncharacterized prokaryotic subgroup of the stomatin-like proteins (slipins) family; belonging to the SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|101aa|up_0|NC_010628.1_508799_509102_+	NA	NA|52aa|down_0|NC_010628.1_509598_509754_+	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|268aa|down_1|NC_010628.1_510150_510954_+	pfam16289, DUF4935, Domain of unknown function (DUF4935)	NA|481aa|down_2|NC_010628.1_511302_512745_-	TIGR01282, Nitrogenase_molybdenum-iron_protein_alpha_chain, nitrogenase molybdenum-iron protein alpha chain	NA|298aa|down_3|NC_010628.1_512908_513802_-	PRK13236, PRK13236, nitrogenase reductase; Reviewed	NA|119aa|down_4|NC_010628.1_514150_514507_-	pfam01152, Bac_globin, Bacterial-like globin	NA|300aa|down_5|NC_010628.1_514731_515631_-	TIGR02000, Nitrogen_fixation_protein_NifU, Fe-S cluster assembly protein NifU	NA|403aa|down_6|NC_010628.1_515735_516944_-	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|119aa|down_7|NC_010628.1_516978_517335_-	COG1149, COG1149, MinD superfamily P-loop ATPase containing an inserted ferredoxin domain [Energy production and conversion]	NA|484aa|down_8|NC_010628.1_517438_518890_-	TIGR01290, FeMo_cofactor_biosynthesis_protein_NifB, nitrogenase cofactor biosynthesis protein NifB	NA|533aa|down_9|NC_010628.1_519988_521587_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	3	723990-724316	2,3,1	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	GTTGCAAAACACCTC-ATCCCTGATAGGG----------ATTCAAAC,GTTGCAAAACACCTCATCCCTGATAGGGATTCAA,ACCTCATCCCTGATAGGGATTCAAAC	47,34,26	0	0	NA	NA	NA:NA:NA	4,4,4	4	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|272aa|up_9|NC_010628.1_708865_709681_+,NA|101aa|up_8|NC_010628.1_709719_710022_+,NA|87aa|up_5|NC_010628.1_714604_714865_-,NA|261aa|up_3|NC_010628.1_717448_718231_-,NA|79aa|down_0|NC_010628.1_724446_724683_+,NA|169aa|down_6|NC_010628.1_732753_733260_+,NA|74aa|down_8|NC_010628.1_734518_734740_+	NA|272aa|up_9|NC_010628.1_708865_709681_+	NA	NA|101aa|up_8|NC_010628.1_709719_710022_+	NA	NA|692aa|up_7|NC_010628.1_711364_713440_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|219aa|up_6|NC_010628.1_713544_714201_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|87aa|up_5|NC_010628.1_714604_714865_-	NA	NA|692aa|up_4|NC_010628.1_715354_717430_+	sd00006, TPR, Tetratricopeptide repeat	NA|261aa|up_3|NC_010628.1_717448_718231_-	NA	NA|511aa|up_2|NC_010628.1_718362_719895_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|767aa|up_1|NC_010628.1_720313_722614_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|292aa|up_0|NC_010628.1_722806_723682_-	pfam09992, NAGPA, Phosphodiester glycosidase	NA|79aa|down_0|NC_010628.1_724446_724683_+	NA	NA|723aa|down_1|NC_010628.1_724966_727135_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|390aa|down_2|NC_010628.1_727417_728587_+	COG1028, FabG, Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [Secondary metabolites biosynthesis, transport, and catabolism / General function prediction only]	NA|448aa|down_3|NC_010628.1_728895_730239_+	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|200aa|down_4|NC_010628.1_730324_730924_+	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|510aa|down_5|NC_010628.1_731007_732537_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|169aa|down_6|NC_010628.1_732753_733260_+	NA	NA|264aa|down_7|NC_010628.1_733592_734384_+	cd06259, YdcF-like, YdcF-like	NA|74aa|down_8|NC_010628.1_734518_734740_+	NA	NA|74aa|down_9|NC_010628.1_734745_734967_-	pfam02941, FeThRed_A, Ferredoxin thioredoxin reductase variable alpha chain
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	4	1030435-1030529	4	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	CAATCAAATAATAACTTTCCCCA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|257aa|up_4|NC_010628.1_1025185_1025956_-,NA|121aa|up_1|NC_010628.1_1029726_1030089_-,NA|92aa|up_0|NC_010628.1_1030133_1030409_+,NA|667aa|down_3|NC_010628.1_1033983_1035984_-,NA|94aa|down_6|NC_010628.1_1040712_1040994_-	NA|261aa|up_9|NC_010628.1_1019698_1020481_-	TIGR04500, PpiC_rel_mature, putative peptide maturation system protein	NA|282aa|up_8|NC_010628.1_1020881_1021727_-	COG1836, COG1836, Predicted membrane protein [Function unknown]	NA|144aa|up_7|NC_010628.1_1022335_1022767_+	COG3565, COG3565, Predicted dioxygenase of extradiol dioxygenase family [General function prediction only]	NA|211aa|up_6|NC_010628.1_1022826_1023459_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|155aa|up_5|NC_010628.1_1023860_1024325_+	PLN03237, PLN03237, DNA topoisomerase 2; Provisional	NA|257aa|up_4|NC_010628.1_1025185_1025956_-	NA	NA|574aa|up_3|NC_010628.1_1025952_1027674_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|464aa|up_2|NC_010628.1_1028332_1029724_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|121aa|up_1|NC_010628.1_1029726_1030089_-	NA	NA|92aa|up_0|NC_010628.1_1030133_1030409_+	NA	NA|150aa|down_0|NC_010628.1_1030537_1030987_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|469aa|down_1|NC_010628.1_1031378_1032785_-	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|162aa|down_2|NC_010628.1_1032959_1033445_+	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|667aa|down_3|NC_010628.1_1033983_1035984_-	NA	NA|789aa|down_4|NC_010628.1_1036488_1038855_+	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|500aa|down_5|NC_010628.1_1039028_1040528_+	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|94aa|down_6|NC_010628.1_1040712_1040994_-	NA	NA|349aa|down_7|NC_010628.1_1041045_1042092_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|173aa|down_8|NC_010628.1_1042165_1042684_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|209aa|down_9|NC_010628.1_1043134_1043761_+	pfam11237, DUF3038, Protein of unknown function (DUF3038)
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	5	1228880-1229486	5,2,3	CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	ATTGCAATTTATCAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTTATCAAAATCCCTATTAGGGATTGAAAC,ATTGCAATTTATCAAAATCCCTATTAGGG----------ATTGAAAC	37,37,47	0	0	NA	NA	I-D,II-B:I-D,II-B	8,8,5	8	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|108aa|up_1|NC_010628.1_1228189_1228513_+,NA|61aa|up_0|NC_010628.1_1228673_1228856_+,NA|85aa|down_1|NC_010628.1_1230541_1230796_+,NA|147aa|down_2|NC_010628.1_1233271_1233712_-,NA|165aa|down_6|NC_010628.1_1237703_1238198_-	NA|52aa|up_9|NC_010628.1_1216243_1216399_-	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|762aa|up_8|NC_010628.1_1216512_1218798_+	cd07389, MPP_PhoD, Bacillus subtilis PhoD and related proteins, metallophosphatase domain	NA|353aa|up_7|NC_010628.1_1219360_1220419_-	PRK12755, PRK12755, phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional	NA|553aa|up_6|NC_010628.1_1220618_1222277_-	cd11350, AmyAc_4, Alpha amylase catalytic domain found in an uncharacterized protein family	NA|126aa|up_5|NC_010628.1_1222644_1223022_-	pfam08853, DUF1823, Domain of unknown function (DUF1823)	NA|298aa|up_4|NC_010628.1_1223140_1224034_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|101aa|up_3|NC_010628.1_1224243_1224546_+	PRK14423, PRK14423, acylphosphatase; Provisional	NA|60aa|up_2|NC_010628.1_1227717_1227897_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|108aa|up_1|NC_010628.1_1228189_1228513_+	NA	NA|61aa|up_0|NC_010628.1_1228673_1228856_+	NA	NA|298aa|down_0|NC_010628.1_1229539_1230433_-	PRK13236, PRK13236, nitrogenase reductase; Reviewed	NA|85aa|down_1|NC_010628.1_1230541_1230796_+	NA	NA|147aa|down_2|NC_010628.1_1233271_1233712_-	NA	NA|286aa|down_3|NC_010628.1_1233998_1234856_-	COG1801, COG1801, Uncharacterized conserved protein [Function unknown]	NA|202aa|down_4|NC_010628.1_1235497_1236103_-	pfam14273, DUF4360, Domain of unknown function (DUF4360)	NA|224aa|down_5|NC_010628.1_1236949_1237621_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|165aa|down_6|NC_010628.1_1237703_1238198_-	NA	NA|291aa|down_7|NC_010628.1_1239072_1239945_+	PRK13398, PRK13398, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|474aa|down_8|NC_010628.1_1240307_1241729_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1116aa|down_9|NC_010628.1_1241852_1245200_+	PRK10614, PRK10614, multidrug efflux system subunit MdtC; Provisional
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	6	1322778-1322866	6	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	CAACGAGAAGCATTGCCAGAGGA	23	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|186aa|up_7|NC_010628.1_1313938_1314496_-,NA|80aa|up_4|NC_010628.1_1318684_1318924_-,NA|49aa|up_0|NC_010628.1_1322312_1322459_+,NA|116aa|down_8|NC_010628.1_1333069_1333417_+,NA|128aa|down_9|NC_010628.1_1333693_1334077_+	NA|166aa|up_9|NC_010628.1_1311524_1312022_-	cd14741, PAAR_5, proline-alanine-alanine-arginine (PAAR) domain	NA|578aa|up_8|NC_010628.1_1312124_1313858_-	TIGR01646, conserved_hypothetical_protein, Rhs element Vgr protein	NA|186aa|up_7|NC_010628.1_1313938_1314496_-	NA	NA|1201aa|up_6|NC_010628.1_1314534_1318137_-	PRK10811, rne, ribonuclease E; Reviewed	NA|175aa|up_5|NC_010628.1_1318160_1318685_-	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|80aa|up_4|NC_010628.1_1318684_1318924_-	NA	NA|675aa|up_3|NC_010628.1_1319370_1321395_+	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|112aa|up_2|NC_010628.1_1321541_1321877_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|139aa|up_1|NC_010628.1_1321864_1322281_-	pfam08814, XisH, XisH protein	NA|49aa|up_0|NC_010628.1_1322312_1322459_+	NA	NA|215aa|down_0|NC_010628.1_1324113_1324758_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|715aa|down_1|NC_010628.1_1324729_1326874_-	pfam12849, PBP_like_2, PBP superfamily domain	NA|227aa|down_2|NC_010628.1_1326917_1327598_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|374aa|down_3|NC_010628.1_1327652_1328774_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|280aa|down_4|NC_010628.1_1329271_1330111_+	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|554aa|down_5|NC_010628.1_1330141_1331803_+	COG3497, COG3497, Phage tail sheath protein FI [General function prediction only]	NA|152aa|down_6|NC_010628.1_1331940_1332396_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|186aa|down_7|NC_010628.1_1332500_1333058_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|116aa|down_8|NC_010628.1_1333069_1333417_+	NA	NA|128aa|down_9|NC_010628.1_1333693_1334077_+	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	7	1601537-1605868	3,4,7	CRT,PILER-CR,CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	GTTTCAATCCCTAATAGGGATTTTGAGAAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGAGAAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGAGAAATTGCAAT	37,37,37	0	0	NA	NA	N:A	59,58,58	59	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|47aa|up_9|NC_010628.1_1588519_1588660_+,NA|119aa|up_6|NC_010628.1_1592194_1592551_-,NA|102aa|up_5|NC_010628.1_1592900_1593206_-,NA|184aa|up_1|NC_010628.1_1598888_1599440_+,NA|71aa|down_3|NC_010628.1_1612170_1612383_+,NA|68aa|down_4|NC_010628.1_1612424_1612628_+	NA|47aa|up_9|NC_010628.1_1588519_1588660_+	NA	NA|972aa|up_8|NC_010628.1_1588836_1591752_+	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|116aa|up_7|NC_010628.1_1591748_1592096_+	pfam07883, Cupin_2, Cupin domain	NA|119aa|up_6|NC_010628.1_1592194_1592551_-	NA	NA|102aa|up_5|NC_010628.1_1592900_1593206_-	NA	NA|743aa|up_4|NC_010628.1_1593476_1595705_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|270aa|up_3|NC_010628.1_1595710_1596520_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|427aa|up_2|NC_010628.1_1596787_1598068_+	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|184aa|up_1|NC_010628.1_1598888_1599440_+	NA	NA|526aa|up_0|NC_010628.1_1599665_1601243_+	COG3540, PhoD, Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]	NA|277aa|down_0|NC_010628.1_1606357_1607188_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|314aa|down_1|NC_010628.1_1607198_1608140_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1039aa|down_2|NC_010628.1_1608426_1611543_-	COG3641, PfoR, Predicted membrane protein, putative toxin regulator [General function prediction only]	NA|71aa|down_3|NC_010628.1_1612170_1612383_+	NA	NA|68aa|down_4|NC_010628.1_1612424_1612628_+	NA	NA|446aa|down_5|NC_010628.1_1612716_1614054_-	cd01116, P_permease, Permease P (pink-eyed dilution)	NA|206aa|down_6|NC_010628.1_1614166_1614784_-	pfam00582, Usp, Universal stress protein family	NA|397aa|down_7|NC_010628.1_1618183_1619374_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|229aa|down_8|NC_010628.1_1619432_1620119_-	TIGR02191, Ribonuclease_3, ribonuclease III, bacterial	NA|266aa|down_9|NC_010628.1_1620259_1621057_-	pfam01936, NYN, NYN domain
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	8	2043772-2043887	8	CRISPRCasFinder	no	csa3	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-A	GCTAACCCCGCGACGCTACGACTTACGGGTAAA	33	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|180aa|up_7|NC_010628.1_2039694_2040234_-,NA|64aa|up_5|NC_010628.1_2041762_2041954_-,NA|121aa|up_4|NC_010628.1_2041950_2042313_-,NA|58aa|up_3|NC_010628.1_2042421_2042595_-,NA|77aa|up_2|NC_010628.1_2042587_2042818_-,NA|123aa|up_1|NC_010628.1_2042891_2043260_-,NA|116aa|up_0|NC_010628.1_2043343_2043691_-,NA|138aa|down_0|NC_010628.1_2044127_2044541_+,NA|99aa|down_2|NC_010628.1_2045096_2045393_+,NA|114aa|down_4|NC_010628.1_2048665_2049007_+,NA|123aa|down_5|NC_010628.1_2049006_2049375_+,NA|205aa|down_6|NC_010628.1_2049337_2049952_+,NA|188aa|down_7|NC_010628.1_2049968_2050532_+,NA|111aa|down_8|NC_010628.1_2050533_2050866_+	NA|741aa|up_9|NC_010628.1_2036229_2038452_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|395aa|up_8|NC_010628.1_2038523_2039708_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|180aa|up_7|NC_010628.1_2039694_2040234_-	NA	NA|417aa|up_6|NC_010628.1_2040519_2041770_-	pfam01935, DUF87, Domain of unknown function DUF87	NA|64aa|up_5|NC_010628.1_2041762_2041954_-	NA	NA|121aa|up_4|NC_010628.1_2041950_2042313_-	NA	NA|58aa|up_3|NC_010628.1_2042421_2042595_-	NA	NA|77aa|up_2|NC_010628.1_2042587_2042818_-	NA	NA|123aa|up_1|NC_010628.1_2042891_2043260_-	NA	NA|116aa|up_0|NC_010628.1_2043343_2043691_-	NA	NA|138aa|down_0|NC_010628.1_2044127_2044541_+	NA	NA|123aa|down_1|NC_010628.1_2044626_2044995_-	pfam08872, KGK, KGK domain	NA|99aa|down_2|NC_010628.1_2045096_2045393_+	NA	NA|912aa|down_3|NC_010628.1_2045553_2048289_+	COG5545, COG5545, Predicted P-loop ATPase and inactivated derivatives [General function prediction only]	NA|114aa|down_4|NC_010628.1_2048665_2049007_+	NA	NA|123aa|down_5|NC_010628.1_2049006_2049375_+	NA	NA|205aa|down_6|NC_010628.1_2049337_2049952_+	NA	NA|188aa|down_7|NC_010628.1_2049968_2050532_+	NA	NA|111aa|down_8|NC_010628.1_2050533_2050866_+	NA	NA|687aa|down_9|NC_010628.1_2051726_2053787_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	9	2047473-2047607	9	CRISPRCasFinder	no	csa3	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-A	AGAAAACCAAAAAAGTTTGGGTAACAAGGCGGATCAAGGCAGATCACC	48	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|64aa|up_8|NC_010628.1_2041762_2041954_-,NA|121aa|up_7|NC_010628.1_2041950_2042313_-,NA|58aa|up_6|NC_010628.1_2042421_2042595_-,NA|77aa|up_5|NC_010628.1_2042587_2042818_-,NA|123aa|up_4|NC_010628.1_2042891_2043260_-,NA|116aa|up_3|NC_010628.1_2043343_2043691_-,NA|138aa|up_2|NC_010628.1_2044127_2044541_+,NA|99aa|up_0|NC_010628.1_2045096_2045393_+,NA|114aa|down_0|NC_010628.1_2048665_2049007_+,NA|123aa|down_1|NC_010628.1_2049006_2049375_+,NA|205aa|down_2|NC_010628.1_2049337_2049952_+,NA|188aa|down_3|NC_010628.1_2049968_2050532_+,NA|111aa|down_4|NC_010628.1_2050533_2050866_+,NA|294aa|down_6|NC_010628.1_2053783_2054665_+,NA|215aa|down_7|NC_010628.1_2054666_2055311_+,NA|119aa|down_8|NC_010628.1_2055508_2055865_+,NA|196aa|down_9|NC_010628.1_2055915_2056503_+	NA|417aa|up_9|NC_010628.1_2040519_2041770_-	pfam01935, DUF87, Domain of unknown function DUF87	NA|64aa|up_8|NC_010628.1_2041762_2041954_-	NA	NA|121aa|up_7|NC_010628.1_2041950_2042313_-	NA	NA|58aa|up_6|NC_010628.1_2042421_2042595_-	NA	NA|77aa|up_5|NC_010628.1_2042587_2042818_-	NA	NA|123aa|up_4|NC_010628.1_2042891_2043260_-	NA	NA|116aa|up_3|NC_010628.1_2043343_2043691_-	NA	NA|138aa|up_2|NC_010628.1_2044127_2044541_+	NA	NA|123aa|up_1|NC_010628.1_2044626_2044995_-	pfam08872, KGK, KGK domain	NA|99aa|up_0|NC_010628.1_2045096_2045393_+	NA	NA|114aa|down_0|NC_010628.1_2048665_2049007_+	NA	NA|123aa|down_1|NC_010628.1_2049006_2049375_+	NA	NA|205aa|down_2|NC_010628.1_2049337_2049952_+	NA	NA|188aa|down_3|NC_010628.1_2049968_2050532_+	NA	NA|111aa|down_4|NC_010628.1_2050533_2050866_+	NA	NA|687aa|down_5|NC_010628.1_2051726_2053787_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|294aa|down_6|NC_010628.1_2053783_2054665_+	NA	NA|215aa|down_7|NC_010628.1_2054666_2055311_+	NA	NA|119aa|down_8|NC_010628.1_2055508_2055865_+	NA	NA|196aa|down_9|NC_010628.1_2055915_2056503_+	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	10	2051259-2051386	10	CRISPRCasFinder	no	csa3	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-A	GTTTTAGAAGCTGAAATTGTGGGAATTGGGGCGACTGCTGGAAG	44	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|116aa|up_9|NC_010628.1_2043343_2043691_-,NA|138aa|up_8|NC_010628.1_2044127_2044541_+,NA|99aa|up_6|NC_010628.1_2045096_2045393_+,NA|114aa|up_4|NC_010628.1_2048665_2049007_+,NA|123aa|up_3|NC_010628.1_2049006_2049375_+,NA|205aa|up_2|NC_010628.1_2049337_2049952_+,NA|188aa|up_1|NC_010628.1_2049968_2050532_+,NA|111aa|up_0|NC_010628.1_2050533_2050866_+,NA|294aa|down_1|NC_010628.1_2053783_2054665_+,NA|215aa|down_2|NC_010628.1_2054666_2055311_+,NA|119aa|down_3|NC_010628.1_2055508_2055865_+,NA|196aa|down_4|NC_010628.1_2055915_2056503_+,NA|84aa|down_5|NC_010628.1_2056483_2056735_+,NA|111aa|down_6|NC_010628.1_2056734_2057067_+,NA|110aa|down_7|NC_010628.1_2057056_2057386_+,NA|104aa|down_8|NC_010628.1_2057462_2057774_-	NA|116aa|up_9|NC_010628.1_2043343_2043691_-	NA	NA|138aa|up_8|NC_010628.1_2044127_2044541_+	NA	NA|123aa|up_7|NC_010628.1_2044626_2044995_-	pfam08872, KGK, KGK domain	NA|99aa|up_6|NC_010628.1_2045096_2045393_+	NA	NA|912aa|up_5|NC_010628.1_2045553_2048289_+	COG5545, COG5545, Predicted P-loop ATPase and inactivated derivatives [General function prediction only]	NA|114aa|up_4|NC_010628.1_2048665_2049007_+	NA	NA|123aa|up_3|NC_010628.1_2049006_2049375_+	NA	NA|205aa|up_2|NC_010628.1_2049337_2049952_+	NA	NA|188aa|up_1|NC_010628.1_2049968_2050532_+	NA	NA|111aa|up_0|NC_010628.1_2050533_2050866_+	NA	NA|687aa|down_0|NC_010628.1_2051726_2053787_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|294aa|down_1|NC_010628.1_2053783_2054665_+	NA	NA|215aa|down_2|NC_010628.1_2054666_2055311_+	NA	NA|119aa|down_3|NC_010628.1_2055508_2055865_+	NA	NA|196aa|down_4|NC_010628.1_2055915_2056503_+	NA	NA|84aa|down_5|NC_010628.1_2056483_2056735_+	NA	NA|111aa|down_6|NC_010628.1_2056734_2057067_+	NA	NA|110aa|down_7|NC_010628.1_2057056_2057386_+	NA	NA|104aa|down_8|NC_010628.1_2057462_2057774_-	NA	NA|155aa|down_9|NC_010628.1_2058097_2058562_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	11	2595761-2595891	11	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	GCAATTGCTCAAATCGACACCTCGCCAAAAATTTTTT	37	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|73aa|up_9|NC_010628.1_2586153_2586372_-,NA|57aa|up_7|NC_010628.1_2590621_2590792_-,NA|51aa|up_5|NC_010628.1_2592699_2592852_+,NA	NA|73aa|up_9|NC_010628.1_2586153_2586372_-	NA	NA|715aa|up_8|NC_010628.1_2586633_2588778_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|57aa|up_7|NC_010628.1_2590621_2590792_-	NA	NA|559aa|up_6|NC_010628.1_2590887_2592564_+	COG3961, COG3961, Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes [Carbohydrate transport and metabolism / Coenzyme metabolism / General function prediction only]	NA|51aa|up_5|NC_010628.1_2592699_2592852_+	NA	NA|136aa|up_4|NC_010628.1_2592929_2593337_-	COG3607, COG3607, Predicted lactoylglutathione lyase [General function prediction only]	NA|137aa|up_3|NC_010628.1_2593414_2593825_-	cd06588, PhnB_like, Escherichia coli PhnB and similar proteins	NA|148aa|up_2|NC_010628.1_2593966_2594410_-	COG4319, COG4319, Ketosteroid isomerase homolog [Function unknown]	NA|264aa|up_1|NC_010628.1_2594485_2595277_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|119aa|up_0|NC_010628.1_2595362_2595719_-	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|418aa|down_0|NC_010628.1_2595902_2597156_+	COG4941, COG4941, Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]	NA|640aa|down_1|NC_010628.1_2601006_2602926_+	cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase (ACDH), ALDH family 20-like	NA|981aa|down_2|NC_010628.1_2602930_2605873_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|387aa|down_3|NC_010628.1_2605859_2607020_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|395aa|down_4|NC_010628.1_2607022_2608207_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|441aa|down_5|NC_010628.1_2608203_2609526_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|574aa|down_6|NC_010628.1_2609527_2611249_+	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)	NA|87aa|down_7|NC_010628.1_2611307_2611568_+	smart00823, PKS_PP, Phosphopantetheine attachment site	NA|107aa|down_8|NC_010628.1_2612746_2613067_-	pfam16261, DUF4915, Domain of unknown function (DUF4915)	NA|1085aa|down_9|NC_010628.1_2614171_2617426_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	12	2679763-2679850	12	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	ACTTGTACTGAGCGGAGTCGAAGTA	25	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|63aa|up_0|NC_010628.1_2665674_2665863_+,NA|266aa|down_3|NC_010628.1_2703493_2704291_+	NA|397aa|up_9|NC_010628.1_2646737_2647928_-	TIGR02971, devB-like_secretion_protein, ABC exporter membrane fusion protein, DevB family	NA|352aa|up_8|NC_010628.1_2648281_2649337_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|346aa|up_7|NC_010628.1_2649563_2650601_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|497aa|up_6|NC_010628.1_2650669_2652160_-	PRK08063, PRK08063, enoyl-[acyl-carrier-protein] reductase FabL	NA|301aa|up_5|NC_010628.1_2652290_2653193_-	COG2091, Sfp, Phosphopantetheinyl transferase [Coenzyme metabolism]	NA|2231aa|up_4|NC_010628.1_2653240_2659933_-	cd00833, PKS, polyketide synthases (PKSs) polymerize simple fatty acids into a large variety of different products, called polyketides, by successive decarboxylating Claisen condensations	NA|96aa|up_3|NC_010628.1_2660768_2661056_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|472aa|up_2|NC_010628.1_2661347_2662763_-	pfam14516, AAA_35, AAA-like domain	NA|387aa|up_1|NC_010628.1_2663342_2664503_-	pfam14516, AAA_35, AAA-like domain	NA|63aa|up_0|NC_010628.1_2665674_2665863_+	NA	NA|1260aa|down_0|NC_010628.1_2680367_2684147_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|3884aa|down_1|NC_010628.1_2684198_2695850_+	PRK12467, PRK12467, peptide synthase; Provisional	NA|2536aa|down_2|NC_010628.1_2695846_2703454_+	PRK12467, PRK12467, peptide synthase; Provisional	NA|266aa|down_3|NC_010628.1_2703493_2704291_+	NA	NA|369aa|down_4|NC_010628.1_2704339_2705446_+	cd08231, MDR_TM0436_like, Hypothetical enzyme TM0436 resembles the zinc-dependent alcohol dehydrogenases (ADH)	NA|274aa|down_5|NC_010628.1_2705485_2706307_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|664aa|down_6|NC_010628.1_2706409_2708401_+	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|45aa|down_7|NC_010628.1_2708889_2709024_+	pfam12559, Inhibitor_I10, Serine endopeptidase inhibitors	NA|331aa|down_8|NC_010628.1_2709312_2710305_+	TIGR04185, RimK-like_ATP-grasp_domain_protein, ATP-grasp ribosomal peptide maturase, MvdC family	NA|326aa|down_9|NC_010628.1_2710379_2711357_+	TIGR04184, hypothetical_protein_HMPREF0204_12500, ATP-grasp ribosomal peptide maturase, MvdD family
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	13	3103015-3103126	13	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TTTAAGGGGGGTTAGGGGGGATCTCCAATGACTATGCAC	39	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|109aa|up_9|NC_010628.1_3088834_3089161_-,NA|114aa|up_5|NC_010628.1_3093407_3093749_-,NA|117aa|up_4|NC_010628.1_3094421_3094772_+,NA|294aa|up_2|NC_010628.1_3095970_3096852_-,NA|209aa|down_6|NC_010628.1_3111725_3112352_+,NA|133aa|down_8|NC_010628.1_3113837_3114236_-	NA|109aa|up_9|NC_010628.1_3088834_3089161_-	NA	NA|285aa|up_8|NC_010628.1_3089458_3090313_+	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|551aa|up_7|NC_010628.1_3090538_3092191_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|342aa|up_6|NC_010628.1_3092260_3093286_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|114aa|up_5|NC_010628.1_3093407_3093749_-	NA	NA|117aa|up_4|NC_010628.1_3094421_3094772_+	NA	NA|372aa|up_3|NC_010628.1_3094718_3095834_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|294aa|up_2|NC_010628.1_3095970_3096852_-	NA	NA|486aa|up_1|NC_010628.1_3097419_3098877_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1061aa|up_0|NC_010628.1_3098983_3102166_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|129aa|down_0|NC_010628.1_3103171_3103558_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|1101aa|down_1|NC_010628.1_3103999_3107302_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|531aa|down_2|NC_010628.1_3107620_3109213_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|501aa|down_3|NC_010628.1_3109505_3111008_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|92aa|down_4|NC_010628.1_3111061_3111337_-	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|63aa|down_5|NC_010628.1_3111434_3111623_+	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|209aa|down_6|NC_010628.1_3111725_3112352_+	NA	NA|469aa|down_7|NC_010628.1_3112362_3113769_+	PRK09201, PRK09201, AtzE family amidohydrolase	NA|133aa|down_8|NC_010628.1_3113837_3114236_-	NA	NA|169aa|down_9|NC_010628.1_3115284_3115791_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	14	3128838-3128938	14	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AAAAGCATGAAAAAACCCGGCTTAATCGGG	30	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA,NA|125aa|down_0|NC_010628.1_3128972_3129347_-	NA|170aa|up_9|NC_010628.1_3115790_3116300_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|338aa|up_8|NC_010628.1_3119428_3120442_+	cd08276, MDR7, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|214aa|up_7|NC_010628.1_3120670_3121312_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|346aa|up_6|NC_010628.1_3121334_3122372_-	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|246aa|up_5|NC_010628.1_3123310_3124048_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|249aa|up_4|NC_010628.1_3124235_3124982_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|244aa|up_3|NC_010628.1_3125062_3125794_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|137aa|up_2|NC_010628.1_3125950_3126361_+	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains	NA|224aa|up_1|NC_010628.1_3126665_3127337_+	TIGR02869, Spore_cortex-lytic_enzyme, spore cortex-lytic enzyme	NA|376aa|up_0|NC_010628.1_3127593_3128721_-	TIGR00236, UDP-N-acetylglucosamine_2-epimerase, UDP-N-acetylglucosamine 2-epimerase	NA|125aa|down_0|NC_010628.1_3128972_3129347_-	NA	NA|425aa|down_1|NC_010628.1_3130076_3131351_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|283aa|down_2|NC_010628.1_3131566_3132415_+	cd02978, KaiB_like, KaiB-like family; composed of the circadian clock proteins, KaiB and the N-terminal KaiB-like sensory domain of SasA	NA|208aa|down_3|NC_010628.1_3132632_3133256_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|389aa|down_4|NC_010628.1_3133350_3134517_+	PLN02449, PLN02449, ferrochelatase	NA|180aa|down_5|NC_010628.1_3134718_3135258_-	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|432aa|down_6|NC_010628.1_3135718_3137014_-	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|184aa|down_7|NC_010628.1_3137054_3137606_-	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|339aa|down_8|NC_010628.1_3137763_3138780_-	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|510aa|down_9|NC_010628.1_3138902_3140432_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	15	3338172-3341197	15,4,5	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-D	GTTTCAATCCCTGATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTGATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTGATAGGGATTTTGATGAATTGCAAT	37,37,37	0	0	NA	NA	N:A	41,41,40	41	TypeI-D	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|149aa|up_4|NC_010628.1_3324861_3325308_+,NA	NA|1868aa|up_9|NC_010628.1_3306421_3312025_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|779aa|up_8|NC_010628.1_3312485_3314822_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|326aa|up_7|NC_010628.1_3315810_3316788_-	cd09763, DHRS1-like_SDR_c, human dehydrogenase/reductase (SDR family) member 1 (DHRS1) -like, classical (c) SDRs	NA|314aa|up_6|NC_010628.1_3317052_3317994_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|1245aa|up_5|NC_010628.1_3318846_3322581_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|149aa|up_4|NC_010628.1_3324861_3325308_+	NA	NA|154aa|up_3|NC_010628.1_3325743_3326205_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|385aa|up_2|NC_010628.1_3326167_3327322_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|933aa|up_1|NC_010628.1_3327330_3330129_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|2199aa|up_0|NC_010628.1_3331544_3338141_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|437aa|down_0|NC_010628.1_3341371_3342682_+	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	cas2|96aa|down_1|NC_010628.1_3342858_3343146_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_2|NC_010628.1_3343185_3344190_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_3|NC_010628.1_3344339_3344933_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|282aa|down_4|NC_010628.1_3344964_3345810_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	2OG_CAS|207aa|down_5|NC_010628.1_3345799_3346420_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas3|760aa|down_6|NC_010628.1_3346496_3348776_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	csc1gr5|253aa|down_7|NC_010628.1_3348768_3349527_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|343aa|down_8|NC_010628.1_3349529_3350558_-	pfam18320, Csc2, Csc2 Crispr	cas10d|914aa|down_9|NC_010628.1_3350575_3353317_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	16	3440637-3440818	16	CRISPRCasFinder	no	Cas9_archaeal	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type II-A, Type II-B, or Type II-C?	CCCGACAGGATTTGAACCTGCAAAA	25	0	0	NA	NA	N:A	2	2	TypeII-A,TypeII-B,orTypeII-C?	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|182aa|up_9|NC_010628.1_3429376_3429922_+,NA|64aa|up_8|NC_010628.1_3430009_3430201_+,NA|67aa|up_7|NC_010628.1_3430216_3430417_+,NA|106aa|up_5|NC_010628.1_3432456_3432774_-,NA|282aa|up_4|NC_010628.1_3433211_3434057_+,NA|65aa|down_0|NC_010628.1_3442419_3442614_-,NA|130aa|down_2|NC_010628.1_3443686_3444076_-,NA|84aa|down_4|NC_010628.1_3444733_3444985_-,NA|112aa|down_5|NC_010628.1_3445056_3445392_+	NA|182aa|up_9|NC_010628.1_3429376_3429922_+	NA	NA|64aa|up_8|NC_010628.1_3430009_3430201_+	NA	NA|67aa|up_7|NC_010628.1_3430216_3430417_+	NA	NA|576aa|up_6|NC_010628.1_3430499_3432227_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|106aa|up_5|NC_010628.1_3432456_3432774_-	NA	NA|282aa|up_4|NC_010628.1_3433211_3434057_+	NA	NA|441aa|up_3|NC_010628.1_3434120_3435443_-	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|311aa|up_2|NC_010628.1_3435880_3436813_-	PLN02632, PLN02632, phytoene synthase	NA|480aa|up_1|NC_010628.1_3436796_3438236_-	TIGR02731, Phytoene_dehydrogenase_chloroplastic/chromoplastic, phytoene desaturase	NA|363aa|up_0|NC_010628.1_3438596_3439685_+	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|65aa|down_0|NC_010628.1_3442419_3442614_-	NA	NA|282aa|down_1|NC_010628.1_3442809_3443655_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|130aa|down_2|NC_010628.1_3443686_3444076_-	NA	Cas9_archaeal|169aa|down_3|NC_010628.1_3444092_3444599_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|84aa|down_4|NC_010628.1_3444733_3444985_-	NA	NA|112aa|down_5|NC_010628.1_3445056_3445392_+	NA	NA|674aa|down_6|NC_010628.1_3446280_3448302_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|203aa|down_7|NC_010628.1_3448310_3448919_+	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|640aa|down_8|NC_010628.1_3449399_3451319_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|298aa|down_9|NC_010628.1_3451470_3452364_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	17	3441037-3441137	17	CRISPRCasFinder	no	Cas9_archaeal	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type II-A, Type II-B, or Type II-C?	CCCGACAGGATTTGAACCTGCAAAA	25	0	0	NA	NA	N:A	1	1	TypeII-A,TypeII-B,orTypeII-C?	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|182aa|up_9|NC_010628.1_3429376_3429922_+,NA|64aa|up_8|NC_010628.1_3430009_3430201_+,NA|67aa|up_7|NC_010628.1_3430216_3430417_+,NA|106aa|up_5|NC_010628.1_3432456_3432774_-,NA|282aa|up_4|NC_010628.1_3433211_3434057_+,NA|65aa|down_0|NC_010628.1_3442419_3442614_-,NA|130aa|down_2|NC_010628.1_3443686_3444076_-,NA|84aa|down_4|NC_010628.1_3444733_3444985_-,NA|112aa|down_5|NC_010628.1_3445056_3445392_+	NA|182aa|up_9|NC_010628.1_3429376_3429922_+	NA	NA|64aa|up_8|NC_010628.1_3430009_3430201_+	NA	NA|67aa|up_7|NC_010628.1_3430216_3430417_+	NA	NA|576aa|up_6|NC_010628.1_3430499_3432227_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|106aa|up_5|NC_010628.1_3432456_3432774_-	NA	NA|282aa|up_4|NC_010628.1_3433211_3434057_+	NA	NA|441aa|up_3|NC_010628.1_3434120_3435443_-	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|311aa|up_2|NC_010628.1_3435880_3436813_-	PLN02632, PLN02632, phytoene synthase	NA|480aa|up_1|NC_010628.1_3436796_3438236_-	TIGR02731, Phytoene_dehydrogenase_chloroplastic/chromoplastic, phytoene desaturase	NA|363aa|up_0|NC_010628.1_3438596_3439685_+	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|65aa|down_0|NC_010628.1_3442419_3442614_-	NA	NA|282aa|down_1|NC_010628.1_3442809_3443655_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|130aa|down_2|NC_010628.1_3443686_3444076_-	NA	Cas9_archaeal|169aa|down_3|NC_010628.1_3444092_3444599_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|84aa|down_4|NC_010628.1_3444733_3444985_-	NA	NA|112aa|down_5|NC_010628.1_3445056_3445392_+	NA	NA|674aa|down_6|NC_010628.1_3446280_3448302_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|203aa|down_7|NC_010628.1_3448310_3448919_+	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|640aa|down_8|NC_010628.1_3449399_3451319_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|298aa|down_9|NC_010628.1_3451470_3452364_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	18	3655866-3655980	18	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TCCACGATGTTTTGTAAGTAAGCGTGGGTGTTTC	34	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|72aa|up_8|NC_010628.1_3646782_3646998_-,NA|104aa|up_7|NC_010628.1_3647284_3647596_+,NA|142aa|up_6|NC_010628.1_3647761_3648187_+,NA|63aa|up_5|NC_010628.1_3648358_3648547_+,NA|61aa|up_3|NC_010628.1_3650390_3650573_-,NA|90aa|down_2|NC_010628.1_3659225_3659495_-,NA|130aa|down_4|NC_010628.1_3661557_3661947_-,NA|63aa|down_7|NC_010628.1_3667698_3667887_-,NA|69aa|down_8|NC_010628.1_3668695_3668902_+	NA|143aa|up_9|NC_010628.1_3646361_3646790_-	pfam13650, Asp_protease_2, Aspartyl protease	NA|72aa|up_8|NC_010628.1_3646782_3646998_-	NA	NA|104aa|up_7|NC_010628.1_3647284_3647596_+	NA	NA|142aa|up_6|NC_010628.1_3647761_3648187_+	NA	NA|63aa|up_5|NC_010628.1_3648358_3648547_+	NA	NA|341aa|up_4|NC_010628.1_3648696_3649719_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|61aa|up_3|NC_010628.1_3650390_3650573_-	NA	NA|118aa|up_2|NC_010628.1_3650904_3651258_+	TIGR01617, Uncharacterized_protein_UU176, transcriptional regulator, Spx/MgsR family	NA|390aa|up_1|NC_010628.1_3651577_3652747_-	COG2942, COG2942, N-acyl-D-glucosamine 2-epimerase [Carbohydrate transport and metabolism]	NA|299aa|up_0|NC_010628.1_3652917_3653814_+	PRK02259, PRK02259, aspartoacylase; Provisional	NA|390aa|down_0|NC_010628.1_3656634_3657804_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|380aa|down_1|NC_010628.1_3657882_3659022_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|90aa|down_2|NC_010628.1_3659225_3659495_-	NA	NA|433aa|down_3|NC_010628.1_3660114_3661413_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|130aa|down_4|NC_010628.1_3661557_3661947_-	NA	NA|369aa|down_5|NC_010628.1_3662209_3663316_-	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|505aa|down_6|NC_010628.1_3663434_3664949_-	TIGR02733, similar_to_to_phytoene_dehydrogenase, C-3',4' desaturase CrtD	NA|63aa|down_7|NC_010628.1_3667698_3667887_-	NA	NA|69aa|down_8|NC_010628.1_3668695_3668902_+	NA	NA|171aa|down_9|NC_010628.1_3669594_3670107_+	PRK09267, PRK09267, flavodoxin FldA; Validated
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	19	4536246-4536322	19	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TTAACGAAGCAAAAGCTAACCAG	23	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|47aa|up_7|NC_010628.1_4524219_4524360_+,NA	NA|221aa|up_9|NC_010628.1_4521345_4522008_-	PLN02476, PLN02476, O-methyltransferase	NA|622aa|up_8|NC_010628.1_4522281_4524147_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|47aa|up_7|NC_010628.1_4524219_4524360_+	NA	NA|1040aa|up_6|NC_010628.1_4524662_4527782_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|434aa|up_5|NC_010628.1_4528077_4529379_-	pfam00211, Guanylate_cyc, Adenylate and Guanylate cyclase catalytic domain	NA|542aa|up_4|NC_010628.1_4529490_4531116_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|405aa|up_3|NC_010628.1_4531045_4532260_+	pfam05626, DUF790, Protein of unknown function (DUF790)	NA|444aa|up_2|NC_010628.1_4532341_4533673_-	PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional	NA|163aa|up_1|NC_010628.1_4533910_4534399_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|205aa|up_0|NC_010628.1_4534579_4535194_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|392aa|down_0|NC_010628.1_4536798_4537974_+	TIGR01185, membrane_spanning_subunit, DevC protein	NA|235aa|down_1|NC_010628.1_4538040_4538745_+	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|394aa|down_2|NC_010628.1_4539175_4540357_+	smart00854, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|535aa|down_3|NC_010628.1_4540434_4542039_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|443aa|down_4|NC_010628.1_4542706_4544035_-	pfam10923, DUF2791, P-loop Domain of unknown function (DUF2791)	NA|430aa|down_5|NC_010628.1_4544131_4545421_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|452aa|down_6|NC_010628.1_4545722_4547078_-	pfam05673, DUF815, Protein of unknown function (DUF815)	NA|107aa|down_7|NC_010628.1_4547165_4547486_+	COG5548, COG5548, Small integral membrane protein [Function unknown]	NA|294aa|down_8|NC_010628.1_4547879_4548761_-	pfam03881, Fructosamin_kin, Fructosamine kinase	NA|442aa|down_9|NC_010628.1_4549018_4550344_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	20	4578003-4578241	20	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AATGTCGTTAATGTCAAAATTGCTAATCAATCTCAACAGATCAGCAATCAAGT	53	0	0	NA	NA	N:A	2	2	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|47aa|up_9|NC_010628.1_4560673_4560814_-,NA|52aa|up_8|NC_010628.1_4561072_4561228_-,NA	NA|47aa|up_9|NC_010628.1_4560673_4560814_-	NA	NA|52aa|up_8|NC_010628.1_4561072_4561228_-	NA	NA|280aa|up_7|NC_010628.1_4561449_4562289_+	pfam04116, FA_hydroxylase, Fatty acid hydroxylase superfamily	NA|165aa|up_6|NC_010628.1_4562629_4563124_-	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)	NA|446aa|up_5|NC_010628.1_4563364_4564702_-	TIGR04344, generic_methyltransferase, 5-histidylcysteine sulfoxide synthase	NA|325aa|up_4|NC_010628.1_4565046_4566021_-	TIGR03438, conserved_hypothetical_protein, dimethylhistidine N-methyltransferase	NA|719aa|up_3|NC_010628.1_4567034_4569191_-	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|182aa|up_2|NC_010628.1_4569292_4569838_-	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|341aa|up_1|NC_010628.1_4572907_4573930_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|279aa|up_0|NC_010628.1_4574555_4575392_+	COG4279, COG4279, Uncharacterized conserved protein [Function unknown]	NA|62aa|down_0|NC_010628.1_4581645_4581831_-	pfam08369, PCP_red, Proto-chlorophyllide reductase 57 kD subunit	NA|464aa|down_1|NC_010628.1_4581961_4583353_-	CHL00035, psbC, photosystem II 44 kDa protein	NA|352aa|down_2|NC_010628.1_4583336_4584392_-	CHL00004, psbD, photosystem II protein D2	NA|190aa|down_3|NC_010628.1_4585073_4585643_+	PRK02542, PRK02542, photosystem I assembly protein Ycf4; Provisional	NA|262aa|down_4|NC_010628.1_4585806_4586592_+	cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) similar ot the Spinach thylakoid lumen protein TLP40	NA|383aa|down_5|NC_010628.1_4586747_4587896_+	PRK05952, PRK05952, beta-ketoacyl-ACP synthase	NA|430aa|down_6|NC_010628.1_4588034_4589324_+	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]	NA|374aa|down_7|NC_010628.1_4589496_4590618_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|163aa|down_8|NC_010628.1_4590716_4591205_-	cd12108, Hr-like, Hemerythrin-like domain	NA|172aa|down_9|NC_010628.1_4591723_4592239_+	pfam01724, DUF29, Domain of unknown function DUF29
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	21	4719432-4719692	6	PILER-CR	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AGACGCAATAGCATCATACGATAAAGCGATCGCCATCAAACCCAACAAATATCAAGC	57	0	0	NA	NA	N:A	2	2	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|240aa|up_9|NC_010628.1_4704237_4704957_-,NA|84aa|up_6|NC_010628.1_4708159_4708411_-,NA|129aa|down_9|NC_010628.1_4735258_4735645_-	NA|240aa|up_9|NC_010628.1_4704237_4704957_-	NA	NA|644aa|up_8|NC_010628.1_4705361_4707293_-	cd07340, M48B_Htpx_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|196aa|up_7|NC_010628.1_4707299_4707887_-	pfam04011, LemA, LemA family	NA|84aa|up_6|NC_010628.1_4708159_4708411_-	NA	NA|198aa|up_5|NC_010628.1_4708438_4709032_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|738aa|up_4|NC_010628.1_4710117_4712331_-	COG3957, COG3957, Phosphoketolase [Carbohydrate transport and metabolism]	NA|461aa|up_3|NC_010628.1_4712520_4713903_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|459aa|up_2|NC_010628.1_4714390_4715767_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|253aa|up_1|NC_010628.1_4715869_4716628_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|565aa|up_0|NC_010628.1_4716732_4718427_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|291aa|down_0|NC_010628.1_4720044_4720917_+	COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane]	NA|188aa|down_1|NC_010628.1_4722081_4722645_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|508aa|down_2|NC_010628.1_4723921_4725445_-	pfam14277, DUF4364, Domain of unknown function (DUF4364)	NA|980aa|down_3|NC_010628.1_4725968_4728908_-	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|130aa|down_4|NC_010628.1_4729079_4729469_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|379aa|down_5|NC_010628.1_4729594_4730731_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|166aa|down_6|NC_010628.1_4731073_4731571_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|589aa|down_7|NC_010628.1_4731759_4733526_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|443aa|down_8|NC_010628.1_4733890_4735219_+	COG0763, LpxB, Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane]	NA|129aa|down_9|NC_010628.1_4735258_4735645_-	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	22	4735794-4735902	21	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AATGCATCGGCTGCATCGTTAATGCA	26	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|129aa|up_0|NC_010628.1_4735258_4735645_-,NA	NA|291aa|up_9|NC_010628.1_4720044_4720917_+	COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane]	NA|188aa|up_8|NC_010628.1_4722081_4722645_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|508aa|up_7|NC_010628.1_4723921_4725445_-	pfam14277, DUF4364, Domain of unknown function (DUF4364)	NA|980aa|up_6|NC_010628.1_4725968_4728908_-	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|130aa|up_5|NC_010628.1_4729079_4729469_-	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|379aa|up_4|NC_010628.1_4729594_4730731_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|166aa|up_3|NC_010628.1_4731073_4731571_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|589aa|up_2|NC_010628.1_4731759_4733526_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|443aa|up_1|NC_010628.1_4733890_4735219_+	COG0763, LpxB, Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane]	NA|129aa|up_0|NC_010628.1_4735258_4735645_-	NA	NA|893aa|down_0|NC_010628.1_4735936_4738615_-	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|485aa|down_1|NC_010628.1_4739691_4741146_+	TIGR03794, conserved_hypothetical_protein, NHLM bacteriocin system secretion protein	NA|1774aa|down_2|NC_010628.1_4741702_4747024_+	TIGR03796, ABC_transporter_related, NHLM bacteriocin system ABC transporter, peptidase/ATP-binding protein	NA|534aa|down_3|NC_010628.1_4747369_4748971_-	COG3540, PhoD, Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]	NA|469aa|down_4|NC_010628.1_4749473_4750880_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|368aa|down_5|NC_010628.1_4750945_4752049_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|145aa|down_6|NC_010628.1_4752263_4752698_-	cd04682, Nudix_Hydrolase_23, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|139aa|down_7|NC_010628.1_4752722_4753139_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|293aa|down_8|NC_010628.1_4753461_4754340_+	PRK14186, PRK14186, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|311aa|down_9|NC_010628.1_4754555_4755488_+	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	23	4939090-4939185	22	CRISPRCasFinder	no	csa3	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-A	ATTGGGAATTGGGAATTGGGCAT	23	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA,NA|66aa|down_0|NC_010628.1_4939377_4939575_+,NA|82aa|down_1|NC_010628.1_4939649_4939895_+,NA|620aa|down_6|NC_010628.1_4945423_4947283_-,NA|218aa|down_7|NC_010628.1_4947347_4948001_-,NA|161aa|down_8|NC_010628.1_4948340_4948823_+,NA|108aa|down_9|NC_010628.1_4948916_4949240_+	NA|253aa|up_9|NC_010628.1_4930044_4930803_-	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|162aa|up_8|NC_010628.1_4930916_4931402_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|105aa|up_7|NC_010628.1_4931647_4931962_-	TIGR02181, GRX_bact, Glutaredoxin, GrxC family	NA|191aa|up_6|NC_010628.1_4932009_4932582_-	pfam06168, DUF981, Protein of unknown function (DUF981)	NA|346aa|up_5|NC_010628.1_4932939_4933977_+	PRK09479, glpX, fructose 1,6-bisphosphatase II; Reviewed	NA|429aa|up_4|NC_010628.1_4934218_4935505_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	csa3|195aa|up_3|NC_010628.1_4935610_4936195_+	COG3860, COG3860, Uncharacterized protein conserved in bacteria [Function unknown]	NA|197aa|up_2|NC_010628.1_4936238_4936829_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|261aa|up_1|NC_010628.1_4937178_4937961_-	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|378aa|up_0|NC_010628.1_4937950_4939084_-	COG4177, LivM, ABC-type branched-chain amino acid transport system, permease component [Amino acid transport and metabolism]	NA|66aa|down_0|NC_010628.1_4939377_4939575_+	NA	NA|82aa|down_1|NC_010628.1_4939649_4939895_+	NA	NA|529aa|down_2|NC_010628.1_4940006_4941593_+	PRK14096, pgi, glucose-6-phosphate isomerase; Provisional	NA|221aa|down_3|NC_010628.1_4941862_4942525_-	pfam07862, Nif11, Nif11 domain	NA|298aa|down_4|NC_010628.1_4943302_4944196_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|400aa|down_5|NC_010628.1_4944218_4945418_-	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|620aa|down_6|NC_010628.1_4945423_4947283_-	NA	NA|218aa|down_7|NC_010628.1_4947347_4948001_-	NA	NA|161aa|down_8|NC_010628.1_4948340_4948823_+	NA	NA|108aa|down_9|NC_010628.1_4948916_4949240_+	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	24	4949016-4949222	23	CRISPRCasFinder	no	csa3	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type I-A	GAAGACCCTTACGGCGACCCCGCAGAT	27	0	0	NA	NA	N:A	3	3	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|66aa|up_8|NC_010628.1_4939377_4939575_+,NA|82aa|up_7|NC_010628.1_4939649_4939895_+,NA|620aa|up_2|NC_010628.1_4945423_4947283_-,NA|218aa|up_1|NC_010628.1_4947347_4948001_-,NA|161aa|up_0|NC_010628.1_4948340_4948823_+,NA	NA|378aa|up_9|NC_010628.1_4937950_4939084_-	COG4177, LivM, ABC-type branched-chain amino acid transport system, permease component [Amino acid transport and metabolism]	NA|66aa|up_8|NC_010628.1_4939377_4939575_+	NA	NA|82aa|up_7|NC_010628.1_4939649_4939895_+	NA	NA|529aa|up_6|NC_010628.1_4940006_4941593_+	PRK14096, pgi, glucose-6-phosphate isomerase; Provisional	NA|221aa|up_5|NC_010628.1_4941862_4942525_-	pfam07862, Nif11, Nif11 domain	NA|298aa|up_4|NC_010628.1_4943302_4944196_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|400aa|up_3|NC_010628.1_4944218_4945418_-	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|620aa|up_2|NC_010628.1_4945423_4947283_-	NA	NA|218aa|up_1|NC_010628.1_4947347_4948001_-	NA	NA|161aa|up_0|NC_010628.1_4948340_4948823_+	NA	NA|328aa|down_0|NC_010628.1_4949755_4950739_-	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|72aa|down_1|NC_010628.1_4950825_4951041_-	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|292aa|down_2|NC_010628.1_4951143_4952019_-	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|72aa|down_3|NC_010628.1_4952263_4952479_-	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE	NA|195aa|down_4|NC_010628.1_4952763_4953348_-	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|79aa|down_5|NC_010628.1_4953590_4953827_+	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|364aa|down_6|NC_010628.1_4953906_4954998_+	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|519aa|down_7|NC_010628.1_4955348_4956905_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|169aa|down_8|NC_010628.1_4957237_4957744_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|114aa|down_9|NC_010628.1_4957845_4958187_-	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	25	5588195-5588288	24	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TCGCGGCTACACAGACGAAACCC	23	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|58aa|up_8|NC_010628.1_5578587_5578761_+,NA|317aa|up_2|NC_010628.1_5583670_5584621_+,NA|137aa|down_4|NC_010628.1_5594204_5594615_+,NA|64aa|down_5|NC_010628.1_5594774_5594966_+	NA|493aa|up_9|NC_010628.1_5576807_5578286_+	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|58aa|up_8|NC_010628.1_5578587_5578761_+	NA	NA|691aa|up_7|NC_010628.1_5578757_5580830_+	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|113aa|up_6|NC_010628.1_5581031_5581370_+	pfam05685, Uma2, Putative restriction endonuclease	NA|184aa|up_5|NC_010628.1_5581406_5581958_+	pfam05685, Uma2, Putative restriction endonuclease	NA|153aa|up_4|NC_010628.1_5582126_5582585_+	TIGR04062, hypothetical_protein_CY0110_29519, dnd system-associated protein 4	NA|334aa|up_3|NC_010628.1_5582649_5583651_+	TIGR03187, hypothetical_protein, DGQHR domain	NA|317aa|up_2|NC_010628.1_5583670_5584621_+	NA	NA|533aa|up_1|NC_010628.1_5584752_5586351_-	TIGR03187, hypothetical_protein, DGQHR domain	NA|542aa|up_0|NC_010628.1_5586533_5588159_+	PRK06850, PRK06850, hypothetical protein; Provisional	NA|662aa|down_0|NC_010628.1_5588822_5590808_+	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|129aa|down_1|NC_010628.1_5590865_5591252_+	pfam08870, DndE, DNA sulphur modification protein DndE	NA|486aa|down_2|NC_010628.1_5591317_5592775_+	TIGR04096, conserved_hypothetical_protein, DNA phosphorothioation-associated putative methyltransferase	NA|425aa|down_3|NC_010628.1_5592918_5594193_+	cd17486, MFS_AmpG_like, AmpG and similar transporters of the Major Facilitator Superfamily	NA|137aa|down_4|NC_010628.1_5594204_5594615_+	NA	NA|64aa|down_5|NC_010628.1_5594774_5594966_+	NA	NA|169aa|down_6|NC_010628.1_5595010_5595517_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|253aa|down_7|NC_010628.1_5595690_5596449_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|135aa|down_8|NC_010628.1_5596620_5597025_-	cd02213, cupin_PMI_typeII_C, Phosphomannose isomerase type II, C-terminal cupin domain	NA|448aa|down_9|NC_010628.1_5597140_5598484_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	26	6775105-6775202	25	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	CTCCTTTCTCTACGAGAGGCTGCGC	25	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|82aa|up_6|NC_010628.1_6766554_6766800_+,NA|73aa|down_5|NC_010628.1_6778352_6778571_-,NA|88aa|down_9|NC_010628.1_6781768_6782032_-	NA|474aa|up_9|NC_010628.1_6763841_6765263_-	PRK07362, PRK07362, NADP-dependent isocitrate dehydrogenase	NA|243aa|up_8|NC_010628.1_6765535_6766264_+	pfam05419, GUN4, GUN4-like	NA|91aa|up_7|NC_010628.1_6766274_6766547_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|82aa|up_6|NC_010628.1_6766554_6766800_+	NA	NA|378aa|up_5|NC_010628.1_6766851_6767985_+	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|260aa|up_4|NC_010628.1_6768095_6768875_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|937aa|up_3|NC_010628.1_6769143_6771954_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|440aa|up_2|NC_010628.1_6771972_6773292_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|245aa|up_1|NC_010628.1_6773361_6774096_-	COG0357, GidB, Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division [Cell envelope biogenesis, outer membrane]	NA|225aa|up_0|NC_010628.1_6774173_6774848_-	COG1122, CbiO, ABC-type cobalt transport system, ATPase component [Inorganic ion transport and metabolism]	NA|311aa|down_0|NC_010628.1_6775315_6776248_+	sd00006, TPR, Tetratricopeptide repeat	NA|151aa|down_1|NC_010628.1_6776339_6776792_+	pfam12049, DUF3531, Protein of unknown function (DUF3531)	NA|144aa|down_2|NC_010628.1_6776800_6777232_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|127aa|down_3|NC_010628.1_6777269_6777650_-	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|92aa|down_4|NC_010628.1_6777615_6777891_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|73aa|down_5|NC_010628.1_6778352_6778571_-	NA	NA|360aa|down_6|NC_010628.1_6778764_6779844_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|437aa|down_7|NC_010628.1_6779977_6781288_+	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|163aa|down_8|NC_010628.1_6781186_6781675_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|88aa|down_9|NC_010628.1_6781768_6782032_-	NA
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	27	6982665-6983554	26,5,7	CRISPRCasFinder,CRT,PILER-CR	no	c2c5_V-U5	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Type V-U5	CTTTCAACCCACCCAGTACCTGGAGGGTTGTTGCCAC,CTTTCAACCCACCCAGTACCTGGAGGGTTGTTGCCAC,CTTTCAACCCACCCAGTACCTGGAGGGTTGTTGCCAC	37,37,37	0	0	NA	NA	N:A	12,12,11	12	TypeV-U5	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|124aa|up_8|NC_010628.1_6966477_6966849_+,NA|63aa|up_3|NC_010628.1_6977499_6977688_-,c2c5_V-U5|640aa|down_0|NC_010628.1_6983999_6985919_-	NA|95aa|up_9|NC_010628.1_6965277_6965562_-	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)	NA|124aa|up_8|NC_010628.1_6966477_6966849_+	NA	NA|732aa|up_7|NC_010628.1_6966994_6969190_-	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|1810aa|up_6|NC_010628.1_6969865_6975295_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|207aa|up_5|NC_010628.1_6975425_6976046_+	pfam05685, Uma2, Putative restriction endonuclease	NA|420aa|up_4|NC_010628.1_6976049_6977309_-	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|63aa|up_3|NC_010628.1_6977499_6977688_-	NA	NA|253aa|up_2|NC_010628.1_6977879_6978638_-	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|449aa|up_1|NC_010628.1_6978785_6980132_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|524aa|up_0|NC_010628.1_6980661_6982233_-	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	c2c5_V-U5|640aa|down_0|NC_010628.1_6983999_6985919_-	NA	NA|151aa|down_1|NC_010628.1_6985999_6986452_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|216aa|down_2|NC_010628.1_6986740_6987388_+	pfam08747, DUF1788, Domain of unknown function (DUF1788)	NA|1180aa|down_3|NC_010628.1_6987384_6990924_+	NF033441, BREX_BrxC, BREX system P-loop protein BrxC	NA|1179aa|down_4|NC_010628.1_6990988_6994525_+	NF033452, BREX_1_MTaseX, BREX-1 system adenine-specific DNA-methyltransferase PglX	NA|118aa|down_5|NC_010628.1_6994587_6994941_-	pfam08869, XisI, XisI protein	NA|139aa|down_6|NC_010628.1_6994928_6995345_-	pfam08814, XisH, XisH protein	NA|856aa|down_7|NC_010628.1_6995445_6998013_+	TIGR02687, conserved_hypothetical_protein, TIGR02687 family protein	NA|673aa|down_8|NC_010628.1_6998104_7000123_+	TIGR02653, putative_ATP-dependent_Lon_protease, conserved hypothetical protein	NA|201aa|down_9|NC_010628.1_7000119_7000722_-	pfam08849, DUF1819, Putative inner membrane protein (DUF1819)
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	28	7053084-7053208	27	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TAGCCCGCTTTGTGAATGCGATCGCTTGCATTATGAATGCA	41	1	2	7053125-7053167|7053125-7053167	NC_010628.1_7053167-7053209|NC_010628.1_7053083-7053125	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|221aa|up_8|NC_010628.1_7041492_7042155_+,NA|54aa|up_5|NC_010628.1_7043825_7043987_+,NA|65aa|down_2|NC_010628.1_7054585_7054780_+,NA|52aa|down_6|NC_010628.1_7062598_7062754_-	NA|237aa|up_9|NC_010628.1_7040702_7041413_+	COG4300, CadD, Predicted permease, cadmium resistance protein [Inorganic ion transport and metabolism]	NA|221aa|up_8|NC_010628.1_7041492_7042155_+	NA	NA|143aa|up_7|NC_010628.1_7042624_7043053_+	pfam02656, DUF202, Domain of unknown function (DUF202)	NA|241aa|up_6|NC_010628.1_7043095_7043818_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|54aa|up_5|NC_010628.1_7043825_7043987_+	NA	NA|235aa|up_4|NC_010628.1_7044007_7044712_-	cd05386, TraL, transfer origin protein TraL	NA|1251aa|up_3|NC_010628.1_7045498_7049251_+	pfam12770, CHAT, CHAT domain	NA|239aa|up_2|NC_010628.1_7049994_7050711_-	cd02910, cupin_Yhhw_N, Escherichia coli YhhW and YhaK and related proteins, pirin-like bicupin, N-terminal cupin domain	NA|191aa|up_1|NC_010628.1_7051102_7051675_-	PRK09448, PRK09448, DNA starvation/stationary phase protection protein Dps; Provisional	NA|351aa|up_0|NC_010628.1_7051831_7052884_-	cd19080, AKR_AKR9A_9B, AKR9A and AKR9B families of aldo-keto reductase (AKR)	NA|202aa|down_0|NC_010628.1_7053308_7053914_+	pfam05685, Uma2, Putative restriction endonuclease	NA|136aa|down_1|NC_010628.1_7054007_7054415_+	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|65aa|down_2|NC_010628.1_7054585_7054780_+	NA	NA|616aa|down_3|NC_010628.1_7054776_7056624_+	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|933aa|down_4|NC_010628.1_7057177_7059976_+	COG3280, TreY, Maltooligosyl trehalose synthase [Carbohydrate transport and metabolism]	NA|506aa|down_5|NC_010628.1_7060116_7061634_+	COG1626, TreA, Neutral trehalase [Carbohydrate transport and metabolism]	NA|52aa|down_6|NC_010628.1_7062598_7062754_-	NA	NA|472aa|down_7|NC_010628.1_7063201_7064617_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|200aa|down_8|NC_010628.1_7064705_7065305_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|249aa|down_9|NC_010628.1_7065265_7066012_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	29	7171867-7171961	28	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AGGTTAGCATTTTCAAGATTGGCATTTGAAAGGTT	35	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|88aa|up_3|NC_010628.1_7169171_7169435_+,NA|122aa|up_1|NC_010628.1_7170667_7171033_+,NA|82aa|up_0|NC_010628.1_7171145_7171391_+,NA|139aa|down_0|NC_010628.1_7174497_7174914_-,NA|158aa|down_1|NC_010628.1_7175144_7175618_-,NA|49aa|down_8|NC_010628.1_7182648_7182795_-	NA|155aa|up_9|NC_010628.1_7160857_7161322_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|137aa|up_8|NC_010628.1_7161579_7161990_-	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|115aa|up_7|NC_010628.1_7162238_7162583_-	pfam05542, DUF760, Protein of unknown function (DUF760)	NA|319aa|up_6|NC_010628.1_7162844_7163801_-	PRK07405, PRK07405, RNA polymerase sigma factor SigD; Validated	NA|1088aa|up_5|NC_010628.1_7164354_7167618_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|185aa|up_4|NC_010628.1_7168166_7168721_-	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|88aa|up_3|NC_010628.1_7169171_7169435_+	NA	NA|219aa|up_2|NC_010628.1_7169493_7170150_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|122aa|up_1|NC_010628.1_7170667_7171033_+	NA	NA|82aa|up_0|NC_010628.1_7171145_7171391_+	NA	NA|139aa|down_0|NC_010628.1_7174497_7174914_-	NA	NA|158aa|down_1|NC_010628.1_7175144_7175618_-	NA	NA|265aa|down_2|NC_010628.1_7175702_7176497_-	TIGR01097, PhnE, phosphonate ABC transporter, permease protein PhnE	NA|245aa|down_3|NC_010628.1_7176600_7177335_-	COG3638, COG3638, ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|325aa|down_4|NC_010628.1_7177487_7178462_-	COG3221, PhnD, ABC-type phosphate/phosphonate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|432aa|down_5|NC_010628.1_7178614_7179910_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|443aa|down_6|NC_010628.1_7180500_7181829_-	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|144aa|down_7|NC_010628.1_7182097_7182529_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|49aa|down_8|NC_010628.1_7182648_7182795_-	NA	NA|503aa|down_9|NC_010628.1_7182823_7184332_+	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	30	7754830-7754926	29	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	TAACTTGTGTGTACACCGTAGCT	23	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|215aa|up_5|NC_010628.1_7747685_7748330_+,NA|231aa|up_4|NC_010628.1_7749195_7749888_+,NA|147aa|up_3|NC_010628.1_7749955_7750396_+,NA|139aa|up_2|NC_010628.1_7750508_7750925_+,NA|47aa|down_0|NC_010628.1_7755057_7755198_+	NA|166aa|up_9|NC_010628.1_7742776_7743274_-	pfam07799, DUF1643, Protein of unknown function (DUF1643)	NA|396aa|up_8|NC_010628.1_7743319_7744507_-	pfam18723, aGPT-Pplase1, alpha-glutamyl/putrescinyl thymine pyrophosphorylase clade 1	NA|227aa|up_7|NC_010628.1_7744680_7745361_-	sd00006, TPR, Tetratricopeptide repeat	NA|658aa|up_6|NC_010628.1_7745478_7747452_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|215aa|up_5|NC_010628.1_7747685_7748330_+	NA	NA|231aa|up_4|NC_010628.1_7749195_7749888_+	NA	NA|147aa|up_3|NC_010628.1_7749955_7750396_+	NA	NA|139aa|up_2|NC_010628.1_7750508_7750925_+	NA	NA|160aa|up_1|NC_010628.1_7750936_7751416_+	pfam00931, NB-ARC, NB-ARC domain	NA|511aa|up_0|NC_010628.1_7752597_7754130_+	sd00006, TPR, Tetratricopeptide repeat	NA|47aa|down_0|NC_010628.1_7755057_7755198_+	NA	NA|241aa|down_1|NC_010628.1_7755288_7756011_+	TIGR03716, R_switched_YkoY, integral membrane protein, YkoY family	NA|150aa|down_2|NC_010628.1_7756086_7756536_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|673aa|down_3|NC_010628.1_7756753_7758772_+	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|390aa|down_4|NC_010628.1_7758958_7760128_-	COG1208, GCD1, Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis]	NA|277aa|down_5|NC_010628.1_7760359_7761190_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|362aa|down_6|NC_010628.1_7761517_7762603_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|215aa|down_7|NC_010628.1_7762729_7763374_+	TIGR00558, Pyridoxine/pyridoxamine_5'-phosphate_oxidase, pyridoxamine-phosphate oxidase	NA|206aa|down_8|NC_010628.1_7763468_7764086_-	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|118aa|down_9|NC_010628.1_7764235_7764589_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	31	7769881-7771303	8,30,6,9	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT,GTTTCAATCCCTAATAGGGATTTTGATGAATTGCAAT	37,37,37,37	0	0	NA	NA	N:A	17,19,19,17	19	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA,NA	NA|277aa|up_9|NC_010628.1_7760359_7761190_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|362aa|up_8|NC_010628.1_7761517_7762603_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|215aa|up_7|NC_010628.1_7762729_7763374_+	TIGR00558, Pyridoxine/pyridoxamine_5'-phosphate_oxidase, pyridoxamine-phosphate oxidase	NA|206aa|up_6|NC_010628.1_7763468_7764086_-	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|118aa|up_5|NC_010628.1_7764235_7764589_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|295aa|up_4|NC_010628.1_7764720_7765605_-	pfam11353, DUF3153, Protein of unknown function (DUF3153)	NA|181aa|up_3|NC_010628.1_7765631_7766174_-	cd15830, BamD, BamD lipoprotein, a component of the beta-barrel assembly machinery	NA|299aa|up_2|NC_010628.1_7766904_7767801_-	cd04250, AAK_NAGK-C, AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis	NA|127aa|up_1|NC_010628.1_7767902_7768283_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|182aa|up_0|NC_010628.1_7769112_7769658_-	PRK00131, aroK, shikimate kinase; Reviewed	NA|171aa|down_0|NC_010628.1_7772640_7773153_-	pfam13563, 2_5_RNA_ligase2, 2'-5' RNA ligase superfamily	NA|485aa|down_1|NC_010628.1_7774068_7775523_-	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|156aa|down_2|NC_010628.1_7776029_7776497_+	COG3296, COG3296, Uncharacterized protein conserved in bacteria [Function unknown]	NA|366aa|down_3|NC_010628.1_7776551_7777649_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|82aa|down_4|NC_010628.1_7777656_7777902_+	pfam14279, HNH_5, HNH endonuclease	NA|914aa|down_5|NC_010628.1_7778255_7780997_-	cd10797, GH57N_APU_like_1, N-terminal putative catalytic domain of mainly uncharacterized prokaryotic proteins similar to archaeal thermoactive amylopullulanases; glycoside hydrolase family 57 (GH57)	NA|364aa|down_6|NC_010628.1_7781523_7782615_+	TIGR00378, cax, calcium/proton exchanger (cax)	NA|379aa|down_7|NC_010628.1_7782795_7783932_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|477aa|down_8|NC_010628.1_7784099_7785530_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|246aa|down_9|NC_010628.1_7785747_7786485_+	pfam07444, Ycf66_N, Ycf66 protein N-terminus
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	32	7930349-7930453	31	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	ACCTAAACCTCCATAGAGTTTGTC	24	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|553aa|up_9|NC_010628.1_7916728_7918387_-,NA|114aa|up_7|NC_010628.1_7919286_7919628_-,NA|50aa|up_6|NC_010628.1_7920109_7920259_-,NA|175aa|up_3|NC_010628.1_7923801_7924326_-,NA	NA|553aa|up_9|NC_010628.1_7916728_7918387_-	NA	NA|293aa|up_8|NC_010628.1_7918392_7919271_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|114aa|up_7|NC_010628.1_7919286_7919628_-	NA	NA|50aa|up_6|NC_010628.1_7920109_7920259_-	NA	NA|650aa|up_5|NC_010628.1_7921301_7923251_+	cd17640, LC_FACS_like, Long-chain fatty acid CoA synthetase	NA|150aa|up_4|NC_010628.1_7923293_7923743_+	pfam11068, YlqD, YlqD protein	NA|175aa|up_3|NC_010628.1_7923801_7924326_-	NA	NA|434aa|up_2|NC_010628.1_7924731_7926033_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|305aa|up_1|NC_010628.1_7926194_7927109_+	COG5464, COG5464, Uncharacterized conserved protein [Function unknown]	NA|154aa|up_0|NC_010628.1_7927322_7927784_-	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|349aa|down_0|NC_010628.1_7930860_7931908_-	cd04252, AAK_NAGK-fArgBP, AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP)	NA|716aa|down_1|NC_010628.1_7932602_7934750_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|296aa|down_2|NC_010628.1_7935123_7936011_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|243aa|down_3|NC_010628.1_7936007_7936736_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|521aa|down_4|NC_010628.1_7937311_7938874_+	COG1233, COG1233, Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|444aa|down_5|NC_010628.1_7939050_7940382_+	COG2907, COG2907, Predicted NAD/FAD-binding protein [General function prediction only]	NA|357aa|down_6|NC_010628.1_7940447_7941518_-	TIGR01392, Homoserine_O-acetyltransferase, homoserine O-acetyltransferase	NA|435aa|down_7|NC_010628.1_7941662_7942967_-	TIGR01326, Includes:_O-acetylhomoserine_sulfhydrylase, OAH/OAS sulfhydrylase	NA|252aa|down_8|NC_010628.1_7943160_7943916_-	cd14953, NHL_like_1, Uncharacterized NHL-repeat domain in bacterial proteins	NA|93aa|down_9|NC_010628.1_7943893_7944172_-	cd14955, NHL_like_4, Uncharacterized NHL-repeat domain in bacterial and archaeal proteins
GCF_000020025.1_ASM2002v1	NC_010628	Nostoc punctiforme PCC 73102, complete sequence	33	7964655-7964732	32	CRISPRCasFinder	no		PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	Orphan	AGAAAAACCCGCAGCTAAAGCCGC	24	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|207aa|up_9|NC_010628.1_7952404_7953025_+,NA|63aa|up_8|NC_010628.1_7953343_7953532_-,NA|438aa|up_5|NC_010628.1_7956174_7957488_-,NA|268aa|up_2|NC_010628.1_7960975_7961779_+,NA|121aa|up_1|NC_010628.1_7962555_7962918_-,NA|169aa|down_2|NC_010628.1_7969057_7969564_+,NA|142aa|down_3|NC_010628.1_7969653_7970079_-,NA|87aa|down_4|NC_010628.1_7970819_7971080_+,NA|93aa|down_6|NC_010628.1_7972960_7973239_+,NA|54aa|down_7|NC_010628.1_7973320_7973482_+	NA|207aa|up_9|NC_010628.1_7952404_7953025_+	NA	NA|63aa|up_8|NC_010628.1_7953343_7953532_-	NA	NA|254aa|up_7|NC_010628.1_7953728_7954490_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|522aa|up_6|NC_010628.1_7954581_7956147_+	pfam01832, Glucosaminidase, Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase	NA|438aa|up_5|NC_010628.1_7956174_7957488_-	NA	NA|641aa|up_4|NC_010628.1_7957718_7959641_-	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|291aa|up_3|NC_010628.1_7960029_7960902_+	pfam02557, VanY, D-alanyl-D-alanine carboxypeptidase	NA|268aa|up_2|NC_010628.1_7960975_7961779_+	NA	NA|121aa|up_1|NC_010628.1_7962555_7962918_-	NA	NA|352aa|up_0|NC_010628.1_7963176_7964232_+	pfam00892, EamA, EamA-like transporter family	NA|165aa|down_0|NC_010628.1_7965199_7965694_-	PHA01886, PHA01886, TM2 domain-containing protein	NA|359aa|down_1|NC_010628.1_7966115_7967192_+	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|169aa|down_2|NC_010628.1_7969057_7969564_+	NA	NA|142aa|down_3|NC_010628.1_7969653_7970079_-	NA	NA|87aa|down_4|NC_010628.1_7970819_7971080_+	NA	NA|29aa|down_5|NC_010628.1_7971700_7971787_+	pfam13018, ESPR, Extended Signal Peptide of Type V secretion system	NA|93aa|down_6|NC_010628.1_7972960_7973239_+	NA	NA|54aa|down_7|NC_010628.1_7973320_7973482_+	NA	NA|325aa|down_8|NC_010628.1_7973922_7974897_-	PRK09375, PRK09375, quinolinate synthase NadA	NA|310aa|down_9|NC_010628.1_7975046_7975976_+	TIGR04168, Ser/Thr_protein_phosphatase_family_protein, TIGR04168 family protein
GCF_000020025.1_ASM2002v1	NC_010631	Nostoc punctiforme PCC 73102 plasmid pNPUN01, complete sequence	1	163417-163549	1,1	CRISPRCasFinder,PILER-CR	no			Orphan	GAAAAAATTTCATTTAGTCTGCAA,TTGCAGACTAAATGAAATTTTTTC	24,24	3	3	163441-163469|163494-163524|163501-163531	NC_010628.1_4036315-4036287|NC_010628.1_4036262-4036232|NC_010628.1_4036262-4036232	N:A	2,2	2	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|227aa|up_7|NC_010631.1_158407_159088_-,NA|156aa|up_6|NC_010631.1_159090_159558_-,NA|105aa|up_3|NC_010631.1_161337_161652_-,NA|71aa|down_9|NC_010631.1_180215_180428_-	NA|917aa|up_9|NC_010631.1_151681_154432_+	pfam12770, CHAT, CHAT domain	NA|1193aa|up_8|NC_010631.1_154487_158066_+	pfam12770, CHAT, CHAT domain	NA|227aa|up_7|NC_010631.1_158407_159088_-	NA	NA|156aa|up_6|NC_010631.1_159090_159558_-	NA	NA|141aa|up_5|NC_010631.1_159761_160184_-	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|336aa|up_4|NC_010631.1_160269_161277_-	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|105aa|up_3|NC_010631.1_161337_161652_-	NA	NA|112aa|up_2|NC_010631.1_161795_162131_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|138aa|up_1|NC_010631.1_162118_162532_-	pfam08814, XisH, XisH protein	NA|152aa|up_0|NC_010631.1_162594_163050_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|274aa|down_0|NC_010631.1_163649_164471_+	pfam08721, Tn7_Tnp_TnsA_C, TnsA endonuclease C terminal	NA|743aa|down_1|NC_010631.1_164492_166721_+	pfam00665, rve, Integrase core domain	NA|559aa|down_2|NC_010631.1_166704_168381_+	pfam13401, AAA_22, AAA domain	NA|446aa|down_3|NC_010631.1_168384_169722_+	pfam15978, TnsD, Tn7-like transposition protein D	NA|236aa|down_4|NC_010631.1_169976_170684_-	pfam06527, TniQ, TniQ	NA|121aa|down_5|NC_010631.1_170655_171018_-	TIGR03499, FlhF, flagellar biosynthetic protein FlhF	NA|870aa|down_6|NC_010631.1_174026_176636_+	pfam00665, rve, Integrase core domain	NA|380aa|down_7|NC_010631.1_176629_177769_+	pfam13401, AAA_22, AAA domain	NA|529aa|down_8|NC_010631.1_177758_179345_+	pfam06527, TniQ, TniQ	NA|71aa|down_9|NC_010631.1_180215_180428_-	NA
GCF_000020025.1_ASM2002v1	NC_010631	Nostoc punctiforme PCC 73102 plasmid pNPUN01, complete sequence	2	179693-179824	2,2	CRISPRCasFinder,PILER-CR	no			Orphan	GAAAAAATTTCATTTAGTCTGCAA,TGCAGACTAAATGAAATTTTTTC	24,23	3	3	179717-179745|179770-179800|179773-179804	NC_010628.1_4036315-4036287|NC_010628.1_4036262-4036232|NC_010628.1_4036263-4036232	N:A	2,2	2	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA,NA|71aa|down_0|NC_010631.1_180215_180428_-,NA|70aa|down_8|NC_010631.1_196637_196847_+	NA|152aa|up_9|NC_010631.1_162594_163050_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|274aa|up_8|NC_010631.1_163649_164471_+	pfam08721, Tn7_Tnp_TnsA_C, TnsA endonuclease C terminal	NA|743aa|up_7|NC_010631.1_164492_166721_+	pfam00665, rve, Integrase core domain	NA|559aa|up_6|NC_010631.1_166704_168381_+	pfam13401, AAA_22, AAA domain	NA|446aa|up_5|NC_010631.1_168384_169722_+	pfam15978, TnsD, Tn7-like transposition protein D	NA|236aa|up_4|NC_010631.1_169976_170684_-	pfam06527, TniQ, TniQ	NA|121aa|up_3|NC_010631.1_170655_171018_-	TIGR03499, FlhF, flagellar biosynthetic protein FlhF	NA|870aa|up_2|NC_010631.1_174026_176636_+	pfam00665, rve, Integrase core domain	NA|380aa|up_1|NC_010631.1_176629_177769_+	pfam13401, AAA_22, AAA domain	NA|529aa|up_0|NC_010631.1_177758_179345_+	pfam06527, TniQ, TniQ	NA|71aa|down_0|NC_010631.1_180215_180428_-	NA	NA|449aa|down_1|NC_010631.1_181429_182776_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|829aa|down_2|NC_010631.1_182810_185297_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|262aa|down_3|NC_010631.1_185327_186113_-	COG0725, ModA, ABC-type molybdate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|709aa|down_4|NC_010631.1_186442_188569_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|143aa|down_5|NC_010631.1_189770_190199_+	PRK02406, PRK02406, DNA polymerase IV; Validated	NA|721aa|down_6|NC_010631.1_190539_192702_+	cd01948, EAL, EAL domain	NA|610aa|down_7|NC_010631.1_193852_195682_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|70aa|down_8|NC_010631.1_196637_196847_+	NA	NA|1205aa|down_9|NC_010631.1_197543_201158_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_000020025.1_ASM2002v1	NC_010632	Nostoc punctiforme PCC 73102 plasmid pNPUN02, complete sequence	1	11298-11406	1	CRISPRCasFinder	no		cas3,csa3	Orphan	ATAGGTAGGTGATTTTGACGACATCTCTGTGAGTGAG	37	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|81aa|up_8|NC_010632.1_243_486_+,NA|102aa|up_4|NC_010632.1_4063_4369_+,NA|83aa|up_3|NC_010632.1_4423_4672_+,NA|111aa|down_0|NC_010632.1_12222_12555_+,NA|63aa|down_1|NC_010632.1_12563_12752_-,NA|103aa|down_2|NC_010632.1_12744_13053_-,NA|68aa|down_3|NC_010632.1_13184_13388_+,NA|514aa|down_4|NC_010632.1_13384_14926_-,NA|96aa|down_5|NC_010632.1_14929_15217_-,NA|147aa|down_7|NC_010632.1_15637_16078_-,NA|107aa|down_8|NC_010632.1_16094_16415_-,NA|171aa|down_9|NC_010632.1_16480_16993_-	NA|NA	NA	NA|81aa|up_8|NC_010632.1_243_486_+	NA	NA|280aa|up_7|NC_010632.1_612_1452_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|375aa|up_6|NC_010632.1_1453_2578_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|317aa|up_5|NC_010632.1_3099_4050_+	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family	NA|102aa|up_4|NC_010632.1_4063_4369_+	NA	NA|83aa|up_3|NC_010632.1_4423_4672_+	NA	NA|403aa|up_2|NC_010632.1_4798_6007_-	pfam10592, AIPR, AIPR protein	NA|1404aa|up_1|NC_010632.1_6077_10289_-	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|50aa|up_0|NC_010632.1_10967_11117_+	cd10719, DnaJ_zf, Zinc finger domain of DnaJ and HSP40	NA|111aa|down_0|NC_010632.1_12222_12555_+	NA	NA|63aa|down_1|NC_010632.1_12563_12752_-	NA	NA|103aa|down_2|NC_010632.1_12744_13053_-	NA	NA|68aa|down_3|NC_010632.1_13184_13388_+	NA	NA|514aa|down_4|NC_010632.1_13384_14926_-	NA	NA|96aa|down_5|NC_010632.1_14929_15217_-	NA	NA|122aa|down_6|NC_010632.1_15233_15599_-	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|147aa|down_7|NC_010632.1_15637_16078_-	NA	NA|107aa|down_8|NC_010632.1_16094_16415_-	NA	NA|171aa|down_9|NC_010632.1_16480_16993_-	NA
GCF_000020025.1_ASM2002v1	NC_010632	Nostoc punctiforme PCC 73102 plasmid pNPUN02, complete sequence	2	230064-230164	2	CRISPRCasFinder	no		cas3,csa3	Orphan	GTGCAAAATAGCAGACATTTTGTA	24	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|261aa|up_9|NC_010632.1_218489_219272_+,NA|81aa|up_8|NC_010632.1_219268_219511_-,NA|203aa|up_5|NC_010632.1_221259_221868_-,NA|109aa|up_3|NC_010632.1_223976_224303_-,NA|167aa|down_0|NC_010632.1_230503_231004_+,NA|427aa|down_2|NC_010632.1_232287_233568_-,NA|64aa|down_3|NC_010632.1_233688_233880_+,NA|137aa|down_4|NC_010632.1_234163_234574_+,NA|125aa|down_5|NC_010632.1_234771_235146_+,NA|54aa|down_6|NC_010632.1_235198_235360_+,NA|168aa|down_7|NC_010632.1_235440_235944_-,NA|1727aa|down_8|NC_010632.1_236019_241200_+	NA|261aa|up_9|NC_010632.1_218489_219272_+	NA	NA|81aa|up_8|NC_010632.1_219268_219511_-	NA	NA|255aa|up_7|NC_010632.1_219494_220259_-	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|218aa|up_6|NC_010632.1_220516_221170_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|203aa|up_5|NC_010632.1_221259_221868_-	NA	NA|229aa|up_4|NC_010632.1_222054_222741_-	pfam13500, AAA_26, AAA domain	NA|109aa|up_3|NC_010632.1_223976_224303_-	NA	NA|298aa|up_2|NC_010632.1_224931_225825_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|541aa|up_1|NC_010632.1_225979_227602_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]	NA|569aa|up_0|NC_010632.1_227709_229416_-	pfam01076, Mob_Pre, Plasmid recombination enzyme	NA|167aa|down_0|NC_010632.1_230503_231004_+	NA	NA|410aa|down_1|NC_010632.1_231006_232236_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|427aa|down_2|NC_010632.1_232287_233568_-	NA	NA|64aa|down_3|NC_010632.1_233688_233880_+	NA	NA|137aa|down_4|NC_010632.1_234163_234574_+	NA	NA|125aa|down_5|NC_010632.1_234771_235146_+	NA	NA|54aa|down_6|NC_010632.1_235198_235360_+	NA	NA|168aa|down_7|NC_010632.1_235440_235944_-	NA	NA|1727aa|down_8|NC_010632.1_236019_241200_+	NA	NA|255aa|down_9|NC_010632.1_241249_242014_+	sd00006, TPR, Tetratricopeptide repeat
GCF_000020025.1_ASM2002v1	NC_010630	Nostoc punctiforme PCC 73102 plasmid pNPUN03, complete sequence	1	61534-61662	1	CRISPRCasFinder	no			Orphan	CATTGCCAGAGCTAGAAAAGGAAGAAGAATTTATATTAT	39	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|142aa|up_9|NC_010630.1_45110_45536_+,NA|202aa|up_8|NC_010630.1_45660_46266_+,NA|188aa|up_7|NC_010630.1_46320_46884_+,NA|153aa|up_6|NC_010630.1_46886_47345_+,NA|155aa|up_5|NC_010630.1_47331_47796_+,NA|332aa|up_4|NC_010630.1_49093_50089_+,NA|76aa|up_3|NC_010630.1_50241_50469_+,NA|83aa|down_0|NC_010630.1_63297_63546_+,NA|67aa|down_2|NC_010630.1_64612_64813_+,NA|80aa|down_7|NC_010630.1_71890_72130_+,NA|123aa|down_8|NC_010630.1_72520_72889_+	NA|142aa|up_9|NC_010630.1_45110_45536_+	NA	NA|202aa|up_8|NC_010630.1_45660_46266_+	NA	NA|188aa|up_7|NC_010630.1_46320_46884_+	NA	NA|153aa|up_6|NC_010630.1_46886_47345_+	NA	NA|155aa|up_5|NC_010630.1_47331_47796_+	NA	NA|332aa|up_4|NC_010630.1_49093_50089_+	NA	NA|76aa|up_3|NC_010630.1_50241_50469_+	NA	NA|281aa|up_2|NC_010630.1_50700_51543_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|786aa|up_1|NC_010630.1_52260_54618_+	cd02872, GH18_chitolectin_chitotriosidase, This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases	NA|875aa|up_0|NC_010630.1_54816_57441_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|83aa|down_0|NC_010630.1_63297_63546_+	NA	NA|276aa|down_1|NC_010630.1_63731_64559_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|67aa|down_2|NC_010630.1_64612_64813_+	NA	NA|795aa|down_3|NC_010630.1_65213_67598_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|402aa|down_4|NC_010630.1_67705_68911_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|397aa|down_5|NC_010630.1_68936_70127_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|92aa|down_6|NC_010630.1_70824_71100_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|80aa|down_7|NC_010630.1_71890_72130_+	NA	NA|123aa|down_8|NC_010630.1_72520_72889_+	NA	NA|323aa|down_9|NC_010630.1_73672_74641_+	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain
GCF_000020025.1_ASM2002v1	NC_010630	Nostoc punctiforme PCC 73102 plasmid pNPUN03, complete sequence	2	78268-78406	2	CRISPRCasFinder	no			Orphan	CGAACCTTTAAGCAAAGCAGACTCAACAGAACAACAGACT	40	0	0	NA	NA	N:A	1	1	Orphan	PD-DExK,RT,csa3,DEDDh,cas14j,cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL,Cas9_archaeal,DinG,c2c5_V-U5	NA|80aa|up_7|NC_010630.1_71890_72130_+,NA|123aa|up_6|NC_010630.1_72520_72889_+,NA|129aa|up_4|NC_010630.1_74903_75290_+,NA|285aa|up_3|NC_010630.1_75425_76280_-,NA|137aa|up_2|NC_010630.1_76286_76697_-,NA|116aa|up_1|NC_010630.1_76852_77200_-,NA|70aa|down_0|NC_010630.1_78763_78973_+,NA|222aa|down_1|NC_010630.1_79345_80011_-,NA|136aa|down_6|NC_010630.1_107116_107524_+,NA|166aa|down_7|NC_010630.1_109189_109687_+,NA|138aa|down_8|NC_010630.1_110286_110700_-	NA|397aa|up_9|NC_010630.1_68936_70127_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|92aa|up_8|NC_010630.1_70824_71100_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|80aa|up_7|NC_010630.1_71890_72130_+	NA	NA|123aa|up_6|NC_010630.1_72520_72889_+	NA	NA|323aa|up_5|NC_010630.1_73672_74641_+	cd01195, INT_C_like_5, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|129aa|up_4|NC_010630.1_74903_75290_+	NA	NA|285aa|up_3|NC_010630.1_75425_76280_-	NA	NA|137aa|up_2|NC_010630.1_76286_76697_-	NA	NA|116aa|up_1|NC_010630.1_76852_77200_-	NA	NA|265aa|up_0|NC_010630.1_77292_78087_+	cd02042, ParAB_family, partition proteins ParAB family	NA|70aa|down_0|NC_010630.1_78763_78973_+	NA	NA|222aa|down_1|NC_010630.1_79345_80011_-	NA	NA|338aa|down_2|NC_010630.1_80387_81401_-	cd01846, fatty_acyltransferase_like, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases	NA|4605aa|down_3|NC_010630.1_82033_95848_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|1854aa|down_4|NC_010630.1_95853_101415_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1802aa|down_5|NC_010630.1_101411_106817_-	PRK05691, PRK05691, peptide synthase; Validated	NA|136aa|down_6|NC_010630.1_107116_107524_+	NA	NA|166aa|down_7|NC_010630.1_109189_109687_+	NA	NA|138aa|down_8|NC_010630.1_110286_110700_-	NA	NA|323aa|down_9|NC_010630.1_110812_111781_-	pfam13455, MUG113, Meiotically up-regulated gene 113
