assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	1	163826-165026	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	 Type III-C?,Type III-D,Type III-B,Type III-A,Type III-C	GTTTCCATTAATTCCGCTTCTAAAGAATAGAAGCGAC,GTTTCCATTAATTCCGCTTCTAAAGAATAGAAGCGAC,TTTCCATTAATTCCGCTTCTAAAGAATAGAAGCGAC	37,37,36	0	0	NA	NA	NA:NA:NA	16,16,16	16	TypeIII-C?,TypeIII-D,TypeIII-A,TypeIII-B,TypeIII-C	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|162aa|up_7|NC_019682.1_151098_151584_-,NA|326aa|up_0|NC_019682.1_159868_160846_-,cmr5gr11|142aa|down_3|NC_019682.1_169275_169701_+,NA|537aa|down_4|NC_019682.1_169690_171301_+,csx1|529aa|down_5|NC_019682.1_171310_172897_+,NA|47aa|down_6|NC_019682.1_172934_173075_+	NA|157aa|up_9|NC_019682.1_149315_149786_-	cd03033, ArsC_15kD, Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC	NA|204aa|up_8|NC_019682.1_150194_150806_-	COG5553, COG5553, Predicted metal-dependent enzyme of the double-stranded beta helix superfamily [General function prediction only]	NA|162aa|up_7|NC_019682.1_151098_151584_-	NA	NA|196aa|up_6|NC_019682.1_151737_152325_-	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|557aa|up_5|NC_019682.1_152483_154154_-	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|336aa|up_4|NC_019682.1_154174_155182_-	COG1622, CyoA, Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]	NA|98aa|up_3|NC_019682.1_155743_156037_-	TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific	NA|531aa|up_2|NC_019682.1_157379_158972_+	cd10549, MtMvhB_like, Uncharacterized polyferredoxin-like protein	NA|209aa|up_1|NC_019682.1_159047_159674_+	pfam05685, Uma2, Putative restriction endonuclease	NA|326aa|up_0|NC_019682.1_159868_160846_-	NA	cas10|519aa|down_0|NC_019682.1_165606_167163_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|331aa|down_1|NC_019682.1_167286_168279_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|280aa|down_2|NC_019682.1_168280_169120_+	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr5gr11|142aa|down_3|NC_019682.1_169275_169701_+	NA	NA|537aa|down_4|NC_019682.1_169690_171301_+	NA	csx1|529aa|down_5|NC_019682.1_171310_172897_+	NA	NA|47aa|down_6|NC_019682.1_172934_173075_+	NA	NA|114aa|down_7|NC_019682.1_173130_173472_-	CHL00065, psaC, photosystem I subunit VII	NA|564aa|down_8|NC_019682.1_173558_175250_-	COG1053, SdhA, Succinate dehydrogenase/fumarate reductase, flavoprotein subunit [Energy production and conversion]	NA|103aa|down_9|NC_019682.1_175860_176169_-	pfam04255, DUF433, Protein of unknown function (DUF433)
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	2	237153-237277	2	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GCGCAAAGGCGCAAAGGAAATTACATAAAATTGTCGCC	38	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|108aa|up_6|NC_019682.1_229385_229709_-,NA|138aa|up_4|NC_019682.1_230440_230854_-,NA|169aa|up_1|NC_019682.1_234118_234625_-,NA|170aa|up_0|NC_019682.1_234841_235351_-,NA	NA|236aa|up_9|NC_019682.1_223302_224010_-	pfam16258, DUF4912, Domain of unknown function (DUF4912)	NA|981aa|up_8|NC_019682.1_224407_227350_-	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|373aa|up_7|NC_019682.1_228031_229150_-	COG2082, CobH, Precorrin isomerase [Coenzyme metabolism]	NA|108aa|up_6|NC_019682.1_229385_229709_-	NA	NA|189aa|up_5|NC_019682.1_229766_230333_-	COG2340, COG2340, Uncharacterized protein with SCP/PR1 domains [Function unknown]	NA|138aa|up_4|NC_019682.1_230440_230854_-	NA	NA|462aa|up_3|NC_019682.1_231642_233028_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|148aa|up_2|NC_019682.1_233039_233483_-	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|169aa|up_1|NC_019682.1_234118_234625_-	NA	NA|170aa|up_0|NC_019682.1_234841_235351_-	NA	NA|2214aa|down_0|NC_019682.1_238360_245002_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|687aa|down_1|NC_019682.1_245293_247354_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|164aa|down_2|NC_019682.1_247365_247857_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|49aa|down_3|NC_019682.1_248086_248233_-	pfam01701, PSI_PsaJ, Photosystem I reaction centre subunit IX / PsaJ	NA|160aa|down_4|NC_019682.1_248315_248795_-	CHL00132, psaF, photosystem I subunit III; Validated	NA|511aa|down_5|NC_019682.1_249150_250683_-	CHL00062, psbB, photosystem II 47 kDa protein	NA|469aa|down_6|NC_019682.1_251067_252474_-	CHL00035, psbC, photosystem II 44 kDa protein	NA|353aa|down_7|NC_019682.1_252736_253795_-	CHL00004, psbD, photosystem II protein D2	NA|174aa|down_8|NC_019682.1_253856_254378_-	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|740aa|down_9|NC_019682.1_254567_256787_-	CHL00091, apcE, phycobillisome linker protein
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	3	398923-399009	3	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GATTTTAAATTTAGGATTTTGGATT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|142aa|up_3|NC_019682.1_389915_390341_-,NA	NA|359aa|up_9|NC_019682.1_383215_384292_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|386aa|up_8|NC_019682.1_384333_385491_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|42aa|up_7|NC_019682.1_385519_385645_-	cd03349, LbH_XAT, Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others	NA|279aa|up_6|NC_019682.1_387163_388000_-	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|324aa|up_5|NC_019682.1_388001_388973_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|274aa|up_4|NC_019682.1_389049_389871_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|142aa|up_3|NC_019682.1_389915_390341_-	NA	NA|430aa|up_2|NC_019682.1_390553_391843_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|734aa|up_1|NC_019682.1_393710_395912_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|862aa|up_0|NC_019682.1_396264_398850_-	cd07302, CHD, cyclase homology domain	NA|117aa|down_0|NC_019682.1_399371_399722_+	COG2146, {NirD}, Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only]	NA|1947aa|down_1|NC_019682.1_399864_405705_-	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|793aa|down_2|NC_019682.1_406149_408528_+	COG4953, PbpC, Membrane carboxypeptidase/penicillin-binding protein PbpC [Cell envelope biogenesis, outer membrane]	NA|534aa|down_3|NC_019682.1_408746_410348_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|679aa|down_4|NC_019682.1_410875_412912_+	pfam14516, AAA_35, AAA-like domain	NA|234aa|down_5|NC_019682.1_412949_413651_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|360aa|down_6|NC_019682.1_413897_414977_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|649aa|down_7|NC_019682.1_415741_417688_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|1056aa|down_8|NC_019682.1_418484_421652_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|510aa|down_9|NC_019682.1_422270_423800_-	TIGR02730, Carotenoid_isomerase, carotene isomerase
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	4	404223-404335	4	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	TATCAAGGGGCTGATTTTTTTTAGTCGAAACG	32	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|142aa|up_4|NC_019682.1_389915_390341_-,NA	NA|386aa|up_9|NC_019682.1_384333_385491_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|42aa|up_8|NC_019682.1_385519_385645_-	cd03349, LbH_XAT, Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others	NA|279aa|up_7|NC_019682.1_387163_388000_-	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|324aa|up_6|NC_019682.1_388001_388973_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|274aa|up_5|NC_019682.1_389049_389871_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|142aa|up_4|NC_019682.1_389915_390341_-	NA	NA|430aa|up_3|NC_019682.1_390553_391843_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|734aa|up_2|NC_019682.1_393710_395912_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|862aa|up_1|NC_019682.1_396264_398850_-	cd07302, CHD, cyclase homology domain	NA|117aa|up_0|NC_019682.1_399371_399722_+	COG2146, {NirD}, Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only]	NA|793aa|down_0|NC_019682.1_406149_408528_+	COG4953, PbpC, Membrane carboxypeptidase/penicillin-binding protein PbpC [Cell envelope biogenesis, outer membrane]	NA|534aa|down_1|NC_019682.1_408746_410348_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|679aa|down_2|NC_019682.1_410875_412912_+	pfam14516, AAA_35, AAA-like domain	NA|234aa|down_3|NC_019682.1_412949_413651_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|360aa|down_4|NC_019682.1_413897_414977_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|649aa|down_5|NC_019682.1_415741_417688_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|1056aa|down_6|NC_019682.1_418484_421652_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|510aa|down_7|NC_019682.1_422270_423800_-	TIGR02730, Carotenoid_isomerase, carotene isomerase	NA|261aa|down_8|NC_019682.1_423911_424694_+	cd03261, ABC_Org_Solvent_Resistant, ATP-binding cassette transport system involved in resistant to organic solvents	NA|461aa|down_9|NC_019682.1_424775_426158_+	PLN03094, PLN03094, Substrate binding subunit of ER-derived-lipid transporter; Provisional
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	5	730176-730255	5	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	CCATAGATAAATCTGCTTGCTCTG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|176aa|up_6|NC_019682.1_719723_720251_-,NA|163aa|down_1|NC_019682.1_734535_735024_+,NA|134aa|down_2|NC_019682.1_735325_735727_+,NA|140aa|down_3|NC_019682.1_735950_736370_+,NA|47aa|down_4|NC_019682.1_736405_736546_+,NA|136aa|down_5|NC_019682.1_736623_737031_+,NA|142aa|down_6|NC_019682.1_737034_737460_+,NA|83aa|down_8|NC_019682.1_738165_738414_-	NA|437aa|up_9|NC_019682.1_715669_716980_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|208aa|up_8|NC_019682.1_717261_717885_+	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|478aa|up_7|NC_019682.1_717922_719356_+	PRK06169, PRK06169, putative amidase; Provisional	NA|176aa|up_6|NC_019682.1_719723_720251_-	NA	NA|339aa|up_5|NC_019682.1_721714_722731_+	COG3491, PcbC, Isopenicillin N synthase and related dioxygenases [General function prediction only]	NA|323aa|up_4|NC_019682.1_722792_723761_-	PRK09553, tauD, taurine dioxygenase; Reviewed	NA|849aa|up_3|NC_019682.1_724239_726786_+	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|92aa|up_2|NC_019682.1_727163_727439_+	pfam04341, DUF485, Protein of unknown function, DUF485	NA|513aa|up_1|NC_019682.1_727527_729066_+	PRK09395, actP, cation/acetate symporter ActP	NA|303aa|up_0|NC_019682.1_729174_730083_+	PRK02259, PRK02259, aspartoacylase; Provisional	NA|1236aa|down_0|NC_019682.1_730290_733998_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|163aa|down_1|NC_019682.1_734535_735024_+	NA	NA|134aa|down_2|NC_019682.1_735325_735727_+	NA	NA|140aa|down_3|NC_019682.1_735950_736370_+	NA	NA|47aa|down_4|NC_019682.1_736405_736546_+	NA	NA|136aa|down_5|NC_019682.1_736623_737031_+	NA	NA|142aa|down_6|NC_019682.1_737034_737460_+	NA	NA|140aa|down_7|NC_019682.1_737746_738166_-	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|83aa|down_8|NC_019682.1_738165_738414_-	NA	NA|330aa|down_9|NC_019682.1_738519_739509_+	pfam01018, GTP1_OBG, GTP1/OBG
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	6	815670-817106	2,6,2	PILER-CR,CRISPRCasFinder,CRT	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTCACAATTAACTTAAATCCCTATTAGGGATTGAAAC,GTCACAATTAACTTAAATCCCTATTAGGGATTGAAAC,GTCACAATTAACTTAAATCCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	19,19,19	19	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|80aa|up_4|NC_019682.1_809793_810033_+,NA|107aa|up_3|NC_019682.1_810188_810509_-,NA|113aa|down_5|NC_019682.1_824626_824965_+,NA|417aa|down_8|NC_019682.1_827391_828642_+	NA|103aa|up_9|NC_019682.1_803529_803838_-	cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BMC) domain	NA|619aa|up_8|NC_019682.1_804429_806286_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|499aa|up_7|NC_019682.1_806421_807918_+	PRK06473, PRK06473, NADH-quinone oxidoreductase subunit M	NA|377aa|up_6|NC_019682.1_807994_809125_+	pfam10216, ChpXY, CO2 hydration protein (ChpXY)	NA|71aa|up_5|NC_019682.1_809527_809740_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|80aa|up_4|NC_019682.1_809793_810033_+	NA	NA|107aa|up_3|NC_019682.1_810188_810509_-	NA	NA|438aa|up_2|NC_019682.1_811360_812674_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|164aa|up_1|NC_019682.1_813332_813824_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|392aa|up_0|NC_019682.1_813820_814996_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|55aa|down_0|NC_019682.1_817318_817483_+	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|421aa|down_1|NC_019682.1_817581_818844_+	sd00006, TPR, Tetratricopeptide repeat	NA|533aa|down_2|NC_019682.1_819522_821121_-	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|631aa|down_3|NC_019682.1_821510_823403_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|346aa|down_4|NC_019682.1_823418_824456_-	COG2008, GLY1, Threonine aldolase [Amino acid transport and metabolism]	NA|113aa|down_5|NC_019682.1_824626_824965_+	NA	NA|92aa|down_6|NC_019682.1_825037_825313_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|435aa|down_7|NC_019682.1_825374_826679_-	PRK07591, PRK07591, threonine synthase; Validated	NA|417aa|down_8|NC_019682.1_827391_828642_+	NA	NA|237aa|down_9|NC_019682.1_828727_829438_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	7	980659-982029	3,7,3	PILER-CR,CRISPRCasFinder,CRT	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	18,18,18	18	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|142aa|up_6|NC_019682.1_971530_971956_-,NA|61aa|down_0|NC_019682.1_982038_982221_+,NA|57aa|down_1|NC_019682.1_982273_982444_+,NA|70aa|down_2|NC_019682.1_982421_982631_+	NA|314aa|up_9|NC_019682.1_969228_970170_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|up_8|NC_019682.1_970117_970588_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|193aa|up_7|NC_019682.1_970896_971475_-	pfam05685, Uma2, Putative restriction endonuclease	NA|142aa|up_6|NC_019682.1_971530_971956_-	NA	NA|329aa|up_5|NC_019682.1_972090_973077_-	PRK13022, secF, protein translocase subunit SecF	NA|474aa|up_4|NC_019682.1_973073_974495_-	TIGR01129, Protein_translocase_subunit_SecD, protein-export membrane protein SecD	NA|289aa|up_3|NC_019682.1_974560_975427_-	TIGR01129, Protein_translocase_subunit_SecD, protein-export membrane protein SecD	NA|89aa|up_2|NC_019682.1_975826_976093_+	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|127aa|up_1|NC_019682.1_976089_976470_+	cd18682, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|328aa|up_0|NC_019682.1_976595_977579_-	CHL00144, odpB, pyruvate dehydrogenase E1 component beta subunit; Validated	NA|61aa|down_0|NC_019682.1_982038_982221_+	NA	NA|57aa|down_1|NC_019682.1_982273_982444_+	NA	NA|70aa|down_2|NC_019682.1_982421_982631_+	NA	NA|274aa|down_3|NC_019682.1_982977_983799_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|137aa|down_4|NC_019682.1_983907_984318_-	pfam11218, DUF3011, Protein of unknown function (DUF3011)	NA|327aa|down_5|NC_019682.1_984605_985586_-	PRK09283, PRK09283, porphobilinogen synthase	NA|123aa|down_6|NC_019682.1_985923_986292_-	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|543aa|down_7|NC_019682.1_986425_988054_+	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|756aa|down_8|NC_019682.1_988201_990469_+	cd07338, M48B_HtpX_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|165aa|down_9|NC_019682.1_991180_991675_+	smart00360, RRM, RNA recognition motif
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	8	1325066-1331254	4,8,4,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Type I-D	GTTTCAATCCCGTTGCCGGGAATCATTTATTTGAAAG,GTTTCAATCCCGTTGCCGGGAATCATTTATTTGAAAG,GTTTCAATCCCGTTGCCGGGAATCATTTATTTGAAAG,GTTTCAATCCCGTTGCCGGGAATCATTTATTTGAAAG	37,37,37,37	0	0	NA	NA	NA:NA:NA:NA	57,84,84,57	84	TypeI-D	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|95aa|up_0|NC_019682.1_1324742_1325027_+,NA	NA|1200aa|up_9|NC_019682.1_1305447_1309047_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|1106aa|up_8|NC_019682.1_1309368_1312686_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|158aa|up_7|NC_019682.1_1312792_1313266_-	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|122aa|up_6|NC_019682.1_1313387_1313753_-	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|410aa|up_5|NC_019682.1_1314111_1315341_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|309aa|up_4|NC_019682.1_1316346_1317273_-	PRK05441, murQ, N-acetylmuramic acid-6-phosphate etherase; Reviewed	NA|142aa|up_3|NC_019682.1_1317363_1317789_-	pfam11360, DUF3110, Protein of unknown function (DUF3110)	NA|1822aa|up_2|NC_019682.1_1318045_1323511_+	pfam04357, TamB, TamB, inner membrane protein subunit of TAM complex	WYL|329aa|up_1|NC_019682.1_1323512_1324499_-	pfam13280, WYL, WYL domain	NA|95aa|up_0|NC_019682.1_1324742_1325027_+	NA	cas2|98aa|down_0|NC_019682.1_1331470_1331764_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NC_019682.1_1331760_1332738_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|195aa|down_2|NC_019682.1_1332741_1333326_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|275aa|down_3|NC_019682.1_1333334_1334159_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	csc1gr5|264aa|down_4|NC_019682.1_1334127_1334919_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|335aa|down_5|NC_019682.1_1335124_1336129_-	pfam18320, Csc2, Csc2 Crispr	cas10d|974aa|down_6|NC_019682.1_1336178_1339100_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|712aa|down_7|NC_019682.1_1339159_1341295_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|348aa|down_8|NC_019682.1_1343467_1344511_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|348aa|down_9|NC_019682.1_1344701_1345745_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	9	1341653-1343376	9,5	CRISPRCasFinder,CRT	no	WYL,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Type I-D	CTTTCAATTAAATGAAACCCGGCAACGGGATTGAAAC,CTTTCAATTAAATGAAACCCGGCAACGGGATTGAAAC	37,37	0	0	NA	NA	NA:NA	21,23	23	TypeI-D	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|95aa|up_8|NC_019682.1_1324742_1325027_+,NA	WYL|329aa|up_9|NC_019682.1_1323512_1324499_-	pfam13280, WYL, WYL domain	NA|95aa|up_8|NC_019682.1_1324742_1325027_+	NA	cas2|98aa|up_7|NC_019682.1_1331470_1331764_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|up_6|NC_019682.1_1331760_1332738_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|195aa|up_5|NC_019682.1_1332741_1333326_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|275aa|up_4|NC_019682.1_1333334_1334159_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	csc1gr5|264aa|up_3|NC_019682.1_1334127_1334919_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|335aa|up_2|NC_019682.1_1335124_1336129_-	pfam18320, Csc2, Csc2 Crispr	cas10d|974aa|up_1|NC_019682.1_1336178_1339100_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|712aa|up_0|NC_019682.1_1339159_1341295_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|348aa|down_0|NC_019682.1_1343467_1344511_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|348aa|down_1|NC_019682.1_1344701_1345745_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|369aa|down_2|NC_019682.1_1345920_1347027_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|274aa|down_3|NC_019682.1_1348126_1348948_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|357aa|down_4|NC_019682.1_1349015_1350086_-	TIGR01728, Putative_aliphatic_sulfonates-binding_protein, ABC transporter, substrate-binding protein, aliphatic sulfonates family	NA|110aa|down_5|NC_019682.1_1350169_1350499_-	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|543aa|down_6|NC_019682.1_1350495_1352124_-	COG1053, SdhA, Succinate dehydrogenase/fumarate reductase, flavoprotein subunit [Energy production and conversion]	NA|578aa|down_7|NC_019682.1_1353485_1355219_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|276aa|down_8|NC_019682.1_1355539_1356367_+	PRK11365, ssuC, aliphatic sulfonate ABC transporter permease SsuC	NA|257aa|down_9|NC_019682.1_1356469_1357240_+	PRK11247, ssuB, aliphatic sulfonates transport ATP-binding subunit; Provisional
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	10	1878366-1878461	10	CRISPRCasFinder	no	cas14j	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Unclear	GTAGGGTGGGCACTGCCCACCACA	24	0	0	NA	NA	NA	1	1	TypeV	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|253aa|up_0|NC_019682.1_1877487_1878246_+,NA	NA|943aa|up_9|NC_019682.1_1867432_1870261_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|144aa|up_8|NC_019682.1_1870411_1870843_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|171aa|up_7|NC_019682.1_1870950_1871463_-	pfam09654, DUF2396, Protein of unknown function (DUF2396)	NA|175aa|up_6|NC_019682.1_1871702_1872227_+	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|42aa|up_5|NC_019682.1_1872382_1872508_+	PRK13240, pbsY, photosystem II protein Y; Reviewed	NA|420aa|up_4|NC_019682.1_1872657_1873917_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|433aa|up_3|NC_019682.1_1874018_1875317_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|369aa|up_2|NC_019682.1_1875611_1876718_+	COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion]	NA|256aa|up_1|NC_019682.1_1876723_1877491_+	COG3166, PilN, Tfp pilus assembly protein PilN [Cell motility and secretion / Intracellular trafficking and secretion]	NA|253aa|up_0|NC_019682.1_1877487_1878246_+	NA	NA|718aa|down_0|NC_019682.1_1878579_1880733_+	COG4796, HofQ, Type II secretory pathway, component HofQ [Intracellular trafficking and secretion]	cas14j|465aa|down_1|NC_019682.1_1880816_1882211_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|152aa|down_2|NC_019682.1_1882287_1882743_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|73aa|down_3|NC_019682.1_1883125_1883344_-	pfam11910, NdhO, Cyanobacterial and plant NDH-1 subunit O	NA|284aa|down_4|NC_019682.1_1883417_1884269_-	PRK13945, PRK13945, formamidopyrimidine-DNA glycosylase; Provisional	NA|71aa|down_5|NC_019682.1_1884494_1884707_-	pfam02427, PSI_PsaE, Photosystem I reaction centre subunit IV / PsaE	NA|195aa|down_6|NC_019682.1_1885009_1885594_-	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|80aa|down_7|NC_019682.1_1885837_1886077_+	pfam11332, DUF3134, Protein of unknown function (DUF3134)	NA|366aa|down_8|NC_019682.1_1886156_1887254_+	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|168aa|down_9|NC_019682.1_1887409_1887913_-	cd00886, MogA_MoaB, MogA_MoaB family
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	11	2019571-2019663	11	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	AAAAGGCTGAAATCAAGAAATCTGTAC	27	1	2	2019598-2019636|2019598-2019636	NC_019682.1_2967582-2967544|NC_019682.1_6564539-6564577	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|165aa|up_9|NC_019682.1_2007925_2008420_+,NA|48aa|up_7|NC_019682.1_2010159_2010303_+,NA|74aa|down_6|NC_019682.1_2026012_2026234_+	NA|165aa|up_9|NC_019682.1_2007925_2008420_+	NA	NA|466aa|up_8|NC_019682.1_2008541_2009939_-	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|48aa|up_7|NC_019682.1_2010159_2010303_+	NA	NA|276aa|up_6|NC_019682.1_2010366_2011194_-	PRK12896, PRK12896, methionine aminopeptidase; Reviewed	NA|163aa|up_5|NC_019682.1_2011332_2011821_-	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily	NA|400aa|up_4|NC_019682.1_2012097_2013297_-	cd13557, PBP2_SsuA, Substrate binding domain of sulfonate binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|420aa|up_3|NC_019682.1_2013649_2014909_+	PRK07364, PRK07364, FAD-dependent hydroxylase	NA|213aa|up_2|NC_019682.1_2015176_2015815_+	pfam05685, Uma2, Putative restriction endonuclease	NA|732aa|up_1|NC_019682.1_2016092_2018288_+	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|279aa|up_0|NC_019682.1_2018719_2019556_+	CHL00182, tatC, Sec-independent translocase component C; Provisional	NA|112aa|down_0|NC_019682.1_2019729_2020065_+	PRK10089, PRK10089, chaperone CsaA	NA|103aa|down_1|NC_019682.1_2020100_2020409_-	pfam11267, DUF3067, Domain of unknown function (DUF3067)	NA|180aa|down_2|NC_019682.1_2020639_2021179_+	PRK13474, PRK13474, cytochrome b6-f complex iron-sulfur subunit; Provisional	NA|334aa|down_3|NC_019682.1_2021299_2022301_+	PRK02693, PRK02693, apocytochrome f; Reviewed	NA|530aa|down_4|NC_019682.1_2023312_2024902_-	COG1543, COG1543, Uncharacterized conserved protein [Function unknown]	NA|270aa|down_5|NC_019682.1_2025206_2026016_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|74aa|down_6|NC_019682.1_2026012_2026234_+	NA	NA|253aa|down_7|NC_019682.1_2026195_2026954_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|360aa|down_8|NC_019682.1_2027002_2028082_-	PRK12289, PRK12289, small ribosomal subunit biogenesis GTPase RsgA	NA|85aa|down_9|NC_019682.1_2028081_2028336_-	cd00291, SirA_YedF_YeeD, SirA, YedF, and YeeD
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	12	2900656-2904492	12,6,6,7,8	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTTTCCATCCCCGTGAGGGGTAAAGGAATTAAAAC,GTTTCCATCCCCNNNNGGGGTAAAGGAANTAAAAC,GTTTTATTTCCTTTACCCCGTGAGGGGATGGAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAACT	35,35,35,35,36	0	0	NA	NA	NA:NA:NA:NA:NA	52,52,48,48,48	52	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|90aa|up_7|NC_019682.1_2892439_2892709_+,NA|114aa|up_6|NC_019682.1_2892938_2893280_-,NA|93aa|up_4|NC_019682.1_2894121_2894400_+,NA|71aa|down_0|NC_019682.1_2906912_2907125_-,NA|620aa|down_3|NC_019682.1_2908434_2910294_-	NA|283aa|up_9|NC_019682.1_2889960_2890809_+	cd05640, M28_like, M28 Zn-peptidase; uncharacterized subfamily	NA|304aa|up_8|NC_019682.1_2890843_2891755_-	COG3781, COG3781, Predicted membrane protein [Function unknown]	NA|90aa|up_7|NC_019682.1_2892439_2892709_+	NA	NA|114aa|up_6|NC_019682.1_2892938_2893280_-	NA	NA|171aa|up_5|NC_019682.1_2893603_2894116_+	pfam18063, BB_PF, Beta barrel Pore-forming domain	NA|93aa|up_4|NC_019682.1_2894121_2894400_+	NA	NA|670aa|up_3|NC_019682.1_2894598_2896608_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|249aa|up_2|NC_019682.1_2897139_2897886_+	pfam04481, DUF561, Protein of unknown function (DUF561)	NA|383aa|up_1|NC_019682.1_2898106_2899255_-	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|259aa|up_0|NC_019682.1_2899605_2900382_+	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|71aa|down_0|NC_019682.1_2906912_2907125_-	NA	NA|125aa|down_1|NC_019682.1_2907506_2907881_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|92aa|down_2|NC_019682.1_2907877_2908153_-	COG0864, NikR, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain and a metal-binding domain [Transcription]	NA|620aa|down_3|NC_019682.1_2908434_2910294_-	NA	NA|410aa|down_4|NC_019682.1_2911005_2912235_+	COG0045, SucC, Succinyl-CoA synthetase, beta subunit [Energy production and conversion]	NA|299aa|down_5|NC_019682.1_2912635_2913532_+	COG0074, SucD, Succinyl-CoA synthetase, alpha subunit [Energy production and conversion]	NA|142aa|down_6|NC_019682.1_2913632_2914058_-	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|615aa|down_7|NC_019682.1_2914439_2916284_-	sd00006, TPR, Tetratricopeptide repeat	NA|908aa|down_8|NC_019682.1_2916372_2919096_-	COG0617, PcnB, tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis]	NA|63aa|down_9|NC_019682.1_2919233_2919422_+	PRK02576, psbZ, photosystem II reaction center protein PsbZ
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	13	3051419-3053058	9,13,7,10,11	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas10,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,cas6,cas2,cas1	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Type III-A,Type III-B,Type III-C,Type III-D	GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC,GTTTCCATCCCCGTGAGGGGTAAAGGAATTAAAAC,GTTTCCATCCCCGTGAGGGGTAAAGGAATTAAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC,GTTTTAATTCCTTTACCCCTCACGGGGATGGAAAC	35,35,35,35,35	0	0	NA	NA	NA:NA:NA:NA:NA	19,22,22,19,19	22	TypeIII-A,TypeIII-B,TypeIII-C,TypeIII-D	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	csx21|215aa|up_9|NC_019682.1_3044969_3045614_+,NA|273aa|up_8|NC_019682.1_3045625_3046444_-,NA|79aa|up_6|NC_019682.1_3047870_3048107_-,NA|101aa|up_5|NC_019682.1_3048231_3048534_-,NA|90aa|up_4|NC_019682.1_3048616_3048886_-,NA|134aa|up_0|NC_019682.1_3050923_3051325_-,NA|71aa|down_1|NC_019682.1_3055963_3056176_-,NA|144aa|down_2|NC_019682.1_3056394_3056826_+,NA|196aa|down_5|NC_019682.1_3062235_3062823_+,NA|52aa|down_9|NC_019682.1_3066975_3067131_-	csx21|215aa|up_9|NC_019682.1_3044969_3045614_+	NA	NA|273aa|up_8|NC_019682.1_3045625_3046444_-	NA	cas6|371aa|up_7|NC_019682.1_3046552_3047665_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|79aa|up_6|NC_019682.1_3047870_3048107_-	NA	NA|101aa|up_5|NC_019682.1_3048231_3048534_-	NA	NA|90aa|up_4|NC_019682.1_3048616_3048886_-	NA	cas2|93aa|up_3|NC_019682.1_3049016_3049295_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|347aa|up_2|NC_019682.1_3049294_3050335_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|142aa|up_1|NC_019682.1_3050498_3050924_-	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|134aa|up_0|NC_019682.1_3050923_3051325_-	NA	cas1|674aa|down_0|NC_019682.1_3053444_3055466_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|71aa|down_1|NC_019682.1_3055963_3056176_-	NA	NA|144aa|down_2|NC_019682.1_3056394_3056826_+	NA	NA|249aa|down_3|NC_019682.1_3057218_3057965_+	pfam14326, DUF4384, Domain of unknown function (DUF4384)	NA|1266aa|down_4|NC_019682.1_3058185_3061983_+	pfam12770, CHAT, CHAT domain	NA|196aa|down_5|NC_019682.1_3062235_3062823_+	NA	NA|603aa|down_6|NC_019682.1_3063356_3065165_+	COG2885, OmpA, Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane]	NA|167aa|down_7|NC_019682.1_3065179_3065680_+	cd00154, Rab, Ras-related in brain (Rab) family of small guanosine triphosphatases (GTPases)	NA|371aa|down_8|NC_019682.1_3065817_3066930_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|52aa|down_9|NC_019682.1_3066975_3067131_-	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	14	3164923-3165026	14	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	TTGCAAAAAGCGATCGCTGGTAATCG	26	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|89aa|up_8|NC_019682.1_3155046_3155313_-,NA|53aa|up_7|NC_019682.1_3155712_3155871_-,NA	NA|431aa|up_9|NC_019682.1_3152627_3153920_-	cd17485, MFS_MFSD3, Major facilitator superfamily domain containing 3 protein	NA|89aa|up_8|NC_019682.1_3155046_3155313_-	NA	NA|53aa|up_7|NC_019682.1_3155712_3155871_-	NA	NA|391aa|up_6|NC_019682.1_3156495_3157668_+	PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated	NA|60aa|up_5|NC_019682.1_3157784_3157964_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|847aa|up_4|NC_019682.1_3158421_3160962_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|135aa|up_3|NC_019682.1_3161189_3161594_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|295aa|up_2|NC_019682.1_3161816_3162701_-	sd00006, TPR, Tetratricopeptide repeat	NA|215aa|up_1|NC_019682.1_3162880_3163525_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|181aa|up_0|NC_019682.1_3163704_3164247_+	cd09916, CpxP_like, CpxP component of the bacterial Cpx-two-component system and related proteins	NA|424aa|down_0|NC_019682.1_3166112_3167384_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|141aa|down_1|NC_019682.1_3167456_3167879_+	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|224aa|down_2|NC_019682.1_3167986_3168658_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|474aa|down_3|NC_019682.1_3168703_3170125_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|491aa|down_4|NC_019682.1_3170521_3171994_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|406aa|down_5|NC_019682.1_3171990_3173208_+	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|241aa|down_6|NC_019682.1_3173571_3174294_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|221aa|down_7|NC_019682.1_3174372_3175035_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|428aa|down_8|NC_019682.1_3175048_3176332_-	PRK09440, avtA, valine--pyruvate transaminase; Provisional	NA|370aa|down_9|NC_019682.1_3176664_3177774_+	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	15	3375025-3376306	15,8,12	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Type I-D	GTTTCAATCCCTGATAGGGATTTAAGTTAATTGGAAC,GTTTCAATCCCTGATAGGGATTTAAGTTAATTGGAAC,GTTCCAATTAACTTAAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	17,17,17	17	TypeI-D	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|85aa|up_8|NC_019682.1_3367187_3367442_+,NA|204aa|up_5|NC_019682.1_3368725_3369337_-,NA|204aa|up_1|NC_019682.1_3373610_3374222_+,NA	NA|257aa|up_9|NC_019682.1_3365863_3366634_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|85aa|up_8|NC_019682.1_3367187_3367442_+	NA	NA|108aa|up_7|NC_019682.1_3367805_3368129_-	TIGR00365, TIGR00365, monothiol glutaredoxin, Grx4 family	NA|86aa|up_6|NC_019682.1_3368223_3368481_-	COG0271, BolA, Stress-induced morphogen (activity unknown) [Signal transduction mechanisms]	NA|204aa|up_5|NC_019682.1_3368725_3369337_-	NA	NA|701aa|up_4|NC_019682.1_3369636_3371739_+	cd07498, Peptidases_S8_15, Peptidase S8 family domain, uncharacterized subfamily 15	NA|256aa|up_3|NC_019682.1_3371895_3372663_-	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|161aa|up_2|NC_019682.1_3372777_3373260_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|204aa|up_1|NC_019682.1_3373610_3374222_+	NA	NA|126aa|up_0|NC_019682.1_3374602_3374980_-	cd06102, citrate_synt_like_2, Citrate synthase (CS) catalyzes the condensation of acetyl coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the oxidative citric acid cycle (TCA or Krebs cycle)	cas2|91aa|down_0|NC_019682.1_3376560_3376833_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NC_019682.1_3376941_3377946_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|197aa|down_2|NC_019682.1_3378067_3378658_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|274aa|down_3|NC_019682.1_3378674_3379496_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	2OG_CAS|211aa|down_4|NC_019682.1_3379473_3380106_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas3|761aa|down_5|NC_019682.1_3380182_3382465_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	csc1gr5|253aa|down_6|NC_019682.1_3382457_3383216_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|343aa|down_7|NC_019682.1_3383219_3384248_-	pfam18320, Csc2, Csc2 Crispr	cas10d|898aa|down_8|NC_019682.1_3384268_3386962_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	WYL|288aa|down_9|NC_019682.1_3387144_3388008_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	16	3390424-3390511	16	CRISPRCasFinder	no	cas2,cas1,cas4,cas6,2OG_CAS,cas3,csc1gr5,csc2gr7,cas10d,WYL	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Type I-D	GAGTTATGAGTACAACTTCTAAT	23	0	0	NA	NA	NA	1	1	TypeI-D	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|100aa|up_1|NC_019682.1_3388211_3388511_+,NA	cas4|197aa|up_9|NC_019682.1_3378067_3378658_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas6|274aa|up_8|NC_019682.1_3378674_3379496_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	2OG_CAS|211aa|up_7|NC_019682.1_3379473_3380106_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas3|761aa|up_6|NC_019682.1_3380182_3382465_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	csc1gr5|253aa|up_5|NC_019682.1_3382457_3383216_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|343aa|up_4|NC_019682.1_3383219_3384248_-	pfam18320, Csc2, Csc2 Crispr	cas10d|898aa|up_3|NC_019682.1_3384268_3386962_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	WYL|288aa|up_2|NC_019682.1_3387144_3388008_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|100aa|up_1|NC_019682.1_3388211_3388511_+	NA	NA|430aa|up_0|NC_019682.1_3389112_3390402_+	PRK02862, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|336aa|down_0|NC_019682.1_3391162_3392170_+	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|251aa|down_1|NC_019682.1_3392250_3393003_-	PRK06172, PRK06172, SDR family oxidoreductase	NA|224aa|down_2|NC_019682.1_3393197_3393869_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|181aa|down_3|NC_019682.1_3394180_3394723_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|204aa|down_4|NC_019682.1_3395156_3395768_-	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|694aa|down_5|NC_019682.1_3395962_3398044_+	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|738aa|down_6|NC_019682.1_3399203_3401417_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|154aa|down_7|NC_019682.1_3402215_3402677_-	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|380aa|down_8|NC_019682.1_3402959_3404099_-	TIGR02048, gshA_cyano, glutamate--cysteine ligase, cyanobacterial, putative	NA|404aa|down_9|NC_019682.1_3404383_3405595_-	cd17243, RMtype1_S_AchA6I-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Arthrobacter chlorophenolicus A6 S subunit (S
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	17	3571237-3571320	17	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	CACTGACAGCAGTGGGTTTTTTAG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|76aa|up_9|NC_019682.1_3558146_3558374_-,NA|77aa|up_8|NC_019682.1_3558480_3558711_+,NA|73aa|up_6|NC_019682.1_3559747_3559966_-,NA|1176aa|up_5|NC_019682.1_3559988_3563516_-,NA|263aa|down_2|NC_019682.1_3579558_3580347_+,NA|119aa|down_8|NC_019682.1_3588350_3588707_+,NA|465aa|down_9|NC_019682.1_3588773_3590168_-	NA|76aa|up_9|NC_019682.1_3558146_3558374_-	NA	NA|77aa|up_8|NC_019682.1_3558480_3558711_+	NA	NA|291aa|up_7|NC_019682.1_3558797_3559670_+	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|73aa|up_6|NC_019682.1_3559747_3559966_-	NA	NA|1176aa|up_5|NC_019682.1_3559988_3563516_-	NA	NA|306aa|up_4|NC_019682.1_3563598_3564516_-	pfam13182, DUF4007, Protein of unknown function (DUF4007)	NA|293aa|up_3|NC_019682.1_3565413_3566292_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|311aa|up_2|NC_019682.1_3566405_3567338_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|up_1|NC_019682.1_3568007_3569258_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|up_0|NC_019682.1_3569997_3570453_+	pfam01475, FUR, Ferric uptake regulator family	NA|656aa|down_0|NC_019682.1_3576047_3578015_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|406aa|down_1|NC_019682.1_3578204_3579422_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|263aa|down_2|NC_019682.1_3579558_3580347_+	NA	NA|262aa|down_3|NC_019682.1_3580405_3581191_+	COG0115, IlvE, Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [Amino acid transport and metabolism / Coenzyme metabolism]	NA|613aa|down_4|NC_019682.1_3581308_3583147_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|176aa|down_5|NC_019682.1_3584487_3585015_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|188aa|down_6|NC_019682.1_3585011_3585575_+	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|443aa|down_7|NC_019682.1_3585601_3586930_-	pfam02281, Dimer_Tnp_Tn5, Transposase Tn5 dimerization domain	NA|119aa|down_8|NC_019682.1_3588350_3588707_+	NA	NA|465aa|down_9|NC_019682.1_3588773_3590168_-	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	18	3878625-3878722	18	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	AAGGCTACGGTGTACACACAAGT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|52aa|up_9|NC_019682.1_3867410_3867566_-,NA|68aa|down_4|NC_019682.1_3885713_3885917_+,NA|252aa|down_7|NC_019682.1_3890926_3891682_-,NA|264aa|down_9|NC_019682.1_3892904_3893696_-	NA|52aa|up_9|NC_019682.1_3867410_3867566_-	NA	NA|467aa|up_8|NC_019682.1_3867638_3869039_+	sd00006, TPR, Tetratricopeptide repeat	NA|357aa|up_7|NC_019682.1_3869082_3870153_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|236aa|up_6|NC_019682.1_3870208_3870916_+	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|258aa|up_5|NC_019682.1_3870917_3871691_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|238aa|up_4|NC_019682.1_3871767_3872481_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|712aa|up_3|NC_019682.1_3872650_3874786_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|372aa|up_2|NC_019682.1_3875784_3876900_-	cd01918, HprK_C, HprK/P, the bifunctional histidine-containing protein kinase/phosphatase, controls the phosphorylation state of the phosphocarrier protein HPr and regulates the utilization of carbon sources by gram-positive bacteria	NA|350aa|up_1|NC_019682.1_3876892_3877942_-	pfam14907, NTP_transf_5, Uncharacterized nucleotidyltransferase	NA|149aa|up_0|NC_019682.1_3877948_3878395_-	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD)	NA|785aa|down_0|NC_019682.1_3879880_3882235_+	TIGR04534, hypothetical_protein, ELWxxDGT repeat	NA|272aa|down_1|NC_019682.1_3883017_3883833_-	pfam12204, DUF3598, Domain of unknown function (DUF3598)	NA|300aa|down_2|NC_019682.1_3884064_3884964_-	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|188aa|down_3|NC_019682.1_3885137_3885701_+	PRK00150, def, peptide deformylase; Reviewed	NA|68aa|down_4|NC_019682.1_3885713_3885917_+	NA	NA|1039aa|down_5|NC_019682.1_3886039_3889156_-	pfam12770, CHAT, CHAT domain	NA|492aa|down_6|NC_019682.1_3889280_3890756_-	COG5310, COG5310, Homospermidine synthase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|252aa|down_7|NC_019682.1_3890926_3891682_-	NA	NA|409aa|down_8|NC_019682.1_3891681_3892908_-	pfam03738, GSP_synth, Glutathionylspermidine synthase preATP-grasp	NA|264aa|down_9|NC_019682.1_3892904_3893696_-	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	19	4053786-4053899	19	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	ACTTGGTGGTAAGTTACTCGGCGTTTG	27	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|186aa|up_9|NC_019682.1_4045789_4046347_+,NA|67aa|up_1|NC_019682.1_4052645_4052846_+,NA|174aa|down_0|NC_019682.1_4054458_4054980_-,NA|145aa|down_3|NC_019682.1_4058954_4059389_-,NA|96aa|down_9|NC_019682.1_4065401_4065689_+	NA|186aa|up_9|NC_019682.1_4045789_4046347_+	NA	NA|215aa|up_8|NC_019682.1_4046425_4047070_+	pfam11780, DUF3318, Protein of unknown function (DUF3318)	NA|266aa|up_7|NC_019682.1_4047171_4047969_-	COG0602, NrdG, Organic radical activating enzymes [Posttranslational modification, protein turnover, chaperones]	NA|350aa|up_6|NC_019682.1_4048167_4049217_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|159aa|up_5|NC_019682.1_4049622_4050099_+	pfam11909, NdhN, NADH-quinone oxidoreductase cyanobacterial subunit N	NA|292aa|up_4|NC_019682.1_4050652_4051528_+	COG1864, NUC1, DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism]	NA|137aa|up_3|NC_019682.1_4051533_4051944_-	pfam07924, NuiA, Nuclease A inhibitor-like protein	NA|157aa|up_2|NC_019682.1_4052163_4052634_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|67aa|up_1|NC_019682.1_4052645_4052846_+	NA	NA|180aa|up_0|NC_019682.1_4052919_4053459_+	pfam11371, DUF3172, Protein of unknown function (DUF3172)	NA|174aa|down_0|NC_019682.1_4054458_4054980_-	NA	NA|733aa|down_1|NC_019682.1_4054984_4057183_-	PLN03159, PLN03159, cation/H(+) antiporter 15; Provisional	NA|430aa|down_2|NC_019682.1_4057457_4058747_-	PLN03159, PLN03159, cation/H(+) antiporter 15; Provisional	NA|145aa|down_3|NC_019682.1_4058954_4059389_-	NA	NA|233aa|down_4|NC_019682.1_4059651_4060350_-	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|296aa|down_5|NC_019682.1_4060459_4061347_-	pfam02517, Abi, CAAX protease self-immunity	NA|654aa|down_6|NC_019682.1_4061489_4063451_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|193aa|down_7|NC_019682.1_4063614_4064193_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|222aa|down_8|NC_019682.1_4064320_4064986_-	PRK11617, PRK11617, deoxyribonuclease V	NA|96aa|down_9|NC_019682.1_4065401_4065689_+	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	20	4106598-4106695	20	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	AGCGTAGTCGAAGTATCAGTTATCAGTTA	29	1	3	4106627-4106666|4106627-4106666|4106627-4106666	NC_019682.1_4100841-4100880|NC_019682.1_4104721-4104682|NC_019682.1_4104790-4104751	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|133aa|up_7|NC_019682.1_4099218_4099617_+,NA|77aa|up_2|NC_019682.1_4104328_4104559_-,NA|168aa|up_0|NC_019682.1_4106052_4106556_-,NA	NA|211aa|up_9|NC_019682.1_4096319_4096952_-	COG3932, COG3932, Uncharacterized ABC-type transport system, permease components [General function prediction only]	NA|295aa|up_8|NC_019682.1_4097109_4097994_-	PRK13057, PRK13057, lipid kinase	NA|133aa|up_7|NC_019682.1_4099218_4099617_+	NA	NA|326aa|up_6|NC_019682.1_4099819_4100797_-	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|145aa|up_5|NC_019682.1_4100973_4101408_-	COG1585, COG1585, Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|640aa|up_4|NC_019682.1_4101651_4103571_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|91aa|up_3|NC_019682.1_4104059_4104332_+	pfam02941, FeThRed_A, Ferredoxin thioredoxin reductase variable alpha chain	NA|77aa|up_2|NC_019682.1_4104328_4104559_-	NA	NA|264aa|up_1|NC_019682.1_4104802_4105594_-	cd06259, YdcF-like, YdcF-like	NA|168aa|up_0|NC_019682.1_4106052_4106556_-	NA	NA|512aa|down_0|NC_019682.1_4106844_4108380_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|100aa|down_1|NC_019682.1_4108609_4108909_-	PRK05943, PRK05943, 50S ribosomal protein L25; Reviewed	NA|448aa|down_2|NC_019682.1_4109059_4110403_-	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|839aa|down_3|NC_019682.1_4111355_4113872_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|348aa|down_4|NC_019682.1_4113963_4115007_+	COG1816, Add, Adenosine deaminase [Nucleotide transport and metabolism]	NA|82aa|down_5|NC_019682.1_4115179_4115425_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|134aa|down_6|NC_019682.1_4115421_4115823_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|207aa|down_7|NC_019682.1_4116865_4117486_+	pfam05685, Uma2, Putative restriction endonuclease	NA|112aa|down_8|NC_019682.1_4117755_4118091_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|560aa|down_9|NC_019682.1_4118268_4119948_+	PRK00095, mutL, DNA mismatch repair endonuclease MutL
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	21	4130743-4130829	21	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	AGGGCGCAGGGGGAAAGAGTTAGAG	25	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|62aa|up_8|NC_019682.1_4120156_4120342_+,NA|254aa|up_1|NC_019682.1_4128008_4128770_-,NA|73aa|down_6|NC_019682.1_4135073_4135292_+	NA|560aa|up_9|NC_019682.1_4118268_4119948_+	PRK00095, mutL, DNA mismatch repair endonuclease MutL	NA|62aa|up_8|NC_019682.1_4120156_4120342_+	NA	NA|99aa|up_7|NC_019682.1_4120326_4120623_+	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|367aa|up_6|NC_019682.1_4120665_4121766_-	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|412aa|up_5|NC_019682.1_4121911_4123147_-	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|480aa|up_4|NC_019682.1_4123786_4125226_+	TIGR02731, Phytoene_dehydrogenase_chloroplastic/chromoplastic, phytoene desaturase	NA|311aa|up_3|NC_019682.1_4125209_4126142_+	PLN02632, PLN02632, phytoene synthase	NA|442aa|up_2|NC_019682.1_4126558_4127884_+	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|254aa|up_1|NC_019682.1_4128008_4128770_-	NA	NA|313aa|up_0|NC_019682.1_4129066_4130005_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|338aa|down_0|NC_019682.1_4130869_4131883_+	PRK10714, PRK10714, undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional	NA|411aa|down_1|NC_019682.1_4131964_4133197_+	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|172aa|down_2|NC_019682.1_4133346_4133862_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|66aa|down_3|NC_019682.1_4133863_4134061_-	pfam11387, DUF2795, Protein of unknown function (DUF2795)	NA|83aa|down_4|NC_019682.1_4134287_4134536_+	pfam13239, 2TM, 2TM domain	NA|103aa|down_5|NC_019682.1_4134655_4134964_+	pfam11378, DUF3181, Protein of unknown function (DUF3181)	NA|73aa|down_6|NC_019682.1_4135073_4135292_+	NA	NA|160aa|down_7|NC_019682.1_4135339_4135819_-	pfam14108, DUF4281, Domain of unknown function (DUF4281)	NA|345aa|down_8|NC_019682.1_4135873_4136908_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|227aa|down_9|NC_019682.1_4137117_4137798_+	COG2085, COG2085, Predicted dinucleotide-binding enzymes [General function prediction only]
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	22	4392319-4392464	22	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTGCAACTTCTGTTGGCGTAGGTATGGGCGCTGGCTGTGTGCGGGTGGGTGTG	53	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|75aa|up_7|NC_019682.1_4384656_4384881_+,NA|88aa|down_2|NC_019682.1_4396275_4396539_+,NA|158aa|down_3|NC_019682.1_4398836_4399310_-	NA|64aa|up_9|NC_019682.1_4383267_4383459_+	pfam01679, Pmp3, Proteolipid membrane potential modulator	NA|267aa|up_8|NC_019682.1_4383640_4384441_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|75aa|up_7|NC_019682.1_4384656_4384881_+	NA	NA|112aa|up_6|NC_019682.1_4385049_4385385_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|340aa|up_5|NC_019682.1_4385438_4386458_+	cd19101, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|92aa|up_4|NC_019682.1_4386613_4386889_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|129aa|up_3|NC_019682.1_4386892_4387279_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|118aa|up_2|NC_019682.1_4387359_4387713_-	pfam08869, XisI, XisI protein	NA|717aa|up_1|NC_019682.1_4387914_4390065_+	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|483aa|up_0|NC_019682.1_4390154_4391603_+	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|307aa|down_0|NC_019682.1_4392935_4393856_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	NA|527aa|down_1|NC_019682.1_4394004_4395585_-	PRK13581, PRK13581, D-3-phosphoglycerate dehydrogenase; Provisional	NA|88aa|down_2|NC_019682.1_4396275_4396539_+	NA	NA|158aa|down_3|NC_019682.1_4398836_4399310_-	NA	NA|466aa|down_4|NC_019682.1_4399356_4400754_-	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|258aa|down_5|NC_019682.1_4400837_4401611_+	COG1335, PncA, Amidases related to nicotinamidase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|152aa|down_6|NC_019682.1_4401686_4402142_+	cd06987, cupin_MAE_RS03005, Microcystis aeruginosa MAE_RS03005 and related proteins, cupin domain	NA|121aa|down_7|NC_019682.1_4402038_4402401_-	pfam01844, HNH, HNH endonuclease	NA|721aa|down_8|NC_019682.1_4402680_4404843_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|249aa|down_9|NC_019682.1_4405080_4405827_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	23	4570254-4570357	23	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	CTCCACCCACAATTCGATTGAGTTT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|73aa|up_7|NC_019682.1_4556566_4556785_-,NA|143aa|up_6|NC_019682.1_4557233_4557662_-,NA	NA|150aa|up_9|NC_019682.1_4555602_4556052_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|144aa|up_8|NC_019682.1_4556138_4556570_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|73aa|up_7|NC_019682.1_4556566_4556785_-	NA	NA|143aa|up_6|NC_019682.1_4557233_4557662_-	NA	NA|1079aa|up_5|NC_019682.1_4557850_4561087_-	COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|473aa|up_4|NC_019682.1_4561432_4562851_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|499aa|up_3|NC_019682.1_4563024_4564521_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|95aa|up_2|NC_019682.1_4564573_4564858_-	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|749aa|up_1|NC_019682.1_4565031_4567278_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|273aa|up_0|NC_019682.1_4567855_4568674_+	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|147aa|down_0|NC_019682.1_4571097_4571538_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|619aa|down_1|NC_019682.1_4571588_4573445_-	cd07302, CHD, cyclase homology domain	NA|118aa|down_2|NC_019682.1_4573539_4573893_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|111aa|down_3|NC_019682.1_4573899_4574232_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|1212aa|down_4|NC_019682.1_4574399_4578035_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|77aa|down_5|NC_019682.1_4578101_4578332_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|131aa|down_6|NC_019682.1_4578328_4578721_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|205aa|down_7|NC_019682.1_4578962_4579577_+	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|71aa|down_8|NC_019682.1_4579732_4579945_+	pfam10742, DUF2555, Protein of unknown function (DUF2555)	NA|411aa|down_9|NC_019682.1_4580074_4581307_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	24	4693226-4693351	24	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	AAGAATTTAGATCGCCTGTGATGTTAATGTTAT	33	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|81aa|up_5|NC_019682.1_4678937_4679180_-,NA|72aa|down_0|NC_019682.1_4694211_4694427_-,NA|63aa|down_9|NC_019682.1_4707508_4707697_-	NA|127aa|up_9|NC_019682.1_4675624_4676005_+	pfam05685, Uma2, Putative restriction endonuclease	NA|384aa|up_8|NC_019682.1_4676017_4677169_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|391aa|up_7|NC_019682.1_4677255_4678428_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|130aa|up_6|NC_019682.1_4678551_4678941_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|81aa|up_5|NC_019682.1_4678937_4679180_-	NA	NA|1072aa|up_4|NC_019682.1_4679291_4682507_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|231aa|up_3|NC_019682.1_4682887_4683580_+	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|249aa|up_2|NC_019682.1_4683940_4684687_+	COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase [Posttranslational modification, protein turnover, chaperones]	NA|977aa|up_1|NC_019682.1_4684917_4687848_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|501aa|up_0|NC_019682.1_4687910_4689413_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|72aa|down_0|NC_019682.1_4694211_4694427_-	NA	NA|796aa|down_1|NC_019682.1_4695249_4697637_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|267aa|down_2|NC_019682.1_4697682_4698483_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|372aa|down_3|NC_019682.1_4698531_4699647_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|339aa|down_4|NC_019682.1_4699750_4700767_-	TIGR03470, HpnH, hopanoid biosynthesis associated radical SAM protein HpnH	NA|329aa|down_5|NC_019682.1_4700904_4701891_-	TIGR03466, HpnA, hopanoid-associated sugar epimerase	NA|912aa|down_6|NC_019682.1_4701944_4704680_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|226aa|down_7|NC_019682.1_4705067_4705745_+	cd17877, NP_MTAN-like, nucleoside phosphorylases similar to 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|442aa|down_8|NC_019682.1_4706074_4707400_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|63aa|down_9|NC_019682.1_4707508_4707697_-	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	25	5067421-5070798	13,25,9	PILER-CR,CRISPRCasFinder,CRT	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTTCCAATTAACTTAAATCCCTATCAGGGATTGAAAC,GTTCCAATTAACTTAAATCCCTATCAGGGATTGAAAC,GTTCCAATTAACTTAAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	46,46,46	46	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|390aa|up_9|NC_019682.1_5049774_5050944_+,NA|47aa|up_0|NC_019682.1_5067078_5067219_-,NA	NA|390aa|up_9|NC_019682.1_5049774_5050944_+	NA	NA|483aa|up_8|NC_019682.1_5050943_5052392_+	pfam00931, NB-ARC, NB-ARC domain	NA|439aa|up_7|NC_019682.1_5052385_5053702_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|719aa|up_6|NC_019682.1_5054435_5056592_-	COG4529, COG4529, Uncharacterized protein conserved in bacteria [Function unknown]	NA|956aa|up_5|NC_019682.1_5058341_5061209_+	COG1554, ATH1, Trehalose and maltose hydrolases (possible phosphorylases) [Carbohydrate transport and metabolism]	NA|552aa|up_4|NC_019682.1_5061317_5062973_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|266aa|up_3|NC_019682.1_5063771_5064569_+	COG3638, COG3638, ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|338aa|up_2|NC_019682.1_5064631_5065645_+	cd13575, PBP2_PnhD, Substrate binding domain of ABC-type phosphonate uptake system; contains the type 2 periplasmic binding fold	NA|258aa|up_1|NC_019682.1_5066208_5066982_+	TIGR01097, PhnE, phosphonate ABC transporter, permease protein PhnE	NA|47aa|up_0|NC_019682.1_5067078_5067219_-	NA	NA|153aa|down_0|NC_019682.1_5071671_5072130_+	COG3624, PhnG, Uncharacterized enzyme of phosphonate metabolism [Inorganic ion transport and metabolism]	NA|200aa|down_1|NC_019682.1_5072202_5072802_+	COG3625, PhnH, Uncharacterized enzyme of phosphonate metabolism [Inorganic ion transport and metabolism]	NA|154aa|down_2|NC_019682.1_5072791_5073253_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|393aa|down_3|NC_019682.1_5073314_5074493_+	COG3626, PhnI, Uncharacterized enzyme of phosphonate metabolism [Inorganic ion transport and metabolism]	NA|142aa|down_4|NC_019682.1_5074561_5074987_+	cd07244, FosA, fosfomycin resistant protein subfamily FosA	NA|399aa|down_5|NC_019682.1_5075172_5076369_+	COG3454, COG3454, Metal-dependent hydrolase involved in phosphonate metabolism [Inorganic ion transport and metabolism]	NA|182aa|down_6|NC_019682.1_5076854_5077400_+	TIGR03276, Phn-HD, phosphonate degradation operons associated HDIG domain protein	NA|279aa|down_7|NC_019682.1_5077463_5078300_+	pfam06007, PhnJ, Phosphonate metabolism protein PhnJ	NA|258aa|down_8|NC_019682.1_5078341_5079115_+	PRK11701, phnK, phosphonate C-P lyase system protein PhnK; Provisional	NA|228aa|down_9|NC_019682.1_5079171_5079855_+	COG4778, PhnL, ABC-type phosphonate transport system, ATPase component [Inorganic ion transport and metabolism]
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	26	5471137-5471283	26	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	TCTAATGATGATTAATTAAACAGGCGAGCGATGTCTACGACGGGCTACGCCTACG	55	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|94aa|up_2|NC_019682.1_5468993_5469275_+,NA|94aa|down_5|NC_019682.1_5481336_5481618_-,NA|121aa|down_6|NC_019682.1_5481734_5482097_+	NA|159aa|up_9|NC_019682.1_5460151_5460628_+	PRK00376, lspA, lipoprotein signal peptidase	NA|205aa|up_8|NC_019682.1_5461186_5461801_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|757aa|up_7|NC_019682.1_5462234_5464505_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|487aa|up_6|NC_019682.1_5464622_5466083_-	pfam14233, DUF4335, Domain of unknown function (DUF4335)	NA|209aa|up_5|NC_019682.1_5466079_5466706_-	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|173aa|up_4|NC_019682.1_5467257_5467776_+	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|350aa|up_3|NC_019682.1_5467875_5468925_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|94aa|up_2|NC_019682.1_5468993_5469275_+	NA	NA|150aa|up_1|NC_019682.1_5469289_5469739_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|287aa|up_0|NC_019682.1_5469984_5470845_-	pfam14015, DUF4231, Protein of unknown function (DUF4231)	NA|500aa|down_0|NC_019682.1_5471994_5473494_-	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|1269aa|down_1|NC_019682.1_5473644_5477451_-	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|162aa|down_2|NC_019682.1_5478288_5478774_-	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|456aa|down_3|NC_019682.1_5479010_5480378_+	TIGR00479, 23S_rRNA_uracil1939-C5-methyltransferase_RlmD, 23S rRNA (uracil-5-)-methyltransferase RumA	NA|152aa|down_4|NC_019682.1_5480804_5481260_+	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|94aa|down_5|NC_019682.1_5481336_5481618_-	NA	NA|121aa|down_6|NC_019682.1_5481734_5482097_+	NA	NA|464aa|down_7|NC_019682.1_5482099_5483491_+	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|103aa|down_8|NC_019682.1_5484529_5484838_+	pfam09383, NIL, NIL domain	NA|226aa|down_9|NC_019682.1_5484905_5485583_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	27	5524559-5524664	27	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	TCCCTACATCACAAAGGCGATCGCA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|70aa|up_3|NC_019682.1_5523062_5523272_+,NA|79aa|up_1|NC_019682.1_5523864_5524101_+,NA|125aa|down_2|NC_019682.1_5525546_5525921_-,NA|88aa|down_3|NC_019682.1_5526425_5526689_-,NA|90aa|down_5|NC_019682.1_5527903_5528173_-,NA|111aa|down_6|NC_019682.1_5528186_5528519_-,NA|57aa|down_7|NC_019682.1_5528554_5528725_-	NA|299aa|up_9|NC_019682.1_5517574_5518471_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|206aa|up_8|NC_019682.1_5518535_5519153_-	pfam05685, Uma2, Putative restriction endonuclease	NA|328aa|up_7|NC_019682.1_5519255_5520239_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|254aa|up_6|NC_019682.1_5520519_5521281_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|300aa|up_5|NC_019682.1_5521379_5522279_+	COG0539, RpsA, Ribosomal protein S1 [Translation, ribosomal structure and biogenesis]	NA|79aa|up_4|NC_019682.1_5522637_5522874_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|70aa|up_3|NC_019682.1_5523062_5523272_+	NA	NA|130aa|up_2|NC_019682.1_5523261_5523651_+	cd18753, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|79aa|up_1|NC_019682.1_5523864_5524101_+	NA	NA|133aa|up_0|NC_019682.1_5524104_5524503_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|84aa|down_0|NC_019682.1_5524672_5524924_-	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|141aa|down_1|NC_019682.1_5525096_5525519_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	NA|125aa|down_2|NC_019682.1_5525546_5525921_-	NA	NA|88aa|down_3|NC_019682.1_5526425_5526689_-	NA	NA|84aa|down_4|NC_019682.1_5526924_5527176_-	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|90aa|down_5|NC_019682.1_5527903_5528173_-	NA	NA|111aa|down_6|NC_019682.1_5528186_5528519_-	NA	NA|57aa|down_7|NC_019682.1_5528554_5528725_-	NA	NA|94aa|down_8|NC_019682.1_5528885_5529167_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|90aa|down_9|NC_019682.1_5529175_5529445_-	TIGR01552, Hypothetical_protein_Rv3357/MT3465/Mb3392
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	28	5781465-5781559	28	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	TTCTGAAAATAGATGTGGTTTAG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|106aa|up_4|NC_019682.1_5776686_5777004_+,NA|48aa|up_3|NC_019682.1_5777053_5777197_+,NA|61aa|up_2|NC_019682.1_5777321_5777504_+,NA|102aa|down_5|NC_019682.1_5791626_5791932_+,NA|166aa|down_7|NC_019682.1_5792898_5793396_+	NA|277aa|up_9|NC_019682.1_5772383_5773214_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|74aa|up_8|NC_019682.1_5773744_5773966_-	COG4877, COG4877, Uncharacterized protein conserved in bacteria [Function unknown]	NA|309aa|up_7|NC_019682.1_5773969_5774896_-	cd03402, SPFH_like_u2, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|177aa|up_6|NC_019682.1_5774983_5775514_-	PRK08118, PRK08118, DNA topology modulation protein	NA|66aa|up_5|NC_019682.1_5775747_5775945_+	pfam04485, NblA, Phycobilisome degradation protein nblA	NA|106aa|up_4|NC_019682.1_5776686_5777004_+	NA	NA|48aa|up_3|NC_019682.1_5777053_5777197_+	NA	NA|61aa|up_2|NC_019682.1_5777321_5777504_+	NA	NA|536aa|up_1|NC_019682.1_5777977_5779585_+	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|420aa|up_0|NC_019682.1_5779808_5781068_-	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|491aa|down_0|NC_019682.1_5782248_5783721_-	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|1049aa|down_1|NC_019682.1_5784610_5787757_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|291aa|down_2|NC_019682.1_5788033_5788906_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|343aa|down_3|NC_019682.1_5789017_5790046_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|157aa|down_4|NC_019682.1_5790289_5790760_-	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|102aa|down_5|NC_019682.1_5791626_5791932_+	NA	NA|135aa|down_6|NC_019682.1_5792127_5792532_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|166aa|down_7|NC_019682.1_5792898_5793396_+	NA	NA|290aa|down_8|NC_019682.1_5793616_5794486_+	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|208aa|down_9|NC_019682.1_5794633_5795257_+	cd16433, CheB, Chemotaxis response regulator protein-glutamate methylesterase, CheB
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	29	5904205-5904315	29	CRISPRCasFinder	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	CACACTCACCCTCGATAGGAGTGTGTGGAGTACGTCAC	38	0	0	NA	NA	NA	1	1	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|126aa|up_3|NC_019682.1_5900576_5900954_+,NA|168aa|up_2|NC_019682.1_5901184_5901688_+,NA|121aa|down_0|NC_019682.1_5904514_5904877_-,NA|181aa|down_9|NC_019682.1_5913185_5913728_-	NA|642aa|up_9|NC_019682.1_5888576_5890502_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|1129aa|up_8|NC_019682.1_5890565_5893952_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|194aa|up_7|NC_019682.1_5894938_5895520_+	pfam07688, KaiA, KaiA C-terminal domain	NA|105aa|up_6|NC_019682.1_5895690_5896005_+	PRK09301, PRK09301, circadian clock protein KaiB; Provisional	NA|521aa|up_5|NC_019682.1_5896089_5897652_+	TIGR02655, Circadian_clock_protein_kinase_KaiC, circadian clock protein KaiC	NA|731aa|up_4|NC_019682.1_5897757_5899950_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|126aa|up_3|NC_019682.1_5900576_5900954_+	NA	NA|168aa|up_2|NC_019682.1_5901184_5901688_+	NA	NA|260aa|up_1|NC_019682.1_5901851_5902631_+	TIGR03069, RNA-binding_S4_domain-containing_protein, photosystem II S4 domain protein	NA|259aa|up_0|NC_019682.1_5902687_5903464_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|121aa|down_0|NC_019682.1_5904514_5904877_-	NA	NA|361aa|down_1|NC_019682.1_5905167_5906250_+	PRK09250, PRK09250, class I fructose-bisphosphate aldolase	NA|125aa|down_2|NC_019682.1_5906617_5906992_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|131aa|down_3|NC_019682.1_5906988_5907381_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|609aa|down_4|NC_019682.1_5907476_5909303_-	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|226aa|down_5|NC_019682.1_5909501_5910179_-	PRK13014, PRK13014, methionine sulfoxide reductase A; Provisional	NA|483aa|down_6|NC_019682.1_5910652_5912101_+	COG5279, CYK3, Uncharacterized protein involved in cytokinesis, contains TGc (transglutaminase/protease-like) domain [Cell division and chromosome partitioning]	NA|144aa|down_7|NC_019682.1_5912237_5912669_+	cd01284, Riboflavin_deaminase-reductase, Riboflavin-specific deaminase	NA|123aa|down_8|NC_019682.1_5912727_5913096_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|181aa|down_9|NC_019682.1_5913185_5913728_-	NA
GCF_000316575.1_ASM31657v1	NC_019682	Calothrix sp. PCC 7507, complete sequence	30	6528939-6531966	14,30,10,15	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	Orphan	GTTGCAATTAACTTAAATCCCTATTAGGGATTGAAAC,GTTTCAATCCCTAATAGGGATTTAAGTTAATTGCAAC,GTTTCAATCCCTAATAGGGATTTAAGTTAATTGCAAC,GTTGCAATTAACTTAAATCCCTATTAGGGATTGAAAC	37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	39,41,41,39	41	Orphan	csa3,PD-DExK,cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,WYL,c2c9_V-U4,cas3,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas14j,DEDDh,csm3gr7,csx10gr5,csm2gr11,csx19,csx21,DinG,2OG_CAS,Cas9_archaeal,Cas14u_CAS-V,RT	NA|72aa|up_8|NC_019682.1_6521075_6521291_-,NA|51aa|up_5|NC_019682.1_6523279_6523432_-,NA|313aa|up_4|NC_019682.1_6523580_6524519_-,NA|291aa|up_1|NC_019682.1_6527468_6528341_+,NA|56aa|down_0|NC_019682.1_6532498_6532666_+,NA|457aa|down_1|NC_019682.1_6532940_6534311_+,NA|458aa|down_2|NC_019682.1_6534604_6535978_+,NA|520aa|down_3|NC_019682.1_6536189_6537749_+,NA|180aa|down_4|NC_019682.1_6538431_6538971_+,NA|166aa|down_5|NC_019682.1_6539450_6539948_+,NA|155aa|down_6|NC_019682.1_6540057_6540522_+,NA|274aa|down_7|NC_019682.1_6540652_6541474_+	NA|164aa|up_9|NC_019682.1_6520473_6520965_-	PRK13618, psbV, cytochrome c-550; Provisional	NA|72aa|up_8|NC_019682.1_6521075_6521291_-	NA	NA|279aa|up_7|NC_019682.1_6521523_6522360_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|303aa|up_6|NC_019682.1_6522356_6523265_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|51aa|up_5|NC_019682.1_6523279_6523432_-	NA	NA|313aa|up_4|NC_019682.1_6523580_6524519_-	NA	NA|204aa|up_3|NC_019682.1_6524851_6525463_+	cd06170, LuxR_C_like, C-terminal DNA-binding domain of LuxR-like proteins	NA|426aa|up_2|NC_019682.1_6525577_6526855_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|291aa|up_1|NC_019682.1_6527468_6528341_+	NA	NA|138aa|up_0|NC_019682.1_6528482_6528896_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	NA|56aa|down_0|NC_019682.1_6532498_6532666_+	NA	NA|457aa|down_1|NC_019682.1_6532940_6534311_+	NA	NA|458aa|down_2|NC_019682.1_6534604_6535978_+	NA	NA|520aa|down_3|NC_019682.1_6536189_6537749_+	NA	NA|180aa|down_4|NC_019682.1_6538431_6538971_+	NA	NA|166aa|down_5|NC_019682.1_6539450_6539948_+	NA	NA|155aa|down_6|NC_019682.1_6540057_6540522_+	NA	NA|274aa|down_7|NC_019682.1_6540652_6541474_+	NA	NA|1349aa|down_8|NC_019682.1_6541548_6545595_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|761aa|down_9|NC_019682.1_6546297_6548580_-	PRK13557, PRK13557, histidine kinase; Provisional
