assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	1	744885-746239	1,1,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Type III-A, Type III-D?,Type III-C,Type III-B,Type III-D	GTTACGCTTACACTTCCCCCGCAAGGGGATGGAAAC,GTTACGCTTACACTTCCCCCGCAAGGGGATGGAAAC,GTTACGCTTACACTTCCCCCGCAAGGGGATGGAAAC,GTTACGCTTACACTTCCCCCGCAAGGGGATGGAAAC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	15,18,18,15	18	TypeIII-A,TypeIII-D?,TypeIII-C,TypeIII-B,TypeIII-D	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	csx21|147aa|up_9|NZ_AP018172.1_734943_735384_-,csx19|160aa|up_7|NZ_AP018172.1_736517_736997_-,PD-DExK|207aa|up_5|NZ_AP018172.1_737983_738604_-,csm2gr11|138aa|up_3|NZ_AP018172.1_739507_739921_-,NA|104aa|down_0|NZ_AP018172.1_746839_747151_+,NA|79aa|down_1|NZ_AP018172.1_747417_747654_+,NA|105aa|down_3|NZ_AP018172.1_749290_749605_-	csx21|147aa|up_9|NZ_AP018172.1_734943_735384_-	NA	csm3gr7|361aa|up_8|NZ_AP018172.1_735383_736466_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|160aa|up_7|NZ_AP018172.1_736517_736997_-	NA	csm3gr7|323aa|up_6|NZ_AP018172.1_736984_737953_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	PD-DExK|207aa|up_5|NZ_AP018172.1_737983_738604_-	NA	csm3gr7|296aa|up_4|NZ_AP018172.1_738614_739502_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|138aa|up_3|NZ_AP018172.1_739507_739921_-	NA	csx10gr5|429aa|up_2|NZ_AP018172.1_739917_741204_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|241aa|up_1|NZ_AP018172.1_741200_741923_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|796aa|up_0|NZ_AP018172.1_741919_744307_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|104aa|down_0|NZ_AP018172.1_746839_747151_+	NA	NA|79aa|down_1|NZ_AP018172.1_747417_747654_+	NA	NA|439aa|down_2|NZ_AP018172.1_747979_749296_+	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|105aa|down_3|NZ_AP018172.1_749290_749605_-	NA	NA|400aa|down_4|NZ_AP018172.1_749805_751005_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|76aa|down_5|NZ_AP018172.1_751428_751656_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|73aa|down_6|NZ_AP018172.1_751648_751867_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|267aa|down_7|NZ_AP018172.1_752294_753095_-	COG1723, COG1723, Uncharacterized conserved protein [Function unknown]	NA|195aa|down_8|NZ_AP018172.1_753224_753809_-	pfam11937, DUF3455, Protein of unknown function (DUF3455)	NA|451aa|down_9|NZ_AP018172.1_753949_755302_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	2	1028596-1029237	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTCCACGAATTATTACCCCGCAAGGGG-------ATTGAAAC,GTTTCCACGAATTATTACCCCGCAAGGGGATTGAAACATT,GTTTCCACGAATTATTACCCCGCAAGGGGATTGAAAC	44,40,37	0	0	NA	NA	NA:III-A:III-A	8,8,8	8	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|80aa|up_9|NZ_AP018172.1_1017693_1017933_+,NA|263aa|up_8|NZ_AP018172.1_1017948_1018737_+,NA|71aa|up_1|NZ_AP018172.1_1027112_1027325_+,NA|97aa|down_2|NZ_AP018172.1_1030961_1031252_-,NA|155aa|down_4|NZ_AP018172.1_1031888_1032353_-	NA|80aa|up_9|NZ_AP018172.1_1017693_1017933_+	NA	NA|263aa|up_8|NZ_AP018172.1_1017948_1018737_+	NA	NA|310aa|up_7|NZ_AP018172.1_1018832_1019762_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|473aa|up_6|NZ_AP018172.1_1020191_1021610_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|108aa|up_5|NZ_AP018172.1_1022045_1022369_-	PRK02237, PRK02237, YnfA family protein	NA|533aa|up_4|NZ_AP018172.1_1022504_1024103_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|173aa|up_3|NZ_AP018172.1_1024888_1025407_-	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|392aa|up_2|NZ_AP018172.1_1025486_1026662_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|71aa|up_1|NZ_AP018172.1_1027112_1027325_+	NA	NA|182aa|up_0|NZ_AP018172.1_1027593_1028139_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|228aa|down_0|NZ_AP018172.1_1029318_1030002_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|314aa|down_1|NZ_AP018172.1_1030023_1030965_-	TIGR02957, putative_sigma_factor, RNA polymerase sigma-70 factor, TIGR02957 family	NA|97aa|down_2|NZ_AP018172.1_1030961_1031252_-	NA	NA|160aa|down_3|NZ_AP018172.1_1031316_1031796_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|155aa|down_4|NZ_AP018172.1_1031888_1032353_-	NA	NA|228aa|down_5|NZ_AP018172.1_1032355_1033039_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|115aa|down_6|NZ_AP018172.1_1033154_1033499_-	pfam12487, DUF3703, Protein of unknown function (DUF3703)	NA|208aa|down_7|NZ_AP018172.1_1033514_1034138_-	cd03140, GATase1_PfpI_3, Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus	NA|599aa|down_8|NZ_AP018172.1_1034256_1036053_-	COG2909, MalT, ATP-dependent transcriptional regulator [Transcription]	NA|262aa|down_9|NZ_AP018172.1_1036152_1036938_-	COG2909, MalT, ATP-dependent transcriptional regulator [Transcription]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	3	1362234-1362623	4,3,3	PILER-CR,CRISPRCasFinder,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTTCAATTAATTACCCCTCACGGGGATGGAAAC,GTTTTCAATTAATTACCCCTCACGGGGATGGAAAC,GTTTTCAATTAATTACCCCTCACGGGGATGGAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	4,5,5	5	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|131aa|up_9|NZ_AP018172.1_1347371_1347764_+,NA|360aa|up_2|NZ_AP018172.1_1357756_1358836_-,NA|132aa|down_0|NZ_AP018172.1_1363178_1363574_-,NA|62aa|down_8|NZ_AP018172.1_1377227_1377413_+	NA|131aa|up_9|NZ_AP018172.1_1347371_1347764_+	NA	NA|232aa|up_8|NZ_AP018172.1_1348077_1348773_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|321aa|up_7|NZ_AP018172.1_1348868_1349831_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|450aa|up_6|NZ_AP018172.1_1349846_1351196_+	pfam01882, DUF58, Protein of unknown function DUF58	NA|673aa|up_5|NZ_AP018172.1_1351192_1353211_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|314aa|up_4|NZ_AP018172.1_1353243_1354185_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|906aa|up_3|NZ_AP018172.1_1354284_1357002_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|360aa|up_2|NZ_AP018172.1_1357756_1358836_-	NA	NA|510aa|up_1|NZ_AP018172.1_1359115_1360645_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|274aa|up_0|NZ_AP018172.1_1360673_1361495_-	cd08932, HetN_like_SDR_c, HetN oxidoreductase-like, classical (c) SDR	NA|132aa|down_0|NZ_AP018172.1_1363178_1363574_-	NA	NA|698aa|down_1|NZ_AP018172.1_1364276_1366370_-	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|488aa|down_2|NZ_AP018172.1_1366676_1368140_+	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|499aa|down_3|NZ_AP018172.1_1368385_1369882_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|292aa|down_4|NZ_AP018172.1_1369971_1370847_-	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|1276aa|down_5|NZ_AP018172.1_1371016_1374844_+	COG3920, COG3920, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|261aa|down_6|NZ_AP018172.1_1374869_1375652_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|356aa|down_7|NZ_AP018172.1_1376103_1377171_+	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins	NA|62aa|down_8|NZ_AP018172.1_1377227_1377413_+	NA	NA|343aa|down_9|NZ_AP018172.1_1377552_1378581_+	cd08235, iditol_2_DH_like, L-iditol 2-dehydrogenase
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	4	2019125-2019202	4	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	TTTGATTGACTGGTACTCAACCAAAG	26	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA,NA|63aa|down_0|NZ_AP018172.1_2019237_2019426_+,NA|181aa|down_3|NZ_AP018172.1_2021820_2022363_+,NA|161aa|down_4|NZ_AP018172.1_2022443_2022926_+,NA|86aa|down_7|NZ_AP018172.1_2028221_2028479_+,NA|79aa|down_8|NZ_AP018172.1_2028709_2028946_+,NA|155aa|down_9|NZ_AP018172.1_2029250_2029715_-	NA|292aa|up_9|NZ_AP018172.1_2010353_2011229_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|168aa|up_8|NZ_AP018172.1_2011323_2011827_-	pfam13924, Lipocalin_5, Lipocalin-like domain	NA|295aa|up_7|NZ_AP018172.1_2011870_2012755_-	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|104aa|up_6|NZ_AP018172.1_2012794_2013106_-	smart00823, PKS_PP, Phosphopantetheine attachment site	NA|278aa|up_5|NZ_AP018172.1_2013334_2014168_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|323aa|up_4|NZ_AP018172.1_2014408_2015377_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|589aa|up_3|NZ_AP018172.1_2015403_2017170_-	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)	NA|247aa|up_2|NZ_AP018172.1_2017166_2017907_-	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|118aa|up_1|NZ_AP018172.1_2018158_2018512_+	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|198aa|up_0|NZ_AP018172.1_2018516_2019110_+	pfam07767, Nop53, Nop53 (60S ribosomal biogenesis)	NA|63aa|down_0|NZ_AP018172.1_2019237_2019426_+	NA	NA|131aa|down_1|NZ_AP018172.1_2019600_2019993_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|510aa|down_2|NZ_AP018172.1_2020098_2021628_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|181aa|down_3|NZ_AP018172.1_2021820_2022363_+	NA	NA|161aa|down_4|NZ_AP018172.1_2022443_2022926_+	NA	NA|73aa|down_5|NZ_AP018172.1_2023065_2023284_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|1566aa|down_6|NZ_AP018172.1_2023324_2028022_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|86aa|down_7|NZ_AP018172.1_2028221_2028479_+	NA	NA|79aa|down_8|NZ_AP018172.1_2028709_2028946_+	NA	NA|155aa|down_9|NZ_AP018172.1_2029250_2029715_-	NA
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	5	2447850-2448152	5	PILER-CR	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	TGCACTCCCACTGCGGCGTGCAGTGGGCGATAGACGTGGCGAAGCTACTACATTGACTAATAT	63	0	0	NA	NA	NA	2	2	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|72aa|up_2|NZ_AP018172.1_2441683_2441899_+,NA|164aa|down_4|NZ_AP018172.1_2455073_2455565_-	NA|184aa|up_9|NZ_AP018172.1_2430982_2431534_-	COG3575, COG3575, Uncharacterized protein conserved in bacteria [Function unknown]	NA|422aa|up_8|NZ_AP018172.1_2431683_2432949_+	pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein	NA|152aa|up_7|NZ_AP018172.1_2432986_2433442_-	pfam12158, DUF3592, Protein of unknown function (DUF3592)	NA|647aa|up_6|NZ_AP018172.1_2433486_2435427_-	PRK05667, dnaG, DNA primase; Validated	NA|451aa|up_5|NZ_AP018172.1_2435605_2436958_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|530aa|up_4|NZ_AP018172.1_2438437_2440027_+	PRK09424, pntA, Re/Si-specific NAD(P)(+) transhydrogenase subunit alpha	NA|483aa|up_3|NZ_AP018172.1_2440036_2441485_+	pfam02233, PNTB, NAD(P) transhydrogenase beta subunit	NA|72aa|up_2|NZ_AP018172.1_2441683_2441899_+	NA	NA|139aa|up_1|NZ_AP018172.1_2441891_2442308_+	cd00303, retropepsin_like, Retropepsins; pepsin-like aspartate proteases	NA|1067aa|up_0|NZ_AP018172.1_2442766_2445967_+	cd09914, RocCOR, Ras of complex proteins (Roc) C-terminal of Roc (COR) domain family	NA|210aa|down_0|NZ_AP018172.1_2450306_2450936_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|198aa|down_1|NZ_AP018172.1_2451235_2451829_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|178aa|down_2|NZ_AP018172.1_2451967_2452501_+	PRK12703, PRK12703, tRNA 2'-O-methylase; Reviewed	NA|766aa|down_3|NZ_AP018172.1_2452560_2454858_-	cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|164aa|down_4|NZ_AP018172.1_2455073_2455565_-	NA	NA|306aa|down_5|NZ_AP018172.1_2455657_2456575_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|362aa|down_6|NZ_AP018172.1_2456611_2457697_-	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|376aa|down_7|NZ_AP018172.1_2458094_2459222_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|615aa|down_8|NZ_AP018172.1_2459434_2461279_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|607aa|down_9|NZ_AP018172.1_2461470_2463291_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	6	2591266-2591387	5	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GGTGACATTACTATCACTTCAGGCTCTCTCTCCCTCAC	38	1	1	2591304-2591349	NZ_AP018172.1_2592100-2592145	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA,NA|154aa|down_2|NZ_AP018172.1_2596068_2596530_+,NA|62aa|down_8|NZ_AP018172.1_2605990_2606176_-	NA|642aa|up_9|NZ_AP018172.1_2573754_2575680_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|226aa|up_8|NZ_AP018172.1_2575721_2576399_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|248aa|up_7|NZ_AP018172.1_2576894_2577638_-	cd15244, 7tm_bacteriorhodopsin, light-driven outward proton pump bacteriorhodopsin, member of the seven-transmembrane GPCR superfamily	NA|246aa|up_6|NZ_AP018172.1_2577834_2578572_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|443aa|up_5|NZ_AP018172.1_2578584_2579913_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|868aa|up_4|NZ_AP018172.1_2580459_2583063_+	cd01031, EriC, ClC chloride channel EriC	NA|238aa|up_3|NZ_AP018172.1_2583286_2584000_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|629aa|up_2|NZ_AP018172.1_2584028_2585915_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|219aa|up_1|NZ_AP018172.1_2586385_2587042_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|951aa|up_0|NZ_AP018172.1_2587097_2589950_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|798aa|down_0|NZ_AP018172.1_2592112_2594506_+	COG3210, FhaB, Large exoproteins involved in heme utilization or adhesion [Intracellular trafficking and secretion]	NA|273aa|down_1|NZ_AP018172.1_2595131_2595950_+	cd05382, CAP_GAPR1-like, CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain of Golgi-associated plant pathogenesis-related protein 1 and similar proteins	NA|154aa|down_2|NZ_AP018172.1_2596068_2596530_+	NA	NA|574aa|down_3|NZ_AP018172.1_2596859_2598581_+	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|1229aa|down_4|NZ_AP018172.1_2598686_2602373_-	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|676aa|down_5|NZ_AP018172.1_2602610_2604638_-	pfam11335, DUF3137, Protein of unknown function (DUF3137)	NA|181aa|down_6|NZ_AP018172.1_2604647_2605190_-	pfam04011, LemA, LemA family	NA|78aa|down_7|NZ_AP018172.1_2605638_2605872_+	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|62aa|down_8|NZ_AP018172.1_2605990_2606176_-	NA	NA|260aa|down_9|NZ_AP018172.1_2606276_2607056_-	pfam07444, Ycf66_N, Ycf66 protein N-terminus
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	7	2730193-2730466	6,4	CRISPRCasFinder,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	TGTTTCAGTCCCCTTGCGGGGAAAT,TGTTTCAGTCCCCTTGCGGGGAAATTAGTTATGGAAAC	25,38	0	0	NA	NA	NA:NA	3,3	3	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|70aa|up_0|NZ_AP018172.1_2729539_2729749_+,NA|726aa|down_0|NZ_AP018172.1_2730750_2732928_-,NA|152aa|down_3|NZ_AP018172.1_2735917_2736373_+,NA|74aa|down_7|NZ_AP018172.1_2749610_2749832_-	NA|1607aa|up_9|NZ_AP018172.1_2714156_2718977_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|327aa|up_8|NZ_AP018172.1_2719124_2720105_-	cd19076, AKR_AKR13A_13D, AKR13A and AKR13D families of aldo-keto reductase (AKR)	NA|504aa|up_7|NZ_AP018172.1_2720252_2721764_-	cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde dehydrogenase-like	NA|239aa|up_6|NZ_AP018172.1_2721938_2722655_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|545aa|up_5|NZ_AP018172.1_2722789_2724424_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|407aa|up_4|NZ_AP018172.1_2724481_2725702_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|113aa|up_3|NZ_AP018172.1_2726168_2726507_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|571aa|up_2|NZ_AP018172.1_2726637_2728350_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|287aa|up_1|NZ_AP018172.1_2728680_2729541_-	TIGR03763, conserved_hypothetical_protein, cyanoexosortase A	NA|70aa|up_0|NZ_AP018172.1_2729539_2729749_+	NA	NA|726aa|down_0|NZ_AP018172.1_2730750_2732928_-	NA	NA|437aa|down_1|NZ_AP018172.1_2733015_2734326_+	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|436aa|down_2|NZ_AP018172.1_2734377_2735685_+	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]	NA|152aa|down_3|NZ_AP018172.1_2735917_2736373_+	NA	NA|196aa|down_4|NZ_AP018172.1_2736470_2737058_+	pfam13023, HD_3, HD domain	NA|284aa|down_5|NZ_AP018172.1_2737136_2737988_+	TIGR02821, S-formylglutathione_hydrolase, S-formylglutathione hydrolase	NA|3427aa|down_6|NZ_AP018172.1_2738159_2748440_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|74aa|down_7|NZ_AP018172.1_2749610_2749832_-	NA	NA|215aa|down_8|NZ_AP018172.1_2750377_2751022_+	cd04182, GT_2_like_f, GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function	NA|2000aa|down_9|NZ_AP018172.1_2751327_2757327_+	COG3899, COG3899, Predicted ATPase [General function prediction only]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	8	3086392-3086944	6,7,5	PILER-CR,CRISPRCasFinder,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTCTACCAACCATTTCCCCGCAAGGGG-------ACTGAAAC,GTTTCTACCAACCATTTCCCCGCAAGGGGACTGAAAC,GTTTCTACCAACCATTTCCCCGCAAGGGGACTGAAAC	44,37,37	0	0	NA	NA	NA:NA:NA	5,7,7	7	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA,NA|189aa|down_3|NZ_AP018172.1_3090850_3091417_-	NA|298aa|up_9|NZ_AP018172.1_3073898_3074792_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|258aa|up_8|NZ_AP018172.1_3074806_3075580_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|201aa|up_7|NZ_AP018172.1_3075591_3076194_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|648aa|up_6|NZ_AP018172.1_3076556_3078500_+	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|107aa|up_5|NZ_AP018172.1_3078552_3078873_-	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|373aa|up_4|NZ_AP018172.1_3078979_3080098_-	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]	NA|792aa|up_3|NZ_AP018172.1_3080400_3082776_-	sd00006, TPR, Tetratricopeptide repeat	NA|302aa|up_2|NZ_AP018172.1_3082897_3083803_-	pfam18863, AbiJ_NTD4, AbiJ N-terminal domain 4	NA|306aa|up_1|NZ_AP018172.1_3084089_3085007_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|249aa|up_0|NZ_AP018172.1_3085248_3085995_+	PRK14832, PRK14832, undecaprenyl pyrophosphate synthase; Provisional	NA|501aa|down_0|NZ_AP018172.1_3087248_3088751_+	PLN02518, PLN02518, pheophorbide a oxygenase	NA|299aa|down_1|NZ_AP018172.1_3088942_3089839_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|141aa|down_2|NZ_AP018172.1_3090273_3090696_-	pfam11218, DUF3011, Protein of unknown function (DUF3011)	NA|189aa|down_3|NZ_AP018172.1_3090850_3091417_-	NA	NA|450aa|down_4|NZ_AP018172.1_3091440_3092790_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|630aa|down_5|NZ_AP018172.1_3092883_3094773_-	pfam00305, Lipoxygenase, Lipoxygenase	NA|128aa|down_6|NZ_AP018172.1_3095116_3095500_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|301aa|down_7|NZ_AP018172.1_3095649_3096552_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|503aa|down_8|NZ_AP018172.1_3096864_3098373_+	COG3349, COG3349, Uncharacterized conserved protein [Function unknown]	NA|472aa|down_9|NZ_AP018172.1_3098580_3099996_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	9	3739789-3739891	8	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	CAGTTTTAAAGAATTATTGTCCA	23	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|101aa|up_9|NZ_AP018172.1_3729788_3730091_+,NA|128aa|down_7|NZ_AP018172.1_3751502_3751886_-,NA|132aa|down_8|NZ_AP018172.1_3752000_3752396_-	NA|101aa|up_9|NZ_AP018172.1_3729788_3730091_+	NA	NA|458aa|up_8|NZ_AP018172.1_3730115_3731489_-	PRK09201, PRK09201, AtzE family amidohydrolase	NA|63aa|up_7|NZ_AP018172.1_3731485_3731674_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|117aa|up_6|NZ_AP018172.1_3731711_3732062_-	COG3502, COG3502, Uncharacterized protein conserved in bacteria [Function unknown]	NA|506aa|up_5|NZ_AP018172.1_3732316_3733834_-	PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family protein	NA|459aa|up_4|NZ_AP018172.1_3734044_3735421_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|294aa|up_3|NZ_AP018172.1_3735580_3736462_-	pfam14243, DUF4343, Domain of unknown function (DUF4343)	NA|394aa|up_2|NZ_AP018172.1_3736562_3737744_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|297aa|up_1|NZ_AP018172.1_3738045_3738936_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|242aa|up_0|NZ_AP018172.1_3738932_3739658_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|338aa|down_0|NZ_AP018172.1_3741238_3742252_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|111aa|down_1|NZ_AP018172.1_3742522_3742855_+	COG2852, COG2852, Very-short-patch-repair endonuclease [Replication, recombination,    and repair]	NA|277aa|down_2|NZ_AP018172.1_3743137_3743968_-	cd00325, chitinase_GH19, Glycoside hydrolase family 19, chitinase domain	NA|183aa|down_3|NZ_AP018172.1_3743990_3744539_-	cd14667, 3D_containing_proteins, Non-mltA associated 3D domain containing proteins, named for 3 conserved aspartate residues	NA|579aa|down_4|NZ_AP018172.1_3744685_3746422_-	cd14490, CBM6-CBM35-CBM36_like_1, uncharacterized members of the carbohydrate binding module 6 (CBM6) and CBM35_like superfamily	NA|491aa|down_5|NZ_AP018172.1_3746472_3747945_-	pfam11721, Malectin, Di-glucose binding within endoplasmic reticulum	NA|317aa|down_6|NZ_AP018172.1_3750109_3751060_-	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|128aa|down_7|NZ_AP018172.1_3751502_3751886_-	NA	NA|132aa|down_8|NZ_AP018172.1_3752000_3752396_-	NA	NA|346aa|down_9|NZ_AP018172.1_3752520_3753558_-	pfam08894, DUF1838, Protein of unknown function (DUF1838)
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	10	3740250-3740506	6	CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	CTTTCAACCCACCACCGAGCGNNNGGAGGGTTATTGAAA	39	0	0	NA	NA	NA	3	3	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|101aa|up_9|NZ_AP018172.1_3729788_3730091_+,NA|128aa|down_7|NZ_AP018172.1_3751502_3751886_-,NA|132aa|down_8|NZ_AP018172.1_3752000_3752396_-	NA|101aa|up_9|NZ_AP018172.1_3729788_3730091_+	NA	NA|458aa|up_8|NZ_AP018172.1_3730115_3731489_-	PRK09201, PRK09201, AtzE family amidohydrolase	NA|63aa|up_7|NZ_AP018172.1_3731485_3731674_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|117aa|up_6|NZ_AP018172.1_3731711_3732062_-	COG3502, COG3502, Uncharacterized protein conserved in bacteria [Function unknown]	NA|506aa|up_5|NZ_AP018172.1_3732316_3733834_-	PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family protein	NA|459aa|up_4|NZ_AP018172.1_3734044_3735421_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|294aa|up_3|NZ_AP018172.1_3735580_3736462_-	pfam14243, DUF4343, Domain of unknown function (DUF4343)	NA|394aa|up_2|NZ_AP018172.1_3736562_3737744_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|297aa|up_1|NZ_AP018172.1_3738045_3738936_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|242aa|up_0|NZ_AP018172.1_3738932_3739658_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|338aa|down_0|NZ_AP018172.1_3741238_3742252_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|111aa|down_1|NZ_AP018172.1_3742522_3742855_+	COG2852, COG2852, Very-short-patch-repair endonuclease [Replication, recombination,    and repair]	NA|277aa|down_2|NZ_AP018172.1_3743137_3743968_-	cd00325, chitinase_GH19, Glycoside hydrolase family 19, chitinase domain	NA|183aa|down_3|NZ_AP018172.1_3743990_3744539_-	cd14667, 3D_containing_proteins, Non-mltA associated 3D domain containing proteins, named for 3 conserved aspartate residues	NA|579aa|down_4|NZ_AP018172.1_3744685_3746422_-	cd14490, CBM6-CBM35-CBM36_like_1, uncharacterized members of the carbohydrate binding module 6 (CBM6) and CBM35_like superfamily	NA|491aa|down_5|NZ_AP018172.1_3746472_3747945_-	pfam11721, Malectin, Di-glucose binding within endoplasmic reticulum	NA|317aa|down_6|NZ_AP018172.1_3750109_3751060_-	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|128aa|down_7|NZ_AP018172.1_3751502_3751886_-	NA	NA|132aa|down_8|NZ_AP018172.1_3752000_3752396_-	NA	NA|346aa|down_9|NZ_AP018172.1_3752520_3753558_-	pfam08894, DUF1838, Protein of unknown function (DUF1838)
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	11	3854048-3854449	7,7	PILER-CR,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTCCA---------TCCCCGTGAGGGGTAATGATTTGAAAAC,GTTTTCAAANCATTACCCCTCACGGGGATGGAAAC	44,35	0	0	NA	NA	NA:NA	5,5	5	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|346aa|up_3|NZ_AP018172.1_3850562_3851600_+,NA|394aa|up_2|NZ_AP018172.1_3851627_3852809_+,NA|86aa|up_1|NZ_AP018172.1_3852958_3853216_+,NA|128aa|up_0|NZ_AP018172.1_3853239_3853623_-,NA|120aa|down_0|NZ_AP018172.1_3854672_3855032_+,NA|121aa|down_9|NZ_AP018172.1_3861515_3861878_+	NA|194aa|up_9|NZ_AP018172.1_3843300_3843882_+	cd12130, Apl, Allophycocyanin-like globins	NA|247aa|up_8|NZ_AP018172.1_3844060_3844801_-	PRK02816, PRK02816, phycocyanobilin:ferredoxin oxidoreductase; Validated	NA|215aa|up_7|NZ_AP018172.1_3844889_3845534_-	pfam05685, Uma2, Putative restriction endonuclease	NA|648aa|up_6|NZ_AP018172.1_3845644_3847588_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|125aa|up_5|NZ_AP018172.1_3847596_3847971_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|706aa|up_4|NZ_AP018172.1_3847980_3850098_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|346aa|up_3|NZ_AP018172.1_3850562_3851600_+	NA	NA|394aa|up_2|NZ_AP018172.1_3851627_3852809_+	NA	NA|86aa|up_1|NZ_AP018172.1_3852958_3853216_+	NA	NA|128aa|up_0|NZ_AP018172.1_3853239_3853623_-	NA	NA|120aa|down_0|NZ_AP018172.1_3854672_3855032_+	NA	NA|156aa|down_1|NZ_AP018172.1_3855300_3855768_-	pfam09150, Carot_N, Orange carotenoid protein, N-terminal	NA|149aa|down_2|NZ_AP018172.1_3856022_3856469_+	pfam10847, DUF2656, Protein of unknown function (DUF2656)	NA|235aa|down_3|NZ_AP018172.1_3856501_3857206_+	PHA03100, PHA03100, ankyrin repeat protein; Provisional	NA|287aa|down_4|NZ_AP018172.1_3857584_3858445_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|250aa|down_5|NZ_AP018172.1_3858484_3859234_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|253aa|down_6|NZ_AP018172.1_3859334_3860093_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|177aa|down_7|NZ_AP018172.1_3860190_3860721_+	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|208aa|down_8|NZ_AP018172.1_3860914_3861538_+	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|121aa|down_9|NZ_AP018172.1_3861515_3861878_+	NA
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	12	4587718-4587919	9	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GATTTACTGAAAGGAGAAAGTATTCAGGCATTTGAAGAAAGTGATATGCCTTTTT	55	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA,NA|189aa|down_5|NZ_AP018172.1_4599162_4599729_-	NA|435aa|up_9|NZ_AP018172.1_4572593_4573898_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|387aa|up_8|NZ_AP018172.1_4574001_4575162_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|374aa|up_7|NZ_AP018172.1_4575183_4576305_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|249aa|up_6|NZ_AP018172.1_4576279_4577026_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|233aa|up_5|NZ_AP018172.1_4577107_4577806_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|596aa|up_4|NZ_AP018172.1_4577819_4579607_-	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|507aa|up_3|NZ_AP018172.1_4579828_4581349_-	COG1626, TreA, Neutral trehalase [Carbohydrate transport and metabolism]	NA|931aa|up_2|NZ_AP018172.1_4581489_4584282_-	COG3280, TreY, Maltooligosyl trehalose synthase [Carbohydrate transport and metabolism]	NA|616aa|up_1|NZ_AP018172.1_4584397_4586245_-	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|305aa|up_0|NZ_AP018172.1_4586519_4587434_+	COG3386, COG3386, Gluconolactonase [Carbohydrate transport and metabolism]	NA|920aa|down_0|NZ_AP018172.1_4588115_4590875_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|747aa|down_1|NZ_AP018172.1_4590933_4593174_-	pfam02026, RyR, RyR domain	NA|137aa|down_2|NZ_AP018172.1_4593188_4593599_-	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|304aa|down_3|NZ_AP018172.1_4593757_4594669_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|1409aa|down_4|NZ_AP018172.1_4594828_4599055_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|189aa|down_5|NZ_AP018172.1_4599162_4599729_-	NA	NA|355aa|down_6|NZ_AP018172.1_4599770_4600835_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|351aa|down_7|NZ_AP018172.1_4601210_4602263_+	cd19080, AKR_AKR9A_9B, AKR9A and AKR9B families of aldo-keto reductase (AKR)	NA|190aa|down_8|NZ_AP018172.1_4602393_4602963_+	PRK09448, PRK09448, DNA starvation/stationary phase protection protein Dps; Provisional	NA|239aa|down_9|NZ_AP018172.1_4603409_4604126_+	cd02910, cupin_Yhhw_N, Escherichia coli YhhW and YhaK and related proteins, pirin-like bicupin, N-terminal cupin domain
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	13	4959962-4960072	10	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	ACTTTCCGATCACATCGCCCCGAAAGGGGATGGAAAC	37	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|146aa|up_8|NZ_AP018172.1_4948127_4948565_-,NA|168aa|up_6|NZ_AP018172.1_4949188_4949692_-,NA|131aa|up_1|NZ_AP018172.1_4958862_4959255_-,NA|112aa|up_0|NZ_AP018172.1_4959366_4959702_+,NA|246aa|down_2|NZ_AP018172.1_4963441_4964179_-	NA|99aa|up_9|NZ_AP018172.1_4947834_4948131_-	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|146aa|up_8|NZ_AP018172.1_4948127_4948565_-	NA	NA|125aa|up_7|NZ_AP018172.1_4948757_4949132_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|168aa|up_6|NZ_AP018172.1_4949188_4949692_-	NA	NA|276aa|up_5|NZ_AP018172.1_4949747_4950575_-	pfam09234, DUF1963, Domain of unknown function (DUF1963)	NA|1347aa|up_4|NZ_AP018172.1_4950602_4954643_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|808aa|up_3|NZ_AP018172.1_4954667_4957091_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|425aa|up_2|NZ_AP018172.1_4957435_4958710_+	COG1134, TagH, ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|131aa|up_1|NZ_AP018172.1_4958862_4959255_-	NA	NA|112aa|up_0|NZ_AP018172.1_4959366_4959702_+	NA	NA|97aa|down_0|NZ_AP018172.1_4960530_4960821_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|209aa|down_1|NZ_AP018172.1_4962178_4962805_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|246aa|down_2|NZ_AP018172.1_4963441_4964179_-	NA	NA|700aa|down_3|NZ_AP018172.1_4964297_4966397_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|154aa|down_4|NZ_AP018172.1_4966731_4967193_-	COG3837, COG3837, Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]	NA|146aa|down_5|NZ_AP018172.1_4967182_4967620_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|216aa|down_6|NZ_AP018172.1_4967798_4968446_+	pfam05685, Uma2, Putative restriction endonuclease	NA|298aa|down_7|NZ_AP018172.1_4968429_4969323_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|79aa|down_8|NZ_AP018172.1_4969536_4969773_+	pfam09003, Arm-DNA-bind_1, Bacteriophage lambda integrase, Arm DNA-binding domain	NA|223aa|down_9|NZ_AP018172.1_4969781_4970450_+	cd03194, GST_C_3, C-terminal, alpha helical domain of an unknown subfamily 3 of Glutathione S-transferases
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	14	4961296-4962048	11,8,8	CRISPRCasFinder,PILER-CR,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	ACTTTCCGATCACATCGCCCCGAAAGGGGATGGAAAC,CTTTCCGATCACATCGCCCCGAAAGGGGATGGAAAC,CTTTCCGATCACATCGCCCCGAAAGGGGATGGAAAC	37,36,36	0	0	NA	NA	NA:NA:NA	10,9,10	10	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|146aa|up_9|NZ_AP018172.1_4948127_4948565_-,NA|168aa|up_7|NZ_AP018172.1_4949188_4949692_-,NA|131aa|up_2|NZ_AP018172.1_4958862_4959255_-,NA|112aa|up_1|NZ_AP018172.1_4959366_4959702_+,NA|246aa|down_1|NZ_AP018172.1_4963441_4964179_-	NA|146aa|up_9|NZ_AP018172.1_4948127_4948565_-	NA	NA|125aa|up_8|NZ_AP018172.1_4948757_4949132_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|168aa|up_7|NZ_AP018172.1_4949188_4949692_-	NA	NA|276aa|up_6|NZ_AP018172.1_4949747_4950575_-	pfam09234, DUF1963, Domain of unknown function (DUF1963)	NA|1347aa|up_5|NZ_AP018172.1_4950602_4954643_-	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|808aa|up_4|NZ_AP018172.1_4954667_4957091_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|425aa|up_3|NZ_AP018172.1_4957435_4958710_+	COG1134, TagH, ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|131aa|up_2|NZ_AP018172.1_4958862_4959255_-	NA	NA|112aa|up_1|NZ_AP018172.1_4959366_4959702_+	NA	NA|97aa|up_0|NZ_AP018172.1_4960530_4960821_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|209aa|down_0|NZ_AP018172.1_4962178_4962805_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|246aa|down_1|NZ_AP018172.1_4963441_4964179_-	NA	NA|700aa|down_2|NZ_AP018172.1_4964297_4966397_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|154aa|down_3|NZ_AP018172.1_4966731_4967193_-	COG3837, COG3837, Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]	NA|146aa|down_4|NZ_AP018172.1_4967182_4967620_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|216aa|down_5|NZ_AP018172.1_4967798_4968446_+	pfam05685, Uma2, Putative restriction endonuclease	NA|298aa|down_6|NZ_AP018172.1_4968429_4969323_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|79aa|down_7|NZ_AP018172.1_4969536_4969773_+	pfam09003, Arm-DNA-bind_1, Bacteriophage lambda integrase, Arm DNA-binding domain	NA|223aa|down_8|NZ_AP018172.1_4969781_4970450_+	cd03194, GST_C_3, C-terminal, alpha helical domain of an unknown subfamily 3 of Glutathione S-transferases	NA|137aa|down_9|NZ_AP018172.1_4970596_4971007_-	cd17580, REC_2_DhkD-like, second phosphoacceptor receiver (REC) domain of Dictyostelium discoideum hybrid signal transduction histidine kinase D and similar domains
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	15	5282895-5283220	9,12,9	CRT,CRISPRCasFinder,PILER-CR	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTCANTCCCCTTGCGGGGTAATAGNTAGTGGAAAC,AGTCCCCTTGCGGGGTAATAGTT,GTTTCCACTAGCTATTACCCCGCAAGGGG-------ATTGAAAC	37,23,44	0	0	NA	NA	NA:NA:NA	4,4,3	4	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|135aa|up_9|NZ_AP018172.1_5270141_5270546_+,NA|104aa|up_4|NZ_AP018172.1_5279538_5279850_+,NA|68aa|down_6|NZ_AP018172.1_5292830_5293034_-	NA|135aa|up_9|NZ_AP018172.1_5270141_5270546_+	NA	NA|217aa|up_8|NZ_AP018172.1_5270566_5271217_-	COG4339, COG4339, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1323aa|up_7|NZ_AP018172.1_5271660_5275629_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|597aa|up_6|NZ_AP018172.1_5276254_5278045_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|379aa|up_5|NZ_AP018172.1_5278296_5279433_-	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|104aa|up_4|NZ_AP018172.1_5279538_5279850_+	NA	NA|450aa|up_3|NZ_AP018172.1_5279917_5281267_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|217aa|up_2|NZ_AP018172.1_5281430_5282081_+	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|69aa|up_1|NZ_AP018172.1_5282190_5282397_+	pfam01954, DUF104, Protein of unknown function DUF104	NA|143aa|up_0|NZ_AP018172.1_5282413_5282842_+	COG2402, COG2402, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|454aa|down_0|NZ_AP018172.1_5283676_5285038_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|375aa|down_1|NZ_AP018172.1_5285099_5286224_-	cd17534, REC_DC-like, phosphoacceptor receiver (REC) domain of modulated diguanylate cyclase and similar domains	NA|743aa|down_2|NZ_AP018172.1_5286237_5288466_-	PRK13560, PRK13560, hypothetical protein; Provisional	NA|429aa|down_3|NZ_AP018172.1_5289259_5290546_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|452aa|down_4|NZ_AP018172.1_5290593_5291949_+	PRK11274, glcF, glycolate oxidase subunit GlcF	NA|167aa|down_5|NZ_AP018172.1_5292159_5292660_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|68aa|down_6|NZ_AP018172.1_5292830_5293034_-	NA	NA|615aa|down_7|NZ_AP018172.1_5293228_5295073_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|242aa|down_8|NZ_AP018172.1_5295426_5296152_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|362aa|down_9|NZ_AP018172.1_5296710_5297796_+	cd03802, GT4_AviGT4-like, UDP-Glc:tetrahydrobiopterin alpha-glucosyltransferase and similar proteins
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	16	5529936-5530075	13	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GCTATGGGAGTTGGTGGTGCTATTGGTGGTGTAGCATTTGGTGTAGCAGATGG	53	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA,NA|85aa|down_2|NZ_AP018172.1_5535281_5535536_-,NA|178aa|down_5|NZ_AP018172.1_5537771_5538305_-	NA|293aa|up_9|NZ_AP018172.1_5517132_5518011_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region	NA|327aa|up_8|NZ_AP018172.1_5518135_5519116_+	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|724aa|up_7|NZ_AP018172.1_5519119_5521291_-	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|348aa|up_6|NZ_AP018172.1_5521572_5522616_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|184aa|up_5|NZ_AP018172.1_5523165_5523717_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|432aa|up_4|NZ_AP018172.1_5523757_5525053_+	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|188aa|up_3|NZ_AP018172.1_5525286_5525850_+	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|350aa|up_2|NZ_AP018172.1_5525918_5526968_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|388aa|up_1|NZ_AP018172.1_5527054_5528218_-	PLN02449, PLN02449, ferrochelatase	NA|208aa|up_0|NZ_AP018172.1_5528311_5528935_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|509aa|down_0|NZ_AP018172.1_5531613_5533140_+	CHL00076, chlB, photochlorophyllide reductase subunit B	NA|445aa|down_1|NZ_AP018172.1_5533280_5534615_-	pfam00931, NB-ARC, NB-ARC domain	NA|85aa|down_2|NZ_AP018172.1_5535281_5535536_-	NA	NA|257aa|down_3|NZ_AP018172.1_5536089_5536860_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|261aa|down_4|NZ_AP018172.1_5536933_5537716_-	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|178aa|down_5|NZ_AP018172.1_5537771_5538305_-	NA	NA|154aa|down_6|NZ_AP018172.1_5538450_5538912_-	pfam13301, DUF4079, Protein of unknown function (DUF4079)	NA|132aa|down_7|NZ_AP018172.1_5539054_5539450_-	pfam08865, DUF1830, Domain of unknown function (DUF1830)	NA|530aa|down_8|NZ_AP018172.1_5540026_5541616_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|464aa|down_9|NZ_AP018172.1_5542014_5543406_+	COG4250, COG4250, Predicted sensor protein/domain [Signal transduction mechanisms]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	17	5916777-5916866	14	CRISPRCasFinder	no	PD-DExK,Cas9_archaeal	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Unclear	ACCTTTTTACTCGAAGTTTCCTGTTCT	27	0	0	NA	NA	NA	1	1	Unclear	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|89aa|up_9|NZ_AP018172.1_5905574_5905841_-,NA|130aa|down_1|NZ_AP018172.1_5918187_5918577_-,NA|83aa|down_3|NZ_AP018172.1_5919192_5919441_-,NA|112aa|down_4|NZ_AP018172.1_5919512_5919848_+	NA|89aa|up_9|NZ_AP018172.1_5905574_5905841_-	NA	NA|374aa|up_8|NZ_AP018172.1_5905983_5907106_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|52aa|up_7|NZ_AP018172.1_5907287_5907443_+	pfam11688, DUF3285, Protein of unknown function (DUF3285)	NA|176aa|up_6|NZ_AP018172.1_5907485_5908013_+	PRK00016, PRK00016, metal-binding heat shock protein; Provisional	NA|154aa|up_5|NZ_AP018172.1_5908133_5908595_+	cd14265, UDPK_IM_like, Integral membrane undecaprenol kinase and similar enzymes	NA|200aa|up_4|NZ_AP018172.1_5908691_5909291_+	PRK05670, PRK05670, anthranilate synthase component II; Provisional	NA|256aa|up_3|NZ_AP018172.1_5909294_5910062_+	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|437aa|up_2|NZ_AP018172.1_5910307_5911618_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|436aa|up_1|NZ_AP018172.1_5912261_5913569_+	PRK10879, PRK10879, proline aminopeptidase P II; Provisional	NA|880aa|up_0|NZ_AP018172.1_5913962_5916602_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|292aa|down_0|NZ_AP018172.1_5917249_5918125_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|130aa|down_1|NZ_AP018172.1_5918187_5918577_-	NA	Cas9_archaeal|173aa|down_2|NZ_AP018172.1_5918593_5919112_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|83aa|down_3|NZ_AP018172.1_5919192_5919441_-	NA	NA|112aa|down_4|NZ_AP018172.1_5919512_5919848_+	NA	NA|678aa|down_5|NZ_AP018172.1_5920636_5922670_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|196aa|down_6|NZ_AP018172.1_5922662_5923250_+	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|640aa|down_7|NZ_AP018172.1_5923819_5925739_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|210aa|down_8|NZ_AP018172.1_5925705_5926335_-	COG1290, QcrB, Cytochrome b subunit of the bc complex [Energy production and conversion]	NA|232aa|down_9|NZ_AP018172.1_5926424_5927120_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	18	6744087-6744174	15	CRISPRCasFinder	no	PD-DExK	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Unclear	GCGATCTCAGAAACTGTAACTTCAGTGG	28	0	0	NA	NA	NA	1	1	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	PD-DExK|198aa|up_8|NZ_AP018172.1_6733345_6733939_-,NA|232aa|up_6|NZ_AP018172.1_6734919_6735615_+,NA|93aa|up_4|NZ_AP018172.1_6736095_6736374_-,NA|138aa|down_1|NZ_AP018172.1_6746423_6746837_+,NA|273aa|down_4|NZ_AP018172.1_6750564_6751383_-	NA|525aa|up_9|NZ_AP018172.1_6731600_6733175_+	PRK02546, PRK02546, NAD(P)H-quinone oxidoreductase subunit 4; Provisional	PD-DExK|198aa|up_8|NZ_AP018172.1_6733345_6733939_-	NA	NA|240aa|up_7|NZ_AP018172.1_6734073_6734793_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|232aa|up_6|NZ_AP018172.1_6734919_6735615_+	NA	NA|76aa|up_5|NZ_AP018172.1_6735824_6736052_+	COG3905, COG3905, Predicted transcriptional regulator [Transcription]	NA|93aa|up_4|NZ_AP018172.1_6736095_6736374_-	NA	NA|476aa|up_3|NZ_AP018172.1_6736455_6737883_-	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|286aa|up_2|NZ_AP018172.1_6737988_6738846_+	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|557aa|up_1|NZ_AP018172.1_6738921_6740592_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|954aa|up_0|NZ_AP018172.1_6740882_6743744_+	cd02089, P-type_ATPase_Ca_prok, prokaryotic P-type Ca(2+)-ATPase similar to Synechococcus elongatus sp	NA|378aa|down_0|NZ_AP018172.1_6744860_6745994_+	PRK00064, recF, recombination protein F; Reviewed	NA|138aa|down_1|NZ_AP018172.1_6746423_6746837_+	NA	NA|146aa|down_2|NZ_AP018172.1_6746885_6747323_-	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|801aa|down_3|NZ_AP018172.1_6748086_6750489_+	COG0574, PpsA, Phosphoenolpyruvate synthase/pyruvate phosphate dikinase [Carbohydrate transport and metabolism]	NA|273aa|down_4|NZ_AP018172.1_6750564_6751383_-	NA	NA|391aa|down_5|NZ_AP018172.1_6751434_6752607_-	pfam18848, baeRF_family6, Bacterial archaeo-eukaryotic release factor family 6	NA|219aa|down_6|NZ_AP018172.1_6752971_6753628_-	COG4330, COG4330, Predicted membrane protein [Function unknown]	NA|836aa|down_7|NZ_AP018172.1_6754083_6756591_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|151aa|down_8|NZ_AP018172.1_6756821_6757274_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|507aa|down_9|NZ_AP018172.1_6757276_6758797_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	19	7416249-7417846	10,16,10	PILER-CR,CRISPRCasFinder,CRT	no	cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas2,cas1,csx18,WYL	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Type III-A,Type III-B,Type III-D,Type III-C	GTTTCCATTCAATTAATTTCTCTAGCGAGTAGAGAC,GTTTCCATTCAATTAATTTCTCTAGCGAGTAGAGAC,GTTTCCATTCAATTANTTTCTCTAGCGAGTAGAGAC	36,36,36	0	0	NA	NA	NA:NA:NA	21,21,21	21	TypeIII-A,TypeIII-B,TypeIII-D,TypeIII-C	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|309aa|up_7|NZ_AP018172.1_7401359_7402286_-,NA|105aa|up_6|NZ_AP018172.1_7402285_7402600_-,NA|518aa|up_5|NZ_AP018172.1_7407036_7408590_-,cmr5gr11|166aa|up_3|NZ_AP018172.1_7410374_7410872_-,csx18|95aa|down_2|NZ_AP018172.1_7419313_7419598_-,NA|360aa|down_4|NZ_AP018172.1_7421134_7422214_-,NA|261aa|down_7|NZ_AP018172.1_7425738_7426521_-	NA|139aa|up_9|NZ_AP018172.1_7394606_7395023_+	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|2107aa|up_8|NZ_AP018172.1_7395042_7401363_-	cd17923, DEXHc_Hrq1-like, DEAH-box helicase domain of Hrq1 and similar proteins	NA|309aa|up_7|NZ_AP018172.1_7401359_7402286_-	NA	NA|105aa|up_6|NZ_AP018172.1_7402285_7402600_-	NA	NA|518aa|up_5|NZ_AP018172.1_7407036_7408590_-	NA	NA|570aa|up_4|NZ_AP018172.1_7408665_7410375_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|166aa|up_3|NZ_AP018172.1_7410374_7410872_-	NA	cmr4gr7|294aa|up_2|NZ_AP018172.1_7410883_7411765_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|380aa|up_1|NZ_AP018172.1_7411820_7412960_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|991aa|up_0|NZ_AP018172.1_7412973_7415946_-	pfam12469, DUF3692, CRISPR-associated protein	cas2|93aa|down_0|NZ_AP018172.1_7418031_7418310_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|down_1|NZ_AP018172.1_7418315_7419308_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx18|95aa|down_2|NZ_AP018172.1_7419313_7419598_-	NA	WYL|399aa|down_3|NZ_AP018172.1_7419817_7421014_+	pfam13280, WYL, WYL domain	NA|360aa|down_4|NZ_AP018172.1_7421134_7422214_-	NA	NA|837aa|down_5|NZ_AP018172.1_7422662_7425173_-	pfam13481, AAA_25, AAA domain	NA|89aa|down_6|NZ_AP018172.1_7425259_7425526_-	pfam06806, DUF1233, Putative excisionase (DUF1233)	NA|261aa|down_7|NZ_AP018172.1_7425738_7426521_-	NA	NA|397aa|down_8|NZ_AP018172.1_7426690_7427881_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|346aa|down_9|NZ_AP018172.1_7428188_7429226_-	pfam05672, MAP7, MAP7 (E-MAP-115) family
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	20	8383640-8383879	17	CRISPRCasFinder	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	AGATTTTAATTTAAAATTACTTGA	24	0	0	NA	NA	NA	4	4	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|157aa|up_8|NZ_AP018172.1_8375176_8375647_-,NA|125aa|up_3|NZ_AP018172.1_8379685_8380060_+,NA|158aa|up_1|NZ_AP018172.1_8380566_8381040_-,NA|141aa|up_0|NZ_AP018172.1_8381232_8381655_-,NA|77aa|down_0|NZ_AP018172.1_8384872_8385103_+	NA|532aa|up_9|NZ_AP018172.1_8373503_8375099_+	COG0374, HyaB, Ni,Fe-hydrogenase I large subunit [Energy production and conversion]	NA|157aa|up_8|NZ_AP018172.1_8375176_8375647_-	NA	NA|70aa|up_7|NZ_AP018172.1_8375776_8375986_-	COG3585, MopI, Molybdopterin-binding protein [Coenzyme metabolism]	NA|193aa|up_6|NZ_AP018172.1_8376889_8377468_+	cd04630, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|155aa|up_5|NZ_AP018172.1_8377930_8378395_+	sd00045, ANK, ankyrin repeats	NA|429aa|up_4|NZ_AP018172.1_8378381_8379668_+	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|125aa|up_3|NZ_AP018172.1_8379685_8380060_+	NA	NA|157aa|up_2|NZ_AP018172.1_8380080_8380551_-	cd06063, H2MP_Cyano-H2up, This group of endopeptidases include HupW enzymes that are specific to the cyanobacterial hydrogenase and are involved in the C-terminal cleavage of the hydrogenase large subunit precursor protein	NA|158aa|up_1|NZ_AP018172.1_8380566_8381040_-	NA	NA|141aa|up_0|NZ_AP018172.1_8381232_8381655_-	NA	NA|77aa|down_0|NZ_AP018172.1_8384872_8385103_+	NA	NA|542aa|down_1|NZ_AP018172.1_8385125_8386751_+	sd00006, TPR, Tetratricopeptide repeat	NA|237aa|down_2|NZ_AP018172.1_8386818_8387529_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|227aa|down_3|NZ_AP018172.1_8387798_8388479_+	pfam00395, SLH, S-layer homology domain	NA|636aa|down_4|NZ_AP018172.1_8388984_8390892_+	PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase; Provisional	NA|1216aa|down_5|NZ_AP018172.1_8391070_8394718_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|496aa|down_6|NZ_AP018172.1_8394919_8396407_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|395aa|down_7|NZ_AP018172.1_8396472_8397657_-	PRK05942, PRK05942, aspartate aminotransferase; Provisional	NA|397aa|down_8|NZ_AP018172.1_8397653_8398844_-	cd08550, GlyDH-like, Glycerol_dehydrogenase-like	NA|174aa|down_9|NZ_AP018172.1_8399500_8400022_-	pfam10726, DUF2518, Protein of function (DUF2518)
GCF_002368175.1_ASM236817v1	NZ_AP018172	Calothrix sp. NIES-2098 DNA, complete genome	21	8592827-8593165	11,18,11	PILER-CR,CRISPRCasFinder,CRT	no		2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	Orphan	GTTTCCATCAATCATTTCCCCGCAAGGGG-------ATTGAAAC,GTTTCCATCAATCATTTCCCCGCAAGGGGATTGAAAC,GTTTCCATCAATCATTTCCCCGCAAGGGGATTGAAAC	44,37,37	0	0	NA	NA	NA:NA:NA	4,4,4	4	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|96aa|up_7|NZ_AP018172.1_8581231_8581519_-,NA|125aa|up_3|NZ_AP018172.1_8585259_8585634_+,NA	NA|509aa|up_9|NZ_AP018172.1_8578855_8580382_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|226aa|up_8|NZ_AP018172.1_8580557_8581235_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|96aa|up_7|NZ_AP018172.1_8581231_8581519_-	NA	NA|231aa|up_6|NZ_AP018172.1_8582359_8583052_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|507aa|up_5|NZ_AP018172.1_8583072_8584593_-	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|133aa|up_4|NZ_AP018172.1_8584705_8585104_+	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|125aa|up_3|NZ_AP018172.1_8585259_8585634_+	NA	NA|774aa|up_2|NZ_AP018172.1_8585694_8588016_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|560aa|up_1|NZ_AP018172.1_8588551_8590231_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|718aa|up_0|NZ_AP018172.1_8590363_8592517_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|535aa|down_0|NZ_AP018172.1_8593447_8595052_+	COG0728, MviN, Uncharacterized membrane protein, putative virulence factor [General function prediction only]	NA|253aa|down_1|NZ_AP018172.1_8595487_8596246_+	PRK05910, PRK05910, type III secretion system protein; Validated	NA|211aa|down_2|NZ_AP018172.1_8596328_8596961_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|281aa|down_3|NZ_AP018172.1_8596957_8597800_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|333aa|down_4|NZ_AP018172.1_8597842_8598841_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|980aa|down_5|NZ_AP018172.1_8600318_8603258_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|989aa|down_6|NZ_AP018172.1_8604006_8606973_+	sd00006, TPR, Tetratricopeptide repeat	NA|173aa|down_7|NZ_AP018172.1_8606992_8607511_-	pfam14218, COP23, Circadian oscillating protein COP23	NA|1224aa|down_8|NZ_AP018172.1_8607653_8611325_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|580aa|down_9|NZ_AP018172.1_8611374_8613114_+	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]
GCF_002368175.1_ASM236817v1	NZ_AP018173	Calothrix sp. NIES-2098 plasmid plasmid1 DNA, complete genome	1	8830-9451	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		RT	Orphan	CTTGCCGATCACATTGCCCCGCAAGGGG-------ATGGAAAC,CTTGCCGATCACATTGCCCCGCAAGGGGATGGAAAC,CTTGCCGATCACATTGCCCCGCAAGGGGATGGAAAC	43,36,36	0	0	NA	NA	NA:NA:NA	6,8,8	8	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|146aa|up_9|NZ_AP018173.1_525_963_-,NA|192aa|up_8|NZ_AP018173.1_1882_2458_+,NA|200aa|up_5|NZ_AP018173.1_3829_4429_+,NA|74aa|up_3|NZ_AP018173.1_6604_6826_+,NA|62aa|up_1|NZ_AP018173.1_7088_7274_-,NA|89aa|up_0|NZ_AP018173.1_7928_8195_+,NA|85aa|down_1|NZ_AP018173.1_10448_10703_+,NA|184aa|down_3|NZ_AP018173.1_11914_12466_-,NA|321aa|down_5|NZ_AP018173.1_15414_16377_-,NA|72aa|down_6|NZ_AP018173.1_16466_16682_+	NA|146aa|up_9|NZ_AP018173.1_525_963_-	NA	NA|192aa|up_8|NZ_AP018173.1_1882_2458_+	NA	NA|88aa|up_7|NZ_AP018173.1_2734_2998_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|92aa|up_6|NZ_AP018173.1_3059_3335_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|200aa|up_5|NZ_AP018173.1_3829_4429_+	NA	NA|618aa|up_4|NZ_AP018173.1_4640_6494_-	pfam16684, Telomere_res, Telomere resolvase	NA|74aa|up_3|NZ_AP018173.1_6604_6826_+	NA	NA|64aa|up_2|NZ_AP018173.1_6895_7087_+	pfam08374, Protocadherin, Protocadherin	NA|62aa|up_1|NZ_AP018173.1_7088_7274_-	NA	NA|89aa|up_0|NZ_AP018173.1_7928_8195_+	NA	NA|211aa|down_0|NZ_AP018173.1_9815_10448_+	PHA02518, PHA02518, ParA-like protein; Provisional	NA|85aa|down_1|NZ_AP018173.1_10448_10703_+	NA	NA|313aa|down_2|NZ_AP018173.1_10899_11838_+	cd09279, RNase_HI_like, RNAse HI family that includes archaeal, some bacterial as well as plant RNase HI	NA|184aa|down_3|NZ_AP018173.1_11914_12466_-	NA	NA|230aa|down_4|NZ_AP018173.1_12473_13163_-	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|321aa|down_5|NZ_AP018173.1_15414_16377_-	NA	NA|72aa|down_6|NZ_AP018173.1_16466_16682_+	NA	NA|134aa|down_7|NZ_AP018173.1_16990_17392_+	pfam05973, Gp49, Phage derived protein Gp49-like (DUF891)	NA|104aa|down_8|NZ_AP018173.1_17375_17687_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|308aa|down_9|NZ_AP018173.1_17717_18641_-	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein
GCF_002368175.1_ASM236817v1	NZ_AP018173	Calothrix sp. NIES-2098 plasmid plasmid1 DNA, complete genome	2	14870-15260	2,2	CRISPRCasFinder,PILER-CR	no		RT	Orphan	CTTTCCTAAACAACTACCCCTTACGGGGATGGAAAC,ACTTTCCTAAACAACTACCCCTTACGGGGATGGAAACCA	36,39	1	1	14906-14941	NZ_AP018173.1_213696-213661	NA:NA	5,2	5	Orphan	2OG_CAS,csx21,csm3gr7,csx19,PD-DExK,csm2gr11,csx10gr5,cas10,Cas9_archaeal,RT,csa3,cas3,DinG,cmr5gr11,cmr4gr7,cmr3gr5,cas2,cas1,csx18,WYL,cas6,DEDDh	NA|74aa|up_8|NZ_AP018173.1_6604_6826_+,NA|62aa|up_6|NZ_AP018173.1_7088_7274_-,NA|89aa|up_5|NZ_AP018173.1_7928_8195_+,NA|85aa|up_3|NZ_AP018173.1_10448_10703_+,NA|184aa|up_1|NZ_AP018173.1_11914_12466_-,NA|321aa|down_0|NZ_AP018173.1_15414_16377_-,NA|72aa|down_1|NZ_AP018173.1_16466_16682_+	NA|618aa|up_9|NZ_AP018173.1_4640_6494_-	pfam16684, Telomere_res, Telomere resolvase	NA|74aa|up_8|NZ_AP018173.1_6604_6826_+	NA	NA|64aa|up_7|NZ_AP018173.1_6895_7087_+	pfam08374, Protocadherin, Protocadherin	NA|62aa|up_6|NZ_AP018173.1_7088_7274_-	NA	NA|89aa|up_5|NZ_AP018173.1_7928_8195_+	NA	NA|211aa|up_4|NZ_AP018173.1_9815_10448_+	PHA02518, PHA02518, ParA-like protein; Provisional	NA|85aa|up_3|NZ_AP018173.1_10448_10703_+	NA	NA|313aa|up_2|NZ_AP018173.1_10899_11838_+	cd09279, RNase_HI_like, RNAse HI family that includes archaeal, some bacterial as well as plant RNase HI	NA|184aa|up_1|NZ_AP018173.1_11914_12466_-	NA	NA|230aa|up_0|NZ_AP018173.1_12473_13163_-	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|321aa|down_0|NZ_AP018173.1_15414_16377_-	NA	NA|72aa|down_1|NZ_AP018173.1_16466_16682_+	NA	NA|134aa|down_2|NZ_AP018173.1_16990_17392_+	pfam05973, Gp49, Phage derived protein Gp49-like (DUF891)	NA|104aa|down_3|NZ_AP018173.1_17375_17687_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|308aa|down_4|NZ_AP018173.1_17717_18641_-	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|255aa|down_5|NZ_AP018173.1_18640_19405_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|103aa|down_6|NZ_AP018173.1_19751_20060_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|63aa|down_7|NZ_AP018173.1_20175_20364_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|1638aa|down_8|NZ_AP018173.1_20571_25485_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|432aa|down_9|NZ_AP018173.1_25481_26777_-	pfam07693, KAP_NTPase, KAP family P-loop domain
