assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	1	391780-391877	1	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GCGCGTTTGGCTTCGGCTTCTTT	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|142aa|up_8|NC_019738.1_378355_378781_+,NA|73aa|up_3|NC_019738.1_386124_386343_-,NA|63aa|up_2|NC_019738.1_386339_386528_-,NA|73aa|down_0|NC_019738.1_393964_394183_+,NA|70aa|down_6|NC_019738.1_403903_404113_-,NA|62aa|down_7|NC_019738.1_404111_404297_+	NA|235aa|up_9|NC_019738.1_377286_377991_+	COG1434, COG1434, Uncharacterized conserved protein [Function unknown]	NA|142aa|up_8|NC_019738.1_378355_378781_+	NA	NA|864aa|up_7|NC_019738.1_378986_381578_+	COG1221, PspF, Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms]	NA|520aa|up_6|NC_019738.1_381881_383441_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|536aa|up_5|NC_019738.1_383775_385383_+	cd07302, CHD, cyclase homology domain	NA|143aa|up_4|NC_019738.1_385643_386072_-	COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones]	NA|73aa|up_3|NC_019738.1_386124_386343_-	NA	NA|63aa|up_2|NC_019738.1_386339_386528_-	NA	NA|837aa|up_1|NC_019738.1_386656_389167_+	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|276aa|up_0|NC_019738.1_389186_390014_-	cd03023, DsbA_Com1_like, DsbA family, Com1-like subfamily; composed of proteins similar to Com1, a 27-kDa outer membrane-associated immunoreactive protein originally found in both acute and chronic disease strains of the pathogenic bacteria Coxiella burnetti	NA|73aa|down_0|NC_019738.1_393964_394183_+	NA	NA|399aa|down_1|NC_019738.1_394484_395681_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|558aa|down_2|NC_019738.1_396147_397821_-	COG2898, COG2898, Uncharacterized conserved protein [Function unknown]	NA|311aa|down_3|NC_019738.1_399583_400516_-	COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]	NA|571aa|down_4|NC_019738.1_401659_403372_+	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|165aa|down_5|NC_019738.1_403379_403874_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|70aa|down_6|NC_019738.1_403903_404113_-	NA	NA|62aa|down_7|NC_019738.1_404111_404297_+	NA	NA|295aa|down_8|NC_019738.1_404324_405209_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|328aa|down_9|NC_019738.1_406582_407566_+	PRK05949, PRK05949, RNA polymerase sigma factor; Validated
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	2	819238-819334	2	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	CCCCCACGTCAGGCTCTCAACACTC	25	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|32aa|up_9|NC_019738.1_811132_811228_-,NA|84aa|up_7|NC_019738.1_813431_813683_+,NA|248aa|up_5|NC_019738.1_814505_815249_-,NA|68aa|up_1|NC_019738.1_817291_817495_+,NA|82aa|up_0|NC_019738.1_817701_817947_-,NA|104aa|down_0|NC_019738.1_820439_820751_-,NA|85aa|down_3|NC_019738.1_824313_824568_-,NA|121aa|down_5|NC_019738.1_826110_826473_+,NA|153aa|down_6|NC_019738.1_826774_827233_-	NA|32aa|up_9|NC_019738.1_811132_811228_-	NA	NA|367aa|up_8|NC_019738.1_811320_812421_+	PRK00002, aroB, 3-dehydroquinate synthase; Reviewed	NA|84aa|up_7|NC_019738.1_813431_813683_+	NA	NA|125aa|up_6|NC_019738.1_813995_814370_+	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|248aa|up_5|NC_019738.1_814505_815249_-	NA	NA|188aa|up_4|NC_019738.1_815513_816077_+	pfam05685, Uma2, Putative restriction endonuclease	NA|189aa|up_3|NC_019738.1_816232_816799_-	pfam05685, Uma2, Putative restriction endonuclease	NA|83aa|up_2|NC_019738.1_816897_817146_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|68aa|up_1|NC_019738.1_817291_817495_+	NA	NA|82aa|up_0|NC_019738.1_817701_817947_-	NA	NA|104aa|down_0|NC_019738.1_820439_820751_-	NA	NA|226aa|down_1|NC_019738.1_821068_821746_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|726aa|down_2|NC_019738.1_822015_824193_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|85aa|down_3|NC_019738.1_824313_824568_-	NA	NA|393aa|down_4|NC_019738.1_824651_825830_-	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|121aa|down_5|NC_019738.1_826110_826473_+	NA	NA|153aa|down_6|NC_019738.1_826774_827233_-	NA	NA|247aa|down_7|NC_019738.1_827619_828360_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|616aa|down_8|NC_019738.1_828383_830231_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|312aa|down_9|NC_019738.1_830377_831313_+	pfam17655, IRK_C, Inward rectifier potassium channel C-terminal domain
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	3	955855-956009	1	PILER-CR	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	AAATTGTCTAACTTTATTATTCC	23	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|123aa|up_7|NC_019738.1_949866_950235_-,NA|94aa|up_5|NC_019738.1_950753_951035_+,NA|91aa|up_4|NC_019738.1_951048_951321_+,NA|211aa|up_3|NC_019738.1_951425_952058_+,NA|360aa|up_1|NC_019738.1_953872_954951_+,NA|137aa|up_0|NC_019738.1_955030_955441_+,NA|126aa|down_3|NC_019738.1_961184_961562_-,NA|72aa|down_5|NC_019738.1_967759_967975_-,NA|303aa|down_8|NC_019738.1_970702_971611_-	NA|740aa|up_9|NC_019738.1_945997_948217_-	pfam00498, FHA, FHA domain	NA|536aa|up_8|NC_019738.1_948235_949843_-	pfam12770, CHAT, CHAT domain	NA|123aa|up_7|NC_019738.1_949866_950235_-	NA	NA|82aa|up_6|NC_019738.1_950388_950634_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|94aa|up_5|NC_019738.1_950753_951035_+	NA	NA|91aa|up_4|NC_019738.1_951048_951321_+	NA	NA|211aa|up_3|NC_019738.1_951425_952058_+	NA	NA|548aa|up_2|NC_019738.1_952217_953861_+	pfam15978, TnsD, Tn7-like transposition protein D	NA|360aa|up_1|NC_019738.1_953872_954951_+	NA	NA|137aa|up_0|NC_019738.1_955030_955441_+	NA	NA|225aa|down_0|NC_019738.1_956429_957104_+	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|438aa|down_1|NC_019738.1_957297_958611_+	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|844aa|down_2|NC_019738.1_958644_961176_-	pfam00656, Peptidase_C14, Caspase domain	NA|126aa|down_3|NC_019738.1_961184_961562_-	NA	NA|304aa|down_4|NC_019738.1_963829_964741_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|72aa|down_5|NC_019738.1_967759_967975_-	NA	NA|64aa|down_6|NC_019738.1_968180_968372_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|641aa|down_7|NC_019738.1_968665_970588_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|303aa|down_8|NC_019738.1_970702_971611_-	NA	NA|304aa|down_9|NC_019738.1_971921_972833_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	4	1265226-1265320	3	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	AGTACCAGGAGTCGGTGCGGCTGAATTTCCTGGAG	35	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|263aa|up_6|NC_019738.1_1253933_1254722_-,NA|336aa|up_2|NC_019738.1_1260266_1261274_-,NA|68aa|down_2|NC_019738.1_1267645_1267849_+,NA|87aa|down_9|NC_019738.1_1275507_1275768_+	NA|403aa|up_9|NC_019738.1_1251298_1252507_+	COG3889, COG3889, Predicted solute binding protein [General function prediction only]	NA|59aa|up_8|NC_019738.1_1252853_1253030_-	COG1826, TatA, Sec-independent protein secretion pathway components [Intracellular trafficking and secretion]	NA|248aa|up_7|NC_019738.1_1253142_1253886_+	PRK02816, PRK02816, phycocyanobilin:ferredoxin oxidoreductase; Validated	NA|263aa|up_6|NC_019738.1_1253933_1254722_-	NA	NA|554aa|up_5|NC_019738.1_1254947_1256609_-	pfam08291, Peptidase_M15_3, Peptidase M15	NA|256aa|up_4|NC_019738.1_1256657_1257425_+	PRK01158, PRK01158, phosphoglycolate phosphatase; Provisional	NA|888aa|up_3|NC_019738.1_1257525_1260189_+	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|336aa|up_2|NC_019738.1_1260266_1261274_-	NA	NA|612aa|up_1|NC_019738.1_1262031_1263867_+	PRK07431, PRK07431, aspartate kinase; Provisional	NA|345aa|up_0|NC_019738.1_1264084_1265119_-	pfam03372, Exo_endo_phos, Endonuclease/Exonuclease/phosphatase family	NA|237aa|down_0|NC_019738.1_1266129_1266840_-	COG0811, TolQ, Biopolymer transport proteins [Intracellular trafficking and secretion]	NA|100aa|down_1|NC_019738.1_1267215_1267515_+	COG3339, COG3339, Uncharacterized conserved protein [Function unknown]	NA|68aa|down_2|NC_019738.1_1267645_1267849_+	NA	NA|1062aa|down_3|NC_019738.1_1267894_1271080_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|226aa|down_4|NC_019738.1_1271209_1271887_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|132aa|down_5|NC_019738.1_1272349_1272745_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|496aa|down_6|NC_019738.1_1272770_1274258_-	pfam02696, UPF0061, Uncharacterized ACR, YdiU/UPF0061 family	NA|86aa|down_7|NC_019738.1_1274320_1274578_+	pfam12095, CRR7, Protein CHLORORESPIRATORY REDUCTION 7	NA|187aa|down_8|NC_019738.1_1274580_1275141_-	pfam11016, DUF2854, Protein of unknown function (DUF2854)	NA|87aa|down_9|NC_019738.1_1275507_1275768_+	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	5	1574448-1574559	4	CRISPRCasFinder	no	csa3	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Type I-A	GTGAGCGCGAAGAGGTAGAGGATGAGGGTTGCTGGGT	37	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|232aa|up_9|NC_019738.1_1559661_1560357_+,NA|93aa|up_7|NC_019738.1_1561406_1561685_-,NA|109aa|down_1|NC_019738.1_1576377_1576704_-,NA|53aa|down_5|NC_019738.1_1582632_1582791_+,NA|129aa|down_7|NC_019738.1_1583500_1583887_+	NA|232aa|up_9|NC_019738.1_1559661_1560357_+	NA	NA|304aa|up_8|NC_019738.1_1560488_1561400_-	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|93aa|up_7|NC_019738.1_1561406_1561685_-	NA	NA|211aa|up_6|NC_019738.1_1562128_1562761_-	PRK00116, ruvA, Holliday junction branch migration protein RuvA	NA|252aa|up_5|NC_019738.1_1562907_1563663_-	TIGR01485, putative_sucrose-phosphate_phosphatase, sucrose-6F-phosphate phosphohydrolase	NA|931aa|up_4|NC_019738.1_1563959_1566752_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1076aa|up_3|NC_019738.1_1567191_1570419_+	pfam18181, SLATT_1, SMODS and SLOG-associating 2TM effector domain 1	NA|250aa|up_2|NC_019738.1_1570624_1571374_+	pfam18171, LSDAT_prok, SLOG in TRPM, prokaryote	NA|280aa|up_1|NC_019738.1_1571482_1572322_+	pfam14015, DUF4231, Protein of unknown function (DUF4231)	NA|421aa|up_0|NC_019738.1_1572397_1573660_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|180aa|down_0|NC_019738.1_1575779_1576319_-	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins	NA|109aa|down_1|NC_019738.1_1576377_1576704_-	NA	NA|834aa|down_2|NC_019738.1_1576763_1579265_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|665aa|down_3|NC_019738.1_1579896_1581891_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	csa3|140aa|down_4|NC_019738.1_1582158_1582578_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|53aa|down_5|NC_019738.1_1582632_1582791_+	NA	NA|55aa|down_6|NC_019738.1_1582808_1582973_-	pfam02069, Metallothio_Pro, Prokaryotic metallothionein	NA|129aa|down_7|NC_019738.1_1583500_1583887_+	NA	NA|118aa|down_8|NC_019738.1_1584158_1584512_+	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|202aa|down_9|NC_019738.1_1584522_1585128_+	pfam06103, DUF948, Bacterial protein of unknown function (DUF948)
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	6	1787477-1787583	5	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GGAGCGAACCTCACTAGAGCGAACCTCTTTTT	32	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|90aa|up_5|NC_019738.1_1778591_1778861_+,NA|149aa|up_1|NC_019738.1_1786147_1786594_-,NA|75aa|down_6|NC_019738.1_1794038_1794263_-	NA|237aa|up_9|NC_019738.1_1772574_1773285_-	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|763aa|up_8|NC_019738.1_1773608_1775897_+	pfam13598, DUF4139, Domain of unknown function (DUF4139)	NA|215aa|up_7|NC_019738.1_1775967_1776612_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|528aa|up_6|NC_019738.1_1776634_1778218_+	TIGR02231, Hypothetical_protein_ZK1055	NA|90aa|up_5|NC_019738.1_1778591_1778861_+	NA	NA|138aa|up_4|NC_019738.1_1779002_1779416_+	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|799aa|up_3|NC_019738.1_1779485_1781882_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|1328aa|up_2|NC_019738.1_1782047_1786031_+	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|149aa|up_1|NC_019738.1_1786147_1786594_-	NA	NA|114aa|up_0|NC_019738.1_1786757_1787099_-	CHL00134, petF, ferredoxin; Validated	NA|214aa|down_0|NC_019738.1_1788186_1788828_-	COG1075, LipA, Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]	NA|163aa|down_1|NC_019738.1_1789231_1789720_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|376aa|down_2|NC_019738.1_1789733_1790861_-	sd00006, TPR, Tetratricopeptide repeat	NA|458aa|down_3|NC_019738.1_1791261_1792635_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|197aa|down_4|NC_019738.1_1792697_1793288_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|245aa|down_5|NC_019738.1_1793260_1793995_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|75aa|down_6|NC_019738.1_1794038_1794263_-	NA	NA|145aa|down_7|NC_019738.1_1794330_1794765_-	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|467aa|down_8|NC_019738.1_1794958_1796359_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|622aa|down_9|NC_019738.1_1796644_1798510_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	7	2087079-2087334	2,6,1	PILER-CR,CRISPRCasFinder,CRT	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	AGTCTGAATTCCATATAATCCCTATCAGGGATTGAAACG,AGTCTGAATTCCATATAATCCCTATCAGGGATTGAAAC,AGTCTGAATTCCATATAATCCCTATCAGGGATTGAAACG	39,38,39	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	3,3,3	3	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|56aa|up_9|NC_019738.1_2074360_2074528_-,NA|107aa|up_3|NC_019738.1_2083822_2084143_+,NA|46aa|down_6|NC_019738.1_2095820_2095958_+,NA|66aa|down_8|NC_019738.1_2098054_2098252_+	NA|56aa|up_9|NC_019738.1_2074360_2074528_-	NA	NA|710aa|up_8|NC_019738.1_2074548_2076678_+	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|397aa|up_7|NC_019738.1_2076992_2078183_+	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|208aa|up_6|NC_019738.1_2078207_2078831_+	pfam09517, RE_Eco29kI, Eco29kI restriction endonuclease	NA|259aa|up_5|NC_019738.1_2078805_2079582_-	cd06259, YdcF-like, YdcF-like	NA|1198aa|up_4|NC_019738.1_2079841_2083435_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|107aa|up_3|NC_019738.1_2083822_2084143_+	NA	NA|74aa|up_2|NC_019738.1_2084378_2084600_+	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|231aa|up_1|NC_019738.1_2084763_2085456_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|228aa|up_0|NC_019738.1_2085817_2086501_-	COG1926, COG1926, Predicted phosphoribosyltransferases [General function prediction only]	NA|624aa|down_0|NC_019738.1_2087549_2089421_-	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|317aa|down_1|NC_019738.1_2089504_2090455_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|227aa|down_2|NC_019738.1_2090821_2091502_+	pfam14206, Cys_rich_CPCC, Cysteine-rich CPCC	NA|782aa|down_3|NC_019738.1_2091503_2093849_-	pfam00211, Guanylate_cyc, Adenylate and Guanylate cyclase catalytic domain	NA|232aa|down_4|NC_019738.1_2094207_2094903_-	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|249aa|down_5|NC_019738.1_2095018_2095765_-	pfam08291, Peptidase_M15_3, Peptidase M15	NA|46aa|down_6|NC_019738.1_2095820_2095958_+	NA	NA|144aa|down_7|NC_019738.1_2097620_2098052_+	cd00737, lyz_endolysin_autolysin, endolysin and autolysin	NA|66aa|down_8|NC_019738.1_2098054_2098252_+	NA	NA|339aa|down_9|NC_019738.1_2098664_2099681_+	pfam13413, HTH_25, Helix-turn-helix domain
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	8	2127069-2127163	7	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GAACAAAGAGGATGCTACCAAGA	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|80aa|down_9|NC_019738.1_2139636_2139876_-	NA|595aa|up_9|NC_019738.1_2114074_2115859_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|262aa|up_8|NC_019738.1_2116066_2116852_-	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|51aa|up_7|NC_019738.1_2117016_2117169_+	pfam10013, DUF2256, Uncharacterized protein conserved in bacteria (DUF2256)	NA|315aa|up_6|NC_019738.1_2117303_2118248_+	cd14949, Asparaginase_2_like_3, Uncharacterized bacterial subfamily of the L-Asparaginase type 2-like enzymes, an Ntn-hydrolase family	NA|183aa|up_5|NC_019738.1_2118344_2118893_-	pfam05685, Uma2, Putative restriction endonuclease	NA|210aa|up_4|NC_019738.1_2119379_2120009_+	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|1250aa|up_3|NC_019738.1_2120295_2124045_+	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|339aa|up_2|NC_019738.1_2124119_2125136_+	pfam05239, PRC, PRC-barrel domain	NA|260aa|up_1|NC_019738.1_2125179_2125959_-	pfam10099, RskA, Anti-sigma-K factor rskA	NA|208aa|up_0|NC_019738.1_2125971_2126595_-	PRK12519, PRK12519, RNA polymerase sigma factor; Provisional	NA|203aa|down_0|NC_019738.1_2127467_2128076_+	COG4117, COG4117, Thiosulfate reductase cytochrome B subunit (membrane anchoring protein) [Energy production and conversion]	NA|241aa|down_1|NC_019738.1_2128085_2128808_+	cd02108, bact_SO_family_Moco, bacterial subgroup of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|444aa|down_2|NC_019738.1_2128851_2130183_-	COG2821, MltA, Membrane-bound lytic murein transglycosylase [Cell envelope biogenesis, outer membrane]	NA|1214aa|down_3|NC_019738.1_2130391_2134033_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|144aa|down_4|NC_019738.1_2134290_2134722_+	pfam10990, DUF2809, Protein of unknown function (DUF2809)	NA|240aa|down_5|NC_019738.1_2134752_2135472_+	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|369aa|down_6|NC_019738.1_2135519_2136626_+	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|395aa|down_7|NC_019738.1_2136812_2137997_+	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|463aa|down_8|NC_019738.1_2138198_2139587_+	pfam00067, p450, Cytochrome P450	NA|80aa|down_9|NC_019738.1_2139636_2139876_-	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	9	2818455-2818579	8	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GGGTCAGCTTAGATAAAGACCCTAAC	26	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|90aa|up_8|NC_019738.1_2807436_2807706_-,NA|136aa|up_2|NC_019738.1_2816089_2816497_-,NA|118aa|down_7|NC_019738.1_2830993_2831347_+,NA|144aa|down_9|NC_019738.1_2832476_2832908_+	NA|198aa|up_9|NC_019738.1_2806870_2807464_-	pfam11375, DUF3177, Protein of unknown function (DUF3177)	NA|90aa|up_8|NC_019738.1_2807436_2807706_-	NA	NA|76aa|up_7|NC_019738.1_2807680_2807908_-	pfam02672, CP12, CP12 domain	NA|416aa|up_6|NC_019738.1_2808281_2809529_+	COG4398, COG4398, Uncharacterized protein conserved in bacteria [Function unknown]	NA|368aa|up_5|NC_019738.1_2809816_2810920_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|214aa|up_4|NC_019738.1_2811196_2811838_+	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|1354aa|up_3|NC_019738.1_2811921_2815983_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|136aa|up_2|NC_019738.1_2816089_2816497_-	NA	NA|149aa|up_1|NC_019738.1_2816754_2817201_+	TIGR03042, hypothetical_protein, photosystem II protein PsbQ	NA|369aa|up_0|NC_019738.1_2817206_2818313_+	pfam01266, DAO, FAD dependent oxidoreductase	NA|1121aa|down_0|NC_019738.1_2818613_2821976_-	PRK13558, PRK13558, bacterio-opsin activator; Provisional	NA|154aa|down_1|NC_019738.1_2822003_2822465_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|1179aa|down_2|NC_019738.1_2822574_2826111_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|406aa|down_3|NC_019738.1_2826436_2827654_+	PRK00366, ispG, flavodoxin-dependent (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase	NA|219aa|down_4|NC_019738.1_2827909_2828566_+	COG4245, TerY, Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]	NA|260aa|down_5|NC_019738.1_2828583_2829363_+	pfam13672, PP2C_2, Protein phosphatase 2C	NA|510aa|down_6|NC_019738.1_2829404_2830934_+	COG4248, COG4248, Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains [General function prediction only]	NA|118aa|down_7|NC_019738.1_2830993_2831347_+	NA	NA|283aa|down_8|NC_019738.1_2831429_2832278_-	PLN03084, PLN03084, alpha/beta hydrolase fold protein; Provisional	NA|144aa|down_9|NC_019738.1_2832476_2832908_+	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	10	2916270-2917173	3,9,2	PILER-CR,CRISPRCasFinder,CRT	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC,GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC,GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	12,12,12	12	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|137aa|up_6|NC_019738.1_2904605_2905016_-,NA|326aa|up_4|NC_019738.1_2905947_2906925_-,NA|89aa|down_2|NC_019738.1_2920866_2921133_+,NA|239aa|down_9|NC_019738.1_2929941_2930658_+	NA|682aa|up_9|NC_019738.1_2899102_2901148_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|80aa|up_8|NC_019738.1_2902540_2902780_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|454aa|up_7|NC_019738.1_2903194_2904556_-	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|137aa|up_6|NC_019738.1_2904605_2905016_-	NA	NA|189aa|up_5|NC_019738.1_2905079_2905646_-	pfam05685, Uma2, Putative restriction endonuclease	NA|326aa|up_4|NC_019738.1_2905947_2906925_-	NA	NA|1001aa|up_3|NC_019738.1_2907308_2910311_-	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|534aa|up_2|NC_019738.1_2910461_2912063_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|555aa|up_1|NC_019738.1_2912444_2914109_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|263aa|up_0|NC_019738.1_2914968_2915757_+	pfam12787, EcsC, EcsC protein family	NA|274aa|down_0|NC_019738.1_2918492_2919314_-	pfam14907, NTP_transf_5, Uncharacterized nucleotidyltransferase	NA|248aa|down_1|NC_019738.1_2919555_2920299_+	COG2129, COG2129, Predicted phosphoesterases, related to the Icc protein [General function prediction only]	NA|89aa|down_2|NC_019738.1_2920866_2921133_+	NA	NA|683aa|down_3|NC_019738.1_2921406_2923455_-	TIGR02442, Uncharacterized_protein_Rv2850c/MT2916, cobaltochelatase subunit	NA|396aa|down_4|NC_019738.1_2923656_2924844_+	pfam11294, DUF3095, Protein of unknown function (DUF3095)	NA|368aa|down_5|NC_019738.1_2924962_2926066_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|285aa|down_6|NC_019738.1_2926417_2927272_+	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|468aa|down_7|NC_019738.1_2927287_2928691_+	pfam13379, NMT1_2, NMT1-like family	NA|274aa|down_8|NC_019738.1_2928740_2929562_+	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|239aa|down_9|NC_019738.1_2929941_2930658_+	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	11	2962665-2962758	10	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	AGGTCGAACCCTACAACGGAGGCCACAACTAACA	34	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|242aa|up_9|NC_019738.1_2947048_2947774_-,NA|86aa|down_0|NC_019738.1_2964688_2964946_+,NA|61aa|down_1|NC_019738.1_2965225_2965408_+,NA|196aa|down_5|NC_019738.1_2968141_2968729_-,NA|75aa|down_8|NC_019738.1_2971302_2971527_+	NA|242aa|up_9|NC_019738.1_2947048_2947774_-	NA	NA|213aa|up_8|NC_019738.1_2948019_2948658_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|408aa|up_7|NC_019738.1_2948738_2949962_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|306aa|up_6|NC_019738.1_2949954_2950872_+	cd10146, LabA_like_C, C-terminal domain of LabA_like proteins	NA|570aa|up_5|NC_019738.1_2950892_2952602_+	COG3653, COG3653, N-acyl-D-aspartate/D-glutamate deacylase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|146aa|up_4|NC_019738.1_2952615_2953053_+	pfam06127, DUF962, Protein of unknown function (DUF962)	NA|345aa|up_3|NC_019738.1_2953278_2954313_+	COG4638, HcaE, Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit [Inorganic ion transport and metabolism / General function prediction only]	NA|387aa|up_2|NC_019738.1_2955727_2956888_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|885aa|up_1|NC_019738.1_2956908_2959563_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|189aa|up_0|NC_019738.1_2959736_2960303_-	pfam05685, Uma2, Putative restriction endonuclease	NA|86aa|down_0|NC_019738.1_2964688_2964946_+	NA	NA|61aa|down_1|NC_019738.1_2965225_2965408_+	NA	NA|245aa|down_2|NC_019738.1_2965585_2966320_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|316aa|down_3|NC_019738.1_2966319_2967267_+	TIGR00005, Ribosomal_large_subunit_pseudouridine_synthase_D, pseudouridine synthase, RluA family	NA|250aa|down_4|NC_019738.1_2967246_2967996_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|196aa|down_5|NC_019738.1_2968141_2968729_-	NA	NA|206aa|down_6|NC_019738.1_2968685_2969303_-	pfam09367, CpeS, CpeS-like protein	NA|428aa|down_7|NC_019738.1_2969444_2970728_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|75aa|down_8|NC_019738.1_2971302_2971527_+	NA	NA|173aa|down_9|NC_019738.1_2972530_2973049_+	cd14768, PC_PEC_beta, Beta subunits of phycoerythrin and phycoerythrocyanin; phycobilisome rod components
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	12	3131505-3132053	4,11,3	PILER-CR,CRISPRCasFinder,CRT	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC,GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC,GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	6,7,7	7	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|412aa|down_1|NC_019738.1_3133294_3134530_-,NA|92aa|down_2|NC_019738.1_3134554_3134830_-,NA|158aa|down_3|NC_019738.1_3134962_3135436_+,NA|54aa|down_4|NC_019738.1_3135458_3135620_-,NA|104aa|down_9|NC_019738.1_3139231_3139543_-	NA|357aa|up_9|NC_019738.1_3115185_3116256_-	PRK12555, PRK12555, chemotaxis-specific protein-glutamate methyltransferase CheB	NA|780aa|up_8|NC_019738.1_3116256_3118596_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|507aa|up_7|NC_019738.1_3118631_3120152_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|398aa|up_6|NC_019738.1_3120259_3121453_-	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|387aa|up_5|NC_019738.1_3121598_3122759_-	cd17574, REC_OmpR, phosphoacceptor receiver (REC) domain of OmpR family response regulators	NA|116aa|up_4|NC_019738.1_3123644_3123992_-	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|543aa|up_3|NC_019738.1_3124178_3125807_+	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|674aa|up_2|NC_019738.1_3125919_3127941_-	cd07338, M48B_HtpX_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|290aa|up_1|NC_019738.1_3128703_3129573_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|435aa|up_0|NC_019738.1_3129934_3131239_+	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|241aa|down_0|NC_019738.1_3132545_3133268_-	cd04692, Nudix_Hydrolase_33, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|412aa|down_1|NC_019738.1_3133294_3134530_-	NA	NA|92aa|down_2|NC_019738.1_3134554_3134830_-	NA	NA|158aa|down_3|NC_019738.1_3134962_3135436_+	NA	NA|54aa|down_4|NC_019738.1_3135458_3135620_-	NA	NA|349aa|down_5|NC_019738.1_3136003_3137050_-	COG3367, COG3367, Uncharacterized conserved protein [Function unknown]	NA|359aa|down_6|NC_019738.1_3137033_3138110_-	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|71aa|down_7|NC_019738.1_3138203_3138416_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|192aa|down_8|NC_019738.1_3138649_3139225_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|104aa|down_9|NC_019738.1_3139231_3139543_-	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	13	3304124-3304229	12	CRISPRCasFinder	no	csx1	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	TCAACCCTCCCTTAATAGAGTGGGTTGAAAGA	32	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|46aa|up_3|NC_019738.1_3300807_3300945_+,csx1|147aa|up_1|NC_019738.1_3302696_3303137_-,NA|194aa|down_1|NC_019738.1_3306924_3307506_-,NA|154aa|down_6|NC_019738.1_3313830_3314292_+,NA|220aa|down_8|NC_019738.1_3315400_3316060_-	NA|704aa|up_9|NC_019738.1_3292471_3294583_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|259aa|up_8|NC_019738.1_3294742_3295519_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|210aa|up_7|NC_019738.1_3295610_3296240_-	pfam01027, Bax1-I, Inhibitor of apoptosis-promoting Bax1	NA|414aa|up_6|NC_019738.1_3296469_3297711_-	cd08021, M20_Acy1_YhaA-like, M20 Peptidase aminoacylase 1 subfamily, includes Bacillus subtilis YhaA and Staphylococcus aureus amidohydrolase, SACOL0085	NA|227aa|up_5|NC_019738.1_3297766_3298447_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|680aa|up_4|NC_019738.1_3298465_3300505_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|46aa|up_3|NC_019738.1_3300807_3300945_+	NA	NA|344aa|up_2|NC_019738.1_3301489_3302521_+	PRK02812, PRK02812, ribose-phosphate pyrophosphokinase; Provisional	csx1|147aa|up_1|NC_019738.1_3302696_3303137_-	NA	NA|272aa|up_0|NC_019738.1_3303231_3304047_+	pfam18599, LCIB_C_CA, Limiting CO2-inducible proteins B/C beta carbonyic anhydrases	NA|188aa|down_0|NC_019738.1_3306350_3306914_+	pfam09988, DUF2227, Uncharacterized metal-binding protein (DUF2227)	NA|194aa|down_1|NC_019738.1_3306924_3307506_-	NA	NA|204aa|down_2|NC_019738.1_3307607_3308219_+	pfam01947, DUF98, Protein of unknown function (DUF98)	NA|504aa|down_3|NC_019738.1_3308423_3309935_-	PRK07349, PRK07349, amidophosphoribosyltransferase; Provisional	NA|784aa|down_4|NC_019738.1_3310118_3312470_-	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|358aa|down_5|NC_019738.1_3312609_3313683_-	PRK11468, PRK11468, dihydroxyacetone kinase subunit DhaK; Provisional	NA|154aa|down_6|NC_019738.1_3313830_3314292_+	NA	NA|342aa|down_7|NC_019738.1_3314350_3315376_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|220aa|down_8|NC_019738.1_3315400_3316060_-	NA	NA|124aa|down_9|NC_019738.1_3316177_3316549_-	TIGR02058, lin0512_fam, conserved hypothetical protein
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	14	3339612-3339680	13	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	TTTGGTCATCATCTTGGTCATCAT	24	1	1	3339636-3339656	NC_019738.1_3339681-3339701	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|1036aa|up_8|NC_019738.1_3328067_3331175_-,NA|127aa|up_3|NC_019738.1_3335872_3336253_-,NA|157aa|up_0|NC_019738.1_3338688_3339159_-,NA|147aa|down_8|NC_019738.1_3355231_3355672_+,NA|175aa|down_9|NC_019738.1_3355762_3356287_+	NA|658aa|up_9|NC_019738.1_3325999_3327973_-	cd10231, YegD_like, Escherichia coli YegD, a putative chaperone protein, and related proteins	NA|1036aa|up_8|NC_019738.1_3328067_3331175_-	NA	NA|236aa|up_7|NC_019738.1_3331756_3332464_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|174aa|up_6|NC_019738.1_3332633_3333155_+	pfam00582, Usp, Universal stress protein family	NA|237aa|up_5|NC_019738.1_3333450_3334161_+	COG4241, COG4241, Predicted membrane protein [Function unknown]	NA|397aa|up_4|NC_019738.1_3334396_3335587_+	PRK04447, PRK04447, hypothetical protein; Provisional	NA|127aa|up_3|NC_019738.1_3335872_3336253_-	NA	NA|215aa|up_2|NC_019738.1_3336728_3337373_+	pfam09367, CpeS, CpeS-like protein	NA|408aa|up_1|NC_019738.1_3337449_3338673_+	cd13661, PBP2_PotD_PotF_like_1, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|157aa|up_0|NC_019738.1_3338688_3339159_-	NA	NA|687aa|down_0|NC_019738.1_3342241_3344302_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|182aa|down_1|NC_019738.1_3344825_3345371_+	PRK00819, PRK00819, RNA 2'-phosphotransferase; Reviewed	NA|147aa|down_2|NC_019738.1_3345402_3345843_+	cd03428, Ap4A_hydrolase_human_like, Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily	NA|402aa|down_3|NC_019738.1_3345917_3347123_+	COG0465, HflB, ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]	NA|299aa|down_4|NC_019738.1_3347269_3348166_-	pfam11251, DUF3050, Protein of unknown function (DUF3050)	NA|241aa|down_5|NC_019738.1_3349101_3349824_+	cd12131, HGbI-like, Hell's gate globin I (HGbI) from Methylacidophilum infernorum and related proteins	NA|927aa|down_6|NC_019738.1_3349893_3352674_-	PRK13558, PRK13558, bacterio-opsin activator; Provisional	NA|503aa|down_7|NC_019738.1_3353138_3354647_+	pfam13205, Big_5, Bacterial Ig-like domain	NA|147aa|down_8|NC_019738.1_3355231_3355672_+	NA	NA|175aa|down_9|NC_019738.1_3355762_3356287_+	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	15	3557701-3557913	14	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GTGTCGATTTGGGCAGTTGCCGTTCGACTCACTTGCAAA	39	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|74aa|up_7|NC_019738.1_3546638_3546860_-,NA|69aa|down_7|NC_019738.1_3563961_3564168_-,NA|118aa|down_8|NC_019738.1_3564429_3564783_-	NA|855aa|up_9|NC_019738.1_3543709_3546274_+	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|97aa|up_8|NC_019738.1_3546285_3546576_+	pfam01455, HupF_HypC, HupF/HypC family	NA|74aa|up_7|NC_019738.1_3546638_3546860_-	NA	NA|1225aa|up_6|NC_019738.1_3546853_3550528_+	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|349aa|up_5|NC_019738.1_3550759_3551806_+	PRK07565, PRK07565, dihydroorotate dehydrogenase-like protein	NA|379aa|up_4|NC_019738.1_3551811_3552948_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|355aa|up_3|NC_019738.1_3552966_3554031_+	cd02197, HypE, HypE (Hydrogenase expression/formation protein)	NA|361aa|up_2|NC_019738.1_3554431_3555514_+	PRK14071, PRK14071, ATP-dependent 6-phosphofructokinase	NA|142aa|up_1|NC_019738.1_3555946_3556372_-	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|418aa|up_0|NC_019738.1_3556410_3557664_-	COG4941, COG4941, Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]	NA|119aa|down_0|NC_019738.1_3557925_3558282_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|144aa|down_1|NC_019738.1_3558376_3558808_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|137aa|down_2|NC_019738.1_3558867_3559278_+	cd06588, PhnB_like, Escherichia coli PhnB and similar proteins	NA|73aa|down_3|NC_019738.1_3559334_3559553_+	cd08894, SRPBCC_CalC_Aha1-like_1, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins	NA|114aa|down_4|NC_019738.1_3559467_3559809_+	pfam07617, DUF1579, Protein of unknown function (DUF1579)	NA|132aa|down_5|NC_019738.1_3560274_3560670_+	cd08359, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|870aa|down_6|NC_019738.1_3560962_3563572_+	PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated	NA|69aa|down_7|NC_019738.1_3563961_3564168_-	NA	NA|118aa|down_8|NC_019738.1_3564429_3564783_-	NA	NA|135aa|down_9|NC_019738.1_3565081_3565486_-	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	16	3653441-3653594	15	CRISPRCasFinder	no	PD-DExK	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Unclear	GCGATCGCTCATTTATTCCCTCCCAACTTTTCCAGCAACGCTTGCCACA	49	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|47aa|up_8|NC_019738.1_3646253_3646394_+,PD-DExK|201aa|up_5|NC_019738.1_3648236_3648839_+,NA|103aa|up_1|NC_019738.1_3651603_3651912_-,NA|117aa|down_1|NC_019738.1_3654121_3654472_+,NA|101aa|down_2|NC_019738.1_3656337_3656640_+,NA|105aa|down_3|NC_019738.1_3656740_3657055_+,NA|70aa|down_4|NC_019738.1_3657051_3657261_+,NA|192aa|down_7|NC_019738.1_3660715_3661291_+	NA|275aa|up_9|NC_019738.1_3644841_3645666_-	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|47aa|up_8|NC_019738.1_3646253_3646394_+	NA	NA|193aa|up_7|NC_019738.1_3646393_3646972_+	pfam05685, Uma2, Putative restriction endonuclease	NA|359aa|up_6|NC_019738.1_3646972_3648049_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	PD-DExK|201aa|up_5|NC_019738.1_3648236_3648839_+	NA	NA|196aa|up_4|NC_019738.1_3648880_3649468_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|141aa|up_3|NC_019738.1_3649492_3649915_+	cd17287, RMtype1_S_EcoN10ORF171P_TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli N10-0505 S subunit (S	NA|325aa|up_2|NC_019738.1_3649924_3650899_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|103aa|up_1|NC_019738.1_3651603_3651912_-	NA	NA|484aa|up_0|NC_019738.1_3651928_3653380_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|81aa|down_0|NC_019738.1_3653690_3653933_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|117aa|down_1|NC_019738.1_3654121_3654472_+	NA	NA|101aa|down_2|NC_019738.1_3656337_3656640_+	NA	NA|105aa|down_3|NC_019738.1_3656740_3657055_+	NA	NA|70aa|down_4|NC_019738.1_3657051_3657261_+	NA	NA|438aa|down_5|NC_019738.1_3657276_3658590_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|347aa|down_6|NC_019738.1_3658888_3659929_-	PRK07411, PRK07411, molybdopterin-synthase adenylyltransferase MoeB	NA|192aa|down_7|NC_019738.1_3660715_3661291_+	NA	NA|270aa|down_8|NC_019738.1_3661297_3662107_-	cd06259, YdcF-like, YdcF-like	NA|187aa|down_9|NC_019738.1_3662281_3662842_-	PRK07411, PRK07411, molybdopterin-synthase adenylyltransferase MoeB
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	17	3859458-3860565	5,16,4	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,PD-DExK,cas3,csc1gr5,csc2gr7,cas10d,WYL	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Type I-D	GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC,GTTTCAATCCCTGATAGGGATTAAGTGGAATTTAAAC,GTTTCAATCCCTGATAGGGATTAAGTGGAATTTAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	15,15,15	15	TypeI-D	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,PD-DExK|203aa|down_4|NC_019738.1_3864010_3864619_-,cas10d|904aa|down_8|NC_019738.1_3868774_3871486_-	NA|119aa|up_9|NC_019738.1_3846911_3847268_+	pfam12973, Cupin_7, ChrR Cupin-like domain	NA|226aa|up_8|NC_019738.1_3847369_3848047_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|692aa|up_7|NC_019738.1_3848279_3850355_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|172aa|up_6|NC_019738.1_3850427_3850943_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|152aa|up_5|NC_019738.1_3851105_3851561_+	pfam12049, DUF3531, Protein of unknown function (DUF3531)	NA|151aa|up_4|NC_019738.1_3851647_3852100_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|743aa|up_3|NC_019738.1_3852160_3854389_-	cd07496, Peptidases_S8_13, Peptidase S8 family domain, uncharacterized subfamily 13	NA|119aa|up_2|NC_019738.1_3854538_3854895_-	pfam02152, FolB, Dihydroneopterin aldolase	NA|411aa|up_1|NC_019738.1_3855442_3856675_+	COG1566, EmrA, Multidrug resistance efflux pump [Defense mechanisms]	NA|772aa|up_0|NC_019738.1_3856922_3859238_+	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	cas2|91aa|down_0|NC_019738.1_3860815_3861088_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NC_019738.1_3861090_3862095_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|198aa|down_2|NC_019738.1_3862361_3862955_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|282aa|down_3|NC_019738.1_3863130_3863976_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	PD-DExK|203aa|down_4|NC_019738.1_3864010_3864619_-	NA	cas3|763aa|down_5|NC_019738.1_3864673_3866962_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	csc1gr5|264aa|down_6|NC_019738.1_3866954_3867746_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|338aa|down_7|NC_019738.1_3867749_3868763_-	pfam18320, Csc2, Csc2 Crispr	cas10d|904aa|down_8|NC_019738.1_3868774_3871486_-	NA	WYL|291aa|down_9|NC_019738.1_3871674_3872547_+	pfam13280, WYL, WYL domain
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	18	4023938-4024049	17	CRISPRCasFinder	no	c2c5_V-U5	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Type V-U5	GTTTCATCACCCCTCCCGCCTTGGGATGGGTTGAAAG	37	0	0	NA	NA	V-U5	1	1	TypeV-U5	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|75aa|up_5|NC_019738.1_4018093_4018318_-,NA|97aa|up_3|NC_019738.1_4018749_4019040_+,NA|103aa|up_2|NC_019738.1_4019036_4019345_+,NA|59aa|down_6|NC_019738.1_4031765_4031942_-,NA|79aa|down_7|NC_019738.1_4032308_4032545_+,NA|88aa|down_8|NC_019738.1_4032656_4032920_-	NA|336aa|up_9|NC_019738.1_4010804_4011812_-	pfam14390, DUF4420, Putative PD-(D/E)XK family member, (DUF4420)	NA|918aa|up_8|NC_019738.1_4011808_4014562_-	pfam10593, Z1, Z1 domain	NA|500aa|up_7|NC_019738.1_4014580_4016080_-	pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase	NA|412aa|up_6|NC_019738.1_4016386_4017622_-	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|75aa|up_5|NC_019738.1_4018093_4018318_-	NA	NA|63aa|up_4|NC_019738.1_4018399_4018588_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|97aa|up_3|NC_019738.1_4018749_4019040_+	NA	NA|103aa|up_2|NC_019738.1_4019036_4019345_+	NA	NA|614aa|up_1|NC_019738.1_4019474_4021316_+	COG3472, COG3472, Uncharacterized conserved protein [Function unknown]	c2c5_V-U5|638aa|up_0|NC_019738.1_4021457_4023371_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|290aa|down_0|NC_019738.1_4024488_4025358_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|304aa|down_1|NC_019738.1_4025413_4026325_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|418aa|down_2|NC_019738.1_4026729_4027983_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|383aa|down_3|NC_019738.1_4028078_4029227_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|397aa|down_4|NC_019738.1_4029545_4030736_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|211aa|down_5|NC_019738.1_4030964_4031597_+	pfam08239, SH3_3, Bacterial SH3 domain	NA|59aa|down_6|NC_019738.1_4031765_4031942_-	NA	NA|79aa|down_7|NC_019738.1_4032308_4032545_+	NA	NA|88aa|down_8|NC_019738.1_4032656_4032920_-	NA	NA|848aa|down_9|NC_019738.1_4033177_4035721_+	sd00006, TPR, Tetratricopeptide repeat
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	19	4084977-4085098	18	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	CTGCCCCACTTAATGACAGCCAATCAGGATTATTCTATCT	40	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|144aa|up_3|NC_019738.1_4080143_4080575_+,NA|494aa|down_7|NC_019738.1_4094627_4096109_-	NA|140aa|up_9|NC_019738.1_4071577_4071997_+	pfam02410, RsfS, Ribosomal silencing factor during starvation	NA|167aa|up_8|NC_019738.1_4072003_4072504_+	pfam06799, DUF1230, Protein of unknown function (DUF1230)	NA|318aa|up_7|NC_019738.1_4072586_4073540_+	pfam06089, Asparaginase_II, L-asparaginase II	NA|968aa|up_6|NC_019738.1_4073840_4076744_-	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|301aa|up_5|NC_019738.1_4076900_4077803_-	pfam07444, Ycf66_N, Ycf66 protein N-terminus	NA|483aa|up_4|NC_019738.1_4078084_4079533_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|144aa|up_3|NC_019738.1_4080143_4080575_+	NA	NA|406aa|up_2|NC_019738.1_4080794_4082012_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|536aa|up_1|NC_019738.1_4082244_4083852_+	pfam18105, PGM1_C, PGM1 C-terminal domain	NA|330aa|up_0|NC_019738.1_4083876_4084866_-	cd06583, PGRP, Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors that bind, and in certain cases, hydrolyze peptidoglycans (PGNs) of bacterial cell walls	NA|105aa|down_0|NC_019738.1_4085433_4085748_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|193aa|down_1|NC_019738.1_4085756_4086335_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1277aa|down_2|NC_019738.1_4086505_4090336_-	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|291aa|down_3|NC_019738.1_4090481_4091354_+	TIGR02196, Gene_56_protein, Glutaredoxin-like protein, YruB-family	NA|289aa|down_4|NC_019738.1_4091396_4092263_+	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|355aa|down_5|NC_019738.1_4092322_4093387_-	cd08152, y4iL_like, Catalase-like heme-binding proteins similar to the uncharacterized y4iL	NA|325aa|down_6|NC_019738.1_4093592_4094567_-	pfam12275, DUF3616, Protein of unknown function (DUF3616)	NA|494aa|down_7|NC_019738.1_4094627_4096109_-	NA	NA|257aa|down_8|NC_019738.1_4096305_4097076_-	pfam10030, DUF2272, Uncharacterized protein conserved in bacteria (DUF2272)	NA|294aa|down_9|NC_019738.1_4097166_4098048_-	cd14486, 3D_domain, 3D domain, named for 3 conserved aspartate residues, is found in mltA-like lytic transglycosylases and numerous other contexts
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	20	4158003-4158281	5	CRT	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GAAGACCCCTACGGTGACCCAGCNGAT	27	0	0	NA	NA	NA	4	4	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|240aa|up_6|NC_019738.1_4149272_4149992_+,NA|75aa|up_4|NC_019738.1_4153179_4153404_-,NA|145aa|up_2|NC_019738.1_4155343_4155778_+,NA|178aa|up_1|NC_019738.1_4156089_4156623_+,NA|170aa|up_0|NC_019738.1_4157297_4157807_+,NA|67aa|down_1|NC_019738.1_4161031_4161232_+,NA|136aa|down_7|NC_019738.1_4169824_4170232_+	NA|621aa|up_9|NC_019738.1_4141645_4143508_+	COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]	NA|743aa|up_8|NC_019738.1_4144170_4146399_+	pfam14326, DUF4384, Domain of unknown function (DUF4384)	NA|698aa|up_7|NC_019738.1_4146794_4148888_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|240aa|up_6|NC_019738.1_4149272_4149992_+	NA	NA|827aa|up_5|NC_019738.1_4150051_4152532_-	pfam12770, CHAT, CHAT domain	NA|75aa|up_4|NC_019738.1_4153179_4153404_-	NA	NA|589aa|up_3|NC_019738.1_4153400_4155167_-	cd07484, Peptidases_S8_Thermitase_like, Peptidase S8 family domain in Thermitase-like proteins	NA|145aa|up_2|NC_019738.1_4155343_4155778_+	NA	NA|178aa|up_1|NC_019738.1_4156089_4156623_+	NA	NA|170aa|up_0|NC_019738.1_4157297_4157807_+	NA	NA|658aa|down_0|NC_019738.1_4158505_4160479_-	TIGR02042, Sulfite_reductase, ferredoxin-sulfite reductase	NA|67aa|down_1|NC_019738.1_4161031_4161232_+	NA	NA|180aa|down_2|NC_019738.1_4161416_4161956_-	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|436aa|down_3|NC_019738.1_4162563_4163871_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|603aa|down_4|NC_019738.1_4164378_4166187_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|613aa|down_5|NC_019738.1_4166514_4168353_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|377aa|down_6|NC_019738.1_4168689_4169820_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|136aa|down_7|NC_019738.1_4169824_4170232_+	NA	NA|379aa|down_8|NC_019738.1_4170356_4171493_+	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|332aa|down_9|NC_019738.1_4171792_4172788_+	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	21	4580266-4580523	6,6	CRT,PILER-CR	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	GTTTCCATTCAATTAGTTTTCCCAGCGAGTGGGAAG,GTTTCCATTCAATTAGTTTTCCCAGCGAGTGGGAAG	36,36	0	0	NA	NA	NA:NA	3,3	3	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|140aa|up_4|NC_019738.1_4572326_4572746_+,NA	NA|730aa|up_9|NC_019738.1_4560495_4562685_+	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|471aa|up_8|NC_019738.1_4562901_4564314_+	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|1827aa|up_7|NC_019738.1_4564468_4569949_-	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|306aa|up_6|NC_019738.1_4570353_4571271_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|243aa|up_5|NC_019738.1_4571419_4572148_-	pfam05721, PhyH, Phytanoyl-CoA dioxygenase (PhyH)	NA|140aa|up_4|NC_019738.1_4572326_4572746_+	NA	NA|169aa|up_3|NC_019738.1_4572910_4573417_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|1044aa|up_2|NC_019738.1_4573998_4577130_+	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]	NA|304aa|up_1|NC_019738.1_4577343_4578255_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|374aa|up_0|NC_019738.1_4578981_4580103_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|213aa|down_0|NC_019738.1_4580827_4581466_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|304aa|down_1|NC_019738.1_4581650_4582562_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|352aa|down_2|NC_019738.1_4582633_4583689_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|302aa|down_3|NC_019738.1_4583990_4584896_-	PLN02953, PLN02953, phosphatidate cytidylyltransferase	NA|201aa|down_4|NC_019738.1_4585153_4585756_-	PRK07402, PRK07402, precorrin-6Y C5,15-methyltransferase subunit CbiT	NA|499aa|down_5|NC_019738.1_4585834_4587331_-	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|90aa|down_6|NC_019738.1_4587723_4587993_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|364aa|down_7|NC_019738.1_4588051_4589143_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|814aa|down_8|NC_019738.1_4589631_4592073_+	COG4953, PbpC, Membrane carboxypeptidase/penicillin-binding protein PbpC [Cell envelope biogenesis, outer membrane]	NA|609aa|down_9|NC_019738.1_4592289_4594116_-	NF033203, entero_EhxA, enterohemolysin EhxA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	22	6181631-6181740	19	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	TGCGATCGCACTTAGAGCAAACTCAA	26	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|282aa|up_7|NC_019738.1_6168104_6168950_+,NA|47aa|down_1|NC_019738.1_6183798_6183939_+,NA|123aa|down_8|NC_019738.1_6192032_6192401_-,NA|446aa|down_9|NC_019738.1_6192444_6193782_-	NA|321aa|up_9|NC_019738.1_6165596_6166559_-	pfam13489, Methyltransf_23, Methyltransferase domain	NA|456aa|up_8|NC_019738.1_6166564_6167932_-	cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|282aa|up_7|NC_019738.1_6168104_6168950_+	NA	NA|658aa|up_6|NC_019738.1_6169001_6170975_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|269aa|up_5|NC_019738.1_6171013_6171820_+	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|245aa|up_4|NC_019738.1_6171820_6172555_+	COG1134, TagH, ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|438aa|up_3|NC_019738.1_6172605_6173919_+	pfam04932, Wzy_C, O-Antigen ligase	NA|1278aa|up_2|NC_019738.1_6173931_6177765_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|282aa|up_1|NC_019738.1_6177818_6178664_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|309aa|up_0|NC_019738.1_6179731_6180658_+	cd02526, GT2_RfbF_like, RfbF is a putative dTDP-rhamnosyl transferase	NA|528aa|down_0|NC_019738.1_6182074_6183658_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|47aa|down_1|NC_019738.1_6183798_6183939_+	NA	NA|286aa|down_2|NC_019738.1_6184700_6185558_+	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|403aa|down_3|NC_019738.1_6185568_6186777_+	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|324aa|down_4|NC_019738.1_6186917_6187889_+	cd05260, GDP_MD_SDR_e, GDP-mannose 4,6 dehydratase, extended (e) SDRs	NA|250aa|down_5|NC_019738.1_6188281_6189031_+	cd00383, trans_reg_C, DNA-binding effector domain of two-component system response regulators	NA|465aa|down_6|NC_019738.1_6189240_6190635_+	PRK00093, PRK00093, GTP-binding protein Der; Reviewed	NA|329aa|down_7|NC_019738.1_6190641_6191628_+	pfam02361, CbiQ, Cobalt transport protein	NA|123aa|down_8|NC_019738.1_6192032_6192401_-	NA	NA|446aa|down_9|NC_019738.1_6192444_6193782_-	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	23	6219520-6219795	7	CRT	no	Cas14c_CAS-V-F	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Unclear	CGAATTCATCNTCCAGNCTNGGAG	24	1	1	6219670-6219687	NC_019739.1_16272-16255	NA	6	6	TypeV	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|109aa|down_4|NC_019738.1_6228626_6228953_+,NA|216aa|down_8|NC_019738.1_6234474_6235122_-,NA|146aa|down_9|NC_019738.1_6235176_6235614_-	NA|220aa|up_9|NC_019738.1_6206292_6206952_-	PRK14419, PRK14419, membrane protein; Provisional	NA|418aa|up_8|NC_019738.1_6207004_6208258_-	pfam11285, DUF3086, Protein of unknown function (DUF3086)	NA|130aa|up_7|NC_019738.1_6208291_6208681_-	pfam11317, DUF3119, Protein of unknown function (DUF3119)	NA|265aa|up_6|NC_019738.1_6208839_6209634_-	PLN03100, PLN03100, Permease subunit of ER-derived-lipid transporter; Provisional	NA|990aa|up_5|NC_019738.1_6209706_6212676_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|177aa|up_4|NC_019738.1_6212769_6213300_-	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|122aa|up_3|NC_019738.1_6213320_6213686_-	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|436aa|up_2|NC_019738.1_6213930_6215238_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|588aa|up_1|NC_019738.1_6216243_6218007_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|350aa|up_0|NC_019738.1_6218043_6219093_-	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|361aa|down_0|NC_019738.1_6220510_6221593_-	CHL00045, ccsA, cytochrome c biogenesis protein	NA|680aa|down_1|NC_019738.1_6221788_6223828_-	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|881aa|down_2|NC_019738.1_6224040_6226683_+	pfam12770, CHAT, CHAT domain	NA|389aa|down_3|NC_019738.1_6226695_6227862_-	PRK07360, PRK07360, FO synthase subunit 2; Reviewed	NA|109aa|down_4|NC_019738.1_6228626_6228953_+	NA	NA|234aa|down_5|NC_019738.1_6229227_6229929_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|430aa|down_6|NC_019738.1_6230678_6231968_+	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|123aa|down_7|NC_019738.1_6233971_6234340_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|216aa|down_8|NC_019738.1_6234474_6235122_-	NA	NA|146aa|down_9|NC_019738.1_6235176_6235614_-	NA
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	24	6364351-6364433	20	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	ATCAGCTCCAACTAACTGTGCCC	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|117aa|down_6|NC_019738.1_6371140_6371491_+	NA|116aa|up_9|NC_019738.1_6352345_6352693_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|105aa|up_8|NC_019738.1_6352693_6353008_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|204aa|up_7|NC_019738.1_6353086_6353698_-	pfam05685, Uma2, Putative restriction endonuclease	NA|405aa|up_6|NC_019738.1_6353958_6355173_+	cd03818, GT4_ExpC-like, Rhizobium meliloti ExpC and similar proteins	NA|559aa|up_5|NC_019738.1_6355169_6356846_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|267aa|up_4|NC_019738.1_6357487_6358288_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|569aa|up_3|NC_019738.1_6358305_6360012_+	COG1178, ThiP, ABC-type Fe3+ transport system, permease component [Inorganic ion transport and metabolism]	NA|172aa|up_2|NC_019738.1_6360012_6360528_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|351aa|up_1|NC_019738.1_6360751_6361804_-	cd13542, PBP2_FutA1_ilke, Substrate binding domain of ferric iron-binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|361aa|up_0|NC_019738.1_6362189_6363272_-	cd13542, PBP2_FutA1_ilke, Substrate binding domain of ferric iron-binding protein, a member of the type 2 periplasmic binding fold superfamily	NA|144aa|down_0|NC_019738.1_6364916_6365348_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|503aa|down_1|NC_019738.1_6365520_6367029_+	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|267aa|down_2|NC_019738.1_6367091_6367892_-	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|264aa|down_3|NC_019738.1_6367888_6368680_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|248aa|down_4|NC_019738.1_6369138_6369882_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|226aa|down_5|NC_019738.1_6370207_6370885_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|117aa|down_6|NC_019738.1_6371140_6371491_+	NA	NA|505aa|down_7|NC_019738.1_6371576_6373091_+	CHL00195, ycf46, Ycf46; Provisional	NA|118aa|down_8|NC_019738.1_6373239_6373593_+	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|480aa|down_9|NC_019738.1_6373748_6375188_-	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	25	6576970-6577047	21	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	CCAGGATGTAACAACACATCACCA	24	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|106aa|up_8|NC_019738.1_6563846_6564164_-,NA|275aa|up_2|NC_019738.1_6571148_6571973_+,NA|67aa|down_1|NC_019738.1_6582835_6583036_-,NA|158aa|down_2|NC_019738.1_6583056_6583530_-,NA|140aa|down_4|NC_019738.1_6585296_6585716_-,NA|73aa|down_8|NC_019738.1_6589970_6590189_-	NA|278aa|up_9|NC_019738.1_6562726_6563560_+	PRK07396, PRK07396, dihydroxynaphthoic acid synthetase; Validated	NA|106aa|up_8|NC_019738.1_6563846_6564164_-	NA	NA|482aa|up_7|NC_019738.1_6565133_6566579_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|254aa|up_6|NC_019738.1_6566675_6567437_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|257aa|up_5|NC_019738.1_6567869_6568640_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|116aa|up_4|NC_019738.1_6569005_6569353_+	PRK09381, trxA, thioredoxin TrxA	NA|398aa|up_3|NC_019738.1_6569559_6570753_+	PRK07366, PRK07366, LL-diaminopimelate aminotransferase	NA|275aa|up_2|NC_019738.1_6571148_6571973_+	NA	NA|602aa|up_1|NC_019738.1_6572142_6573948_-	cd09610, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|669aa|up_0|NC_019738.1_6574470_6576477_+	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|1042aa|down_0|NC_019738.1_6577558_6580684_+	pfam12770, CHAT, CHAT domain	NA|67aa|down_1|NC_019738.1_6582835_6583036_-	NA	NA|158aa|down_2|NC_019738.1_6583056_6583530_-	NA	NA|483aa|down_3|NC_019738.1_6583725_6585174_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|140aa|down_4|NC_019738.1_6585296_6585716_-	NA	NA|491aa|down_5|NC_019738.1_6586202_6587675_-	PRK00654, glgA, glycogen synthase GlgA	NA|377aa|down_6|NC_019738.1_6587891_6589022_-	COG4240, COG4240, Predicted kinase [General function prediction only]	NA|119aa|down_7|NC_019738.1_6589026_6589383_-	pfam04483, DUF565, Protein of unknown function (DUF565)	NA|73aa|down_8|NC_019738.1_6589970_6590189_-	NA	NA|333aa|down_9|NC_019738.1_6590515_6591514_+	cd19100, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	26	6882717-6882875	22	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	AGGTTAGCACCGCTCAAATTCGCC	24	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|136aa|down_2|NC_019738.1_6886505_6886913_-,NA|64aa|down_4|NC_019738.1_6891881_6892073_+	NA|504aa|up_9|NC_019738.1_6868169_6869681_+	TIGR02980, SigBFG, RNA polymerase sigma-70 factor, sigma-B/F/G subfamily	NA|461aa|up_8|NC_019738.1_6869702_6871085_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|837aa|up_7|NC_019738.1_6871280_6873791_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|492aa|up_6|NC_019738.1_6873867_6875343_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|258aa|up_5|NC_019738.1_6875412_6876186_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|454aa|up_4|NC_019738.1_6876358_6877720_+	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|196aa|up_3|NC_019738.1_6877843_6878431_+	COG4278, COG4278, Uncharacterized conserved protein [Function unknown]	NA|479aa|up_2|NC_019738.1_6878544_6879981_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|333aa|up_1|NC_019738.1_6880293_6881292_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|147aa|up_0|NC_019738.1_6881686_6882127_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|114aa|down_0|NC_019738.1_6884712_6885054_-	pfam05542, DUF760, Protein of unknown function (DUF760)	NA|185aa|down_1|NC_019738.1_6885564_6886119_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|136aa|down_2|NC_019738.1_6886505_6886913_-	NA	NA|266aa|down_3|NC_019738.1_6888927_6889725_-	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|64aa|down_4|NC_019738.1_6891881_6892073_+	NA	NA|543aa|down_5|NC_019738.1_6893922_6895551_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|469aa|down_6|NC_019738.1_6897376_6898783_-	cd17253, RMtype1_S_Eco933I-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli O157:H7 EDL933 S subunit (S	NA|270aa|down_7|NC_019738.1_6898796_6899606_-	pfam08463, EcoEI_R_C, EcoEI R protein C-terminal	NA|135aa|down_8|NC_019738.1_6899841_6900246_-	TIGR01862, Nitrogenase_molybdenum-iron_protein_alpha_chain, nitrogenase component I, alpha chain	NA|188aa|down_9|NC_019738.1_6900319_6900883_-	pfam06527, TniQ, TniQ
GCF_000317515.1_ASM31751v1	NC_019738	Microcoleus sp. PCC 7113, complete sequence	27	7016701-7016839	23	CRISPRCasFinder	no		c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS	Orphan	CACCCGCATCAGAGGAGGATAACTACTTTCAGCAATTGCA	40	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|117aa|up_2|NC_019738.1_7013036_7013387_+,NA|50aa|down_8|NC_019738.1_7032372_7032522_+,NA|192aa|down_9|NC_019738.1_7032565_7033141_-	NA|226aa|up_9|NC_019738.1_7006756_7007434_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|376aa|up_8|NC_019738.1_7007637_7008765_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|192aa|up_7|NC_019738.1_7009005_7009581_+	pfam14065, DUF4255, Protein of unknown function (DUF4255)	NA|546aa|up_6|NC_019738.1_7009624_7011262_+	COG3497, COG3497, Phage tail sheath protein FI [General function prediction only]	NA|158aa|up_5|NC_019738.1_7011373_7011847_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|158aa|up_4|NC_019738.1_7011902_7012376_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|161aa|up_3|NC_019738.1_7012547_7013030_+	pfam06841, Phage_T4_gp19, T4-like virus tail tube protein gp19	NA|117aa|up_2|NC_019738.1_7013036_7013387_+	NA	NA|127aa|up_1|NC_019738.1_7013436_7013817_+	pfam10109, Phage_TAC_7, Phage tail assembly chaperone proteins, E, or 41 or 14	NA|259aa|up_0|NC_019738.1_7014023_7014800_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|161aa|down_0|NC_019738.1_7017052_7017535_+	cd02215, cupin_QDO_N_C, quercetinase, N- and C-terminal cupin domains	NA|146aa|down_1|NC_019738.1_7019403_7019841_+	pfam04965, GPW_gp25, Gene 25-like lysozyme	NA|735aa|down_2|NC_019738.1_7019986_7022191_+	TIGR02243, hypothetical_protein_SCD8A	NA|349aa|down_3|NC_019738.1_7022384_7023431_+	TIGR02242, putative_secreted_protein, phage tail protein domain	NA|353aa|down_4|NC_019738.1_7025111_7026170_+	pfam11845, DUF3365, Protein of unknown function (DUF3365)	NA|707aa|down_5|NC_019738.1_7026175_7028296_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|972aa|down_6|NC_019738.1_7028492_7031408_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|191aa|down_7|NC_019738.1_7031472_7032045_-	pfam05685, Uma2, Putative restriction endonuclease	NA|50aa|down_8|NC_019738.1_7032372_7032522_+	NA	NA|192aa|down_9|NC_019738.1_7032565_7033141_-	NA
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	1	8045-8232	1	PILER-CR	no	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Type III-A	ATAGTTTCCGTCCCCGTGAAGGGGAAGTGAATTGAAACC	39	0	0	NA	NA	NA	2	2	TypeIII-A	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|163aa|up_6|NC_019740.1_957_1446_-,NA|99aa|up_5|NC_019740.1_1726_2023_-,NA|182aa|up_4|NC_019740.1_2019_2565_-,NA|81aa|up_3|NC_019740.1_2752_2995_+,NA|180aa|down_3|NC_019740.1_13461_14001_+,NA|101aa|down_4|NC_019740.1_14090_14393_+,NA|148aa|down_5|NC_019740.1_14670_15114_+,csx21|207aa|down_8|NC_019740.1_18578_19199_-	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|163aa|up_6|NC_019740.1_957_1446_-	NA	NA|99aa|up_5|NC_019740.1_1726_2023_-	NA	NA|182aa|up_4|NC_019740.1_2019_2565_-	NA	NA|81aa|up_3|NC_019740.1_2752_2995_+	NA	NA|90aa|up_2|NC_019740.1_3274_3544_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|198aa|up_1|NC_019740.1_3540_4134_-	cd02042, ParAB_family, partition proteins ParAB family	NA|1205aa|up_0|NC_019740.1_4300_7915_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	cas1|668aa|down_0|NC_019740.1_8789_10793_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas1|334aa|down_1|NC_019740.1_11987_12989_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|94aa|down_2|NC_019740.1_13028_13310_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|180aa|down_3|NC_019740.1_13461_14001_+	NA	NA|101aa|down_4|NC_019740.1_14090_14393_+	NA	NA|148aa|down_5|NC_019740.1_14670_15114_+	NA	NA|280aa|down_6|NC_019740.1_16430_17270_+	TIGR02710, conserved_hypothetical_protein, CRISPR-associated protein, TIGR02710 family	cas6|376aa|down_7|NC_019740.1_17270_18398_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx21|207aa|down_8|NC_019740.1_18578_19199_-	NA	csm3gr7|369aa|down_9|NC_019740.1_19199_20306_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	2	11240-11789	2,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCGTCCCCGTAAAGGGGAAGAGATTTGAAAC,GTTTCAAATCTCTTCCCCTTTACGGGGACGGAAAC,GTTTCAAATCTCTTCCCCTTTACGGGGACGGAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	7,7,7	7	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|163aa|up_7|NC_019740.1_957_1446_-,NA|99aa|up_6|NC_019740.1_1726_2023_-,NA|182aa|up_5|NC_019740.1_2019_2565_-,NA|81aa|up_4|NC_019740.1_2752_2995_+,NA|180aa|down_2|NC_019740.1_13461_14001_+,NA|101aa|down_3|NC_019740.1_14090_14393_+,NA|148aa|down_4|NC_019740.1_14670_15114_+,csx21|207aa|down_7|NC_019740.1_18578_19199_-,csx19|147aa|down_9|NC_019740.1_20302_20743_-	NA|NA	NA	NA|NA	NA	NA|163aa|up_7|NC_019740.1_957_1446_-	NA	NA|99aa|up_6|NC_019740.1_1726_2023_-	NA	NA|182aa|up_5|NC_019740.1_2019_2565_-	NA	NA|81aa|up_4|NC_019740.1_2752_2995_+	NA	NA|90aa|up_3|NC_019740.1_3274_3544_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|198aa|up_2|NC_019740.1_3540_4134_-	cd02042, ParAB_family, partition proteins ParAB family	NA|1205aa|up_1|NC_019740.1_4300_7915_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	cas1|668aa|up_0|NC_019740.1_8789_10793_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas1|334aa|down_0|NC_019740.1_11987_12989_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|94aa|down_1|NC_019740.1_13028_13310_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|180aa|down_2|NC_019740.1_13461_14001_+	NA	NA|101aa|down_3|NC_019740.1_14090_14393_+	NA	NA|148aa|down_4|NC_019740.1_14670_15114_+	NA	NA|280aa|down_5|NC_019740.1_16430_17270_+	TIGR02710, conserved_hypothetical_protein, CRISPR-associated protein, TIGR02710 family	cas6|376aa|down_6|NC_019740.1_17270_18398_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx21|207aa|down_7|NC_019740.1_18578_19199_-	NA	csm3gr7|369aa|down_8|NC_019740.1_19199_20306_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|147aa|down_9|NC_019740.1_20302_20743_-	NA
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	3	15495-16257	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCGTCCCCGTAAAGGGGAAGTTAATTGAAAC,GTTTCAATTAACTTCCCCTTTACGGGGACGGAAAC,GTTTCAATTAACTTCCCCTTTACGGGGACGGAAAC	35,35,35	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|81aa|up_9|NC_019740.1_2752_2995_+,NA|180aa|up_2|NC_019740.1_13461_14001_+,NA|101aa|up_1|NC_019740.1_14090_14393_+,NA|148aa|up_0|NC_019740.1_14670_15114_+,csx21|207aa|down_2|NC_019740.1_18578_19199_-,csx19|147aa|down_4|NC_019740.1_20302_20743_-,PD-DExK|207aa|down_6|NC_019740.1_21722_22343_-,PD-DExK|207aa|down_7|NC_019740.1_22370_22991_-	NA|81aa|up_9|NC_019740.1_2752_2995_+	NA	NA|90aa|up_8|NC_019740.1_3274_3544_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	NA|198aa|up_7|NC_019740.1_3540_4134_-	cd02042, ParAB_family, partition proteins ParAB family	NA|1205aa|up_6|NC_019740.1_4300_7915_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	cas1|668aa|up_5|NC_019740.1_8789_10793_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas1|334aa|up_4|NC_019740.1_11987_12989_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|94aa|up_3|NC_019740.1_13028_13310_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|180aa|up_2|NC_019740.1_13461_14001_+	NA	NA|101aa|up_1|NC_019740.1_14090_14393_+	NA	NA|148aa|up_0|NC_019740.1_14670_15114_+	NA	NA|280aa|down_0|NC_019740.1_16430_17270_+	TIGR02710, conserved_hypothetical_protein, CRISPR-associated protein, TIGR02710 family	cas6|376aa|down_1|NC_019740.1_17270_18398_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csx21|207aa|down_2|NC_019740.1_18578_19199_-	NA	csm3gr7|369aa|down_3|NC_019740.1_19199_20306_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|147aa|down_4|NC_019740.1_20302_20743_-	NA	csm3gr7|318aa|down_5|NC_019740.1_20739_21693_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	PD-DExK|207aa|down_6|NC_019740.1_21722_22343_-	NA	PD-DExK|207aa|down_7|NC_019740.1_22370_22991_-	NA	csm3gr7|298aa|down_8|NC_019740.1_22977_23871_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm6|374aa|down_9|NC_019740.1_23908_25030_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	4	30328-30581	4,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Type III-B,Type III-D,Type III-A,Type III-C	GTTTACGCAAGCACTTCCCCGCAAGGGGATGGAAACG,GTTTACGCAAGCACTTCCCCGCAAGGGGATGGAAAC,GTTTACGCAAGCACTTCCCCGCAAGGGGATGGAAACG	37,36,37	0	0	NA	NA	NA:NA:NA	3,3,3	3	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	csx19|147aa|up_9|NC_019740.1_20302_20743_-,PD-DExK|207aa|up_7|NC_019740.1_21722_22343_-,PD-DExK|207aa|up_6|NC_019740.1_22370_22991_-,csm2gr11|138aa|up_3|NC_019740.1_25075_25489_-,NA|103aa|down_3|NC_019740.1_35682_35991_+,NA|72aa|down_8|NC_019740.1_39919_40135_+	csx19|147aa|up_9|NC_019740.1_20302_20743_-	NA	csm3gr7|318aa|up_8|NC_019740.1_20739_21693_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	PD-DExK|207aa|up_7|NC_019740.1_21722_22343_-	NA	PD-DExK|207aa|up_6|NC_019740.1_22370_22991_-	NA	csm3gr7|298aa|up_5|NC_019740.1_22977_23871_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm6|374aa|up_4|NC_019740.1_23908_25030_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm2gr11|138aa|up_3|NC_019740.1_25075_25489_-	NA	csx10gr5|442aa|up_2|NC_019740.1_25485_26811_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|239aa|up_1|NC_019740.1_26807_27524_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|769aa|up_0|NC_019740.1_27528_29835_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|796aa|down_0|NC_019740.1_30816_33204_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|down_1|NC_019740.1_33329_34355_+	pfam13401, AAA_22, AAA domain	NA|408aa|down_2|NC_019740.1_34344_35568_+	pfam06527, TniQ, TniQ	NA|103aa|down_3|NC_019740.1_35682_35991_+	NA	WYL|455aa|down_4|NC_019740.1_36113_37478_-	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	csx3|110aa|down_5|NC_019740.1_37552_37882_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	csx3|301aa|down_6|NC_019740.1_37957_38860_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|196aa|down_7|NC_019740.1_39173_39761_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|72aa|down_8|NC_019740.1_39919_40135_+	NA	NA|987aa|down_9|NC_019740.1_40537_43498_+	cd07477, Peptidases_S8_Subtilisin_subset, Peptidase S8 family domain in Subtilisin proteins
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	5	35746-35912	4	CRISPRCasFinder	no	cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4,cas1	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Type III-B,Type III-D,Type III-A,Type III-C	CTTCCCCGCAAGGGGATGGAAAC	23	0	0	NA	NA	NA	2	2	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	PD-DExK|207aa|up_9|NC_019740.1_22370_22991_-,csm2gr11|138aa|up_6|NC_019740.1_25075_25489_-,NA|72aa|down_4|NC_019740.1_39919_40135_+,PD-DExK|207aa|down_6|NC_019740.1_43728_44349_+	PD-DExK|207aa|up_9|NC_019740.1_22370_22991_-	NA	csm3gr7|298aa|up_8|NC_019740.1_22977_23871_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm6|374aa|up_7|NC_019740.1_23908_25030_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm2gr11|138aa|up_6|NC_019740.1_25075_25489_-	NA	csx10gr5|442aa|up_5|NC_019740.1_25485_26811_-	TIGR02674, cas_cyan_RAMP_2, CRISPR-associated RAMP protein, Csx10 family	csm3gr7|239aa|up_4|NC_019740.1_26807_27524_-	COG1337, COG1337, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|769aa|up_3|NC_019740.1_27528_29835_-	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	NA|796aa|up_2|NC_019740.1_30816_33204_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|up_1|NC_019740.1_33329_34355_+	pfam13401, AAA_22, AAA domain	NA|408aa|up_0|NC_019740.1_34344_35568_+	pfam06527, TniQ, TniQ	WYL|455aa|down_0|NC_019740.1_36113_37478_-	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	csx3|110aa|down_1|NC_019740.1_37552_37882_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	csx3|301aa|down_2|NC_019740.1_37957_38860_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|196aa|down_3|NC_019740.1_39173_39761_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|72aa|down_4|NC_019740.1_39919_40135_+	NA	NA|987aa|down_5|NC_019740.1_40537_43498_+	cd07477, Peptidases_S8_Subtilisin_subset, Peptidase S8 family domain in Subtilisin proteins	PD-DExK|207aa|down_6|NC_019740.1_43728_44349_+	NA	NA|207aa|down_7|NC_019740.1_44487_45108_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	cas3|238aa|down_8|NC_019740.1_45150_45864_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas3|721aa|down_9|NC_019740.1_45960_48123_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	6	48978-49085	5	CRISPRCasFinder	no	WYL,csx3,PD-DExK,cas3,cas6,cas4,cas1	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Unclear	GTTTCAGTCCCCTTACGGGGATTAAGTTCGTG	32	0	0	NA	NA	NA	1	1	Unclear	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|72aa|up_6|NC_019740.1_39919_40135_+,PD-DExK|207aa|up_4|NC_019740.1_43728_44349_+,NA|106aa|down_9|NC_019740.1_63280_63598_-	csx3|110aa|up_9|NC_019740.1_37552_37882_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	csx3|301aa|up_8|NC_019740.1_37957_38860_+	pfam09620, Cas_csx3, CRISPR-associated protein (Cas_csx3)	NA|196aa|up_7|NC_019740.1_39173_39761_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|72aa|up_6|NC_019740.1_39919_40135_+	NA	NA|987aa|up_5|NC_019740.1_40537_43498_+	cd07477, Peptidases_S8_Subtilisin_subset, Peptidase S8 family domain in Subtilisin proteins	PD-DExK|207aa|up_4|NC_019740.1_43728_44349_+	NA	NA|207aa|up_3|NC_019740.1_44487_45108_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	cas3|238aa|up_2|NC_019740.1_45150_45864_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas3|721aa|up_1|NC_019740.1_45960_48123_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|240aa|up_0|NC_019740.1_48205_48925_+	cd03039, GST_N_Sigma_like, GST_N family, Class Sigma_like; composed of GSTs belonging to class Sigma and similar proteins, including GSTs from class Mu, Pi and Alpha	cas6|280aa|down_0|NC_019740.1_49495_50335_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|436aa|down_1|NC_019740.1_50766_52074_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|90aa|down_2|NC_019740.1_52324_52594_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	cas4|195aa|down_3|NC_019740.1_52678_53263_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|336aa|down_4|NC_019740.1_53264_54272_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|796aa|down_5|NC_019740.1_55432_57820_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|down_6|NC_019740.1_57945_58971_+	pfam13401, AAA_22, AAA domain	NA|408aa|down_7|NC_019740.1_58960_60184_+	pfam06527, TniQ, TniQ	NA|737aa|down_8|NC_019740.1_60662_62873_-	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|106aa|down_9|NC_019740.1_63280_63598_-	NA
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	7	54490-55198	5,6,4	PILER-CR,CRISPRCasFinder,CRT	no	WYL,csx3,PD-DExK,cas3,cas6,cas4,cas1	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Unclear	CTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAAC,CTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAACA,CTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAAC	37,38,37	0	0	NA	NA	NA:NA:NA	9,9,9	9	Unclear	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	PD-DExK|207aa|up_9|NC_019740.1_43728_44349_+,NA|106aa|down_4|NC_019740.1_63280_63598_-,NA|65aa|down_6|NC_019740.1_65117_65312_+,NA|1130aa|down_8|NC_019740.1_67580_70970_+,NA|268aa|down_9|NC_019740.1_71196_72000_-	PD-DExK|207aa|up_9|NC_019740.1_43728_44349_+	NA	NA|207aa|up_8|NC_019740.1_44487_45108_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	cas3|238aa|up_7|NC_019740.1_45150_45864_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas3|721aa|up_6|NC_019740.1_45960_48123_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|240aa|up_5|NC_019740.1_48205_48925_+	cd03039, GST_N_Sigma_like, GST_N family, Class Sigma_like; composed of GSTs belonging to class Sigma and similar proteins, including GSTs from class Mu, Pi and Alpha	cas6|280aa|up_4|NC_019740.1_49495_50335_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|436aa|up_3|NC_019740.1_50766_52074_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|90aa|up_2|NC_019740.1_52324_52594_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	cas4|195aa|up_1|NC_019740.1_52678_53263_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|336aa|up_0|NC_019740.1_53264_54272_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|796aa|down_0|NC_019740.1_55432_57820_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|down_1|NC_019740.1_57945_58971_+	pfam13401, AAA_22, AAA domain	NA|408aa|down_2|NC_019740.1_58960_60184_+	pfam06527, TniQ, TniQ	NA|737aa|down_3|NC_019740.1_60662_62873_-	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|106aa|down_4|NC_019740.1_63280_63598_-	NA	NA|271aa|down_5|NC_019740.1_63749_64562_-	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|65aa|down_6|NC_019740.1_65117_65312_+	NA	NA|459aa|down_7|NC_019740.1_65674_67051_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|1130aa|down_8|NC_019740.1_67580_70970_+	NA	NA|268aa|down_9|NC_019740.1_71196_72000_-	NA
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	8	60346-60607	6,7,5	PILER-CR,CRISPRCasFinder,CRT	no	PD-DExK,cas3,cas6,cas4,cas1	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Unclear	GAACTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAAC,CTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAAC,CTTTCCGCGAACTTAATCCCCGCAAGGGGACTGAAAC	40,37,37	0	0	NA	NA	NA:NA:NA	3,3,3	3	Unclear	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA|106aa|down_1|NC_019740.1_63280_63598_-,NA|65aa|down_3|NC_019740.1_65117_65312_+,NA|1130aa|down_5|NC_019740.1_67580_70970_+,NA|268aa|down_6|NC_019740.1_71196_72000_-,NA|133aa|down_7|NC_019740.1_72262_72661_+,NA|103aa|down_8|NC_019740.1_73080_73389_+,NA|85aa|down_9|NC_019740.1_73771_74026_-	cas3|721aa|up_9|NC_019740.1_45960_48123_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|240aa|up_8|NC_019740.1_48205_48925_+	cd03039, GST_N_Sigma_like, GST_N family, Class Sigma_like; composed of GSTs belonging to class Sigma and similar proteins, including GSTs from class Mu, Pi and Alpha	cas6|280aa|up_7|NC_019740.1_49495_50335_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|436aa|up_6|NC_019740.1_50766_52074_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|90aa|up_5|NC_019740.1_52324_52594_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	cas4|195aa|up_4|NC_019740.1_52678_53263_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|336aa|up_3|NC_019740.1_53264_54272_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|796aa|up_2|NC_019740.1_55432_57820_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|up_1|NC_019740.1_57945_58971_+	pfam13401, AAA_22, AAA domain	NA|408aa|up_0|NC_019740.1_58960_60184_+	pfam06527, TniQ, TniQ	NA|737aa|down_0|NC_019740.1_60662_62873_-	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|106aa|down_1|NC_019740.1_63280_63598_-	NA	NA|271aa|down_2|NC_019740.1_63749_64562_-	TIGR00675, Modification_methylase, DNA-methyltransferase (dcm)	NA|65aa|down_3|NC_019740.1_65117_65312_+	NA	NA|459aa|down_4|NC_019740.1_65674_67051_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|1130aa|down_5|NC_019740.1_67580_70970_+	NA	NA|268aa|down_6|NC_019740.1_71196_72000_-	NA	NA|133aa|down_7|NC_019740.1_72262_72661_+	NA	NA|103aa|down_8|NC_019740.1_73080_73389_+	NA	NA|85aa|down_9|NC_019740.1_73771_74026_-	NA
GCF_000317515.1_ASM31751v1	NC_019740	Microcoleus sp. PCC 7113 plasmid pMIC7113.03, complete sequence	9	63682-63769	8	CRISPRCasFinder	no	PD-DExK,cas3,cas6,cas4,cas1	cas1,cas2,cas6,csx21,csm3gr7,csx19,PD-DExK,csm6,csm2gr11,csx10gr5,cas10,WYL,csx3,cas3,cas4	Unclear	GCTTTTCACTTAGTTCTAGGGGATT	25	0	0	NA	NA	NA	1	1	Unclear	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA|106aa|up_0|NC_019740.1_63280_63598_-,NA|65aa|down_0|NC_019740.1_65117_65312_+,NA|1130aa|down_2|NC_019740.1_67580_70970_+,NA|268aa|down_3|NC_019740.1_71196_72000_-,NA|133aa|down_4|NC_019740.1_72262_72661_+,NA|103aa|down_5|NC_019740.1_73080_73389_+,NA|85aa|down_6|NC_019740.1_73771_74026_-,NA|68aa|down_9|NC_019740.1_76808_77012_-	cas6|280aa|up_9|NC_019740.1_49495_50335_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|436aa|up_8|NC_019740.1_50766_52074_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|90aa|up_7|NC_019740.1_52324_52594_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	cas4|195aa|up_6|NC_019740.1_52678_53263_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|336aa|up_5|NC_019740.1_53264_54272_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|796aa|up_4|NC_019740.1_55432_57820_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|342aa|up_3|NC_019740.1_57945_58971_+	pfam13401, AAA_22, AAA domain	NA|408aa|up_2|NC_019740.1_58960_60184_+	pfam06527, TniQ, TniQ	NA|737aa|up_1|NC_019740.1_60662_62873_-	TIGR01448, recD_rel, helicase, putative, RecD/TraA family	NA|106aa|up_0|NC_019740.1_63280_63598_-	NA	NA|65aa|down_0|NC_019740.1_65117_65312_+	NA	NA|459aa|down_1|NC_019740.1_65674_67051_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|1130aa|down_2|NC_019740.1_67580_70970_+	NA	NA|268aa|down_3|NC_019740.1_71196_72000_-	NA	NA|133aa|down_4|NC_019740.1_72262_72661_+	NA	NA|103aa|down_5|NC_019740.1_73080_73389_+	NA	NA|85aa|down_6|NC_019740.1_73771_74026_-	NA	NA|318aa|down_7|NC_019740.1_74027_74981_-	pfam17989, ALP_N, Actin like proteins N terminal domain	NA|402aa|down_8|NC_019740.1_75191_76397_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|68aa|down_9|NC_019740.1_76808_77012_-	NA
GCF_000317515.1_ASM31751v1	NC_019741	Microcoleus sp. PCC 7113 plasmid pMIC7113.05, complete sequence	1	36434-36542	1	CRISPRCasFinder	no			Orphan	GTCGAGGTCGAGGTCGTCGCGGGTGAGG	28	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14c_CAS-V-F,PD-DExK,RT,cas14j,csa3,Cas9_archaeal,DinG,c2c10_CAS-V-U3,csx3,WYL,csx1,cas10,csm3gr7,csx10gr5,csx19,c2c8_V-U2,Cas14u_CAS-V,cas2,cas1,cas4,cas6,cas3,csc1gr5,csc2gr7,cas10d,c2c5_V-U5,DEDDh,2OG_CAS,csx21,csm6,csm2gr11	NA,NA	NA|122aa|up_9|NC_019741.1_25739_26105_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|81aa|up_8|NC_019741.1_26444_26687_+	pfam01381, HTH_3, Helix-turn-helix	NA|384aa|up_7|NC_019741.1_26773_27925_+	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|532aa|up_6|NC_019741.1_28060_29656_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|89aa|up_5|NC_019741.1_29664_29931_+	TIGR02606, Antitoxin_ParD, putative addiction module antidote protein, CC2985 family	NA|102aa|up_4|NC_019741.1_29927_30233_+	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|106aa|up_3|NC_019741.1_30242_30560_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|444aa|up_2|NC_019741.1_30546_31878_+	cd17512, RMtype1_S_BceB55ORF5615P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Bacillus cereus HuB5-5 S subunit (S	NA|296aa|up_1|NC_019741.1_31877_32765_+	pfam14355, Abi_C, Abortive infection C-terminus	NA|1055aa|up_0|NC_019741.1_32761_35926_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
