assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000317575.1_ASM31757v1	NC_019765	Stanieria cyanosphaera PCC 7437 plasmid pSTA7437.01, complete sequence	1	31939-32557	1,1,1	CRT,PILER-CR,CRISPRCasFinder	no	c2c9_V-U4,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	c2c9_V-U4,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,cas14j,Cas14c_CAS-V-F	Type I-D	GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC	37,37,37	0	0	NA	NA	NA:NA:NA	8,6,7	8	TypeI-D	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|291aa|up_8|NC_019765.1_20009_20882_+,NA|178aa|up_6|NC_019765.1_21742_22276_-,NA|79aa|up_2|NC_019765.1_25897_26134_-,NA|1217aa|up_1|NC_019765.1_26326_29977_+,NA	NA|116aa|up_9|NC_019765.1_19185_19533_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|291aa|up_8|NC_019765.1_20009_20882_+	NA	NA|245aa|up_7|NC_019765.1_20854_21589_+	pfam14460, Prok-E2_D, Prokaryotic E2 family D	NA|178aa|up_6|NC_019765.1_21742_22276_-	NA	NA|336aa|up_5|NC_019765.1_22393_23401_-	pfam14239, RRXRR, RRXRR protein	NA|266aa|up_4|NC_019765.1_23648_24446_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|289aa|up_3|NC_019765.1_25053_25920_+	TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein	NA|79aa|up_2|NC_019765.1_25897_26134_-	NA	NA|1217aa|up_1|NC_019765.1_26326_29977_+	NA	NA|611aa|up_0|NC_019765.1_30093_31926_+	PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional	cas2|92aa|down_0|NC_019765.1_35426_35702_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|336aa|down_1|NC_019765.1_35705_36713_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|197aa|down_2|NC_019765.1_36754_37345_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|275aa|down_3|NC_019765.1_37350_38175_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csc1gr5|235aa|down_4|NC_019765.1_38191_38896_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|334aa|down_5|NC_019765.1_38973_39975_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1159aa|down_6|NC_019765.1_40057_43534_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|60aa|down_7|NC_019765.1_43580_43760_-	pfam04255, DUF433, Protein of unknown function (DUF433)	cas3|727aa|down_8|NC_019765.1_44053_46234_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	WYL|307aa|down_9|NC_019765.1_46725_47646_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]
GCF_000317575.1_ASM31757v1	NC_019765	Stanieria cyanosphaera PCC 7437 plasmid pSTA7437.01, complete sequence	2	32592-35193	2,2,2,3,4	CRT,PILER-CR,CRISPRCasFinder,PILER-CR,PILER-CR	no	c2c9_V-U4,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	c2c9_V-U4,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,cas14j,Cas14c_CAS-V-F	Type I-D	GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC,GTTTCAATTCCTCATAGGAATTATTAATAGTTTAAAC	37,37,37,37,37	0	0	NA	NA	NA:NA:NA:NA:NA	35,30,34,30,30	35	TypeI-D	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|291aa|up_8|NC_019765.1_20009_20882_+,NA|178aa|up_6|NC_019765.1_21742_22276_-,NA|79aa|up_2|NC_019765.1_25897_26134_-,NA|1217aa|up_1|NC_019765.1_26326_29977_+,NA	NA|116aa|up_9|NC_019765.1_19185_19533_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|291aa|up_8|NC_019765.1_20009_20882_+	NA	NA|245aa|up_7|NC_019765.1_20854_21589_+	pfam14460, Prok-E2_D, Prokaryotic E2 family D	NA|178aa|up_6|NC_019765.1_21742_22276_-	NA	NA|336aa|up_5|NC_019765.1_22393_23401_-	pfam14239, RRXRR, RRXRR protein	NA|266aa|up_4|NC_019765.1_23648_24446_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|289aa|up_3|NC_019765.1_25053_25920_+	TIGR03736, PRTRC_ThiF, PRTRC system ThiF family protein	NA|79aa|up_2|NC_019765.1_25897_26134_-	NA	NA|1217aa|up_1|NC_019765.1_26326_29977_+	NA	NA|611aa|up_0|NC_019765.1_30093_31926_+	PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional	cas2|92aa|down_0|NC_019765.1_35426_35702_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|336aa|down_1|NC_019765.1_35705_36713_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|197aa|down_2|NC_019765.1_36754_37345_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|275aa|down_3|NC_019765.1_37350_38175_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csc1gr5|235aa|down_4|NC_019765.1_38191_38896_-	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	csc2gr7|334aa|down_5|NC_019765.1_38973_39975_-	pfam18320, Csc2, Csc2 Crispr	cas10d|1159aa|down_6|NC_019765.1_40057_43534_-	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|60aa|down_7|NC_019765.1_43580_43760_-	pfam04255, DUF433, Protein of unknown function (DUF433)	cas3|727aa|down_8|NC_019765.1_44053_46234_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	WYL|307aa|down_9|NC_019765.1_46725_47646_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	1	534533-535100	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Type III-B	CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC,CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC,CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	7,7,7	7	TypeIII-B	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|154aa|up_9|NC_019748.1_521228_521690_+,csx1|496aa|up_5|NC_019748.1_524636_526124_-,cmr5gr11|144aa|up_3|NC_019748.1_528171_528603_-,NA|149aa|down_7|NC_019748.1_544161_544608_-,NA|48aa|down_9|NC_019748.1_544781_544925_+	NA|154aa|up_9|NC_019748.1_521228_521690_+	NA	WYL|418aa|up_8|NC_019748.1_521744_522998_-	pfam13280, WYL, WYL domain	csx18|94aa|up_7|NC_019748.1_523217_523499_+	PRK08238, PRK08238, UbiA family prenyltransferase	cas1|331aa|up_6|NC_019748.1_523606_524599_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx1|496aa|up_5|NC_019748.1_524636_526124_-	NA	cmr6gr7|655aa|up_4|NC_019748.1_526132_528097_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|144aa|up_3|NC_019748.1_528171_528603_-	NA	cmr4gr7|266aa|up_2|NC_019748.1_528651_529449_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|391aa|up_1|NC_019748.1_529459_530632_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas2|94aa|up_0|NC_019748.1_534061_534343_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|452aa|down_0|NC_019748.1_535145_536501_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|322aa|down_1|NC_019748.1_538024_538990_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|188aa|down_2|NC_019748.1_538997_539561_-	pfam14238, DUF4340, Domain of unknown function (DUF4340)	NA|47aa|down_3|NC_019748.1_539514_539655_+	pfam09907, HigB_toxin, HigB_toxin, RelE-like toxic component of a toxin-antitoxin system	NA|512aa|down_4|NC_019748.1_540387_541923_-	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]	NA|264aa|down_5|NC_019748.1_542166_542958_-	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|328aa|down_6|NC_019748.1_543084_544068_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|149aa|down_7|NC_019748.1_544161_544608_-	NA	NA|30aa|down_8|NC_019748.1_544688_544778_+	pfam03742, PetN, PetN	NA|48aa|down_9|NC_019748.1_544781_544925_+	NA
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	2	536594-537916	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Type III-B	CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC,CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC,CTCTATCTTTTGTAGAGAAACTAATTGAATGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	17,17,17	17	TypeIII-B	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	csx1|496aa|up_6|NC_019748.1_524636_526124_-,cmr5gr11|144aa|up_4|NC_019748.1_528171_528603_-,NA|149aa|down_6|NC_019748.1_544161_544608_-,NA|48aa|down_8|NC_019748.1_544781_544925_+	WYL|418aa|up_9|NC_019748.1_521744_522998_-	pfam13280, WYL, WYL domain	csx18|94aa|up_8|NC_019748.1_523217_523499_+	PRK08238, PRK08238, UbiA family prenyltransferase	cas1|331aa|up_7|NC_019748.1_523606_524599_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csx1|496aa|up_6|NC_019748.1_524636_526124_-	NA	cmr6gr7|655aa|up_5|NC_019748.1_526132_528097_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|144aa|up_4|NC_019748.1_528171_528603_-	NA	cmr4gr7|266aa|up_3|NC_019748.1_528651_529449_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|391aa|up_2|NC_019748.1_529459_530632_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas2|94aa|up_1|NC_019748.1_534061_534343_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|452aa|up_0|NC_019748.1_535145_536501_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|322aa|down_0|NC_019748.1_538024_538990_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|188aa|down_1|NC_019748.1_538997_539561_-	pfam14238, DUF4340, Domain of unknown function (DUF4340)	NA|47aa|down_2|NC_019748.1_539514_539655_+	pfam09907, HigB_toxin, HigB_toxin, RelE-like toxic component of a toxin-antitoxin system	NA|512aa|down_3|NC_019748.1_540387_541923_-	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]	NA|264aa|down_4|NC_019748.1_542166_542958_-	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|328aa|down_5|NC_019748.1_543084_544068_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|149aa|down_6|NC_019748.1_544161_544608_-	NA	NA|30aa|down_7|NC_019748.1_544688_544778_+	pfam03742, PetN, PetN	NA|48aa|down_8|NC_019748.1_544781_544925_+	NA	NA|77aa|down_9|NC_019748.1_545160_545391_-	pfam11332, DUF3134, Protein of unknown function (DUF3134)
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	3	1074051-1074159	3	CRISPRCasFinder	no	csa3	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Type I-A	GATTGAACTTGTTCTGCTATTTCTACCA	28	0	0	NA	NA	NA	1	1	Orphan	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|103aa|up_2|NC_019748.1_1069585_1069894_+,NA|130aa|down_7|NC_019748.1_1081305_1081695_-,NA|153aa|down_8|NC_019748.1_1081966_1082425_+	NA|432aa|up_9|NC_019748.1_1063070_1064366_+	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|185aa|up_8|NC_019748.1_1064428_1064983_+	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|292aa|up_7|NC_019748.1_1065575_1066451_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|254aa|up_6|NC_019748.1_1066952_1067714_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|91aa|up_5|NC_019748.1_1067810_1068083_+	smart01094, CpcD, CpcD/allophycocyanin linker domain	NA|179aa|up_4|NC_019748.1_1068282_1068819_+	pfam09367, CpeS, CpeS-like protein	NA|209aa|up_3|NC_019748.1_1068903_1069530_+	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|103aa|up_2|NC_019748.1_1069585_1069894_+	NA	NA|278aa|up_1|NC_019748.1_1069989_1070823_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|503aa|up_0|NC_019748.1_1070880_1072389_-	TIGR01488, Phosphoserine_phosphatase, Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like	NA|191aa|down_0|NC_019748.1_1074213_1074786_-	pfam06041, DUF924, Bacterial protein of unknown function (DUF924)	NA|199aa|down_1|NC_019748.1_1074847_1075444_+	cd05540, UreG, urease accessory protein UreG	NA|443aa|down_2|NC_019748.1_1075653_1076982_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|389aa|down_3|NC_019748.1_1077172_1078339_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|390aa|down_4|NC_019748.1_1078406_1079576_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|251aa|down_5|NC_019748.1_1079687_1080440_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|234aa|down_6|NC_019748.1_1080558_1081260_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|130aa|down_7|NC_019748.1_1081305_1081695_-	NA	NA|153aa|down_8|NC_019748.1_1081966_1082425_+	NA	NA|85aa|down_9|NC_019748.1_1082492_1082747_+	COG4095, COG4095, Uncharacterized conserved protein [Function unknown]
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	4	2796866-2796970	4	CRISPRCasFinder	no	Cas14c_CAS-V-F	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Unclear	CCGACTCCTCCAATTAAAGTATCA	24	0	0	NA	NA	NA	1	1	TypeV	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|431aa|up_4|NC_019748.1_2789099_2790392_+,NA|427aa|up_3|NC_019748.1_2790433_2791714_+,NA|157aa|up_2|NC_019748.1_2792171_2792642_+,NA|120aa|down_0|NC_019748.1_2801204_2801564_+,NA|195aa|down_9|NC_019748.1_2812214_2812799_-	NA|75aa|up_9|NC_019748.1_2783533_2783758_-	cd06187, O2ase_reductase_like, The oxygenase reductase FAD/NADH binding domain acts as part of the multi-component bacterial oxygenases which oxidize hydrocarbons using oxygen as the oxidant	NA|166aa|up_8|NC_019748.1_2783953_2784451_-	pfam13239, 2TM, 2TM domain	NA|374aa|up_7|NC_019748.1_2784632_2785754_+	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|510aa|up_6|NC_019748.1_2785972_2787502_-	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|449aa|up_5|NC_019748.1_2787489_2788836_-	TIGR01143, murF, UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase	NA|431aa|up_4|NC_019748.1_2789099_2790392_+	NA	NA|427aa|up_3|NC_019748.1_2790433_2791714_+	NA	NA|157aa|up_2|NC_019748.1_2792171_2792642_+	NA	NA|356aa|up_1|NC_019748.1_2792651_2793719_-	pfam02254, TrkA_N, TrkA-N domain	NA|395aa|up_0|NC_019748.1_2794005_2795190_+	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|120aa|down_0|NC_019748.1_2801204_2801564_+	NA	NA|376aa|down_1|NC_019748.1_2801794_2802922_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|241aa|down_2|NC_019748.1_2803385_2804108_+	cd00737, lyz_endolysin_autolysin, endolysin and autolysin	NA|476aa|down_3|NC_019748.1_2804104_2805532_-	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|226aa|down_4|NC_019748.1_2805691_2806369_-	TIGR03697, NtcA_cyano, global nitrogen regulator NtcA, cyanobacterial	NA|260aa|down_5|NC_019748.1_2806635_2807415_+	PRK07370, PRK07370, enoyl-[acyl-carrier-protein] reductase FabI	NA|217aa|down_6|NC_019748.1_2807896_2808547_-	PRK11081, PRK11081, tRNA guanosine-2'-O-methyltransferase; Provisional	NA|308aa|down_7|NC_019748.1_2808543_2809467_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|753aa|down_8|NC_019748.1_2809645_2811904_-	PRK13410, PRK13410, molecular chaperone DnaK; Provisional	NA|195aa|down_9|NC_019748.1_2812214_2812799_-	NA
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	5	2872261-2872502	3	PILER-CR	no		RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Orphan	ATTAACCCAAGCATTAACATTATTTGGTTGTAATTCAATCACACGCTCAAAG	52	0	0	NA	NA	NA	2	2	Orphan	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|186aa|up_4|NC_019748.1_2868181_2868739_-,NA|51aa|down_3|NC_019748.1_2876245_2876398_-,NA|100aa|down_6|NC_019748.1_2879259_2879559_-	NA|200aa|up_9|NC_019748.1_2861008_2861608_-	COG2018, COG2018, Uncharacterized distant relative of homeotic protein bithoraxoid [General function prediction only]	NA|181aa|up_8|NC_019748.1_2861717_2862260_-	COG2229, COG2229, Predicted GTPase [General function prediction only]	NA|295aa|up_7|NC_019748.1_2862259_2863144_-	pfam14332, DUF4388, Domain of unknown function (DUF4388)	NA|367aa|up_6|NC_019748.1_2863584_2864685_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|994aa|up_5|NC_019748.1_2864847_2867829_+	cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate dehydrogenase, RocA	NA|186aa|up_4|NC_019748.1_2868181_2868739_-	NA	NA|454aa|up_3|NC_019748.1_2868861_2870223_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|75aa|up_2|NC_019748.1_2870382_2870607_-	sd00006, TPR, Tetratricopeptide repeat	NA|170aa|up_1|NC_019748.1_2870603_2871113_-	pfam14317, YcxB, YcxB-like protein	NA|188aa|up_0|NC_019748.1_2871409_2871973_-	sd00006, TPR, Tetratricopeptide repeat	NA|184aa|down_0|NC_019748.1_2874386_2874938_-	pfam14317, YcxB, YcxB-like protein	NA|189aa|down_1|NC_019748.1_2875016_2875583_-	pfam14317, YcxB, YcxB-like protein	NA|171aa|down_2|NC_019748.1_2875661_2876174_-	pfam14317, YcxB, YcxB-like protein	NA|51aa|down_3|NC_019748.1_2876245_2876398_-	NA	NA|58aa|down_4|NC_019748.1_2876378_2876552_-	pfam13414, TPR_11, TPR repeat	NA|843aa|down_5|NC_019748.1_2876695_2879224_-	NF033189, internalin_A, class 1 internalin InlA	NA|100aa|down_6|NC_019748.1_2879259_2879559_-	NA	NA|32aa|down_7|NC_019748.1_2879748_2879844_-	cd19588, serpin_miropin-like, serpin miropin and similar proteins	NA|98aa|down_8|NC_019748.1_2879803_2880097_-	cd19588, serpin_miropin-like, serpin miropin and similar proteins	NA|233aa|down_9|NC_019748.1_2880297_2880996_-	sd00006, TPR, Tetratricopeptide repeat
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	6	3071472-3072157	3,4,5	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas5,cas7,cas8b3,cas3,cas6,WYL	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Unclear	GTGATCAACGCCTAACGGCATCTAAGGTTAGTACAC,GTGATCAACGCCTAACGGCATCTAAGGTTAGTACAC,GTGATCAACGCCTAACGGCATCTAAGGTTAGTACAC	36,36,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	9,8,8	9	Unclear	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|122aa|up_5|NC_019748.1_3064192_3064558_+,NA|145aa|up_1|NC_019748.1_3070441_3070876_+,NA|76aa|up_0|NC_019748.1_3071103_3071331_-,NA	NA|710aa|up_9|NC_019748.1_3055571_3057701_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|178aa|up_8|NC_019748.1_3057697_3058231_-	pfam14218, COP23, Circadian oscillating protein COP23	NA|1349aa|up_7|NC_019748.1_3058476_3062523_+	pfam05860, Haemagg_act, haemagglutination activity domain	NA|459aa|up_6|NC_019748.1_3062506_3063883_-	sd00006, TPR, Tetratricopeptide repeat	NA|122aa|up_5|NC_019748.1_3064192_3064558_+	NA	NA|244aa|up_4|NC_019748.1_3064756_3065488_+	PRK00024, PRK00024, DNA repair protein RadC	NA|602aa|up_3|NC_019748.1_3065498_3067304_-	cd01948, EAL, EAL domain	NA|840aa|up_2|NC_019748.1_3067340_3069860_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|145aa|up_1|NC_019748.1_3070441_3070876_+	NA	NA|76aa|up_0|NC_019748.1_3071103_3071331_-	NA	cas2|98aa|down_0|NC_019748.1_3072410_3072704_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|553aa|down_1|NC_019748.1_3072712_3074371_-	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas5|213aa|down_2|NC_019748.1_3075658_3076297_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|287aa|down_3|NC_019748.1_3076296_3077157_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b3|492aa|down_4|NC_019748.1_3077189_3078665_-	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas3|761aa|down_5|NC_019748.1_3078670_3080953_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas6|221aa|down_6|NC_019748.1_3080952_3081615_-	pfam09559, Cas6, Cas6 Crispr	NA|105aa|down_7|NC_019748.1_3081618_3081933_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|102aa|down_8|NC_019748.1_3081955_3082261_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	WYL|325aa|down_9|NC_019748.1_3082278_3083253_-	pfam13280, WYL, WYL domain
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	7	3434760-3434873	6	CRISPRCasFinder	no		RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Orphan	TAAACCATTTAGTCAAAGTTTGATTAATCTGCTCTA	36	0	0	NA	NA	NA	1	1	Orphan	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|60aa|up_7|NC_019748.1_3425911_3426091_-,NA|71aa|up_1|NC_019748.1_3432217_3432430_-,NA	NA|265aa|up_9|NC_019748.1_3423655_3424450_-	COG1589, FtsQ, Cell division septal protein [Cell envelope biogenesis, outer membrane]	NA|431aa|up_8|NC_019748.1_3424618_3425911_+	TIGR00225, Tail-specific_protease, C-terminal peptidase (prc)	NA|60aa|up_7|NC_019748.1_3425911_3426091_-	NA	NA|39aa|up_6|NC_019748.1_3426179_3426296_-	PRK02655, psbI, photosystem II reaction center protein I	NA|453aa|up_5|NC_019748.1_3426507_3427866_-	COG3429, COG3429, Glucose-6-P dehydrogenase subunit [Carbohydrate transport and metabolism]	NA|510aa|up_4|NC_019748.1_3427969_3429499_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|383aa|up_3|NC_019748.1_3429573_3430722_-	PRK03343, PRK03343, transaldolase; Validated	NA|357aa|up_2|NC_019748.1_3430774_3431845_-	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|71aa|up_1|NC_019748.1_3432217_3432430_-	NA	NA|374aa|up_0|NC_019748.1_3432481_3433603_-	TIGR03169, selenide_water_dikinase_putative, pyridine nucleotide-disulfide oxidoreductase family protein	NA|411aa|down_0|NC_019748.1_3435065_3436298_-	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|278aa|down_1|NC_019748.1_3436543_3437377_+	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|245aa|down_2|NC_019748.1_3437928_3438663_+	cd07992, LPLAT_AAK14816-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: Unknown AAK14816-like	NA|653aa|down_3|NC_019748.1_3438775_3440734_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|383aa|down_4|NC_019748.1_3440730_3441879_-	cd06451, AGAT_like, Alanine-glyoxylate aminotransferase (AGAT) family	NA|228aa|down_5|NC_019748.1_3442372_3443056_+	COG0811, TolQ, Biopolymer transport proteins [Intracellular trafficking and secretion]	NA|199aa|down_6|NC_019748.1_3443079_3443676_+	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|194aa|down_7|NC_019748.1_3443859_3444441_-	cd12130, Apl, Allophycocyanin-like globins	NA|219aa|down_8|NC_019748.1_3444562_3445219_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|181aa|down_9|NC_019748.1_3445365_3445908_+	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	8	3529794-3529902	7	CRISPRCasFinder	no		RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Orphan	CTTTAGCTTGTTTACTAGAAGATGCACC	28	0	0	NA	NA	NA	1	1	Orphan	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|103aa|up_9|NC_019748.1_3519206_3519515_-,NA	NA|103aa|up_9|NC_019748.1_3519206_3519515_-	NA	NA|152aa|up_8|NC_019748.1_3519735_3520191_-	pfam09424, YqeY, Yqey-like protein	NA|206aa|up_7|NC_019748.1_3520390_3521008_-	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|73aa|up_6|NC_019748.1_3521018_3521237_-	pfam10929, DUF2811, Protein of unknown function (DUF2811)	NA|511aa|up_5|NC_019748.1_3521709_3523242_-	COG0063, COG0063, Predicted sugar kinase [Carbohydrate transport and metabolism]	NA|354aa|up_4|NC_019748.1_3523365_3524427_-	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|98aa|up_3|NC_019748.1_3524711_3525005_+	PRK13019, clpS, ATP-dependent Clp protease adapter ClpS	NA|96aa|up_2|NC_019748.1_3525009_3525297_+	pfam09876, DUF2103, Predicted metal-binding protein (DUF2103)	NA|448aa|up_1|NC_019748.1_3525451_3526795_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|740aa|up_0|NC_019748.1_3526873_3529093_-	PRK10669, PRK10669, putative cation:proton antiport protein; Provisional	NA|296aa|down_0|NC_019748.1_3530750_3531638_+	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|116aa|down_1|NC_019748.1_3531764_3532112_+	pfam05534, HicB, HicB family	NA|504aa|down_2|NC_019748.1_3532179_3533691_+	CHL00195, ycf46, Ycf46; Provisional	NA|117aa|down_3|NC_019748.1_3533834_3534185_+	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|227aa|down_4|NC_019748.1_3534248_3534929_-	COG1926, COG1926, Predicted phosphoribosyltransferases [General function prediction only]	NA|370aa|down_5|NC_019748.1_3535058_3536169_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|392aa|down_6|NC_019748.1_3536165_3537341_-	COG0438, RfaG, Glycosyltransferase [Cell envelope biogenesis, outer membrane]	NA|681aa|down_7|NC_019748.1_3537349_3539392_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|402aa|down_8|NC_019748.1_3539418_3540624_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|258aa|down_9|NC_019748.1_3540940_3541714_+	TIGR03413, GSH_gloB, hydroxyacylglutathione hydrolase
GCF_000317575.1_ASM31757v1	NC_019748	Stanieria cyanosphaera PCC 7437, complete sequence	9	4485921-4486141	8	CRISPRCasFinder	no		RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4	Orphan	CCAGTAAATTGCGTCTCATTATTAAAATTGCCGTTAATTCCAATTAAACCAAG	53	0	0	NA	NA	NA	1	1	Orphan	RT,Cas14u_CAS-V,cas14k,WYL,csx18,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas2,c2c9_V-U4,cas14j,csa3,Cas14c_CAS-V-F,DEDDh,cas5,cas7,cas8b3,cas3,cas6,DinG,cas4,csc1gr5,csc2gr7,cas10d	NA|287aa|up_9|NC_019748.1_4474446_4475307_-,NA|137aa|up_6|NC_019748.1_4478715_4479126_+,NA|109aa|down_9|NC_019748.1_4496667_4496994_-	NA|287aa|up_9|NC_019748.1_4474446_4475307_-	NA	NA|373aa|up_8|NC_019748.1_4475812_4476931_+	PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated	NA|370aa|up_7|NC_019748.1_4477200_4478310_-	TIGR00367, Uncharacterized_membrane_protein_MJ0091, K+-dependent Na+/Ca+ exchanger related-protein	NA|137aa|up_6|NC_019748.1_4478715_4479126_+	NA	NA|498aa|up_5|NC_019748.1_4479196_4480690_+	PRK00784, PRK00784, cobyric acid synthase	NA|80aa|up_4|NC_019748.1_4480697_4480937_+	cd00207, fer2, 2Fe-2S iron-sulfur cluster binding domain	NA|539aa|up_3|NC_019748.1_4481259_4482876_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|266aa|up_2|NC_019748.1_4483136_4483934_+	pfam03808, Glyco_tran_WecB, Glycosyl transferase WecB/TagA/CpsF family	NA|237aa|up_1|NC_019748.1_4483919_4484630_-	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]	NA|149aa|up_0|NC_019748.1_4484604_4485051_-	PRK13261, ureE, urease accessory protein UreE; Provisional	NA|292aa|down_0|NC_019748.1_4487000_4487876_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|170aa|down_1|NC_019748.1_4488233_4488743_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|320aa|down_2|NC_019748.1_4488963_4489923_-	COG2006, COG2006, Uncharacterized conserved protein [Function unknown]	NA|418aa|down_3|NC_019748.1_4490177_4491431_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|256aa|down_4|NC_019748.1_4491507_4492275_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|333aa|down_5|NC_019748.1_4492368_4493367_-	PRK07400, PRK07400, 30S ribosomal protein S1; Reviewed	NA|179aa|down_6|NC_019748.1_4493793_4494330_-	PRK00464, nrdR, transcriptional repressor NrdR	NA|32aa|down_7|NC_019748.1_4494520_4494616_-	PRK11875, psbT, photosystem II reaction center protein T; Reviewed	NA|511aa|down_8|NC_019748.1_4494844_4496377_-	CHL00062, psbB, photosystem II 47 kDa protein	NA|109aa|down_9|NC_019748.1_4496667_4496994_-	NA
