assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	1	560388-560470	1	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	AGTTATGCCGATTTGAGTCAAGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|61aa|up_5|NC_019745.1_553738_553921_-,NA	NA|1743aa|up_9|NC_019745.1_545169_550398_+	pfam04357, TamB, TamB, inner membrane protein subunit of TAM complex	NA|299aa|up_8|NC_019745.1_550489_551386_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|294aa|up_7|NC_019745.1_551517_552399_-	pfam03881, Fructosamin_kin, Fructosamine kinase	NA|425aa|up_6|NC_019745.1_552395_553670_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|61aa|up_5|NC_019745.1_553738_553921_-	NA	NA|252aa|up_4|NC_019745.1_554483_555239_-	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|266aa|up_3|NC_019745.1_555267_556065_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|381aa|up_2|NC_019745.1_556180_557323_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|385aa|up_1|NC_019745.1_557434_558589_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|440aa|up_0|NC_019745.1_558687_560007_-	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|225aa|down_0|NC_019745.1_560644_561319_-	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|116aa|down_1|NC_019745.1_561309_561657_-	pfam12973, Cupin_7, ChrR Cupin-like domain	NA|523aa|down_2|NC_019745.1_561850_563419_-	COG3845, COG3845, ABC-type uncharacterized transport systems, ATPase components [General function prediction only]	NA|238aa|down_3|NC_019745.1_563434_564148_-	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|345aa|down_4|NC_019745.1_564181_565216_-	PRK13287, amiF, formamidase; Provisional	NA|134aa|down_5|NC_019745.1_565288_565690_-	PRK08186, PRK08186, allophanate hydrolase; Provisional	NA|307aa|down_6|NC_019745.1_565726_566647_-	COG1079, COG1079, Uncharacterized ABC-type transport system, permease component [General function prediction only]	NA|359aa|down_7|NC_019745.1_566665_567742_-	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|398aa|down_8|NC_019745.1_567825_569019_-	cd19963, PBP1_BMP-like, periplasmic binding component of a basic membrane lipoprotein (BMP) from Brucella abortus and its close homologs in other bacteria	NA|1208aa|down_9|NC_019745.1_569407_573031_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	2	1342475-1342599	2	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	TACAGAAACAAAATCTGTCTTCACCGGTTTCTTTAGCAC	39	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|69aa|up_7|NC_019745.1_1336139_1336346_-,NA|114aa|up_5|NC_019745.1_1338057_1338399_-,NA|96aa|up_4|NC_019745.1_1338507_1338795_+,NA|70aa|down_1|NC_019745.1_1343726_1343936_+,NA|206aa|down_4|NC_019745.1_1345416_1346034_+,NA|104aa|down_6|NC_019745.1_1347128_1347440_-	NA|282aa|up_9|NC_019745.1_1333649_1334495_-	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|354aa|up_8|NC_019745.1_1334585_1335647_-	COG1613, Sbp, ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|69aa|up_7|NC_019745.1_1336139_1336346_-	NA	NA|464aa|up_6|NC_019745.1_1336542_1337934_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|114aa|up_5|NC_019745.1_1338057_1338399_-	NA	NA|96aa|up_4|NC_019745.1_1338507_1338795_+	NA	NA|150aa|up_3|NC_019745.1_1338815_1339265_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|218aa|up_2|NC_019745.1_1339703_1340357_+	pfam05685, Uma2, Putative restriction endonuclease	NA|465aa|up_1|NC_019745.1_1340475_1341870_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|162aa|up_0|NC_019745.1_1341958_1342444_+	cd12125, APC_alpha, Allophycocyanin alpha subunit of the phycobilisome core	NA|243aa|down_0|NC_019745.1_1342739_1343468_-	PRK02816, PRK02816, phycocyanobilin:ferredoxin oxidoreductase; Validated	NA|70aa|down_1|NC_019745.1_1343726_1343936_+	NA	NA|197aa|down_2|NC_019745.1_1343932_1344523_-	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|181aa|down_3|NC_019745.1_1344681_1345224_-	pfam01789, PsbP, PsbP	NA|206aa|down_4|NC_019745.1_1345416_1346034_+	NA	NA|292aa|down_5|NC_019745.1_1346156_1347032_-	pfam00565, SNase, Staphylococcal nuclease homolog	NA|104aa|down_6|NC_019745.1_1347128_1347440_-	NA	NA|262aa|down_7|NC_019745.1_1347456_1348242_-	COG1691, COG1691, NCAIR mutase (PurE)-related proteins [General function prediction only]	NA|607aa|down_8|NC_019745.1_1348280_1350101_-	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|241aa|down_9|NC_019745.1_1350218_1350941_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	3	1535975-1536088	3	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	ATCAAATATTAAAGAAACCCACGCCGGTGGGTTTTGTCT	39	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|101aa|up_4|NC_019745.1_1531616_1531919_+,NA|325aa|up_3|NC_019745.1_1532396_1533371_+,NA|91aa|up_2|NC_019745.1_1533445_1533718_+,NA|76aa|down_5|NC_019745.1_1542247_1542475_+,NA|81aa|down_6|NC_019745.1_1543008_1543251_-,NA|74aa|down_7|NC_019745.1_1543477_1543699_+	NA|297aa|up_9|NC_019745.1_1526277_1527168_+	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|280aa|up_8|NC_019745.1_1527338_1528178_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|140aa|up_7|NC_019745.1_1528177_1528597_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|112aa|up_6|NC_019745.1_1528944_1529280_+	PRK13697, PRK13697, cytochrome c6; Provisional	NA|699aa|up_5|NC_019745.1_1529475_1531572_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|101aa|up_4|NC_019745.1_1531616_1531919_+	NA	NA|325aa|up_3|NC_019745.1_1532396_1533371_+	NA	NA|91aa|up_2|NC_019745.1_1533445_1533718_+	NA	NA|419aa|up_1|NC_019745.1_1533714_1534971_-	pfam01384, PHO4, Phosphate transporter family	NA|295aa|up_0|NC_019745.1_1535003_1535888_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|457aa|down_0|NC_019745.1_1536210_1537581_-	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|191aa|down_1|NC_019745.1_1537683_1538256_-	pfam05685, Uma2, Putative restriction endonuclease	NA|439aa|down_2|NC_019745.1_1538881_1540198_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|167aa|down_3|NC_019745.1_1540422_1540923_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|97aa|down_4|NC_019745.1_1541275_1541566_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|76aa|down_5|NC_019745.1_1542247_1542475_+	NA	NA|81aa|down_6|NC_019745.1_1543008_1543251_-	NA	NA|74aa|down_7|NC_019745.1_1543477_1543699_+	NA	NA|178aa|down_8|NC_019745.1_1543698_1544232_+	pfam13470, PIN_3, PIN domain	NA|77aa|down_9|NC_019745.1_1544387_1544618_+	pfam10047, DUF2281, Protein of unknown function (DUF2281)
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	4	1666998-1667074	4	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	AACTCCGCTTCTAAGCGCGCGAT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|129aa|up_9|NC_019745.1_1651682_1652069_+,NA|65aa|up_8|NC_019745.1_1652139_1652334_-,NA|50aa|down_1|NC_019745.1_1670281_1670431_-,NA|93aa|down_2|NC_019745.1_1670622_1670901_+	NA|129aa|up_9|NC_019745.1_1651682_1652069_+	NA	NA|65aa|up_8|NC_019745.1_1652139_1652334_-	NA	NA|130aa|up_7|NC_019745.1_1652882_1653272_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|872aa|up_6|NC_019745.1_1653373_1655989_+	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|675aa|up_5|NC_019745.1_1656627_1658652_+	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|218aa|up_4|NC_019745.1_1658700_1659354_+	PRK13925, rnhB, ribonuclease HII; Provisional	NA|190aa|up_3|NC_019745.1_1659334_1659904_-	pfam09366, DUF1997, Protein of unknown function (DUF1997)	NA|1322aa|up_2|NC_019745.1_1660205_1664171_+	PLN03241, PLN03241, magnesium chelatase subunit H; Provisional	NA|230aa|up_1|NC_019745.1_1664322_1665012_-	TIGR02982, heterocyst_DevA, ABC exporter ATP-binding subunit, DevA family	NA|391aa|up_0|NC_019745.1_1665058_1666231_-	TIGR01185, membrane_spanning_subunit, DevC protein	NA|444aa|down_0|NC_019745.1_1668626_1669958_+	cd16409, ParB_N_like, ParB N-terminal-like domain of bacterial and plasmid parABS partitioning systems	NA|50aa|down_1|NC_019745.1_1670281_1670431_-	NA	NA|93aa|down_2|NC_019745.1_1670622_1670901_+	NA	NA|1087aa|down_3|NC_019745.1_1677784_1681045_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|553aa|down_4|NC_019745.1_1681080_1682739_+	cd04742, NPD_FabD, 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase  FabD	NA|507aa|down_5|NC_019745.1_1682747_1684268_+	COG3320, COG3320, Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|99aa|down_6|NC_019745.1_1684380_1684677_+	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|610aa|down_7|NC_019745.1_1684711_1686541_+	pfam18563, TubC_N, TubC N-terminal docking domain	NA|544aa|down_8|NC_019745.1_1686613_1688245_+	TIGR01194, ATP-binding_protein_SyrD, cyclic peptide transporter	NA|375aa|down_9|NC_019745.1_1688300_1689425_+	pfam13469, Sulfotransfer_3, Sulfotransferase family
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	5	2248605-2250739	5,1,1	CRISPRCasFinder,CRT,PILER-CR	no	csb2gr5,cas7,cas8u1,cas3	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Unclear	GTTTCAATTTAGTCCCTTCAATAAAAGGGATTTCTAC,GTTTCAATTTAGTCCCTTCAATAAAAGGGATTTCTAC,GTTTCAATTTAGTCCCTTCAATAAAAGGGATTTCTAC	37,37,37	0	0	NA	NA	NA:NA:NA	29,29,29	29	Unclear	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|130aa|up_3|NC_019745.1_2245181_2245571_+,NA|54aa|down_4|NC_019745.1_2257850_2258012_-,NA|76aa|down_8|NC_019745.1_2262415_2262643_+	NA|176aa|up_9|NC_019745.1_2236916_2237444_+	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|533aa|up_8|NC_019745.1_2237798_2239397_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|662aa|up_7|NC_019745.1_2240085_2242071_+	TIGR02042, Sulfite_reductase, ferredoxin-sulfite reductase	NA|237aa|up_6|NC_019745.1_2242091_2242802_+	PRK11713, PRK11713, 16S ribosomal RNA methyltransferase RsmE; Provisional	NA|358aa|up_5|NC_019745.1_2242830_2243904_+	sd00006, TPR, Tetratricopeptide repeat	NA|220aa|up_4|NC_019745.1_2243924_2244584_-	PRK10811, rne, ribonuclease E; Reviewed	NA|130aa|up_3|NC_019745.1_2245181_2245571_+	NA	NA|154aa|up_2|NC_019745.1_2246035_2246497_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|149aa|up_1|NC_019745.1_2246516_2246963_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|293aa|up_0|NC_019745.1_2247436_2248315_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	csb2gr5|511aa|down_0|NC_019745.1_2250906_2252439_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|330aa|down_1|NC_019745.1_2252383_2253373_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas8u1|680aa|down_2|NC_019745.1_2253365_2255405_-	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas3|728aa|down_3|NC_019745.1_2255388_2257572_-	cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3	NA|54aa|down_4|NC_019745.1_2257850_2258012_-	NA	NA|256aa|down_5|NC_019745.1_2258462_2259230_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|548aa|down_6|NC_019745.1_2259457_2261101_+	TIGR02734, Phytoene_desaturase_lycopene-forming, phytoene desaturase	NA|262aa|down_7|NC_019745.1_2261437_2262223_+	cd06259, YdcF-like, YdcF-like	NA|76aa|down_8|NC_019745.1_2262415_2262643_+	NA	NA|75aa|down_9|NC_019745.1_2262682_2262907_-	pfam02941, FeThRed_A, Ferredoxin thioredoxin reductase variable alpha chain
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	6	3442926-3446127	2,6,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas6,cas3,cas8b3,cas7,cas1,cas2	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Unclear	GTGTTTTAACCTTAGATGCCACAAGGCGTTGATCAC,GTGTTTTAACCTTAGATGCCACAAGGCGTTGATCAC,GTGTTTTAACCTTAGATGCCACAAGGCGTTGATCAC	36,36,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	44,44,44	44	Unclear	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA,NA|141aa|down_1|NC_019745.1_3447102_3447525_-	NA|227aa|up_9|NC_019745.1_3431384_3432065_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|224aa|up_8|NC_019745.1_3432057_3432729_+	cd02588, HAD_L2-DEX, L-2-haloacid dehalogenase	WYL|329aa|up_7|NC_019745.1_3433231_3434218_+	pfam13280, WYL, WYL domain	cas6|221aa|up_6|NC_019745.1_3434217_3434880_+	pfam09559, Cas6, Cas6 Crispr	cas3|797aa|up_5|NC_019745.1_3434879_3437270_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas8b3|562aa|up_4|NC_019745.1_3437262_3438948_+	TIGR04413, hypothetical_protein_LEP1GSC082_4029, CRISPR type MYXAN-associated protein Cmx8	cas7|297aa|up_3|NC_019745.1_3438975_3439866_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|233aa|up_2|NC_019745.1_3439875_3440574_+	TIGR02593, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5, N-terminal domain	cas1|557aa|up_1|NC_019745.1_3440698_3442369_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_0|NC_019745.1_3442378_3442672_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|273aa|down_0|NC_019745.1_3446274_3447093_-	COG3217, COG3217, Uncharacterized Fe-S protein [General function prediction only]	NA|141aa|down_1|NC_019745.1_3447102_3447525_-	NA	NA|472aa|down_2|NC_019745.1_3447540_3448956_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|439aa|down_3|NC_019745.1_3449742_3451059_-	cd02808, GltS_FMN, Glutamate synthase (GltS) FMN-binding domain	NA|234aa|down_4|NC_019745.1_3451328_3452030_-	COG0070, GltB, Glutamate synthase domain 3 [Amino acid transport and metabolism]	NA|312aa|down_5|NC_019745.1_3452121_3453057_-	cd01907, GlxB, Glutamine amidotransferases class-II (Gn-AT)_GlxB-type	NA|257aa|down_6|NC_019745.1_3453302_3454073_-	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|401aa|down_7|NC_019745.1_3454232_3455435_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|117aa|down_8|NC_019745.1_3455841_3456192_+	pfam00395, SLH, S-layer homology domain	NA|459aa|down_9|NC_019745.1_3456267_3457644_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	7	3546270-3546388	7	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	AAACAAAGTCCACCTGCGTGGACTACACCTAAATCTTGAACT	42	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|119aa|up_5|NC_019745.1_3540229_3540586_+,NA|346aa|up_4|NC_019745.1_3540969_3542007_+,NA|67aa|up_2|NC_019745.1_3543477_3543678_+,NA|190aa|up_1|NC_019745.1_3544025_3544595_+,NA|73aa|down_1|NC_019745.1_3547198_3547417_+	NA|478aa|up_9|NC_019745.1_3536456_3537890_-	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|167aa|up_8|NC_019745.1_3538415_3538916_+	pfam10882, bPH_5, Bacterial PH domain	NA|117aa|up_7|NC_019745.1_3539191_3539542_+	smart00924, MgtE_N, MgtE intracellular N domain	NA|137aa|up_6|NC_019745.1_3539487_3539898_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|119aa|up_5|NC_019745.1_3540229_3540586_+	NA	NA|346aa|up_4|NC_019745.1_3540969_3542007_+	NA	NA|332aa|up_3|NC_019745.1_3542036_3543032_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|67aa|up_2|NC_019745.1_3543477_3543678_+	NA	NA|190aa|up_1|NC_019745.1_3544025_3544595_+	NA	NA|493aa|up_0|NC_019745.1_3544734_3546213_+	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|186aa|down_0|NC_019745.1_3546409_3546967_-	COG4446, COG4446, Uncharacterized protein conserved in bacteria [Function unknown]	NA|73aa|down_1|NC_019745.1_3547198_3547417_+	NA	NA|259aa|down_2|NC_019745.1_3547418_3548195_-	pfam06267, DUF1028, Family of unknown function (DUF1028)	NA|679aa|down_3|NC_019745.1_3548470_3550507_+	TIGR03108, eps_aminotran_1, exosortase A system-associated amidotransferase 1	NA|350aa|down_4|NC_019745.1_3550594_3551644_-	PRK11308, dppF, dipeptide transporter ATP-binding subunit; Provisional	NA|337aa|down_5|NC_019745.1_3551662_3552673_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|299aa|down_6|NC_019745.1_3552740_3553637_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|308aa|down_7|NC_019745.1_3553636_3554560_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|552aa|down_8|NC_019745.1_3554695_3556351_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|360aa|down_9|NC_019745.1_3557080_3558160_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	8	4591331-4591458	8	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	CGCTAGGAGTGTTCCTGCTGACAAAATTGGTAAATTACTCCCTGCAAA	48	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|150aa|up_6|NC_019745.1_4583205_4583655_-,NA|77aa|down_0|NC_019745.1_4591808_4592039_-,NA|228aa|down_1|NC_019745.1_4592324_4593008_-	NA|105aa|up_9|NC_019745.1_4581603_4581918_+	pfam11267, DUF3067, Domain of unknown function (DUF3067)	NA|111aa|up_8|NC_019745.1_4581945_4582278_-	PRK10089, PRK10089, chaperone CsaA	NA|274aa|up_7|NC_019745.1_4582274_4583096_-	CHL00182, tatC, Sec-independent translocase component C; Provisional	NA|150aa|up_6|NC_019745.1_4583205_4583655_-	NA	NA|728aa|up_5|NC_019745.1_4583841_4586025_-	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|130aa|up_4|NC_019745.1_4586107_4586497_-	cd10158, CsoR-like_DUF156_1, Uncharacterized family 1; belongs to a superfamily containing the transcriptional regulators CsoR (copper-sensitive operon repressor), RcnR, and FrmR, and related domains; this family was previously known as part of DUF156	NA|419aa|up_3|NC_019745.1_4586754_4588011_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|437aa|up_2|NC_019745.1_4588156_4589467_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|346aa|up_1|NC_019745.1_4589473_4590511_-	PRK00292, glk, glucokinase; Provisional	NA|216aa|up_0|NC_019745.1_4590632_4591280_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|77aa|down_0|NC_019745.1_4591808_4592039_-	NA	NA|228aa|down_1|NC_019745.1_4592324_4593008_-	NA	NA|273aa|down_2|NC_019745.1_4593037_4593856_-	COG1606, COG1606, ATP-utilizing enzymes of the PP-loop superfamily [General function prediction only]	NA|925aa|down_3|NC_019745.1_4594001_4596776_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|748aa|down_4|NC_019745.1_4596795_4599039_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|148aa|down_5|NC_019745.1_4599043_4599487_-	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|746aa|down_6|NC_019745.1_4599500_4601738_-	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|616aa|down_7|NC_019745.1_4601797_4603645_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|290aa|down_8|NC_019745.1_4604163_4605033_+	cd06582, TM_PBP1_LivH_like, Transmembrane subunit (TM) of Escherichia coli LivH and related proteins	NA|46aa|down_9|NC_019745.1_4605184_4605322_-	pfam08078, PsaX, PsaX family
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	9	4675562-4675674	9	CRISPRCasFinder	no		csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Orphan	GCCATCACCACCGTAGAGGGTGTCATAGCCAT	32	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA|112aa|up_8|NC_019745.1_4661157_4661493_-,NA|92aa|up_5|NC_019745.1_4667877_4668153_-,NA|93aa|up_2|NC_019745.1_4673276_4673555_-,NA|55aa|up_1|NC_019745.1_4673755_4673920_-,NA|147aa|up_0|NC_019745.1_4674286_4674727_-,NA|65aa|down_5|NC_019745.1_4685555_4685750_-	NA|258aa|up_9|NC_019745.1_4659343_4660117_+	COG1496, yfiH, Multicopper polyphenol oxidase (laccase) [Secondary metabolites biosynthesis, transport and catabolism]	NA|112aa|up_8|NC_019745.1_4661157_4661493_-	NA	NA|253aa|up_7|NC_019745.1_4661846_4662605_-	sd00006, TPR, Tetratricopeptide repeat	NA|1694aa|up_6|NC_019745.1_4662612_4667694_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|92aa|up_5|NC_019745.1_4667877_4668153_-	NA	NA|859aa|up_4|NC_019745.1_4668294_4670871_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|478aa|up_3|NC_019745.1_4671219_4672653_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|93aa|up_2|NC_019745.1_4673276_4673555_-	NA	NA|55aa|up_1|NC_019745.1_4673755_4673920_-	NA	NA|147aa|up_0|NC_019745.1_4674286_4674727_-	NA	NA|412aa|down_0|NC_019745.1_4676153_4677389_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|386aa|down_1|NC_019745.1_4677395_4678553_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|904aa|down_2|NC_019745.1_4678691_4681403_-	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|371aa|down_3|NC_019745.1_4681320_4682433_-	pfam05860, Haemagg_act, haemagglutination activity domain	NA|875aa|down_4|NC_019745.1_4682556_4685181_-	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|65aa|down_5|NC_019745.1_4685555_4685750_-	NA	NA|92aa|down_6|NC_019745.1_4685749_4686025_-	cd15242, 7tm_Proteorhodopsin, green- and blue-light absorbing proteorhodopsins, member of the seven-transmembrane GPCR superfamily	NA|347aa|down_7|NC_019745.1_4686449_4687490_-	TIGR03609, S_layer_CsaB, polysaccharide pyruvyl transferase CsaB	NA|115aa|down_8|NC_019745.1_4687631_4687976_+	pfam10693, DUF2499, Protein of unknown function (DUF2499)	NA|107aa|down_9|NC_019745.1_4688096_4688417_+	pfam12159, DUF3593, Protein of unknown function (DUF3593)
GCF_000317555.1_ASM31755v1	NC_019745	Gloeocapsa sp. PCC 7428, complete genome	10	5170418-5172269	3,10,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	Type I-D	ATTTCAATTAACATAAATCCTTATCAGGGATTGAAAC,ATTTCAATTAACATAAATCCTTATCAGGGATTGAAAC,ATTTCAATTAACATAAATCCTTATCAGGGATTGAAAC,ATTTCAATTAACATAAATCCTTATCAGGGATTGAAAC	37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	22,25,25,22	25	TypeI-D	csa3,DinG,Cas14u_CAS-V,csb2gr5,cas7,cas8u1,cas3,c2c9_V-U4,cas14j,WYL,cas6,cas8b3,cas1,cas2,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4	NA,NA|83aa|down_1|NC_019745.1_5173718_5173967_+,NA|79aa|down_2|NC_019745.1_5173994_5174231_+,NA|427aa|down_7|NC_019745.1_5178677_5179958_-	WYL|288aa|up_9|NC_019745.1_5158226_5159090_-	pfam13280, WYL, WYL domain	cas3|709aa|up_8|NC_019745.1_5159196_5161323_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas10d|1095aa|up_7|NC_019745.1_5161513_5164798_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|340aa|up_6|NC_019745.1_5164833_5165853_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|235aa|up_5|NC_019745.1_5165858_5166563_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	2OG_CAS|206aa|up_4|NC_019745.1_5166821_5167439_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|273aa|up_3|NC_019745.1_5167428_5168247_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|198aa|up_2|NC_019745.1_5168257_5168851_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|335aa|up_1|NC_019745.1_5168868_5169873_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|90aa|up_0|NC_019745.1_5169886_5170156_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|274aa|down_0|NC_019745.1_5172477_5173299_-	pfam14256, YwiC, YwiC-like protein	NA|83aa|down_1|NC_019745.1_5173718_5173967_+	NA	NA|79aa|down_2|NC_019745.1_5173994_5174231_+	NA	NA|146aa|down_3|NC_019745.1_5174554_5174992_+	TIGR03562, osmo_induc_OsmC, peroxiredoxin, OsmC subfamily	NA|745aa|down_4|NC_019745.1_5175231_5177466_-	COG1449, COG1449, Alpha-amylase/alpha-mannosidase [Carbohydrate transport and metabolism]	NA|77aa|down_5|NC_019745.1_5177587_5177818_-	pfam01106, NifU, NifU-like domain	NA|219aa|down_6|NC_019745.1_5177901_5178558_-	pfam11866, DUF3386, Protein of unknown function (DUF3386)	NA|427aa|down_7|NC_019745.1_5178677_5179958_-	NA	NA|388aa|down_8|NC_019745.1_5180323_5181487_+	PRK09303, PRK09303, histidine kinase	NA|137aa|down_9|NC_019745.1_5181673_5182084_-	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)
