assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	1	514094-514711	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type I-D	GTTTCAATCCC-----------ATTACTAGGATTCATTAAAAAGAAAC,GTTTCAATCCCATTACTAGGATTCATTAAAAAGAAAC,GTTTCAATCCCATTACTAGGATTCATTAAAAAGAAAC	48,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	8,8,8	8	TypeI-D	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|92aa|up_6|NC_011726.1_505197_505473_+,NA|74aa|up_3|NC_011726.1_507463_507685_-,NA|83aa|down_8|NC_011726.1_524999_525248_-	NA|694aa|up_9|NC_011726.1_501281_503363_+	PRK13411, PRK13411, molecular chaperone DnaK; Provisional	NA|132aa|up_8|NC_011726.1_503393_503789_+	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|412aa|up_7|NC_011726.1_503855_505091_+	PRK07590, PRK07590, L,L-diaminopimelate aminotransferase; Validated	NA|92aa|up_6|NC_011726.1_505197_505473_+	NA	NA|283aa|up_5|NC_011726.1_505883_506732_+	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|184aa|up_4|NC_011726.1_506896_507448_+	pfam11371, DUF3172, Protein of unknown function (DUF3172)	NA|74aa|up_3|NC_011726.1_507463_507685_-	NA	NA|1107aa|up_2|NC_011726.1_507896_511217_-	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|219aa|up_1|NC_011726.1_511428_512085_+	pfam07885, Ion_trans_2, Ion channel	NA|271aa|up_0|NC_011726.1_513137_513950_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	cas2|98aa|down_0|NC_011726.1_514904_515198_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NC_011726.1_515194_516172_-	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas4|197aa|down_2|NC_011726.1_516204_516795_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas6|270aa|down_3|NC_011726.1_516920_517730_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	csc1gr5|259aa|down_4|NC_011726.1_517698_518475_-	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	csc2gr7|357aa|down_5|NC_011726.1_518617_519688_-	pfam18320, Csc2, Csc2 Crispr	cas10d|975aa|down_6|NC_011726.1_519760_522685_-	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	cas3|702aa|down_7|NC_011726.1_522742_524848_-	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|83aa|down_8|NC_011726.1_524999_525248_-	NA	WYL|316aa|down_9|NC_011726.1_525444_526392_+	pfam13280, WYL, WYL domain
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	2	604771-604971	2,2	PILER-CR,CRISPRCasFinder	no	cas14j	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Unclear	GTTTCAATCCCTCATAGGGATTATTGCTTATTTTAACTGATAAAGAAT,GTTTCAATCCCTCATAGGGATTATTGCTTATTTTAACT	48,38	0	0	NA	NA	I-D,II-B:I-D,II-B	2,2	2	TypeV	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|364aa|up_3|NC_011726.1_601206_602298_+,NA|75aa|down_3|NC_011726.1_608120_608345_+,NA|161aa|down_9|NC_011726.1_612623_613106_+	NA|157aa|up_9|NC_011726.1_591216_591687_+	pfam08846, DUF1816, Domain of unknown function (DUF1816)	NA|1265aa|up_8|NC_011726.1_591711_595506_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|553aa|up_7|NC_011726.1_595554_597213_-	pfam13676, TIR_2, TIR domain	NA|321aa|up_6|NC_011726.1_597565_598528_+	COG1622, CyoA, Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]	NA|556aa|up_5|NC_011726.1_598556_600224_+	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|206aa|up_4|NC_011726.1_600322_600940_+	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|364aa|up_3|NC_011726.1_601206_602298_+	NA	NA|50aa|up_2|NC_011726.1_602294_602444_-	pfam01724, DUF29, Domain of unknown function DUF29	NA|493aa|up_1|NC_011726.1_602526_604005_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|216aa|up_0|NC_011726.1_604072_604720_+	COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|391aa|down_0|NC_011726.1_605270_606443_-	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|169aa|down_1|NC_011726.1_606447_606954_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|192aa|down_2|NC_011726.1_607218_607794_+	pfam16734, Pilin_GH, Type IV pilin-like G and H, putative	NA|75aa|down_3|NC_011726.1_608120_608345_+	NA	NA|532aa|down_4|NC_011726.1_608434_610030_+	cd07488, Peptidases_S8_2, Peptidase S8 family domain, uncharacterized subfamily 2	NA|171aa|down_5|NC_011726.1_610168_610681_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|288aa|down_6|NC_011726.1_610726_611590_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|56aa|down_7|NC_011726.1_611653_611821_+	PRK14276, PRK14276, chaperone protein DnaJ; Provisional	NA|264aa|down_8|NC_011726.1_611804_612596_+	sd00006, TPR, Tetratricopeptide repeat	NA|161aa|down_9|NC_011726.1_612623_613106_+	NA
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	3	827835-827932	3	CRISPRCasFinder	no		PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Orphan	GTTGAGGAAGAAGAAGACATTGA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|168aa|up_9|NC_011726.1_818357_818861_+,NA|237aa|up_5|NC_011726.1_822525_823236_+,NA|66aa|up_0|NC_011726.1_827105_827303_+,NA|163aa|down_5|NC_011726.1_834513_835002_+,NA|159aa|down_8|NC_011726.1_836998_837475_+	NA|168aa|up_9|NC_011726.1_818357_818861_+	NA	NA|271aa|up_8|NC_011726.1_818922_819735_-	COG4577, CcmK, Carbon dioxide concentrating mechanism/carboxysome shell protein [Secondary metabolites biosynthesis, transport, and catabolism / Energy production and conversion]	NA|355aa|up_7|NC_011726.1_819956_821021_+	pfam18087, RuBisCo_chap_C, Rubisco Assembly chaperone C-terminal domain	NA|326aa|up_6|NC_011726.1_821330_822308_+	PRK09375, PRK09375, quinolinate synthase NadA	NA|237aa|up_5|NC_011726.1_822525_823236_+	NA	NA|467aa|up_4|NC_011726.1_823510_824911_-	CHL00073, chlN, photochlorophyllide reductase subunit N	NA|125aa|up_3|NC_011726.1_824915_825290_-	pfam17265, DUF5331, Family of unknown function (DUF5331)	NA|290aa|up_2|NC_011726.1_825451_826321_-	CHL00072, chlL, photochlorophyllide reductase subunit L	NA|40aa|up_1|NC_011726.1_826936_827056_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|66aa|up_0|NC_011726.1_827105_827303_+	NA	NA|818aa|down_0|NC_011726.1_828511_830965_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|198aa|down_1|NC_011726.1_831048_831642_+	COG5381, COG5381, Uncharacterized protein conserved in bacteria [Function unknown]	NA|118aa|down_2|NC_011726.1_831701_832055_+	COG5439, COG5439, Uncharacterized conserved protein [Function unknown]	NA|282aa|down_3|NC_011726.1_832127_832973_-	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|433aa|down_4|NC_011726.1_832978_834277_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|163aa|down_5|NC_011726.1_834513_835002_+	NA	NA|306aa|down_6|NC_011726.1_835021_835939_-	PLN02578, PLN02578, hydrolase	NA|177aa|down_7|NC_011726.1_836131_836662_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|159aa|down_8|NC_011726.1_836998_837475_+	NA	NA|314aa|down_9|NC_011726.1_837471_838413_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	4	911959-913021	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no	csa3,cas14j	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type I-A	GTTTCAATCCC-----------ATTACTAGGATTCATTAATAAGAAAC,GTTTCTTATTAATGAATCCTAGTAATGGGATTGAAAC,GTTTCTTATTAATGAATCCTAGTAATGGGATTGAAAC	48,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	13,14,14	14	TypeV,TypeI-A	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|234aa|up_1|NC_011726.1_908815_909517_-,NA|392aa|down_3|NC_011726.1_916312_917488_-,NA|229aa|down_4|NC_011726.1_917833_918520_+,NA|147aa|down_8|NC_011726.1_922055_922496_-,NA|92aa|down_9|NC_011726.1_922699_922975_+	csa3|130aa|up_9|NC_011726.1_902306_902696_+	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|58aa|up_8|NC_011726.1_902692_902866_-	pfam02069, Metallothio_Pro, Prokaryotic metallothionein	NA|324aa|up_7|NC_011726.1_903033_904005_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|359aa|up_6|NC_011726.1_903992_905069_+	COG2319, COG2319, FOG: WD40 repeat [General function prediction only]	NA|215aa|up_5|NC_011726.1_905166_905811_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|347aa|up_4|NC_011726.1_905880_906921_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|370aa|up_3|NC_011726.1_907047_908157_-	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|147aa|up_2|NC_011726.1_908345_908786_+	pfam02657, SufE, Fe-S metabolism associated domain	NA|234aa|up_1|NC_011726.1_908815_909517_-	NA	NA|616aa|up_0|NC_011726.1_909864_911712_+	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|170aa|down_0|NC_011726.1_913239_913749_-	pfam01475, FUR, Ferric uptake regulator family	NA|294aa|down_1|NC_011726.1_913945_914827_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|341aa|down_2|NC_011726.1_915002_916025_-	PRK14982, PRK14982, acyl-ACP reductase; Provisional	NA|392aa|down_3|NC_011726.1_916312_917488_-	NA	NA|229aa|down_4|NC_011726.1_917833_918520_+	NA	NA|361aa|down_5|NC_011726.1_918549_919632_-	PRK00082, hrcA, heat-inducible transcription repressor; Provisional	NA|120aa|down_6|NC_011726.1_919796_920156_-	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|562aa|down_7|NC_011726.1_920332_922018_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|147aa|down_8|NC_011726.1_922055_922496_-	NA	NA|92aa|down_9|NC_011726.1_922699_922975_+	NA
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	5	1733348-1733534	5	CRISPRCasFinder	no	cas6,cas8b3,cas7,cas5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Unclear	TGTGATTCATGGCTTCATGCCGTTAGGCGTTGCTCAA	37	0	0	NA	NA	NA	2	2	Unclear	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA,NA	NA|92aa|up_9|NC_011726.1_1720462_1720738_+	PRK14857, tatA, TatA/E family twin arginine-targeting protein translocase	NA|206aa|up_8|NC_011726.1_1720812_1721430_+	pfam11833, CPP1-like, Protein CHAPERONE-LIKE PROTEIN OF POR1-like	NA|716aa|up_7|NC_011726.1_1721556_1723704_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|350aa|up_6|NC_011726.1_1723830_1724880_+	cd03469, Rieske_RO_Alpha_N, Rieske non-heme iron oxygenase (RO) family, N-terminal Rieske domain of the oxygenase alpha subunit; The RO family comprise a large class of aromatic ring-hydroxylating dioxygenases found predominantly in microorganisms	NA|682aa|up_5|NC_011726.1_1725123_1727169_+	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|464aa|up_4|NC_011726.1_1727757_1729149_-	pfam06527, TniQ, TniQ	cas6|222aa|up_3|NC_011726.1_1729302_1729968_+	pfam09559, Cas6, Cas6 Crispr	cas8b3|516aa|up_2|NC_011726.1_1729964_1731512_+	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|324aa|up_1|NC_011726.1_1731492_1732464_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|213aa|up_0|NC_011726.1_1732460_1733099_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	NA|1717aa|down_0|NC_011726.1_1733636_1738787_+	cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold	NA|330aa|down_1|NC_011726.1_1738828_1739818_-	pfam06527, TniQ, TniQ	NA|374aa|down_2|NC_011726.1_1739814_1740936_-	pfam13401, AAA_22, AAA domain	NA|923aa|down_3|NC_011726.1_1740929_1743698_-	pfam00665, rve, Integrase core domain	NA|65aa|down_4|NC_011726.1_1744065_1744260_-	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|75aa|down_5|NC_011726.1_1744252_1744477_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|1006aa|down_6|NC_011726.1_1744613_1747631_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|204aa|down_7|NC_011726.1_1748041_1748653_-	pfam05685, Uma2, Putative restriction endonuclease	NA|387aa|down_8|NC_011726.1_1748670_1749831_-	PRK00025, lpxB, lipid-A-disaccharide synthase; Reviewed	NA|277aa|down_9|NC_011726.1_1749853_1750684_-	PRK05289, PRK05289, acyl-ACP--UDP-N-acetylglucosamine O-acyltransferase
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	6	2283960-2284149	4	PILER-CR	no	c2c5_V-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type V-U5	AAACTTTCAACCTACCCCTTATCGGGATGGCGGTTGAAACCCAT	44	0	0	NA	NA	V-U5	2	2	TypeV-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|156aa|up_4|NC_011726.1_2278398_2278866_-,NA|64aa|up_1|NC_011726.1_2280868_2281060_-,c2c5_V-U5|613aa|down_0|NC_011726.1_2284683_2286522_-,NA|73aa|down_4|NC_011726.1_2289151_2289370_+,NA|87aa|down_7|NC_011726.1_2291046_2291307_+,NA|84aa|down_8|NC_011726.1_2291343_2291595_+	NA|218aa|up_9|NC_011726.1_2271763_2272417_+	PRK00058, PRK00058, peptide-methionine (S)-S-oxide reductase MsrA	NA|329aa|up_8|NC_011726.1_2272622_2273609_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|217aa|up_7|NC_011726.1_2273699_2274350_-	pfam05685, Uma2, Putative restriction endonuclease	NA|806aa|up_6|NC_011726.1_2274475_2276893_-	COG4354, COG4354, Predicted bile acid beta-glucosidase [Carbohydrate transport and metabolism]	NA|429aa|up_5|NC_011726.1_2277056_2278343_-	PRK07369, PRK07369, dihydroorotase; Provisional	NA|156aa|up_4|NC_011726.1_2278398_2278866_-	NA	NA|381aa|up_3|NC_011726.1_2279069_2280212_+	TIGR02048, gshA_cyano, glutamate--cysteine ligase, cyanobacterial, putative	NA|153aa|up_2|NC_011726.1_2280406_2280865_+	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|64aa|up_1|NC_011726.1_2280868_2281060_-	NA	NA|728aa|up_0|NC_011726.1_2281325_2283509_+	pfam01551, Peptidase_M23, Peptidase family M23	c2c5_V-U5|613aa|down_0|NC_011726.1_2284683_2286522_-	NA	NA|144aa|down_1|NC_011726.1_2286590_2287022_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|523aa|down_2|NC_011726.1_2287036_2288605_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|123aa|down_3|NC_011726.1_2288707_2289076_+	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|73aa|down_4|NC_011726.1_2289151_2289370_+	NA	NA|80aa|down_5|NC_011726.1_2289366_2289606_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|458aa|down_6|NC_011726.1_2289666_2291040_+	cd17287, RMtype1_S_EcoN10ORF171P_TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli N10-0505 S subunit (S	NA|87aa|down_7|NC_011726.1_2291046_2291307_+	NA	NA|84aa|down_8|NC_011726.1_2291343_2291595_+	NA	NA|142aa|down_9|NC_011726.1_2291581_2292007_+	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	7	2521799-2521867	6	CRISPRCasFinder	no		PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Orphan	AATTTACAAGGGGCTGATTTAACA	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|70aa|up_7|NC_011726.1_2508831_2509041_+,NA|64aa|up_0|NC_011726.1_2519284_2519476_+,NA|485aa|down_2|NC_011726.1_2524851_2526306_-,NA|84aa|down_9|NC_011726.1_2534491_2534743_+	NA|71aa|up_9|NC_011726.1_2507848_2508061_+	cd01716, Hfq, bacterial Hfq-like	NA|224aa|up_8|NC_011726.1_2508128_2508800_+	cd19927, REC_Ycf29, phosphoacceptor receiver (REC) domain of probable transcriptional regulator Ycf29	NA|70aa|up_7|NC_011726.1_2508831_2509041_+	NA	NA|444aa|up_6|NC_011726.1_2509087_2510419_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|709aa|up_5|NC_011726.1_2511680_2513807_+	pfam01804, Penicil_amidase, Penicillin amidase	NA|338aa|up_4|NC_011726.1_2513980_2514994_+	pfam09994, DUF2235, Uncharacterized alpha/beta hydrolase domain (DUF2235)	NA|577aa|up_3|NC_011726.1_2515038_2516769_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|669aa|up_2|NC_011726.1_2516944_2518951_+	pfam01060, TTR-52, Transthyretin-like family	NA|89aa|up_1|NC_011726.1_2519046_2519313_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|64aa|up_0|NC_011726.1_2519284_2519476_+	NA	NA|393aa|down_0|NC_011726.1_2521997_2523176_-	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|547aa|down_1|NC_011726.1_2523204_2524845_-	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|485aa|down_2|NC_011726.1_2524851_2526306_-	NA	NA|1035aa|down_3|NC_011726.1_2526328_2529433_-	COG0383, AMS1, Alpha-mannosidase [Carbohydrate transport and metabolism]	NA|258aa|down_4|NC_011726.1_2529554_2530328_+	TIGR03413, GSH_gloB, hydroxyacylglutathione hydrolase	NA|270aa|down_5|NC_011726.1_2530333_2531143_+	pfam00520, Ion_trans, Ion transport protein	NA|256aa|down_6|NC_011726.1_2531217_2531985_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|344aa|down_7|NC_011726.1_2532036_2533068_+	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|184aa|down_8|NC_011726.1_2533588_2534140_-	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|84aa|down_9|NC_011726.1_2534491_2534743_+	NA
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	8	3093017-3093354	5,7,3	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type V-U5	AGTTTCATCAACCCTCCTGATGTGGGATGGGTTGAAAG,AGTTTCATCAACCCTCCTGATGTGGGATGGGTTGAAAG,AGTTTCATCAACCCTCCTGATGTGGGATGGGTTGAAAG	38,38,38	1	1	3093055-3093096	NC_011721.1_37373-37332	V-U5:V-U5:V-U5	3,4,4	4	TypeV-U5	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|412aa|up_8|NC_011726.1_3078475_3079711_-,c2c5_V-U5|683aa|up_0|NC_011726.1_3090391_3092440_+,NA	NA|168aa|up_9|NC_011726.1_3077964_3078468_+	pfam06527, TniQ, TniQ	NA|412aa|up_8|NC_011726.1_3078475_3079711_-	NA	NA|155aa|up_7|NC_011726.1_3079714_3080179_-	TIGR04435, ABC_transporter, restriction system-associated AAA family ATPase	NA|331aa|up_6|NC_011726.1_3080458_3081451_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|288aa|up_5|NC_011726.1_3081522_3082386_-	TIGR04435, ABC_transporter, restriction system-associated AAA family ATPase	NA|693aa|up_4|NC_011726.1_3082403_3084482_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|387aa|up_3|NC_011726.1_3084484_3085645_-	cd17261, RMtype1_S_EcoKI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli str	NA|876aa|up_2|NC_011726.1_3087314_3089942_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|59aa|up_1|NC_011726.1_3090065_3090242_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	c2c5_V-U5|683aa|up_0|NC_011726.1_3090391_3092440_+	NA	NA|665aa|down_0|NC_011726.1_3093696_3095691_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|297aa|down_1|NC_011726.1_3095697_3096588_+	pfam13401, AAA_22, AAA domain	NA|167aa|down_2|NC_011726.1_3096594_3097095_+	pfam06527, TniQ, TniQ	NA|1427aa|down_3|NC_011726.1_3097066_3101347_-	pfam13173, AAA_14, AAA domain	c2c5_V-U5|661aa|down_4|NC_011726.1_3101441_3103424_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|355aa|down_5|NC_011726.1_3104860_3105925_+	PLN02433, PLN02433, uroporphyrinogen decarboxylase	NA|380aa|down_6|NC_011726.1_3106018_3107158_-	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|198aa|down_7|NC_011726.1_3107233_3107827_+	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|445aa|down_8|NC_011726.1_3107938_3109273_+	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|100aa|down_9|NC_011726.1_3109442_3109742_+	pfam17195, DUF5132, Protein of unknown function (DUF5132)
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	9	3103986-3104241	6,8,4	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5,cas14j	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Unclear	GTTTCATACAGGTTTTTGACCTCCCATTGATTGAAAGA,GTTTCATACAGGTTTTTGACCTCCCATTGATTGAAAG,GTTTCATACAGGTTTTTGACCTCCCATTGATTGAAAGA	38,37,38	0	0	NA	NA	NA:NA:NA	3,3,3	3	TypeV	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	c2c5_V-U5|683aa|up_5|NC_011726.1_3090391_3092440_+,NA	NA|693aa|up_9|NC_011726.1_3082403_3084482_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|387aa|up_8|NC_011726.1_3084484_3085645_-	cd17261, RMtype1_S_EcoKI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli str	NA|876aa|up_7|NC_011726.1_3087314_3089942_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|59aa|up_6|NC_011726.1_3090065_3090242_-	pfam07878, RHH_5, CopG-like RHH_1 or ribbon-helix-helix domain, RHH_5	c2c5_V-U5|683aa|up_5|NC_011726.1_3090391_3092440_+	NA	NA|665aa|up_4|NC_011726.1_3093696_3095691_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|297aa|up_3|NC_011726.1_3095697_3096588_+	pfam13401, AAA_22, AAA domain	NA|167aa|up_2|NC_011726.1_3096594_3097095_+	pfam06527, TniQ, TniQ	NA|1427aa|up_1|NC_011726.1_3097066_3101347_-	pfam13173, AAA_14, AAA domain	c2c5_V-U5|661aa|up_0|NC_011726.1_3101441_3103424_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|355aa|down_0|NC_011726.1_3104860_3105925_+	PLN02433, PLN02433, uroporphyrinogen decarboxylase	NA|380aa|down_1|NC_011726.1_3106018_3107158_-	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|198aa|down_2|NC_011726.1_3107233_3107827_+	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|445aa|down_3|NC_011726.1_3107938_3109273_+	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|100aa|down_4|NC_011726.1_3109442_3109742_+	pfam17195, DUF5132, Protein of unknown function (DUF5132)	NA|997aa|down_5|NC_011726.1_3109993_3112984_+	COG0474, MgtA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|411aa|down_6|NC_011726.1_3113119_3114352_+	PRK12459, PRK12459, S-adenosylmethionine synthetase; Provisional	NA|365aa|down_7|NC_011726.1_3114582_3115677_-	cd01156, IVD, Isovaleryl-CoA dehydrogenase	NA|111aa|down_8|NC_011726.1_3115685_3116018_-	cd11532, NTP-PPase_COG4997, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|310aa|down_9|NC_011726.1_3116037_3116967_-	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	10	4328521-4331328	7,9,5	PILER-CR,CRISPRCasFinder,CRT	no	c2c8_V-U2	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type V-U2	GTTTCAATCCC-----------ATTGCTAGGATTCATTAATAAGAAAC,GTTTCAATCCCATTGCTAGGATTCATTAATAAGAAAC,GTTTCAATCCCATTGCTAGGATTCATTAATAAGAAAC	48,37,37	0	0	NA	NA	V-U2:V-U2:V-U2	38,38,38	38	TypeV-U2	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|253aa|up_2|NC_011726.1_4325162_4325921_-,NA	NA|352aa|up_9|NC_011726.1_4317410_4318466_+	cd03785, GT28_MurG, undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase	NA|534aa|up_8|NC_011726.1_4318501_4320103_-	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|445aa|up_7|NC_011726.1_4320132_4321467_-	TIGR00665, DnaB, replicative DNA helicase	NA|73aa|up_6|NC_011726.1_4321537_4321756_-	PRK11409, PRK11409, YoeB-YefM toxin-antitoxin system antitoxin YefM	NA|70aa|up_5|NC_011726.1_4322025_4322235_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|448aa|up_4|NC_011726.1_4322320_4323664_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|480aa|up_3|NC_011726.1_4323660_4325100_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|253aa|up_2|NC_011726.1_4325162_4325921_-	NA	NA|457aa|up_1|NC_011726.1_4325949_4327320_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|268aa|up_0|NC_011726.1_4327593_4328397_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	c2c8_V-U2|495aa|down_0|NC_011726.1_4331769_4333254_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|182aa|down_1|NC_011726.1_4333439_4333985_-	TIGR03426, shape_MreD, rod shape-determining protein MreD	NA|250aa|down_2|NC_011726.1_4333981_4334731_-	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|350aa|down_3|NC_011726.1_4334757_4335807_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|123aa|down_4|NC_011726.1_4336080_4336449_+	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|418aa|down_5|NC_011726.1_4336453_4337707_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|300aa|down_6|NC_011726.1_4338264_4339164_+	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|133aa|down_7|NC_011726.1_4339160_4339559_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|94aa|down_8|NC_011726.1_4339668_4339950_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|228aa|down_9|NC_011726.1_4339953_4340637_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	11	4342782-4342890	10	CRISPRCasFinder	no	c2c8_V-U2,cas14j	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Unclear	TTTTATTAAGAGAAATTAGTTGAATGGAAAC	31	0	0	NA	NA	NA	1	1	TypeV	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA,NA|139aa|down_3|NC_011726.1_4345932_4346349_+	NA|350aa|up_9|NC_011726.1_4334757_4335807_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|123aa|up_8|NC_011726.1_4336080_4336449_+	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|418aa|up_7|NC_011726.1_4336453_4337707_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|300aa|up_6|NC_011726.1_4338264_4339164_+	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|133aa|up_5|NC_011726.1_4339160_4339559_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|94aa|up_4|NC_011726.1_4339668_4339950_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|228aa|up_3|NC_011726.1_4339953_4340637_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|318aa|up_2|NC_011726.1_4340644_4341598_-	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|198aa|up_1|NC_011726.1_4341624_4342218_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|97aa|up_0|NC_011726.1_4342201_4342492_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|497aa|down_0|NC_011726.1_4342897_4344388_-	TIGR03708, poly_P_AMP_trns, polyphosphate:AMP phosphotransferase	NA|333aa|down_1|NC_011726.1_4344525_4345524_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|92aa|down_2|NC_011726.1_4345648_4345924_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|139aa|down_3|NC_011726.1_4345932_4346349_+	NA	NA|40aa|down_4|NC_011726.1_4346497_4346617_-	PRK02565, PRK02565, photosystem II reaction center protein J; Provisional	NA|40aa|down_5|NC_011726.1_4346652_4346772_-	PRK00753, psbL, photosystem II reaction center L; Provisional	NA|45aa|down_6|NC_011726.1_4346781_4346916_-	PRK02561, psbF, cytochrome b559 subunit beta; Provisional	NA|82aa|down_7|NC_011726.1_4346942_4347188_-	PRK02557, psbE, cytochrome b559 subunit alpha; Provisional	NA|337aa|down_8|NC_011726.1_4347281_4348292_-	PRK13684, PRK13684, photosynthesis system II assembly factor Ycf48	NA|116aa|down_9|NC_011726.1_4348381_4348729_-	pfam00301, Rubredoxin, Rubredoxin
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	12	4398255-4398335	11	CRISPRCasFinder	no	c2c9_V-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type V-U4	TTGCTGTTTATTGGCTTCTTGAAAGGC	27	0	0	NA	NA	NA	1	1	TypeV-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|70aa|up_3|NC_011726.1_4394662_4394872_+,NA	NA|367aa|up_9|NC_011726.1_4387577_4388678_+	PRK00002, aroB, 3-dehydroquinate synthase; Reviewed	NA|582aa|up_8|NC_011726.1_4388774_4390520_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|311aa|up_7|NC_011726.1_4390863_4391796_-	pfam07313, DUF1460, Protein of unknown function (DUF1460)	NA|183aa|up_6|NC_011726.1_4391924_4392473_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|89aa|up_5|NC_011726.1_4392575_4392842_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|509aa|up_4|NC_011726.1_4392899_4394426_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|70aa|up_3|NC_011726.1_4394662_4394872_+	NA	NA|276aa|up_2|NC_011726.1_4394928_4395756_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|314aa|up_1|NC_011726.1_4395910_4396852_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|418aa|up_0|NC_011726.1_4396880_4398134_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|425aa|down_0|NC_011726.1_4398911_4400186_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|275aa|down_1|NC_011726.1_4400182_4401007_-	pfam12644, DUF3782, Protein of unknown function (DUF3782)	NA|422aa|down_2|NC_011726.1_4401685_4402951_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|62aa|down_3|NC_011726.1_4402901_4403087_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|437aa|down_4|NC_011726.1_4403240_4404551_-	COG4247, Phy, 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]	c2c9_V-U4|404aa|down_5|NC_011726.1_4404943_4406155_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|598aa|down_6|NC_011726.1_4406265_4408059_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|650aa|down_7|NC_011726.1_4408480_4410430_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|142aa|down_8|NC_011726.1_4410430_4410856_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|810aa|down_9|NC_011726.1_4410955_4413385_+	COG0699, COG0699, Predicted GTPases (dynamin-related) [General function prediction only]
GCF_000021805.1_ASM2180v1	NC_011726	Rippkaea orientalis PCC 8801, complete sequence	13	4400686-4400797	12	CRISPRCasFinder	no	c2c9_V-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	Type V-U4	TATTTTCATCCCATTTACGCGCTTGTTCTTCCCTATC	37	0	0	NA	NA	NA	1	1	TypeV-U4	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|70aa|up_5|NC_011726.1_4394662_4394872_+,NA|117aa|up_1|NC_011726.1_4398176_4398527_-,NA|88aa|down_9|NC_011726.1_4414123_4414387_+	NA|311aa|up_9|NC_011726.1_4390863_4391796_-	pfam07313, DUF1460, Protein of unknown function (DUF1460)	NA|183aa|up_8|NC_011726.1_4391924_4392473_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|89aa|up_7|NC_011726.1_4392575_4392842_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|509aa|up_6|NC_011726.1_4392899_4394426_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|70aa|up_5|NC_011726.1_4394662_4394872_+	NA	NA|276aa|up_4|NC_011726.1_4394928_4395756_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|314aa|up_3|NC_011726.1_4395910_4396852_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|418aa|up_2|NC_011726.1_4396880_4398134_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|117aa|up_1|NC_011726.1_4398176_4398527_-	NA	NA|425aa|up_0|NC_011726.1_4398911_4400186_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|422aa|down_0|NC_011726.1_4401685_4402951_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|62aa|down_1|NC_011726.1_4402901_4403087_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|437aa|down_2|NC_011726.1_4403240_4404551_-	COG4247, Phy, 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]	c2c9_V-U4|404aa|down_3|NC_011726.1_4404943_4406155_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|598aa|down_4|NC_011726.1_4406265_4408059_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|650aa|down_5|NC_011726.1_4408480_4410430_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|142aa|down_6|NC_011726.1_4410430_4410856_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|810aa|down_7|NC_011726.1_4410955_4413385_+	COG0699, COG0699, Predicted GTPases (dynamin-related) [General function prediction only]	NA|113aa|down_8|NC_011726.1_4413623_4413962_+	pfam08872, KGK, KGK domain	NA|88aa|down_9|NC_011726.1_4414123_4414387_+	NA
GCF_000021805.1_ASM2180v1	NC_011727	Rippkaea orientalis PCC 8801 plasmid pP880103, complete sequence	1	1056-1161	1	CRISPRCasFinder	no			Orphan	GACTCCTTGTGTACCAAGTCTGGGGGGACAG	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas14j,cas2,cas1,cas4,cas6,csc1gr5,csc2gr7,cas10d,cas3,WYL,2OG_CAS,csa3,c2c9_V-U4,c2c10_CAS-V-U3,DinG,RT,cas8b3,cas7,cas5,Cas14c_CAS-V-F,c2c5_V-U5,c2c8_V-U2	NA|136aa|up_0|NC_011727.1_302_710_+,NA|95aa|down_0|NC_011727.1_4566_4851_-,NA|298aa|down_2|NC_011727.1_9833_10727_+,NA|195aa|down_4|NC_011727.1_12340_12925_+	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|79aa|up_1|NC_011727.1_47_284_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|136aa|up_0|NC_011727.1_302_710_+	NA	NA|95aa|down_0|NC_011727.1_4566_4851_-	NA	NA|1486aa|down_1|NC_011727.1_5341_9799_+	COG3209, RhsA, Rhs family protein [Cell envelope biogenesis, outer membrane]	NA|298aa|down_2|NC_011727.1_9833_10727_+	NA	NA|315aa|down_3|NC_011727.1_11270_12215_+	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|195aa|down_4|NC_011727.1_12340_12925_+	NA	NA|122aa|down_5|NC_011727.1_13419_13785_+	pfam05713, MobC, Bacterial mobilisation protein (MobC)	NA|919aa|down_6|NC_011727.1_13802_16559_+	pfam13155, Toprim_2, Toprim-like	NA|NA	NA	NA|NA	NA	NA|NA	NA
