assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000757845.1_ASM75784v1	NZ_CP007753	Prochlorococcus sp. MIT 0604, complete genome	1	904678-904794	1	CRISPRCasFinder	no		cas4,DEDDh,cas3,csa3,DinG	Orphan	TCAAAATAATTCTGCTCCTAAAATTGA	27	0	0	NA	NA	NA	1	1	Orphan	cas4,DEDDh,cas3,csa3,DinG	NA|26aa|up_5|NZ_CP007753.1_900551_900629_-,NA|147aa|up_0|NZ_CP007753.1_904064_904505_+,NA|123aa|down_6|NZ_CP007753.1_914789_915158_-	NA|585aa|up_9|NZ_CP007753.1_897398_899153_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|58aa|up_8|NZ_CP007753.1_899284_899458_+	CHL00152, rpl32, ribosomal protein L32; Validated	NA|109aa|up_7|NZ_CP007753.1_899461_899788_-	pfam04483, DUF565, Protein of unknown function (DUF565)	NA|254aa|up_6|NZ_CP007753.1_899774_900536_-	cd07528, HAD_CbbY-like, subfamily of beta-phosphoglucomutase-like family, similar to Rhodobacter sphaeroides xylulose-1,5-bisphosphate phosphatase CbbY	NA|26aa|up_5|NZ_CP007753.1_900551_900629_-	NA	NA|116aa|up_4|NZ_CP007753.1_900711_901059_+	COG0727, COG0727, Predicted Fe-S-cluster oxidoreductase [General function prediction only]	NA|109aa|up_3|NZ_CP007753.1_901055_901382_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|104aa|up_2|NZ_CP007753.1_901382_901694_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|742aa|up_1|NZ_CP007753.1_901766_903992_+	cd04471, S1_RNase_R, S1_RNase_R: RNase R C-terminal S1 domain	NA|147aa|up_0|NZ_CP007753.1_904064_904505_+	NA	NA|391aa|down_0|NZ_CP007753.1_905292_906465_+	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|475aa|down_1|NZ_CP007753.1_906496_907921_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|451aa|down_2|NZ_CP007753.1_907923_909276_+	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|329aa|down_3|NZ_CP007753.1_909272_910259_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed	NA|237aa|down_4|NZ_CP007753.1_910255_910966_-	TIGR03716, R_switched_YkoY, integral membrane protein, YkoY family	NA|1171aa|down_5|NZ_CP007753.1_911040_914553_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|123aa|down_6|NZ_CP007753.1_914789_915158_-	NA	NA|195aa|down_7|NZ_CP007753.1_915278_915863_-	pfam14233, DUF4335, Domain of unknown function (DUF4335)	NA|161aa|down_8|NZ_CP007753.1_915867_916350_-	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|66aa|down_9|NZ_CP007753.1_916431_916629_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)
GCF_000757845.1_ASM75784v1	NZ_CP007753	Prochlorococcus sp. MIT 0604, complete genome	2	975383-975509	2	CRISPRCasFinder	no		cas4,DEDDh,cas3,csa3,DinG	Orphan	AGGTCATGATGATCATTCAGACCATGGAGGACATGACGATCAT	43	0	0	NA	NA	NA	1	1	Orphan	cas4,DEDDh,cas3,csa3,DinG	NA|125aa|up_9|NZ_CP007753.1_968667_969042_-,NA|69aa|up_8|NZ_CP007753.1_969222_969429_+,NA|81aa|up_7|NZ_CP007753.1_969710_969953_+,NA|95aa|up_5|NZ_CP007753.1_970717_971002_-,NA|83aa|up_3|NZ_CP007753.1_972381_972630_+,NA|78aa|down_3|NZ_CP007753.1_980081_980315_-,NA|79aa|down_4|NZ_CP007753.1_980346_980583_-,NA|158aa|down_8|NZ_CP007753.1_984138_984612_+	NA|125aa|up_9|NZ_CP007753.1_968667_969042_-	NA	NA|69aa|up_8|NZ_CP007753.1_969222_969429_+	NA	NA|81aa|up_7|NZ_CP007753.1_969710_969953_+	NA	NA|136aa|up_6|NZ_CP007753.1_970097_970505_+	pfam02566, OsmC, OsmC-like protein	NA|95aa|up_5|NZ_CP007753.1_970717_971002_-	NA	NA|128aa|up_4|NZ_CP007753.1_971141_971525_-	pfam07386, DUF1499, Protein of unknown function (DUF1499)	NA|83aa|up_3|NZ_CP007753.1_972381_972630_+	NA	NA|262aa|up_2|NZ_CP007753.1_972859_973645_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|145aa|up_1|NZ_CP007753.1_973662_974097_+	cd07153, Fur_like, Ferric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators	NA|257aa|up_0|NZ_CP007753.1_974110_974881_-	cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the metal-type transporters	NA|450aa|down_0|NZ_CP007753.1_976636_977986_+	cd03112, CobW-like, cobalamin synthesis protein CobW	NA|358aa|down_1|NZ_CP007753.1_977973_979047_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|92aa|down_2|NZ_CP007753.1_979781_980057_+	smart00531, TFIIE, Transcription initiation factor IIE	NA|78aa|down_3|NZ_CP007753.1_980081_980315_-	NA	NA|79aa|down_4|NZ_CP007753.1_980346_980583_-	NA	NA|481aa|down_5|NZ_CP007753.1_980710_982153_+	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|314aa|down_6|NZ_CP007753.1_982154_983096_+	pfam04168, Alpha-E, A predicted alpha-helical domain with a conserved ER motif	NA|286aa|down_7|NZ_CP007753.1_983107_983965_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|158aa|down_8|NZ_CP007753.1_984138_984612_+	NA	NA|173aa|down_9|NZ_CP007753.1_984695_985214_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]
