assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_006351965.1_ASM635196v1	NZ_CP040932	Oceanicola sp. D3 chromosome, complete genome	1	387749-387977	1	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	GCAGTGGCGGCGGTTCGACCGGCGGTGGCGGCGGCAGCACGGGCGGCGGAGGCGG	55	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,cas3,WYL	NA|272aa|up_1|NZ_CP040932.1_382534_383350_-,NA|67aa|down_5|NZ_CP040932.1_395327_395528_-	NA|220aa|up_9|NZ_CP040932.1_374653_375313_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|306aa|up_8|NZ_CP040932.1_375550_376468_+	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|285aa|up_7|NZ_CP040932.1_376490_377345_+	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	NA|190aa|up_6|NZ_CP040932.1_377406_377976_+	cd00438, cupin_RmlC, RmlC carbohydrate epimerase, involved in dTDP-L-rhamnose production	NA|346aa|up_5|NZ_CP040932.1_377972_379010_+	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|291aa|up_4|NZ_CP040932.1_379208_380081_+	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|390aa|up_3|NZ_CP040932.1_380169_381339_+	cd05255, SQD1_like_SDR_e, UDP_sulfoquinovose_synthase (Arabidopsis thaliana SQD1 and related proteins), extended (e) SDRs	NA|324aa|up_2|NZ_CP040932.1_381517_382489_+	cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|272aa|up_1|NZ_CP040932.1_382534_383350_-	NA	NA|775aa|up_0|NZ_CP040932.1_383346_385671_-	cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|240aa|down_0|NZ_CP040932.1_388362_389082_-	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|357aa|down_1|NZ_CP040932.1_389475_390546_+	COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope biogenesis, outer membrane]	NA|224aa|down_2|NZ_CP040932.1_390566_391238_+	cd03220, ABC_KpsT_Wzt, ATP-binding cassette component of polysaccharide transport system	NA|888aa|down_3|NZ_CP040932.1_391253_393917_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|369aa|down_4|NZ_CP040932.1_394190_395297_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|67aa|down_5|NZ_CP040932.1_395327_395528_-	NA	NA|283aa|down_6|NZ_CP040932.1_395649_396498_+	COG2971, COG2971, Predicted N-acetylglucosamine kinase [Carbohydrate transport and metabolism]	NA|374aa|down_7|NZ_CP040932.1_396494_397616_+	COG1820, NagA, N-acetylglucosamine-6-phosphate deacetylase [Carbohydrate transport and metabolism]	NA|153aa|down_8|NZ_CP040932.1_397604_398063_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|345aa|down_9|NZ_CP040932.1_398171_399206_+	TIGR02272, gentisate_12-dioxygenase, gentisate 1,2-dioxygenase
GCF_006351965.1_ASM635196v1	NZ_CP040932	Oceanicola sp. D3 chromosome, complete genome	2	2434473-2434559	2	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	GAAGCCGAAGCCGAAATGGCTGA	23	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,cas3,WYL	NA|104aa|up_2|NZ_CP040932.1_2432973_2433285_-,NA|162aa|down_9|NZ_CP040932.1_2444945_2445431_-	NA|452aa|up_9|NZ_CP040932.1_2426114_2427470_+	PRK05341, PRK05341, homogentisate 1,2-dioxygenase; Provisional	NA|419aa|up_8|NZ_CP040932.1_2427511_2428768_+	PLN02856, PLN02856, fumarylacetoacetase	NA|187aa|up_7|NZ_CP040932.1_2428764_2429325_-	PRK00944, PRK00944, hypothetical protein; Provisional	NA|317aa|up_6|NZ_CP040932.1_2429443_2430394_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|541aa|up_5|NZ_CP040932.1_2430416_2432039_+	PRK08132, PRK08132, FAD-dependent oxidoreductase; Provisional	NA|64aa|up_4|NZ_CP040932.1_2432043_2432235_+	pfam10932, DUF2783, Protein of unknown function (DUF2783)	NA|243aa|up_3|NZ_CP040932.1_2432241_2432970_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|104aa|up_2|NZ_CP040932.1_2432973_2433285_-	NA	NA|111aa|up_1|NZ_CP040932.1_2433304_2433637_-	pfam05957, DUF883, Bacterial protein of unknown function (DUF883)	NA|57aa|up_0|NZ_CP040932.1_2433814_2433985_+	COG5487, COG5487, Small integral membrane protein [Function unknown]	NA|371aa|down_0|NZ_CP040932.1_2435548_2436661_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|563aa|down_1|NZ_CP040932.1_2436672_2438361_-	cd09105, PLDc_vPLD1_2_like_2, Catalytic domain, repeat 2, of vertebrate phospholipases, PLD1 and PLD2, and similar proteins	NA|618aa|down_2|NZ_CP040932.1_2438357_2440211_-	COG3920, COG3920, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|193aa|down_3|NZ_CP040932.1_2440227_2440806_-	PRK12546, PRK12546, RNA polymerase sigma factor; Provisional	NA|55aa|down_4|NZ_CP040932.1_2440802_2440967_-	pfam18557, NepR, Anti-sigma factor NepR	NA|272aa|down_5|NZ_CP040932.1_2441150_2441966_+	PRK09191, PRK09191, two-component response regulator; Provisional	NA|154aa|down_6|NZ_CP040932.1_2442004_2442466_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|149aa|down_7|NZ_CP040932.1_2442785_2443232_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|496aa|down_8|NZ_CP040932.1_2443324_2444812_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|162aa|down_9|NZ_CP040932.1_2444945_2445431_-	NA
GCF_006351965.1_ASM635196v1	NZ_CP040932	Oceanicola sp. D3 chromosome, complete genome	3	2434851-2435068	3	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	GAAGCCGAAGCCGAAATGGCTGA	23	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,csa3,cas3,WYL	NA|104aa|up_2|NZ_CP040932.1_2432973_2433285_-,NA|162aa|down_9|NZ_CP040932.1_2444945_2445431_-	NA|452aa|up_9|NZ_CP040932.1_2426114_2427470_+	PRK05341, PRK05341, homogentisate 1,2-dioxygenase; Provisional	NA|419aa|up_8|NZ_CP040932.1_2427511_2428768_+	PLN02856, PLN02856, fumarylacetoacetase	NA|187aa|up_7|NZ_CP040932.1_2428764_2429325_-	PRK00944, PRK00944, hypothetical protein; Provisional	NA|317aa|up_6|NZ_CP040932.1_2429443_2430394_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|541aa|up_5|NZ_CP040932.1_2430416_2432039_+	PRK08132, PRK08132, FAD-dependent oxidoreductase; Provisional	NA|64aa|up_4|NZ_CP040932.1_2432043_2432235_+	pfam10932, DUF2783, Protein of unknown function (DUF2783)	NA|243aa|up_3|NZ_CP040932.1_2432241_2432970_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|104aa|up_2|NZ_CP040932.1_2432973_2433285_-	NA	NA|111aa|up_1|NZ_CP040932.1_2433304_2433637_-	pfam05957, DUF883, Bacterial protein of unknown function (DUF883)	NA|57aa|up_0|NZ_CP040932.1_2433814_2433985_+	COG5487, COG5487, Small integral membrane protein [Function unknown]	NA|371aa|down_0|NZ_CP040932.1_2435548_2436661_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|563aa|down_1|NZ_CP040932.1_2436672_2438361_-	cd09105, PLDc_vPLD1_2_like_2, Catalytic domain, repeat 2, of vertebrate phospholipases, PLD1 and PLD2, and similar proteins	NA|618aa|down_2|NZ_CP040932.1_2438357_2440211_-	COG3920, COG3920, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|193aa|down_3|NZ_CP040932.1_2440227_2440806_-	PRK12546, PRK12546, RNA polymerase sigma factor; Provisional	NA|55aa|down_4|NZ_CP040932.1_2440802_2440967_-	pfam18557, NepR, Anti-sigma factor NepR	NA|272aa|down_5|NZ_CP040932.1_2441150_2441966_+	PRK09191, PRK09191, two-component response regulator; Provisional	NA|154aa|down_6|NZ_CP040932.1_2442004_2442466_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|149aa|down_7|NZ_CP040932.1_2442785_2443232_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|496aa|down_8|NZ_CP040932.1_2443324_2444812_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|162aa|down_9|NZ_CP040932.1_2444945_2445431_-	NA
