assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000213255.1_ASM21325v1	NC_015520	Mahella australiensis 50-1 BON, complete sequence	1	271580-271668	1	CRISPRCasFinder	no	c2c4_V-U1	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	Type V-U1	GGTTTATATTAACAATGTGGGATG	24	0	0	NA	NA	NA	1	1	TypeV-U1	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	NA|119aa|up_1|NC_015520.1_270619_270976_+,NA|136aa|up_0|NC_015520.1_270988_271396_+,NA	NA|226aa|up_9|NC_015520.1_261638_262316_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|520aa|up_8|NC_015520.1_262416_263976_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|446aa|up_7|NC_015520.1_264678_266016_-	pfam13551, HTH_29, Winged helix-turn helix	NA|71aa|up_6|NC_015520.1_266169_266382_+	PRK00019, rpmE, 50S ribosomal protein L31; Reviewed	NA|309aa|up_5|NC_015520.1_266469_267396_+	pfam07136, DUF1385, Protein of unknown function (DUF1385)	NA|282aa|up_4|NC_015520.1_267386_268232_+	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|357aa|up_3|NC_015520.1_268253_269324_+	PRK00591, prfA, peptide chain release factor 1; Validated	NA|397aa|up_2|NC_015520.1_269405_270596_+	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|119aa|up_1|NC_015520.1_270619_270976_+	NA	NA|136aa|up_0|NC_015520.1_270988_271396_+	NA	NA|244aa|down_0|NC_015520.1_271766_272498_+	PRK04201, PRK04201, zinc transporter ZupT; Provisional	NA|230aa|down_1|NC_015520.1_272500_273190_-	COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and modification]	NA|160aa|down_2|NC_015520.1_273552_274032_+	pfam02502, LacAB_rpiB, Ribose/Galactose Isomerase	NA|211aa|down_3|NC_015520.1_274021_274654_+	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|428aa|down_4|NC_015520.1_274686_275970_+	cd01116, P_permease, Permease P (pink-eyed dilution)	NA|171aa|down_5|NC_015520.1_275982_276495_+	COG2131, ComEB, Deoxycytidylate deaminase [Nucleotide transport and metabolism]	NA|379aa|down_6|NC_015520.1_276536_277673_+	COG0381, WecB, UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane]	NA|284aa|down_7|NC_015520.1_277793_278645_+	PRK12857, PRK12857, class II fructose-1,6-bisphosphate aldolase	NA|217aa|down_8|NC_015520.1_278675_279326_+	PRK01362, PRK01362, fructose-6-phosphate aldolase	NA|602aa|down_9|NC_015520.1_279342_281148_+	PRK09376, rho, transcription termination factor Rho; Provisional
GCF_000213255.1_ASM21325v1	NC_015520	Mahella australiensis 50-1 BON, complete sequence	2	2336692-2342216	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	Type I-B	GTTTCAATTCCTCATAGGTAGGCTAAAAAC,GTTTCAATTCCTCATAGGTAGGCTAAAAAC,GTTTCAATTCCTCATAGGTAGGCTAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	82,82,62	82	TypeI-B	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	NA|121aa|up_6|NC_015520.1_2330357_2330720_-,NA|126aa|up_4|NC_015520.1_2331125_2331503_-,NA	NA|243aa|up_9|NC_015520.1_2327100_2327829_+	COG4555, NatA, ABC-type Na+ transport system, ATPase component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|389aa|up_8|NC_015520.1_2327825_2328992_+	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|411aa|up_7|NC_015520.1_2328988_2330221_-	TIGR03918, hydrogenase_maturation_GTPase_HydF, [FeFe] hydrogenase H-cluster maturation GTPase HydF	NA|121aa|up_6|NC_015520.1_2330357_2330720_-	NA	NA|124aa|up_5|NC_015520.1_2330735_2331107_-	pfam09862, DUF2089, Protein of unknown function (DUF2089)	NA|126aa|up_4|NC_015520.1_2331125_2331503_-	NA	NA|272aa|up_3|NC_015520.1_2331714_2332530_+	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	cas2|88aa|up_2|NC_015520.1_2332557_2332821_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|332aa|up_1|NC_015520.1_2332824_2333820_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|167aa|up_0|NC_015520.1_2333834_2334335_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|816aa|down_0|NC_015520.1_2342439_2344887_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|256aa|down_1|NC_015520.1_2344907_2345675_-	cd09692, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7b|306aa|down_2|NC_015520.1_2345719_2346637_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|656aa|down_3|NC_015520.1_2346629_2348597_-	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas6|257aa|down_4|NC_015520.1_2348589_2349360_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|618aa|down_5|NC_015520.1_2349544_2351398_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|333aa|down_6|NC_015520.1_2351413_2352412_-	pfam09376, NurA, NurA domain	NA|93aa|down_7|NC_015520.1_2352709_2352988_+	TIGR02851, stage_V_sporulation_protein_T, stage V sporulation protein T	NA|292aa|down_8|NC_015520.1_2353006_2353882_-	COG0313, COG0313, Predicted methyltransferases [General function prediction only]	NA|253aa|down_9|NC_015520.1_2353850_2354609_-	COG4123, COG4123, Predicted O-methyltransferase [General function prediction only]
GCF_000213255.1_ASM21325v1	NC_015520	Mahella australiensis 50-1 BON, complete sequence	3	2645347-2645438	3	CRISPRCasFinder	no		DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	Orphan	CTTGGAAGGGGTTAAGTTATTTTCGTAGTAA	31	1	1	2645378-2645407	NC_015520.1_2642578-2642607	NA	1	1	Orphan	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	NA|90aa|up_5|NC_015520.1_2637647_2637917_+,NA|206aa|down_3|NC_015520.1_2652134_2652752_-,NA|222aa|down_4|NC_015520.1_2652873_2653539_-,NA|245aa|down_5|NC_015520.1_2653544_2654279_-,NA|167aa|down_9|NC_015520.1_2658405_2658906_-	NA|339aa|up_9|NC_015520.1_2631151_2632168_-	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|318aa|up_8|NC_015520.1_2632217_2633171_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|267aa|up_7|NC_015520.1_2635975_2636776_-	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|262aa|up_6|NC_015520.1_2636779_2637565_-	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|90aa|up_5|NC_015520.1_2637647_2637917_+	NA	NA|317aa|up_4|NC_015520.1_2638210_2639161_-	cd00688, ISOPREN_C2_like, This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement	NA|353aa|up_3|NC_015520.1_2639157_2640216_-	cd01918, HprK_C, HprK/P, the bifunctional histidine-containing protein kinase/phosphatase, controls the phosphorylation state of the phosphocarrier protein HPr and regulates the utilization of carbon sources by gram-positive bacteria	NA|367aa|up_2|NC_015520.1_2640226_2641327_-	PRK06827, PRK06827, phosphoribosylpyrophosphate synthetase; Provisional	NA|287aa|up_1|NC_015520.1_2641323_2642184_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|257aa|up_0|NC_015520.1_2642839_2643610_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|290aa|down_0|NC_015520.1_2645543_2646413_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|359aa|down_1|NC_015520.1_2646469_2647546_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|464aa|down_2|NC_015520.1_2650523_2651915_-	pfam14239, RRXRR, RRXRR protein	NA|206aa|down_3|NC_015520.1_2652134_2652752_-	NA	NA|222aa|down_4|NC_015520.1_2652873_2653539_-	NA	NA|245aa|down_5|NC_015520.1_2653544_2654279_-	NA	NA|408aa|down_6|NC_015520.1_2654415_2655639_-	pfam13203, DUF2201_N, Putative metallopeptidase domain	NA|361aa|down_7|NC_015520.1_2655638_2656721_-	cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold	NA|249aa|down_8|NC_015520.1_2657507_2658254_+	pfam00520, Ion_trans, Ion transport protein	NA|167aa|down_9|NC_015520.1_2658405_2658906_-	NA
GCF_000213255.1_ASM21325v1	NC_015520	Mahella australiensis 50-1 BON, complete sequence	4	2671465-2671964	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas6,csf3gr5,csf2gr7,csf1gr8	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	Type IV-A	CTTTTAACCCCTCACAGAAGGTATCTGCAC,CTTTTAACCCCTCANAGAAGGTATCTGNAC,CTTTTAACCCCTCACAGAAGGTATCTGCACC	30,30,31	0	0	NA	NA	NA:NA:NA	7,7,3	7	TypeIV-A	DEDDh,c2c4_V-U1,WYL,cas14k,RT,cas4,csa3,cas3,cas2,cas1,cas5,cas7b,cas8b1,cas6,Cas9_archaeal,csf3gr5,csf2gr7,csf1gr8	NA|167aa|up_9|NC_015520.1_2658405_2658906_-,NA|413aa|up_6|NC_015520.1_2663052_2664291_-,NA|101aa|up_4|NC_015520.1_2666583_2666886_+,cas6|230aa|up_1|NC_015520.1_2669860_2670550_-,NA	NA|167aa|up_9|NC_015520.1_2658405_2658906_-	NA	NA|425aa|up_8|NC_015520.1_2660087_2661362_-	COG2333, ComEC, Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only]	NA|465aa|up_7|NC_015520.1_2661670_2663065_-	pfam10087, DUF2325, Uncharacterized protein conserved in bacteria (DUF2325)	NA|413aa|up_6|NC_015520.1_2663052_2664291_-	NA	NA|620aa|up_5|NC_015520.1_2664539_2666399_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|101aa|up_4|NC_015520.1_2666583_2666886_+	NA	NA|745aa|up_3|NC_015520.1_2667166_2669401_-	cd17933, DEXSc_RecD-like, DEXS-box helicase domain of RecD and similar proteins	NA|107aa|up_2|NC_015520.1_2669503_2669824_-	pfam14271, DUF4359, Domain of unknown function (DUF4359)	cas6|230aa|up_1|NC_015520.1_2669860_2670550_-	NA	csf3gr5|259aa|up_0|NC_015520.1_2670546_2671323_-	TIGR03116, cas5_csf3, CRISPR type IV/AFERR-associated protein Csf3	csf2gr7|335aa|down_0|NC_015520.1_2672070_2673075_-	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	csf1gr8|239aa|down_1|NC_015520.1_2673071_2673788_-	TIGR03114, cas8u_csf1, CRISPR type AFERR-associated protein Csf1	NA|171aa|down_2|NC_015520.1_2673816_2674329_-	TIGR04342, hypothetical_protein, EXLDI protein	NA|1034aa|down_3|NC_015520.1_2674424_2677526_-	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|216aa|down_4|NC_015520.1_2677522_2678170_-	cd17268, RMtype1_S_Ara36733I_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|204aa|down_5|NC_015520.1_2678166_2678778_-	cd17274, RMtype1_S_Eco540ANI-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|522aa|down_6|NC_015520.1_2678777_2680343_-	TIGR00497, hsdM, type I restriction system adenine methylase (hsdM)	NA|293aa|down_7|NC_015520.1_2680494_2681373_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|523aa|down_8|NC_015520.1_2681384_2682953_-	pfam10961, SelK_SelG, Selenoprotein SelK_SelG	NA|189aa|down_9|NC_015520.1_2686137_2686704_+	pfam01695, IstB_IS21, IstB-like ATP binding protein
