assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	1	1027685-1027831	1	CRISPRCasFinder	no		cas3,csa3,DEDDh,PD-DExK,WYL	Orphan	CAACCTTCCCTCAAGGGAAGGAGTTTTGATTGTCATTCTGCTACAGAG	48	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,PD-DExK,WYL	NA|484aa|up_7|NC_020156.1_1016721_1018173_-,NA|416aa|up_6|NC_020156.1_1018452_1019700_+,NA|109aa|down_4|NC_020156.1_1035011_1035338_-	NA|640aa|up_9|NC_020156.1_1012813_1014733_-	pfam14280, DUF4365, Domain of unknown function (DUF4365)	NA|462aa|up_8|NC_020156.1_1015177_1016563_+	cd05680, M20_dipept_like, uncharacterized M20 dipeptidase	NA|484aa|up_7|NC_020156.1_1016721_1018173_-	NA	NA|416aa|up_6|NC_020156.1_1018452_1019700_+	NA	NA|122aa|up_5|NC_020156.1_1019845_1020211_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|655aa|up_4|NC_020156.1_1020213_1022178_+	cd07341, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|367aa|up_3|NC_020156.1_1022429_1023530_+	pfam14362, DUF4407, Domain of unknown function (DUF4407)	NA|343aa|up_2|NC_020156.1_1023944_1024973_-	COG4850, COG4850, Uncharacterized conserved protein [Function unknown]	NA|278aa|up_1|NC_020156.1_1025288_1026122_-	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|450aa|up_0|NC_020156.1_1026103_1027453_-	cd10322, SLC5sbd, Solute carrier 5 family, sodium/glucose transporters and related proteins; solute-binding domain	NA|484aa|down_0|NC_020156.1_1028009_1029461_-	pfam12771, SusD-like_2, Starch-binding associating with outer membrane	NA|1070aa|down_1|NC_020156.1_1029635_1032845_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|229aa|down_2|NC_020156.1_1032896_1033583_-	TIGR04282, hypothetical_protein, transferase 1, rSAM/selenodomain-associated	NA|355aa|down_3|NC_020156.1_1033586_1034651_-	TIGR04167, radical_SAM_domain_protein, radical SAM/Cys-rich domain protein	NA|109aa|down_4|NC_020156.1_1035011_1035338_-	NA	NA|112aa|down_5|NC_020156.1_1035528_1035864_-	TIGR04169, perox_w_seleSAM, alkylhydroperoxidase/carboxymuconolactone decarboxylase family protein	NA|476aa|down_6|NC_020156.1_1036011_1037439_-	cd11313, AmyAc_arch_bac_AmyA, Alpha amylase catalytic domain found in archaeal and bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|787aa|down_7|NC_020156.1_1037498_1039859_-	cd11339, AmyAc_bac_CMD_like_2, Alpha amylase catalytic domain found in bacterial cyclomaltodextrinases and related proteins	NA|660aa|down_8|NC_020156.1_1040193_1042173_-	cd11340, AmyAc_bac_CMD_like_3, Alpha amylase catalytic domain found in bacterial cyclomaltodextrinases and related proteins	NA|727aa|down_9|NC_020156.1_1042215_1044396_-	pfam10566, Glyco_hydro_97, Glycoside hydrolase 97
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	2	1101825-1102018	1	PILER-CR	no		cas3,csa3,DEDDh,PD-DExK,WYL	Orphan	TCATGTCAGTAACGTTTGAAACGTCCCAATTATTTAAAGGCTGAT	45	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,DEDDh,PD-DExK,WYL	NA|731aa|up_9|NC_020156.1_1086087_1088280_-,NA|56aa|up_6|NC_020156.1_1089766_1089934_-,NA|133aa|down_6|NC_020156.1_1112587_1112986_+,NA|220aa|down_8|NC_020156.1_1113975_1114635_+,NA|185aa|down_9|NC_020156.1_1114625_1115180_+	NA|731aa|up_9|NC_020156.1_1086087_1088280_-	NA	NA|185aa|up_8|NC_020156.1_1088408_1088963_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|236aa|up_7|NC_020156.1_1089005_1089713_-	cd04254, AAK_UMPK-PyrH-Ec, UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway)	NA|56aa|up_6|NC_020156.1_1089766_1089934_-	NA	NA|188aa|up_5|NC_020156.1_1090078_1090642_+	pfam11307, DUF3109, Protein of unknown function (DUF3109)	NA|331aa|up_4|NC_020156.1_1091004_1091997_+	PLN02492, PLN02492, ribonucleoside-diphosphate reductase	NA|869aa|up_3|NC_020156.1_1092146_1094753_+	PLN02437, PLN02437, ribonucleoside--diphosphate reductase large subunit	NA|149aa|up_2|NC_020156.1_1095043_1095490_+	pfam09537, DUF2383, Domain of unknown function (DUF2383)	NA|813aa|up_1|NC_020156.1_1095561_1098000_-	pfam03030, H_PPase, Inorganic H+ pyrophosphatase	NA|280aa|up_0|NC_020156.1_1098223_1099063_+	cd15242, 7tm_Proteorhodopsin, green- and blue-light absorbing proteorhodopsins, member of the seven-transmembrane GPCR superfamily	NA|849aa|down_0|NC_020156.1_1104262_1106809_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|327aa|down_1|NC_020156.1_1106988_1107969_-	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|249aa|down_2|NC_020156.1_1108253_1109000_+	cd01714, ETF_beta, The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases	NA|323aa|down_3|NC_020156.1_1109097_1110066_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|204aa|down_4|NC_020156.1_1110166_1110778_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|572aa|down_5|NC_020156.1_1110792_1112508_+	COG1972, NupC, Nucleoside permease [Nucleotide transport and metabolism]	NA|133aa|down_6|NC_020156.1_1112587_1112986_+	NA	NA|275aa|down_7|NC_020156.1_1113142_1113967_+	PRK01827, thyA, thymidylate synthase; Reviewed	NA|220aa|down_8|NC_020156.1_1113975_1114635_+	NA	NA|185aa|down_9|NC_020156.1_1114625_1115180_+	NA
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	3	2316163-2316242	2	CRISPRCasFinder	no	cas3	cas3,csa3,DEDDh,PD-DExK,WYL	Unclear	TTCCAGCCCCAACCGCCATTCCATGC	26	0	0	NA	NA	NA	1	1	Unclear	cas3,csa3,DEDDh,PD-DExK,WYL	NA|79aa|up_9|NC_020156.1_2302672_2302909_+,NA|117aa|up_4|NC_020156.1_2306151_2306502_-,NA|259aa|down_2|NC_020156.1_2319089_2319866_-,NA|349aa|down_3|NC_020156.1_2319991_2321038_-,NA|258aa|down_4|NC_020156.1_2321115_2321889_-,NA|142aa|down_7|NC_020156.1_2323695_2324121_-,NA|55aa|down_9|NC_020156.1_2326218_2326383_-	NA|79aa|up_9|NC_020156.1_2302672_2302909_+	NA	NA|135aa|up_8|NC_020156.1_2302985_2303390_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|121aa|up_7|NC_020156.1_2303696_2304059_+	pfam13380, CoA_binding_2, CoA binding domain	NA|492aa|up_6|NC_020156.1_2304033_2305509_-	cd10326, SLC5sbd_NIS-like, Na(+)/iodide (NIS) and Na(+)/multivitamin (SMVT) cotransporters, and related proteins; solute binding domain	NA|74aa|up_5|NC_020156.1_2305900_2306122_-	pfam01809, Haemolytic, Haemolytic domain	NA|117aa|up_4|NC_020156.1_2306151_2306502_-	NA	NA|496aa|up_3|NC_020156.1_2306919_2308407_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|224aa|up_2|NC_020156.1_2308555_2309227_-	pfam01227, GTP_cyclohydroI, GTP cyclohydrolase I	NA|1515aa|up_1|NC_020156.1_2309411_2313956_+	TIGR04131, conserved_hypothetical_protein, gliding motility-associated C-terminal domain	NA|503aa|up_0|NC_020156.1_2314171_2315680_-	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|493aa|down_0|NC_020156.1_2316926_2318405_+	PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional	NA|85aa|down_1|NC_020156.1_2318599_2318854_+	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|259aa|down_2|NC_020156.1_2319089_2319866_-	NA	NA|349aa|down_3|NC_020156.1_2319991_2321038_-	NA	NA|258aa|down_4|NC_020156.1_2321115_2321889_-	NA	NA|190aa|down_5|NC_020156.1_2321878_2322448_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|379aa|down_6|NC_020156.1_2322517_2323654_-	pfam07745, Glyco_hydro_53, Glycosyl hydrolase family 53	NA|142aa|down_7|NC_020156.1_2323695_2324121_-	NA	NA|595aa|down_8|NC_020156.1_2324264_2326049_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|55aa|down_9|NC_020156.1_2326218_2326383_-	NA
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	4	2636822-2636905	3	CRISPRCasFinder	no		cas3,csa3,DEDDh,PD-DExK,WYL	Orphan	TTGGGGAAATGTAAGACCAGTACT	24	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,PD-DExK,WYL	NA|267aa|up_7|NC_020156.1_2630212_2631013_-,NA|204aa|up_6|NC_020156.1_2631186_2631798_+,NA|209aa|up_5|NC_020156.1_2631797_2632424_+,NA|84aa|up_1|NC_020156.1_2634671_2634923_+,NA|55aa|down_0|NC_020156.1_2637007_2637172_-,NA|200aa|down_6|NC_020156.1_2641857_2642457_+,NA|233aa|down_9|NC_020156.1_2643821_2644520_+	NA|543aa|up_9|NC_020156.1_2626779_2628408_+	PLN02820, PLN02820, 3-methylcrotonyl-CoA carboxylase, beta chain	NA|489aa|up_8|NC_020156.1_2628652_2630119_-	TIGR04183, hypothetical_protein, Por secretion system C-terminal sorting domain	NA|267aa|up_7|NC_020156.1_2630212_2631013_-	NA	NA|204aa|up_6|NC_020156.1_2631186_2631798_+	NA	NA|209aa|up_5|NC_020156.1_2631797_2632424_+	NA	NA|253aa|up_4|NC_020156.1_2632434_2633193_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|222aa|up_3|NC_020156.1_2633209_2633875_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|267aa|up_2|NC_020156.1_2633878_2634679_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|84aa|up_1|NC_020156.1_2634671_2634923_+	NA	NA|558aa|up_0|NC_020156.1_2634982_2636656_+	pfam11832, DUF3352, Protein of unknown function (DUF3352)	NA|55aa|down_0|NC_020156.1_2637007_2637172_-	NA	NA|234aa|down_1|NC_020156.1_2637159_2637861_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|259aa|down_2|NC_020156.1_2638114_2638891_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|252aa|down_3|NC_020156.1_2638927_2639683_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|258aa|down_4|NC_020156.1_2640252_2641026_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|260aa|down_5|NC_020156.1_2641027_2641807_+	TIGR01314, gntK_FGGY, gluconate kinase, FGGY type	NA|200aa|down_6|NC_020156.1_2641857_2642457_+	NA	NA|258aa|down_7|NC_020156.1_2642480_2643254_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|57aa|down_8|NC_020156.1_2643364_2643535_-	pfam01679, Pmp3, Proteolipid membrane potential modulator	NA|233aa|down_9|NC_020156.1_2643821_2644520_+	NA
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	5	2757867-2757976	4	CRISPRCasFinder	no		cas3,csa3,DEDDh,PD-DExK,WYL	Orphan	AATATCAGTTCGAGCGCAGTCGAGA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,PD-DExK,WYL	NA|234aa|up_3|NC_020156.1_2752524_2753226_+,NA|146aa|up_1|NC_020156.1_2756845_2757283_+,NA|51aa|down_2|NC_020156.1_2759101_2759254_-,NA|141aa|down_3|NC_020156.1_2759467_2759890_+,NA|119aa|down_4|NC_020156.1_2759943_2760300_+,NA|425aa|down_6|NC_020156.1_2763842_2765117_-,NA|280aa|down_9|NC_020156.1_2769894_2770734_-	NA|799aa|up_9|NC_020156.1_2741972_2744369_-	TIGR03434, ADOP, Acidobacterial duplicated orphan permease	NA|234aa|up_8|NC_020156.1_2744568_2745270_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|418aa|up_7|NC_020156.1_2745475_2746729_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|448aa|up_6|NC_020156.1_2746937_2748281_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|448aa|up_5|NC_020156.1_2748270_2749614_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|530aa|up_4|NC_020156.1_2750511_2752101_-	PRK00029, PRK00029, YdiU family protein	NA|234aa|up_3|NC_020156.1_2752524_2753226_+	NA	NA|1074aa|up_2|NC_020156.1_2753387_2756609_+	NF033452, BREX_1_MTaseX, BREX-1 system adenine-specific DNA-methyltransferase PglX	NA|146aa|up_1|NC_020156.1_2756845_2757283_+	NA	NA|118aa|up_0|NC_020156.1_2757502_2757856_+	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|184aa|down_0|NC_020156.1_2758094_2758646_+	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)	NA|131aa|down_1|NC_020156.1_2758712_2759105_+	pfam17775, UPF0225, UPF0225 domain	NA|51aa|down_2|NC_020156.1_2759101_2759254_-	NA	NA|141aa|down_3|NC_020156.1_2759467_2759890_+	NA	NA|119aa|down_4|NC_020156.1_2759943_2760300_+	NA	NA|1082aa|down_5|NC_020156.1_2760424_2763670_+	cd07562, Peptidase_S41_TRI, Tricorn protease; serine protease family S41	NA|425aa|down_6|NC_020156.1_2763842_2765117_-	NA	NA|701aa|down_7|NC_020156.1_2765306_2767409_+	PLN03185, PLN03185, phosphatidylinositol phosphate kinase; Provisional	NA|739aa|down_8|NC_020156.1_2767647_2769864_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|280aa|down_9|NC_020156.1_2769894_2770734_-	NA
GCF_000332115.1_ASM33211v1	NC_020156	Nonlabens dokdonensis DSW-6, complete sequence	6	3501124-3501237	5	CRISPRCasFinder	no		cas3,csa3,DEDDh,PD-DExK,WYL	Orphan	ATTTATTACGCTTTTACGTAATAATATA	28	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,PD-DExK,WYL	NA|231aa|up_9|NC_020156.1_3488325_3489018_+,NA|175aa|up_7|NC_020156.1_3489587_3490112_+,NA|207aa|down_0|NC_020156.1_3501248_3501869_+,NA|123aa|down_2|NC_020156.1_3502556_3502925_+,NA|217aa|down_5|NC_020156.1_3505006_3505657_+,NA|224aa|down_6|NC_020156.1_3505674_3506346_+,NA|282aa|down_7|NC_020156.1_3506437_3507283_-,NA|136aa|down_9|NC_020156.1_3508106_3508514_-	NA|231aa|up_9|NC_020156.1_3488325_3489018_+	NA	NA|84aa|up_8|NC_020156.1_3489187_3489439_-	PRK01678, rpmE2, type B 50S ribosomal protein L31	NA|175aa|up_7|NC_020156.1_3489587_3490112_+	NA	NA|323aa|up_6|NC_020156.1_3490167_3491136_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|571aa|up_5|NC_020156.1_3491302_3493015_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|607aa|up_4|NC_020156.1_3493019_3494840_+	TIGR02203, Lipid_A_export_ATP-binding/permease_protein_msbA	NA|171aa|up_3|NC_020156.1_3494862_3495375_+	pfam13098, Thioredoxin_2, Thioredoxin-like domain	NA|694aa|up_2|NC_020156.1_3495395_3497477_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|558aa|up_1|NC_020156.1_3497653_3499327_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|486aa|up_0|NC_020156.1_3499412_3500870_+	pfam02646, RmuC, RmuC family	NA|207aa|down_0|NC_020156.1_3501248_3501869_+	NA	NA|236aa|down_1|NC_020156.1_3501852_3502560_+	pfam11750, DUF3307, Protein of unknown function (DUF3307)	NA|123aa|down_2|NC_020156.1_3502556_3502925_+	NA	NA|182aa|down_3|NC_020156.1_3503028_3503574_+	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|445aa|down_4|NC_020156.1_3503570_3504905_-	PRK09084, PRK09084, aspartate kinase III; Validated	NA|217aa|down_5|NC_020156.1_3505006_3505657_+	NA	NA|224aa|down_6|NC_020156.1_3505674_3506346_+	NA	NA|282aa|down_7|NC_020156.1_3506437_3507283_-	NA	NA|256aa|down_8|NC_020156.1_3507332_3508100_-	cd08544, Reeler, Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins	NA|136aa|down_9|NC_020156.1_3508106_3508514_-	NA
