assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002356035.1_ASM235603v1	NZ_AP017900	Nocardia seriolae strain UTF1	1	2175560-2175633	1	CRISPRCasFinder	no		PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	Orphan	GCGCCCCAACCACAACCCGGCGC	23	0	0	NA	NA	NA	1	1	Orphan	PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	NA|179aa|up_9|NZ_AP017900.1_2163910_2164447_-,NA|139aa|up_8|NZ_AP017900.1_2164473_2164890_-,NA|260aa|up_2|NZ_AP017900.1_2171524_2172304_-,NA|71aa|down_1|NZ_AP017900.1_2178291_2178504_-,NA|278aa|down_7|NZ_AP017900.1_2182437_2183271_-,NA|305aa|down_8|NZ_AP017900.1_2183546_2184461_-	NA|179aa|up_9|NZ_AP017900.1_2163910_2164447_-	NA	NA|139aa|up_8|NZ_AP017900.1_2164473_2164890_-	NA	NA|217aa|up_7|NZ_AP017900.1_2165159_2165810_-	cd04887, ACT_MalLac-Enz, ACT_MalLac-Enz CD includes the N-terminal ACT domain of putative NAD-dependent malic enzyme 1, Bacillus subtilis YqkI and related domains	NA|100aa|up_6|NZ_AP017900.1_2166053_2166353_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|490aa|up_5|NZ_AP017900.1_2166349_2167819_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|664aa|up_4|NZ_AP017900.1_2168004_2169996_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|447aa|up_3|NZ_AP017900.1_2169956_2171297_-	COG4425, COG4425, Predicted membrane protein [Function unknown]	NA|260aa|up_2|NZ_AP017900.1_2171524_2172304_-	NA	NA|383aa|up_1|NZ_AP017900.1_2172519_2173668_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|220aa|up_0|NZ_AP017900.1_2173774_2174434_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|378aa|down_0|NZ_AP017900.1_2177087_2178221_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|71aa|down_1|NZ_AP017900.1_2178291_2178504_-	NA	NA|294aa|down_2|NZ_AP017900.1_2178849_2179731_+	pfam13560, HTH_31, Helix-turn-helix domain	NA|140aa|down_3|NZ_AP017900.1_2179727_2180147_+	pfam04149, DUF397, Domain of unknown function (DUF397)	NA|165aa|down_4|NZ_AP017900.1_2180098_2180593_-	TIGR01926, peroxid_rel, uncharacterized peroxidase-related enzyme	NA|194aa|down_5|NZ_AP017900.1_2180680_2181262_+	pfam11706, zf-CGNR, CGNR zinc finger	NA|344aa|down_6|NZ_AP017900.1_2181328_2182360_-	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|278aa|down_7|NZ_AP017900.1_2182437_2183271_-	NA	NA|305aa|down_8|NZ_AP017900.1_2183546_2184461_-	NA	NA|596aa|down_9|NZ_AP017900.1_2184489_2186277_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_002356035.1_ASM235603v1	NZ_AP017900	Nocardia seriolae strain UTF1	2	2981920-2982001	2	CRISPRCasFinder	no		PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	Orphan	GGATGGCCAGTGGCGGATCGCCT	23	0	0	NA	NA	NA	1	1	Orphan	PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	NA,NA|124aa|down_8|NZ_AP017900.1_2994863_2995235_-	NA|195aa|up_9|NZ_AP017900.1_2967430_2968015_-	COG2306, COG2306, Predicted RNA-binding protein, associated with RNAses E/G family [General function prediction only]	NA|722aa|up_8|NZ_AP017900.1_2968316_2970482_+	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|161aa|up_7|NZ_AP017900.1_2970621_2971104_-	cd07246, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|288aa|up_6|NZ_AP017900.1_2971160_2972024_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|151aa|up_5|NZ_AP017900.1_2972526_2972979_+	pfam00582, Usp, Universal stress protein family	NA|740aa|up_4|NZ_AP017900.1_2973092_2975312_-	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|444aa|up_3|NZ_AP017900.1_2975584_2976916_-	COG2199, COG2199, c-di-GMP synthetase (diguanylate cyclase, GGDEF domain) [Signal    transduction mechanisms]	NA|382aa|up_2|NZ_AP017900.1_2977143_2978289_+	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II	NA|578aa|up_1|NZ_AP017900.1_2978297_2980031_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|216aa|up_0|NZ_AP017900.1_2980102_2980750_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|643aa|down_0|NZ_AP017900.1_2985057_2986986_-	PRK10522, PRK10522, multidrug transporter membrane component/ATP-binding component; Provisional	NA|415aa|down_1|NZ_AP017900.1_2987025_2988270_-	COG3424, BcsA, Predicted naringenin-chalcone synthase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|514aa|down_2|NZ_AP017900.1_2988352_2989894_-	pfam00173, Cyt-b5, Cytochrome b5-like Heme/Steroid binding domain	NA|378aa|down_3|NZ_AP017900.1_2990163_2991297_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|278aa|down_4|NZ_AP017900.1_2991477_2992311_-	pfam07161, LppX_LprAFG, LppX_LprAFG lipoprotein	NA|243aa|down_5|NZ_AP017900.1_2992521_2993250_-	pfam12697, Abhydrolase_6, Alpha/beta hydrolase family	NA|279aa|down_6|NZ_AP017900.1_2993366_2994203_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|202aa|down_7|NZ_AP017900.1_2994209_2994815_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|124aa|down_8|NZ_AP017900.1_2994863_2995235_-	NA	NA|233aa|down_9|NZ_AP017900.1_2995580_2996279_+	PRK00028, infC, translation initiation factor IF-3; Reviewed
GCF_002356035.1_ASM235603v1	NZ_AP017900	Nocardia seriolae strain UTF1	3	5261369-5261483	3	CRISPRCasFinder	no		PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	Orphan	CCGACCGTGCACGCCTACAACCCGCAGG	28	0	0	NA	NA	NA	1	1	Orphan	PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	NA|173aa|up_7|NZ_AP017900.1_5252332_5252851_+,NA|58aa|up_0|NZ_AP017900.1_5260880_5261054_+,NA|116aa|down_0|NZ_AP017900.1_5262383_5262731_+,NA|266aa|down_7|NZ_AP017900.1_5268732_5269530_-	NA|209aa|up_9|NZ_AP017900.1_5250172_5250799_+	pfam11452, DUF3000, Protein of unknown function (DUF3000)	NA|415aa|up_8|NZ_AP017900.1_5250965_5252210_+	COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and biogenesis]	NA|173aa|up_7|NZ_AP017900.1_5252332_5252851_+	NA	NA|268aa|up_6|NZ_AP017900.1_5253001_5253805_-	PRK05864, PRK05864, enoyl-CoA hydratase; Provisional	NA|649aa|up_5|NZ_AP017900.1_5253924_5255871_-	PRK05444, PRK05444, 1-deoxy-D-xylulose-5-phosphate synthase; Provisional	NA|612aa|up_4|NZ_AP017900.1_5256014_5257850_+	pfam04960, Glutaminase, Glutaminase	NA|309aa|up_3|NZ_AP017900.1_5258042_5258969_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|140aa|up_2|NZ_AP017900.1_5259062_5259482_+	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|378aa|up_1|NZ_AP017900.1_5259552_5260686_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|58aa|up_0|NZ_AP017900.1_5260880_5261054_+	NA	NA|116aa|down_0|NZ_AP017900.1_5262383_5262731_+	NA	NA|213aa|down_1|NZ_AP017900.1_5262735_5263374_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|191aa|down_2|NZ_AP017900.1_5263430_5264003_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|400aa|down_3|NZ_AP017900.1_5264002_5265202_+	cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN binding domain	NA|248aa|down_4|NZ_AP017900.1_5265198_5265942_+	cd05341, 3beta-17beta-HSD_like_SDR_c, 3beta17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|404aa|down_5|NZ_AP017900.1_5265946_5267158_-	PRK09265, PRK09265, aminotransferase AlaT; Validated	NA|271aa|down_6|NZ_AP017900.1_5267062_5267875_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|266aa|down_7|NZ_AP017900.1_5268732_5269530_-	NA	NA|667aa|down_8|NZ_AP017900.1_5271586_5273587_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|222aa|down_9|NZ_AP017900.1_5273980_5274646_+	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]
GCF_002356035.1_ASM235603v1	NZ_AP017900	Nocardia seriolae strain UTF1	4	6137688-6137788	4	CRISPRCasFinder	no		PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	Orphan	ACGGTCGGGGCGGGCGCGACCGA	23	0	0	NA	NA	NA	1	1	Orphan	PrimPol,cas3,WYL,DinG,RT,PD-DExK,DEDDh,cas14j,c2c9_V-U4,csa3,c2c10_CAS-V-U3	NA|86aa|up_3|NZ_AP017900.1_6132805_6133063_-,NA|105aa|up_2|NZ_AP017900.1_6133112_6133427_-,NA	NA|521aa|up_9|NZ_AP017900.1_6125849_6127412_-	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|332aa|up_8|NZ_AP017900.1_6127562_6128558_-	cd03296, ABC_CysA_sulfate_importer, ATP-binding cassette domain of the sulfate transporter	NA|270aa|up_7|NZ_AP017900.1_6128554_6129364_-	COG4208, CysW, ABC-type sulfate transport system, permease component [Inorganic ion transport and metabolism]	NA|292aa|up_6|NZ_AP017900.1_6129363_6130239_-	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|346aa|up_5|NZ_AP017900.1_6130301_6131339_-	cd01005, PBP2_CysP, Substrate binding domain of an active sulfate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|348aa|up_4|NZ_AP017900.1_6131765_6132809_+	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|86aa|up_3|NZ_AP017900.1_6132805_6133063_-	NA	NA|105aa|up_2|NZ_AP017900.1_6133112_6133427_-	NA	NA|645aa|up_1|NZ_AP017900.1_6133629_6135564_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|594aa|up_0|NZ_AP017900.1_6135638_6137420_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|613aa|down_0|NZ_AP017900.1_6140073_6141912_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|194aa|down_1|NZ_AP017900.1_6142077_6142659_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|260aa|down_2|NZ_AP017900.1_6142655_6143435_-	PRK05420, PRK05420, aquaporin Z; Provisional	NA|281aa|down_3|NZ_AP017900.1_6143476_6144319_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|325aa|down_4|NZ_AP017900.1_6144389_6145364_-	COG2307, COG2307, Uncharacterized protein conserved in bacteria [Function unknown]	NA|555aa|down_5|NZ_AP017900.1_6145428_6147093_-	COG2308, COG2308, Uncharacterized conserved protein [Function unknown]	NA|88aa|down_6|NZ_AP017900.1_6147303_6147567_+	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|248aa|down_7|NZ_AP017900.1_6147882_6148626_+	COG5479, COG5479, Uncharacterized protein potentially involved in peptidoglycan biosynthesis [Cell envelope biogenesis, outer membrane]	NA|325aa|down_8|NZ_AP017900.1_6148641_6149616_-	PRK07914, PRK07914, hypothetical protein; Reviewed	NA|386aa|down_9|NZ_AP017900.1_6149742_6150900_+	pfam00144, Beta-lactamase, Beta-lactamase
