assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001955695.1_ASM195569v1	NZ_CP019236	Rhodoferax koreense strain DCY-110 chromosome, complete genome	1	3558929-3559027	1	CRISPRCasFinder	no		DEDDh,WYL,csa3,RT,DinG,cas3	Orphan	TGCTTCGACAGGCTCAGCACGAACGG	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,csa3,RT,DinG,cas3	NA|56aa|up_9|NZ_CP019236.1_3550792_3550960_+,NA|87aa|up_4|NZ_CP019236.1_3554355_3554616_+,NA|213aa|down_6|NZ_CP019236.1_3566566_3567205_-,NA|188aa|down_7|NZ_CP019236.1_3567204_3567768_-,NA|118aa|down_8|NZ_CP019236.1_3568216_3568570_+	NA|56aa|up_9|NZ_CP019236.1_3550792_3550960_+	NA	NA|142aa|up_8|NZ_CP019236.1_3550956_3551382_+	pfam06252, DUF1018, Protein of unknown function (DUF1018)	NA|106aa|up_7|NZ_CP019236.1_3551462_3551780_+	pfam08765, Mor, Mor transcription activator family	NA|470aa|up_6|NZ_CP019236.1_3552133_3553543_-	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|192aa|up_5|NZ_CP019236.1_3553646_3554222_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|87aa|up_4|NZ_CP019236.1_3554355_3554616_+	NA	NA|261aa|up_3|NZ_CP019236.1_3554619_3555402_-	pfam07589, VPEP, PEP-CTERM motif	NA|440aa|up_2|NZ_CP019236.1_3555590_3556910_-	COG3264, COG3264, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|318aa|up_1|NZ_CP019236.1_3556957_3557911_-	cd11599, HDAC_classII_2, Histone deacetylases and histone-like deacetylases, classII	NA|308aa|up_0|NZ_CP019236.1_3557952_3558876_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|327aa|down_0|NZ_CP019236.1_3559168_3560149_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|687aa|down_1|NZ_CP019236.1_3560142_3562203_+	pfam11992, DUF3488, Domain of unknown function (DUF3488)	NA|359aa|down_2|NZ_CP019236.1_3562199_3563276_+	pfam13406, SLT_2, Transglycosylase SLT domain	NA|569aa|down_3|NZ_CP019236.1_3563276_3564983_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|218aa|down_4|NZ_CP019236.1_3565113_3565767_-	pfam16036, Chalcone_3, Chalcone isomerase-like	NA|238aa|down_5|NZ_CP019236.1_3565802_3566516_-	pfam02107, FlgH, Flagellar L-ring protein	NA|213aa|down_6|NZ_CP019236.1_3566566_3567205_-	NA	NA|188aa|down_7|NZ_CP019236.1_3567204_3567768_-	NA	NA|118aa|down_8|NZ_CP019236.1_3568216_3568570_+	NA	NA|334aa|down_9|NZ_CP019236.1_3568698_3569700_+	TIGR03558, oxido_grp_1, luciferase family oxidoreductase, group 1
GCF_001955695.1_ASM195569v1	NZ_CP019236	Rhodoferax koreense strain DCY-110 chromosome, complete genome	2	3881846-3881966	2	CRISPRCasFinder	no		DEDDh,WYL,csa3,RT,DinG,cas3	Orphan	ACCAGACCCTTCGACAGGCTCAGGGCGAACGG	32	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,csa3,RT,DinG,cas3	NA|110aa|up_8|NZ_CP019236.1_3872602_3872932_+,NA|202aa|up_2|NZ_CP019236.1_3878358_3878964_+,NA	NA|232aa|up_9|NZ_CP019236.1_3871704_3872400_-	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|110aa|up_8|NZ_CP019236.1_3872602_3872932_+	NA	NA|509aa|up_7|NZ_CP019236.1_3873051_3874578_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|174aa|up_6|NZ_CP019236.1_3874745_3875267_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|142aa|up_5|NZ_CP019236.1_3875931_3876357_+	pfam07883, Cupin_2, Cupin domain	NA|254aa|up_4|NZ_CP019236.1_3876390_3877152_+	COG3473, COG3473, Maleate cis-trans isomerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|341aa|up_3|NZ_CP019236.1_3877152_3878175_-	cd13579, PBP2_Bug_NagM, Uncharacterized NagM-like protein of Bug (Bordetella uptake gene) protein family; contains the type 2 periplasmic binding fold	NA|202aa|up_2|NZ_CP019236.1_3878358_3878964_+	NA	NA|285aa|up_1|NZ_CP019236.1_3879313_3880168_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|525aa|up_0|NZ_CP019236.1_3880205_3881780_+	cd08498, PBP2_NikA_DppA_OppA_like_2, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|322aa|down_0|NZ_CP019236.1_3882030_3882996_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|317aa|down_1|NZ_CP019236.1_3882992_3883943_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|334aa|down_2|NZ_CP019236.1_3884076_3885078_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|336aa|down_3|NZ_CP019236.1_3885074_3886082_+	PRK11308, dppF, dipeptide transporter ATP-binding subunit; Provisional	NA|483aa|down_4|NZ_CP019236.1_3886092_3887541_+	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|212aa|down_5|NZ_CP019236.1_3887551_3888187_+	COG1853, COG1853, Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family [General function prediction only]	NA|398aa|down_6|NZ_CP019236.1_3888190_3889384_+	PRK05985, PRK05985, cytosine deaminase; Provisional	NA|490aa|down_7|NZ_CP019236.1_3889637_3891107_+	PRK06151, PRK06151, N-ethylammeline chlorohydrolase; Provisional	NA|352aa|down_8|NZ_CP019236.1_3891188_3892244_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|268aa|down_9|NZ_CP019236.1_3892256_3893060_-	cd01069, PBP2_PheC, Cyclohexadienyl dehydratase, a member of the type 2 periplasmic binding fold protein superfamily
GCF_001955695.1_ASM195569v1	NZ_CP019236	Rhodoferax koreense strain DCY-110 chromosome, complete genome	3	4152375-4152478	3	CRISPRCasFinder	no		DEDDh,WYL,csa3,RT,DinG,cas3	Orphan	GGCCCTTCGACAAGCTCAGGGCGAACGG	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,csa3,RT,DinG,cas3	NA,NA	NA|186aa|up_9|NZ_CP019236.1_4139893_4140451_-	pfam14279, HNH_5, HNH endonuclease	NA|337aa|up_8|NZ_CP019236.1_4140614_4141625_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|981aa|up_7|NZ_CP019236.1_4141732_4144675_+	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|170aa|up_6|NZ_CP019236.1_4144674_4145184_+	PRK00376, lspA, lipoprotein signal peptidase	NA|553aa|up_5|NZ_CP019236.1_4145282_4146941_+	COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and metabolism]	NA|335aa|up_4|NZ_CP019236.1_4147257_4148262_-	pfam13593, SBF_like, SBF-like CPA transporter family (DUF4137)	NA|303aa|up_3|NZ_CP019236.1_4148354_4149263_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|268aa|up_2|NZ_CP019236.1_4149491_4150295_-	cd03676, Nudix_hydrolase_3, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|330aa|up_1|NZ_CP019236.1_4150291_4151281_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|301aa|up_0|NZ_CP019236.1_4151459_4152362_+	pfam00892, EamA, EamA-like transporter family	NA|289aa|down_0|NZ_CP019236.1_4152486_4153353_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|296aa|down_1|NZ_CP019236.1_4153363_4154251_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|367aa|down_2|NZ_CP019236.1_4154252_4155353_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|356aa|down_3|NZ_CP019236.1_4155359_4156427_-	cd13587, PBP2_polyamine_2, The periplasmic-binding component of an uncharacterized ABC transporter involved in uptake of polyamines; contains the type 2 periplasmic binding fold	NA|167aa|down_4|NZ_CP019236.1_4156544_4157045_-	pfam04828, GFA, Glutathione-dependent formaldehyde-activating enzyme	NA|158aa|down_5|NZ_CP019236.1_4157149_4157623_-	pfam04972, BON, BON domain	NA|422aa|down_6|NZ_CP019236.1_4157745_4159011_+	cd03586, PolY_Pol_IV_kappa, DNA Polymerase IV/Kappa	NA|400aa|down_7|NZ_CP019236.1_4159118_4160318_+	COG3569, COG3569, Topoisomerase IB [DNA replication, recombination, and repair]	NA|356aa|down_8|NZ_CP019236.1_4160324_4161392_-	cd13589, PBP2_polyamine_RpCGA009, The periplasmic-binding component of an uncharacterized ABC transport system from Rhodopseudomonas palustris CGA009 and related proteins; contains the type 2 periplasmic-binding fold	NA|380aa|down_9|NZ_CP019236.1_4161444_4162584_-	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]
GCF_001955695.1_ASM195569v1	NZ_CP019236	Rhodoferax koreense strain DCY-110 chromosome, complete genome	4	4943776-4943884	4	CRISPRCasFinder	no		DEDDh,WYL,csa3,RT,DinG,cas3	Orphan	CCGTTCGGGCTGAGCCTGTCGAAGC	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,csa3,RT,DinG,cas3	NA|143aa|up_1|NZ_CP019236.1_4941795_4942224_+,NA|161aa|down_4|NZ_CP019236.1_4950749_4951232_-,NA|77aa|down_9|NZ_CP019236.1_4959266_4959497_+	NA|796aa|up_9|NZ_CP019236.1_4931148_4933536_+	PRK05261, PRK05261, phosphoketolase	NA|408aa|up_8|NZ_CP019236.1_4933552_4934776_+	COG0282, ackA, Acetate kinase [Energy production and conversion]	NA|253aa|up_7|NZ_CP019236.1_4935007_4935766_+	pfam11828, DUF3348, Protein of unknown function (DUF3348)	NA|724aa|up_6|NZ_CP019236.1_4935762_4937934_+	pfam05650, DUF802, Domain of unknown function (DUF802)	NA|221aa|up_5|NZ_CP019236.1_4937930_4938593_+	PRK09040, PRK09040, hypothetical protein; Provisional	NA|218aa|up_4|NZ_CP019236.1_4938589_4939243_+	pfam11445, DUF2894, Protein of unknown function (DUF2894)	NA|351aa|up_3|NZ_CP019236.1_4939394_4940447_+	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|323aa|up_2|NZ_CP019236.1_4940457_4941426_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|143aa|up_1|NZ_CP019236.1_4941795_4942224_+	NA	NA|455aa|up_0|NZ_CP019236.1_4942329_4943694_+	PRK14325, PRK14325, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|471aa|down_0|NZ_CP019236.1_4943926_4945339_+	PRK11100, PRK11100, sensory histidine kinase CreC; Provisional	NA|463aa|down_1|NZ_CP019236.1_4946110_4947499_+	PRK15115, PRK15115, response regulator GlrR; Provisional	NA|687aa|down_2|NZ_CP019236.1_4947919_4949980_+	PRK05454, PRK05454, glucans biosynthesis glucosyltransferase MdoH	NA|214aa|down_3|NZ_CP019236.1_4950049_4950691_-	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|161aa|down_4|NZ_CP019236.1_4950749_4951232_-	NA	NA|308aa|down_5|NZ_CP019236.1_4951231_4952155_-	cd09614, griffithsin_like, Jacalin-like lectin domain of griffithsin and related proteins	NA|1600aa|down_6|NZ_CP019236.1_4952006_4956806_-	cd02851, E_set_GO_C, C-terminal Early set domain associated with the catalytic domain of galactose oxidase	NA|458aa|down_7|NZ_CP019236.1_4957008_4958382_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|267aa|down_8|NZ_CP019236.1_4958469_4959270_+	COG4137, COG4137, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|77aa|down_9|NZ_CP019236.1_4959266_4959497_+	NA
GCF_001955695.1_ASM195569v1	NZ_CP019236	Rhodoferax koreense strain DCY-110 chromosome, complete genome	5	5231386-5231490	5	CRISPRCasFinder	no		DEDDh,WYL,csa3,RT,DinG,cas3	Orphan	CACTTCGACAGGCTCAGTGCGAACGG	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,csa3,RT,DinG,cas3	NA,NA|537aa|down_4|NZ_CP019236.1_5235004_5236615_-	NA|306aa|up_9|NZ_CP019236.1_5220170_5221088_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|380aa|up_8|NZ_CP019236.1_5221098_5222238_+	smart00271, DnaJ, DnaJ molecular chaperone homology domain	NA|100aa|up_7|NZ_CP019236.1_5222239_5222539_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|95aa|up_6|NZ_CP019236.1_5222560_5222845_-	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|1185aa|up_5|NZ_CP019236.1_5222899_5226454_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|227aa|up_4|NZ_CP019236.1_5226450_5227131_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|237aa|up_3|NZ_CP019236.1_5227140_5227851_-	cd01011, nicotinamidase, Nicotinamidase/pyrazinamidase (PZase)	NA|244aa|up_2|NZ_CP019236.1_5227874_5228606_-	cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of branched-chain amino acid transporter	NA|266aa|up_1|NZ_CP019236.1_5228602_5229400_-	cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component of branched chain amino acids transport system	NA|637aa|up_0|NZ_CP019236.1_5229396_5231307_-	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|399aa|down_0|NZ_CP019236.1_5231511_5232708_-	cd06330, PBP1_As_SBP-like, periplasmic substrate-binding domain of active transport proteins	NA|195aa|down_1|NZ_CP019236.1_5232767_5233352_-	pfam06684, AA_synth, Amino acid synthesis	NA|214aa|down_2|NZ_CP019236.1_5233379_5234021_-	pfam06684, AA_synth, Amino acid synthesis	NA|316aa|down_3|NZ_CP019236.1_5234046_5234994_-	PRK04334, PRK04334, hypothetical protein; Provisional	NA|537aa|down_4|NZ_CP019236.1_5235004_5236615_-	NA	NA|229aa|down_5|NZ_CP019236.1_5236845_5237532_+	pfam17938, TetR_C_29, Tetracyclin repressor-like, C-terminal domain	NA|398aa|down_6|NZ_CP019236.1_5237562_5238756_+	cd01292, metallo-dependent_hydrolases, Superfamily of metallo-dependent hydrolases (also called amidohydrolase superfamily) is a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site	NA|420aa|down_7|NZ_CP019236.1_5238835_5240095_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|286aa|down_8|NZ_CP019236.1_5240168_5241026_+	PRK06180, PRK06180, short chain dehydrogenase; Provisional	NA|193aa|down_9|NZ_CP019236.1_5241055_5241634_-	COG3224, COG3224, Uncharacterized protein conserved in bacteria [Function unknown]
