assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_013154915.1_ASM1315491v1	NZ_CP053553	Diaphorobacter sp. JS3050 chromosome, complete genome	1	65064-68533	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas9	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	Type II-C, or Type II-C?, Type II-B,Type II-A,Type II-B	GGTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT	37,36,36,36,36	0	0	NA	NA	NA:NA:NA:NA:NA	40,52,52,40,40	52	TypeII-C,orTypeII-C?,TypeII-B,TypeII-A,TypeII-B	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	NA|203aa|up_5|NZ_CP053553.1_59632_60241_-,NA	NA|266aa|up_9|NZ_CP053553.1_55742_56540_+	PRK08317, PRK08317, hypothetical protein; Provisional	NA|347aa|up_8|NZ_CP053553.1_56700_57741_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|110aa|up_7|NZ_CP053553.1_57896_58226_-	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|407aa|up_6|NZ_CP053553.1_58332_59553_-	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|203aa|up_5|NZ_CP053553.1_59632_60241_-	NA	NA|177aa|up_4|NZ_CP053553.1_60237_60768_-	PRK11924, PRK11924, RNA polymerase sigma factor; Provisional	NA|309aa|up_3|NZ_CP053553.1_60931_61858_-	cd06124, cupin_NimR-like_N, AraC/XylS family transcriptional regulators similar to NimR, N-terminal cupin domain	NA|416aa|up_2|NZ_CP053553.1_61917_63165_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|248aa|up_1|NZ_CP053553.1_63478_64222_+	COG3703, ChaC, Uncharacterized protein involved in cation transport [Inorganic ion transport and metabolism]	NA|201aa|up_0|NZ_CP053553.1_64360_64963_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|362aa|down_0|NZ_CP053553.1_68614_69700_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas2|103aa|down_1|NZ_CP053553.1_71940_72249_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|303aa|down_2|NZ_CP053553.1_72291_73200_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1123aa|down_3|NZ_CP053553.1_73186_76555_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|475aa|down_4|NZ_CP053553.1_76748_78173_-	PLN02805, PLN02805, D-lactate dehydrogenase [cytochrome]	NA|127aa|down_5|NZ_CP053553.1_78722_79103_-	cd15898, EFh_PI-PLC, EF-hand motif found in eukaryotic phosphoinositide-specific phospholipase C (PI-PLC, EC 3	NA|191aa|down_6|NZ_CP053553.1_79258_79831_+	pfam01923, Cob_adeno_trans, Cobalamin adenosyltransferase	NA|141aa|down_7|NZ_CP053553.1_80189_80612_-	TIGR00369, Putative_esterase, uncharacterized domain 1	NA|386aa|down_8|NZ_CP053553.1_80835_81993_+	cd06326, PBP1_ABC_ligand_binding-like, periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|360aa|down_9|NZ_CP053553.1_82372_83452_-	pfam13358, DDE_3, DDE superfamily endonuclease
GCF_013154915.1_ASM1315491v1	NZ_CP053553	Diaphorobacter sp. JS3050 chromosome, complete genome	2	69729-71878	2,2,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas9	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	Type II-C, or Type II-C?, Type II-B,Type II-A,Type II-B	GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACT,GTTGTAGCTCCCTCTCTCACCCCGGATAGCTACACTCG	36,36,36,38	0	0	NA	NA	NA:NA:NA:NA	32,32,28,28	32	TypeII-C,orTypeII-C?,TypeII-B,TypeII-A,TypeII-B	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	NA|203aa|up_6|NZ_CP053553.1_59632_60241_-,NA	NA|347aa|up_9|NZ_CP053553.1_56700_57741_+	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|110aa|up_8|NZ_CP053553.1_57896_58226_-	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|407aa|up_7|NZ_CP053553.1_58332_59553_-	cd02110, SO_family_Moco_dimer, Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain	NA|203aa|up_6|NZ_CP053553.1_59632_60241_-	NA	NA|177aa|up_5|NZ_CP053553.1_60237_60768_-	PRK11924, PRK11924, RNA polymerase sigma factor; Provisional	NA|309aa|up_4|NZ_CP053553.1_60931_61858_-	cd06124, cupin_NimR-like_N, AraC/XylS family transcriptional regulators similar to NimR, N-terminal cupin domain	NA|416aa|up_3|NZ_CP053553.1_61917_63165_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|248aa|up_2|NZ_CP053553.1_63478_64222_+	COG3703, ChaC, Uncharacterized protein involved in cation transport [Inorganic ion transport and metabolism]	NA|201aa|up_1|NZ_CP053553.1_64360_64963_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|362aa|up_0|NZ_CP053553.1_68614_69700_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas2|103aa|down_0|NZ_CP053553.1_71940_72249_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|303aa|down_1|NZ_CP053553.1_72291_73200_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1123aa|down_2|NZ_CP053553.1_73186_76555_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|475aa|down_3|NZ_CP053553.1_76748_78173_-	PLN02805, PLN02805, D-lactate dehydrogenase [cytochrome]	NA|127aa|down_4|NZ_CP053553.1_78722_79103_-	cd15898, EFh_PI-PLC, EF-hand motif found in eukaryotic phosphoinositide-specific phospholipase C (PI-PLC, EC 3	NA|191aa|down_5|NZ_CP053553.1_79258_79831_+	pfam01923, Cob_adeno_trans, Cobalamin adenosyltransferase	NA|141aa|down_6|NZ_CP053553.1_80189_80612_-	TIGR00369, Putative_esterase, uncharacterized domain 1	NA|386aa|down_7|NZ_CP053553.1_80835_81993_+	cd06326, PBP1_ABC_ligand_binding-like, periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|360aa|down_8|NZ_CP053553.1_82372_83452_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|489aa|down_9|NZ_CP053553.1_83688_85155_-	TIGR03458, Propionyl-CoA:succinate_CoA_transferase, succinate CoA transferase
GCF_013154915.1_ASM1315491v1	NZ_CP053553	Diaphorobacter sp. JS3050 chromosome, complete genome	3	589349-589739	3,3,6	CRISPRCasFinder,CRT,PILER-CR	no	cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	Type I-F	TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAACA	28,28,29	0	0	NA	NA	I-F:I-F:I-F	6,6,2	6	TypeI-F	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	NA|224aa|up_5|NZ_CP053553.1_584271_584943_+,NA	NA|249aa|up_9|NZ_CP053553.1_579343_580090_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|651aa|up_8|NZ_CP053553.1_580436_582389_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|331aa|up_7|NZ_CP053553.1_582592_583585_+	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|171aa|up_6|NZ_CP053553.1_583732_584245_+	TIGR04177, conserved_hypothetical_protein_partial, exosortase H, IPTLxxWG-CTERM-specific	NA|224aa|up_5|NZ_CP053553.1_584271_584943_+	NA	NA|347aa|up_4|NZ_CP053553.1_584949_585990_-	PRK11768, PRK11768, serine/threonine protein kinase	NA|480aa|up_3|NZ_CP053553.1_586025_587465_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|253aa|up_2|NZ_CP053553.1_587507_588266_-	cd00838, MPP_superfamily, metallophosphatase superfamily, metallophosphatase domain	NA|139aa|up_1|NZ_CP053553.1_588396_588813_-	TIGR03561, organ_hyd_perox, peroxiredoxin, Ohr subfamily	NA|152aa|up_0|NZ_CP053553.1_588873_589329_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	cas6f|192aa|down_0|NZ_CP053553.1_593177_593753_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|342aa|down_1|NZ_CP053553.1_593766_594792_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|359aa|down_2|NZ_CP053553.1_594848_595925_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|455aa|down_3|NZ_CP053553.1_595921_597286_-	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas3-cas2|1171aa|down_4|NZ_CP053553.1_597646_601159_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|335aa|down_5|NZ_CP053553.1_601155_602160_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|394aa|down_6|NZ_CP053553.1_602195_603377_-	cd06819, PLPDE_III_LS_D-TA, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Low Specificity D-Threonine Aldolase	NA|215aa|down_7|NZ_CP053553.1_603360_604005_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|155aa|down_8|NZ_CP053553.1_604027_604492_-	PRK11667, PRK11667, hypothetical protein; Provisional	NA|205aa|down_9|NZ_CP053553.1_604519_605134_-	pfam13590, DUF4136, Domain of unknown function (DUF4136)
GCF_013154915.1_ASM1315491v1	NZ_CP053553	Diaphorobacter sp. JS3050 chromosome, complete genome	4	589891-593039	7,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	Type I-F	TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAAC	28,28,28	0	0	NA	NA	I-F:I-F:I-F	52,52,52	52	TypeI-F	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	NA|224aa|up_5|NZ_CP053553.1_584271_584943_+,NA	NA|249aa|up_9|NZ_CP053553.1_579343_580090_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|651aa|up_8|NZ_CP053553.1_580436_582389_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|331aa|up_7|NZ_CP053553.1_582592_583585_+	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|171aa|up_6|NZ_CP053553.1_583732_584245_+	TIGR04177, conserved_hypothetical_protein_partial, exosortase H, IPTLxxWG-CTERM-specific	NA|224aa|up_5|NZ_CP053553.1_584271_584943_+	NA	NA|347aa|up_4|NZ_CP053553.1_584949_585990_-	PRK11768, PRK11768, serine/threonine protein kinase	NA|480aa|up_3|NZ_CP053553.1_586025_587465_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|253aa|up_2|NZ_CP053553.1_587507_588266_-	cd00838, MPP_superfamily, metallophosphatase superfamily, metallophosphatase domain	NA|139aa|up_1|NZ_CP053553.1_588396_588813_-	TIGR03561, organ_hyd_perox, peroxiredoxin, Ohr subfamily	NA|152aa|up_0|NZ_CP053553.1_588873_589329_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	cas6f|192aa|down_0|NZ_CP053553.1_593177_593753_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|342aa|down_1|NZ_CP053553.1_593766_594792_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|359aa|down_2|NZ_CP053553.1_594848_595925_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|455aa|down_3|NZ_CP053553.1_595921_597286_-	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas3-cas2|1171aa|down_4|NZ_CP053553.1_597646_601159_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|335aa|down_5|NZ_CP053553.1_601155_602160_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|394aa|down_6|NZ_CP053553.1_602195_603377_-	cd06819, PLPDE_III_LS_D-TA, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Low Specificity D-Threonine Aldolase	NA|215aa|down_7|NZ_CP053553.1_603360_604005_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|155aa|down_8|NZ_CP053553.1_604027_604492_-	PRK11667, PRK11667, hypothetical protein; Provisional	NA|205aa|down_9|NZ_CP053553.1_604519_605134_-	pfam13590, DUF4136, Domain of unknown function (DUF4136)
GCF_013154915.1_ASM1315491v1	NZ_CP053553	Diaphorobacter sp. JS3050 chromosome, complete genome	5	1959284-1959362	5	CRISPRCasFinder	no		DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	Orphan	GCGCAGGTCGCCGCCGCAGTGCAGC	25	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,cas2,cas1,cas9,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,csa3,PrimPol,Cas9_archaeal	NA,NA|135aa|down_1|NZ_CP053553.1_1962410_1962815_-,NA|80aa|down_2|NZ_CP053553.1_1962840_1963080_-	NA|683aa|up_9|NZ_CP053553.1_1945644_1947693_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|147aa|up_8|NZ_CP053553.1_1947715_1948156_-	pfam06713, bPH_4, Bacterial PH domain	NA|511aa|up_7|NZ_CP053553.1_1948172_1949705_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|343aa|up_6|NZ_CP053553.1_1949730_1950759_-	PRK09435, PRK09435, methylmalonyl Co-A mutase-associated GTPase MeaB	NA|721aa|up_5|NZ_CP053553.1_1950755_1952918_-	PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed	NA|212aa|up_4|NZ_CP053553.1_1953042_1953678_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|610aa|up_3|NZ_CP053553.1_1953845_1955675_+	COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]	NA|390aa|up_2|NZ_CP053553.1_1955675_1956845_-	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|151aa|up_1|NZ_CP053553.1_1956982_1957435_+	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|408aa|up_0|NZ_CP053553.1_1957499_1958723_-	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|696aa|down_0|NZ_CP053553.1_1960146_1962234_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|135aa|down_1|NZ_CP053553.1_1962410_1962815_-	NA	NA|80aa|down_2|NZ_CP053553.1_1962840_1963080_-	NA	NA|182aa|down_3|NZ_CP053553.1_1963234_1963780_-	PRK14054, PRK14054, peptide-methionine (S)-S-oxide reductase	NA|221aa|down_4|NZ_CP053553.1_1963910_1964573_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|237aa|down_5|NZ_CP053553.1_1964620_1965331_+	COG2518, Pcm, Protein-L-isoaspartate carboxylmethyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|113aa|down_6|NZ_CP053553.1_1965362_1965701_+	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|194aa|down_7|NZ_CP053553.1_1967197_1967779_-	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|118aa|down_8|NZ_CP053553.1_1967823_1968177_-	PRK09272, PRK09272, hypothetical protein; Provisional	NA|902aa|down_9|NZ_CP053553.1_1968183_1970889_-	COG1042, COG1042, Acyl-CoA synthetase (NDP forming) [Energy production and conversion]
