assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000148685.1_ASM14868v1	NC_014539	Burkholderia sp. CCGE1003 chromosome 1, complete sequence	1	1795159-1795434	1	CRT	no		cas3,WYL,csa3,DinG,DEDDh,RT	Orphan	GCGGCAATCGAAGCGGCA	18	0	0	NA	NA	NA	6	6	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA,NA|101aa|down_2|NC_014539.1_1799924_1800227_+,NA|188aa|down_5|NC_014539.1_1802778_1803342_+,NA|164aa|down_9|NC_014539.1_1808051_1808543_-	NA|394aa|up_9|NC_014539.1_1783839_1785021_+	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|247aa|up_8|NC_014539.1_1785228_1785969_+	PRK12938, PRK12938, 3-ketoacyl-ACP reductase	NA|202aa|up_7|NC_014539.1_1786097_1786703_+	COG5394, COG5394, Uncharacterized protein conserved in bacteria [Function unknown]	NA|464aa|up_6|NC_014539.1_1787116_1788508_+	PRK14862, rimO, 30S ribosomal protein S12 methylthiotransferase RimO	NA|312aa|up_5|NC_014539.1_1788511_1789447_+	cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates 2-keto-3-deoxygluconate (KDG) to form 2-keto-3-deoxy-6-phosphogluconate (KDGP)	NA|395aa|up_4|NC_014539.1_1789522_1790707_+	PRK09051, PRK09051, beta-ketothiolase BktB	NA|226aa|up_3|NC_014539.1_1790754_1791432_+	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|395aa|up_2|NC_014539.1_1791562_1792747_+	PRK07050, PRK07050, cystathionine beta-lyase; Provisional	NA|282aa|up_1|NC_014539.1_1792802_1793648_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|411aa|up_0|NC_014539.1_1793849_1795082_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|342aa|down_0|NC_014539.1_1795600_1796626_+	cd00789, KU_like, Ku-core domain, Ku-like subfamily; composed of prokaryotic homologs of the eukaryotic DNA binding protein Ku	NA|975aa|down_1|NC_014539.1_1796832_1799757_+	PRK05972, ligD, ATP-dependent DNA ligase; Reviewed	NA|101aa|down_2|NC_014539.1_1799924_1800227_+	NA	NA|321aa|down_3|NC_014539.1_1800404_1801367_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|339aa|down_4|NC_014539.1_1801561_1802578_+	cd08252, AL_MDR, Arginate lyase and other MDR family members	NA|188aa|down_5|NC_014539.1_1802778_1803342_+	NA	NA|269aa|down_6|NC_014539.1_1803500_1804307_-	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|237aa|down_7|NC_014539.1_1804303_1805014_-	COG4619, COG4619, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|858aa|down_8|NC_014539.1_1805146_1807720_+	COG2982, AsmA, Uncharacterized protein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane]	NA|164aa|down_9|NC_014539.1_1808051_1808543_-	NA
GCF_000148685.1_ASM14868v1	NC_014539	Burkholderia sp. CCGE1003 chromosome 1, complete sequence	2	3000779-3000871	1	CRISPRCasFinder	no	csa3,DEDDh	cas3,WYL,csa3,DinG,DEDDh,RT	Type I-A	GTTCGACTAACGACGAGCGCCTGT	24	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA|94aa|up_4|NC_014539.1_2995975_2996257_-,NA	NA|277aa|up_9|NC_014539.1_2989772_2990603_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|255aa|up_8|NC_014539.1_2990652_2991417_+	COG1296, AzlC, Predicted branched-chain amino acid permease (azaleucine resistance) [Amino acid transport and metabolism]	NA|110aa|up_7|NC_014539.1_2991413_2991743_+	COG4392, COG4392, Predicted membrane protein [Function unknown]	NA|256aa|up_6|NC_014539.1_2992365_2993133_+	pfam13317, DUF4088, Protein of unknown function (DUF4088)	NA|782aa|up_5|NC_014539.1_2993293_2995639_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|94aa|up_4|NC_014539.1_2995975_2996257_-	NA	NA|158aa|up_3|NC_014539.1_2996355_2996829_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|73aa|up_2|NC_014539.1_2997153_2997372_+	pfam13275, S4_2, S4 domain	NA|471aa|up_1|NC_014539.1_2997694_2999107_+	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|320aa|up_0|NC_014539.1_2999704_3000664_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|539aa|down_0|NC_014539.1_3001223_3002840_+	cd09113, PLDc_ymdC_like_2, Putative catalytic domain, repeat 2, of Escherichia coli uncharacterized protein ymdC and similar proteins	NA|467aa|down_1|NC_014539.1_3002853_3004254_+	PRK00485, fumC, fumarate hydratase; Reviewed	NA|169aa|down_2|NC_014539.1_3004383_3004890_-	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|386aa|down_3|NC_014539.1_3004966_3006124_-	COG4977, COG4977, Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]	NA|405aa|down_4|NC_014539.1_3006123_3007338_-	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	csa3|238aa|down_5|NC_014539.1_3008111_3008825_+	COG0640, ArsR, Predicted transcriptional regulators [Transcription]	NA|934aa|down_6|NC_014539.1_3008989_3011791_+	pfam09770, PAT1, Topoisomerase II-associated protein PAT1	NA|324aa|down_7|NC_014539.1_3011884_3012856_-	PRK13821, thyA, thymidylate synthase; Provisional	NA|465aa|down_8|NC_014539.1_3013219_3014614_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|167aa|down_9|NC_014539.1_3015409_3015910_+	pfam00186, DHFR_1, Dihydrofolate reductase
GCF_000148685.1_ASM14868v1	NC_014540	Burkholderia sp. CCGE1003 chromosome 2, complete sequence	1	222374-222472	1	CRISPRCasFinder	no		DinG,cas3,csa3	Orphan	CGCGACCGTGGCCGGCGCTGTCGG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA|103aa|up_9|NC_014540.1_212793_213102_-,NA|79aa|up_8|NC_014540.1_213500_213737_+,NA|82aa|up_7|NC_014540.1_213812_214058_-,NA	NA|103aa|up_9|NC_014540.1_212793_213102_-	NA	NA|79aa|up_8|NC_014540.1_213500_213737_+	NA	NA|82aa|up_7|NC_014540.1_213812_214058_-	NA	NA|72aa|up_6|NC_014540.1_214297_214513_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|369aa|up_5|NC_014540.1_214667_215774_-	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane	NA|227aa|up_4|NC_014540.1_215965_216646_-	PRK10995, PRK10995, MarC family NAAT transporter	NA|72aa|up_3|NC_014540.1_217017_217233_+	pfam11755, DUF3311, Protein of unknown function (DUF3311)	NA|491aa|up_2|NC_014540.1_217229_218702_+	COG0591, PutP, Na+/proline symporter [Amino acid transport and metabolism / General function prediction only]	NA|534aa|up_1|NC_014540.1_218976_220578_+	PRK00654, glgA, glycogen synthase GlgA	NA|383aa|up_0|NC_014540.1_220779_221928_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|722aa|down_0|NC_014540.1_223543_225709_-	cd09601, M1_APN-Q_like, Peptidase M1 aminopeptidase N catalytic domain family which includes aminopeptidase N (APN), aminopeptidase Q (APQ), tricorn interacting factor F3, and endoplasmic reticulum aminopeptidase 1 (ERAP1)	NA|388aa|down_1|NC_014540.1_226297_227461_-	pfam06441, EHN, Epoxide hydrolase N-terminus	NA|128aa|down_2|NC_014540.1_227598_227982_+	pfam03788, LrgA, LrgA family	NA|247aa|down_3|NC_014540.1_227978_228719_+	pfam04172, LrgB, LrgB-like family	NA|568aa|down_4|NC_014540.1_229061_230765_+	PRK13272, treA, alpha,alpha-trehalase TreA	NA|213aa|down_5|NC_014540.1_230879_231518_+	pfam13532, 2OG-FeII_Oxy_2, 2OG-Fe(II) oxygenase superfamily	NA|212aa|down_6|NC_014540.1_231537_232173_-	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|91aa|down_7|NC_014540.1_232391_232664_-	pfam07369, DUF1488, Protein of unknown function (DUF1488)	NA|498aa|down_8|NC_014540.1_232848_234342_+	cd17502, MFS_Azr1_MDR_like, Saccharomyces cerevisiae Azole resistance protein 1 (Azr1p), and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|242aa|down_9|NC_014540.1_234413_235139_-	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]
GCF_000148685.1_ASM14868v1	NC_014540	Burkholderia sp. CCGE1003 chromosome 2, complete sequence	2	1113791-1113867	2	CRISPRCasFinder	no		DinG,cas3,csa3	Orphan	GTCGACCGGAGTTCGAGCGAAGC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA,NA|235aa|down_7|NC_014540.1_1121189_1121894_-,NA|96aa|down_9|NC_014540.1_1122819_1123107_+	NA|318aa|up_9|NC_014540.1_1103732_1104686_-	pfam00582, Usp, Universal stress protein family	NA|230aa|up_8|NC_014540.1_1104738_1105428_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|342aa|up_7|NC_014540.1_1105648_1106674_+	cd08297, CAD3, Cinnamyl alcohol dehydrogenases (CAD)	NA|128aa|up_6|NC_014540.1_1106962_1107346_+	pfam12087, DUF3564, Protein of unknown function (DUF3564)	NA|64aa|up_5|NC_014540.1_1107386_1107578_-	pfam11177, DUF2964, Protein of unknown function (DUF2964)	NA|658aa|up_4|NC_014540.1_1107791_1109765_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|25aa|up_3|NC_014540.1_1110060_1110135_+	PRK00284, pqqA, pyrroloquinoline quinone precursor peptide PqqA	NA|94aa|up_2|NC_014540.1_1110266_1110548_+	TIGR03859, PQQ_PqqD, coenzyme PQQ biosynthesis protein PqqD	NA|417aa|up_1|NC_014540.1_1110708_1111959_-	smart00062, PBPb, Bacterial periplasmic substrate-binding proteins	NA|576aa|up_0|NC_014540.1_1112048_1113776_-	cd10277, PQQ_ADH_I, Ethanol dehydrogenase, a bacterial quinoprotein (PQQ-dependent type I alcohol dehydrogenase)	NA|299aa|down_0|NC_014540.1_1114219_1115116_-	cd08451, PBP2_BudR, The C-terminal substrate binding domain of LysR-type transcrptional regulator BudR, which is responsible for activation of the expression of the butanediol operon genes; contains the type 2 periplasmic binding fold	NA|139aa|down_1|NC_014540.1_1115334_1115751_+	cd03499, SQR_TypeC_SdhC, Succinate:quinone oxidoreductase (SQR) Type C subfamily, Succinate dehydrogenase C (SdhC) subunit; composed of bacterial SdhC and eukaryotic large cytochrome b binding (CybL) proteins	NA|123aa|down_2|NC_014540.1_1115755_1116124_+	TIGR02968, succ_dehyd_anc, succinate dehydrogenase, hydrophobic membrane anchor protein	NA|592aa|down_3|NC_014540.1_1116128_1117904_+	PRK07057, sdhA, succinate dehydrogenase flavoprotein subunit; Reviewed	NA|235aa|down_4|NC_014540.1_1117928_1118633_+	PRK05950, sdhB, succinate dehydrogenase iron-sulfur subunit; Reviewed	NA|479aa|down_5|NC_014540.1_1119185_1120622_+	cd15482, Sialidase_non-viral, Non-viral sialidases	NA|107aa|down_6|NC_014540.1_1120700_1121021_-	pfam06865, DUF1255, Protein of unknown function (DUF1255)	NA|235aa|down_7|NC_014540.1_1121189_1121894_-	NA	NA|154aa|down_8|NC_014540.1_1122116_1122578_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|96aa|down_9|NC_014540.1_1122819_1123107_+	NA
GCF_000148685.1_ASM14868v1	NC_014540	Burkholderia sp. CCGE1003 chromosome 2, complete sequence	3	1340109-1340205	3	CRISPRCasFinder	no		DinG,cas3,csa3	Orphan	GGCGTGCGGCGGTTTATGCGCGC	23	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA|68aa|up_5|NC_014540.1_1329293_1329497_-,NA|88aa|up_4|NC_014540.1_1330040_1330304_-,NA|84aa|down_1|NC_014540.1_1341192_1341444_+,NA|93aa|down_4|NC_014540.1_1345010_1345289_+,NA|86aa|down_6|NC_014540.1_1347903_1348161_-	NA|346aa|up_9|NC_014540.1_1325346_1326384_-	cd19079, AKR_EcYajO-like, Escherichia coli YajO and similar proteins	NA|175aa|up_8|NC_014540.1_1326434_1326959_-	pfam08714, Fae, Formaldehyde-activating enzyme (Fae)	NA|97aa|up_7|NC_014540.1_1327610_1327901_-	pfam00816, Histone_HNS, H-NS histone family	NA|400aa|up_6|NC_014540.1_1328029_1329229_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|68aa|up_5|NC_014540.1_1329293_1329497_-	NA	NA|88aa|up_4|NC_014540.1_1330040_1330304_-	NA	NA|500aa|up_3|NC_014540.1_1330392_1331892_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|467aa|up_2|NC_014540.1_1332219_1333620_+	COG2610, GntT, H+/gluconate symporter and related permeases [Carbohydrate transport and metabolism / Amino acid transport and metabolism]	NA|134aa|up_1|NC_014540.1_1333730_1334132_+	COG0251, TdcF, Putative translation initiation inhibitor, yjgF family [Translation, ribosomal structure and biogenesis]	NA|578aa|up_0|NC_014540.1_1338178_1339912_+	PRK15041, PRK15041, methyl-accepting chemotaxis protein	NA|201aa|down_0|NC_014540.1_1340253_1340856_+	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|84aa|down_1|NC_014540.1_1341192_1341444_+	NA	NA|331aa|down_2|NC_014540.1_1341804_1342797_+	cd01444, GlpE_ST, GlpE sulfurtransferase (ST) and homologs are members of the Rhodanese Homology Domain superfamily	NA|572aa|down_3|NC_014540.1_1342882_1344598_+	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|93aa|down_4|NC_014540.1_1345010_1345289_+	NA	NA|200aa|down_5|NC_014540.1_1347156_1347756_-	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|86aa|down_6|NC_014540.1_1347903_1348161_-	NA	NA|308aa|down_7|NC_014540.1_1348356_1349280_-	PRK11272, PRK11272, putative DMT superfamily transporter inner membrane protein; Provisional	NA|307aa|down_8|NC_014540.1_1349456_1350377_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|1007aa|down_9|NC_014540.1_1350505_1353526_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]
GCF_000148685.1_ASM14868v1	NC_014540	Burkholderia sp. CCGE1003 chromosome 2, complete sequence	4	1367440-1367527	4	CRISPRCasFinder	no		DinG,cas3,csa3	Orphan	GCTCCCGAAAATCCTCACCTTTG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DinG,DEDDh,RT	NA,NA|181aa|down_1|NC_014540.1_1368365_1368908_-	NA|243aa|up_9|NC_014540.1_1357305_1358034_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|144aa|up_8|NC_014540.1_1358858_1359290_+	TIGR03357, hypothetical_protein_Atu4338, type VI secretion system lysozyme-like protein	NA|93aa|up_7|NC_014540.1_1359339_1359618_+	cd14744, PAAR_CT_2, proline-alanine-alanine-arginine (PAAR) domain with uncharacterized C-terminal extension	NA|446aa|up_6|NC_014540.1_1359811_1361149_-	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|324aa|up_5|NC_014540.1_1361210_1362182_-	cd07938, DRE_TIM_HMGL, 3-hydroxy-3-methylglutaryl-CoA lyase, catalytic TIM barrel domain	NA|398aa|up_4|NC_014540.1_1362178_1363372_-	COG1804, CaiB, Predicted acyl-CoA transferases/carnitine dehydratase [Energy production and conversion]	NA|322aa|up_3|NC_014540.1_1363491_1364457_-	cd08421, PBP2_LTTR_like_1, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|213aa|up_2|NC_014540.1_1364529_1365168_-	TIGR03401, Uncharacterized_protein_YFL061W/YNL335W, HD domain protein, cyanamide hydratase family	NA|321aa|up_1|NC_014540.1_1365414_1366377_+	COG4977, COG4977, Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]	NA|253aa|up_0|NC_014540.1_1366459_1367218_+	PRK07023, PRK07023, SDR family oxidoreductase	NA|122aa|down_0|NC_014540.1_1367960_1368326_-	cd17562, REC_CheY4-like, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY4 and similar CheY family proteins	NA|181aa|down_1|NC_014540.1_1368365_1368908_-	NA	NA|245aa|down_2|NC_014540.1_1368904_1369639_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|797aa|down_3|NC_014540.1_1369693_1372084_-	cd16916, HATPase_CheA-like, Histidine kinase-like ATPase domain of the chemotaxis protein histidine kinase CheA, and some hybrid sensor histidine kinases	NA|591aa|down_4|NC_014540.1_1372399_1374172_+	cd09173, PLDc_Nuc_like_unchar1_2, Putative catalytic domain, repeat 2, of uncharacterized hypothetical proteins similar to Nuc, an endonuclease from Salmonella typhimurium	NA|510aa|down_5|NC_014540.1_1374292_1375822_-	PRK09837, PRK09837, Cu(I)/Ag(I) efflux RND transporter outer membrane protein	NA|1077aa|down_6|NC_014540.1_1375818_1379049_-	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|410aa|down_7|NC_014540.1_1379061_1380291_-	PRK15030, PRK15030, multidrug efflux RND transporter periplasmic adaptor subunit AcrA	NA|246aa|down_8|NC_014540.1_1380711_1381449_+	PRK09468, ompR, osmolarity response regulator; Provisional	NA|416aa|down_9|NC_014540.1_1381432_1382680_+	PRK09467, envZ, osmolarity sensor protein; Provisional
