assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000520895.1_ASM52089v1	NZ_CP004023	Burkholderia pseudomallei MSHR511 chromosome 1, complete sequence	1	3795643-3795722	1	CRISPRCasFinder	no		WYL,cas3,DEDDh,csa3,DinG	Orphan	ACGGAATGGGCCCCGGCATGATG	23	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,DEDDh,csa3,DinG	NA|258aa|up_9|NZ_CP004023.1_3785229_3786003_+,NA|95aa|up_6|NZ_CP004023.1_3788724_3789009_+,NA|155aa|up_4|NZ_CP004023.1_3790572_3791037_+,NA|183aa|down_0|NZ_CP004023.1_3796671_3797220_+,NA|116aa|down_7|NZ_CP004023.1_3804490_3804838_-,NA|57aa|down_9|NZ_CP004023.1_3806045_3806216_+	NA|258aa|up_9|NZ_CP004023.1_3785229_3786003_+	NA	NA|335aa|up_8|NZ_CP004023.1_3786707_3787712_+	pfam08065, K167R, K167R (NUC007) repeat	NA|161aa|up_7|NZ_CP004023.1_3788245_3788728_+	pfam10908, DUF2778, Protein of unknown function (DUF2778)	NA|95aa|up_6|NZ_CP004023.1_3788724_3789009_+	NA	NA|364aa|up_5|NZ_CP004023.1_3789082_3790174_-	PHA02536, Q, portal vertex protein; Provisional	NA|155aa|up_4|NZ_CP004023.1_3790572_3791037_+	NA	NA|292aa|up_3|NZ_CP004023.1_3791174_3792050_-	TIGR03381, putative_carbon-nitrogen_hydrolase, N-carbamoylputrescine amidase	NA|371aa|up_2|NZ_CP004023.1_3792060_3793173_-	cd09996, HDAC_classII_1, Histone deacetylases and histone-like deacetylases, classII	NA|365aa|up_1|NZ_CP004023.1_3793201_3794296_-	cd13659, PBP2_PotF, The periplasmic substrate-binding component of an ABC putrescine transport system and related proteins; contains the type 2 periplasmic-binding fold	NA|323aa|up_0|NZ_CP004023.1_3794438_3795407_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|183aa|down_0|NZ_CP004023.1_3796671_3797220_+	NA	NA|483aa|down_1|NZ_CP004023.1_3797483_3798932_+	TIGR02393, RNA_polymerase_sigma_factor_RpoD, RNA polymerase sigma factor RpoD, C-terminal domain	NA|538aa|down_2|NZ_CP004023.1_3798950_3800564_-	PRK02107, PRK02107, glutamate--cysteine ligase; Provisional	NA|314aa|down_3|NZ_CP004023.1_3800722_3801664_-	COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]	NA|365aa|down_4|NZ_CP004023.1_3801660_3802755_-	PRK15435, PRK15435, bifunctional DNA-binding transcriptional regulator/O6-methylguanine-DNA methyltransferase Ada	NA|139aa|down_5|NZ_CP004023.1_3803156_3803573_+	cd07253, GLOD5, Human glyoxalase domain-containing protein 5 and similar proteins	NA|185aa|down_6|NZ_CP004023.1_3803720_3804275_-	pfam07152, YaeQ, YaeQ protein	NA|116aa|down_7|NZ_CP004023.1_3804490_3804838_-	NA	NA|187aa|down_8|NZ_CP004023.1_3805323_3805884_-	PRK15130, PRK15130, spermidine N1-acetyltransferase; Provisional	NA|57aa|down_9|NZ_CP004023.1_3806045_3806216_+	NA
GCF_000520895.1_ASM52089v1	NZ_CP004023	Burkholderia pseudomallei MSHR511 chromosome 1, complete sequence	2	3952970-3953237	1	CRT	no		WYL,cas3,DEDDh,csa3,DinG	Orphan	NAGCGCTGAAGCGCTGAC	18	0	0	NA	NA	NA	5	5	Orphan	WYL,cas3,DEDDh,csa3,DinG	NA,NA	NA|296aa|up_9|NZ_CP004023.1_3943568_3944456_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|378aa|up_8|NZ_CP004023.1_3944463_3945597_+	cd01117, YbiR_permease, Putative anion permease YbiR	NA|177aa|up_7|NZ_CP004023.1_3945867_3946398_+	pfam03899, ATP-synt_I, ATP synthase I chain	NA|284aa|up_6|NZ_CP004023.1_3946510_3947362_+	PRK05815, PRK05815, F0F1 ATP synthase subunit A; Validated	NA|90aa|up_5|NZ_CP004023.1_3947438_3947708_+	PRK06876, PRK06876, F0F1 ATP synthase subunit C; Validated	NA|157aa|up_4|NZ_CP004023.1_3947835_3948306_+	PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated	NA|180aa|up_3|NZ_CP004023.1_3948308_3948848_+	PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated	NA|514aa|up_2|NZ_CP004023.1_3948894_3950436_+	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|292aa|up_1|NZ_CP004023.1_3950507_3951383_+	PRK05621, PRK05621, F0F1 ATP synthase subunit gamma; Validated	NA|465aa|up_0|NZ_CP004023.1_3951462_3952857_+	PRK09280, PRK09280, F0F1 ATP synthase subunit beta; Validated	NA|577aa|down_0|NZ_CP004023.1_3953851_3955582_+	PRK08315, PRK08315, AMP-binding domain protein; Validated	NA|45aa|down_1|NZ_CP004023.1_3956041_3956176_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|114aa|down_2|NZ_CP004023.1_3956141_3956483_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|262aa|down_3|NZ_CP004023.1_3956682_3957468_-	cd01069, PBP2_PheC, Cyclohexadienyl dehydratase, a member of the type 2 periplasmic binding fold protein superfamily	NA|365aa|down_4|NZ_CP004023.1_3957607_3958702_+	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|756aa|down_5|NZ_CP004023.1_3959142_3961410_+	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|57aa|down_6|NZ_CP004023.1_3961420_3961591_-	cd05928, MACS_euk, Eukaryotic Medium-chain acyl-CoA synthetase (MACS or ACSM)	NA|1310aa|down_7|NZ_CP004023.1_3961874_3965804_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|380aa|down_8|NZ_CP004023.1_3965888_3967028_+	cd06342, PBP1_ABC_LIVBP-like, type 1 periplasmic ligand-binding domain of ABC (Atpase Binding Cassette)-type active transport systems involved in the transport of all three branched chain aliphatic amino acids (leucine, isoleucine and valine)	NA|516aa|down_9|NZ_CP004023.1_3967275_3968823_-	cd17631, FACL_FadD13-like, fatty acyl-CoA synthetase, including FadD13
GCF_000520895.1_ASM52089v1	NZ_CP004024	Burkholderia pseudomallei MSHR511 chromosome 2, complete sequence	1	345451-345584	1	CRISPRCasFinder	no		cas3,csa3,DinG	Orphan	GCATCCCGATGATCGGGTATTCCG	24	0	0	NA	NA	NA	2	2	Orphan	WYL,cas3,DEDDh,csa3,DinG	NA|114aa|up_5|NZ_CP004024.1_335872_336214_+,NA|197aa|down_3|NZ_CP004024.1_351358_351949_-	NA|377aa|up_9|NZ_CP004024.1_332290_333421_+	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane	NA|145aa|up_8|NZ_CP004024.1_333827_334262_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|218aa|up_7|NZ_CP004024.1_334329_334983_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|150aa|up_6|NZ_CP004024.1_335426_335876_+	cd17775, CBS_pair_bact_arch, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains  present in bacteria and archaea	NA|114aa|up_5|NZ_CP004024.1_335872_336214_+	NA	NA|606aa|up_4|NZ_CP004024.1_336342_338160_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|344aa|up_3|NZ_CP004024.1_338156_339188_-	PRK07877, PRK07877, Rv1355c family protein	NA|604aa|up_2|NZ_CP004024.1_339719_341531_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|614aa|up_1|NZ_CP004024.1_341605_343447_-	PRK06948, PRK06948, ribonucleotide reductase-like protein; Provisional	NA|422aa|up_0|NZ_CP004024.1_343655_344921_-	TIGR01849, unnamed_protein_product, polyhydroxyalkanoate depolymerase, intracellular	NA|132aa|down_0|NZ_CP004024.1_346309_346705_+	COG1734, DksA, DnaK suppressor protein [Signal transduction mechanisms]	NA|758aa|down_1|NZ_CP004024.1_346837_349111_+	pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide oxidoreductase	NA|633aa|down_2|NZ_CP004024.1_349400_351299_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|197aa|down_3|NZ_CP004024.1_351358_351949_-	NA	NA|312aa|down_4|NZ_CP004024.1_351964_352900_-	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|372aa|down_5|NZ_CP004024.1_353197_354313_+	pfam10604, Polyketide_cyc2, Polyketide cyclase / dehydrase and lipid transport	NA|147aa|down_6|NZ_CP004024.1_354614_355055_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|411aa|down_7|NZ_CP004024.1_356307_357540_+	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]	NA|348aa|down_8|NZ_CP004024.1_357545_358589_+	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|484aa|down_9|NZ_CP004024.1_358590_360042_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed
GCF_000520895.1_ASM52089v1	NZ_CP004024	Burkholderia pseudomallei MSHR511 chromosome 2, complete sequence	2	791804-791896	2	CRISPRCasFinder	no		cas3,csa3,DinG	Orphan	TCGCCGCCGCGCGCGACGACGCGC	24	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,DEDDh,csa3,DinG	NA,NA|172aa|down_3|NZ_CP004024.1_796499_797015_+,NA|170aa|down_4|NZ_CP004024.1_797072_797582_-	NA|216aa|up_9|NZ_CP004024.1_780291_780939_+	COG2823, OsmY, Predicted periplasmic or secreted lipoprotein [General function prediction only]	NA|457aa|up_8|NZ_CP004024.1_780953_782324_+	cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor 6-phosphofructokinase (pfkB) and related sugar kinases	NA|393aa|up_7|NZ_CP004024.1_783041_784220_-	PRK07058, PRK07058, acetate/propionate family kinase	NA|468aa|up_6|NZ_CP004024.1_784216_785620_-	PRK08190, PRK08190, bifunctional enoyl-CoA hydratase/phosphate acetyltransferase; Validated	NA|598aa|up_5|NZ_CP004024.1_785630_787424_-	TIGR01838, Poly-beta-hydroxybutyrate_polymerase, poly(R)-hydroxyalkanoic acid synthase, class I	NA|547aa|up_4|NZ_CP004024.1_787784_789425_+	PRK12597, PRK12597, F0F1 ATP synthase subunit beta; Provisional	NA|152aa|up_3|NZ_CP004024.1_789414_789870_+	PRK13447, PRK13447, F0F1 ATP synthase subunit epsilon; Provisional	NA|103aa|up_2|NZ_CP004024.1_790339_790648_+	TIGR03165, F1F0_chp_2, F1/F0 ATPase, Methanosarcina type, subunit 2	NA|232aa|up_1|NZ_CP004024.1_790644_791340_+	PRK13421, PRK13421, F0F1 ATP synthase subunit A; Provisional	NA|83aa|up_0|NZ_CP004024.1_791336_791585_+	PRK13468, PRK13468, F0F1 ATP synthase subunit C; Provisional	NA|669aa|down_0|NZ_CP004024.1_792331_794338_+	PRK13343, PRK13343, F0F1 ATP synthase subunit alpha; Provisional	NA|283aa|down_1|NZ_CP004024.1_794334_795183_+	pfam00231, ATP-synt, ATP synthase	NA|342aa|down_2|NZ_CP004024.1_795409_796435_+	cd08297, CAD3, Cinnamyl alcohol dehydrogenases (CAD)	NA|172aa|down_3|NZ_CP004024.1_796499_797015_+	NA	NA|170aa|down_4|NZ_CP004024.1_797072_797582_-	NA	NA|522aa|down_5|NZ_CP004024.1_798122_799688_+	PRK15048, PRK15048, methyl-accepting chemotaxis protein II; Provisional	NA|612aa|down_6|NZ_CP004024.1_800073_801909_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|375aa|down_7|NZ_CP004024.1_802058_803183_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|1092aa|down_8|NZ_CP004024.1_803185_806461_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|358aa|down_9|NZ_CP004024.1_806457_807531_-	PRK03598, PRK03598, putative efflux pump membrane fusion protein; Provisional
