assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	1	182596-182701	1	CRISPRCasFinder	no		csa3,RT,DEDDh,DinG,WYL,cas3	Orphan	TTTATTGCTAGTCAGAGCTCTGACCCCGGT	30	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA,NA|162aa|down_4|NZ_CP024608.1_192871_193357_+,NA|231aa|down_7|NZ_CP024608.1_196292_196985_+,NA|162aa|down_9|NZ_CP024608.1_197494_197980_-	NA|838aa|up_9|NZ_CP024608.1_167126_169640_-	cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease, serralysin_like subfamily	NA|819aa|up_8|NZ_CP024608.1_169904_172361_-	cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease, serralysin_like subfamily	NA|147aa|up_7|NZ_CP024608.1_172526_172967_-	pfam17200, sCache_2, Single Cache domain 2	NA|687aa|up_6|NZ_CP024608.1_173088_175149_-	TIGR01074, ATP-dependent_DNA_helicase_Rep, ATP-dependent DNA helicase Rep	NA|331aa|up_5|NZ_CP024608.1_175280_176273_-	cd08252, AL_MDR, Arginate lyase and other MDR family members	NA|307aa|up_4|NZ_CP024608.1_176373_177294_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|179aa|up_3|NZ_CP024608.1_177290_177827_-	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|175aa|up_2|NZ_CP024608.1_177971_178496_+	cd08863, SRPBCC_DUF1857, DUF1857, an uncharacterized ligand-binding domain of the SRPBCC domain superfamily	NA|481aa|up_1|NZ_CP024608.1_179878_181321_-	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated	NA|298aa|up_0|NZ_CP024608.1_181495_182389_+	PRK03635, PRK03635, ArgP/LysG family DNA-binding transcriptional regulator	NA|674aa|down_0|NZ_CP024608.1_182715_184737_-	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|362aa|down_1|NZ_CP024608.1_185489_186575_-	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|650aa|down_2|NZ_CP024608.1_186706_188656_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|1372aa|down_3|NZ_CP024608.1_188676_192792_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|162aa|down_4|NZ_CP024608.1_192871_193357_+	NA	NA|626aa|down_5|NZ_CP024608.1_193472_195350_+	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|264aa|down_6|NZ_CP024608.1_195360_196152_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|231aa|down_7|NZ_CP024608.1_196292_196985_+	NA	NA|188aa|down_8|NZ_CP024608.1_196938_197502_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|162aa|down_9|NZ_CP024608.1_197494_197980_-	NA
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	2	2130610-2131369	1	CRT	no	DEDDh	csa3,RT,DEDDh,DinG,WYL,cas3	Unclear	GTGCACCATGCCGNCTTGG	19	2	2	2131019-2131038|2131097-2131116	NZ_CP024608.1_5178465-5178446|NZ_CP024608.1_4287762-4287781	NA	18	18	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA|68aa|up_2|NZ_CP024608.1_2129230_2129434_+,NA|91aa|up_1|NZ_CP024608.1_2129555_2129828_+,NA|89aa|up_0|NZ_CP024608.1_2130014_2130281_+,NA|168aa|down_3|NZ_CP024608.1_2136401_2136905_+,NA|260aa|down_4|NZ_CP024608.1_2136907_2137687_+,NA|291aa|down_5|NZ_CP024608.1_2137704_2138577_+	NA|273aa|up_9|NZ_CP024608.1_2122833_2123652_+	cd01144, BtuF, Cobalamin binding protein BtuF	NA|122aa|up_8|NZ_CP024608.1_2123749_2124115_-	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site	NA|165aa|up_7|NZ_CP024608.1_2124277_2124772_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|424aa|up_6|NZ_CP024608.1_2124768_2126040_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|204aa|up_5|NZ_CP024608.1_2126161_2126773_+	cd07988, LPLAT_ABO13168-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: Unknown ABO13168	NA|341aa|up_4|NZ_CP024608.1_2127093_2128116_-	PRK07609, PRK07609, CDP-6-deoxy-delta-3,4-glucoseen reductase; Validated	NA|296aa|up_3|NZ_CP024608.1_2128320_2129208_+	cd05266, SDR_a4, atypical (a) SDRs, subgroup 4	NA|68aa|up_2|NZ_CP024608.1_2129230_2129434_+	NA	NA|91aa|up_1|NZ_CP024608.1_2129555_2129828_+	NA	NA|89aa|up_0|NZ_CP024608.1_2130014_2130281_+	NA	NA|599aa|down_0|NZ_CP024608.1_2132044_2133841_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|586aa|down_1|NZ_CP024608.1_2134088_2135846_-	PRK05347, PRK05347, glutaminyl-tRNA synthetase; Provisional	NA|97aa|down_2|NZ_CP024608.1_2136031_2136322_+	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|168aa|down_3|NZ_CP024608.1_2136401_2136905_+	NA	NA|260aa|down_4|NZ_CP024608.1_2136907_2137687_+	NA	NA|291aa|down_5|NZ_CP024608.1_2137704_2138577_+	NA	NA|299aa|down_6|NZ_CP024608.1_2138875_2139772_-	cd08414, PBP2_LTTR_aromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold	NA|405aa|down_7|NZ_CP024608.1_2139877_2141092_+	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|217aa|down_8|NZ_CP024608.1_2141203_2141854_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|117aa|down_9|NZ_CP024608.1_2142397_2142748_-	pfam09951, DUF2185, Protein of unknown function (DUF2185)
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	3	2697941-2698032	2	CRISPRCasFinder	no		csa3,RT,DEDDh,DinG,WYL,cas3	Orphan	CTAATTAAAGCTCTGGCCCCGGTT	24	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA|109aa|up_0|NZ_CP024608.1_2697582_2697909_+,NA|287aa|down_2|NZ_CP024608.1_2701331_2702192_+,NA|733aa|down_8|NZ_CP024608.1_2717743_2719942_-	NA|106aa|up_9|NZ_CP024608.1_2688407_2688725_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|114aa|up_8|NZ_CP024608.1_2688735_2689077_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|388aa|up_7|NZ_CP024608.1_2689556_2690720_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|492aa|up_6|NZ_CP024608.1_2690770_2692246_+	cd01299, Met_dep_hydrolase_A, Metallo-dependent hydrolases, subgroup A is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site	NA|342aa|up_5|NZ_CP024608.1_2692488_2693514_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|142aa|up_4|NZ_CP024608.1_2693611_2694037_-	pfam08897, DUF1841, Domain of unknown function (DUF1841)	NA|203aa|up_3|NZ_CP024608.1_2694218_2694827_-	PRK00714, PRK00714, RNA pyrophosphohydrolase; Reviewed	NA|578aa|up_2|NZ_CP024608.1_2695030_2696764_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|202aa|up_1|NZ_CP024608.1_2696775_2697381_+	cd00254, LT-like, lytic transglycosylase(LT)-like domain	NA|109aa|up_0|NZ_CP024608.1_2697582_2697909_+	NA	NA|478aa|down_0|NZ_CP024608.1_2698152_2699586_+	COG1785, PhoA, Alkaline phosphatase [Inorganic ion transport and metabolism]	NA|462aa|down_1|NZ_CP024608.1_2699612_2700998_+	COG1785, PhoA, Alkaline phosphatase [Inorganic ion transport and metabolism]	NA|287aa|down_2|NZ_CP024608.1_2701331_2702192_+	NA	NA|3016aa|down_3|NZ_CP024608.1_2702188_2711236_+	smart00736, CADG, Dystroglycan-type cadherin-like domains	NA|138aa|down_4|NZ_CP024608.1_2711356_2711770_-	cd10567, SWIB-MDM2_like, SWIB/MDM2 domain found in SWIB/MDM2 homologous proteins	NA|457aa|down_5|NZ_CP024608.1_2712052_2713423_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|632aa|down_6|NZ_CP024608.1_2713543_2715439_-	cd14953, NHL_like_1, Uncharacterized NHL-repeat domain in bacterial proteins	NA|655aa|down_7|NZ_CP024608.1_2715646_2717611_-	cd14953, NHL_like_1, Uncharacterized NHL-repeat domain in bacterial proteins	NA|733aa|down_8|NZ_CP024608.1_2717743_2719942_-	NA	NA|325aa|down_9|NZ_CP024608.1_2720152_2721127_+	cd13553, PBP2_NrtA_CpmA_like, Substrate binding domain of ABC-type nitrate/bicarbonate transporters, a member of the type 2 periplasmic binding fold superfamily
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	4	2737721-2737917	3	CRISPRCasFinder	no		csa3,RT,DEDDh,DinG,WYL,cas3	Orphan	GGCGGCAGGAACTGGGCCGGCTCGGCGACTTCGGGTTCGGCATGAAC	47	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA|299aa|up_6|NZ_CP024608.1_2733571_2734468_-,NA|74aa|up_5|NZ_CP024608.1_2734440_2734662_-,NA|97aa|down_0|NZ_CP024608.1_2738044_2738335_+,NA|211aa|down_6|NZ_CP024608.1_2743109_2743742_-	NA|482aa|up_9|NZ_CP024608.1_2725668_2727114_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|987aa|up_8|NZ_CP024608.1_2727690_2730651_+	PRK07207, PRK07207, ribonucleoside-diphosphate reductase subunit alpha	NA|375aa|up_7|NZ_CP024608.1_2730747_2731872_+	PRK07209, PRK07209, ribonucleotide-diphosphate reductase subunit beta; Validated	NA|299aa|up_6|NZ_CP024608.1_2733571_2734468_-	NA	NA|74aa|up_5|NZ_CP024608.1_2734440_2734662_-	NA	NA|177aa|up_4|NZ_CP024608.1_2734769_2735300_+	pfam02325, YGGT, YGGT family	NA|100aa|up_3|NZ_CP024608.1_2735296_2735596_+	pfam02594, DUF167, Uncharacterized ACR, YggU family COG1872	NA|76aa|up_2|NZ_CP024608.1_2735657_2735885_+	pfam02594, DUF167, Uncharacterized ACR, YggU family COG1872	NA|313aa|up_1|NZ_CP024608.1_2735965_2736904_-	cd01942, ribokinase_group_A, Ribokinase-like subgroup A	NA|152aa|up_0|NZ_CP024608.1_2736912_2737368_-	pfam11906, DUF3426, Protein of unknown function (DUF3426)	NA|97aa|down_0|NZ_CP024608.1_2738044_2738335_+	NA	NA|316aa|down_1|NZ_CP024608.1_2739055_2740003_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	NA|467aa|down_2|NZ_CP024608.1_2739999_2741400_-	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|155aa|down_3|NZ_CP024608.1_2741409_2741874_-	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein	NA|146aa|down_4|NZ_CP024608.1_2741967_2742405_-	PRK05395, PRK05395, type II 3-dehydroquinate dehydratase	NA|190aa|down_5|NZ_CP024608.1_2742543_2743113_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|211aa|down_6|NZ_CP024608.1_2743109_2743742_-	NA	NA|460aa|down_7|NZ_CP024608.1_2743825_2745205_+	TIGR01081, mpl, UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase	NA|196aa|down_8|NZ_CP024608.1_2745201_2745789_+	COG3150, COG3150, Predicted esterase [General function prediction only]	NA|200aa|down_9|NZ_CP024608.1_2745785_2746385_+	COG3161, UbiC, 4-hydroxybenzoate synthetase (chorismate lyase) [Coenzyme metabolism]
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	5	4929947-4930036	4	CRISPRCasFinder	no		csa3,RT,DEDDh,DinG,WYL,cas3	Orphan	CGAAAGGGCGGAGCCTGCCCGGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA|129aa|up_9|NZ_CP024608.1_4915926_4916313_-,NA|51aa|up_1|NZ_CP024608.1_4928640_4928793_+,NA|442aa|down_4|NZ_CP024608.1_4948115_4949441_+	NA|129aa|up_9|NZ_CP024608.1_4915926_4916313_-	NA	NA|683aa|up_8|NZ_CP024608.1_4916426_4918475_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|357aa|up_7|NZ_CP024608.1_4919288_4920359_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|253aa|up_6|NZ_CP024608.1_4920725_4921484_+	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|437aa|up_5|NZ_CP024608.1_4923873_4925184_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|351aa|up_4|NZ_CP024608.1_4925622_4926675_+	cd03141, GATase1_Hsp31_like, Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein	NA|227aa|up_3|NZ_CP024608.1_4926683_4927364_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|412aa|up_2|NZ_CP024608.1_4927393_4928629_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|51aa|up_1|NZ_CP024608.1_4928640_4928793_+	NA	NA|168aa|up_0|NZ_CP024608.1_4929312_4929816_+	cd09893, NGN_SP_TaA, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), TaA	NA|3594aa|down_0|NZ_CP024608.1_4930332_4941114_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1139aa|down_1|NZ_CP024608.1_4941136_4944553_+	cd00833, PKS, polyketide synthases (PKSs) polymerize simple fatty acids into a large variety of different products, called polyketides, by successive decarboxylating Claisen condensations	NA|859aa|down_2|NZ_CP024608.1_4944589_4947166_+	COG1033, COG1033, Predicted exporters of the RND superfamily [General function prediction only]	NA|300aa|down_3|NZ_CP024608.1_4947203_4948103_+	cd16329, LolA_like, proteins similar to periplasmic molecular chaperone LolA, the outer membrane lipoprotein receptor LolB and the periplasmic protein RseB	NA|442aa|down_4|NZ_CP024608.1_4948115_4949441_+	NA	NA|427aa|down_5|NZ_CP024608.1_4949468_4950749_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|251aa|down_6|NZ_CP024608.1_4950792_4951545_+	cd08876, START_1, Uncharacterized subgroup of the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domain family	NA|260aa|down_7|NZ_CP024608.1_4951562_4952342_+	cd08876, START_1, Uncharacterized subgroup of the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domain family	NA|464aa|down_8|NZ_CP024608.1_4952353_4953745_+	cd04433, AFD_class_I, Adenylate forming domain, Class I, also known as the ANL superfamily	NA|246aa|down_9|NZ_CP024608.1_4953747_4954485_+	PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated
GCF_002752675.1_ASM275267v1	NZ_CP024608	Massilia violaceinigra strain B2 chromosome	6	5342282-5342391	5	CRISPRCasFinder	no	WYL	csa3,RT,DEDDh,DinG,WYL,cas3	Unclear	TCCATGGATTGCTCGTCCGACGACTC	26	0	0	NA	NA	NA	2	2	Orphan	csa3,RT,DEDDh,DinG,WYL,cas3,PD-DExK	NA|188aa|up_8|NZ_CP024608.1_5332396_5332960_+,NA|179aa|up_7|NZ_CP024608.1_5332925_5333462_-,NA|184aa|up_4|NZ_CP024608.1_5335423_5335975_+,NA|158aa|up_2|NZ_CP024608.1_5336241_5336715_-,NA|90aa|down_6|NZ_CP024608.1_5353275_5353545_-,NA|86aa|down_7|NZ_CP024608.1_5353548_5353806_-,NA|132aa|down_8|NZ_CP024608.1_5353805_5354201_-	NA|197aa|up_9|NZ_CP024608.1_5331819_5332410_+	COG3124, COG3124, Uncharacterized protein conserved in bacteria [Function unknown]	NA|188aa|up_8|NZ_CP024608.1_5332396_5332960_+	NA	NA|179aa|up_7|NZ_CP024608.1_5332925_5333462_-	NA	NA|134aa|up_6|NZ_CP024608.1_5333686_5334088_+	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|246aa|up_5|NZ_CP024608.1_5334100_5334838_-	pfam04338, DUF481, Protein of unknown function, DUF481	NA|184aa|up_4|NZ_CP024608.1_5335423_5335975_+	NA	NA|54aa|up_3|NZ_CP024608.1_5335976_5336138_+	cd02419, Peptidase_C39C, A sub-family of peptidase family C39	NA|158aa|up_2|NZ_CP024608.1_5336241_5336715_-	NA	NA|751aa|up_1|NZ_CP024608.1_5336942_5339195_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|403aa|up_0|NZ_CP024608.1_5339252_5340461_+	pfam11062, DUF2863, Protein of unknown function (DUF2863)	NA|966aa|down_0|NZ_CP024608.1_5343357_5346255_-	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|817aa|down_1|NZ_CP024608.1_5346424_5348875_+	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|143aa|down_2|NZ_CP024608.1_5349002_5349431_+	cd03428, Ap4A_hydrolase_human_like, Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily	NA|779aa|down_3|NZ_CP024608.1_5349441_5351778_-	PRK15347, PRK15347, two component system sensor kinase	NA|180aa|down_4|NZ_CP024608.1_5352060_5352600_-	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|103aa|down_5|NZ_CP024608.1_5352891_5353200_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|90aa|down_6|NZ_CP024608.1_5353275_5353545_-	NA	NA|86aa|down_7|NZ_CP024608.1_5353548_5353806_-	NA	NA|132aa|down_8|NZ_CP024608.1_5353805_5354201_-	NA	NA|231aa|down_9|NZ_CP024608.1_5354273_5354966_-	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]
