assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004136315.1_ASM413631v1	CP029562	Mesorhizobium sp. Pch-S chromosome, complete genome	3	1497623-1497733	3	CRISPRCasFinder	no		cas3,csa3,DEDDh,WYL	Orphan	GGCGCTTCTATCTCCCCCCTTGCGGGGGAG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,WYL	NA|191aa|up_4|CP029562.1_1493069_1493642_+,NA|64aa|up_2|CP029562.1_1494694_1494886_-,NA|296aa|down_0|CP029562.1_1498351_1499239_+,NA|161aa|down_3|CP029562.1_1502556_1503039_-,NA|116aa|down_6|CP029562.1_1506567_1506915_-	NA|132aa|up_9|CP029562.1_1487111_1487507_-	COG3727, Vsr, DNA G:T-mismatch repair endonuclease [DNA replication, recombination, and repair]	NA|251aa|up_8|CP029562.1_1487538_1488291_-	pfam18737, HEPN_MAE_28990, MAE_28990/MAE_18760-like HEPN	NA|359aa|up_7|CP029562.1_1488287_1489364_-	pfam03235, DUF262, Protein of unknown function DUF262	NA|369aa|up_6|CP029562.1_1489385_1490492_-	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|560aa|up_5|CP029562.1_1491089_1492769_+	pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase	NA|191aa|up_4|CP029562.1_1493069_1493642_+	NA	NA|301aa|up_3|CP029562.1_1493691_1494594_-	cd08480, PBP2_CrgA_like_10, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|64aa|up_2|CP029562.1_1494694_1494886_-	NA	NA|275aa|up_1|CP029562.1_1494977_1495802_+	cd19140, AKR_AKR3F3, Sinorhizobium meliloti isatin reductase and similar proteins	NA|405aa|up_0|CP029562.1_1496318_1497533_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|296aa|down_0|CP029562.1_1498351_1499239_+	NA	NA|750aa|down_1|CP029562.1_1499264_1501514_+	cd07473, Peptidases_S8_Subtilisin_like, Peptidase S8 family domain in Subtilisin-like proteins	NA|338aa|down_2|CP029562.1_1501488_1502502_+	cd04280, ZnMc_astacin_like, Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site	NA|161aa|down_3|CP029562.1_1502556_1503039_-	NA	NA|451aa|down_4|CP029562.1_1503331_1504684_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|583aa|down_5|CP029562.1_1504680_1506429_-	COG4618, ArpD, ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only]	NA|116aa|down_6|CP029562.1_1506567_1506915_-	NA	NA|1934aa|down_7|CP029562.1_1506908_1512710_-	pfam17210, SdrD_B, SdrD B-like domain	NA|379aa|down_8|CP029562.1_1513046_1514183_+	COG2771, CsgD, DNA-binding HTH domain-containing proteins [Transcription]	NA|287aa|down_9|CP029562.1_1514735_1515596_+	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]
GCA_004136315.1_ASM413631v1	CP029562	Mesorhizobium sp. Pch-S chromosome, complete genome	4	1518730-1518833	4	CRISPRCasFinder	no		cas3,csa3,DEDDh,WYL	Orphan	TTCACCCAACGCCCCTCATCCTGAGG	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,WYL	NA|161aa|up_9|CP029562.1_1502556_1503039_-,NA|116aa|up_6|CP029562.1_1506567_1506915_-,NA	NA|161aa|up_9|CP029562.1_1502556_1503039_-	NA	NA|451aa|up_8|CP029562.1_1503331_1504684_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|583aa|up_7|CP029562.1_1504680_1506429_-	COG4618, ArpD, ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only]	NA|116aa|up_6|CP029562.1_1506567_1506915_-	NA	NA|1934aa|up_5|CP029562.1_1506908_1512710_-	pfam17210, SdrD_B, SdrD B-like domain	NA|379aa|up_4|CP029562.1_1513046_1514183_+	COG2771, CsgD, DNA-binding HTH domain-containing proteins [Transcription]	NA|287aa|up_3|CP029562.1_1514735_1515596_+	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|158aa|up_2|CP029562.1_1515595_1516069_+	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|755aa|up_1|CP029562.1_1516073_1518338_+	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|82aa|up_0|CP029562.1_1518337_1518583_+	cd17040, Ubl_MoaD_like, ubiquitin-like (Ubl) domain found in a group of small sulfide carrier proteins	NA|233aa|down_0|CP029562.1_1519058_1519757_+	pfam05988, DUF899, Bacterial protein of unknown function (DUF899)	NA|469aa|down_1|CP029562.1_1519807_1521214_-	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|275aa|down_2|CP029562.1_1521210_1522035_-	COG4275, COG4275, Uncharacterized conserved protein [Function unknown]	NA|243aa|down_3|CP029562.1_1522154_1522883_-	COG2215, COG2215, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|435aa|down_4|CP029562.1_1522933_1524238_-	TIGR00900, multidrug_transporter, H+ Antiporter protein	NA|92aa|down_5|CP029562.1_1524237_1524513_-	cd10154, NreA-like_DUF156, Alcaligenes xylosoxidans NreA and related domains; this domain family was previously known as part of DUF156	NA|69aa|down_6|CP029562.1_1524674_1524881_+	pfam11903, ParD_like, ParD-like antitoxin of type II bacterial toxin-antitoxin system	NA|256aa|down_7|CP029562.1_1524984_1525752_+	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|448aa|down_8|CP029562.1_1525974_1527318_+	cd01034, EriC_like, ClC chloride channel family	NA|295aa|down_9|CP029562.1_1527445_1528330_-	COG3449, COG3449, DNA gyrase inhibitor [DNA replication, recombination, and repair]
GCA_004136315.1_ASM413631v1	CP029562	Mesorhizobium sp. Pch-S chromosome, complete genome	5	1614320-1614403	5	CRISPRCasFinder	no	csa3	cas3,csa3,DEDDh,WYL	Type I-A	GGAAACGGAAACGCGCTGGTGAAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,WYL	NA|100aa|up_9|CP029562.1_1605978_1606278_-,NA|94aa|up_7|CP029562.1_1607861_1608143_-,NA|60aa|up_6|CP029562.1_1608474_1608654_-,NA|78aa|up_5|CP029562.1_1609470_1609704_-,NA|86aa|up_2|CP029562.1_1610882_1611140_-,NA|77aa|up_0|CP029562.1_1613991_1614222_-,NA|69aa|down_1|CP029562.1_1614963_1615170_+	NA|100aa|up_9|CP029562.1_1605978_1606278_-	NA	NA|249aa|up_8|CP029562.1_1606821_1607568_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|94aa|up_7|CP029562.1_1607861_1608143_-	NA	NA|60aa|up_6|CP029562.1_1608474_1608654_-	NA	NA|78aa|up_5|CP029562.1_1609470_1609704_-	NA	NA|107aa|up_4|CP029562.1_1610153_1610474_+	pfam11154, DUF2934, Protein of unknown function (DUF2934)	NA|122aa|up_3|CP029562.1_1610520_1610886_-	pfam06169, DUF982, Protein of unknown function (DUF982)	NA|86aa|up_2|CP029562.1_1610882_1611140_-	NA	NA|774aa|up_1|CP029562.1_1611445_1613767_-	TIGR03074, PQQ_membr_DH, membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family	NA|77aa|up_0|CP029562.1_1613991_1614222_-	NA	NA|70aa|down_0|CP029562.1_1614498_1614708_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|69aa|down_1|CP029562.1_1614963_1615170_+	NA	NA|233aa|down_2|CP029562.1_1615234_1615933_-	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|310aa|down_3|CP029562.1_1615929_1616859_-	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|503aa|down_4|CP029562.1_1616855_1618364_-	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|303aa|down_5|CP029562.1_1618427_1619336_-	cd01536, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of active transport systems that are members of the type 1 periplasmic binding protein (PBP1) superfamily	NA|482aa|down_6|CP029562.1_1619509_1620955_-	cd07805, FGGY_XK_like_2, uncharacterized xylulose kinase-like proteins; a subgroup of the FGGY family of carbohydrate kinases	NA|328aa|down_7|CP029562.1_1621327_1622311_-	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|417aa|down_8|CP029562.1_1622505_1623756_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|272aa|down_9|CP029562.1_1623757_1624573_+	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional
GCA_004136315.1_ASM413631v1	CP029562	Mesorhizobium sp. Pch-S chromosome, complete genome	6	4421204-4421289	6	CRISPRCasFinder	no		cas3,csa3,DEDDh,WYL	Orphan	CAGAGCAATTCCAGGAAAAGTGCGT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,WYL	NA,NA|133aa|down_1|CP029562.1_4424463_4424862_+,NA|65aa|down_3|CP029562.1_4425910_4426105_-,NA|130aa|down_6|CP029562.1_4427903_4428293_-	NA|201aa|up_9|CP029562.1_4413063_4413666_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|142aa|up_8|CP029562.1_4413734_4414160_+	cd08359, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|306aa|up_7|CP029562.1_4414281_4415199_+	cd11587, Arginase-like, Arginase types I and II and arginase-like family	NA|121aa|up_6|CP029562.1_4415204_4415567_-	pfam01638, HxlR, HxlR-like helix-turn-helix	NA|338aa|up_5|CP029562.1_4415674_4416688_+	cd08252, AL_MDR, Arginate lyase and other MDR family members	NA|318aa|up_4|CP029562.1_4416690_4417644_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|290aa|up_3|CP029562.1_4417812_4418682_+	COG0265, DegQ, Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones]	NA|195aa|up_2|CP029562.1_4418690_4419275_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|289aa|up_1|CP029562.1_4419472_4420339_+	PRK08278, PRK08278, SDR family oxidoreductase	NA|174aa|up_0|CP029562.1_4420361_4420883_-	PRK05208, PRK05208, hypothetical protein; Provisional	NA|150aa|down_0|CP029562.1_4423818_4424268_+	PRK05170, PRK05170, YcgN family cysteine cluster protein	NA|133aa|down_1|CP029562.1_4424463_4424862_+	NA	NA|237aa|down_2|CP029562.1_4424970_4425681_-	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|65aa|down_3|CP029562.1_4425910_4426105_-	NA	NA|200aa|down_4|CP029562.1_4426336_4426936_-	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|124aa|down_5|CP029562.1_4427174_4427546_+	COG4766, EutQ, Ethanolamine utilization protein [Amino acid transport and metabolism]	NA|130aa|down_6|CP029562.1_4427903_4428293_-	NA	NA|165aa|down_7|CP029562.1_4428336_4428831_-	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|138aa|down_8|CP029562.1_4428937_4429351_-	cd02228, cupin_EutQ, Clostridium difficile EutQ and related proteins, cupin domain	NA|299aa|down_9|CP029562.1_4429515_4430412_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain
GCA_004136315.1_ASM413631v1	CP029562	Mesorhizobium sp. Pch-S chromosome, complete genome	7	4809401-4809547	7	CRISPRCasFinder	no		cas3,csa3,DEDDh,WYL	Orphan	GAGCAATTCCAGGAAAAGTGTGGAACGGTTTTCCGCCCGGAATTG	45	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,WYL	NA,NA|103aa|down_4|CP029562.1_4812760_4813069_+,NA|394aa|down_5|CP029562.1_4813162_4814344_-,NA|94aa|down_6|CP029562.1_4814727_4815009_+,NA|177aa|down_8|CP029562.1_4817422_4817953_+,NA|105aa|down_9|CP029562.1_4817945_4818260_+	NA|436aa|up_9|CP029562.1_4796160_4797468_+	COG0849, ftsA, Cell division ATPase FtsA [Cell division and chromosome partitioning]	NA|562aa|up_8|CP029562.1_4797540_4799226_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|315aa|up_7|CP029562.1_4799495_4800440_+	PRK13186, lpxC, UDP-3-O-acyl-N-acetylglucosamine deacetylase	NA|291aa|up_6|CP029562.1_4800639_4801512_+	COG4105, ComL, DNA uptake lipoprotein [General function prediction only]	NA|557aa|up_5|CP029562.1_4801572_4803243_+	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|738aa|up_4|CP029562.1_4803341_4805555_+	COG0272, Lig, NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair]	NA|234aa|up_3|CP029562.1_4805554_4806256_+	COG5587, COG5587, Uncharacterized conserved protein [Function unknown]	NA|251aa|up_2|CP029562.1_4806427_4807180_+	COG1296, AzlC, Predicted branched-chain amino acid permease (azaleucine resistance) [Amino acid transport and metabolism]	NA|99aa|up_1|CP029562.1_4807176_4807473_+	COG4541, COG4541, Predicted membrane protein [Function unknown]	NA|609aa|up_0|CP029562.1_4807498_4809325_-	cd01085, APP, X-Prolyl Aminopeptidase 2	NA|291aa|down_0|CP029562.1_4809610_4810483_-	COG2264, PrmA, Ribosomal protein L11 methylase [Translation, ribosomal structure and biogenesis]	NA|193aa|down_1|CP029562.1_4810521_4811100_-	pfam02630, SCO1-SenC, SCO1/SenC	NA|168aa|down_2|CP029562.1_4811310_4811814_+	COG3045, CreA, Uncharacterized protein conserved in bacteria [Function unknown]	NA|213aa|down_3|CP029562.1_4811855_4812494_+	TIGR03308, phn_thr-fam, phosphonate metabolism protein, transferase hexapeptide repeat family	NA|103aa|down_4|CP029562.1_4812760_4813069_+	NA	NA|394aa|down_5|CP029562.1_4813162_4814344_-	NA	NA|94aa|down_6|CP029562.1_4814727_4815009_+	NA	NA|213aa|down_7|CP029562.1_4816787_4817426_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|177aa|down_8|CP029562.1_4817422_4817953_+	NA	NA|105aa|down_9|CP029562.1_4817945_4818260_+	NA
