assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_011046245.1_ASM1104624v1	NZ_CP049244	Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence	1	335555-335787	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	csa3	csa3,cas3HD	Type I-A	GTTTCGATCCACGCCTCCGCGAGGGAGGCGAC,GTTTCGATCCACGCCTCCGCGAGGGAGGCGAC,TGTTTCGATCCACGCCTCCGCGAGGGAGGCGAC	32,32,33	0	0	NA	NA	NA:NA:NA	3,3,2	3	Orphan	csa3,DEDDh,cas3,cas4,WYL,cas3HD	NA|109aa|up_4|NZ_CP049244.1_330892_331219_-,NA|49aa|down_2|NZ_CP049244.1_339940_340087_+	NA|286aa|up_9|NZ_CP049244.1_325890_326748_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|344aa|up_8|NZ_CP049244.1_326751_327783_+	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|307aa|up_7|NZ_CP049244.1_327779_328700_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|328aa|up_6|NZ_CP049244.1_328754_329738_+	cd08984, GH43-like, Glycosyl hydrolase family 43	NA|270aa|up_5|NZ_CP049244.1_329987_330797_-	pfam01226, Form_Nir_trans, Formate/nitrite transporter	NA|109aa|up_4|NZ_CP049244.1_330892_331219_-	NA	NA|833aa|up_3|NZ_CP049244.1_331221_333720_-	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|223aa|up_2|NZ_CP049244.1_333716_334385_-	cd04213, CuRO_CcO_Caa3_II, The cupredoxin domain of Caa3 type Cytochrome c oxidase subunit II	NA|167aa|up_1|NZ_CP049244.1_334442_334943_+	pfam09990, DUF2231, Predicted membrane protein (DUF2231)	NA|167aa|up_0|NZ_CP049244.1_334920_335421_-	pfam03653, UPF0093, Uncharacterized protein family (UPF0093)	NA|346aa|down_0|NZ_CP049244.1_336043_337081_+	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|379aa|down_1|NZ_CP049244.1_337313_338450_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|49aa|down_2|NZ_CP049244.1_339940_340087_+	NA	NA|649aa|down_3|NZ_CP049244.1_341301_343248_+	pfam01970, TctA, Tripartite tricarboxylate transporter TctA family	NA|358aa|down_4|NZ_CP049244.1_343261_344335_+	cd07012, PBP2_Bug_TTT, Bug (Bordetella uptake gene) protein family of periplasmic solute-binding receptors; contains the type 2 periplasmic binding fold	NA|292aa|down_5|NZ_CP049244.1_344390_345266_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|331aa|down_6|NZ_CP049244.1_345244_346237_-	PRK11139, PRK11139, DNA-binding transcriptional activator GcvA; Provisional	NA|317aa|down_7|NZ_CP049244.1_347757_348709_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|319aa|down_8|NZ_CP049244.1_349512_350469_-	PRK11139, PRK11139, DNA-binding transcriptional activator GcvA; Provisional	NA|658aa|down_9|NZ_CP049244.1_350599_352573_+	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]
GCF_011046245.1_ASM1104624v1	NZ_CP049244	Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence	2	409146-409316	2,2	PILER-CR,CRISPRCasFinder	no	cas3HD	csa3,cas3HD	Unclear	ATCGTTTCGATCCACGCCTCCGCGAGGGAGGCGACG,GTTTCGATCCACGCCTCCGCGAGGGAGGCGAC	36,32	0	0	NA	NA	NA:NA	2,2	2	Unclear	csa3,DEDDh,cas3,cas4,WYL,cas3HD	NA,NA|103aa|down_3|NZ_CP049244.1_417369_417678_+	NA|401aa|up_9|NZ_CP049244.1_397006_398209_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|304aa|up_8|NZ_CP049244.1_398380_399292_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|370aa|up_7|NZ_CP049244.1_399402_400512_+	COG1566, EmrA, Multidrug resistance efflux pump [Defense mechanisms]	NA|534aa|up_6|NZ_CP049244.1_400508_402110_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|764aa|up_5|NZ_CP049244.1_402415_404707_+	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c	NA|194aa|up_4|NZ_CP049244.1_405016_405598_-	COG4567, COG4567, Response regulator consisting of a CheY-like receiver domain and a Fis-type HTH domain [Signal transduction mechanisms / Transcription]	NA|135aa|up_3|NZ_CP049244.1_405703_406108_-	cd17537, REC_FixJ, phosphoacceptor receiver (REC) domain of FixJ family response regulators	NA|155aa|up_2|NZ_CP049244.1_406149_406614_-	COG4566, TtrR, Response regulator [Signal transduction mechanisms]	NA|245aa|up_1|NZ_CP049244.1_406697_407432_-	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|500aa|up_0|NZ_CP049244.1_407431_408931_-	pfam00665, rve, Integrase core domain	NA|345aa|down_0|NZ_CP049244.1_411772_412807_+	pfam17038, CBP_BcsN, Cellulose biosynthesis protein BcsN	NA|721aa|down_1|NZ_CP049244.1_412835_414998_+	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	NA|777aa|down_2|NZ_CP049244.1_415042_417373_+	pfam03170, BcsB, Bacterial cellulose synthase subunit	NA|103aa|down_3|NZ_CP049244.1_417369_417678_+	NA	NA|769aa|down_4|NZ_CP049244.1_417668_419975_+	sd00006, TPR, Tetratricopeptide repeat	NA|205aa|down_5|NZ_CP049244.1_420247_420862_+	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|260aa|down_6|NZ_CP049244.1_420950_421730_-	PRK06523, PRK06523, short chain dehydrogenase; Provisional	NA|110aa|down_7|NZ_CP049244.1_421726_422056_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|299aa|down_8|NZ_CP049244.1_422193_423090_+	cd08474, PBP2_CrgA_like_5, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|422aa|down_9|NZ_CP049244.1_423207_424473_-	pfam12831, FAD_oxidored, FAD dependent oxidoreductase
GCF_011046245.1_ASM1104624v1	NZ_CP049241	Rhizobium pseudoryzae strain DSM 19479 chromosome, complete genome	1	2603042-2603127	1	CRISPRCasFinder	no		csa3,DEDDh,cas3,cas4,WYL	Orphan	GCCGCCCCGAAGAAGAAGGCTGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,cas4,WYL,cas3HD	NA|248aa|up_6|NZ_CP049241.1_2596087_2596831_+,NA|73aa|up_4|NZ_CP049241.1_2598192_2598411_+,NA|75aa|up_2|NZ_CP049241.1_2599749_2599974_+,NA|87aa|down_6|NZ_CP049241.1_2616814_2617075_+	NA|565aa|up_9|NZ_CP049241.1_2592220_2593915_+	COG1178, ThiP, ABC-type Fe3+ transport system, permease component [Inorganic ion transport and metabolism]	NA|344aa|up_8|NZ_CP049241.1_2593902_2594934_+	COG3839, MalK, ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism]	NA|369aa|up_7|NZ_CP049241.1_2594984_2596091_+	cd10283, MnuA_DNase1-like, Mycoplasma pulmonis MnuA nuclease-like	NA|248aa|up_6|NZ_CP049241.1_2596087_2596831_+	NA	NA|306aa|up_5|NZ_CP049241.1_2596906_2597824_+	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|73aa|up_4|NZ_CP049241.1_2598192_2598411_+	NA	NA|313aa|up_3|NZ_CP049241.1_2598593_2599532_+	COG2070, COG2070, Dioxygenases related to 2-nitropropane dioxygenase [General function prediction only]	NA|75aa|up_2|NZ_CP049241.1_2599749_2599974_+	NA	NA|48aa|up_1|NZ_CP049241.1_2600172_2600316_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|49aa|up_0|NZ_CP049241.1_2600578_2600725_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|737aa|down_0|NZ_CP049241.1_2604620_2606831_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|1095aa|down_1|NZ_CP049241.1_2606827_2610112_-	TIGR02456, Trehalose_synthase, trehalose synthase	NA|1087aa|down_2|NZ_CP049241.1_2610136_2613397_-	cd11344, AmyAc_GlgE_like, Alpha amylase catalytic domain found in GlgE-like proteins	NA|74aa|down_3|NZ_CP049241.1_2613588_2613810_-	pfam06169, DUF982, Protein of unknown function (DUF982)	NA|632aa|down_4|NZ_CP049241.1_2613908_2615804_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|228aa|down_5|NZ_CP049241.1_2615999_2616683_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|87aa|down_6|NZ_CP049241.1_2616814_2617075_+	NA	NA|164aa|down_7|NZ_CP049241.1_2617630_2618122_+	PRK02487, PRK02487, heme-degrading domain-containing protein	NA|84aa|down_8|NZ_CP049241.1_2618199_2618451_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|833aa|down_9|NZ_CP049241.1_2618644_2621143_+	COG3264, COG3264, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]
