assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007970185.1_ASM797018v1	NZ_CP042431	Pseudobacter ginsenosidimutans strain Gsoil 221 chromosome, complete genome	1	777383-777472	1	CRISPRCasFinder	no	cas2,cas1,cas4,cas3,cas7,cas6	cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	Unclear	ATTTCAATTCAACCTTTGTACGATTAACAG	30	0	0	NA	NA	NA	1	1	Unclear	cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	NA|100aa|up_7|NZ_CP042431.1_766832_767132_-,NA|81aa|up_4|NZ_CP042431.1_771489_771732_+,NA|666aa|down_6|NZ_CP042431.1_786661_788659_-	NA|268aa|up_9|NZ_CP042431.1_764856_765660_-	pfam16132, DUF4843, Domain of unknown function (DUF4843)	NA|357aa|up_8|NZ_CP042431.1_765672_766743_-	cd08977, SusD, starch binding outer membrane protein SusD	NA|100aa|up_7|NZ_CP042431.1_766832_767132_-	NA	NA|1118aa|up_6|NZ_CP042431.1_767142_770496_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|207aa|up_5|NZ_CP042431.1_770757_771378_-	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]	NA|81aa|up_4|NZ_CP042431.1_771489_771732_+	NA	NA|199aa|up_3|NZ_CP042431.1_772035_772632_-	TIGR02985, Sig70_bacteroi1, RNA polymerase sigma-70 factor, Bacteroides expansion family 1	NA|399aa|up_2|NZ_CP042431.1_772731_773928_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|797aa|up_1|NZ_CP042431.1_774044_776435_+	pfam18676, MBG_2, MBG domain (YGX type)	NA|121aa|up_0|NZ_CP042431.1_776440_776803_-	pfam14534, DUF4440, Domain of unknown function (DUF4440)	cas2|88aa|down_0|NZ_CP042431.1_780834_781098_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP042431.1_781099_782104_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|168aa|down_2|NZ_CP042431.1_782141_782645_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|767aa|down_3|NZ_CP042431.1_782648_784949_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	NA|226aa|down_4|NZ_CP042431.1_784945_785623_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|343aa|down_5|NZ_CP042431.1_785615_786644_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	NA|666aa|down_6|NZ_CP042431.1_786661_788659_-	NA	cas6|261aa|down_7|NZ_CP042431.1_788697_789480_-	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|462aa|down_8|NZ_CP042431.1_789974_791360_-	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|277aa|down_9|NZ_CP042431.1_791471_792302_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal
GCF_007970185.1_ASM797018v1	NZ_CP042431	Pseudobacter ginsenosidimutans strain Gsoil 221 chromosome, complete genome	2	777566-780622	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas7,cas6	cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	Unclear	ATTTCAATTCAACCTTTGTACGATTAACAG,ATTTCAATTCAACCTTTGTACGATTAACAG,ATTTCAATTCAACCTTTGTACGATTAACAG	30,30,30	0	0	NA	NA	NA:NA:NA	46,46,42	46	Unclear	cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	NA|100aa|up_7|NZ_CP042431.1_766832_767132_-,NA|81aa|up_4|NZ_CP042431.1_771489_771732_+,NA|666aa|down_6|NZ_CP042431.1_786661_788659_-	NA|268aa|up_9|NZ_CP042431.1_764856_765660_-	pfam16132, DUF4843, Domain of unknown function (DUF4843)	NA|357aa|up_8|NZ_CP042431.1_765672_766743_-	cd08977, SusD, starch binding outer membrane protein SusD	NA|100aa|up_7|NZ_CP042431.1_766832_767132_-	NA	NA|1118aa|up_6|NZ_CP042431.1_767142_770496_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|207aa|up_5|NZ_CP042431.1_770757_771378_-	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]	NA|81aa|up_4|NZ_CP042431.1_771489_771732_+	NA	NA|199aa|up_3|NZ_CP042431.1_772035_772632_-	TIGR02985, Sig70_bacteroi1, RNA polymerase sigma-70 factor, Bacteroides expansion family 1	NA|399aa|up_2|NZ_CP042431.1_772731_773928_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|797aa|up_1|NZ_CP042431.1_774044_776435_+	pfam18676, MBG_2, MBG domain (YGX type)	NA|121aa|up_0|NZ_CP042431.1_776440_776803_-	pfam14534, DUF4440, Domain of unknown function (DUF4440)	cas2|88aa|down_0|NZ_CP042431.1_780834_781098_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|335aa|down_1|NZ_CP042431.1_781099_782104_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas4|168aa|down_2|NZ_CP042431.1_782141_782645_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|767aa|down_3|NZ_CP042431.1_782648_784949_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	NA|226aa|down_4|NZ_CP042431.1_784945_785623_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|343aa|down_5|NZ_CP042431.1_785615_786644_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	NA|666aa|down_6|NZ_CP042431.1_786661_788659_-	NA	cas6|261aa|down_7|NZ_CP042431.1_788697_789480_-	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|462aa|down_8|NZ_CP042431.1_789974_791360_-	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|277aa|down_9|NZ_CP042431.1_791471_792302_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal
GCF_007970185.1_ASM797018v1	NZ_CP042431	Pseudobacter ginsenosidimutans strain Gsoil 221 chromosome, complete genome	3	1943451-1943549	3	CRISPRCasFinder	no		cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	Orphan	TGAATATTTTATAAAAATGAACA	23	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas4,cas3,cas7,cas6,csa3,WYL,DEDDh,PD-DExK	NA|805aa|up_1|NZ_CP042431.1_1939864_1942279_-,NA	NA|376aa|up_9|NZ_CP042431.1_1930865_1931993_+	cd09084, EEP-2, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; uncharacterized family 2	NA|498aa|up_8|NZ_CP042431.1_1932061_1933555_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|321aa|up_7|NZ_CP042431.1_1933597_1934560_+	pfam14870, PSII_BNR, Photosynthesis system II assembly factor YCF48	NA|226aa|up_6|NZ_CP042431.1_1934562_1935240_-	COG4121, COG4121, Uncharacterized conserved protein [Function unknown]	NA|346aa|up_5|NZ_CP042431.1_1935262_1936300_+	COG4447, COG4447, Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]	NA|203aa|up_4|NZ_CP042431.1_1936303_1936912_+	COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]	NA|162aa|up_3|NZ_CP042431.1_1936962_1937448_-	pfam06271, RDD, RDD family	NA|721aa|up_2|NZ_CP042431.1_1937572_1939735_-	pfam10459, Peptidase_S46, Peptidase S46	NA|805aa|up_1|NZ_CP042431.1_1939864_1942279_-	NA	NA|335aa|up_0|NZ_CP042431.1_1942363_1943368_-	PRK12299, obgE, GTPase CgtA; Reviewed	NA|260aa|down_0|NZ_CP042431.1_1944563_1945343_+	COG4849, COG4849, Predicted nucleotidyltransferase [General function prediction    only]	NA|198aa|down_1|NZ_CP042431.1_1945345_1945939_-	cd04745, LbH_paaY_like, paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E	NA|143aa|down_2|NZ_CP042431.1_1945967_1946396_-	TIGR02286, Acyl-coenzyme_A_thioesterase_PaaI, phenylacetic acid degradation protein PaaD	NA|259aa|down_3|NZ_CP042431.1_1946428_1947205_-	PRK08140, PRK08140, enoyl-CoA hydratase; Provisional	NA|168aa|down_4|NZ_CP042431.1_1947224_1947728_-	TIGR02159, Putative_12-phenylacetyl-CoA_epoxidase_subunit_D, phenylacetate-CoA oxygenase, PaaJ subunit	NA|257aa|down_5|NZ_CP042431.1_1947734_1948505_-	pfam05138, PaaA_PaaC, Phenylacetic acid catabolic protein	NA|121aa|down_6|NZ_CP042431.1_1948527_1948890_-	PRK13781, paaB, phenylacetate-CoA oxygenase subunit PaaB; Provisional	NA|317aa|down_7|NZ_CP042431.1_1948917_1949868_-	PRK13778, paaA, phenylacetate-CoA oxygenase subunit PaaA; Provisional	NA|491aa|down_8|NZ_CP042431.1_1950110_1951583_+	pfam02055, Glyco_hydro_30, Glycosyl hydrolase family 30 TIM-barrel domain	NA|361aa|down_9|NZ_CP042431.1_1951679_1952762_-	TIGR02160, 12-phenylacetyl-CoA_epoxidase_subunit_E, phenylacetate-CoA oxygenase/reductase, PaaK subunit
