assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001984035.1_ASM198403v1	NZ_CP019449	Sphingopyxis sp. QXT-31, complete genome	1	3737931-3738014	1	CRISPRCasFinder	no		csa3,DinG,WYL,cas3	Orphan	TAAATAGGGACAGTCCCTATTTA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,WYL,cas3	NA|442aa|up_1|NZ_CP019449.1_3731566_3732892_+,NA|88aa|down_1|NZ_CP019449.1_3738922_3739186_-,NA|102aa|down_2|NZ_CP019449.1_3739209_3739515_-,NA|200aa|down_8|NZ_CP019449.1_3747021_3747621_-	NA|345aa|up_9|NZ_CP019449.1_3717674_3718709_-	cd06295, PBP1_CelR, ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators	NA|955aa|up_8|NZ_CP019449.1_3719034_3721899_+	TIGR01782, TonB-dependent_receptor, TonB-dependent receptor	NA|500aa|up_7|NZ_CP019449.1_3721977_3723477_+	pfam04820, Trp_halogenase, Tryptophan halogenase	NA|604aa|up_6|NZ_CP019449.1_3723473_3725285_+	cd11320, AmyAc_AmyMalt_CGTase_like, Alpha amylase catalytic domain found in maltogenic amylases, cyclodextrin glycosyltransferase, and related proteins	NA|582aa|up_5|NZ_CP019449.1_3725193_3726939_+	cd11330, AmyAc_OligoGlu, Alpha amylase catalytic domain found in oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase) and related proteins	NA|680aa|up_4|NZ_CP019449.1_3726938_3728978_+	pfam10566, Glyco_hydro_97, Glycoside hydrolase 97	NA|480aa|up_3|NZ_CP019449.1_3729031_3730471_+	pfam04820, Trp_halogenase, Tryptophan halogenase	NA|233aa|up_2|NZ_CP019449.1_3730644_3731343_+	PRK07053, PRK07053, glutamine amidotransferase; Provisional	NA|442aa|up_1|NZ_CP019449.1_3731566_3732892_+	NA	NA|1573aa|up_0|NZ_CP019449.1_3733192_3737911_+	pfam05088, Bac_GDH, Bacterial NAD-glutamate dehydrogenase	NA|288aa|down_0|NZ_CP019449.1_3738027_3738891_+	COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]	NA|88aa|down_1|NZ_CP019449.1_3738922_3739186_-	NA	NA|102aa|down_2|NZ_CP019449.1_3739209_3739515_-	NA	NA|79aa|down_3|NZ_CP019449.1_3739516_3739753_-	pfam13467, RHH_4, Ribbon-helix-helix domain	NA|241aa|down_4|NZ_CP019449.1_3740913_3741636_-	TIGR01829, Acetoacetyl-CoA_reductase, acetoacetyl-CoA reductase	NA|584aa|down_5|NZ_CP019449.1_3741729_3743481_-	pfam09234, DUF1963, Domain of unknown function (DUF1963)	NA|332aa|down_6|NZ_CP019449.1_3743551_3744547_-	PRK00035, hemH, ferrochelatase; Reviewed	NA|763aa|down_7|NZ_CP019449.1_3744543_3746832_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|200aa|down_8|NZ_CP019449.1_3747021_3747621_-	NA	NA|141aa|down_9|NZ_CP019449.1_3747810_3748233_-	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]
GCF_001984035.1_ASM198403v1	NZ_CP019449	Sphingopyxis sp. QXT-31, complete genome	2	3890895-3891084	1	PILER-CR	no		csa3,DinG,WYL,cas3	Orphan	GCCTATGTCCGCGAGATGCGCA	22	0	0	NA	NA	NA	2	2	Orphan	csa3,DinG,WYL,cas3	NA,NA|269aa|down_0|NZ_CP019449.1_3891434_3892241_+	NA|330aa|up_9|NZ_CP019449.1_3877479_3878469_-	pfam11199, DUF2891, Protein of unknown function (DUF2891)	NA|313aa|up_8|NZ_CP019449.1_3878629_3879568_-	pfam06166, DUF979, Protein of unknown function (DUF979)	NA|225aa|up_7|NZ_CP019449.1_3879564_3880239_-	pfam06149, DUF969, Protein of unknown function (DUF969)	NA|1198aa|up_6|NZ_CP019449.1_3880441_3884035_-	PLN02666, PLN02666, 5-oxoprolinase	NA|549aa|up_5|NZ_CP019449.1_3884021_3885668_-	COG3920, COG3920, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|216aa|up_4|NZ_CP019449.1_3885839_3886487_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|229aa|up_3|NZ_CP019449.1_3886366_3887053_+	cd07814, SRPBCC_CalC_Aha1-like, Putative hydrophobic ligand-binding SRPBCC domain of Micromonospora echinospora CalC, human Aha1, and related proteins	NA|324aa|up_2|NZ_CP019449.1_3887470_3888442_-	cd12830, MtCorA-like, Mycobacterium tuberculosis CorA-like subfamily	NA|147aa|up_1|NZ_CP019449.1_3888452_3888893_-	pfam07971, Glyco_hydro_92, Glycosyl hydrolase family 92	NA|126aa|up_0|NZ_CP019449.1_3889221_3889599_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|269aa|down_0|NZ_CP019449.1_3891434_3892241_+	NA	NA|696aa|down_1|NZ_CP019449.1_3892255_3894343_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|128aa|down_2|NZ_CP019449.1_3894533_3894917_+	cd09279, RNase_HI_like, RNAse HI family that includes archaeal, some bacterial as well as plant RNase HI	NA|347aa|down_3|NZ_CP019449.1_3895081_3896122_+	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|226aa|down_4|NZ_CP019449.1_3896094_3896772_-	pfam14246, TetR_C_7, AefR-like transcriptional repressor, C-terminal region	NA|474aa|down_5|NZ_CP019449.1_3896825_3898247_+	TIGR01845, Outer_membrane_protein_OprM, efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family	NA|383aa|down_6|NZ_CP019449.1_3898246_3899395_+	TIGR00998, Probable_multidrug_resistance_protein_EmrK, efflux pump membrane protein (multidrug resistance protein A)	NA|515aa|down_7|NZ_CP019449.1_3899404_3900949_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|144aa|down_8|NZ_CP019449.1_3901041_3901473_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|342aa|down_9|NZ_CP019449.1_3901636_3902662_-	PRK09358, PRK09358, adenosine deaminase; Provisional
GCF_001984035.1_ASM198403v1	NZ_CP019449	Sphingopyxis sp. QXT-31, complete genome	3	3932358-3932443	2	CRISPRCasFinder	no		csa3,DinG,WYL,cas3	Orphan	CGAGCGCGGCGCTGCGTTTCCTC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,WYL,cas3	NA|133aa|up_6|NZ_CP019449.1_3923720_3924119_-,NA|203aa|up_3|NZ_CP019449.1_3928029_3928638_-,NA	NA|123aa|up_9|NZ_CP019449.1_3920357_3920726_-	PRK05179, rpsM, 30S ribosomal protein S13; Validated	NA|264aa|up_8|NZ_CP019449.1_3921408_3922200_-	cd01638, CysQ, CysQ, a 3'-Phosphoadenosine-5'-phosphosulfate (PAPS) 3'-phosphatase, is a bacterial member of the inositol monophosphatase family	NA|448aa|up_7|NZ_CP019449.1_3922203_3923547_-	COG0312, TldD, Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only]	NA|133aa|up_6|NZ_CP019449.1_3923720_3924119_-	NA	NA|651aa|up_5|NZ_CP019449.1_3924456_3926409_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|431aa|up_4|NZ_CP019449.1_3926507_3927800_+	cd00842, MPP_ASMase, acid sphingomyelinase and related proteins, metallophosphatase domain	NA|203aa|up_3|NZ_CP019449.1_3928029_3928638_-	NA	NA|173aa|up_2|NZ_CP019449.1_3928715_3929234_-	pfam03232, COQ7, Ubiquinone biosynthesis protein COQ7	NA|161aa|up_1|NZ_CP019449.1_3929230_3929713_-	pfam02600, DsbB, Disulfide bond formation protein DsbB	NA|466aa|up_0|NZ_CP019449.1_3929846_3931244_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|143aa|down_0|NZ_CP019449.1_3932588_3933017_-	cd18081, RlmH-like, 23S-rRNA-pseudouridine1915-N3-methyltransferase RlmH	NA|131aa|down_1|NZ_CP019449.1_3933049_3933442_-	TIGR00090, rsfS_iojap_ybeB, ribosome silencing factor RsfS/YbeB/iojap	NA|218aa|down_2|NZ_CP019449.1_3933441_3934095_-	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|96aa|down_3|NZ_CP019449.1_3934091_3934379_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|88aa|down_4|NZ_CP019449.1_3934366_3934630_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|427aa|down_5|NZ_CP019449.1_3934661_3935942_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|374aa|down_6|NZ_CP019449.1_3936019_3937141_+	pfam12412, DUF3667, Protein of unknown function (DUF3667)	NA|544aa|down_7|NZ_CP019449.1_3937199_3938831_+	pfam09423, PhoD, PhoD-like phosphatase	NA|308aa|down_8|NZ_CP019449.1_3938942_3939866_-	PRK08260, PRK08260, enoyl-CoA hydratase; Provisional	NA|260aa|down_9|NZ_CP019449.1_3940247_3941027_+	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed
