assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003855095.1_ASM385509v1	CP033972	Gordonia sp. MMS17-SY073	1	490624-490705	1	CRISPRCasFinder	no	DinG	csa3,WYL,DEDDh,DinG,cas3,casR	Type IV-A	CGCTGGTCGAGTGCCGCGCGAGCGGAGC	28	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,DinG,cas3,casR	NA,NA|82aa|down_6|CP033972.1_495390_495636_+,NA|83aa|down_7|CP033972.1_495592_495841_-	NA|435aa|up_9|CP033972.1_482978_484283_-	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|74aa|up_8|CP033972.1_484462_484684_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|192aa|up_7|CP033972.1_484702_485278_+	pfam09438, DUF2017, Domain of unknown function (DUF2017)	NA|366aa|up_6|CP033972.1_485291_486389_+	cd02252, nylC_like, nylC-like family; composed of proteins with similarity to Flavobacterium endo-type 6-aminohexanoate-oligomer hydrolase (EIII), the product of the nylon oligomer degradation gene, nylC	NA|136aa|up_5|CP033972.1_486534_486942_+	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|91aa|up_4|CP033972.1_486946_487219_+	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|329aa|up_3|CP033972.1_487222_488209_+	TIGR01136, Cysteine_synthase, cysteine synthase	NA|215aa|up_2|CP033972.1_488317_488962_+	pfam01694, Rhomboid, Rhomboid family	NA|253aa|up_1|CP033972.1_489000_489759_+	PRK00865, PRK00865, glutamate racemase; Provisional	NA|256aa|up_0|CP033972.1_489834_490602_+	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|268aa|down_0|CP033972.1_490743_491547_+	PRK00173, rph, ribonuclease PH; Reviewed	NA|206aa|down_1|CP033972.1_491543_492161_+	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|384aa|down_2|CP033972.1_492137_493289_-	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|115aa|down_3|CP033972.1_493398_493743_-	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|162aa|down_4|CP033972.1_493739_494225_-	COG4578, GutM, Glucitol operon activator [Transcription]	NA|200aa|down_5|CP033972.1_494386_494986_-	PRK04296, PRK04296, thymidine kinase; Provisional	NA|82aa|down_6|CP033972.1_495390_495636_+	NA	NA|83aa|down_7|CP033972.1_495592_495841_-	NA	NA|348aa|down_8|CP033972.1_495913_496957_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|292aa|down_9|CP033972.1_496962_497838_-	pfam01261, AP_endonuc_2, Xylose isomerase-like TIM barrel
GCA_003855095.1_ASM385509v1	CP033972	Gordonia sp. MMS17-SY073	2	1295889-1295964	2	CRISPRCasFinder	no		csa3,WYL,DEDDh,DinG,cas3,casR	Orphan	ACCCGTCCCAGCGCGGCGCCATCA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,DinG,cas3,casR	NA,NA	NA|360aa|up_9|CP033972.1_1284417_1285497_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|228aa|up_8|CP033972.1_1285573_1286257_-	cd05341, 3beta-17beta-HSD_like_SDR_c, 3beta17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|361aa|up_7|CP033972.1_1286435_1287518_+	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|469aa|up_6|CP033972.1_1287579_1288986_-	cd07809, FGGY_D-XK_1, D-xylulose kinases, subgroup 1; members of the FGGY family of carbohydrate kinases	NA|495aa|up_5|CP033972.1_1288982_1290467_-	COG0246, MtlD, Mannitol-1-phosphate/altronate dehydrogenases [Carbohydrate transport and metabolism]	NA|365aa|up_4|CP033972.1_1290477_1291572_-	cd05285, sorbitol_DH, Sorbitol dehydrogenase	NA|307aa|up_3|CP033972.1_1291718_1292639_+	COG2390, DeoR, Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription]	NA|463aa|up_2|CP033972.1_1292638_1294027_+	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|320aa|up_1|CP033972.1_1294030_1294990_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|292aa|up_0|CP033972.1_1294989_1295865_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|368aa|down_0|CP033972.1_1296011_1297115_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|588aa|down_1|CP033972.1_1297179_1298943_+	COG4425, COG4425, Predicted membrane protein [Function unknown]	NA|150aa|down_2|CP033972.1_1298974_1299424_-	TIGR02096, putative_cyclase, conserved hypothetical protein, steroid delta-isomerase-related	NA|619aa|down_3|CP033972.1_1299413_1301270_-	COG3653, COG3653, N-acyl-D-aspartate/D-glutamate deacylase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|313aa|down_4|CP033972.1_1301377_1302316_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|187aa|down_5|CP033972.1_1302312_1302873_+	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|219aa|down_6|CP033972.1_1302914_1303571_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|171aa|down_7|CP033972.1_1303652_1304165_+	pfam02627, CMD, Carboxymuconolactone decarboxylase family	NA|276aa|down_8|CP033972.1_1304195_1305023_+	PRK06142, PRK06142, crotonase/enoyl-CoA hydratase family protein	NA|160aa|down_9|CP033972.1_1305038_1305518_-	TIGR00026, Hypothetical_protein_Rv1261c/MT1299/Mb1292c
GCA_003855095.1_ASM385509v1	CP033972	Gordonia sp. MMS17-SY073	3	1603387-1603466	3	CRISPRCasFinder	no		csa3,WYL,DEDDh,DinG,cas3,casR	Orphan	AGCGCACCACCCGCCCCCGCTGATC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,DinG,cas3,casR	NA|109aa|up_2|CP033972.1_1600718_1601045_+,NA|140aa|down_7|CP033972.1_1610315_1610735_+	NA|128aa|up_9|CP033972.1_1594433_1594817_+	pfam13772, AIG2_2, AIG2-like family	NA|398aa|up_8|CP033972.1_1594830_1596024_-	cd08014, M20_Acy1-like, M20 Peptidase aminoacylase 1 subfamily	NA|233aa|up_7|CP033972.1_1596036_1596735_-	cd06558, crotonase-like, Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily	NA|268aa|up_6|CP033972.1_1596770_1597574_+	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|544aa|up_5|CP033972.1_1597576_1599208_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|273aa|up_4|CP033972.1_1599214_1600033_+	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|208aa|up_3|CP033972.1_1600045_1600669_-	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|109aa|up_2|CP033972.1_1600718_1601045_+	NA	NA|312aa|up_1|CP033972.1_1601044_1601980_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|466aa|up_0|CP033972.1_1601976_1603374_+	pfam10446, DUF2457, Protein of unknown function (DUF2457)	NA|380aa|down_0|CP033972.1_1603592_1604732_-	PRK09358, PRK09358, adenosine deaminase; Provisional	NA|457aa|down_1|CP033972.1_1604728_1606099_-	PRK05820, deoA, thymidine phosphorylase; Reviewed	NA|134aa|down_2|CP033972.1_1606095_1606497_-	PRK05578, PRK05578, cytidine deaminase; Validated	NA|136aa|down_3|CP033972.1_1606721_1607129_+	cd03501, SQR_TypeA_SdhC_like, Succinate:quinone oxidoreductase (SQR) Type A subfamily, Succinate dehydrogenase C (SdhC)-like subunit; SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol	NA|153aa|down_4|CP033972.1_1607139_1607598_+	cd03500, SQR_TypeA_SdhD_like, Succinate:quinone oxidoreductase (SQR) Type A subfamily, Succinate dehydrogenase D (SdhD)-like subunit; SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol	NA|585aa|down_5|CP033972.1_1607615_1609370_+	PRK08205, sdhA, succinate dehydrogenase flavoprotein subunit; Reviewed	NA|268aa|down_6|CP033972.1_1609369_1610173_+	PRK05950, sdhB, succinate dehydrogenase iron-sulfur subunit; Reviewed	NA|140aa|down_7|CP033972.1_1610315_1610735_+	NA	NA|502aa|down_8|CP033972.1_1610850_1612356_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|311aa|down_9|CP033972.1_1612345_1613278_+	cd00293, USP_Like, Usp: Universal stress protein family
GCA_003855095.1_ASM385509v1	CP033972	Gordonia sp. MMS17-SY073	4	2094443-2094587	4	CRISPRCasFinder	no		csa3,WYL,DEDDh,DinG,cas3,casR	Orphan	CCGCTGGTCGAGTAGTCGCGAACGAAGTGAGTGACGAATCGAGACCACACC	51	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,DEDDh,DinG,cas3,casR	NA,NA	NA|306aa|up_9|CP033972.1_2081715_2082633_+	TIGR03619, F420_Rv2161c, probable F420-dependent oxidoreductase, Rv2161c family	NA|292aa|up_8|CP033972.1_2082703_2083579_-	TIGR02438, catechol_12-dioxygenase, catechol 1,2-dioxygenase, Actinobacterial	NA|95aa|up_7|CP033972.1_2083634_2083919_-	TIGR03221, muco_delta, muconolactone delta-isomerase	NA|369aa|up_6|CP033972.1_2083924_2085031_-	cd03315, MLE_like, Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily	NA|319aa|up_5|CP033972.1_2085027_2085984_-	PRK09986, PRK09986, LysR family transcriptional regulator	NA|452aa|up_4|CP033972.1_2086216_2087572_+	TIGR03229, benzo_1_2_benA, benzoate 1,2-dioxygenase, large subunit	NA|183aa|up_3|CP033972.1_2087773_2088322_+	TIGR03232, benzo_1_2_benB, benzoate 1,2-dioxygenase, small subunit	NA|932aa|up_2|CP033972.1_2088334_2091130_+	PRK12823, benD, 1,6-dihydroxycyclohexa-2,4-diene-1-carboxylate dehydrogenase; Provisional	NA|69aa|up_1|CP033972.1_2091358_2091565_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|888aa|up_0|CP033972.1_2091671_2094335_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|96aa|down_0|CP033972.1_2094635_2094923_+	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|219aa|down_1|CP033972.1_2095068_2095725_-	TIGR02428, 3-oxoadipate_CoA-transferase_subunit_B, 3-oxoacid CoA-transferase, B subunit	NA|266aa|down_2|CP033972.1_2095735_2096533_-	COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism]	NA|257aa|down_3|CP033972.1_2096529_2097300_-	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|408aa|down_4|CP033972.1_2097296_2098520_-	PRK06205, PRK06205, acetyl-CoA C-acetyltransferase	NA|299aa|down_5|CP033972.1_2098593_2099490_+	PRK09986, PRK09986, LysR family transcriptional regulator	NA|553aa|down_6|CP033972.1_2099505_2101164_-	cd00322, FNR_like, Ferredoxin reductase (FNR), an FAD and NAD(P) binding protein, was intially identified as a chloroplast reductase activity, catalyzing the electron transfer from reduced iron-sulfur protein ferredoxin to NADP+ as the final step in the electron transport mechanism of photosystem I	NA|294aa|down_7|CP033972.1_2101160_2102042_-	pfam02424, ApbE, ApbE family	NA|157aa|down_8|CP033972.1_2102049_2102520_-	COG3976, COG3976, Uncharacterized protein conserved in bacteria [Function unknown]	NA|563aa|down_9|CP033972.1_2102612_2104301_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
