assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000247715.1_ASM24771v1	NC_016906	Gordonia polyisoprenivorans VH2, complete sequence	1	1389980-1390151	1	CRT	no	csa3	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	Type I-A	NCGATGGTCGAGGTNCNNGNG	21	1	1	1390000-1390017	NC_016906.1_2251042-2251059	NA	4	4	Orphan	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	NA,NA	NA|153aa|up_9|NC_016906.1_1373965_1374424_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|152aa|up_8|NC_016906.1_1374420_1374876_+	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|1179aa|up_7|NC_016906.1_1376040_1379577_-	PRK13030, PRK13030, indolepyruvate ferredoxin oxidoreductase family protein	NA|514aa|up_6|NC_016906.1_1379859_1381401_+	PLN02820, PLN02820, 3-methylcrotonyl-CoA carboxylase, beta chain	NA|702aa|up_5|NC_016906.1_1381490_1383596_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|382aa|up_4|NC_016906.1_1383636_1384782_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|173aa|up_3|NC_016906.1_1384778_1385297_+	cd03451, FkbR2, FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes	NA|299aa|up_2|NC_016906.1_1385354_1386251_+	COG2301, CitE, Citrate lyase beta subunit [Carbohydrate transport and metabolism]	NA|559aa|up_1|NC_016906.1_1386282_1387959_+	PRK08315, PRK08315, AMP-binding domain protein; Validated	NA|668aa|up_0|NC_016906.1_1387955_1389959_+	PRK03584, PRK03584, acetoacetate--CoA ligase	NA|415aa|down_0|NC_016906.1_1390420_1391665_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|556aa|down_1|NC_016906.1_1391819_1393487_+	pfam10101, DUF2339, Predicted membrane protein (DUF2339)	NA|212aa|down_2|NC_016906.1_1393584_1394220_+	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|535aa|down_3|NC_016906.1_1394216_1395821_+	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|249aa|down_4|NC_016906.1_1395953_1396700_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|910aa|down_5|NC_016906.1_1396745_1399475_-	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|76aa|down_6|NC_016906.1_1399530_1399758_-	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|254aa|down_7|NC_016906.1_1399769_1400531_-	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|289aa|down_8|NC_016906.1_1400888_1401755_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|353aa|down_9|NC_016906.1_1401761_1402820_+	cd13560, PBP2_taurine, Taurine-binding periplasmic protein; the type 2 periplasmic binding protein fold
GCF_000247715.1_ASM24771v1	NC_016906	Gordonia polyisoprenivorans VH2, complete sequence	2	3058967-3059060	1	CRISPRCasFinder	no		DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	Orphan	CCGCTCACCGGGCGCGCCACGCCTGCACGA	30	0	0	NA	NA	NA	1	1	Orphan	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	NA,NA	NA|273aa|up_9|NC_016906.1_3049108_3049927_+	PRK06940, PRK06940, short chain dehydrogenase; Provisional	NA|315aa|up_8|NC_016906.1_3049930_3050875_+	smart00849, Lactamase_B, Metallo-beta-lactamase superfamily	NA|392aa|up_7|NC_016906.1_3050820_3051996_-	pfam09995, DUF2236, Uncharacterized protein conserved in bacteria (DUF2236)	NA|211aa|up_6|NC_016906.1_3052039_3052672_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|111aa|up_5|NC_016906.1_3052766_3053099_+	PRK07062, PRK07062, SDR family oxidoreductase	NA|302aa|up_4|NC_016906.1_3053122_3054028_-	TIGR02957, putative_sigma_factor, RNA polymerase sigma-70 factor, TIGR02957 family	NA|399aa|up_3|NC_016906.1_3054024_3055221_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|235aa|up_2|NC_016906.1_3055381_3056086_+	pfam06197, DUF998, Protein of unknown function (DUF998)	NA|611aa|up_1|NC_016906.1_3056043_3057876_-	COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]	NA|293aa|up_0|NC_016906.1_3058025_3058904_-	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|478aa|down_0|NC_016906.1_3059154_3060588_-	PRK06247, PRK06247, pyruvate kinase; Provisional	NA|502aa|down_1|NC_016906.1_3060788_3062294_-	PRK12810, gltD, glutamate synthase subunit beta; Reviewed	NA|1524aa|down_2|NC_016906.1_3062286_3066858_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|615aa|down_3|NC_016906.1_3067042_3068887_-	PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase; Reviewed	NA|274aa|down_4|NC_016906.1_3068961_3069783_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|459aa|down_5|NC_016906.1_3069779_3071156_-	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|273aa|down_6|NC_016906.1_3071215_3072034_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|252aa|down_7|NC_016906.1_3072144_3072900_-	pfam09534, Trp_oprn_chp, Tryptophan-associated transmembrane protein (Trp_oprn_chp)	NA|529aa|down_8|NC_016906.1_3072892_3074479_-	PRK13571, PRK13571, anthranilate synthase component I; Provisional	NA|60aa|down_9|NC_016906.1_3074475_3074655_-	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]
GCF_000247715.1_ASM24771v1	NC_016906	Gordonia polyisoprenivorans VH2, complete sequence	3	3982226-3983952	2,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,csb2gr5,cas7,cas8u1,cas3	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	Unclear	GCTGCAATGGNAACCCGGCCGTGAAGACCGGGAGCACN,GCTGCAATGGAACCCGGCCGTGAAGACCGGGAGCAC,CAATGGAACCCGGCCGTGAAGACCGGGAGCAC	38,36,32	0	0	NA	NA	NA:NA:NA	23,21,21	23	Unclear	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	NA|254aa|up_2|NC_016906.1_3980121_3980883_-,NA|162aa|down_7|NC_016906.1_3995025_3995511_-	NA|441aa|up_9|NC_016906.1_3975487_3976810_-	PRK09204, secY, preprotein translocase subunit SecY; Reviewed	NA|147aa|up_8|NC_016906.1_3977144_3977585_-	PRK05592, rplO, 50S ribosomal protein L15; Reviewed	NA|62aa|up_7|NC_016906.1_3977581_3977767_-	PRK05611, rpmD, 50S ribosomal protein L30; Reviewed	NA|216aa|up_6|NC_016906.1_3977772_3978420_-	PRK00550, rpsE, 30S ribosomal protein S5; Validated	NA|119aa|up_5|NC_016906.1_3978463_3978820_-	PRK05593, rplR, 50S ribosomal protein L18; Reviewed	NA|179aa|up_4|NC_016906.1_3978871_3979408_-	PRK05498, rplF, 50S ribosomal protein L6; Validated	NA|133aa|up_3|NC_016906.1_3979424_3979823_-	PRK00136, rpsH, 30S ribosomal protein S8; Validated	NA|254aa|up_2|NC_016906.1_3980121_3980883_-	NA	NA|254aa|up_1|NC_016906.1_3980920_3981682_+	pfam09900, DUF2127, Predicted membrane protein (DUF2127)	NA|110aa|up_0|NC_016906.1_3981762_3982092_+	cd02230, cupin_HP0902-like, Helicobacter pylori HP0902 and related proteins, cupin domain	cas2|96aa|down_0|NC_016906.1_3984127_3984415_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|552aa|down_1|NC_016906.1_3984421_3986077_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	csb2gr5|493aa|down_2|NC_016906.1_3986194_3987673_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|310aa|down_3|NC_016906.1_3987679_3988609_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas8u1|751aa|down_4|NC_016906.1_3988592_3990845_-	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas3|797aa|down_5|NC_016906.1_3990841_3993232_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	NA|460aa|down_6|NC_016906.1_3993609_3994989_+	pfam03008, DUF234, Archaea bacterial proteins of unknown function	NA|162aa|down_7|NC_016906.1_3995025_3995511_-	NA	NA|286aa|down_8|NC_016906.1_3995507_3996365_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|984aa|down_9|NC_016906.1_3996389_3999341_-	pfam10593, Z1, Z1 domain
GCF_000247715.1_ASM24771v1	NC_016906	Gordonia polyisoprenivorans VH2, complete sequence	4	4578974-4579069	3	CRISPRCasFinder	no	c2c9_V-U4	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	Type V-U4	CGTCGCTAACGCTCCTCGACACCTCGACCA	30	0	0	NA	NA	NA	1	1	TypeV-U4	DEDDh,casR,csa3,cas3,WYL,DinG,cas4,cas2,cas1,csb2gr5,cas7,cas8u1,c2c9_V-U4,RT	NA|106aa|up_9|NC_016906.1_4567785_4568103_-,NA|127aa|up_7|NC_016906.1_4570009_4570390_-,NA|225aa|up_4|NC_016906.1_4572683_4573358_-,NA|269aa|up_1|NC_016906.1_4576332_4577139_+,NA	NA|106aa|up_9|NC_016906.1_4567785_4568103_-	NA	NA|345aa|up_8|NC_016906.1_4568859_4569894_-	PRK07877, PRK07877, Rv1355c family protein	NA|127aa|up_7|NC_016906.1_4570009_4570390_-	NA	NA|94aa|up_6|NC_016906.1_4570806_4571088_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	c2c9_V-U4|421aa|up_5|NC_016906.1_4571299_4572562_-	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|225aa|up_4|NC_016906.1_4572683_4573358_-	NA	NA|216aa|up_3|NC_016906.1_4573558_4574206_+	COG5606, COG5606, Uncharacterized conserved small protein [Function unknown]	NA|523aa|up_2|NC_016906.1_4574338_4575907_-	pfam01446, Rep_1, Replication protein	NA|269aa|up_1|NC_016906.1_4576332_4577139_+	NA	NA|554aa|up_0|NC_016906.1_4577251_4578913_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|312aa|down_0|NC_016906.1_4579169_4580105_-	pfam09678, Caa3_CtaG, Cytochrome c oxidase caa3 assembly factor (Caa3_CtaG)	NA|284aa|down_1|NC_016906.1_4580227_4581079_+	TIGR03707, PPK2_P_aer, polyphosphate kinase 2, PA0141 family	NA|225aa|down_2|NC_016906.1_4581136_4581811_+	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|184aa|down_3|NC_016906.1_4581959_4582511_+	pfam04264, YceI, YceI-like domain	NA|303aa|down_4|NC_016906.1_4583018_4583927_-	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|525aa|down_5|NC_016906.1_4584000_4585575_-	pfam05977, MFS_3, Transmembrane secretion effector	NA|316aa|down_6|NC_016906.1_4585634_4586582_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|388aa|down_7|NC_016906.1_4586591_4587755_-	pfam02515, CoA_transf_3, CoA-transferase family III	NA|354aa|down_8|NC_016906.1_4588030_4589092_+	cd07725, TTHA1429-like_MBL-fold, uncharacterized Thermus thermophilus TTHA1429 and related proteins; MBL-fold metallo hydrolase domain	NA|254aa|down_9|NC_016906.1_4589088_4589850_+	PRK08252, PRK08252, crotonase/enoyl-CoA hydratase family protein
