assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902386185.1_UHGG_MGYG-HGUT-02352	NZ_LR698982	Gordonibacter urolithinfaciens isolate MGYG-HGUT-02352 chromosome 1	1	66184-66266	1	CRISPRCasFinder	no		cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	Orphan	GGACAGGGGGACGGTTCTCTTGTCCC	26	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	NA,NA	NA|269aa|up_9|NZ_LR698982.1_52475_53282_-	PRK00117, recX, recombination regulator RecX; Reviewed	NA|353aa|up_8|NZ_LR698982.1_53283_54342_-	PRK09354, recA, recombinase A; Provisional	NA|424aa|up_7|NZ_LR698982.1_54627_55899_-	pfam02464, CinA, Competence-damaged protein	NA|443aa|up_6|NZ_LR698982.1_55908_57237_-	COG0621, MiaB, 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]	NA|166aa|up_5|NZ_LR698982.1_57245_57743_-	cd11740, YajQ_like, Proteins similar to Escherichia coli YajQ	NA|374aa|up_4|NZ_LR698982.1_58164_59286_-	pfam13413, HTH_25, Helix-turn-helix domain	NA|852aa|up_3|NZ_LR698982.1_59299_61855_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|634aa|up_2|NZ_LR698982.1_61949_63851_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|300aa|up_1|NZ_LR698982.1_64066_64966_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|256aa|up_0|NZ_LR698982.1_65076_65844_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|750aa|down_0|NZ_LR698982.1_66398_68648_-	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|96aa|down_1|NZ_LR698982.1_68920_69208_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|152aa|down_2|NZ_LR698982.1_69448_69904_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|384aa|down_3|NZ_LR698982.1_70072_71224_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|272aa|down_4|NZ_LR698982.1_71287_72103_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|247aa|down_5|NZ_LR698982.1_72280_73021_+	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|83aa|down_6|NZ_LR698982.1_73005_73254_-	pfam12637, TSCPD, TSCPD domain	NA|257aa|down_7|NZ_LR698982.1_73335_74106_-	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|525aa|down_8|NZ_LR698982.1_74111_75686_-	COG2815, COG2815, Uncharacterized protein conserved in bacteria [Function unknown]	NA|306aa|down_9|NZ_LR698982.1_75845_76763_-	PRK00089, era, GTPase Era; Reviewed
GCF_902386185.1_UHGG_MGYG-HGUT-02352	NZ_LR698982	Gordonibacter urolithinfaciens isolate MGYG-HGUT-02352 chromosome 1	2	397600-398159	1,1,2	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	 Type I-U?,Type I-C,Type I-U	NGTTTCAACCCGCGAGCCCGCGAGGGCTCGAC,GTTTCAACCCGCGAGCCCGCGA--GGGCTCGAC,GTTTCAACCCGCGAGCCCGCGAGGGCTCGAC	32,33,31	0	0	NA	NA	NA:NA:NA	8,7,7	8	TypeI-U?,TypeI-C,TypeI-U	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	NA|94aa|up_5|NZ_LR698982.1_392152_392434_-,NA|182aa|up_3|NZ_LR698982.1_395070_395616_-,NA	NA|484aa|up_9|NZ_LR698982.1_386860_388312_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|248aa|up_8|NZ_LR698982.1_389485_390229_-	TIGR00927, retinal_rod, K+-dependent Na+/Ca+ exchanger	NA|155aa|up_7|NZ_LR698982.1_390281_390746_-	pfam01668, SmpB, SmpB protein	NA|400aa|up_6|NZ_LR698982.1_390751_391951_-	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|94aa|up_5|NZ_LR698982.1_392152_392434_-	NA	NA|783aa|up_4|NZ_LR698982.1_392611_394960_+	PRK14501, PRK14501, putative bifunctional trehalose-6-phosphate synthase/HAD hydrolase subfamily IIB; Provisional	NA|182aa|up_3|NZ_LR698982.1_395070_395616_-	NA	NA|152aa|up_2|NZ_LR698982.1_395612_396068_-	pfam04074, DUF386, Domain of unknown function (DUF386)	NA|134aa|up_1|NZ_LR698982.1_396185_396587_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|215aa|up_0|NZ_LR698982.1_396577_397222_-	smart00318, SNc, Staphylococcal nuclease homologues	cas2|97aa|down_0|NZ_LR698982.1_398350_398641_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|342aa|down_1|NZ_LR698982.1_398643_399669_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|220aa|down_2|NZ_LR698982.1_399628_400288_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|302aa|down_3|NZ_LR698982.1_400291_401197_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|609aa|down_4|NZ_LR698982.1_401206_403033_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|222aa|down_5|NZ_LR698982.1_403029_403695_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|766aa|down_6|NZ_LR698982.1_403749_406047_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|415aa|down_7|NZ_LR698982.1_406203_407448_-	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|743aa|down_8|NZ_LR698982.1_408654_410883_+	cd02759, MopB_Acetylene-hydratase, The MopB_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins	NA|73aa|down_9|NZ_LR698982.1_411043_411262_+	pfam14229, DUF4332, Domain of unknown function (DUF4332)
GCF_902386185.1_UHGG_MGYG-HGUT-02352	NZ_LR698982	Gordonibacter urolithinfaciens isolate MGYG-HGUT-02352 chromosome 1	3	407623-408130	2,2,3	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	 Type I-U?,Type I-C,Type I-U	GTTTCAACCCACGCGCCCCGCGAGGGGCGCGAC,GTTTCAACCCACGCGCCCCGCGAGGGGCGCGAC,GTTTCAACCCACGCGCCCCGCGAGGGGCGCGAC	33,33,33	0	0	NA	NA	NA:NA:NA	7,6,6	7	TypeI-U?,TypeI-C,TypeI-U	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	NA,NA|48aa|down_5|NZ_LR698982.1_415233_415377_+	NA|134aa|up_9|NZ_LR698982.1_396185_396587_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|215aa|up_8|NZ_LR698982.1_396577_397222_-	smart00318, SNc, Staphylococcal nuclease homologues	cas2|97aa|up_7|NZ_LR698982.1_398350_398641_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|342aa|up_6|NZ_LR698982.1_398643_399669_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|220aa|up_5|NZ_LR698982.1_399628_400288_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|302aa|up_4|NZ_LR698982.1_400291_401197_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|609aa|up_3|NZ_LR698982.1_401206_403033_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|222aa|up_2|NZ_LR698982.1_403029_403695_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|766aa|up_1|NZ_LR698982.1_403749_406047_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|415aa|up_0|NZ_LR698982.1_406203_407448_-	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|743aa|down_0|NZ_LR698982.1_408654_410883_+	cd02759, MopB_Acetylene-hydratase, The MopB_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins	NA|73aa|down_1|NZ_LR698982.1_411043_411262_+	pfam14229, DUF4332, Domain of unknown function (DUF4332)	NA|260aa|down_2|NZ_LR698982.1_411636_412416_+	TIGR02366, conserved_hypothetical_protein, probable dihydroxyacetone kinase regulator	NA|86aa|down_3|NZ_LR698982.1_412397_412655_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|387aa|down_4|NZ_LR698982.1_413899_415060_-	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|48aa|down_5|NZ_LR698982.1_415233_415377_+	NA	NA|76aa|down_6|NZ_LR698982.1_415575_415803_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|74aa|down_7|NZ_LR698982.1_416102_416324_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|73aa|down_8|NZ_LR698982.1_416436_416655_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|191aa|down_9|NZ_LR698982.1_416721_417294_+	cd03674, Nudix_Hydrolase_1, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_902386185.1_UHGG_MGYG-HGUT-02352	NZ_LR698982	Gordonibacter urolithinfaciens isolate MGYG-HGUT-02352 chromosome 1	4	412770-413801	4,3,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	 Type I-U?,Type I-C,Type I-U	GTTTCAACCCACGCGCCCCTATGGGGCGCGAC,GTTTCAACCCACGCGCCCCTATGGGGCGCGAC,GTTTCAACCCACGCGCCCC-TATGGGGCGCGAC	32,32,33	0	0	NA	NA	I-C:I-C:NA	15,15,15	15	TypeI-U?,TypeI-C,TypeI-U	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	NA,NA|48aa|down_1|NZ_LR698982.1_415233_415377_+	cas4|220aa|up_9|NZ_LR698982.1_399628_400288_-	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas7|302aa|up_8|NZ_LR698982.1_400291_401197_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|609aa|up_7|NZ_LR698982.1_401206_403033_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|222aa|up_6|NZ_LR698982.1_403029_403695_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|766aa|up_5|NZ_LR698982.1_403749_406047_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|415aa|up_4|NZ_LR698982.1_406203_407448_-	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|743aa|up_3|NZ_LR698982.1_408654_410883_+	cd02759, MopB_Acetylene-hydratase, The MopB_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins	NA|73aa|up_2|NZ_LR698982.1_411043_411262_+	pfam14229, DUF4332, Domain of unknown function (DUF4332)	NA|260aa|up_1|NZ_LR698982.1_411636_412416_+	TIGR02366, conserved_hypothetical_protein, probable dihydroxyacetone kinase regulator	NA|86aa|up_0|NZ_LR698982.1_412397_412655_+	PRK14559, PRK14559, serine/threonine phosphatase	NA|387aa|down_0|NZ_LR698982.1_413899_415060_-	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|48aa|down_1|NZ_LR698982.1_415233_415377_+	NA	NA|76aa|down_2|NZ_LR698982.1_415575_415803_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|74aa|down_3|NZ_LR698982.1_416102_416324_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|73aa|down_4|NZ_LR698982.1_416436_416655_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|191aa|down_5|NZ_LR698982.1_416721_417294_+	cd03674, Nudix_Hydrolase_1, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|247aa|down_6|NZ_LR698982.1_417382_418123_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|1101aa|down_7|NZ_LR698982.1_418451_421754_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|855aa|down_8|NZ_LR698982.1_421895_424460_-	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|152aa|down_9|NZ_LR698982.1_424498_424954_-	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ
GCF_902386185.1_UHGG_MGYG-HGUT-02352	NZ_LR698982	Gordonibacter urolithinfaciens isolate MGYG-HGUT-02352 chromosome 1	5	1648429-1648548	4	PILER-CR	no	DinG	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	Type IV-A	ATGTTTCACGTGAAACA	17	0	0	NA	NA	NA	2	2	Orphan	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,WYL,DinG,PD-DExK,csa3	NA,NA|214aa|down_0|NZ_LR698982.1_1648559_1649201_-,NA|51aa|down_4|NZ_LR698982.1_1657400_1657553_+,NA|105aa|down_9|NZ_LR698982.1_1661044_1661359_+	NA|110aa|up_9|NZ_LR698982.1_1640674_1641004_+	pfam00825, Ribonuclease_P, Ribonuclease P	NA|81aa|up_8|NZ_LR698982.1_1641014_1641257_+	pfam01809, Haemolytic, Haemolytic domain	NA|257aa|up_7|NZ_LR698982.1_1641305_1642076_+	pfam02096, 60KD_IMP, 60Kd inner membrane protein	NA|178aa|up_6|NZ_LR698982.1_1642167_1642701_+	COG1847, Jag, Predicted RNA-binding protein [General function prediction only]	NA|218aa|up_5|NZ_LR698982.1_1642809_1643463_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|134aa|up_4|NZ_LR698982.1_1643533_1643935_-	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|250aa|up_3|NZ_LR698982.1_1644424_1645174_+	pfam13614, AAA_31, AAA domain	NA|354aa|up_2|NZ_LR698982.1_1645166_1646228_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|350aa|up_1|NZ_LR698982.1_1646321_1647371_-	PRK14454, PRK14454, 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN	NA|316aa|up_0|NZ_LR698982.1_1647452_1648400_+	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|214aa|down_0|NZ_LR698982.1_1648559_1649201_-	NA	DinG|1032aa|down_1|NZ_LR698982.1_1649344_1652440_+	PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated	NA|464aa|down_2|NZ_LR698982.1_1652468_1653860_+	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|1082aa|down_3|NZ_LR698982.1_1654010_1657256_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|51aa|down_4|NZ_LR698982.1_1657400_1657553_+	NA	NA|222aa|down_5|NZ_LR698982.1_1657841_1658507_+	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|162aa|down_6|NZ_LR698982.1_1658545_1659031_+	pfam04892, VanZ, VanZ like family	NA|81aa|down_7|NZ_LR698982.1_1659049_1659292_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|367aa|down_8|NZ_LR698982.1_1659969_1661070_+	COG1316, LytR, Transcriptional regulator [Transcription]	NA|105aa|down_9|NZ_LR698982.1_1661044_1661359_+	NA
