assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902386045.1_UHGG_MGYG-HGUT-02340	NZ_LR698981	Clostridiales bacterium isolate MGYG-HGUT-02340 chromosome 1	1	570028-570153	1	CRISPRCasFinder	no		cas14j,csa3,cas3,WYL,DEDDh,cas3HD,cas2,cas1,cas7,cas8c,cas5	Orphan	GGCCCCTCCCGCGGCGCAGGAGAAGGCCATTGAGGCGGCCAAG	43	0	0	NA	NA	NA	1	1	Orphan	cas14j,csa3,cas3,WYL,DEDDh,cas3HD,cas2,cas1,cas7,cas8c,cas5	NA,NA|64aa|down_8|NZ_LR698981.1_580656_580848_+	NA|325aa|up_9|NZ_LR698981.1_556848_557823_+	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|292aa|up_8|NZ_LR698981.1_557840_558716_+	cd02573, PseudoU_synth_EcTruB, Pseudouridine synthase, Escherichia coli TruB like	NA|299aa|up_7|NZ_LR698981.1_558751_559648_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|466aa|up_6|NZ_LR698981.1_559673_561071_+	pfam07613, DUF1576, Protein of unknown function (DUF1576)	NA|788aa|up_5|NZ_LR698981.1_561137_563501_-	TIGR02865, Stage_II_sporulation_protein_E, stage II sporulation protein E	NA|324aa|up_4|NZ_LR698981.1_563660_564632_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|199aa|up_3|NZ_LR698981.1_564695_565292_+	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|119aa|up_2|NZ_LR698981.1_565345_565702_+	pfam13788, DUF4180, Domain of unknown function (DUF4180)	NA|294aa|up_1|NZ_LR698981.1_565783_566665_+	pfam13490, zf-HC2, Putative zinc-finger	NA|265aa|up_0|NZ_LR698981.1_566996_567791_-	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|259aa|down_0|NZ_LR698981.1_570294_571071_+	PRK05809, PRK05809, short-chain-enoyl-CoA hydratase	NA|284aa|down_1|NZ_LR698981.1_571156_572008_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|731aa|down_2|NZ_LR698981.1_572121_574314_-	PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed	NA|630aa|down_3|NZ_LR698981.1_574313_576203_-	pfam01642, MM_CoA_mutase, Methylmalonyl-CoA mutase	NA|448aa|down_4|NZ_LR698981.1_576305_577649_-	pfam03977, OAD_beta, Na+-transporting oxaloacetate decarboxylase beta subunit	NA|112aa|down_5|NZ_LR698981.1_578122_578458_-	pfam04277, OAD_gamma, Oxaloacetate decarboxylase, gamma chain	NA|514aa|down_6|NZ_LR698981.1_578476_580018_-	TIGR01117, methylmalonyl-CoA_decarboxylase_alpha-subunit, methylmalonyl-CoA decarboxylase alpha subunit	NA|136aa|down_7|NZ_LR698981.1_580043_580451_-	TIGR03081, Methylmalonyl-CoA_epimerase_mitochondrial, methylmalonyl-CoA epimerase	NA|64aa|down_8|NZ_LR698981.1_580656_580848_+	NA	NA|400aa|down_9|NZ_LR698981.1_580844_582044_-	cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine
GCF_902386045.1_UHGG_MGYG-HGUT-02340	NZ_LR698981	Clostridiales bacterium isolate MGYG-HGUT-02340 chromosome 1	2	3073963-3077986	1,1,2	CRT,PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas7,cas8c,cas5,cas3	cas14j,csa3,cas3,WYL,DEDDh,cas3HD,cas2,cas1,cas7,cas8c,cas5	 Type I-U?,Type I-C,Type I-U	ATTTCAATCCACGCCCCCCGTGTGGGGGGCGACN,ATTTCAATCCACGCCCCCCGTGTGGGGGGCGAC,ATTTCAATCCACGCCCCCCGTGTGGGGGGCGAC	34,33,33	2	2	3076024-3076056|3076090-3076128	NZ_LR698981.1_1653795-1653763|NZ_LR698981.1_2482380-2482342	I-C:I-C:I-C	59,58,58	59	TypeI-U?,TypeI-C,TypeI-U	cas14j,csa3,cas3,WYL,DEDDh,cas3HD,cas2,cas1,cas7,cas8c,cas5	NA,NA|57aa|down_7|NZ_LR698981.1_3088035_3088206_+,NA|113aa|down_9|NZ_LR698981.1_3088529_3088868_+	NA|326aa|up_9|NZ_LR698981.1_3065569_3066547_+	cd07304, Chorismate_synthase, Chorismase synthase, the enzyme catalyzing the final step of the shikimate pathway	NA|397aa|up_8|NZ_LR698981.1_3066556_3067747_+	cd13631, PBP2_Ct-PDT_like, Catalytic domain of prephenate dehydratase from Chlorobium tepidum and similar proteins, subgroup 2; the type 2 periplasmic binding protein fold	NA|164aa|up_7|NZ_LR698981.1_3067743_3068235_+	PRK00131, aroK, shikimate kinase; Reviewed	NA|144aa|up_6|NZ_LR698981.1_3068231_3068663_+	pfam01220, DHquinase_II, Dehydroquinase class II	NA|284aa|up_5|NZ_LR698981.1_3068647_3069499_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|83aa|up_4|NZ_LR698981.1_3069652_3069901_+	pfam12116, SpoIIID, Stage III sporulation protein D	NA|45aa|up_3|NZ_LR698981.1_3070012_3070147_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|365aa|up_2|NZ_LR698981.1_3070244_3071339_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|451aa|up_1|NZ_LR698981.1_3071600_3072953_+	COG2031, AtoE, Short chain fatty acids transporter [Lipid metabolism]	NA|147aa|up_0|NZ_LR698981.1_3073039_3073480_-	cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18	cas2|97aa|down_0|NZ_LR698981.1_3078156_3078447_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_LR698981.1_3078451_3079483_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas7|284aa|down_2|NZ_LR698981.1_3080130_3080982_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|589aa|down_3|NZ_LR698981.1_3080982_3082749_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|243aa|down_4|NZ_LR698981.1_3082748_3083477_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|811aa|down_5|NZ_LR698981.1_3083481_3085914_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|189aa|down_6|NZ_LR698981.1_3086586_3087153_-	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]	NA|57aa|down_7|NZ_LR698981.1_3088035_3088206_+	NA	NA|61aa|down_8|NZ_LR698981.1_3088198_3088381_+	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|113aa|down_9|NZ_LR698981.1_3088529_3088868_+	NA
