assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000724665.2_ASM72466v3	NZ_CP036542	Bacteroides fragilis strain DCMOUH0018B chromosome, complete genome	1	1762972-1763117	1	PILER-CR	no		cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	Orphan	GAAATTCCCAATATATTGTGAATTTGA	27	0	0	NA	NA	NA	2	2	Orphan	cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	NA,NA|77aa|down_2|NZ_CP036542.1_1764920_1765151_-,NA|64aa|down_3|NZ_CP036542.1_1765164_1765356_+	NA|363aa|up_9|NZ_CP036542.1_1754109_1755198_+	pfam07610, DUF1573, Protein of unknown function (DUF1573)	NA|364aa|up_8|NZ_CP036542.1_1755206_1756298_+	PRK09435, PRK09435, methylmalonyl Co-A mutase-associated GTPase MeaB	NA|303aa|up_7|NZ_CP036542.1_1756407_1757316_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|276aa|up_6|NZ_CP036542.1_1757407_1758235_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|339aa|up_5|NZ_CP036542.1_1758256_1759273_+	pfam14491, DUF4435, Protein of unknown function (DUF4435)	NA|416aa|up_4|NZ_CP036542.1_1759244_1760492_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|298aa|up_3|NZ_CP036542.1_1760533_1761427_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|280aa|up_2|NZ_CP036542.1_1761429_1762269_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|110aa|up_1|NZ_CP036542.1_1762432_1762762_-	TIGR03071, couple_hipA, HipA N-terminal domain	NA|71aa|up_0|NZ_CP036542.1_1762758_1762971_-	TIGR03070, couple_hipB, transcriptional regulator, y4mF family	NA|291aa|down_0|NZ_CP036542.1_1763460_1764333_-	pfam14297, DUF4373, Domain of unknown function (DUF4373)	NA|116aa|down_1|NZ_CP036542.1_1764474_1764822_-	pfam10902, WYL_2, WYL_2, Sm-like SH3 beta-barrel fold	NA|77aa|down_2|NZ_CP036542.1_1764920_1765151_-	NA	NA|64aa|down_3|NZ_CP036542.1_1765164_1765356_+	NA	NA|179aa|down_4|NZ_CP036542.1_1765869_1766406_+	cd09895, NGN_SP_UpxY, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY	NA|162aa|down_5|NZ_CP036542.1_1766425_1766911_+	pfam06603, UpxZ, UpxZ family of transcription anti-terminator antagonists	NA|403aa|down_6|NZ_CP036542.1_1766977_1768186_+	cd05237, UDP_invert_4-6DH_SDR_e, UDP-Glcnac (UDP-linked N-acetylglucosamine) inverting 4,6-dehydratase, extended (e) SDRs	NA|381aa|down_7|NZ_CP036542.1_1768438_1769581_+	TIGR04181, DegT/DnrJ/EryC1/StrS_aminotransferase, aminotransferase, LLPSF_NHT_00031 family	NA|216aa|down_8|NZ_CP036542.1_1769598_1770246_+	TIGR03570, NeuD_NnaD, sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family	NA|339aa|down_9|NZ_CP036542.1_1770238_1771255_+	TIGR03569, ORF_8_similar_to_NeuB_family, N-acetylneuraminate synthase
GCF_000724665.2_ASM72466v3	NZ_CP036542	Bacteroides fragilis strain DCMOUH0018B chromosome, complete genome	2	2313742-2313869	1	CRISPRCasFinder	no		cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	Orphan	GTTTTCAGACAAAAGAACAAAAGAACACATAAAGAAT	37	0	0	NA	NA	NA	1	1	Orphan	cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	NA,NA|108aa|down_1|NZ_CP036542.1_2315117_2315441_-,NA|180aa|down_2|NZ_CP036542.1_2315902_2316442_+,NA|89aa|down_5|NZ_CP036542.1_2318868_2319135_-,NA|443aa|down_6|NZ_CP036542.1_2319982_2321311_+,NA|884aa|down_7|NZ_CP036542.1_2321444_2324096_+,NA|787aa|down_8|NZ_CP036542.1_2324226_2326587_+	NA|443aa|up_9|NZ_CP036542.1_2298116_2299445_-	TIGR04456, hypothetical_protein_ACD_77C00477G0043, LruC domain	NA|430aa|up_8|NZ_CP036542.1_2299715_2301005_-	pfam16130, DUF4842, Domain of unknown function (DUF4842)	NA|425aa|up_7|NZ_CP036542.1_2301234_2302509_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|1012aa|up_6|NZ_CP036542.1_2302535_2305571_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|351aa|up_5|NZ_CP036542.1_2305713_2306766_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|772aa|up_4|NZ_CP036542.1_2306913_2309229_-	PRK13807, PRK13807, maltose phosphorylase; Provisional	NA|159aa|up_3|NZ_CP036542.1_2309495_2309972_-	pfam18291, HU-HIG, HU domain fused to wHTH, Ig, or Glycine-rich motif	NA|82aa|up_2|NZ_CP036542.1_2310346_2310592_+	pfam14053, DUF4248, Domain of unknown function (DUF4248)	NA|623aa|up_1|NZ_CP036542.1_2310610_2312479_-	cd01122, GP4d_helicase, GP4d_helicase is a homohexameric 5'-3' helicases	NA|305aa|up_0|NZ_CP036542.1_2312705_2313620_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|326aa|down_0|NZ_CP036542.1_2314024_2315002_-	pfam14297, DUF4373, Domain of unknown function (DUF4373)	NA|108aa|down_1|NZ_CP036542.1_2315117_2315441_-	NA	NA|180aa|down_2|NZ_CP036542.1_2315902_2316442_+	NA	NA|424aa|down_3|NZ_CP036542.1_2316444_2317716_+	pfam12099, DUF3575, Protein of unknown function (DUF3575)	NA|339aa|down_4|NZ_CP036542.1_2317712_2318729_+	pfam08842, Mfa2, Fimbrillin-A associated anchor proteins Mfa1 and Mfa2	NA|89aa|down_5|NZ_CP036542.1_2318868_2319135_-	NA	NA|443aa|down_6|NZ_CP036542.1_2319982_2321311_+	NA	NA|884aa|down_7|NZ_CP036542.1_2321444_2324096_+	NA	NA|787aa|down_8|NZ_CP036542.1_2324226_2326587_+	NA	NA|778aa|down_9|NZ_CP036542.1_2326632_2328966_+	pfam16249, DUF4906, Domain of unknown function (DUF4906)
GCF_000724665.2_ASM72466v3	NZ_CP036542	Bacteroides fragilis strain DCMOUH0018B chromosome, complete genome	3	3409741-3410268	2,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6	cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	Type III-D,Type III-A,Type III-C,Type III-B	TGTCTTAATCCTTATTATACTGGAATACATCTACAT,TGTCTTAATCCTTATTATACTGGAATACATCTACAT,TGTCTTAATCCTTATTATACTGGAATACATCTACAT	36,36,36	0	0	NA	NA	NA:NA:NA	7,7,7	7	TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	cas3,PrimPol,RT,DEDDh,PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas6,csa3,WYL	NA|94aa|up_9|NZ_CP036542.1_3400353_3400635_-,cmr1gr7|467aa|down_7|NZ_CP036542.1_3418457_3419858_-,NA|160aa|down_8|NZ_CP036542.1_3419871_3420351_-	NA|94aa|up_9|NZ_CP036542.1_3400353_3400635_-	NA	NA|265aa|up_8|NZ_CP036542.1_3400641_3401436_-	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|207aa|up_7|NZ_CP036542.1_3401440_3402061_-	cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of nickel/oligopeptides specific transporters	NA|177aa|up_6|NZ_CP036542.1_3402155_3402686_+	COG4739, COG4739, Uncharacterized protein containing a ferredoxin domain [Function unknown]	NA|372aa|up_5|NZ_CP036542.1_3402760_3403876_+	pfam04371, PAD_porph, Porphyromonas-type peptidyl-arginine deiminase	NA|295aa|up_4|NZ_CP036542.1_3403888_3404773_+	cd07573, CPA, N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases)	NA|128aa|up_3|NZ_CP036542.1_3404855_3405239_-	pfam04138, GtrA, GtrA-like protein	NA|586aa|up_2|NZ_CP036542.1_3405239_3406997_-	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|348aa|up_1|NZ_CP036542.1_3407080_3408124_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|395aa|up_0|NZ_CP036542.1_3408358_3409543_+	cd06454, KBL_like, KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I)	cas2|97aa|down_0|NZ_CP036542.1_3410715_3411006_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|757aa|down_1|NZ_CP036542.1_3410999_3413270_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cmr6gr7|315aa|down_2|NZ_CP036542.1_3413289_3414234_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|137aa|down_3|NZ_CP036542.1_3414236_3414647_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|280aa|down_4|NZ_CP036542.1_3414659_3415499_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|379aa|down_5|NZ_CP036542.1_3415518_3416655_-	TIGR01888, Hypothetical_protein_SSO1730, CRISPR type III-B/RAMP module-associated protein Cmr3	cas10|601aa|down_6|NZ_CP036542.1_3416647_3418450_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr1gr7|467aa|down_7|NZ_CP036542.1_3418457_3419858_-	NA	NA|160aa|down_8|NZ_CP036542.1_3419871_3420351_-	NA	NA|492aa|down_9|NZ_CP036542.1_3420347_3421823_-	cd12822, TmCorA-like, Thermotoga maritima CorA-like family
