assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000473305.1_ASM47330v1	NZ_CP006772	Bacteroidales bacterium CF chromosome, complete genome	1	157013-157101	1	CRISPRCasFinder	no		DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	Orphan	ATCTTATTTGACGGTTATTTGATA	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	NA|244aa|up_8|NZ_CP006772.1_143009_143741_-,NA|215aa|up_2|NZ_CP006772.1_152950_153595_+,NA	NA|163aa|up_9|NZ_CP006772.1_142233_142722_-	cd09011, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|244aa|up_8|NZ_CP006772.1_143009_143741_-	NA	NA|798aa|up_7|NZ_CP006772.1_144589_146983_+	pfam01835, A2M_N, MG2 domain	NA|486aa|up_6|NZ_CP006772.1_147067_148525_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|70aa|up_5|NZ_CP006772.1_148773_148983_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|1069aa|up_4|NZ_CP006772.1_148986_152193_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|249aa|up_3|NZ_CP006772.1_152189_152936_+	pfam14335, DUF4391, Domain of unknown function (DUF4391)	NA|215aa|up_2|NZ_CP006772.1_152950_153595_+	NA	NA|648aa|up_1|NZ_CP006772.1_153603_155547_+	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|484aa|up_0|NZ_CP006772.1_155558_157010_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|1029aa|down_0|NZ_CP006772.1_157124_160211_+	COG3587, COG3587, Restriction endonuclease [Defense mechanisms]	NA|390aa|down_1|NZ_CP006772.1_160249_161419_-	pfam14294, DUF4372, Domain of unknown function (DUF4372)	NA|188aa|down_2|NZ_CP006772.1_161610_162174_+	TIGR02985, Sig70_bacteroi1, RNA polymerase sigma-70 factor, Bacteroides expansion family 1	NA|318aa|down_3|NZ_CP006772.1_162214_163168_+	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]	NA|1172aa|down_4|NZ_CP006772.1_163241_166757_+	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|499aa|down_5|NZ_CP006772.1_166764_168261_+	cd08977, SusD, starch binding outer membrane protein SusD	NA|269aa|down_6|NZ_CP006772.1_168271_169078_+	pfam14717, DUF4465, Domain of unknown function (DUF4465)	NA|372aa|down_7|NZ_CP006772.1_169083_170199_+	pfam13098, Thioredoxin_2, Thioredoxin-like domain	NA|419aa|down_8|NZ_CP006772.1_170396_171653_-	pfam00162, PGK, Phosphoglycerate kinase	NA|310aa|down_9|NZ_CP006772.1_171762_172692_+	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]
GCF_000473305.1_ASM47330v1	NZ_CP006772	Bacteroidales bacterium CF chromosome, complete genome	2	1375777-1375915	1,2	PILER-CR,CRISPRCasFinder	no		DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	Orphan	ATCTTTTCCGGAGCATTGTCGGCCAAT,TTTCCGGAGCATTGTCGGCCAGT	27,23	0	0	NA	NA	NA:NA	2,2	2	Orphan	DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	NA|122aa|up_4|NZ_CP006772.1_1369131_1369497_+,NA|375aa|down_2|NZ_CP006772.1_1378386_1379511_-	NA|391aa|up_9|NZ_CP006772.1_1360771_1361944_+	pfam14294, DUF4372, Domain of unknown function (DUF4372)	NA|691aa|up_8|NZ_CP006772.1_1361974_1364047_+	cd01029, TOPRIM_primases, TOPRIM_primases: The topoisomerase-primase (TORPIM) nucleotidyl transferase/hydrolase domain found in the active site regions of bacterial DnaG-type primases and their homologs	NA|245aa|up_7|NZ_CP006772.1_1364307_1365042_-	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|520aa|up_6|NZ_CP006772.1_1365069_1366629_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|397aa|up_5|NZ_CP006772.1_1367584_1368775_-	cd17370, MFS_MJ1317_like, MJ1317 and similar transporters of the Major Facilitator Superfamily	NA|122aa|up_4|NZ_CP006772.1_1369131_1369497_+	NA	NA|119aa|up_3|NZ_CP006772.1_1369489_1369846_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|539aa|up_2|NZ_CP006772.1_1369919_1371536_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|138aa|up_1|NZ_CP006772.1_1372208_1372622_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|849aa|up_0|NZ_CP006772.1_1372907_1375454_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|520aa|down_0|NZ_CP006772.1_1375976_1377536_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|228aa|down_1|NZ_CP006772.1_1377586_1378270_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|375aa|down_2|NZ_CP006772.1_1378386_1379511_-	NA	NA|266aa|down_3|NZ_CP006772.1_1379511_1380309_-	cd05374, 17beta-HSD-like_SDR_c, 17beta hydroxysteroid dehydrogenase-like, classical (c) SDRs	NA|397aa|down_4|NZ_CP006772.1_1380336_1381527_-	cd06454, KBL_like, KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I)	NA|152aa|down_5|NZ_CP006772.1_1381540_1381996_-	pfam04138, GtrA, GtrA-like protein	NA|228aa|down_6|NZ_CP006772.1_1382032_1382716_-	pfam01066, CDP-OH_P_transf, CDP-alcohol phosphatidyltransferase	NA|290aa|down_7|NZ_CP006772.1_1382768_1383638_-	cd07228, Pat_NTE_like_bacteria, Bacterial patatin-like phospholipase domain containing protein 6	NA|92aa|down_8|NZ_CP006772.1_1383776_1384052_-	cd07988, LPLAT_ABO13168-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: Unknown ABO13168	NA|57aa|down_9|NZ_CP006772.1_1384203_1384374_-	pfam08989, DUF1896, Domain of unknown function (DUF1896)
GCF_000473305.1_ASM47330v1	NZ_CP006772	Bacteroidales bacterium CF chromosome, complete genome	3	1713813-1713911	3	CRISPRCasFinder	no		DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	Orphan	TTTATTTTTAGTGGCACTGATTTTT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	NA,NA	NA|210aa|up_9|NZ_CP006772.1_1701817_1702447_+	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|229aa|up_8|NZ_CP006772.1_1702731_1703418_-	cd16325, LolA, LolA, a periplasmic chaperone	NA|393aa|up_7|NZ_CP006772.1_1703817_1704996_+	pfam14294, DUF4372, Domain of unknown function (DUF4372)	NA|792aa|up_6|NZ_CP006772.1_1705332_1707708_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|442aa|up_5|NZ_CP006772.1_1707860_1709186_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|218aa|up_4|NZ_CP006772.1_1709295_1709949_-	pfam14123, DUF4290, Domain of unknown function (DUF4290)	NA|173aa|up_3|NZ_CP006772.1_1709950_1710469_-	cd09895, NGN_SP_UpxY, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY	NA|425aa|up_2|NZ_CP006772.1_1710524_1711799_-	pfam02350, Epimerase_2, UDP-N-acetylglucosamine 2-epimerase	NA|93aa|up_1|NZ_CP006772.1_1711909_1712188_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|103aa|up_0|NZ_CP006772.1_1712213_1712522_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|266aa|down_0|NZ_CP006772.1_1714005_1714803_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|192aa|down_1|NZ_CP006772.1_1715158_1715734_-	PRK05920, PRK05920, aromatic acid decarboxylase; Validated	NA|662aa|down_2|NZ_CP006772.1_1715907_1717893_-	pfam13360, PQQ_2, PQQ-like domain	NA|424aa|down_3|NZ_CP006772.1_1718370_1719642_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|242aa|down_4|NZ_CP006772.1_1719657_1720383_+	pfam02719, Polysacc_synt_2, Polysaccharide biosynthesis protein	NA|260aa|down_5|NZ_CP006772.1_1720418_1721198_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|793aa|down_6|NZ_CP006772.1_1721208_1723587_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|352aa|down_7|NZ_CP006772.1_1723830_1724886_+	TIGR04130, L-fucosamine_synthetase, UDP-N-acetylglucosamine 4,6-dehydratase/5-epimerase	NA|396aa|down_8|NZ_CP006772.1_1725153_1726341_+	pfam13173, AAA_14, AAA domain	NA|118aa|down_9|NZ_CP006772.1_1726428_1726782_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein
GCF_000473305.1_ASM47330v1	NZ_CP006772	Bacteroidales bacterium CF chromosome, complete genome	4	2162869-2165140	4,2	CRISPRCasFinder,PILER-CR	no	cas9,cas1,cas2	DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	Type II-A,Type II-C, Type II-B, or Type II-C?,Type II-B	ACTGTTTCTGATATGTCAAAGATAAAATTTTGAAAGCAAATCACAAC,CTGTTTCTGATATGTCAAAGATAAAATTTTGAAAGCAAATCACAAC	47,46	0	0	NA	NA	NA:NA	29,22	29	TypeII-A,TypeII-B,TypeII-C,orTypeII-C?,TypeII-B	DEDDh,cas3,cas4,csa3,cas9,cas1,cas2	NA|133aa|up_8|NZ_CP006772.1_2150899_2151298_-,NA|255aa|up_7|NZ_CP006772.1_2151342_2152107_-,NA|202aa|up_4|NZ_CP006772.1_2154864_2155470_-,NA|204aa|up_3|NZ_CP006772.1_2155903_2156515_+,NA|148aa|down_0|NZ_CP006772.1_2165697_2166141_+,NA|237aa|down_4|NZ_CP006772.1_2171441_2172152_-	NA|206aa|up_9|NZ_CP006772.1_2150291_2150909_+	TIGR00558, Pyridoxine/pyridoxamine_5'-phosphate_oxidase, pyridoxamine-phosphate oxidase	NA|133aa|up_8|NZ_CP006772.1_2150899_2151298_-	NA	NA|255aa|up_7|NZ_CP006772.1_2151342_2152107_-	NA	NA|408aa|up_6|NZ_CP006772.1_2152133_2153357_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|358aa|up_5|NZ_CP006772.1_2153804_2154878_-	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|202aa|up_4|NZ_CP006772.1_2154864_2155470_-	NA	NA|204aa|up_3|NZ_CP006772.1_2155903_2156515_+	NA	cas9|1449aa|up_2|NZ_CP006772.1_2157085_2161432_+	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	cas1|306aa|up_1|NZ_CP006772.1_2161458_2162376_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|102aa|up_0|NZ_CP006772.1_2162402_2162708_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|148aa|down_0|NZ_CP006772.1_2165697_2166141_+	NA	NA|243aa|down_1|NZ_CP006772.1_2166133_2166862_+	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|642aa|down_2|NZ_CP006772.1_2167167_2169093_-	pfam12969, DUF3857, Domain of Unknown Function with PDB structure (DUF3857)	NA|658aa|down_3|NZ_CP006772.1_2169110_2171084_-	pfam12969, DUF3857, Domain of Unknown Function with PDB structure (DUF3857)	NA|237aa|down_4|NZ_CP006772.1_2171441_2172152_-	NA	NA|657aa|down_5|NZ_CP006772.1_2172307_2174278_-	pfam00912, Transgly, Transglycosylase	NA|1271aa|down_6|NZ_CP006772.1_2174376_2178189_+	sd00036, LRR_3, leucine-rich repeats	NA|429aa|down_7|NZ_CP006772.1_2178465_2179752_-	PRK12391, PRK12391, TrpB-like pyridoxal phosphate-dependent enzyme	NA|780aa|down_8|NZ_CP006772.1_2179991_2182331_-	cd06595, GH31_u1, glycosyl hydrolase family 31 (GH31); uncharacterized subgroup	NA|376aa|down_9|NZ_CP006772.1_2182398_2183526_-	pfam02156, Glyco_hydro_26, Glycosyl hydrolase family 26
