assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007164725.1_ASM716472v1	NZ_AP019697	Dialister hominis strain 5BBH33	1	143017-143121	1	CRISPRCasFinder	no		csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	Orphan	CTAATTCCCCACTGGGAAATCCTGAA	26	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	NA,NA|73aa|down_4|NZ_AP019697.1_147179_147398_-,NA|282aa|down_8|NZ_AP019697.1_150775_151621_+	NA|435aa|up_9|NZ_AP019697.1_132076_133381_+	COG1757, NhaC, Na+/H+ antiporter [Energy production and conversion]	NA|186aa|up_8|NZ_AP019697.1_133493_134051_+	pfam10050, DUF2284, Predicted metal-binding protein (DUF2284)	NA|292aa|up_7|NZ_AP019697.1_134229_135105_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|211aa|up_6|NZ_AP019697.1_135166_135799_-	pfam13518, HTH_28, Helix-turn-helix domain	NA|289aa|up_5|NZ_AP019697.1_136058_136925_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|108aa|up_4|NZ_AP019697.1_136936_137260_-	COG2076, EmrE, Membrane transporters of cations and cationic drugs [Inorganic ion transport and metabolism]	NA|465aa|up_3|NZ_AP019697.1_137341_138736_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|309aa|up_2|NZ_AP019697.1_138946_139873_+	COG1313, PflX, Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins [General function prediction only]	NA|132aa|up_1|NZ_AP019697.1_140545_140941_-	COG5496, COG5496, Predicted thioesterase [General function prediction only]	NA|408aa|up_0|NZ_AP019697.1_141289_142513_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|330aa|down_0|NZ_AP019697.1_143581_144571_+	pfam02618, YceG, YceG-like family	NA|203aa|down_1|NZ_AP019697.1_144554_145163_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|409aa|down_2|NZ_AP019697.1_145166_146393_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|81aa|down_3|NZ_AP019697.1_146389_146632_+	pfam16256, DUF4911, Domain of unknown function (DUF4911)	NA|73aa|down_4|NZ_AP019697.1_147179_147398_-	NA	NA|136aa|down_5|NZ_AP019697.1_147515_147923_+	COG5015, COG5015, Uncharacterized conserved protein [Function unknown]	NA|482aa|down_6|NZ_AP019697.1_148084_149530_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|313aa|down_7|NZ_AP019697.1_149840_150779_+	COG1754, COG1754, Uncharacterized C-terminal domain of topoisomerase IA [General function prediction only]	NA|282aa|down_8|NZ_AP019697.1_150775_151621_+	NA	NA|358aa|down_9|NZ_AP019697.1_152124_153198_-	pfam01548, DEDD_Tnp_IS110, Transposase
GCF_007164725.1_ASM716472v1	NZ_AP019697	Dialister hominis strain 5BBH33	2	265680-272986	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f	csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	Type I-F	GATCACTGCCGCATAGGCAGCTTAGAAA,GATCACTGCCGCATAGGCAGCTTAGAAA,GATCACTGCCGCATAGGCAGCTTAGAAA	28,28,28	4	4	266309-266340|266911-266942|267513-267544|268115-268146	NZ_AP019697.1_401215-401246|NZ_AP019697.1_401215-401246|NZ_AP019697.1_401215-401246|NZ_AP019697.1_401215-401246	I-F:I-F:I-F	121,121,118	121	TypeI-F	csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	NA,NA|371aa|down_0|NZ_AP019697.1_273381_274494_+	NA|510aa|up_9|NZ_AP019697.1_253052_254582_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|194aa|up_8|NZ_AP019697.1_255054_255636_+	cd03139, GATase1_PfpI_2, Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus	NA|224aa|up_7|NZ_AP019697.1_255700_256372_-	pfam14358, DUF4405, Domain of unknown function (DUF4405)	NA|214aa|up_6|NZ_AP019697.1_256532_257174_-	pfam14358, DUF4405, Domain of unknown function (DUF4405)	cas1|322aa|up_5|NZ_AP019697.1_257423_258389_+	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	cas3-cas2|1126aa|up_4|NZ_AP019697.1_258385_261763_+	cd09673, Cas3_Cas2_I-F, CRISPR/Cas system-associated protein Cas3/Cas2	cas8f|408aa|up_3|NZ_AP019697.1_261784_263008_+	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas5f|303aa|up_2|NZ_AP019697.1_263004_263913_+	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas7f|344aa|up_1|NZ_AP019697.1_263930_264962_+	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas6f|193aa|up_0|NZ_AP019697.1_264968_265547_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	NA|371aa|down_0|NZ_AP019697.1_273381_274494_+	NA	NA|390aa|down_1|NZ_AP019697.1_274577_275747_-	PRK02627, PRK02627, acetylornithine aminotransferase; Provisional	NA|286aa|down_2|NZ_AP019697.1_275773_276631_-	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|325aa|down_3|NZ_AP019697.1_276692_277667_-	PRK11863, PRK11863, N-acetyl-gamma-glutamyl-phosphate reductase; Provisional	NA|338aa|down_4|NZ_AP019697.1_277685_278699_-	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|393aa|down_5|NZ_AP019697.1_278737_279916_-	cd03886, M20_Acy1, M20 Peptidase Aminoacylase 1 family	NA|681aa|down_6|NZ_AP019697.1_280203_282246_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|518aa|down_7|NZ_AP019697.1_282770_284324_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|94aa|down_8|NZ_AP019697.1_284479_284761_+	pfam00462, Glutaredoxin, Glutaredoxin	NA|296aa|down_9|NZ_AP019697.1_285032_285920_-	COG0583, LysR, Transcriptional regulator [Transcription]
GCF_007164725.1_ASM716472v1	NZ_AP019697	Dialister hominis strain 5BBH33	3	1846762-1846845	3	CRISPRCasFinder	no		csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	Orphan	TTCACCGTCTTCATGGCAGTGGCA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	NA|207aa|up_8|NZ_AP019697.1_1834926_1835547_-,NA	NA|355aa|up_9|NZ_AP019697.1_1833730_1834795_+	cd08171, GlyDH-like, Glycerol dehydrogenase-like	NA|207aa|up_8|NZ_AP019697.1_1834926_1835547_-	NA	NA|526aa|up_7|NZ_AP019697.1_1836722_1838300_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|328aa|up_6|NZ_AP019697.1_1838455_1839439_-	TIGR00433, biotin_synthase, biotin synthase	NA|334aa|up_5|NZ_AP019697.1_1839508_1840510_-	TIGR00433, biotin_synthase, biotin synthase	NA|333aa|up_4|NZ_AP019697.1_1840631_1841630_-	TIGR00433, biotin_synthase, biotin synthase	NA|142aa|up_3|NZ_AP019697.1_1842096_1842522_-	cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18	NA|306aa|up_2|NZ_AP019697.1_1843093_1844011_-	cd01017, AdcA, Metal binding protein AdcA	NA|264aa|up_1|NZ_AP019697.1_1844104_1844896_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|221aa|up_0|NZ_AP019697.1_1844888_1845551_-	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|257aa|down_0|NZ_AP019697.1_1847242_1848013_-	COG1691, COG1691, NCAIR mutase (PurE)-related proteins [General function prediction only]	NA|277aa|down_1|NZ_AP019697.1_1848028_1848859_-	cd01990, Alpha_ANH_like_I, This is a subfamily of Adenine nucleotide alpha hydrolases superfamily	NA|89aa|down_2|NZ_AP019697.1_1849225_1849492_-	pfam12669, P12, Virus attachment protein p12 family	NA|280aa|down_3|NZ_AP019697.1_1849771_1850611_+	pfam03331, LpxC, UDP-3-O-acyl N-acetylglycosamine deacetylase	NA|156aa|down_4|NZ_AP019697.1_1850658_1851126_+	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|271aa|down_5|NZ_AP019697.1_1851265_1852078_+	PRK05289, PRK05289, acyl-ACP--UDP-N-acetylglucosamine O-acyltransferase	NA|271aa|down_6|NZ_AP019697.1_1852335_1853148_+	PRK05289, PRK05289, acyl-ACP--UDP-N-acetylglucosamine O-acyltransferase	NA|292aa|down_7|NZ_AP019697.1_1853321_1854197_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|233aa|down_8|NZ_AP019697.1_1854193_1854892_-	pfam13518, HTH_28, Helix-turn-helix domain	NA|476aa|down_9|NZ_AP019697.1_1855148_1856576_-	cd17321, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide) and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily
GCF_007164725.1_ASM716472v1	NZ_AP019697	Dialister hominis strain 5BBH33	4	2345430-2345522	4	CRISPRCasFinder	no		csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	Orphan	GGATAGGGTTAATTACCGACAGCCG	25	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DEDDh,cas3	NA|87aa|up_7|NZ_AP019697.1_2335926_2336187_+,NA|184aa|up_6|NZ_AP019697.1_2336526_2337078_-,NA|162aa|up_0|NZ_AP019697.1_2344655_2345141_-,NA|166aa|down_0|NZ_AP019697.1_2345752_2346250_+,NA|57aa|down_4|NZ_AP019697.1_2348757_2348928_-,NA|175aa|down_9|NZ_AP019697.1_2359265_2359790_-	NA|268aa|up_9|NZ_AP019697.1_2333106_2333910_+	cd17767, UP_EcUdp-like, uridine phosphorylases similar to Escherichia coli Udp and related phosphorylases	NA|526aa|up_8|NZ_AP019697.1_2334026_2335604_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|87aa|up_7|NZ_AP019697.1_2335926_2336187_+	NA	NA|184aa|up_6|NZ_AP019697.1_2336526_2337078_-	NA	NA|393aa|up_5|NZ_AP019697.1_2337138_2338317_-	pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1	NA|297aa|up_4|NZ_AP019697.1_2338585_2339476_-	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|460aa|up_3|NZ_AP019697.1_2339733_2341113_+	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|624aa|up_2|NZ_AP019697.1_2341656_2343528_+	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|237aa|up_1|NZ_AP019697.1_2343576_2344287_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|162aa|up_0|NZ_AP019697.1_2344655_2345141_-	NA	NA|166aa|down_0|NZ_AP019697.1_2345752_2346250_+	NA	NA|196aa|down_1|NZ_AP019697.1_2346246_2346834_+	cd06414, GH25_LytC-like, The LytC lysozyme of Streptococcus pneumoniae is a bacterial cell wall hydrolase that cleaves the beta1-4-glycosydic bond located between the N-acetylmuramoyl-N-glucosaminyl residues of the cell wall polysaccharide chains	NA|292aa|down_2|NZ_AP019697.1_2347073_2347949_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|233aa|down_3|NZ_AP019697.1_2347945_2348644_-	pfam13518, HTH_28, Helix-turn-helix domain	NA|57aa|down_4|NZ_AP019697.1_2348757_2348928_-	NA	NA|191aa|down_5|NZ_AP019697.1_2349032_2349605_+	pfam12645, HTH_16, Helix-turn-helix domain	NA|69aa|down_6|NZ_AP019697.1_2349722_2349929_+	pfam11148, DUF2922, Protein of unknown function (DUF2922)	NA|76aa|down_7|NZ_AP019697.1_2349965_2350193_+	pfam07872, DUF1659, Protein of unknown function (DUF1659)	NA|351aa|down_8|NZ_AP019697.1_2350810_2351863_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|175aa|down_9|NZ_AP019697.1_2359265_2359790_-	NA
