assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002076915.1_ASM207691v1	NZ_CP020468	Actinomyces gaoshouyii strain pika_114 chromosome, complete genome	1	33531-33638	1	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Orphan	GGTGGGTGCGCTGGCCGTCGTCGC	24	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	NA|114aa|up_6|NZ_CP020468.1_25241_25583_-,NA|71aa|down_0|NZ_CP020468.1_33861_34074_+,NA|291aa|down_7|NZ_CP020468.1_42226_43099_+	NA|485aa|up_9|NZ_CP020468.1_22133_23588_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|166aa|up_8|NZ_CP020468.1_23584_24082_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|241aa|up_7|NZ_CP020468.1_24078_24801_-	pfam12401, DUF3662, Protein of unknown function (DUF2662)	NA|114aa|up_6|NZ_CP020468.1_25241_25583_-	NA	NA|68aa|up_5|NZ_CP020468.1_25990_26194_-	pfam04328, Sel_put, Selenoprotein, putative	NA|798aa|up_4|NZ_CP020468.1_26222_28616_-	PRK15015, PRK15015, carbon starvation protein CstA	NA|313aa|up_3|NZ_CP020468.1_28739_29678_-	cd08420, PBP2_CysL_like, C-terminal substrate binding domain of LysR-type transcriptional regulator CysL, which activates the transcription of the cysJI operon encoding sulfite reductase, contains the type 2 periplasmic binding fold	NA|363aa|up_2|NZ_CP020468.1_29739_30828_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|485aa|up_1|NZ_CP020468.1_30881_32336_-	PRK09285, PRK09285, adenylosuccinate lyase; Provisional	NA|137aa|up_0|NZ_CP020468.1_32392_32803_-	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|71aa|down_0|NZ_CP020468.1_33861_34074_+	NA	NA|671aa|down_1|NZ_CP020468.1_34070_36083_+	PHA03378, PHA03378, EBNA-3B; Provisional	NA|379aa|down_2|NZ_CP020468.1_36191_37328_+	PRK03321, PRK03321, putative aminotransferase; Provisional	NA|311aa|down_3|NZ_CP020468.1_37474_38407_-	pfam14256, YwiC, YwiC-like protein	NA|560aa|down_4|NZ_CP020468.1_38551_40231_-	PRK07564, PRK07564, phosphoglucomutase; Validated	NA|132aa|down_5|NZ_CP020468.1_40509_40905_+	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|378aa|down_6|NZ_CP020468.1_40975_42109_+	pfam17173, DUF5129, Domain of unknown function (DUF5129)	NA|291aa|down_7|NZ_CP020468.1_42226_43099_+	NA	NA|392aa|down_8|NZ_CP020468.1_43234_44410_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|591aa|down_9|NZ_CP020468.1_44525_46298_-	COG1178, ThiP, ABC-type Fe3+ transport system, permease component [Inorganic ion transport and metabolism]
GCF_002076915.1_ASM207691v1	NZ_CP020468	Actinomyces gaoshouyii strain pika_114 chromosome, complete genome	2	351384-351472	2	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Orphan	AGGTAGTACCAGGAGGAGCCGTCCT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	NA|74aa|up_7|NZ_CP020468.1_341860_342082_-,NA|231aa|down_0|NZ_CP020468.1_353356_354049_+,NA|187aa|down_2|NZ_CP020468.1_355498_356059_+	NA|474aa|up_9|NZ_CP020468.1_339393_340815_-	TIGR00237, exodeoxyribonuclease_VII_large_subunit, exodeoxyribonuclease VII, large subunit	NA|341aa|up_8|NZ_CP020468.1_340789_341812_+	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|74aa|up_7|NZ_CP020468.1_341860_342082_-	NA	NA|624aa|up_6|NZ_CP020468.1_342135_344007_+	pfam00756, Esterase, Putative esterase	NA|367aa|up_5|NZ_CP020468.1_344140_345241_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|300aa|up_4|NZ_CP020468.1_345522_346422_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|324aa|up_3|NZ_CP020468.1_346922_347894_+	TIGR01921, Meso-diaminopimelate_D-dehydrogenase, diaminopimelate dehydrogenase	NA|204aa|up_2|NZ_CP020468.1_347884_348496_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|391aa|up_1|NZ_CP020468.1_348674_349847_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|391aa|up_0|NZ_CP020468.1_350019_351192_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|231aa|down_0|NZ_CP020468.1_353356_354049_+	NA	NA|378aa|down_1|NZ_CP020468.1_354083_355217_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|187aa|down_2|NZ_CP020468.1_355498_356059_+	NA	NA|241aa|down_3|NZ_CP020468.1_356124_356847_+	pfam01981, PTH2, Peptidyl-tRNA hydrolase PTH2	NA|200aa|down_4|NZ_CP020468.1_356893_357493_-	pfam04138, GtrA, GtrA-like protein	NA|457aa|down_5|NZ_CP020468.1_357775_359146_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|139aa|down_6|NZ_CP020468.1_359142_359559_+	COG3012, COG3012, Uncharacterized protein conserved in bacteria [Function unknown]	NA|391aa|down_7|NZ_CP020468.1_359668_360841_-	cd02932, OYE_YqiM_FMN, Old yellow enzyme (OYE) YqjM-like FMN binding domain	NA|315aa|down_8|NZ_CP020468.1_361037_361982_+	cd07208, Pat_hypo_Ecoli_yjju_like, Hypothetical patatin similar to yjju protein of Escherichia coli	NA|188aa|down_9|NZ_CP020468.1_362020_362584_-	cd04683, Nudix_Hydrolase_24, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_002076915.1_ASM207691v1	NZ_CP020468	Actinomyces gaoshouyii strain pika_114 chromosome, complete genome	3	351636-351720	3	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Orphan	AGGTAGTACCAGGAGGAGCCGTCCT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	NA|74aa|up_7|NZ_CP020468.1_341860_342082_-,NA|231aa|down_0|NZ_CP020468.1_353356_354049_+,NA|187aa|down_2|NZ_CP020468.1_355498_356059_+	NA|474aa|up_9|NZ_CP020468.1_339393_340815_-	TIGR00237, exodeoxyribonuclease_VII_large_subunit, exodeoxyribonuclease VII, large subunit	NA|341aa|up_8|NZ_CP020468.1_340789_341812_+	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|74aa|up_7|NZ_CP020468.1_341860_342082_-	NA	NA|624aa|up_6|NZ_CP020468.1_342135_344007_+	pfam00756, Esterase, Putative esterase	NA|367aa|up_5|NZ_CP020468.1_344140_345241_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|300aa|up_4|NZ_CP020468.1_345522_346422_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|324aa|up_3|NZ_CP020468.1_346922_347894_+	TIGR01921, Meso-diaminopimelate_D-dehydrogenase, diaminopimelate dehydrogenase	NA|204aa|up_2|NZ_CP020468.1_347884_348496_-	pfam13424, TPR_12, Tetratricopeptide repeat	NA|391aa|up_1|NZ_CP020468.1_348674_349847_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|391aa|up_0|NZ_CP020468.1_350019_351192_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|231aa|down_0|NZ_CP020468.1_353356_354049_+	NA	NA|378aa|down_1|NZ_CP020468.1_354083_355217_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|187aa|down_2|NZ_CP020468.1_355498_356059_+	NA	NA|241aa|down_3|NZ_CP020468.1_356124_356847_+	pfam01981, PTH2, Peptidyl-tRNA hydrolase PTH2	NA|200aa|down_4|NZ_CP020468.1_356893_357493_-	pfam04138, GtrA, GtrA-like protein	NA|457aa|down_5|NZ_CP020468.1_357775_359146_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|139aa|down_6|NZ_CP020468.1_359142_359559_+	COG3012, COG3012, Uncharacterized protein conserved in bacteria [Function unknown]	NA|391aa|down_7|NZ_CP020468.1_359668_360841_-	cd02932, OYE_YqiM_FMN, Old yellow enzyme (OYE) YqjM-like FMN binding domain	NA|315aa|down_8|NZ_CP020468.1_361037_361982_+	cd07208, Pat_hypo_Ecoli_yjju_like, Hypothetical patatin similar to yjju protein of Escherichia coli	NA|188aa|down_9|NZ_CP020468.1_362020_362584_-	cd04683, Nudix_Hydrolase_24, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_002076915.1_ASM207691v1	NZ_CP020468	Actinomyces gaoshouyii strain pika_114 chromosome, complete genome	4	1526035-1529725	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Type I-E	GTGCACCCCGCGTATGCGGGGATGATCC,GTGCACCCCGCGTATGCGGGGATGATCC,GTGCACCCCGCGTATGCGGGGATGATCCN	28,28,29	0	0	NA	NA	NA:NA:NA	59,60,60	60	TypeI-E	cas3,csa3,DinG,WYL,cas4,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	NA,NA|46aa|down_1|NZ_CP020468.1_1532320_1532458_-	NA|384aa|up_9|NZ_CP020468.1_1513680_1514832_-	PRK00241, nudC, NAD(+) diphosphatase	NA|433aa|up_8|NZ_CP020468.1_1515057_1516356_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	cas3|986aa|up_7|NZ_CP020468.1_1516806_1519764_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|553aa|up_6|NZ_CP020468.1_1519760_1521419_+	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cse2gr11|234aa|up_5|NZ_CP020468.1_1521415_1522117_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|377aa|up_4|NZ_CP020468.1_1522118_1523249_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|238aa|up_3|NZ_CP020468.1_1523248_1523962_+	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|230aa|up_2|NZ_CP020468.1_1523952_1524642_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|316aa|up_1|NZ_CP020468.1_1524644_1525592_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|131aa|up_0|NZ_CP020468.1_1525585_1525978_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|386aa|down_0|NZ_CP020468.1_1530725_1531883_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|46aa|down_1|NZ_CP020468.1_1532320_1532458_-	NA	NA|329aa|down_2|NZ_CP020468.1_1532806_1533793_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|254aa|down_3|NZ_CP020468.1_1533886_1534648_-	cd01714, ETF_beta, The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases	NA|779aa|down_4|NZ_CP020468.1_1534768_1537105_+	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|382aa|down_5|NZ_CP020468.1_1537312_1538458_+	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|685aa|down_6|NZ_CP020468.1_1538565_1540620_+	cd11344, AmyAc_GlgE_like, Alpha amylase catalytic domain found in GlgE-like proteins	NA|656aa|down_7|NZ_CP020468.1_1540616_1542584_+	TIGR02456, Trehalose_synthase, trehalose synthase	NA|521aa|down_8|NZ_CP020468.1_1542580_1544143_+	PRK14705, PRK14705, glycogen branching enzyme; Provisional	NA|736aa|down_9|NZ_CP020468.1_1544278_1546486_+	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB
