assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902381755.1_UHGG_MGYG-HGUT-01490	NZ_LR698966	Bifidobacterium catenulatum isolate MGYG-HGUT-01490 chromosome 1	1	10754-10856	1	CRISPRCasFinder	no		DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	AACAGCCCTGTGGGCTGTTAATGGCAGCGAGCAGCG	36	0	0	NA	NA	NA	1	1	Orphan	DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|492aa|up_5|NZ_LR698966.1_0_1476_+	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|375aa|up_4|NZ_LR698966.1_2036_3161_+	PRK07761, PRK07761, DNA polymerase III subunit beta; Validated	NA|401aa|up_3|NZ_LR698966.1_3325_4528_+	COG1195, RecF, Recombinational DNA repair ATPase (RecF pathway) [DNA replication, recombination, and repair]	NA|162aa|up_2|NZ_LR698966.1_4527_5013_+	pfam05258, DUF721, Protein of unknown function (DUF721)	NA|692aa|up_1|NZ_LR698966.1_5326_7402_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|895aa|up_0|NZ_LR698966.1_7455_10140_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|449aa|down_0|NZ_LR698966.1_11126_12473_-	PRK09414, PRK09414, NADP-specific glutamate dehydrogenase	NA|148aa|down_1|NZ_LR698966.1_12714_13158_+	pfam01741, MscL, Large-conductance mechanosensitive channel, MscL	NA|189aa|down_2|NZ_LR698966.1_13819_14386_+	pfam01863, DUF45, Protein of unknown function DUF45	NA|328aa|down_3|NZ_LR698966.1_14651_15635_+	cd05291, HicDH_like, L-2-hydroxyisocapronate dehydrogenases and some bacterial L-lactate dehydrogenases	NA|444aa|down_4|NZ_LR698966.1_15810_17142_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|232aa|down_5|NZ_LR698966.1_17290_17986_-	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|919aa|down_6|NZ_LR698966.1_18309_21066_-	pfam00311, PEPcase, Phosphoenolpyruvate carboxylase	NA|610aa|down_7|NZ_LR698966.1_21326_23156_+	pfam06738, ThrE, Putative threonine/serine exporter	NA|543aa|down_8|NZ_LR698966.1_23464_25093_+	cd11475, SLC5sbd_PutP, Na(+)/proline cotransporter PutP and related proteins; solute binding domain	NA|367aa|down_9|NZ_LR698966.1_25215_26316_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed
GCF_902381755.1_UHGG_MGYG-HGUT-01490	NZ_LR698966	Bifidobacterium catenulatum isolate MGYG-HGUT-01490 chromosome 1	2	322704-322894	2	CRISPRCasFinder	no		DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	CGTGATTATGATGATCGTCCGCGTCGTCGTCGTTCCGATCGCGA	44	0	0	NA	NA	NA	1	1	Orphan	DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|700aa|up_9|NZ_LR698966.1_299031_301131_+,NA|225aa|up_1|NZ_LR698966.1_319085_319760_+,NA	NA|700aa|up_9|NZ_LR698966.1_299031_301131_+	NA	NA|310aa|up_8|NZ_LR698966.1_301192_302122_+	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|194aa|up_7|NZ_LR698966.1_302147_302729_-	pfam02632, BioY, BioY family	NA|632aa|up_6|NZ_LR698966.1_302970_304866_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|544aa|up_5|NZ_LR698966.1_304916_306548_+	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|3111aa|up_4|NZ_LR698966.1_306645_315978_+	COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein]	NA|153aa|up_3|NZ_LR698966.1_316298_316757_+	PRK00070, acpS, 4'-phosphopantetheinyl transferase; Provisional	NA|606aa|up_2|NZ_LR698966.1_316954_318772_+	cd11324, AmyAc_Amylosucrase, Alpha amylase catalytic domain found in Amylosucrase	NA|225aa|up_1|NZ_LR698966.1_319085_319760_+	NA	NA|90aa|up_0|NZ_LR698966.1_319892_320162_+	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|187aa|down_0|NZ_LR698966.1_323254_323815_+	pfam04011, LemA, LemA family	NA|738aa|down_1|NZ_LR698966.1_323912_326126_+	pfam09972, DUF2207, Predicted membrane protein (DUF2207)	NA|288aa|down_2|NZ_LR698966.1_326232_327096_+	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|513aa|down_3|NZ_LR698966.1_327259_328798_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|339aa|down_4|NZ_LR698966.1_328881_329898_+	pfam02578, Cu-oxidase_4, Multi-copper polyphenol oxidoreductase laccase	NA|276aa|down_5|NZ_LR698966.1_329985_330813_+	cd02516, CDP-ME_synthetase, CDP-ME synthetase is involved in mevalonate-independent isoprenoid production	NA|230aa|down_6|NZ_LR698966.1_330854_331544_+	cd00501, Peptidase_C15, Pyroglutamyl peptidase (PGP) type I, also known as pyrrolidone carboxyl peptidase (pcp) type I:  Enzymes responsible for cleaving pyroglutamate (pGlu) from the N-terminal end of specialized proteins	NA|493aa|down_7|NZ_LR698966.1_331692_333171_+	PTZ00121, PTZ00121, MAEBL; Provisional	NA|447aa|down_8|NZ_LR698966.1_333163_334504_+	cd00585, Peptidase_C1B, Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC)	NA|359aa|down_9|NZ_LR698966.1_334737_335814_+	PRK09261, PRK09261, phospho-2-dehydro-3-deoxyheptonate aldolase; Validated
GCF_902381755.1_UHGG_MGYG-HGUT-01490	NZ_LR698966	Bifidobacterium catenulatum isolate MGYG-HGUT-01490 chromosome 1	3	1508552-1510594	1,3,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Type I-E	GGGATCATCCCCGCGTATGCGGGGAACAC,GGGATCATCCCCGCGTATGCGGGGAACAC,GGGATCATCCCCGCGTATGCGGGGAACAC,GGATCATCCCCGCGTATGCGGGGAACAC	29,29,29,28	0	0	NA	NA	I-E:I-E:I-E:I-E	29,33,33,29	33	TypeI-E	DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|67aa|up_0|NZ_LR698966.1_1508192_1508393_-,NA	NA|222aa|up_9|NZ_LR698966.1_1497734_1498400_+	cd12922, VKOR_5, Vitamin K epoxide reductase family in bacteria	NA|300aa|up_8|NZ_LR698966.1_1498554_1499454_-	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|259aa|up_7|NZ_LR698966.1_1499552_1500329_+	PRK00865, PRK00865, glutamate racemase; Provisional	NA|558aa|up_6|NZ_LR698966.1_1500695_1502369_+	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|313aa|up_5|NZ_LR698966.1_1502497_1503436_-	pfam01112, Asparaginase_2, Asparaginase	NA|283aa|up_4|NZ_LR698966.1_1503589_1504438_+	cd07208, Pat_hypo_Ecoli_yjju_like, Hypothetical patatin similar to yjju protein of Escherichia coli	NA|319aa|up_3|NZ_LR698966.1_1504511_1505468_+	cd08423, PBP2_LTTR_like_6, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|409aa|up_2|NZ_LR698966.1_1505652_1506879_+	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|219aa|up_1|NZ_LR698966.1_1507015_1507672_+	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|67aa|up_0|NZ_LR698966.1_1508192_1508393_-	NA	cas2|120aa|down_0|NZ_LR698966.1_1510649_1511009_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|316aa|down_1|NZ_LR698966.1_1511002_1511950_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|234aa|down_2|NZ_LR698966.1_1512039_1512741_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|250aa|down_3|NZ_LR698966.1_1512763_1513513_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|372aa|down_4|NZ_LR698966.1_1513531_1514647_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|216aa|down_5|NZ_LR698966.1_1514696_1515344_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|551aa|down_6|NZ_LR698966.1_1515381_1517034_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|1040aa|down_7|NZ_LR698966.1_1517254_1520374_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|1104aa|down_8|NZ_LR698966.1_1520659_1523971_-	PRK06039, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|337aa|down_9|NZ_LR698966.1_1524324_1525335_-	cd01544, PBP1_GalR, ligand-binding domain of DNA transcription repressor GalR which is one of two regulatory proteins involved in galactose transport and metabolism
GCF_902381755.1_UHGG_MGYG-HGUT-01490	NZ_LR698966	Bifidobacterium catenulatum isolate MGYG-HGUT-01490 chromosome 1	4	1869171-1869301	4	CRISPRCasFinder	no		DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	GTCCCAAATTCCTCCACTCACTGGA	25	0	0	NA	NA	NA	2	2	Orphan	DEDDh,c2c9_V-U4,cas3,csa3,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|65aa|up_5|NZ_LR698966.1_1860249_1860444_+,NA|121aa|up_3|NZ_LR698966.1_1863575_1863938_-,NA|789aa|down_0|NZ_LR698966.1_1875507_1877874_-	NA|677aa|up_9|NZ_LR698966.1_1853949_1855980_-	pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2	NA|305aa|up_8|NZ_LR698966.1_1856166_1857081_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|382aa|up_7|NZ_LR698966.1_1857151_1858297_-	pfam09913, DUF2142, Predicted membrane protein (DUF2142)	NA|331aa|up_6|NZ_LR698966.1_1858625_1859618_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|65aa|up_5|NZ_LR698966.1_1860249_1860444_+	NA	NA|605aa|up_4|NZ_LR698966.1_1860465_1862280_-	cd06414, GH25_LytC-like, The LytC lysozyme of Streptococcus pneumoniae is a bacterial cell wall hydrolase that cleaves the beta1-4-glycosydic bond located between the N-acetylmuramoyl-N-glucosaminyl residues of the cell wall polysaccharide chains	NA|121aa|up_3|NZ_LR698966.1_1863575_1863938_-	NA	NA|306aa|up_2|NZ_LR698966.1_1864862_1865780_-	pfam16280, DUF4928, Domain of unknown function (DUF4928)	NA|393aa|up_1|NZ_LR698966.1_1865792_1866971_-	pfam00145, DNA_methylase, C-5 cytosine-specific DNA methylase	NA|254aa|up_0|NZ_LR698966.1_1867068_1867830_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|789aa|down_0|NZ_LR698966.1_1875507_1877874_-	NA	NA|442aa|down_1|NZ_LR698966.1_1878082_1879408_+	pfam02687, FtsX, FtsX-like permease family	NA|373aa|down_2|NZ_LR698966.1_1879404_1880523_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|264aa|down_3|NZ_LR698966.1_1880640_1881432_-	TIGR00295, Uncharacterized_protein_MJ0778, TIGR00295 family protein	NA|295aa|down_4|NZ_LR698966.1_1881557_1882442_-	cd01908, YafJ, Glutamine amidotransferases class-II (Gn-AT)_YafJ-type	NA|726aa|down_5|NZ_LR698966.1_1882642_1884820_-	pfam02705, K_trans, K+ potassium transporter	NA|319aa|down_6|NZ_LR698966.1_1885043_1886000_-	cd01310, TatD_DNAse, TatD like proteins;  E	NA|817aa|down_7|NZ_LR698966.1_1886092_1888543_-	cd14791, GH36, glycosyl hydrolase family 36 (GH36)	NA|719aa|down_8|NZ_LR698966.1_1888863_1891020_-	COG3533, COG3533, Uncharacterized protein conserved in bacteria [Function unknown]	NA|596aa|down_9|NZ_LR698966.1_1891912_1893700_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]
