assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	1	276473-276553	1	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Orphan	ATATGAGACGGCTTCACTGTGCG	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA|284aa|up_0|NC_010816.1_274720_275572_-,NA|107aa|down_5|NC_010816.1_285875_286196_+	NA|375aa|up_9|NC_010816.1_262533_263658_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|271aa|up_8|NC_010816.1_263995_264808_+	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|428aa|up_7|NC_010816.1_264863_266147_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|545aa|up_6|NC_010816.1_266426_268061_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|364aa|up_5|NC_010816.1_268226_269318_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|390aa|up_4|NC_010816.1_269319_270489_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|570aa|up_3|NC_010816.1_270492_272202_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|174aa|up_2|NC_010816.1_272251_272773_-	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|532aa|up_1|NC_010816.1_272821_274417_-	cd01087, Prolidase, Prolidase	NA|284aa|up_0|NC_010816.1_274720_275572_-	NA	NA|532aa|down_0|NC_010816.1_276826_278422_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|1226aa|down_1|NC_010816.1_278482_282160_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|332aa|down_2|NC_010816.1_282278_283274_-	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|518aa|down_3|NC_010816.1_283395_284949_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|263aa|down_4|NC_010816.1_285087_285876_+	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|107aa|down_5|NC_010816.1_285875_286196_+	NA	NA|315aa|down_6|NC_010816.1_286461_287406_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|319aa|down_7|NC_010816.1_287530_288487_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|353aa|down_8|NC_010816.1_288711_289770_+	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|170aa|down_9|NC_010816.1_289809_290319_-	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	2	1335011-1335182	1	PILER-CR	no	RT	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Unclear	CCAGACGCTCGTCCATACGCCGCTGATGGCGTTCAGGACG	40	0	0	NA	NA	NA	2	2	Orphan	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA|108aa|up_8|NC_010816.1_1324458_1324782_-,NA|58aa|up_7|NC_010816.1_1324909_1325083_-,NA|527aa|up_5|NC_010816.1_1326731_1328312_-,NA|241aa|up_4|NC_010816.1_1328328_1329051_-,NA|56aa|up_3|NC_010816.1_1329037_1329205_-,NA|199aa|up_2|NC_010816.1_1329204_1329801_-,NA|261aa|up_0|NC_010816.1_1333549_1334332_-,NA|107aa|down_1|NC_010816.1_1337824_1338145_-,NA|197aa|down_2|NC_010816.1_1338204_1338795_-,NA|122aa|down_3|NC_010816.1_1338825_1339191_-,NA|92aa|down_4|NC_010816.1_1339187_1339463_-,NA|112aa|down_5|NC_010816.1_1339474_1339810_-,NA|105aa|down_7|NC_010816.1_1340270_1340585_-	NA|412aa|up_9|NC_010816.1_1323165_1324401_-	cd06417, GH25_LysA-like, LysA is a cell wall endolysin produced by Lactobacillus fermentum, which degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues	NA|108aa|up_8|NC_010816.1_1324458_1324782_-	NA	NA|58aa|up_7|NC_010816.1_1324909_1325083_-	NA	RT|415aa|up_6|NC_010816.1_1325070_1326315_-	cd01646, RT_Bac_retron_I, RT_Bac_retron_I: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|527aa|up_5|NC_010816.1_1326731_1328312_-	NA	NA|241aa|up_4|NC_010816.1_1328328_1329051_-	NA	NA|56aa|up_3|NC_010816.1_1329037_1329205_-	NA	NA|199aa|up_2|NC_010816.1_1329204_1329801_-	NA	NA|1240aa|up_1|NC_010816.1_1329810_1333530_-	pfam06605, Prophage_tail, Prophage endopeptidase tail	NA|261aa|up_0|NC_010816.1_1333549_1334332_-	NA	NA|131aa|down_0|NC_010816.1_1337420_1337813_-	pfam17318, DUF5361, Family of unknown function (DUF5361)	NA|107aa|down_1|NC_010816.1_1337824_1338145_-	NA	NA|197aa|down_2|NC_010816.1_1338204_1338795_-	NA	NA|122aa|down_3|NC_010816.1_1338825_1339191_-	NA	NA|92aa|down_4|NC_010816.1_1339187_1339463_-	NA	NA|112aa|down_5|NC_010816.1_1339474_1339810_-	NA	NA|151aa|down_6|NC_010816.1_1339806_1340259_-	pfam09355, Phage_Gp19, Phage protein Gp19/Gp15/Gp42	NA|105aa|down_7|NC_010816.1_1340270_1340585_-	NA	NA|302aa|down_8|NC_010816.1_1340584_1341490_-	pfam05065, Phage_capsid, Phage capsid family	NA|178aa|down_9|NC_010816.1_1341535_1342069_-	pfam14265, DUF4355, Domain of unknown function (DUF4355)
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	3	1488581-1488949	2	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Orphan	TGCGGTTGATTCCACGGGTCACCG	24	0	0	NA	NA	NA	6	6	Orphan	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA|57aa|up_8|NC_010816.1_1480447_1480618_-,NA|207aa|up_7|NC_010816.1_1480691_1481312_-,NA|99aa|up_2|NC_010816.1_1485913_1486210_-,NA|201aa|down_5|NC_010816.1_1494766_1495369_-	NA|489aa|up_9|NC_010816.1_1478890_1480357_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|57aa|up_8|NC_010816.1_1480447_1480618_-	NA	NA|207aa|up_7|NC_010816.1_1480691_1481312_-	NA	NA|365aa|up_6|NC_010816.1_1481479_1482574_+	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|189aa|up_5|NC_010816.1_1482655_1483222_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|255aa|up_4|NC_010816.1_1483277_1484042_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|428aa|up_3|NC_010816.1_1484402_1485686_-	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|99aa|up_2|NC_010816.1_1485913_1486210_-	NA	NA|378aa|up_1|NC_010816.1_1486572_1487706_+	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|201aa|up_0|NC_010816.1_1487702_1488305_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|200aa|down_0|NC_010816.1_1491408_1492008_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|377aa|down_1|NC_010816.1_1492205_1493336_-	PRK11914, PRK11914, diacylglycerol kinase; Reviewed	NA|126aa|down_2|NC_010816.1_1493384_1493762_-	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|126aa|down_3|NC_010816.1_1493811_1494189_-	pfam07811, TadE, TadE-like protein	NA|96aa|down_4|NC_010816.1_1494194_1494482_-	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|201aa|down_5|NC_010816.1_1494766_1495369_-	NA	NA|219aa|down_6|NC_010816.1_1495365_1496022_-	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|356aa|down_7|NC_010816.1_1496021_1497089_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|317aa|down_8|NC_010816.1_1497095_1498046_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|242aa|down_9|NC_010816.1_1498268_1498994_-	PRK10847, PRK10847, DedA family protein
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	4	1923647-1923721	3	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Orphan	TTCTGCTGTCCCGGTTCGTCATTT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA,NA|59aa|down_0|NC_010816.1_1924563_1924740_-	NA|442aa|up_9|NC_010816.1_1909942_1911268_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|418aa|up_8|NC_010816.1_1911316_1912570_-	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|311aa|up_7|NC_010816.1_1912663_1913596_-	cd08423, PBP2_LTTR_like_6, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|426aa|up_6|NC_010816.1_1913682_1914960_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|222aa|up_5|NC_010816.1_1914956_1915622_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|621aa|up_4|NC_010816.1_1915835_1917698_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|531aa|up_3|NC_010816.1_1917700_1919293_+	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|439aa|up_2|NC_010816.1_1919453_1920770_+	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|371aa|up_1|NC_010816.1_1920864_1921977_+	PRK01212, PRK01212, homoserine kinase; Provisional	NA|485aa|up_0|NC_010816.1_1922113_1923568_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|59aa|down_0|NC_010816.1_1924563_1924740_-	NA	NA|365aa|down_1|NC_010816.1_1925189_1926284_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|483aa|down_2|NC_010816.1_1926280_1927729_+	COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|315aa|down_3|NC_010816.1_1927904_1928849_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|402aa|down_4|NC_010816.1_1928955_1930161_+	cd05647, M20_DapE_actinobac, M20 Peptidase actinobacterial DapE encoded N-succinyl-L,L-diaminopimelic acid desuccinylase	NA|1023aa|down_5|NC_010816.1_1930474_1933543_+	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|103aa|down_6|NC_010816.1_1933694_1934003_+	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|83aa|down_7|NC_010816.1_1934025_1934274_+	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|564aa|down_8|NC_010816.1_1934342_1936034_+	PRK12296, obgE, GTPase CgtA; Reviewed	NA|378aa|down_9|NC_010816.1_1936034_1937168_+	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	5	2080259-2080344	4	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Orphan	CTCCGCGCCCGCACGCTCAGGGC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA|243aa|up_1|NC_010816.1_2077450_2078179_-,NA|211aa|down_4|NC_010816.1_2087342_2087975_+,NA|30aa|down_5|NC_010816.1_2088166_2088256_+	NA|133aa|up_9|NC_010816.1_2066874_2067273_+	PRK05309, PRK05309, 30S ribosomal protein S11; Validated	NA|332aa|up_8|NC_010816.1_2067353_2068349_+	PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha; Provisional	NA|184aa|up_7|NC_010816.1_2068448_2069000_+	PRK05591, rplQ, 50S ribosomal protein L17; Validated	NA|304aa|up_6|NC_010816.1_2069081_2069993_-	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|727aa|up_5|NC_010816.1_2070268_2072449_+	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|267aa|up_4|NC_010816.1_2072655_2073456_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|523aa|up_3|NC_010816.1_2074265_2075834_+	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|348aa|up_2|NC_010816.1_2076366_2077410_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|243aa|up_1|NC_010816.1_2077450_2078179_-	NA	NA|356aa|up_0|NC_010816.1_2078386_2079454_+	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|158aa|down_0|NC_010816.1_2082801_2083275_+	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|388aa|down_1|NC_010816.1_2083276_2084440_+	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|375aa|down_2|NC_010816.1_2084537_2085662_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|513aa|down_3|NC_010816.1_2085676_2087215_-	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|211aa|down_4|NC_010816.1_2087342_2087975_+	NA	NA|30aa|down_5|NC_010816.1_2088166_2088256_+	NA	NA|233aa|down_6|NC_010816.1_2088408_2089107_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|297aa|down_7|NC_010816.1_2089237_2090128_-	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|539aa|down_8|NC_010816.1_2090521_2092138_-	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|141aa|down_9|NC_010816.1_2092271_2092694_-	pfam02082, Rrf2, Transcriptional regulator
GCF_000008945.1_ASM894v1	NC_010816	Bifidobacterium longum DJO10A, complete sequence	6	2262845-2265578	5,1,2,3,4,5	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR,PILER-CR	no	cas9,cas1,cas2,WYL	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	Type II-B,Type II-A,,Type II-C	CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC	36,36,36,36,36,36	1	1	2263330-2263357	NC_010816.1_602075-602102	NA:NA:NA:NA:NA:NA	42,42,35,35,35,35	42	TypeII-B,,TypeII-A,TypeII-C	DEDDh,WYL,cas3,csa3,RT,c2c9_V-U4,casR,cas9,cas1,cas2	NA,NA|97aa|down_0|NC_010816.1_2265598_2265889_+	NA|235aa|up_9|NC_010816.1_2250372_2251077_+	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|226aa|up_8|NC_010816.1_2251208_2251886_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|225aa|up_7|NC_010816.1_2252060_2252735_+	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|428aa|up_6|NC_010816.1_2252731_2254015_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|518aa|up_5|NC_010816.1_2254067_2255621_+	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|217aa|up_4|NC_010816.1_2255788_2256439_+	PRK05359, PRK05359, oligoribonuclease; Provisional	NA|473aa|up_3|NC_010816.1_2256487_2257906_+	cd18037, DEXSc_Pif1_like, DEAD-box helicase domain of Pif1	cas9|1139aa|up_2|NC_010816.1_2258139_2261556_+	pfam18470, Cas9_a, Cas9 alpha-helical lobe domain	cas1|302aa|up_1|NC_010816.1_2261559_2262465_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|111aa|up_0|NC_010816.1_2262461_2262794_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|97aa|down_0|NC_010816.1_2265598_2265889_+	NA	NA|605aa|down_1|NC_010816.1_2265969_2267784_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|246aa|down_2|NC_010816.1_2268059_2268797_+	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|725aa|down_3|NC_010816.1_2268879_2271054_-	COG3590, PepO, Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]	NA|327aa|down_4|NC_010816.1_2271230_2272211_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|261aa|down_5|NC_010816.1_2272356_2273139_+	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|431aa|down_6|NC_010816.1_2273412_2274705_+	cd06114, EcCS_like, Escherichia coli (Ec) citrate synthase (CS) GltA_like	NA|340aa|down_7|NC_010816.1_2274918_2275938_-	TIGR03535, DapD_actino, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase	WYL|362aa|down_8|NC_010816.1_2276171_2277257_-	pfam13280, WYL, WYL domain	NA|1395aa|down_9|NC_010816.1_2277334_2281519_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]
