assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000196575.1_ASM19657v1	NC_015052	Bifidobacterium longum subsp. infantis 157F, complete genome	1	164530-164898	1	CRISPRCasFinder	no		cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	Orphan	CGATGACCCGTGGAATCAACCGCA	24	0	0	NA	NA	NA	6	6	Orphan	cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	NA|201aa|up_5|NC_015052.1_158078_158681_+,NA|207aa|down_6|NC_015052.1_172169_172790_+	NA|242aa|up_9|NC_015052.1_154453_155179_+	PRK10847, PRK10847, DedA family protein	NA|317aa|up_8|NC_015052.1_155401_156352_+	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|356aa|up_7|NC_015052.1_156358_157426_+	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|219aa|up_6|NC_015052.1_157425_158082_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|201aa|up_5|NC_015052.1_158078_158681_+	NA	NA|96aa|up_4|NC_015052.1_158965_159253_+	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|126aa|up_3|NC_015052.1_159258_159636_+	pfam07811, TadE, TadE-like protein	NA|126aa|up_2|NC_015052.1_159686_160064_+	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|377aa|up_1|NC_015052.1_160112_161243_+	PRK11914, PRK11914, diacylglycerol kinase; Reviewed	NA|200aa|up_0|NC_015052.1_161440_162040_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|201aa|down_0|NC_015052.1_165173_165776_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|378aa|down_1|NC_015052.1_165772_166906_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|428aa|down_2|NC_015052.1_167795_169079_+	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|255aa|down_3|NC_015052.1_169439_170204_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|189aa|down_4|NC_015052.1_170259_170826_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|365aa|down_5|NC_015052.1_170907_172002_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|207aa|down_6|NC_015052.1_172169_172790_+	NA	NA|527aa|down_7|NC_015052.1_172792_174373_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|639aa|down_8|NC_015052.1_175061_176978_+	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated	NA|728aa|down_9|NC_015052.1_177049_179233_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]
GCF_000196575.1_ASM19657v1	NC_015052	Bifidobacterium longum subsp. infantis 157F, complete genome	2	284506-284591	2	CRISPRCasFinder	no		cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	Orphan	GACAGCTCCCGCCAGCGGGAGCACTT	26	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	NA|210aa|up_5|NC_015052.1_277483_278113_+,NA|323aa|up_3|NC_015052.1_279489_280458_-,NA|695aa|down_6|NC_015052.1_293047_295132_+	NA|245aa|up_9|NC_015052.1_272101_272836_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|903aa|up_8|NC_015052.1_272919_275628_-	cd02079, P-type_ATPase_HM, P-type heavy metal-transporting ATPase	NA|94aa|up_7|NC_015052.1_275738_276020_+	COG1937, COG1937, Uncharacterized protein conserved in bacteria [Function unknown]	NA|475aa|up_6|NC_015052.1_276062_277487_+	pfam02646, RmuC, RmuC family	NA|210aa|up_5|NC_015052.1_277483_278113_+	NA	NA|283aa|up_4|NC_015052.1_278206_279055_+	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|323aa|up_3|NC_015052.1_279489_280458_-	NA	NA|100aa|up_2|NC_015052.1_281040_281340_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|514aa|up_1|NC_015052.1_281343_282885_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|500aa|up_0|NC_015052.1_282910_284410_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|355aa|down_0|NC_015052.1_284705_285770_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|105aa|down_1|NC_015052.1_285780_286095_+	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|618aa|down_2|NC_015052.1_286268_288122_+	cd07061, HP_HAP_like, Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction	NA|538aa|down_3|NC_015052.1_288340_289954_+	PRK07208, PRK07208, hypothetical protein; Provisional	NA|690aa|down_4|NC_015052.1_290159_292229_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|130aa|down_5|NC_015052.1_292536_292926_+	PRK09239, PRK09239, chorismate mutase; Provisional	NA|695aa|down_6|NC_015052.1_293047_295132_+	NA	NA|912aa|down_7|NC_015052.1_295357_298093_-	NF000540, alt_ValS, valine--tRNA ligase	NA|509aa|down_8|NC_015052.1_298236_299763_-	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|229aa|down_9|NC_015052.1_299824_300511_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
GCF_000196575.1_ASM19657v1	NC_015052	Bifidobacterium longum subsp. infantis 157F, complete genome	3	1492972-1493052	3	CRISPRCasFinder	no		cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	Orphan	CGCACAGTGAAACCGTCTCATAT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	NA|107aa|up_5|NC_015052.1_1483313_1483634_-,NA|284aa|down_0|NC_015052.1_1493952_1494804_+	NA|170aa|up_9|NC_015052.1_1479190_1479700_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|353aa|up_8|NC_015052.1_1479739_1480798_-	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|319aa|up_7|NC_015052.1_1481022_1481979_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|315aa|up_6|NC_015052.1_1482103_1483048_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|107aa|up_5|NC_015052.1_1483313_1483634_-	NA	NA|263aa|up_4|NC_015052.1_1483633_1484422_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|518aa|up_3|NC_015052.1_1484560_1486114_+	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|332aa|up_2|NC_015052.1_1486235_1487231_+	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|1222aa|up_1|NC_015052.1_1487349_1491015_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|541aa|up_0|NC_015052.1_1491075_1492698_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|284aa|down_0|NC_015052.1_1493952_1494804_+	NA	NA|532aa|down_1|NC_015052.1_1495107_1496703_+	cd01087, Prolidase, Prolidase	NA|174aa|down_2|NC_015052.1_1496751_1497273_+	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|570aa|down_3|NC_015052.1_1497322_1499032_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|390aa|down_4|NC_015052.1_1499035_1500205_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|364aa|down_5|NC_015052.1_1500206_1501298_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|545aa|down_6|NC_015052.1_1501462_1503097_-	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|428aa|down_7|NC_015052.1_1503376_1504660_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|271aa|down_8|NC_015052.1_1504715_1505528_-	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|375aa|down_9|NC_015052.1_1505865_1506990_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
GCF_000196575.1_ASM19657v1	NC_015052	Bifidobacterium longum subsp. infantis 157F, complete genome	4	2006736-2006821	4	CRISPRCasFinder	no		cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	NA|99aa|up_8|NC_015052.1_1996478_1996775_-,NA|30aa|up_5|NC_015052.1_1998824_1998914_-,NA|211aa|up_4|NC_015052.1_1999105_1999738_-,NA|243aa|down_1|NC_015052.1_2008901_2009630_+	NA|539aa|up_9|NC_015052.1_1994703_1996320_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|99aa|up_8|NC_015052.1_1996478_1996775_-	NA	NA|297aa|up_7|NC_015052.1_1996952_1997843_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NC_015052.1_1997973_1998672_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NC_015052.1_1998824_1998914_-	NA	NA|211aa|up_4|NC_015052.1_1999105_1999738_-	NA	NA|513aa|up_3|NC_015052.1_1999865_2001404_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NC_015052.1_2001418_2002543_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NC_015052.1_2002640_2003804_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|158aa|up_0|NC_015052.1_2003805_2004279_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NC_015052.1_2007626_2008694_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|243aa|down_1|NC_015052.1_2008901_2009630_+	NA	NA|348aa|down_2|NC_015052.1_2009670_2010714_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|523aa|down_3|NC_015052.1_2011246_2012815_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|267aa|down_4|NC_015052.1_2013624_2014425_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|727aa|down_5|NC_015052.1_2014631_2016812_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|304aa|down_6|NC_015052.1_2017033_2017945_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|181aa|down_7|NC_015052.1_2018026_2018569_-	PRK05591, rplQ, 50S ribosomal protein L17; Validated	NA|332aa|down_8|NC_015052.1_2018668_2019664_-	PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha; Provisional	NA|133aa|down_9|NC_015052.1_2019744_2020143_-	PRK05309, PRK05309, 30S ribosomal protein S11; Validated
GCF_000196575.1_ASM19657v1	NC_015052	Bifidobacterium longum subsp. infantis 157F, complete genome	5	2144667-2144740	5	CRISPRCasFinder	no		cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	Orphan	AAATGACGAACCGGGACAGCGAA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,c2c9_V-U4,WYL,DEDDh,casR	NA|59aa|up_0|NC_015052.1_2143647_2143824_+,NA	NA|378aa|up_9|NC_015052.1_2131233_2132367_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|564aa|up_8|NC_015052.1_2132367_2134059_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|83aa|up_7|NC_015052.1_2134127_2134376_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|103aa|up_6|NC_015052.1_2134398_2134707_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|1023aa|up_5|NC_015052.1_2134846_2137915_-	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|402aa|up_4|NC_015052.1_2138227_2139433_-	cd05647, M20_DapE_actinobac, M20 Peptidase actinobacterial DapE encoded N-succinyl-L,L-diaminopimelic acid desuccinylase	NA|315aa|up_3|NC_015052.1_2139539_2140484_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|483aa|up_2|NC_015052.1_2140659_2142108_-	COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|365aa|up_1|NC_015052.1_2142104_2143199_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|59aa|up_0|NC_015052.1_2143647_2143824_+	NA	NA|485aa|down_0|NC_015052.1_2144819_2146274_-	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|371aa|down_1|NC_015052.1_2146410_2147523_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|439aa|down_2|NC_015052.1_2147617_2148934_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|531aa|down_3|NC_015052.1_2149094_2150687_-	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|621aa|down_4|NC_015052.1_2150689_2152552_-	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|222aa|down_5|NC_015052.1_2152765_2153431_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|426aa|down_6|NC_015052.1_2153427_2154705_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|311aa|down_7|NC_015052.1_2154791_2155724_+	cd08423, PBP2_LTTR_like_6, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|418aa|down_8|NC_015052.1_2155817_2157071_+	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|442aa|down_9|NC_015052.1_2157116_2158442_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated
