assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020425.1_ASM2042v1	NC_011593	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete sequence	1	932425-932581	1	CRISPRCasFinder	no	WYL	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Unclear	GAGCGTACCCGCTCCGAGGCCGACG	25	0	0	NA	NA	NA	2	2	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA,NA|433aa|down_8|NC_011593.1_947736_949035_+	NA|126aa|up_9|NC_011593.1_919473_919851_-	pfam01910, Thiamine_BP, Thiamine-binding protein	NA|270aa|up_8|NC_011593.1_919914_920724_-	pfam08543, Phos_pyr_kin, Phosphomethylpyrimidine kinase	NA|403aa|up_7|NC_011593.1_920850_922059_+	COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones]	NA|918aa|up_6|NC_011593.1_922095_924849_-	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|315aa|up_5|NC_011593.1_924931_925876_-	cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz) kinase catalyzes the phosphorylation of the hydroxylgroup of Thz	NA|491aa|up_4|NC_011593.1_926311_927784_+	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional	NA|415aa|up_3|NC_011593.1_927928_929173_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|404aa|up_2|NC_011593.1_929284_930496_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|160aa|up_1|NC_011593.1_930508_930988_+	pfam04472, SepF, Cell division protein SepF	NA|101aa|up_0|NC_011593.1_931109_931412_+	pfam02325, YGGT, YGGT family	NA|183aa|down_0|NC_011593.1_932952_933501_+	PRK14771, PRK14771, lipoprotein signal peptidase; Provisional	NA|321aa|down_1|NC_011593.1_933500_934463_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|267aa|down_2|NC_011593.1_934984_935785_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	WYL|349aa|down_3|NC_011593.1_935812_936859_-	pfam13280, WYL, WYL domain	NA|879aa|down_4|NC_011593.1_936942_939579_+	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|1099aa|down_5|NC_011593.1_939716_943013_+	TIGR02773, ATP-dependent_helicase/deoxyribonuclease_subunit_B, helicase-exonuclease AddAB, AddB subunit	NA|1372aa|down_6|NC_011593.1_943006_947122_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|143aa|down_7|NC_011593.1_947266_947695_+	cd02201, FtsZ_type1, Filamenting temperature sensitive mutant Z, type 1	NA|433aa|down_8|NC_011593.1_947736_949035_+	NA	NA|646aa|down_9|NC_011593.1_949831_951769_-	pfam03235, DUF262, Protein of unknown function DUF262
GCF_000020425.1_ASM2042v1	NC_011593	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete sequence	2	1257541-1257628	2	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	CCCGCTGGCGGGGGCTCCCGCGCAGC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|284aa|up_2|NC_011593.1_1254251_1255103_+,NA|132aa|up_1|NC_011593.1_1256881_1257277_+,NA	NA|137aa|up_9|NC_011593.1_1246972_1247383_+	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|519aa|up_8|NC_011593.1_1247449_1249006_+	PRK13571, PRK13571, anthranilate synthase component I; Provisional	NA|216aa|up_7|NC_011593.1_1249070_1249718_-	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|318aa|up_6|NC_011593.1_1249765_1250719_-	pfam06166, DUF979, Protein of unknown function (DUF979)	NA|243aa|up_5|NC_011593.1_1250720_1251449_-	pfam06149, DUF969, Protein of unknown function (DUF969)	NA|534aa|up_4|NC_011593.1_1251682_1253284_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|236aa|up_3|NC_011593.1_1253419_1254127_+	pfam00106, adh_short, short chain dehydrogenase	NA|284aa|up_2|NC_011593.1_1254251_1255103_+	NA	NA|132aa|up_1|NC_011593.1_1256881_1257277_+	NA	NA|82aa|up_0|NC_011593.1_1257277_1257523_+	PRK11578, PRK11578, macrolide transporter subunit MacA; Provisional	NA|218aa|down_0|NC_011593.1_1257740_1258394_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|542aa|down_1|NC_011593.1_1258549_1260175_+	pfam11855, DUF3375, Protein of unknown function (DUF3375)	NA|221aa|down_2|NC_011593.1_1260171_1260834_+	pfam13835, DUF4194, Domain of unknown function (DUF4194)	NA|1185aa|down_3|NC_011593.1_1260830_1264385_+	COG4913, COG4913, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1012aa|down_4|NC_011593.1_1264580_1267616_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|789aa|down_5|NC_011593.1_1267766_1270133_+	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|324aa|down_6|NC_011593.1_1270241_1271213_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|329aa|down_7|NC_011593.1_1271212_1272199_+	PRK05416, PRK05416, RNase adapter RapZ	NA|317aa|down_8|NC_011593.1_1272397_1273348_+	TIGR00647, DNA_bind_WhiA, DNA-binding protein WhiA	NA|402aa|down_9|NC_011593.1_1273516_1274722_+	PRK00073, pgk, phosphoglycerate kinase; Provisional
GCF_000020425.1_ASM2042v1	NC_011593	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete sequence	3	1989985-1990067	3	CRISPRCasFinder	no	cas3	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Unclear	GCGATCACCACGGACAAGCTGGC	23	0	0	NA	NA	NA	1	1	Unclear	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|115aa|up_7|NC_011593.1_1981279_1981624_+,NA|92aa|up_6|NC_011593.1_1981627_1981903_+,NA|126aa|up_5|NC_011593.1_1981902_1982280_+,NA|200aa|up_4|NC_011593.1_1982295_1982895_+,NA|122aa|up_3|NC_011593.1_1983054_1983420_+,NA|150aa|up_2|NC_011593.1_1983416_1983866_+,NA|326aa|up_0|NC_011593.1_1986115_1987093_+,NA|191aa|down_0|NC_011593.1_1991253_1991826_+,NA|57aa|down_1|NC_011593.1_1991829_1992000_+,NA|170aa|down_2|NC_011593.1_1992600_1993110_+,NA|152aa|down_3|NC_011593.1_1993135_1993591_+,NA|132aa|down_4|NC_011593.1_1993617_1994013_+	NA|188aa|up_9|NC_011593.1_1980298_1980862_+	pfam11133, Phage_head_fibr, Head fiber protein	NA|136aa|up_8|NC_011593.1_1980875_1981283_+	pfam09355, Phage_Gp19, Phage protein Gp19/Gp15/Gp42	NA|115aa|up_7|NC_011593.1_1981279_1981624_+	NA	NA|92aa|up_6|NC_011593.1_1981627_1981903_+	NA	NA|126aa|up_5|NC_011593.1_1981902_1982280_+	NA	NA|200aa|up_4|NC_011593.1_1982295_1982895_+	NA	NA|122aa|up_3|NC_011593.1_1983054_1983420_+	NA	NA|150aa|up_2|NC_011593.1_1983416_1983866_+	NA	NA|737aa|up_1|NC_011593.1_1983890_1986101_+	TIGR02675, Mu-like_prophage_FluMu_protein_gp42, tape measure domain	NA|326aa|up_0|NC_011593.1_1986115_1987093_+	NA	NA|191aa|down_0|NC_011593.1_1991253_1991826_+	NA	NA|57aa|down_1|NC_011593.1_1991829_1992000_+	NA	NA|170aa|down_2|NC_011593.1_1992600_1993110_+	NA	NA|152aa|down_3|NC_011593.1_1993135_1993591_+	NA	NA|132aa|down_4|NC_011593.1_1993617_1994013_+	NA	NA|411aa|down_5|NC_011593.1_1994073_1995306_+	cd06417, GH25_LysA-like, LysA is a cell wall endolysin produced by Lactobacillus fermentum, which degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues	NA|73aa|down_6|NC_011593.1_1995431_1995650_+	pfam16938, Phage_holin_Dp1, Putative phage holin Dp-1	NA|334aa|down_7|NC_011593.1_1996167_1997169_-	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|189aa|down_8|NC_011593.1_1997231_1997798_-	pfam04417, DUF501, Protein of unknown function (DUF501)	NA|200aa|down_9|NC_011593.1_1997794_1998394_-	pfam04977, DivIC, Septum formation initiator
GCF_000020425.1_ASM2042v1	NC_011593	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete sequence	4	2438564-2438649	4	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|58aa|up_8|NC_011593.1_2428304_2428478_-,NA|30aa|up_5|NC_011593.1_2430649_2430739_-,NA|211aa|up_4|NC_011593.1_2430930_2431563_-,NA|243aa|down_1|NC_011593.1_2440762_2441491_+,NA|40aa|down_5|NC_011593.1_2446211_2446331_-	NA|548aa|up_9|NC_011593.1_2426488_2428132_+	PRK08010, PRK08010, pyridine nucleotide-disulfide oxidoreductase; Provisional	NA|58aa|up_8|NC_011593.1_2428304_2428478_-	NA	NA|297aa|up_7|NC_011593.1_2428777_2429668_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NC_011593.1_2429798_2430497_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NC_011593.1_2430649_2430739_-	NA	NA|211aa|up_4|NC_011593.1_2430930_2431563_-	NA	NA|513aa|up_3|NC_011593.1_2431690_2433229_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NC_011593.1_2433243_2434368_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NC_011593.1_2434465_2435629_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|159aa|up_0|NC_011593.1_2435630_2436107_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NC_011593.1_2439487_2440555_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|243aa|down_1|NC_011593.1_2440762_2441491_+	NA	NA|461aa|down_2|NC_011593.1_2442410_2443793_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|326aa|down_3|NC_011593.1_2443906_2444884_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|304aa|down_4|NC_011593.1_2444883_2445795_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|40aa|down_5|NC_011593.1_2446211_2446331_-	NA	NA|437aa|down_6|NC_011593.1_2446670_2447981_-	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|267aa|down_7|NC_011593.1_2448349_2449150_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|727aa|down_8|NC_011593.1_2449354_2451535_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|304aa|down_9|NC_011593.1_2451755_2452667_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E
GCF_000020425.1_ASM2042v1	NC_011593	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete sequence	5	2555486-2555568	5	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	GCGAACCGGGCGGTCCTCGTCAT	23	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|107aa|up_9|NC_011593.1_2542866_2543187_-,NA	NA|107aa|up_9|NC_011593.1_2542866_2543187_-	NA	NA|231aa|up_8|NC_011593.1_2545363_2546056_-	PRK05424, rplA, 50S ribosomal protein L1; Validated	NA|144aa|up_7|NC_011593.1_2546071_2546503_-	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|298aa|up_6|NC_011593.1_2546763_2547657_-	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|76aa|up_5|NC_011593.1_2547686_2547914_-	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|402aa|up_4|NC_011593.1_2548179_2549385_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|378aa|up_3|NC_011593.1_2549475_2550609_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|564aa|up_2|NC_011593.1_2550609_2552301_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|83aa|up_1|NC_011593.1_2552369_2552618_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|103aa|up_0|NC_011593.1_2552640_2552949_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|402aa|down_0|NC_011593.1_2556369_2557575_-	cd05647, M20_DapE_actinobac, M20 Peptidase actinobacterial DapE encoded N-succinyl-L,L-diaminopimelic acid desuccinylase	NA|315aa|down_1|NC_011593.1_2557681_2558626_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|149aa|down_2|NC_011593.1_2558877_2559324_+	COG4154, FucU, Fucose dissimilation pathway protein FucU [Carbohydrate transport and metabolism]	NA|256aa|down_3|NC_011593.1_2559570_2560338_-	COG3618, COG3618, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|428aa|down_4|NC_011593.1_2560339_2561623_-	cd17394, MFS_FucP_like, Fucose permease and similar proteins of the Major Facilitator Superfamily of transporters	NA|264aa|down_5|NC_011593.1_2561852_2562644_-	PRK08628, PRK08628, SDR family oxidoreductase	NA|428aa|down_6|NC_011593.1_2562784_2564068_-	cd03324, rTSbeta_L-fuconate_dehydratase, Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA	NA|343aa|down_7|NC_011593.1_2564247_2565276_+	cd06296, PBP1_CatR-like, ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation	NA|483aa|down_8|NC_011593.1_2565257_2566706_-	COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|317aa|down_9|NC_011593.1_2566702_2567653_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein
