assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000024925.1_ASM2492v1	NC_013521	Sanguibacter keddieii DSM 10542, complete sequence	1	291042-291425	1	CRISPRCasFinder	no		WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	Orphan	CCGTCGCCGTAGTTGTTGATGGTG	24	0	0	NA	NA	NA	7	7	Orphan	WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	NA|178aa|up_3|NC_013521.1_287499_288033_+,NA|213aa|down_6|NC_013521.1_298356_298995_+,NA|161aa|down_7|NC_013521.1_299044_299527_-	NA|310aa|up_9|NC_013521.1_280825_281755_+	cd08434, PBP2_GltC_like, The substrate binding domain of LysR-type transcriptional regulator GltC, which activates gltA expression of glutamate synthase operon, contains type 2 periplasmic binding fold	NA|346aa|up_8|NC_013521.1_281788_282826_-	cd08983, GH43_Bt3655-like, Glycosyl hydrolase family 43 protein such as Bacteroides thetaiotaomicron VPI-5482 arabinofuranosidase Bt3655	NA|241aa|up_7|NC_013521.1_282995_283718_-	pfam13462, Thioredoxin_4, Thioredoxin	NA|342aa|up_6|NC_013521.1_283914_284940_+	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter	NA|329aa|up_5|NC_013521.1_284936_285923_+	COG1277, NosY, ABC-type transport system involved in multi-copper enzyme maturation, permease component [General function prediction only]	NA|390aa|up_4|NC_013521.1_286058_287228_-	pfam07510, DUF1524, Protein of unknown function (DUF1524)	NA|178aa|up_3|NC_013521.1_287499_288033_+	NA	NA|331aa|up_2|NC_013521.1_288099_289092_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|254aa|up_1|NC_013521.1_289088_289850_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|218aa|up_0|NC_013521.1_289846_290500_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|170aa|down_0|NC_013521.1_292257_292767_+	pfam09990, DUF2231, Predicted membrane protein (DUF2231)	NA|408aa|down_1|NC_013521.1_292861_294085_-	TIGR03356, BGL, beta-galactosidase	NA|429aa|down_2|NC_013521.1_294226_295513_-	cd17313, MFS_SLC45_SUC, Solute carrier family 45 and similar sugar transporters of the Major Facilitator Superfamily of transporters	NA|212aa|down_3|NC_013521.1_295640_296276_+	TIGR03384, betaine_BetI, transcriptional repressor BetI	NA|348aa|down_4|NC_013521.1_296446_297490_+	TIGR03858, LLM_2I7G, probable oxidoreductase, LLM family	NA|145aa|down_5|NC_013521.1_297922_298357_+	cd04279, ZnMc_MMP_like_1, Zinc-dependent metalloprotease; MMP_like sub-family 1	NA|213aa|down_6|NC_013521.1_298356_298995_+	NA	NA|161aa|down_7|NC_013521.1_299044_299527_-	NA	NA|223aa|down_8|NC_013521.1_299676_300345_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|164aa|down_9|NC_013521.1_300341_300833_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family
GCF_000024925.1_ASM2492v1	NC_013521	Sanguibacter keddieii DSM 10542, complete sequence	2	1096750-1096839	2	CRISPRCasFinder	no		WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	Orphan	CGGACCAGCATGCGCCCCGGAGC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	NA|181aa|up_7|NC_013521.1_1089812_1090355_-,NA|163aa|up_6|NC_013521.1_1090743_1091232_+,NA|193aa|up_2|NC_013521.1_1094735_1095314_-,NA	NA|721aa|up_9|NC_013521.1_1084357_1086520_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|932aa|up_8|NC_013521.1_1086662_1089458_+	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|181aa|up_7|NC_013521.1_1089812_1090355_-	NA	NA|163aa|up_6|NC_013521.1_1090743_1091232_+	NA	NA|267aa|up_5|NC_013521.1_1091361_1092162_-	TIGR02067, his_9_HisN, histidinol-phosphatase, inositol monophosphatase family	NA|345aa|up_4|NC_013521.1_1092220_1093255_-	cd01854, YjeQ_EngC, Ribosomal interacting GTPase YjeQ/EngC, a circularly permuted subfamily of the Ras GTPases	NA|449aa|up_3|NC_013521.1_1093260_1094607_-	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|193aa|up_2|NC_013521.1_1094735_1095314_-	NA	NA|213aa|up_1|NC_013521.1_1095628_1096267_+	TIGR02947, putative_RNA_polymerase_sigma_factor, RNA polymerase sigma-70 factor, TIGR02947 family	NA|71aa|up_0|NC_013521.1_1096293_1096506_+	TIGR02949, putative_anti-sigma_factor, anti-sigma factor, TIGR02949 family	NA|363aa|down_0|NC_013521.1_1096888_1097977_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|285aa|down_1|NC_013521.1_1098163_1099018_+	pfam13349, DUF4097, Putative adhesin	NA|237aa|down_2|NC_013521.1_1099165_1099876_-	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|1296aa|down_3|NC_013521.1_1099971_1103859_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|273aa|down_4|NC_013521.1_1104178_1104997_-	pfam12679, ABC2_membrane_2, ABC-2 family transporter protein	NA|320aa|down_5|NC_013521.1_1104993_1105953_-	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter	NA|383aa|down_6|NC_013521.1_1105993_1107142_-	PRK10549, PRK10549, two-component system sensor histidine kinase BaeS	NA|245aa|down_7|NC_013521.1_1107158_1107893_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|444aa|down_8|NC_013521.1_1108163_1109495_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|347aa|down_9|NC_013521.1_1109491_1110532_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]
GCF_000024925.1_ASM2492v1	NC_013521	Sanguibacter keddieii DSM 10542, complete sequence	3	2612416-2612519	3	CRISPRCasFinder	no		WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	Orphan	GGCATGCCCTGGCTCGTCGCGAAGGGGTTGTT	32	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	NA|534aa|up_4|NC_013521.1_2605036_2606638_-,NA	NA|239aa|up_9|NC_013521.1_2599500_2600217_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|149aa|up_8|NC_013521.1_2600332_2600779_+	pfam11593, Med3, Mediator complex subunit 3 fungal	NA|442aa|up_7|NC_013521.1_2600840_2602166_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|736aa|up_6|NC_013521.1_2602167_2604375_-	TIGR02696, polyribonucleotide_nucleotidyltransferase, guanosine pentaphosphate synthetase I/polynucleotide phosphorylase	NA|90aa|up_5|NC_013521.1_2604598_2604868_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|534aa|up_4|NC_013521.1_2605036_2606638_-	NA	NA|229aa|up_3|NC_013521.1_2606641_2607328_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|330aa|up_2|NC_013521.1_2607346_2608336_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|344aa|up_1|NC_013521.1_2608476_2609508_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|155aa|up_0|NC_013521.1_2609530_2609995_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|99aa|down_0|NC_013521.1_2613075_2613372_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|366aa|down_1|NC_013521.1_2613493_2614591_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|182aa|down_2|NC_013521.1_2614592_2615138_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|401aa|down_3|NC_013521.1_2615329_2616532_+	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|592aa|down_4|NC_013521.1_2616640_2618416_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|269aa|down_5|NC_013521.1_2618470_2619277_-	pfam13312, DUF4081, Domain of unknown function (DUF4081)	NA|378aa|down_6|NC_013521.1_2619440_2620574_-	PRK00366, ispG, flavodoxin-dependent (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase	NA|439aa|down_7|NC_013521.1_2620680_2621997_-	cd06163, S2P-M50_PDZ_RseP-like, RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|396aa|down_8|NC_013521.1_2622114_2623302_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|184aa|down_9|NC_013521.1_2623291_2623843_-	TIGR03543, divI1A_rptt_fam, DivIVA domain repeat protein
GCF_000024925.1_ASM2492v1	NC_013521	Sanguibacter keddieii DSM 10542, complete sequence	4	4114984-4115070	4	CRISPRCasFinder	no	csa3	WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	Type I-A	CCACGACCACTCGGACGAGGCCAC	24	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,csa3,DEDDh,cas4,DinG,PD-DExK	NA|170aa|up_1|NC_013521.1_4112315_4112825_-,NA|171aa|up_0|NC_013521.1_4112914_4113427_-,NA	NA|314aa|up_9|NC_013521.1_4106012_4106954_-	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	csa3|145aa|up_8|NC_013521.1_4106950_4107385_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|180aa|up_7|NC_013521.1_4107664_4108204_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|410aa|up_6|NC_013521.1_4108222_4109452_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|231aa|up_5|NC_013521.1_4109559_4110252_-	pfam12840, HTH_20, Helix-turn-helix domain	NA|159aa|up_4|NC_013521.1_4110325_4110802_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|278aa|up_3|NC_013521.1_4110866_4111700_+	pfam12695, Abhydrolase_5, Alpha/beta hydrolase family	NA|168aa|up_2|NC_013521.1_4111775_4112279_+	cd01110, HTH_SoxR, Helix-Turn-Helix DNA binding domain of the SoxR transcription regulator	NA|170aa|up_1|NC_013521.1_4112315_4112825_-	NA	NA|171aa|up_0|NC_013521.1_4112914_4113427_-	NA	NA|289aa|down_0|NC_013521.1_4115220_4116087_+	pfam01297, ZnuA, Zinc-uptake complex component A periplasmic	NA|218aa|down_1|NC_013521.1_4116319_4116973_-	pfam04234, CopC, CopC domain	NA|640aa|down_2|NC_013521.1_4117152_4119072_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|317aa|down_3|NC_013521.1_4119193_4120144_+	TIGR01249, Putative_proline_iminopeptidase, proline iminopeptidase, Neisseria-type subfamily	NA|289aa|down_4|NC_013521.1_4120219_4121086_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|217aa|down_5|NC_013521.1_4121082_4121733_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|291aa|down_6|NC_013521.1_4121890_4122763_-	cd13690, PBP2_GluB, Substrate binding domain of ABC glutamate transporter; the type 2 periplasmic binding protein fold	NA|258aa|down_7|NC_013521.1_4122897_4123671_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|274aa|down_8|NC_013521.1_4123956_4124778_-	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|361aa|down_9|NC_013521.1_4124728_4125811_+	pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region
