assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000189775.2_ASM18977v3	NC_015555	Thermoanaerobacterium xylanolyticum LX-11, complete genome	1	890521-890731	1	PILER-CR	no		RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	Orphan	ATAGCTCAGTTGGTAGAGCAACACCCTGCCAAGCTG	36	0	0	NA	NA	NA	2	2	Orphan	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	NA|501aa|up_3|NC_015555.1_887555_889058_+,NA	NA|259aa|up_9|NC_015555.1_883316_884093_+	PRK00052, PRK00052, prolipoprotein diacylglyceryl transferase; Reviewed	NA|169aa|up_8|NC_015555.1_884145_884652_+	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|190aa|up_7|NC_015555.1_884648_885218_+	pfam04011, LemA, LemA family	NA|268aa|up_6|NC_015555.1_885230_886034_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|341aa|up_5|NC_015555.1_886030_887053_-	cd02932, OYE_YqiM_FMN, Old yellow enzyme (OYE) YqjM-like FMN binding domain	NA|74aa|up_4|NC_015555.1_887223_887445_+	pfam01106, NifU, NifU-like domain	NA|501aa|up_3|NC_015555.1_887555_889058_+	NA	NA|49aa|up_2|NC_015555.1_889069_889216_+	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|252aa|up_1|NC_015555.1_889258_890014_+	cd01413, SIR2_Af2, SIR2_Af2: Archaeal and prokaryotic group which includes Archaeoglobus fulgidus Sir2-Af2, Sulfolobus solfataricus ssSir2, and several bacterial homologs; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|68aa|up_0|NC_015555.1_890090_890294_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|169aa|down_0|NC_015555.1_891578_892085_+	PRK15238, PRK15238, inner membrane transporter YjeM; Provisional	NA|355aa|down_1|NC_015555.1_892185_893250_+	cd01092, APP-like, Similar to Prolidase and Aminopeptidase P	NA|313aa|down_2|NC_015555.1_893360_894299_+	cd08964, L-asparaginase_II, Type II (periplasmic) bacterial L-asparaginase	NA|512aa|down_3|NC_015555.1_894344_895880_+	pfam01293, PEPCK_ATP, Phosphoenolpyruvate carboxykinase	NA|397aa|down_4|NC_015555.1_895898_897089_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|477aa|down_5|NC_015555.1_897123_898554_+	PRK15238, PRK15238, inner membrane transporter YjeM; Provisional	NA|467aa|down_6|NC_015555.1_898682_900083_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|114aa|down_7|NC_015555.1_900149_900491_+	pfam05402, PqqD, Coenzyme PQQ synthesis protein D (PqqD)	NA|151aa|down_8|NC_015555.1_900492_900945_-	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|100aa|down_9|NC_015555.1_901024_901324_-	pfam02627, CMD, Carboxymuconolactone decarboxylase family
GCF_000189775.2_ASM18977v3	NC_015555	Thermoanaerobacterium xylanolyticum LX-11, complete genome	2	1096957-1097053	1	CRISPRCasFinder	no		RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	Orphan	GTTGAAAAGTATTGATATTATGTCGAGAAGG	31	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	NA|245aa|up_4|NC_015555.1_1090055_1090790_+,NA	NA|113aa|up_9|NC_015555.1_1084473_1084812_+	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|440aa|up_8|NC_015555.1_1084954_1086274_+	PRK08032, fliD, flagellar capping protein; Reviewed	NA|614aa|up_7|NC_015555.1_1086460_1088302_+	TIGR01536, Asparagine_synthetase_1, asparagine synthase (glutamine-hydrolyzing)	NA|68aa|up_6|NC_015555.1_1088418_1088622_+	cd03110, SIMIBI_bact_arch, bacterial and archaeal subfamily of SIMIBI	NA|311aa|up_5|NC_015555.1_1089080_1090013_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|245aa|up_4|NC_015555.1_1090055_1090790_+	NA	NA|292aa|up_3|NC_015555.1_1090844_1091720_+	pfam08812, YtxC, YtxC-like family	NA|200aa|up_2|NC_015555.1_1091797_1092397_+	COG4399, COG4399, Uncharacterized protein conserved in bacteria [Function unknown]	NA|635aa|up_1|NC_015555.1_1092675_1094580_+	PRK00413, thrS, threonyl-tRNA synthetase; Reviewed	NA|414aa|up_0|NC_015555.1_1094681_1095923_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|171aa|down_0|NC_015555.1_1097227_1097740_+	PRK00028, infC, translation initiation factor IF-3; Reviewed	NA|66aa|down_1|NC_015555.1_1097757_1097955_+	PRK00172, rpmI, 50S ribosomal protein L35; Reviewed	NA|120aa|down_2|NC_015555.1_1097974_1098334_+	PRK05185, rplT, 50S ribosomal protein L20; Provisional	NA|257aa|down_3|NC_015555.1_1098389_1099160_+	COG0566, SpoU, rRNA methylases [Translation, ribosomal structure and biogenesis]	NA|340aa|down_4|NC_015555.1_1099456_1100476_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|794aa|down_5|NC_015555.1_1100488_1102870_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|125aa|down_6|NC_015555.1_1102967_1103342_+	pfam05164, ZapA, Cell division protein ZapA	NA|782aa|down_7|NC_015555.1_1103395_1105741_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|787aa|down_8|NC_015555.1_1105740_1108101_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|237aa|down_9|NC_015555.1_1108152_1108863_+	TIGR02832, conserved_hypothetical_protein, sporulation protein YunB
GCF_000189775.2_ASM18977v3	NC_015555	Thermoanaerobacterium xylanolyticum LX-11, complete genome	3	2388534-2394161	2,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	Type III-B,Type III-A,Type III-C,Type I-B,Type III-D	GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	84,84,84	84	TypeIII-B,TypeIII-A,TypeIII-C,TypeI-B,TypeIII-D	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	NA|102aa|up_0|NC_015555.1_2388185_2388491_-,csm2gr11|124aa|down_5|NC_015555.1_2401019_2401391_-,csm2gr11|157aa|down_9|NC_015555.1_2403772_2404243_-	NA|563aa|up_9|NC_015555.1_2377338_2379027_+	TIGR03710, OAFO_sf, 2-oxoacid:acceptor oxidoreductase, alpha subunit	NA|283aa|up_8|NC_015555.1_2379047_2379896_+	PRK11867, PRK11867, 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed	NA|624aa|up_7|NC_015555.1_2379935_2381807_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|742aa|up_6|NC_015555.1_2381806_2384032_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|144aa|up_5|NC_015555.1_2384025_2384457_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|523aa|up_4|NC_015555.1_2384643_2386212_-	cd01031, EriC, ClC chloride channel EriC	cas2|88aa|up_3|NC_015555.1_2386380_2386644_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_2|NC_015555.1_2386656_2387649_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|168aa|up_1|NC_015555.1_2387645_2388149_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|102aa|up_0|NC_015555.1_2388185_2388491_-	NA	cas3|788aa|down_0|NC_015555.1_2394324_2396688_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|238aa|down_1|NC_015555.1_2396665_2397379_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|304aa|down_2|NC_015555.1_2397443_2398355_-	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas8b1|586aa|down_3|NC_015555.1_2398354_2400112_-	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas6|247aa|down_4|NC_015555.1_2400126_2400867_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	csm2gr11|124aa|down_5|NC_015555.1_2401019_2401391_-	NA	csm3gr7|285aa|down_6|NC_015555.1_2401404_2402259_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csm3gr7|260aa|down_7|NC_015555.1_2402245_2403025_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm3gr7|250aa|down_8|NC_015555.1_2403017_2403767_-	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|157aa|down_9|NC_015555.1_2403772_2404243_-	NA
GCF_000189775.2_ASM18977v3	NC_015555	Thermoanaerobacterium xylanolyticum LX-11, complete genome	4	2457993-2458851	3,3,2	CRISPRCasFinder,PILER-CR,CRT	no	cas6,cas2,cas1,cas4,cas3,cas5,cas7,cas8b2	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	Unclear	GTTTCAATTCCACTATGGTTAGATTAAATC,GTTTCAATTCCACTATGGTTAGATTAAATC,GTTTCAATTCCACTATGGTTAGATTAAATC	30,30,30	0	0	NA	NA	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	12,9,9	12	Unclear	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	NA|85aa|up_0|NC_015555.1_2457728_2457983_-,NA	NA|232aa|up_9|NC_015555.1_2448294_2448990_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|272aa|up_8|NC_015555.1_2448996_2449812_-	pfam13485, Peptidase_MA_2, Peptidase MA superfamily	NA|402aa|up_7|NC_015555.1_2449945_2451151_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|324aa|up_6|NC_015555.1_2451332_2452304_+	pfam07261, DnaB_2, Replication initiation and membrane attachment	NA|327aa|up_5|NC_015555.1_2452296_2453277_+	PRK06835, PRK06835, DNA replication protein DnaC; Validated	NA|481aa|up_4|NC_015555.1_2453273_2454716_-	cd09604, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|204aa|up_3|NC_015555.1_2454855_2455467_+	TIGR02840, conserved_hypothetical_protein, putative sporulation protein YtaF	NA|147aa|up_2|NC_015555.1_2456871_2457312_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|60aa|up_1|NC_015555.1_2457496_2457676_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|85aa|up_0|NC_015555.1_2457728_2457983_-	NA	NA|458aa|down_0|NC_015555.1_2458949_2460323_-	pfam13546, DDE_5, DDE superfamily endonuclease	cas6|252aa|down_1|NC_015555.1_2469952_2470708_-	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas2|88aa|down_2|NC_015555.1_2470718_2470982_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|down_3|NC_015555.1_2470983_2471976_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|164aa|down_4|NC_015555.1_2471987_2472479_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|739aa|down_5|NC_015555.1_2472497_2474714_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|216aa|down_6|NC_015555.1_2474685_2475333_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|339aa|down_7|NC_015555.1_2475339_2476356_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b2|460aa|down_8|NC_015555.1_2476374_2477754_-	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	NA|351aa|down_9|NC_015555.1_2478081_2479134_+	pfam07854, DUF1646, Protein of unknown function (DUF1646)
GCF_000189775.2_ASM18977v3	NC_015555	Thermoanaerobacterium xylanolyticum LX-11, complete genome	5	2460483-2469714	4,3,4	CRISPRCasFinder,CRT,PILER-CR	no	cas6,cas2,cas1,cas4,cas3,cas5,cas7,cas8b2	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	Unclear	GTTTCAATTCCACTATGGTTAGATTAAATC,GTTTCAATTCCACTATGGTTAGATTAAATC,GTTTCAATTCCACTATGGTTAGATTAAATC	30,30,30	0	0	NA	NA	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	139,139,134	139	Unclear	RT,cas3,cas4,csa3,DEDDh,DinG,cas2,cas1,cas5,cas7,cas8b1,cas6,csm2gr11,csm3gr7,csx10gr5,cas10,csx1,csx20,cas8b2	NA|85aa|up_1|NC_015555.1_2457728_2457983_-,NA|224aa|down_9|NC_015555.1_2479197_2479869_-	NA|272aa|up_9|NC_015555.1_2448996_2449812_-	pfam13485, Peptidase_MA_2, Peptidase MA superfamily	NA|402aa|up_8|NC_015555.1_2449945_2451151_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|324aa|up_7|NC_015555.1_2451332_2452304_+	pfam07261, DnaB_2, Replication initiation and membrane attachment	NA|327aa|up_6|NC_015555.1_2452296_2453277_+	PRK06835, PRK06835, DNA replication protein DnaC; Validated	NA|481aa|up_5|NC_015555.1_2453273_2454716_-	cd09604, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|204aa|up_4|NC_015555.1_2454855_2455467_+	TIGR02840, conserved_hypothetical_protein, putative sporulation protein YtaF	NA|147aa|up_3|NC_015555.1_2456871_2457312_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|60aa|up_2|NC_015555.1_2457496_2457676_-	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|85aa|up_1|NC_015555.1_2457728_2457983_-	NA	NA|458aa|up_0|NC_015555.1_2458949_2460323_-	pfam13546, DDE_5, DDE superfamily endonuclease	cas6|252aa|down_0|NC_015555.1_2469952_2470708_-	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas2|88aa|down_1|NC_015555.1_2470718_2470982_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|down_2|NC_015555.1_2470983_2471976_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|164aa|down_3|NC_015555.1_2471987_2472479_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|739aa|down_4|NC_015555.1_2472497_2474714_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|216aa|down_5|NC_015555.1_2474685_2475333_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas7|339aa|down_6|NC_015555.1_2475339_2476356_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b2|460aa|down_7|NC_015555.1_2476374_2477754_-	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	NA|351aa|down_8|NC_015555.1_2478081_2479134_+	pfam07854, DUF1646, Protein of unknown function (DUF1646)	NA|224aa|down_9|NC_015555.1_2479197_2479869_-	NA
