assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001682135.1_ASM168213v1	NZ_CP007121	Thermosipho sp. 1070 chromosome, complete genome	1	125564-126342	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Type III-A,Type III-D,Type III-C,Type III-B	GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	11,11,10	11	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA|251aa|up_5|NZ_CP007121.1_122029_122782_-,NA|134aa|up_3|NZ_CP007121.1_123482_123884_-,NA	csm2gr11|138aa|up_9|NZ_CP007121.1_116527_116941_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|756aa|up_8|NZ_CP007121.1_116942_119210_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csx1|454aa|up_7|NZ_CP007121.1_119225_120587_-	cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|481aa|up_6|NZ_CP007121.1_120583_122026_-	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|251aa|up_5|NZ_CP007121.1_122029_122782_-	NA	NA|223aa|up_4|NZ_CP007121.1_122806_123475_-	TIGR02943, RNA_polymerase_sigma-24_subunit_RpoE, RNA polymerase sigma-70 factor, TIGR02943 family	NA|134aa|up_3|NZ_CP007121.1_123482_123884_-	NA	cas2|90aa|up_2|NZ_CP007121.1_123932_124202_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|329aa|up_1|NZ_CP007121.1_124206_125193_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|88aa|up_0|NZ_CP007121.1_125203_125467_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|263aa|down_0|NZ_CP007121.1_127133_127922_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|461aa|down_1|NZ_CP007121.1_127969_129352_-	PRK06828, PRK06828, amidase; Provisional	NA|280aa|down_2|NZ_CP007121.1_129378_130218_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|294aa|down_3|NZ_CP007121.1_130230_131112_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|413aa|down_4|NZ_CP007121.1_131163_132402_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|404aa|down_5|NZ_CP007121.1_132734_133946_+	cd03792, GT4_trehalose_phosphorylase, trehalose phosphorylase and similar proteins	NA|372aa|down_6|NZ_CP007121.1_133945_135061_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|313aa|down_7|NZ_CP007121.1_135041_135980_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|down_8|NZ_CP007121.1_135966_136641_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|215aa|down_9|NZ_CP007121.1_136637_137282_-	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU
GCF_001682135.1_ASM168213v1	NZ_CP007121	Thermosipho sp. 1070 chromosome, complete genome	2	306563-310795	2,2,2	CRT,PILER-CR,CRISPRCasFinder	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,cas1,cas2	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Type I-D	GTTAAAAAACCTAATTCCATAAAATGGAATTCAAAC,GTTAAAAAACCTAATTCCATAAAATGGAATTCAAAC,GTTAAAAAACCTAATTCCATAAAATGGAATTCAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	58,57,57	58	TypeI-D	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA|363aa|down_9|NZ_CP007121.1_321903_322992_+	NA|335aa|up_9|NZ_CP007121.1_294970_295975_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|85aa|up_8|NZ_CP007121.1_296032_296287_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|244aa|up_7|NZ_CP007121.1_296676_297408_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	WYL|320aa|up_6|NZ_CP007121.1_297400_298360_+	pfam13280, WYL, WYL domain	cas10d|790aa|up_5|NZ_CP007121.1_298498_300868_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|324aa|up_4|NZ_CP007121.1_300860_301832_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|231aa|up_3|NZ_CP007121.1_301880_302573_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas3|678aa|up_2|NZ_CP007121.1_302562_304596_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas1|517aa|up_1|NZ_CP007121.1_304611_306162_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|92aa|up_0|NZ_CP007121.1_306167_306443_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|564aa|down_0|NZ_CP007121.1_311314_313006_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|470aa|down_1|NZ_CP007121.1_313002_314412_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|474aa|down_2|NZ_CP007121.1_314416_315838_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|403aa|down_3|NZ_CP007121.1_315994_317203_+	PRK10720, PRK10720, uracil transporter; Provisional	NA|324aa|down_4|NZ_CP007121.1_317213_318185_-	COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism]	NA|431aa|down_5|NZ_CP007121.1_318294_319587_+	PRK14330, PRK14330, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|288aa|down_6|NZ_CP007121.1_319600_320464_+	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|177aa|down_7|NZ_CP007121.1_320473_321004_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|311aa|down_8|NZ_CP007121.1_321005_321938_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|363aa|down_9|NZ_CP007121.1_321903_322992_+	NA
GCF_001682135.1_ASM168213v1	NZ_CP007121	Thermosipho sp. 1070 chromosome, complete genome	3	625089-626267	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	DEDDh	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Unclear	TTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTAGAACATACCTATGAGGAATGGAAAC,GTTTAGAACATACCTATGAGGAATGGAAAC	29,30,30	0	0	NA	NA	NA:NA:NA	17,17,17	17	Orphan	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA	NA|191aa|up_9|NZ_CP007121.1_612164_612737_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|665aa|up_8|NZ_CP007121.1_612794_614789_+	cd11335, AmyAc_MTase_N, Alpha amylase catalytic domain found in maltosyltransferase	NA|844aa|up_7|NZ_CP007121.1_614804_617336_+	TIGR02104, pulA_typeI, pullulanase, type I	NA|120aa|up_6|NZ_CP007121.1_617379_617739_+	cd17554, REC_TrrA-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator TrrA and similar domains	NA|909aa|up_5|NZ_CP007121.1_617823_620550_+	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|167aa|up_4|NZ_CP007121.1_620587_621088_+	TIGR00725, TIGR00725, TIGR00725 family protein	NA|290aa|up_3|NZ_CP007121.1_621084_621954_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|336aa|up_2|NZ_CP007121.1_621940_622948_+	PRK13930, PRK13930, rod shape-determining protein MreB; Provisional	NA|234aa|up_1|NZ_CP007121.1_622957_623659_+	COG4786, FlgG, Flagellar basal body rod protein [Cell motility and secretion]	NA|262aa|up_0|NZ_CP007121.1_623673_624459_+	PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional	NA|241aa|down_0|NZ_CP007121.1_626439_627162_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|217aa|down_1|NZ_CP007121.1_627161_627812_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|246aa|down_2|NZ_CP007121.1_627824_628562_-	cd13624, PBP2_Arg_Lys_His, Substrate binding domain of the arginine-, lysine-, histidine-binding protein ArtJ; the type 2 periplasmic binding protein fold	NA|602aa|down_3|NZ_CP007121.1_628675_630481_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|334aa|down_4|NZ_CP007121.1_630449_631451_-	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|61aa|down_5|NZ_CP007121.1_631453_631636_-	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|183aa|down_6|NZ_CP007121.1_631654_632203_-	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|320aa|down_7|NZ_CP007121.1_632251_633211_+	cd00254, LT-like, lytic transglycosylase(LT)-like domain	NA|337aa|down_8|NZ_CP007121.1_634831_635842_+	PRK05686, fliG, flagellar motor switch protein G; Validated	DEDDh|188aa|down_9|NZ_CP007121.1_635849_636413_+	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family
GCF_001682135.1_ASM168213v1	NZ_CP007121	Thermosipho sp. 1070 chromosome, complete genome	4	923880-925386	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cas3HD	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Unclear	GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	22,22,22	22	Unclear	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA	NA|516aa|up_9|NZ_CP007121.1_911985_913533_+	COG1031, COG1031, Uncharacterized Fe-S oxidoreductase [Energy production and conversion]	NA|319aa|up_8|NZ_CP007121.1_913528_914485_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	cas3HD|410aa|up_7|NZ_CP007121.1_914488_915718_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|156aa|up_6|NZ_CP007121.1_915730_916198_+	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|456aa|up_5|NZ_CP007121.1_916200_917568_+	cd11316, AmyAc_bac2_AmyA, Alpha amylase catalytic domain found in bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|553aa|up_4|NZ_CP007121.1_917567_919226_+	PRK00095, mutL, DNA mismatch repair endonuclease MutL	NA|272aa|up_3|NZ_CP007121.1_919149_919965_-	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|295aa|up_2|NZ_CP007121.1_920013_920898_-	cd18614, GH130, Glycosyl hydrolase family 130; uncharacterized	NA|325aa|up_1|NZ_CP007121.1_920913_921888_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|418aa|up_0|NZ_CP007121.1_922100_923354_-	PRK12391, PRK12391, TrpB-like pyridoxal phosphate-dependent enzyme	NA|203aa|down_0|NZ_CP007121.1_926469_927078_+	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|274aa|down_1|NZ_CP007121.1_927052_927874_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|155aa|down_2|NZ_CP007121.1_927878_928343_-	cd03018, PRX_AhpE_like, Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE	NA|125aa|down_3|NZ_CP007121.1_928444_928819_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|572aa|down_4|NZ_CP007121.1_928823_930539_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|223aa|down_5|NZ_CP007121.1_930580_931249_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|398aa|down_6|NZ_CP007121.1_931309_932503_+	COG2461, COG2461, Uncharacterized conserved protein [Function unknown]	NA|553aa|down_7|NZ_CP007121.1_932515_934174_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|438aa|down_8|NZ_CP007121.1_934398_935712_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|289aa|down_9|NZ_CP007121.1_935753_936620_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]
