assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001941365.1_ASM194136v1	NZ_CP007223	Thermosipho sp. 1063, complete genome	1	125563-126130	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Type III-B,Type III-C,Type III-D,Type III-A	GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	8,8,7	8	TypeIII-B,TypeIII-C,TypeIII-D,TypeIII-A	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA|251aa|up_5|NZ_CP007223.1_122028_122781_-,NA|223aa|up_4|NZ_CP007223.1_122805_123474_-,NA|134aa|up_3|NZ_CP007223.1_123481_123883_-,NA	csm2gr11|138aa|up_9|NZ_CP007223.1_116526_116940_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|756aa|up_8|NZ_CP007223.1_116941_119209_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csx1|454aa|up_7|NZ_CP007223.1_119224_120586_-	cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|481aa|up_6|NZ_CP007223.1_120582_122025_-	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|251aa|up_5|NZ_CP007223.1_122028_122781_-	NA	NA|223aa|up_4|NZ_CP007223.1_122805_123474_-	NA	NA|134aa|up_3|NZ_CP007223.1_123481_123883_-	NA	cas2|90aa|up_2|NZ_CP007223.1_123931_124201_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|329aa|up_1|NZ_CP007223.1_124205_125192_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|88aa|up_0|NZ_CP007223.1_125202_125466_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|263aa|down_0|NZ_CP007223.1_126921_127710_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|461aa|down_1|NZ_CP007223.1_127757_129140_-	PRK06828, PRK06828, amidase; Provisional	NA|280aa|down_2|NZ_CP007223.1_129166_130006_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|294aa|down_3|NZ_CP007223.1_130018_130900_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|413aa|down_4|NZ_CP007223.1_130951_132190_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|404aa|down_5|NZ_CP007223.1_132522_133734_+	cd03792, GT4_trehalose_phosphorylase, trehalose phosphorylase and similar proteins	NA|372aa|down_6|NZ_CP007223.1_133733_134849_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|313aa|down_7|NZ_CP007223.1_134829_135768_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|down_8|NZ_CP007223.1_135754_136429_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|215aa|down_9|NZ_CP007223.1_136425_137070_-	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU
GCF_001941365.1_ASM194136v1	NZ_CP007223	Thermosipho sp. 1063, complete genome	2	306496-310809	2,2,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,cas1,cas2	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Type I-D	GTTAAAAAACCTAATTCCATAAATGGAATTCAAAC,GTTAAAAAACCTAATTCCATAAATGGAATTCAAAC,GTTAAAAAACCTAATTCCATAAATGGAATTCAAAC,GTTAAAAAACCTAATTCCATAAATGGAATTCAAAC	35,35,35,35	0	0	NA	NA	NA:NA:NA:NA	58,60,60,58	60	TypeI-D	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA|369aa|down_9|NZ_CP007223.1_321899_323006_+	NA|335aa|up_9|NZ_CP007223.1_294834_295839_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|85aa|up_8|NZ_CP007223.1_295896_296151_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|244aa|up_7|NZ_CP007223.1_296540_297272_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	WYL|320aa|up_6|NZ_CP007223.1_297264_298224_+	pfam13280, WYL, WYL domain	cas10d|790aa|up_5|NZ_CP007223.1_298362_300732_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|324aa|up_4|NZ_CP007223.1_300724_301696_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|231aa|up_3|NZ_CP007223.1_301744_302437_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas3|678aa|up_2|NZ_CP007223.1_302426_304460_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas1|517aa|up_1|NZ_CP007223.1_304475_306026_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|92aa|up_0|NZ_CP007223.1_306031_306307_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|564aa|down_0|NZ_CP007223.1_311328_313020_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|470aa|down_1|NZ_CP007223.1_313016_314426_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|474aa|down_2|NZ_CP007223.1_314430_315852_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|403aa|down_3|NZ_CP007223.1_316008_317217_+	PRK10720, PRK10720, uracil transporter; Provisional	NA|324aa|down_4|NZ_CP007223.1_317227_318199_-	COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism]	NA|431aa|down_5|NZ_CP007223.1_318308_319601_+	PRK14330, PRK14330, (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional	NA|288aa|down_6|NZ_CP007223.1_319614_320478_+	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|177aa|down_7|NZ_CP007223.1_320487_321018_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|311aa|down_8|NZ_CP007223.1_321019_321952_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|369aa|down_9|NZ_CP007223.1_321899_323006_+	NA
GCF_001941365.1_ASM194136v1	NZ_CP007223	Thermosipho sp. 1063, complete genome	3	624605-625388	4,3,3	PILER-CR,CRISPRCasFinder,CRT	no	DEDDh	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Unclear	GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTAGAACATACCTATGAGGAATGGAAAC,GTTTAGAACATACCTATGAGGAATGGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	11,11,11	11	Orphan	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA	NA|191aa|up_9|NZ_CP007223.1_611680_612253_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|665aa|up_8|NZ_CP007223.1_612310_614305_+	cd11335, AmyAc_MTase_N, Alpha amylase catalytic domain found in maltosyltransferase	NA|844aa|up_7|NZ_CP007223.1_614320_616852_+	TIGR02104, pulA_typeI, pullulanase, type I	NA|120aa|up_6|NZ_CP007223.1_616895_617255_+	cd17554, REC_TrrA-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator TrrA and similar domains	NA|909aa|up_5|NZ_CP007223.1_617339_620066_+	PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|167aa|up_4|NZ_CP007223.1_620103_620604_+	TIGR00725, TIGR00725, TIGR00725 family protein	NA|290aa|up_3|NZ_CP007223.1_620600_621470_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|336aa|up_2|NZ_CP007223.1_621456_622464_+	PRK13930, PRK13930, rod shape-determining protein MreB; Provisional	NA|234aa|up_1|NZ_CP007223.1_622473_623175_+	COG4786, FlgG, Flagellar basal body rod protein [Cell motility and secretion]	NA|262aa|up_0|NZ_CP007223.1_623189_623975_+	PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional	NA|242aa|down_0|NZ_CP007223.1_625560_626286_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|217aa|down_1|NZ_CP007223.1_626282_626933_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|246aa|down_2|NZ_CP007223.1_626945_627683_-	cd13624, PBP2_Arg_Lys_His, Substrate binding domain of the arginine-, lysine-, histidine-binding protein ArtJ; the type 2 periplasmic binding protein fold	NA|602aa|down_3|NZ_CP007223.1_627796_629602_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|334aa|down_4|NZ_CP007223.1_629570_630572_-	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|61aa|down_5|NZ_CP007223.1_630574_630757_-	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|183aa|down_6|NZ_CP007223.1_630775_631324_-	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|320aa|down_7|NZ_CP007223.1_631372_632332_+	cd00254, LT-like, lytic transglycosylase(LT)-like domain	NA|535aa|down_8|NZ_CP007223.1_632343_633948_+	PRK06007, fliF, flagellar basal body M-ring protein FliF	NA|337aa|down_9|NZ_CP007223.1_633952_634963_+	PRK05686, fliG, flagellar motor switch protein G; Validated
GCF_001941365.1_ASM194136v1	NZ_CP007223	Thermosipho sp. 1063, complete genome	4	909396-911235	5,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cas3HD	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	Unclear	GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC,GTTTCCATTCCTCATAGGTATGTTCTAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	27,27,27	27	Unclear	cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,cas2,cas1,cas3,WYL,cas10d,csc2gr7,csc1gr5,DEDDh,cmr6gr7,cmr5gr11,cmr4gr7,csm6,cmr3gr5,cmr1gr7,csa3,cas3HD,PrimPol	NA,NA	NA|516aa|up_9|NZ_CP007223.1_897501_899049_+	COG1031, COG1031, Uncharacterized Fe-S oxidoreductase [Energy production and conversion]	NA|302aa|up_8|NZ_CP007223.1_899095_900001_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	cas3HD|410aa|up_7|NZ_CP007223.1_900004_901234_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|156aa|up_6|NZ_CP007223.1_901246_901714_+	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|456aa|up_5|NZ_CP007223.1_901716_903084_+	cd11316, AmyAc_bac2_AmyA, Alpha amylase catalytic domain found in bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|553aa|up_4|NZ_CP007223.1_903083_904742_+	PRK00095, mutL, DNA mismatch repair endonuclease MutL	NA|272aa|up_3|NZ_CP007223.1_904665_905481_-	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|295aa|up_2|NZ_CP007223.1_905529_906414_-	cd18614, GH130, Glycosyl hydrolase family 130; uncharacterized	NA|325aa|up_1|NZ_CP007223.1_906429_907404_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|418aa|up_0|NZ_CP007223.1_907616_908870_-	PRK12391, PRK12391, TrpB-like pyridoxal phosphate-dependent enzyme	NA|203aa|down_0|NZ_CP007223.1_912318_912927_+	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|274aa|down_1|NZ_CP007223.1_912901_913723_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|155aa|down_2|NZ_CP007223.1_913727_914192_-	cd03018, PRX_AhpE_like, Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE	NA|125aa|down_3|NZ_CP007223.1_914293_914668_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|572aa|down_4|NZ_CP007223.1_914672_916388_+	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|223aa|down_5|NZ_CP007223.1_916429_917098_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|398aa|down_6|NZ_CP007223.1_917158_918352_+	COG2461, COG2461, Uncharacterized conserved protein [Function unknown]	NA|553aa|down_7|NZ_CP007223.1_918364_920023_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|438aa|down_8|NZ_CP007223.1_920247_921561_+	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|289aa|down_9|NZ_CP007223.1_921602_922469_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]
