assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	1	81689-82168	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,csx1	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Unclear	GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC	36,36,36	0	0	NA	NA	III-A:III-A:III-A	6,6,6	6	Unclear	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA,NA	NA|282aa|up_9|NZ_CP010822.1_73139_73985_+	TIGR02144, lysine_biosynthesis_enzyme, Lysine biosynthesis enzyme LysX	NA|345aa|up_8|NZ_CP010822.1_73977_75012_+	TIGR01850, N-acetyl-gamma-glutamyl-phosphate_reductase, N-acetyl-gamma-glutamyl-phosphate reductase, common form	NA|270aa|up_7|NZ_CP010822.1_75029_75839_+	PRK14058, PRK14058, [LysW]-aminoadipate/[LysW]-glutamate kinase	NA|532aa|up_6|NZ_CP010822.1_75843_77439_+	cd10816, GH57N_BE_TK1436_like, N-terminal catalytic domain of Gh57 branching enzyme TK 1436 and similar proteins	NA|292aa|up_5|NZ_CP010822.1_77443_78319_+	cd07586, nitrilase_8, Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|282aa|up_4|NZ_CP010822.1_78315_79161_+	PRK13980, PRK13980, NAD synthetase; Provisional	NA|141aa|up_3|NZ_CP010822.1_79157_79580_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|334aa|up_2|NZ_CP010822.1_79581_80583_+	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|233aa|up_1|NZ_CP010822.1_80475_81174_-	COG0445, GidA, Flavin-dependent tRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA    [Cell cycle control, cell division, chromosome partitioning]	cas2|91aa|up_0|NZ_CP010822.1_81387_81660_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|317aa|down_0|NZ_CP010822.1_82526_83477_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	csx1|402aa|down_1|NZ_CP010822.1_83476_84682_-	TIGR02710, conserved_hypothetical_protein, CRISPR-associated protein, TIGR02710 family	NA|205aa|down_2|NZ_CP010822.1_84713_85328_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|307aa|down_3|NZ_CP010822.1_85324_86245_-	cd02146, NfsA-like, nitroreductase similar to Escherichia coli NfsA	NA|168aa|down_4|NZ_CP010822.1_86261_86765_-	pfam11159, DUF2939, Protein of unknown function (DUF2939)	NA|356aa|down_5|NZ_CP010822.1_86845_87913_-	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|512aa|down_6|NZ_CP010822.1_87971_89507_+	cd06460, M32_Taq, Peptidase family M32, which includes thermostable carboxypeptidases TaqCP, PfuCP and FisCP	NA|99aa|down_7|NZ_CP010822.1_89618_89915_-	cd11531, NTP-PPase_BsYpjD, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|344aa|down_8|NZ_CP010822.1_89944_90976_-	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|127aa|down_9|NZ_CP010822.1_90998_91379_+	PRK05234, mgsA, methylglyoxal synthase; Validated
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	2	147410-147507	2	CRISPRCasFinder	no		cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Orphan	ATCATCTGACTACCTGACTACCG	23	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA|123aa|up_7|NZ_CP010822.1_142764_143133_-,NA|101aa|up_3|NZ_CP010822.1_144732_145035_+,NA|192aa|up_1|NZ_CP010822.1_145411_145987_+,NA|226aa|down_1|NZ_CP010822.1_149183_149861_+,NA|56aa|down_3|NZ_CP010822.1_150990_151158_+,NA|78aa|down_7|NZ_CP010822.1_153600_153834_+	NA|76aa|up_9|NZ_CP010822.1_141212_141440_+	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|378aa|up_8|NZ_CP010822.1_141629_142763_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|123aa|up_7|NZ_CP010822.1_142764_143133_-	NA	NA|221aa|up_6|NZ_CP010822.1_143532_144195_-	cd06529, S24_LexA-like, Peptidase S24 LexA-like proteins are involved in the SOS response leading to the repair of single-stranded DNA within the bacterial cell	NA|81aa|up_5|NZ_CP010822.1_144249_144492_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|78aa|up_4|NZ_CP010822.1_144484_144718_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|101aa|up_3|NZ_CP010822.1_144732_145035_+	NA	NA|127aa|up_2|NZ_CP010822.1_145027_145408_+	pfam00684, DnaJ_CXXCXGXG, DnaJ central domain	NA|192aa|up_1|NZ_CP010822.1_145411_145987_+	NA	NA|209aa|up_0|NZ_CP010822.1_145995_146622_+	smart00942, PriCT_1, Primase C terminal 1 (PriCT-1)	NA|543aa|down_0|NZ_CP010822.1_147558_149187_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|226aa|down_1|NZ_CP010822.1_149183_149861_+	NA	NA|371aa|down_2|NZ_CP010822.1_149874_150987_+	PRK14945, PRK14945, DNA polymerase III subunit beta; Provisional	NA|56aa|down_3|NZ_CP010822.1_150990_151158_+	NA	NA|119aa|down_4|NZ_CP010822.1_151343_151700_+	PRK09786, PRK09786, endodeoxyribonuclease RUS; Reviewed	NA|107aa|down_5|NZ_CP010822.1_151725_152046_+	COG3604, FhlA, Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains [Transcription / Signal transduction mechanisms]	NA|427aa|down_6|NZ_CP010822.1_152042_153323_+	PHA02533, 17, large terminase protein; Provisional	NA|78aa|down_7|NZ_CP010822.1_153600_153834_+	NA	NA|494aa|down_8|NZ_CP010822.1_153809_155291_+	pfam06074, DUF935, Protein of unknown function (DUF935)	NA|481aa|down_9|NZ_CP010822.1_155271_156714_+	pfam04233, Phage_Mu_F, Phage Mu protein F like protein
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	3	550253-550403	3	CRISPRCasFinder	no		cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Orphan	GGGCCATCCCCACGTGTGTGGGGACT	26	0	0	NA	NA	I-E,II-B	2	2	Orphan	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA|238aa|up_0|NZ_CP010822.1_549537_550251_+,NA|147aa|down_2|NZ_CP010822.1_552527_552968_-	NA|216aa|up_9|NZ_CP010822.1_539559_540207_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|209aa|up_8|NZ_CP010822.1_540255_540882_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|236aa|up_7|NZ_CP010822.1_540896_541604_+	cd17765, PNP_ThPNP_like, purine nucleoside phosphorylases similar to Thermus thermophiles PNP	NA|272aa|up_6|NZ_CP010822.1_541605_542421_+	PRK07658, PRK07658, enoyl-CoA hydratase; Provisional	NA|291aa|up_5|NZ_CP010822.1_542459_543332_+	COG3294, COG3294, HD supefamily hydrolase [General function prediction only]	NA|543aa|up_4|NZ_CP010822.1_543470_545099_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|318aa|up_3|NZ_CP010822.1_545543_546497_-	cd07725, TTHA1429-like_MBL-fold, uncharacterized Thermus thermophilus TTHA1429 and related proteins; MBL-fold metallo hydrolase domain	NA|298aa|up_2|NZ_CP010822.1_546493_547387_-	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|702aa|up_1|NZ_CP010822.1_547421_549527_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|238aa|up_0|NZ_CP010822.1_549537_550251_+	NA	NA|141aa|down_0|NZ_CP010822.1_550934_551357_-	PRK03113, PRK03113, putative disulfide oxidoreductase; Provisional	NA|305aa|down_1|NZ_CP010822.1_551613_552528_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|147aa|down_2|NZ_CP010822.1_552527_552968_-	NA	NA|469aa|down_3|NZ_CP010822.1_553000_554407_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|243aa|down_4|NZ_CP010822.1_554460_555189_+	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|463aa|down_5|NZ_CP010822.1_555189_556578_+	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|273aa|down_6|NZ_CP010822.1_556524_557343_-	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|196aa|down_7|NZ_CP010822.1_557305_557893_-	COG1847, Jag, Predicted RNA-binding protein [General function prediction only]	NA|431aa|down_8|NZ_CP010822.1_557902_559195_-	TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1 family, C-terminal domain	NA|83aa|down_9|NZ_CP010822.1_559191_559440_-	pfam01809, Haemolytic, Haemolytic domain
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	4	550599-550813	2,4,2	CRT,CRISPRCasFinder,PILER-CR	no		cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Orphan	CGGGCCATCCCCACGTGCGTGGGGACTAC,GGGCCATCCCCACGTGTGTGGGGACT,ACGGGCCATCCCCACGTGCGTGGGGACTACGG	29,26,32	0	0	NA	NA	I-B,III-A,III-B:I-E,II-B:I-B,III-A,III-B	3,3,2	3	Orphan	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA|238aa|up_0|NZ_CP010822.1_549537_550251_+,NA|147aa|down_2|NZ_CP010822.1_552527_552968_-	NA|216aa|up_9|NZ_CP010822.1_539559_540207_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|209aa|up_8|NZ_CP010822.1_540255_540882_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|236aa|up_7|NZ_CP010822.1_540896_541604_+	cd17765, PNP_ThPNP_like, purine nucleoside phosphorylases similar to Thermus thermophiles PNP	NA|272aa|up_6|NZ_CP010822.1_541605_542421_+	PRK07658, PRK07658, enoyl-CoA hydratase; Provisional	NA|291aa|up_5|NZ_CP010822.1_542459_543332_+	COG3294, COG3294, HD supefamily hydrolase [General function prediction only]	NA|543aa|up_4|NZ_CP010822.1_543470_545099_-	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|318aa|up_3|NZ_CP010822.1_545543_546497_-	cd07725, TTHA1429-like_MBL-fold, uncharacterized Thermus thermophilus TTHA1429 and related proteins; MBL-fold metallo hydrolase domain	NA|298aa|up_2|NZ_CP010822.1_546493_547387_-	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|702aa|up_1|NZ_CP010822.1_547421_549527_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|238aa|up_0|NZ_CP010822.1_549537_550251_+	NA	NA|141aa|down_0|NZ_CP010822.1_550934_551357_-	PRK03113, PRK03113, putative disulfide oxidoreductase; Provisional	NA|305aa|down_1|NZ_CP010822.1_551613_552528_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|147aa|down_2|NZ_CP010822.1_552527_552968_-	NA	NA|469aa|down_3|NZ_CP010822.1_553000_554407_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|243aa|down_4|NZ_CP010822.1_554460_555189_+	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|463aa|down_5|NZ_CP010822.1_555189_556578_+	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|273aa|down_6|NZ_CP010822.1_556524_557343_-	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|196aa|down_7|NZ_CP010822.1_557305_557893_-	COG1847, Jag, Predicted RNA-binding protein [General function prediction only]	NA|431aa|down_8|NZ_CP010822.1_557902_559195_-	TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1 family, C-terminal domain	NA|83aa|down_9|NZ_CP010822.1_559191_559440_-	pfam01809, Haemolytic, Haemolytic domain
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	5	869567-871070	3,5,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas5,cas7,cas8b1,cas6,cas3,cas4,cas1	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Type I-B	CTTTTGACCGTACCTATGAGGGTTTGAAAC,CTTTTGACCGTACCTATGAGGGTTTGAAAC,CTTTTGACCGTACCTATGAGGGTTTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	22,22,22	22	TypeI-B	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA,NA	NA|314aa|up_9|NZ_CP010822.1_857948_858890_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|141aa|up_8|NZ_CP010822.1_859046_859469_-	cd09881, PIN_VapC4-5_FitB-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4 and VapC5, and Neisseria gonorrhoeae FitB and related proteins	cas2|87aa|up_7|NZ_CP010822.1_861103_861364_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas5|242aa|up_6|NZ_CP010822.1_861367_862093_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|319aa|up_5|NZ_CP010822.1_862089_863046_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8b1|603aa|up_4|NZ_CP010822.1_863038_864847_-	cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|243aa|up_3|NZ_CP010822.1_864888_865617_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas3|750aa|up_2|NZ_CP010822.1_865657_867907_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|174aa|up_1|NZ_CP010822.1_867882_868404_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|328aa|up_0|NZ_CP010822.1_868396_869380_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	NA|195aa|down_0|NZ_CP010822.1_871241_871826_+	pfam17932, TetR_C_24, Tetracyclin repressor-like, C-terminal domain	NA|324aa|down_1|NZ_CP010822.1_871812_872784_+	PRK13778, paaA, phenylacetate-CoA oxygenase subunit PaaA; Provisional	NA|174aa|down_2|NZ_CP010822.1_872793_873315_+	pfam06243, PaaB, Phenylacetic acid degradation B	NA|253aa|down_3|NZ_CP010822.1_873307_874066_+	TIGR02158, 12-phenylacetyl-CoA_epoxidase_subunit_C, phenylacetate-CoA oxygenase, PaaI subunit	NA|171aa|down_4|NZ_CP010822.1_874002_874515_+	TIGR02159, Putative_12-phenylacetyl-CoA_epoxidase_subunit_D, phenylacetate-CoA oxygenase, PaaJ subunit	NA|664aa|down_5|NZ_CP010822.1_874523_876515_+	PRK11563, PRK11563, bifunctional aldehyde dehydrogenase/enoyl-CoA hydratase; Provisional	NA|173aa|down_6|NZ_CP010822.1_876486_877005_-	pfam14595, Thioredoxin_9, Thioredoxin	NA|154aa|down_7|NZ_CP010822.1_877020_877482_-	COG5496, COG5496, Predicted thioesterase [General function prediction only]	NA|446aa|down_8|NZ_CP010822.1_877485_878823_-	cd05913, PaaK, Phenylacetate-CoA ligase (also known as PaaK)	NA|123aa|down_9|NZ_CP010822.1_878819_879188_-	TIGR02286, Acyl-coenzyme_A_thioesterase_PaaI, phenylacetic acid degradation protein PaaD
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	6	1009153-1009266	6	CRISPRCasFinder	no		cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Orphan	GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC	36	0	0	NA	NA	III-A	1	1	Orphan	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA,NA|80aa|down_1|NZ_CP010822.1_1011231_1011471_-	NA|165aa|up_9|NZ_CP010822.1_999847_1000342_-	PRK13829, rimM, 16S rRNA-processing protein RimM; Provisional	NA|73aa|up_8|NZ_CP010822.1_1000341_1000560_-	pfam13083, KH_4, KH domain	NA|88aa|up_7|NZ_CP010822.1_1000617_1000881_-	PRK00040, rpsP, 30S ribosomal protein S16; Reviewed	NA|431aa|up_6|NZ_CP010822.1_1000888_1002181_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|449aa|up_5|NZ_CP010822.1_1002389_1003736_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|132aa|up_4|NZ_CP010822.1_1003738_1004134_+	COG0221, Ppa, Inorganic pyrophosphatase [Energy production and conversion]	NA|346aa|up_3|NZ_CP010822.1_1004100_1005138_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|180aa|up_2|NZ_CP010822.1_1005141_1005681_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|858aa|up_1|NZ_CP010822.1_1005754_1008328_-	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|252aa|up_0|NZ_CP010822.1_1008324_1009080_-	cd07382, MPP_DR1281, Deinococcus radiodurans DR1281 and related proteins, metallophosphatase domain	NA|318aa|down_0|NZ_CP010822.1_1009502_1010456_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|80aa|down_1|NZ_CP010822.1_1011231_1011471_-	NA	NA|222aa|down_2|NZ_CP010822.1_1011494_1012160_+	COG1381, RecO, Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]	NA|157aa|down_3|NZ_CP010822.1_1012146_1012617_-	TIGR01462, Transcription_elongation_factor_GreA, transcription elongation factor GreA	NA|249aa|down_4|NZ_CP010822.1_1012673_1013420_-	pfam01063, Aminotran_4, Amino-transferase class IV	NA|617aa|down_5|NZ_CP010822.1_1013416_1015267_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|174aa|down_6|NZ_CP010822.1_1015289_1015811_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1186aa|down_7|NZ_CP010822.1_1015891_1019449_-	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|508aa|down_8|NZ_CP010822.1_1019491_1021015_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|397aa|down_9|NZ_CP010822.1_1021094_1022285_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	7	1010468-1010734	7,4	CRISPRCasFinder,PILER-CR	no		cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Orphan	GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,AGTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC	36,37	0	0	NA	NA	III-A:III-A	3,2	3	Orphan	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA|64aa|up_1|NZ_CP010822.1_1009194_1009386_+,NA|80aa|down_0|NZ_CP010822.1_1011231_1011471_-	NA|88aa|up_9|NZ_CP010822.1_1000617_1000881_-	PRK00040, rpsP, 30S ribosomal protein S16; Reviewed	NA|431aa|up_8|NZ_CP010822.1_1000888_1002181_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|449aa|up_7|NZ_CP010822.1_1002389_1003736_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|132aa|up_6|NZ_CP010822.1_1003738_1004134_+	COG0221, Ppa, Inorganic pyrophosphatase [Energy production and conversion]	NA|346aa|up_5|NZ_CP010822.1_1004100_1005138_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|180aa|up_4|NZ_CP010822.1_1005141_1005681_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|858aa|up_3|NZ_CP010822.1_1005754_1008328_-	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|252aa|up_2|NZ_CP010822.1_1008324_1009080_-	cd07382, MPP_DR1281, Deinococcus radiodurans DR1281 and related proteins, metallophosphatase domain	NA|64aa|up_1|NZ_CP010822.1_1009194_1009386_+	NA	NA|318aa|up_0|NZ_CP010822.1_1009502_1010456_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|80aa|down_0|NZ_CP010822.1_1011231_1011471_-	NA	NA|222aa|down_1|NZ_CP010822.1_1011494_1012160_+	COG1381, RecO, Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]	NA|157aa|down_2|NZ_CP010822.1_1012146_1012617_-	TIGR01462, Transcription_elongation_factor_GreA, transcription elongation factor GreA	NA|249aa|down_3|NZ_CP010822.1_1012673_1013420_-	pfam01063, Aminotran_4, Amino-transferase class IV	NA|617aa|down_4|NZ_CP010822.1_1013416_1015267_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|174aa|down_5|NZ_CP010822.1_1015289_1015811_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1186aa|down_6|NZ_CP010822.1_1015891_1019449_-	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|508aa|down_7|NZ_CP010822.1_1019491_1021015_+	PRK09243, PRK09243, nicotinate phosphoribosyltransferase; Validated	NA|397aa|down_8|NZ_CP010822.1_1021094_1022285_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|195aa|down_9|NZ_CP010822.1_1022271_1022856_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	8	1232917-1233872	8,4,5	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Type III-D,Type III-B,Type III-A,Type III-C	GTCGCAATCCCCTTACGGGGCTAAGTGG,GTCGCAATCCCCTTACGGGGCTAAGTGG,GTCGCAATCCCCTTACGGGGCTAAGTGGCTTGCAAC	28,28,36	0	0	NA	NA	III-A:III-A:III-A	12,12,11	12	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA|251aa|up_1|NZ_CP010822.1_1231085_1231838_+,NA|95aa|down_5|NZ_CP010822.1_1240099_1240384_+,NA|317aa|down_6|NZ_CP010822.1_1240346_1241297_+,NA|238aa|down_7|NZ_CP010822.1_1241293_1242007_+	NA|379aa|up_9|NZ_CP010822.1_1220998_1222135_+	PRK00696, sucC, ADP-forming succinate--CoA ligase subunit beta	NA|289aa|up_8|NZ_CP010822.1_1222131_1222998_+	PRK05678, PRK05678, succinyl-CoA synthetase subunit alpha; Validated	NA|501aa|up_7|NZ_CP010822.1_1223046_1224549_-	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|291aa|up_6|NZ_CP010822.1_1224601_1225474_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|286aa|up_5|NZ_CP010822.1_1225489_1226347_-	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|407aa|up_4|NZ_CP010822.1_1226336_1227557_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|404aa|up_3|NZ_CP010822.1_1227549_1228761_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	cas3|768aa|up_2|NZ_CP010822.1_1228776_1231080_+	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|251aa|up_1|NZ_CP010822.1_1231085_1231838_+	NA	NA|330aa|up_0|NZ_CP010822.1_1231821_1232811_-	pfam03283, PAE, Pectinacetylesterase	cas10|806aa|down_0|NZ_CP010822.1_1234381_1236799_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|135aa|down_1|NZ_CP010822.1_1236799_1237204_+	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	csm3gr7|244aa|down_2|NZ_CP010822.1_1237212_1237944_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|289aa|down_3|NZ_CP010822.1_1237945_1238812_+	cd09663, Csm4_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm4	csm5gr7|376aa|down_4|NZ_CP010822.1_1238808_1239936_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|95aa|down_5|NZ_CP010822.1_1240099_1240384_+	NA	NA|317aa|down_6|NZ_CP010822.1_1240346_1241297_+	NA	NA|238aa|down_7|NZ_CP010822.1_1241293_1242007_+	NA	csx1|462aa|down_8|NZ_CP010822.1_1242553_1243939_+	pfam09670, Cas_Cas02710, CRISPR-associated protein (Cas_Cas02710)	cas6|248aa|down_9|NZ_CP010822.1_1243938_1244682_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	9	1731966-1732521	9,5,6	CRISPRCasFinder,CRT,PILER-CR	no	csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,cas10	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Type III-D,Type III-C,Type III-A,Type III-B, Type III-C?	GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC	36,36,36	0	0	NA	NA	III-A:III-A:III-A	7,7,6	7	TypeIII-D,TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-C?	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA,NA	NA|514aa|up_9|NZ_CP010822.1_1719837_1721379_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|405aa|up_8|NZ_CP010822.1_1721452_1722667_-	cd00887, MoeA, MoeA family	NA|378aa|up_7|NZ_CP010822.1_1722672_1723806_-	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|221aa|up_6|NZ_CP010822.1_1723903_1724566_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|124aa|up_5|NZ_CP010822.1_1724580_1724952_+	cd06554, ASCH_ASC-1_like, ASC-1 homology domain, ASC-1-like subfamily	NA|577aa|up_4|NZ_CP010822.1_1724948_1726679_-	COG2401, COG2401, ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only]	NA|478aa|up_3|NZ_CP010822.1_1727544_1728978_-	PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional	NA|407aa|up_2|NZ_CP010822.1_1729010_1730231_+	PRK04135, PRK04135, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|184aa|up_1|NZ_CP010822.1_1730243_1730795_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|326aa|up_0|NZ_CP010822.1_1730772_1731750_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	csx1|419aa|down_0|NZ_CP010822.1_1732995_1734252_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	cmr6gr7|350aa|down_1|NZ_CP010822.1_1734253_1735303_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|115aa|down_2|NZ_CP010822.1_1735319_1735664_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|287aa|down_3|NZ_CP010822.1_1735660_1736521_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr1gr7|399aa|down_4|NZ_CP010822.1_1736534_1737731_-	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr3gr5|369aa|down_5|NZ_CP010822.1_1737727_1738834_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|585aa|down_6|NZ_CP010822.1_1738823_1740578_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|627aa|down_7|NZ_CP010822.1_1740629_1742510_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	NA|80aa|down_8|NZ_CP010822.1_1744689_1744929_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|277aa|down_9|NZ_CP010822.1_1744915_1745746_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional
GCF_001399775.1_ASM139977v1	NZ_CP010822	Thermus aquaticus Y51MC23, complete genome	10	1742839-1743245	10,6,7	CRISPRCasFinder,CRT,PILER-CR	no	csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,cas10	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5	Type III-D,Type III-C,Type III-A,Type III-B, Type III-C?	GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC,GTCGCAATCCCCTTACGGGGCTAAGTGGTTTGCAAC	36,36,36	0	0	NA	NA	III-A:III-A:III-A	5,5,4	5	TypeIII-D,TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-C?	cas2,cas1,csx1,Cas9_archaeal,Cas14b_CAS-V-F,c2c9_V-U4,csa3,cas5,cas7,cas8b1,cas6,cas3,cas4,DEDDh,RT,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cmr6gr7,cmr5gr11,cmr4gr7,cmr1gr7,cmr3gr5,DinG	NA,NA	NA|184aa|up_9|NZ_CP010822.1_1730243_1730795_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|326aa|up_8|NZ_CP010822.1_1730772_1731750_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	csx1|419aa|up_7|NZ_CP010822.1_1732995_1734252_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	cmr6gr7|350aa|up_6|NZ_CP010822.1_1734253_1735303_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|115aa|up_5|NZ_CP010822.1_1735319_1735664_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|287aa|up_4|NZ_CP010822.1_1735660_1736521_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr1gr7|399aa|up_3|NZ_CP010822.1_1736534_1737731_-	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr3gr5|369aa|up_2|NZ_CP010822.1_1737727_1738834_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|585aa|up_1|NZ_CP010822.1_1738823_1740578_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|627aa|up_0|NZ_CP010822.1_1740629_1742510_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	NA|80aa|down_0|NZ_CP010822.1_1744689_1744929_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|277aa|down_1|NZ_CP010822.1_1744915_1745746_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|180aa|down_2|NZ_CP010822.1_1745749_1746289_-	pfam01025, GrpE, GrpE	NA|617aa|down_3|NZ_CP010822.1_1746347_1748198_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|625aa|down_4|NZ_CP010822.1_1748323_1750198_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|98aa|down_5|NZ_CP010822.1_1750241_1750535_-	cd13831, HU, histone-like DNA-binding protein HU	NA|210aa|down_6|NZ_CP010822.1_1750657_1751287_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|227aa|down_7|NZ_CP010822.1_1751292_1751973_-	cd06559, Endonuclease_V, Endonuclease_V, a DNA repair enzyme that initiates repair of nitrosative deaminated purine bases	NA|291aa|down_8|NZ_CP010822.1_1751973_1752846_-	pfam18353, PG_isomerase_N, Phosphoglucose isomerase N-terminal domain	NA|262aa|down_9|NZ_CP010822.1_1752865_1753651_+	cd07010, cupin_PMI_type_I_N_bac, Phosphomannose isomerase in bacteria and archaea, N-terminal cupin domain
