assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007165645.1_ASM716564v1	NZ_AP019793	Thermus thermophilus strain AA2-20 plasmid pAA220, complete sequence	1	29250-30936	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	Cas14u_CAS-V,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Cas14u_CAS-V,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4	Type I-E	CGGTCCATCCCCACGTGCGTGGGGACTAC,CGGTCCATCCCCACGTGCGTGGGGACTAC,CGGTCCATCCCCACGTGCGTGGGGACTAC	29,29,29	0	0	NA	NA	I-E,II-B:I-E,II-B:I-E,II-B	27,27,27	27	TypeI-E	csa3,cas2,DEDDh,cas3,Cas9_archaeal,Cas14u_CAS-V,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4	NA|222aa|up_1|NZ_AP019793.1_27246_27912_-,NA	NA|134aa|up_9|NZ_AP019793.1_18453_18855_+	cd18682, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|283aa|up_8|NZ_AP019793.1_19037_19886_-	cd05233, SDR_c, classical (c) SDRs	NA|163aa|up_7|NZ_AP019793.1_19896_20385_-	cd00667, ring_hydroxylating_dioxygenases_beta, Ring hydroxylating dioxygenase beta subunit	NA|442aa|up_6|NZ_AP019793.1_20391_21717_-	TIGR03229, benzo_1_2_benA, benzoate 1,2-dioxygenase, large subunit	NA|367aa|up_5|NZ_AP019793.1_21781_22882_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|411aa|up_4|NZ_AP019793.1_23745_24978_-	cd14748, PBP2_UgpB, The periplasmic-binding component of ABC transport system specific for sn-glycerol-3-phosphate; possesses type 2 periplasmic binding fold	NA|327aa|up_3|NZ_AP019793.1_25045_26026_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|386aa|up_2|NZ_AP019793.1_26096_27254_-	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|222aa|up_1|NZ_AP019793.1_27246_27912_-	NA	NA|315aa|up_0|NZ_AP019793.1_28118_29063_-	pfam01555, N6_N4_Mtase, DNA methylase	WYL|330aa|down_0|NZ_AP019793.1_31123_32113_+	pfam13280, WYL, WYL domain	cas3|920aa|down_1|NZ_AP019793.1_32109_34869_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|494aa|down_2|NZ_AP019793.1_34918_36400_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|164aa|down_3|NZ_AP019793.1_36396_36888_+	PRK13921, PRK13921, CRISPR-associated Cse2 family protein; Provisional	cas7|372aa|down_4|NZ_AP019793.1_36891_38007_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|225aa|down_5|NZ_AP019793.1_38008_38683_+	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas6e|212aa|down_6|NZ_AP019793.1_38669_39305_+	cd09664, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|326aa|down_7|NZ_AP019793.1_39314_40292_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|126aa|down_8|NZ_AP019793.1_40245_40623_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|251aa|down_9|NZ_AP019793.1_41699_42452_-	pfam03746, LamB_YcsF, LamB/YcsF family
GCF_007165645.1_ASM716564v1	NZ_AP019793	Thermus thermophilus strain AA2-20 plasmid pAA220, complete sequence	2	40684-41696	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Cas14u_CAS-V,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4	Type I-E	GTAGTCCCCACGCGTGTGGGGATGGACCG,GTAGTCCCCACGCGTGTGGGGATGGACCG,GTAGTCCCCACGCGTGTGGGGATGGACCG	29,29,29	0	0	NA	NA	I-E,II-B:I-E,II-B:I-E,II-B	13,16,16	16	TypeI-E	csa3,cas2,DEDDh,cas3,Cas9_archaeal,Cas14u_CAS-V,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4	NA,NA	NA|315aa|up_9|NZ_AP019793.1_28118_29063_-	pfam01555, N6_N4_Mtase, DNA methylase	WYL|330aa|up_8|NZ_AP019793.1_31123_32113_+	pfam13280, WYL, WYL domain	cas3|920aa|up_7|NZ_AP019793.1_32109_34869_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|494aa|up_6|NZ_AP019793.1_34918_36400_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|164aa|up_5|NZ_AP019793.1_36396_36888_+	PRK13921, PRK13921, CRISPR-associated Cse2 family protein; Provisional	cas7|372aa|up_4|NZ_AP019793.1_36891_38007_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|225aa|up_3|NZ_AP019793.1_38008_38683_+	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas6e|212aa|up_2|NZ_AP019793.1_38669_39305_+	cd09664, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|326aa|up_1|NZ_AP019793.1_39314_40292_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|126aa|up_0|NZ_AP019793.1_40245_40623_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|251aa|down_0|NZ_AP019793.1_41699_42452_-	pfam03746, LamB_YcsF, LamB/YcsF family	NA|229aa|down_1|NZ_AP019793.1_42464_43151_-	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|434aa|down_2|NZ_AP019793.1_43153_44455_-	TIGR00786, TRAP_transporter_permease_protein_SiaT, TRAP transporter, DctM subunit	NA|152aa|down_3|NZ_AP019793.1_44451_44907_-	pfam04290, DctQ, Tripartite ATP-independent periplasmic transporters, DctQ component	NA|319aa|down_4|NZ_AP019793.1_44906_45863_-	cd13602, PBP2_TRAP_BpDctp6_7, Substrate-binding domain of a pyroglutamic acid binding DctP subfamily of the tripartite ATP-independent periplasmic transporters; contains the type 2 periplasmic binding protein fold	NA|259aa|down_5|NZ_AP019793.1_46881_47658_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|258aa|down_6|NZ_AP019793.1_47644_48418_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|270aa|down_7|NZ_AP019793.1_48716_49526_-	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]	NA|278aa|down_8|NZ_AP019793.1_49624_50458_-	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|762aa|down_9|NZ_AP019793.1_50454_52740_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site
GCF_007165645.1_ASM716564v1	NZ_AP019793	Thermus thermophilus strain AA2-20 plasmid pAA220, complete sequence	3	59957-60297	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2	Cas14u_CAS-V,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4	Unclear	GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC,GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC,GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC	36,36,36	0	0	NA	NA	III-A:III-A:III-A	3,4,4	4	Unclear	csa3,cas2,DEDDh,cas3,Cas9_archaeal,Cas14u_CAS-V,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4	NA|60aa|up_0|NZ_AP019793.1_59725_59905_-,NA|67aa|down_7|NZ_AP019793.1_68577_68778_+	NA|762aa|up_9|NZ_AP019793.1_50454_52740_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|444aa|up_8|NZ_AP019793.1_53611_54943_-	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|407aa|up_7|NZ_AP019793.1_55064_56285_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|132aa|up_6|NZ_AP019793.1_56329_56725_-	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|87aa|up_5|NZ_AP019793.1_56721_56982_-	TIGR01439, Uncharacterized_protein_Mb2626, looped-hinge helix DNA binding domain, AbrB family	NA|135aa|up_4|NZ_AP019793.1_57472_57877_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|77aa|up_3|NZ_AP019793.1_57873_58104_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|81aa|up_2|NZ_AP019793.1_58435_58678_+	TIGR01439, Uncharacterized_protein_Mb2626, looped-hinge helix DNA binding domain, AbrB family	NA|152aa|up_1|NZ_AP019793.1_58658_59114_+	pfam13470, PIN_3, PIN domain	NA|60aa|up_0|NZ_AP019793.1_59725_59905_-	NA	NA|424aa|down_0|NZ_AP019793.1_61059_62331_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|220aa|down_1|NZ_AP019793.1_62327_62987_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|151aa|down_2|NZ_AP019793.1_62987_63440_-	pfam00034, Cytochrom_C, Cytochrome c	NA|308aa|down_3|NZ_AP019793.1_63546_64470_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|375aa|down_4|NZ_AP019793.1_64479_65604_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|619aa|down_5|NZ_AP019793.1_65678_67535_+	pfam13520, AA_permease_2, Amino acid permease	NA|309aa|down_6|NZ_AP019793.1_67531_68458_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|67aa|down_7|NZ_AP019793.1_68577_68778_+	NA	NA|274aa|down_8|NZ_AP019793.1_70031_70853_+	pfam01972, SDH_sah, Serine dehydrogenase proteinase	NA|461aa|down_9|NZ_AP019793.1_70947_72330_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated
GCF_007165645.1_ASM716564v1	NZ_AP019792	Thermus thermophilus strain AA2-20	1	1195947-1196024	1	CRISPRCasFinder	no		csa3,cas2,DEDDh,cas3,Cas9_archaeal	Orphan	ATCCAAAGCCTGCGGCAGGAGATG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas2,DEDDh,cas3,Cas9_archaeal,Cas14u_CAS-V,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4	NA,NA|54aa|down_4|NZ_AP019792.1_1200154_1200316_+	NA|338aa|up_9|NZ_AP019792.1_1186463_1187477_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|171aa|up_8|NZ_AP019792.1_1187473_1187986_-	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|563aa|up_7|NZ_AP019792.1_1187982_1189671_-	TIGR00118, Probable_acetolactate_synthase_large_subunit, acetolactate synthase, large subunit, biosynthetic type	NA|327aa|up_6|NZ_AP019792.1_1189826_1190807_-	TIGR04018, thioredoxin_reductase, putative bacillithiol system oxidoreductase, YpdA family	NA|157aa|up_5|NZ_AP019792.1_1190972_1191443_+	pfam12019, GspH, Type II transport protein GspH	NA|158aa|up_4|NZ_AP019792.1_1191498_1191972_+	pfam12019, GspH, Type II transport protein GspH	NA|194aa|up_3|NZ_AP019792.1_1191968_1192550_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|234aa|up_2|NZ_AP019792.1_1192546_1193248_+	COG4795, PulJ, Type II secretory pathway, component PulJ [Intracellular trafficking and secretion]	NA|559aa|up_1|NZ_AP019792.1_1193258_1194935_+	pfam14341, PilX_N, PilX N-terminal	NA|117aa|up_0|NZ_AP019792.1_1195155_1195506_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|181aa|down_0|NZ_AP019792.1_1196210_1196753_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|258aa|down_1|NZ_AP019792.1_1196774_1197548_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|367aa|down_2|NZ_AP019792.1_1197625_1198726_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|407aa|down_3|NZ_AP019792.1_1198785_1200006_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|54aa|down_4|NZ_AP019792.1_1200154_1200316_+	NA	NA|186aa|down_5|NZ_AP019792.1_1200414_1200972_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|219aa|down_6|NZ_AP019792.1_1200990_1201647_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|338aa|down_7|NZ_AP019792.1_1201643_1202657_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|473aa|down_8|NZ_AP019792.1_1202981_1204400_+	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|202aa|down_9|NZ_AP019792.1_1204413_1205019_+	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit
GCF_007165645.1_ASM716564v1	NZ_AP019792	Thermus thermophilus strain AA2-20	2	1754932-1755022	2	CRISPRCasFinder	no		csa3,cas2,DEDDh,cas3,Cas9_archaeal	Orphan	TCCTAAAGGGGGGTAAAGGGGGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas2,DEDDh,cas3,Cas9_archaeal,Cas14u_CAS-V,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4	NA|94aa|up_4|NZ_AP019792.1_1750095_1750377_+,NA|164aa|up_3|NZ_AP019792.1_1750491_1750983_+,NA|50aa|down_1|NZ_AP019792.1_1755949_1756099_-	NA|76aa|up_9|NZ_AP019792.1_1747433_1747661_-	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|396aa|up_8|NZ_AP019792.1_1747880_1749068_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|127aa|up_7|NZ_AP019792.1_1749079_1749460_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|85aa|up_6|NZ_AP019792.1_1749563_1749818_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|94aa|up_5|NZ_AP019792.1_1749804_1750086_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|94aa|up_4|NZ_AP019792.1_1750095_1750377_+	NA	NA|164aa|up_3|NZ_AP019792.1_1750491_1750983_+	NA	NA|407aa|up_2|NZ_AP019792.1_1751098_1752319_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|289aa|up_1|NZ_AP019792.1_1752335_1753202_+	pfam13362, Toprim_3, Toprim domain	NA|494aa|up_0|NZ_AP019792.1_1753188_1754670_+	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|149aa|down_0|NZ_AP019792.1_1755506_1755953_-	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|50aa|down_1|NZ_AP019792.1_1755949_1756099_-	NA	NA|623aa|down_2|NZ_AP019792.1_1756407_1758276_+	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|959aa|down_3|NZ_AP019792.1_1758586_1761463_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|488aa|down_4|NZ_AP019792.1_1761597_1763061_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|307aa|down_5|NZ_AP019792.1_1763077_1763998_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|110aa|down_6|NZ_AP019792.1_1763975_1764305_-	cd00562, NifX_NifB, This CD represents a family of iron-molybdenum cluster-binding proteins that includes NifB, NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|158aa|down_7|NZ_AP019792.1_1764333_1764807_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|200aa|down_8|NZ_AP019792.1_1764816_1765416_-	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|153aa|down_9|NZ_AP019792.1_1765408_1765867_-	pfam02542, YgbB, YgbB family
