assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002005165.1_ASM200516v1	NZ_CP019699	Novibacillus thermophilus strain SG-1 chromosome, complete genome	1	1589808-1589975	1,1	CRISPRCasFinder,PILER-CR	no		cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	Orphan	GTTTCAATTCCTTATAGTTAAGATAAAAAC,GTTTCAATTCCTTATAGTTAAGATAAAAACAC	30,32	0	0	NA	NA	NA:NA	2,2	2	Orphan	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	NA|306aa|up_7|NZ_CP019699.1_1580983_1581901_+,NA|186aa|up_6|NZ_CP019699.1_1581969_1582527_+,NA|63aa|up_3|NZ_CP019699.1_1585273_1585462_+,NA|153aa|down_5|NZ_CP019699.1_1595546_1596005_-	NA|176aa|up_9|NZ_CP019699.1_1579906_1580434_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|74aa|up_8|NZ_CP019699.1_1580564_1580786_+	COG3462, COG3462, Predicted membrane protein [Function unknown]	NA|306aa|up_7|NZ_CP019699.1_1580983_1581901_+	NA	NA|186aa|up_6|NZ_CP019699.1_1581969_1582527_+	NA	NA|128aa|up_5|NZ_CP019699.1_1582771_1583155_-	cd13913, ba3_CcO_II_C, C-terminal cupredoxin domain of Ba3-like heme-copper oxidase subunit II	NA|528aa|up_4|NZ_CP019699.1_1583394_1584978_+	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|63aa|up_3|NZ_CP019699.1_1585273_1585462_+	NA	NA|291aa|up_2|NZ_CP019699.1_1585625_1586498_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|89aa|up_1|NZ_CP019699.1_1587714_1587981_-	pfam14283, DUF4366, Domain of unknown function (DUF4366)	NA|409aa|up_0|NZ_CP019699.1_1588061_1589288_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|529aa|down_0|NZ_CP019699.1_1590312_1591899_+	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|731aa|down_1|NZ_CP019699.1_1592111_1594304_+	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|99aa|down_2|NZ_CP019699.1_1594397_1594694_+	cd10148, CsoR-like_DUF156, Transcriptional regulators CsoR (copper-sensitive operon repressor), RcnR, and FrmR, and related domains; this domain superfamily was previously known as DUF156	NA|69aa|down_3|NZ_CP019699.1_1594724_1594931_+	COG2608, CopZ, Copper chaperone [Inorganic ion transport and metabolism]	NA|116aa|down_4|NZ_CP019699.1_1595116_1595464_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|153aa|down_5|NZ_CP019699.1_1595546_1596005_-	NA	NA|200aa|down_6|NZ_CP019699.1_1596435_1597035_+	pfam07563, DUF1541, Protein of unknown function (DUF1541)	NA|239aa|down_7|NZ_CP019699.1_1597075_1597792_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|473aa|down_8|NZ_CP019699.1_1597788_1599207_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|85aa|down_9|NZ_CP019699.1_1599456_1599711_+	TIGR01973, NADH-quinone_oxidoreductase_subunit_G, NADH-quinone oxidoreductase, chain G
GCF_002005165.1_ASM200516v1	NZ_CP019699	Novibacillus thermophilus strain SG-1 chromosome, complete genome	2	2026822-2028995	2,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b1,cas7b,cas5,cas3,cas4,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	Type I-B,Type III-C,Type III-D,Type III-B,Type III-A	GTTTGTAGCTTACCTATAAGGAATGGAAAC,GTTTGTAGCTTACCTATAAGGAATGGAAAC,GTTTGTAGCTTACCTATAAGGAATGGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	29,32,32	32	TypeI-B,TypeIII-C,TypeIII-D,TypeIII-B,TypeIII-A	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	NA|70aa|up_8|NZ_CP019699.1_2018673_2018883_-,NA	NA|393aa|up_9|NZ_CP019699.1_2016146_2017325_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|70aa|up_8|NZ_CP019699.1_2018673_2018883_-	NA	NA|139aa|up_7|NZ_CP019699.1_2018988_2019405_+	COG4768, COG4768, Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]	NA|123aa|up_6|NZ_CP019699.1_2019404_2019773_+	pfam12732, YtxH, YtxH-like protein	NA|495aa|up_5|NZ_CP019699.1_2019908_2021393_+	cd06460, M32_Taq, Peptidase family M32, which includes thermostable carboxypeptidases TaqCP, PfuCP and FisCP	NA|180aa|up_4|NZ_CP019699.1_2021632_2022172_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|248aa|up_3|NZ_CP019699.1_2022203_2022947_+	pfam13490, zf-HC2, Putative zinc-finger	NA|364aa|up_2|NZ_CP019699.1_2023026_2024118_+	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|338aa|up_1|NZ_CP019699.1_2024350_2025364_+	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|347aa|up_0|NZ_CP019699.1_2025465_2026506_-	TIGR03858, LLM_2I7G, probable oxidoreductase, LLM family	cas6|258aa|down_0|NZ_CP019699.1_2029210_2029984_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b1|700aa|down_1|NZ_CP019699.1_2029973_2032073_+	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas7b|321aa|down_2|NZ_CP019699.1_2032092_2033055_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|235aa|down_3|NZ_CP019699.1_2033079_2033784_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|807aa|down_4|NZ_CP019699.1_2033777_2036198_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|168aa|down_5|NZ_CP019699.1_2036238_2036742_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|down_6|NZ_CP019699.1_2036773_2037766_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|down_7|NZ_CP019699.1_2037821_2038085_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	csm6|408aa|down_8|NZ_CP019699.1_2041226_2042450_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	cmr1gr7|319aa|down_9|NZ_CP019699.1_2042475_2043432_+	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]
GCF_002005165.1_ASM200516v1	NZ_CP019699	Novibacillus thermophilus strain SG-1 chromosome, complete genome	3	2038294-2040862	3,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b1,cas7b,cas5,cas3,cas4,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	Type I-B,Type III-C,Type III-D,Type III-B,Type III-A	GTTTGTAGCTTACCTATGAGGAATGGAAAC,GTTTGTAGCTTACCTATGAGGAATGGAAAC,GTTTGTAGCTTACCTANTGAGGAATGGAAAC	30,30,31	0	0	NA	NA	NA:NA:NA	37,37,38	38	TypeI-B,TypeIII-C,TypeIII-D,TypeIII-B,TypeIII-A	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	NA,NA	NA|338aa|up_9|NZ_CP019699.1_2024350_2025364_+	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|347aa|up_8|NZ_CP019699.1_2025465_2026506_-	TIGR03858, LLM_2I7G, probable oxidoreductase, LLM family	cas6|258aa|up_7|NZ_CP019699.1_2029210_2029984_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b1|700aa|up_6|NZ_CP019699.1_2029973_2032073_+	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas7b|321aa|up_5|NZ_CP019699.1_2032092_2033055_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|235aa|up_4|NZ_CP019699.1_2033079_2033784_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|807aa|up_3|NZ_CP019699.1_2033777_2036198_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|168aa|up_2|NZ_CP019699.1_2036238_2036742_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|up_1|NZ_CP019699.1_2036773_2037766_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|up_0|NZ_CP019699.1_2037821_2038085_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	csm6|408aa|down_0|NZ_CP019699.1_2041226_2042450_+	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	cmr1gr7|319aa|down_1|NZ_CP019699.1_2042475_2043432_+	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cas10|579aa|down_2|NZ_CP019699.1_2043428_2045165_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|381aa|down_3|NZ_CP019699.1_2045136_2046279_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|302aa|down_4|NZ_CP019699.1_2046308_2047214_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|133aa|down_5|NZ_CP019699.1_2047259_2047658_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr6gr7|403aa|down_6|NZ_CP019699.1_2047665_2048874_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	NA|197aa|down_7|NZ_CP019699.1_2049137_2049728_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|1151aa|down_8|NZ_CP019699.1_2049915_2053368_+	PRK12999, PRK12999, pyruvate carboxylase; Reviewed	NA|229aa|down_9|NZ_CP019699.1_2053745_2054432_+	PRK00507, PRK00507, deoxyribose-phosphate aldolase; Provisional
GCF_002005165.1_ASM200516v1	NZ_CP019699	Novibacillus thermophilus strain SG-1 chromosome, complete genome	4	3284703-3284828	4	CRISPRCasFinder	no	csa3	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	Type I-A	CTCCGATGTCTGCTCGACGTTGTGGAAATGACCATAGAGGTTG	43	0	0	NA	NA	NA	1	1	Orphan	cas3,cas4,RT,csa3,cas6,cas8b1,cas7b,cas5,cas1,cas2,csm6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,WYL,DEDDh,csx1	NA|70aa|up_3|NZ_CP019699.1_3278971_3279181_+,NA	NA|178aa|up_9|NZ_CP019699.1_3272331_3272865_+	COG2353, COG2353, Uncharacterized conserved protein [Function unknown]	NA|358aa|up_8|NZ_CP019699.1_3273071_3274145_+	TIGR03858, LLM_2I7G, probable oxidoreductase, LLM family	NA|245aa|up_7|NZ_CP019699.1_3274141_3274876_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|140aa|up_6|NZ_CP019699.1_3276646_3277066_-	PRK13530, PRK13530, arsenate reductase (thioredoxin)	NA|432aa|up_5|NZ_CP019699.1_3277157_3278453_-	PRK15445, PRK15445, arsenical efflux pump membrane protein ArsB	csa3|115aa|up_4|NZ_CP019699.1_3278467_3278812_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|70aa|up_3|NZ_CP019699.1_3278971_3279181_+	NA	NA|158aa|up_2|NZ_CP019699.1_3279825_3280299_-	pfam01668, SmpB, SmpB protein	NA|761aa|up_1|NZ_CP019699.1_3280377_3282660_-	TIGR02063, Ribonuclease_R, ribonuclease R	NA|78aa|up_0|NZ_CP019699.1_3282785_3283019_-	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|514aa|down_0|NZ_CP019699.1_3286131_3287673_-	PRK05434, PRK05434, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|252aa|down_1|NZ_CP019699.1_3287683_3288439_-	PRK00042, tpiA, triosephosphate isomerase; Provisional	NA|394aa|down_2|NZ_CP019699.1_3288698_3289880_-	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|335aa|down_3|NZ_CP019699.1_3290025_3291030_-	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|357aa|down_4|NZ_CP019699.1_3291007_3292078_-	COG2390, DeoR, Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription]	NA|444aa|down_5|NZ_CP019699.1_3292211_3293543_-	PRK05932, PRK05932, RNA polymerase factor sigma-54; Reviewed	NA|198aa|down_6|NZ_CP019699.1_3294003_3294597_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|89aa|down_7|NZ_CP019699.1_3294661_3294928_-	pfam00381, PTS-HPr, PTS HPr component phosphorylation site	NA|315aa|down_8|NZ_CP019699.1_3294981_3295926_-	TIGR00647, DNA_bind_WhiA, DNA-binding protein WhiA	NA|305aa|down_9|NZ_CP019699.1_3295969_3296884_-	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related
