assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001273775.1_ASM127377v1	NZ_CP011801	Nitrospira moscoviensis strain NSP M-1 chromosome, complete genome	1	1417623-1420297	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	WYL,cas6,cas3,cas8b3,cas7,cas5,cas1,cas2	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	Unclear	GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC,GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC,GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC,TGGGTGATGACCTATGAGATGCCGAAAGGCGTTGAGCACATCT,GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC	36,36,36,43,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	33,37,37,33,33	37	Unclear	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	NA,NA|62aa|down_2|NZ_CP011801.1_1423654_1423840_-,NA|347aa|down_8|NZ_CP011801.1_1429505_1430546_-	NA|231aa|up_9|NZ_CP011801.1_1407492_1408185_+	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|119aa|up_8|NZ_CP011801.1_1408200_1408557_+	PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2); Provisional	NA|190aa|up_7|NZ_CP011801.1_1408592_1409162_+	pfam10543, ORF6N, ORF6N domain	cas6|198aa|up_6|NZ_CP011801.1_1409420_1410014_+	pfam09559, Cas6, Cas6 Crispr	cas3|741aa|up_5|NZ_CP011801.1_1410034_1412257_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8b3|524aa|up_4|NZ_CP011801.1_1412244_1413816_+	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas7|315aa|up_3|NZ_CP011801.1_1413812_1414757_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|213aa|up_2|NZ_CP011801.1_1414768_1415407_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas1|557aa|up_1|NZ_CP011801.1_1415385_1417056_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_0|NZ_CP011801.1_1417059_1417353_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|513aa|down_0|NZ_CP011801.1_1420575_1422114_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|255aa|down_1|NZ_CP011801.1_1422113_1422878_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|62aa|down_2|NZ_CP011801.1_1423654_1423840_-	NA	NA|160aa|down_3|NZ_CP011801.1_1423845_1424325_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|331aa|down_4|NZ_CP011801.1_1424414_1425407_-	PRK08224, ligC, ATP-dependent DNA ligase; Reviewed	NA|321aa|down_5|NZ_CP011801.1_1425348_1426311_-	cd04862, PaeLigD_Pol_like, PaeLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD	NA|446aa|down_6|NZ_CP011801.1_1426385_1427723_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|513aa|down_7|NZ_CP011801.1_1427844_1429383_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|347aa|down_8|NZ_CP011801.1_1429505_1430546_-	NA	NA|390aa|down_9|NZ_CP011801.1_1430722_1431892_+	pfam12727, PBP_like, PBP superfamily domain
GCF_001273775.1_ASM127377v1	NZ_CP011801	Nitrospira moscoviensis strain NSP M-1 chromosome, complete genome	2	1422980-1423371	4,2,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas6,cas3,cas8b3,cas7,cas5,cas1,cas2	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	Unclear	GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC,GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC,GTGATGACCTATGAGATGCCGAAAGGCGTTGAGCAC	36,36,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	5,5,5	5	Unclear	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	NA,NA|62aa|down_0|NZ_CP011801.1_1423654_1423840_-,NA|347aa|down_6|NZ_CP011801.1_1429505_1430546_-,NA|221aa|down_9|NZ_CP011801.1_1432781_1433444_+	NA|190aa|up_9|NZ_CP011801.1_1408592_1409162_+	pfam10543, ORF6N, ORF6N domain	cas6|198aa|up_8|NZ_CP011801.1_1409420_1410014_+	pfam09559, Cas6, Cas6 Crispr	cas3|741aa|up_7|NZ_CP011801.1_1410034_1412257_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8b3|524aa|up_6|NZ_CP011801.1_1412244_1413816_+	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas7|315aa|up_5|NZ_CP011801.1_1413812_1414757_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|213aa|up_4|NZ_CP011801.1_1414768_1415407_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas1|557aa|up_3|NZ_CP011801.1_1415385_1417056_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_2|NZ_CP011801.1_1417059_1417353_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|513aa|up_1|NZ_CP011801.1_1420575_1422114_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|255aa|up_0|NZ_CP011801.1_1422113_1422878_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|62aa|down_0|NZ_CP011801.1_1423654_1423840_-	NA	NA|160aa|down_1|NZ_CP011801.1_1423845_1424325_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|331aa|down_2|NZ_CP011801.1_1424414_1425407_-	PRK08224, ligC, ATP-dependent DNA ligase; Reviewed	NA|321aa|down_3|NZ_CP011801.1_1425348_1426311_-	cd04862, PaeLigD_Pol_like, PaeLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD	NA|446aa|down_4|NZ_CP011801.1_1426385_1427723_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|513aa|down_5|NZ_CP011801.1_1427844_1429383_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|347aa|down_6|NZ_CP011801.1_1429505_1430546_-	NA	NA|390aa|down_7|NZ_CP011801.1_1430722_1431892_+	pfam12727, PBP_like, PBP superfamily domain	NA|274aa|down_8|NZ_CP011801.1_1431976_1432798_+	COG0725, ModA, ABC-type molybdate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|221aa|down_9|NZ_CP011801.1_1432781_1433444_+	NA
GCF_001273775.1_ASM127377v1	NZ_CP011801	Nitrospira moscoviensis strain NSP M-1 chromosome, complete genome	3	2411168-2411270	3	CRISPRCasFinder	no		cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	Orphan	CGGACCCGCAGCCGATTCCGCCGGC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	NA|116aa|up_9|NZ_CP011801.1_2401555_2401903_-,NA|84aa|up_7|NZ_CP011801.1_2403068_2403320_-,NA|55aa|up_4|NZ_CP011801.1_2405455_2405620_+,NA|324aa|up_0|NZ_CP011801.1_2409999_2410971_+,NA|361aa|down_7|NZ_CP011801.1_2416637_2417720_-,NA|230aa|down_9|NZ_CP011801.1_2418743_2419433_-	NA|116aa|up_9|NZ_CP011801.1_2401555_2401903_-	NA	NA|252aa|up_8|NZ_CP011801.1_2402205_2402961_+	cd07331, M48C_Oma1_like, Peptidase M48C, integral membrane endopeptidase	NA|84aa|up_7|NZ_CP011801.1_2403068_2403320_-	NA	NA|220aa|up_6|NZ_CP011801.1_2403580_2404240_+	COG1750, COG1750, Archaeal serine proteases [General function prediction only]	NA|395aa|up_5|NZ_CP011801.1_2404254_2405439_+	pfam13489, Methyltransf_23, Methyltransferase domain	NA|55aa|up_4|NZ_CP011801.1_2405455_2405620_+	NA	NA|532aa|up_3|NZ_CP011801.1_2406050_2407646_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|452aa|up_2|NZ_CP011801.1_2407646_2409002_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|144aa|up_1|NZ_CP011801.1_2409345_2409777_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|324aa|up_0|NZ_CP011801.1_2409999_2410971_+	NA	NA|93aa|down_0|NZ_CP011801.1_2411334_2411613_-	pfam03413, PepSY, Peptidase propeptide and YPEB domain	NA|573aa|down_1|NZ_CP011801.1_2411708_2413427_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|191aa|down_2|NZ_CP011801.1_2413634_2414207_-	pfam03843, Slp, Outer membrane lipoprotein Slp family	NA|86aa|down_3|NZ_CP011801.1_2414223_2414481_-	pfam04972, BON, BON domain	NA|168aa|down_4|NZ_CP011801.1_2414616_2415120_-	PRK02710, PRK02710, plastocyanin; Provisional	NA|146aa|down_5|NZ_CP011801.1_2415154_2415592_-	pfam04972, BON, BON domain	NA|208aa|down_6|NZ_CP011801.1_2415962_2416586_-	pfam16785, SMBP, Small metal-binding protein	NA|361aa|down_7|NZ_CP011801.1_2416637_2417720_-	NA	NA|82aa|down_8|NZ_CP011801.1_2417737_2417983_-	pfam04972, BON, BON domain	NA|230aa|down_9|NZ_CP011801.1_2418743_2419433_-	NA
GCF_001273775.1_ASM127377v1	NZ_CP011801	Nitrospira moscoviensis strain NSP M-1 chromosome, complete genome	4	2669610-2669736	4	CRISPRCasFinder	no	cas3	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	Unclear	TGGTTGTCGAACCTACCCCGCTACCCCCTACCCC	34	0	0	NA	NA	NA	1	1	Unclear	cas3,WYL,cas6,cas8b3,cas7,cas5,cas1,cas2,DEDDh,csa3,csx1	NA|199aa|up_9|NZ_CP011801.1_2659251_2659848_+,NA|94aa|up_3|NZ_CP011801.1_2666665_2666947_-,NA|393aa|down_3|NZ_CP011801.1_2673616_2674795_-	NA|199aa|up_9|NZ_CP011801.1_2659251_2659848_+	NA	NA|182aa|up_8|NZ_CP011801.1_2659963_2660509_+	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|329aa|up_7|NZ_CP011801.1_2660599_2661586_-	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|743aa|up_6|NZ_CP011801.1_2661684_2663913_-	cd05254, dTDP_HR_like_SDR_e, dTDP-6-deoxy-L-lyxo-4-hexulose reductase and related proteins, extended (e) SDRs	NA|387aa|up_5|NZ_CP011801.1_2663872_2665033_-	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|398aa|up_4|NZ_CP011801.1_2665037_2666231_-	cd04950, GT4_TuaH-like, teichuronic acid biosynthesis glycosyltransferase TuaH and similar proteins	NA|94aa|up_3|NZ_CP011801.1_2666665_2666947_-	NA	NA|194aa|up_2|NZ_CP011801.1_2667019_2667601_-	pfam14417, MEDS, MEDS: MEthanogen/methylotroph, DcmR Sensory domain	NA|182aa|up_1|NZ_CP011801.1_2667624_2668170_-	pfam14417, MEDS, MEDS: MEthanogen/methylotroph, DcmR Sensory domain	NA|381aa|up_0|NZ_CP011801.1_2668379_2669522_-	COG0153, GalK, Galactokinase [Carbohydrate transport and metabolism]	NA|374aa|down_0|NZ_CP011801.1_2669739_2670861_-	cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate	NA|612aa|down_1|NZ_CP011801.1_2670881_2672717_-	PRK10150, PRK10150, beta-D-glucuronidase; Provisional	NA|242aa|down_2|NZ_CP011801.1_2672743_2673469_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|393aa|down_3|NZ_CP011801.1_2673616_2674795_-	NA	cas3|507aa|down_4|NZ_CP011801.1_2674935_2676456_-	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|412aa|down_5|NZ_CP011801.1_2676794_2678030_+	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|386aa|down_6|NZ_CP011801.1_2678064_2679222_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|267aa|down_7|NZ_CP011801.1_2679685_2680486_+	pfam07885, Ion_trans_2, Ion channel	NA|132aa|down_8|NZ_CP011801.1_2680810_2681206_+	pfam04982, HPP, HPP family	NA|959aa|down_9|NZ_CP011801.1_2681298_2684175_+	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite
