assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009258225.1_ASM925822v1	NZ_CP043617	Sulfurimonas sp. GYSZ_1 chromosome, complete genome	1	13990-14093	1	CRISPRCasFinder	no		DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	Orphan	AATAAGCAGTTATTATTTGCTTTAC	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	NA|113aa|up_2|NZ_CP043617.1_8735_9074_-,NA|82aa|down_0|NZ_CP043617.1_14573_14819_-	NA|415aa|up_9|NZ_CP043617.1_1567_2812_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|165aa|up_8|NZ_CP043617.1_2821_3316_+	PRK00039, ruvC, Holliday junction resolvase; Reviewed	NA|580aa|up_7|NZ_CP043617.1_3312_5052_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|426aa|up_6|NZ_CP043617.1_5077_6355_-	PRK05614, gltA, citrate synthase	NA|141aa|up_5|NZ_CP043617.1_6416_6839_-	PRK10996, PRK10996, thioredoxin 2; Provisional	NA|460aa|up_4|NZ_CP043617.1_6838_8218_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|128aa|up_3|NZ_CP043617.1_8331_8715_-	PRK13282, PRK13282, flagellar assembly protein FliW; Provisional	NA|113aa|up_2|NZ_CP043617.1_8735_9074_-	NA	NA|789aa|up_1|NZ_CP043617.1_9075_11442_-	PRK08447, PRK08447, ribonucleoside-diphosphate reductase subunit alpha	NA|443aa|up_0|NZ_CP043617.1_11650_12979_-	PRK08470, PRK08470, adenylosuccinate lyase; Provisional	NA|82aa|down_0|NZ_CP043617.1_14573_14819_-	NA	NA|359aa|down_1|NZ_CP043617.1_14978_16055_-	PRK14462, PRK14462, 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN	NA|181aa|down_2|NZ_CP043617.1_16051_16594_-	pfam01048, PNP_UDP_1, Phosphorylase superfamily	NA|253aa|down_3|NZ_CP043617.1_16715_17474_-	PRK02083, PRK02083, imidazole glycerol phosphate synthase subunit HisF; Provisional	NA|155aa|down_4|NZ_CP043617.1_17655_18120_+	pfam13682, CZB, Chemoreceptor zinc-binding domain	NA|262aa|down_5|NZ_CP043617.1_18162_18948_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|650aa|down_6|NZ_CP043617.1_18962_20912_+	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|195aa|down_7|NZ_CP043617.1_21002_21587_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|435aa|down_8|NZ_CP043617.1_21592_22897_+	PRK08472, fliI, flagellar protein export ATPase FliI	NA|706aa|down_9|NZ_CP043617.1_22900_25018_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_009258225.1_ASM925822v1	NZ_CP043617	Sulfurimonas sp. GYSZ_1 chromosome, complete genome	2	553654-553739	2	CRISPRCasFinder	no		DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	Orphan	ATTATTCTCATAATAATTTATTT	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	NA|113aa|up_3|NZ_CP043617.1_551019_551358_+,NA|146aa|down_1|NZ_CP043617.1_554956_555394_+,NA|322aa|down_3|NZ_CP043617.1_556062_557028_+	NA|404aa|up_9|NZ_CP043617.1_546347_547559_+	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|247aa|up_8|NZ_CP043617.1_547561_548302_+	cd16373, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|204aa|up_7|NZ_CP043617.1_548276_548888_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|160aa|up_6|NZ_CP043617.1_548889_549369_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|302aa|up_5|NZ_CP043617.1_549378_550284_+	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|213aa|up_4|NZ_CP043617.1_550286_550925_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|113aa|up_3|NZ_CP043617.1_551019_551358_+	NA	NA|276aa|up_2|NZ_CP043617.1_551357_552185_+	pfam12679, ABC2_membrane_2, ABC-2 family transporter protein	NA|157aa|up_1|NZ_CP043617.1_552193_552664_+	pfam05573, NosL, NosL	NA|175aa|up_0|NZ_CP043617.1_552665_553190_+	pfam08447, PAS_3, PAS fold	NA|146aa|down_0|NZ_CP043617.1_554415_554853_-	pfam05573, NosL, NosL	NA|146aa|down_1|NZ_CP043617.1_554956_555394_+	NA	NA|221aa|down_2|NZ_CP043617.1_555403_556066_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|322aa|down_3|NZ_CP043617.1_556062_557028_+	NA	NA|361aa|down_4|NZ_CP043617.1_557029_558112_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|down_5|NZ_CP043617.1_558115_558790_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|566aa|down_6|NZ_CP043617.1_558776_560474_-	COG0155, CysI, Sulfite reductase, beta subunit (hemoprotein) [Inorganic ion transport and metabolism]	NA|148aa|down_7|NZ_CP043617.1_560607_561051_+	cd16343, LMWPTP, Low molecular weight protein tyrosine phosphatase	NA|310aa|down_8|NZ_CP043617.1_561062_561992_+	TIGR01136, Cysteine_synthase, cysteine synthase	NA|199aa|down_9|NZ_CP043617.1_561981_562578_+	PRK07101, PRK07101, hypothetical protein; Provisional
GCF_009258225.1_ASM925822v1	NZ_CP043617	Sulfurimonas sp. GYSZ_1 chromosome, complete genome	3	819344-819949	3	CRISPRCasFinder	no		DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	Orphan	ACAACGGATGAAGCTACGGTAGATGAAGTTCCTGAAAGTACTGTAGATGAA	51	0	0	NA	NA	NA	4	4	Orphan	DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	NA|103aa|up_3|NZ_CP043617.1_814915_815224_+,NA|370aa|up_2|NZ_CP043617.1_815273_816383_+,NA|373aa|up_0|NZ_CP043617.1_816914_818033_-,NA	NA|227aa|up_9|NZ_CP043617.1_806351_807032_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|289aa|up_8|NZ_CP043617.1_807019_807886_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|175aa|up_7|NZ_CP043617.1_807991_808516_+	TIGR03344, VI_effect_Hcp1, type VI secretion system effector, Hcp1 family	NA|975aa|up_6|NZ_CP043617.1_808605_811530_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|390aa|up_5|NZ_CP043617.1_811516_812686_+	sd00010, SLR, Sel1-like repeat	NA|746aa|up_4|NZ_CP043617.1_812675_814913_+	pfam09994, DUF2235, Uncharacterized alpha/beta hydrolase domain (DUF2235)	NA|103aa|up_3|NZ_CP043617.1_814915_815224_+	NA	NA|370aa|up_2|NZ_CP043617.1_815273_816383_+	NA	NA|155aa|up_1|NZ_CP043617.1_816390_816855_-	pfam13682, CZB, Chemoreceptor zinc-binding domain	NA|373aa|up_0|NZ_CP043617.1_816914_818033_-	NA	NA|202aa|down_0|NZ_CP043617.1_823600_824206_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|447aa|down_1|NZ_CP043617.1_824210_825551_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|725aa|down_2|NZ_CP043617.1_825534_827709_-	TIGR03375, type_I_sec_LssB, type I secretion system ATPase, LssB family	NA|621aa|down_3|NZ_CP043617.1_827727_829590_-	TIGR01844, Proteases_secretion_protein_PrtF, type I secretion outer membrane protein, TolC family	NA|259aa|down_4|NZ_CP043617.1_829606_830383_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|1010aa|down_5|NZ_CP043617.1_830576_833606_+	COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair]	NA|274aa|down_6|NZ_CP043617.1_833602_834424_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|366aa|down_7|NZ_CP043617.1_835230_836328_+	PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase; Reviewed	NA|143aa|down_8|NZ_CP043617.1_836329_836758_+	pfam13899, Thioredoxin_7, Thioredoxin-like	NA|126aa|down_9|NZ_CP043617.1_836754_837132_+	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase
GCF_009258225.1_ASM925822v1	NZ_CP043617	Sulfurimonas sp. GYSZ_1 chromosome, complete genome	4	2137265-2139407	1,4,1,2,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas9,cas1,cas2	DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	 or Type II-C?,Type II-C,Type II-A, Type II-B,Type II-B	ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGACCAAA,ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGAC,ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGAC,ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGAC,ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGAC,ATTGTATCAAATGGGGATTTGAGAGTAGCTGAAGAC	40,36,36,36,36,36	1	1	2137697-2137726	NZ_CP043617.1_587998-587969	NA:NA:NA:NA:NA:NA	27,31,32,27,27,27	32	orTypeII-C?,TypeII-C,TypeII-A,TypeII-B,TypeII-B	DEDDh,RT,WYL,csa3,cas3,cas9,cas1,cas2	NA|401aa|up_8|NZ_CP043617.1_2127575_2128778_+,NA|378aa|down_2|NZ_CP043617.1_2142559_2143693_-,NA|247aa|down_9|NZ_CP043617.1_2150652_2151393_-	NA|671aa|up_9|NZ_CP043617.1_2125443_2127456_-	pfam02028, BCCT, BCCT, betaine/carnitine/choline family transporter	NA|401aa|up_8|NZ_CP043617.1_2127575_2128778_+	NA	NA|302aa|up_7|NZ_CP043617.1_2128755_2129661_-	pfam06719, AraC_N, AraC-type transcriptional regulator N-terminus	NA|113aa|up_6|NZ_CP043617.1_2129872_2130211_+	COG4925, COG4925, Uncharacterized conserved protein [Function unknown]	NA|245aa|up_5|NZ_CP043617.1_2130222_2130957_+	cd02910, cupin_Yhhw_N, Escherichia coli YhhW and YhaK and related proteins, pirin-like bicupin, N-terminal cupin domain	NA|154aa|up_4|NZ_CP043617.1_2130968_2131430_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|248aa|up_3|NZ_CP043617.1_2131512_2132256_+	cd05359, ChcA_like_SDR_c, 1-cyclohexenylcarbonyl_coenzyme A_reductase (ChcA)_like, classical (c) SDRs	cas9|1158aa|up_2|NZ_CP043617.1_2132425_2135899_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	cas1|291aa|up_1|NZ_CP043617.1_2136013_2136886_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|111aa|up_0|NZ_CP043617.1_2136882_2137215_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|384aa|down_0|NZ_CP043617.1_2139676_2140828_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|535aa|down_1|NZ_CP043617.1_2140817_2142422_-	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|378aa|down_2|NZ_CP043617.1_2142559_2143693_-	NA	NA|335aa|down_3|NZ_CP043617.1_2143857_2144862_-	PRK10918, PRK10918, phosphate ABC transporter substrate-binding protein PstS	NA|286aa|down_4|NZ_CP043617.1_2145003_2145861_+	COG0573, PstC, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|280aa|down_5|NZ_CP043617.1_2145860_2146700_+	PRK11268, pstA, phosphate ABC transporter permease PstA	NA|262aa|down_6|NZ_CP043617.1_2146795_2147581_+	COG1117, PstB, ABC-type phosphate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|224aa|down_7|NZ_CP043617.1_2147595_2148267_+	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU	NA|756aa|down_8|NZ_CP043617.1_2148339_2150607_+	pfam03924, CHASE, CHASE domain	NA|247aa|down_9|NZ_CP043617.1_2150652_2151393_-	NA
