assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	1	66380-66495	1	CRISPRCasFinder	no		cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Orphan	TTTACCTCTGAAGAGGCTAGAGAAGCTGGACGCAAAGGTGG	41	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA|558aa|up_9|NZ_AP017375.1_53651_55325_+,NA|203aa|up_1|NZ_AP017375.1_65051_65660_+,NA|242aa|down_1|NZ_AP017375.1_69993_70719_+,NA|71aa|down_2|NZ_AP017375.1_70828_71041_-,NA|174aa|down_8|NZ_AP017375.1_79628_80150_-,NA|144aa|down_9|NZ_AP017375.1_80210_80642_-	NA|558aa|up_9|NZ_AP017375.1_53651_55325_+	NA	NA|354aa|up_8|NZ_AP017375.1_56339_57401_+	TIGR04070, photo_TT_lyase, spore photoproduct lyase	NA|195aa|up_7|NZ_AP017375.1_57485_58070_-	cd12130, Apl, Allophycocyanin-like globins	NA|220aa|up_6|NZ_AP017375.1_58191_58851_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|181aa|up_5|NZ_AP017375.1_58993_59536_+	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|274aa|up_4|NZ_AP017375.1_59697_60519_+	pfam14218, COP23, Circadian oscillating protein COP23	NA|242aa|up_3|NZ_AP017375.1_60625_61351_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|1038aa|up_2|NZ_AP017375.1_61558_64672_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|203aa|up_1|NZ_AP017375.1_65051_65660_+	NA	NA|69aa|up_0|NZ_AP017375.1_65710_65917_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|161aa|down_0|NZ_AP017375.1_69414_69897_+	cd07727, YmaE-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YmaE and related proteins; MBL-fold metallo hydrolase domain	NA|242aa|down_1|NZ_AP017375.1_69993_70719_+	NA	NA|71aa|down_2|NZ_AP017375.1_70828_71041_-	NA	NA|649aa|down_3|NZ_AP017375.1_71155_73102_-	pfam07693, KAP_NTPase, KAP family P-loop domain	NA|902aa|down_4|NZ_AP017375.1_73308_76014_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|501aa|down_5|NZ_AP017375.1_76401_77904_+	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|297aa|down_6|NZ_AP017375.1_78058_78949_+	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|201aa|down_7|NZ_AP017375.1_79014_79617_+	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|174aa|down_8|NZ_AP017375.1_79628_80150_-	NA	NA|144aa|down_9|NZ_AP017375.1_80210_80642_-	NA
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	2	731807-731918	2	CRISPRCasFinder	no	csa3,c2c5_V-U5	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	 Type V-U5?,Type I-A	TTTCAACATCTCCCACTTAGGGAGAGTCGAAGCAAC	36	0	0	NA	NA	NA	1	1	TypeV-U5?,TypeI-A	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA,c2c5_V-U5|79aa|down_3|NZ_AP017375.1_735579_735816_+,c2c5_V-U5|107aa|down_4|NZ_AP017375.1_736130_736451_+	NA|295aa|up_9|NZ_AP017375.1_724456_725341_+	PRK00091, miaA, tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed	NA|108aa|up_8|NZ_AP017375.1_725375_725699_-	PRK02237, PRK02237, YnfA family protein	NA|38aa|up_7|NZ_AP017375.1_725901_726015_-	pfam08041, PetM, PetM family of cytochrome b6f complex subunit 7	NA|350aa|up_6|NZ_AP017375.1_726171_727221_+	PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase PdxA	NA|138aa|up_5|NZ_AP017375.1_727217_727631_-	COG1837, COG1837, Predicted RNA-binding protein (contains KH domain) [General function prediction only]	NA|83aa|up_4|NZ_AP017375.1_727623_727872_-	CHL00005, rps16, ribosomal protein S16	NA|474aa|up_3|NZ_AP017375.1_727969_729391_-	PRK10867, PRK10867, signal recognition particle protein; Provisional	NA|309aa|up_2|NZ_AP017375.1_729775_730702_-	cd02647, nuc_hydro_TvIAG, nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax	csa3|131aa|up_1|NZ_AP017375.1_730727_731120_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|54aa|up_0|NZ_AP017375.1_731185_731347_-	pfam02069, Metallothio_Pro, Prokaryotic metallothionein	NA|169aa|down_0|NZ_AP017375.1_733531_734038_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|89aa|down_1|NZ_AP017375.1_734025_734292_-	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|252aa|down_2|NZ_AP017375.1_734419_735175_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	c2c5_V-U5|79aa|down_3|NZ_AP017375.1_735579_735816_+	NA	c2c5_V-U5|107aa|down_4|NZ_AP017375.1_736130_736451_+	NA	NA|168aa|down_5|NZ_AP017375.1_737780_738284_+	pfam02481, DNA_processg_A, DNA recombination-mediator protein A	NA|720aa|down_6|NZ_AP017375.1_738643_740803_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|975aa|down_7|NZ_AP017375.1_741138_744063_+	NF033092, HK_WalK, cell wall metabolism sensor histidine kinase WalK	NA|455aa|down_8|NZ_AP017375.1_744289_745654_+	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|233aa|down_9|NZ_AP017375.1_745800_746499_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	3	1999408-1999509	3	CRISPRCasFinder	no		cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Orphan	AAATACTTATCAATCTCCTAATCA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA,NA|113aa|down_1|NZ_AP017375.1_2002805_2003144_+	NA|160aa|up_9|NZ_AP017375.1_1990503_1990983_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|296aa|up_8|NZ_AP017375.1_1991074_1991962_-	COG2602, COG2602, Beta-lactamase class D [Defense mechanisms]	NA|389aa|up_7|NZ_AP017375.1_1992043_1993210_-	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|243aa|up_6|NZ_AP017375.1_1993251_1993980_-	COG1137, YhbG, ABC-type (unclassified) transport system, ATPase component [General function prediction only]	NA|164aa|up_5|NZ_AP017375.1_1994010_1994502_-	COG1934, COG1934, Uncharacterized protein conserved in bacteria [Function unknown]	NA|132aa|up_4|NZ_AP017375.1_1994626_1995022_-	pfam03745, DUF309, Domain of unknown function (DUF309)	NA|119aa|up_3|NZ_AP017375.1_1995028_1995385_-	CHL00165, ftrB, ferredoxin thioreductase subunit beta; Validated	NA|221aa|up_2|NZ_AP017375.1_1995579_1996242_+	COG1040, ComFC, Predicted amidophosphoribosyltransferases [General function prediction only]	NA|224aa|up_1|NZ_AP017375.1_1996309_1996981_+	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|340aa|up_0|NZ_AP017375.1_1997062_1998082_+	PRK12299, obgE, GTPase CgtA; Reviewed	NA|395aa|down_0|NZ_AP017375.1_2001266_2002451_+	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|113aa|down_1|NZ_AP017375.1_2002805_2003144_+	NA	NA|174aa|down_2|NZ_AP017375.1_2003353_2003875_-	pfam13490, zf-HC2, Putative zinc-finger	NA|219aa|down_3|NZ_AP017375.1_2003991_2004648_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|182aa|down_4|NZ_AP017375.1_2004726_2005272_-	pfam10719, ComFB, Late competence development protein ComFB	NA|285aa|down_5|NZ_AP017375.1_2005432_2006287_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|692aa|down_6|NZ_AP017375.1_2006612_2008688_+	COG4250, COG4250, Predicted sensor protein/domain [Signal transduction mechanisms]	NA|143aa|down_7|NZ_AP017375.1_2008787_2009216_+	pfam02531, PsaD, PsaD	NA|510aa|down_8|NZ_AP017375.1_2009315_2010845_+	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|212aa|down_9|NZ_AP017375.1_2010860_2011496_-	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	4	2424223-2424325	4	CRISPRCasFinder	no		cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Orphan	AAGCCCAAAGTCAGTTAGATGAATT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA|230aa|up_4|NZ_AP017375.1_2417395_2418085_+,NA|250aa|up_0|NZ_AP017375.1_2423057_2423807_+,NA	NA|232aa|up_9|NZ_AP017375.1_2411050_2411746_-	COG0170, SEC59, Dolichol kinase [Lipid metabolism]	NA|81aa|up_8|NZ_AP017375.1_2411830_2412073_+	pfam01809, Haemolytic, Haemolytic domain	NA|367aa|up_7|NZ_AP017375.1_2412762_2413863_+	PRK13396, PRK13396, 3-deoxy-7-phosphoheptulonate synthase; Provisional	NA|564aa|up_6|NZ_AP017375.1_2414012_2415704_+	pfam00221, Lyase_aromatic, Aromatic amino acid lyase	NA|503aa|up_5|NZ_AP017375.1_2415837_2417346_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|230aa|up_4|NZ_AP017375.1_2417395_2418085_+	NA	NA|550aa|up_3|NZ_AP017375.1_2418114_2419764_+	pfam01565, FAD_binding_4, FAD binding domain	NA|626aa|up_2|NZ_AP017375.1_2419705_2421583_+	cd13149, MATE_like_2, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|400aa|up_1|NZ_AP017375.1_2421649_2422849_+	PRK11199, tyrA, bifunctional chorismate mutase/prephenate dehydrogenase; Provisional	NA|250aa|up_0|NZ_AP017375.1_2423057_2423807_+	NA	NA|234aa|down_0|NZ_AP017375.1_2424636_2425338_-	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|251aa|down_1|NZ_AP017375.1_2425407_2426160_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|390aa|down_2|NZ_AP017375.1_2426309_2427479_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|389aa|down_3|NZ_AP017375.1_2427554_2428721_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|443aa|down_4|NZ_AP017375.1_2428897_2430226_-	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|199aa|down_5|NZ_AP017375.1_2430434_2431031_-	cd05540, UreG, urease accessory protein UreG	NA|191aa|down_6|NZ_AP017375.1_2431092_2431665_+	pfam06041, DUF924, Bacterial protein of unknown function (DUF924)	NA|367aa|down_7|NZ_AP017375.1_2431670_2432771_+	PRK00292, glk, glucokinase; Provisional	NA|503aa|down_8|NZ_AP017375.1_2433809_2435318_+	pfam12710, HAD, haloacid dehalogenase-like hydrolase	NA|278aa|down_9|NZ_AP017375.1_2435375_2436209_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	5	2431704-2431828	5	CRISPRCasFinder	no		cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Orphan	AAAACTATCTTGCGTCTGATCGAAGTAGCAGAGCAAGTTCAATC	44	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA|250aa|up_7|NZ_AP017375.1_2423057_2423807_+,NA|112aa|down_2|NZ_AP017375.1_2436649_2436985_-	NA|626aa|up_9|NZ_AP017375.1_2419705_2421583_+	cd13149, MATE_like_2, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|400aa|up_8|NZ_AP017375.1_2421649_2422849_+	PRK11199, tyrA, bifunctional chorismate mutase/prephenate dehydrogenase; Provisional	NA|250aa|up_7|NZ_AP017375.1_2423057_2423807_+	NA	NA|234aa|up_6|NZ_AP017375.1_2424636_2425338_-	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|251aa|up_5|NZ_AP017375.1_2425407_2426160_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|390aa|up_4|NZ_AP017375.1_2426309_2427479_-	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|389aa|up_3|NZ_AP017375.1_2427554_2428721_-	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|443aa|up_2|NZ_AP017375.1_2428897_2430226_-	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|199aa|up_1|NZ_AP017375.1_2430434_2431031_-	cd05540, UreG, urease accessory protein UreG	NA|191aa|up_0|NZ_AP017375.1_2431092_2431665_+	pfam06041, DUF924, Bacterial protein of unknown function (DUF924)	NA|503aa|down_0|NZ_AP017375.1_2433809_2435318_+	pfam12710, HAD, haloacid dehalogenase-like hydrolase	NA|278aa|down_1|NZ_AP017375.1_2435375_2436209_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|112aa|down_2|NZ_AP017375.1_2436649_2436985_-	NA	NA|209aa|down_3|NZ_AP017375.1_2437012_2437639_-	pfam06206, CpeT, CpeT/CpcT family (DUF1001)	NA|179aa|down_4|NZ_AP017375.1_2437726_2438263_-	pfam09367, CpeS, CpeS-like protein	NA|64aa|down_5|NZ_AP017375.1_2438526_2438718_-	smart01094, CpcD, CpcD/allophycocyanin linker domain	NA|256aa|down_6|NZ_AP017375.1_2438799_2439567_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|292aa|down_7|NZ_AP017375.1_2440004_2440880_-	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|185aa|down_8|NZ_AP017375.1_2441433_2441988_-	cd19433, lipocalin_CpcS-CpeS, CpcS/CpeS phycobiliprotein lyase family	NA|427aa|down_9|NZ_AP017375.1_2442050_2443331_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	6	2942802-2943281	1,6,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Type I-C,Type I-U, Type I-U?	ATTGCGATCGTCCTTCGGGGCGATCGAGGATTGAAACT,ATTGCGATCGTCCTTCGGGGCGATCGAGGATTGAAAC,ATTGCGATCGTCCTTCGGGGCGATCGAGGATT	38,37,32	0	0	NA	NA	NA:NA:NA	4,4,6	6	TypeI-C,TypeI-U,TypeI-U?	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA|106aa|up_7|NZ_AP017375.1_2936055_2936373_+,NA|99aa|up_3|NZ_AP017375.1_2940334_2940631_+,NA|178aa|down_1|NZ_AP017375.1_2946091_2946625_-,NA|174aa|down_7|NZ_AP017375.1_2953461_2953983_-,NA|216aa|down_8|NZ_AP017375.1_2954252_2954900_+	NA|71aa|up_9|NZ_AP017375.1_2933396_2933609_+	pfam10742, DUF2555, Protein of unknown function (DUF2555)	cas3|738aa|up_8|NZ_AP017375.1_2933741_2935955_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|106aa|up_7|NZ_AP017375.1_2936055_2936373_+	NA	cas5|229aa|up_6|NZ_AP017375.1_2936627_2937314_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|576aa|up_5|NZ_AP017375.1_2937310_2939038_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|379aa|up_4|NZ_AP017375.1_2939103_2940240_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	NA|99aa|up_3|NZ_AP017375.1_2940334_2940631_+	NA	cas4|207aa|up_2|NZ_AP017375.1_2940673_2941294_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|345aa|up_1|NZ_AP017375.1_2941296_2942331_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_AP017375.1_2942334_2942625_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|759aa|down_0|NZ_AP017375.1_2943445_2945722_+	pfam03772, Competence, Competence protein	NA|178aa|down_1|NZ_AP017375.1_2946091_2946625_-	NA	NA|168aa|down_2|NZ_AP017375.1_2947640_2948144_+	pfam04116, FA_hydroxylase, Fatty acid hydroxylase superfamily	NA|306aa|down_3|NZ_AP017375.1_2949227_2950145_+	TIGR04262, possible_ABC_transporter_solute_binding_protein, extracellular substrate-binding orphan protein, GRRM family	NA|119aa|down_4|NZ_AP017375.1_2950212_2950569_+	TIGR04260, hypothetical_protein, rSAM-associated Gly-rich repeat protein	NA|402aa|down_5|NZ_AP017375.1_2950632_2951838_+	TIGR04261, putative_arylsulfatase_regulatory_protein, radical SAM/SPASM domain protein, GRRM system	NA|328aa|down_6|NZ_AP017375.1_2952202_2953186_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|174aa|down_7|NZ_AP017375.1_2953461_2953983_-	NA	NA|216aa|down_8|NZ_AP017375.1_2954252_2954900_+	NA	NA|473aa|down_9|NZ_AP017375.1_2955180_2956599_+	TIGR00992, chloroplast_import-associated_channel_homolog, chloroplast envelope protein translocase, IAP75 family
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	7	3139253-3139508	7,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas5,cas7,cas8b3,cas6	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Unclear	TGTTCACAACGCCTGACGACATCTAATCTGGTTGCCC,GTTCACAACGCCTGACGACATCTAATCTGGTTGCCC,GTTCACAACGCCTGACGACATCTAATCTGGTTGCCCTT	37,36,38	0	0	NA	NA	NA:NA:NA	3,3,2	3	Unclear	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA,NA	NA|456aa|up_9|NZ_AP017375.1_3114135_3115503_-	pfam06527, TniQ, TniQ	NA|397aa|up_8|NZ_AP017375.1_3115492_3116683_-	pfam05621, TniB, Bacterial TniB protein	NA|912aa|up_7|NZ_AP017375.1_3116685_3119421_-	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|68aa|up_6|NZ_AP017375.1_3121129_3121333_+	pfam11211, DUF2997, Protein of unknown function (DUF2997)	NA|707aa|up_5|NZ_AP017375.1_3121400_3123521_-	pfam00580, UvrD-helicase, UvrD/REP helicase N-terminal domain	NA|2114aa|up_4|NZ_AP017375.1_3123562_3129904_-	cd17923, DEXHc_Hrq1-like, DEAH-box helicase domain of Hrq1 and similar proteins	NA|122aa|up_3|NZ_AP017375.1_3129949_3130315_-	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|1583aa|up_2|NZ_AP017375.1_3130329_3135078_-	COG1002, COG1002, Type II restriction enzyme, methylase subunits [Defense mechanisms]	NA|103aa|up_1|NZ_AP017375.1_3135232_3135541_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|972aa|up_0|NZ_AP017375.1_3136064_3138980_-	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	cas5|209aa|down_0|NZ_AP017375.1_3139768_3140395_-	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|337aa|down_1|NZ_AP017375.1_3140398_3141409_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b3|566aa|down_2|NZ_AP017375.1_3141412_3143110_-	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas6|217aa|down_3|NZ_AP017375.1_3143121_3143772_-	pfam09559, Cas6, Cas6 Crispr	NA|341aa|down_4|NZ_AP017375.1_3143937_3144960_-	pfam06527, TniQ, TniQ	NA|258aa|down_5|NZ_AP017375.1_3145873_3146647_+	COG1127, Ttg2A, ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|435aa|down_6|NZ_AP017375.1_3146702_3148007_+	PLN03094, PLN03094, Substrate binding subunit of ER-derived-lipid transporter; Provisional	NA|347aa|down_7|NZ_AP017375.1_3148003_3149044_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|227aa|down_8|NZ_AP017375.1_3149052_3149733_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|460aa|down_9|NZ_AP017375.1_3150277_3151657_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit
GCF_002355455.1_ASM235545v1	NZ_AP017375	Stanieria sp. NIES-3757 DNA, complete genome	8	4882387-4882643	3,8,3	PILER-CR,CRISPRCasFinder,CRT	no		cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG	Orphan	ATTGCGATCGTCCTTCGGGGCGA--GAGGTTTGCTTCGCGT,ATTGCGATCGTCCTTCGGGGCGAGAGGTTTGCTTCGCGT,ATTGCGATCGTCCTTCGGGGCGAGAGGTTTGCTTCGCGT	41,39,39	0	0	NA	NA	NA:NA:NA	3,3,3	3	Orphan	cas3,csa3,c2c5_V-U5,DEDDh,RT,cas5,cas8c,cas7,cas4,cas1,cas2,cas8b3,cas6,DinG,cas14j	NA|60aa|up_4|NZ_AP017375.1_4879014_4879194_+,NA	NA|281aa|up_9|NZ_AP017375.1_4873816_4874659_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|140aa|up_8|NZ_AP017375.1_4874658_4875078_-	pfam03965, Penicillinase_R, Penicillinase repressor	NA|290aa|up_7|NZ_AP017375.1_4875530_4876400_-	cd09025, Aldose_epim_Slr1438, Aldose 1-epimerase, similar to Synechocystis Slr1438	NA|325aa|up_6|NZ_AP017375.1_4876443_4877418_-	PRK06245, cofG, FO synthase subunit 1; Reviewed	NA|361aa|up_5|NZ_AP017375.1_4877555_4878638_+	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|60aa|up_4|NZ_AP017375.1_4879014_4879194_+	NA	NA|183aa|up_3|NZ_AP017375.1_4879323_4879872_+	pfam01789, PsbP, PsbP	NA|199aa|up_2|NZ_AP017375.1_4879881_4880478_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|278aa|up_1|NZ_AP017375.1_4880515_4881349_+	COG4279, COG4279, Uncharacterized conserved protein [Function unknown]	NA|228aa|up_0|NZ_AP017375.1_4881454_4882138_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|160aa|down_0|NZ_AP017375.1_4882678_4883158_+	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|235aa|down_1|NZ_AP017375.1_4883224_4883929_-	COG0398, COG0398, Uncharacterized conserved protein [Function unknown]	NA|139aa|down_2|NZ_AP017375.1_4884478_4884895_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|770aa|down_3|NZ_AP017375.1_4884880_4887190_-	cd07389, MPP_PhoD, Bacillus subtilis PhoD and related proteins, metallophosphatase domain	NA|352aa|down_4|NZ_AP017375.1_4887325_4888381_+	pfam12275, DUF3616, Protein of unknown function (DUF3616)	NA|88aa|down_5|NZ_AP017375.1_4888710_4888974_-	COG4095, COG4095, Uncharacterized conserved protein [Function unknown]	NA|186aa|down_6|NZ_AP017375.1_4889621_4890179_+	pfam04115, Ureidogly_lyase, Ureidoglycolate lyase	NA|313aa|down_7|NZ_AP017375.1_4890179_4891118_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|333aa|down_8|NZ_AP017375.1_4891358_4892357_-	cd07331, M48C_Oma1_like, Peptidase M48C, integral membrane endopeptidase	NA|215aa|down_9|NZ_AP017375.1_4892569_4893214_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like
