CRISPRimmunity

Please click to download your results

Overview of predicted results

Overview of the results

Contig_ID	Contig_def	CRISPR array number	Contig Signature genes	Self targeting spacer number	Target MGE spacer number	Prophage number	Anti-CRISPR protein number
NZ_CP016770	Candidatus Planktophila dulcis isolate MMS-21-155 chromosome, complete genome	2 crisprs	DinG,WYL,cas4,DEDDh,cas3	0	0	4	0

Results visualization

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Crispr_ID: NZ_CP016770_1

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_CP016770_1

301955-302094

Orphan

Consensus_repeat	Method
GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC	CRISPRCasFinder

1 spacers

The CRISPR arrays of NZ_CP016770_1

>merge|NZ_CP016770|1|301955-302094|CRISPRCasFinder
GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCCCGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTTGTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC

>NZ_CP016770|1|1|301955-302094|CRISPRCasFinder
GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC	CGCACCGCTTAATTAAATATGCAATAATTGGACTCCGCTT
GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC

Protein	Signature genes	Signature genes Name	Protein_function
NZ_CP016770.1\|WP_095696028.1\|311164_312655_-\|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase	unknown	unknown	gnl\|CDD\|143404
NZ_CP016770.1\|WP_095675846.1\|304239_305247_-\|Gfo/Idh/MocA-family-oxidoreductase	unknown	unknown	gnl\|CDD\|275173
NZ_CP016770.1\|WP_095696026.1\|308320_310234_-\|3D-(3,5/4)-trihydroxycyclohexane-1,2-dione-acylhydrolase-(decyclizing)	unknown	unknown	gnl\|CDD\|275170
NZ_CP016770.1\|WP_095696018.1\|298599_299013_+\|Rieske-2Fe-2S-domain-containing-protein	unknown	unknown	gnl\|CDD\|223795
NZ_CP016770.1\|WP_095675847.1\|305283_306306_-\|Gfo/Idh/MocA-family-oxidoreductase	unknown	unknown	gnl\|CDD\|223745
NZ_CP016770.1\|WP_095696017.1\|296337_296832_+\|site-specific-integrase	unknown	unknown	gnl\|CDD\|271189
NZ_CP016770.1\|WP_095696019.1\|299046_299586_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP016770.1\|WP_095696025.1\|306315_307392_-\|transaldolase-family-protein	unknown	unknown	gnl\|CDD\|376418
NZ_CP016770.1\|WP_095675835.1\|297978_298590_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP016770.1\|WP_095696027.1\|310235_311156_-\|5-deoxy-glucuronate-isomerase	unknown	unknown	gnl\|CDD\|377429
NZ_CP016770.1\|WP_095675849.1\|307396_308311_-\|TIM-barrel-protein	unknown	unknown	gnl\|CDD\|275172
NZ_CP016770.1\|WP_095675834.1\|297424_297967_+\|DUF305-domain-containing-protein	unknown	unknown	gnl\|CDD\|367619
NZ_CP016770.1\|WP_095696021.1\|300306_300639_+\|TipAS-antibiotic-recognition-domain-containing-protein	unknown	unknown	gnl\|CDD\|377906
NZ_CP016770.1\|WP_095696024.1\|302381_302864_+\|SRPBCC-family-protein	unknown	unknown	gnl\|CDD\|176867
NZ_CP016770.1\|WP_095696015.1\|294315_295320_+\|NAD(P)-dependent-alcohol-dehydrogenase	unknown	unknown	gnl\|CDD\|176188
NZ_CP016770.1\|WP_095696022.1\|300738_301407_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP016770.1\|WP_190286211.1\|303335_304322_-\|DMT-family-transporter	unknown	unknown	gnl\|CDD\|223769
NZ_CP016770.1\|WP_095675844.1\|302873_303197_+\|hypothetical-protein	unknown	unknown	gnl\|CDD\|226318
NZ_CP016770.1\|WP_095696023.1\|301415_301883_+\|hypothetical-protein	unknown	unknown	unknown
NZ_CP016770.1\|WP_150121963.1\|295873_296434_-\|GIY-YIG-nuclease-family-protein	unknown	unknown	gnl\|CDD\|366699

Protein	Function_ID	Function_description	E-value
NZ_CP016770.1\|WP_095696028.1\|311164_312655_-\|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase	gnl\|CDD\|143404	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2. Methylmalonate semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27) [acylating] from Bacillus subtilis is involved in valine metabolism and catalyses the NAD+- and CoA-dependent oxidation of methylmalonate semialdehyde into propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and Arabidopsis MMSDH ALDH6B2 are also present in this CD.	0
NZ_CP016770.1\|WP_095675846.1\|304239_305247_-\|Gfo/Idh/MocA-family-oxidoreductase	gnl\|CDD\|275173	TIGR04380, hypothetical_protein_HOLDEFILI_04020, inositol 2-dehydrogenase. All members of the seed alignment for this model are known or predicted inositol 2-dehydrogenase sequences co-clustered with other enzymes for catabolism of myo-inositol or closely related compounds. Inositol 2-dehydrogenase catalyzes the first step in inositol catabolism. Members of this family may vary somewhat in their ranges of acceptable substrates and some may act on analogs to myo-inositol rather than myo-inositol per se. [Energy metabolism, Sugars].	1.0527e-121
NZ_CP016770.1\|WP_095696026.1\|308320_310234_-\|3D-(3,5/4)-trihydroxycyclohexane-1,2-dione-acylhydrolase-(decyclizing)	gnl\|CDD\|275170	TIGR04377, 3D-35/4-trihydroxycyclohexane-12-dione_hydrolase, 3,5/4-trihydroxycyclohexa-1,2-dione hydrolase. Members of this protein family, 3,5/4-trihydroxycyclohexa-1,2-dione hydrolase (iolD), represent one of eight enzymes in a pathway converting myo-inositol to acetyl-CoA. IolD hydrolyzes the cyclic molecule 3D-(3,5/4)-trihydroxycyclohexane-1,2-dione to yield 5-deoxy-D-glucuronic acid. TPP is a cofactor. [Energy metabolism, Sugars].	0
NZ_CP016770.1\|WP_095696018.1\|298599_299013_+\|Rieske-2Fe-2S-domain-containing-protein	gnl\|CDD\|223795	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion].	3.45535e-09
NZ_CP016770.1\|WP_095675847.1\|305283_306306_-\|Gfo/Idh/MocA-family-oxidoreductase	gnl\|CDD\|223745	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only].	2.70501e-57
NZ_CP016770.1\|WP_095696017.1\|296337_296832_+\|site-specific-integrase	gnl\|CDD\|271189	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons. This family of tyrosine based site-specific integrases is has origins in bacterial phages and conjugate transposons. One member is the integrase from Bacillus subtilis conjugative transposon ICEBs1. ICEBs1 can be excised and transfered to various recipients in response to DNA damage or high concentrations of potential mating partners. The family belongs to the superfamily of DNA breaking-rejoining enzymes, which share the same fold in their catalytic domain and the overall reaction mechanism. The catalytic domain contains six conserved active site residues. Their overall reaction mechanism involves cleavage of a single strand of a DNA duplex by nucleophilic attack of a conserved tyrosine to give a 3' phosphotyrosyl protein-DNA adduct. In the second rejoining step, a terminal 5' hydroxyl attacks the covalent adduct to release the enzyme and generate duplex DNA.	2.68065e-13
NZ_CP016770.1\|WP_095696025.1\|306315_307392_-\|transaldolase-family-protein	gnl\|CDD\|376418	pfam00923, TAL_FSA, Transaldolase/Fructose-6-phosphate aldolase. Transaldolase (TAL) is an enzyme of the pentose phosphate pathway (PPP) found almost ubiquitously in the three domains of life (Archaea, Bacteria, and Eukarya). TAL shares a high degree of structural similarity and sequence identity with fructose-6-phosphate aldolase (FSA). They both belong to the class I aldolase family. Their protein structures have been revealed.	3.92059e-34
NZ_CP016770.1\|WP_095696027.1\|310235_311156_-\|5-deoxy-glucuronate-isomerase	gnl\|CDD\|377429	pfam04962, KduI, KduI/IolB family. This family includes the 5-keto 4-deoxyuronate isomerase enzyme EC:5.3.1.17 that is involved in pectin degradation. This family aldo includes bacterial Myo-inositol catabolism (IolB) proteins. The Bacillus subtilis inositol operon (iolABCDEFGHIJ) is involved in myo-inositol catabolism. Glucose repression of the iol operon induced by inositol is exerted through catabolite repression mediated by CcpA and the iol induction system mediated by IolR. The exact function of IolB is unknown. Members of this family possess a Cupin like structure.	1.16375e-80
NZ_CP016770.1\|WP_095675834.1\|297424_297967_+\|DUF305-domain-containing-protein	gnl\|CDD\|367619	pfam03713, DUF305, Domain of unknown function (DUF305). Domain found in small family of bacterial secreted proteins with no known function. Also found in Paramecium bursaria chlorella virus 1. This domain is short and found in one or two copies. The domain has a conserved HH motif that may be functionally important. This domain belongs to the ferritin superfamily. It contains two sequence similar repeats each of which is composed of two alpha helices.	9.15705e-39
NZ_CP016770.1\|WP_095696021.1\|300306_300639_+\|TipAS-antibiotic-recognition-domain-containing-protein	gnl\|CDD\|377906	pfam07739, TipAS, TipAS antibiotic-recognition domain. This domain is found at the C-terminus of some MerR family transcription factors. The domain has an alpha-helical globin-like fold. The family includes Mta a central regulator of multidrug resistance in Bacillus subtilis.	1.58948e-16
NZ_CP016770.1\|WP_095696024.1\|302381_302864_+\|SRPBCC-family-protein	gnl\|CDD\|176867	cd07825, SRPBCC_7, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins. Uncharacterized group of the SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain superfamily. SRPBCC domains have a deep hydrophobic ligand-binding pocket and they bind diverse ligands. SRPBCC domains include the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of mammalian STARD1-STARD15, the C-terminal catalytic domains of the alpha oxygenase subunit of Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases (RHOs_alpha_C), Class I and II phosphatidylinositol transfer proteins (PITPs), Bet v 1 (the major pollen allergen of white birch, Betula verrucosa), CoxG, CalC, and related proteins. Other members of the superfamily include PYR/PYL/RCAR plant proteins, the aromatase/cyclase (ARO/CYC) domains of proteins such as Streptomyces glaucescens tetracenomycin, and the SRPBCC domains of Streptococcus mutans Smu.440 and related proteins.	1.06357e-37
NZ_CP016770.1\|WP_095696015.1\|294315_295320_+\|NAD(P)-dependent-alcohol-dehydrogenase	gnl\|CDD\|176188	cd05285, sorbitol_DH, Sorbitol dehydrogenase. Sorbitol and aldose reductase are NAD(+) binding proteins of the polyol pathway, which interconverts glucose and fructose. Sorbitol dehydrogenase is tetrameric and has a single catalytic zinc per subunit. Aldose reductase catalyzes the NADP(H)-dependent conversion of glucose to sorbital, and SDH uses NAD(H) in the conversion of sorbitol to fructose. NAD(P)(H)-dependent oxidoreductases are the major enzymes in the interconversion of alcohols and aldehydes, or ketones. The medium chain alcohol dehydrogenase family (MDR) have a NAD(P)(H)-binding domain in a Rossmann fold of a beta-alpha form. The N-terminal region typically has an all-beta catalytic domain. These proteins typically form dimers (typically higher plants, mammals) or tetramers (yeast, bacteria), and have 2 tightly bound zinc atoms per subunit.	2.86829e-165
NZ_CP016770.1\|WP_095675849.1\|307396_308311_-\|TIM-barrel-protein	gnl\|CDD\|275172	TIGR04379, myo-inositol_catabolism_protein, myo-inosose-2 dehydratase. Members of this family include the enzyme myo-inosose-2 dehydratase, product of the gene iolE, as found in inositol utilization cassettes in many species. [Energy metabolism, Sugars].	2.97727e-35
NZ_CP016770.1\|WP_190286211.1\|303335_304322_-\|DMT-family-transporter	gnl\|CDD\|223769	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only].	6.37403e-11
NZ_CP016770.1\|WP_095675844.1\|302873_303197_+\|hypothetical-protein	gnl\|CDD\|226318	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown].	5.5313e-09
NZ_CP016770.1\|WP_150121963.1\|295873_296434_-\|GIY-YIG-nuclease-family-protein	gnl\|CDD\|366699	pfam01541, GIY-YIG, GIY-YIG catalytic domain. This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.	0.00782016

>NZ_CP016770.1|WP_095696023.1|301415_301883_+|hypothetical-protein
MKFDIKKVFPENPSKFEGFRIIRLIAALYMSVMVARSCIHLFAPDGGAQSIAGIDTSVEGGNNIIAIFHQWGAIQLILAILLIVLFFRYPGLTPLILLTLTLDPVLRFVAGQQMSLTTTGTPPGEALNGVSLYLLLVLFLGSLWNKKAKLDLSGL
>NZ_CP016770.1|WP_095696022.1|300738_301407_+|hypothetical-protein
MKIKSVAISATAFVLLGGVLGVQQYISSQITSKVQREMPNASGISASVPLADVPSNLTSDSIKSADINIKSFALKESGTKTSLNISASSISKAKPTLVGSLEITATIPASTITKSSEFNDAQIVGNTLQVSAGAGGMGTAILIPKYSNSQLYFELQSISLLGNQIPASSLPSDLQNQIKSRSQRSLTPPKGLKVKSVSLSSKGLSVKMFGNNIQLGNLGSGL
>NZ_CP016770.1|WP_095696021.1|300306_300639_+|TipAS-antibiotic-recognition-domain-containing-protein
MKFGIHSEQYQNNFSKEETQKFTEVFGELTQEFAEKLNTGVPSSDESVQVLVRRHYEFCLQFWTPTKEAYKSLAMTYILPSPYRDAYEEVAEGLGKYHYDAVCIWADKNL
>NZ_CP016770.1|WP_095696019.1|299046_299586_+|hypothetical-protein
MRKFLIVTIVSTLIIIVTYFLPSGVWAEFGELPAHPLIIHGVVVLLPLLAILLLAGLFWKNLLKKLHLPLIGALALSVVGVLAAKSSGYSLSAVVGLPKSHAQWGNYLVLLAIALVSSFVLFSYFSFYKKSKIASSSLGVLMAFLAVSAIGMTYVVGHSGAESVWKYRYEIGKDQQGLP
>NZ_CP016770.1|WP_095696018.1|298599_299013_+|Rieske-2Fe-2S-domain-containing-protein
MEPISRRSFIAGVCAVVALGGSEVPAAANTAVKKLPGGRLSVDLKAVPALAKVGGATRIGSVKGVPVAIARTGTSKYIAFNLLCPHQKVTVTQNEKGWVCNAHGSEFESDGDLALGPATTGLARVPMKISKGLATIG
>NZ_CP016770.1|WP_095675835.1|297978_298590_+|hypothetical-protein
MRKVIVSIATVIGLVLSSNVAFADSAKPGQSMTHMKTAAGVASTLEAAGVILYVQGGATSAVIGDNLSAATSQVVFHIPVTANKAGVQHVGSNIIFFNTANNQYLTLKNPVIDLAKGVVSATVPQAGDAKVDILTITNASTAKPKITNDKKAKQRTTAYAGTTLVLAPGVAATIASVLGLPAGSLPDGLAFGTADVTLYSKLK
>NZ_CP016770.1|WP_095675834.1|297424_297967_+|DUF305-domain-containing-protein
MLKKISVLLLAVGVILIPSGANASTHAKSLQNLGMNEIMFAQMMIPHHEQAISMSETALKKSRNQAILKLSNQIKSLQGTEKSQLAYWLKATDSSMTMDHDMQMSGMLTTKELASLKRLTGTQFDRTFLQLMIKHHQGAIEMLDLISDSRNAEAKALAKAIKSAQSKEITSMKLLLNKLK
>NZ_CP016770.1|WP_095696017.1|296337_296832_+|site-specific-integrase
MRPSLAIPHFETFVLLRFSDQVFEKHLKTAKTYRTIPFPVGLQDLITNHVNRFGLGPHGLLLQNRSGKIWRYKDASAMFREVARPLGLDKGEGLHQLRHTFVSVLIQLNLNAKQIQEWLGHESILETMDRYGHLFPNSLNQASQLLDSHVVLALQNKAEARMLA
>NZ_CP016770.1|WP_150121963.1|295873_296434_-|GIY-YIG-nuclease-family-protein
MSLQSSNAFQILDLRNVAKQKFQNAGLLMTGEYLTARIPVSCICQKCGAKTKQTLNGVMNGKTCKYCYHVGIKYGESAYLYLIIHKEFSSIKVGISNHEANLNRLEAHKKNGWELYKSFDFDTANEAEWFETKLLNWLRRDRQLGVHLVRELMPQGGFSETVDGNEISILEIEQKFLELLEIGMTD
>NZ_CP016770.1|WP_095696015.1|294315_295320_+|NAD(P)-dependent-alcohol-dehydrogenase
MKASFLNKDKNIYVEEIDVPTLDADQVLIRVESVGICGSDVHYYKHGAIGPYVVEKPIILGHELSGVITAVGKDVEKNRIGARVAVEPQRACKVCKQCKAGRYNLCPDIEFYATPPIDGAFCEFVKIQSSFAYDIPANISFDAAALIEPMSVCIWAAQKAGIESGSTVLIAGAGPIGVIMAQVAKAFGAKDVVVTDVIEKRLAFVKGFGATRTINSTTESVGSEKFDVFIDACGVPSAVYAGIKSTGPAGRVLLVGLGSDDMSLPVSHIQNNEILVTGVFRYANTWPIGIDLLASGKVNLDAIVTHHFALNDVEHGLRATASPDAMKVIIHPNN
>NZ_CP016770.1|WP_095696024.1|302381_302864_+|SRPBCC-family-protein
MTSEKVRSEIFDTGNPKIKSARIIVEASPSTIFAILSNPKSHRDIDGSATVTANVSGPEALVLGSKFGMKMRLGITYWITNTVVEYKKDELIAWRHLGRWRWRYELTTLGNGSTQVTESFDGTYAPAVAQVWLNFRKAYPWTQLAVAKTLVRLKAVAEAS
>NZ_CP016770.1|WP_095675844.1|302873_303197_+|hypothetical-protein
MKFLISVIDDLSNSGTPAEMVAIDAFNDQLRANGQWIFAWGMQAPETATVIDNRGGANSETGHPLFDSKEHYSGLWLIEAADAATAKKLAFDASNACNRKVELRPLH
>NZ_CP016770.1|WP_190286211.1|303335_304322_-|DMT-family-transporter
MDELPSFLPMQLMNQLTQVNQSKLISSKYMAVALSKTQRSGLLFAFLGIFAFSLSLPFTKLALKSFDPFFTAFARPVIAAVIAIPLMMIAKVPMLPRNLWKPTAFTAAGAVFGWPILIALALQRTTSAHVSVIAAVMPLVTAIIAVIKHKKHPGLSFWVASSLGTVLLVAFSITRGGGTNADLKTDLLIIGAVIASSYCYVEGAALTSHMPGWQVISWVVVVSLPIALPAAAFVYAQTNADYSFHGDALFGLLAIGLSSMYLGFFAWYRGLRDFGVAHGSQVQQLQAIMTLGWSALLLGETVTLTMALSAIGIVLCVLWALSNVNRVK
>NZ_CP016770.1|WP_095675846.1|304239_305247_-|Gfo/Idh/MocA-family-oxidoreductase
MTQKLRIAIIGAGRIGYVHAGSVNDTPELELVYVVDPFEENAKKVTAAFGGKVSNDPSAVIASGEIDAVIIGSPTATHIPLLRECIAAGVHALCEKPIDLDVKNVEEFRALANSAKTNITLGFNRRQDPQYKALKAKVASGAIGTVEQVILTSRDPGPAPQGYIAVSGGIFRDMTIHDFDMARNFVPDIVEVTAFGANSFCDYIKEEGDFDNISVIMKGSNNELITVVNSRHAAFGYDQRAEIFGDKGMLQISNLSDTTVKSFTKDGTTAGEPFMDFFLERYADSYRNELKLFIEGIKTGKVLGSTYDDGRAALILADAAHESAHTGKSIKVNLK
>NZ_CP016770.1|WP_095675847.1|305283_306306_-|Gfo/Idh/MocA-family-oxidoreductase
MSALPKPHIFTAAESKPLRWGIFGAGWISEAMVKTAQLNSNQQFVAVASRTPGKAEAFAQKWNIDSFHNSYEELAARDDIDAIYLGTLPSDRLEVALVAINAGKHVLIEKPITMDYDEAQQIYAAAKAKKVLAMEAMWTRFLPQMDIARQLVADGALGDVELVVSNFCQNNLGVTRLFTLGGGNPIIDMGIYPAALSQQFLGNPDEIHAFGKLHPNDIDEETHTFMRFANGSRSNFVLSARTTLPHWAGVSGSKGAITFGTPWFTPSSITFHESTFNGAQSTWVDDLGIPEHFGLIYQVHAFAQYVDQGLLEGPLYTHHDSLSNIKTVLEIGNLIGTRYK
>NZ_CP016770.1|WP_095696025.1|306315_307392_-|transaldolase-family-protein
MTQSPFLYMKENSPTVLWNDSADPKELKDALNWGIVGATCNPVIALTAIKADAPHWVSRIKEYAKSHPAATEDEIGWAMVKELSTNAAKLLEGEFEKYNGRNGRLSIQTDPRNFRNAKALAEQAVEFSRLAKNMIVKIPVTTEAISAFEEATYQGVSLNATVSFSVAQTVAVAEAIERGLKRREAEGLDISTMGPVCTIMVGRVDDWVKVSAEKLGSKVDPEILEWSGVAVFRNAHKIYQERGYRTRLLSAAFRNHMHWSEILGGDSVISPPYAWQVKINEMGITPNLNSVNEPIEARILDPLLENFPEFRKMYDVDGLAVEDFTNFGGTLRTLRGFLQSVNDLESFVRDVTVPNPDK
>NZ_CP016770.1|WP_095675849.1|307396_308311_-|TIM-barrel-protein
MTAQIRVGTAPDSWGVWFPSEPHQVPWDRFLDEVVEAGYHWIELGPYGYLPTDPKQLEDELGKRNLKMTAGTVFTGFHKEDESQWQRAWDQALAVANLVSKLGVEHLVVIPDLWRDDKTGQARESRTLSNEQWKRLAAGHNKLGKALLEEFGIHQQFHSHADSHIGTYQEVERYLQETDPKYSNLCLDTGHFAYYLGDNLKLMNAYPERIGYLHLKQVHPDILAETLKNDVPFGDAVAKGVMTEPGFEGVPKFAPIIERALEINPEIFAIIEQDMYGCPVDMPFPIAQRTREHILAATRAARVK
>NZ_CP016770.1|WP_095696026.1|308320_310234_-|3D-(3,5/4)-trihydroxycyclohexane-1,2-dione-acylhydrolase-(decyclizing)
MATRKMTVSQAVVEFLSHQYTVDGDHRERTIQGVFGIFGHGNVAGIGQALKQLSVENPSLMPYYQARNEQAMVHESSAFARMKRRRATFACTASVGPGATNMLTGAAVATTNHLPVLLLPSDTFANRASDPVLQQLEMPHDATLSVNDAFKPLSRFFDRVQRPEQLFSALMGAMRVLTDPVETGAVTICLPEDVQAEMIDVPEEFLADRDWHIRRPRAEAAQLAEVARVITSSKRPFIVAGGGVIYSDAHDALQTFVEQTKIPVGTSQAGVGSLNWDHPQLLGSVGATGTTAANRAAKEADVVIGIGTRYSDFTTSSRTAFQNPDVRFININIASFDAFKHGSALPVVADARESLKELTALLTTFATTSDYQSKYTQEKSEWDAVVDAAFVDQKRALPSQTEIIHAVQSASDATDTLICAAGSLPGDLHKLWRVRSPLGYHVEYAFSCMGYEIAAGLGAARAGATPIVMVGDGSYLMMHTEIVSAVAEGLKVIIVLIQNHGYASIGHLSESIGSERFGTQYRFKDQAGNNFESGEKLPVDLAANAASLGITVIDIKQTTSAIADLHAAVKKAKQSSTSTLIHINSDPLLYSPDGEGWWDVPIAPISTLKSTQDAYAQYKDEISLQRPLLGNGTKDKK
>NZ_CP016770.1|WP_095696027.1|310235_311156_-|5-deoxy-glucuronate-isomerase
MSSADKWYFRHGELSRDGWDVLLDPQSPPVAGWKYTGLRIGTLTESKSLTLPADTNERIIFPLEGQEFLVEYTHDGNSSSQILHGRTSVFHGPADFIYLPINTSATISGVGRIAVGQTPATKVKAVRYVAKEDVSISLRGAGRETRQVHNLGMPETLDADRMIVCEVIVPASNWSGSPSHKHDVYIPGKESELEEIYYFQSAVTRGAKTPPSSLPFGYFRGTSADSRPYDVNEEVHSGDVALVPYGWHGPAAAGPGYDLYFFNVMAGPDPDRAWNATDHPDQVWIRDSWQSQQSDPRLPYGSTERI
>NZ_CP016770.1|WP_095696028.1|311164_312655_-|CoA-acylating-methylmalonate-semialdehyde-dehydrogenase
MSTIVNHWINGAEFVSTSGRTSPVYDPALGVETKRVALANQAEIDAAIKAAKDAFPAWRDESLAKRQQIIFTFRELLNSRKGELAEIITSEHGKVLSDALGEITRGQEVVEFATGIPHLLKGFYSENVSTGVDVYSTRQPLGVVGIISPFNFPAMVPMWFFPIAIAAGNTVVIKPSEKDPSASMWVAKLWKEAGLPDGVFNVLNGDKESVDGLLNSPDVESISFVGSTPIAKYIYESASRTGKRVQALGGAKNHMLVLPDADLELVADSAINAGFGSAGERCMAISVVVAVEPVADKLIPKIVERMGKLRTGDGRRGCDMGPLVTREHRDKVASYIDIAEKDGATVVVDGRNPQVDGDANGFWLAPTLVDKVPTTSKVYTEEIFGPVLSIVRVKSYDEGVALINSGAFGNGTAIFTNDGGAARRFQNEIQVGMVGINVPIPVPVAYYSFGGWKQSLFGDTKAHGVEGVHFFTRGKAITSRWLDPSHGGINLGFPQN

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Crispr_ID: NZ_CP016770_2

CRISPR_ID

CRISPR_location

CRISPR_type

Repeat_type

Spacer_info

Cas_protein_info

CRISPR-Cas_info

NZ_CP016770_2

1006759-1006855

Unclear

Consensus_repeat	Method
TGCAGATGTTCTTGAAGAGATGGAT	CRISPRCasFinder

1 spacers

cas3

The CRISPR arrays of NZ_CP016770_2

>merge|NZ_CP016770|2|1006759-1006855|CRISPRCasFinder
TGCAGATGTTCTTGAAGAGATGGATGAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGCTGCAGATATTCTTGAAGAGATGGAT

>NZ_CP016770|2|2|1006759-1006855|CRISPRCasFinder
TGCAGATGTTCTTGAAGAGATGGAT	GAGTCCGAGCGCGTTGCACTGATGGCAGAACTTGAAGGCGAACGTGC
TGCAGATATTCTTGAAGAGATGGAT

Protein	Signature genes	Signature genes Name	Protein_function
NZ_CP016770.1\|WP_095696374.1\|1011107_1012589_-\|leucyl-aminopeptidase-family-protein	unknown	unknown	gnl\|CDD\|238247
NZ_CP016770.1\|WP_095676482.1\|1013149_1013599_-\|SRPBCC-family-protein	unknown	unknown	gnl\|CDD\|176854
NZ_CP016770.1\|WP_095696373.1\|1009353_1010472_-\|trypsin-like-peptidase-domain-containing-protein	unknown	unknown	gnl\|CDD\|273938
NZ_CP016770.1\|WP_095696371.1\|1004346_1005192_-\|PHP-domain-containing-protein	unknown	unknown	gnl\|CDD\|223686
NZ_CP016770.1\|WP_095696367.1\|997470_999054_+\|cysteine--tRNA-ligase	unknown	unknown	gnl\|CDD\|234705
NZ_CP016770.1\|WP_095676481.1\|1012896_1013097_-\|sigma-70-family-RNA-polymerase-sigma-factor	unknown	unknown	gnl\|CDD\|274357
NZ_CP016770.1\|WP_095692708.1\|1002376_1003738_+\|DEAD/DEAH-box-helicase	cas3	cd09639_cas3_CAS-I	gnl\|CDD\|223587
NZ_CP016770.1\|WP_095676479.1\|1010484_1011111_-\|class-I-SAM-dependent-methyltransferase	unknown	unknown	gnl\|CDD\|226607
NZ_CP016770.1\|WP_020045748.1\|1012604_1012784_-\|DUF3117-domain-containing-protein	unknown	unknown	gnl\|CDD\|371461
NZ_CP016770.1\|WP_095696370.1\|1002124_1002376_+\|DUF3107-domain-containing-protein	unknown	unknown	gnl\|CDD\|378634
NZ_CP016770.1\|WP_095696369.1\|999366_1000236_-\|N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha--D-glucopyranoside-deacetylase	unknown	unknown	gnl\|CDD\|274584
NZ_CP016770.1\|WP_095676468.1\|1000235_1001417_-\|adenylyltransferase/sulfurtransferase-MoeZ	unknown	unknown	gnl\|CDD\|181156
NZ_CP016770.1\|WP_095676471.1\|1003738_1004350_-\|MarC-family-protein	unknown	unknown	gnl\|CDD\|225006
NZ_CP016770.1\|WP_095696372.1\|1005188_1006100_-\|DMT-family-transporter	unknown	unknown	gnl\|CDD\|223769
NZ_CP016770.1\|WP_095696368.1\|999055_999367_-\|hypothetical-protein	unknown	unknown	unknown
NZ_CP016770.1\|WP_095676477.1\|1009025_1009334_-\|Sec-independent-protein-secretion-pathway-component	unknown	unknown	gnl\|CDD\|179287
NZ_CP016770.1\|WP_095676476.1\|1007910_1009029_-\|Mrp/NBP35-family-ATP-binding-protein	unknown	unknown	gnl\|CDD\|378455
NZ_CP016770.1\|WP_095676475.1\|1007445_1007940_+\|DUF1003-domain-containing-protein	unknown	unknown	gnl\|CDD\|377629
NZ_CP016770.1\|WP_095676483.1\|1013601_1014141_-\|TIGR00730-family-Rossman-fold-protein	unknown	unknown	gnl\|CDD\|129813
NZ_CP016770.1\|WP_095676864.1\|1001472_1002108_+\|TetR/AcrR-family-transcriptional-regulator	unknown	unknown	gnl\|CDD\|366102

Protein	Function_ID	Function_description	E-value
NZ_CP016770.1\|WP_095696374.1\|1011107_1012589_-\|leucyl-aminopeptidase-family-protein	gnl\|CDD\|238247	cd00433, Peptidase_M17, Cytosol aminopeptidase family, N-terminal and catalytic domains. Family M17 contains zinc- and manganese-dependent exopeptidases ( EC 3.4.11.1), including leucine aminopeptidase. They catalyze removal of amino acids from the N-terminus of a protein and play a key role in protein degradation and in the metabolism of biologically active peptides. They do not contain HEXXH motif (which is used as one of the signature patterns to group the peptidase families) in the metal-binding site. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase. The enzyme is a hexamer, with the catalytic domains clustered around the three-fold axis, and the two trimers related to one another by a two-fold rotation. The N-terminal domain is structurally similar to the ADP-ribose binding Macro domain. This family includes proteins from bacteria, archaea, animals and plants.	8.19164e-147
NZ_CP016770.1\|WP_095676482.1\|1013149_1013599_-\|SRPBCC-family-protein	gnl\|CDD\|176854	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily. SRPBCC domains have a deep hydrophobic ligand-binding pocket; they bind diverse ligands. Included in this superfamily are the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of mammalian STARD1-STARD15, and the C-terminal catalytic domains of the alpha oxygenase subunit of Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases (RHOs_alpha_C), as well as the SRPBCC domains of phosphatidylinositol transfer proteins (PITPs), Bet v 1 (the major pollen allergen of white birch, Betula verrucosa), CoxG, CalC, and related proteins. Other members of this superfamily include PYR/PYL/RCAR plant proteins, the aromatase/cyclase (ARO/CYC) domains of proteins such as Streptomyces glaucescens tetracenomycin, and the SRPBCC domains of Streptococcus mutans Smu.440 and related proteins.	2.19497e-06
NZ_CP016770.1\|WP_095696373.1\|1009353_1010472_-\|trypsin-like-peptidase-domain-containing-protein	gnl\|CDD\|273938	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures. [Protein fate, Protein folding and stabilization, Protein fate, Degradation of proteins, peptides, and glycopeptides].	4.13388e-92
NZ_CP016770.1\|WP_095696371.1\|1004346_1005192_-\|PHP-domain-containing-protein	gnl\|CDD\|223686	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only].	3.25602e-36
NZ_CP016770.1\|WP_095696367.1\|997470_999054_+\|cysteine--tRNA-ligase	gnl\|CDD\|234705	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated.	1.39743e-162
NZ_CP016770.1\|WP_095676481.1\|1012896_1013097_-\|sigma-70-family-RNA-polymerase-sigma-factor	gnl\|CDD\|274357	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family. This model encompasses all varieties of the sigma-70 type sigma factors including the ECF subfamily. A number of sigma factors have names with a different number than 70 (i.e. sigma-38), but in fact, all except for the Sigma-54 family (TIGR02395) are included within this family. Several Pfam models hit segments of these sequences including Sigma-70 region 2 (pfam04542) and Sigma-70, region 4 (pfam04545), but not always above their respective trusted cutoffs.	6.22266e-08
NZ_CP016770.1\|WP_095692708.1\|1002376_1003738_+\|DEAD/DEAH-box-helicase	gnl\|CDD\|223587	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].	1.80003e-147
NZ_CP016770.1\|WP_095676479.1\|1010484_1011111_-\|class-I-SAM-dependent-methyltransferase	gnl\|CDD\|226607	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only].	4.6079e-31
NZ_CP016770.1\|WP_020045748.1\|1012604_1012784_-\|DUF3117-domain-containing-protein	gnl\|CDD\|371461	pfam11314, DUF3117, Protein of unknown function (DUF3117). This family of proteins with unknown function appears to be restricted to Actinobacteria.	5.34468e-23
NZ_CP016770.1\|WP_095696370.1\|1002124_1002376_+\|DUF3107-domain-containing-protein	gnl\|CDD\|378634	pfam11305, DUF3107, Protein of unknown function (DUF3107). Some members in this family of proteins are annotated as ATP-binding proteins however this cannot be confirmed. Currently no function is known.	1.70428e-20
NZ_CP016770.1\|WP_095696369.1\|999366_1000236_-\|N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha--D-glucopyranoside-deacetylase	gnl\|CDD\|274584	TIGR03445, mycothiol_MshB, N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha-D-glucopyranoside deacetylase. Members of this protein family are N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha-D-glucopyranoside deacetylase, also called 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase, the MshB protein of mycothiol biosynthesis in Mycobacterium tuberculosis and related species. [Cellular processes, Detoxification].	9.61357e-121
NZ_CP016770.1\|WP_095676468.1\|1000235_1001417_-\|adenylyltransferase/sulfurtransferase-MoeZ	gnl\|CDD\|181156	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated.	0
NZ_CP016770.1\|WP_095676471.1\|1003738_1004350_-\|MarC-family-protein	gnl\|CDD\|225006	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion].	5.92741e-36
NZ_CP016770.1\|WP_095696372.1\|1005188_1006100_-\|DMT-family-transporter	gnl\|CDD\|223769	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only].	5.71813e-11
NZ_CP016770.1\|WP_095676477.1\|1009025_1009334_-\|Sec-independent-protein-secretion-pathway-component	gnl\|CDD\|179287	PRK01371, PRK01371, Sec-independent protein translocase protein TatB.	1.39549e-27
NZ_CP016770.1\|WP_095676476.1\|1007910_1009029_-\|Mrp/NBP35-family-ATP-binding-protein	gnl\|CDD\|378455	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase. This family contains ATPases involved in plasmid partitioning. It also contains the cytosolic Fe-S cluster assembling factor NBP35 which is required for biogenesis and export of both ribosomal subunits.	2.42966e-119
NZ_CP016770.1\|WP_095676475.1\|1007445_1007940_+\|DUF1003-domain-containing-protein	gnl\|CDD\|377629	pfam06210, DUF1003, Protein of unknown function (DUF1003). This family consists of several hypothetical bacterial proteins of unknown function.	1.69535e-44
NZ_CP016770.1\|WP_095676483.1\|1013601_1014141_-\|TIGR00730-family-Rossman-fold-protein	gnl\|CDD\|129813	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein. This model represents one branch of a subfamily of proteins of unknown function. Both PSI-BLAST and weak hits by this model show a low level of similarity to and suggest an evolutionary relationship of the subfamily to the DprA/Smf family of DNA-processing proteins involved in chromosomal transformation with foreign DNA. Both Aquifex aeolicus and Mycobacterium leprae have one member in each of two branches of this subfamily, suggesting that the branches may have distinct functions. [Hypothetical proteins, Conserved].	3.6514e-53
NZ_CP016770.1\|WP_095676864.1\|1001472_1002108_+\|TetR/AcrR-family-transcriptional-regulator	gnl\|CDD\|366102	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family.	5.63466e-15

>NZ_CP016770.1|WP_095696372.1|1005188_1006100_-|DMT-family-transporter
MSEAQKATVHNHTELPARPDLIRLIIGIFGIGSSGPLIALSAMPVPTLIFWRNLGGSLMTLPFALRHKLDRTGVKWAVLAGIVLAVHFVGFFLSMRMTSVTAGTAIVATQPIFAAFFVKLTGGHIPTKAWLGMLISFTGVLVVTGIDLQLDRRSFLGDLAALISGALAAAYMLIGSRAQQTLATTSYTTICYFVCAMTALPMALLSGYDIVGFALREWWILLGLIIGAQILGHTMFNITLKRVSPAVVSMIVFFEVPVAAIVSLVFDIGKQPTLSIIPGVILILLGCILVVLRTRPESVMTEQ
>NZ_CP016770.1|WP_095696371.1|1004346_1005192_-|PHP-domain-containing-protein
MIDLHTHTICSDGTDAPFALVKKALAAGITTLAITDHDSTAGWEEAVSAIQPQIELVLGAEISCLTTDGISVHMLGLLFDGKNSEMQQMLSDSRDTRVPRMRKMVELMSADGINISLDDVYRATPEGATVGRPHLADALVANGVVATRDEAFLDLLNNESKYYVTHAAPTPVDAIEVIRKAGGVAVIAHPFASRRGQIITASTFTDLVAAGLNGIEVHHRDQSADEQSTLTAIAQELNLVITGSSDYHGTGKLNGLAENTTHQAQWEQLESLADARRVVKK
>NZ_CP016770.1|WP_095676471.1|1003738_1004350_-|MarC-family-protein
MNSLGAVTFATQAFVTLFVIMDPPGATPIFLGLVGDKSPRERVRLAWQAAGVSLFVIASFALFGRFILDYMNVSIEALQAAGGLLLLYVALQLLTGNKNTGTENASDNIGMVPLGTPLLAGPGAIVATMIYVQKADTNAQILGLVIAILAVHLIIGTVLMASTKIVGLIKDSGVTLLASIAGLLLAAIAVQMLANAIKAFAAS
>NZ_CP016770.1|WP_095692708.1|1002376_1003738_+|DEAD/DEAH-box-helicase
MSLTFADLPLRKETIDALHEHGFTSPFPIQEMVMPIALADGDVIGQAKTGTGKTLAFGIPVIERVIAPNDADWAQLPNQGKPQVLIVVPTRELCVQVTKDVEELSFNRGIRTLAVYGGRAFEPQIEALNNGVEIVVGTPGRLLDLYRQGQLTLKFVSRVVLDEADEMLDLGFLPDVEKIFTSTPARQQTMLFSATMPGDIIALARRFMNQPVHIRTQDNEDEGAVVSRIEQHVIRAHAMDKIEMLARILQADGRGPTIVFCRTKRTAQKTSDDLFERGFRAATIHGDLGQSAREKALNDFKAGKSDVLIATDVAARGIDIDGITHVINYQCPEDEKTYVHRIGRTARAGAAGIAVTFVDWDDLARWKMIDTALVLGLPEPVETYSSSEHLFEMLNIPAGSSGRMTKKSAAAVDKPKTDRPKSDRPRSEKAVEPKKPAADRIKRERTRTKRISE
>NZ_CP016770.1|WP_095696370.1|1002124_1002376_+|DUF3107-domain-containing-protein
MSSKKSEKAAKVRISIINVGSELSFDCHSTPAEIKSAVTAALTAQTPLSLQDVQGHEIIVPADKIGYVEIGEPAERRVGFGVV
>NZ_CP016770.1|WP_095676864.1|1001472_1002108_+|TetR/AcrR-family-transcriptional-regulator
MTTESATANNSRSDKSRLPRDERRAILLSAALEVFTAAGYHSAAMDEIADRANVSKPVLYQHFPSKLDLYLAVLDLHIDSLVFEIQKAISSTPDNERRVHVTIEAYFNFIENEGEAFRLLFESDMSVEPQVRERLNRMTYDCAAAVSGVISNDTGLPKEAAMMLGVGLIGYVQVTARHWLERDSKLTRQQAMDLVENLMWRGISGFPRTDS
>NZ_CP016770.1|WP_095676468.1|1000235_1001417_-|adenylyltransferase/sulfurtransferase-MoeZ
MKTPPLVTPGPALTVDEVRRYSRHLIIPDVAMAGQQRLMNAKVLCVGAGGLGSPALMYLAAAGVGTLGIVEFDTVDESNLQRQIIHGQSDIGKSKALSAKEKIAEINPYVNVILHETRLDNSNVMEIFSQYDIIVDGTDNFATRYLVNDACVLLKKPYVWGSIYRFDGQASVFWAEYGPCYRCLYPEPPPPGMVPSCAEGGVLGVLCATIGSIQTTEAIKVLTGVGEPLIGSLMVYDALDMTFRKIKVRKDPNCPLCSENATQTALLPDYEAFCGTLSEAAQEASSGSTITVQDLKAKIDNKDNFYLIDVREPSEYEIVNIPTAHLIPKQGFIDGSVLASLPQDKPIVLHCKSGVRSAECLAILKNAGFADASHVFGGVIAWAKQIDTTLPVY
>NZ_CP016770.1|WP_095696369.1|999366_1000236_-|N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha--D-glucopyranoside-deacetylase
MLSSYKGYRMLLVHAHPDDETINNGSTMAMYAALGADVTLVTCTRGEEGEVLVKDLAHLAAHETDSLGEHRVGELADAMKALGISDHRFLGEGEKKYRDSGMMGTEPNNRPDVFWQADLEEASSELVKIMDEVKPHVLITYDEIGGYGHPDHIQAHRVAMRASEKSSWNIEKIYWNVMPRSVIQEGIDAMKKLGSDFMGAEKAEDLPFAKDDSFVHAMVDGNAYVEKKMDAMRAHSTQIEVDGPFFALSNNLGLQVWGNEYYTLVKGEKSEPLDSRGHEMDLFAGVTPS
>NZ_CP016770.1|WP_095696368.1|999055_999367_-|hypothetical-protein
MQFLSSLLFGAMIAVSATLVHQTLPPVGVSVGIFATYLGIWYVGRHYGKRRYKLIALSAWLAVISIAGSFGVGEELLIQGDNQGSALLTIGFVAGVVAVLRNP
>NZ_CP016770.1|WP_095696367.1|997470_999054_+|cysteine--tRNA-ligase
MASMSLRTQIAQALGKRATIRLRDSDGGLRDIVGVLQSETELINRRGEVVNFNPDEAVAFRVIPVFNRRDVSTGSLSIYDTKSKSLHTIAGTDGVVRIYCCGPTVYRDAHVGNLRTFLLSDLISRTLQMTGLDVSLVQNITDVGHMADDFEEDKMLAESAKTKVDPFQIARTFEDRFHIDLERLNIEPAASYPRASEKMAEMITAIEKLIAMKRAYVGSDGSVYFDATSFPTYGALSGNKLDSLQPGHRYEFTDEGGKRFHADWALWKLAGARTQMIWDSPWGAGFPGWHIECSAMSIELLDAHVDLHLGGIDLRFPHHENERAQSNSLTGNETVDTWVHGEHLLFEGRKMSKSAGNVVLLQDVIDRGLDPLSLRFALLENRYRSQMDLSWASLEAAHSTLKRWRQLLSNAGTSAEMKFDQEVSDALTTDLDTPRAMQRIRTIEKDSTIGALDKRALFLFADQVFGLDLDRGVEQREVSSEIQALLDARITARAEKNWSLSDSLRDQLTNAGLEINDGAEGQSWSWK
>NZ_CP016770.1|WP_095676475.1|1007445_1007940_+|DUF1003-domain-containing-protein
MARNFGLDTPRETRRSLRGNIDPETFGRLSERFARFLGTARFLVYMTAFVLTWVLWNTLAPRDIRFDNYPFIFLTLILSLQASYAAPLILLAQNRQADRDRIALNEDRAQNARSIADTEYLTRELASLRIALGDVATRDYLRNELGDMAKEIVVELRKPESDAK
>NZ_CP016770.1|WP_095676476.1|1007910_1009029_-|Mrp/NBP35-family-ATP-binding-protein
MTTLESVHAALATVQDPELHRALPELGMVKSVEIKGSIAHLEILLTISGCPMRDRLQKDIESAVTAVEGISAIELTFGVMDEEQRANIKKLLRNGRESFISFAQKDSLTRVIGVASGKGGVGKSSLTANLAVSSAQKGLRVGILDADVYGHSIPRLMGLIGQRPTAIDQMFIPLESFGVKTVSMEMFKPERSDAIAYRGPLLHRVLEQLLSDAYWGDLDLLYIDLPPGTGDLAISLGQLVPSSEIVVVTTPQVAAAEVAERAGRIAHQIHQRVIGVIENMSAYPCAKCGELTSLFGEGGGEETSRRLSQLVGSDVPLLGKIPFSPDLREGGDAGAPVVISAPDSPSAKAIEAIVSQLIVREKSLLGVRLGLA
>NZ_CP016770.1|WP_095676477.1|1009025_1009334_-|Sec-independent-protein-secretion-pathway-component
MFFDFGAGELVGLAILAMILIGPERLPNLAVDAAKFVKRIREMASKATEELKDNLGPGFEDLKPTDLNPKSFIKKQLSSVLDDDDSTPATSKRTSTIDPDLL
>NZ_CP016770.1|WP_095696373.1|1009353_1010472_-|trypsin-like-peptidase-domain-containing-protein
MSINNGGPWWDAPSKSGLGKNITLRSAIVLALVVGVIAGAFGASSSGSLFGRSVNLVKSTSAIERPAGSVAEIAQRVLPSVVSIEAKSSNGGSTGSGFVIDSSGYILTNNHVIAASVTSGGDITVRLNDGSSFDAKVVGRDSSYDLAVLKIVGASLKALQFGDSDKVAVGDSVIAIGSPLGLSGTVTLGIISAKDRAVTAGESNSENSFINALQTDAAINPGNSGGPLVDATGSVIGVNSAIASLGSSFSSQTGSIGLGFAIPINQARKTADQLIKNGKATYPVMGISVDMNFSGDGAMIAKNAQAILPGGPAAKAGLKSGDIITAIDGRPITSPEELIVTIRSLNIGDSVVVTYKRGSESKSATLTLTASK
>NZ_CP016770.1|WP_095676479.1|1010484_1011111_-|class-I-SAM-dependent-methyltransferase
MNNNPHSYAESFIAEDAVKIAARARGLELGTLDASQGTGAYLRHLAHLLDAQSVVEIGTGSGVGSLWILEGMIASGTLTSIDDEMEHTSIAKLAMADADIAQSRYRFITNSVMDVMTKLTDRAYDLVVYRHNPEDLSFAISEAHRILRSGGVFVIDNFFGGSKVHDPAQRDPKTIALREAGKLIKGDTDSWVSSLIPTGDGLLLATKL
>NZ_CP016770.1|WP_095696374.1|1011107_1012589_-|leucyl-aminopeptidase-family-protein
MLHTVAPDLEALISADVLALGFTKKNDENIELVGSARLISSLEKYFGINLIDEIAFFAPSGKAGELFEIPVLHKDSTVDRLYLVAVGDGSLASLRAAGASLGRKVRGKAIELISLVCQSRAEIRAHGASILLGAYTWNLKTGKPAEIATIAIATKDGASVSEAGVIARALYTARDLIHTPANIKNPLWMAQEAKKIAEEKGLSISVLAGKELAQFGGLRAVGNSSPKPGPRFIEITYIPKGKARSAAALPHVVIVGKGITFDTGGISLKRPYEFMTAMKSDMAGAAAALATISALPDLQPQVKVTVLMMCAENSLSGTSQRPSDVITQYGGTTVEIINTDAEGRLVLADGLAYAVENLDPDYLFDIATLTGSATLGLGRQYAALYTRDEKLAKELVSVGESSGERVWHMPLIDDYQDSLESDVADLNHAADKGDYSAGSVTAALYLEHFVGDSRWVHLDIAGTGRSETDSGENAKGGTGFGVRFFIDWILSLS
>NZ_CP016770.1|WP_020045748.1|1012604_1012784_-|DUF3117-domain-containing-protein
MAAMKPRTGDGPMEVTKEARSLVMRIPLEGGGRLVVELNPQEANNLSAALEAAVALIKK
>NZ_CP016770.1|WP_095676481.1|1012896_1013097_-|sigma-70-family-RNA-polymerase-sigma-factor
MSSSSNPQTLAELLASLPEEERIILTLHYLRSKSSGEIATLLSVPERAVIVVIESGKTRLKAILGL
>NZ_CP016770.1|WP_095676482.1|1013149_1013599_-|SRPBCC-family-protein
MSSNTLSISLTIDAPREVVWKKIADWKSQGDWMLQTKVWVTSNQVEGVGTSIAAFTGPLHKFYPRLKSLGLLDLMVVTQWQPPHRCDVDHVGKVLKGSGSFQLSEINGSSTRFDWSETIVAPKAIFLLAAPFLYVGVRISLARFARSFT
>NZ_CP016770.1|WP_095676483.1|1013601_1014141_-|TIGR00730-family-Rossman-fold-protein
MRIAVFCSSSPTIDSKFIDLAFELGAGIAQSGAELVSGGGHISAMGAISRGARSQGGRTIGIIPQKLVDIEFADHDSDELIVVDSMRTRKAKIEDLSDAFITLPGGLGTLEELFEIWVGRYLKFHDKPVIILDPHGVFQPLHALVEHLENENFVKPGMRDLLHWTTTVEEAVAIAYGKK

You can click texts colored in the table to view more detailed information

Click the colored protein region to show detailed information

Self-targeting detection

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_ID	Protospacer_location	Mismatch	Identity

MGE targeting detection<

CRISPR_ID	Spacer_Info	Spacer_region	Spacer_length	Hit_phage_ID	Hit_phage_def	Protospacer_location	Mismatch	Identity

Prophage detection

Region

Region Position

Protein_number

Hit_taxonomy

Key_proteins

Att_site

Prophage annotation

DBSCAN-SWA_1

615120 : 623049

Cedratvirus(16.67%)

The bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_095676118.1\|615120_615873_+	Fe-S cluster assembly ATPase SufC	A0A285PWH2	Cedratvirus	1.0e-10	30.5
WP_095696176.1\|615875_617123_+	cysteine desulfurase	Q2XUY6	environmental_halophage	5.0e-108	46.7
WP_095696177.1\|617122_617566_+	SUF system NifU family Fe-S cluster assembly protein	A0A2P1CJL8	Mycobacterium_phage	3.8e-18	41.3
WP_095531491.1\|617586_617898_+	metal-sulfur cluster assembly factor	NA	NA	NA	NA
WP_095696178.1\|617917_619783_+	ABC transporter ATP-binding protein	W8CYL7	Bacillus_phage	1.9e-42	27.5
WP_095696179.1\|619744_620467_-	SURF1 family protein	NA	NA	NA	NA
WP_095696180.1\|620459_620732_-	DUF3099 domain-containing protein	NA	NA	NA	NA
WP_095676124.1\|620759_621473_+	3-oxoacyl-ACP reductase FabG	Q06VL0	Trichoplusia_ni_ascovirus	6.8e-09	27.6
WP_095676125.1\|621475_622243_+	enoyl-ACP reductase FabI	NA	NA	NA	NA
WP_190286215.1\|622422_623049_+	translation initiation factor IF-3	A0A2L0UZ54	Agrobacterium_phage	1.2e-12	31.3

DBSCAN-SWA_2

1055486 : 1067480

uncultured_virus(22.22%)

tRNA

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_095696405.1\|1055486_1057730_-	DNA helicase PcrA	A7KV33	Bacillus_phage	2.3e-127	39.5
WP_095676523.1\|1057758_1058484_+	PIG-L family deacetylase	NA	NA	NA	NA
WP_095676867.1\|1058485_1059166_-	peptidylprolyl isomerase	A0A1V0SCU1	Indivirus	1.6e-07	34.1
WP_095676524.1\|1059177_1060728_-	glutamine-hydrolyzing GMP synthase	A0A1V0SH76	Hokovirus	5.4e-11	26.7
WP_095696406.1\|1060717_1061830_-	GuaB3 family IMP dehydrogenase-related protein	A0A0N9Q9A5	Chrysochromulina_ericina_virus	3.6e-09	31.1
WP_095676526.1\|1061832_1062945_-	IMP dehydrogenase	A0A1V0SHK8	Klosneuvirus	4.2e-58	31.8
WP_095676527.1\|1062990_1063890_-	MerR family transcriptional regulator	NA	NA	NA	NA
WP_095676528.1\|1064033_1064336_+	WhiB family transcriptional regulator	A0A0R8V0E7	Thermobifida_phage	2.8e-12	42.3
WP_095676529.1\|1064398_1066033_-	chaperonin GroEL	A0A240F779	uncultured_virus	6.1e-154	56.0
WP_095676530.1\|1066036_1066330_-	co-chaperone GroES	A0A221S3C8	uncultured_virus	3.5e-20	57.1
WP_095676531.1\|1066451_1067480_-\|tRNA	tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD	A0A0R6PI74	Moraxella_phage	3.4e-62	43.8

DBSCAN-SWA_3

1194968 : 1204376

Acanthocystis_turfacea_Chlorella_virus(16.67%)

protease,tRNA

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_095696456.1\|1194968_1196579_-	L-aspartate oxidase	M1I0Y3	Acanthocystis_turfacea_Chlorella_virus	1.1e-25	30.4
WP_095696457.1\|1196575_1197346_-	pantoate--beta-alanine ligase	NA	NA	NA	NA
WP_095696458.1\|1197396_1197843_-	2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase	NA	NA	NA	NA
WP_095696459.1\|1197839_1198196_-	dihydroneopterin aldolase	NA	NA	NA	NA
WP_095696460.1\|1198192_1198948_-	dihydropteroate synthase	A0A0B5J4J5	Pandoravirus	8.2e-21	29.2
WP_095676658.1\|1198954_1199542_-	GTP cyclohydrolase I FolE	E7DN69	Pneumococcus_phage	2.9e-42	51.4
WP_095696461.1\|1199558_1201622_-\|protease	ATP-dependent zinc metalloprotease FtsH	G9E4U6	Ostreococcus_lucimarinus_virus	4.5e-106	45.2
WP_095676660.1\|1201630_1202182_-	hypoxanthine phosphoribosyltransferase	A0A2K9L634	Tupanvirus	7.6e-08	21.3
WP_095696462.1\|1202201_1203164_-\|tRNA	tRNA lysidine(34) synthetase TilS	NA	NA	NA	NA
WP_095696463.1\|1203173_1204376_-	C40 family peptidase	A0A2L1IW19	Streptomyces_phage	3.4e-13	37.0

DBSCAN-SWA_4

1326937 : 1335996

Bacillus_virus(33.33%)

Protein_ID	Protein_Def	Hit_ID	Hit_Def	E-value	Identity
WP_095696537.1\|1326937_1329436_-	DNA gyrase subunit A	G3M9Z5	Bacillus_virus	4.5e-100	32.3
WP_190286201.1\|1329435_1331406_-	DNA topoisomerase (ATP-hydrolyzing) subunit B	G3M9Z3	Bacillus_virus	7.1e-133	43.6
WP_095696538.1\|1331705_1332428_+	response regulator transcription factor	W8CYM9	Bacillus_phage	1.4e-33	36.0
WP_095696539.1\|1332433_1333903_+	HAMP domain-containing histidine kinase	W8CYF6	Bacillus_phage	2.5e-29	32.1
WP_095676775.1\|1333879_1334803_-	M48 family metallopeptidase	NA	NA	NA	NA
WP_095676776.1\|1334811_1335360_-	LemA family protein	A0A1X9IGG1	Lactococcus_phage	3.6e-10	34.2
WP_095676777.1\|1335417_1335996_-	dCTP deaminase	I4AZP2	Saccharomonospora_phage	4.1e-73	67.7

Anti-CRISPR protein detection

Acr ID	Acr position	Acr size	Homology with known anti	Neighbor HTH/AcRanker	Neighbor Aca	In prophage	Protospacer in prophage

Overview of predicted results

Overview of the results

Cas Category Instructions

Results visualization

1. NZ_CP016770

Click the left colored region to show detailed information

CRISPR-Cas detection and classification

Click the colored protein region to show detailed information

Click the colored protein region to show detailed information

Self-targeting detection

MGE targeting detection<

Prophage detection

Anti-CRISPR protein detection