Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   NWAT_RS08825 Genome accession   NC_014315
Coordinates   1991345..1992862 (-) Length   505 a.a.
NCBI ID   WP_013220752.1    Uniprot ID   D8K6Y3
Organism   Nitrosococcus watsonii C-113     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1986345..1997862
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NWAT_RS08810 (Nwat_1798) - 1986582..1987160 (+) 579 WP_013220749.1 cytochrome c5 family protein -
  NWAT_RS08815 (Nwat_1799) ilvD 1987275..1989128 (-) 1854 WP_013220750.1 dihydroxy-acid dehydratase -
  NWAT_RS08820 (Nwat_1800) rep 1989355..1991358 (-) 2004 WP_013220751.1 DNA helicase Rep -
  NWAT_RS08825 (Nwat_1801) comM 1991345..1992862 (-) 1518 WP_013220752.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  NWAT_RS08830 (Nwat_1802) ubiK 1993089..1993388 (-) 300 WP_013220753.1 ubiquinone biosynthesis accessory factor UbiK -
  NWAT_RS08835 (Nwat_1803) - 1993583..1994758 (+) 1176 WP_013220754.1 porin -
  NWAT_RS08840 (Nwat_1805) - 1995278..1995727 (+) 450 WP_232420083.1 VOC family protein -
  NWAT_RS08845 (Nwat_1806) - 1996178..1996858 (-) 681 WP_013220756.1 FkbM family methyltransferase -
  NWAT_RS08850 (Nwat_1807) - 1996880..1997761 (-) 882 WP_049772969.1 hypothetical protein -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54945.10 Da        Isoelectric Point: 8.4905

>NTDB_id=37776 NWAT_RS08825 WP_013220752.1 1991345..1992862(-) (comM) [Nitrosococcus watsonii C-113]
MSLAIAYSRAQASLDAPLVTVEVHLSNGLPAFSIVGLPETAVKESRERVRGALLNCHFEFPARRITVNLAPADLPKEGGR
FDLAIALGILAASGQISSSELKAYEFAGELALSGKVRSIRGVLPVALQTAKAGRSLVVAEENAPEAVLVSKVEVLGVSHL
LEICQHLRGESRLTPFTPNPLKVVTDKKGDIADIRGQYHAKRALEVAAAGAHNLLMIGPPGTGKTMLASRLPGLLPEMAE
AEALESATVQSISSQGFDVSRWRQRPFRAPHHTASGVALVGGGGQPRPGEVSLAHHGVLFLDELPEFERRVLEVLREPLE
SGWIIISRAAQQAEFPARVQLVAAMNPCPCGYLGDSKGRCRCTMEQVQRYRARISGPLLDRIDIQIEVPPVPLHQLRTES
ENGMETSCQVQARVEAARERQLARSGQPNSRLSNREVEQICRLGEKDYQLLERALEQLGLSARAYHRILKVARTIADLEG
SDTIRTPHLSEAIGYRRLDRSLVKS

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=37776 NWAT_RS08825 WP_013220752.1 1991345..1992862(-) (comM) [Nitrosococcus watsonii C-113]
ATGTCGCTAGCAATTGCCTATAGCCGGGCCCAGGCGAGTCTGGATGCCCCTTTAGTGACCGTGGAAGTCCACCTCTCAAA
TGGCCTCCCTGCTTTCTCTATTGTGGGCCTGCCGGAAACTGCTGTTAAGGAGAGCAGAGAGAGGGTGCGGGGTGCGCTGC
TTAACTGTCATTTTGAATTTCCAGCGCGTCGTATTACGGTAAATCTGGCACCTGCGGATCTGCCCAAGGAAGGGGGGCGC
TTTGATTTGGCGATTGCTTTGGGTATTTTAGCGGCCTCGGGGCAGATTTCTTCCTCTGAGTTAAAGGCTTATGAATTTGC
CGGCGAGCTTGCCTTAAGTGGAAAGGTGCGAAGTATCCGTGGAGTGTTACCCGTCGCATTGCAAACGGCAAAAGCGGGTC
GTAGCCTGGTGGTTGCTGAAGAAAATGCTCCCGAAGCGGTCCTCGTGTCCAAGGTTGAAGTATTAGGGGTTTCTCATCTG
TTAGAGATTTGCCAACATCTTCGAGGTGAGTCGCGATTAACTCCCTTTACTCCCAATCCCCTTAAAGTGGTTACTGATAA
AAAGGGGGATATTGCAGATATTCGGGGCCAGTACCATGCCAAACGAGCGCTGGAGGTGGCAGCCGCAGGGGCTCATAATC
TGTTAATGATTGGTCCACCGGGAACTGGCAAGACCATGCTGGCCAGCCGTCTGCCAGGGCTTTTGCCTGAGATGGCCGAG
GCCGAAGCCCTGGAAAGCGCTACTGTGCAATCTATCAGCAGCCAAGGTTTTGATGTTAGCCGCTGGCGCCAACGGCCTTT
CCGAGCGCCCCATCATACGGCTTCCGGGGTAGCCTTGGTAGGCGGGGGCGGGCAGCCACGGCCAGGAGAGGTATCTTTGG
CCCATCATGGCGTGCTCTTTCTCGATGAGTTGCCAGAATTTGAGCGGCGGGTACTGGAAGTCCTTAGGGAGCCTCTGGAA
TCGGGCTGGATTATCATTTCCCGCGCGGCCCAGCAAGCCGAGTTTCCGGCCCGAGTTCAGTTGGTAGCCGCCATGAATCC
TTGCCCCTGTGGCTATTTAGGCGATTCTAAAGGCCGTTGCCGATGCACCATGGAGCAAGTACAACGTTACCGAGCGCGGA
TTTCCGGGCCTTTATTAGATCGCATCGATATACAAATCGAGGTGCCGCCCGTACCCTTGCATCAGTTGCGAACCGAAAGT
GAGAATGGGATGGAAACGAGTTGCCAGGTTCAAGCCCGGGTGGAAGCAGCGCGGGAGCGCCAGCTAGCCCGTTCTGGGCA
ACCTAACAGCAGGTTAAGCAACCGGGAAGTAGAACAGATTTGCCGCCTTGGAGAGAAGGATTATCAGCTATTGGAGCGGG
CCTTGGAGCAGTTAGGGCTTTCGGCTCGTGCCTACCATCGTATATTGAAAGTTGCCCGGACGATTGCCGATTTGGAAGGA
AGCGACACTATTCGCACGCCCCATCTTTCCGAAGCGATTGGTTACCGGCGATTAGACCGTTCCCTTGTCAAATCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB D8K6Y3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

57.968

99.406

0.576

  comM Haemophilus influenzae Rd KW20

56.89

100

0.572

  comM Vibrio campbellii strain DS40M4

56.773

99.406

0.564

  comM Glaesserella parasuis strain SC1401

54.743

100

0.549

  comM Legionella pneumophila str. Paris

50.704

98.416

0.499

  comM Legionella pneumophila strain ERS1305867

50.704

98.416

0.499

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.289

100

0.469


Multiple sequence alignment