Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   H7A79_RS08575 Genome accession   NZ_CP060414
Coordinates   1648211..1649701 (+) Length   496 a.a.
NCBI ID   WP_135036235.1    Uniprot ID   -
Organism   Neisseria musculi strain NW831     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1643211..1654701
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H7A79_RS08555 (H7A79_1582) - 1645052..1645702 (-) 651 WP_186999953.1 hypothetical protein -
  H7A79_RS08560 (H7A79_1583) - 1646333..1647133 (+) 801 WP_135036244.1 NUDIX domain-containing protein -
  H7A79_RS08565 (H7A79_1584) - 1647149..1647586 (+) 438 WP_187001671.1 hypothetical protein -
  H7A79_RS08570 (H7A79_1585) - 1647909..1648184 (+) 276 WP_186999954.1 accessory factor UbiK family protein -
  H7A79_RS08575 (H7A79_1586) comM 1648211..1649701 (+) 1491 WP_135036235.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  H7A79_RS08580 (H7A79_1587) - 1649805..1650635 (+) 831 WP_186999955.1 SPOR domain-containing protein -
  H7A79_RS08585 (H7A79_1588) dsbA2 1650655..1651356 (+) 702 WP_186999956.1 thiol:disulfide interchange protein DsbA/DsbL Machinery gene
  H7A79_RS08590 (H7A79_1589) - 1651556..1652383 (+) 828 WP_135036226.1 undecaprenyl-diphosphate phosphatase -
  H7A79_RS08595 (H7A79_1590) - 1652635..1653324 (-) 690 WP_186999957.1 SprT family zinc-dependent metalloprotease -
  H7A79_RS08600 (H7A79_1591) - 1653454..1654188 (-) 735 WP_186999958.1 lysophospholipid acyltransferase family protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 53209.25 Da        Isoelectric Point: 9.2825

>NTDB_id=475834 H7A79_RS08575 WP_135036235.1 1648211..1649701(+) (comM) [Neisseria musculi strain NW831]
MSLALVYSRALSGMNAPLVEVEAHLANGLPAFNIVGLPDTEVKESRDRVRAAIIQSGFEFPAKKITVNLAPADLPKESGR
FDLPIALGILAASGQMMADKLAQYEFAGELALSGTLRPVRGALAMAWQGMKAGRAFILPQQNAEQAAIINGITVYGAGSL
GQVAAHLNAVEPLAQTRTGIGQRPSENRSLPDLKDVKGQHTARLALEIAAAGGHSLLMMGPPGTGKSMLAQRLPSILPPL
TDNELIEVWALRSLLPNHRQELHHSRPFQSPHHTSSPVAVVGGSSGPGEISLAHNGVLFFDELPEFDRKVLEVLREPLES
GEIHISRAMHKAVYPAKFQLVAAMNPCPCGYLGHPAKPCRCTPESIARYRGKISGPLLDRIDLTIEVPALSAAELMQQQA
GESSAEVSARVRAARERQYARQGKINAALSVTELDETAAVSKEAHEALGSLLEKLSLSARSYHRIMRVARTLADLNGDKS
VGRAHVLRAVSFRRAL

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=475834 H7A79_RS08575 WP_135036235.1 1648211..1649701(+) (comM) [Neisseria musculi strain NW831]
ATGTCGTTAGCCCTTGTTTACAGCCGCGCCTTAAGCGGCATGAATGCGCCGTTGGTGGAGGTGGAAGCCCATCTTGCCAA
CGGCCTGCCTGCGTTTAATATTGTCGGCCTGCCCGACACCGAAGTCAAAGAAAGCCGCGACCGCGTGCGCGCGGCCATTA
TCCAGAGCGGCTTTGAATTTCCCGCCAAAAAAATCACCGTCAACCTCGCCCCCGCCGACCTGCCCAAAGAATCCGGCCGC
TTCGACCTGCCGATTGCCCTGGGCATACTGGCCGCCTCGGGGCAGATGATGGCCGACAAACTGGCGCAATACGAATTTGC
AGGCGAGCTGGCCTTATCGGGCACGCTGCGCCCCGTGCGGGGTGCGCTGGCAATGGCCTGGCAGGGCATGAAAGCCGGCC
GCGCCTTTATTCTGCCGCAGCAAAATGCCGAACAGGCCGCCATCATCAACGGTATCACGGTTTACGGCGCCGGCAGTTTG
GGGCAGGTGGCGGCACATCTGAATGCCGTTGAACCGTTGGCACAAACCCGAACCGGTATCGGGCAGAGGCCGTCTGAAAA
CCGCAGCCTGCCCGATTTGAAAGACGTGAAAGGCCAGCACACCGCCCGCCTGGCCTTGGAAATTGCCGCCGCCGGCGGCC
ACAGCCTGCTGATGATGGGGCCGCCCGGCACGGGAAAATCCATGCTCGCCCAACGCCTGCCCTCCATTCTGCCGCCCCTA
ACCGACAACGAACTCATCGAAGTATGGGCGCTGCGCTCGCTGCTGCCCAACCACCGGCAGGAATTACACCACAGCCGTCC
GTTCCAGTCGCCCCATCATACTTCCAGCCCCGTGGCAGTGGTAGGCGGAAGCTCCGGCCCCGGCGAAATCTCGCTGGCCC
ACAACGGCGTTTTGTTTTTTGACGAGCTGCCCGAGTTTGACCGCAAAGTACTGGAAGTGTTGCGCGAACCGCTGGAAAGC
GGCGAAATCCACATTTCCCGCGCGATGCACAAAGCCGTTTATCCGGCCAAATTCCAATTGGTAGCCGCCATGAACCCCTG
CCCCTGCGGCTATCTCGGCCATCCCGCCAAACCCTGCCGCTGCACACCCGAAAGCATCGCCCGCTACAGGGGCAAAATAT
CCGGCCCGCTGCTCGACCGCATCGATCTGACCATCGAAGTGCCCGCACTTTCTGCCGCCGAGCTGATGCAGCAGCAAGCC
GGCGAAAGCAGCGCCGAAGTGTCGGCGCGCGTACGGGCTGCACGCGAAAGGCAGTATGCGCGGCAGGGCAAAATCAATGC
GGCTCTGAGTGTTACGGAACTGGACGAAACAGCCGCGGTCTCTAAAGAAGCGCACGAAGCACTGGGCAGCCTGCTCGAAA
AACTCTCGCTTTCCGCCCGCAGCTACCACCGCATCATGCGCGTGGCGCGCACGCTGGCCGATCTGAACGGCGATAAAAGT
GTCGGCCGCGCCCATGTGTTACGCGCAGTAAGTTTTCGTCGTGCTTTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

52.695

100

0.532

  comM Vibrio campbellii strain DS40M4

51.6

100

0.52

  comM Haemophilus influenzae Rd KW20

50.696

100

0.514

  comM Glaesserella parasuis strain SC1401

50.199

100

0.508

  comM Legionella pneumophila str. Paris

47.431

100

0.484

  comM Legionella pneumophila strain ERS1305867

47.431

100

0.484

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.732

100

0.454