Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GH742_RS04210 Genome accession   NZ_CP045732
Coordinates   889854..891365 (-) Length   503 a.a.
NCBI ID   WP_203456237.1    Uniprot ID   -
Organism   Legionella sp. MW5194     
Function   DNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 884854..896365
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GH742_RS04175 (GH742_04160) - 885360..885581 (-) 222 WP_203456230.1 cold-shock protein -
  GH742_RS04180 (GH742_04165) - 885830..886792 (-) 963 WP_203456231.1 metal-dependent hydrolase -
  GH742_RS04185 (GH742_04170) - 886911..887336 (+) 426 WP_203456232.1 HIT domain-containing protein -
  GH742_RS04190 (GH742_04175) - 887442..888005 (+) 564 WP_203456233.1 YqgE/AlgH family protein -
  GH742_RS04195 (GH742_04180) ruvX 888038..888457 (+) 420 WP_203456234.1 Holliday junction resolvase RuvX -
  GH742_RS04200 (GH742_04185) - 888468..889388 (+) 921 WP_203456235.1 aspartate carbamoyltransferase catalytic subunit -
  GH742_RS04205 (GH742_04190) - 889371..889781 (-) 411 WP_203456236.1 YidH family protein -
  GH742_RS04210 (GH742_04195) comM 889854..891365 (-) 1512 WP_203456237.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GH742_RS04215 (GH742_04200) - 891440..891691 (-) 252 WP_203456238.1 accessory factor UbiK family protein -
  GH742_RS04220 (GH742_04205) - 891791..892132 (+) 342 WP_058525665.1 P-II family nitrogen regulator -
  GH742_RS04225 (GH742_04210) - 892129..892596 (-) 468 WP_203456239.1 EVE domain-containing protein -
  GH742_RS04230 (GH742_04215) - 892596..893168 (-) 573 WP_203456240.1 5-formyltetrahydrofolate cyclo-ligase -
  GH742_RS04235 (GH742_04220) - 893362..893556 (+) 195 WP_108292377.1 PA3496 family putative envelope integrity protein -
  GH742_RS04240 (GH742_04225) - 893559..894371 (+) 813 WP_203456241.1 aminotransferase class IV -
  GH742_RS04245 (GH742_04230) - 894804..895049 (-) 246 WP_203456242.1 hypothetical protein -
  GH742_RS04250 (GH742_04235) - 895321..895593 (+) 273 WP_239005275.1 hypothetical protein -
  GH742_RS04255 (GH742_04240) - 895594..895974 (+) 381 WP_203456243.1 hypothetical protein -
  GH742_RS04260 (GH742_04245) - 895946..896263 (+) 318 WP_203456244.1 hypothetical protein -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 55077.52 Da        Isoelectric Point: 8.5586

>NTDB_id=396424 GH742_RS04210 WP_203456237.1 889854..891365(-) (comM) [Legionella sp. MW5194]
MNLALTYTRSAQGIHARAVHVEVHLSNGLPQFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKTGSG
FDLPIALGILAASGQIPSEGLTSHEFISELALSGELRGHTPIIPSVMAARREQRRLIIAEANAREAALTAYDQVFSAGNL
RQVCDYLLHQTPLNAMPALPPLPAVEPGLDWSDVKGQYHAKQAMAIAACGGHSLLLSGPPGSGKTMLAKRFTTLLPELSE
TQALECAAIHSLRGKVPPYEHWRIPPFRSPHHTASPVALVGGGSPPKPGEISLAHHGILFLDELPEFPKQVLETLRQPLE
SGLISISRAAMQTDFPAEFQFIAAMNPCPCGQWGNPRANCLCTPERIKRYLGKLSAPLLDRIDMQVNVQALSQQELLKAS
PTREGESQRIRLQVQESRAVQISRQGKLNARLDSKTCEQVCYLGREEKVFLSQVLDTLQLSARAYHRFLKVARTIADMNG
EEQVKRPALQQALSFKQCLQMPQ

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=396424 GH742_RS04210 WP_203456237.1 889854..891365(-) (comM) [Legionella sp. MW5194]
ATGAATCTCGCGTTAACCTATACCCGTAGCGCACAGGGCATTCATGCCAGGGCGGTACATGTCGAAGTGCATTTATCCAA
TGGTCTGCCGCAATTTACCATCGTCGGTCTTGCTGAAACTGCCGTGAAGGAAAGCAAGGATCGCGTTCGCAGCGCCATCA
TTAACAGTCAATTCGAATTTCCCTGCCGTAAAATCACCGTCAATCTTGCGCCGGCTGATTTACCCAAAACCGGCAGCGGC
TTTGATTTACCCATTGCCTTGGGCATCCTGGCTGCATCCGGGCAAATTCCCTCGGAAGGATTGACCTCCCATGAGTTTAT
CAGTGAGTTGGCGTTAAGCGGTGAATTACGCGGCCATACACCCATTATTCCCAGCGTGATGGCTGCCCGCCGCGAGCAGC
GGCGCTTGATCATTGCCGAAGCAAATGCACGTGAAGCCGCGCTTACCGCCTACGATCAGGTATTCAGTGCGGGCAACCTG
CGGCAGGTGTGTGATTATCTTCTTCACCAGACACCGCTTAATGCGATGCCGGCTCTGCCTCCCCTTCCTGCCGTTGAGCC
GGGATTGGATTGGTCAGATGTTAAAGGGCAGTACCACGCCAAACAGGCCATGGCCATCGCAGCCTGCGGCGGGCATAGTC
TTTTATTAAGCGGTCCACCGGGCAGCGGTAAAACCATGCTGGCGAAACGCTTCACCACCCTGCTTCCAGAATTGAGCGAA
ACGCAGGCTTTAGAATGCGCTGCGATCCATTCGCTTCGCGGCAAAGTGCCACCCTATGAACACTGGCGCATTCCTCCTTT
CCGCTCCCCTCATCATACAGCCTCACCCGTCGCGCTGGTGGGCGGAGGCAGTCCGCCCAAACCCGGGGAAATTTCATTAG
CACACCACGGCATCCTTTTTCTTGATGAGTTGCCTGAGTTCCCTAAACAGGTGTTGGAGACCCTGCGCCAACCCCTTGAA
TCAGGCCTTATTTCCATTTCACGTGCCGCCATGCAGACTGATTTTCCGGCCGAGTTCCAGTTCATTGCCGCCATGAATCC
CTGCCCCTGTGGCCAATGGGGCAATCCCAGGGCCAATTGCCTGTGTACCCCGGAACGAATTAAACGCTACCTCGGGAAAT
TGTCCGCTCCGCTTCTCGATCGCATTGACATGCAGGTGAATGTGCAGGCATTGTCACAACAGGAATTACTCAAGGCGAGC
CCCACCCGGGAAGGCGAAAGTCAGCGCATTCGGCTTCAGGTGCAAGAGTCACGCGCCGTGCAAATCAGTCGGCAGGGCAA
GCTCAATGCCCGGCTTGACAGCAAAACCTGTGAACAGGTTTGTTATCTTGGCAGGGAAGAAAAGGTGTTCCTGTCACAGG
TACTGGACACACTGCAACTGTCTGCCCGCGCTTACCATCGCTTTTTGAAGGTAGCCCGTACCATTGCCGACATGAACGGT
GAAGAACAGGTTAAACGGCCCGCCCTGCAACAAGCCCTGTCGTTTAAACAGTGTTTGCAGATGCCGCAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Legionella pneumophila str. Paris

70.775

100

0.708

  comM Legionella pneumophila strain ERS1305867

70.775

100

0.708

  comM Vibrio cholerae strain A1552

51.509

98.807

0.509

  comM Haemophilus influenzae Rd KW20

49.704

100

0.501

  comM Vibrio campbellii strain DS40M4

50.602

99.006

0.501

  comM Glaesserella parasuis strain SC1401

49.9

99.205

0.495

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.083

100

0.433


Multiple sequence alignment