Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   clem_RS03375 Genome accession   NZ_CP016397
Coordinates   767974..769485 (-) Length   503 a.a.
NCBI ID   WP_094090325.1    Uniprot ID   A0A222P0D1
Organism   Legionella clemsonensis strain CDC-D5610     
Function   DNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 762974..774485
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  clem_RS03350 (clem_03410) - 763884..764309 (+) 426 WP_094090320.1 HIT domain-containing protein -
  clem_RS03355 (clem_03415) - 764283..765815 (-) 1533 WP_094090321.1 FMN-binding glutamate synthase family protein -
  clem_RS03360 (clem_03420) - 766054..766617 (+) 564 WP_094090322.1 YqgE/AlgH family protein -
  clem_RS03365 (clem_03425) ruvX 766640..767059 (+) 420 WP_094090323.1 Holliday junction resolvase RuvX -
  clem_RS03370 (clem_03430) - 767063..767959 (+) 897 WP_094090324.1 aspartate carbamoyltransferase catalytic subunit -
  clem_RS03375 (clem_03435) comM 767974..769485 (-) 1512 WP_094090325.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  clem_RS03380 (clem_03440) ubiK 769571..769825 (-) 255 WP_094090326.1 ubiquinone biosynthesis accessory factor UbiK -
  clem_RS03385 (clem_03445) - 769919..770257 (+) 339 WP_094090327.1 P-II family nitrogen regulator -
  clem_RS03390 (clem_03450) - 770263..770733 (-) 471 WP_232505548.1 EVE domain-containing protein -
  clem_RS03395 (clem_03455) - 770730..771302 (-) 573 WP_094090329.1 5-formyltetrahydrofolate cyclo-ligase -
  clem_RS03400 (clem_03460) - 771496..771675 (+) 180 WP_094090330.1 PA3496 family putative envelope integrity protein -
  clem_RS03405 (clem_03465) - 771678..772496 (+) 819 WP_094090331.1 aminotransferase class IV -
  clem_RS03410 (clem_03470) - 772438..773130 (-) 693 WP_094090332.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 54818.04 Da        Isoelectric Point: 8.4888

>NTDB_id=187963 clem_RS03375 WP_094090325.1 767974..769485(-) (comM) [Legionella clemsonensis strain CDC-D5610]
MNLAFSKTRSTVGILAQSVSVEVHLSNGLPSFTIVGLAETAVKESKDRVRSAIINSQFEFPCRKITVNLAPADLPKSGSG
FDLPIAVGILAASGQLPVDKLATHEFISELALSGNLRGVSAIIPAVLAVRRDNQKLVIATANAAEASLAGYNDVFSANNL
REVCSYLCQNTPLKVLPARPETTYVNGKMDWSDIKGQYHAKRAMEIAACGGHSILLSGPPGSGKTMLAKRFATLLPDLSE
TQALECAAIKSIRGRLPDFNSWRSPPFRSPHHTASQVALVGGGNPPKPGEISLAHNGVLFLDELPEFHKQVLETLREPLE
SGNIWISRAATQIEFPAQFQLVAAMNPCPCGQWGNPQASCLCSPERITRYLAKLSAPLLDRIDMQITLQALTQEELIKPN
LTTAGESKRIRQTVEQVRARQLSRQNCINAQLDAKDCEEFCQLSQAEQGFLSEVMNQLKLSARAYHRLLKVARTIADMNN
LEKVDLSALQQALSFRQNLQLPK

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=187963 clem_RS03375 WP_094090325.1 767974..769485(-) (comM) [Legionella clemsonensis strain CDC-D5610]
ATGAATCTCGCTTTTAGCAAAACGCGTAGTACTGTAGGTATACTCGCGCAGTCTGTTTCTGTCGAAGTCCATTTATCCAA
TGGCTTGCCCAGCTTCACAATTGTGGGGCTTGCCGAAACTGCTGTTAAGGAAAGCAAAGACAGAGTTCGTAGTGCAATCA
TTAATAGTCAATTTGAGTTTCCCTGTCGTAAAATCACAGTCAATCTTGCTCCTGCCGATTTACCCAAATCAGGGAGTGGT
TTTGATTTACCTATTGCCGTAGGTATTCTTGCAGCTTCAGGCCAGCTACCTGTAGATAAGTTAGCTACACATGAATTTAT
TAGTGAACTCGCCTTGAGTGGTAATTTGCGTGGTGTATCCGCTATCATTCCTGCAGTCCTGGCTGTGCGGCGGGATAATC
AAAAATTAGTAATTGCTACAGCTAATGCTGCAGAAGCCTCACTGGCAGGCTATAATGACGTGTTTAGTGCCAATAACTTG
CGCGAGGTGTGCAGTTATCTATGTCAAAATACACCGCTTAAAGTCCTACCTGCGCGTCCTGAAACTACCTATGTAAATGG
GAAAATGGATTGGTCTGATATTAAGGGTCAGTATCATGCAAAGCGAGCGATGGAAATTGCGGCTTGTGGAGGTCATAGTA
TTTTATTAAGCGGACCTCCCGGGAGTGGTAAAACCATGTTGGCCAAACGCTTCGCTACCCTCCTTCCAGATCTTAGCGAA
ACTCAAGCACTTGAATGTGCTGCCATTAAGTCCATTCGTGGACGGCTTCCAGATTTTAATAGCTGGCGCTCTCCACCATT
TCGTTCGCCACATCACACAGCCTCCCAAGTTGCACTAGTAGGTGGAGGTAATCCACCAAAGCCAGGGGAGATTTCACTGG
CCCATAATGGCGTATTATTTCTTGATGAATTGCCTGAGTTTCATAAGCAAGTACTGGAAACCTTACGTGAACCCCTGGAA
TCAGGGAATATCTGGATTTCCCGGGCAGCGACTCAAATTGAATTTCCTGCCCAATTTCAACTTGTTGCTGCGATGAATCC
TTGCCCTTGTGGTCAGTGGGGGAATCCTCAAGCAAGCTGTCTTTGTAGTCCTGAACGCATTACTCGTTATTTGGCAAAAT
TATCAGCTCCACTGCTTGACAGAATTGATATGCAAATAACCTTGCAAGCATTAACACAAGAGGAATTAATTAAACCCAAT
CTTACTACTGCAGGAGAAAGCAAACGGATCAGACAAACTGTTGAACAAGTTAGAGCGCGTCAGCTAAGCCGACAAAATTG
TATTAATGCCCAACTTGACGCTAAAGATTGTGAAGAATTCTGTCAATTAAGCCAGGCAGAACAAGGGTTTTTAAGTGAAG
TCATGAACCAGCTTAAATTATCGGCACGTGCCTACCACCGTCTTCTGAAAGTGGCAAGAACTATTGCCGATATGAATAAT
CTGGAGAAAGTAGACTTAAGTGCCCTGCAGCAAGCTTTATCGTTCAGGCAAAATTTACAACTACCGAAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A222P0D1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Legionella pneumophila str. Paris

75.746

100

0.757

  comM Legionella pneumophila strain ERS1305867

75.746

100

0.757

  comM Vibrio cholerae strain A1552

51.509

98.807

0.509

  comM Haemophilus influenzae Rd KW20

50.198

100

0.505

  comM Vibrio campbellii strain DS40M4

50

99.404

0.497

  comM Glaesserella parasuis strain SC1401

48.6

99.404

0.483

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.227

99.801

0.431


Multiple sequence alignment