Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   HH212_RS04610 Genome accession   NZ_CP051685
Coordinates   1069118..1070635 (+) Length   505 a.a.
NCBI ID   WP_169434299.1    Uniprot ID   A0A7Z2VV03
Organism   Massilia forsythiae strain GN2-R2     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1064118..1075635
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HH212_RS04585 (HH212_04585) - 1065432..1066982 (-) 1551 WP_169434294.1 ammonium transporter -
  HH212_RS04590 (HH212_04590) - 1066993..1067331 (-) 339 WP_169434295.1 P-II family nitrogen regulator -
  HH212_RS04595 (HH212_04595) - 1067344..1068090 (-) 747 WP_169434296.1 TorF family putative porin -
  HH212_RS04600 (HH212_04600) - 1068357..1068605 (+) 249 WP_169434297.1 accessory factor UbiK family protein -
  HH212_RS04605 (HH212_04605) - 1068680..1068976 (-) 297 WP_169434298.1 hypothetical protein -
  HH212_RS04610 (HH212_04610) comM 1069118..1070635 (+) 1518 WP_169434299.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  HH212_RS04615 (HH212_04615) - 1070692..1071294 (+) 603 WP_229217570.1 hypothetical protein -
  HH212_RS04620 (HH212_04620) - 1071299..1072027 (+) 729 WP_169434300.1 ABC transporter ATP-binding protein -
  HH212_RS04625 (HH212_04625) - 1072024..1073565 (+) 1542 WP_169434301.1 hypothetical protein -
  HH212_RS04630 (HH212_04630) - 1073679..1074020 (+) 342 WP_169434302.1 DUF1840 domain-containing protein -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54046.91 Da        Isoelectric Point: 8.4075

>NTDB_id=440456 HH212_RS04610 WP_169434299.1 1069118..1070635(+) (comM) [Massilia forsythiae strain GN2-R2]
MSLAVVKSRALAGMQAPAVSVEVHLANGLPGMHIVGLPDTEVREAKDRVRAALQNSGYEVPNRRITINLAPADLPKESGR
FDLPIALGIMAASEQIPARQLERYEFAGELSLSGELRPVRGALAMSFAMHRAGAGRPCAFVLPAANADEAALVADAEIYP
ARTLIEVCSHLAAKGNDALLARHRPPTPDAAPRYPDFADVKGQQHVKRALEVAAAGLHSVLLIGPPGAGKSMLASRFPGL
LPPMREEEALETAAIQSLAGSFSAAQWRRRPYRTPHHTSSGVALVGGGSNPRPGEVSLAHHGVLFLDELPEFDRKVLEVL
REPLESGEITISRAARQADFPASFQLVAAMNPCPCGWLGHPSGRCRCTPDAVQRYQDRISGPLLDRIDIQIPVAAMAPDA
MALLADGEASATVAARVGRAHALQLARQGKANQRLGTREIDLHCELDAAAGHLLREAMQKLHWSARAYHRVLRVARTVAD
LAASSQVQAQHVAEAIQYRRGLSGQ

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=440456 HH212_RS04610 WP_169434299.1 1069118..1070635(+) (comM) [Massilia forsythiae strain GN2-R2]
ATGAGTCTCGCCGTCGTCAAGAGCCGCGCACTGGCCGGCATGCAGGCGCCGGCCGTCAGCGTCGAGGTGCACCTCGCCAA
CGGCTTGCCGGGCATGCACATCGTCGGGCTGCCGGACACCGAGGTGCGCGAAGCCAAGGACCGGGTGCGCGCGGCGCTGC
AGAACTCCGGCTACGAGGTGCCCAACCGGCGCATCACCATCAATCTGGCGCCCGCCGACCTGCCGAAAGAGTCGGGCCGC
TTCGACCTGCCGATCGCCCTCGGCATCATGGCCGCGTCCGAGCAGATTCCGGCCAGGCAATTGGAGCGCTACGAGTTCGC
CGGAGAATTGTCGCTGTCGGGCGAACTGCGGCCGGTGCGCGGCGCCCTGGCGATGAGCTTCGCCATGCACCGCGCCGGGG
CCGGCCGCCCTTGCGCCTTCGTGCTGCCGGCAGCCAACGCCGACGAGGCCGCGCTGGTGGCCGATGCGGAAATCTATCCC
GCGCGCACCCTGATCGAGGTCTGCTCCCACCTCGCCGCCAAGGGCAACGATGCGCTGCTGGCGCGCCACCGTCCGCCCAC
GCCGGATGCCGCGCCGCGCTATCCGGATTTTGCCGACGTCAAGGGCCAGCAGCACGTCAAGCGCGCACTGGAAGTGGCGG
CGGCGGGGCTGCACTCGGTGCTGCTGATCGGCCCGCCGGGCGCCGGCAAGAGCATGCTGGCCTCGCGCTTTCCCGGGCTG
CTGCCGCCGATGCGCGAGGAAGAAGCGCTGGAAACGGCGGCGATCCAGTCGCTGGCCGGCAGCTTTTCGGCCGCCCAGTG
GCGCCGCCGTCCCTACCGCACGCCGCACCACACCAGTTCCGGCGTGGCCCTGGTCGGCGGCGGCAGCAATCCACGTCCGG
GGGAAGTGTCGCTGGCGCACCATGGCGTGCTGTTCCTGGATGAGTTGCCGGAGTTCGACCGCAAGGTGCTCGAGGTACTG
CGCGAACCGCTGGAATCCGGCGAGATCACGATCTCGCGCGCGGCGCGCCAGGCCGACTTTCCCGCCAGTTTCCAATTGGT
GGCGGCGATGAATCCCTGTCCCTGCGGCTGGCTGGGCCATCCGTCGGGCCGCTGCCGCTGCACGCCGGACGCGGTGCAAC
GCTACCAGGACCGCATTTCCGGCCCCCTGCTCGACCGCATCGACATCCAGATCCCGGTGGCGGCGATGGCGCCCGACGCC
ATGGCGCTGCTGGCCGACGGCGAAGCCAGCGCCACGGTCGCCGCGCGCGTCGGACGGGCGCACGCCCTGCAGCTGGCGCG
CCAGGGCAAGGCCAACCAGCGCCTCGGCACGCGCGAGATCGACCTGCATTGCGAGCTCGACGCCGCCGCCGGCCATCTAT
TGCGCGAAGCGATGCAAAAGTTGCACTGGTCGGCGCGCGCGTATCACCGGGTATTGCGGGTGGCGCGTACGGTGGCCGAC
CTGGCTGCCTCCAGCCAGGTGCAGGCGCAGCACGTGGCGGAAGCGATCCAGTACCGGCGCGGCTTGTCCGGACAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z2VV03

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

52.695

99.208

0.523

  comM Vibrio cholerae strain A1552

52.295

99.208

0.519

  comM Glaesserella parasuis strain SC1401

50.69

100

0.509

  comM Legionella pneumophila str. Paris

50.493

100

0.507

  comM Legionella pneumophila strain ERS1305867

50.493

100

0.507

  comM Haemophilus influenzae Rd KW20

49.405

99.802

0.493

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.129

100

0.471