Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E5P3_RS01785 Genome accession   NZ_LR594662
Coordinates   389349..390884 (-) Length   511 a.a.
NCBI ID   WP_162584429.1    Uniprot ID   A0A6P2DYQ9
Organism   Variovorax sp. RA8     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 384349..395884
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5P3_RS01765 (G3W91_RS01765) - 385155..385964 (+) 810 WP_162584425.1 helix-turn-helix transcriptional regulator -
  E5P3_RS01770 (G3W91_RS01770) - 386058..387077 (+) 1020 WP_162584426.1 aromatic ring-hydroxylating dioxygenase subunit alpha -
  E5P3_RS01775 (G3W91_RS01775) - 387126..388109 (+) 984 WP_162584427.1 tripartite tricarboxylate transporter substrate-binding protein -
  E5P3_RS01780 (G3W91_RS01780) - 388239..389348 (+) 1110 WP_162584428.1 ABC transporter substrate-binding protein -
  E5P3_RS01785 (G3W91_RS01785) comM 389349..390884 (-) 1536 WP_162584429.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  E5P3_RS01790 (G3W91_RS01790) - 391044..391865 (+) 822 WP_162589481.1 TorF family putative porin -
  E5P3_RS01795 (G3W91_RS01795) glnK 391932..392270 (+) 339 WP_068686416.1 P-II family nitrogen regulator -
  E5P3_RS01800 (G3W91_RS01800) amt 392297..393823 (+) 1527 WP_174263023.1 ammonium transporter -
  E5P3_RS01805 (G3W91_RS01805) - 393977..394900 (+) 924 WP_162584430.1 SMP-30/gluconolactonase/LRE family protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 53401.16 Da        Isoelectric Point: 8.5296

>NTDB_id=1128179 E5P3_RS01785 WP_162584429.1 389349..390884(-) (comM) [Variovorax sp. RA8]
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRCAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGAEIYGA
AHLLDVVRQFLPGGPAPGDAAEDGWHRAQAAAAGPAAAEADLADVKGHAGARRALEIAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPSMSVDEALESAAVASLHGRFAVERWRLRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGSITIARAARRAEFPARFQLVAAMNPCPCGHLGSSLKPCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
VPATQLLETPTGEASAEVRARVVEARERALRRQGKANQALQGAEIDRHAQPGAAALQFLQAAATRLGWSARGTHRTLKLA
RTIADLAGAGTVQAAHVAEAVQYRRALQKVE

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1128179 E5P3_RS01785 WP_162584429.1 389349..390884(-) (comM) [Variovorax sp. RA8]
ATGAGCTTATCTTTGGTGCAGAGCCGTGCTTTGCTGGGCCTGGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGCTGCCCAGCTTCACGCTGGTGGGATTGGCCGACGTGGAGGTGAAGGAAGCCCGGGAGCGGGTGCGTTGCGCCATCC
AGAACGCCGGCCTCGAATTCCCGAGCAACAAGCGGATCACGGTCAACCTGGCGCCGGCCGACCTGCCGAAGGACTCGGGC
CGCTTCGACCTGCCGATTGCCCTGGGCATCCTGGCGGCGGCCGGGCAGATCGAGGCGGCCCGGCTGGCGGGCCACGAATT
CGCGGGGGAGCTCTCGCTTTCAGGGCACCTGAGGCCCGTGCGTGGTGCGCTCGCGATGGCGCTGGCGCTGCATGGCCGCG
GTGTCGCGACCAAGCTGGTGCTGCCGGCAGAGAGTGCGAGGGAGGCCGCCCTGGTGCCGGGCGCCGAAATCTACGGTGCA
GCCCACCTGCTCGATGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCACCCGGCGATGCGGCGGAAGATGGCTGGCATCG
TGCGCAGGCCGCCGCCGCCGGCCCGGCGGCGGCGGAGGCCGACCTGGCGGACGTCAAGGGCCACGCGGGCGCCAGGCGCG
CGCTCGAGATCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGGTCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGTCGATGAGCGTGGACGAGGCGCTGGAAAGCGCCGCCGTCGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGCTGGCGCCTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCGGTAGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCGGGCGAAATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAGTTCCCGGAGTTCCAGCGCGCC
GCGCTCGAAGCGCTGCGCGAGCCGCTGGAGACCGGCAGCATCACCATCGCGCGGGCCGCACGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGGTCGCGGCCATGAACCCCTGCCCTTGCGGGCACCTGGGCTCCTCGCTCAAGCCCTGCCGCTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTGGACCGCATCGACCTGCACATCGAGGTACCTGCG
GTGCCGGCCACCCAGTTGCTGGAGACACCCACCGGCGAAGCCAGCGCCGAGGTTCGCGCGCGCGTGGTCGAGGCGCGCGA
GCGCGCCCTGCGGCGCCAGGGCAAGGCCAACCAGGCGCTGCAGGGCGCGGAGATCGACCGCCACGCGCAGCCCGGTGCCG
CGGCCCTGCAGTTCCTGCAGGCCGCCGCGACGCGGCTGGGCTGGTCGGCGCGCGGGACGCATCGCACGCTCAAGCTGGCC
CGCACGATCGCGGACCTGGCCGGCGCCGGCACGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGAGAGCGCT
GCAGAAGGTCGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6P2DYQ9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

51.456

100

0.519

  comM Glaesserella parasuis strain SC1401

50.677

100

0.513

  comM Vibrio cholerae strain A1552

51.272

100

0.513

  comM Vibrio campbellii strain DS40M4

49.902

100

0.499

  comM Legionella pneumophila str. Paris

46.899

100

0.474

  comM Legionella pneumophila strain ERS1305867

46.899

100

0.474

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.418

99.609

0.432


Multiple sequence alignment