Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E5P1_RS17830 Genome accession   NZ_LR594689
Coordinates   3682471..3684006 (-) Length   511 a.a.
NCBI ID   WP_068686418.1    Uniprot ID   A0A2J7W672
Organism   Variovorax sp. WDL1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 3677471..3689006
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5P1_RS17805 (G3W93_RS17805) - 3677670..3678164 (+) 495 WP_162487148.1 CesT family type III secretion system chaperone -
  E5P1_RS17810 (G3W93_RS17810) - 3678284..3679093 (+) 810 WP_068686424.1 helix-turn-helix transcriptional regulator -
  E5P1_RS17815 (G3W93_RS17815) - 3679187..3680206 (+) 1020 WP_068686422.1 aromatic ring-hydroxylating dioxygenase subunit alpha -
  E5P1_RS17820 (G3W93_RS17820) - 3680249..3681232 (+) 984 WP_232081203.1 tripartite tricarboxylate transporter substrate-binding protein -
  E5P1_RS17825 (G3W93_RS17825) - 3681361..3682470 (+) 1110 WP_068686420.1 ABC transporter substrate-binding protein -
  E5P1_RS17830 (G3W93_RS17830) comM 3682471..3684006 (-) 1536 WP_068686418.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  E5P1_RS17835 (G3W93_RS17835) - 3684163..3684987 (+) 825 WP_083944970.1 TorF family putative porin -
  E5P1_RS17840 (G3W93_RS17840) glnK 3685054..3685392 (+) 339 WP_068686416.1 P-II family nitrogen regulator -
  E5P1_RS17845 (G3W93_RS17845) amt 3685419..3686948 (+) 1530 WP_174262780.1 ammonium transporter -
  E5P1_RS17850 (G3W93_RS17850) glcE 3687120..3688214 (+) 1095 WP_068686414.1 glycolate oxidase subunit GlcE -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 53550.41 Da        Isoelectric Point: 8.5260

>NTDB_id=1128269 E5P1_RS17830 WP_068686418.1 3682471..3684006(-) (comM) [Variovorax sp. WDL1]
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRCAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRRVATRLVLPAESAREAALVPGAEIYGA
AHLLDVVRQFLPGGPAAGDAAEDGWHRALPAAAGPAAAAADLADVKGHAGARRALEIAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPAMSVDEALESAAVASLHGRFAVERWRMRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGSITIARAARRAEFPARFQLVAAMNPCPCGYLGSTLKPCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
VPATHLLETPAGEASAEVRARVVEARERALQRQGKANQALQGAEIDRHAQLDAAALRFLQAAATRLGWSARGTHRTLKLA
RTIADLAGAGTVQAAHVAEAVQYRRALQQVE

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1128269 E5P1_RS17830 WP_068686418.1 3682471..3684006(-) (comM) [Variovorax sp. WDL1]
ATGAGCTTATCTTTGGTGCAAAGCCGTGCTTTGCTGGGCCTGGAGGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGCTGCCCAGCTTCACGCTGGTGGGACTGGCGGACGTGGAGGTGAAGGAGGCCCGCGAGCGCGTGCGCTGCGCCATCC
AGAACGCCGGCCTCGAATTTCCGAGCAACAAGCGGATCACGGTCAACCTGGCGCCGGCAGATCTCCCGAAGGACTCGGGC
CGCTTCGACCTGCCGATCGCGCTGGGCATCCTGGCTGCGGCCGGGCAGATCGAGGCGGCGCGGCTGGCAGGCCACGAATT
CGCGGGGGAGCTCTCCCTTTCTGGGCACCTGAGGCCCGTGCGTGGTGCGCTCGCGATGGCGCTGGCGCTGCATGGCCGCC
GTGTCGCGACCAGGCTGGTGTTGCCGGCAGAGAGTGCGAGGGAGGCGGCCCTGGTGCCCGGCGCCGAAATCTACGGTGCA
GCCCACCTGCTCGATGTGGTGCGGCAATTCCTGCCGGGCGGCCCGGCAGCGGGCGATGCGGCTGAAGATGGCTGGCACCG
TGCGCTGCCCGCCGCCGCAGGCCCGGCGGCGGCCGCGGCCGACCTGGCGGATGTCAAGGGCCACGCGGGCGCGAGGCGTG
CGCTCGAGATCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTCGGTCCGCCCGGTTCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGGCGATGAGCGTGGACGAGGCGCTGGAAAGCGCCGCCGTCGCGAGCCTGCACGGCCGCTT
CGCCGTCGAGCGCTGGCGCATGCGGCCGACCTGCAGCCCGCACCATAGCGCCAGTGCGGTGGCGCTGGTGGGCGGCGGCT
CGCCGCCACGGCCGGGTGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTAGACGAATTTCCGGAGTTCCAGCGCGCC
GCGCTCGAAGCGCTGCGCGAGCCGCTGGAGACCGGCAGCATCACCATCGCACGCGCCGCGCGGCGTGCCGAGTTCCCGGC
CCGCTTCCAACTGGTGGCGGCCATGAACCCCTGCCCGTGCGGCTACCTGGGCTCAACGCTCAAGCCCTGCCGATGCACGC
CGGACCAGGTGGCACGCTACCAGGGCAAGCTCAGCGGACCGCTGCTGGACCGCATCGACCTGCACATCGAAGTGCCCGCG
GTACCGGCCACCCATTTGCTGGAAACACCCGCCGGCGAAGCCAGCGCCGAGGTCCGTGCGCGCGTGGTCGAGGCGCGCGA
GCGCGCCCTGCAGCGGCAGGGCAAGGCCAACCAGGCGCTACAGGGTGCGGAGATCGACCGCCATGCGCAGCTGGATGCCG
CGGCTTTGCGTTTCCTGCAGGCCGCGGCGACGCGGCTGGGCTGGTCGGCCCGGGGGACGCATCGCACGCTCAAACTGGCC
CGCACGATCGCGGACCTGGCCGGCGCCGGCACGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGAGAGCGCT
GCAGCAGGTCGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2J7W672

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

51.158

100

0.519

  comM Vibrio cholerae strain A1552

51.456

100

0.519

  comM Glaesserella parasuis strain SC1401

50.385

100

0.513

  comM Vibrio campbellii strain DS40M4

50.096

100

0.509

  comM Legionella pneumophila str. Paris

46.318

100

0.468

  comM Legionella pneumophila strain ERS1305867

46.318

100

0.468

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.333

99.804

0.432


Multiple sequence alignment