Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ABL847_RS21055 Genome accession   NZ_CP157616
Coordinates   4557906..4559441 (-) Length   511 a.a.
NCBI ID   WP_077000248.1    Uniprot ID   -
Organism   Variovorax sp. KK3     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4552906..4564441
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ABL847_RS21035 (ABL847_21000) - 4553442..4554554 (+) 1113 WP_077000251.1 ABC transporter substrate-binding protein -
  ABL847_RS21040 (ABL847_21005) - 4554680..4555843 (+) 1164 WP_077000250.1 saccharopine dehydrogenase family protein -
  ABL847_RS21045 (ABL847_21010) - 4555861..4557366 (+) 1506 WP_077000249.1 aldehyde dehydrogenase family protein -
  ABL847_RS21050 (ABL847_21015) - 4557423..4557884 (+) 462 WP_077000299.1 Lrp/AsnC family transcriptional regulator -
  ABL847_RS21055 (ABL847_21020) comM 4557906..4559441 (-) 1536 WP_077000248.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ABL847_RS21060 (ABL847_21025) - 4559599..4560432 (+) 834 WP_077000247.1 TorF family putative porin -
  ABL847_RS21065 (ABL847_21030) glnK 4560503..4560841 (+) 339 WP_077000246.1 P-II family nitrogen regulator -
  ABL847_RS21070 (ABL847_21035) - 4560868..4562397 (+) 1530 WP_077000245.1 ammonium transporter -
  ABL847_RS21075 (ABL847_21040) - 4562634..4563533 (+) 900 WP_077000298.1 SMP-30/gluconolactonase/LRE family protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 53439.14 Da        Isoelectric Point: 7.8527

>NTDB_id=1007726 ABL847_RS21055 WP_077000248.1 4557906..4559441(-) (comM) [Variovorax sp. KK3]
MSLSLVQSRALIGLEAADVTVEVHLANGLPSFTLVGLADVEVKEARERVRSALQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAASGQIEAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATRLVLPAESAQEAALVPGAEIYGA
AHLLDVVRQFVPGGPAPAGADDGWHRAVQAVAAEPSEALADLADVKGHAGARRVLEIAAAGQHSLLMVGPPGAGKSMLAQ
RFAGLLPQMSVDEALEAAAVASLQGRFAVGRWRQRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRS
ALEALREPLETGSITIARAARRAEFPARFQLIAAMNPCPCGYLGSTLKACRCSPDQVTRYQGKLSGPLLDRIDLQIEVPA
VPTTELLDVPAGEASATVRERVAEARGRALERQGKANQALQGAEIDRHARPEAAALQLLHGAAARLGWSARGIHRALKVA
RTIADLAATDTVQAAHVAEAVQYRRALRAAA

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1007726 ABL847_RS21055 WP_077000248.1 4557906..4559441(-) (comM) [Variovorax sp. KK3]
ATGAGTTTGTCTTTGGTGCAAAGCCGTGCGCTGATCGGCTTGGAGGCGGCCGATGTCACGGTCGAGGTGCATCTGGCCAA
CGGCCTGCCCAGCTTCACGCTGGTCGGGTTGGCCGATGTGGAAGTCAAGGAAGCGCGCGAACGGGTGCGCTCGGCCCTCC
AGAACGCCGGCCTCGAATTCCCCAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGACCTGCCCAAGGATTCCGGC
CGCTTCGACCTGCCGATCGCGCTGGGCATCCTGGCGGCCAGCGGCCAGATCGAGGCGGCGCGGCTCGCTGGCCACGAATT
CGCGGGCGAGCTCTCGCTTTCGGGGCATCTGCGGCCCGTGCGTGGTGCACTGGCGATGGCGCTGGCCCTGCATGGCCGCG
GCGTCGCCACGCGGCTGGTGCTGCCGGCCGAGAGCGCACAGGAAGCCGCGCTGGTGCCTGGCGCCGAAATCTACGGCGCA
GCGCACCTGCTCGACGTGGTGCGCCAGTTCGTGCCGGGCGGCCCCGCGCCGGCCGGGGCCGACGATGGCTGGCACCGGGC
CGTGCAGGCCGTCGCCGCCGAGCCGTCCGAAGCGCTGGCCGACCTGGCCGACGTCAAGGGCCATGCCGGCGCACGGCGCG
TGCTCGAGATCGCCGCCGCCGGCCAGCACAGCCTGCTGATGGTCGGGCCGCCGGGCGCCGGCAAGTCGATGCTGGCCCAG
CGCTTCGCCGGCCTGCTGCCGCAGATGAGCGTGGACGAAGCGTTGGAAGCCGCGGCGGTGGCCAGCCTGCAAGGCCGGTT
CGCCGTCGGCCGATGGCGCCAGCGGCCGACCTGCAGCCCGCACCACAGCGCGAGCGCGGTCGCGCTGGTGGGTGGCGGCA
GTCCGCCGCGGCCCGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTCGACGAGTTTCCCGAGTTCCAGCGCTCG
GCCCTCGAGGCCCTGCGCGAGCCCCTGGAGACCGGCAGCATCACCATCGCGCGGGCTGCCCGGCGCGCCGAATTTCCGGC
GCGTTTCCAGCTGATCGCGGCGATGAACCCCTGCCCTTGCGGCTACCTGGGCTCGACGCTCAAGGCCTGCCGCTGCTCGC
CCGACCAGGTCACCCGATATCAAGGAAAGCTCAGCGGCCCGCTGCTGGACCGCATCGACCTGCAGATCGAGGTGCCGGCC
GTGCCCACCACTGAGCTGCTCGACGTGCCGGCCGGCGAAGCCAGCGCCACGGTGCGCGAGCGCGTGGCCGAGGCGCGCGG
CCGGGCCCTGGAGCGCCAGGGCAAGGCCAACCAGGCCTTGCAGGGCGCAGAGATCGACCGCCACGCCCGGCCCGAAGCGG
CTGCCTTGCAGCTGCTGCACGGCGCCGCAGCGCGGCTGGGCTGGTCGGCGCGCGGCATCCACCGGGCGCTGAAGGTCGCG
CGAACCATTGCGGACCTGGCCGCCACCGACACGGTGCAGGCGGCGCACGTGGCGGAGGCGGTGCAGTACCGCCGGGCACT
GCGCGCAGCAGCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

51.663

100

0.517

  comM Vibrio cholerae strain A1552

51.272

100

0.513

  comM Glaesserella parasuis strain SC1401

51.282

99.217

0.509

  comM Vibrio campbellii strain DS40M4

49.902

99.609

0.497

  comM Legionella pneumophila str. Paris

46.693

100

0.47

  comM Legionella pneumophila strain ERS1305867

46.693

100

0.47

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.717

99.413

0.425


Multiple sequence alignment