Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   RG643_RS22055 Genome accession   NZ_CP133753
Coordinates   4706277..4707770 (+) Length   497 a.a.
NCBI ID   WP_003457843.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain ZYPA187     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4701277..4712770
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RG643_RS22025 (RG643_22020) - 4701427..4702341 (-) 915 WP_003118059.1 fimbrial protein -
  RG643_RS22030 (RG643_22025) sutA 4702749..4703066 (-) 318 WP_003098240.1 transcriptional regulator SutA -
  RG643_RS22035 (RG643_22030) - 4703144..4703569 (-) 426 WP_015649825.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  RG643_RS22040 (RG643_22035) - 4703830..4705158 (-) 1329 WP_003098243.1 ammonium transporter -
  RG643_RS22045 (RG643_22040) glnK 4705198..4705536 (-) 339 WP_003096476.1 P-II family nitrogen regulator -
  RG643_RS22050 (RG643_22045) - 4705976..4706236 (+) 261 WP_003096478.1 accessory factor UbiK family protein -
  RG643_RS22055 (RG643_22050) comM 4706277..4707770 (+) 1494 WP_003457843.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  RG643_RS22060 (RG643_22055) betT 4707895..4709880 (+) 1986 WP_003096496.1 choline BCCT transporter BetT -
  RG643_RS22065 (RG643_22060) pchP 4709923..4710972 (-) 1050 WP_003110458.1 phosphorylcholine phosphatase -
  RG643_RS22070 (RG643_22065) - 4711123..4712040 (-) 918 WP_003098255.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53083.02 Da        Isoelectric Point: 7.9587

>NTDB_id=876310 RG643_RS22055 WP_003457843.1 4706277..4707770(+) (comM) [Pseudomonas aeruginosa strain ZYPA187]
MSLAIVHSRAQVGVEAPCVSVEAHLANGLPSLTLVGLPETAVRESKDRVRSALLNAGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASGQLPGTALDGLECLGELALSGAIRPVRGVLPAALAARDARRVLVVPKENAEEASLASGLTVFAVDHL
LEIAGHLSGQAPLLPYQARGLLRAPFPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPALDE
DEALEVAAIHSVASHVPLRHWPQRPFRQPHHSASAPALVGGGSRPQPGEITLAHQGVLFLDELPEFERKVLEVLREPLES
GEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCTPEQVQRYRGKLSGPLLDRIDLHVSVLRESTSLQPGHGE
TATAEISERVGAARQRQLARQGCANAHLDLQAMHRNCALAEADRRWLEAAGERLELSLRALHRILKVARTLADLERIDAI
ERRHLAEALQYRATTST

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=876310 RG643_RS22055 WP_003457843.1 4706277..4707770(+) (comM) [Pseudomonas aeruginosa strain ZYPA187]
ATGTCCCTGGCGATTGTCCACAGCCGAGCCCAGGTCGGCGTCGAAGCCCCCTGCGTCAGCGTCGAGGCGCACCTGGCCAA
CGGCCTGCCTTCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTGCGCGAGAGCAAGGACCGCGTGCGCAGCGCCCTGC
TCAATGCCGGTTTCGACTTCCCCGCGCGGCGCATCACCCTCAACCTCGCCCCCGCCGACCTGCCCAAGGACGGCGGTCGC
TTCGACCTGGCCATCGCACTCGGCATCCTCGCCGCCAGCGGCCAGTTGCCCGGCACCGCCCTCGACGGCCTGGAGTGCCT
TGGCGAACTGGCCCTGTCCGGGGCGATCCGGCCAGTGCGAGGCGTATTGCCGGCCGCGCTGGCGGCGCGCGACGCAAGGC
GCGTTCTGGTGGTACCGAAGGAAAATGCCGAAGAGGCCAGCCTGGCCAGCGGGCTGACGGTGTTCGCCGTGGACCACCTG
CTGGAGATCGCCGGACACCTCTCCGGCCAGGCCCCGCTGCTGCCCTACCAGGCCCGCGGCCTGCTCCGCGCGCCCTTCCC
TTATCCAGACCTGGCCGAGGTCCAGGGCCAGGCCGCCGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCGCGCACAACC
TGTTGCTCAGCGGCCCGCCGGGCACCGGCAAGACCCTCCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGGCGCTCGACGAG
GACGAGGCCCTGGAGGTCGCAGCGATCCATTCGGTGGCCAGCCACGTCCCCCTCAGGCACTGGCCGCAGCGACCGTTCCG
CCAGCCGCACCACTCCGCCTCCGCGCCGGCCCTGGTCGGCGGCGGCAGCCGCCCGCAGCCGGGCGAGATCACCCTGGCGC
ACCAGGGCGTGCTGTTCCTCGACGAACTGCCGGAGTTCGAGCGCAAGGTCCTGGAGGTCCTGCGCGAGCCGCTGGAAAGC
GGCGAGATCGTCATTGCCCGGGCCAACGGCCGGGTACGTTTCCCGGCGCGCTTCCAACTGGTGGCGGCGATGAATCCCTG
TCCCTGTGGCTACCTCGGCGATCCCAGCGGCCGCTGCCGCTGCACCCCGGAACAGGTCCAGCGCTACCGGGGCAAGCTGT
CCGGACCGCTGCTCGATCGCATCGACCTGCACGTCAGCGTGCTCCGCGAAAGCACCAGCCTGCAGCCAGGACACGGCGAA
ACCGCTACCGCCGAGATCAGCGAACGGGTTGGCGCCGCACGGCAACGGCAACTGGCCCGCCAGGGCTGCGCCAATGCCCA
TCTCGACCTCCAGGCGATGCACCGCAATTGTGCACTCGCCGAAGCGGACCGCCGCTGGCTGGAGGCTGCCGGAGAGCGCC
TGGAACTTTCCTTGCGCGCCTTGCATCGCATACTCAAGGTGGCCCGGACGCTGGCCGACCTGGAGCGCATCGATGCCATC
GAACGCCGGCACCTGGCGGAAGCCCTGCAGTATCGGGCAACGACCTCCACGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.452

99.799

0.563

  comM Haemophilus influenzae Rd KW20

55.467

100

0.561

  comM Vibrio cholerae strain A1552

56.048

99.799

0.559

  comM Glaesserella parasuis strain SC1401

54.691

100

0.551

  comM Legionella pneumophila str. Paris

49.901

100

0.507

  comM Legionella pneumophila strain ERS1305867

49.901

100

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.154

100

0.471