Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R5029_RS31350 Genome accession   NZ_CP137522
Coordinates   6620051..6621544 (+) Length   497 a.a.
NCBI ID   WP_014604121.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain HPA0118     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6615051..6626544
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R5029_RS31320 (R5029_31320) - 6615172..6616086 (-) 915 WP_003096438.1 fimbrial protein -
  R5029_RS31325 (R5029_31325) sutA 6616523..6616840 (-) 318 WP_003096440.1 transcriptional regulator SutA -
  R5029_RS31330 (R5029_31330) - 6616918..6617343 (-) 426 WP_003096441.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  R5029_RS31335 (R5029_31335) - 6617604..6618932 (-) 1329 WP_003109815.1 ammonium transporter -
  R5029_RS31340 (R5029_31340) glnK 6618972..6619310 (-) 339 WP_003096476.1 P-II family nitrogen regulator -
  R5029_RS31345 (R5029_31345) - 6619750..6620010 (+) 261 WP_003096478.1 accessory factor UbiK family protein -
  R5029_RS31350 (R5029_31350) comM 6620051..6621544 (+) 1494 WP_014604121.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R5029_RS31355 (R5029_31355) betT 6621669..6623654 (+) 1986 WP_003096496.1 choline BCCT transporter BetT -
  R5029_RS31360 (R5029_31360) pchP 6623697..6624746 (-) 1050 WP_003096497.1 phosphorylcholine phosphatase -
  R5029_RS31365 (R5029_31365) - 6624897..6625814 (-) 918 WP_014604122.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53113.05 Da        Isoelectric Point: 7.9587

>NTDB_id=898770 R5029_RS31350 WP_014604121.1 6620051..6621544(+) (comM) [Pseudomonas aeruginosa strain HPA0118]
MSLAIVHSRAQVGVEAPCVSVEAHLANGLPSLTLVGLPETAVRESKDRVRSALLNAGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASGQLPGTTLDGLECLGELALSGAIRPVRGVLPAALAARDARRVLVVPKENAEEASLASGLTVFAVDHL
LEIAGHLSGQAPLLPYQARGLLRAPFPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPALDE
DEALEVAAIHSVASHVPLRHWPQRPFRQPHHSASAPALVGGGSRPQPGEITLAHQGVLFLDELPEFERKVLEVLREPLES
GEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCTPEQVQRYRGKLSGPLLDRIDLHVSVLRESTSLQPGHGE
TATAEISERVGAARQRQLARQGCANAHLDLQAMHRNCALAEADRRWLEAAGERLELSLRALHRILKVARTLADLERIDAI
ERRHLAEALQYRATTST

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=898770 R5029_RS31350 WP_014604121.1 6620051..6621544(+) (comM) [Pseudomonas aeruginosa strain HPA0118]
ATGTCCCTGGCGATTGTCCACAGCCGAGCCCAGGTCGGCGTCGAAGCCCCCTGCGTCAGCGTCGAGGCGCACCTGGCCAA
CGGCCTGCCTTCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTGCGCGAGAGCAAGGACCGCGTGCGCAGCGCCCTGC
TCAATGCCGGTTTCGACTTCCCCGCGCGGCGCATCACCCTCAACCTCGCCCCCGCCGACCTGCCCAAGGACGGCGGTCGC
TTCGACCTGGCCATCGCACTCGGCATCCTCGCCGCCAGCGGCCAGTTGCCCGGCACCACCCTCGACGGCCTGGAGTGCCT
TGGCGAACTGGCCCTGTCCGGGGCGATCCGGCCAGTGCGAGGCGTATTGCCGGCCGCGCTGGCGGCGCGCGACGCAAGGC
GCGTTCTGGTGGTACCGAAGGAAAATGCCGAAGAGGCCAGCCTGGCCAGCGGGCTGACGGTGTTCGCCGTGGACCACCTG
CTGGAGATCGCCGGACACCTCTCCGGCCAGGCCCCGCTGCTGCCCTACCAGGCCCGCGGCCTGCTCCGCGCGCCCTTCCC
TTATCCAGACCTGGCCGAGGTCCAGGGCCAGGCCGCCGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCGCGCACAACC
TGTTGCTCAGCGGCCCGCCGGGCACCGGCAAGACCCTCCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGGCGCTCGACGAG
GACGAGGCCCTGGAGGTCGCAGCGATCCATTCGGTGGCCAGCCACGTCCCCCTCAGGCACTGGCCGCAGCGACCGTTCCG
CCAGCCGCACCACTCCGCCTCCGCGCCGGCCCTGGTCGGCGGCGGCAGCCGCCCGCAGCCGGGCGAGATCACCCTGGCGC
ACCAGGGCGTGCTGTTCCTCGACGAACTGCCGGAGTTCGAGCGCAAGGTCCTGGAGGTCCTGCGCGAGCCGCTGGAAAGC
GGCGAGATCGTCATTGCCCGGGCCAACGGCCGGGTACGTTTCCCGGCGCGCTTCCAACTGGTGGCGGCGATGAATCCCTG
TCCCTGTGGCTACCTCGGCGATCCCAGCGGCCGCTGCCGCTGCACCCCGGAACAGGTCCAGCGCTACCGGGGCAAGCTGT
CCGGACCGCTGCTCGATCGCATCGACCTGCACGTCAGCGTGCTCCGCGAAAGCACCAGCCTGCAGCCAGGACACGGCGAA
ACCGCTACCGCCGAGATCAGCGAACGGGTTGGCGCCGCACGGCAACGGCAACTGGCCCGCCAGGGCTGCGCCAATGCCCA
TCTCGACCTCCAGGCGATGCACCGCAATTGTGCACTCGCCGAAGCGGACCGCCGCTGGCTGGAGGCTGCCGGAGAGCGCC
TGGAACTTTCCTTGCGCGCCTTGCATCGCATACTCAAGGTGGCCCGGACGCTGGCCGACCTGGAGCGCATCGATGCCATC
GAACGCCGGCACCTGGCGGAAGCCCTGCAGTATCGGGCAACGACCTCCACGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.452

99.799

0.563

  comM Haemophilus influenzae Rd KW20

55.467

100

0.561

  comM Vibrio cholerae strain A1552

56.048

99.799

0.559

  comM Glaesserella parasuis strain SC1401

54.691

100

0.551

  comM Legionella pneumophila str. Paris

49.901

100

0.507

  comM Legionella pneumophila strain ERS1305867

49.901

100

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.154

100

0.471