Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MM141_RS27975 Genome accession   NZ_CP093012
Coordinates   6022317..6023810 (+) Length   497 a.a.
NCBI ID   WP_003457843.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain H20     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6017317..6028810
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MM141_RS27945 (MM141_27955) - 6017460..6018374 (-) 915 WP_003118059.1 fimbrial protein -
  MM141_RS27950 (MM141_27960) sutA 6018789..6019106 (-) 318 WP_003098240.1 transcriptional regulator SutA -
  MM141_RS27955 (MM141_27965) - 6019184..6019609 (-) 426 WP_003096441.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  MM141_RS27960 (MM141_27970) - 6019870..6021198 (-) 1329 WP_003109815.1 ammonium transporter -
  MM141_RS27965 (MM141_27975) glnK 6021238..6021576 (-) 339 WP_003096476.1 P-II family nitrogen regulator -
  MM141_RS27970 (MM141_27980) - 6022016..6022276 (+) 261 WP_003096478.1 accessory factor UbiK family protein -
  MM141_RS27975 (MM141_27985) comM 6022317..6023810 (+) 1494 WP_003457843.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  MM141_RS27980 (MM141_27990) betT 6023935..6025920 (+) 1986 WP_003096496.1 choline BCCT transporter BetT -
  MM141_RS27985 (MM141_27995) pchP 6025963..6027012 (-) 1050 WP_009877249.1 phosphorylcholine phosphatase -
  MM141_RS27990 (MM141_28000) - 6027163..6028080 (-) 918 WP_003098255.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53083.02 Da        Isoelectric Point: 7.9587

>NTDB_id=661345 MM141_RS27975 WP_003457843.1 6022317..6023810(+) (comM) [Pseudomonas aeruginosa strain H20]
MSLAIVHSRAQVGVEAPCVSVEAHLANGLPSLTLVGLPETAVRESKDRVRSALLNAGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASGQLPGTALDGLECLGELALSGAIRPVRGVLPAALAARDARRVLVVPKENAEEASLASGLTVFAVDHL
LEIAGHLSGQAPLLPYQARGLLRAPFPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPALDE
DEALEVAAIHSVASHVPLRHWPQRPFRQPHHSASAPALVGGGSRPQPGEITLAHQGVLFLDELPEFERKVLEVLREPLES
GEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCTPEQVQRYRGKLSGPLLDRIDLHVSVLRESTSLQPGHGE
TATAEISERVGAARQRQLARQGCANAHLDLQAMHRNCALAEADRRWLEAAGERLELSLRALHRILKVARTLADLERIDAI
ERRHLAEALQYRATTST

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=661345 MM141_RS27975 WP_003457843.1 6022317..6023810(+) (comM) [Pseudomonas aeruginosa strain H20]
ATGTCCCTGGCGATTGTCCACAGCCGAGCCCAGGTCGGCGTCGAAGCCCCCTGCGTCAGCGTCGAGGCGCACCTGGCCAA
CGGCCTGCCTTCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTGCGCGAGAGCAAGGACCGCGTGCGCAGCGCCCTGC
TCAATGCCGGTTTCGACTTCCCCGCGCGGCGCATCACCCTCAACCTCGCCCCCGCCGACCTGCCCAAGGACGGCGGTCGC
TTCGACCTGGCCATCGCACTCGGCATCCTCGCCGCCAGCGGCCAGTTGCCCGGCACCGCCCTCGACGGCCTGGAGTGCCT
TGGCGAACTGGCCCTGTCCGGGGCGATCCGGCCAGTGCGAGGCGTATTGCCGGCCGCGCTGGCGGCGCGCGACGCAAGGC
GCGTTCTGGTGGTACCGAAGGAAAATGCCGAAGAGGCCAGCCTGGCCAGCGGGCTGACGGTGTTCGCCGTGGACCACCTG
CTGGAGATCGCCGGACACCTCTCCGGCCAGGCCCCGCTGCTGCCCTACCAGGCCCGCGGCCTGCTCCGCGCGCCCTTCCC
TTATCCAGACCTGGCCGAGGTCCAGGGCCAGGCCGCCGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCGCGCACAACC
TGTTGCTCAGCGGCCCGCCGGGCACCGGCAAGACCCTCCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGGCGCTCGACGAG
GACGAGGCCCTGGAGGTCGCAGCGATCCATTCGGTGGCCAGCCACGTCCCCCTCAGGCACTGGCCGCAGCGACCGTTCCG
CCAGCCGCACCACTCCGCCTCCGCGCCGGCCCTGGTCGGCGGCGGCAGCCGCCCGCAGCCGGGCGAGATCACCCTGGCGC
ACCAGGGCGTGCTGTTCCTCGACGAACTGCCGGAGTTCGAGCGCAAGGTCCTGGAGGTCCTGCGCGAGCCGCTGGAAAGC
GGCGAGATCGTCATTGCCCGGGCCAACGGCCGGGTACGTTTCCCGGCGCGCTTCCAACTGGTGGCGGCGATGAATCCCTG
TCCCTGTGGCTACCTCGGCGATCCCAGCGGCCGCTGCCGCTGCACCCCGGAACAGGTCCAGCGCTACCGGGGCAAGCTGT
CCGGACCGCTGCTCGATCGCATCGACCTGCACGTCAGCGTGCTCCGCGAAAGCACCAGCCTGCAGCCAGGACACGGCGAA
ACCGCTACCGCCGAGATCAGCGAACGGGTTGGCGCCGCACGGCAACGGCAACTGGCCCGCCAGGGCTGCGCCAATGCCCA
TCTCGACCTCCAGGCGATGCACCGCAATTGTGCACTCGCCGAAGCGGACCGCCGCTGGCTGGAGGCTGCCGGAGAGCGCC
TGGAACTTTCCTTGCGCGCCTTGCATCGCATACTCAAGGTGGCCCGGACGCTGGCCGACCTGGAGCGCATCGATGCCATC
GAACGCCGGCACCTGGCGGAAGCCCTGCAGTATCGGGCAACGACCTCCACGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.452

99.799

0.563

  comM Haemophilus influenzae Rd KW20

55.467

100

0.561

  comM Vibrio cholerae strain A1552

56.048

99.799

0.559

  comM Glaesserella parasuis strain SC1401

54.691

100

0.551

  comM Legionella pneumophila str. Paris

49.901

100

0.507

  comM Legionella pneumophila strain ERS1305867

49.901

100

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.154

100

0.471