Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R6K63_RS31595 Genome accession   NZ_CP137556
Coordinates   6707729..6709222 (+) Length   497 a.a.
NCBI ID   WP_058201325.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain CDC1270     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6702729..6714222
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R6K63_RS31565 - 6702791..6703705 (-) 915 WP_023107397.1 fimbrial protein -
  R6K63_RS31570 sutA 6704201..6704518 (-) 318 WP_003096440.1 transcriptional regulator SutA -
  R6K63_RS31575 - 6704596..6705021 (-) 426 WP_003096441.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  R6K63_RS31580 - 6705282..6706610 (-) 1329 WP_058201324.1 ammonium transporter -
  R6K63_RS31585 glnK 6706650..6706988 (-) 339 WP_003096476.1 P-II family nitrogen regulator -
  R6K63_RS31590 - 6707428..6707688 (+) 261 WP_003096478.1 accessory factor UbiK family protein -
  R6K63_RS31595 comM 6707729..6709222 (+) 1494 WP_058201325.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R6K63_RS31600 betT 6709347..6711332 (+) 1986 WP_003096496.1 choline BCCT transporter BetT -
  R6K63_RS31605 pchP 6711375..6712424 (-) 1050 WP_003110458.1 phosphorylcholine phosphatase -
  R6K63_RS31610 - 6712575..6713492 (-) 918 WP_023434403.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53129.11 Da        Isoelectric Point: 7.9587

>NTDB_id=899084 R6K63_RS31595 WP_058201325.1 6707729..6709222(+) (comM) [Pseudomonas aeruginosa strain CDC1270]
MSLAIVHSRAQVGVEAPCVSVEAHLANGLPSLTLVGLPETAVRESKDRVRSALLNAGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASGQLPSTALDGLECLGELALSGAIRPVRGVLPAALAARDARRVLVVPKENAEEASLASGLTVFAVDHL
LEIAGHLSGQAPLLPYQARGLLRAPFPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPALDE
DEALEVAAIHSVASHVPLRHWPQRPFRQPHHSASAPALVGGGSRPQPGEITLAHQGVLFLDELPEFERKVLEVLREPLES
GEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCTPEQVQRYRGKLSGPLLDRIDLHVSVLRESTSLQPGHGE
TATAEVSERVGAARQRQLARQGCANAHLDLQAMHRNCALAEADRRWLEAAGERLELSLRALHRILKVARTLADLERIDAI
ERRHLAEALQYRAMTST

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=899084 R6K63_RS31595 WP_058201325.1 6707729..6709222(+) (comM) [Pseudomonas aeruginosa strain CDC1270]
ATGTCCCTGGCGATTGTCCACAGCCGAGCCCAGGTCGGCGTCGAAGCCCCCTGCGTCAGCGTCGAGGCGCATCTGGCCAA
TGGCCTGCCTTCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTGCGCGAGAGCAAGGACCGCGTGCGCAGCGCCCTGC
TCAATGCCGGTTTCGACTTCCCCGCGCGGCGCATCACCCTCAACCTCGCCCCCGCCGACCTGCCCAAGGACGGCGGTCGC
TTCGACCTGGCCATCGCACTCGGCATCCTCGCCGCCAGCGGCCAGTTGCCCAGCACCGCCCTCGACGGCCTGGAGTGCCT
TGGCGAACTGGCCCTGTCCGGGGCGATCCGGCCAGTGCGAGGCGTATTGCCGGCCGCGCTGGCGGCGCGCGACGCAAGGC
GCGTTCTGGTGGTACCGAAGGAAAATGCCGAAGAGGCCAGCCTGGCCAGCGGGCTGACGGTGTTCGCCGTGGACCACCTG
CTGGAGATCGCCGGACACCTCTCCGGCCAGGCCCCGCTGCTGCCCTACCAGGCCCGCGGCCTGCTCCGCGCGCCCTTCCC
TTATCCAGACCTGGCCGAGGTCCAGGGCCAGGCCGCCGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCGCGCACAACC
TGTTGCTCAGCGGCCCGCCGGGCACCGGCAAGACCCTCCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGGCGCTCGACGAG
GACGAGGCCCTGGAGGTCGCGGCGATCCATTCGGTGGCCAGCCACGTCCCCCTCAGGCACTGGCCGCAGCGACCGTTCCG
CCAGCCGCACCACTCCGCCTCCGCGCCGGCCCTGGTCGGCGGCGGCAGCCGCCCGCAGCCGGGCGAGATCACCCTGGCGC
ACCAGGGCGTGCTGTTCCTCGACGAACTGCCGGAGTTCGAGCGCAAGGTCCTGGAGGTCCTGCGCGAGCCGCTGGAAAGC
GGCGAGATCGTCATTGCCCGGGCCAACGGCCGGGTACGTTTCCCGGCGCGCTTCCAACTGGTGGCGGCGATGAATCCCTG
TCCCTGTGGCTACCTCGGCGATCCCAGCGGCCGCTGCCGCTGCACCCCGGAACAGGTCCAGCGCTACCGGGGCAAGCTGT
CCGGACCGCTGCTCGATCGCATCGACCTGCACGTCAGCGTGCTCCGCGAAAGCACCAGCCTGCAGCCAGGACACGGCGAA
ACCGCTACCGCCGAGGTCAGCGAACGGGTTGGCGCCGCACGGCAACGGCAACTGGCCCGCCAGGGCTGCGCCAATGCCCA
TCTCGACCTCCAGGCGATGCACCGCAATTGTGCACTCGCCGAAGCGGACCGCCGCTGGCTGGAGGCTGCCGGAGAGCGCC
TGGAACTTTCCTTGCGCGCCTTGCATCGCATACTCAAGGTGGCCCGGACGCTGGCCGACCTGGAGCGCATCGATGCCATC
GAACGCCGGCACCTGGCGGAAGCCCTGCAGTATCGGGCAATGACCTCCACGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.74

100

0.567

  comM Haemophilus influenzae Rd KW20

55.644

100

0.565

  comM Vibrio cholerae strain A1552

56.338

100

0.563

  comM Glaesserella parasuis strain SC1401

54.98

100

0.555

  comM Legionella pneumophila str. Paris

49.703

100

0.505

  comM Legionella pneumophila strain ERS1305867

49.703

100

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.866

100

0.469