Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | WLF14_RS18960 | Genome accession | NZ_CP151816 |
| Coordinates | 4191533..4193746 (-) | Length | 737 a.a. |
| NCBI ID | WP_342309910.1 | Uniprot ID | - |
| Organism | Pseudomonas fluorescens strain IMGN2 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4186533..4198746
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| WLF14_RS18925 (WLF14_18925) | murB | 4186902..4187921 (-) | 1020 | WP_102625486.1 | UDP-N-acetylmuramate dehydrogenase | - |
| WLF14_RS18930 (WLF14_18930) | - | 4187918..4188382 (-) | 465 | WP_102625485.1 | low molecular weight protein-tyrosine-phosphatase | - |
| WLF14_RS18935 (WLF14_18935) | kdsB | 4188382..4189146 (-) | 765 | WP_024076129.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| WLF14_RS18940 (WLF14_18940) | - | 4189143..4189328 (-) | 186 | WP_003174668.1 | Trm112 family protein | - |
| WLF14_RS18945 (WLF14_18945) | lpxK | 4189353..4190363 (-) | 1011 | WP_102625484.1 | tetraacyldisaccharide 4'-kinase | - |
| WLF14_RS18950 (WLF14_18950) | - | 4190363..4190791 (-) | 429 | WP_102625483.1 | biopolymer transporter ExbD | - |
| WLF14_RS18955 (WLF14_18955) | exbB | 4190788..4191423 (-) | 636 | WP_024076132.1 | MotA/TolQ/ExbB proton channel family protein | Machinery gene |
| WLF14_RS18960 (WLF14_18960) | comA | 4191533..4193746 (-) | 2214 | WP_342309910.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| WLF14_RS18965 (WLF14_18965) | - | 4193894..4194409 (+) | 516 | WP_024076134.1 | DUF2062 domain-containing protein | - |
| WLF14_RS18970 (WLF14_18970) | - | 4194406..4195782 (-) | 1377 | WP_102625481.1 | sensor histidine kinase | - |
| WLF14_RS18975 (WLF14_18975) | - | 4195769..4196497 (-) | 729 | WP_102625480.1 | response regulator transcription factor | - |
Sequence
Protein
Download Length: 737 a.a. Molecular weight: 79848.76 Da Isoelectric Point: 10.6361
>NTDB_id=981002 WLF14_RS18960 WP_342309910.1 4191533..4193746(-) (comA) [Pseudomonas fluorescens strain IMGN2]
MFALALGLLALRFLPALPPAAGVLAMCVLALMLLPFRSHPLAFFLLGLSWACISAQWALDDRLQPALDGQTRWVEGRVTG
LPQQTSTGVRFELTDGRSPKARLPKRIRVAWHGGPPVRSGERWRLAVTLKRPSGVLNFHGFDHEAWLLAQRIGATGSVKD
GERLAPARNAWRDAVRQRLMAVDAQGREAGLAALVLGDGSGLAAEDWRVLQDTGTVHLLVISGQHIGLLAGLIYGLVTLL
ARYGCWPPAWPWLPWACGLAFAAALGYGLLAGFGVPVQRACVMVGLVLLWRMRFRHLGAWWPLLLAFNGVLILEPLASLQ
PGFWLSFAAVAVLVLAFGGRLGRWSAWQAWTRPQWLIAIGLFPLLLVLGLPISLSAPLANLFAVPWISLVVLPLALLGTV
LVAVPFVGEGLLWLAGGALDGLFRALALLAGHLPAWIPAEVPMGYWLVSVAGAVLLLLPRGVPLRPLGWPMLLLAVFPPR
EPVPHGQVEVVQLDVGQGQALVLRTRHHTLLYDAGPRSGEVDLGARVVLPSLRKLGVEVLDMMLLSHADADHAGGAAAIA
RGLPIRRVVGGETRGLPAFLGTEPCVSGEQWEWDGVSFELWQWPDAISGNPKSCVLQVQAKGERMLLTGDIDRAAERAFL
ASPLAVRTDWLQAPHHGSRSSSSWPFVQRLAPTAVLISRGRGNAFGHPHPQVMERYRALGTQVYDSAEQGAVRVQLGAFL
PPTVARSQRRFWREALP
MFALALGLLALRFLPALPPAAGVLAMCVLALMLLPFRSHPLAFFLLGLSWACISAQWALDDRLQPALDGQTRWVEGRVTG
LPQQTSTGVRFELTDGRSPKARLPKRIRVAWHGGPPVRSGERWRLAVTLKRPSGVLNFHGFDHEAWLLAQRIGATGSVKD
GERLAPARNAWRDAVRQRLMAVDAQGREAGLAALVLGDGSGLAAEDWRVLQDTGTVHLLVISGQHIGLLAGLIYGLVTLL
ARYGCWPPAWPWLPWACGLAFAAALGYGLLAGFGVPVQRACVMVGLVLLWRMRFRHLGAWWPLLLAFNGVLILEPLASLQ
PGFWLSFAAVAVLVLAFGGRLGRWSAWQAWTRPQWLIAIGLFPLLLVLGLPISLSAPLANLFAVPWISLVVLPLALLGTV
LVAVPFVGEGLLWLAGGALDGLFRALALLAGHLPAWIPAEVPMGYWLVSVAGAVLLLLPRGVPLRPLGWPMLLLAVFPPR
EPVPHGQVEVVQLDVGQGQALVLRTRHHTLLYDAGPRSGEVDLGARVVLPSLRKLGVEVLDMMLLSHADADHAGGAAAIA
RGLPIRRVVGGETRGLPAFLGTEPCVSGEQWEWDGVSFELWQWPDAISGNPKSCVLQVQAKGERMLLTGDIDRAAERAFL
ASPLAVRTDWLQAPHHGSRSSSSWPFVQRLAPTAVLISRGRGNAFGHPHPQVMERYRALGTQVYDSAEQGAVRVQLGAFL
PPTVARSQRRFWREALP
Nucleotide
Download Length: 2214 bp
>NTDB_id=981002 WLF14_RS18960 WP_342309910.1 4191533..4193746(-) (comA) [Pseudomonas fluorescens strain IMGN2]
ATGTTCGCGCTCGCGCTGGGGCTGCTCGCCTTGCGTTTTTTACCCGCGTTGCCGCCTGCCGCGGGGGTGTTGGCCATGTG
CGTGCTGGCGTTGATGCTGCTGCCGTTTCGCAGCCACCCACTGGCGTTCTTTCTGCTGGGCTTGAGTTGGGCGTGCATCA
GCGCGCAGTGGGCGCTGGATGACCGCCTGCAACCGGCCCTCGATGGCCAGACGCGCTGGGTGGAGGGCCGGGTGACCGGG
TTGCCGCAGCAGACCAGCACGGGCGTGCGTTTCGAACTGACCGACGGCCGGTCCCCCAAGGCGCGTTTACCCAAGCGTAT
TCGCGTGGCCTGGCATGGCGGGCCACCGGTGCGCAGTGGGGAGCGCTGGCGCTTGGCCGTGACGCTCAAGCGGCCATCCG
GCGTGCTCAACTTCCATGGCTTTGATCATGAGGCCTGGCTGTTGGCCCAGCGCATCGGTGCCACCGGCTCGGTGAAAGAT
GGTGAGCGTTTGGCGCCGGCGCGCAACGCTTGGCGCGACGCGGTCCGCCAACGACTGATGGCTGTGGATGCCCAAGGCCG
CGAGGCCGGGCTGGCCGCCTTGGTGCTGGGCGATGGCTCGGGGCTGGCCGCCGAGGATTGGCGTGTGTTGCAGGACACCG
GTACGGTGCACCTGCTGGTGATATCCGGCCAGCATATTGGCTTGCTGGCGGGGTTGATCTATGGGCTGGTCACACTGCTG
GCGCGCTACGGTTGCTGGCCCCCCGCCTGGCCGTGGCTGCCTTGGGCGTGTGGCCTGGCATTTGCTGCCGCGCTGGGCTA
CGGCCTGCTGGCGGGGTTTGGAGTGCCGGTGCAAAGGGCCTGCGTGATGGTGGGGCTGGTATTGCTGTGGCGTATGCGTT
TTCGCCATTTGGGCGCCTGGTGGCCATTGTTACTGGCATTTAATGGCGTGCTGATCCTTGAGCCTTTGGCCAGCCTGCAG
CCGGGGTTCTGGTTGTCGTTCGCGGCTGTCGCGGTATTGGTGCTGGCATTCGGTGGGCGTTTGGGGCGGTGGAGCGCCTG
GCAAGCCTGGACCCGTCCCCAGTGGTTGATCGCGATCGGCCTGTTTCCGCTGTTGCTGGTGCTGGGGTTGCCCATCAGTC
TCAGCGCGCCGTTGGCTAACCTGTTTGCCGTGCCATGGATCAGCCTGGTGGTGTTGCCCTTGGCATTGTTGGGCACTGTG
CTGGTGGCGGTGCCCTTTGTGGGGGAGGGGCTGTTATGGCTGGCGGGTGGGGCGCTGGATGGGTTGTTCCGCGCCTTGGC
GCTGCTGGCCGGGCATCTGCCGGCATGGATACCGGCCGAGGTGCCGATGGGCTATTGGCTGGTGAGCGTCGCGGGTGCAG
TGTTACTGCTGTTGCCCAGGGGCGTACCGTTACGACCGCTGGGGTGGCCGATGCTGTTGTTGGCGGTGTTTCCGCCGCGG
GAGCCGGTGCCCCATGGGCAGGTTGAGGTGGTGCAACTGGATGTCGGTCAGGGGCAGGCGCTGGTGCTGCGCACCCGTCA
TCACACCTTGCTCTACGATGCGGGTCCGCGCTCGGGGGAAGTCGATCTTGGCGCGCGTGTGGTATTGCCGTCATTGAGAA
AACTCGGGGTGGAGGTATTGGACATGATGCTGCTCAGCCACGCCGACGCCGACCATGCCGGCGGTGCGGCGGCTATTGCC
CGTGGGCTGCCGATCAGACGGGTAGTCGGAGGCGAAACACGAGGCTTGCCGGCGTTTCTCGGCACTGAACCCTGTGTCAG
CGGTGAACAGTGGGAGTGGGATGGGGTGTCATTTGAACTGTGGCAGTGGCCAGATGCTATTAGTGGTAACCCGAAATCCT
GTGTATTACAGGTCCAGGCCAAGGGTGAGCGCATGTTGCTCACGGGGGATATTGATCGCGCCGCCGAGCGGGCTTTTCTT
GCCTCGCCCTTGGCTGTGCGGACCGATTGGTTGCAGGCGCCCCATCATGGCAGCCGCAGTTCTTCGTCCTGGCCCTTTGT
GCAGCGGCTGGCGCCCACGGCGGTGCTGATTTCCAGGGGGCGAGGCAACGCGTTCGGCCACCCCCACCCCCAGGTGATGG
AACGCTACCGGGCACTGGGTACCCAGGTTTATGACAGTGCTGAACAAGGGGCTGTGCGTGTGCAATTGGGGGCGTTCCTG
CCACCGACTGTTGCGCGCAGTCAACGCCGTTTCTGGCGCGAAGCGTTACCGTAA
ATGTTCGCGCTCGCGCTGGGGCTGCTCGCCTTGCGTTTTTTACCCGCGTTGCCGCCTGCCGCGGGGGTGTTGGCCATGTG
CGTGCTGGCGTTGATGCTGCTGCCGTTTCGCAGCCACCCACTGGCGTTCTTTCTGCTGGGCTTGAGTTGGGCGTGCATCA
GCGCGCAGTGGGCGCTGGATGACCGCCTGCAACCGGCCCTCGATGGCCAGACGCGCTGGGTGGAGGGCCGGGTGACCGGG
TTGCCGCAGCAGACCAGCACGGGCGTGCGTTTCGAACTGACCGACGGCCGGTCCCCCAAGGCGCGTTTACCCAAGCGTAT
TCGCGTGGCCTGGCATGGCGGGCCACCGGTGCGCAGTGGGGAGCGCTGGCGCTTGGCCGTGACGCTCAAGCGGCCATCCG
GCGTGCTCAACTTCCATGGCTTTGATCATGAGGCCTGGCTGTTGGCCCAGCGCATCGGTGCCACCGGCTCGGTGAAAGAT
GGTGAGCGTTTGGCGCCGGCGCGCAACGCTTGGCGCGACGCGGTCCGCCAACGACTGATGGCTGTGGATGCCCAAGGCCG
CGAGGCCGGGCTGGCCGCCTTGGTGCTGGGCGATGGCTCGGGGCTGGCCGCCGAGGATTGGCGTGTGTTGCAGGACACCG
GTACGGTGCACCTGCTGGTGATATCCGGCCAGCATATTGGCTTGCTGGCGGGGTTGATCTATGGGCTGGTCACACTGCTG
GCGCGCTACGGTTGCTGGCCCCCCGCCTGGCCGTGGCTGCCTTGGGCGTGTGGCCTGGCATTTGCTGCCGCGCTGGGCTA
CGGCCTGCTGGCGGGGTTTGGAGTGCCGGTGCAAAGGGCCTGCGTGATGGTGGGGCTGGTATTGCTGTGGCGTATGCGTT
TTCGCCATTTGGGCGCCTGGTGGCCATTGTTACTGGCATTTAATGGCGTGCTGATCCTTGAGCCTTTGGCCAGCCTGCAG
CCGGGGTTCTGGTTGTCGTTCGCGGCTGTCGCGGTATTGGTGCTGGCATTCGGTGGGCGTTTGGGGCGGTGGAGCGCCTG
GCAAGCCTGGACCCGTCCCCAGTGGTTGATCGCGATCGGCCTGTTTCCGCTGTTGCTGGTGCTGGGGTTGCCCATCAGTC
TCAGCGCGCCGTTGGCTAACCTGTTTGCCGTGCCATGGATCAGCCTGGTGGTGTTGCCCTTGGCATTGTTGGGCACTGTG
CTGGTGGCGGTGCCCTTTGTGGGGGAGGGGCTGTTATGGCTGGCGGGTGGGGCGCTGGATGGGTTGTTCCGCGCCTTGGC
GCTGCTGGCCGGGCATCTGCCGGCATGGATACCGGCCGAGGTGCCGATGGGCTATTGGCTGGTGAGCGTCGCGGGTGCAG
TGTTACTGCTGTTGCCCAGGGGCGTACCGTTACGACCGCTGGGGTGGCCGATGCTGTTGTTGGCGGTGTTTCCGCCGCGG
GAGCCGGTGCCCCATGGGCAGGTTGAGGTGGTGCAACTGGATGTCGGTCAGGGGCAGGCGCTGGTGCTGCGCACCCGTCA
TCACACCTTGCTCTACGATGCGGGTCCGCGCTCGGGGGAAGTCGATCTTGGCGCGCGTGTGGTATTGCCGTCATTGAGAA
AACTCGGGGTGGAGGTATTGGACATGATGCTGCTCAGCCACGCCGACGCCGACCATGCCGGCGGTGCGGCGGCTATTGCC
CGTGGGCTGCCGATCAGACGGGTAGTCGGAGGCGAAACACGAGGCTTGCCGGCGTTTCTCGGCACTGAACCCTGTGTCAG
CGGTGAACAGTGGGAGTGGGATGGGGTGTCATTTGAACTGTGGCAGTGGCCAGATGCTATTAGTGGTAACCCGAAATCCT
GTGTATTACAGGTCCAGGCCAAGGGTGAGCGCATGTTGCTCACGGGGGATATTGATCGCGCCGCCGAGCGGGCTTTTCTT
GCCTCGCCCTTGGCTGTGCGGACCGATTGGTTGCAGGCGCCCCATCATGGCAGCCGCAGTTCTTCGTCCTGGCCCTTTGT
GCAGCGGCTGGCGCCCACGGCGGTGCTGATTTCCAGGGGGCGAGGCAACGCGTTCGGCCACCCCCACCCCCAGGTGATGG
AACGCTACCGGGCACTGGGTACCCAGGTTTATGACAGTGCTGAACAAGGGGCTGTGCGTGTGCAATTGGGGGCGTTCCTG
CCACCGACTGTTGCGCGCAGTCAACGCCGTTTCTGGCGCGAAGCGTTACCGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Pseudomonas stutzeri DSM 10701 |
60.083 |
97.558 |
0.586 |
| comA | Ralstonia pseudosolanacearum GMI1000 |
35.714 |
100 |
0.38 |