Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | IF189_RS20695 | Genome accession | NZ_CP062251 |
| Coordinates | 4624845..4627088 (-) | Length | 747 a.a. |
| NCBI ID | WP_192552189.1 | Uniprot ID | - |
| Organism | Pseudomonas sp. IzPS59 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4619845..4632088
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| IF189_RS20660 | murB | 4620245..4621264 (-) | 1020 | WP_134827763.1 | UDP-N-acetylmuramate dehydrogenase | - |
| IF189_RS20665 | - | 4621261..4621725 (-) | 465 | WP_134827764.1 | low molecular weight protein-tyrosine-phosphatase | - |
| IF189_RS20670 | kdsB | 4621725..4622489 (-) | 765 | WP_134827765.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| IF189_RS20675 | - | 4622486..4622671 (-) | 186 | WP_003179363.1 | Trm112 family protein | - |
| IF189_RS20680 | lpxK | 4622696..4623706 (-) | 1011 | WP_192552188.1 | tetraacyldisaccharide 4'-kinase | - |
| IF189_RS20685 | - | 4623706..4624134 (-) | 429 | WP_085710944.1 | ExbD/TolR family protein | - |
| IF189_RS20690 | exbB | 4624131..4624766 (-) | 636 | WP_162803506.1 | MotA/TolQ/ExbB proton channel family protein | Machinery gene |
| IF189_RS20695 | comA | 4624845..4627088 (-) | 2244 | WP_192552189.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| IF189_RS20700 | - | 4627225..4627743 (+) | 519 | WP_085710946.1 | DUF2062 domain-containing protein | - |
| IF189_RS20705 | - | 4627798..4628577 (-) | 780 | WP_085710947.1 | ABC transporter permease | - |
| IF189_RS20710 | - | 4628574..4629506 (-) | 933 | WP_134827768.1 | ABC transporter ATP-binding protein | - |
| IF189_RS20715 | - | 4629638..4630261 (-) | 624 | WP_134827769.1 | glutathione S-transferase family protein | - |
| IF189_RS20720 | - | 4630305..4630964 (-) | 660 | WP_192552190.1 | transglutaminase-like domain-containing protein | - |
Sequence
Protein
Download Length: 747 a.a. Molecular weight: 81914.92 Da Isoelectric Point: 9.9145
>NTDB_id=487335 IF189_RS20695 WP_192552189.1 4624845..4627088(-) (comA) [Pseudomonas sp. IzPS59]
MRTGMMALAVGLLAPVFLPALPPVGWVLWLPVVALMLLPFRSYPLAFLLFGFAWSCIGAQWALDERLPAELDGETRWVEG
RVFGLPQSGDGVVRFELADARSRHGKLPSLMRLAWYDGPPVNSGERWRLAVKLKRPGGLLNPDAFDYEAWLLAQRIGATG
TVKDGNRLAAAQWAWRDGIRKRLLEVDTQGRGAALAALVLGDGSGLSREDWQILQDTGTVHLLVISGQHIGLLAAVMYGL
VAGLARFGVWPLRWPWLPWACGLAFAAALGYGLLAGFDVPVRRACVMVALVLLWRLRFRHLGAWWPLLLAFNGVLLMDPL
ASLRPGLWLSFAAVAVLIFTFGGRLGPWRWWQTWTRAQWLIAIGLGPVLLILGLPISLSGPLANLLAVPWISFAVLPPAL
LGTLLLPIPYVGEDLLWLAGGLVDGLFRALALIAGRWPAWVAPLMPGWILALGCIGAVLLLLPRGIPLRPLGWPLLLVLA
FPPRERLAENVADVWQLDVGQGLAILIRTRHHTLLYDTGPRFGDFDVGERVVLPALHKLGVERLDLMLLSHADADHAGGA
LAVANGLPVSRVISGDPPGLPEALNAETCESGRQWQWDGVAFHLWQWGDAQDSNQRSCVLQIEANGERLLLTGDIDTGAE
RDLLNSALAVPTQWLQAPHHGSRSSSSMALLKVLQPRDVLISRGQGNSFGHPHPTVIARYRKQGLRIHDSAEQGAIHLQL
GRFQPARSMRLQRRFWRDPPQPGATHR
MRTGMMALAVGLLAPVFLPALPPVGWVLWLPVVALMLLPFRSYPLAFLLFGFAWSCIGAQWALDERLPAELDGETRWVEG
RVFGLPQSGDGVVRFELADARSRHGKLPSLMRLAWYDGPPVNSGERWRLAVKLKRPGGLLNPDAFDYEAWLLAQRIGATG
TVKDGNRLAAAQWAWRDGIRKRLLEVDTQGRGAALAALVLGDGSGLSREDWQILQDTGTVHLLVISGQHIGLLAAVMYGL
VAGLARFGVWPLRWPWLPWACGLAFAAALGYGLLAGFDVPVRRACVMVALVLLWRLRFRHLGAWWPLLLAFNGVLLMDPL
ASLRPGLWLSFAAVAVLIFTFGGRLGPWRWWQTWTRAQWLIAIGLGPVLLILGLPISLSGPLANLLAVPWISFAVLPPAL
LGTLLLPIPYVGEDLLWLAGGLVDGLFRALALIAGRWPAWVAPLMPGWILALGCIGAVLLLLPRGIPLRPLGWPLLLVLA
FPPRERLAENVADVWQLDVGQGLAILIRTRHHTLLYDTGPRFGDFDVGERVVLPALHKLGVERLDLMLLSHADADHAGGA
LAVANGLPVSRVISGDPPGLPEALNAETCESGRQWQWDGVAFHLWQWGDAQDSNQRSCVLQIEANGERLLLTGDIDTGAE
RDLLNSALAVPTQWLQAPHHGSRSSSSMALLKVLQPRDVLISRGQGNSFGHPHPTVIARYRKQGLRIHDSAEQGAIHLQL
GRFQPARSMRLQRRFWRDPPQPGATHR
Nucleotide
Download Length: 2244 bp
>NTDB_id=487335 IF189_RS20695 WP_192552189.1 4624845..4627088(-) (comA) [Pseudomonas sp. IzPS59]
ATGCGCACAGGGATGATGGCGCTGGCAGTCGGTCTGCTGGCCCCGGTTTTTTTGCCGGCCTTGCCGCCGGTCGGGTGGGT
GCTGTGGCTGCCGGTGGTGGCACTGATGCTATTGCCGTTTCGCAGTTATCCGCTGGCGTTTTTGCTGTTCGGCTTCGCCT
GGTCGTGTATCGGTGCGCAGTGGGCGCTGGATGAGCGACTGCCAGCGGAGCTGGACGGCGAGACGCGCTGGGTCGAAGGG
CGGGTGTTCGGCCTGCCGCAGAGCGGCGACGGGGTCGTGCGCTTTGAGCTGGCGGATGCCCGTTCGCGTCACGGGAAACT
GCCTTCGCTGATGCGCCTGGCCTGGTACGACGGGCCACCGGTCAACAGCGGCGAGCGCTGGCGGCTGGCGGTCAAACTCA
AACGTCCCGGCGGGCTGCTCAACCCCGATGCCTTCGATTACGAGGCGTGGCTGTTGGCGCAACGCATCGGCGCCACCGGC
ACGGTGAAAGACGGCAACCGGCTGGCGGCGGCGCAATGGGCCTGGCGGGACGGCATCCGAAAGCGTCTGCTGGAGGTGGA
TACACAGGGCCGGGGTGCTGCGCTGGCGGCATTGGTGCTGGGCGATGGCTCGGGGCTCAGTCGCGAGGACTGGCAGATCC
TGCAGGACACCGGCACCGTGCACTTGTTGGTGATTTCCGGACAGCACATCGGTCTGCTGGCGGCGGTGATGTACGGGCTG
GTCGCCGGGCTGGCGCGATTCGGTGTGTGGCCGTTGCGTTGGCCATGGCTGCCGTGGGCCTGTGGTCTGGCGTTCGCGGC
AGCGTTGGGGTACGGGCTGCTGGCCGGGTTCGATGTGCCGGTGCGACGGGCCTGTGTGATGGTTGCGCTGGTGTTGCTGT
GGCGATTGCGCTTTCGCCATCTGGGTGCCTGGTGGCCGTTGTTGCTGGCCTTCAATGGGGTGTTGTTGATGGACCCGCTG
GCCAGCCTGCGTCCGGGATTGTGGTTGTCCTTCGCGGCGGTGGCAGTGCTGATTTTTACCTTCGGCGGTCGTCTGGGGCC
GTGGCGCTGGTGGCAGACCTGGACGCGAGCCCAATGGCTGATCGCGATCGGTTTAGGCCCGGTGTTGCTGATCCTGGGCC
TGCCGATCAGCCTCAGCGGGCCATTGGCCAATCTGCTGGCGGTGCCGTGGATCAGTTTTGCGGTGCTCCCGCCGGCATTG
CTCGGCACTTTGTTGTTGCCGATTCCCTATGTCGGCGAAGACCTGTTGTGGCTGGCCGGCGGACTGGTCGATGGATTGTT
CCGGGCTCTGGCCTTGATTGCAGGGCGCTGGCCGGCATGGGTTGCGCCATTGATGCCTGGATGGATCCTGGCGCTGGGCT
GCATCGGTGCCGTGTTGTTATTGCTGCCCCGAGGCATTCCATTGCGTCCCCTGGGCTGGCCGTTGTTGCTGGTGCTGGCA
TTTCCGCCCCGGGAGCGGCTGGCCGAAAACGTGGCGGATGTCTGGCAACTCGATGTCGGCCAGGGGCTGGCGATTCTGAT
CCGCACCCGCCATCACACATTGCTGTACGACACCGGGCCGCGCTTCGGCGATTTCGATGTCGGCGAGCGAGTGGTGCTGC
CAGCCTTGCACAAACTGGGAGTGGAGCGACTCGATCTGATGTTGCTCAGTCATGCCGACGCCGACCATGCCGGTGGGGCG
CTGGCCGTGGCAAACGGCTTGCCGGTCAGTCGGGTGATCAGCGGCGATCCGCCGGGGCTGCCCGAAGCGCTGAACGCTGA
GACCTGTGAAAGCGGTCGGCAATGGCAGTGGGACGGCGTTGCCTTTCATCTATGGCAGTGGGGGGACGCTCAGGACAGTA
ACCAGCGTTCCTGTGTCCTGCAGATCGAAGCCAACGGCGAGCGATTGCTGTTGACCGGCGATATCGACACCGGCGCCGAA
CGGGATTTGCTCAACAGCGCGCTGGCGGTGCCCACCCAGTGGTTGCAGGCCCCGCACCATGGCAGTCGCAGCTCCTCATC
GATGGCGTTGCTCAAGGTTTTGCAGCCTCGGGACGTGCTGATCTCCCGGGGGCAGGGCAATTCGTTCGGGCATCCGCATC
CGACCGTCATCGCCCGTTACCGCAAGCAAGGTTTGCGCATCCATGACAGCGCCGAACAGGGTGCCATTCATCTGCAACTG
GGCCGATTTCAGCCGGCCCGGTCGATGCGTCTGCAACGCCGGTTCTGGCGCGACCCGCCGCAGCCGGGGGCGACGCACCG
TTGA
ATGCGCACAGGGATGATGGCGCTGGCAGTCGGTCTGCTGGCCCCGGTTTTTTTGCCGGCCTTGCCGCCGGTCGGGTGGGT
GCTGTGGCTGCCGGTGGTGGCACTGATGCTATTGCCGTTTCGCAGTTATCCGCTGGCGTTTTTGCTGTTCGGCTTCGCCT
GGTCGTGTATCGGTGCGCAGTGGGCGCTGGATGAGCGACTGCCAGCGGAGCTGGACGGCGAGACGCGCTGGGTCGAAGGG
CGGGTGTTCGGCCTGCCGCAGAGCGGCGACGGGGTCGTGCGCTTTGAGCTGGCGGATGCCCGTTCGCGTCACGGGAAACT
GCCTTCGCTGATGCGCCTGGCCTGGTACGACGGGCCACCGGTCAACAGCGGCGAGCGCTGGCGGCTGGCGGTCAAACTCA
AACGTCCCGGCGGGCTGCTCAACCCCGATGCCTTCGATTACGAGGCGTGGCTGTTGGCGCAACGCATCGGCGCCACCGGC
ACGGTGAAAGACGGCAACCGGCTGGCGGCGGCGCAATGGGCCTGGCGGGACGGCATCCGAAAGCGTCTGCTGGAGGTGGA
TACACAGGGCCGGGGTGCTGCGCTGGCGGCATTGGTGCTGGGCGATGGCTCGGGGCTCAGTCGCGAGGACTGGCAGATCC
TGCAGGACACCGGCACCGTGCACTTGTTGGTGATTTCCGGACAGCACATCGGTCTGCTGGCGGCGGTGATGTACGGGCTG
GTCGCCGGGCTGGCGCGATTCGGTGTGTGGCCGTTGCGTTGGCCATGGCTGCCGTGGGCCTGTGGTCTGGCGTTCGCGGC
AGCGTTGGGGTACGGGCTGCTGGCCGGGTTCGATGTGCCGGTGCGACGGGCCTGTGTGATGGTTGCGCTGGTGTTGCTGT
GGCGATTGCGCTTTCGCCATCTGGGTGCCTGGTGGCCGTTGTTGCTGGCCTTCAATGGGGTGTTGTTGATGGACCCGCTG
GCCAGCCTGCGTCCGGGATTGTGGTTGTCCTTCGCGGCGGTGGCAGTGCTGATTTTTACCTTCGGCGGTCGTCTGGGGCC
GTGGCGCTGGTGGCAGACCTGGACGCGAGCCCAATGGCTGATCGCGATCGGTTTAGGCCCGGTGTTGCTGATCCTGGGCC
TGCCGATCAGCCTCAGCGGGCCATTGGCCAATCTGCTGGCGGTGCCGTGGATCAGTTTTGCGGTGCTCCCGCCGGCATTG
CTCGGCACTTTGTTGTTGCCGATTCCCTATGTCGGCGAAGACCTGTTGTGGCTGGCCGGCGGACTGGTCGATGGATTGTT
CCGGGCTCTGGCCTTGATTGCAGGGCGCTGGCCGGCATGGGTTGCGCCATTGATGCCTGGATGGATCCTGGCGCTGGGCT
GCATCGGTGCCGTGTTGTTATTGCTGCCCCGAGGCATTCCATTGCGTCCCCTGGGCTGGCCGTTGTTGCTGGTGCTGGCA
TTTCCGCCCCGGGAGCGGCTGGCCGAAAACGTGGCGGATGTCTGGCAACTCGATGTCGGCCAGGGGCTGGCGATTCTGAT
CCGCACCCGCCATCACACATTGCTGTACGACACCGGGCCGCGCTTCGGCGATTTCGATGTCGGCGAGCGAGTGGTGCTGC
CAGCCTTGCACAAACTGGGAGTGGAGCGACTCGATCTGATGTTGCTCAGTCATGCCGACGCCGACCATGCCGGTGGGGCG
CTGGCCGTGGCAAACGGCTTGCCGGTCAGTCGGGTGATCAGCGGCGATCCGCCGGGGCTGCCCGAAGCGCTGAACGCTGA
GACCTGTGAAAGCGGTCGGCAATGGCAGTGGGACGGCGTTGCCTTTCATCTATGGCAGTGGGGGGACGCTCAGGACAGTA
ACCAGCGTTCCTGTGTCCTGCAGATCGAAGCCAACGGCGAGCGATTGCTGTTGACCGGCGATATCGACACCGGCGCCGAA
CGGGATTTGCTCAACAGCGCGCTGGCGGTGCCCACCCAGTGGTTGCAGGCCCCGCACCATGGCAGTCGCAGCTCCTCATC
GATGGCGTTGCTCAAGGTTTTGCAGCCTCGGGACGTGCTGATCTCCCGGGGGCAGGGCAATTCGTTCGGGCATCCGCATC
CGACCGTCATCGCCCGTTACCGCAAGCAAGGTTTGCGCATCCATGACAGCGCCGAACAGGGTGCCATTCATCTGCAACTG
GGCCGATTTCAGCCGGCCCGGTCGATGCGTCTGCAACGCCGGTTCTGGCGCGACCCGCCGCAGCCGGGGGCGACGCACCG
TTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Pseudomonas stutzeri DSM 10701 |
62.17 |
96.252 |
0.598 |
| comA | Ralstonia pseudosolanacearum GMI1000 |
34.694 |
100 |
0.364 |