Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | E5221_RS08560 | Genome accession | NZ_CP039127 |
| Coordinates | 1832992..1835202 (+) | Length | 736 a.a. |
| NCBI ID | WP_247847124.1 | Uniprot ID | - |
| Organism | Pseudomonas sp. A2 | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1827992..1840202
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| E5221_RS08540 (E5221_08805) | - | 1829857..1830480 (+) | 624 | WP_182328560.1 | glutathione S-transferase | - |
| E5221_RS08545 (E5221_08810) | - | 1830625..1831557 (+) | 933 | WP_013971560.1 | ABC transporter ATP-binding protein | - |
| E5221_RS08550 (E5221_08815) | - | 1831554..1832333 (+) | 780 | WP_015269397.1 | ABC transporter permease | - |
| E5221_RS08555 (E5221_08820) | - | 1832343..1832849 (-) | 507 | Protein_1664 | DUF2062 domain-containing protein | - |
| E5221_RS08560 (E5221_08825) | comA | 1832992..1835202 (+) | 2211 | WP_247847124.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| E5221_RS08565 (E5221_08830) | exbB | 1835618..1836253 (+) | 636 | WP_003259970.1 | MotA/TolQ/ExbB proton channel family protein | Machinery gene |
| E5221_RS08570 (E5221_08835) | - | 1836250..1836684 (+) | 435 | WP_013971564.1 | biopolymer transporter ExbD | - |
| E5221_RS08575 (E5221_08840) | lpxK | 1836684..1837685 (+) | 1002 | WP_247847126.1 | tetraacyldisaccharide 4'-kinase | - |
| E5221_RS08580 (E5221_08845) | - | 1837730..1837915 (+) | 186 | WP_011532874.1 | Trm112 family protein | - |
| E5221_RS08585 (E5221_08850) | kdsB | 1837912..1838676 (+) | 765 | WP_182328556.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| E5221_RS08590 (E5221_08855) | - | 1838676..1839140 (+) | 465 | WP_003259973.1 | low molecular weight protein-tyrosine-phosphatase | - |
| E5221_RS08595 (E5221_08860) | murB | 1839137..1840156 (+) | 1020 | WP_061552362.1 | UDP-N-acetylmuramate dehydrogenase | - |
Sequence
Protein
Download Length: 736 a.a. Molecular weight: 79420.41 Da Isoelectric Point: 10.5462
>NTDB_id=357183 E5221_RS08560 WP_247847124.1 1832992..1835202(+) (comA) [Pseudomonas sp. A2]
MRTGMFALALGLLCLGFLPALPSVGWLVLLAVAGGLGLFTRLWPLGCFLLGLCWACWSAQQALDDRLATGLDGRTLWLEG
RVVGLPTRTAQGVRFELDAARSRRAELPQRLQLSWFDGPPLRAGEQWRLAVTLQRPAGLLNPHGPDREAQLLARRVGATG
TVKAGQLLAPVVGGWRDTLRQRLLAVEANGRQAALVALVLGDGAGLAREDWQTLQATGTVHLLVVSGQHIGLVAGLLYGL
VAGLARWGLWPARLPWVPWACGLAMAAALAYGWLAGGGVPVQRACLMLAVVLLWRLRFRQLGATLPLLLALVAVLLLEPL
AALLPGFWLSFAAVATLVYCFSARLGSWRPWQAWTRAQWVIAIGLLPVLLATGLPVSLSAPLANLVAVPWVSLAVLPLAL
LGALLLPLGGVGEALLWLAGGLLDVLFRLLALAAQQPAWLPAALPLWAWLLVCLGALLVLMPRGVPLRGLGGVMLLALWV
PRETVPFGQVEVWQLDVGQGLAVLLRTRHHNLLYDAGPARGESDLGERVVLPTLRKLGVGSLDLMLVSHAHADHAGGAGA
IMRGLPVERVIGGELLDDLQLQPCVSGEQWMWDGVRFSMWRWADGQSSNDRSCVLLVEAQGERLLLAGDMEAGAERAWLA
ATEAPRIDWLQSPHHGSRSSSSEAFIRATAPRGVLISRGRNNSFGHPHSQVVERYRRHGVVMHDTAEQGALRLVLGRLGE
VEGVRGQRRFWRVRVQ
MRTGMFALALGLLCLGFLPALPSVGWLVLLAVAGGLGLFTRLWPLGCFLLGLCWACWSAQQALDDRLATGLDGRTLWLEG
RVVGLPTRTAQGVRFELDAARSRRAELPQRLQLSWFDGPPLRAGEQWRLAVTLQRPAGLLNPHGPDREAQLLARRVGATG
TVKAGQLLAPVVGGWRDTLRQRLLAVEANGRQAALVALVLGDGAGLAREDWQTLQATGTVHLLVVSGQHIGLVAGLLYGL
VAGLARWGLWPARLPWVPWACGLAMAAALAYGWLAGGGVPVQRACLMLAVVLLWRLRFRQLGATLPLLLALVAVLLLEPL
AALLPGFWLSFAAVATLVYCFSARLGSWRPWQAWTRAQWVIAIGLLPVLLATGLPVSLSAPLANLVAVPWVSLAVLPLAL
LGALLLPLGGVGEALLWLAGGLLDVLFRLLALAAQQPAWLPAALPLWAWLLVCLGALLVLMPRGVPLRGLGGVMLLALWV
PRETVPFGQVEVWQLDVGQGLAVLLRTRHHNLLYDAGPARGESDLGERVVLPTLRKLGVGSLDLMLVSHAHADHAGGAGA
IMRGLPVERVIGGELLDDLQLQPCVSGEQWMWDGVRFSMWRWADGQSSNDRSCVLLVEAQGERLLLAGDMEAGAERAWLA
ATEAPRIDWLQSPHHGSRSSSSEAFIRATAPRGVLISRGRNNSFGHPHSQVVERYRRHGVVMHDTAEQGALRLVLGRLGE
VEGVRGQRRFWRVRVQ
Nucleotide
Download Length: 2211 bp
>NTDB_id=357183 E5221_RS08560 WP_247847124.1 1832992..1835202(+) (comA) [Pseudomonas sp. A2]
ATGCGCACAGGGATGTTTGCGCTCGCGCTCGGGCTGTTGTGTCTGGGCTTTCTCCCCGCATTGCCATCGGTCGGATGGCT
GGTGCTGCTGGCTGTAGCCGGTGGCCTCGGCCTGTTTACCCGGCTGTGGCCACTGGGCTGTTTTCTGTTGGGCCTGTGTT
GGGCCTGCTGGTCTGCGCAGCAGGCGCTGGATGACCGCCTGGCAACCGGGCTGGATGGCCGCACCTTGTGGCTTGAGGGC
CGGGTGGTCGGCCTGCCGACCCGGACTGCGCAGGGTGTGCGCTTCGAGCTGGACGCGGCCCGCTCGCGGCGGGCAGAGCT
GCCGCAGCGCCTGCAACTGAGCTGGTTCGACGGCCCGCCGCTGCGCGCGGGCGAGCAGTGGCGGCTGGCGGTGACCTTGC
AGCGCCCGGCCGGGCTGCTCAACCCGCATGGGCCGGACCGTGAGGCGCAATTGCTGGCGCGGCGGGTCGGTGCGACCGGT
ACGGTCAAGGCTGGGCAGTTGCTGGCGCCGGTTGTCGGCGGCTGGCGCGACACGCTGCGCCAGCGCCTGCTGGCAGTCGA
GGCCAATGGCCGGCAGGCAGCGTTGGTGGCGCTGGTGCTTGGCGACGGGGCGGGCCTGGCCCGGGAGGACTGGCAAACGT
TGCAGGCCACCGGCACGGTGCACTTGCTGGTGGTCTCTGGCCAGCACATCGGCCTGGTTGCCGGCTTGCTGTATGGCCTG
GTTGCCGGGCTGGCGCGTTGGGGGCTGTGGCCCGCTCGTCTGCCGTGGGTGCCCTGGGCGTGTGGCCTGGCCATGGCTGC
GGCACTGGCCTACGGTTGGTTGGCTGGTGGCGGTGTACCGGTGCAGCGTGCCTGCCTGATGCTGGCCGTGGTGTTGCTCT
GGCGCCTGCGTTTTCGCCAATTGGGTGCGACCTTGCCGTTGTTGCTGGCGCTGGTTGCCGTACTGTTGCTGGAACCGCTG
GCCGCGCTGCTGCCCGGGTTCTGGCTGTCGTTCGCGGCCGTGGCCACGCTTGTCTATTGCTTCAGTGCCCGCCTGGGCAG
CTGGCGGCCCTGGCAGGCCTGGACGCGGGCGCAATGGGTGATTGCGATCGGCTTGCTGCCGGTACTGCTGGCTACGGGCT
TGCCGGTGAGCTTGAGTGCGCCGCTGGCCAACCTTGTTGCGGTGCCGTGGGTCAGCTTGGCAGTGCTGCCATTGGCGTTG
CTGGGCGCTTTGCTGCTGCCGTTGGGCGGGGTAGGGGAGGCGCTGCTCTGGCTGGCGGGCGGTTTGCTTGATGTGCTTTT
CCGGCTGCTGGCGCTGGCGGCGCAGCAGCCGGCGTGGCTGCCAGCAGCCCTGCCGTTGTGGGCCTGGCTGCTGGTGTGCC
TGGGCGCGCTGCTGGTTTTGATGCCCCGTGGCGTTCCACTGCGCGGGTTGGGTGGCGTAATGCTGCTGGCGCTGTGGGTA
CCCCGGGAGACGGTGCCGTTCGGCCAGGTCGAGGTCTGGCAGCTGGACGTCGGCCAGGGGCTGGCAGTGCTTTTGCGCAC
CCGGCATCACAACCTGCTGTACGACGCCGGGCCGGCCAGGGGAGAGAGCGACCTGGGCGAGCGGGTGGTGTTGCCGACCT
TGCGCAAGCTGGGGGTGGGCAGCCTGGACCTGATGCTGGTCAGCCACGCCCATGCCGACCACGCCGGTGGTGCTGGCGCA
ATCATGCGTGGCCTGCCGGTCGAACGGGTGATCGGCGGCGAATTGCTGGATGACCTGCAACTGCAACCTTGCGTAAGTGG
CGAGCAGTGGATGTGGGATGGCGTGCGCTTTTCGATGTGGCGCTGGGCCGATGGGCAGAGCAGCAATGACCGCTCCTGTG
TGTTGTTGGTCGAGGCGCAGGGTGAGCGGTTGTTGCTGGCGGGAGACATGGAGGCCGGTGCCGAACGGGCCTGGCTGGCG
GCCACGGAAGCACCGCGGATCGACTGGTTGCAGTCGCCGCATCACGGCAGCCGCAGTTCGTCCAGCGAGGCCTTCATCCG
CGCCACGGCGCCGCGCGGGGTACTGATTTCGCGCGGGCGCAACAACAGCTTCGGGCACCCGCACTCGCAAGTGGTCGAAC
GTTATCGGCGGCATGGGGTGGTGATGCATGACACGGCGGAGCAGGGGGCGTTGCGGTTGGTGCTGGGGAGGCTGGGGGAG
GTGGAGGGTGTGCGGGGGCAGAGGCGGTTTTGGCGGGTTCGGGTGCAGTAA
ATGCGCACAGGGATGTTTGCGCTCGCGCTCGGGCTGTTGTGTCTGGGCTTTCTCCCCGCATTGCCATCGGTCGGATGGCT
GGTGCTGCTGGCTGTAGCCGGTGGCCTCGGCCTGTTTACCCGGCTGTGGCCACTGGGCTGTTTTCTGTTGGGCCTGTGTT
GGGCCTGCTGGTCTGCGCAGCAGGCGCTGGATGACCGCCTGGCAACCGGGCTGGATGGCCGCACCTTGTGGCTTGAGGGC
CGGGTGGTCGGCCTGCCGACCCGGACTGCGCAGGGTGTGCGCTTCGAGCTGGACGCGGCCCGCTCGCGGCGGGCAGAGCT
GCCGCAGCGCCTGCAACTGAGCTGGTTCGACGGCCCGCCGCTGCGCGCGGGCGAGCAGTGGCGGCTGGCGGTGACCTTGC
AGCGCCCGGCCGGGCTGCTCAACCCGCATGGGCCGGACCGTGAGGCGCAATTGCTGGCGCGGCGGGTCGGTGCGACCGGT
ACGGTCAAGGCTGGGCAGTTGCTGGCGCCGGTTGTCGGCGGCTGGCGCGACACGCTGCGCCAGCGCCTGCTGGCAGTCGA
GGCCAATGGCCGGCAGGCAGCGTTGGTGGCGCTGGTGCTTGGCGACGGGGCGGGCCTGGCCCGGGAGGACTGGCAAACGT
TGCAGGCCACCGGCACGGTGCACTTGCTGGTGGTCTCTGGCCAGCACATCGGCCTGGTTGCCGGCTTGCTGTATGGCCTG
GTTGCCGGGCTGGCGCGTTGGGGGCTGTGGCCCGCTCGTCTGCCGTGGGTGCCCTGGGCGTGTGGCCTGGCCATGGCTGC
GGCACTGGCCTACGGTTGGTTGGCTGGTGGCGGTGTACCGGTGCAGCGTGCCTGCCTGATGCTGGCCGTGGTGTTGCTCT
GGCGCCTGCGTTTTCGCCAATTGGGTGCGACCTTGCCGTTGTTGCTGGCGCTGGTTGCCGTACTGTTGCTGGAACCGCTG
GCCGCGCTGCTGCCCGGGTTCTGGCTGTCGTTCGCGGCCGTGGCCACGCTTGTCTATTGCTTCAGTGCCCGCCTGGGCAG
CTGGCGGCCCTGGCAGGCCTGGACGCGGGCGCAATGGGTGATTGCGATCGGCTTGCTGCCGGTACTGCTGGCTACGGGCT
TGCCGGTGAGCTTGAGTGCGCCGCTGGCCAACCTTGTTGCGGTGCCGTGGGTCAGCTTGGCAGTGCTGCCATTGGCGTTG
CTGGGCGCTTTGCTGCTGCCGTTGGGCGGGGTAGGGGAGGCGCTGCTCTGGCTGGCGGGCGGTTTGCTTGATGTGCTTTT
CCGGCTGCTGGCGCTGGCGGCGCAGCAGCCGGCGTGGCTGCCAGCAGCCCTGCCGTTGTGGGCCTGGCTGCTGGTGTGCC
TGGGCGCGCTGCTGGTTTTGATGCCCCGTGGCGTTCCACTGCGCGGGTTGGGTGGCGTAATGCTGCTGGCGCTGTGGGTA
CCCCGGGAGACGGTGCCGTTCGGCCAGGTCGAGGTCTGGCAGCTGGACGTCGGCCAGGGGCTGGCAGTGCTTTTGCGCAC
CCGGCATCACAACCTGCTGTACGACGCCGGGCCGGCCAGGGGAGAGAGCGACCTGGGCGAGCGGGTGGTGTTGCCGACCT
TGCGCAAGCTGGGGGTGGGCAGCCTGGACCTGATGCTGGTCAGCCACGCCCATGCCGACCACGCCGGTGGTGCTGGCGCA
ATCATGCGTGGCCTGCCGGTCGAACGGGTGATCGGCGGCGAATTGCTGGATGACCTGCAACTGCAACCTTGCGTAAGTGG
CGAGCAGTGGATGTGGGATGGCGTGCGCTTTTCGATGTGGCGCTGGGCCGATGGGCAGAGCAGCAATGACCGCTCCTGTG
TGTTGTTGGTCGAGGCGCAGGGTGAGCGGTTGTTGCTGGCGGGAGACATGGAGGCCGGTGCCGAACGGGCCTGGCTGGCG
GCCACGGAAGCACCGCGGATCGACTGGTTGCAGTCGCCGCATCACGGCAGCCGCAGTTCGTCCAGCGAGGCCTTCATCCG
CGCCACGGCGCCGCGCGGGGTACTGATTTCGCGCGGGCGCAACAACAGCTTCGGGCACCCGCACTCGCAAGTGGTCGAAC
GTTATCGGCGGCATGGGGTGGTGATGCATGACACGGCGGAGCAGGGGGCGTTGCGGTTGGTGCTGGGGAGGCTGGGGGAG
GTGGAGGGTGTGCGGGGGCAGAGGCGGTTTTGGCGGGTTCGGGTGCAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Pseudomonas stutzeri DSM 10701 |
59.444 |
97.826 |
0.582 |
| comA | Ralstonia pseudosolanacearum GMI1000 |
34.783 |
100 |
0.37 |