Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | AVIN_RS06735 | Genome accession | NC_012560 |
| Coordinates | 1454808..1457018 (+) | Length | 736 a.a. |
| NCBI ID | WP_156483617.1 | Uniprot ID | - |
| Organism | Azotobacter vinelandii DJ | ||
| Function | ssDNA transport through the inner membrane (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1449808..1462018
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AVIN_RS06710 (Avin_14680) | - | 1449962..1450684 (-) | 723 | WP_012700103.1 | glycerophosphodiester phosphodiesterase | - |
| AVIN_RS06715 (Avin_14690) | - | 1450932..1452182 (+) | 1251 | WP_012700104.1 | lipoprotein-releasing ABC transporter permease subunit | - |
| AVIN_RS06720 (Avin_14700) | lolD | 1452175..1452873 (+) | 699 | WP_012700105.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| AVIN_RS06725 (Avin_14710) | - | 1452876..1454120 (+) | 1245 | WP_012700106.1 | lipoprotein-releasing ABC transporter permease subunit | - |
| AVIN_RS06730 (Avin_14720) | - | 1454145..1454672 (-) | 528 | WP_012700107.1 | DUF2062 domain-containing protein | - |
| AVIN_RS06735 (Avin_14730) | comA | 1454808..1457018 (+) | 2211 | WP_156483617.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| AVIN_RS06740 (Avin_14740) | - | 1457090..1457722 (+) | 633 | WP_041807047.1 | MotA/TolQ/ExbB proton channel family protein | - |
| AVIN_RS06745 (Avin_14750) | - | 1457719..1458147 (+) | 429 | WP_012700110.1 | biopolymer transporter ExbD | - |
| AVIN_RS06750 (Avin_14760) | lpxK | 1458147..1459148 (+) | 1002 | WP_012700111.1 | tetraacyldisaccharide 4'-kinase | - |
| AVIN_RS06755 (Avin_14770) | - | 1459209..1459394 (+) | 186 | WP_012700112.1 | Trm112 family protein | - |
| AVIN_RS06760 (Avin_14780) | kdsB | 1459391..1460155 (+) | 765 | WP_012700113.1 | 3-deoxy-manno-octulosonate cytidylyltransferase | - |
| AVIN_RS06765 (Avin_14790) | - | 1460155..1460619 (+) | 465 | WP_012700114.1 | low molecular weight protein-tyrosine-phosphatase | - |
| AVIN_RS06770 (Avin_14800) | murB | 1460616..1461635 (+) | 1020 | WP_012700115.1 | UDP-N-acetylmuramate dehydrogenase | - |
Sequence
Protein
Download Length: 736 a.a. Molecular weight: 79919.77 Da Isoelectric Point: 10.2397
>NTDB_id=33769 AVIN_RS06735 WP_156483617.1 1454808..1457018(+) (comA) [Azotobacter vinelandii DJ]
MLALTAGLLVLRFLPALPPWWLWSPMALGAAILLAARRYSPALFLFGLGWACLSAHWALEERLPVELDGRTLWLEGLVVG
LPARIDGTLHFQLEEASSRRAELPGRLRLAWHAGPEVRAGERWRLAVSLKRPRGLVNPQGFDYEAWLLAQRIGATGTVKA
GERLGTPENADGWRDSLRQRLLQVDAHGREGALAALVMGDASGLSVADWKLLQDTGTVHLMVISGQHVGLLAGLVYGLVV
LLARFGLWPGFLPWLPCACGLAFATALGYGWLAGFGVPVQRACAMLAVVLFWRLRFRHLGLWLPILLALDGVLLLEPLAS
LQPGFWLSFGAVVILVLAFGGRLGAWSWRQTLWRAQWTSALGLLPLLLALGLPISLSGPLANLVAVPWVGFAVVPLALLG
TLLLPLPAMGEGLLWLAGALLETLFRLLGEIAGAVPAWLPHAVPVWGWLLALLGTLLILLPAGVPLRVPGLALLLPLAFP
PQERIPQARADVWLLDVGQGLAVLVRTRGHDLLYDAGPRFGDFDLGERVVLPSLRNLGVGRLDRLLLSHADGDHAGGALA
VRRALPVGEVVAGEAQAQSAALAAQPCARRAWQWDGVRFATWHWTAVQEGNRASCVLLVEAAGERLLLTGDIDAAAERAL
LDSHPEWRADWLLAPHHGSRSSSSPALLKALAPRAVLISRGWNNGFGHPHAQVVERYRKLPAVIHDTARQGALRFRLGDW
GRARGLREEPRFWREK
MLALTAGLLVLRFLPALPPWWLWSPMALGAAILLAARRYSPALFLFGLGWACLSAHWALEERLPVELDGRTLWLEGLVVG
LPARIDGTLHFQLEEASSRRAELPGRLRLAWHAGPEVRAGERWRLAVSLKRPRGLVNPQGFDYEAWLLAQRIGATGTVKA
GERLGTPENADGWRDSLRQRLLQVDAHGREGALAALVMGDASGLSVADWKLLQDTGTVHLMVISGQHVGLLAGLVYGLVV
LLARFGLWPGFLPWLPCACGLAFATALGYGWLAGFGVPVQRACAMLAVVLFWRLRFRHLGLWLPILLALDGVLLLEPLAS
LQPGFWLSFGAVVILVLAFGGRLGAWSWRQTLWRAQWTSALGLLPLLLALGLPISLSGPLANLVAVPWVGFAVVPLALLG
TLLLPLPAMGEGLLWLAGALLETLFRLLGEIAGAVPAWLPHAVPVWGWLLALLGTLLILLPAGVPLRVPGLALLLPLAFP
PQERIPQARADVWLLDVGQGLAVLVRTRGHDLLYDAGPRFGDFDLGERVVLPSLRNLGVGRLDRLLLSHADGDHAGGALA
VRRALPVGEVVAGEAQAQSAALAAQPCARRAWQWDGVRFATWHWTAVQEGNRASCVLLVEAAGERLLLTGDIDAAAERAL
LDSHPEWRADWLLAPHHGSRSSSSPALLKALAPRAVLISRGWNNGFGHPHAQVVERYRKLPAVIHDTARQGALRFRLGDW
GRARGLREEPRFWREK
Nucleotide
Download Length: 2211 bp
>NTDB_id=33769 AVIN_RS06735 WP_156483617.1 1454808..1457018(+) (comA) [Azotobacter vinelandii DJ]
ATGTTGGCGCTCACCGCAGGTCTGCTCGTCCTGCGTTTCCTTCCGGCCCTACCGCCCTGGTGGCTGTGGTCGCCGATGGC
GCTGGGCGCTGCGATCCTTCTGGCGGCGCGTCGCTATTCGCCGGCGCTCTTTCTCTTCGGTCTCGGCTGGGCCTGCCTGT
CGGCGCACTGGGCGCTGGAGGAACGGTTGCCTGTCGAACTCGATGGTCGCACCCTGTGGCTGGAGGGGCTGGTGGTCGGT
CTGCCGGCGCGCATCGACGGCACGCTGCATTTCCAACTGGAGGAGGCCTCTTCCCGGCGCGCCGAACTGCCCGGGCGACT
GCGCCTAGCGTGGCACGCCGGGCCGGAGGTCCGCGCCGGGGAGCGCTGGCGCCTGGCGGTCAGCCTCAAGCGTCCGCGCG
GCCTGGTCAACCCGCAGGGTTTCGATTACGAGGCCTGGCTGCTGGCCCAGCGGATCGGCGCCACCGGGACGGTGAAAGCG
GGAGAGCGACTCGGAACGCCGGAAAACGCCGACGGTTGGCGCGATTCCCTGCGCCAGCGCCTGCTGCAGGTCGATGCCCA
TGGCCGTGAGGGCGCGCTCGCCGCGCTGGTGATGGGCGACGCGTCCGGGCTGAGCGTGGCGGACTGGAAGCTCCTGCAGG
ATACCGGCACCGTGCATCTGATGGTGATCTCCGGCCAGCATGTCGGCCTGCTTGCCGGCCTGGTCTACGGGCTGGTGGTC
CTGCTGGCGAGATTTGGCCTGTGGCCGGGTTTTCTGCCCTGGTTGCCCTGTGCCTGCGGCCTGGCCTTCGCCACCGCGCT
CGGTTATGGCTGGCTGGCCGGCTTCGGGGTACCAGTACAGCGGGCCTGCGCCATGCTCGCCGTGGTGCTGTTCTGGCGCC
TGCGTTTCCGCCACCTGGGTCTCTGGCTACCCATCCTGCTGGCGCTGGACGGCGTACTGCTGCTCGAGCCCCTGGCCAGC
CTGCAGCCGGGGTTCTGGCTGTCGTTCGGTGCGGTGGTGATCCTCGTCCTGGCCTTCGGCGGCCGGCTGGGTGCCTGGTC
GTGGCGGCAGACCCTGTGGCGAGCGCAGTGGACCAGTGCGCTGGGACTGCTACCGTTGTTGCTGGCCTTGGGCCTGCCGA
TCAGTCTCAGCGGTCCGTTGGCCAATCTGGTCGCGGTACCCTGGGTCGGTTTCGCGGTGGTCCCGCTGGCTCTGCTCGGA
ACCCTGCTGCTGCCCTTGCCGGCAATGGGCGAGGGCCTTCTCTGGCTGGCCGGCGCCTTGCTGGAGACGCTGTTTCGGCT
GCTCGGCGAGATCGCCGGCGCCGTACCGGCCTGGCTGCCCCACGCGGTGCCGGTCTGGGGCTGGCTGCTGGCGCTGCTCG
GGACCCTGCTGATCCTGCTGCCGGCGGGAGTGCCGCTGCGTGTCCCGGGACTGGCGCTGCTGCTGCCCCTGGCATTTCCG
CCGCAGGAGCGAATCCCGCAGGCACGGGCCGATGTCTGGCTGCTGGATGTCGGGCAGGGCCTTGCCGTGCTTGTGCGTAC
CCGCGGGCACGACCTGCTCTATGATGCTGGGCCGCGTTTCGGCGATTTCGATCTGGGCGAGCGCGTGGTCCTGCCTTCGC
TGCGCAATCTCGGCGTGGGCCGCCTGGATCGCCTGCTGCTCAGCCATGCCGATGGCGACCACGCCGGTGGCGCCCTGGCC
GTGCGGCGCGCTCTGCCGGTGGGCGAGGTCGTCGCCGGCGAGGCGCAGGCGCAATCGGCGGCGCTCGCCGCGCAGCCTTG
CGCCCGTCGCGCCTGGCAGTGGGATGGTGTGCGTTTCGCCACCTGGCACTGGACGGCCGTGCAAGAGGGCAATCGGGCTT
CCTGCGTGCTGCTGGTCGAGGCCGCCGGCGAGCGCCTGCTGCTGACCGGCGATATCGATGCCGCAGCCGAGCGGGCACTG
CTCGACAGCCACCCGGAGTGGCGCGCCGACTGGCTGCTGGCGCCTCACCACGGCAGCCGCAGTTCGTCTTCGCCGGCTCT
GCTCAAGGCCCTGGCGCCGCGCGCGGTGCTGATCTCGCGCGGCTGGAACAACGGCTTCGGCCATCCCCATGCGCAGGTCG
TGGAGCGTTACCGGAAGCTGCCGGCCGTGATTCACGATACTGCGCGCCAGGGGGCCCTGCGGTTTCGCCTGGGCGACTGG
GGCCGGGCGCGCGGGCTGCGCGAAGAGCCCCGCTTCTGGCGGGAAAAATGA
ATGTTGGCGCTCACCGCAGGTCTGCTCGTCCTGCGTTTCCTTCCGGCCCTACCGCCCTGGTGGCTGTGGTCGCCGATGGC
GCTGGGCGCTGCGATCCTTCTGGCGGCGCGTCGCTATTCGCCGGCGCTCTTTCTCTTCGGTCTCGGCTGGGCCTGCCTGT
CGGCGCACTGGGCGCTGGAGGAACGGTTGCCTGTCGAACTCGATGGTCGCACCCTGTGGCTGGAGGGGCTGGTGGTCGGT
CTGCCGGCGCGCATCGACGGCACGCTGCATTTCCAACTGGAGGAGGCCTCTTCCCGGCGCGCCGAACTGCCCGGGCGACT
GCGCCTAGCGTGGCACGCCGGGCCGGAGGTCCGCGCCGGGGAGCGCTGGCGCCTGGCGGTCAGCCTCAAGCGTCCGCGCG
GCCTGGTCAACCCGCAGGGTTTCGATTACGAGGCCTGGCTGCTGGCCCAGCGGATCGGCGCCACCGGGACGGTGAAAGCG
GGAGAGCGACTCGGAACGCCGGAAAACGCCGACGGTTGGCGCGATTCCCTGCGCCAGCGCCTGCTGCAGGTCGATGCCCA
TGGCCGTGAGGGCGCGCTCGCCGCGCTGGTGATGGGCGACGCGTCCGGGCTGAGCGTGGCGGACTGGAAGCTCCTGCAGG
ATACCGGCACCGTGCATCTGATGGTGATCTCCGGCCAGCATGTCGGCCTGCTTGCCGGCCTGGTCTACGGGCTGGTGGTC
CTGCTGGCGAGATTTGGCCTGTGGCCGGGTTTTCTGCCCTGGTTGCCCTGTGCCTGCGGCCTGGCCTTCGCCACCGCGCT
CGGTTATGGCTGGCTGGCCGGCTTCGGGGTACCAGTACAGCGGGCCTGCGCCATGCTCGCCGTGGTGCTGTTCTGGCGCC
TGCGTTTCCGCCACCTGGGTCTCTGGCTACCCATCCTGCTGGCGCTGGACGGCGTACTGCTGCTCGAGCCCCTGGCCAGC
CTGCAGCCGGGGTTCTGGCTGTCGTTCGGTGCGGTGGTGATCCTCGTCCTGGCCTTCGGCGGCCGGCTGGGTGCCTGGTC
GTGGCGGCAGACCCTGTGGCGAGCGCAGTGGACCAGTGCGCTGGGACTGCTACCGTTGTTGCTGGCCTTGGGCCTGCCGA
TCAGTCTCAGCGGTCCGTTGGCCAATCTGGTCGCGGTACCCTGGGTCGGTTTCGCGGTGGTCCCGCTGGCTCTGCTCGGA
ACCCTGCTGCTGCCCTTGCCGGCAATGGGCGAGGGCCTTCTCTGGCTGGCCGGCGCCTTGCTGGAGACGCTGTTTCGGCT
GCTCGGCGAGATCGCCGGCGCCGTACCGGCCTGGCTGCCCCACGCGGTGCCGGTCTGGGGCTGGCTGCTGGCGCTGCTCG
GGACCCTGCTGATCCTGCTGCCGGCGGGAGTGCCGCTGCGTGTCCCGGGACTGGCGCTGCTGCTGCCCCTGGCATTTCCG
CCGCAGGAGCGAATCCCGCAGGCACGGGCCGATGTCTGGCTGCTGGATGTCGGGCAGGGCCTTGCCGTGCTTGTGCGTAC
CCGCGGGCACGACCTGCTCTATGATGCTGGGCCGCGTTTCGGCGATTTCGATCTGGGCGAGCGCGTGGTCCTGCCTTCGC
TGCGCAATCTCGGCGTGGGCCGCCTGGATCGCCTGCTGCTCAGCCATGCCGATGGCGACCACGCCGGTGGCGCCCTGGCC
GTGCGGCGCGCTCTGCCGGTGGGCGAGGTCGTCGCCGGCGAGGCGCAGGCGCAATCGGCGGCGCTCGCCGCGCAGCCTTG
CGCCCGTCGCGCCTGGCAGTGGGATGGTGTGCGTTTCGCCACCTGGCACTGGACGGCCGTGCAAGAGGGCAATCGGGCTT
CCTGCGTGCTGCTGGTCGAGGCCGCCGGCGAGCGCCTGCTGCTGACCGGCGATATCGATGCCGCAGCCGAGCGGGCACTG
CTCGACAGCCACCCGGAGTGGCGCGCCGACTGGCTGCTGGCGCCTCACCACGGCAGCCGCAGTTCGTCTTCGCCGGCTCT
GCTCAAGGCCCTGGCGCCGCGCGCGGTGCTGATCTCGCGCGGCTGGAACAACGGCTTCGGCCATCCCCATGCGCAGGTCG
TGGAGCGTTACCGGAAGCTGCCGGCCGTGATTCACGATACTGCGCGCCAGGGGGCCCTGCGGTTTCGCCTGGGCGACTGG
GGCCGGGCGCGCGGGCTGCGCGAAGAGCCCCGCTTCTGGCGGGAAAAATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Pseudomonas stutzeri DSM 10701 |
65.912 |
96.06 |
0.633 |
| comA | Ralstonia pseudosolanacearum GMI1000 |
34.814 |
100 |
0.394 |