Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | HH212_RS23215 | Genome accession | NZ_CP051685 |
| Coordinates | 5536437..5538923 (-) | Length | 828 a.a. |
| NCBI ID | WP_170204653.1 | Uniprot ID | A0A7Z2W196 |
| Organism | Massilia forsythiae strain GN2-R2 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 5531437..5543923
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HH212_RS23200 (HH212_23200) | - | 5531830..5535090 (-) | 3261 | WP_170204651.1 | DEAD/DEAH box helicase | - |
| HH212_RS23205 (HH212_23205) | - | 5535282..5535641 (+) | 360 | WP_211172400.1 | hypothetical protein | - |
| HH212_RS23210 (HH212_23210) | - | 5535719..5536390 (-) | 672 | WP_170204652.1 | hypothetical protein | - |
| HH212_RS23215 (HH212_23215) | comA | 5536437..5538923 (-) | 2487 | WP_170204653.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HH212_RS27860 (HH212_23220) | - | 5539012..5540961 (-) | 1950 | WP_370663889.1 | T6SS immunity protein Tli4 family protein | - |
| HH212_RS23225 (HH212_23225) | - | 5540981..5542651 (-) | 1671 | WP_170204655.1 | triacylglycerol lipase | - |
Sequence
Protein
Download Length: 828 a.a. Molecular weight: 87445.82 Da Isoelectric Point: 9.2228
>NTDB_id=440470 HH212_RS23215 WP_170204653.1 5536437..5538923(-) (comA) [Massilia forsythiae strain GN2-R2]
MRCLILGFAAGVFWLQQASSLPAALSLAGCAAAALALSILAAFVRRAAPSRLRGAAAVVLMLAAAGGLSGYAWAALLAQR
ALAPMLAATDEGRDLAIVGVVDNLPASFEQGVRFNFLVERTLTAGAAAPPRVALSWYANPRGASGRAAAASSVSAQDALP
EIEPGQRWQLTVRLQRPHGNANPGGFDYEAWLLEQGVRATGYVRTGRAAAGVPAAVLLDEFAPSLPGVVERCRAWLRERI
LRALAGRQYAGVIVALVIGDQRGIDQADWQVFNRTGIGHLVSISGLHITMIAGLAALGASALWRRSFFTDAQLPLLLPAQ
KVAALAGAVTALLYVLLAGFGVPAQRTLYMLSVVALALWSGRLTAVSHVLCAALGVVLLLDPWAVLWPGFWLSFGAVAMI
LFAGHGRINPPLRGLCGTLLGAGHTQWAVTLGLVPLTMLLFGQVSLVSPLANAVAIPLVSFVVTPLALAGSLLPDPLCGW
LLALAHAAVAALAWLLGWMAGLPLAVWRAPAPQAWVFLLALGGTLWLLMPRGWPLRWSGAIAWLPLLLHLPDHPPAGSVR
VTAFDVGQGMALLVETAGHRLLYDTGPAYAPGADAGSRVILPYLRMRGIGALDGIVVSHGDLDHTGGALALLGELEVGWL
ASSLGEEHAIARAAPRHLHCMAGQRWEWDGIRFEMLHPAPSSYGDAGLKANARSCVLRIVNATHALLLAGDIEAAQEAGL
VADRAQALRADVLLAPHHGSGTSSTPAFLQAVRPSIGIFQVGYRNRYRHPKAEVYERYRMLGIERLRTDTSGAVTFDVDA
AVTVEAYRSAHARYWYAGGMTADHLQSY
MRCLILGFAAGVFWLQQASSLPAALSLAGCAAAALALSILAAFVRRAAPSRLRGAAAVVLMLAAAGGLSGYAWAALLAQR
ALAPMLAATDEGRDLAIVGVVDNLPASFEQGVRFNFLVERTLTAGAAAPPRVALSWYANPRGASGRAAAASSVSAQDALP
EIEPGQRWQLTVRLQRPHGNANPGGFDYEAWLLEQGVRATGYVRTGRAAAGVPAAVLLDEFAPSLPGVVERCRAWLRERI
LRALAGRQYAGVIVALVIGDQRGIDQADWQVFNRTGIGHLVSISGLHITMIAGLAALGASALWRRSFFTDAQLPLLLPAQ
KVAALAGAVTALLYVLLAGFGVPAQRTLYMLSVVALALWSGRLTAVSHVLCAALGVVLLLDPWAVLWPGFWLSFGAVAMI
LFAGHGRINPPLRGLCGTLLGAGHTQWAVTLGLVPLTMLLFGQVSLVSPLANAVAIPLVSFVVTPLALAGSLLPDPLCGW
LLALAHAAVAALAWLLGWMAGLPLAVWRAPAPQAWVFLLALGGTLWLLMPRGWPLRWSGAIAWLPLLLHLPDHPPAGSVR
VTAFDVGQGMALLVETAGHRLLYDTGPAYAPGADAGSRVILPYLRMRGIGALDGIVVSHGDLDHTGGALALLGELEVGWL
ASSLGEEHAIARAAPRHLHCMAGQRWEWDGIRFEMLHPAPSSYGDAGLKANARSCVLRIVNATHALLLAGDIEAAQEAGL
VADRAQALRADVLLAPHHGSGTSSTPAFLQAVRPSIGIFQVGYRNRYRHPKAEVYERYRMLGIERLRTDTSGAVTFDVDA
AVTVEAYRSAHARYWYAGGMTADHLQSY
Nucleotide
Download Length: 2487 bp
>NTDB_id=440470 HH212_RS23215 WP_170204653.1 5536437..5538923(-) (comA) [Massilia forsythiae strain GN2-R2]
ATGCGCTGCCTGATCCTGGGGTTTGCCGCCGGCGTGTTCTGGCTGCAGCAGGCGTCGTCCCTGCCGGCGGCACTGTCGCT
GGCCGGCTGCGCCGCAGCCGCACTGGCGCTGTCGATATTGGCCGCGTTTGTCCGCCGCGCAGCCCCGTCTCGTTTGCGTG
GCGCCGCCGCTGTGGTCCTGATGCTCGCGGCCGCCGGCGGCTTGTCCGGTTACGCATGGGCCGCGCTGCTGGCGCAGCGC
GCATTGGCGCCAATGCTGGCGGCCACCGACGAAGGGCGCGACCTGGCCATCGTCGGCGTGGTCGACAACCTGCCGGCCAG
CTTCGAGCAGGGCGTGCGTTTCAACTTCCTGGTCGAGCGCACGCTCACGGCGGGCGCGGCCGCGCCGCCGCGCGTGGCGC
TGTCCTGGTATGCCAACCCGCGCGGCGCGTCCGGCCGCGCCGCTGCGGCGTCCAGCGTGTCCGCGCAAGACGCGCTGCCC
GAGATCGAGCCCGGCCAGCGCTGGCAGTTGACGGTGCGCCTGCAGCGCCCGCACGGCAACGCCAATCCCGGCGGCTTCGA
CTACGAAGCCTGGCTGCTGGAACAGGGCGTGCGCGCCACCGGCTACGTGCGCACCGGGCGCGCTGCGGCAGGAGTGCCGG
CGGCGGTCCTGCTGGACGAGTTCGCGCCCAGCCTGCCGGGCGTGGTCGAGCGCTGCCGCGCCTGGCTGCGCGAACGCATC
CTGCGCGCGCTGGCGGGGCGCCAGTACGCCGGCGTCATCGTCGCGCTGGTGATCGGCGACCAGCGCGGCATCGACCAGGC
CGACTGGCAGGTGTTCAACCGCACCGGCATCGGCCACCTGGTTTCGATCTCGGGATTGCACATCACGATGATCGCCGGGC
TTGCGGCACTCGGCGCCTCGGCGCTGTGGCGGCGCTCGTTCTTCACCGATGCCCAGTTGCCGCTGCTGCTGCCGGCGCAG
AAGGTGGCGGCGCTGGCCGGCGCCGTCACCGCGCTGCTGTACGTGCTGCTGGCCGGCTTCGGGGTGCCGGCCCAGCGCAC
CTTGTACATGCTGTCGGTGGTGGCGCTGGCGCTGTGGAGCGGACGGCTGACGGCGGTCTCGCACGTGCTGTGCGCGGCGC
TCGGCGTAGTGCTGCTGCTGGACCCCTGGGCGGTGTTGTGGCCGGGCTTCTGGCTGTCGTTCGGCGCGGTGGCGATGATC
CTGTTCGCCGGCCATGGCCGCATCAATCCGCCGCTGCGCGGCCTGTGCGGCACGCTGCTGGGGGCGGGGCACACGCAGTG
GGCGGTGACGCTCGGCCTGGTGCCGCTGACGATGCTGCTGTTCGGCCAGGTATCGCTGGTCAGTCCGCTGGCGAATGCGG
TGGCGATCCCGCTGGTGAGCTTCGTGGTCACGCCGCTGGCACTGGCCGGCAGCCTGCTGCCCGATCCGTTGTGCGGCTGG
CTGCTGGCGCTGGCGCACGCGGCGGTCGCGGCGCTGGCCTGGCTGCTGGGCTGGATGGCGGGACTGCCGCTGGCGGTATG
GCGCGCACCGGCGCCGCAGGCCTGGGTATTCCTGCTGGCGCTGGGCGGCACGCTGTGGCTGTTGATGCCGCGCGGCTGGC
CGCTGCGCTGGAGCGGCGCGATCGCCTGGCTGCCCTTGCTGCTGCACCTGCCCGATCACCCGCCGGCAGGCAGCGTGCGC
GTCACCGCCTTCGACGTCGGCCAGGGCATGGCGCTGCTGGTCGAGACCGCGGGCCACCGCCTGCTGTACGACACCGGCCC
GGCCTATGCGCCCGGCGCGGATGCCGGCAGCCGCGTGATCCTGCCGTACCTGCGCATGCGCGGCATCGGGGCGCTGGACG
GCATCGTGGTCAGCCATGGCGACCTCGATCACACCGGCGGCGCCCTGGCGCTGCTGGGGGAACTCGAGGTCGGCTGGCTG
GCGTCGTCGCTCGGTGAAGAGCACGCGATCGCGCGGGCGGCGCCGCGCCACCTGCATTGCATGGCCGGCCAGCGCTGGGA
GTGGGACGGCATCCGCTTCGAGATGCTGCATCCGGCGCCGTCGAGTTATGGCGACGCCGGCCTGAAGGCGAATGCGCGCA
GCTGCGTGCTGCGCATCGTCAATGCCACGCATGCGTTGCTGCTGGCGGGCGACATCGAGGCGGCGCAGGAAGCCGGCCTG
GTGGCGGATCGAGCGCAGGCGCTGCGCGCCGACGTGCTGCTGGCGCCGCACCACGGCAGCGGCACCTCGTCCACGCCGGC
CTTTTTGCAGGCGGTGCGGCCGTCGATCGGCATCTTCCAGGTGGGGTACCGCAACCGTTACCGCCATCCCAAGGCCGAGG
TCTACGAGCGTTACCGGATGCTGGGCATCGAGCGCCTGCGCACCGACACCAGCGGCGCCGTGACGTTCGACGTGGATGCA
GCCGTGACGGTGGAAGCCTACCGCAGCGCGCACGCCCGTTACTGGTATGCGGGGGGCATGACGGCAGACCATTTACAAAG
TTATTGA
ATGCGCTGCCTGATCCTGGGGTTTGCCGCCGGCGTGTTCTGGCTGCAGCAGGCGTCGTCCCTGCCGGCGGCACTGTCGCT
GGCCGGCTGCGCCGCAGCCGCACTGGCGCTGTCGATATTGGCCGCGTTTGTCCGCCGCGCAGCCCCGTCTCGTTTGCGTG
GCGCCGCCGCTGTGGTCCTGATGCTCGCGGCCGCCGGCGGCTTGTCCGGTTACGCATGGGCCGCGCTGCTGGCGCAGCGC
GCATTGGCGCCAATGCTGGCGGCCACCGACGAAGGGCGCGACCTGGCCATCGTCGGCGTGGTCGACAACCTGCCGGCCAG
CTTCGAGCAGGGCGTGCGTTTCAACTTCCTGGTCGAGCGCACGCTCACGGCGGGCGCGGCCGCGCCGCCGCGCGTGGCGC
TGTCCTGGTATGCCAACCCGCGCGGCGCGTCCGGCCGCGCCGCTGCGGCGTCCAGCGTGTCCGCGCAAGACGCGCTGCCC
GAGATCGAGCCCGGCCAGCGCTGGCAGTTGACGGTGCGCCTGCAGCGCCCGCACGGCAACGCCAATCCCGGCGGCTTCGA
CTACGAAGCCTGGCTGCTGGAACAGGGCGTGCGCGCCACCGGCTACGTGCGCACCGGGCGCGCTGCGGCAGGAGTGCCGG
CGGCGGTCCTGCTGGACGAGTTCGCGCCCAGCCTGCCGGGCGTGGTCGAGCGCTGCCGCGCCTGGCTGCGCGAACGCATC
CTGCGCGCGCTGGCGGGGCGCCAGTACGCCGGCGTCATCGTCGCGCTGGTGATCGGCGACCAGCGCGGCATCGACCAGGC
CGACTGGCAGGTGTTCAACCGCACCGGCATCGGCCACCTGGTTTCGATCTCGGGATTGCACATCACGATGATCGCCGGGC
TTGCGGCACTCGGCGCCTCGGCGCTGTGGCGGCGCTCGTTCTTCACCGATGCCCAGTTGCCGCTGCTGCTGCCGGCGCAG
AAGGTGGCGGCGCTGGCCGGCGCCGTCACCGCGCTGCTGTACGTGCTGCTGGCCGGCTTCGGGGTGCCGGCCCAGCGCAC
CTTGTACATGCTGTCGGTGGTGGCGCTGGCGCTGTGGAGCGGACGGCTGACGGCGGTCTCGCACGTGCTGTGCGCGGCGC
TCGGCGTAGTGCTGCTGCTGGACCCCTGGGCGGTGTTGTGGCCGGGCTTCTGGCTGTCGTTCGGCGCGGTGGCGATGATC
CTGTTCGCCGGCCATGGCCGCATCAATCCGCCGCTGCGCGGCCTGTGCGGCACGCTGCTGGGGGCGGGGCACACGCAGTG
GGCGGTGACGCTCGGCCTGGTGCCGCTGACGATGCTGCTGTTCGGCCAGGTATCGCTGGTCAGTCCGCTGGCGAATGCGG
TGGCGATCCCGCTGGTGAGCTTCGTGGTCACGCCGCTGGCACTGGCCGGCAGCCTGCTGCCCGATCCGTTGTGCGGCTGG
CTGCTGGCGCTGGCGCACGCGGCGGTCGCGGCGCTGGCCTGGCTGCTGGGCTGGATGGCGGGACTGCCGCTGGCGGTATG
GCGCGCACCGGCGCCGCAGGCCTGGGTATTCCTGCTGGCGCTGGGCGGCACGCTGTGGCTGTTGATGCCGCGCGGCTGGC
CGCTGCGCTGGAGCGGCGCGATCGCCTGGCTGCCCTTGCTGCTGCACCTGCCCGATCACCCGCCGGCAGGCAGCGTGCGC
GTCACCGCCTTCGACGTCGGCCAGGGCATGGCGCTGCTGGTCGAGACCGCGGGCCACCGCCTGCTGTACGACACCGGCCC
GGCCTATGCGCCCGGCGCGGATGCCGGCAGCCGCGTGATCCTGCCGTACCTGCGCATGCGCGGCATCGGGGCGCTGGACG
GCATCGTGGTCAGCCATGGCGACCTCGATCACACCGGCGGCGCCCTGGCGCTGCTGGGGGAACTCGAGGTCGGCTGGCTG
GCGTCGTCGCTCGGTGAAGAGCACGCGATCGCGCGGGCGGCGCCGCGCCACCTGCATTGCATGGCCGGCCAGCGCTGGGA
GTGGGACGGCATCCGCTTCGAGATGCTGCATCCGGCGCCGTCGAGTTATGGCGACGCCGGCCTGAAGGCGAATGCGCGCA
GCTGCGTGCTGCGCATCGTCAATGCCACGCATGCGTTGCTGCTGGCGGGCGACATCGAGGCGGCGCAGGAAGCCGGCCTG
GTGGCGGATCGAGCGCAGGCGCTGCGCGCCGACGTGCTGCTGGCGCCGCACCACGGCAGCGGCACCTCGTCCACGCCGGC
CTTTTTGCAGGCGGTGCGGCCGTCGATCGGCATCTTCCAGGTGGGGTACCGCAACCGTTACCGCCATCCCAAGGCCGAGG
TCTACGAGCGTTACCGGATGCTGGGCATCGAGCGCCTGCGCACCGACACCAGCGGCGCCGTGACGTTCGACGTGGATGCA
GCCGTGACGGTGGAAGCCTACCGCAGCGCGCACGCCCGTTACTGGTATGCGGGGGGCATGACGGCAGACCATTTACAAAG
TTATTGA
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Ralstonia pseudosolanacearum GMI1000 |
49.11 |
100 |
0.5 |