Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | KLP38_RS05005 | Genome accession | NZ_CP076060 |
| Coordinates | 1069772..1071475 (+) | Length | 567 a.a. |
| NCBI ID | WP_370649100.1 | Uniprot ID | - |
| Organism | Cupriavidus sp. EM10 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1061471..1072065 | 1069772..1071475 | within | 0 |
Gene organization within MGE regions
Location: 1061471..1072065
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| KLP38_RS04970 (KLP38_04970) | prfB | 1061471..1062574 (-) | 1104 | WP_215529668.1 | peptide chain release factor 2 | - |
| KLP38_RS04975 (KLP38_04975) | - | 1062816..1063550 (+) | 735 | WP_370649113.1 | peptidoglycan DD-metalloendopeptidase family protein | - |
| KLP38_RS04980 (KLP38_04980) | recJ | 1063576..1065274 (-) | 1699 | Protein_997 | single-stranded-DNA-specific exonuclease RecJ | - |
| KLP38_RS04985 (KLP38_04985) | - | 1065305..1066426 (-) | 1122 | WP_215529670.1 | hypothetical protein | - |
| KLP38_RS04990 (KLP38_04990) | - | 1066528..1067777 (+) | 1250 | Protein_999 | lipoprotein-releasing ABC transporter permease subunit | - |
| KLP38_RS04995 (KLP38_04995) | lolD | 1067770..1068510 (+) | 741 | WP_215529671.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| KLP38_RS05000 (KLP38_05000) | - | 1068527..1069357 (+) | 831 | WP_215529672.1 | TatD family hydrolase | - |
| KLP38_RS05005 (KLP38_05005) | comA | 1069772..1071475 (+) | 1704 | WP_370649100.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| KLP38_RS32890 | - | 1071463..1072065 (+) | 603 | WP_370649101.1 | ComEC/Rec2 family competence protein | - |
Sequence
Protein
Download Length: 567 a.a. Molecular weight: 60611.88 Da Isoelectric Point: 11.2229
>NTDB_id=571743 KLP38_RS05005 WP_370649100.1 1069772..1071475(+) (comA) [Cupriavidus sp. EM10]
MAGRNLVVEGMVAGLPDTAPHGTRFLFRVERWEDGGTGQSLPDRLLLTWRDEVSGTGEAAESQQAAALQPGQRYRLTVRL
RRPRGLANPHGFDYAYWLLAEGVHATGYVRSGTALASGGPLPWPSWRSWPVRIAMWRAGLRDHLRAALPADARYGPVLIA
LVIGDQRGIAQGDWDVFRRTGISHLVSISGLHITMIAGAAGALARLGWRHSFGLGRRLRRPFPLIWPAQQAALVVTVLAA
LGYGCIAGMQIPALRTVAMLVVAALALWSGRAPPLSVVLAWAAGIAVAIDPWAVMSPGFWLSFGAVAVIFLHARDDPEGQ
RKKQGGQSGQGERGRMARLRAVVVDAARTQWAVTIGLVPLTLLLFGQVSVVSCLANAVAIPVVSLFVTPLALAGAALPVQ
LATPVLAVAHGAMGWLVDGLAWLAAPAWAVWEAAHPGWWRTALAVVGVWLLLVPRRPILPPDIWPRIKYPIPRWLGAGLM
VPMLMTARTAVAEGEMRVTALDVGQGAAVLVETRRHALLYDAGPAYVSGTSAGGQVVVPFLRASGIRRLDMLMVSHEDAD
HAGACAM
MAGRNLVVEGMVAGLPDTAPHGTRFLFRVERWEDGGTGQSLPDRLLLTWRDEVSGTGEAAESQQAAALQPGQRYRLTVRL
RRPRGLANPHGFDYAYWLLAEGVHATGYVRSGTALASGGPLPWPSWRSWPVRIAMWRAGLRDHLRAALPADARYGPVLIA
LVIGDQRGIAQGDWDVFRRTGISHLVSISGLHITMIAGAAGALARLGWRHSFGLGRRLRRPFPLIWPAQQAALVVTVLAA
LGYGCIAGMQIPALRTVAMLVVAALALWSGRAPPLSVVLAWAAGIAVAIDPWAVMSPGFWLSFGAVAVIFLHARDDPEGQ
RKKQGGQSGQGERGRMARLRAVVVDAARTQWAVTIGLVPLTLLLFGQVSVVSCLANAVAIPVVSLFVTPLALAGAALPVQ
LATPVLAVAHGAMGWLVDGLAWLAAPAWAVWEAAHPGWWRTALAVVGVWLLLVPRRPILPPDIWPRIKYPIPRWLGAGLM
VPMLMTARTAVAEGEMRVTALDVGQGAAVLVETRRHALLYDAGPAYVSGTSAGGQVVVPFLRASGIRRLDMLMVSHEDAD
HAGACAM
Nucleotide
Download Length: 1704 bp
>NTDB_id=571743 KLP38_RS05005 WP_370649100.1 1069772..1071475(+) (comA) [Cupriavidus sp. EM10]
ATGGCCGGACGCAATCTGGTGGTGGAGGGCATGGTGGCGGGGCTGCCAGACACCGCGCCGCATGGCACCCGGTTCCTGTT
TCGTGTCGAGCGCTGGGAGGACGGTGGGACAGGGCAATCGTTGCCGGACCGGCTGCTGCTGACATGGCGCGACGAAGTAT
CAGGTACCGGCGAGGCCGCTGAATCCCAACAAGCCGCCGCGCTGCAGCCGGGCCAGCGCTATCGCCTGACCGTCCGGCTG
CGCCGTCCGCGCGGGCTGGCGAATCCGCATGGCTTCGACTATGCCTACTGGCTGCTGGCGGAAGGGGTGCATGCAACGGG
CTATGTGCGCAGTGGCACCGCGCTGGCCTCCGGCGGTCCACTTCCGTGGCCCTCGTGGCGCTCGTGGCCCGTACGCATCG
CGATGTGGCGCGCCGGCCTGCGCGACCATCTGCGCGCCGCCTTGCCGGCCGACGCCCGCTATGGCCCCGTGCTGATCGCG
CTGGTGATCGGCGACCAGCGCGGCATCGCGCAGGGGGACTGGGACGTTTTCCGGCGTACCGGCATCTCGCATCTGGTCAG
CATCTCGGGGCTGCATATCACGATGATCGCGGGCGCGGCCGGGGCGTTGGCGAGGCTGGGATGGCGGCACTCGTTCGGGC
TCGGGCGGCGGCTGCGCCGGCCGTTCCCGCTGATATGGCCGGCCCAGCAGGCCGCGCTGGTGGTCACCGTGCTGGCCGCG
CTGGGCTATGGCTGCATCGCCGGCATGCAGATTCCGGCGTTGCGCACGGTTGCAATGCTGGTCGTGGCCGCGCTGGCGCT
GTGGAGCGGCCGGGCGCCGCCGCTTTCCGTGGTGCTGGCGTGGGCGGCCGGCATTGCCGTGGCCATCGATCCCTGGGCGG
TCATGTCACCGGGGTTCTGGTTGTCCTTCGGCGCGGTGGCGGTGATCTTCCTTCACGCGCGCGACGATCCGGAGGGGCAG
CGAAAGAAGCAGGGCGGGCAGAGCGGGCAGGGTGAACGGGGCCGGATGGCGCGGCTGCGCGCCGTCGTCGTCGACGCCGC
ACGCACGCAGTGGGCGGTGACCATCGGGCTTGTGCCGCTGACGCTGCTGCTGTTTGGCCAGGTGTCGGTGGTGTCGTGTT
TGGCCAATGCGGTGGCGATCCCGGTGGTGAGCCTGTTCGTGACGCCGCTGGCACTGGCCGGGGCTGCATTGCCGGTGCAG
CTTGCCACGCCGGTGCTGGCCGTGGCGCACGGCGCGATGGGCTGGCTGGTGGACGGGCTGGCGTGGTTGGCCGCGCCGGC
GTGGGCCGTGTGGGAGGCCGCCCATCCGGGATGGTGGCGGACGGCCCTGGCCGTCGTCGGGGTGTGGCTTCTGCTTGTGC
CGCGACGGCCGATATTGCCGCCCGACATCTGGCCGCGCATCAAGTATCCGATTCCACGCTGGCTAGGCGCCGGCCTGATG
GTGCCAATGTTGATGACCGCCCGCACTGCAGTGGCTGAAGGCGAAATGCGCGTGACGGCGCTCGATGTCGGGCAGGGGGC
GGCGGTGCTGGTGGAAACGCGGCGGCATGCGCTGCTCTACGACGCCGGGCCGGCCTACGTATCCGGGACGAGTGCCGGCG
GGCAGGTCGTCGTGCCATTCCTGCGGGCCTCGGGCATCCGGAGGCTCGATATGCTGATGGTCAGCCACGAGGACGCCGAC
CATGCCGGGGCGTGCGCGATGTGA
ATGGCCGGACGCAATCTGGTGGTGGAGGGCATGGTGGCGGGGCTGCCAGACACCGCGCCGCATGGCACCCGGTTCCTGTT
TCGTGTCGAGCGCTGGGAGGACGGTGGGACAGGGCAATCGTTGCCGGACCGGCTGCTGCTGACATGGCGCGACGAAGTAT
CAGGTACCGGCGAGGCCGCTGAATCCCAACAAGCCGCCGCGCTGCAGCCGGGCCAGCGCTATCGCCTGACCGTCCGGCTG
CGCCGTCCGCGCGGGCTGGCGAATCCGCATGGCTTCGACTATGCCTACTGGCTGCTGGCGGAAGGGGTGCATGCAACGGG
CTATGTGCGCAGTGGCACCGCGCTGGCCTCCGGCGGTCCACTTCCGTGGCCCTCGTGGCGCTCGTGGCCCGTACGCATCG
CGATGTGGCGCGCCGGCCTGCGCGACCATCTGCGCGCCGCCTTGCCGGCCGACGCCCGCTATGGCCCCGTGCTGATCGCG
CTGGTGATCGGCGACCAGCGCGGCATCGCGCAGGGGGACTGGGACGTTTTCCGGCGTACCGGCATCTCGCATCTGGTCAG
CATCTCGGGGCTGCATATCACGATGATCGCGGGCGCGGCCGGGGCGTTGGCGAGGCTGGGATGGCGGCACTCGTTCGGGC
TCGGGCGGCGGCTGCGCCGGCCGTTCCCGCTGATATGGCCGGCCCAGCAGGCCGCGCTGGTGGTCACCGTGCTGGCCGCG
CTGGGCTATGGCTGCATCGCCGGCATGCAGATTCCGGCGTTGCGCACGGTTGCAATGCTGGTCGTGGCCGCGCTGGCGCT
GTGGAGCGGCCGGGCGCCGCCGCTTTCCGTGGTGCTGGCGTGGGCGGCCGGCATTGCCGTGGCCATCGATCCCTGGGCGG
TCATGTCACCGGGGTTCTGGTTGTCCTTCGGCGCGGTGGCGGTGATCTTCCTTCACGCGCGCGACGATCCGGAGGGGCAG
CGAAAGAAGCAGGGCGGGCAGAGCGGGCAGGGTGAACGGGGCCGGATGGCGCGGCTGCGCGCCGTCGTCGTCGACGCCGC
ACGCACGCAGTGGGCGGTGACCATCGGGCTTGTGCCGCTGACGCTGCTGCTGTTTGGCCAGGTGTCGGTGGTGTCGTGTT
TGGCCAATGCGGTGGCGATCCCGGTGGTGAGCCTGTTCGTGACGCCGCTGGCACTGGCCGGGGCTGCATTGCCGGTGCAG
CTTGCCACGCCGGTGCTGGCCGTGGCGCACGGCGCGATGGGCTGGCTGGTGGACGGGCTGGCGTGGTTGGCCGCGCCGGC
GTGGGCCGTGTGGGAGGCCGCCCATCCGGGATGGTGGCGGACGGCCCTGGCCGTCGTCGGGGTGTGGCTTCTGCTTGTGC
CGCGACGGCCGATATTGCCGCCCGACATCTGGCCGCGCATCAAGTATCCGATTCCACGCTGGCTAGGCGCCGGCCTGATG
GTGCCAATGTTGATGACCGCCCGCACTGCAGTGGCTGAAGGCGAAATGCGCGTGACGGCGCTCGATGTCGGGCAGGGGGC
GGCGGTGCTGGTGGAAACGCGGCGGCATGCGCTGCTCTACGACGCCGGGCCGGCCTACGTATCCGGGACGAGTGCCGGCG
GGCAGGTCGTCGTGCCATTCCTGCGGGCCTCGGGCATCCGGAGGCTCGATATGCTGATGGTCAGCCACGAGGACGCCGAC
CATGCCGGGGCGTGCGCGATGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Ralstonia pseudosolanacearum GMI1000 |
48.7 |
100 |
0.496 |