Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | IV454_RS22615 | Genome accession | NZ_CP065053 |
| Coordinates | 5083649..5086006 (+) | Length | 785 a.a. |
| NCBI ID | WP_206087944.1 | Uniprot ID | - |
| Organism | Massilia antarctica strain P8398 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 5078649..5091006
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| IV454_RS22590 (IV454_22570) | - | 5078870..5080135 (+) | 1266 | WP_206087939.1 | lipoprotein-releasing ABC transporter permease subunit | - |
| IV454_RS22595 (IV454_22575) | lolD | 5080128..5080835 (+) | 708 | WP_206087940.1 | lipoprotein-releasing ABC transporter ATP-binding protein LolD | - |
| IV454_RS22600 (IV454_22580) | - | 5080838..5081632 (+) | 795 | WP_206087941.1 | TatD family hydrolase | - |
| IV454_RS22605 (IV454_22585) | - | 5081650..5082405 (-) | 756 | WP_206087942.1 | SDR family NAD(P)-dependent oxidoreductase | - |
| IV454_RS22610 (IV454_22590) | - | 5082539..5083426 (+) | 888 | WP_206087943.1 | AraC family transcriptional regulator | - |
| IV454_RS22615 (IV454_22595) | comA | 5083649..5086006 (+) | 2358 | WP_206087944.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| IV454_RS22620 (IV454_22600) | - | 5086055..5086567 (+) | 513 | WP_206087945.1 | hypothetical protein | - |
| IV454_RS22625 (IV454_22605) | - | 5086545..5087975 (-) | 1431 | WP_206087946.1 | ATP-binding protein | - |
| IV454_RS22630 (IV454_22610) | - | 5087979..5088665 (-) | 687 | WP_206087947.1 | response regulator transcription factor | - |
| IV454_RS22635 (IV454_22615) | - | 5088804..5089274 (+) | 471 | WP_206087948.1 | Spy/CpxP family protein refolding chaperone | - |
| IV454_RS22640 (IV454_22620) | - | 5089292..5089885 (+) | 594 | WP_206087949.1 | hypothetical protein | - |
| IV454_RS22645 (IV454_22625) | - | 5089908..5090762 (+) | 855 | WP_206087950.1 | intradiol ring-cleavage dioxygenase | - |
Sequence
Protein
Download Length: 785 a.a. Molecular weight: 84551.69 Da Isoelectric Point: 9.5275
>NTDB_id=506270 IV454_RS22615 WP_206087944.1 5083649..5086006(+) (comA) [Massilia antarctica strain P8398]
MRAAILGFACGAGLLQIQPVLPSPSTMAACAAIAVLLCCLRGVARLAMAGALAGFCWAALAAHLALAPALAKAVEGRDIT
VVGIVDSLPFRFDDGVRFNFKVERVLGEPVVVPPRVSLAWYAGYRDAAQAIGDVQPGERWQLVVRLQRPHGNMNPYGFDY
EAWLLEQGVRATGYVRPEGGTTRRLDGFVFSLSNLVEHCRATLRERILRALPGKEYAGVIVALVVGDQRAIGQADWDVFN
RTGIGHLISISGLHITMVAGLFASLASLLWRRSFFTDAQLPLLMPAPKVAALTGAAVALLYVLLAGFGVPAQRTLYMLTV
VAAALWFGRLTQVSHVLCVALGVVVVLDPWAIASPGFWLSFGAVAAILFATTGRTVVRQPRWRGVLLVAAHTQYVVTLAL
VPLTMLLFSQVSIVSPLANAVAIPVVSFVVTPLALAGSMLPAPLSTLLLNAAHYAVQGLAWALAWCSGLRFAVWSAPAPE
PWLFVFAVVGTLWMLAPRGWPHRWTGLAAWLPLLTAQPVSPPQGEVFVTAFDVGQGMALLIETGTHRLLYDTGPAYTRES
NGANRVILPYLKARGIGFLDGVVVSHSDIDHAGGARTLLGALKVGWVSSSLWFDHPIVKAAPRHARCSGGQQWTWDGVRF
EMLHPSVESYADASLKPNARGCVLRITAGVHSILLAADIEAAQEAKLVAGSAQLLRAEVLLAPHHGSGTSSTPVFLAAVQ
PRLVLFQVGYRNRYHHPKTEVVERYEKLGIERLRSDESGAIMLDSASGFAPVEYRREHARYWYGK
MRAAILGFACGAGLLQIQPVLPSPSTMAACAAIAVLLCCLRGVARLAMAGALAGFCWAALAAHLALAPALAKAVEGRDIT
VVGIVDSLPFRFDDGVRFNFKVERVLGEPVVVPPRVSLAWYAGYRDAAQAIGDVQPGERWQLVVRLQRPHGNMNPYGFDY
EAWLLEQGVRATGYVRPEGGTTRRLDGFVFSLSNLVEHCRATLRERILRALPGKEYAGVIVALVVGDQRAIGQADWDVFN
RTGIGHLISISGLHITMVAGLFASLASLLWRRSFFTDAQLPLLMPAPKVAALTGAAVALLYVLLAGFGVPAQRTLYMLTV
VAAALWFGRLTQVSHVLCVALGVVVVLDPWAIASPGFWLSFGAVAAILFATTGRTVVRQPRWRGVLLVAAHTQYVVTLAL
VPLTMLLFSQVSIVSPLANAVAIPVVSFVVTPLALAGSMLPAPLSTLLLNAAHYAVQGLAWALAWCSGLRFAVWSAPAPE
PWLFVFAVVGTLWMLAPRGWPHRWTGLAAWLPLLTAQPVSPPQGEVFVTAFDVGQGMALLIETGTHRLLYDTGPAYTRES
NGANRVILPYLKARGIGFLDGVVVSHSDIDHAGGARTLLGALKVGWVSSSLWFDHPIVKAAPRHARCSGGQQWTWDGVRF
EMLHPSVESYADASLKPNARGCVLRITAGVHSILLAADIEAAQEAKLVAGSAQLLRAEVLLAPHHGSGTSSTPVFLAAVQ
PRLVLFQVGYRNRYHHPKTEVVERYEKLGIERLRSDESGAIMLDSASGFAPVEYRREHARYWYGK
Nucleotide
Download Length: 2358 bp
>NTDB_id=506270 IV454_RS22615 WP_206087944.1 5083649..5086006(+) (comA) [Massilia antarctica strain P8398]
ATGCGCGCCGCCATCCTGGGATTTGCCTGTGGCGCCGGCCTGCTGCAAATCCAGCCGGTGCTGCCATCACCATCGACCAT
GGCCGCATGCGCCGCCATCGCGGTCCTGCTTTGCTGTTTGCGCGGTGTTGCGAGGCTGGCCATGGCCGGTGCGCTCGCCG
GTTTCTGCTGGGCCGCGCTGGCCGCCCATCTGGCCTTGGCGCCGGCGCTGGCCAAAGCCGTCGAGGGCCGCGACATCACG
GTCGTCGGCATCGTCGACAGCCTGCCGTTCCGCTTCGATGACGGCGTGCGCTTCAATTTCAAGGTCGAAAGAGTGCTGGG
CGAGCCGGTGGTGGTGCCGCCGCGGGTGTCGCTGGCGTGGTATGCCGGCTACCGCGACGCCGCGCAAGCGATCGGCGACG
TGCAGCCCGGCGAACGCTGGCAGCTGGTCGTGCGCCTGCAACGCCCGCATGGAAACATGAACCCGTACGGCTTCGATTAC
GAGGCCTGGCTGCTGGAGCAGGGCGTGCGCGCCACCGGCTACGTGCGGCCGGAAGGCGGCACGACCCGGCGCCTGGACGG
TTTCGTCTTCAGCTTGTCGAACCTGGTCGAACATTGCCGGGCCACCTTGCGCGAGCGTATCCTGCGCGCGCTGCCGGGCA
AGGAATACGCCGGCGTGATCGTCGCCCTGGTCGTCGGCGACCAGCGCGCCATCGGGCAGGCCGACTGGGACGTGTTCAAC
CGCACCGGAATCGGCCACCTCATTTCCATCTCCGGCCTGCATATCACCATGGTGGCCGGACTGTTCGCGTCGCTCGCCTC
CTTGCTGTGGCGGCGCTCGTTTTTCACCGATGCGCAACTGCCTTTGCTGATGCCGGCGCCGAAGGTGGCCGCCTTGACGG
GCGCCGCGGTCGCGCTGTTGTACGTGCTGCTGGCCGGCTTCGGCGTGCCGGCGCAGCGCACTCTGTATATGTTGACCGTG
GTCGCCGCCGCGCTCTGGTTCGGCCGGCTCACGCAGGTGTCGCACGTGCTGTGCGTGGCGCTCGGCGTGGTCGTCGTGCT
CGATCCGTGGGCGATCGCTTCGCCCGGTTTCTGGCTGTCGTTCGGGGCCGTGGCGGCGATCCTGTTCGCGACCACCGGCC
GCACCGTTGTCAGGCAGCCGCGCTGGCGCGGCGTGCTGCTGGTGGCCGCGCATACGCAGTATGTGGTCACGCTCGCGCTG
GTTCCCCTGACGATGCTGCTGTTCTCGCAGGTGTCCATCGTCAGTCCGCTGGCCAACGCGGTGGCGATCCCGGTGGTCAG
CTTCGTCGTCACGCCGCTCGCGCTGGCCGGCAGCATGCTGCCGGCACCCTTGTCCACCTTGCTGCTCAATGCCGCGCACT
ATGCCGTGCAGGGACTGGCCTGGGCGCTCGCATGGTGCTCGGGATTGCGTTTTGCGGTGTGGAGCGCACCGGCTCCCGAG
CCGTGGCTGTTCGTGTTCGCGGTCGTCGGCACCTTGTGGATGCTCGCCCCGCGCGGCTGGCCGCACCGCTGGACCGGGCT
GGCCGCGTGGCTGCCGCTGCTCACCGCCCAGCCTGTGTCTCCGCCGCAGGGCGAGGTGTTCGTGACCGCATTCGATGTCG
GGCAGGGCATGGCGCTGTTGATCGAAACCGGCACGCACCGGCTGCTGTACGACACCGGACCGGCCTACACCCGCGAGTCG
AACGGTGCCAACCGGGTGATCCTGCCGTACCTCAAGGCGCGCGGCATCGGCTTTCTCGACGGCGTGGTCGTCAGCCACAG
CGACATTGACCATGCCGGCGGCGCGCGCACCTTGCTCGGCGCGCTCAAGGTCGGCTGGGTGTCGTCGTCGCTGTGGTTCG
ACCATCCGATCGTCAAGGCCGCGCCACGCCATGCCCGCTGCAGCGGCGGGCAGCAGTGGACCTGGGATGGCGTGCGCTTC
GAGATGCTGCATCCGAGCGTGGAAAGCTATGCGGATGCCAGCCTCAAACCGAACGCGCGCGGCTGCGTGCTGCGGATCAC
GGCGGGCGTCCATTCCATCCTGCTGGCGGCCGATATCGAGGCCGCGCAGGAGGCGAAGCTGGTCGCCGGATCGGCGCAGC
TGCTGCGCGCCGAGGTTTTGCTGGCGCCGCATCATGGCAGCGGTACCTCGTCGACACCGGTCTTCCTGGCCGCTGTGCAG
CCACGCCTGGTGCTGTTCCAAGTCGGTTATCGCAACCGCTACCATCACCCCAAGACGGAGGTGGTGGAGCGCTACGAAAA
ACTAGGCATCGAACGCTTGCGCTCGGACGAATCGGGAGCGATCATGCTCGATTCGGCCAGCGGCTTCGCGCCGGTCGAAT
ACCGGCGAGAACATGCGCGTTACTGGTACGGAAAGTAG
ATGCGCGCCGCCATCCTGGGATTTGCCTGTGGCGCCGGCCTGCTGCAAATCCAGCCGGTGCTGCCATCACCATCGACCAT
GGCCGCATGCGCCGCCATCGCGGTCCTGCTTTGCTGTTTGCGCGGTGTTGCGAGGCTGGCCATGGCCGGTGCGCTCGCCG
GTTTCTGCTGGGCCGCGCTGGCCGCCCATCTGGCCTTGGCGCCGGCGCTGGCCAAAGCCGTCGAGGGCCGCGACATCACG
GTCGTCGGCATCGTCGACAGCCTGCCGTTCCGCTTCGATGACGGCGTGCGCTTCAATTTCAAGGTCGAAAGAGTGCTGGG
CGAGCCGGTGGTGGTGCCGCCGCGGGTGTCGCTGGCGTGGTATGCCGGCTACCGCGACGCCGCGCAAGCGATCGGCGACG
TGCAGCCCGGCGAACGCTGGCAGCTGGTCGTGCGCCTGCAACGCCCGCATGGAAACATGAACCCGTACGGCTTCGATTAC
GAGGCCTGGCTGCTGGAGCAGGGCGTGCGCGCCACCGGCTACGTGCGGCCGGAAGGCGGCACGACCCGGCGCCTGGACGG
TTTCGTCTTCAGCTTGTCGAACCTGGTCGAACATTGCCGGGCCACCTTGCGCGAGCGTATCCTGCGCGCGCTGCCGGGCA
AGGAATACGCCGGCGTGATCGTCGCCCTGGTCGTCGGCGACCAGCGCGCCATCGGGCAGGCCGACTGGGACGTGTTCAAC
CGCACCGGAATCGGCCACCTCATTTCCATCTCCGGCCTGCATATCACCATGGTGGCCGGACTGTTCGCGTCGCTCGCCTC
CTTGCTGTGGCGGCGCTCGTTTTTCACCGATGCGCAACTGCCTTTGCTGATGCCGGCGCCGAAGGTGGCCGCCTTGACGG
GCGCCGCGGTCGCGCTGTTGTACGTGCTGCTGGCCGGCTTCGGCGTGCCGGCGCAGCGCACTCTGTATATGTTGACCGTG
GTCGCCGCCGCGCTCTGGTTCGGCCGGCTCACGCAGGTGTCGCACGTGCTGTGCGTGGCGCTCGGCGTGGTCGTCGTGCT
CGATCCGTGGGCGATCGCTTCGCCCGGTTTCTGGCTGTCGTTCGGGGCCGTGGCGGCGATCCTGTTCGCGACCACCGGCC
GCACCGTTGTCAGGCAGCCGCGCTGGCGCGGCGTGCTGCTGGTGGCCGCGCATACGCAGTATGTGGTCACGCTCGCGCTG
GTTCCCCTGACGATGCTGCTGTTCTCGCAGGTGTCCATCGTCAGTCCGCTGGCCAACGCGGTGGCGATCCCGGTGGTCAG
CTTCGTCGTCACGCCGCTCGCGCTGGCCGGCAGCATGCTGCCGGCACCCTTGTCCACCTTGCTGCTCAATGCCGCGCACT
ATGCCGTGCAGGGACTGGCCTGGGCGCTCGCATGGTGCTCGGGATTGCGTTTTGCGGTGTGGAGCGCACCGGCTCCCGAG
CCGTGGCTGTTCGTGTTCGCGGTCGTCGGCACCTTGTGGATGCTCGCCCCGCGCGGCTGGCCGCACCGCTGGACCGGGCT
GGCCGCGTGGCTGCCGCTGCTCACCGCCCAGCCTGTGTCTCCGCCGCAGGGCGAGGTGTTCGTGACCGCATTCGATGTCG
GGCAGGGCATGGCGCTGTTGATCGAAACCGGCACGCACCGGCTGCTGTACGACACCGGACCGGCCTACACCCGCGAGTCG
AACGGTGCCAACCGGGTGATCCTGCCGTACCTCAAGGCGCGCGGCATCGGCTTTCTCGACGGCGTGGTCGTCAGCCACAG
CGACATTGACCATGCCGGCGGCGCGCGCACCTTGCTCGGCGCGCTCAAGGTCGGCTGGGTGTCGTCGTCGCTGTGGTTCG
ACCATCCGATCGTCAAGGCCGCGCCACGCCATGCCCGCTGCAGCGGCGGGCAGCAGTGGACCTGGGATGGCGTGCGCTTC
GAGATGCTGCATCCGAGCGTGGAAAGCTATGCGGATGCCAGCCTCAAACCGAACGCGCGCGGCTGCGTGCTGCGGATCAC
GGCGGGCGTCCATTCCATCCTGCTGGCGGCCGATATCGAGGCCGCGCAGGAGGCGAAGCTGGTCGCCGGATCGGCGCAGC
TGCTGCGCGCCGAGGTTTTGCTGGCGCCGCATCATGGCAGCGGTACCTCGTCGACACCGGTCTTCCTGGCCGCTGTGCAG
CCACGCCTGGTGCTGTTCCAAGTCGGTTATCGCAACCGCTACCATCACCCCAAGACGGAGGTGGTGGAGCGCTACGAAAA
ACTAGGCATCGAACGCTTGCGCTCGGACGAATCGGGAGCGATCATGCTCGATTCGGCCAGCGGCTTCGCGCCGGTCGAAT
ACCGGCGAGAACATGCGCGTTACTGGTACGGAAAGTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Ralstonia pseudosolanacearum GMI1000 |
45.258 |
100 |
0.48 |
| comA | Pseudomonas stutzeri DSM 10701 |
38.021 |
97.834 |
0.372 |