Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | RJW56_RS03040 | Genome accession | NZ_CP134472 |
| Coordinates | 604113..606350 (+) | Length | 745 a.a. |
| NCBI ID | WP_024394380.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain 1522228 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 599113..611350
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| RJW56_RS03015 (RJW56_03015) | - | 599185..600114 (-) | 930 | WP_009909659.1 | ABC transporter substrate-binding protein | - |
| RJW56_RS03020 (RJW56_03020) | - | 600111..600911 (-) | 801 | WP_013730319.1 | ABC transporter ATP-binding protein | - |
| RJW56_RS03025 (RJW56_03025) | - | 601063..602508 (-) | 1446 | WP_013730320.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| RJW56_RS03030 (RJW56_03030) | - | 602654..603400 (+) | 747 | WP_014735985.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| RJW56_RS03035 (RJW56_03035) | - | 603466..604129 (+) | 664 | Protein_551 | helix-hairpin-helix domain-containing protein | - |
| RJW56_RS03040 (RJW56_03040) | comEC/celB | 604113..606350 (+) | 2238 | WP_024394380.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| RJW56_RS03045 (RJW56_03045) | - | 606359..606481 (+) | 123 | Protein_553 | IS982 family transposase | - |
| RJW56_RS03050 (RJW56_03050) | - | 606664..607320 (+) | 657 | WP_024394381.1 | CBS domain-containing protein | - |
| RJW56_RS03055 (RJW56_03055) | - | 607414..608241 (+) | 828 | WP_023370707.1 | hypothetical protein | - |
| RJW56_RS03060 (RJW56_03060) | - | 608247..609518 (+) | 1272 | WP_002935355.1 | toxic anion resistance protein | - |
| RJW56_RS03065 (RJW56_03065) | - | 609555..610421 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| RJW56_RS03070 (RJW56_03070) | tmk | 610596..611234 (+) | 639 | WP_002935358.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 84107.06 Da Isoelectric Point: 7.6331
>NTDB_id=880105 RJW56_RS03040 WP_024394380.1 604113..606350(+) (comEC/celB) [Streptococcus suis strain 1522228]
MSRLSKLPCQPIHLAVLAVAVYFAVHSFSLLKMGLLSLLLLVFGFRQGKAVFLKTLPLLALCGMFFGYQKVQWERADQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQPYQIFYKLTSQEEQAYFQKLADLVQLEVEVEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSEFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKEAVDKLQIPFSLFYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFAFSFVFDVLLLPGLSV
IFLLSPIVKITWVNGFFVFMEKIIVWVADLGLRPWILGKPSGLVLLFLLVSLFLLYDFHRKKNWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEEWQERSSQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIYVSPGSLTVPDFVATLKEINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIKAYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGENNRYKHPHQETLDRFDSRNIQ
VYRTDQQGAIRFRGWKEWRIETVRR
MSRLSKLPCQPIHLAVLAVAVYFAVHSFSLLKMGLLSLLLLVFGFRQGKAVFLKTLPLLALCGMFFGYQKVQWERADQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQPYQIFYKLTSQEEQAYFQKLADLVQLEVEVEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSEFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKEAVDKLQIPFSLFYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFAFSFVFDVLLLPGLSV
IFLLSPIVKITWVNGFFVFMEKIIVWVADLGLRPWILGKPSGLVLLFLLVSLFLLYDFHRKKNWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEEWQERSSQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIYVSPGSLTVPDFVATLKEINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIKAYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGENNRYKHPHQETLDRFDSRNIQ
VYRTDQQGAIRFRGWKEWRIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=880105 RJW56_RS03040 WP_024394380.1 604113..606350(+) (comEC/celB) [Streptococcus suis strain 1522228]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCATTCACCTGGCAGTCTTGGCGGTGGCAGTTTACTTTGCAGTCCATTC
TTTTTCCCTCTTGAAAATGGGTCTGCTGAGTTTGTTGTTGCTGGTCTTTGGTTTTCGGCAAGGTAAGGCAGTTTTCCTCA
AAACGCTGCCGCTTCTAGCCCTGTGCGGTATGTTTTTCGGCTACCAAAAGGTCCAATGGGAGCGGGCAGACCAGTCAGCC
CCAGAGCAAGTCACAACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGACGGACAACCCTATCAGATTTTCTATAAATTAACCAGTCAGGAGGAGCAGGCTTATTTTCAAAAGCTGGCAGACTTAG
TCCAGCTAGAAGTGGAAGTAGAGGTCAGCTTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACAGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCAGCACCAATGAGTCACTACATGACAG
GACTCTTGTTTGGGGAATTGGATAGTGAGTTTGACCAGATGAGTGACCTCTATTCCAGTTTGGGTATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAGC
CGTAGATAAGTTGCAGATTCCCTTTTCACTTTTTTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAGAAAATTCTGGGTAATCTGGGCTTGCGGAAGCTGGATAACTTTGCGGTGACAGTTTTTGTCTGTCTGCTGATTCTG
CCCCGATTTTTACTGACAGCAGGCGGAGTGCTGACTTTTACCTATGCTTTTTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGTCTCAGTATTTCGCTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCCTTATCTATCCTCCTGACCTTTGCCTTTTCTTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAATCGTCAAGATTACTTGGGTTAACGGATTTTTTGTCTTTATGGAAAAAATCATTGTCTGGGT
GGCGGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGGGTTGGTCCTCTTGTTCTTGCTGGTCAGCCTTTTCT
TGCTCTACGATTTCCACAGGAAGAAAAACTGGCTCCTAGGACTAAGCCTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTCCTTATTGATGTGGGCGGTCGGGTCGATTTTGCGGCCAAGGAGGAGTGGCAGGAGCGGTCTTCGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCACAGTCGAGGTGTGGATAGGATTGATAGTTTGGTCCTGACCCACACCGACACAGAC
CATGTGGGCGATGTGCTGGAAGTAGCTAAGCAGGTGCAAATCGGTCGGATTTACGTGTCGCCAGGAAGCCTGACGGTACC
TGACTTTGTAGCAACCTTGAAAGAAATCAATGTCCCTGTTCATGTGGTGAAAGTGGGCGACCGCTTGCCGATTTTCGATT
CTTATCTGGAAGTCCTTTATCCAGACGGGACAGGCGACGGTGGTAATAATGATTCTATTGTTCTCTACGGTCGCTTGTTG
GAAACCAATTTCCTCTTTACAGGGGACTTGGAGCAGGGGGAACTGGACTTGATAAAAGCCTATCCCAACCTGCCAGTCGA
TGTTCTCAAAGCTGGTCACCATGGTTCAAAAGGCTCTTCCTATCCCGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGATATCTGCTGGAGAAAATAACCGCTACAAGCATCCTCATCAAGAAACCTTGGACCGCTTTGACAGTCGGAATATCCAA
GTCTATCGTACCGACCAGCAAGGAGCCATCCGCTTCCGAGGTTGGAAGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCATTCACCTGGCAGTCTTGGCGGTGGCAGTTTACTTTGCAGTCCATTC
TTTTTCCCTCTTGAAAATGGGTCTGCTGAGTTTGTTGTTGCTGGTCTTTGGTTTTCGGCAAGGTAAGGCAGTTTTCCTCA
AAACGCTGCCGCTTCTAGCCCTGTGCGGTATGTTTTTCGGCTACCAAAAGGTCCAATGGGAGCGGGCAGACCAGTCAGCC
CCAGAGCAAGTCACAACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGACGGACAACCCTATCAGATTTTCTATAAATTAACCAGTCAGGAGGAGCAGGCTTATTTTCAAAAGCTGGCAGACTTAG
TCCAGCTAGAAGTGGAAGTAGAGGTCAGCTTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACAGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCAGCACCAATGAGTCACTACATGACAG
GACTCTTGTTTGGGGAATTGGATAGTGAGTTTGACCAGATGAGTGACCTCTATTCCAGTTTGGGTATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAGC
CGTAGATAAGTTGCAGATTCCCTTTTCACTTTTTTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAGAAAATTCTGGGTAATCTGGGCTTGCGGAAGCTGGATAACTTTGCGGTGACAGTTTTTGTCTGTCTGCTGATTCTG
CCCCGATTTTTACTGACAGCAGGCGGAGTGCTGACTTTTACCTATGCTTTTTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGTCTCAGTATTTCGCTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCCTTATCTATCCTCCTGACCTTTGCCTTTTCTTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAATCGTCAAGATTACTTGGGTTAACGGATTTTTTGTCTTTATGGAAAAAATCATTGTCTGGGT
GGCGGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGGGTTGGTCCTCTTGTTCTTGCTGGTCAGCCTTTTCT
TGCTCTACGATTTCCACAGGAAGAAAAACTGGCTCCTAGGACTAAGCCTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTCCTTATTGATGTGGGCGGTCGGGTCGATTTTGCGGCCAAGGAGGAGTGGCAGGAGCGGTCTTCGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCACAGTCGAGGTGTGGATAGGATTGATAGTTTGGTCCTGACCCACACCGACACAGAC
CATGTGGGCGATGTGCTGGAAGTAGCTAAGCAGGTGCAAATCGGTCGGATTTACGTGTCGCCAGGAAGCCTGACGGTACC
TGACTTTGTAGCAACCTTGAAAGAAATCAATGTCCCTGTTCATGTGGTGAAAGTGGGCGACCGCTTGCCGATTTTCGATT
CTTATCTGGAAGTCCTTTATCCAGACGGGACAGGCGACGGTGGTAATAATGATTCTATTGTTCTCTACGGTCGCTTGTTG
GAAACCAATTTCCTCTTTACAGGGGACTTGGAGCAGGGGGAACTGGACTTGATAAAAGCCTATCCCAACCTGCCAGTCGA
TGTTCTCAAAGCTGGTCACCATGGTTCAAAAGGCTCTTCCTATCCCGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGATATCTGCTGGAGAAAATAACCGCTACAAGCATCCTCATCAAGAAACCTTGGACCGCTTTGACAGTCGGAATATCCAA
GTCTATCGTACCGACCAGCAAGGAGCCATCCGCTTCCGAGGTTGGAAGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.701 |
99.732 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.925 |
98.658 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
52.038 |
98.792 |
0.514 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.482 |
99.597 |
0.513 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.482 |
99.597 |
0.513 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.482 |
99.597 |
0.513 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.067 |
100 |
0.494 |