Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | R5H38_RS03180 | Genome accession | NZ_CP137602 |
| Coordinates | 612438..614675 (+) | Length | 745 a.a. |
| NCBI ID | WP_318150846.1 | Uniprot ID | - |
| Organism | Streptococcus parasuis strain 221006 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 607438..619675
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R5H38_RS03150 | - | 608709..609115 (+) | 407 | Protein_582 | GNAT family N-acetyltransferase | - |
| R5H38_RS03155 | - | 609236..609847 (+) | 612 | WP_277836757.1 | FMN-dependent NADH-azoreductase | - |
| R5H38_RS03160 | - | 609865..610137 (-) | 273 | WP_318150845.1 | GIY-YIG nuclease family protein | - |
| R5H38_RS03165 | - | 610127..610876 (-) | 750 | WP_130554335.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| R5H38_RS03170 | - | 610968..611711 (+) | 744 | WP_217374832.1 | lysophospholipid acyltransferase family protein | - |
| R5H38_RS03175 | - | 611777..612454 (+) | 678 | WP_274504678.1 | helix-hairpin-helix domain-containing protein | - |
| R5H38_RS03180 | comEC/celB | 612438..614675 (+) | 2238 | WP_318150846.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| R5H38_RS03185 | holA | 614739..615770 (+) | 1032 | WP_318150847.1 | DNA polymerase III subunit delta | - |
| R5H38_RS03190 | sodA | 615847..616452 (+) | 606 | WP_217374836.1 | superoxide dismutase | - |
| R5H38_RS03195 | rplS | 616654..617001 (+) | 348 | WP_011921928.1 | 50S ribosomal protein L19 | - |
| R5H38_RS03200 | - | 617274..619013 (+) | 1740 | WP_318150848.1 | ABC transporter ATP-binding protein | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 84507.79 Da Isoelectric Point: 6.9943
>NTDB_id=899231 R5H38_RS03180 WP_318150846.1 612438..614675(+) (comEC/celB) [Streptococcus parasuis strain 221006]
MSRLIRLPCQPIHFAVLAVLAYFAVHSFSLLTMSLLSLLLAVFRLRQGKVVFIRTLPLLALCGLFFGCQKIQWERTNQWA
PEQVTTVQVIPDTIDVNGDSLSFRGRAEGQVFQVFYKVASQEEQTYFHELTDLVQLEVDAEVCQPAGQRNFNGFDYQAYL
KTQGIYRTVKISTINNILPVHSWNIFDWLSTWRRQALVYIKSHFPAPMSHYMTGLLLGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWLLLRLGLTKETVDKLQIPFSLIYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAATVFVCLLLM
PRFLLTAGGVLTFTYALLLTVFDFEELGQLKKIAVESLSISIGILPVLMTYFFAFQPLSILLTFVFSFVFDVLLLPGLSV
ILLLSPFIKITWVNGFFILMEKIIVWVAELGIRPWILGKPTGLVFLLLLFCLFLLYDFHREKKWLLGLSLILVLLFFITK
HPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVDFAAKEAWRERAREANAERTLIPYLHSRGVDRIDDLVLTHTDAD
HVGDVLELAKQIQIGKIYVSPGSLTVPDFVATLRRINVPVHVVNPGDRLPIFDSYLEVLYPNRIGDGGNNDSIVLYGRLL
KMNFLFTGDLEQGELDLITSYPQLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGEDNRYQHPHKETLERLDSQNMQ
VYRTDLQGAIRFRGWKQWSIETVKE
MSRLIRLPCQPIHFAVLAVLAYFAVHSFSLLTMSLLSLLLAVFRLRQGKVVFIRTLPLLALCGLFFGCQKIQWERTNQWA
PEQVTTVQVIPDTIDVNGDSLSFRGRAEGQVFQVFYKVASQEEQTYFHELTDLVQLEVDAEVCQPAGQRNFNGFDYQAYL
KTQGIYRTVKISTINNILPVHSWNIFDWLSTWRRQALVYIKSHFPAPMSHYMTGLLLGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWLLLRLGLTKETVDKLQIPFSLIYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAATVFVCLLLM
PRFLLTAGGVLTFTYALLLTVFDFEELGQLKKIAVESLSISIGILPVLMTYFFAFQPLSILLTFVFSFVFDVLLLPGLSV
ILLLSPFIKITWVNGFFILMEKIIVWVAELGIRPWILGKPTGLVFLLLLFCLFLLYDFHREKKWLLGLSLILVLLFFITK
HPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVDFAAKEAWRERAREANAERTLIPYLHSRGVDRIDDLVLTHTDAD
HVGDVLELAKQIQIGKIYVSPGSLTVPDFVATLRRINVPVHVVNPGDRLPIFDSYLEVLYPNRIGDGGNNDSIVLYGRLL
KMNFLFTGDLEQGELDLITSYPQLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGEDNRYQHPHKETLERLDSQNMQ
VYRTDLQGAIRFRGWKQWSIETVKE
Nucleotide
Download Length: 2238 bp
>NTDB_id=899231 R5H38_RS03180 WP_318150846.1 612438..614675(+) (comEC/celB) [Streptococcus parasuis strain 221006]
ATGTCACGGTTGATTAGACTCCCCTGTCAGCCCATTCACTTTGCAGTTTTGGCGGTGTTAGCCTACTTTGCCGTTCACTC
TTTTTCCCTTTTGACAATGAGCCTGCTGAGTCTGTTACTAGCAGTCTTTAGGCTTCGGCAAGGAAAGGTGGTCTTCATCA
GAACGCTACCGCTTTTAGCCTTATGTGGTCTCTTCTTCGGATGTCAGAAGATACAATGGGAGCGGACAAATCAATGGGCT
CCAGAGCAAGTGACAACTGTGCAGGTTATTCCTGATACCATTGATGTCAACGGAGACAGTCTATCTTTTCGTGGTCGGGC
TGAAGGTCAAGTTTTTCAGGTTTTCTATAAAGTTGCAAGTCAGGAAGAACAAACCTACTTTCATGAGCTTACGGACTTGG
TGCAGTTAGAGGTAGATGCAGAAGTTTGCCAACCAGCAGGTCAACGTAATTTCAATGGTTTTGATTATCAGGCTTATCTC
AAAACCCAGGGCATCTATCGGACAGTAAAAATAAGTACCATTAACAATATTCTACCTGTTCATTCTTGGAATATCTTTGA
CTGGTTGTCAACCTGGCGGAGGCAGGCTCTCGTTTATATCAAATCTCATTTTCCTGCTCCCATGAGCCACTACATGACTG
GATTACTATTGGGAGAGTTAGATAGTGACTTTGACCAAATGAGTGATCTCTATTCTAGTTTAGGGATCATTCATCTTTTT
GCCCTGTCTGGGATGCAGGTTGGTTTTTTCATTGACAAATTTCGCTGGCTTTTATTGCGTTTGGGTTTAACAAAGGAAAC
TGTCGATAAACTTCAAATTCCGTTTTCTCTTATTTATGCAGGATTAACAGGATTTTCAGTATCAGTCGTGCGGTCCTTGG
TCCAGAAAATTCTTGGTAATCTCGGGCTACGAAAATTGGATAATTTTGCAGCAACTGTCTTTGTTTGTCTCTTGCTTATG
CCACGTTTTCTTCTGACAGCAGGAGGTGTGCTGACATTTACCTATGCTTTGTTATTGACAGTCTTTGATTTTGAAGAGTT
AGGGCAGCTAAAAAAGATAGCAGTGGAGAGTCTGAGTATTTCTATTGGGATTTTACCGGTCTTGATGACCTATTTTTTTG
CCTTTCAGCCCTTATCTATCCTTTTAACGTTTGTTTTTTCCTTTGTTTTTGATGTGTTGTTGTTACCTGGGCTATCTGTC
ATTCTTTTACTATCGCCCTTCATTAAAATTACGTGGGTCAACGGATTCTTTATCCTTATGGAAAAGATTATTGTTTGGGT
GGCAGAATTGGGGATTCGACCTTGGATTTTAGGAAAACCTACGGGCCTTGTCTTTTTGCTCTTGCTGTTCTGCCTTTTCT
TGCTTTATGATTTTCACAGAGAGAAGAAATGGCTCCTTGGATTGAGTCTGATCCTTGTTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGGTAGACGTAGGGCAGGGGGATAGTATCTTTTTGCGGGACATTCGGGGGCGGAC
GGTTCTGATTGATGTGGGTGGTCGGGTTGACTTTGCTGCAAAGGAAGCTTGGCGGGAGCGGGCTAGGGAAGCAAATGCGG
AGCGAACACTGATTCCTTACCTGCATAGTCGAGGTGTGGATCGGATTGATGATTTGGTTCTGACCCATACCGATGCAGAT
CATGTGGGTGATGTGCTAGAATTGGCTAAGCAGATTCAAATAGGTAAGATTTACGTTTCTCCAGGTAGTTTGACTGTACC
AGATTTTGTTGCGACTTTGAGGAGAATAAATGTCCCTGTTCATGTTGTAAATCCTGGAGATCGATTGCCCATTTTTGATT
CCTATCTAGAAGTTCTATATCCCAATAGAATCGGAGATGGAGGCAATAATGACTCAATTGTACTCTATGGTCGTTTGTTA
AAAATGAATTTTCTCTTTACCGGTGACTTGGAGCAAGGGGAATTAGATTTAATCACTTCTTATCCGCAGCTACCAGTCGA
TGTGCTGAAAGCAGGTCACCATGGTTCCAAGGGCTCTTCATATCCAGAATTTTTAGACCATATTGGAGCAAAAATTGCTC
TGATTTCTGCTGGTGAAGATAATCGCTATCAACATCCACATAAGGAAACTCTGGAACGTCTTGACAGTCAAAATATGCAG
GTTTACCGAACGGATCTGCAAGGAGCAATCCGTTTCCGAGGTTGGAAACAGTGGAGTATTGAAACGGTAAAAGAGTGA
ATGTCACGGTTGATTAGACTCCCCTGTCAGCCCATTCACTTTGCAGTTTTGGCGGTGTTAGCCTACTTTGCCGTTCACTC
TTTTTCCCTTTTGACAATGAGCCTGCTGAGTCTGTTACTAGCAGTCTTTAGGCTTCGGCAAGGAAAGGTGGTCTTCATCA
GAACGCTACCGCTTTTAGCCTTATGTGGTCTCTTCTTCGGATGTCAGAAGATACAATGGGAGCGGACAAATCAATGGGCT
CCAGAGCAAGTGACAACTGTGCAGGTTATTCCTGATACCATTGATGTCAACGGAGACAGTCTATCTTTTCGTGGTCGGGC
TGAAGGTCAAGTTTTTCAGGTTTTCTATAAAGTTGCAAGTCAGGAAGAACAAACCTACTTTCATGAGCTTACGGACTTGG
TGCAGTTAGAGGTAGATGCAGAAGTTTGCCAACCAGCAGGTCAACGTAATTTCAATGGTTTTGATTATCAGGCTTATCTC
AAAACCCAGGGCATCTATCGGACAGTAAAAATAAGTACCATTAACAATATTCTACCTGTTCATTCTTGGAATATCTTTGA
CTGGTTGTCAACCTGGCGGAGGCAGGCTCTCGTTTATATCAAATCTCATTTTCCTGCTCCCATGAGCCACTACATGACTG
GATTACTATTGGGAGAGTTAGATAGTGACTTTGACCAAATGAGTGATCTCTATTCTAGTTTAGGGATCATTCATCTTTTT
GCCCTGTCTGGGATGCAGGTTGGTTTTTTCATTGACAAATTTCGCTGGCTTTTATTGCGTTTGGGTTTAACAAAGGAAAC
TGTCGATAAACTTCAAATTCCGTTTTCTCTTATTTATGCAGGATTAACAGGATTTTCAGTATCAGTCGTGCGGTCCTTGG
TCCAGAAAATTCTTGGTAATCTCGGGCTACGAAAATTGGATAATTTTGCAGCAACTGTCTTTGTTTGTCTCTTGCTTATG
CCACGTTTTCTTCTGACAGCAGGAGGTGTGCTGACATTTACCTATGCTTTGTTATTGACAGTCTTTGATTTTGAAGAGTT
AGGGCAGCTAAAAAAGATAGCAGTGGAGAGTCTGAGTATTTCTATTGGGATTTTACCGGTCTTGATGACCTATTTTTTTG
CCTTTCAGCCCTTATCTATCCTTTTAACGTTTGTTTTTTCCTTTGTTTTTGATGTGTTGTTGTTACCTGGGCTATCTGTC
ATTCTTTTACTATCGCCCTTCATTAAAATTACGTGGGTCAACGGATTCTTTATCCTTATGGAAAAGATTATTGTTTGGGT
GGCAGAATTGGGGATTCGACCTTGGATTTTAGGAAAACCTACGGGCCTTGTCTTTTTGCTCTTGCTGTTCTGCCTTTTCT
TGCTTTATGATTTTCACAGAGAGAAGAAATGGCTCCTTGGATTGAGTCTGATCCTTGTTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGGTAGACGTAGGGCAGGGGGATAGTATCTTTTTGCGGGACATTCGGGGGCGGAC
GGTTCTGATTGATGTGGGTGGTCGGGTTGACTTTGCTGCAAAGGAAGCTTGGCGGGAGCGGGCTAGGGAAGCAAATGCGG
AGCGAACACTGATTCCTTACCTGCATAGTCGAGGTGTGGATCGGATTGATGATTTGGTTCTGACCCATACCGATGCAGAT
CATGTGGGTGATGTGCTAGAATTGGCTAAGCAGATTCAAATAGGTAAGATTTACGTTTCTCCAGGTAGTTTGACTGTACC
AGATTTTGTTGCGACTTTGAGGAGAATAAATGTCCCTGTTCATGTTGTAAATCCTGGAGATCGATTGCCCATTTTTGATT
CCTATCTAGAAGTTCTATATCCCAATAGAATCGGAGATGGAGGCAATAATGACTCAATTGTACTCTATGGTCGTTTGTTA
AAAATGAATTTTCTCTTTACCGGTGACTTGGAGCAAGGGGAATTAGATTTAATCACTTCTTATCCGCAGCTACCAGTCGA
TGTGCTGAAAGCAGGTCACCATGGTTCCAAGGGCTCTTCATATCCAGAATTTTTAGACCATATTGGAGCAAAAATTGCTC
TGATTTCTGCTGGTGAAGATAATCGCTATCAACATCCACATAAGGAAACTCTGGAACGTCTTGACAGTCAAAATATGCAG
GTTTACCGAACGGATCTGCAAGGAGCAATCCGTTTCCGAGGTTGGAAACAGTGGAGTATTGAAACGGTAAAAGAGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.815 |
100 |
0.54 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.561 |
99.597 |
0.523 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.817 |
99.732 |
0.517 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.279 |
99.732 |
0.511 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.279 |
99.732 |
0.511 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.279 |
99.732 |
0.511 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.13 |
100 |
0.474 |