Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | DP111_RS08835 | Genome accession | NZ_CP030124 |
| Coordinates | 1740313..1742550 (-) | Length | 745 a.a. |
| NCBI ID | WP_043033064.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain SH1510 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1735313..1747550
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DP111_RS08810 (DP111_08810) | tmk | 1735419..1736057 (-) | 639 | WP_002935358.1 | dTMP kinase | - |
| DP111_RS08815 (DP111_08815) | - | 1736232..1737098 (+) | 867 | WP_002935357.1 | YitT family protein | - |
| DP111_RS08820 (DP111_08820) | - | 1737135..1738406 (-) | 1272 | WP_002935355.1 | toxic anion resistance protein | - |
| DP111_RS08825 (DP111_08825) | - | 1738412..1739239 (-) | 828 | WP_002935354.1 | hypothetical protein | - |
| DP111_RS08830 (DP111_08830) | - | 1739333..1739989 (-) | 657 | WP_002935353.1 | CBS domain-containing protein | - |
| DP111_RS11215 | - | 1740171..1740296 (-) | 126 | Protein_1668 | IS982 family transposase | - |
| DP111_RS08835 (DP111_08835) | comEC/celB | 1740313..1742550 (-) | 2238 | WP_043033064.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| DP111_RS08840 (DP111_08840) | comEA/celA/cilE | 1742534..1743196 (-) | 663 | WP_079394055.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| DP111_RS08845 (DP111_08845) | - | 1743267..1744013 (-) | 747 | WP_014735985.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| DP111_RS08850 (DP111_08850) | - | 1744159..1745604 (+) | 1446 | WP_114876557.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| DP111_RS08855 (DP111_08855) | - | 1745756..1746556 (+) | 801 | WP_013730319.1 | ABC transporter ATP-binding protein | - |
| DP111_RS08860 (DP111_08860) | - | 1746553..1747482 (+) | 930 | WP_009909659.1 | ABC transporter substrate-binding protein | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 84157.32 Da Isoelectric Point: 8.1369
>NTDB_id=299444 DP111_RS08835 WP_043033064.1 1740313..1742550(-) (comEC/celB) [Streptococcus suis strain SH1510]
MLRSIKLPCQPIHLALLAVAAYFVVRSFSLLTMGLLSLVLAVFGVRQGKAVFLRTLPLLALCGLFFGFQKLQWNRADQSA
PEQVTTVQVIPDTIEVNGDSLSFRGRADGQTYQVFYKLASQEEQTYFQELADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKITVIKKIVPVQSRNVFDWLSSWRRQALVYIKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKEAVDKLQIPFSIVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCFLLL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMIYFFAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPIIKITWVNGFFVLMEKIIVWVADLGLRPWILGKPSVVILILLLVCLFLLYDFHRKKKWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFMRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVHIGRIYVSPGSLTVPDFVATLKEINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
KTNFLFTGDLEQGELDLIESYPNLPVDVLKAGHHGSKGSSYPEFLDHIEAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWREWRIETVRR
MLRSIKLPCQPIHLALLAVAAYFVVRSFSLLTMGLLSLVLAVFGVRQGKAVFLRTLPLLALCGLFFGFQKLQWNRADQSA
PEQVTTVQVIPDTIEVNGDSLSFRGRADGQTYQVFYKLASQEEQTYFQELADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKITVIKKIVPVQSRNVFDWLSSWRRQALVYIKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKEAVDKLQIPFSIVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCFLLL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMIYFFAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPIIKITWVNGFFVLMEKIIVWVADLGLRPWILGKPSVVILILLLVCLFLLYDFHRKKKWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFMRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVHIGRIYVSPGSLTVPDFVATLKEINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
KTNFLFTGDLEQGELDLIESYPNLPVDVLKAGHHGSKGSSYPEFLDHIEAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWREWRIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=299444 DP111_RS08835 WP_043033064.1 1740313..1742550(-) (comEC/celB) [Streptococcus suis strain SH1510]
ATGTTACGGTCGATTAAGCTCCCCTGCCAGCCCATTCACCTGGCTCTTTTGGCGGTGGCAGCCTACTTTGTAGTCCGCTC
TTTTTCCCTCTTGACAATGGGCCTGCTCAGTTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGGCAGTTTTCCTCA
GAACGCTGCCGCTTCTAGCCCTGTGTGGTCTCTTTTTCGGATTTCAAAAGCTTCAATGGAATCGAGCAGACCAGTCAGCC
CCAGAGCAAGTCACAACTGTTCAGGTCATTCCAGATACCATTGAAGTCAACGGGGACAGTCTGTCCTTTCGCGGTCGGGC
TGACGGACAAACCTATCAGGTTTTCTACAAATTAGCCAGTCAGGAGGAGCAGACTTATTTTCAAGAGCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGAGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACAGTCAAGATTACGGTAATCAAGAAGATTGTCCCTGTCCAGTCTAGGAATGTTTTTGA
CTGGTTGTCAAGCTGGCGGAGGCAGGCCTTGGTCTATATCAAAACCAATTTTCCAGCTCCTATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTCGACCAGATGAGTGACCTCTATTCTAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTTGGATTTTTCATCGATAAGTTTCGCTGGATTTTACTGCGTTTGGGCTTGACCAAGGAAGC
CGTAGATAAGTTGCAGATTCCCTTTTCAATTGTTTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTGGGCTTGCGGAAGCTGGATAACTTTGCGGTGACAGTTTTTGTCTGTTTCTTGTTGCTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACCTATGCTTTCTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCGCTGGGAATTTTGCCGGTGCTGATGATCTATTTCTTTG
CCTTCCAGCCCCTGTCTATTCTCTTGACTTTTGTATTTTCCTTTGTCTTTGATGTCCTGCTCTTGCCAGGCTTGTCGGTC
ATTTTTCTCTTATCACCAATTATCAAGATTACCTGGGTCAACGGATTTTTCGTCTTGATGGAAAAAATTATTGTCTGGGT
GGCAGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGTGGTAATTCTTATTCTCTTGCTGGTCTGTCTCTTCT
TACTTTATGATTTCCACAGAAAAAAGAAATGGCTCTTAGGACTGAGCCTGGTCCTTGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGTGACAGCATCTTTATGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTGGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGCGTGGACAGGATTGATAGTTTGGTTCTGACACACACCGACACAGAC
CATGTGGGCGATGTGCTGGAAGTGGCTAAGCAGGTCCATATTGGTCGAATTTACGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCAACCTTGAAAGAAATTAATGTGCCTGTCCATGTGGTCAAAGTGGGCGACCGCTTGCCGATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGAGACGGTGGCAATAATGACTCCATTGTCCTTTATGGTCGCTTGCTG
AAAACTAATTTTCTCTTTACAGGGGACTTGGAGCAGGGGGAGTTGGACTTGATAGAATCTTATCCTAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCACGGCTCCAAAGGTTCTTCCTATCCTGAATTTTTGGACCATATCGAAGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGATAGTCGCAATATCCAA
GTCTATCGTACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAGGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
ATGTTACGGTCGATTAAGCTCCCCTGCCAGCCCATTCACCTGGCTCTTTTGGCGGTGGCAGCCTACTTTGTAGTCCGCTC
TTTTTCCCTCTTGACAATGGGCCTGCTCAGTTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGGCAGTTTTCCTCA
GAACGCTGCCGCTTCTAGCCCTGTGTGGTCTCTTTTTCGGATTTCAAAAGCTTCAATGGAATCGAGCAGACCAGTCAGCC
CCAGAGCAAGTCACAACTGTTCAGGTCATTCCAGATACCATTGAAGTCAACGGGGACAGTCTGTCCTTTCGCGGTCGGGC
TGACGGACAAACCTATCAGGTTTTCTACAAATTAGCCAGTCAGGAGGAGCAGACTTATTTTCAAGAGCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGAGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACAGTCAAGATTACGGTAATCAAGAAGATTGTCCCTGTCCAGTCTAGGAATGTTTTTGA
CTGGTTGTCAAGCTGGCGGAGGCAGGCCTTGGTCTATATCAAAACCAATTTTCCAGCTCCTATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTCGACCAGATGAGTGACCTCTATTCTAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTTGGATTTTTCATCGATAAGTTTCGCTGGATTTTACTGCGTTTGGGCTTGACCAAGGAAGC
CGTAGATAAGTTGCAGATTCCCTTTTCAATTGTTTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTGGGCTTGCGGAAGCTGGATAACTTTGCGGTGACAGTTTTTGTCTGTTTCTTGTTGCTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACCTATGCTTTCTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCGCTGGGAATTTTGCCGGTGCTGATGATCTATTTCTTTG
CCTTCCAGCCCCTGTCTATTCTCTTGACTTTTGTATTTTCCTTTGTCTTTGATGTCCTGCTCTTGCCAGGCTTGTCGGTC
ATTTTTCTCTTATCACCAATTATCAAGATTACCTGGGTCAACGGATTTTTCGTCTTGATGGAAAAAATTATTGTCTGGGT
GGCAGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGTGGTAATTCTTATTCTCTTGCTGGTCTGTCTCTTCT
TACTTTATGATTTCCACAGAAAAAAGAAATGGCTCTTAGGACTGAGCCTGGTCCTTGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGTGACAGCATCTTTATGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTGGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGCGTGGACAGGATTGATAGTTTGGTTCTGACACACACCGACACAGAC
CATGTGGGCGATGTGCTGGAAGTGGCTAAGCAGGTCCATATTGGTCGAATTTACGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCAACCTTGAAAGAAATTAATGTGCCTGTCCATGTGGTCAAAGTGGGCGACCGCTTGCCGATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGAGACGGTGGCAATAATGACTCCATTGTCCTTTATGGTCGCTTGCTG
AAAACTAATTTTCTCTTTACAGGGGACTTGGAGCAGGGGGAGTTGGACTTGATAGAATCTTATCCTAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCACGGCTCCAAAGGTTCTTCCTATCCTGAATTTTTGGACCATATCGAAGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGATAGTCGCAATATCCAA
GTCTATCGTACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAGGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
54.24 |
99.732 |
0.541 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
53.414 |
100 |
0.536 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
52.463 |
100 |
0.529 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
52.197 |
100 |
0.526 |
| comEC/celB | Streptococcus pneumoniae D39 |
52.197 |
100 |
0.526 |
| comEC/celB | Streptococcus pneumoniae R6 |
52.197 |
100 |
0.526 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.332 |
100 |
0.495 |