Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SAGCMC97051_RS04755 | Genome accession | NZ_AP020310 |
| Coordinates | 858431..860668 (+) | Length | 745 a.a. |
| NCBI ID | WP_000939903.1 | Uniprot ID | A0AAV3JP21 |
| Organism | Streptococcus agalactiae strain GCMC97051 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 853431..865668
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SAGCMC97051_RS04730 (SAGCMC97051_08570) | - | 854025..855611 (+) | 1587 | WP_000673089.1 | DEAD/DEAH box helicase | - |
| SAGCMC97051_RS04735 (SAGCMC97051_08580) | - | 855796..856062 (-) | 267 | WP_000598736.1 | GIY-YIG nuclease family protein | - |
| SAGCMC97051_RS04740 (SAGCMC97051_08590) | - | 856055..856819 (-) | 765 | WP_000567425.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| SAGCMC97051_RS04745 (SAGCMC97051_08600) | - | 856954..857694 (+) | 741 | WP_000500220.1 | lysophospholipid acyltransferase family protein | - |
| SAGCMC97051_RS04750 (SAGCMC97051_08610) | - | 857794..858447 (+) | 654 | WP_000461744.1 | helix-hairpin-helix domain-containing protein | - |
| SAGCMC97051_RS04755 (SAGCMC97051_08620) | comEC/celB | 858431..860668 (+) | 2238 | WP_000939903.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SAGCMC97051_RS04760 (SAGCMC97051_08630) | - | 860794..861603 (+) | 810 | WP_000153219.1 | Cof-type HAD-IIB family hydrolase | - |
| SAGCMC97051_RS04765 (SAGCMC97051_08640) | - | 861614..862558 (+) | 945 | WP_000200830.1 | LacI family DNA-binding transcriptional regulator | - |
| SAGCMC97051_RS04770 (SAGCMC97051_08650) | - | 862617..863609 (-) | 993 | WP_000800998.1 | alpha/beta hydrolase fold domain-containing protein | - |
| SAGCMC97051_RS04775 (SAGCMC97051_08660) | - | 863785..864513 (+) | 729 | WP_000468966.1 | methyltransferase domain-containing protein | - |
| SAGCMC97051_RS04780 (SAGCMC97051_08670) | holA | 864567..865604 (+) | 1038 | WP_237395053.1 | DNA polymerase III subunit delta | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85440.37 Da Isoelectric Point: 9.8459
>NTDB_id=74547 SAGCMC97051_RS04755 WP_000939903.1 858431..860668(+) (comEC/celB) [Streptococcus agalactiae strain GCMC97051]
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=74547 SAGCMC97051_RS04755 WP_000939903.1 858431..860668(+) (comEC/celB) [Streptococcus agalactiae strain GCMC97051]
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.4 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis SK321 |
48.529 |
100 |
0.487 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.059 |
100 |
0.472 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.586 |
100 |
0.467 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.338 |
99.732 |
0.432 |