Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | AGP54_RS04090 | Genome accession | NZ_CP038809 |
| Coordinates | 759478..761715 (+) | Length | 745 a.a. |
| NCBI ID | WP_000939903.1 | Uniprot ID | A0AAV3JP21 |
| Organism | Streptococcus agalactiae strain SS1168 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 754478..766715
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AGP54_RS04065 (AGP54_04195) | - | 755072..756658 (+) | 1587 | WP_000673089.1 | DEAD/DEAH box helicase | - |
| AGP54_RS04070 (AGP54_04200) | - | 756843..757109 (-) | 267 | WP_047199681.1 | GIY-YIG nuclease family protein | - |
| AGP54_RS04075 (AGP54_04205) | - | 757102..757866 (-) | 765 | WP_000567425.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| AGP54_RS04080 (AGP54_04210) | - | 758001..758741 (+) | 741 | WP_000500220.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| AGP54_RS04085 (AGP54_04215) | - | 758841..759494 (+) | 654 | WP_000461744.1 | helix-hairpin-helix domain-containing protein | - |
| AGP54_RS04090 (AGP54_04220) | comEC/celB | 759478..761715 (+) | 2238 | WP_000939903.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| AGP54_RS04095 (AGP54_04225) | - | 761841..762650 (+) | 810 | WP_000153219.1 | Cof-type HAD-IIB family hydrolase | - |
| AGP54_RS04100 (AGP54_04230) | - | 762661..763605 (+) | 945 | WP_000200830.1 | LacI family DNA-binding transcriptional regulator | - |
| AGP54_RS04105 (AGP54_04235) | - | 763664..764656 (-) | 993 | WP_000800998.1 | alpha/beta hydrolase fold domain-containing protein | - |
| AGP54_RS04110 (AGP54_04240) | - | 764832..765560 (+) | 729 | WP_000468966.1 | methyltransferase domain-containing protein | - |
| AGP54_RS04115 (AGP54_04245) | holA | 765614..766651 (+) | 1038 | WP_000560292.1 | DNA polymerase III subunit delta | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85440.37 Da Isoelectric Point: 9.8459
>NTDB_id=355668 AGP54_RS04090 WP_000939903.1 759478..761715(+) (comEC/celB) [Streptococcus agalactiae strain SS1168]
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=355668 AGP54_RS04090 WP_000939903.1 759478..761715(+) (comEC/celB) [Streptococcus agalactiae strain SS1168]
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.4 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis SK321 |
48.529 |
100 |
0.487 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.059 |
100 |
0.472 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.586 |
100 |
0.467 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.338 |
99.732 |
0.432 |