Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | HQ616_RS04370 | Genome accession | NZ_CP053891 |
| Coordinates | 815318..817555 (+) | Length | 745 a.a. |
| NCBI ID | WP_000939903.1 | Uniprot ID | A0AAV3JP21 |
| Organism | Streptococcus agalactiae strain GBS20 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 810318..822555
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HQ616_RS04345 (HQ616_04320) | - | 810912..812498 (+) | 1587 | WP_000673089.1 | DEAD/DEAH box helicase | - |
| HQ616_RS04350 (HQ616_04325) | - | 812683..812949 (-) | 267 | WP_000598736.1 | GIY-YIG nuclease family protein | - |
| HQ616_RS04355 (HQ616_04330) | - | 812942..813706 (-) | 765 | WP_000567425.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| HQ616_RS04360 (HQ616_04335) | - | 813841..814581 (+) | 741 | WP_000500220.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| HQ616_RS04365 (HQ616_04340) | - | 814681..815334 (+) | 654 | WP_000461744.1 | helix-hairpin-helix domain-containing protein | - |
| HQ616_RS04370 (HQ616_04345) | comEC/celB | 815318..817555 (+) | 2238 | WP_000939903.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HQ616_RS04375 (HQ616_04350) | - | 817681..818490 (+) | 810 | WP_000153219.1 | Cof-type HAD-IIB family hydrolase | - |
| HQ616_RS04380 (HQ616_04355) | - | 818501..819445 (+) | 945 | WP_000200822.1 | LacI family DNA-binding transcriptional regulator | - |
| HQ616_RS04385 (HQ616_04360) | - | 819504..820496 (-) | 993 | WP_000800998.1 | alpha/beta hydrolase fold domain-containing protein | - |
| HQ616_RS04390 (HQ616_04365) | - | 820672..821400 (+) | 729 | WP_000468966.1 | methyltransferase domain-containing protein | - |
| HQ616_RS04395 (HQ616_04370) | holA | 821454..822491 (+) | 1038 | WP_000560292.1 | DNA polymerase III subunit delta | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85440.37 Da Isoelectric Point: 9.8459
>NTDB_id=448750 HQ616_RS04370 WP_000939903.1 815318..817555(+) (comEC/celB) [Streptococcus agalactiae strain GBS20]
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=448750 HQ616_RS04370 WP_000939903.1 815318..817555(+) (comEC/celB) [Streptococcus agalactiae strain GBS20]
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.4 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis SK321 |
48.529 |
100 |
0.487 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.059 |
100 |
0.472 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.586 |
100 |
0.467 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.338 |
99.732 |
0.432 |