Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | FGL23_RS07910 | Genome accession | NZ_LR594049 |
| Coordinates | 1642274..1644514 (-) | Length | 746 a.a. |
| NCBI ID | WP_061603130.1 | Uniprot ID | - |
| Organism | Streptococcus gordonii strain NCTC10231 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1637274..1649514
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| FGL23_RS07880 (NCTC10231_01571) | - | 1637546..1638043 (-) | 498 | WP_045635087.1 | DUF1648 domain-containing protein | - |
| FGL23_RS07885 (NCTC10231_01572) | - | 1638030..1638311 (-) | 282 | WP_012130660.1 | autorepressor SdpR family transcription factor | - |
| FGL23_RS07890 (NCTC10231_01573) | - | 1638289..1639077 (-) | 789 | WP_061603132.1 | YhfC family intramembrane metalloprotease | - |
| FGL23_RS07895 (NCTC10231_01574) | - | 1639283..1640356 (-) | 1074 | WP_138115456.1 | serine hydrolase | - |
| FGL23_RS07900 (NCTC10231_01575) | sodA | 1640468..1641073 (-) | 606 | WP_045635085.1 | superoxide dismutase SodA | - |
| FGL23_RS07905 (NCTC10231_01576) | holA | 1641146..1642183 (-) | 1038 | WP_061603131.1 | DNA polymerase III subunit delta | - |
| FGL23_RS07910 (NCTC10231_01577) | comEC/celB | 1642274..1644514 (-) | 2241 | WP_061603130.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| FGL23_RS07915 (NCTC10231_01578) | comEA/celA/cilE | 1644498..1645163 (-) | 666 | WP_060554135.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| FGL23_RS07920 (NCTC10231_01579) | - | 1645260..1645787 (-) | 528 | WP_060554134.1 | HXXEE domain-containing protein | - |
| FGL23_RS07925 (NCTC10231_01580) | - | 1645826..1646566 (-) | 741 | WP_060554133.1 | lysophospholipid acyltransferase family protein | - |
| FGL23_RS07930 (NCTC10231_01581) | - | 1646696..1649065 (+) | 2370 | WP_060554132.1 | cation-translocating P-type ATPase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 85074.98 Da Isoelectric Point: 9.8605
>NTDB_id=1127974 FGL23_RS07910 WP_061603130.1 1642274..1644514(-) (comEC/celB) [Streptococcus gordonii strain NCTC10231]
MSQWIKKLPLSPIYLCFLLVWLYFAIYSGEKLAYLGVFLLIARLIWHYPRKKWLPTLAILACFAIFFYARRELTERTFQS
QPAPARQVLVLPDTIKVNGDSLSFRGRIAGRLYQLYYKLASPREKKTFQKLTDLVTLEIEAEFNLAEGQRNFSGFDYQAY
LKSQGIYRTVKVSGIISSRSSQSANPFDWLSVWRRKALVFIKSTFPSPMSHYMTGLLFGDLDIDFAEMNDLYSSLGIIHL
FALSGMQVGFFMDVFRKILLRIGLRMETVDWLQFPLSFIYAGLTGFSVSVVRSLVQKLLSQFGVRRLDNFAMTMLVLMLL
MPSFLLTTGGVLSCAYAFIITMLDFKDSSGFRKAVLESLSISLGILPILIYYFAEFQPWSILLTFLFSIVFDVFMLPLLS
LIFVLSPLFAIIQVNIFFQWLEIVIRWVAGLSTRPIVLGQPNLTLLILLLLALALLYDFRKQKKIKAGLSLLAILLFFLS
KYPLQNEITVVDIGQGDSIFLRDIRGRTIVIDTGGRVEIGKKEAWQERVRKSNAETTLIPYLKSRGVDHLDQLVLTHTDT
DHMGDMLDLAKHFSIREIYVSKGSMTQSDFVGKLKKMKAKVHVVEVGDRLPIFDSALEVLYPLTQGDGGNDDSIVLYGEF
FRTKFLFTGDLEAPGEGQMVTAYPDLRVDVLKAGHHGSKGSSSPEFLEHIKPKLALISAGKNNRYQHPHQETLERFEKIQ
TKILRTDEQGAIRFKGWSSWKIETVR
MSQWIKKLPLSPIYLCFLLVWLYFAIYSGEKLAYLGVFLLIARLIWHYPRKKWLPTLAILACFAIFFYARRELTERTFQS
QPAPARQVLVLPDTIKVNGDSLSFRGRIAGRLYQLYYKLASPREKKTFQKLTDLVTLEIEAEFNLAEGQRNFSGFDYQAY
LKSQGIYRTVKVSGIISSRSSQSANPFDWLSVWRRKALVFIKSTFPSPMSHYMTGLLFGDLDIDFAEMNDLYSSLGIIHL
FALSGMQVGFFMDVFRKILLRIGLRMETVDWLQFPLSFIYAGLTGFSVSVVRSLVQKLLSQFGVRRLDNFAMTMLVLMLL
MPSFLLTTGGVLSCAYAFIITMLDFKDSSGFRKAVLESLSISLGILPILIYYFAEFQPWSILLTFLFSIVFDVFMLPLLS
LIFVLSPLFAIIQVNIFFQWLEIVIRWVAGLSTRPIVLGQPNLTLLILLLLALALLYDFRKQKKIKAGLSLLAILLFFLS
KYPLQNEITVVDIGQGDSIFLRDIRGRTIVIDTGGRVEIGKKEAWQERVRKSNAETTLIPYLKSRGVDHLDQLVLTHTDT
DHMGDMLDLAKHFSIREIYVSKGSMTQSDFVGKLKKMKAKVHVVEVGDRLPIFDSALEVLYPLTQGDGGNDDSIVLYGEF
FRTKFLFTGDLEAPGEGQMVTAYPDLRVDVLKAGHHGSKGSSSPEFLEHIKPKLALISAGKNNRYQHPHQETLERFEKIQ
TKILRTDEQGAIRFKGWSSWKIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=1127974 FGL23_RS07910 WP_061603130.1 1642274..1644514(-) (comEC/celB) [Streptococcus gordonii strain NCTC10231]
ATGTCACAGTGGATTAAAAAACTTCCCCTTTCACCAATCTATTTGTGCTTTCTTTTGGTTTGGCTTTACTTCGCTATTTA
TAGTGGGGAAAAGCTCGCTTATCTGGGAGTTTTTCTGCTTATAGCTCGCCTAATCTGGCATTATCCAAGAAAGAAATGGT
TGCCAACTTTGGCTATCCTAGCCTGTTTTGCTATCTTTTTTTATGCTAGACGCGAGTTGACGGAGCGAACCTTTCAGTCT
CAACCAGCTCCAGCAAGACAAGTCTTAGTTTTACCAGATACAATTAAAGTAAATGGAGACTCCCTGTCATTCCGTGGTAG
AATAGCTGGCAGACTCTATCAGCTCTACTACAAATTAGCAAGTCCAAGAGAGAAAAAAACTTTTCAAAAACTAACTGACT
TGGTCACTTTAGAGATAGAAGCGGAGTTCAATCTAGCAGAAGGACAGCGCAATTTCTCTGGCTTTGACTATCAGGCTTAT
TTAAAAAGTCAGGGAATCTATCGGACGGTTAAGGTCAGTGGGATTATCTCTAGTCGTTCTAGTCAGTCTGCTAATCCATT
TGATTGGCTGTCTGTTTGGCGTAGGAAGGCCTTGGTTTTCATTAAGTCTACTTTTCCAAGCCCGATGAGTCACTACATGA
CAGGACTCTTGTTTGGTGATTTAGATATTGATTTTGCAGAGATGAATGACTTGTACTCAAGTTTAGGAATTATCCATCTT
TTTGCTTTATCAGGAATGCAAGTTGGCTTTTTTATGGATGTCTTTCGAAAAATTCTTCTACGCATTGGCTTAAGAATGGA
AACAGTAGATTGGTTGCAATTCCCTTTGTCCTTTATTTATGCTGGTTTGACCGGATTTTCTGTGTCAGTAGTGAGAAGTT
TAGTGCAAAAGTTGTTGTCTCAATTTGGAGTGAGGCGCTTGGATAATTTTGCGATGACCATGCTGGTCTTAATGCTTCTT
ATGCCAAGCTTTCTCCTGACAACAGGCGGAGTCCTATCTTGCGCTTATGCCTTTATCATCACGATGTTGGACTTTAAAGA
TTCTAGTGGTTTTCGCAAAGCAGTGCTAGAGAGTTTAAGCATCTCGCTAGGTATTTTACCAATTCTTATCTATTATTTTG
CAGAATTCCAGCCTTGGTCTATCCTCTTGACCTTCCTTTTTTCAATCGTTTTTGATGTATTTATGTTGCCTCTCTTGAGT
CTAATTTTTGTCCTTTCGCCTTTGTTTGCCATCATTCAAGTTAATATCTTTTTCCAATGGCTGGAAATAGTTATACGTTG
GGTAGCTGGTCTGTCAACAAGGCCAATAGTTTTAGGTCAGCCCAATCTGACTTTGCTTATTCTTCTTCTACTAGCTTTAG
CCTTACTCTATGATTTTAGAAAACAAAAAAAGATTAAGGCTGGTCTGAGTCTGTTAGCAATCTTGCTATTTTTCCTAAGT
AAATATCCCTTGCAAAATGAAATCACAGTGGTTGATATTGGACAAGGAGATAGTATATTTCTGAGAGATATCAGAGGCCG
AACCATTGTAATTGATACAGGAGGACGGGTGGAGATTGGTAAAAAAGAGGCTTGGCAAGAGCGCGTGAGAAAAAGCAATG
CGGAAACGACCTTAATCCCTTATTTAAAAAGTCGAGGAGTGGATCACTTGGATCAATTAGTCCTGACTCACACAGATACA
GACCATATGGGAGATATGTTGGACTTAGCCAAGCATTTTTCTATCCGAGAAATTTATGTTTCCAAAGGAAGTATGACTCA
GTCTGATTTTGTAGGCAAGTTAAAGAAGATGAAAGCTAAAGTTCATGTAGTCGAGGTGGGAGATCGGCTTCCTATTTTTG
ATTCGGCTCTTGAAGTGCTCTACCCGCTTACTCAAGGAGATGGAGGTAATGATGATTCAATTGTCTTGTATGGTGAATTT
TTCCGGACCAAGTTTCTTTTCACAGGAGATTTAGAAGCTCCAGGTGAAGGTCAGATGGTGACAGCCTATCCAGATTTAAG
AGTAGATGTGCTCAAAGCTGGGCATCATGGTTCTAAAGGATCTTCTAGTCCAGAATTTCTAGAGCATATTAAGCCTAAGT
TGGCCTTGATTTCAGCTGGTAAAAACAATCGCTACCAGCATCCGCATCAGGAAACTTTAGAAAGATTTGAAAAAATCCAG
ACCAAGATTTTACGGACAGATGAGCAGGGTGCTATTCGATTTAAAGGATGGAGTTCTTGGAAGATAGAAACAGTTCGCTA
G
ATGTCACAGTGGATTAAAAAACTTCCCCTTTCACCAATCTATTTGTGCTTTCTTTTGGTTTGGCTTTACTTCGCTATTTA
TAGTGGGGAAAAGCTCGCTTATCTGGGAGTTTTTCTGCTTATAGCTCGCCTAATCTGGCATTATCCAAGAAAGAAATGGT
TGCCAACTTTGGCTATCCTAGCCTGTTTTGCTATCTTTTTTTATGCTAGACGCGAGTTGACGGAGCGAACCTTTCAGTCT
CAACCAGCTCCAGCAAGACAAGTCTTAGTTTTACCAGATACAATTAAAGTAAATGGAGACTCCCTGTCATTCCGTGGTAG
AATAGCTGGCAGACTCTATCAGCTCTACTACAAATTAGCAAGTCCAAGAGAGAAAAAAACTTTTCAAAAACTAACTGACT
TGGTCACTTTAGAGATAGAAGCGGAGTTCAATCTAGCAGAAGGACAGCGCAATTTCTCTGGCTTTGACTATCAGGCTTAT
TTAAAAAGTCAGGGAATCTATCGGACGGTTAAGGTCAGTGGGATTATCTCTAGTCGTTCTAGTCAGTCTGCTAATCCATT
TGATTGGCTGTCTGTTTGGCGTAGGAAGGCCTTGGTTTTCATTAAGTCTACTTTTCCAAGCCCGATGAGTCACTACATGA
CAGGACTCTTGTTTGGTGATTTAGATATTGATTTTGCAGAGATGAATGACTTGTACTCAAGTTTAGGAATTATCCATCTT
TTTGCTTTATCAGGAATGCAAGTTGGCTTTTTTATGGATGTCTTTCGAAAAATTCTTCTACGCATTGGCTTAAGAATGGA
AACAGTAGATTGGTTGCAATTCCCTTTGTCCTTTATTTATGCTGGTTTGACCGGATTTTCTGTGTCAGTAGTGAGAAGTT
TAGTGCAAAAGTTGTTGTCTCAATTTGGAGTGAGGCGCTTGGATAATTTTGCGATGACCATGCTGGTCTTAATGCTTCTT
ATGCCAAGCTTTCTCCTGACAACAGGCGGAGTCCTATCTTGCGCTTATGCCTTTATCATCACGATGTTGGACTTTAAAGA
TTCTAGTGGTTTTCGCAAAGCAGTGCTAGAGAGTTTAAGCATCTCGCTAGGTATTTTACCAATTCTTATCTATTATTTTG
CAGAATTCCAGCCTTGGTCTATCCTCTTGACCTTCCTTTTTTCAATCGTTTTTGATGTATTTATGTTGCCTCTCTTGAGT
CTAATTTTTGTCCTTTCGCCTTTGTTTGCCATCATTCAAGTTAATATCTTTTTCCAATGGCTGGAAATAGTTATACGTTG
GGTAGCTGGTCTGTCAACAAGGCCAATAGTTTTAGGTCAGCCCAATCTGACTTTGCTTATTCTTCTTCTACTAGCTTTAG
CCTTACTCTATGATTTTAGAAAACAAAAAAAGATTAAGGCTGGTCTGAGTCTGTTAGCAATCTTGCTATTTTTCCTAAGT
AAATATCCCTTGCAAAATGAAATCACAGTGGTTGATATTGGACAAGGAGATAGTATATTTCTGAGAGATATCAGAGGCCG
AACCATTGTAATTGATACAGGAGGACGGGTGGAGATTGGTAAAAAAGAGGCTTGGCAAGAGCGCGTGAGAAAAAGCAATG
CGGAAACGACCTTAATCCCTTATTTAAAAAGTCGAGGAGTGGATCACTTGGATCAATTAGTCCTGACTCACACAGATACA
GACCATATGGGAGATATGTTGGACTTAGCCAAGCATTTTTCTATCCGAGAAATTTATGTTTCCAAAGGAAGTATGACTCA
GTCTGATTTTGTAGGCAAGTTAAAGAAGATGAAAGCTAAAGTTCATGTAGTCGAGGTGGGAGATCGGCTTCCTATTTTTG
ATTCGGCTCTTGAAGTGCTCTACCCGCTTACTCAAGGAGATGGAGGTAATGATGATTCAATTGTCTTGTATGGTGAATTT
TTCCGGACCAAGTTTCTTTTCACAGGAGATTTAGAAGCTCCAGGTGAAGGTCAGATGGTGACAGCCTATCCAGATTTAAG
AGTAGATGTGCTCAAAGCTGGGCATCATGGTTCTAAAGGATCTTCTAGTCCAGAATTTCTAGAGCATATTAAGCCTAAGT
TGGCCTTGATTTCAGCTGGTAAAAACAATCGCTACCAGCATCCGCATCAGGAAACTTTAGAAAGATTTGAAAAAATCCAG
ACCAAGATTTTACGGACAGATGAGCAGGGTGCTATTCGATTTAAAGGATGGAGTTCTTGGAAGATAGAAACAGTTCGCTA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
58.902 |
100 |
0.59 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
58.099 |
100 |
0.582 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
57.641 |
100 |
0.576 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
57.162 |
100 |
0.572 |
| comEC/celB | Streptococcus pneumoniae D39 |
57.162 |
100 |
0.572 |
| comEC/celB | Streptococcus pneumoniae R6 |
57.162 |
100 |
0.572 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
50.538 |
99.732 |
0.504 |