Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | FD735_RS03835 | Genome accession | NZ_CP040231 |
| Coordinates | 702283..704514 (+) | Length | 743 a.a. |
| NCBI ID | WP_139658523.1 | Uniprot ID | A0A5B7Y5H2 |
| Organism | Streptococcus sp. 1643 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 697283..709514
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| FD735_RS03800 (FD735_03800) | - | 697789..698394 (+) | 606 | WP_042902395.1 | GrpB family protein | - |
| FD735_RS03805 (FD735_03805) | - | 698391..698606 (+) | 216 | WP_001232084.1 | YozE family protein | - |
| FD735_RS03810 (FD735_03810) | - | 698687..699673 (+) | 987 | WP_139658518.1 | PhoH family protein | - |
| FD735_RS03815 (FD735_03815) | ald | 699724..700836 (-) | 1113 | WP_139658520.1 | alanine dehydrogenase | - |
| FD735_RS03825 (FD735_03825) | - | 701013..701582 (+) | 570 | WP_139658521.1 | GNAT family N-acetyltransferase | - |
| FD735_RS03830 (FD735_03830) | comEA/celA/cilE | 701649..702299 (+) | 651 | WP_139658522.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| FD735_RS03835 (FD735_03835) | comEC/celB | 702283..704514 (+) | 2232 | WP_139658523.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| FD735_RS09910 | - | 704664..704878 (+) | 215 | Protein_691 | hypothetical protein | - |
| FD735_RS03840 (FD735_03840) | - | 704911..705504 (+) | 594 | WP_139658524.1 | ATP-binding cassette domain-containing protein | - |
| FD735_RS03845 (FD735_03845) | - | 705501..706685 (+) | 1185 | WP_139658525.1 | hypothetical protein | - |
| FD735_RS03850 (FD735_03850) | infC | 706991..707521 (+) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| FD735_RS03855 (FD735_03855) | rpmI | 707554..707754 (+) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| FD735_RS03860 (FD735_03860) | rplT | 707806..708165 (+) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
Sequence
Protein
Download Length: 743 a.a. Molecular weight: 84542.70 Da Isoelectric Point: 8.7152
>NTDB_id=362640 FD735_RS03835 WP_139658523.1 702283..704514(+) (comEC/celB) [Streptococcus sp. 1643]
MSQWIKNFPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQTWQQSQASQN
LVDSVERVRILPDTIKVNGDSLSFRGKAEAHTFQVYYKLQSEEEKELFQTLTDLHEIELEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKSIQSMKQVSSWDIRENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGIKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGDGIKVVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFIYPVTQFNFVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVLLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITLLDVGQGESIFLRDMAGKTILIDVGGKAESDKKIQAWQEKATTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSRGSLTQKEFVAELEASQNKVRSVTAGENFPIFGSYLEVLSPRQIGDEDRDGSLVLYGKL
LDKHFLFTGNLKEKDLLKQYPDLEVDVLKAGQHGAKTSSNPTFLEKLKPEITLISVGKNNRAKLPHQETLTRLEIIKSKI
YRTDQQGAIRFTGWNSWQIESVR
MSQWIKNFPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQTWQQSQASQN
LVDSVERVRILPDTIKVNGDSLSFRGKAEAHTFQVYYKLQSEEEKELFQTLTDLHEIELEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKSIQSMKQVSSWDIRENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGIKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGDGIKVVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFIYPVTQFNFVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVLLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITLLDVGQGESIFLRDMAGKTILIDVGGKAESDKKIQAWQEKATTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSRGSLTQKEFVAELEASQNKVRSVTAGENFPIFGSYLEVLSPRQIGDEDRDGSLVLYGKL
LDKHFLFTGNLKEKDLLKQYPDLEVDVLKAGQHGAKTSSNPTFLEKLKPEITLISVGKNNRAKLPHQETLTRLEIIKSKI
YRTDQQGAIRFTGWNSWQIESVR
Nucleotide
Download Length: 2232 bp
>NTDB_id=362640 FD735_RS03835 WP_139658523.1 702283..704514(+) (comEC/celB) [Streptococcus sp. 1643]
ATGTCACAGTGGATTAAGAATTTCCCTATCCCCCTAATCTATCTGAGTTTTCTATTGCTCTGGCTTTACTACGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTCGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAACTTGGCAACAAAGTCAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAAAGGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGAGATAGTCTATCCTTTCGGGGCAA
GGCTGAGGCTCACACCTTCCAAGTTTACTATAAACTCCAGTCCGAGGAAGAGAAAGAGCTCTTTCAGACCTTAACAGACC
TTCATGAGATTGAACTAGAAGGAAAACTTTCGGAACCCGAAGGGCAGAGGAATTTTGGTGGCTTTAACTACCAAGCCTAT
CTGAAGACTCAAGGAATTTACCAAACTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGCAGTTGGGATATAAG
AGAAAATCTGTCTAGTTTACGTCGAAAGGCTGTAGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAACTACATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTTGGAATTATCCATCTA
TTTGCCTTGTCAGGTATGCAAGTGGGCTTCTTTATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACCCAAGA
GAAGTTGAAGTGGCTAACTTATCCCTTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCAGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCCCAACATGGTATTAAGGGTTTGGATAATTTTGCCTTGACGGTCCTTGTCCTTTTTATCATT
ATGCCAAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACTAGCAAAGAAGG
CGATGGGATCAAGGTAGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTGTTTGATGTGGTTTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTATTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTGTTAGTTTTATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTCAGTCTCTTTATTGTGGGGCTCTTTTTCCTGACCAAA
CACCCACTGGAAAATGAAATCACCTTGCTGGATGTAGGGCAAGGTGAAAGCATTTTTCTACGTGATATGGCTGGTAAAAC
CATTCTCATAGATGTGGGAGGTAAAGCAGAATCTGACAAGAAAATCCAAGCTTGGCAGGAAAAGGCGACGACCAGCAATG
CCCAGCGTACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGATAAGATTGACCAGCTAATTTTGACCAATACAGACAAG
GAACATGTTGGTGATTTGCTGGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAGAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGACAGCTGGGGAGAATTTCCCGATTTTTG
GCAGTTACTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGAGGATCGTGATGGTTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGATCTTCTAAAGCAATACCCTGACTTAGAGGTGGATGT
CCTGAAAGCAGGCCAACATGGTGCTAAAACATCATCAAATCCAACTTTCCTAGAAAAACTCAAACCAGAAATTACTCTTA
TTTCAGTTGGAAAGAACAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACACGACTAGAAATCATCAAGAGTAAGATT
TACCGAACTGACCAGCAAGGGGCTATCCGCTTTACAGGGTGGAATAGTTGGCAGATTGAAAGTGTTCGTTAG
ATGTCACAGTGGATTAAGAATTTCCCTATCCCCCTAATCTATCTGAGTTTTCTATTGCTCTGGCTTTACTACGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTCGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAACTTGGCAACAAAGTCAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAAAGGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGAGATAGTCTATCCTTTCGGGGCAA
GGCTGAGGCTCACACCTTCCAAGTTTACTATAAACTCCAGTCCGAGGAAGAGAAAGAGCTCTTTCAGACCTTAACAGACC
TTCATGAGATTGAACTAGAAGGAAAACTTTCGGAACCCGAAGGGCAGAGGAATTTTGGTGGCTTTAACTACCAAGCCTAT
CTGAAGACTCAAGGAATTTACCAAACTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGCAGTTGGGATATAAG
AGAAAATCTGTCTAGTTTACGTCGAAAGGCTGTAGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAACTACATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTTGGAATTATCCATCTA
TTTGCCTTGTCAGGTATGCAAGTGGGCTTCTTTATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACCCAAGA
GAAGTTGAAGTGGCTAACTTATCCCTTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCAGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCCCAACATGGTATTAAGGGTTTGGATAATTTTGCCTTGACGGTCCTTGTCCTTTTTATCATT
ATGCCAAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACTAGCAAAGAAGG
CGATGGGATCAAGGTAGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTGTTTGATGTGGTTTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTATTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTGTTAGTTTTATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTCAGTCTCTTTATTGTGGGGCTCTTTTTCCTGACCAAA
CACCCACTGGAAAATGAAATCACCTTGCTGGATGTAGGGCAAGGTGAAAGCATTTTTCTACGTGATATGGCTGGTAAAAC
CATTCTCATAGATGTGGGAGGTAAAGCAGAATCTGACAAGAAAATCCAAGCTTGGCAGGAAAAGGCGACGACCAGCAATG
CCCAGCGTACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGATAAGATTGACCAGCTAATTTTGACCAATACAGACAAG
GAACATGTTGGTGATTTGCTGGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAGAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGACAGCTGGGGAGAATTTCCCGATTTTTG
GCAGTTACTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGAGGATCGTGATGGTTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGATCTTCTAAAGCAATACCCTGACTTAGAGGTGGATGT
CCTGAAAGCAGGCCAACATGGTGCTAAAACATCATCAAATCCAACTTTCCTAGAAAAACTCAAACCAGAAATTACTCTTA
TTTCAGTTGGAAAGAACAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACACGACTAGAAATCATCAAGAGTAAGATT
TACCGAACTGACCAGCAAGGGGCTATCCGCTTTACAGGGTGGAATAGTTGGCAGATTGAAAGTGTTCGTTAG
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
87.534 |
100 |
0.879 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
86.577 |
100 |
0.868 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
85.791 |
100 |
0.861 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
85.255 |
100 |
0.856 |
| comEC/celB | Streptococcus pneumoniae D39 |
85.255 |
100 |
0.856 |
| comEC/celB | Streptococcus pneumoniae R6 |
85.255 |
100 |
0.856 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.818 |
100 |
0.448 |