Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | GOM46_RS05055 | Genome accession | NZ_CP046525 |
| Coordinates | 1009126..1011336 (-) | Length | 736 a.a. |
| NCBI ID | WP_235083305.1 | Uniprot ID | - |
| Organism | Streptococcus infantis strain SO | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1004126..1016336
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GOM46_RS05025 (GOM46_05015) | - | 1005163..1005699 (-) | 537 | WP_006150931.1 | F0F1 ATP synthase subunit delta | - |
| GOM46_RS05030 (GOM46_05020) | atpF | 1005699..1006193 (-) | 495 | WP_006150923.1 | F0F1 ATP synthase subunit B | - |
| GOM46_RS05035 (GOM46_05025) | atpB | 1006212..1006928 (-) | 717 | WP_006150950.1 | F0F1 ATP synthase subunit A | - |
| GOM46_RS05040 (GOM46_05030) | - | 1006961..1007161 (-) | 201 | WP_001054554.1 | F0F1 ATP synthase subunit C | - |
| GOM46_RS05045 (GOM46_05035) | - | 1007392..1007703 (-) | 312 | WP_025170042.1 | CHY zinc finger protein | - |
| GOM46_RS05050 (GOM46_05040) | - | 1007717..1009003 (-) | 1287 | WP_006150948.1 | peptidase U32 family protein | - |
| GOM46_RS05055 (GOM46_05045) | comEC/celB | 1009126..1011336 (-) | 2211 | WP_235083305.1 | ComEC/Rec2 family competence protein | Machinery gene |
| GOM46_RS05060 (GOM46_05050) | comEA/celA/cilE | 1011320..1011961 (-) | 642 | WP_006150927.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| GOM46_RS05065 (GOM46_05055) | - | 1012027..1012596 (-) | 570 | WP_006150922.1 | GNAT family N-acetyltransferase | - |
| GOM46_RS05070 (GOM46_05060) | ald | 1012762..1013874 (+) | 1113 | WP_006150941.1 | alanine dehydrogenase | - |
| GOM46_RS05075 (GOM46_05065) | - | 1013921..1014907 (-) | 987 | WP_025170041.1 | PhoH family protein | - |
| GOM46_RS05080 (GOM46_05070) | - | 1014992..1015207 (-) | 216 | WP_001232087.1 | YozE family protein | - |
| GOM46_RS05085 (GOM46_05075) | cvfB | 1015216..1016070 (-) | 855 | WP_025170040.1 | RNA-binding virulence regulatory protein CvfB | - |
Sequence
Protein
Download Length: 736 a.a. Molecular weight: 84800.53 Da Isoelectric Point: 9.7348
>NTDB_id=405072 GOM46_RS05055 WP_235083305.1 1009126..1011336(-) (comEC/celB) [Streptococcus infantis strain SO]
MLQWIRRFPIPKIYLSFLLLWLYYAIFSANFLALFGFVFLLVCLFFQFPWKTVTKVLLVCSLFGSWFVFQKWQQEEASQH
LVNTVDTVRILPDTIKVNGDSLSFRGKADGRLFQVYYKLQSEAEKKKFQDLSELHEMAVKGKLASPQGANNFAGFDYRNY
LKTQGIYQTLTISEIVELKQTSSWDIGENLSSLRRKAVVWIKRNFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKFKWLAHPFSLVYAGLTGFSASVIRSLLQKLLAQHGFKSLDNFALTILILFLI
MPNFFLTAGGVLSCAYAFILTMTGEEVAGIKGLVRESCIISLGILPILSFYFSEFQPWSILLTFFFSFLFDMVLLPLLSI
LFCLSFIYPITQFNFLFEWLENIIRYISQLSTRPLVFGQPSLWILILFLIILALIYDYRKNLKKISMLVIVAISLFLVTK
HPLENEITVLDMEQGRSIFLRDMTGKTILLDVGEKSERDKKEAWQERISSSNAERSLIPYLKSRGVAKIDQLVLTTSEPK
QLDHVLEMSKSFELEEILVSEETLSKREFMDKLKKSKIKVATIKTGQQLFIFGSSLEAFTSQNGDKKDSMVLYGKLLNQT
FLVTGNLEEKFLTKSYPKLQADIVITHQQASKKKTDVEVFKNLQPKTTVISVDKKKKFKEKNEESNQELGNSIYKTDQKG
AIRFKGWSTWRIETVR
MLQWIRRFPIPKIYLSFLLLWLYYAIFSANFLALFGFVFLLVCLFFQFPWKTVTKVLLVCSLFGSWFVFQKWQQEEASQH
LVNTVDTVRILPDTIKVNGDSLSFRGKADGRLFQVYYKLQSEAEKKKFQDLSELHEMAVKGKLASPQGANNFAGFDYRNY
LKTQGIYQTLTISEIVELKQTSSWDIGENLSSLRRKAVVWIKRNFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKFKWLAHPFSLVYAGLTGFSASVIRSLLQKLLAQHGFKSLDNFALTILILFLI
MPNFFLTAGGVLSCAYAFILTMTGEEVAGIKGLVRESCIISLGILPILSFYFSEFQPWSILLTFFFSFLFDMVLLPLLSI
LFCLSFIYPITQFNFLFEWLENIIRYISQLSTRPLVFGQPSLWILILFLIILALIYDYRKNLKKISMLVIVAISLFLVTK
HPLENEITVLDMEQGRSIFLRDMTGKTILLDVGEKSERDKKEAWQERISSSNAERSLIPYLKSRGVAKIDQLVLTTSEPK
QLDHVLEMSKSFELEEILVSEETLSKREFMDKLKKSKIKVATIKTGQQLFIFGSSLEAFTSQNGDKKDSMVLYGKLLNQT
FLVTGNLEEKFLTKSYPKLQADIVITHQQASKKKTDVEVFKNLQPKTTVISVDKKKKFKEKNEESNQELGNSIYKTDQKG
AIRFKGWSTWRIETVR
Nucleotide
Download Length: 2211 bp
>NTDB_id=405072 GOM46_RS05055 WP_235083305.1 1009126..1011336(-) (comEC/celB) [Streptococcus infantis strain SO]
ATGTTACAGTGGATTAGACGCTTTCCCATTCCAAAAATCTATCTCAGTTTTCTTTTGCTATGGCTCTACTATGCAATCTT
TTCAGCAAATTTCTTAGCACTTTTTGGCTTTGTCTTTTTACTAGTCTGCCTTTTTTTCCAATTTCCATGGAAGACTGTGA
CCAAAGTCCTATTGGTTTGCAGCCTATTTGGGAGTTGGTTTGTATTTCAAAAATGGCAGCAAGAAGAAGCGAGCCAACAT
CTAGTCAACACGGTTGATACAGTTCGAATCTTGCCTGATACCATCAAGGTTAATGGTGATAGCTTGTCTTTTCGTGGCAA
GGCTGATGGTAGACTCTTTCAGGTCTATTATAAGCTCCAGTCAGAAGCCGAAAAGAAAAAATTTCAAGACTTATCTGAAC
TACATGAAATGGCTGTGAAAGGCAAGTTAGCTAGTCCTCAAGGGGCAAACAATTTTGCTGGTTTTGATTATCGAAACTAT
CTCAAAACGCAAGGAATCTATCAGACCTTAACTATATCCGAAATTGTTGAGCTAAAGCAAACAAGTAGTTGGGATATCGG
AGAAAATCTATCAAGCTTGAGGAGAAAAGCAGTTGTCTGGATTAAACGGAATTTTCCCGATCCCATGCGCAACTATATGA
CCGGTCTTTTATTGGGTCATTTAGATACGGACTTTGAGGAAATGAATGAACTCTACTCGAGTCTGGGGATTATTCACCTT
TTTGCTTTATCTGGAATGCAAGTCGGATTTTTCATGGATGCTTTCAAAAAGTTACTTCTGCGTTTGGGTTTGACGCAAGA
GAAGTTCAAATGGTTAGCCCACCCCTTCTCCTTAGTCTATGCAGGTTTGACAGGATTTTCAGCCTCTGTTATTCGTAGTC
TTCTACAAAAGCTATTGGCTCAGCATGGCTTTAAGTCACTGGATAATTTTGCTTTAACGATTCTTATTCTATTTCTTATC
ATGCCGAACTTTTTCTTAACTGCTGGGGGAGTTCTTTCCTGTGCTTATGCCTTTATCCTGACCATGACTGGTGAGGAAGT
AGCAGGGATAAAAGGTCTGGTTAGAGAGAGTTGCATTATTTCCTTGGGAATTCTGCCAATTTTATCATTTTATTTTTCAG
AATTTCAACCGTGGTCCATTCTCTTGACATTTTTCTTTTCATTCTTATTTGATATGGTCCTGTTACCGCTCTTATCGATT
CTCTTCTGTCTGTCATTTATATATCCCATCACCCAGTTCAACTTTCTCTTTGAATGGCTTGAAAACATCATTCGCTATAT
TTCTCAGCTATCTACGAGGCCTCTTGTTTTTGGTCAGCCGAGTCTTTGGATTTTAATCCTTTTTTTGATTATTTTAGCTC
TCATCTACGATTATCGAAAGAATTTGAAAAAAATATCTATGCTTGTCATAGTTGCTATCTCGCTCTTTTTAGTAACCAAA
CATCCACTGGAGAATGAAATCACAGTCCTAGATATGGAACAGGGGCGGAGCATTTTCCTAAGAGACATGACAGGAAAGAC
CATACTACTGGATGTCGGAGAAAAATCTGAACGTGACAAGAAAGAAGCCTGGCAGGAGAGGATTTCCTCTAGCAATGCCG
AACGGAGTTTAATCCCTTATCTAAAAAGTCGAGGAGTTGCAAAGATTGATCAGCTTGTGCTAACGACTAGTGAACCTAAG
CAACTAGACCATGTGTTAGAAATGAGTAAATCCTTCGAGCTTGAAGAGATTCTAGTAAGTGAAGAAACTTTATCTAAAAG
AGAATTTATGGATAAGCTGAAGAAAAGCAAAATCAAGGTAGCTACTATTAAAACAGGGCAGCAGTTGTTTATTTTTGGCA
GTAGTTTGGAAGCATTCACTAGCCAAAATGGCGATAAAAAGGATTCAATGGTCTTGTATGGAAAGTTACTAAACCAAACC
TTTCTAGTTACTGGAAATTTAGAGGAAAAGTTCTTAACCAAGTCTTATCCGAAACTCCAAGCAGATATTGTGATAACTCA
TCAGCAAGCATCGAAGAAAAAGACAGATGTAGAAGTCTTCAAAAACTTACAACCTAAAACCACTGTCATTTCGGTAGACA
AGAAGAAAAAATTCAAAGAAAAAAATGAGGAAAGTAACCAAGAACTTGGGAATTCGATTTATAAAACGGACCAAAAGGGT
GCCATTCGTTTCAAAGGTTGGAGTACTTGGCGAATAGAAACAGTTCGCTGA
ATGTTACAGTGGATTAGACGCTTTCCCATTCCAAAAATCTATCTCAGTTTTCTTTTGCTATGGCTCTACTATGCAATCTT
TTCAGCAAATTTCTTAGCACTTTTTGGCTTTGTCTTTTTACTAGTCTGCCTTTTTTTCCAATTTCCATGGAAGACTGTGA
CCAAAGTCCTATTGGTTTGCAGCCTATTTGGGAGTTGGTTTGTATTTCAAAAATGGCAGCAAGAAGAAGCGAGCCAACAT
CTAGTCAACACGGTTGATACAGTTCGAATCTTGCCTGATACCATCAAGGTTAATGGTGATAGCTTGTCTTTTCGTGGCAA
GGCTGATGGTAGACTCTTTCAGGTCTATTATAAGCTCCAGTCAGAAGCCGAAAAGAAAAAATTTCAAGACTTATCTGAAC
TACATGAAATGGCTGTGAAAGGCAAGTTAGCTAGTCCTCAAGGGGCAAACAATTTTGCTGGTTTTGATTATCGAAACTAT
CTCAAAACGCAAGGAATCTATCAGACCTTAACTATATCCGAAATTGTTGAGCTAAAGCAAACAAGTAGTTGGGATATCGG
AGAAAATCTATCAAGCTTGAGGAGAAAAGCAGTTGTCTGGATTAAACGGAATTTTCCCGATCCCATGCGCAACTATATGA
CCGGTCTTTTATTGGGTCATTTAGATACGGACTTTGAGGAAATGAATGAACTCTACTCGAGTCTGGGGATTATTCACCTT
TTTGCTTTATCTGGAATGCAAGTCGGATTTTTCATGGATGCTTTCAAAAAGTTACTTCTGCGTTTGGGTTTGACGCAAGA
GAAGTTCAAATGGTTAGCCCACCCCTTCTCCTTAGTCTATGCAGGTTTGACAGGATTTTCAGCCTCTGTTATTCGTAGTC
TTCTACAAAAGCTATTGGCTCAGCATGGCTTTAAGTCACTGGATAATTTTGCTTTAACGATTCTTATTCTATTTCTTATC
ATGCCGAACTTTTTCTTAACTGCTGGGGGAGTTCTTTCCTGTGCTTATGCCTTTATCCTGACCATGACTGGTGAGGAAGT
AGCAGGGATAAAAGGTCTGGTTAGAGAGAGTTGCATTATTTCCTTGGGAATTCTGCCAATTTTATCATTTTATTTTTCAG
AATTTCAACCGTGGTCCATTCTCTTGACATTTTTCTTTTCATTCTTATTTGATATGGTCCTGTTACCGCTCTTATCGATT
CTCTTCTGTCTGTCATTTATATATCCCATCACCCAGTTCAACTTTCTCTTTGAATGGCTTGAAAACATCATTCGCTATAT
TTCTCAGCTATCTACGAGGCCTCTTGTTTTTGGTCAGCCGAGTCTTTGGATTTTAATCCTTTTTTTGATTATTTTAGCTC
TCATCTACGATTATCGAAAGAATTTGAAAAAAATATCTATGCTTGTCATAGTTGCTATCTCGCTCTTTTTAGTAACCAAA
CATCCACTGGAGAATGAAATCACAGTCCTAGATATGGAACAGGGGCGGAGCATTTTCCTAAGAGACATGACAGGAAAGAC
CATACTACTGGATGTCGGAGAAAAATCTGAACGTGACAAGAAAGAAGCCTGGCAGGAGAGGATTTCCTCTAGCAATGCCG
AACGGAGTTTAATCCCTTATCTAAAAAGTCGAGGAGTTGCAAAGATTGATCAGCTTGTGCTAACGACTAGTGAACCTAAG
CAACTAGACCATGTGTTAGAAATGAGTAAATCCTTCGAGCTTGAAGAGATTCTAGTAAGTGAAGAAACTTTATCTAAAAG
AGAATTTATGGATAAGCTGAAGAAAAGCAAAATCAAGGTAGCTACTATTAAAACAGGGCAGCAGTTGTTTATTTTTGGCA
GTAGTTTGGAAGCATTCACTAGCCAAAATGGCGATAAAAAGGATTCAATGGTCTTGTATGGAAAGTTACTAAACCAAACC
TTTCTAGTTACTGGAAATTTAGAGGAAAAGTTCTTAACCAAGTCTTATCCGAAACTCCAAGCAGATATTGTGATAACTCA
TCAGCAAGCATCGAAGAAAAAGACAGATGTAGAAGTCTTCAAAAACTTACAACCTAAAACCACTGTCATTTCGGTAGACA
AGAAGAAAAAATTCAAAGAAAAAAATGAGGAAAGTAACCAAGAACTTGGGAATTCGATTTATAAAACGGACCAAAAGGGT
GCCATTCGTTTCAAAGGTTGGAGTACTTGGCGAATAGAAACAGTTCGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
66.756 |
100 |
0.677 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
66.577 |
100 |
0.674 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
66.354 |
100 |
0.673 |
| comEC/celB | Streptococcus pneumoniae D39 |
66.354 |
100 |
0.673 |
| comEC/celB | Streptococcus pneumoniae R6 |
66.354 |
100 |
0.673 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
65.818 |
100 |
0.667 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
40.27 |
100 |
0.405 |