Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | NBW44_RS04160 | Genome accession | NZ_OW724079 |
| Coordinates | 826719..828929 (-) | Length | 736 a.a. |
| NCBI ID | WP_250307319.1 | Uniprot ID | - |
| Organism | Streptococcus sp. Marseille-Q3533 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 821719..833929
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NBW44_RS04130 | - | 822757..823293 (-) | 537 | WP_075567718.1 | F0F1 ATP synthase subunit delta | - |
| NBW44_RS04135 | atpF | 823293..823787 (-) | 495 | WP_250307316.1 | F0F1 ATP synthase subunit B | - |
| NBW44_RS04140 | atpB | 823806..824522 (-) | 717 | WP_004253312.1 | F0F1 ATP synthase subunit A | - |
| NBW44_RS04145 | - | 824555..824755 (-) | 201 | WP_004253309.1 | F0F1 ATP synthase subunit C | - |
| NBW44_RS04150 | - | 824984..825295 (-) | 312 | WP_250307317.1 | CHY zinc finger protein | - |
| NBW44_RS04155 | - | 825310..826596 (-) | 1287 | WP_250307318.1 | U32 family peptidase | - |
| NBW44_RS04160 | comEC/celB | 826719..828929 (-) | 2211 | WP_250307319.1 | ComEC/Rec2 family competence protein | Machinery gene |
| NBW44_RS04165 | comEA/celA/cilE | 828913..829554 (-) | 642 | WP_250307320.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| NBW44_RS04170 | - | 829620..830189 (-) | 570 | WP_250307321.1 | GNAT family N-acetyltransferase | - |
| NBW44_RS04175 | ald | 830396..831508 (+) | 1113 | WP_250307322.1 | alanine dehydrogenase | - |
| NBW44_RS04180 | - | 831549..832535 (-) | 987 | WP_250307323.1 | PhoH family protein | - |
| NBW44_RS04185 | - | 832620..832835 (-) | 216 | WP_075567726.1 | YozE family protein | - |
| NBW44_RS04190 | cvfB | 832844..833698 (-) | 855 | WP_250307325.1 | RNA-binding virulence regulatory protein CvfB | - |
Sequence
Protein
Download Length: 736 a.a. Molecular weight: 84983.88 Da Isoelectric Point: 9.7562
>NTDB_id=1152597 NBW44_RS04160 WP_250307319.1 826719..828929(-) (comEC/celB) [Streptococcus sp. Marseille-Q3533]
MSQWISRFPIPKIYLSFLLLWLYYAIFSASFLALLGFVFLLLCLFFQFPWKNVVKILFVCSFFGSWFIFQKWQQEEVSQH
LVDSVNTVRILPDTIKFNGDSLFFRGKAEGRLFQVYYKFQSESEKERFKELSELHEIVVKGKLAIPQGANNFAGFDYRNY
LKTQGIYQTLTISEIVELKKTYSWDIGENLSSLRRKAVVWIKRKFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHI
FALSGMQVGFFMNAFKKFFLRLGMSQENLKCLVYPFSLVYAGLTGFSASVIRSLLQKLLAQHGFKGLDNFALTVLFLFIV
MPNFFLTAGGVLSCAYAFILTMTGEEVAGIKGLVRESFIISLGILPILSFYFSEFQPWSILLTFVFSFLFDMVLLPLLSI
LFCLSWIYPITQFNFLFEWLENIIRYVSQLSTRPFVFGQPSLWVLVFLLISLAIVYDYRKNLKKTQIIALFVLALFLVTK
HPLENEITVLDMEQGRSIFLRDMTGKTILLDVGEKLAVEKKEAWQEKVITSNAKRSLIPYIKSRGVAKIDQLVLTTSQPK
QLDHLLEISKSFNLGEILVTEETLSKREFMDKLKESNLKVRPIKTGEQLFIFGSSLEVIENQNSDSKSSIVMYGKLLNQT
FLVTGNIEEKFLNKSYPKIQADVVLTHQQASKKKTDVKVFEIFQPKITVISVDKKKKFKEKNGEINQELGNSIYKTDQKG
AIRFKGWSAWQIETVR
MSQWISRFPIPKIYLSFLLLWLYYAIFSASFLALLGFVFLLLCLFFQFPWKNVVKILFVCSFFGSWFIFQKWQQEEVSQH
LVDSVNTVRILPDTIKFNGDSLFFRGKAEGRLFQVYYKFQSESEKERFKELSELHEIVVKGKLAIPQGANNFAGFDYRNY
LKTQGIYQTLTISEIVELKKTYSWDIGENLSSLRRKAVVWIKRKFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHI
FALSGMQVGFFMNAFKKFFLRLGMSQENLKCLVYPFSLVYAGLTGFSASVIRSLLQKLLAQHGFKGLDNFALTVLFLFIV
MPNFFLTAGGVLSCAYAFILTMTGEEVAGIKGLVRESFIISLGILPILSFYFSEFQPWSILLTFVFSFLFDMVLLPLLSI
LFCLSWIYPITQFNFLFEWLENIIRYVSQLSTRPFVFGQPSLWVLVFLLISLAIVYDYRKNLKKTQIIALFVLALFLVTK
HPLENEITVLDMEQGRSIFLRDMTGKTILLDVGEKLAVEKKEAWQEKVITSNAKRSLIPYIKSRGVAKIDQLVLTTSQPK
QLDHLLEISKSFNLGEILVTEETLSKREFMDKLKESNLKVRPIKTGEQLFIFGSSLEVIENQNSDSKSSIVMYGKLLNQT
FLVTGNIEEKFLNKSYPKIQADVVLTHQQASKKKTDVKVFEIFQPKITVISVDKKKKFKEKNGEINQELGNSIYKTDQKG
AIRFKGWSAWQIETVR
Nucleotide
Download Length: 2211 bp
>NTDB_id=1152597 NBW44_RS04160 WP_250307319.1 826719..828929(-) (comEC/celB) [Streptococcus sp. Marseille-Q3533]
ATGTCACAGTGGATTAGTCGATTTCCCATCCCAAAAATCTATCTCAGTTTTCTTTTATTATGGCTCTACTATGCTATTTT
TTCAGCAAGCTTTTTAGCACTTTTGGGCTTTGTCTTTTTACTGCTCTGCCTCTTTTTTCAATTTCCATGGAAGAATGTGG
TCAAGATTCTCTTTGTTTGTAGTTTTTTTGGAAGCTGGTTTATATTTCAAAAATGGCAACAAGAAGAAGTGAGTCAACAT
CTTGTTGACTCCGTTAATACGGTACGAATCTTGCCTGATACTATCAAGTTCAATGGAGATAGTTTATTCTTTCGTGGGAA
AGCTGAGGGAAGACTTTTTCAGGTTTATTATAAATTCCAGTCAGAGTCTGAAAAGGAAAGGTTTAAAGAGTTATCTGAAC
TGCATGAGATAGTAGTAAAAGGAAAACTAGCTATTCCTCAAGGAGCAAATAACTTTGCTGGATTTGATTATCGAAACTAT
TTAAAAACACAGGGAATTTATCAGACCTTAACTATATCGGAAATAGTCGAATTAAAGAAAACATACAGCTGGGATATAGG
AGAAAATCTATCCAGCTTGCGGAGAAAAGCAGTTGTCTGGATAAAAAGGAAATTTCCTGATCCCATGCGTAATTATATGA
CAGGTCTTTTATTAGGGCACTTGGACACAGATTTTGAGGAAATGAATGAACTCTACTCAAGTTTAGGAATTATTCATATT
TTTGCACTGTCAGGAATGCAAGTAGGATTCTTCATGAATGCTTTTAAGAAGTTCTTTTTGCGACTGGGGATGAGCCAAGA
AAACTTAAAATGTCTTGTCTATCCATTTTCCTTGGTTTATGCAGGTTTGACTGGATTTTCAGCTTCTGTTATTAGAAGCC
TGTTACAAAAACTTTTGGCTCAACATGGTTTCAAGGGGCTGGATAATTTTGCTTTGACAGTCCTTTTCTTGTTTATCGTG
ATGCCGAATTTCTTCCTAACTGCTGGTGGAGTGCTCTCCTGTGCCTATGCCTTTATCCTGACCATGACTGGTGAAGAAGT
AGCAGGGATAAAAGGTCTGGTGAGAGAAAGTTTCATTATTTCCTTAGGAATTCTGCCAATTTTATCATTCTATTTTTCTG
AATTTCAACCTTGGTCCATACTCTTAACCTTTGTTTTTTCTTTTTTATTTGATATGGTCTTGCTACCGCTTTTATCAATA
CTCTTCTGTCTGTCATGGATATATCCTATTACCCAGTTTAACTTTCTCTTTGAATGGCTAGAAAATATCATACGCTATGT
ATCTCAGCTATCTACTAGGCCTTTTGTTTTTGGTCAACCGAGTCTTTGGGTTCTGGTGTTTCTTTTAATTTCTTTGGCTA
TAGTCTATGACTATCGTAAAAATTTGAAAAAAACACAAATAATTGCACTATTTGTCCTAGCTCTCTTTTTAGTAACCAAA
CATCCACTGGAAAATGAAATCACAGTCCTAGACATGGAGCAGGGACGGAGCATTTTCCTAAGAGACATGACAGGAAAGAC
AATATTACTGGATGTCGGTGAAAAGTTAGCAGTTGAGAAAAAAGAAGCCTGGCAGGAGAAGGTCATAACAAGCAATGCCA
AACGTAGTTTAATCCCCTATATAAAAAGTAGAGGAGTCGCAAAGATTGATCAGCTTGTACTAACGACCAGTCAACCTAAG
CAACTAGACCATCTACTAGAAATTAGTAAATCCTTCAATCTTGGAGAGATTCTAGTAACTGAAGAGACTCTATCTAAAAG
AGAATTTATGGATAAATTAAAGGAAAGTAACTTGAAAGTACGTCCTATTAAAACAGGAGAGCAGTTATTTATTTTTGGGA
GCAGTTTAGAAGTAATCGAAAATCAAAATAGTGATAGTAAATCCTCAATAGTAATGTATGGAAAGCTACTAAATCAAACT
TTTCTAGTCACTGGAAATATAGAGGAGAAGTTCTTAAACAAGTCTTATCCAAAAATCCAAGCAGATGTAGTGCTAACTCA
TCAGCAAGCATCGAAGAAAAAGACAGATGTCAAAGTCTTCGAAATCTTTCAGCCTAAAATCACTGTCATTTCTGTAGACA
AGAAGAAAAAATTTAAAGAAAAAAATGGGGAGATTAACCAAGAACTTGGGAATTCGATTTACAAAACGGATCAAAAGGGG
GCCATTCGTTTTAAAGGTTGGAGTGCTTGGCAAATAGAAACAGTTCGCTGA
ATGTCACAGTGGATTAGTCGATTTCCCATCCCAAAAATCTATCTCAGTTTTCTTTTATTATGGCTCTACTATGCTATTTT
TTCAGCAAGCTTTTTAGCACTTTTGGGCTTTGTCTTTTTACTGCTCTGCCTCTTTTTTCAATTTCCATGGAAGAATGTGG
TCAAGATTCTCTTTGTTTGTAGTTTTTTTGGAAGCTGGTTTATATTTCAAAAATGGCAACAAGAAGAAGTGAGTCAACAT
CTTGTTGACTCCGTTAATACGGTACGAATCTTGCCTGATACTATCAAGTTCAATGGAGATAGTTTATTCTTTCGTGGGAA
AGCTGAGGGAAGACTTTTTCAGGTTTATTATAAATTCCAGTCAGAGTCTGAAAAGGAAAGGTTTAAAGAGTTATCTGAAC
TGCATGAGATAGTAGTAAAAGGAAAACTAGCTATTCCTCAAGGAGCAAATAACTTTGCTGGATTTGATTATCGAAACTAT
TTAAAAACACAGGGAATTTATCAGACCTTAACTATATCGGAAATAGTCGAATTAAAGAAAACATACAGCTGGGATATAGG
AGAAAATCTATCCAGCTTGCGGAGAAAAGCAGTTGTCTGGATAAAAAGGAAATTTCCTGATCCCATGCGTAATTATATGA
CAGGTCTTTTATTAGGGCACTTGGACACAGATTTTGAGGAAATGAATGAACTCTACTCAAGTTTAGGAATTATTCATATT
TTTGCACTGTCAGGAATGCAAGTAGGATTCTTCATGAATGCTTTTAAGAAGTTCTTTTTGCGACTGGGGATGAGCCAAGA
AAACTTAAAATGTCTTGTCTATCCATTTTCCTTGGTTTATGCAGGTTTGACTGGATTTTCAGCTTCTGTTATTAGAAGCC
TGTTACAAAAACTTTTGGCTCAACATGGTTTCAAGGGGCTGGATAATTTTGCTTTGACAGTCCTTTTCTTGTTTATCGTG
ATGCCGAATTTCTTCCTAACTGCTGGTGGAGTGCTCTCCTGTGCCTATGCCTTTATCCTGACCATGACTGGTGAAGAAGT
AGCAGGGATAAAAGGTCTGGTGAGAGAAAGTTTCATTATTTCCTTAGGAATTCTGCCAATTTTATCATTCTATTTTTCTG
AATTTCAACCTTGGTCCATACTCTTAACCTTTGTTTTTTCTTTTTTATTTGATATGGTCTTGCTACCGCTTTTATCAATA
CTCTTCTGTCTGTCATGGATATATCCTATTACCCAGTTTAACTTTCTCTTTGAATGGCTAGAAAATATCATACGCTATGT
ATCTCAGCTATCTACTAGGCCTTTTGTTTTTGGTCAACCGAGTCTTTGGGTTCTGGTGTTTCTTTTAATTTCTTTGGCTA
TAGTCTATGACTATCGTAAAAATTTGAAAAAAACACAAATAATTGCACTATTTGTCCTAGCTCTCTTTTTAGTAACCAAA
CATCCACTGGAAAATGAAATCACAGTCCTAGACATGGAGCAGGGACGGAGCATTTTCCTAAGAGACATGACAGGAAAGAC
AATATTACTGGATGTCGGTGAAAAGTTAGCAGTTGAGAAAAAAGAAGCCTGGCAGGAGAAGGTCATAACAAGCAATGCCA
AACGTAGTTTAATCCCCTATATAAAAAGTAGAGGAGTCGCAAAGATTGATCAGCTTGTACTAACGACCAGTCAACCTAAG
CAACTAGACCATCTACTAGAAATTAGTAAATCCTTCAATCTTGGAGAGATTCTAGTAACTGAAGAGACTCTATCTAAAAG
AGAATTTATGGATAAATTAAAGGAAAGTAACTTGAAAGTACGTCCTATTAAAACAGGAGAGCAGTTATTTATTTTTGGGA
GCAGTTTAGAAGTAATCGAAAATCAAAATAGTGATAGTAAATCCTCAATAGTAATGTATGGAAAGCTACTAAATCAAACT
TTTCTAGTCACTGGAAATATAGAGGAGAAGTTCTTAAACAAGTCTTATCCAAAAATCCAAGCAGATGTAGTGCTAACTCA
TCAGCAAGCATCGAAGAAAAAGACAGATGTCAAAGTCTTCGAAATCTTTCAGCCTAAAATCACTGTCATTTCTGTAGACA
AGAAGAAAAAATTTAAAGAAAAAAATGGGGAGATTAACCAAGAACTTGGGAATTCGATTTACAAAACGGATCAAAAGGGG
GCCATTCGTTTTAAAGGTTGGAGTGCTTGGCAAATAGAAACAGTTCGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae D39 |
65.818 |
100 |
0.667 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
65.818 |
100 |
0.667 |
| comEC/celB | Streptococcus pneumoniae R6 |
65.818 |
100 |
0.667 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
65.772 |
100 |
0.666 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
65.416 |
100 |
0.663 |
| comEC/celB | Streptococcus mitis SK321 |
65.282 |
100 |
0.662 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
38.275 |
100 |
0.386 |