Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | GO995_RS03165 | Genome accession | NZ_CP046624 |
| Coordinates | 593213..595447 (+) | Length | 744 a.a. |
| NCBI ID | WP_202913550.1 | Uniprot ID | - |
| Organism | Streptococcus ruminicola strain CNU_G3 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 588213..600447
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GO995_RS03140 (GO995_03140) | - | 588958..590532 (+) | 1575 | WP_157628821.1 | DEAD/DEAH box helicase | - |
| GO995_RS03145 (GO995_03145) | - | 590572..590847 (-) | 276 | WP_157628822.1 | GIY-YIG nuclease family protein | - |
| GO995_RS03150 (GO995_03150) | - | 590822..591583 (-) | 762 | WP_157628823.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| GO995_RS03155 (GO995_03155) | - | 591700..592446 (+) | 747 | WP_074559556.1 | lysophospholipid acyltransferase family protein | - |
| GO995_RS03160 (GO995_03160) | - | 592543..593223 (+) | 681 | WP_157628824.1 | helix-hairpin-helix domain-containing protein | - |
| GO995_RS03165 (GO995_03165) | comEC/celB | 593213..595447 (+) | 2235 | WP_202913550.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| GO995_RS03170 (GO995_03170) | holA | 595512..596549 (+) | 1038 | WP_157628826.1 | DNA polymerase III subunit delta | - |
| GO995_RS03175 (GO995_03175) | sodA | 596646..597257 (+) | 612 | WP_039698404.1 | superoxide dismutase SodA | - |
| GO995_RS03180 (GO995_03180) | - | 597368..598093 (+) | 726 | WP_024344280.1 | YebC/PmpR family DNA-binding transcriptional regulator | - |
| GO995_RS03185 (GO995_03185) | queA | 598145..599173 (-) | 1029 | WP_157628827.1 | tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA | - |
| GO995_RS03190 (GO995_03190) | - | 599337..599843 (+) | 507 | WP_033152227.1 | VIT1/CCC1 transporter family protein | - |
Sequence
Protein
Download Length: 744 a.a. Molecular weight: 84221.43 Da Isoelectric Point: 8.2166
>NTDB_id=406161 GO995_RS03165 WP_202913550.1 593213..595447(+) (comEC/celB) [Streptococcus ruminicola strain CNU_G3]
MLIKHFPVKPIQLAFLLTLLYYFCFEKVVLCLILLLLALLFLWQQYGWKAWCQVSLCLAFFAIFFLTKSNQTEQAYQAAP
AQLSKIQMVPDTISVNGDLLSFRGKEAGQTYQVFYTLKSEKEQKFFKNLNQTMLLSGQVDLEEATPQRNFGGFDYRTYLK
HEGIYRIANLSSITEMRQQSSLSFFERLHELRRQAIVSIQRNFPAPMQHYMTGLLFGYLDKSFDEMSDVYTSLGIIHLFA
LSGMQVGFFVGVFRYFFLRLGLRRDYVDILQIPFSLVYAGLTGFSVSVVRSLIQSAFANLGIKKIDNLAFTLMTLFILMP
NFLLTTGGILSFTYAFILSFIDFDELSHYRKILTECLAISLGSLPVLLYFFSVFQPLSLLLTAIFSLAFDTLILPVLTFV
FLLSPLVKITALNPFFVFLETIIKFSKSLVGTSLVFGRPSLGILLLLLLTIGLLYDCYRRKKVVIFLLSLIALLFFQIKH
PLENEVTVVDIGQGDSIFVRDVKGHTLLIDVGGKVSFNQKEGWQERLSDSNADRTLIPYLNSRGVGKIDQLVLTHTDTDH
MGDMLEVAKELKIGQVLVSPGSLTKPDFVVKLRQMKTPVRAVSAGDNLPIMGSHLQVLYPIEVGDGSNNDSVVLYGNLLG
KNFLFTGDLEEEGEREVMAAYPNLPVDVLKAGHHGSKGSSSPEFLEHISPQVALISAGQNNRYKHPHQETLERFQAQNMT
IYRTDEQGAIRFRGLNHWKIETVR
MLIKHFPVKPIQLAFLLTLLYYFCFEKVVLCLILLLLALLFLWQQYGWKAWCQVSLCLAFFAIFFLTKSNQTEQAYQAAP
AQLSKIQMVPDTISVNGDLLSFRGKEAGQTYQVFYTLKSEKEQKFFKNLNQTMLLSGQVDLEEATPQRNFGGFDYRTYLK
HEGIYRIANLSSITEMRQQSSLSFFERLHELRRQAIVSIQRNFPAPMQHYMTGLLFGYLDKSFDEMSDVYTSLGIIHLFA
LSGMQVGFFVGVFRYFFLRLGLRRDYVDILQIPFSLVYAGLTGFSVSVVRSLIQSAFANLGIKKIDNLAFTLMTLFILMP
NFLLTTGGILSFTYAFILSFIDFDELSHYRKILTECLAISLGSLPVLLYFFSVFQPLSLLLTAIFSLAFDTLILPVLTFV
FLLSPLVKITALNPFFVFLETIIKFSKSLVGTSLVFGRPSLGILLLLLLTIGLLYDCYRRKKVVIFLLSLIALLFFQIKH
PLENEVTVVDIGQGDSIFVRDVKGHTLLIDVGGKVSFNQKEGWQERLSDSNADRTLIPYLNSRGVGKIDQLVLTHTDTDH
MGDMLEVAKELKIGQVLVSPGSLTKPDFVVKLRQMKTPVRAVSAGDNLPIMGSHLQVLYPIEVGDGSNNDSVVLYGNLLG
KNFLFTGDLEEEGEREVMAAYPNLPVDVLKAGHHGSKGSSSPEFLEHISPQVALISAGQNNRYKHPHQETLERFQAQNMT
IYRTDEQGAIRFRGLNHWKIETVR
Nucleotide
Download Length: 2235 bp
>NTDB_id=406161 GO995_RS03165 WP_202913550.1 593213..595447(+) (comEC/celB) [Streptococcus ruminicola strain CNU_G3]
CTGTTGATTAAGCATTTCCCTGTCAAGCCAATTCAATTGGCTTTTTTGCTCACTTTACTTTATTATTTTTGTTTTGAAAA
AGTGGTGTTATGTCTGATTTTACTCCTGTTAGCCTTGCTGTTTTTATGGCAACAGTATGGTTGGAAGGCTTGGTGTCAGG
TTTCACTGTGCTTGGCTTTCTTTGCGATATTTTTCCTGACTAAGTCTAATCAGACAGAACAGGCTTATCAAGCGGCGCCA
GCTCAGCTATCAAAAATCCAGATGGTTCCAGATACGATTTCGGTTAATGGTGATTTGTTATCTTTTCGAGGGAAGGAGGC
TGGGCAGACTTACCAAGTTTTTTATACCTTGAAGTCAGAAAAAGAGCAGAAATTTTTTAAAAATCTTAATCAAACAATGC
TTTTATCAGGGCAGGTGGATTTAGAAGAGGCAACGCCACAGCGGAATTTTGGTGGTTTTGATTACCGCACTTACCTTAAA
CATGAAGGCATTTATCGGATTGCTAATCTTTCTTCGATTACTGAAATGAGACAGCAGAGCTCTTTAAGTTTTTTTGAACG
ACTACATGAACTTAGAAGACAGGCGATTGTCAGTATTCAAAGGAATTTTCCTGCCCCAATGCAACATTATATGACTGGCC
TTTTGTTTGGATATTTGGATAAGTCTTTTGATGAGATGAGTGATGTATACACGAGTTTAGGGATTATTCATCTTTTTGCT
CTGTCTGGTATGCAGGTTGGCTTTTTTGTTGGTGTTTTTCGCTATTTCTTTTTGCGACTTGGGTTAAGGCGTGATTATGT
GGATATACTCCAAATCCCGTTTTCACTTGTATATGCTGGCTTGACAGGCTTTTCGGTTTCAGTTGTTCGGAGCCTTATTC
AGTCTGCTTTTGCAAATCTTGGCATTAAAAAAATAGATAATCTTGCCTTTACTTTAATGACTCTTTTTATTTTGATGCCA
AATTTCCTGCTGACGACGGGAGGTATTCTTTCGTTTACTTACGCTTTTATCCTGAGTTTTATTGATTTTGATGAGCTTAG
TCATTATCGAAAAATCCTTACAGAATGTTTGGCGATTTCGCTGGGAAGTTTACCGGTTTTGCTATATTTCTTCTCAGTAT
TTCAACCCTTGTCGCTGCTTTTAACAGCTATTTTTTCATTGGCATTTGATACGCTTATCTTACCTGTTTTGACCTTTGTT
TTTCTTTTATCACCTCTGGTTAAAATTACTGCGTTAAATCCATTTTTTGTTTTTCTAGAAACTATCATCAAATTTAGCAA
GTCGCTGGTTGGTACCTCACTCGTTTTTGGCAGACCAAGTCTAGGAATTTTGTTGTTATTACTTTTGACAATAGGGCTTC
TCTATGACTGTTATCGACGTAAAAAAGTGGTCATTTTCCTTCTTAGCTTAATAGCTCTGCTATTTTTCCAAATCAAGCAT
CCGCTGGAAAACGAAGTGACGGTGGTTGACATTGGGCAGGGAGATAGTATTTTTGTGCGTGATGTTAAAGGGCATACGCT
GTTAATTGATGTCGGAGGAAAAGTGAGTTTTAACCAAAAAGAAGGCTGGCAGGAGCGTTTGTCTGACAGCAATGCAGATC
GAACCTTGATTCCTTATCTCAATAGCCGTGGTGTTGGAAAAATTGACCAGCTGGTTTTGACGCACACGGATACCGATCAT
ATGGGTGATATGCTGGAAGTCGCAAAAGAGCTGAAAATTGGTCAAGTCCTAGTGAGTCCAGGAAGTCTCACAAAGCCAGA
TTTTGTGGTTAAACTTCGACAAATGAAGACGCCAGTTCGCGCAGTTTCTGCAGGAGATAACTTGCCAATTATGGGAAGTC
ACTTGCAGGTGCTCTATCCAATTGAAGTTGGTGATGGTTCTAATAATGATTCTGTGGTGCTTTATGGCAATCTTCTAGGT
AAAAATTTTTTGTTTACAGGAGATTTGGAAGAAGAAGGTGAGCGTGAGGTGATGGCGGCTTATCCTAATCTGCCAGTTGA
TGTTTTAAAAGCAGGACACCATGGTTCAAAAGGGTCATCAAGCCCAGAATTTCTAGAGCATATTTCACCGCAAGTTGCTC
TGATTTCCGCAGGACAAAATAACCGCTACAAGCATCCTCATCAAGAAACCCTCGAACGCTTTCAAGCACAAAACATGACC
ATTTACCGTACTGATGAACAAGGTGCTATCCGCTTTAGAGGATTAAACCACTGGAAAATTGAAACGGTGAGATGA
CTGTTGATTAAGCATTTCCCTGTCAAGCCAATTCAATTGGCTTTTTTGCTCACTTTACTTTATTATTTTTGTTTTGAAAA
AGTGGTGTTATGTCTGATTTTACTCCTGTTAGCCTTGCTGTTTTTATGGCAACAGTATGGTTGGAAGGCTTGGTGTCAGG
TTTCACTGTGCTTGGCTTTCTTTGCGATATTTTTCCTGACTAAGTCTAATCAGACAGAACAGGCTTATCAAGCGGCGCCA
GCTCAGCTATCAAAAATCCAGATGGTTCCAGATACGATTTCGGTTAATGGTGATTTGTTATCTTTTCGAGGGAAGGAGGC
TGGGCAGACTTACCAAGTTTTTTATACCTTGAAGTCAGAAAAAGAGCAGAAATTTTTTAAAAATCTTAATCAAACAATGC
TTTTATCAGGGCAGGTGGATTTAGAAGAGGCAACGCCACAGCGGAATTTTGGTGGTTTTGATTACCGCACTTACCTTAAA
CATGAAGGCATTTATCGGATTGCTAATCTTTCTTCGATTACTGAAATGAGACAGCAGAGCTCTTTAAGTTTTTTTGAACG
ACTACATGAACTTAGAAGACAGGCGATTGTCAGTATTCAAAGGAATTTTCCTGCCCCAATGCAACATTATATGACTGGCC
TTTTGTTTGGATATTTGGATAAGTCTTTTGATGAGATGAGTGATGTATACACGAGTTTAGGGATTATTCATCTTTTTGCT
CTGTCTGGTATGCAGGTTGGCTTTTTTGTTGGTGTTTTTCGCTATTTCTTTTTGCGACTTGGGTTAAGGCGTGATTATGT
GGATATACTCCAAATCCCGTTTTCACTTGTATATGCTGGCTTGACAGGCTTTTCGGTTTCAGTTGTTCGGAGCCTTATTC
AGTCTGCTTTTGCAAATCTTGGCATTAAAAAAATAGATAATCTTGCCTTTACTTTAATGACTCTTTTTATTTTGATGCCA
AATTTCCTGCTGACGACGGGAGGTATTCTTTCGTTTACTTACGCTTTTATCCTGAGTTTTATTGATTTTGATGAGCTTAG
TCATTATCGAAAAATCCTTACAGAATGTTTGGCGATTTCGCTGGGAAGTTTACCGGTTTTGCTATATTTCTTCTCAGTAT
TTCAACCCTTGTCGCTGCTTTTAACAGCTATTTTTTCATTGGCATTTGATACGCTTATCTTACCTGTTTTGACCTTTGTT
TTTCTTTTATCACCTCTGGTTAAAATTACTGCGTTAAATCCATTTTTTGTTTTTCTAGAAACTATCATCAAATTTAGCAA
GTCGCTGGTTGGTACCTCACTCGTTTTTGGCAGACCAAGTCTAGGAATTTTGTTGTTATTACTTTTGACAATAGGGCTTC
TCTATGACTGTTATCGACGTAAAAAAGTGGTCATTTTCCTTCTTAGCTTAATAGCTCTGCTATTTTTCCAAATCAAGCAT
CCGCTGGAAAACGAAGTGACGGTGGTTGACATTGGGCAGGGAGATAGTATTTTTGTGCGTGATGTTAAAGGGCATACGCT
GTTAATTGATGTCGGAGGAAAAGTGAGTTTTAACCAAAAAGAAGGCTGGCAGGAGCGTTTGTCTGACAGCAATGCAGATC
GAACCTTGATTCCTTATCTCAATAGCCGTGGTGTTGGAAAAATTGACCAGCTGGTTTTGACGCACACGGATACCGATCAT
ATGGGTGATATGCTGGAAGTCGCAAAAGAGCTGAAAATTGGTCAAGTCCTAGTGAGTCCAGGAAGTCTCACAAAGCCAGA
TTTTGTGGTTAAACTTCGACAAATGAAGACGCCAGTTCGCGCAGTTTCTGCAGGAGATAACTTGCCAATTATGGGAAGTC
ACTTGCAGGTGCTCTATCCAATTGAAGTTGGTGATGGTTCTAATAATGATTCTGTGGTGCTTTATGGCAATCTTCTAGGT
AAAAATTTTTTGTTTACAGGAGATTTGGAAGAAGAAGGTGAGCGTGAGGTGATGGCGGCTTATCCTAATCTGCCAGTTGA
TGTTTTAAAAGCAGGACACCATGGTTCAAAAGGGTCATCAAGCCCAGAATTTCTAGAGCATATTTCACCGCAAGTTGCTC
TGATTTCCGCAGGACAAAATAACCGCTACAAGCATCCTCATCAAGAAACCCTCGAACGCTTTCAAGCACAAAACATGACC
ATTTACCGTACTGATGAACAAGGTGCTATCCGCTTTAGAGGATTAAACCACTGGAAAATTGAAACGGTGAGATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
50.943 |
99.731 |
0.508 |
| comEC/celB | Streptococcus mitis SK321 |
50.539 |
99.731 |
0.504 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
49.798 |
99.866 |
0.497 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
49.26 |
99.866 |
0.492 |
| comEC/celB | Streptococcus pneumoniae D39 |
49.26 |
99.866 |
0.492 |
| comEC/celB | Streptococcus pneumoniae R6 |
49.26 |
99.866 |
0.492 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.664 |
100 |
0.48 |