Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | DQN42_RS05170 | Genome accession | NZ_LS483436 |
| Coordinates | 1047180..1049414 (-) | Length | 744 a.a. |
| NCBI ID | WP_014829690.1 | Uniprot ID | - |
| Organism | Streptococcus intermedius strain NCTC11324 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1042180..1054414
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DQN42_RS05145 (NCTC11324_01037) | - | 1042506..1043780 (-) | 1275 | WP_003076495.1 | dihydroorotase | - |
| DQN42_RS05150 (NCTC11324_01038) | - | 1043792..1044262 (-) | 471 | WP_003072602.1 | 8-oxo-dGTP diphosphatase | - |
| DQN42_RS05155 (NCTC11324_01039) | - | 1044289..1044942 (-) | 654 | WP_003072601.1 | uracil-DNA glycosylase | - |
| DQN42_RS05160 (NCTC11324_01040) | - | 1044967..1045953 (-) | 987 | WP_003076968.1 | Gfo/Idh/MocA family protein | - |
| DQN42_RS05165 (NCTC11324_01042) | - | 1046389..1047102 (-) | 714 | WP_003076424.1 | DUF805 domain-containing protein | - |
| DQN42_RS05170 (NCTC11324_01043) | comEC/celB | 1047180..1049414 (-) | 2235 | WP_014829690.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| DQN42_RS05175 (NCTC11324_01044) | comEA/celA/cilE | 1049398..1050102 (-) | 705 | WP_003076279.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| DQN42_RS05180 (NCTC11324_01045) | smpB | 1050270..1050737 (-) | 468 | WP_003076815.1 | SsrA-binding protein SmpB | - |
| DQN42_RS05185 (NCTC11324_01046) | rnr | 1050700..1053039 (-) | 2340 | WP_003076716.1 | ribonuclease R | - |
| DQN42_RS05190 (NCTC11324_01047) | secG | 1053130..1053363 (-) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| DQN42_RS05195 | rpmG | 1053403..1053552 (-) | 150 | WP_003034401.1 | 50S ribosomal protein L33 | - |
Sequence
Protein
Download Length: 744 a.a. Molecular weight: 85290.35 Da Isoelectric Point: 10.2431
>NTDB_id=1141445 DQN42_RS05170 WP_014829690.1 1047180..1049414(-) (comEC/celB) [Streptococcus intermedius strain NCTC11324]
MSQWIKIFPIKPIYIAFLLVWLYFAIYQSNWLAGVGLIFLLIRLSRIYSLKEWFTTLMILACFAVFFLVRRELANRKIKV
EAPPVRQVAVLPDTIKVNGDSLSFRGKAKGQTYQIYYKLKSKKEKLAFQNLSSLVTLTVEGEFESPEKQRNFSGFDYQAY
LKTQGIYRILKVDQILSSQDRVSLQPFEWLSSWRRKALVFIKRNFPNPMSNYMTGFLFGALDTDFGEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKSLLRLGLTQEIVHKCQYPFSFFYAGMTGFSVSVVRSLIQKLLSQHGITKLDNFALTIMVLSLI
MPSFLLTAGGVLSCAYAFIISVLDFKGLTPYKKIIIESIVISLGILPILIFYFGEFQPWSILLTFVFSLIFDIVMLPGLT
IIFLVSPFIKLTQVNFLFECLESSIRWLASMFSRPVVLGKPNPLLLIAMLLVLAILYDIRQNKKWLIFLSLFLSLLFFVA
KFPLQNEITMIDVGKGDSIFMRDWRGSTVLIDVGGREEIRKKESWQERISSSNAERTLIPYLKSRGVDTIDTLVLTNPNS
DYAGDVLEVAKKFSIKKIFISRSSLNDADFLNKLKETRAFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEVKLMQLYPKLKTDVLKVGQHGSQNSSSSKFLQQVRPVIALISTGKNNSSKSLSQETIERFNRLNTKIY
RTDKQGAIKFSGWTTWQLETVQQP
MSQWIKIFPIKPIYIAFLLVWLYFAIYQSNWLAGVGLIFLLIRLSRIYSLKEWFTTLMILACFAVFFLVRRELANRKIKV
EAPPVRQVAVLPDTIKVNGDSLSFRGKAKGQTYQIYYKLKSKKEKLAFQNLSSLVTLTVEGEFESPEKQRNFSGFDYQAY
LKTQGIYRILKVDQILSSQDRVSLQPFEWLSSWRRKALVFIKRNFPNPMSNYMTGFLFGALDTDFGEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKSLLRLGLTQEIVHKCQYPFSFFYAGMTGFSVSVVRSLIQKLLSQHGITKLDNFALTIMVLSLI
MPSFLLTAGGVLSCAYAFIISVLDFKGLTPYKKIIIESIVISLGILPILIFYFGEFQPWSILLTFVFSLIFDIVMLPGLT
IIFLVSPFIKLTQVNFLFECLESSIRWLASMFSRPVVLGKPNPLLLIAMLLVLAILYDIRQNKKWLIFLSLFLSLLFFVA
KFPLQNEITMIDVGKGDSIFMRDWRGSTVLIDVGGREEIRKKESWQERISSSNAERTLIPYLKSRGVDTIDTLVLTNPNS
DYAGDVLEVAKKFSIKKIFISRSSLNDADFLNKLKETRAFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEVKLMQLYPKLKTDVLKVGQHGSQNSSSSKFLQQVRPVIALISTGKNNSSKSLSQETIERFNRLNTKIY
RTDKQGAIKFSGWTTWQLETVQQP
Nucleotide
Download Length: 2235 bp
>NTDB_id=1141445 DQN42_RS05170 WP_014829690.1 1047180..1049414(-) (comEC/celB) [Streptococcus intermedius strain NCTC11324]
ATGTCACAGTGGATTAAGATATTTCCGATCAAACCAATTTACATTGCTTTTTTACTTGTCTGGTTATATTTTGCAATCTA
TCAAAGTAATTGGTTGGCAGGAGTGGGATTGATCTTTCTGTTAATTCGTCTTTCCCGCATATATTCGCTAAAAGAATGGT
TTACAACTTTAATGATTCTCGCTTGTTTTGCTGTTTTTTTCCTTGTTCGTAGGGAACTGGCGAATCGGAAGATAAAAGTA
GAAGCTCCTCCTGTAAGACAAGTTGCGGTTTTACCTGATACAATTAAGGTAAATGGAGATTCGCTTTCTTTTCGTGGTAA
AGCAAAAGGACAGACATATCAAATTTACTACAAATTGAAATCAAAAAAAGAAAAGTTGGCTTTTCAAAATCTATCCAGCC
TTGTCACATTAACTGTTGAGGGGGAATTTGAATCTCCTGAAAAGCAGCGCAATTTTTCTGGTTTTGATTATCAAGCCTAT
CTAAAAACACAAGGGATTTATCGAATTTTAAAGGTGGATCAAATTTTGTCTAGTCAAGATAGGGTTAGCCTCCAACCGTT
TGAGTGGCTGTCTAGCTGGCGAAGAAAAGCATTGGTTTTCATTAAGAGAAATTTCCCGAATCCAATGAGCAATTATATGA
CAGGGTTTTTGTTCGGAGCTTTGGATACAGATTTTGGTGAAATGAATAACCTTTATTCGAGCTTGGGAATTATTCATTTA
TTTGCATTGTCTGGTATGCAAGTTGGTTTTTTTATGGAAGGGTTTCGTAAGTCACTGTTGAGATTGGGGCTTACGCAGGA
AATAGTTCATAAGTGTCAATATCCATTTTCTTTTTTTTATGCTGGAATGACAGGTTTTTCAGTATCCGTTGTACGGAGTT
TAATCCAGAAATTATTGTCACAACACGGCATTACTAAGTTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGATT
ATGCCCTCCTTTCTTTTAACAGCAGGAGGAGTACTTTCTTGTGCGTATGCTTTTATCATTAGCGTTTTAGATTTTAAAGG
CCTGACTCCTTATAAAAAGATTATTATAGAGAGTATTGTCATTTCGCTTGGCATTTTACCAATTTTAATCTTTTATTTCG
GAGAATTTCAGCCCTGGTCTATTTTATTGACATTTGTTTTTTCGTTGATTTTCGATATAGTGATGTTACCGGGGCTAACG
ATAATTTTTCTTGTTTCCCCTTTCATAAAACTCACTCAAGTTAATTTTCTATTTGAATGCTTAGAAAGTAGCATTCGCTG
GTTAGCAAGTATGTTTAGCAGACCAGTCGTTCTTGGCAAACCTAATCCACTTTTGCTAATCGCTATGTTGCTTGTATTAG
CTATCTTGTATGATATTCGGCAAAATAAAAAATGGCTGATATTTCTTAGTCTGTTTCTTTCATTACTCTTTTTTGTAGCT
AAATTTCCTTTACAAAATGAAATCACAATGATTGATGTCGGGAAGGGAGATAGTATTTTTATGCGAGACTGGAGAGGGAG
CACTGTATTGATTGATGTTGGCGGACGTGAAGAAATTAGAAAAAAAGAAAGTTGGCAAGAACGTATAAGTAGTTCAAACG
CAGAGAGAACGCTGATTCCATATCTTAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCAAATTCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAGTTTTCGATAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAGGGCATTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAGTAAAGCTAATGCAACTTTATCCAAAACTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGATCCCAAAATTCATCAAGTTCTAAGTTCTTACAGCAAGTAAGACCAGTGATTGCCCTCATCT
CTACTGGGAAAAACAATTCATCCAAATCGCTAAGTCAAGAAACAATTGAGCGATTCAATCGATTAAATACAAAGATCTAT
CGAACAGATAAACAGGGGGCTATTAAGTTTTCGGGTTGGACAACATGGCAATTAGAAACTGTTCAGCAACCATAG
ATGTCACAGTGGATTAAGATATTTCCGATCAAACCAATTTACATTGCTTTTTTACTTGTCTGGTTATATTTTGCAATCTA
TCAAAGTAATTGGTTGGCAGGAGTGGGATTGATCTTTCTGTTAATTCGTCTTTCCCGCATATATTCGCTAAAAGAATGGT
TTACAACTTTAATGATTCTCGCTTGTTTTGCTGTTTTTTTCCTTGTTCGTAGGGAACTGGCGAATCGGAAGATAAAAGTA
GAAGCTCCTCCTGTAAGACAAGTTGCGGTTTTACCTGATACAATTAAGGTAAATGGAGATTCGCTTTCTTTTCGTGGTAA
AGCAAAAGGACAGACATATCAAATTTACTACAAATTGAAATCAAAAAAAGAAAAGTTGGCTTTTCAAAATCTATCCAGCC
TTGTCACATTAACTGTTGAGGGGGAATTTGAATCTCCTGAAAAGCAGCGCAATTTTTCTGGTTTTGATTATCAAGCCTAT
CTAAAAACACAAGGGATTTATCGAATTTTAAAGGTGGATCAAATTTTGTCTAGTCAAGATAGGGTTAGCCTCCAACCGTT
TGAGTGGCTGTCTAGCTGGCGAAGAAAAGCATTGGTTTTCATTAAGAGAAATTTCCCGAATCCAATGAGCAATTATATGA
CAGGGTTTTTGTTCGGAGCTTTGGATACAGATTTTGGTGAAATGAATAACCTTTATTCGAGCTTGGGAATTATTCATTTA
TTTGCATTGTCTGGTATGCAAGTTGGTTTTTTTATGGAAGGGTTTCGTAAGTCACTGTTGAGATTGGGGCTTACGCAGGA
AATAGTTCATAAGTGTCAATATCCATTTTCTTTTTTTTATGCTGGAATGACAGGTTTTTCAGTATCCGTTGTACGGAGTT
TAATCCAGAAATTATTGTCACAACACGGCATTACTAAGTTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGATT
ATGCCCTCCTTTCTTTTAACAGCAGGAGGAGTACTTTCTTGTGCGTATGCTTTTATCATTAGCGTTTTAGATTTTAAAGG
CCTGACTCCTTATAAAAAGATTATTATAGAGAGTATTGTCATTTCGCTTGGCATTTTACCAATTTTAATCTTTTATTTCG
GAGAATTTCAGCCCTGGTCTATTTTATTGACATTTGTTTTTTCGTTGATTTTCGATATAGTGATGTTACCGGGGCTAACG
ATAATTTTTCTTGTTTCCCCTTTCATAAAACTCACTCAAGTTAATTTTCTATTTGAATGCTTAGAAAGTAGCATTCGCTG
GTTAGCAAGTATGTTTAGCAGACCAGTCGTTCTTGGCAAACCTAATCCACTTTTGCTAATCGCTATGTTGCTTGTATTAG
CTATCTTGTATGATATTCGGCAAAATAAAAAATGGCTGATATTTCTTAGTCTGTTTCTTTCATTACTCTTTTTTGTAGCT
AAATTTCCTTTACAAAATGAAATCACAATGATTGATGTCGGGAAGGGAGATAGTATTTTTATGCGAGACTGGAGAGGGAG
CACTGTATTGATTGATGTTGGCGGACGTGAAGAAATTAGAAAAAAAGAAAGTTGGCAAGAACGTATAAGTAGTTCAAACG
CAGAGAGAACGCTGATTCCATATCTTAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCAAATTCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAGTTTTCGATAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAGGGCATTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAGTAAAGCTAATGCAACTTTATCCAAAACTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGATCCCAAAATTCATCAAGTTCTAAGTTCTTACAGCAAGTAAGACCAGTGATTGCCCTCATCT
CTACTGGGAAAAACAATTCATCCAAATCGCTAAGTCAAGAAACAATTGAGCGATTCAATCGATTAAATACAAAGATCTAT
CGAACAGATAAACAGGGGGCTATTAAGTTTTCGGGTTGGACAACATGGCAATTAGAAACTGTTCAGCAACCATAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
55.689 |
100 |
0.559 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
55.496 |
100 |
0.556 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
54.485 |
100 |
0.547 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.949 |
100 |
0.542 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.949 |
100 |
0.542 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.949 |
100 |
0.542 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
46.03 |
99.866 |
0.46 |