Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | BZG42_RS06490 | Genome accession | NZ_CP019557 |
| Coordinates | 1374215..1376449 (-) | Length | 744 a.a. |
| NCBI ID | WP_155995986.1 | Uniprot ID | - |
| Organism | Streptococcus sp. DAT741 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1369215..1381449
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| BZG42_RS06460 | - | 1369921..1370085 (-) | 165 | WP_006627244.1 | transposon-encoded TnpW family protein | - |
| BZG42_RS06465 (BZG42_06540) | - | 1370212..1370835 (-) | 624 | WP_155995982.1 | helix-turn-helix transcriptional regulator | - |
| BZG42_RS10255 (BZG42_06545) | - | 1371081..1371277 (-) | 197 | Protein_1234 | recombinase | - |
| BZG42_RS06475 (BZG42_06550) | sodA | 1371463..1372074 (-) | 612 | WP_024532641.1 | superoxide dismutase SodA | - |
| BZG42_RS06480 (BZG42_06555) | holA | 1372171..1373202 (-) | 1032 | WP_120172488.1 | DNA polymerase III subunit delta | - |
| BZG42_RS06485 (BZG42_06560) | - | 1373458..1374069 (-) | 612 | WP_155995984.1 | hypothetical protein | - |
| BZG42_RS06490 (BZG42_06565) | comEC/celB | 1374215..1376449 (-) | 2235 | WP_155995986.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| BZG42_RS06495 (BZG42_06570) | comEA | 1376433..1377089 (-) | 657 | WP_024532637.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| BZG42_RS06500 (BZG42_06575) | - | 1377174..1377917 (-) | 744 | WP_024532636.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| BZG42_RS06505 (BZG42_06580) | - | 1378107..1379552 (+) | 1446 | WP_155995988.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| BZG42_RS06510 (BZG42_06585) | - | 1379911..1380699 (+) | 789 | WP_155995990.1 | ABC transporter ATP-binding protein | - |
Sequence
Protein
Download Length: 744 a.a. Molecular weight: 84934.80 Da Isoelectric Point: 9.7221
>NTDB_id=216226 BZG42_RS06490 WP_155995986.1 1374215..1376449(-) (comEC/celB) [Streptococcus sp. DAT741]
MLRLDKLPCQPIHLAILAVVTYFTVHRFSMLTMSLLILLLVIYWRRQGKLTFFKMLPLLVLCGLFFGCQKLKWKQEEQAV
SKQVTAVEIVPDTININGDQLSFRARTDGQIYQGFYKLTSQKEQEYFQKLTALVQVEVKAETSLPSGQRNFNGFDYQAYL
KTQGIYRILTITTINKIVPIHSWNPMDWLSIWRRKALVFIKSNFPSPMSHYMTGLLFGELDSDFDQMSALYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDQLQVPFSLVYAGLTGFSVSVVRSLIQKILSNLGLRKLDNFAATLFICFLVQ
PRFLLTVGGVLTFTYAFLLTVFDFEDLKQIKKVVVESLSISLGILPILMTYFFAFQPLSILLTFFFSFVFDVLLLPSLSV
IFILSPLVKITWVNSFFVLMEKMIIWVSDLGFRPLILGKPSDFILLVLLLGFFMLYDFHRQKKWLLGLSLLLTLLFFITK
HPLENEVTVVDIGQGDSIFLRDMQGRTVLIDVGGRVDFTVKEGWKARSRQANAERTLIPYLHSRGVDKIDSLVLTHTDTD
HVGDVLEVAKNFKIGKIYVSSGSLTVPSFVTTLKKINVPVHVVQVGDRLPIVDSYLEVLYPSRSGDGSNNDSIVLYGRLL
GINFLFTGDLEEGELELIKTYPKLSVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGEKNRYKHPHQETLERFEQKNMQ
IYRTDRQGAIRFRGWRSWSIETVR
MLRLDKLPCQPIHLAILAVVTYFTVHRFSMLTMSLLILLLVIYWRRQGKLTFFKMLPLLVLCGLFFGCQKLKWKQEEQAV
SKQVTAVEIVPDTININGDQLSFRARTDGQIYQGFYKLTSQKEQEYFQKLTALVQVEVKAETSLPSGQRNFNGFDYQAYL
KTQGIYRILTITTINKIVPIHSWNPMDWLSIWRRKALVFIKSNFPSPMSHYMTGLLFGELDSDFDQMSALYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDQLQVPFSLVYAGLTGFSVSVVRSLIQKILSNLGLRKLDNFAATLFICFLVQ
PRFLLTVGGVLTFTYAFLLTVFDFEDLKQIKKVVVESLSISLGILPILMTYFFAFQPLSILLTFFFSFVFDVLLLPSLSV
IFILSPLVKITWVNSFFVLMEKMIIWVSDLGFRPLILGKPSDFILLVLLLGFFMLYDFHRQKKWLLGLSLLLTLLFFITK
HPLENEVTVVDIGQGDSIFLRDMQGRTVLIDVGGRVDFTVKEGWKARSRQANAERTLIPYLHSRGVDKIDSLVLTHTDTD
HVGDVLEVAKNFKIGKIYVSSGSLTVPSFVTTLKKINVPVHVVQVGDRLPIVDSYLEVLYPSRSGDGSNNDSIVLYGRLL
GINFLFTGDLEEGELELIKTYPKLSVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGEKNRYKHPHQETLERFEQKNMQ
IYRTDRQGAIRFRGWRSWSIETVR
Nucleotide
Download Length: 2235 bp
>NTDB_id=216226 BZG42_RS06490 WP_155995986.1 1374215..1376449(-) (comEC/celB) [Streptococcus sp. DAT741]
ATGTTACGGTTGGATAAGCTCCCCTGTCAGCCTATACATCTAGCCATTTTGGCGGTCGTGACCTACTTTACGGTTCACCG
TTTTTCGATGTTGACAATGAGCCTTCTGATATTATTATTGGTAATCTATTGGCGGCGTCAGGGGAAACTTACTTTTTTTA
AAATGCTGCCGCTTCTAGTCTTATGCGGTCTATTCTTCGGCTGCCAAAAGCTCAAATGGAAACAGGAAGAGCAGGCTGTT
TCAAAGCAAGTGACTGCTGTAGAGATTGTGCCGGATACTATAAATATCAATGGCGATCAGTTATCTTTCCGTGCTAGGAC
TGATGGTCAGATTTATCAAGGTTTCTATAAGCTAACTAGTCAGAAAGAACAGGAGTATTTCCAAAAGCTAACAGCTCTTG
TTCAGGTGGAGGTGAAGGCAGAGACGAGTCTCCCATCAGGTCAGCGGAATTTCAATGGTTTTGATTATCAAGCCTATTTA
AAAACACAGGGAATCTATCGAATCCTCACCATAACGACTATCAATAAGATTGTACCTATTCACTCTTGGAACCCTATGGA
TTGGTTGTCAATTTGGCGGAGAAAAGCCTTGGTTTTCATCAAATCTAATTTTCCTTCCCCCATGAGCCATTACATGACAG
GGTTGCTTTTTGGAGAGTTGGACAGTGACTTTGATCAGATGAGCGCCCTTTACTCCAGTCTGGGGATTATCCATCTCTTT
GCTCTATCAGGAATGCAGGTAGGTTTTTTCATTGATAAGTTTCGCTGGATTTTACTGCGTTTAGGCTTGACAAAGGAGAC
AGTTGATCAATTACAAGTCCCCTTTTCTTTGGTCTATGCTGGTTTGACAGGCTTTTCTGTATCAGTCGTCCGGTCTTTGA
TTCAGAAAATTTTGAGCAACCTGGGGCTTCGGAAGCTGGATAATTTTGCTGCAACACTCTTTATTTGTTTCTTGGTGCAG
CCTCGTTTTCTCCTGACAGTTGGAGGTGTCTTGACATTTACCTATGCTTTTTTGTTGACAGTCTTTGATTTTGAGGATTT
GAAGCAAATCAAAAAGGTTGTAGTGGAAAGTCTGAGTATTTCTCTGGGGATTCTGCCAATTCTGATGACCTATTTTTTCG
CTTTTCAGCCTCTATCTATTCTACTAACCTTCTTCTTTTCCTTTGTTTTTGATGTATTGCTTTTACCGAGTCTATCTGTA
ATTTTTATTTTATCTCCTTTGGTCAAGATTACTTGGGTCAATTCTTTCTTTGTTTTGATGGAGAAAATGATTATTTGGGT
ATCGGATTTGGGTTTTCGACCTTTGATACTAGGGAAACCGTCTGACTTCATCTTGTTAGTCTTGTTACTGGGATTTTTCA
TGCTTTACGATTTTCATAGACAGAAAAAATGGCTGTTAGGACTGAGTTTGCTCCTTACTCTGCTATTTTTCATTACAAAA
CATCCATTAGAAAATGAAGTAACTGTGGTAGATATCGGACAGGGAGATAGTATCTTTTTGCGAGATATGCAGGGGCGTAC
GGTTCTAATTGACGTTGGTGGACGGGTGGATTTTACGGTTAAGGAAGGCTGGAAGGCGCGATCCCGTCAGGCTAATGCAG
AGAGGACCTTGATTCCTTATCTACATAGTCGAGGTGTGGACAAGATTGACAGTCTGGTGTTGACGCACACCGATACTGAC
CATGTGGGCGATGTTCTGGAAGTAGCTAAGAATTTTAAGATCGGAAAAATCTATGTGTCTTCAGGTAGTTTGACAGTCCC
TAGCTTTGTAACCACCTTGAAAAAAATCAATGTACCTGTTCATGTGGTGCAGGTGGGTGATCGCCTACCGATCGTTGATT
CCTATTTGGAAGTGCTCTATCCGAGCCGATCGGGTGATGGTAGCAACAATGATTCTATTGTCCTCTATGGACGGCTCCTT
GGAATCAATTTTCTGTTTACGGGTGATCTGGAGGAGGGGGAATTAGAGCTGATAAAAACCTATCCCAAATTGTCAGTTGA
TGTTCTAAAAGCCGGACACCATGGTTCAAAAGGTTCTTCTTATCCTGAATTTTTAGACCATATTGGAGCTAAGATTGCCT
TAATCTCGGCTGGCGAGAAAAATCGCTACAAGCATCCCCATCAAGAAACCTTGGAGCGTTTTGAACAGAAGAATATGCAA
ATCTATCGGACAGACCGGCAAGGAGCTATTCGCTTTAGAGGTTGGCGAAGTTGGTCAATTGAAACGGTGAGATGA
ATGTTACGGTTGGATAAGCTCCCCTGTCAGCCTATACATCTAGCCATTTTGGCGGTCGTGACCTACTTTACGGTTCACCG
TTTTTCGATGTTGACAATGAGCCTTCTGATATTATTATTGGTAATCTATTGGCGGCGTCAGGGGAAACTTACTTTTTTTA
AAATGCTGCCGCTTCTAGTCTTATGCGGTCTATTCTTCGGCTGCCAAAAGCTCAAATGGAAACAGGAAGAGCAGGCTGTT
TCAAAGCAAGTGACTGCTGTAGAGATTGTGCCGGATACTATAAATATCAATGGCGATCAGTTATCTTTCCGTGCTAGGAC
TGATGGTCAGATTTATCAAGGTTTCTATAAGCTAACTAGTCAGAAAGAACAGGAGTATTTCCAAAAGCTAACAGCTCTTG
TTCAGGTGGAGGTGAAGGCAGAGACGAGTCTCCCATCAGGTCAGCGGAATTTCAATGGTTTTGATTATCAAGCCTATTTA
AAAACACAGGGAATCTATCGAATCCTCACCATAACGACTATCAATAAGATTGTACCTATTCACTCTTGGAACCCTATGGA
TTGGTTGTCAATTTGGCGGAGAAAAGCCTTGGTTTTCATCAAATCTAATTTTCCTTCCCCCATGAGCCATTACATGACAG
GGTTGCTTTTTGGAGAGTTGGACAGTGACTTTGATCAGATGAGCGCCCTTTACTCCAGTCTGGGGATTATCCATCTCTTT
GCTCTATCAGGAATGCAGGTAGGTTTTTTCATTGATAAGTTTCGCTGGATTTTACTGCGTTTAGGCTTGACAAAGGAGAC
AGTTGATCAATTACAAGTCCCCTTTTCTTTGGTCTATGCTGGTTTGACAGGCTTTTCTGTATCAGTCGTCCGGTCTTTGA
TTCAGAAAATTTTGAGCAACCTGGGGCTTCGGAAGCTGGATAATTTTGCTGCAACACTCTTTATTTGTTTCTTGGTGCAG
CCTCGTTTTCTCCTGACAGTTGGAGGTGTCTTGACATTTACCTATGCTTTTTTGTTGACAGTCTTTGATTTTGAGGATTT
GAAGCAAATCAAAAAGGTTGTAGTGGAAAGTCTGAGTATTTCTCTGGGGATTCTGCCAATTCTGATGACCTATTTTTTCG
CTTTTCAGCCTCTATCTATTCTACTAACCTTCTTCTTTTCCTTTGTTTTTGATGTATTGCTTTTACCGAGTCTATCTGTA
ATTTTTATTTTATCTCCTTTGGTCAAGATTACTTGGGTCAATTCTTTCTTTGTTTTGATGGAGAAAATGATTATTTGGGT
ATCGGATTTGGGTTTTCGACCTTTGATACTAGGGAAACCGTCTGACTTCATCTTGTTAGTCTTGTTACTGGGATTTTTCA
TGCTTTACGATTTTCATAGACAGAAAAAATGGCTGTTAGGACTGAGTTTGCTCCTTACTCTGCTATTTTTCATTACAAAA
CATCCATTAGAAAATGAAGTAACTGTGGTAGATATCGGACAGGGAGATAGTATCTTTTTGCGAGATATGCAGGGGCGTAC
GGTTCTAATTGACGTTGGTGGACGGGTGGATTTTACGGTTAAGGAAGGCTGGAAGGCGCGATCCCGTCAGGCTAATGCAG
AGAGGACCTTGATTCCTTATCTACATAGTCGAGGTGTGGACAAGATTGACAGTCTGGTGTTGACGCACACCGATACTGAC
CATGTGGGCGATGTTCTGGAAGTAGCTAAGAATTTTAAGATCGGAAAAATCTATGTGTCTTCAGGTAGTTTGACAGTCCC
TAGCTTTGTAACCACCTTGAAAAAAATCAATGTACCTGTTCATGTGGTGCAGGTGGGTGATCGCCTACCGATCGTTGATT
CCTATTTGGAAGTGCTCTATCCGAGCCGATCGGGTGATGGTAGCAACAATGATTCTATTGTCCTCTATGGACGGCTCCTT
GGAATCAATTTTCTGTTTACGGGTGATCTGGAGGAGGGGGAATTAGAGCTGATAAAAACCTATCCCAAATTGTCAGTTGA
TGTTCTAAAAGCCGGACACCATGGTTCAAAAGGTTCTTCTTATCCTGAATTTTTAGACCATATTGGAGCTAAGATTGCCT
TAATCTCGGCTGGCGAGAAAAATCGCTACAAGCATCCCCATCAAGAAACCTTGGAGCGTTTTGAACAGAAGAATATGCAA
ATCTATCGGACAGACCGGCAAGGAGCTATTCGCTTTAGAGGTTGGCGAAGTTGGTCAATTGAAACGGTGAGATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.414 |
100 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.83 |
99.731 |
0.527 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.682 |
99.866 |
0.516 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.144 |
99.866 |
0.511 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.144 |
99.866 |
0.511 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.144 |
99.866 |
0.511 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.233 |
99.597 |
0.47 |