Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | V6U66_RS08345 | Genome accession | NZ_CP145864 |
| Coordinates | 1799910..1802150 (-) | Length | 746 a.a. |
| NCBI ID | WP_155213929.1 | Uniprot ID | - |
| Organism | Streptococcus salivarius strain KSS7 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1794910..1807150
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| V6U66_RS08325 (V6U66_08315) | - | 1795774..1796634 (-) | 861 | WP_004183173.1 | methionyl aminopeptidase | - |
| V6U66_RS08330 (V6U66_08320) | spxR | 1796678..1797958 (-) | 1281 | WP_410534456.1 | CBS-HotDog domain-containing transcription factor SpxR | - |
| V6U66_RS08335 (V6U66_08325) | - | 1797951..1798496 (-) | 546 | WP_013991031.1 | GNAT family N-acetyltransferase | - |
| V6U66_RS08340 (V6U66_08330) | - | 1798562..1799824 (-) | 1263 | WP_013991032.1 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | - |
| V6U66_RS08345 (V6U66_08335) | comEC/celB | 1799910..1802150 (-) | 2241 | WP_155213929.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| V6U66_RS08350 (V6U66_08340) | comEA | 1802140..1802838 (-) | 699 | WP_084870986.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| V6U66_RS08355 (V6U66_08345) | - | 1802947..1803702 (-) | 756 | WP_084870987.1 | lysophospholipid acyltransferase family protein | - |
| V6U66_RS08360 (V6U66_08350) | - | 1803832..1804767 (+) | 936 | WP_118172091.1 | polysaccharide deacetylase family protein | - |
| V6U66_RS08365 (V6U66_08355) | - | 1804817..1805581 (+) | 765 | WP_118172087.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| V6U66_RS08370 (V6U66_08360) | - | 1805585..1805854 (+) | 270 | WP_118172086.1 | GIY-YIG nuclease family protein | - |
| V6U66_RS08375 (V6U66_08365) | - | 1805946..1806332 (-) | 387 | WP_002886743.1 | IS110 family transposase | - |
| V6U66_RS08380 (V6U66_08370) | - | 1806352..1807113 (-) | 762 | WP_002886742.1 | IS110 family transposase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84524.52 Da Isoelectric Point: 9.9425
>NTDB_id=940594 V6U66_RS08345 WP_155213929.1 1799910..1802150(-) (comEC/celB) [Streptococcus salivarius strain KSS7]
MWLKKAPINLFSLALLIAALYFTIFVTNVYAMGAFAFLLVCFLKHHWKNKAALKLVGLVGGFFLIYFLFLHHRASIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKYEKEQHFFKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LASQGIYRMAQIEHLDHVVPQKSLSPLAFLHQLRRRALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLLIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICTSLVLSLGILPFLTYYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFVLSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVMLFDFWKKPQFRVCLLMIFSLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEASQTSNAEKALIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVTKRFKIKEICVSQGALTKPSFVKRLRTLKRPVRTLKAGDNLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMVSYPNLKASVLKAGHHGSKGSSSEAFLDQLHPSLALVSAGENNRYKHPNDETLKRFKKRH
IKVLRTDQNGAIRFKGWFKWSSETVR
MWLKKAPINLFSLALLIAALYFTIFVTNVYAMGAFAFLLVCFLKHHWKNKAALKLVGLVGGFFLIYFLFLHHRASIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKYEKEQHFFKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LASQGIYRMAQIEHLDHVVPQKSLSPLAFLHQLRRRALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLLIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICTSLVLSLGILPFLTYYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFVLSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVMLFDFWKKPQFRVCLLMIFSLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEASQTSNAEKALIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVTKRFKIKEICVSQGALTKPSFVKRLRTLKRPVRTLKAGDNLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMVSYPNLKASVLKAGHHGSKGSSSEAFLDQLHPSLALVSAGENNRYKHPNDETLKRFKKRH
IKVLRTDQNGAIRFKGWFKWSSETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=940594 V6U66_RS08345 WP_155213929.1 1799910..1802150(-) (comEC/celB) [Streptococcus salivarius strain KSS7]
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCTTGGCTCTGTTAATAGCTGCTCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATGGGGGCTTTTGCTTTTCTTTTAGTATGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGGCTTGTAGGTGGGTTCTTTTTGATTTATTTTTTATTCTTGCATCACAGAGCTAGCATACAAGATAAACAA
GCTCCTGCTGAAATCAATCAGGTGACTCTGGTTGCTGATACGCTATCGGTTAATGGTGAGCAATTATCAGCTATTGGAAA
GGCAAAGGGACAAACTTATCAGGTCTTTTACCGACTCAAATATGAGAAGGAGCAGCATTTTTTTAAGACTACTAGTCAAA
CGCTAGTACTAAAAGGAAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTAGCCAGTCAAGGCATTTATCGAATGGCTCAGATTGAGCACTTGGACCATGTGGTGCCTCAAAAATCTCTATCTCCCCT
AGCTTTTTTACATCAACTGAGGAGGAGGGCATTGGTTCATATCCAGACGCACTTTCCTAATCCTATGAGACACTATATGA
CAGGCCTGCTCTTTGGGTATTTGGATAAGGAGTTTGATGAGCAGAGTCAGCTTTACACAAGCTTAGGTATTATTCATCTA
TTCGCACTTTCAGGGATGCAAGTCGGCTTTTTTCTGGGATGGTTTCGCTATGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTAATTATCTTGCTACCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCAGTCCTACGTTCCT
TGATTCAAAGTTTGTTGGCGGAGTTTGGTATTAAAAAACTGGATAATATGGGTATAACCTTGCTTTTATTGTTTCTCTTT
TTACCTCATTTTCTTTTGACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCCTTCTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTACGAGCTTGGTATTGAGTCTTGGGATTTTGCCCTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATGTTTTCGATAGTCTTTGATAGCTTTCTCTTACCTGTCTTGACG
GTCTTCTTTGTCCTTTCAGGACTAGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTTTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTATGCTCTTTGATTTTTGGAAAAAGCCCCAGTTTAGGGTTTGTCTTTTGATGATTTTTAGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGCTCAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAGCCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGACCATCTGGTCCTGACTCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTGACCAAGCGGTTTAAGATTAAGGAAATCTGTGTCAGTCAAGGAGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTCGAACTTTAAAACGCCCAGTTCGCACTCTAAAGGCTGGAGACAACTTACCCATGATGG
GAAGTAAGCTACAGGTTCTTTATCCAAATAAAATTGGTGATGGTGGTAACAATGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAGCAGTTTTCTGTTTACTGGTGATTTGGAAAAAGAAGGAGAGGAGGAACTGATGGTCAGCTATCCTAATTTAAA
GGCCAGTGTCCTCAAAGCCGGACACCACGGTTCAAAAGGGTCATCGTCTGAAGCTTTTTTGGACCAGCTGCATCCCTCCC
TTGCACTTGTTTCAGCCGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGAAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGCGCTATCCGTTTTAAGGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCTTGGCTCTGTTAATAGCTGCTCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATGGGGGCTTTTGCTTTTCTTTTAGTATGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGGCTTGTAGGTGGGTTCTTTTTGATTTATTTTTTATTCTTGCATCACAGAGCTAGCATACAAGATAAACAA
GCTCCTGCTGAAATCAATCAGGTGACTCTGGTTGCTGATACGCTATCGGTTAATGGTGAGCAATTATCAGCTATTGGAAA
GGCAAAGGGACAAACTTATCAGGTCTTTTACCGACTCAAATATGAGAAGGAGCAGCATTTTTTTAAGACTACTAGTCAAA
CGCTAGTACTAAAAGGAAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTAGCCAGTCAAGGCATTTATCGAATGGCTCAGATTGAGCACTTGGACCATGTGGTGCCTCAAAAATCTCTATCTCCCCT
AGCTTTTTTACATCAACTGAGGAGGAGGGCATTGGTTCATATCCAGACGCACTTTCCTAATCCTATGAGACACTATATGA
CAGGCCTGCTCTTTGGGTATTTGGATAAGGAGTTTGATGAGCAGAGTCAGCTTTACACAAGCTTAGGTATTATTCATCTA
TTCGCACTTTCAGGGATGCAAGTCGGCTTTTTTCTGGGATGGTTTCGCTATGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTAATTATCTTGCTACCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCAGTCCTACGTTCCT
TGATTCAAAGTTTGTTGGCGGAGTTTGGTATTAAAAAACTGGATAATATGGGTATAACCTTGCTTTTATTGTTTCTCTTT
TTACCTCATTTTCTTTTGACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCCTTCTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTACGAGCTTGGTATTGAGTCTTGGGATTTTGCCCTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATGTTTTCGATAGTCTTTGATAGCTTTCTCTTACCTGTCTTGACG
GTCTTCTTTGTCCTTTCAGGACTAGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTTTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTATGCTCTTTGATTTTTGGAAAAAGCCCCAGTTTAGGGTTTGTCTTTTGATGATTTTTAGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGCTCAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAGCCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGACCATCTGGTCCTGACTCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTGACCAAGCGGTTTAAGATTAAGGAAATCTGTGTCAGTCAAGGAGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTCGAACTTTAAAACGCCCAGTTCGCACTCTAAAGGCTGGAGACAACTTACCCATGATGG
GAAGTAAGCTACAGGTTCTTTATCCAAATAAAATTGGTGATGGTGGTAACAATGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAGCAGTTTTCTGTTTACTGGTGATTTGGAAAAAGAAGGAGAGGAGGAACTGATGGTCAGCTATCCTAATTTAAA
GGCCAGTGTCCTCAAAGCCGGACACCACGGTTCAAAAGGGTCATCGTCTGAAGCTTTTTTGGACCAGCTGCATCCCTCCC
TTGCACTTGTTTCAGCCGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGAAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGCGCTATCCGTTTTAAGGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.525 |
100 |
0.485 |
| comEC/celB | Streptococcus mitis SK321 |
48.327 |
100 |
0.484 |
| comEC/celB | Streptococcus pneumoniae R6 |
47.523 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae D39 |
47.523 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
47.523 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.256 |
100 |
0.473 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.243 |
99.196 |
0.429 |