Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | HSISS4_RS07510 | Genome accession | NZ_CP013216 |
| Coordinates | 1661957..1664197 (-) | Length | 746 a.a. |
| NCBI ID | WP_021143744.1 | Uniprot ID | - |
| Organism | Streptococcus salivarius strain HSISS4 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1656957..1669197
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HSISS4_RS07490 (HSISS4_01462) | - | 1657821..1658681 (-) | 861 | WP_004183173.1 | methionyl aminopeptidase | - |
| HSISS4_RS07495 (HSISS4_01463) | spxR | 1658725..1660005 (-) | 1281 | WP_002884454.1 | CBS-HotDog domain-containing transcription factor SpxR | - |
| HSISS4_RS07500 (HSISS4_01464) | - | 1659998..1660543 (-) | 546 | WP_013991031.1 | GNAT family N-acetyltransferase | - |
| HSISS4_RS07505 (HSISS4_01465) | - | 1660609..1661871 (-) | 1263 | WP_021143745.1 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | - |
| HSISS4_RS07510 (HSISS4_01466) | comEC/celB | 1661957..1664197 (-) | 2241 | WP_021143744.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| HSISS4_RS07515 (HSISS4_01467) | comEA | 1664187..1664882 (-) | 696 | WP_021143743.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| HSISS4_RS07520 (HSISS4_01468) | - | 1664990..1665745 (-) | 756 | WP_021143742.1 | lysophospholipid acyltransferase family protein | - |
| HSISS4_RS07525 (HSISS4_01469) | - | 1665874..1666809 (+) | 936 | WP_021143741.1 | polysaccharide deacetylase family protein | - |
| HSISS4_RS07530 (HSISS4_01470) | - | 1666859..1667623 (+) | 765 | WP_059749235.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| HSISS4_RS07535 (HSISS4_01471) | - | 1667627..1667896 (+) | 270 | WP_021143739.1 | GIY-YIG nuclease family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84596.50 Da Isoelectric Point: 9.9132
>NTDB_id=160715 HSISS4_RS07510 WP_021143744.1 1661957..1664197(-) (comEC/celB) [Streptococcus salivarius strain HSISS4]
MWLKKAPISLFSLSLLIAALYFTIFVTNVYAIGTFVFLMVCFLKHHWKNKAALKLVGLVGSFFLIYFLFLQHRATIQDKQ
APTAINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQYFFKTTSQTLVLKGKINLSPVTGQRNFQGFNYQSY
LASQGIYRMAQIERLDHVVSQKNTSPLAFFHQLRRRALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLMFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICTSLVLSLGILPFLTYYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFVLSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVLLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGLKEKWQEASQTSNAEKTLIPYLQARGVSQIDHMVLTHTDT
DHIGDLEEVAKRFKIKEICVSQGALTKPSFVKRLRTLKRPVRTLKAGDNLPMMGSKLQVLYPNKVGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMASYPNLKAGILKAGHHGSKGSSSEAFLDQLQPSLALVSAGENNRYKHPNDETLKRFKERH
IKVLRTDQNGAIRFKGWFKWSSETVR
MWLKKAPISLFSLSLLIAALYFTIFVTNVYAIGTFVFLMVCFLKHHWKNKAALKLVGLVGSFFLIYFLFLQHRATIQDKQ
APTAINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQYFFKTTSQTLVLKGKINLSPVTGQRNFQGFNYQSY
LASQGIYRMAQIERLDHVVSQKNTSPLAFFHQLRRRALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLMFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICTSLVLSLGILPFLTYYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFVLSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVLLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGLKEKWQEASQTSNAEKTLIPYLQARGVSQIDHMVLTHTDT
DHIGDLEEVAKRFKIKEICVSQGALTKPSFVKRLRTLKRPVRTLKAGDNLPMMGSKLQVLYPNKVGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMASYPNLKAGILKAGHHGSKGSSSEAFLDQLQPSLALVSAGENNRYKHPNDETLKRFKERH
IKVLRTDQNGAIRFKGWFKWSSETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=160715 HSISS4_RS07510 WP_021143744.1 1661957..1664197(-) (comEC/celB) [Streptococcus salivarius strain HSISS4]
ATGTGGCTTAAAAAAGCGCCAATCAGTCTTTTTTCCTTGTCTCTTTTAATAGCTGCCCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATTGGAACTTTTGTCTTTCTTATGGTCTGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGGCTTGTAGGTAGTTTCTTTTTGATTTACTTTTTATTCTTGCAACACAGAGCTACCATACAAGATAAACAA
GCTCCTACTGCAATCAATCAAGTGACTCTGGTTGCTGATACGCTATCGGTTAATGGTGAGCAATTATCAGCTATTGGAAA
GGCAAAGGGACAAACCTATCAGGTTTTTTACCGACTCAAATCTGAGAAGGAGCAGTATTTTTTTAAGACTACTAGTCAAA
CCTTGGTATTAAAAGGGAAAATAAACTTATCCCCAGTAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTAGCCAGTCAAGGCATTTATCGAATGGCTCAGATTGAGCGCTTGGATCATGTGGTGTCTCAAAAAAACACATCTCCCCT
AGCTTTTTTCCATCAACTGAGGAGGAGGGCTTTGGTTCATATTCAGACGCATTTTCCTAATCCTATGAGACACTATATGA
CAGGTCTGCTCTTTGGGTATTTGGACAAGGAGTTTGATGAGCAAAGTCAGCTTTACACAAGCTTAGGTATTATTCATCTA
TTCGCACTTTCGGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTACGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTATCTTGCTGCCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCTGTCCTACGTTCCT
TGATTCAAAGTTTGTTGGCTGAGTTTGGTATTAAAAAACTGGACAATATGGGAATAACCTTGCTTCTAATGTTTCTCTTT
TTACCTCATTTTCTTTTGACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCCTTTTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTACGAGCTTGGTATTGAGTCTTGGGATTTTGCCCTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACAGCGATGTTTTCGATAGTCTTTGATAGCTTTCTCTTACCTGTCTTGACG
GTCTTCTTTGTCCTTTCAGGGTTGGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTCTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTCTGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTGATTGATGTGGGTGGCAAGGTGACTTTCGGCCTAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAACCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGACCATATGGTCCTGACTCATACGGATACA
GACCATATTGGTGATTTGGAAGAAGTAGCCAAGCGGTTTAAGATTAAGGAAATCTGTGTCAGTCAAGGAGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTAGGACTTTAAAACGCCCAGTTCGTACTCTAAAGGCTGGAGACAACTTACCCATGATGG
GAAGCAAGCTACAGGTTCTTTATCCAAATAAAGTTGGTGATGGTGGTAATAATGATTCGATAGTGCTCTATGGAAAATTA
TTAGGAAGCAGTTTTCTGTTTACGGGTGATCTGGAAAAGGAAGGAGAGGAGGAACTGATGGCCAGTTATCCCAATTTAAA
AGCAGGCATCCTCAAAGCTGGACACCACGGTTCAAAAGGGTCATCGTCTGAAGCGTTTTTGGACCAGTTGCAGCCCTCCC
TTGCCCTTGTTTCAGCTGGTGAGAATAATCGCTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGGAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGTGCTATCCGTTTTAAAGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
ATGTGGCTTAAAAAAGCGCCAATCAGTCTTTTTTCCTTGTCTCTTTTAATAGCTGCCCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATTGGAACTTTTGTCTTTCTTATGGTCTGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGGCTTGTAGGTAGTTTCTTTTTGATTTACTTTTTATTCTTGCAACACAGAGCTACCATACAAGATAAACAA
GCTCCTACTGCAATCAATCAAGTGACTCTGGTTGCTGATACGCTATCGGTTAATGGTGAGCAATTATCAGCTATTGGAAA
GGCAAAGGGACAAACCTATCAGGTTTTTTACCGACTCAAATCTGAGAAGGAGCAGTATTTTTTTAAGACTACTAGTCAAA
CCTTGGTATTAAAAGGGAAAATAAACTTATCCCCAGTAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTAGCCAGTCAAGGCATTTATCGAATGGCTCAGATTGAGCGCTTGGATCATGTGGTGTCTCAAAAAAACACATCTCCCCT
AGCTTTTTTCCATCAACTGAGGAGGAGGGCTTTGGTTCATATTCAGACGCATTTTCCTAATCCTATGAGACACTATATGA
CAGGTCTGCTCTTTGGGTATTTGGACAAGGAGTTTGATGAGCAAAGTCAGCTTTACACAAGCTTAGGTATTATTCATCTA
TTCGCACTTTCGGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTACGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTATCTTGCTGCCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCTGTCCTACGTTCCT
TGATTCAAAGTTTGTTGGCTGAGTTTGGTATTAAAAAACTGGACAATATGGGAATAACCTTGCTTCTAATGTTTCTCTTT
TTACCTCATTTTCTTTTGACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCCTTTTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTACGAGCTTGGTATTGAGTCTTGGGATTTTGCCCTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACAGCGATGTTTTCGATAGTCTTTGATAGCTTTCTCTTACCTGTCTTGACG
GTCTTCTTTGTCCTTTCAGGGTTGGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTCTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTCTGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTGATTGATGTGGGTGGCAAGGTGACTTTCGGCCTAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAACCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGACCATATGGTCCTGACTCATACGGATACA
GACCATATTGGTGATTTGGAAGAAGTAGCCAAGCGGTTTAAGATTAAGGAAATCTGTGTCAGTCAAGGAGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTAGGACTTTAAAACGCCCAGTTCGTACTCTAAAGGCTGGAGACAACTTACCCATGATGG
GAAGCAAGCTACAGGTTCTTTATCCAAATAAAGTTGGTGATGGTGGTAATAATGATTCGATAGTGCTCTATGGAAAATTA
TTAGGAAGCAGTTTTCTGTTTACGGGTGATCTGGAAAAGGAAGGAGAGGAGGAACTGATGGCCAGTTATCCCAATTTAAA
AGCAGGCATCCTCAAAGCTGGACACCACGGTTCAAAAGGGTCATCGTCTGAAGCGTTTTTGGACCAGTTGCAGCCCTCCC
TTGCCCTTGTTTCAGCTGGTGAGAATAATCGCTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGGAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGTGCTATCCGTTTTAAAGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.465 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis SK321 |
48.133 |
100 |
0.484 |
| comEC/celB | Streptococcus pneumoniae R6 |
47.333 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae D39 |
47.333 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
47.333 |
100 |
0.476 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.2 |
100 |
0.475 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
42.973 |
99.196 |
0.426 |