Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | KX728_RS05780 | Genome accession | NZ_CP079724 |
| Coordinates | 1168782..1171022 (-) | Length | 746 a.a. |
| NCBI ID | WP_215804577.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain 34 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1163782..1176022
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| KX728_RS05755 (KX728_05750) | rplT | 1165133..1165492 (-) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
| KX728_RS05760 (KX728_05755) | rpmI | 1165544..1165744 (-) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| KX728_RS05765 (KX728_05760) | infC | 1165777..1166307 (-) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| KX728_RS05770 (KX728_05765) | - | 1166612..1167796 (-) | 1185 | WP_042902389.1 | membrane protein | - |
| KX728_RS05775 (KX728_05770) | - | 1167793..1168386 (-) | 594 | WP_042902390.1 | ATP-binding cassette domain-containing protein | - |
| KX728_RS09360 | - | 1168419..1168624 (-) | 206 | Protein_1100 | hypothetical protein | - |
| KX728_RS05780 (KX728_05775) | comEC/celB | 1168782..1171022 (-) | 2241 | WP_215804577.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| KX728_RS05785 (KX728_05780) | comEA/celA/cilE | 1171006..1171656 (-) | 651 | WP_042902392.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| KX728_RS05790 (KX728_05785) | - | 1171723..1172292 (-) | 570 | WP_049485310.1 | GNAT family N-acetyltransferase | - |
| KX728_RS05795 (KX728_05790) | ald | 1172469..1173581 (+) | 1113 | WP_084921952.1 | alanine dehydrogenase | - |
| KX728_RS05800 (KX728_05795) | - | 1173632..1174618 (-) | 987 | WP_084921953.1 | PhoH family protein | - |
| KX728_RS05805 (KX728_05800) | - | 1174699..1174914 (-) | 216 | WP_001232084.1 | YozE family protein | - |
| KX728_RS05810 (KX728_05805) | - | 1174938..1175516 (-) | 579 | WP_215804576.1 | GrpB family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84723.06 Da Isoelectric Point: 9.3972
>NTDB_id=588983 KX728_RS05780 WP_215804577.1 1168782..1171022(-) (comEC/celB) [Streptococcus oralis strain 34]
MSQWIKNFPIPLIYLSFLLLWPYYAIFSVSYLALLGFVFLLICLFFQFPWKSAGRVLAICGVFGIWFLFQNWQQTQASQN
LADSVERVRILPDTIKVNGDSLSFRGKAEGRTFQVYYKLQSEEEKELFQALTDLHEIEIEGKLSEPEVQRNFGGFNYQAY
LKTQGIYQILTIKSIQSMKQVRSWDIGENLSGLRRKAVVWIKMRFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILGLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPVTQFNFVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEIIMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIQAWQEKATTNNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHIGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQNKVRSVVAGENFPIFGSFLEVLSPRKIGDGNRDDSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPGLEVDVLKAGQHGAKTSSNPAFLEKIKPEITLISVGKSNRAKLPHQETLTRLESIK
SNIYRTDQQGAIRFTGWNSWRIETVR
MSQWIKNFPIPLIYLSFLLLWPYYAIFSVSYLALLGFVFLLICLFFQFPWKSAGRVLAICGVFGIWFLFQNWQQTQASQN
LADSVERVRILPDTIKVNGDSLSFRGKAEGRTFQVYYKLQSEEEKELFQALTDLHEIEIEGKLSEPEVQRNFGGFNYQAY
LKTQGIYQILTIKSIQSMKQVRSWDIGENLSGLRRKAVVWIKMRFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILGLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPVTQFNFVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEIIMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIQAWQEKATTNNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHIGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQNKVRSVVAGENFPIFGSFLEVLSPRKIGDGNRDDSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPGLEVDVLKAGQHGAKTSSNPAFLEKIKPEITLISVGKSNRAKLPHQETLTRLESIK
SNIYRTDQQGAIRFTGWNSWRIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=588983 KX728_RS05780 WP_215804577.1 1168782..1171022(-) (comEC/celB) [Streptococcus oralis strain 34]
ATGTCACAGTGGATTAAGAATTTCCCTATCCCCCTAATCTATCTGAGTTTTCTGTTACTCTGGCCTTACTATGCCATTTT
CTCAGTATCTTATCTTGCTTTACTAGGCTTTGTTTTTCTACTCATCTGTCTCTTTTTTCAATTTCCTTGGAAATCGGCCG
GTAGAGTTCTAGCGATTTGTGGAGTTTTTGGAATTTGGTTTTTGTTTCAAAATTGGCAACAGACACAAGCAAGTCAAAAC
CTAGCGGATTCTGTTGAGAGGGTACGGATTTTACCAGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCTTCCAAGTTTATTATAAGCTACAGTCCGAGGAAGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCATGAGATAGAGATAGAAGGAAAACTTTCAGAGCCTGAAGTTCAGAGGAATTTTGGTGGCTTTAACTACCAAGCCTAT
CTGAAGACTCAAGGAATTTACCAAATTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGAAGTTGGGATATAGG
AGAAAATCTGTCTGGTTTACGTCGGAAGGCTGTAGTTTGGATCAAGATGCGCTTTCCAGATCCTATGCGCAATTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTAGGCTTTTTCATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTAACTCAAGA
AAAGTTGAAGTGGCTAACTTATCCCTTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCTGTTATTCGCAGTC
TCTTGCAAAAGTTACTGGCCCAACATGGTGTTAAGGGTTTGGATAATTTTGCCTTGACGATCCTTGGCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCATACGCTTTTATCTTGACCATGACTAGTAAAGAAGG
GGAGGGACTCAAGGCTGTTGCTAGAGAAAGTCTGGTCATTTCCTTGGGAATATTGCCTATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTATTTTCCTTTCTGTTTGATGTGGTCTTTTTGCCACTTTTGTCCATC
TTATTTATTCTGTCTTTTGTTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAATGCATGGCTGTTGATTCTACTGTTAGTTTCATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTTAGTCTCTTTATTGTGGGTCTCTTTTTCCTGACCAAG
CATCCACTTGAAAATGAAATCATCATGTTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAC
CATTCTCATAGATGTGGGTGGTAAGGCAGAATCTGACAAGAAAATCCAAGCTTGGCAAGAAAAGGCGACAACCAACAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACAAATACAGATAAG
GAGCATATTGGCGATTTGCTGGAGGTAACCAAGGCTTTTCATGTCGGAGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGGTAGCTGGGGAGAATTTCCCGATTTTTG
GCAGTTTCTTAGAAGTCCTATCTCCAAGGAAGATTGGAGATGGGAATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAGGATCTTCTAAAGCAATACCCTGGCTTAGA
GGTGGATGTCCTGAAAGCAGGCCAACATGGTGCTAAAACCTCATCAAATCCAGCTTTCCTAGAAAAAATCAAACCAGAAA
TTACTCTCATTTCAGTCGGAAAGAGCAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACACGACTGGAAAGTATCAAG
AGTAATATCTACCGAACTGACCAGCAAGGGGCTATCCGCTTTACAGGGTGGAATAGTTGGAGAATTGAAACGGTTCGTTA
G
ATGTCACAGTGGATTAAGAATTTCCCTATCCCCCTAATCTATCTGAGTTTTCTGTTACTCTGGCCTTACTATGCCATTTT
CTCAGTATCTTATCTTGCTTTACTAGGCTTTGTTTTTCTACTCATCTGTCTCTTTTTTCAATTTCCTTGGAAATCGGCCG
GTAGAGTTCTAGCGATTTGTGGAGTTTTTGGAATTTGGTTTTTGTTTCAAAATTGGCAACAGACACAAGCAAGTCAAAAC
CTAGCGGATTCTGTTGAGAGGGTACGGATTTTACCAGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCTTCCAAGTTTATTATAAGCTACAGTCCGAGGAAGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCATGAGATAGAGATAGAAGGAAAACTTTCAGAGCCTGAAGTTCAGAGGAATTTTGGTGGCTTTAACTACCAAGCCTAT
CTGAAGACTCAAGGAATTTACCAAATTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGAAGTTGGGATATAGG
AGAAAATCTGTCTGGTTTACGTCGGAAGGCTGTAGTTTGGATCAAGATGCGCTTTCCAGATCCTATGCGCAATTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTAGGCTTTTTCATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTAACTCAAGA
AAAGTTGAAGTGGCTAACTTATCCCTTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCTGTTATTCGCAGTC
TCTTGCAAAAGTTACTGGCCCAACATGGTGTTAAGGGTTTGGATAATTTTGCCTTGACGATCCTTGGCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCATACGCTTTTATCTTGACCATGACTAGTAAAGAAGG
GGAGGGACTCAAGGCTGTTGCTAGAGAAAGTCTGGTCATTTCCTTGGGAATATTGCCTATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTATTTTCCTTTCTGTTTGATGTGGTCTTTTTGCCACTTTTGTCCATC
TTATTTATTCTGTCTTTTGTTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAATGCATGGCTGTTGATTCTACTGTTAGTTTCATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTTAGTCTCTTTATTGTGGGTCTCTTTTTCCTGACCAAG
CATCCACTTGAAAATGAAATCATCATGTTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAC
CATTCTCATAGATGTGGGTGGTAAGGCAGAATCTGACAAGAAAATCCAAGCTTGGCAAGAAAAGGCGACAACCAACAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACAAATACAGATAAG
GAGCATATTGGCGATTTGCTGGAGGTAACCAAGGCTTTTCATGTCGGAGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGGTAGCTGGGGAGAATTTCCCGATTTTTG
GCAGTTTCTTAGAAGTCCTATCTCCAAGGAAGATTGGAGATGGGAATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAGGATCTTCTAAAGCAATACCCTGGCTTAGA
GGTGGATGTCCTGAAAGCAGGCCAACATGGTGCTAAAACCTCATCAAATCCAGCTTTCCTAGAAAAAATCAAACCAGAAA
TTACTCTCATTTCAGTCGGAAAGAGCAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACACGACTGGAAAGTATCAAG
AGTAATATCTACCGAACTGACCAGCAAGGGGCTATCCGCTTTACAGGGTGGAATAGTTGGAGAATTGAAACGGTTCGTTA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
87.668 |
100 |
0.877 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
86.711 |
99.866 |
0.866 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
86.193 |
100 |
0.862 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
85.657 |
100 |
0.857 |
| comEC/celB | Streptococcus pneumoniae D39 |
85.657 |
100 |
0.857 |
| comEC/celB | Streptococcus pneumoniae R6 |
85.657 |
100 |
0.857 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.804 |
99.33 |
0.445 |