Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | STO1_RS05965 | Genome accession | NZ_AP018338 |
| Coordinates | 1179238..1181478 (-) | Length | 746 a.a. |
| NCBI ID | WP_096422408.1 | Uniprot ID | - |
| Organism | Streptococcus oralis subsp. tigurinus strain osk_001 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1174238..1186478
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| STO1_RS05935 (STO1_011560) | rplT | 1175569..1175928 (-) | 360 | WP_000124834.1 | 50S ribosomal protein L20 | - |
| STO1_RS05940 (STO1_011570) | rpmI | 1175980..1176180 (-) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| STO1_RS05945 (STO1_011580) | infC | 1176213..1176743 (-) | 531 | WP_007521573.1 | translation initiation factor IF-3 | - |
| STO1_RS05950 (STO1_011590) | - | 1177058..1178241 (-) | 1184 | Protein_1164 | hypothetical protein | - |
| STO1_RS05955 (STO1_011610) | - | 1178238..1178831 (-) | 594 | WP_061588100.1 | ATP-binding cassette domain-containing protein | - |
| STO1_RS09920 | - | 1178863..1179090 (-) | 228 | WP_061588099.1 | hypothetical protein | - |
| STO1_RS05965 (STO1_011630) | comEC/celB | 1179238..1181478 (-) | 2241 | WP_096422408.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| STO1_RS05970 (STO1_011640) | comEA/celA/cilE | 1181462..1182112 (-) | 651 | WP_061587623.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| STO1_RS05975 (STO1_011650) | - | 1182179..1182748 (-) | 570 | WP_084939084.1 | GNAT family N-acetyltransferase | - |
| STO1_RS05985 (STO1_011660) | ald | 1182925..1184037 (+) | 1113 | WP_045616943.1 | alanine dehydrogenase | - |
| STO1_RS05990 (STO1_011670) | - | 1184087..1185073 (-) | 987 | WP_000658201.1 | PhoH family protein | - |
| STO1_RS05995 (STO1_011680) | - | 1185154..1185369 (-) | 216 | WP_001232084.1 | YozE family protein | - |
| STO1_RS06000 (STO1_011690) | - | 1185393..1185971 (-) | 579 | WP_061588095.1 | GrpB family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84838.92 Da Isoelectric Point: 9.3510
>NTDB_id=69066 STO1_RS05965 WP_096422408.1 1179238..1181478(-) (comEC/celB) [Streptococcus oralis subsp. tigurinus strain osk_001]
MSQWIKNFPIPLIYLSFLLLWLYYAIFGVSYLALLGFVFLLICLFFQFSWKSAGKILAICGVFGFWFLFHNWQQTQANQN
LMDSVERVRILPDTIKVNGDSLSFRGKADGRTFQIYYKLQSEEEKEQFQALTDLYEIELEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLTIKSIQSLKKVSSWDIGENLSSLRRKAIVWIKSHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKHLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLFQKLLAQHGYKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFAFSFVYPAVQFNLIFEWLEGVIRLVSQLATRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIQAWQQKSTASNAQRTLIPYLKSRGVDKIDQLILTNTEK
EHVGDLLEVTRAFQVGEILVSKGSLTQKEFVAELQATQTKVRSVTGGENLPIFGSYLEVLSTRKMGDGSHDDSLFLYGRL
LDKHFLFTGNLKEKGEKDLLKQYSTLEVDVLKVGQHGSKTSSNLAFLEKLKPEITLISVGKNNRTKLPHQEALTRLETIN
SKIYRTDQNGAIRFKGWKSWQIESVR
MSQWIKNFPIPLIYLSFLLLWLYYAIFGVSYLALLGFVFLLICLFFQFSWKSAGKILAICGVFGFWFLFHNWQQTQANQN
LMDSVERVRILPDTIKVNGDSLSFRGKADGRTFQIYYKLQSEEEKEQFQALTDLYEIELEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLTIKSIQSLKKVSSWDIGENLSSLRRKAIVWIKSHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKHLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLFQKLLAQHGYKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFAFSFVYPAVQFNLIFEWLEGVIRLVSQLATRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIQAWQQKSTASNAQRTLIPYLKSRGVDKIDQLILTNTEK
EHVGDLLEVTRAFQVGEILVSKGSLTQKEFVAELQATQTKVRSVTGGENLPIFGSYLEVLSTRKMGDGSHDDSLFLYGRL
LDKHFLFTGNLKEKGEKDLLKQYSTLEVDVLKVGQHGSKTSSNLAFLEKLKPEITLISVGKNNRTKLPHQEALTRLETIN
SKIYRTDQNGAIRFKGWKSWQIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=69066 STO1_RS05965 WP_096422408.1 1179238..1181478(-) (comEC/celB) [Streptococcus oralis subsp. tigurinus strain osk_001]
ATGTCACAGTGGATTAAGAATTTTCCTATCCCCCTAATCTACCTGAGCTTTCTGTTGCTCTGGCTTTACTATGCCATTTT
TGGAGTGTCCTATCTAGCACTGCTAGGTTTTGTTTTTTTGCTCATCTGTCTCTTTTTCCAATTTTCTTGGAAATCGGCTG
GTAAAATTTTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTTTGTTTCATAACTGGCAACAGACACAAGCTAATCAAAAC
CTAATGGATTCTGTTGAGAGGGTACGGATTTTACCGGATACTATCAAAGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCGTACCTTCCAAATTTACTATAAACTCCAGTCCGAGGAAGAGAAAGAGCAATTTCAAGCCTTGACAGACC
TCTATGAGATAGAACTGGAAGGAAAACTGTCAGAGCCAGAAGGCCAGAGAAATTTTGGTGGATTTGACTACCAAGCCTAT
CTGAAAACTCAAGGGATTTACCAGACACTGACTATCAAGAGCATCCAGTCACTTAAAAAAGTTAGCAGTTGGGATATAGG
TGAAAACCTATCGAGTCTACGTAGAAAGGCTATTGTTTGGATTAAGAGCCATTTTCCCGACCCTATGCGTAACTACATGA
CGGGGCTCTTGCTGGGACATCTGGATACGGACTTCGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGTTTCTTTATGGATGGTTTTAAGAAACACCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCGGTCATTCGTAGTC
TCTTCCAAAAGTTACTGGCCCAACATGGCTACAAGGGTTTGGACAATTTTGCCTTGACGGTGCTTGTTCTCTTTATCATC
ATGCCCAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCCTACGCCTTTATCTTGACCATGACTAGTAAAGAAGG
CGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTCTGGTCATTTCCTTGGGAATATTGCCCATTCTTTCCTTTTATTTTGCAG
AATTTCAGCCTTGGTCAATTCTTTTAACATTTGTCTTTTCCTTTCTTTTTGACTTGGTTTTCTTACCACTTTTGTCTATC
TTGTTTGCCTTTTCTTTTGTCTATCCTGCCGTTCAATTTAACCTTATCTTTGAGTGGTTGGAGGGCGTCATTCGCTTGGT
TTCACAGCTAGCAACTAGGCCCTTGGTCTTTGGACAACCCAACGCATGGCTTTTGATTCTTCTCTTAGTTTCATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTCAGTCTCTTTATCGTAGGACTCTTTTTCTTGACCAAG
CATCCGCTGGAAAATGAAATCACCATGCTGGATGTAGGGCAGGGAGAAAGCATTTTTCTAAGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTGGGTGGAAAGGCAGAATCCGACAAAAAAATCCAAGCTTGGCAGCAAAAGTCGACGGCTAGCAATG
CCCAGCGAACCTTGATTCCCTATCTTAAAAGTCGTGGAGTAGACAAGATTGACCAGCTGATTTTGACCAACACAGAGAAA
GAGCATGTTGGAGATTTACTAGAGGTGACCAGGGCTTTCCAAGTCGGGGAGATTTTAGTATCAAAAGGAAGTTTGACACA
GAAAGAATTTGTGGCAGAACTACAAGCGACTCAAACCAAGGTACGCAGTGTGACAGGGGGGGAGAATTTACCGATTTTTG
GCAGTTACTTAGAAGTCCTATCTACAAGGAAGATGGGAGATGGAAGTCATGATGATTCTCTTTTTCTTTATGGGAGGCTT
TTGGATAAGCACTTTCTCTTTACTGGAAACTTGAAGGAGAAGGGAGAAAAGGATCTTCTAAAGCAATACTCTACCTTAGA
GGTGGATGTCTTAAAAGTCGGTCAACATGGTTCTAAAACCTCATCAAATCTAGCTTTCCTAGAAAAACTCAAACCAGAAA
TTACTCTCATTTCAGTTGGAAAGAACAATCGTACGAAGCTACCCCATCAGGAAGCCTTGACACGACTGGAAACTATCAAT
AGTAAAATTTACCGAACTGACCAGAACGGAGCTATTCGCTTTAAAGGTTGGAAAAGTTGGCAAATCGAAAGTGTTCGTTA
G
ATGTCACAGTGGATTAAGAATTTTCCTATCCCCCTAATCTACCTGAGCTTTCTGTTGCTCTGGCTTTACTATGCCATTTT
TGGAGTGTCCTATCTAGCACTGCTAGGTTTTGTTTTTTTGCTCATCTGTCTCTTTTTCCAATTTTCTTGGAAATCGGCTG
GTAAAATTTTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTTTGTTTCATAACTGGCAACAGACACAAGCTAATCAAAAC
CTAATGGATTCTGTTGAGAGGGTACGGATTTTACCGGATACTATCAAAGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCGTACCTTCCAAATTTACTATAAACTCCAGTCCGAGGAAGAGAAAGAGCAATTTCAAGCCTTGACAGACC
TCTATGAGATAGAACTGGAAGGAAAACTGTCAGAGCCAGAAGGCCAGAGAAATTTTGGTGGATTTGACTACCAAGCCTAT
CTGAAAACTCAAGGGATTTACCAGACACTGACTATCAAGAGCATCCAGTCACTTAAAAAAGTTAGCAGTTGGGATATAGG
TGAAAACCTATCGAGTCTACGTAGAAAGGCTATTGTTTGGATTAAGAGCCATTTTCCCGACCCTATGCGTAACTACATGA
CGGGGCTCTTGCTGGGACATCTGGATACGGACTTCGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGTTTCTTTATGGATGGTTTTAAGAAACACCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCGGTCATTCGTAGTC
TCTTCCAAAAGTTACTGGCCCAACATGGCTACAAGGGTTTGGACAATTTTGCCTTGACGGTGCTTGTTCTCTTTATCATC
ATGCCCAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCCTACGCCTTTATCTTGACCATGACTAGTAAAGAAGG
CGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTCTGGTCATTTCCTTGGGAATATTGCCCATTCTTTCCTTTTATTTTGCAG
AATTTCAGCCTTGGTCAATTCTTTTAACATTTGTCTTTTCCTTTCTTTTTGACTTGGTTTTCTTACCACTTTTGTCTATC
TTGTTTGCCTTTTCTTTTGTCTATCCTGCCGTTCAATTTAACCTTATCTTTGAGTGGTTGGAGGGCGTCATTCGCTTGGT
TTCACAGCTAGCAACTAGGCCCTTGGTCTTTGGACAACCCAACGCATGGCTTTTGATTCTTCTCTTAGTTTCATTAGCCT
TGGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGATTCAGTCTCTTTATCGTAGGACTCTTTTTCTTGACCAAG
CATCCGCTGGAAAATGAAATCACCATGCTGGATGTAGGGCAGGGAGAAAGCATTTTTCTAAGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTGGGTGGAAAGGCAGAATCCGACAAAAAAATCCAAGCTTGGCAGCAAAAGTCGACGGCTAGCAATG
CCCAGCGAACCTTGATTCCCTATCTTAAAAGTCGTGGAGTAGACAAGATTGACCAGCTGATTTTGACCAACACAGAGAAA
GAGCATGTTGGAGATTTACTAGAGGTGACCAGGGCTTTCCAAGTCGGGGAGATTTTAGTATCAAAAGGAAGTTTGACACA
GAAAGAATTTGTGGCAGAACTACAAGCGACTCAAACCAAGGTACGCAGTGTGACAGGGGGGGAGAATTTACCGATTTTTG
GCAGTTACTTAGAAGTCCTATCTACAAGGAAGATGGGAGATGGAAGTCATGATGATTCTCTTTTTCTTTATGGGAGGCTT
TTGGATAAGCACTTTCTCTTTACTGGAAACTTGAAGGAGAAGGGAGAAAAGGATCTTCTAAAGCAATACTCTACCTTAGA
GGTGGATGTCTTAAAAGTCGGTCAACATGGTTCTAAAACCTCATCAAATCTAGCTTTCCTAGAAAAACTCAAACCAGAAA
TTACTCTCATTTCAGTTGGAAAGAACAATCGTACGAAGCTACCCCATCAGGAAGCCTTGACACGACTGGAAACTATCAAT
AGTAAAATTTACCGAACTGACCAGAACGGAGCTATTCGCTTTAAAGGTTGGAAAAGTTGGCAAATCGAAAGTGTTCGTTA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
87.802 |
100 |
0.878 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
87.114 |
99.866 |
0.87 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
86.059 |
100 |
0.861 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
85.523 |
100 |
0.855 |
| comEC/celB | Streptococcus pneumoniae D39 |
85.523 |
100 |
0.855 |
| comEC/celB | Streptococcus pneumoniae R6 |
85.523 |
100 |
0.855 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.209 |
99.33 |
0.449 |