Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | BSR20_RS07940 | Genome accession | NZ_CP018189 |
| Coordinates | 1731712..1733952 (-) | Length | 746 a.a. |
| NCBI ID | WP_013991033.1 | Uniprot ID | A0AAP3Q6Q7 |
| Organism | Streptococcus salivarius strain ICDC3 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1726712..1738952
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| BSR20_RS07920 (BSR20_07990) | - | 1727579..1728439 (-) | 861 | WP_004183173.1 | methionyl aminopeptidase | - |
| BSR20_RS07925 (BSR20_07995) | spxR | 1728480..1729760 (-) | 1281 | WP_002884454.1 | CBS-HotDog domain-containing transcription factor SpxR | - |
| BSR20_RS07930 (BSR20_08000) | - | 1729753..1730298 (-) | 546 | WP_013991031.1 | GNAT family N-acetyltransferase | - |
| BSR20_RS07935 (BSR20_08005) | - | 1730364..1731626 (-) | 1263 | WP_013991032.1 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | - |
| BSR20_RS07940 (BSR20_08010) | comEC/celB | 1731712..1733952 (-) | 2241 | WP_013991033.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| BSR20_RS07945 (BSR20_08015) | comEA | 1733942..1734640 (-) | 699 | WP_013991034.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| BSR20_RS07950 (BSR20_08020) | - | 1734748..1735503 (-) | 756 | WP_013991035.1 | lysophospholipid acyltransferase family protein | - |
| BSR20_RS07955 (BSR20_08025) | - | 1735633..1736568 (+) | 936 | WP_045002039.1 | polysaccharide deacetylase family protein | - |
| BSR20_RS07960 (BSR20_08030) | - | 1736618..1737382 (+) | 765 | WP_013991037.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| BSR20_RS07965 (BSR20_08035) | - | 1737386..1737655 (+) | 270 | WP_013991038.1 | GIY-YIG nuclease family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84531.46 Da Isoelectric Point: 9.8651
>NTDB_id=206931 BSR20_RS07940 WP_013991033.1 1731712..1733952(-) (comEC/celB) [Streptococcus salivarius strain ICDC3]
MWLKKAPINLFSLALLIAALYFTVFVTNVYAIGTFVFLMVCFLKHHWKNKAALKLVGIVGSFFLVYFLFLHHKATIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQHFFKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LASQGIYRMAQIERLDYVVSQKSLSPLAFFHQLRRRALVHIQTHFPSPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICMSLVLSLGILPFLTYYYGTFQPVSLILTAIFSIVFDNFLLPVLT
VFFALSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLLQFSLMIAVLVMLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEGSQTSNAEKTLIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVAKRFKIKEICVSQGALTKPSFVKRLRTLKRPVHTLKAGDKLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMASYPTLRASVLKAGHHGSKGSSSEAFLDQLHPSLALVSAGENNRYKHPNDETLERFKQRH
IKVLRTDKDGAIRFKGWFKWSSETVR
MWLKKAPINLFSLALLIAALYFTVFVTNVYAIGTFVFLMVCFLKHHWKNKAALKLVGIVGSFFLVYFLFLHHKATIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQHFFKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LASQGIYRMAQIERLDYVVSQKSLSPLAFFHQLRRRALVHIQTHFPSPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIILLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMSSLKKSICMSLVLSLGILPFLTYYYGTFQPVSLILTAIFSIVFDNFLLPVLT
VFFALSGLVIFSQINPLFEWMETFLTWIQSWIGQPLILGKPSLLQFSLMIAVLVMLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEGSQTSNAEKTLIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVAKRFKIKEICVSQGALTKPSFVKRLRTLKRPVHTLKAGDKLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGSSFLFTGDLEKEGEEELMASYPTLRASVLKAGHHGSKGSSSEAFLDQLHPSLALVSAGENNRYKHPNDETLERFKQRH
IKVLRTDKDGAIRFKGWFKWSSETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=206931 BSR20_RS07940 WP_013991033.1 1731712..1733952(-) (comEC/celB) [Streptococcus salivarius strain ICDC3]
ATGTGGCTTAAAAAGGCTCCAATCAATCTTTTTTCCTTGGCTCTGTTAATAGCTGCTCTTTATTTTACGGTTTTTGTCAC
TAATGTTTATGCTATTGGGACTTTTGTCTTTCTTATGGTCTGTTTTTTGAAACATCATTGGAAGAATAAGGCTGCCCTAA
AGTTGGTGGGAATTGTAGGTAGTTTCTTTTTGGTCTACTTTTTATTCTTACACCACAAAGCTACCATACAAGATAAACAA
GCTCCTGCTGAAATCAATCAGGTGACTCTGGTTGCTGATACGTTATCGGTTAATGGTGAGCAATTATCAGCTATCGGAAA
GGCAAAGGGACAAACTTATCAGGTCTTTTACCGACTCAAATCTGAGAAGGAGCAGCATTTTTTTAAGACTACTAGCCAAA
CGCTGGTATTAAAGGGGAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTCAATTATCAGTCTTAT
CTAGCCAGCCAAGGTATTTATCGAATGGCTCAGATTGAGCGCTTGGATTATGTGGTGTCTCAAAAATCTCTATCTCCCTT
AGCTTTTTTCCATCAATTGAGGAGAAGGGCTTTGGTTCATATCCAGACACATTTTCCTAGTCCTATGAGACACTATATGA
CAGGCCTACTCTTTGGGTATTTGGATAAGGAGTTCGATGAGCAGAGTCAGCTTTACACAAGCCTAGGTATTATTCATCTA
TTCGCACTTTCGGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTATGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTATCTTGCTACCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCAGTCCTACGTTCCC
TGATTCAAAGTTTGTTGGCGGAGTTTGGTATTAAAAAACTGGATAATATGGGAATAACCTTACTTCTATTGTTTCTCTTT
TTACCTCATTTTCTTTTAACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCTTTTTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTATGAGCTTGGTATTGAGTCTTGGGATTTTGCCTTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATATTTTCCATAGTCTTTGATAACTTTCTCTTGCCTGTCTTGACG
GTCTTCTTTGCCCTTTCAGGACTGGTAATCTTTTCCCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCTTGGATAGGCCAGCCATTAATTTTAGGGAAACCTAGTTTGTTACAGTTTAGCTTGATGATAGCTGTATTGG
TTATGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGTTCAAAGGAAAAATGGCAGGAGGGGAGCCAGACGAGCAATG
CAGAGAAAACCTTGATTCCCTATCTACAGGCTAGAGGAGTGTCTCAAATTGATCACCTGGTCCTGACGCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTGGCCAAGAGGTTTAAGATTAAGGAAATCTGTGTCAGTCAGGGGGCTTTGACTAA
ACCTAGTTTTGTTAAAAGACTTCGGACCTTAAAACGACCAGTTCACACTCTAAAGGCTGGCGACAAATTACCCATGATGG
GAAGTAAGCTACAGGTTCTTTATCCCAATAAAATTGGTGATGGTGGTAACAACGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAGCAGTTTTTTGTTTACTGGTGATTTGGAAAAAGAAGGAGAGGAGGAGCTGATGGCCAGCTATCCAACTTTAAG
GGCAAGCGTCCTCAAAGCTGGACACCACGGTTCAAAAGGCTCATCGTCTGAAGCTTTTTTGGATCAGTTGCACCCCTCCC
TTGCACTTGTTTCAGCTGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACATTAGAACGTTTTAAGCAACGTCAC
ATTAAGGTTTTACGAACAGATAAAGACGGAGCCATACGTTTTAAGGGTTGGTTTAAATGGTCAAGTGAAACTGTCCGATA
A
ATGTGGCTTAAAAAGGCTCCAATCAATCTTTTTTCCTTGGCTCTGTTAATAGCTGCTCTTTATTTTACGGTTTTTGTCAC
TAATGTTTATGCTATTGGGACTTTTGTCTTTCTTATGGTCTGTTTTTTGAAACATCATTGGAAGAATAAGGCTGCCCTAA
AGTTGGTGGGAATTGTAGGTAGTTTCTTTTTGGTCTACTTTTTATTCTTACACCACAAAGCTACCATACAAGATAAACAA
GCTCCTGCTGAAATCAATCAGGTGACTCTGGTTGCTGATACGTTATCGGTTAATGGTGAGCAATTATCAGCTATCGGAAA
GGCAAAGGGACAAACTTATCAGGTCTTTTACCGACTCAAATCTGAGAAGGAGCAGCATTTTTTTAAGACTACTAGCCAAA
CGCTGGTATTAAAGGGGAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTCAATTATCAGTCTTAT
CTAGCCAGCCAAGGTATTTATCGAATGGCTCAGATTGAGCGCTTGGATTATGTGGTGTCTCAAAAATCTCTATCTCCCTT
AGCTTTTTTCCATCAATTGAGGAGAAGGGCTTTGGTTCATATCCAGACACATTTTCCTAGTCCTATGAGACACTATATGA
CAGGCCTACTCTTTGGGTATTTGGATAAGGAGTTCGATGAGCAGAGTCAGCTTTACACAAGCCTAGGTATTATTCATCTA
TTCGCACTTTCGGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTATGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTATCTTGCTACCTTTTTCCTTATGCTATGGCTTAATGACAGGTTGGACAGCTTCAGTCCTACGTTCCC
TGATTCAAAGTTTGTTGGCGGAGTTTGGTATTAAAAAACTGGATAATATGGGAATAACCTTACTTCTATTGTTTCTCTTT
TTACCTCATTTTCTTTTAACAGTGGGAGGTGTTTTAAGTTGTTCCTATGCTTTTTTGTTGTGTTTGTTTGATTTTGAGGA
GATGTCATCTCTTAAAAAGTCAATCTGTATGAGCTTGGTATTGAGTCTTGGGATTTTGCCTTTTCTAACTTACTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATATTTTCCATAGTCTTTGATAACTTTCTCTTGCCTGTCTTGACG
GTCTTCTTTGCCCTTTCAGGACTGGTAATCTTTTCCCAAATCAACCCACTTTTTGAATGGATGGAGACCTTTTTGACTTG
GATACAATCTTGGATAGGCCAGCCATTAATTTTAGGGAAACCTAGTTTGTTACAGTTTAGCTTGATGATAGCTGTATTGG
TTATGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGTTCAAAGGAAAAATGGCAGGAGGGGAGCCAGACGAGCAATG
CAGAGAAAACCTTGATTCCCTATCTACAGGCTAGAGGAGTGTCTCAAATTGATCACCTGGTCCTGACGCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTGGCCAAGAGGTTTAAGATTAAGGAAATCTGTGTCAGTCAGGGGGCTTTGACTAA
ACCTAGTTTTGTTAAAAGACTTCGGACCTTAAAACGACCAGTTCACACTCTAAAGGCTGGCGACAAATTACCCATGATGG
GAAGTAAGCTACAGGTTCTTTATCCCAATAAAATTGGTGATGGTGGTAACAACGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAGCAGTTTTTTGTTTACTGGTGATTTGGAAAAAGAAGGAGAGGAGGAGCTGATGGCCAGCTATCCAACTTTAAG
GGCAAGCGTCCTCAAAGCTGGACACCACGGTTCAAAAGGCTCATCGTCTGAAGCTTTTTTGGATCAGTTGCACCCCTCCC
TTGCACTTGTTTCAGCTGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACATTAGAACGTTTTAAGCAACGTCAC
ATTAAGGTTTTACGAACAGATAAAGACGGAGCCATACGTTTTAAGGGTTGGTTTAAATGGTCAAGTGAAACTGTCCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
48.529 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.193 |
100 |
0.483 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.658 |
100 |
0.468 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.658 |
100 |
0.468 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.658 |
100 |
0.468 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
46.524 |
100 |
0.466 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.649 |
99.196 |
0.433 |