Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | NSQ79_RS08000 | Genome accession | NZ_CP150200 |
| Coordinates | 1740480..1742720 (-) | Length | 746 a.a. |
| NCBI ID | WP_339313492.1 | Uniprot ID | - |
| Organism | Streptococcus sp. FSL W7-1342 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1735480..1747720
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NSQ79_RS07980 (NSQ79_07980) | - | 1736348..1737208 (-) | 861 | WP_004183173.1 | methionyl aminopeptidase | - |
| NSQ79_RS07985 (NSQ79_07985) | spxR | 1737248..1738528 (-) | 1281 | WP_002884454.1 | CBS-HotDog domain-containing transcription factor SpxR | - |
| NSQ79_RS07990 (NSQ79_07990) | - | 1738521..1739066 (-) | 546 | WP_164332044.1 | GNAT family N-acetyltransferase | - |
| NSQ79_RS07995 (NSQ79_07995) | - | 1739132..1740394 (-) | 1263 | WP_013991032.1 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | - |
| NSQ79_RS08000 (NSQ79_08000) | comEC/celB | 1740480..1742720 (-) | 2241 | WP_339313492.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| NSQ79_RS08005 (NSQ79_08005) | comEA | 1742710..1743405 (-) | 696 | WP_004183181.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| NSQ79_RS08010 (NSQ79_08010) | - | 1743514..1744269 (-) | 756 | WP_048790321.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| NSQ79_RS08015 (NSQ79_08015) | - | 1744400..1745335 (+) | 936 | WP_129852028.1 | polysaccharide deacetylase family protein | - |
| NSQ79_RS08020 (NSQ79_08020) | - | 1745385..1746149 (+) | 765 | WP_037599202.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| NSQ79_RS08025 (NSQ79_08025) | - | 1746153..1746422 (+) | 270 | WP_084870988.1 | GIY-YIG nuclease family protein | - |
| NSQ79_RS08030 (NSQ79_08030) | - | 1746514..1746900 (-) | 387 | WP_045768432.1 | IS110 family transposase | - |
| NSQ79_RS08035 (NSQ79_08035) | - | 1747022..1747681 (-) | 660 | WP_339313495.1 | transposase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84461.44 Da Isoelectric Point: 10.0200
>NTDB_id=966114 NSQ79_RS08000 WP_339313492.1 1740480..1742720(-) (comEC/celB) [Streptococcus sp. FSL W7-1342]
MWLKKAPINLFSLTLLIAALYFTIFVTNVYAMGAFAFLLGCFLKHHWKNKAALKLVGLVGSFFLVYFLFLHHRAIIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQHFYKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LSSQGIYRMAQIERLDHVVPQKTTSPLAFFHQLRRKALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIVLLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMPSLKKSICTSLVLSLGILPFLTFYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFALSGLVIFSQINPLFEWMEAFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVLLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEASQTSNAEKTLIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVAKRFKIKEVCVSQGALTKPSFVKRLRTLKRPVRTLKAGDKLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGNSFLFTGDLEKEGEEELMVSYPNLKAGILKAGHHGSKGSSSEAFLDQLQPSLALVSAGENNRYKHPNDETLKRFKKRH
IKVLRTDQNGAIRFKGWFKWSSETVR
MWLKKAPINLFSLTLLIAALYFTIFVTNVYAMGAFAFLLGCFLKHHWKNKAALKLVGLVGSFFLVYFLFLHHRAIIQDKQ
APAEINQVTLVADTLSVNGEQLSAIGKAKGQTYQVFYRLKSEKEQHFYKTTSQTLVLKGKIKLSPATGQRNFQGFNYQSY
LSSQGIYRMAQIERLDHVVPQKTTSPLAFFHQLRRKALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKDYLFIVLLPFSLCYGLMTGWTASVLRSLIQSLLAEFGIKKLDNMGITLLLLFLF
LPHFLLTVGGVLSCSYAFLLCLFDFEEMPSLKKSICTSLVLSLGILPFLTFYYGTFQPVSLILTAMFSIVFDSFLLPVLT
VFFALSGLVIFSQINPLFEWMEAFLTWIQSWIGQPLILGKPSLFQFGLMIAVLVLLFDFWKKPQFRICLLMIFGLLMVWV
KHPLTNEVTMVDVGQGDSIFLRSMKGDTILIDVGGKVTFGSKEKWQEASQTSNAEKTLIPYLQARGVSQIDHLVLTHTDT
DHIGDLEEVAKRFKIKEVCVSQGALTKPSFVKRLRTLKRPVRTLKAGDKLPMMGSKLQVLYPNKIGDGGNNDSIVLYGKL
LGNSFLFTGDLEKEGEEELMVSYPNLKAGILKAGHHGSKGSSSEAFLDQLQPSLALVSAGENNRYKHPNDETLKRFKKRH
IKVLRTDQNGAIRFKGWFKWSSETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=966114 NSQ79_RS08000 WP_339313492.1 1740480..1742720(-) (comEC/celB) [Streptococcus sp. FSL W7-1342]
ATGTGGCTTAAAAAAGCTCCCATCAATCTTTTTTCCTTGACTCTTTTAATAGCTGCTCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATGGGGGCTTTTGCTTTTCTTTTAGGATGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGACTTGTAGGTAGTTTCTTTTTGGTCTACTTTTTATTCTTGCATCACAGAGCTATCATACAAGATAAACAA
GCTCCTGCTGAAATTAATCAGGTGACTCTGGTTGCTGATACGCTGTCGGTTAATGGTGAGCAATTATCAGCTATCGGAAA
GGCAAAGGGACAAACCTATCAGGTCTTTTACCGACTTAAATCTGAGAAGGAGCAGCATTTTTATAAGACTACTAGCCAAA
CGCTGGTATTAAAAGGGAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTATCCAGTCAAGGTATTTATCGAATGGCTCAGATTGAGCGCTTGGACCATGTGGTGCCTCAAAAAACCACATCTCCCCT
AGCTTTTTTTCATCAACTGAGGAGGAAGGCTTTGGTTCATATCCAGACGCACTTTCCTAATCCTATGAGACACTATATGA
CAGGCCTACTCTTTGGGTATTTGGATAAGGAGTTTGATGAGCAAAGTCAGCTTTACACAAGTTTAGGGATTATTCATCTA
TTCGCACTTTCAGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTACGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTGTTTTACTGCCTTTTTCCTTATGCTATGGCTTAATGACAGGCTGGACAGCTTCAGTCCTACGTTCCC
TGATTCAAAGTTTGCTGGCGGAGTTTGGTATTAAAAAACTGGACAATATGGGGATAACCTTGCTTCTATTGTTTCTCTTT
TTACCTCATTTTCTTTTAACAGTGGGAGGTGTTTTAAGTTGTTCCTACGCCTTCTTGTTGTGTTTGTTTGATTTTGAGGA
GATGCCATCTCTTAAAAAGTCAATCTGTACGAGCTTAGTGTTGAGTCTTGGGATTTTGCCTTTTCTAACTTTCTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATGTTTTCGATAGTATTTGATAGCTTTCTCTTGCCTGTATTGACA
GTCTTCTTTGCTCTTTCAGGACTGGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGGCCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTTTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTCTGCTCTTTGATTTTTGGAAAAAGCCCCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGCTCAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAACCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGATCATCTGGTCCTGACTCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTAGCCAAGCGGTTTAAGATTAAGGAAGTCTGTGTCAGTCAGGGGGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTCGGACTTTAAAACGCCCAGTTCGCACTCTAAAAGCTGGTGATAAACTGCCTATGATGG
GAAGTAAGCTACAGGTTCTTTATCCAAATAAAATTGGTGATGGTGGTAACAATGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAACAGTTTTTTGTTTACGGGTGATTTGGAAAAGGAAGGAGAGGAGGAACTGATGGTCAGTTATCCCAATTTAAA
GGCAGGCATCCTCAAAGCTGGGCACCACGGTTCAAAAGGGTCATCGTCTGAAGCGTTTTTGGACCAGTTGCAGCCCTCCC
TTGCCCTTGTTTCAGCCGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGAAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGTGCTATCCGTTTTAAGGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
ATGTGGCTTAAAAAAGCTCCCATCAATCTTTTTTCCTTGACTCTTTTAATAGCTGCTCTTTATTTTACGATTTTTGTCAC
TAATGTTTATGCTATGGGGGCTTTTGCTTTTCTTTTAGGATGTTTTTTGAAGCATCATTGGAAGAATAAGGCAGCCCTAA
AGTTGGTGGGACTTGTAGGTAGTTTCTTTTTGGTCTACTTTTTATTCTTGCATCACAGAGCTATCATACAAGATAAACAA
GCTCCTGCTGAAATTAATCAGGTGACTCTGGTTGCTGATACGCTGTCGGTTAATGGTGAGCAATTATCAGCTATCGGAAA
GGCAAAGGGACAAACCTATCAGGTCTTTTACCGACTTAAATCTGAGAAGGAGCAGCATTTTTATAAGACTACTAGCCAAA
CGCTGGTATTAAAAGGGAAAATAAAGTTATCCCCAGCAACTGGTCAACGTAATTTTCAAGGGTTTAATTATCAGTCTTAT
CTATCCAGTCAAGGTATTTATCGAATGGCTCAGATTGAGCGCTTGGACCATGTGGTGCCTCAAAAAACCACATCTCCCCT
AGCTTTTTTTCATCAACTGAGGAGGAAGGCTTTGGTTCATATCCAGACGCACTTTCCTAATCCTATGAGACACTATATGA
CAGGCCTACTCTTTGGGTATTTGGATAAGGAGTTTGATGAGCAAAGTCAGCTTTACACAAGTTTAGGGATTATTCATCTA
TTCGCACTTTCAGGGATGCAAGTGGGCTTTTTTCTGGGATGGTTTCGCTACGGACTCCTACGCTTGGGCCTTCCTAAAGA
TTATCTATTTATTGTTTTACTGCCTTTTTCCTTATGCTATGGCTTAATGACAGGCTGGACAGCTTCAGTCCTACGTTCCC
TGATTCAAAGTTTGCTGGCGGAGTTTGGTATTAAAAAACTGGACAATATGGGGATAACCTTGCTTCTATTGTTTCTCTTT
TTACCTCATTTTCTTTTAACAGTGGGAGGTGTTTTAAGTTGTTCCTACGCCTTCTTGTTGTGTTTGTTTGATTTTGAGGA
GATGCCATCTCTTAAAAAGTCAATCTGTACGAGCTTAGTGTTGAGTCTTGGGATTTTGCCTTTTCTAACTTTCTATTATG
GGACCTTTCAACCGGTGAGTTTAATTCTGACGGCGATGTTTTCGATAGTATTTGATAGCTTTCTCTTGCCTGTATTGACA
GTCTTCTTTGCTCTTTCAGGACTGGTAATCTTTTCTCAAATCAACCCACTTTTTGAATGGATGGAGGCCTTTTTGACTTG
GATACAATCCTGGATAGGCCAGCCTTTGATTTTAGGGAAACCTAGTTTGTTTCAGTTTGGCTTGATGATAGCTGTATTGG
TTCTGCTCTTTGATTTTTGGAAAAAGCCCCAGTTTAGGATTTGCCTTTTGATGATTTTTGGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAACCAATGAGGTGACTATGGTTGACGTTGGTCAGGGAGATAGTATTTTTCTGAGGAGTATGAAAGGCGA
TACCATCCTAATTGATGTGGGTGGCAAGGTGACTTTCGGCTCAAAGGAAAAATGGCAAGAGGCTAGTCAGACGAGCAATG
CGGAGAAAACCTTGATTCCCTATCTACAGGCTAGGGGAGTGTCTCAAATTGATCATCTGGTCCTGACTCATACGGACACA
GACCATATTGGTGATTTGGAAGAAGTAGCCAAGCGGTTTAAGATTAAGGAAGTCTGTGTCAGTCAGGGGGCTTTGACTAA
GCCTAGTTTTGTGAAACGACTTCGGACTTTAAAACGCCCAGTTCGCACTCTAAAAGCTGGTGATAAACTGCCTATGATGG
GAAGTAAGCTACAGGTTCTTTATCCAAATAAAATTGGTGATGGTGGTAACAATGATTCGATAGTTCTTTACGGAAAACTA
TTAGGAAACAGTTTTTTGTTTACGGGTGATTTGGAAAAGGAAGGAGAGGAGGAACTGATGGTCAGTTATCCCAATTTAAA
GGCAGGCATCCTCAAAGCTGGGCACCACGGTTCAAAAGGGTCATCGTCTGAAGCGTTTTTGGACCAGTTGCAGCCCTCCC
TTGCCCTTGTTTCAGCCGGTGAGAACAATCGTTATAAACATCCAAATGATGAAACTTTGAAGCGTTTTAAGAAACGTCAC
ATTAAGGTTTTACGAACAGACCAGAACGGTGCTATCCGTTTTAAGGGGTGGTTTAAGTGGTCAAGTGAAACTGTCCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.327 |
100 |
0.484 |
| comEC/celB | Streptococcus mitis SK321 |
47.995 |
100 |
0.481 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.791 |
100 |
0.469 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.791 |
100 |
0.469 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.791 |
100 |
0.469 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
46.452 |
100 |
0.465 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.189 |
99.196 |
0.438 |