Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | NWE21_RS03670 | Genome accession | NZ_CP102748 |
| Coordinates | 720204..722441 (+) | Length | 745 a.a. |
| NCBI ID | WP_012775040.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain Transconjugant cSFJ45 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 715204..727441
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NWE21_RS03645 (NWE21_03655) | - | 715275..716204 (-) | 930 | WP_012775038.1 | ABC transporter substrate-binding protein | - |
| NWE21_RS03650 (NWE21_03660) | - | 716217..717002 (-) | 786 | WP_011922762.1 | ABC transporter ATP-binding protein | - |
| NWE21_RS03655 (NWE21_03665) | - | 717154..718599 (-) | 1446 | WP_011922188.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| NWE21_RS03660 (NWE21_03670) | - | 718746..719495 (+) | 750 | WP_012775039.1 | lysophospholipid acyltransferase family protein | - |
| NWE21_RS03665 (NWE21_03675) | - | 719558..720220 (+) | 663 | WP_011922190.1 | helix-hairpin-helix domain-containing protein | - |
| NWE21_RS03670 (NWE21_03680) | comEC/celB | 720204..722441 (+) | 2238 | WP_012775040.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| NWE21_RS03675 (NWE21_03685) | - | 722458..722583 (+) | 126 | Protein_680 | IS982 family transposase | - |
| NWE21_RS03680 (NWE21_03690) | - | 722766..723422 (+) | 657 | WP_011922763.1 | CBS domain-containing protein | - |
| NWE21_RS03685 (NWE21_03695) | - | 723516..724343 (+) | 828 | WP_011922194.1 | hypothetical protein | - |
| NWE21_RS03690 (NWE21_03700) | - | 724349..725620 (+) | 1272 | WP_011922195.1 | toxic anion resistance protein | - |
| NWE21_RS03695 (NWE21_03705) | - | 725657..726523 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| NWE21_RS03700 (NWE21_03710) | tmk | 726698..727336 (+) | 639 | WP_012775041.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83965.92 Da Isoelectric Point: 7.8195
>NTDB_id=718748 NWE21_RS03670 WP_012775040.1 720204..722441(+) (comEC/celB) [Streptococcus suis strain Transconjugant cSFJ45]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=718748 NWE21_RS03670 WP_012775040.1 720204..722441(+) (comEC/celB) [Streptococcus suis strain Transconjugant cSFJ45]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.774 |
99.597 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.782 |
98.926 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.973 |
98.658 |
0.513 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.701 |
98.658 |
0.51 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.399 |
100 |
0.497 |