Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | APQ97_RS03750 | Genome accession | NZ_CP012911 |
| Coordinates | 761709..763946 (+) | Length | 745 a.a. |
| NCBI ID | WP_002935350.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain NSUI060 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 756709..768946
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| APQ97_RS03725 (APQ97_03715) | - | 756779..757708 (-) | 930 | WP_002937175.1 | ABC transporter substrate-binding protein | - |
| APQ97_RS03730 (APQ97_03720) | - | 757721..758506 (-) | 786 | WP_014636900.1 | ABC transporter ATP-binding protein | - |
| APQ97_RS03735 (APQ97_03725) | - | 758658..760103 (-) | 1446 | WP_024390938.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| APQ97_RS03740 (APQ97_03730) | - | 760250..760996 (+) | 747 | WP_014636901.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| APQ97_RS03745 (APQ97_03735) | comEA/celA/cilE | 761063..761725 (+) | 663 | WP_002935348.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| APQ97_RS03750 (APQ97_03740) | comEC/celB | 761709..763946 (+) | 2238 | WP_002935350.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| APQ97_RS12435 (APQ97_03745) | - | 763963..764088 (+) | 126 | Protein_708 | IS982 family transposase | - |
| APQ97_RS03760 (APQ97_03750) | - | 764270..764926 (+) | 657 | WP_002935353.1 | CBS domain-containing protein | - |
| APQ97_RS03765 (APQ97_03755) | - | 765020..765847 (+) | 828 | WP_002935354.1 | hypothetical protein | - |
| APQ97_RS03770 (APQ97_03760) | - | 765853..767124 (+) | 1272 | WP_002935355.1 | toxic anion resistance protein | - |
| APQ97_RS03775 (APQ97_03765) | - | 767161..768027 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| APQ97_RS03780 (APQ97_03770) | tmk | 768202..768840 (+) | 639 | WP_002935358.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 84195.14 Da Isoelectric Point: 7.3813
>NTDB_id=158208 APQ97_RS03750 WP_002935350.1 761709..763946(+) (comEC/celB) [Streptococcus suis strain NSUI060]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMGLLSLLLLVFGFRQGKAVFIKTLPLLALCGLFFGCQKVQWERADQLA
PEQVTTMQVIPDTIEVNGDSLSFRGRADGQTYQVFYKLTSQEEQTYFQKLTGLVQLEVEAEISLPAGQRNFKGFDYQAYL
KTQGIYRTVKITAIKKIVPVQSWNVFDWLSNWRRQALVYVKTNFPAPMSHYMTGLLFGELDSEFDQMSDLYSSLGIIHLF
ALSGMQVGFVIDKFRWILLRLGLTKETVDKLQIPFSLVYASLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFAFSFVFDVLLLPGLSV
IFLLSPIVKITWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIYVSPGSLTVPDFVATLKEINVPVHVVEVGERLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
KTNFLFTGDLEQGELDLIESYPNLPVDVLKAGHHGSKGSSYPEFLDHIEAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWREWRIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMGLLSLLLLVFGFRQGKAVFIKTLPLLALCGLFFGCQKVQWERADQLA
PEQVTTMQVIPDTIEVNGDSLSFRGRADGQTYQVFYKLTSQEEQTYFQKLTGLVQLEVEAEISLPAGQRNFKGFDYQAYL
KTQGIYRTVKITAIKKIVPVQSWNVFDWLSNWRRQALVYVKTNFPAPMSHYMTGLLFGELDSEFDQMSDLYSSLGIIHLF
ALSGMQVGFVIDKFRWILLRLGLTKETVDKLQIPFSLVYASLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFAFSFVFDVLLLPGLSV
IFLLSPIVKITWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIYVSPGSLTVPDFVATLKEINVPVHVVEVGERLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
KTNFLFTGDLEQGELDLIESYPNLPVDVLKAGHHGSKGSSYPEFLDHIEAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWREWRIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=158208 APQ97_RS03750 WP_002935350.1 761709..763946(+) (comEC/celB) [Streptococcus suis strain NSUI060]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGGGCCTGCTGAGTTTGTTGTTGCTGGTCTTTGGATTTCGGCAAGGTAAGGCAGTTTTCATCA
AAACGCTGCCGCTTTTAGCCCTATGTGGTCTGTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGACCAGTTAGCC
CCAGAGCAAGTGACGACTATGCAGGTCATTCCAGATACGATTGAAGTCAACGGGGATAGCCTGTCCTTCCGCGGTCGGGC
TGACGGACAAACCTATCAGGTTTTCTATAAATTAACCAGTCAGGAGGAACAGACCTATTTTCAAAAGCTGACAGGTTTGG
TCCAGCTTGAAGTGGAAGCAGAGATCAGCCTGCCAGCAGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGTATTTATCGGACGGTCAAGATTACGGCAATCAAGAAGATTGTCCCAGTCCAGTCTTGGAATGTCTTTGA
CTGGTTGTCAAACTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGCCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGAGTTTGACCAGATGAGTGACCTCTATTCCAGTTTGGGTATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTGTCATCGACAAGTTTCGCTGGATTTTACTGCGTTTGGGCTTGACCAAAGAAAC
GGTTGATAAATTACAAATTCCCTTTTCCCTTGTCTATGCGAGCTTGACAGGTTTTTCCGTATCGGTGGTGCGGTCTTTGG
TTCAAAAAATTTTGGGTAATCTGGGCTTGCGGAAGTTGGATAACTTTGCGGTGACAGTTTTTGTCTGTCTGTTGATACTG
CCCCGATTTCTGCTGACAGCTGGTGGTGTCCTGACCTTTACCTATGCTTTTTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAAAGTCTCAGTATTTCACTTGGGATTTTGCCAGTGCTCATGACCTATTTCTATG
CCTTTCAGCCCCTGTCTATCCTCTTGACCTTTGCCTTTTCTTTTGTCTTTGATGTCCTACTCTTGCCAGGCTTGTCAGTC
ATTTTTCTCCTGTCGCCAATCGTCAAGATTACTTGGGTCAACGGATTTTTTGTCTTTATGGAAAAAATCATTGTCTGGGT
GGCGGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTTTTGCTCTTGCTGGTCAGCCTCTTCT
TACTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTCCTGATTGATGTGGGCGGACGAGTTGATTTTGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTTACGTGTCGCCGGGAAGCCTGACGGTGCC
TGACTTTGTAGCCACCTTGAAAGAAATTAATGTCCCTGTTCATGTTGTTGAGGTGGGCGAACGCTTACCAATTTTTGACT
CTTACCTGGAAGTCCTTTATCCAGATGGGACAGGAGACGGTGGCAATAATGACTCCATTGTCCTTTATGGTCGCTTGCTG
AAAACTAATTTTCTCTTTACAGGGGACTTGGAGCAGGGGGAGTTGGACTTGATAGAATCTTATCCTAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCACGGCTCCAAAGGTTCTTCCTATCCTGAATTTTTGGACCATATCGAAGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGATAGTCGCAATATCCAA
GTCTATCGTACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAGGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGGGCCTGCTGAGTTTGTTGTTGCTGGTCTTTGGATTTCGGCAAGGTAAGGCAGTTTTCATCA
AAACGCTGCCGCTTTTAGCCCTATGTGGTCTGTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGACCAGTTAGCC
CCAGAGCAAGTGACGACTATGCAGGTCATTCCAGATACGATTGAAGTCAACGGGGATAGCCTGTCCTTCCGCGGTCGGGC
TGACGGACAAACCTATCAGGTTTTCTATAAATTAACCAGTCAGGAGGAACAGACCTATTTTCAAAAGCTGACAGGTTTGG
TCCAGCTTGAAGTGGAAGCAGAGATCAGCCTGCCAGCAGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGTATTTATCGGACGGTCAAGATTACGGCAATCAAGAAGATTGTCCCAGTCCAGTCTTGGAATGTCTTTGA
CTGGTTGTCAAACTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGCCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGAGTTTGACCAGATGAGTGACCTCTATTCCAGTTTGGGTATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTGTCATCGACAAGTTTCGCTGGATTTTACTGCGTTTGGGCTTGACCAAAGAAAC
GGTTGATAAATTACAAATTCCCTTTTCCCTTGTCTATGCGAGCTTGACAGGTTTTTCCGTATCGGTGGTGCGGTCTTTGG
TTCAAAAAATTTTGGGTAATCTGGGCTTGCGGAAGTTGGATAACTTTGCGGTGACAGTTTTTGTCTGTCTGTTGATACTG
CCCCGATTTCTGCTGACAGCTGGTGGTGTCCTGACCTTTACCTATGCTTTTTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAAAGTCTCAGTATTTCACTTGGGATTTTGCCAGTGCTCATGACCTATTTCTATG
CCTTTCAGCCCCTGTCTATCCTCTTGACCTTTGCCTTTTCTTTTGTCTTTGATGTCCTACTCTTGCCAGGCTTGTCAGTC
ATTTTTCTCCTGTCGCCAATCGTCAAGATTACTTGGGTCAACGGATTTTTTGTCTTTATGGAAAAAATCATTGTCTGGGT
GGCGGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTTTTGCTCTTGCTGGTCAGCCTCTTCT
TACTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTCCTGATTGATGTGGGCGGACGAGTTGATTTTGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTTACGTGTCGCCGGGAAGCCTGACGGTGCC
TGACTTTGTAGCCACCTTGAAAGAAATTAATGTCCCTGTTCATGTTGTTGAGGTGGGCGAACGCTTACCAATTTTTGACT
CTTACCTGGAAGTCCTTTATCCAGATGGGACAGGAGACGGTGGCAATAATGACTCCATTGTCCTTTATGGTCGCTTGCTG
AAAACTAATTTTCTCTTTACAGGGGACTTGGAGCAGGGGGAGTTGGACTTGATAGAATCTTATCCTAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCACGGCTCCAAAGGTTCTTCCTATCCTGAATTTTTGGACCATATCGAAGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGATAGTCGCAATATCCAA
GTCTATCGTACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAGGGAGTGGAGGATTGAAACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.836 |
99.732 |
0.537 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.632 |
99.463 |
0.523 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.752 |
99.597 |
0.515 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.482 |
99.597 |
0.513 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.482 |
99.597 |
0.513 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.482 |
99.597 |
0.513 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
48.594 |
100 |
0.487 |