Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | K6971_RS03450 | Genome accession | NZ_CP082200 |
| Coordinates | 691995..694232 (+) | Length | 745 a.a. |
| NCBI ID | WP_012775040.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain Transconjugant cDY107 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 686995..699232
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| K6971_RS03425 (K6971_03425) | - | 687066..687995 (-) | 930 | WP_012775038.1 | ABC transporter substrate-binding protein | - |
| K6971_RS03430 (K6971_03430) | - | 688008..688793 (-) | 786 | WP_011922762.1 | ABC transporter ATP-binding protein | - |
| K6971_RS03435 (K6971_03435) | - | 688945..690390 (-) | 1446 | WP_011922188.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| K6971_RS03440 (K6971_03440) | - | 690537..691286 (+) | 750 | WP_012775039.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| K6971_RS03445 (K6971_03445) | - | 691349..692011 (+) | 663 | WP_011922190.1 | helix-hairpin-helix domain-containing protein | - |
| K6971_RS03450 (K6971_03450) | comEC/celB | 691995..694232 (+) | 2238 | WP_012775040.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| K6971_RS03455 (K6971_03455) | - | 694249..694374 (+) | 126 | Protein_639 | IS982 family transposase | - |
| K6971_RS03460 (K6971_03460) | - | 694557..695213 (+) | 657 | WP_011922763.1 | CBS domain-containing protein | - |
| K6971_RS03465 (K6971_03465) | - | 695307..696134 (+) | 828 | WP_011922194.1 | hypothetical protein | - |
| K6971_RS03470 (K6971_03470) | - | 696140..697411 (+) | 1272 | WP_011922195.1 | toxic anion resistance protein | - |
| K6971_RS03475 (K6971_03475) | - | 697448..698314 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| K6971_RS03480 (K6971_03480) | tmk | 698489..699127 (+) | 639 | WP_012775041.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83965.92 Da Isoelectric Point: 7.8195
>NTDB_id=599774 K6971_RS03450 WP_012775040.1 691995..694232(+) (comEC/celB) [Streptococcus suis strain Transconjugant cDY107]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=599774 K6971_RS03450 WP_012775040.1 691995..694232(+) (comEC/celB) [Streptococcus suis strain Transconjugant cDY107]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.774 |
99.597 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.782 |
98.926 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.973 |
98.658 |
0.513 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.701 |
98.658 |
0.51 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.399 |
100 |
0.497 |