Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | ZY05719_RS03195 | Genome accession | NZ_CP007497 |
| Coordinates | 635995..638232 (+) | Length | 745 a.a. |
| NCBI ID | WP_012775040.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain ZY05719 isolate diseased pig | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 630995..643232
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ZY05719_RS03170 (ZY05719_03160) | - | 631066..631995 (-) | 930 | WP_012775038.1 | ABC transporter substrate-binding protein | - |
| ZY05719_RS03175 (ZY05719_03165) | - | 632008..632793 (-) | 786 | WP_011922762.1 | ABC transporter ATP-binding protein | - |
| ZY05719_RS03180 (ZY05719_03170) | - | 632945..634390 (-) | 1446 | WP_011922188.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| ZY05719_RS03185 (ZY05719_03175) | - | 634537..635286 (+) | 750 | WP_012775039.1 | lysophospholipid acyltransferase family protein | - |
| ZY05719_RS03190 (ZY05719_03180) | - | 635349..636011 (+) | 663 | WP_011922190.1 | helix-hairpin-helix domain-containing protein | - |
| ZY05719_RS03195 (ZY05719_03185) | comEC/celB | 635995..638232 (+) | 2238 | WP_012775040.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| ZY05719_RS10725 (ZY05719_03190) | - | 638249..638374 (+) | 126 | Protein_577 | IS982 family transposase | - |
| ZY05719_RS03205 (ZY05719_03195) | - | 638557..639213 (+) | 657 | WP_011922763.1 | CBS domain-containing protein | - |
| ZY05719_RS03210 (ZY05719_03200) | - | 639307..640134 (+) | 828 | WP_011922194.1 | hypothetical protein | - |
| ZY05719_RS03215 (ZY05719_03205) | - | 640140..641411 (+) | 1272 | WP_011922195.1 | toxic anion resistance protein | - |
| ZY05719_RS03220 (ZY05719_03210) | - | 641448..642314 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| ZY05719_RS03225 (ZY05719_03215) | tmk | 642489..643127 (+) | 639 | WP_012775041.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83965.92 Da Isoelectric Point: 7.8195
>NTDB_id=119962 ZY05719_RS03195 WP_012775040.1 635995..638232(+) (comEC/celB) [Streptococcus suis strain ZY05719 isolate diseased pig]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=119962 ZY05719_RS03195 WP_012775040.1 635995..638232(+) (comEC/celB) [Streptococcus suis strain ZY05719 isolate diseased pig]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.774 |
99.597 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.782 |
98.926 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.973 |
98.658 |
0.513 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.701 |
98.658 |
0.51 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.399 |
100 |
0.497 |