Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SSGZ1_RS03410 | Genome accession | NC_017617 |
| Coordinates | 686471..688708 (+) | Length | 745 a.a. |
| NCBI ID | WP_012775040.1 | Uniprot ID | - |
| Organism | Streptococcus suis GZ1 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 681471..693708
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SSGZ1_RS03385 (SSGZ1_0639) | - | 681543..682472 (-) | 930 | WP_012775038.1 | ABC transporter substrate-binding protein | - |
| SSGZ1_RS03390 (SSGZ1_0640) | - | 682485..683270 (-) | 786 | WP_011922762.1 | ABC transporter ATP-binding protein | - |
| SSGZ1_RS03395 (SSGZ1_0642) | - | 683422..684866 (-) | 1445 | Protein_624 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| SSGZ1_RS03400 (SSGZ1_0643) | - | 685013..685762 (+) | 750 | WP_012775039.1 | lysophospholipid acyltransferase family protein | - |
| SSGZ1_RS03405 (SSGZ1_0644) | - | 685825..686487 (+) | 663 | WP_011922190.1 | helix-hairpin-helix domain-containing protein | - |
| SSGZ1_RS03410 (SSGZ1_0645) | comEC/celB | 686471..688708 (+) | 2238 | WP_012775040.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SSGZ1_RS10510 (SSGZ1_0646) | - | 688725..688850 (+) | 126 | Protein_628 | IS982 family transposase | - |
| SSGZ1_RS03420 (SSGZ1_0647) | - | 689033..689689 (+) | 657 | WP_011922763.1 | CBS domain-containing protein | - |
| SSGZ1_RS03425 (SSGZ1_0648) | - | 689783..690610 (+) | 828 | WP_011922194.1 | hypothetical protein | - |
| SSGZ1_RS03430 (SSGZ1_0649) | - | 690616..691887 (+) | 1272 | WP_011922195.1 | toxic anion resistance protein | - |
| SSGZ1_RS03435 (SSGZ1_0650) | - | 691924..692790 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| SSGZ1_RS03440 (SSGZ1_0651) | tmk | 692965..693603 (+) | 639 | WP_012775041.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83965.92 Da Isoelectric Point: 7.8195
>NTDB_id=49580 SSGZ1_RS03410 WP_012775040.1 686471..688708(+) (comEC/celB) [Streptococcus suis GZ1]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=49580 SSGZ1_RS03410 WP_012775040.1 686471..688708(+) (comEC/celB) [Streptococcus suis GZ1]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.774 |
99.597 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.782 |
98.926 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.973 |
98.658 |
0.513 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.701 |
98.658 |
0.51 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.399 |
100 |
0.497 |