Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | AAF758_RS03470 | Genome accession | NZ_CP152119 |
| Coordinates | 704761..706998 (+) | Length | 745 a.a. |
| NCBI ID | WP_012775040.1 | Uniprot ID | - |
| Organism | Streptococcus suis strain Ss2301 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 699761..711998
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AAF758_RS03445 (AAF758_03445) | - | 699832..700761 (-) | 930 | WP_012775038.1 | ABC transporter substrate-binding protein | - |
| AAF758_RS03450 (AAF758_03450) | - | 700774..701559 (-) | 786 | WP_011922762.1 | ABC transporter ATP-binding protein | - |
| AAF758_RS03455 (AAF758_03455) | - | 701711..703156 (-) | 1446 | WP_011922188.1 | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase | - |
| AAF758_RS03460 (AAF758_03460) | - | 703303..704052 (+) | 750 | WP_012775039.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| AAF758_RS03465 (AAF758_03465) | - | 704115..704777 (+) | 663 | WP_011922190.1 | helix-hairpin-helix domain-containing protein | - |
| AAF758_RS03470 (AAF758_03470) | comEC/celB | 704761..706998 (+) | 2238 | WP_012775040.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| AAF758_RS03475 (AAF758_03475) | - | 707015..707140 (+) | 126 | Protein_637 | IS982 family transposase | - |
| AAF758_RS03480 (AAF758_03480) | - | 707323..707979 (+) | 657 | WP_011922763.1 | CBS domain-containing protein | - |
| AAF758_RS03485 (AAF758_03485) | - | 708073..708900 (+) | 828 | WP_011922194.1 | hypothetical protein | - |
| AAF758_RS03490 (AAF758_03490) | - | 708906..710177 (+) | 1272 | WP_011922195.1 | toxic anion resistance protein | - |
| AAF758_RS03495 (AAF758_03495) | - | 710214..711080 (-) | 867 | WP_002935357.1 | YitT family protein | - |
| AAF758_RS03500 (AAF758_03500) | tmk | 711255..711893 (+) | 639 | WP_012775041.1 | dTMP kinase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 83965.92 Da Isoelectric Point: 7.8195
>NTDB_id=987353 AAF758_RS03470 WP_012775040.1 704761..706998(+) (comEC/celB) [Streptococcus suis strain Ss2301]
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
MSRLSKLPCQPVHLAVLAVAAYFAVHSFSLLTMCLLSLVLAVFGVRQGKTVFLKTLPLLTLCGLFFGCQKVQWERAAQSA
PEQVTTVQVIPDTIDINGDSLSFRGRADGQNYQVFYKLTSQEEQTYFQNLADLVQLEVEAEVSLPAGQRNFKGFDYQAYL
KTQGIYRTVKLTAIKKIVPVQSWNVFDWLSTWRRQALVYVKTNFPAPMSHYMTGLLFGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFICLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFVFSFVFDVLLLPGLSV
IFLLSPVVKMTWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVLDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR
Nucleotide
Download Length: 2238 bp
>NTDB_id=987353 AAF758_RS03470 WP_012775040.1 704761..706998(+) (comEC/celB) [Streptococcus suis strain Ss2301]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTTTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCTAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCGTTCACCTGGCAGTCTTGGCGGTGGCAGCTTACTTTGCAGTCCACTC
TTTTTCCCTCTTGACAATGTGCCTGCTCAGCTTGGTGTTAGCTGTCTTTGGTGTTCGGCAAGGTAAGACAGTTTTCCTCA
AAACGCTGCCGCTTCTAACCCTGTGTGGTCTCTTTTTCGGATGCCAAAAGGTCCAATGGGAGCGGGCAGCTCAGTCAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACCATAGACATCAATGGCGACAGCTTGTCTTTTCGTGGTCGGGC
TGATGGACAAAACTATCAGGTTTTCTACAAATTAACCAGTCAGGAGGAGCAGACCTATTTTCAAAATCTGGCAGACCTGG
TCCAGCTAGAAGTGGAAGCAGAGGTCAGCCTGCCAGCTGGTCAGCGGAATTTCAAGGGCTTTGACTATCAGGCCTATCTG
AAAACACAGGGCATCTATCGGACTGTCAAGCTAACGGCGATTAAGAAGATTGTCCCTGTCCAGTCTTGGAATGTTTTTGA
CTGGCTGTCAACTTGGCGGAGGCAGGCCTTGGTCTATGTCAAAACCAATTTTCCGGCTCCCATGAGTCACTACATGACAG
GGCTTTTGTTTGGGGAGCTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTCTAGGAATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGTTGGATTTTGTTGCGTCTGGGCTTGACCAAGGAAAC
AGTGGATAAGTTACAAATTCCCTTTTCCCTTGTCTATGCTGGTTTGACAGGCTTTTCTGTGTCGGTGGTGCGGTCCTTGG
TCCAAAAAATTCTGGGTAATCTAGGCTTGCGGAAGTTGGATAATTTTGCGGTGACGGTCTTTATCTGTCTGTTGATACTG
CCCCGATTTTTACTGACAGCAGGAGGAGTGCTGACTTTTACTTATGCTTTCTTGCTGACGGTCTTTGATTTTGAGGATTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAGAGCCTCAGTATTTCACTGGGAATTTTGCCGGTGCTTATGACCTATTTCTATG
CCTTTCAGCCTCTATCTATCCTCTTGACCTTTGTCTTTTCCTTTGTCTTTGATGTCCTACTCTTGCCAGGTTTGTCAGTC
ATTTTTCTCCTGTCGCCAGTTGTCAAGATGACTTGGGTCAACGGATTTTTCGTCTTTATGGAGAAAATCATTGTCTGGGT
GGCTGATTTGGGATTGCGGCCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTCTTGCTTTTGCTGGTCAGCCTCTTCT
TGCTTTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATCACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGCTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTTCTGATTGATGTAGGCGGACGAGTCGATTTCGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCGAAGCAGGTCCAGATTGGTCGGATTGTCGTGTCGCCGGGAAGTCTGACGGTGCC
TGACTTTGTAGCGACCTTGAAAAAAATCAATGTTCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGACGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCTAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
53.774 |
99.597 |
0.536 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
52.782 |
98.926 |
0.522 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
51.973 |
98.658 |
0.513 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae D39 |
51.701 |
98.658 |
0.51 |
| comEC/celB | Streptococcus pneumoniae R6 |
51.701 |
98.658 |
0.51 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.399 |
100 |
0.497 |