Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | H1W98_RS07865 | Genome accession | NZ_LR822026 |
| Coordinates | 1529648..1531888 (-) | Length | 746 a.a. |
| NCBI ID | WP_179972172.1 | Uniprot ID | - |
| Organism | Streptococcus thermophilus isolate STH_CIRM_967 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1524648..1536888
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| H1W98_RS07845 (STHERMO_1778) | - | 1525512..1526372 (-) | 861 | WP_002951571.1 | methionyl aminopeptidase | - |
| H1W98_RS07850 (STHERMO_1779) | spxR | 1526416..1527696 (-) | 1281 | WP_179972170.1 | CBS-HotDog domain-containing transcription factor SpxR | - |
| H1W98_RS07855 | - | 1527689..1528234 (-) | 546 | Protein_1499 | GNAT family N-acetyltransferase | - |
| H1W98_RS07860 (STHERMO_1782) | - | 1528300..1529562 (-) | 1263 | WP_179972171.1 | UDP-N-acetylglucosamine 1-carboxyvinyltransferase | - |
| H1W98_RS07865 (STHERMO_1783) | comEC/celB | 1529648..1531888 (-) | 2241 | WP_179972172.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| H1W98_RS07870 (STHERMO_1784) | comEA | 1531878..1532573 (-) | 696 | WP_180482451.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| H1W98_RS07875 (STHERMO_1785) | - | 1532680..1533435 (-) | 756 | WP_180482450.1 | lysophospholipid acyltransferase family protein | - |
| H1W98_RS11245 | - | 1533565..1534474 (+) | 910 | Protein_1504 | polysaccharide deacetylase family protein | - |
| H1W98_RS07885 (STHERMO_1788) | - | 1534524..1535288 (+) | 765 | WP_180482449.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| H1W98_RS07890 (STHERMO_1789) | - | 1535292..1535561 (+) | 270 | WP_180482448.1 | GIY-YIG nuclease family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84981.00 Da Isoelectric Point: 9.9593
>NTDB_id=1131294 H1W98_RS07865 WP_179972172.1 1529648..1531888(-) (comEC/celB) [Streptococcus thermophilus isolate STH_CIRM_967]
MWLKKAPINLFSLALLIVALYFTIFVSNFYAIGTFVFLMICFLRHHWKNRAALKLVGLIGGFFLVYFLFLYNIAIMQDKQ
APAEIHQVTLVADTLSVNGERLSAIGKSNGQTYRVFYRLKSDKEQHFFKTTSQTLVLKGKIKLSSATGQRNFQGFDYQSY
LASQGIYRIAQIERLEHVVTPKSISPIAFFHQLRRKALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKNYILAILLPFSIFYGLMTGWTASVLRSLIQSLLAECGIKKLNNMGITLLLFFLV
LPHFLLTVGGVLSCSYAFLLCLFNFEEMPSFKKSIYMSLVLSLGTLPFLTYYYGTFQPLSLILTAIFSLVFDSFLLPVLT
VLFTLSGMVIFSQFNPLFEWMEFFLTWIQSWGVQPLILGKPSLFQFVSMIFVLVLLFDFWKKPQFRISLLMIFSLLMVWV
KHPLINEVTVVDVGQGDSIFLRSMKGETVLIDVGGKVTFVSKEKWQEGSQTSNAEKTLIPYLQERGVSQIDYLVLTHTDT
DHIGDLEEVAKRFKIKEICVSKGALTKPSFAKRIRFLKRPVRTLKAGDKLTMMGSNLQVLYPNKIGDGGNNDSLVLYGKL
LGTSFLFTGDLEKEGEEELMASYPNLKVRVLKAGHHGSKGSSSEAFLDQLKPSLALVSAGENNRYKHPNDETLERFKERH
IKVLRTDLDGAIRFKGWFQLSSETVR
MWLKKAPINLFSLALLIVALYFTIFVSNFYAIGTFVFLMICFLRHHWKNRAALKLVGLIGGFFLVYFLFLYNIAIMQDKQ
APAEIHQVTLVADTLSVNGERLSAIGKSNGQTYRVFYRLKSDKEQHFFKTTSQTLVLKGKIKLSSATGQRNFQGFDYQSY
LASQGIYRIAQIERLEHVVTPKSISPIAFFHQLRRKALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKNYILAILLPFSIFYGLMTGWTASVLRSLIQSLLAECGIKKLNNMGITLLLFFLV
LPHFLLTVGGVLSCSYAFLLCLFNFEEMPSFKKSIYMSLVLSLGTLPFLTYYYGTFQPLSLILTAIFSLVFDSFLLPVLT
VLFTLSGMVIFSQFNPLFEWMEFFLTWIQSWGVQPLILGKPSLFQFVSMIFVLVLLFDFWKKPQFRISLLMIFSLLMVWV
KHPLINEVTVVDVGQGDSIFLRSMKGETVLIDVGGKVTFVSKEKWQEGSQTSNAEKTLIPYLQERGVSQIDYLVLTHTDT
DHIGDLEEVAKRFKIKEICVSKGALTKPSFAKRIRFLKRPVRTLKAGDKLTMMGSNLQVLYPNKIGDGGNNDSLVLYGKL
LGTSFLFTGDLEKEGEEELMASYPNLKVRVLKAGHHGSKGSSSEAFLDQLKPSLALVSAGENNRYKHPNDETLERFKERH
IKVLRTDLDGAIRFKGWFQLSSETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=1131294 H1W98_RS07865 WP_179972172.1 1529648..1531888(-) (comEC/celB) [Streptococcus thermophilus isolate STH_CIRM_967]
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCCTAGCGCTTTTAATAGTTGCTCTTTATTTTACTATTTTTGTATC
TAATTTCTATGCTATTGGGACTTTTGTCTTTCTGATGATATGTTTTTTGAGGCATCATTGGAAGAATAGAGCAGCTCTAA
AGTTGGTGGGACTTATAGGTGGTTTCTTTTTGGTCTACTTTTTATTTTTATACAATATAGCTATCATGCAAGATAAACAA
GCTCCTGCTGAAATTCATCAGGTGACACTGGTTGCAGATACGCTCTCAGTTAATGGTGAGCGATTATCAGCAATCGGGAA
GTCAAATGGACAAACCTATCGGGTCTTTTACCGGCTAAAATCTGATAAGGAGCAGCATTTTTTTAAGACTACGAGTCAAA
CGCTTGTTTTAAAAGGAAAAATAAAGTTGTCCTCGGCAACTGGTCAACGTAATTTTCAAGGCTTTGATTATCAGTCTTAT
CTAGCTAGTCAGGGCATTTATAGGATTGCTCAGATTGAGCGTCTGGAACATGTCGTAACCCCAAAATCGATATCTCCAAT
AGCTTTTTTTCATCAATTGAGGAGGAAGGCTCTAGTTCATATTCAGACGCATTTTCCTAATCCGATGAGACACTACATGA
CAGGACTGCTCTTTGGGTATTTGGACAAGGAGTTTGATGAGCAGAGTCAACTCTACACTAGCTTGGGGATTATTCATCTT
TTTGCGCTATCGGGTATGCAGGTTGGATTTTTTCTTGGCTGGTTTCGCTATGGACTTCTCCGTTTGGGGCTTCCCAAAAA
TTATATCCTTGCTATCTTATTACCTTTTTCGATTTTCTATGGCTTAATGACTGGTTGGACGGCTTCGGTCTTACGTTCTT
TGATTCAAAGCCTCTTGGCTGAGTGTGGTATTAAAAAACTGAACAATATGGGCATAACGCTACTTCTATTTTTTCTAGTC
TTGCCTCATTTTCTTTTAACGGTAGGAGGTGTTTTAAGTTGTTCTTATGCCTTCTTGTTGTGTTTATTTAATTTTGAGGA
GATGCCGTCCTTTAAAAAGTCAATTTATATGAGTTTAGTATTGAGTCTTGGGACTTTGCCTTTTTTGACCTACTATTATG
GAACTTTTCAACCATTGAGTTTGATTCTGACGGCAATCTTCTCTCTAGTTTTTGATAGCTTTCTCTTACCTGTCTTAACA
GTACTTTTTACACTTTCAGGAATGGTAATTTTTTCTCAATTTAATCCACTTTTTGAATGGATGGAGTTCTTTTTGACTTG
GATACAATCTTGGGGAGTCCAGCCATTGATTTTAGGAAAACCTAGCTTGTTTCAGTTTGTCTCAATGATATTTGTATTGG
TTTTGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGAATATCCCTCTTGATGATTTTTAGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAATAAATGAAGTAACGGTGGTCGACGTTGGTCAAGGAGATAGTATTTTTTTAAGGAGTATGAAAGGTGA
GACGGTCCTAATAGATGTAGGTGGCAAGGTGACTTTCGTCTCTAAGGAAAAATGGCAAGAGGGGAGCCAGACGAGTAATG
CGGAGAAAACCTTAATTCCCTATTTACAGGAAAGGGGAGTGTCTCAAATTGACTATCTGGTTCTGACTCATACGGATACA
GATCATATTGGTGATTTGGAGGAAGTGGCTAAACGGTTTAAGATTAAGGAAATTTGTGTCAGTAAGGGGGCTTTGACTAA
GCCTAGTTTTGCCAAACGTATTCGATTTCTTAAACGTCCCGTTCGCACTTTAAAGGCTGGTGATAAGCTGACTATGATGG
GAAGTAACCTACAGGTTCTTTATCCTAATAAAATTGGTGATGGTGGTAACAATGATTCACTTGTTCTCTATGGAAAGTTA
TTGGGAACCAGTTTTTTGTTTACTGGTGATTTAGAAAAAGAAGGAGAGGAGGAATTAATGGCTAGCTATCCAAATTTAAA
AGTAAGGGTCCTAAAAGCAGGACATCACGGTTCTAAAGGCTCTTCATCAGAAGCTTTTTTGGATCAGCTAAAGCCATCAC
TTGCTCTTGTCTCAGCTGGTGAAAATAATCGTTATAAGCATCCAAATGATGAAACATTAGAGCGTTTCAAAGAACGTCAC
ATTAAGGTTTTACGCACAGACCTGGATGGTGCCATTCGGTTTAAGGGATGGTTCCAACTATCAAGTGAAACTGTCCGATA
A
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCCTAGCGCTTTTAATAGTTGCTCTTTATTTTACTATTTTTGTATC
TAATTTCTATGCTATTGGGACTTTTGTCTTTCTGATGATATGTTTTTTGAGGCATCATTGGAAGAATAGAGCAGCTCTAA
AGTTGGTGGGACTTATAGGTGGTTTCTTTTTGGTCTACTTTTTATTTTTATACAATATAGCTATCATGCAAGATAAACAA
GCTCCTGCTGAAATTCATCAGGTGACACTGGTTGCAGATACGCTCTCAGTTAATGGTGAGCGATTATCAGCAATCGGGAA
GTCAAATGGACAAACCTATCGGGTCTTTTACCGGCTAAAATCTGATAAGGAGCAGCATTTTTTTAAGACTACGAGTCAAA
CGCTTGTTTTAAAAGGAAAAATAAAGTTGTCCTCGGCAACTGGTCAACGTAATTTTCAAGGCTTTGATTATCAGTCTTAT
CTAGCTAGTCAGGGCATTTATAGGATTGCTCAGATTGAGCGTCTGGAACATGTCGTAACCCCAAAATCGATATCTCCAAT
AGCTTTTTTTCATCAATTGAGGAGGAAGGCTCTAGTTCATATTCAGACGCATTTTCCTAATCCGATGAGACACTACATGA
CAGGACTGCTCTTTGGGTATTTGGACAAGGAGTTTGATGAGCAGAGTCAACTCTACACTAGCTTGGGGATTATTCATCTT
TTTGCGCTATCGGGTATGCAGGTTGGATTTTTTCTTGGCTGGTTTCGCTATGGACTTCTCCGTTTGGGGCTTCCCAAAAA
TTATATCCTTGCTATCTTATTACCTTTTTCGATTTTCTATGGCTTAATGACTGGTTGGACGGCTTCGGTCTTACGTTCTT
TGATTCAAAGCCTCTTGGCTGAGTGTGGTATTAAAAAACTGAACAATATGGGCATAACGCTACTTCTATTTTTTCTAGTC
TTGCCTCATTTTCTTTTAACGGTAGGAGGTGTTTTAAGTTGTTCTTATGCCTTCTTGTTGTGTTTATTTAATTTTGAGGA
GATGCCGTCCTTTAAAAAGTCAATTTATATGAGTTTAGTATTGAGTCTTGGGACTTTGCCTTTTTTGACCTACTATTATG
GAACTTTTCAACCATTGAGTTTGATTCTGACGGCAATCTTCTCTCTAGTTTTTGATAGCTTTCTCTTACCTGTCTTAACA
GTACTTTTTACACTTTCAGGAATGGTAATTTTTTCTCAATTTAATCCACTTTTTGAATGGATGGAGTTCTTTTTGACTTG
GATACAATCTTGGGGAGTCCAGCCATTGATTTTAGGAAAACCTAGCTTGTTTCAGTTTGTCTCAATGATATTTGTATTGG
TTTTGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGAATATCCCTCTTGATGATTTTTAGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAATAAATGAAGTAACGGTGGTCGACGTTGGTCAAGGAGATAGTATTTTTTTAAGGAGTATGAAAGGTGA
GACGGTCCTAATAGATGTAGGTGGCAAGGTGACTTTCGTCTCTAAGGAAAAATGGCAAGAGGGGAGCCAGACGAGTAATG
CGGAGAAAACCTTAATTCCCTATTTACAGGAAAGGGGAGTGTCTCAAATTGACTATCTGGTTCTGACTCATACGGATACA
GATCATATTGGTGATTTGGAGGAAGTGGCTAAACGGTTTAAGATTAAGGAAATTTGTGTCAGTAAGGGGGCTTTGACTAA
GCCTAGTTTTGCCAAACGTATTCGATTTCTTAAACGTCCCGTTCGCACTTTAAAGGCTGGTGATAAGCTGACTATGATGG
GAAGTAACCTACAGGTTCTTTATCCTAATAAAATTGGTGATGGTGGTAACAATGATTCACTTGTTCTCTATGGAAAGTTA
TTGGGAACCAGTTTTTTGTTTACTGGTGATTTAGAAAAAGAAGGAGAGGAGGAATTAATGGCTAGCTATCCAAATTTAAA
AGTAAGGGTCCTAAAAGCAGGACATCACGGTTCTAAAGGCTCTTCATCAGAAGCTTTTTTGGATCAGCTAAAGCCATCAC
TTGCTCTTGTCTCAGCTGGTGAAAATAATCGTTATAAGCATCCAAATGATGAAACATTAGAGCGTTTCAAAGAACGTCAC
ATTAAGGTTTTACGCACAGACCTGGATGGTGCCATTCGGTTTAAGGGATGGTTCCAACTATCAAGTGAAACTGTCCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.193 |
100 |
0.483 |
| comEC/celB | Streptococcus mitis SK321 |
46.791 |
100 |
0.469 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
46.123 |
100 |
0.462 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
45.856 |
100 |
0.46 |
| comEC/celB | Streptococcus pneumoniae D39 |
45.856 |
100 |
0.46 |
| comEC/celB | Streptococcus pneumoniae R6 |
45.856 |
100 |
0.46 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.69 |
97.185 |
0.434 |