Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | EGX72_RS07955 | Genome accession | NZ_CP033809 |
| Coordinates | 1501284..1503521 (-) | Length | 745 a.a. |
| NCBI ID | WP_000939903.1 | Uniprot ID | A0AAV3JP21 |
| Organism | Streptococcus sp. FDAARGOS_521 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1496284..1508521
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EGX72_RS07930 (EGX72_07925) | holA | 1496348..1497385 (-) | 1038 | WP_000560292.1 | DNA polymerase III subunit delta | - |
| EGX72_RS07935 (EGX72_07930) | - | 1497439..1498167 (-) | 729 | WP_000468966.1 | methyltransferase domain-containing protein | - |
| EGX72_RS07940 (EGX72_07935) | - | 1498343..1499335 (+) | 993 | WP_000800998.1 | alpha/beta hydrolase fold domain-containing protein | - |
| EGX72_RS07945 (EGX72_07940) | - | 1499394..1500338 (-) | 945 | WP_000200830.1 | LacI family DNA-binding transcriptional regulator | - |
| EGX72_RS07950 (EGX72_07945) | - | 1500349..1501158 (-) | 810 | WP_000153219.1 | Cof-type HAD-IIB family hydrolase | - |
| EGX72_RS07955 (EGX72_07950) | comEC/celB | 1501284..1503521 (-) | 2238 | WP_000939903.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| EGX72_RS07960 (EGX72_07955) | - | 1503505..1504158 (-) | 654 | WP_000461744.1 | helix-hairpin-helix domain-containing protein | - |
| EGX72_RS07965 (EGX72_07960) | - | 1504258..1504998 (-) | 741 | WP_000500220.1 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | - |
| EGX72_RS07970 (EGX72_07965) | - | 1505133..1505897 (+) | 765 | WP_000567425.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| EGX72_RS07975 (EGX72_07970) | - | 1505890..1506156 (+) | 267 | WP_000598736.1 | GIY-YIG nuclease family protein | - |
| EGX72_RS07980 (EGX72_07975) | - | 1506341..1507927 (-) | 1587 | WP_000673090.1 | DEAD/DEAH box helicase | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85440.37 Da Isoelectric Point: 9.8459
>NTDB_id=325882 EGX72_RS07955 WP_000939903.1 1501284..1503521(-) (comEC/celB) [Streptococcus sp. FDAARGOS_521]
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFSFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIKVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLFQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDDMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLSCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRVRFEQPEEWKQKVNQSNAKRTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKHLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=325882 EGX72_RS07955 WP_000939903.1 1501284..1503521(-) (comEC/celB) [Streptococcus sp. FDAARGOS_521]
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTTCTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTAAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATAATTGCAGATATCAAACTTGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATTTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GGCTTTTGTTTGGATATCTAGATAAGACCTTTGATGACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCCCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAAGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTTGCGATTTTAGATGTTGGACAGGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGGTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTA
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGTATTAGCAAGATAGATGATTTAGTGATAACTCATACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCATTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTTTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGTACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
48.4 |
100 |
0.487 |
| comEC/celB | Streptococcus mitis SK321 |
48.529 |
100 |
0.487 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
47.059 |
100 |
0.472 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae D39 |
46.586 |
100 |
0.467 |
| comEC/celB | Streptococcus pneumoniae R6 |
46.586 |
100 |
0.467 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
43.338 |
99.732 |
0.432 |