Detailed information
Overview
| Name | comA | Type | Regulator |
| Locus tag | I6G43_RS01040 | Genome accession | NZ_CP065706 |
| Coordinates | 216375..218528 (+) | Length | 717 a.a. |
| NCBI ID | WP_038806542.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain FDAARGOS_886 | ||
| Function | processing and transport of ComC (predicted from homology) Competence regulation |
||
Genomic Context
Location: 211375..223528
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6G43_RS01000 (I6G43_01000) | recO | 211633..212403 (+) | 771 | WP_038806551.1 | DNA repair protein RecO | - |
| I6G43_RS01005 (I6G43_01005) | plsX | 212400..213392 (+) | 993 | WP_038806550.1 | phosphate acyltransferase PlsX | - |
| I6G43_RS01010 (I6G43_01010) | - | 213397..213633 (+) | 237 | WP_038806549.1 | acyl carrier protein | - |
| I6G43_RS01015 (I6G43_01015) | - | 214080..214223 (+) | 144 | WP_038806548.1 | class IIb bacteriocin, lactobin A/cerein 7B family | - |
| I6G43_RS01020 (I6G43_01020) | - | 214239..214397 (+) | 159 | WP_038806547.1 | class IIb bacteriocin, lactobin A/cerein 7B family | - |
| I6G43_RS01025 (I6G43_01025) | - | 214780..214935 (+) | 156 | WP_000732133.1 | hypothetical protein | - |
| I6G43_RS09285 | - | 214974..215186 (+) | 213 | WP_038806546.1 | hypothetical protein | - |
| I6G43_RS01030 (I6G43_01030) | - | 215267..215455 (+) | 189 | WP_038806545.1 | hypothetical protein | - |
| I6G43_RS01035 (I6G43_01035) | - | 215480..215686 (+) | 207 | WP_038806544.1 | hypothetical protein | - |
| I6G43_RS09290 | - | 216014..216208 (+) | 195 | WP_038806543.1 | hypothetical protein | - |
| I6G43_RS01040 (I6G43_01040) | comA | 216375..218528 (+) | 2154 | WP_038806542.1 | peptide cleavage/export ABC transporter ComA | Regulator |
| I6G43_RS01045 (I6G43_01045) | comB | 218541..219890 (+) | 1350 | WP_038806541.1 | competence pheromone export protein ComB | Regulator |
| I6G43_RS01050 (I6G43_01050) | purC | 220058..220765 (+) | 708 | WP_038806540.1 | phosphoribosylaminoimidazolesuccinocarboxamide synthase | - |
Sequence
Protein
Download Length: 717 a.a. Molecular weight: 80522.83 Da Isoelectric Point: 6.9424
>NTDB_id=513188 I6G43_RS01040 WP_038806542.1 216375..218528(+) (comA) [Streptococcus oralis strain FDAARGOS_886]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYSLAHLRELAKTTMDGTTALGLVKVAEELGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKKTIHIADPDPGVKLTKISRERFAQEWTGVSLFMAPSPDYKPHKEKKQGLLSFLPI
LFKQRGLITNIVLATLLVTLINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTILIISLVLFSQNMMLFFISLLALPIYTV
IIFVFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESSRYQKIDKEFVAYLKKSFTYSRAESQQKALKKVAQL
LLNVAVLWMGAILVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLRMM
KGDMTFNQVHYKYGYGRDVLSDINLTIPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVLVLDQGKIVEEGNHADLLARDGFYAHLVNS
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYSLAHLRELAKTTMDGTTALGLVKVAEELGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKKTIHIADPDPGVKLTKISRERFAQEWTGVSLFMAPSPDYKPHKEKKQGLLSFLPI
LFKQRGLITNIVLATLLVTLINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTILIISLVLFSQNMMLFFISLLALPIYTV
IIFVFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESSRYQKIDKEFVAYLKKSFTYSRAESQQKALKKVAQL
LLNVAVLWMGAILVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLRMM
KGDMTFNQVHYKYGYGRDVLSDINLTIPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVLVLDQGKIVEEGNHADLLARDGFYAHLVNS
Nucleotide
Download Length: 2154 bp
>NTDB_id=513188 I6G43_RS01040 WP_038806542.1 216375..218528(+) (comA) [Streptococcus oralis strain FDAARGOS_886]
ATGAAATTTGGGAAAAGACACTATCGTCCCCAGGTGGATCAGATGGATTGTGGCGTGGCTTCCTTGGCTATGGTCTTTGG
CTATTATGGTAGTTATTACTCCTTGGCCCATCTACGAGAGTTGGCCAAGACGACCATGGATGGGACGACTGCTTTGGGGC
TTGTAAAGGTGGCAGAGGAGCTTGGCTTTGAGACGCGGGCTATCAAGGCGGATATGACGCTCTTTGATCTGCCTGATTTG
ACCTTTCCTTTTGTGGCCCATGTGCTCAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACAGGTCAGGATAAGAAGAC
CATCCATATCGCTGATCCAGATCCTGGTGTCAAGCTAACCAAGATTTCCCGTGAGCGATTTGCGCAAGAGTGGACAGGGG
TCAGTCTCTTTATGGCGCCATCTCCAGACTATAAACCTCATAAGGAGAAAAAACAGGGGCTCCTATCCTTCTTGCCCATC
TTATTCAAACAGCGTGGCTTAATTACCAATATCGTACTAGCGACACTCTTGGTAACCCTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACGCTGGGTATCATCTCTATTGGTTTGGTCA
TCGTCTATATTCTCCAGCAGATTTTGTCTTATGCTCAGGAGTATCTCTTACTTGTTTTGGGGCAACGCCTGTCAATTGAT
GTGATTTTGTCTTATATCAAGCATGTTTTTCACCTGCCAATGTCCTTTTTCGCGACACGCAGGACAGGAGAAATCGTATC
TCGTTTTACAGATGCCAATAGTATCATTGACGCGCTAGCGTCGACCATTCTGTCGATTTTCCTAGATGTGTCGACGATTT
TGATTATTTCGCTTGTCTTGTTTTCACAAAATATGATGCTCTTTTTCATTAGTCTGCTTGCCCTTCCCATCTATACAGTG
ATTATCTTTGTCTTTATGAAACCTTTTGAAAAGATGAATCGGGATACAATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATCGAGGATATCAACGGTATTGAGACCATTAAGTCTTTGACCAGTGAAAGTTCGCGCTATCAAAAGATTGACAAGGAAT
TTGTGGCTTATCTGAAAAAATCTTTTACCTATAGTCGGGCAGAAAGCCAGCAAAAGGCACTGAAAAAAGTTGCCCAGCTC
CTGCTCAATGTTGCTGTTCTCTGGATGGGAGCTATTCTCGTCATGGATGGGAAAATGAGTTTGGGCCAGCTGATTACCTA
TAACACCCTGCTCGTTTACTTTACCAATCCTTTGGAAAATATCATCAACCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTGAATGAGGTTTATCTAGTAGCTTCGGAGTTTGAGGAGAAGAAAACGGTCGAAGATTTGAGGATGATG
AAAGGAGATATGACTTTCAATCAGGTTCACTACAAGTATGGCTATGGTCGAGATGTTTTGTCGGATATCAATTTGACCAT
TCCGCAAGGTTCTAAAGTGGCTTTCGTGGGAATTTCAGGATCAGGCAAGACAACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAGGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGACAAAAAGGCTTTGCGCCAGTACATT
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTCCTTGGAGCCAAGGAGGGGACGAC
TCAGGAAGATATCTTACGGGCGGTTGAGTTGGCAGAGATTCGGGAGGATATTGAGCGCATGCCACTGAATTATCAGACAG
AATTAACTTCGGATGGGGCTGGAATTTCAGGTGGACAACGTCAGCGAATCGCTCTGGCGCGAGCTCTTTTAACGGATGCG
CCTGTCTTGATATTGGATGAGGCGACCAGCAGTCTGGATATCTTGACAGAGAAGCGGATTGTGGATAATCTCATGGCTTT
AGATAAGACCTTGATTTTCATCGCCCACCGCTTGACCATTGCTGAGCGGACAGAGAAGGTTCTTGTCTTGGATCAGGGTA
AGATTGTCGAAGAAGGCAATCATGCTGATTTGCTGGCTCGAGATGGCTTTTACGCCCATTTGGTGAATAGCTAG
ATGAAATTTGGGAAAAGACACTATCGTCCCCAGGTGGATCAGATGGATTGTGGCGTGGCTTCCTTGGCTATGGTCTTTGG
CTATTATGGTAGTTATTACTCCTTGGCCCATCTACGAGAGTTGGCCAAGACGACCATGGATGGGACGACTGCTTTGGGGC
TTGTAAAGGTGGCAGAGGAGCTTGGCTTTGAGACGCGGGCTATCAAGGCGGATATGACGCTCTTTGATCTGCCTGATTTG
ACCTTTCCTTTTGTGGCCCATGTGCTCAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACAGGTCAGGATAAGAAGAC
CATCCATATCGCTGATCCAGATCCTGGTGTCAAGCTAACCAAGATTTCCCGTGAGCGATTTGCGCAAGAGTGGACAGGGG
TCAGTCTCTTTATGGCGCCATCTCCAGACTATAAACCTCATAAGGAGAAAAAACAGGGGCTCCTATCCTTCTTGCCCATC
TTATTCAAACAGCGTGGCTTAATTACCAATATCGTACTAGCGACACTCTTGGTAACCCTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACGCTGGGTATCATCTCTATTGGTTTGGTCA
TCGTCTATATTCTCCAGCAGATTTTGTCTTATGCTCAGGAGTATCTCTTACTTGTTTTGGGGCAACGCCTGTCAATTGAT
GTGATTTTGTCTTATATCAAGCATGTTTTTCACCTGCCAATGTCCTTTTTCGCGACACGCAGGACAGGAGAAATCGTATC
TCGTTTTACAGATGCCAATAGTATCATTGACGCGCTAGCGTCGACCATTCTGTCGATTTTCCTAGATGTGTCGACGATTT
TGATTATTTCGCTTGTCTTGTTTTCACAAAATATGATGCTCTTTTTCATTAGTCTGCTTGCCCTTCCCATCTATACAGTG
ATTATCTTTGTCTTTATGAAACCTTTTGAAAAGATGAATCGGGATACAATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATCGAGGATATCAACGGTATTGAGACCATTAAGTCTTTGACCAGTGAAAGTTCGCGCTATCAAAAGATTGACAAGGAAT
TTGTGGCTTATCTGAAAAAATCTTTTACCTATAGTCGGGCAGAAAGCCAGCAAAAGGCACTGAAAAAAGTTGCCCAGCTC
CTGCTCAATGTTGCTGTTCTCTGGATGGGAGCTATTCTCGTCATGGATGGGAAAATGAGTTTGGGCCAGCTGATTACCTA
TAACACCCTGCTCGTTTACTTTACCAATCCTTTGGAAAATATCATCAACCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTGAATGAGGTTTATCTAGTAGCTTCGGAGTTTGAGGAGAAGAAAACGGTCGAAGATTTGAGGATGATG
AAAGGAGATATGACTTTCAATCAGGTTCACTACAAGTATGGCTATGGTCGAGATGTTTTGTCGGATATCAATTTGACCAT
TCCGCAAGGTTCTAAAGTGGCTTTCGTGGGAATTTCAGGATCAGGCAAGACAACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAGGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGACAAAAAGGCTTTGCGCCAGTACATT
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTCCTTGGAGCCAAGGAGGGGACGAC
TCAGGAAGATATCTTACGGGCGGTTGAGTTGGCAGAGATTCGGGAGGATATTGAGCGCATGCCACTGAATTATCAGACAG
AATTAACTTCGGATGGGGCTGGAATTTCAGGTGGACAACGTCAGCGAATCGCTCTGGCGCGAGCTCTTTTAACGGATGCG
CCTGTCTTGATATTGGATGAGGCGACCAGCAGTCTGGATATCTTGACAGAGAAGCGGATTGTGGATAATCTCATGGCTTT
AGATAAGACCTTGATTTTCATCGCCCACCGCTTGACCATTGCTGAGCGGACAGAGAAGGTTCTTGTCTTGGATCAGGGTA
AGATTGTCGAAGAAGGCAATCATGCTGATTTGCTGGCTCGAGATGGCTTTTACGCCCATTTGGTGAATAGCTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Streptococcus mitis NCTC 12261 |
95.258 |
100 |
0.953 |
| comA | Streptococcus pneumoniae Rx1 |
94.84 |
100 |
0.948 |
| comA | Streptococcus pneumoniae D39 |
94.84 |
100 |
0.948 |
| comA | Streptococcus pneumoniae R6 |
94.84 |
100 |
0.948 |
| comA | Streptococcus mitis SK321 |
94.561 |
100 |
0.946 |
| comA | Streptococcus pneumoniae TIGR4 |
94.421 |
100 |
0.944 |
| comA | Streptococcus gordonii str. Challis substr. CH1 |
80.893 |
100 |
0.809 |
| comA/nlmT | Streptococcus mutans UA159 |
64.714 |
100 |
0.647 |