Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   EQH33_RS00225 Genome accession   NZ_CP035247
Coordinates   41808..43961 (+) Length   717 a.a.
NCBI ID   WP_000668304.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901940     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Genomic Context


Location: 36808..48961
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH33_RS00195 (EQH33_00215) - 37160..38329 (+) 1170 WP_000366348.1 pyridoxal phosphate-dependent aminotransferase -
  EQH33_RS00200 (EQH33_00220) recO 38326..39096 (+) 771 WP_000616164.1 DNA repair protein RecO -
  EQH33_RS00205 (EQH33_00225) plsX 39093..40085 (+) 993 WP_000717463.1 phosphate acyltransferase PlsX -
  EQH33_RS00210 (EQH33_00230) - 40091..40324 (+) 234 WP_000659555.1 acyl carrier protein -
  EQH33_RS00215 (EQH33_00235) - 40361..40660 (+) 300 Protein_36 transposase family protein -
  EQH33_RS00220 blpU 40863..41093 (+) 231 Protein_37 bacteriocin-like peptide BlpU -
  EQH33_RS10385 - 41096..41221 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  EQH33_RS00225 (EQH33_00245) comA 41808..43961 (+) 2154 WP_000668304.1 peptide cleavage/export ABC transporter ComA Regulator
  EQH33_RS00230 (EQH33_00250) comB 43974..45323 (+) 1350 WP_000801631.1 competence pheromone export protein ComB Regulator
  EQH33_RS00235 (EQH33_00255) purC 45493..46200 (+) 708 WP_000043304.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  EQH33_RS10465 (EQH33_00260) - 46202..46345 (+) 144 WP_050167432.1 hypothetical protein -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80436.52 Da        Isoelectric Point: 6.2593

>NTDB_id=337492 EQH33_RS00225 WP_000668304.1 41808..43961(+) (comA) [Streptococcus pneumoniae strain TVO_1901940]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPYKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGEMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELVEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=337492 EQH33_RS00225 WP_000668304.1 41808..43961(+) (comA) [Streptococcus pneumoniae strain TVO_1901940]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCGGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTTATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAAAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGAGATGACTTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTACGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGTAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTACCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTGGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae Rx1

99.442

100

0.994

  comA Streptococcus pneumoniae D39

99.442

100

0.994

  comA Streptococcus pneumoniae R6

99.442

100

0.994

  comA Streptococcus pneumoniae TIGR4

99.024

100

0.99

  comA Streptococcus mitis SK321

98.466

100

0.985

  comA Streptococcus mitis NCTC 12261

98.187

100

0.982

  comA Streptococcus gordonii str. Challis substr. CH1

80.474

100

0.805

  comA/nlmT Streptococcus mutans UA159

64.435

100

0.644


Multiple sequence alignment