Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   EQH41_RS00220 Genome accession   NZ_CP035239
Coordinates   40620..42773 (+) Length   717 a.a.
NCBI ID   WP_000668284.1    Uniprot ID   A0A4J1UYR4
Organism   Streptococcus pneumoniae strain TVO_TIGR4     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Genomic Context


Location: 35620..47773
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH41_RS00190 (EQH41_00210) - 35972..37141 (+) 1170 WP_000366370.1 pyridoxal phosphate-dependent aminotransferase -
  EQH41_RS00195 (EQH41_00215) recO 37138..37908 (+) 771 WP_000616164.1 DNA repair protein RecO -
  EQH41_RS00200 (EQH41_00220) plsX 37905..38897 (+) 993 WP_000717458.1 phosphate acyltransferase PlsX -
  EQH41_RS00205 (EQH41_00225) - 38903..39136 (+) 234 WP_000136447.1 acyl carrier protein -
  EQH41_RS00210 (EQH41_00230) - 39173..39472 (+) 300 Protein_35 transposase family protein -
  EQH41_RS00215 (EQH41_00235) blpU 39675..39905 (+) 231 WP_001093487.1 bacteriocin-like peptide BlpU -
  EQH41_RS10855 - 39908..40033 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  EQH41_RS00220 (EQH41_00240) comA 40620..42773 (+) 2154 WP_000668284.1 peptide cleavage/export ABC transporter ComA Regulator
  EQH41_RS00225 (EQH41_00245) comB 42786..44135 (+) 1350 WP_000801611.1 competence pheromone export protein ComB Regulator
  EQH41_RS00230 (EQH41_00250) purC 44305..45012 (+) 708 WP_000043309.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  EQH41_RS10940 (EQH41_00255) - 45014..45157 (+) 144 WP_050167432.1 hypothetical protein -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80404.38 Da        Isoelectric Point: 6.3368

>NTDB_id=336948 EQH41_RS00220 WP_000668284.1 40620..42773(+) (comA) [Streptococcus pneumoniae strain TVO_TIGR4]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPHKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYIFQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGDMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGSVNLNQIDKKALRQYI
NYLSQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLIALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=336948 EQH41_RS00220 WP_000668284.1 40620..42773(+) (comA) [Streptococcus pneumoniae strain TVO_TIGR4]
ATGAAATTTGGGAAACGTCATTATCGTCCGCAGGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCAGATATGACGCTTTTTGACTTGCCGGATTTA
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTCATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCTTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACTAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGATATGACCTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGATGTCTTATCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTAGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGTCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTACCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATTGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTAGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4J1UYR4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae TIGR4

100

100

1

  comA Streptococcus pneumoniae Rx1

99.582

100

0.996

  comA Streptococcus pneumoniae D39

99.582

100

0.996

  comA Streptococcus pneumoniae R6

99.582

100

0.996

  comA Streptococcus mitis SK321

98.047

100

0.98

  comA Streptococcus mitis NCTC 12261

98.047

100

0.98

  comA Streptococcus gordonii str. Challis substr. CH1

80.195

100

0.802

  comA/nlmT Streptococcus mutans UA159

64.575

100

0.646


Multiple sequence alignment