Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   EQH42_RS00220 Genome accession   NZ_CP035238
Coordinates   41902..44055 (+) Length   717 a.a.
NCBI ID   WP_000668290.1    Uniprot ID   A0A0B7KN15
Organism   Streptococcus pneumoniae strain TVO_D39     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 39018..41500 41902..44055 flank 402


Gene organization within MGE regions


Location: 39018..44055
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH42_RS00210 (EQH42_00220) blpU 39514..39744 (+) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  EQH42_RS10395 - 39747..39872 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  EQH42_RS00220 (EQH42_00235) comA 41902..44055 (+) 2154 WP_000668290.1 peptide cleavage/export ABC transporter ComA Regulator

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80350.38 Da        Isoelectric Point: 6.3368

>NTDB_id=336871 EQH42_RS00220 WP_000668290.1 41902..44055(+) (comA) [Streptococcus pneumoniae strain TVO_D39]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPHKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGDMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLIALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=336871 EQH42_RS00220 WP_000668290.1 41902..44055(+) (comA) [Streptococcus pneumoniae strain TVO_D39]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTTAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCGGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTCATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGATATGACCTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTATCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATTGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTGGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGAGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0B7KN15

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae Rx1

100

100

1

  comA Streptococcus pneumoniae D39

100

100

1

  comA Streptococcus pneumoniae R6

100

100

1

  comA Streptococcus pneumoniae TIGR4

99.582

100

0.996

  comA Streptococcus mitis SK321

98.466

100

0.985

  comA Streptococcus mitis NCTC 12261

98.466

100

0.985

  comA Streptococcus gordonii str. Challis substr. CH1

80.614

100

0.806

  comA/nlmT Streptococcus mutans UA159

64.854

100

0.649


Multiple sequence alignment