Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   E0F39_RS00355 Genome accession   NZ_LR216060
Coordinates   53225..55378 (+) Length   717 a.a.
NCBI ID   WP_000668283.1    Uniprot ID   A0A0T8MW44
Organism   Streptococcus pneumoniae strain GPSC47 substr. ST315     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1729..66845 53225..55378 within 0


Gene organization within MGE regions


Location: 1729..66845
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F39_RS00025 (SAMEA3713867_00005) - 1771..3057 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  E0F39_RS00030 (SAMEA3713867_00006) - 3299..4447 (-) 1149 WP_000876732.1 tyrosine-type recombinase/integrase -
  E0F39_RS00040 (SAMEA3713867_00007) - 4633..5556 (-) 924 WP_000122591.1 exonuclease domain-containing protein -
  E0F39_RS00045 (SAMEA3713867_00008) - 5569..5952 (-) 384 WP_001865140.1 ImmA/IrrE family metallo-endopeptidase -
  E0F39_RS00050 (SAMEA3713867_00009) - 5965..6330 (-) 366 WP_000492031.1 helix-turn-helix domain-containing protein -
  E0F39_RS00055 (SAMEA3713867_00010) - 6636..7118 (-) 483 WP_001865139.1 hypothetical protein -
  E0F39_RS00060 (SAMEA3713867_00011) - 7173..7376 (+) 204 WP_001865138.1 helix-turn-helix transcriptional regulator -
  E0F39_RS00065 (SAMEA3713867_00013) - 7519..7716 (+) 198 WP_001865136.1 hypothetical protein -
  E0F39_RS00070 (SAMEA3713867_00015) - 7832..8491 (-) 660 WP_001865135.1 hypothetical protein -
  E0F39_RS00075 (SAMEA3713867_00016) - 8545..9261 (+) 717 WP_001865134.1 ORF6C domain-containing protein -
  E0F39_RS00080 (SAMEA3713867_00017) - 9274..9531 (+) 258 WP_000370959.1 hypothetical protein -
  E0F39_RS00085 (SAMEA3713867_00018) - 9617..9937 (+) 321 WP_000462826.1 hypothetical protein -
  E0F39_RS00090 (SAMEA3713867_00019) - 9953..10249 (+) 297 WP_001865133.1 hypothetical protein -
  E0F39_RS00095 (SAMEA3713867_00020) - 10242..11030 (+) 789 WP_001865132.1 phage replisome organizer N-terminal domain-containing protein -
  E0F39_RS12235 (SAMEA3713867_00021) - 11018..11176 (+) 159 WP_000538418.1 hypothetical protein -
  E0F39_RS00100 (SAMEA3713867_00022) - 11170..11940 (+) 771 WP_001865131.1 ATP-binding protein -
  E0F39_RS13100 (SAMEA3713867_00023) - 11955..12149 (+) 195 WP_001865130.1 hypothetical protein -
  E0F39_RS00110 (SAMEA3713867_00024) - 12248..13330 (+) 1083 WP_050198917.1 DNA cytosine methyltransferase -
  E0F39_RS12485 - 13327..13404 (+) 78 Protein_20 DNA-binding protein -
  E0F39_RS00120 (SAMEA3713867_00025) - 13492..13845 (+) 354 WP_001864845.1 helix-turn-helix domain-containing protein -
  E0F39_RS00125 (SAMEA3713867_00026) - 13827..14291 (+) 465 WP_000516820.1 hypothetical protein -
  E0F39_RS00130 (SAMEA3713867_00027) - 14400..14942 (+) 543 WP_001028147.1 site-specific integrase -
  E0F39_RS12490 (SAMEA3713867_00028) - 15495..15689 (+) 195 WP_001824495.1 HNH endonuclease -
  E0F39_RS00140 (SAMEA3713867_00029) - 15826..16311 (+) 486 WP_000601030.1 hypothetical protein -
  E0F39_RS00145 (SAMEA3713867_00030) - 16304..18016 (+) 1713 WP_000230006.1 terminase TerL endonuclease subunit -
  E0F39_RS00150 (SAMEA3713867_00031) - 18025..19167 (+) 1143 WP_001812652.1 phage portal protein -
  E0F39_RS00155 (SAMEA3713867_00032) - 19214..19756 (+) 543 WP_000413203.1 HK97 family phage prohead protease -
  E0F39_RS00160 (SAMEA3713867_00033) - 19771..21024 (+) 1254 WP_000855224.1 phage major capsid protein -
  E0F39_RS00165 (SAMEA3713867_00034) - 21050..21385 (+) 336 WP_000154006.1 hypothetical protein -
  E0F39_RS00170 (SAMEA3713867_00035) - 21382..21687 (+) 306 WP_001864847.1 head-tail adaptor protein -
  E0F39_RS00175 (SAMEA3713867_00036) - 21687..22034 (+) 348 WP_001074487.1 hypothetical protein -
  E0F39_RS00180 (SAMEA3713867_00037) - 22021..22365 (+) 345 WP_000534621.1 hypothetical protein -
  E0F39_RS00185 (SAMEA3713867_00038) - 22379..23047 (+) 669 WP_000221469.1 hypothetical protein -
  E0F39_RS00190 (SAMEA3713867_00039) - 23049..23525 (+) 477 WP_000591561.1 hypothetical protein -
  E0F39_RS00200 (SAMEA3713867_00040) - 23712..26450 (+) 2739 WP_001864848.1 phage tail tape measure protein -
  E0F39_RS00205 (SAMEA3713867_00041) - 26447..27169 (+) 723 WP_001864849.1 hypothetical protein -
  E0F39_RS00210 (SAMEA3713867_00042) - 27170..33496 (+) 6327 WP_050247094.1 phage tail spike protein -
  E0F39_RS13105 - 33493..33609 (+) 117 Protein_39 dihydrodipicolinate reductase -
  E0F39_RS00220 (SAMEA3713867_00043) - 33590..33793 (+) 204 WP_001091119.1 hypothetical protein -
  E0F39_RS00225 (SAMEA3713867_00044) - 33796..34146 (+) 351 WP_000852244.1 hypothetical protein -
  E0F39_RS00230 (SAMEA3713867_00045) - 34156..34572 (+) 417 WP_001165341.1 phage holin family protein -
  E0F39_RS00235 (SAMEA3713867_00046) - 34576..34911 (+) 336 WP_001186241.1 phage holin -
  E0F39_RS00240 (SAMEA3713867_00047) - 34911..35867 (+) 957 WP_050120696.1 N-acetylmuramoyl-L-alanine amidase -
  E0F39_RS00245 (SAMEA3713867_00048) - 36005..36184 (-) 180 WP_001233269.1 hypothetical protein -
  E0F39_RS12175 - 36326..36475 (-) 150 WP_001030863.1 hypothetical protein -
  E0F39_RS00250 (SAMEA3713867_00049) tadA 36756..37223 (+) 468 WP_000291875.1 tRNA adenosine(34) deaminase TadA -
  E0F39_RS00260 (SAMEA3713867_00050) - 37410..37853 (+) 444 WP_000701974.1 dUTP diphosphatase -
  E0F39_RS00265 (SAMEA3713867_00051) - 37855..38370 (+) 516 WP_000691236.1 histidine phosphatase family protein -
  E0F39_RS00270 (SAMEA3713867_00052) radA 38384..39745 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  E0F39_RS00275 (SAMEA3713867_00053) - 39818..40315 (+) 498 WP_001809263.1 beta-class carbonic anhydrase -
  E0F39_RS00280 (SAMEA3713867_00054) - 40340..41155 (+) 816 WP_000749768.1 PrsW family intramembrane metalloprotease -
  E0F39_RS00285 (SAMEA3713867_00055) - 41300..42268 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  E0F39_RS00290 - 42402..42683 (-) 282 Protein_54 ISL3 family transposase -
  E0F39_RS12495 - 42810..43717 (-) 908 Protein_55 Rpn family recombination-promoting nuclease/putative transposase -
  E0F39_RS00310 (SAMEA3713867_00059) polA 43968..46601 (+) 2634 WP_001812647.1 DNA polymerase I -
  E0F39_RS00315 (SAMEA3713867_00060) - 46686..47123 (+) 438 WP_000076479.1 CoA-binding protein -
  E0F39_RS12500 (SAMEA3713867_00061) - 47164..47457 (+) 294 WP_050213153.1 hypothetical protein -
  E0F39_RS00320 (SAMEA3713867_00062) - 47486..48496 (-) 1011 WP_000009158.1 YeiH family protein -
  E0F39_RS00325 (SAMEA3713867_00063) - 48645..49814 (+) 1170 WP_000366342.1 pyridoxal phosphate-dependent aminotransferase -
  E0F39_RS00330 (SAMEA3713867_00064) recO 49811..50581 (+) 771 WP_000616164.1 DNA repair protein RecO -
  E0F39_RS00335 (SAMEA3713867_00065) plsX 50578..51570 (+) 993 WP_061755083.1 phosphate acyltransferase PlsX -
  E0F39_RS00340 (SAMEA3713867_00066) - 51576..51809 (+) 234 WP_000136447.1 acyl carrier protein -
  E0F39_RS00345 (SAMEA3713867_00067) - 51846..52145 (+) 300 Protein_64 transposase family protein -
  E0F39_RS00350 (SAMEA3713867_00068) - 52348..52506 (+) 159 WP_001093073.1 bacteriocin class II family protein -
  E0F39_RS12965 (SAMEA3713867_00069) - 52509..52634 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  E0F39_RS00355 (SAMEA3713867_00070) comA 53225..55378 (+) 2154 WP_000668283.1 peptide cleavage/export ABC transporter ComA Regulator
  E0F39_RS00360 (SAMEA3713867_00071) comB 55391..56740 (+) 1350 WP_000801627.1 competence pheromone export protein ComB Regulator
  E0F39_RS00365 (SAMEA3713867_00072) purC 56910..57617 (+) 708 WP_000043300.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  E0F39_RS13110 - 57628..57762 (+) 135 WP_000429436.1 hypothetical protein -
  E0F39_RS00375 (SAMEA3713867_00073) - 57819..61544 (+) 3726 WP_000361191.1 phosphoribosylformylglycinamidine synthase -
  E0F39_RS00380 (SAMEA3713867_00074) purF 61637..63079 (+) 1443 WP_000220637.1 amidophosphoribosyltransferase -
  E0F39_RS00385 (SAMEA3713867_00075) purM 63116..64138 (+) 1023 WP_000182575.1 phosphoribosylformylglycinamidine cyclo-ligase -
  E0F39_RS00390 (SAMEA3713867_00076) purN 64135..64680 (+) 546 WP_000717501.1 phosphoribosylglycinamide formyltransferase -
  E0F39_RS00395 (SAMEA3713867_00077) - 64764..65273 (+) 510 WP_000894018.1 VanZ family protein -
  E0F39_RS00400 (SAMEA3713867_00078) purH 65298..66845 (+) 1548 WP_000167080.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80364.40 Da        Isoelectric Point: 6.3368

>NTDB_id=1126167 E0F39_RS00355 WP_000668283.1 53225..55378(+) (comA) [Streptococcus pneumoniae strain GPSC47 substr. ST315]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPHKDQKNGLLSFIPI
LVKQRGLIANIVLATLLVTLINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGEMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLIALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=1126167 E0F39_RS00355 WP_000668283.1 53225..55378(+) (comA) [Streptococcus pneumoniae strain GPSC47 substr. ST315]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGCTTTGAGACACGAGCTATCAAGGCGGATATGACGCTTTTTGATTTGCCCGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGACAG
CATTCATATTGCCGATCCAGATCCTGGGGTGAAATTGACCAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTGTTTATGGCACCTAGTCCAGACTACAAGCCTCATAAGGATCAAAAGAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCTTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACGTTGGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTTTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCATTTGAAAAGATGAATCGGGACACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAATGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGAGATGACTTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTATCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATAGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTAGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0T8MW44

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae Rx1

99.582

100

0.996

  comA Streptococcus pneumoniae D39

99.582

100

0.996

  comA Streptococcus pneumoniae R6

99.582

100

0.996

  comA Streptococcus pneumoniae TIGR4

99.163

100

0.992

  comA Streptococcus mitis SK321

98.605

100

0.986

  comA Streptococcus mitis NCTC 12261

98.605

100

0.986

  comA Streptococcus gordonii str. Challis substr. CH1

80.474

100

0.805

  comA/nlmT Streptococcus mutans UA159

64.854

100

0.649


Multiple sequence alignment