Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   DV119_RS03445 Genome accession   NZ_CP031245
Coordinates   607700..609853 (+) Length   717 a.a.
NCBI ID   WP_050221223.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain M16808     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 542761..621320 607700..609853 within 0
IScluster/Tn 604845..607266 607700..609853 flank 434


Gene organization within MGE regions


Location: 542761..621320
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV119_RS03050 ftsH 542761..544719 (+) 1959 WP_000744557.1 ATP-dependent zinc metalloprotease FtsH -
  DV119_RS03055 - 544784..546031 (-) 1248 WP_114880800.1 ISL3 family transposase -
  DV119_RS03060 comX/comX2 546255..546734 (+) 480 WP_000588874.1 sigma-70 family RNA polymerase sigma factor Regulator
  DV119_RS03095 - 552161..552958 (-) 798 Protein_534 transposase -
  DV119_RS03100 comW 553224..553460 (+) 237 WP_000939545.1 sigma(X)-activator ComW Regulator
  DV119_RS03105 - 553691..554977 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  DV119_RS03110 - 555219..556367 (-) 1149 WP_000876732.1 tyrosine-type recombinase/integrase -
  DV119_RS03120 - 556553..557476 (-) 924 WP_000122591.1 exonuclease domain-containing protein -
  DV119_RS03125 - 557489..557872 (-) 384 WP_001865140.1 ImmA/IrrE family metallo-endopeptidase -
  DV119_RS03130 - 557885..558250 (-) 366 WP_000492031.1 helix-turn-helix domain-containing protein -
  DV119_RS03135 - 558556..559038 (-) 483 WP_000842471.1 hypothetical protein -
  DV119_RS03140 - 559093..559296 (+) 204 WP_000032096.1 hypothetical protein -
  DV119_RS03145 - 559313..559510 (+) 198 WP_001057654.1 hypothetical protein -
  DV119_RS03150 - 559626..560285 (-) 660 WP_001865135.1 hypothetical protein -
  DV119_RS03155 - 560339..561055 (+) 717 WP_001865134.1 ORF6C domain-containing protein -
  DV119_RS03160 - 561068..561325 (+) 258 WP_000370959.1 hypothetical protein -
  DV119_RS03165 - 561411..561731 (+) 321 WP_000462826.1 hypothetical protein -
  DV119_RS03170 - 561747..562043 (+) 297 WP_001865133.1 hypothetical protein -
  DV119_RS03175 - 562036..562842 (+) 807 WP_054387342.1 phage replisome organizer N-terminal domain-containing protein -
  DV119_RS03180 - 562982..563752 (+) 771 WP_000228214.1 ATP-binding protein -
  DV119_RS12075 - 563767..563961 (+) 195 WP_000470305.1 hypothetical protein -
  DV119_RS03190 - 563961..564188 (+) 228 WP_050267377.1 hypothetical protein -
  DV119_RS11460 - 564315..564482 (+) 168 WP_000233203.1 hypothetical protein -
  DV119_RS03205 - 564810..565163 (+) 354 WP_001177094.1 helix-turn-helix domain-containing protein -
  DV119_RS03210 - 565145..565609 (+) 465 WP_000516820.1 hypothetical protein -
  DV119_RS03215 - 565718..566260 (+) 543 WP_001028147.1 site-specific integrase -
  DV119_RS03220 - 566801..567007 (+) 207 WP_223842409.1 HNH endonuclease -
  DV119_RS03225 - 567144..567638 (+) 495 WP_054387340.1 hypothetical protein -
  DV119_RS03230 - 567631..569343 (+) 1713 WP_000230006.1 terminase TerL endonuclease subunit -
  DV119_RS03235 - 569352..570494 (+) 1143 WP_001812652.1 phage portal protein -
  DV119_RS03240 - 570541..571083 (+) 543 WP_000413203.1 HK97 family phage prohead protease -
  DV119_RS03245 - 571098..572351 (+) 1254 WP_000855224.1 phage major capsid protein -
  DV119_RS03250 - 572377..572712 (+) 336 WP_000154006.1 phage head-tail connector protein -
  DV119_RS03255 - 572709..573014 (+) 306 WP_000842790.1 head-tail adaptor protein -
  DV119_RS03260 - 573014..573361 (+) 348 WP_001074487.1 hypothetical protein -
  DV119_RS03265 - 573348..573692 (+) 345 WP_000534621.1 hypothetical protein -
  DV119_RS03270 - 573706..574374 (+) 669 WP_000221469.1 hypothetical protein -
  DV119_RS03275 - 574376..574852 (+) 477 WP_000591561.1 hypothetical protein -
  DV119_RS03285 - 575039..577777 (+) 2739 WP_000852167.1 phage tail tape measure protein -
  DV119_RS03290 - 577774..578496 (+) 723 WP_050199190.1 phage tail protein -
  DV119_RS03295 - 578497..586458 (+) 7962 WP_114880801.1 phage tail spike protein -
  DV119_RS12080 - 586455..586571 (+) 117 WP_001063633.1 hypothetical protein -
  DV119_RS03305 - 586552..586755 (+) 204 WP_001091109.1 hypothetical protein -
  DV119_RS03310 - 586758..587108 (+) 351 WP_000852248.1 hypothetical protein -
  DV119_RS03315 - 587118..587534 (+) 417 WP_001165341.1 phage holin family protein -
  DV119_RS03320 - 587538..587870 (+) 333 WP_001186207.1 phage holin -
  DV119_RS03325 lytA 587874..588830 (+) 957 WP_000350489.1 N-acetylmuramoyl-L-alanine amidase LytA -
  DV119_RS03330 - 589099..589278 (-) 180 WP_001233269.1 hypothetical protein -
  DV119_RS11275 - 589420..589569 (-) 150 WP_162489709.1 hypothetical protein -
  DV119_RS03335 tadA 589839..590306 (+) 468 WP_000291874.1 tRNA adenosine(34) deaminase TadA -
  DV119_RS03345 - 590493..590936 (+) 444 WP_000701992.1 dUTP diphosphatase -
  DV119_RS03350 - 590938..591453 (+) 516 WP_000691236.1 histidine phosphatase family protein -
  DV119_RS03355 radA 591467..592828 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  DV119_RS03360 - 592901..593398 (+) 498 WP_001809263.1 beta-class carbonic anhydrase -
  DV119_RS03370 - 593423..594237 (+) 815 Protein_585 PrsW family intramembrane metalloprotease -
  DV119_RS03375 - 594382..595350 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  DV119_RS11730 - 595468..596413 (+) 946 Protein_587 Rpn family recombination-promoting nuclease/putative transposase -
  DV119_RS03395 polA 596916..599549 (+) 2634 WP_050279195.1 DNA polymerase I -
  DV119_RS03400 - 599634..600071 (+) 438 WP_000076479.1 CoA-binding protein -
  DV119_RS11735 - 600112..600450 (+) 339 WP_061849647.1 hypothetical protein -
  DV119_RS03405 - 600479..601489 (-) 1011 WP_000009175.1 YeiH family protein -
  DV119_RS03410 - 601638..602807 (+) 1170 WP_000366348.1 pyridoxal phosphate-dependent aminotransferase -
  DV119_RS03415 recO 602804..603574 (+) 771 WP_050220959.1 DNA repair protein RecO -
  DV119_RS03420 plsX 603571..604563 (+) 993 WP_000717456.1 phosphate acyltransferase PlsX -
  DV119_RS03425 - 604569..604802 (+) 234 WP_000136447.1 acyl carrier protein -
  DV119_RS03430 - 604839..605138 (+) 300 Protein_596 transposase family protein -
  DV119_RS03435 blpU 605341..605571 (+) 231 WP_001093487.1 bacteriocin-like peptide BlpU -
  DV119_RS12005 - 605574..605699 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  DV119_RS03440 - 605949..607266 (-) 1318 Protein_599 ISL3 family transposase -
  DV119_RS03445 comA 607700..609853 (+) 2154 WP_050221223.1 peptide cleavage/export ABC transporter ComA Regulator
  DV119_RS03450 comB 609866..611215 (+) 1350 WP_000801629.1 competence pheromone export protein ComB Regulator
  DV119_RS03455 purC 611385..612092 (+) 708 WP_000043313.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  DV119_RS12085 - 612103..612237 (+) 135 WP_000429436.1 hypothetical protein -
  DV119_RS03465 - 612294..616019 (+) 3726 WP_114880802.1 phosphoribosylformylglycinamidine synthase -
  DV119_RS03470 purF 616112..617554 (+) 1443 WP_050104961.1 amidophosphoribosyltransferase -
  DV119_RS03475 purM 617591..618613 (+) 1023 WP_114880803.1 phosphoribosylformylglycinamidine cyclo-ligase -
  DV119_RS03480 purN 618610..619155 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  DV119_RS03485 - 619239..619748 (+) 510 WP_050104960.1 VanZ family protein -
  DV119_RS03490 purH 619773..621320 (+) 1548 WP_114880804.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80422.41 Da        Isoelectric Point: 6.3368

>NTDB_id=304632 DV119_RS03445 WP_050221223.1 607700..609853(+) (comA) [Streptococcus pneumoniae strain M16808]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPHKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYIFQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGDMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGSVNLNQIDKKALRQYI
NYLSQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=304632 DV119_RS03445 WP_050221223.1 607700..609853(+) (comA) [Streptococcus pneumoniae strain M16808]
ATGAAATTTGGGAAACGTCATTATCGTCCGCAGGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCATTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCAGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTCATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCTTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACTAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGATATGACCTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGATGTCTTATCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTAGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGTCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTACCAGACAG
AATTGACTTCGGATGGGGCAGGAATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTAGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae TIGR4

99.861

100

0.999

  comA Streptococcus pneumoniae Rx1

99.442

100

0.994

  comA Streptococcus pneumoniae D39

99.442

100

0.994

  comA Streptococcus pneumoniae R6

99.442

100

0.994

  comA Streptococcus mitis SK321

98.187

100

0.982

  comA Streptococcus mitis NCTC 12261

98.187

100

0.982

  comA Streptococcus gordonii str. Challis substr. CH1

80.335

100

0.803

  comA/nlmT Streptococcus mutans UA159

64.575

100

0.646


Multiple sequence alignment