Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   SPN994039_RS00480 Genome accession   NC_021005
Coordinates   74457..76610 (+) Length   717 a.a.
NCBI ID   WP_000668304.1    Uniprot ID   -
Organism   Streptococcus pneumoniae SPN994039     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 12359..88077 74457..76610 within 0


Gene organization within MGE regions


Location: 12359..88077
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPN994039_RS00065 (SPN994039_00130) ftsH 12359..14317 (+) 1959 WP_000744554.1 ATP-dependent zinc metalloprotease FtsH -
  SPN994039_RS00070 (SPN994039_00140) comX/comX2 14439..14918 (+) 480 WP_000588866.1 sigma-70 family RNA polymerase sigma factor Regulator
  SPN994039_RS00105 - 20468..20677 (+) 210 Protein_14 transposase -
  SPN994039_RS10970 (SPN994039_00150) - 20712..21509 (-) 798 Protein_15 transposase -
  SPN994039_RS00125 (SPN994039_00160) comW 21775..22011 (+) 237 WP_000939546.1 sigma(X)-activator ComW Regulator
  SPN994039_RS00130 (SPN994039_00170) - 22242..23528 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  SPN994039_RS00135 (SPN994039_00180) - 23770..24918 (-) 1149 WP_000876732.1 site-specific integrase -
  SPN994039_RS00140 (SPN994039_00190) - 25104..26027 (-) 924 WP_000122591.1 exonuclease domain-containing protein -
  SPN994039_RS00145 (SPN994039_00200) - 26040..26423 (-) 384 WP_000136459.1 ImmA/IrrE family metallo-endopeptidase -
  SPN994039_RS00150 (SPN994039_00210) - 26436..26801 (-) 366 WP_000492031.1 helix-turn-helix transcriptional regulator -
  SPN994039_RS00155 (SPN994039_00230) - 27178..27399 (-) 222 WP_000041097.1 hypothetical protein -
  SPN994039_RS12565 (SPN994039_00240) - 27518..27664 (+) 147 WP_000389576.1 hypothetical protein -
  SPN994039_RS00160 (SPN994039_00250) - 27676..27879 (+) 204 WP_000032097.1 helix-turn-helix transcriptional regulator -
  SPN994039_RS00165 - 27896..28093 (+) 198 WP_001057654.1 hypothetical protein -
  SPN994039_RS12570 (SPN994039_00260) - 28104..28265 (+) 162 WP_001002946.1 hypothetical protein -
  SPN994039_RS00170 (SPN994039_00270) - 28260..28685 (-) 426 WP_000386249.1 hypothetical protein -
  SPN994039_RS00175 (SPN994039_00280) - 28739..29449 (+) 711 WP_001002359.1 ORF6C domain-containing protein -
  SPN994039_RS00180 (SPN994039_00290) - 29463..29720 (+) 258 WP_000370959.1 hypothetical protein -
  SPN994039_RS00185 (SPN994039_00300) - 29806..30126 (+) 321 WP_000462824.1 hypothetical protein -
  SPN994039_RS00190 (SPN994039_00310) - 30142..30438 (+) 297 WP_000391805.1 hypothetical protein -
  SPN994039_RS00195 (SPN994039_00320) - 30431..31237 (+) 807 WP_001289771.1 phage replisome organizer N-terminal domain-containing protein -
  SPN994039_RS00200 (SPN994039_00340) - 31377..32147 (+) 771 WP_000228219.1 ATP-binding protein -
  SPN994039_RS13285 - 32162..32356 (+) 195 WP_000470307.1 hypothetical protein -
  SPN994039_RS00210 (SPN994039_00350) - 32356..32574 (+) 219 WP_000891962.1 hypothetical protein -
  SPN994039_RS10980 - 32956..33054 (+) 99 Protein_36 single-stranded DNA-binding protein -
  SPN994039_RS12575 (SPN994039_00370) - 33068..33235 (+) 168 WP_000233203.1 hypothetical protein -
  SPN994039_RS00220 - 33222..33431 (+) 210 WP_000872740.1 hypothetical protein -
  SPN994039_RS00225 (SPN994039_00380) - 33403..33720 (+) 318 WP_000969665.1 hypothetical protein -
  SPN994039_RS12805 - 33722..33796 (+) 75 Protein_40 DUF1642 domain-containing protein -
  SPN994039_RS00230 (SPN994039_00400) - 33973..34374 (+) 402 WP_000736390.1 transcriptional activator -
  SPN994039_RS00235 (SPN994039_00410) - 34562..35104 (+) 543 WP_000397549.1 site-specific integrase -
  SPN994039_RS00240 (SPN994039_00420) - 35481..35801 (+) 321 WP_000282427.1 HNH endonuclease -
  SPN994039_RS00245 (SPN994039_00430) - 35938..36330 (+) 393 WP_001118283.1 P27 family phage terminase small subunit -
  SPN994039_RS00250 (SPN994039_00440) - 36323..38053 (+) 1731 WP_000527299.1 terminase large subunit -
  SPN994039_RS00255 - 38061..38279 (+) 219 WP_001002923.1 hypothetical protein -
  SPN994039_RS00260 (SPN994039_00460) - 38297..39499 (+) 1203 WP_000510803.1 phage portal protein -
  SPN994039_RS00265 (SPN994039_00470) - 39483..40058 (+) 576 WP_001172115.1 HK97 family phage prohead protease -
  SPN994039_RS00270 (SPN994039_00480) - 40055..41221 (+) 1167 WP_001030357.1 phage major capsid protein -
  SPN994039_RS00275 - 41233..41502 (+) 270 WP_000262606.1 hypothetical protein -
  SPN994039_RS00280 (SPN994039_00500) - 41505..41786 (+) 282 WP_000370976.1 hypothetical protein -
  SPN994039_RS00285 (SPN994039_00510) - 41773..42072 (+) 300 WP_000267055.1 phage head closure protein -
  SPN994039_RS00290 (SPN994039_00520) - 42069..42416 (+) 348 WP_000063886.1 HK97 gp10 family phage protein -
  SPN994039_RS00295 (SPN994039_00530) - 42413..42736 (+) 324 WP_000777003.1 hypothetical protein -
  SPN994039_RS00300 (SPN994039_00540) - 42748..43326 (+) 579 WP_000191279.1 major tail protein -
  SPN994039_RS00305 (SPN994039_00550) - 43338..43757 (+) 420 WP_001227146.1 hypothetical protein -
  SPN994039_RS00310 (SPN994039_00560) - 44035..47130 (+) 3096 WP_000918318.1 hypothetical protein -
  SPN994039_RS00315 (SPN994039_00570) - 47127..47849 (+) 723 WP_000589856.1 hypothetical protein -
  SPN994039_RS12305 (SPN994039_00580) - 47850..54482 (+) 6633 WP_000966215.1 phage tail spike protein -
  SPN994039_RS13290 (SPN994039_00590) - 54479..54595 (+) 117 WP_001063632.1 hypothetical protein -
  SPN994039_RS00330 (SPN994039_00600) - 54576..54779 (+) 204 WP_001091113.1 hypothetical protein -
  SPN994039_RS00335 (SPN994039_00610) - 54782..55132 (+) 351 WP_000852241.1 hypothetical protein -
  SPN994039_RS00340 (SPN994039_00620) - 55141..55557 (+) 417 WP_001165344.1 phage holin family protein -
  SPN994039_RS00345 (SPN994039_00630) - 55561..55893 (+) 333 WP_001186219.1 phage holin -
  SPN994039_RS00350 (SPN994039_00640) - 55897..56853 (+) 957 WP_000350505.1 N-acetylmuramoyl-L-alanine amidase family protein -
  SPN994039_RS00360 - 57074..57262 (-) 189 WP_000109850.1 hypothetical protein -
  SPN994039_RS00365 (SPN994039_00670) tadA 57830..58297 (+) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  SPN994039_RS00370 (SPN994039_00680) - 58483..58926 (+) 444 WP_000701992.1 dUTP diphosphatase -
  SPN994039_RS00375 (SPN994039_00690) - 58928..59443 (+) 516 WP_001838385.1 histidine phosphatase family protein -
  SPN994039_RS00380 (SPN994039_00700) radA 59457..60818 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  SPN994039_RS00385 (SPN994039_00710) - 60891..61388 (+) 498 WP_001809263.1 carbonic anhydrase -
  SPN994039_RS00395 (SPN994039_00720) - 61413..62243 (+) 831 Protein_72 PrsW family glutamic-type intramembrane protease -
  SPN994039_RS00400 (SPN994039_00730) - 62388..63356 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  SPN994039_RS10995 (SPN994039_00740) - 63493..63771 (-) 279 Protein_74 transposase family protein -
  SPN994039_RS12810 - 63815..64740 (-) 926 Protein_75 Rpn family recombination-promoting nuclease/putative transposase -
  SPN994039_RS00425 (SPN994039_00770) polA 64996..67629 (+) 2634 WP_001809267.1 DNA polymerase I -
  SPN994039_RS00430 (SPN994039_00780) - 67714..68151 (+) 438 WP_000076479.1 CoA-binding protein -
  SPN994039_RS12815 - 68192..68620 (+) 429 WP_000693134.1 hypothetical protein -
  SPN994039_RS00440 (SPN994039_00790) - 68649..69659 (-) 1011 WP_000009171.1 YeiH family protein -
  SPN994039_RS00445 (SPN994039_00800) - 69808..70977 (+) 1170 WP_000366345.1 pyridoxal phosphate-dependent aminotransferase -
  SPN994039_RS00450 (SPN994039_00810) recO 70974..71744 (+) 771 WP_000616162.1 DNA repair protein RecO -
  SPN994039_RS00455 (SPN994039_00820) plsX 71741..72733 (+) 993 WP_000717457.1 phosphate acyltransferase PlsX -
  SPN994039_RS00460 (SPN994039_00830) - 72739..72972 (+) 234 WP_000659556.1 acyl carrier protein -
  SPN994039_RS11005 (SPN994039_00840) - 73018..73309 (+) 292 Protein_84 IS5/IS1182 family transposase -
  SPN994039_RS00465 (SPN994039_00850) blpU 73512..73742 (+) 231 Protein_85 bacteriocin-like peptide BlpU -
  SPN994039_RS13155 (SPN994039_00860) - 73745..73870 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  SPN994039_RS00480 (SPN994039_00870) comA 74457..76610 (+) 2154 WP_000668304.1 peptide cleavage/export ABC transporter ComA Regulator
  SPN994039_RS00485 (SPN994039_00880) comB 76623..77972 (+) 1350 WP_000801618.1 competence pheromone export protein ComB Regulator
  SPN994039_RS00490 (SPN994039_00890) purC 78142..78849 (+) 708 WP_000043304.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  SPN994039_RS13295 - 78851..78994 (+) 144 WP_050167432.1 hypothetical protein -
  SPN994039_RS00495 (SPN994039_00900) - 79051..82776 (+) 3726 WP_000361178.1 phosphoribosylformylglycinamidine synthase -
  SPN994039_RS00500 (SPN994039_00910) purF 82869..84311 (+) 1443 WP_000220633.1 amidophosphoribosyltransferase -
  SPN994039_RS00505 (SPN994039_00920) purM 84348..85370 (+) 1023 WP_000182558.1 phosphoribosylformylglycinamidine cyclo-ligase -
  SPN994039_RS00510 (SPN994039_00930) purN 85367..85912 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  SPN994039_RS00515 (SPN994039_00940) - 85996..86505 (+) 510 WP_000894018.1 VanZ family protein -
  SPN994039_RS00520 (SPN994039_00950) purH 86530..88077 (+) 1548 WP_000167082.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80436.52 Da        Isoelectric Point: 6.2593

>NTDB_id=57919 SPN994039_RS00480 WP_000668304.1 74457..76610(+) (comA) [Streptococcus pneumoniae SPN994039]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPYKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGEMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELVEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=57919 SPN994039_RS00480 WP_000668304.1 74457..76610(+) (comA) [Streptococcus pneumoniae SPN994039]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCGGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTTATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTTTTCGCGACACGCAGGACAGGGGAAATTGTGTC
TCGTTTCACGGATGCTAACAGTATTATCGATGCGCTGGCTTCGACCATTCTTTCGATTTTCCTAGATGTGTCAACAGTTG
TCATTATTTCCCTTGTTTTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGACACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTACTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGAGATGACTTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTACGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGTAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTACCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTGGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae Rx1

99.442

100

0.994

  comA Streptococcus pneumoniae D39

99.442

100

0.994

  comA Streptococcus pneumoniae R6

99.442

100

0.994

  comA Streptococcus pneumoniae TIGR4

99.024

100

0.99

  comA Streptococcus mitis SK321

98.466

100

0.985

  comA Streptococcus mitis NCTC 12261

98.187

100

0.982

  comA Streptococcus gordonii str. Challis substr. CH1

80.474

100

0.805

  comA/nlmT Streptococcus mutans UA159

64.435

100

0.644


Multiple sequence alignment