Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   GL187_RS00320 Genome accession   NZ_CP046379
Coordinates   49726..51879 (+) Length   717 a.a.
NCBI ID   WP_000668280.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain 563     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 10009..63201 49726..51879 within 0


Gene organization within MGE regions


Location: 10009..63201
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GL187_RS00055 (GL187_00055) ftsH 10009..11967 (+) 1959 WP_000744557.1 ATP-dependent zinc metalloprotease FtsH -
  GL187_RS00060 (GL187_00060) comX/comX2 12089..12568 (+) 480 WP_000588925.1 sigma-70 family RNA polymerase sigma factor Regulator
  GL187_RS00095 (GL187_00095) - 18059..18905 (+) 847 Protein_13 IS630 family transposase -
  GL187_RS00100 (GL187_00100) - 18940..19737 (-) 798 Protein_14 transposase -
  GL187_RS00105 (GL187_00105) comW 20003..20239 (+) 237 WP_000939544.1 sigma(X)-activator ComW Regulator
  GL187_RS00110 (GL187_00110) - 20470..21756 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  GL187_RS00115 (GL187_00115) tadA 21957..22424 (+) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  GL187_RS10955 (GL187_00125) - 22633..23331 (-) 699 WP_001106362.1 tyrosine-type recombinase/integrase -
  GL187_RS10960 (GL187_00130) - 23421..23768 (-) 348 WP_001839379.1 hypothetical protein -
  GL187_RS00130 (GL187_00135) - 23829..24899 (-) 1071 WP_000401841.1 type I restriction endonuclease -
  GL187_RS00135 (GL187_00140) - 24916..25296 (-) 381 WP_000170931.1 ImmA/IrrE family metallo-endopeptidase -
  GL187_RS00140 (GL187_00145) - 25309..25572 (-) 264 WP_000285962.1 type II toxin-antitoxin system RelE family toxin -
  GL187_RS00145 (GL187_00150) - 25572..25805 (-) 234 WP_000156419.1 hypothetical protein -
  GL187_RS00150 (GL187_00155) - 25805..26173 (-) 369 WP_000464160.1 helix-turn-helix domain-containing protein -
  GL187_RS00155 (GL187_00160) - 26745..26936 (+) 192 WP_001112859.1 DNA-binding protein -
  GL187_RS00160 (GL187_00165) - 26959..27162 (+) 204 WP_001247549.1 hypothetical protein -
  GL187_RS00165 (GL187_00170) - 27317..27484 (-) 168 WP_000024181.1 YjzC family protein -
  GL187_RS00170 (GL187_00175) - 27489..27869 (+) 381 Protein_28 autolysin -
  GL187_RS00175 (GL187_00180) - 28089..28268 (-) 180 WP_001209433.1 hypothetical protein -
  GL187_RS00180 - 28410..28559 (-) 150 WP_001030863.1 hypothetical protein -
  GL187_RS00185 (GL187_00185) - 28864..29307 (+) 444 WP_000701992.1 dUTP diphosphatase -
  GL187_RS00190 (GL187_00190) - 29309..29824 (+) 516 WP_000691236.1 histidine phosphatase family protein -
  GL187_RS00195 (GL187_00195) radA 29838..31199 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  GL187_RS00200 (GL187_00200) - 31272..31769 (+) 498 WP_001809263.1 beta-class carbonic anhydrase -
  GL187_RS00210 (GL187_00210) - 31794..32608 (+) 815 Protein_35 PrsW family intramembrane metalloprotease -
  GL187_RS00215 (GL187_00215) - 32753..33721 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  GL187_RS00220 (GL187_00220) - 33855..34136 (-) 282 Protein_37 transposase family protein -
  GL187_RS10940 - 34263..35170 (-) 908 Protein_38 Rpn family recombination-promoting nuclease/putative transposase -
  GL187_RS00240 (GL187_00240) polA 35426..38059 (+) 2634 WP_088834530.1 DNA polymerase I -
  GL187_RS00245 (GL187_00245) - 38144..38581 (+) 438 WP_061633093.1 CoA-binding protein -
  GL187_RS10965 - 38622..39062 (+) 441 WP_000693179.1 hypothetical protein -
  GL187_RS00255 (GL187_00255) - 39081..40091 (-) 1011 WP_000009171.1 YeiH family protein -
  GL187_RS00260 (GL187_00260) - 40240..41409 (+) 1170 WP_000366342.1 pyridoxal phosphate-dependent aminotransferase -
  GL187_RS00265 (GL187_00265) recO 41406..42176 (+) 771 WP_000616164.1 DNA repair protein RecO -
  GL187_RS00270 (GL187_00270) plsX 42173..43165 (+) 993 WP_000717467.1 phosphate acyltransferase PlsX -
  GL187_RS00275 (GL187_00275) - 43171..43404 (+) 234 WP_000136447.1 acyl carrier protein -
  GL187_RS00280 (GL187_00280) - 43441..43740 (+) 300 Protein_47 transposase family protein -
  GL187_RS00285 (GL187_00285) blpU 43943..44173 (+) 231 WP_001093487.1 bacteriocin-like peptide BlpU -
  GL187_RS11340 - 44176..44301 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  GL187_RS10970 - 44712..44906 (+) 195 WP_000756776.1 hypothetical protein -
  GL187_RS00290 (GL187_00290) - 44927..45820 (+) 894 WP_000343888.1 helix-turn-helix domain-containing protein -
  GL187_RS00295 (GL187_00295) - 46363..46659 (+) 297 WP_000837726.1 uberolysin/carnocyclin family circular bacteriocin -
  GL187_RS00300 (GL187_00300) - 46666..47802 (+) 1137 WP_000698828.1 hypothetical protein -
  GL187_RS00305 (GL187_00305) - 47799..48281 (+) 483 WP_001232094.1 stage II sporulation protein M -
  GL187_RS00310 (GL187_00310) - 48278..48874 (+) 597 WP_000687787.1 ATP-binding cassette domain-containing protein -
  GL187_RS00315 (GL187_00315) - 48855..49346 (+) 492 WP_000671990.1 hypothetical protein -
  GL187_RS00320 (GL187_00320) comA 49726..51879 (+) 2154 WP_000668280.1 peptide cleavage/export ABC transporter ComA Regulator
  GL187_RS00325 (GL187_00325) comB 51892..53241 (+) 1350 WP_024477962.1 competence pheromone export protein ComB Regulator
  GL187_RS00330 (GL187_00330) purC 53411..54118 (+) 708 WP_000043301.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  GL187_RS00335 (GL187_00335) - 54175..57900 (+) 3726 WP_024477961.1 phosphoribosylformylglycinamidine synthase -
  GL187_RS00340 (GL187_00340) purF 57993..59435 (+) 1443 WP_024477960.1 amidophosphoribosyltransferase -
  GL187_RS00345 (GL187_00345) purM 59472..60494 (+) 1023 WP_000182575.1 phosphoribosylformylglycinamidine cyclo-ligase -
  GL187_RS00350 (GL187_00350) purN 60491..61036 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  GL187_RS00355 (GL187_00355) - 61120..61629 (+) 510 WP_000894018.1 VanZ family protein -
  GL187_RS00360 (GL187_00360) purH 61654..63201 (+) 1548 WP_024477959.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80374.42 Da        Isoelectric Point: 6.4567

>NTDB_id=403658 GL187_RS00320 WP_000668280.1 49726..51879(+) (comA) [Streptococcus pneumoniae strain 563]
MKFGKRHYRPQVDQMDCGVASLAMVFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFAEEWTGVTLFMAPSPDYKPHKEQKNGLFSFIPI
LVKQRGLIANIVLATLLVTVINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGDMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGSVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=403658 GL187_RS00320 WP_000668280.1 49726..51879(+) (comA) [Streptococcus pneumoniae strain 563]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGGTTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCGGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGACCCAGATCCTGGGGTGAAATTGACTAAACTGCCACGTGAGCGTTTTGCGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTACAAGCCTCATAAGGAACAAAAAAATGGTCTGTTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCCAATATCGTTTTGGCAACACTCTTGGTAACCGTGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGATATGACCTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTAGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTATCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTGGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGAGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae Rx1

99.442

100

0.994

  comA Streptococcus pneumoniae D39

99.442

100

0.994

  comA Streptococcus pneumoniae R6

99.442

100

0.994

  comA Streptococcus pneumoniae TIGR4

99.303

100

0.993

  comA Streptococcus mitis SK321

98.466

100

0.985

  comA Streptococcus mitis NCTC 12261

98.466

100

0.985

  comA Streptococcus gordonii str. Challis substr. CH1

80.335

100

0.803

  comA/nlmT Streptococcus mutans UA159

64.714

100

0.647


Multiple sequence alignment