Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   Q7625_RS00290 Genome accession   NZ_CP131708
Coordinates   46746..48899 (+) Length   717 a.a.
NCBI ID   WP_000668272.1    Uniprot ID   A0A0T8H8N3
Organism   Streptococcus pneumoniae strain 2016C10-332     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 12174..60221 46746..48899 within 0


Gene organization within MGE regions


Location: 12174..60221
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  Q7625_RS00065 (Q7625_00065) ftsH 12174..14132 (+) 1959 WP_000744545.1 ATP-dependent zinc metalloprotease FtsH -
  Q7625_RS00070 (Q7625_00070) comX/comX2 14254..14733 (+) 480 WP_000588897.1 sigma-70 family RNA polymerase sigma factor Regulator
  Q7625_RS00105 (Q7625_00105) - 20225..21071 (+) 847 Protein_14 IS630 family transposase -
  Q7625_RS00110 (Q7625_00110) - 21106..21903 (-) 798 Protein_15 transposase -
  Q7625_RS00115 (Q7625_00115) comW 22169..22405 (+) 237 WP_000939545.1 sigma(X)-activator ComW Regulator
  Q7625_RS00120 (Q7625_00120) - 22636..23922 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  Q7625_RS00125 (Q7625_00125) tadA 24123..24590 (+) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  Q7625_RS00135 (Q7625_00135) - 24799..25497 (-) 699 WP_001106362.1 site-specific integrase -
  Q7625_RS00140 (Q7625_00140) - 25587..25934 (-) 348 WP_001839379.1 hypothetical protein -
  Q7625_RS00145 (Q7625_00145) - 25995..27064 (-) 1070 Protein_21 type I restriction endonuclease -
  Q7625_RS00150 (Q7625_00150) - 27081..27461 (-) 381 WP_000170931.1 ImmA/IrrE family metallo-endopeptidase -
  Q7625_RS00155 (Q7625_00155) - 27474..27737 (-) 264 WP_000285962.1 type II toxin-antitoxin system RelE/ParE family toxin -
  Q7625_RS00160 (Q7625_00160) - 27737..27970 (-) 234 WP_000156419.1 hypothetical protein -
  Q7625_RS00165 (Q7625_00165) - 27970..28338 (-) 369 WP_000464160.1 helix-turn-helix transcriptional regulator -
  Q7625_RS00170 (Q7625_00170) - 28910..29101 (+) 192 WP_001112859.1 DNA-binding protein -
  Q7625_RS00175 (Q7625_00175) - 29124..29327 (+) 204 WP_001247549.1 hypothetical protein -
  Q7625_RS00180 (Q7625_00180) - 29482..29649 (-) 168 WP_000024181.1 YjzC family protein -
  Q7625_RS00185 (Q7625_00185) - 29669..30034 (+) 366 Protein_29 autolysin -
  Q7625_RS00190 (Q7625_00190) - 30254..30433 (-) 180 WP_001209433.1 hypothetical protein -
  Q7625_RS00195 (Q7625_00195) - 30575..30724 (-) 150 WP_001030863.1 hypothetical protein -
  Q7625_RS00200 (Q7625_00200) - 31029..31472 (+) 444 WP_000701992.1 dUTP diphosphatase -
  Q7625_RS00205 (Q7625_00205) - 31474..31989 (+) 516 WP_000691236.1 histidine phosphatase family protein -
  Q7625_RS00210 (Q7625_00210) radA 32003..33364 (+) 1362 WP_075213698.1 DNA repair protein RadA Machinery gene
  Q7625_RS00215 (Q7625_00215) - 33437..33934 (+) 498 WP_001809263.1 carbonic anhydrase -
  Q7625_RS00220 (Q7625_00220) - 33959..34742 (+) 784 Protein_36 PrsW family glutamic-type intramembrane protease -
  Q7625_RS00225 (Q7625_00225) - 34887..35855 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  Q7625_RS00230 (Q7625_00230) - 35973..36900 (+) 928 Protein_38 Rpn family recombination-promoting nuclease/putative transposase -
  Q7625_RS00235 (Q7625_00235) polA 37509..40142 (+) 2634 WP_001812055.1 DNA polymerase I -
  Q7625_RS00240 (Q7625_00240) - 40227..40664 (+) 438 WP_000076479.1 CoA-binding protein -
  Q7625_RS00245 (Q7625_00245) - 40705..40920 (+) 216 WP_001814139.1 hypothetical protein -
  Q7625_RS00250 (Q7625_00250) - 40939..41949 (-) 1011 WP_000009180.1 YeiH family protein -
  Q7625_RS00255 (Q7625_00255) - 42098..43267 (+) 1170 WP_000366348.1 pyridoxal phosphate-dependent aminotransferase -
  Q7625_RS00260 (Q7625_00260) recO 43264..44034 (+) 771 WP_000616164.1 DNA repair protein RecO -
  Q7625_RS00265 (Q7625_00265) plsX 44031..45023 (+) 993 WP_000717451.1 phosphate acyltransferase PlsX -
  Q7625_RS00270 (Q7625_00270) - 45029..45262 (+) 234 WP_000136449.1 acyl carrier protein -
  Q7625_RS00275 (Q7625_00275) - 45308..45599 (+) 292 Protein_47 IS5/IS1182 family transposase -
  Q7625_RS00280 (Q7625_00280) blpU 45801..46031 (+) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  Q7625_RS00285 (Q7625_00285) - 46034..46159 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  Q7625_RS00290 (Q7625_00290) comA 46746..48899 (+) 2154 WP_000668272.1 peptide cleavage/export ABC transporter ComA Regulator
  Q7625_RS00295 (Q7625_00295) comB 48912..50261 (+) 1350 WP_000801611.1 competence pheromone export protein ComB Regulator
  Q7625_RS00300 (Q7625_00300) purC 50431..51138 (+) 708 WP_000043310.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  Q7625_RS00305 (Q7625_00305) - 51195..54920 (+) 3726 WP_000361217.1 phosphoribosylformylglycinamidine synthase -
  Q7625_RS00310 (Q7625_00310) purF 55013..56455 (+) 1443 WP_000220632.1 amidophosphoribosyltransferase -
  Q7625_RS00315 (Q7625_00315) purM 56492..57514 (+) 1023 WP_000182575.1 phosphoribosylformylglycinamidine cyclo-ligase -
  Q7625_RS00320 (Q7625_00320) purN 57511..58056 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  Q7625_RS00325 (Q7625_00325) - 58140..58649 (+) 510 WP_000894018.1 VanZ family protein -
  Q7625_RS00330 (Q7625_00330) purH 58674..60221 (+) 1548 WP_341922723.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80378.44 Da        Isoelectric Point: 6.2590

>NTDB_id=863937 Q7625_RS00290 WP_000668272.1 46746..48899(+) (comA) [Streptococcus pneumoniae strain 2016C10-332]
MKFGKRHYRPQVDQMDCGVASLAMIFGYYGSYYFLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVTGQDKDSIHIADPDPGVKLTKLPRERFEEEWTGVTLFMAPSPDYKPYKEQKNGLLSFIPI
LVKQRGLIANIVLATLLVIGINIVGSYYLQSIIDTYVPDQMRSTLGIISIGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGEIVSRFTDANSIIDALASTILSIFLDVSTVVIISLVLFSQNTNLFFMTLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQRYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHL
LLNVGILWMGAVLVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSLM
KGDMTFKQVHYKYGYGRDVLSDINLTVPQGSKVAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGKHADLLAQGGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=863937 Q7625_RS00290 WP_000668272.1 46746..48899(+) (comA) [Streptococcus pneumoniae strain 2016C10-332]
ATGAAATTTGGGAAACGTCACTATCGTCCGCAAGTGGATCAGATGGACTGCGGTGTAGCTTCATTAGCCATGATTTTTGG
CTACTATGGTAGTTATTATTTTTTGGCTCACTTGCGAGAATTGGCTAAGACGACCATGGATGGGACGACGGCTTTGGGCT
TGGTCAAGGTGGCAGAGGAGATTGGTTTTGAGACGCGAGCCATTAAGGCGGATATGACGCTTTTTGACTTGCCGGATTTG
ACTTTTCCTTTTGTTGCCCATGTGCTTAAGGAAGGGAAATTGCTCCACTACTATGTGGTGACTGGGCAGGATAAGGATAG
CATTCATATTGCCGATCCAGATCCCGGGGTGAAGTTGACTAAACTGCCACGTGAGCGTTTTGAGGAAGAATGGACAGGAG
TGACTCTTTTTATGGCACCTAGTCCAGACTATAAGCCTTATAAGGAACAAAAAAATGGTCTGCTCTCTTTTATCCCTATA
TTAGTGAAGCAGCGTGGCTTGATTGCTAATATCGTTTTGGCAACACTCTTGGTAATCGGGATTAACATTGTGGGTTCTTA
TTATCTGCAGTCTATCATTGATACCTATGTGCCAGATCAGATGCGTTCGACACTAGGGATTATTTCTATTGGGCTAGTCA
TCGTCTACATCCTCCAGCAAATCTTGTCTTACGCTCAGGAGTATCTCTTGCTTGTTTTGGGGCAACGCTTGTCGATTGAC
GTGATTTTGTCCTATATCAAGCATGTTTTTCACCTCCCTATGTCCTTCTTTGCGACACGCAGGACAGGGGAGATCGTGTC
TCGTTTTACAGATGCTAACAGTATCATCGATGCGCTGGCTTCGACCATCCTTTCGATTTTCCTAGATGTGTCAACGGTTG
TCATTATTTCCCTTGTTCTATTTTCACAAAATACCAATCTCTTTTTCATGACTTTATTGGCGCTTCCTATCTACACAGTG
ATTATCTTTGCCTTTATGAAGCCGTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCTAT
CATTGAGGACATCAACGGTATTGAGACTATCAAGTCCTTGACCAGTGAAAGTCAGCGTTACCAAAAAATTGACAAGGAAT
TTGTGGATTATCTGAAGAAATCCTTTACCTATAGTCGAGCAGAGAGTCAGCAAAAGGCTCTGAAAAAGGTTGCCCATCTC
TTGCTTAATGTCGGCATTCTCTGGATGGGGGCTGTTCTGGTCATGGATGGCAAGATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTGGTTTACTTTACCAATCCTTTGGAAAATATCATCAATCTGCAAACCAAGCTTCAGACAGCGCAGGTTG
CCAATAACCGTCTAAATGAAGTGTATCTAGTAGCTTCTGAGTTTGAGGAGAAGAAAACAGTTGAGGATTTGAGCTTGATG
AAGGGAGATATGACCTTCAAGCAGGTTCATTACAAGTATGGCTATGGTCGAGACGTCTTGTCGGATATCAATTTAACCGT
TCCCCAAGGGTCTAAGGTGGCTTTTGTGGGGATTTCAGGGTCAGGTAAGACGACTTTGGCCAAGATGATGGTTAATTTTT
ACGACCCAAGTCAAGGGGAGATTAGTCTGGGTGGTGTCAATCTCAATCAGATTGATAAAAAAGCCCTGCGCCAGTACATC
AACTATCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTTTTGGGAGCCAAGGAGGGGACGAC
ACAGGAAGATATCTTACGGGCGGTCGAATTGGCAGAGATTCGAGAGGATATCGAGCGCATGCCACTGAATTACCAGACAG
AATTGACTTCGGATGGGGCAGGGATTTCAGGTGGTCAACGTCAGAGAATCGCTTTGGCGCGTGCTCTCTTGACAGATGCG
CCGGTCTTGATTTTGGATGAGGCGACTAGCAGTTTGGATATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCTCACCGCTTGACTATTGCTGAGCGGACAGAGAAGGTAGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGAAAGCATGCTGATTTGCTTGCACAGGGTGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0T8H8N3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus pneumoniae D39

99.303

100

0.993

  comA Streptococcus pneumoniae R6

99.303

100

0.993

  comA Streptococcus pneumoniae Rx1

99.303

100

0.993

  comA Streptococcus pneumoniae TIGR4

98.884

100

0.989

  comA Streptococcus mitis NCTC 12261

98.187

100

0.982

  comA Streptococcus mitis SK321

98.047

100

0.98

  comA Streptococcus gordonii str. Challis substr. CH1

80.753

100

0.808

  comA/nlmT Streptococcus mutans UA159

64.435

100

0.644