Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   GWG04_RS02155 Genome accession   NZ_CP047922
Coordinates   399973..400659 (+) Length   228 a.a.
NCBI ID   WP_001802022.1    Uniprot ID   A0A7U7EYP5
Organism   Staphylococcus aureus strain SR231     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 394973..405659
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GWG04_RS02120 (GWG04_02110) yqeH 395500..396600 (+) 1101 WP_001280137.1 ribosome biogenesis GTPase YqeH -
  GWG04_RS02125 (GWG04_02115) aroE 396614..397420 (+) 807 WP_000666757.1 shikimate dehydrogenase -
  GWG04_RS02130 (GWG04_02120) yhbY 397424..397714 (+) 291 WP_000955235.1 ribosome assembly RNA-binding protein YhbY -
  GWG04_RS02135 (GWG04_02125) nadD 397717..398286 (+) 570 WP_000725167.1 nicotinate (nicotinamide) nucleotide adenylyltransferase -
  GWG04_RS02140 (GWG04_02130) yqeK 398276..398860 (+) 585 WP_001019324.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  GWG04_RS02145 (GWG04_02135) rsfS 398861..399214 (+) 354 WP_001088020.1 ribosome silencing factor -
  GWG04_RS02150 (GWG04_02140) - 399217..399933 (+) 717 WP_000084829.1 class I SAM-dependent methyltransferase -
  GWG04_RS02155 (GWG04_02145) comEA 399973..400659 (+) 687 WP_001802022.1 ComEA family DNA-binding protein Machinery gene
  GWG04_RS02160 (GWG04_02150) - 400751..401212 (+) 462 WP_000439693.1 ComE operon protein 2 -
  GWG04_RS02165 (GWG04_02155) comEC 401217..403418 (+) 2202 WP_001557362.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GWG04_RS02170 (GWG04_02160) holA 403475..404449 (+) 975 WP_001282563.1 DNA polymerase III subunit delta -
  GWG04_RS02175 (GWG04_02165) rpsT 404494..404745 (-) 252 WP_001274017.1 30S ribosomal protein S20 -

Sequence


Protein


Download         Length: 228 a.a.        Molecular weight: 25900.65 Da        Isoelectric Point: 9.6751

>NTDB_id=418402 GWG04_RS02155 WP_001802022.1 399973..400659(+) (comEA) [Staphylococcus aureus strain SR231]
MVLLYQFLLRYKDFLTQWKLYIISAVVLIMVLIGFIFWRQDDYTSRNFENKDTALKQSTSENNSLSKLEDVQVKDGDNSK
NKGPVYVDVKGAVKHPNVYKMTSKDRVVDLLDKAQLLDDADVSRINLSEKLTDQKMIFIPHKGQKNVEPQIEVNSVHVKN
GNTNNTKVNLNTASVSELMSVPGVGQAKANAIVEYRNQQGAFQEIDDLKKVKGFGSKTFDKLKSYFTI

Nucleotide


Download         Length: 687 bp        

>NTDB_id=418402 GWG04_RS02155 WP_001802022.1 399973..400659(+) (comEA) [Staphylococcus aureus strain SR231]
GTGGTTTTATTGTATCAATTTTTATTACGCTATAAAGATTTTTTAACTCAGTGGAAGTTATATATTATAAGTGCTGTTGT
TTTAATTATGGTATTAATTGGTTTTATATTCTGGAGACAAGATGATTATACTTCAAGAAATTTTGAAAATAAAGATACTG
CTCTGAAACAAAGCACTAGTGAAAATAATAGTTTGTCCAAATTAGAAGATGTCCAGGTCAAAGATGGAGATAATTCCAAA
AATAAAGGTCCTGTATATGTCGATGTAAAAGGTGCTGTTAAACATCCTAATGTTTATAAAATGACATCTAAGGATAGAGT
AGTTGATTTACTTGATAAAGCACAATTATTGGATGATGCAGATGTAAGTCGAATTAATTTGTCTGAAAAATTAACAGATC
AAAAAATGATTTTCATACCCCATAAAGGACAAAAGAATGTTGAACCACAAATTGAAGTAAACAGTGTGCACGTAAAAAAT
GGGAACACAAATAATACTAAAGTAAATTTAAATACCGCATCTGTATCAGAATTGATGTCTGTTCCTGGAGTAGGGCAAGC
TAAAGCTAATGCAATTGTTGAATATCGCAACCAACAAGGTGCATTTCAAGAAATTGACGATTTGAAAAAAGTAAAAGGTT
TTGGAAGTAAAACTTTTGATAAACTGAAATCTTATTTCACGATATAA

Domains


Predicted by InterproScan.

(164-227)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7U7EYP5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Staphylococcus aureus N315

100

100

1

  comEA Staphylococcus aureus MW2

97.368

100

0.974


Multiple sequence alignment