Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   EH163_RS10025 Genome accession   NZ_CP034115
Coordinates   1974720..1975694 (+) Length   324 a.a.
NCBI ID   WP_058649760.1    Uniprot ID   -
Organism   Staphylococcus epidermidis strain CDC121     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1969720..1980694
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EH163_RS09990 rpmG 1970200..1970349 (+) 150 WP_001830957.1 50S ribosomal protein L33 -
  EH163_RS09995 - 1970532..1971083 (+) 552 WP_001831001.1 5-formyltetrahydrofolate cyclo-ligase -
  EH163_RS10000 - 1971084..1972544 (+) 1461 WP_001832757.1 rhomboid family intramembrane serine protease -
  EH163_RS10005 - 1972528..1972728 (+) 201 WP_001831138.1 YqgQ family protein -
  EH163_RS10010 - 1972728..1973714 (+) 987 WP_001831113.1 ROK family glucokinase -
  EH163_RS10015 - 1973714..1974040 (+) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  EH163_RS10020 - 1974040..1974663 (+) 624 WP_001831016.1 MBL fold metallo-hydrolase -
  EH163_RS10025 comGA 1974720..1975694 (+) 975 WP_058649760.1 competence type IV pilus ATPase ComGA Machinery gene
  EH163_RS10030 comGB 1975666..1976733 (+) 1068 WP_001831131.1 competence type IV pilus assembly protein ComGB Machinery gene
  EH163_RS10035 comGC 1976751..1977068 (+) 318 WP_001831271.1 competence type IV pilus major pilin ComGC Machinery gene
  EH163_RS10040 comGD 1977058..1977495 (+) 438 WP_001831025.1 competence type IV pilus minor pilin ComGD -
  EH163_RS10045 - 1977482..1977775 (+) 294 WP_080035447.1 hypothetical protein -
  EH163_RS10050 - 1977699..1978190 (+) 492 WP_002440043.1 competence type IV pilus minor pilin ComGF -
  EH163_RS10060 - 1978355..1978867 (+) 513 WP_002485736.1 shikimate kinase -
  EH163_RS10065 gcvT 1979085..1980176 (+) 1092 WP_001831107.1 glycine cleavage system aminomethyltransferase GcvT -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37388.48 Da        Isoelectric Point: 8.6335

>NTDB_id=329164 EH163_RS10025 WP_058649760.1 1974720..1975694(+) (comGA) [Staphylococcus epidermidis strain CDC121]
MKILFEQIINHAIKQEASDIHFIPCEEHTIIKLRIKDELTIYDRLSFPIYKKLLIYMKFQSGLDVSTQHRAQSGRYSYKM
KHLYYLRISTLPLSLGNESCVIRIVPQYFQTTRESYEFKDFKHFMKKKQGLLLFSGPTGSGKSTLMYQMVLYAHQKLNLN
VISVEDPVEQILNGITQISVNEKADINYESSFKAILRCDPDVILIGEIRDSTVAKYVIQASLSGHLVLSTMHANDCKGAL
LRLLEMGISVQELCQAINLISNQRLITTTTNYRQLVSELMFQSQINYFFEHNHSLPKNFTKLATHLNKMSKEGVICEEVV
DKYI

Nucleotide


Download         Length: 975 bp        

>NTDB_id=329164 EH163_RS10025 WP_058649760.1 1974720..1975694(+) (comGA) [Staphylococcus epidermidis strain CDC121]
TTGAAAATTTTATTTGAACAGATTATCAATCATGCAATTAAGCAAGAAGCAAGTGACATACACTTTATACCATGTGAGGA
ACACACTATAATTAAGCTAAGAATTAAAGATGAGCTCACAATATATGATAGACTTTCATTTCCAATTTATAAAAAATTAC
TTATATACATGAAGTTTCAATCTGGCTTAGATGTATCAACACAGCATAGAGCACAAAGTGGTAGATATAGTTATAAGATG
AAGCATTTATATTATTTAAGAATCTCTACATTACCTTTATCATTAGGCAATGAAAGTTGTGTAATTCGTATAGTACCTCA
ATACTTCCAAACTACACGAGAATCATATGAATTTAAAGATTTTAAACACTTTATGAAAAAGAAACAAGGTCTACTTTTGT
TTAGTGGACCAACTGGCTCTGGCAAAAGTACTTTAATGTACCAAATGGTCCTTTATGCACATCAAAAATTGAATTTAAAT
GTAATATCAGTTGAAGATCCAGTTGAGCAAATACTTAATGGAATCACACAAATATCAGTCAATGAAAAGGCAGATATAAA
TTATGAAAGCTCTTTCAAAGCCATTTTAAGGTGTGATCCAGATGTAATTTTAATTGGTGAAATCAGAGATTCTACCGTAG
CTAAATACGTGATTCAAGCCAGTTTAAGCGGGCATTTAGTTTTGTCGACAATGCACGCGAATGATTGCAAGGGAGCACTA
TTACGATTATTAGAAATGGGTATTTCTGTCCAAGAGTTGTGCCAGGCAATCAATTTAATTTCTAATCAAAGATTAATCAC
TACTACTACAAATTATCGTCAACTAGTATCGGAATTAATGTTTCAAAGCCAAATAAATTATTTTTTTGAACATAATCATT
CTTTACCTAAAAATTTCACAAAATTAGCTACACATTTAAACAAAATGTCTAAAGAAGGTGTTATATGTGAAGAAGTTGTC
GATAAATACATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

69.753

100

0.698

  comGA Staphylococcus aureus N315

69.753

100

0.698


Multiple sequence alignment