Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   SE_RS05820 Genome accession   NC_004461
Coordinates   1262127..1263101 (-) Length   324 a.a.
NCBI ID   WP_001831109.1    Uniprot ID   A0A9Q5JKD6
Organism   Staphylococcus epidermidis ATCC 12228     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1257127..1268101
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SE_RS05785 (SE1222) gcvT 1257645..1258736 (-) 1092 WP_001831107.1 glycine cleavage system aminomethyltransferase GcvT -
  SE_RS05790 (SE1224) - 1258954..1259466 (-) 513 WP_002440040.1 shikimate kinase -
  SE_RS05795 (SE1225) - 1259631..1260122 (-) 492 WP_002440043.1 competence type IV pilus minor pilin ComGF -
  SE_RS05800 (SE1226) - 1260046..1260339 (-) 294 WP_001831116.1 hypothetical protein -
  SE_RS05805 (SE1227) comGD 1260326..1260763 (-) 438 WP_001831025.1 competence type IV pilus minor pilin ComGD -
  SE_RS05810 (SE1228) comGC 1260753..1261070 (-) 318 WP_001831271.1 competence type IV pilus major pilin ComGC Machinery gene
  SE_RS05815 (SE1229) comGB 1261088..1262155 (-) 1068 WP_001831131.1 competence type IV pilus assembly protein ComGB Machinery gene
  SE_RS05820 (SE1230) comGA 1262127..1263101 (-) 975 WP_001831109.1 competence type IV pilus ATPase ComGA Machinery gene
  SE_RS05825 (SE1231) - 1263158..1263781 (-) 624 WP_001831016.1 MBL fold metallo-hydrolase -
  SE_RS05830 (SE1232) - 1263781..1264107 (-) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  SE_RS05835 (SE1233) - 1264107..1265093 (-) 987 WP_001831113.1 ROK family glucokinase -
  SE_RS05840 (SE1234) - 1265093..1265293 (-) 201 WP_001831138.1 YqgQ family protein -
  SE_RS05845 (SE1235) - 1265277..1266737 (-) 1461 WP_001832757.1 rhomboid family protein -
  SE_RS05850 (SE1236) - 1266738..1267289 (-) 552 WP_001831001.1 5-formyltetrahydrofolate cyclo-ligase -
  SE_RS05855 (SE1237) rpmG 1267472..1267621 (-) 150 WP_001830957.1 50S ribosomal protein L33 -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37330.44 Da        Isoelectric Point: 8.8174

>NTDB_id=22579 SE_RS05820 WP_001831109.1 1262127..1263101(-) (comGA) [Staphylococcus epidermidis ATCC 12228]
MKILFEQIINHAIKQEASDIHFIPCEEHTIIKLRIKDELTIYDRLSFPIYKKLLIYMKFQSGLDVSTQHRAQSGRYSYKM
KHLYYLRISTLPLSLGNESCVIRIVPQYFQTTRESYEFKDFKHFMKKKQGLLLFSGPTGSGKSTLMYQMVLYAHQKLNLN
VISVEDPVEQILNGITQISVNEKAGINYESSFKAILRCDPDVILIGEIRDSTVAKYVIQASLSGHLVLSTMHANDCKGAL
LRLLEMGISVQELCQAINLISNQRLITTTTNYRQLVSELMFQSQINYFFEHNHSLPKNFTKLATHLNKMSKEGVICEEVV
DKYI

Nucleotide


Download         Length: 975 bp        

>NTDB_id=22579 SE_RS05820 WP_001831109.1 1262127..1263101(-) (comGA) [Staphylococcus epidermidis ATCC 12228]
TTGAAAATTTTATTTGAACAGATTATCAATCATGCAATTAAGCAAGAAGCAAGTGACATACACTTTATACCATGTGAGGA
ACACACTATAATTAAGCTAAGAATTAAAGATGAGCTCACAATATATGATAGACTTTCATTTCCAATTTATAAAAAATTAC
TTATATACATGAAGTTTCAATCTGGCTTAGATGTATCAACACAGCATAGAGCACAAAGTGGTAGATATAGTTATAAGATG
AAGCATTTATATTATTTAAGAATCTCTACATTACCTTTATCATTAGGCAATGAAAGTTGTGTAATTCGTATAGTACCTCA
ATACTTCCAAACTACACGAGAATCATATGAATTTAAAGATTTTAAACACTTTATGAAAAAGAAACAAGGTCTACTTTTGT
TTAGTGGACCAACTGGCTCTGGCAAAAGTACTTTAATGTACCAAATGGTCCTTTATGCACATCAAAAATTGAATTTAAAT
GTAATATCAGTTGAAGATCCAGTTGAGCAAATACTTAATGGAATCACACAAATATCAGTCAATGAAAAGGCAGGTATAAA
TTATGAAAGCTCTTTCAAAGCCATTTTAAGGTGTGATCCAGATGTAATTTTAATTGGTGAAATCAGAGATTCTACCGTAG
CTAAATACGTGATTCAAGCCAGTTTAAGCGGGCATTTAGTTTTGTCGACAATGCACGCGAATGATTGCAAGGGAGCACTA
TTACGATTATTAGAAATGGGTATTTCTGTCCAAGAGTTGTGCCAGGCAATCAATTTAATTTCTAATCAAAGATTAATCAC
TACTACTACAAATTATCGTCAACTAGTATCGGAATTAATGTTTCAAAGCCAAATAAATTATTTTTTTGAACATAATCATT
CTTTACCTAAAAATTTCACAAAATTAGCTACACATTTAAACAAAATGTCTAAAGAAGGTGTTATATGTGAAGAAGTTGTC
GATAAATACATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

70.062

100

0.701

  comGA Staphylococcus aureus N315

70.062

100

0.701


Multiple sequence alignment