Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   CWR44_RS10685 Genome accession   NZ_CP025031
Coordinates   2172208..2173182 (-) Length   324 a.a.
NCBI ID   WP_011275668.1    Uniprot ID   -
Organism   Staphylococcus haemolyticus strain SGAir0252     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2167208..2178182
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CWR44_RS10645 (CWR44_10595) gcvT 2167758..2168849 (-) 1092 WP_011275675.1 glycine cleavage system aminomethyltransferase GcvT -
  CWR44_RS10650 (CWR44_10600) - 2169038..2169556 (-) 519 WP_011275674.1 shikimate kinase -
  CWR44_RS13680 (CWR44_10610) - 2169716..2170207 (-) 492 WP_011275673.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  CWR44_RS10665 (CWR44_10615) - 2170125..2170424 (-) 300 WP_011275672.1 hypothetical protein -
  CWR44_RS10670 (CWR44_10620) comGD 2170405..2170857 (-) 453 WP_370444504.1 competence type IV pilus minor pilin ComGD -
  CWR44_RS10675 (CWR44_10625) comGC 2170838..2171158 (-) 321 WP_011275670.1 competence type IV pilus major pilin ComGC Machinery gene
  CWR44_RS10680 (CWR44_10630) comGB 2171172..2172236 (-) 1065 WP_011275669.1 competence type IV pilus assembly protein ComGB -
  CWR44_RS10685 (CWR44_10635) comGA 2172208..2173182 (-) 975 WP_011275668.1 competence type IV pilus ATPase ComGA Machinery gene
  CWR44_RS10690 (CWR44_10640) - 2173234..2173860 (-) 627 WP_011275667.1 MBL fold metallo-hydrolase -
  CWR44_RS10695 (CWR44_10645) - 2173862..2174185 (-) 324 WP_104948337.1 MTH1187 family thiamine-binding protein -
  CWR44_RS10700 (CWR44_10650) - 2174185..2175171 (-) 987 WP_011275665.1 ROK family glucokinase -
  CWR44_RS10705 (CWR44_10655) - 2175168..2175371 (-) 204 WP_011275664.1 YqgQ family protein -
  CWR44_RS10710 (CWR44_10660) - 2175355..2176812 (-) 1458 WP_011275663.1 rhomboid family protein -
  CWR44_RS10715 (CWR44_10665) - 2176828..2177364 (-) 537 WP_011275662.1 5-formyltetrahydrofolate cyclo-ligase -
  CWR44_RS10720 (CWR44_10670) rpmG 2177601..2177750 (-) 150 WP_011275661.1 50S ribosomal protein L33 -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37137.14 Da        Isoelectric Point: 8.6557

>NTDB_id=257500 CWR44_RS10685 WP_011275668.1 2172208..2173182(-) (comGA) [Staphylococcus haemolyticus strain SGAir0252]
MKLLFREIVNKAISKNASDIHFIPTVDEVHIKFRINDYLELYEIFNLDVYQKLLVFMKFKSGLDVSSHQSAQSGRYTYQA
KSTFYLRISTLPLSLGIESCVIRIIPQYFQAKKEYKEFNDFKHLVNKKQGLILLTGPTGSGKSTLMYQMVLHAYKELNLN
VITIENPVEQLLKGITQISINKKAGIDYVSSFKAILRCDPDIILIGEIRDAEVAKCVIQASLSGHLVLSTMHSTNCRGAL
LRLLEMGISIQELTQSINIISNQRLITTTQNERRLICETIDKKQIQFFFEHEQTMPHNFNNLQQQLNLLSKEGTICEDTA
SKYF

Nucleotide


Download         Length: 975 bp        

>NTDB_id=257500 CWR44_RS10685 WP_011275668.1 2172208..2173182(-) (comGA) [Staphylococcus haemolyticus strain SGAir0252]
TTGAAACTATTATTCAGAGAGATAGTTAATAAAGCTATTTCAAAAAATGCGAGTGACATACATTTTATTCCTACAGTTGA
TGAAGTTCATATTAAATTTAGAATTAATGATTACCTTGAACTTTATGAAATATTCAACTTAGATGTATATCAAAAATTAT
TAGTATTTATGAAGTTCAAATCAGGTTTGGACGTTTCATCTCATCAATCCGCTCAAAGTGGTCGTTATACTTATCAAGCT
AAATCTACTTTTTATTTACGGATTTCAACATTACCTTTGTCTTTAGGTATAGAAAGTTGTGTAATTAGGATAATTCCTCA
ATATTTTCAAGCAAAAAAAGAATATAAAGAATTTAACGATTTTAAACACTTAGTAAATAAAAAACAGGGATTAATTCTAT
TAACTGGACCCACAGGATCCGGAAAAAGCACTCTGATGTATCAAATGGTTCTACATGCATATAAAGAATTAAATCTTAAT
GTCATCACTATCGAAAACCCAGTTGAGCAATTACTAAAGGGAATTACTCAAATATCAATTAATAAAAAAGCAGGTATCGA
CTACGTAAGTTCATTTAAAGCTATATTAAGATGTGATCCTGATATTATTTTAATAGGTGAAATAAGAGACGCAGAAGTTG
CTAAATGTGTAATTCAAGCTAGTTTGAGTGGACATCTTGTATTATCAACTATGCATTCCACTAATTGTAGAGGTGCATTA
CTTCGATTACTTGAGATGGGAATTTCAATTCAAGAATTAACTCAATCGATCAATATCATTTCTAATCAAAGATTAATAAC
GACTACTCAGAATGAGCGTCGCTTAATTTGTGAAACAATAGATAAAAAGCAGATACAATTTTTCTTCGAGCATGAACAAA
CAATGCCACATAATTTTAATAATTTACAACAGCAATTAAATTTATTATCTAAAGAAGGAACAATTTGTGAAGATACTGCA
AGTAAATATTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

65.325

99.691

0.651

  comGA Staphylococcus aureus N315

65.325

99.691

0.651


Multiple sequence alignment