Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   ACFDBD_RS06175 Genome accession   NZ_CP170249
Coordinates   1303261..1304328 (+) Length   355 a.a.
NCBI ID   WP_002446533.1    Uniprot ID   -
Organism   Staphylococcus epidermidis strain HS01     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1298261..1309328
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACFDBD_RS06145 (ACFDBD_06140) - 1298679..1300139 (+) 1461 WP_002446538.1 rhomboid family intramembrane serine protease -
  ACFDBD_RS06150 (ACFDBD_06145) - 1300123..1300323 (+) 201 WP_002446537.1 YqgQ family protein -
  ACFDBD_RS06155 (ACFDBD_06150) - 1300323..1301309 (+) 987 WP_002446536.1 ROK family glucokinase -
  ACFDBD_RS06160 (ACFDBD_06155) - 1301309..1301635 (+) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  ACFDBD_RS06165 (ACFDBD_06160) - 1301635..1302258 (+) 624 WP_002446535.1 MBL fold metallo-hydrolase -
  ACFDBD_RS06170 (ACFDBD_06165) comGA 1302315..1303289 (+) 975 WP_002446534.1 competence type IV pilus ATPase ComGA Machinery gene
  ACFDBD_RS06175 (ACFDBD_06170) comGB 1303261..1304328 (+) 1068 WP_002446533.1 competence type IV pilus assembly protein ComGB Machinery gene
  ACFDBD_RS06180 (ACFDBD_06175) comGC 1304346..1304663 (+) 318 WP_002446532.1 competence type IV pilus major pilin ComGC Machinery gene
  ACFDBD_RS06185 (ACFDBD_06180) comGD 1304653..1305090 (+) 438 WP_002446531.1 competence type IV pilus minor pilin ComGD -
  ACFDBD_RS06190 (ACFDBD_06185) - 1305077..1305370 (+) 294 WP_002446530.1 hypothetical protein -
  ACFDBD_RS06195 (ACFDBD_06190) comGF 1305294..1305791 (+) 498 Protein_1181 competence type IV pilus minor pilin ComGF -
  ACFDBD_RS06200 (ACFDBD_06195) - 1305961..1306473 (+) 513 WP_002446528.1 shikimate kinase -
  ACFDBD_RS06205 (ACFDBD_06200) gcvT 1306691..1307782 (+) 1092 WP_002446527.1 glycine cleavage system aminomethyltransferase GcvT -
  ACFDBD_RS06210 (ACFDBD_06205) gcvPA 1307802..1309148 (+) 1347 WP_002446526.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 42350.56 Da        Isoelectric Point: 10.0598

>NTDB_id=1056008 ACFDBD_RS06175 WP_002446533.1 1303261..1304328(+) (comGB) [Staphylococcus epidermidis strain HS01]
MKKLSINTFKYKRNKYLTEIQSINLLQRLHQLLSHGFTLYQSFKFLNSYFKYKERTINEKIIQHLQNGATCYDILKMIGY
PELVLLQIKFAENYGNIDEALVDTVQYMKRNLKAKKRLIKTLQYPVALISIFLFILTILNITVIPQFQQLYETMNVKLST
FQNLLTLIITLLPKLTFIFIFMSGIAFFITYKFYYYLPIERKLKSILKIPMINTYYKIYRTYQLSNQLSLFYRNGTSLQQ
IVQIYRNEQDNEFLKFLGNCLFKEANRGLPLPTILMNLQCFQNDLIKFIEQGEKNGKLDIELKLYSQMLLQQFEEKVLKQ
TKFIQPIIFFILGIFIVSLYLVIMLPMFELMQTIK

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=1056008 ACFDBD_RS06175 WP_002446533.1 1303261..1304328(+) (comGB) [Staphylococcus epidermidis strain HS01]
GTGAAGAAGTTGTCGATAAATACATTTAAATATAAGAGGAATAAATATCTTACTGAAATACAATCAATAAACTTATTACA
GAGATTACATCAGCTTTTAAGTCACGGATTCACTTTATATCAAAGTTTTAAATTTTTAAACTCTTATTTTAAATATAAAG
AACGAACAATAAATGAAAAGATTATCCAACATCTACAAAACGGCGCTACATGTTATGATATTTTAAAAATGATAGGGTAT
CCAGAATTAGTTCTTCTTCAAATAAAATTTGCTGAAAACTACGGTAACATTGATGAGGCTCTCGTTGATACTGTTCAATA
TATGAAAAGAAATTTGAAAGCTAAAAAACGACTCATCAAAACCTTACAATATCCTGTTGCATTAATTTCTATCTTCTTAT
TCATATTAACCATTTTAAATATAACTGTCATACCTCAATTTCAACAACTTTATGAGACTATGAATGTTAAATTATCAACA
TTTCAAAATCTACTAACTCTTATTATTACCCTTCTTCCCAAGCTAACTTTCATTTTTATCTTTATGAGTGGTATAGCATT
TTTTATCACTTATAAATTTTACTATTATCTACCAATTGAGAGAAAATTAAAATCTATTTTAAAAATCCCAATGATTAATA
CGTATTATAAAATATATAGAACTTATCAACTTTCCAATCAGCTTTCTTTATTTTACAGAAATGGTACAAGTCTTCAACAA
ATTGTCCAAATATATCGTAATGAGCAAGATAACGAATTTCTTAAATTTCTGGGTAATTGTCTTTTTAAAGAAGCTAATAG
AGGGCTCCCGTTACCTACTATATTAATGAATCTACAATGTTTTCAAAATGATTTAATTAAATTCATAGAACAAGGAGAGA
AAAATGGAAAATTAGATATAGAATTAAAATTATACAGTCAAATGTTATTACAGCAATTTGAGGAAAAAGTGTTAAAACAA
ACAAAATTTATACAACCTATCATCTTCTTTATCTTGGGAATTTTTATTGTATCTTTATACTTAGTCATTATGCTTCCTAT
GTTCGAACTTATGCAAACAATAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Staphylococcus aureus MW2

48.596

100

0.487

  comGB Staphylococcus aureus N315

48.596

100

0.487


Multiple sequence alignment