Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   EH163_RS10030 Genome accession   NZ_CP034115
Coordinates   1975666..1976733 (+) Length   355 a.a.
NCBI ID   WP_001831131.1    Uniprot ID   A0A9Q5JLH5
Organism   Staphylococcus epidermidis strain CDC121     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1970666..1981733
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EH163_RS10000 - 1971084..1972544 (+) 1461 WP_001832757.1 rhomboid family intramembrane serine protease -
  EH163_RS10005 - 1972528..1972728 (+) 201 WP_001831138.1 YqgQ family protein -
  EH163_RS10010 - 1972728..1973714 (+) 987 WP_001831113.1 ROK family glucokinase -
  EH163_RS10015 - 1973714..1974040 (+) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  EH163_RS10020 - 1974040..1974663 (+) 624 WP_001831016.1 MBL fold metallo-hydrolase -
  EH163_RS10025 comGA 1974720..1975694 (+) 975 WP_058649760.1 competence type IV pilus ATPase ComGA Machinery gene
  EH163_RS10030 comGB 1975666..1976733 (+) 1068 WP_001831131.1 competence type IV pilus assembly protein ComGB Machinery gene
  EH163_RS10035 comGC 1976751..1977068 (+) 318 WP_001831271.1 competence type IV pilus major pilin ComGC Machinery gene
  EH163_RS10040 comGD 1977058..1977495 (+) 438 WP_001831025.1 competence type IV pilus minor pilin ComGD -
  EH163_RS10045 - 1977482..1977775 (+) 294 WP_080035447.1 hypothetical protein -
  EH163_RS10050 - 1977699..1978190 (+) 492 WP_002440043.1 competence type IV pilus minor pilin ComGF -
  EH163_RS10060 - 1978355..1978867 (+) 513 WP_002485736.1 shikimate kinase -
  EH163_RS10065 gcvT 1979085..1980176 (+) 1092 WP_001831107.1 glycine cleavage system aminomethyltransferase GcvT -
  EH163_RS10070 gcvPA 1980196..1981542 (+) 1347 WP_002456181.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 42379.68 Da        Isoelectric Point: 10.1702

>NTDB_id=329165 EH163_RS10030 WP_001831131.1 1975666..1976733(+) (comGB) [Staphylococcus epidermidis strain CDC121]
MKKLSINTFKYKRNKYLTEIQSIDLLQRLQQLLSHGFTLYQSFKFLNSYFKYKERTINKKIIQHLQNGATCYDILKIIGY
PELVLLQIKFAENYGNIEEALVDTVQYMKRNLKAKKRLIKTLQYPVALISIFLFILTILNITVIPQFQQLYETMNVKLST
FQNLLTLIITRLPKLTFIFIFISGIAFFITYKFYYYLPIEKKLKSILKIPMINTYYKIYRTYQLSNQLSLFYRNGTSLQQ
IVRIYRNEQDNDFLKFLGDYLFKEANKGLPLPVILMNLKCFQNDLIKFIEQGEKNGKLDIELKLYSQMLLQQFEEKVLKQ
TKFIQPIIFFILGIFIVSLYLVIMLPMFELMQTIK

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=329165 EH163_RS10030 WP_001831131.1 1975666..1976733(+) (comGB) [Staphylococcus epidermidis strain CDC121]
GTGAAGAAGTTGTCGATAAATACATTTAAATATAAGAGGAATAAATATCTTACTGAAATACAATCAATAGACTTACTACA
GAGATTACAACAGCTTTTAAGTCACGGATTCACTTTATATCAAAGTTTTAAATTTTTAAACTCCTATTTTAAATATAAAG
AGCGAACAATAAATAAAAAGATTATCCAACATCTACAAAACGGTGCTACATGTTATGATATTTTAAAAATAATAGGGTAT
CCAGAATTAGTTCTTCTTCAAATAAAATTTGCTGAAAACTACGGCAACATTGAGGAGGCTCTCGTTGATACTGTTCAATA
TATGAAAAGAAATCTGAAAGCTAAAAAACGACTCATCAAAACCTTACAATATCCTGTTGCATTAATTTCTATCTTCTTAT
TCATATTAACCATTTTAAATATAACTGTCATACCTCAATTTCAACAACTTTATGAGACTATGAATGTTAAATTATCAACA
TTTCAAAATCTACTAACTCTTATTATTACCCGTCTTCCCAAACTAACTTTCATTTTTATCTTTATTAGTGGTATAGCATT
TTTTATCACTTATAAATTCTACTATTATCTACCAATTGAGAAAAAGTTAAAATCTATTTTAAAAATCCCAATGATTAATA
CGTATTATAAAATATATAGAACTTATCAACTTTCCAATCAACTTTCTTTATTTTACAGAAATGGTACAAGTCTTCAACAA
ATTGTCCGTATATATCGTAATGAGCAAGATAACGATTTTCTTAAATTTCTGGGTGATTATCTTTTTAAAGAAGCTAATAA
AGGGCTCCCGTTACCTGTTATATTAATGAATTTAAAATGTTTTCAAAATGATTTGATTAAATTCATAGAACAAGGAGAGA
AAAATGGGAAATTAGATATAGAATTAAAGTTATACAGTCAAATGCTATTACAGCAATTTGAAGAAAAAGTGTTAAAACAA
ACAAAATTTATACAACCTATCATCTTCTTTATCTTGGGAATTTTTATTGTATCTTTATACTTAGTCATTATGCTTCCTAT
GTTTGAACTTATGCAAACAATAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Staphylococcus aureus MW2

49.719

100

0.499

  comGB Staphylococcus aureus N315

49.719

100

0.499


Multiple sequence alignment