Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   EGX64_RS11025 Genome accession   NZ_CP033782
Coordinates   2208344..2209411 (-) Length   355 a.a.
NCBI ID   WP_002446533.1    Uniprot ID   -
Organism   Staphylococcus epidermidis strain FDAARGOS_529     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2203344..2214411
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EGX64_RS10985 (EGX64_10980) gcvPA 2203535..2204881 (-) 1347 WP_002446526.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  EGX64_RS10990 (EGX64_10985) gcvT 2204901..2205992 (-) 1092 WP_002446527.1 glycine cleavage system aminomethyltransferase GcvT -
  EGX64_RS10995 (EGX64_10990) - 2206210..2206722 (-) 513 WP_002446528.1 shikimate kinase -
  EGX64_RS11005 (EGX64_11000) comGF 2206887..2207378 (-) 492 WP_002446529.1 competence type IV pilus minor pilin ComGF -
  EGX64_RS11010 (EGX64_11005) - 2207302..2207595 (-) 294 WP_002446530.1 hypothetical protein -
  EGX64_RS11015 (EGX64_11010) comGD 2207582..2208019 (-) 438 WP_002446531.1 competence type IV pilus minor pilin ComGD -
  EGX64_RS11020 (EGX64_11015) comGC 2208009..2208326 (-) 318 WP_002446532.1 competence type IV pilus major pilin ComGC Machinery gene
  EGX64_RS11025 (EGX64_11020) comGB 2208344..2209411 (-) 1068 WP_002446533.1 competence type IV pilus assembly protein ComGB Machinery gene
  EGX64_RS11030 (EGX64_11025) comGA 2209383..2210357 (-) 975 WP_002446534.1 competence type IV pilus ATPase ComGA Machinery gene
  EGX64_RS11035 (EGX64_11030) - 2210414..2211037 (-) 624 WP_002446535.1 MBL fold metallo-hydrolase -
  EGX64_RS11040 (EGX64_11035) - 2211037..2211363 (-) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  EGX64_RS11045 (EGX64_11040) - 2211363..2212349 (-) 987 WP_002446536.1 ROK family glucokinase -
  EGX64_RS11050 (EGX64_11045) - 2212349..2212549 (-) 201 WP_002446537.1 YqgQ family protein -
  EGX64_RS11055 (EGX64_11050) - 2212533..2213993 (-) 1461 WP_002446538.1 rhomboid family intramembrane serine protease -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 42350.56 Da        Isoelectric Point: 10.0598

>NTDB_id=325685 EGX64_RS11025 WP_002446533.1 2208344..2209411(-) (comGB) [Staphylococcus epidermidis strain FDAARGOS_529]
MKKLSINTFKYKRNKYLTEIQSINLLQRLHQLLSHGFTLYQSFKFLNSYFKYKERTINEKIIQHLQNGATCYDILKMIGY
PELVLLQIKFAENYGNIDEALVDTVQYMKRNLKAKKRLIKTLQYPVALISIFLFILTILNITVIPQFQQLYETMNVKLST
FQNLLTLIITLLPKLTFIFIFMSGIAFFITYKFYYYLPIERKLKSILKIPMINTYYKIYRTYQLSNQLSLFYRNGTSLQQ
IVQIYRNEQDNEFLKFLGNCLFKEANRGLPLPTILMNLQCFQNDLIKFIEQGEKNGKLDIELKLYSQMLLQQFEEKVLKQ
TKFIQPIIFFILGIFIVSLYLVIMLPMFELMQTIK

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=325685 EGX64_RS11025 WP_002446533.1 2208344..2209411(-) (comGB) [Staphylococcus epidermidis strain FDAARGOS_529]
GTGAAGAAGTTGTCGATAAATACATTTAAATATAAGAGGAATAAATATCTTACTGAAATACAATCAATAAACTTATTACA
GAGATTACATCAGCTTTTAAGTCACGGATTCACTTTATATCAAAGTTTTAAATTTTTAAACTCTTATTTTAAATATAAAG
AACGAACAATAAATGAAAAGATTATCCAACATCTACAAAACGGCGCTACATGTTATGATATTTTAAAAATGATAGGGTAT
CCAGAATTAGTTCTTCTTCAAATAAAATTTGCTGAAAACTACGGTAACATTGATGAGGCTCTCGTTGATACTGTTCAATA
TATGAAAAGAAATTTGAAAGCTAAAAAACGACTCATCAAAACCTTACAATATCCTGTTGCATTAATTTCTATCTTCTTAT
TCATATTAACCATTTTAAATATAACTGTCATACCTCAATTTCAACAACTTTATGAGACTATGAATGTTAAATTATCAACA
TTTCAAAATCTACTAACTCTTATTATTACCCTTCTTCCCAAGCTAACTTTCATTTTTATCTTTATGAGTGGTATAGCATT
TTTTATCACTTATAAATTTTACTATTATCTACCAATTGAGAGAAAATTAAAATCTATTTTAAAAATCCCAATGATTAATA
CGTATTATAAAATATATAGAACTTATCAACTTTCCAATCAGCTTTCTTTATTTTACAGAAATGGTACAAGTCTTCAACAA
ATTGTCCAAATATATCGTAATGAGCAAGATAACGAATTTCTTAAATTTCTGGGTAATTGTCTTTTTAAAGAAGCTAATAG
AGGGCTCCCGTTACCTACTATATTAATGAATCTACAATGTTTTCAAAATGATTTAATTAAATTCATAGAACAAGGAGAGA
AAAATGGAAAATTAGATATAGAATTAAAATTATACAGTCAAATGTTATTACAGCAATTTGAGGAAAAAGTGTTAAAACAA
ACAAAATTTATACAACCTATCATCTTCTTTATCTTGGGAATTTTTATTGTATCTTTATACTTAGTCATTATGCTTCCTAT
GTTCGAACTTATGCAAACAATAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Staphylococcus aureus MW2

48.596

100

0.487

  comGB Staphylococcus aureus N315

48.596

100

0.487


Multiple sequence alignment