Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   SE1UMMC_RS05960 Genome accession   NZ_CP013943
Coordinates   1194205..1195272 (-) Length   355 a.a.
NCBI ID   WP_001831131.1    Uniprot ID   A0A9Q5JLH5
Organism   Staphylococcus epidermidis strain DAR1907     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1189205..1200272
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SE1UMMC_RS05920 (SE1UMMC_05580) gcvPA 1189396..1190742 (-) 1347 WP_101750535.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  SE1UMMC_RS05925 (SE1UMMC_05585) gcvT 1190762..1191853 (-) 1092 WP_001831107.1 glycine cleavage system aminomethyltransferase GcvT -
  SE1UMMC_RS05930 (SE1UMMC_05590) - 1192071..1192583 (-) 513 WP_002440040.1 shikimate kinase -
  SE1UMMC_RS05940 (SE1UMMC_05595) - 1192748..1193239 (-) 492 WP_002440043.1 competence type IV pilus minor pilin ComGF -
  SE1UMMC_RS05945 - 1193163..1193456 (-) 294 WP_001831116.1 hypothetical protein -
  SE1UMMC_RS05950 (SE1UMMC_05600) comGD 1193443..1193880 (-) 438 WP_001831025.1 competence type IV pilus minor pilin ComGD -
  SE1UMMC_RS05955 (SE1UMMC_05605) comGC 1193870..1194187 (-) 318 WP_001831271.1 competence type IV pilus major pilin ComGC Machinery gene
  SE1UMMC_RS05960 (SE1UMMC_05610) comGB 1194205..1195272 (-) 1068 WP_001831131.1 competence type IV pilus assembly protein ComGB Machinery gene
  SE1UMMC_RS05965 (SE1UMMC_05615) comGA 1195244..1196218 (-) 975 WP_001831109.1 competence type IV pilus ATPase ComGA Machinery gene
  SE1UMMC_RS05970 (SE1UMMC_05620) - 1196275..1196898 (-) 624 WP_001831016.1 MBL fold metallo-hydrolase -
  SE1UMMC_RS05975 (SE1UMMC_05625) - 1196898..1197224 (-) 327 WP_001831234.1 MTH1187 family thiamine-binding protein -
  SE1UMMC_RS05980 (SE1UMMC_05630) - 1197224..1198210 (-) 987 WP_001831113.1 ROK family glucokinase -
  SE1UMMC_RS05985 (SE1UMMC_05635) - 1198210..1198410 (-) 201 WP_001831138.1 YqgQ family protein -
  SE1UMMC_RS05990 (SE1UMMC_05640) - 1198394..1199854 (-) 1461 WP_002484879.1 rhomboid family protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 42379.68 Da        Isoelectric Point: 10.1702

>NTDB_id=166397 SE1UMMC_RS05960 WP_001831131.1 1194205..1195272(-) (comGB) [Staphylococcus epidermidis strain DAR1907]
MKKLSINTFKYKRNKYLTEIQSIDLLQRLQQLLSHGFTLYQSFKFLNSYFKYKERTINKKIIQHLQNGATCYDILKIIGY
PELVLLQIKFAENYGNIEEALVDTVQYMKRNLKAKKRLIKTLQYPVALISIFLFILTILNITVIPQFQQLYETMNVKLST
FQNLLTLIITRLPKLTFIFIFISGIAFFITYKFYYYLPIEKKLKSILKIPMINTYYKIYRTYQLSNQLSLFYRNGTSLQQ
IVRIYRNEQDNDFLKFLGDYLFKEANKGLPLPVILMNLKCFQNDLIKFIEQGEKNGKLDIELKLYSQMLLQQFEEKVLKQ
TKFIQPIIFFILGIFIVSLYLVIMLPMFELMQTIK

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=166397 SE1UMMC_RS05960 WP_001831131.1 1194205..1195272(-) (comGB) [Staphylococcus epidermidis strain DAR1907]
GTGAAGAAGTTGTCGATAAATACATTTAAATATAAGAGGAATAAATATCTTACTGAAATACAATCAATAGACTTACTACA
GAGATTACAACAGCTTTTAAGTCACGGATTCACTTTATATCAAAGTTTTAAATTTTTAAACTCCTATTTTAAATATAAAG
AGCGAACAATAAATAAAAAGATTATCCAACATCTACAAAACGGTGCTACATGTTATGATATTTTAAAAATAATAGGGTAT
CCAGAATTAGTTCTTCTTCAAATAAAATTTGCTGAAAACTACGGCAACATTGAGGAGGCTCTCGTTGATACTGTTCAATA
TATGAAAAGAAATCTGAAAGCTAAAAAACGACTCATCAAAACCTTACAATATCCTGTTGCATTAATTTCTATCTTCTTAT
TCATATTAACCATTTTAAATATAACTGTCATACCTCAATTTCAACAACTTTATGAGACTATGAATGTTAAATTATCAACA
TTTCAAAATCTACTAACTCTTATTATTACCCGTCTTCCCAAACTAACTTTCATTTTTATCTTTATTAGTGGTATAGCATT
TTTTATCACTTATAAATTCTACTATTATCTACCAATTGAGAAAAAGTTAAAATCTATTTTAAAAATCCCAATGATTAATA
CGTATTATAAAATATATAGAACTTATCAACTTTCCAATCAACTTTCTTTATTTTACAGAAATGGTACAAGTCTTCAACAA
ATTGTCCGTATATATCGTAATGAGCAAGATAACGATTTTCTTAAATTTCTGGGTGATTATCTTTTTAAAGAAGCTAATAA
AGGGCTCCCGTTACCTGTTATATTAATGAATTTAAAATGTTTTCAAAATGATTTGATTAAATTCATAGAACAAGGAGAGA
AAAATGGGAAATTAGATATAGAATTAAAGTTATACAGTCAAATGCTATTACAGCAATTTGAAGAAAAAGTGTTAAAACAA
ACAAAATTTATACAACCTATCATCTTCTTTATCTTGGGAATTTTTATTGTATCTTTATACTTAGTCATTATGCTTCCTAT
GTTTGAACTTATGCAAACAATAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Staphylococcus aureus MW2

49.719

100

0.499

  comGB Staphylococcus aureus N315

49.719

100

0.499


Multiple sequence alignment