Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   D9829_RS15390 Genome accession   NZ_CP033044
Coordinates   3175118..3176191 (+) Length   357 a.a.
NCBI ID   WP_121611828.1    Uniprot ID   -
Organism   Mesobacillus foraminis strain Bac44     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3170118..3181191
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D9829_RS15355 - 3170804..3171115 (+) 312 WP_121611813.1 MTH1187 family thiamine-binding protein -
  D9829_RS15360 - 3171158..3171331 (-) 174 WP_121611815.1 DUF2759 domain-containing protein -
  D9829_RS15365 - 3171443..3171919 (-) 477 WP_121611817.1 hypothetical protein -
  D9829_RS15370 - 3172021..3172653 (+) 633 WP_121611819.1 MBL fold metallo-hydrolase -
  D9829_RS15375 - 3172701..3173795 (+) 1095 WP_121611821.1 class I SAM-dependent methyltransferase -
  D9829_RS15380 - 3173840..3174085 (-) 246 WP_121611823.1 DUF2626 domain-containing protein -
  D9829_RS15385 - 3174167..3174868 (-) 702 WP_121611825.1 metalloregulator ArsR/SmtB family transcription factor -
  D9829_RS15390 comGA 3175118..3176191 (+) 1074 WP_121611828.1 competence type IV pilus ATPase ComGA Machinery gene
  D9829_RS15395 comGB 3176175..3177209 (+) 1035 WP_121611830.1 competence type IV pilus assembly protein ComGB -
  D9829_RS15400 comGC 3177232..3177543 (+) 312 WP_121611832.1 competence type IV pilus major pilin ComGC -
  D9829_RS15405 comGD 3177540..3177977 (+) 438 WP_121611834.1 competence type IV pilus minor pilin ComGD -
  D9829_RS15410 - 3177988..3178296 (+) 309 WP_162990227.1 hypothetical protein -
  D9829_RS15415 comGF 3178250..3178726 (+) 477 WP_121611839.1 competence type IV pilus minor pilin ComGF -
  D9829_RS15420 comGG 3178716..3179093 (+) 378 WP_162990228.1 competence type IV pilus minor pilin ComGG -
  D9829_RS15425 - 3179164..3179343 (+) 180 WP_121611843.1 YqzE family protein -
  D9829_RS15430 - 3179379..3180173 (-) 795 WP_121611846.1 YqhG family protein -

Sequence


Protein


Download         Length: 357 a.a.        Molecular weight: 39375.43 Da        Isoelectric Point: 9.2256

>NTDB_id=320693 D9829_RS15390 WP_121611828.1 3175118..3176191(+) (comGA) [Mesobacillus foraminis strain Bac44]
MDKSIQDLADSILADALKKGASDIHIIPRGKDSAIKFRLGNKLMQRLILDNSDSERLVSHFKFTASMDIGDKRRPQSGAY
AYRFKDSLVGLRISTLPANKSESMVIRILPQQNLSPHYQLSLFPSTSRKLVSLMKHAHGLIILTGPTGSGKTTTLYSLLS
EASQVINRNVITLEDPIEKLNDTVLQIQVNEKAGVSYSAGLKAILRHDPDIIMVGEIRDKETAEIAVRAALTGHLVLSTM
HTRDAKGAIYRLLEFGVNWLEIQQTLIAVTAQRLVELTCPFCSEGCSPFCYTAGRGKRGGVFEILSGRALSAVLAESRGE
AADYHYPTLKDAILKGVSLGFIKEEEIKRWIVNEQSQ

Nucleotide


Download         Length: 1074 bp        

>NTDB_id=320693 D9829_RS15390 WP_121611828.1 3175118..3176191(+) (comGA) [Mesobacillus foraminis strain Bac44]
ATTGATAAATCAATACAGGATTTAGCAGACAGTATTTTGGCTGATGCCCTTAAAAAAGGTGCTTCAGATATCCACATTAT
CCCCCGGGGAAAAGACTCGGCAATCAAATTTCGCCTGGGAAACAAACTGATGCAAAGGCTGATCCTGGATAATTCAGACA
GTGAGAGACTAGTGTCACACTTTAAATTTACTGCTTCAATGGATATTGGCGATAAGCGGAGGCCGCAGAGCGGGGCATAT
GCCTACCGATTTAAAGACAGCCTTGTAGGTTTGAGGATATCCACGCTTCCTGCCAACAAAAGCGAAAGTATGGTGATCAG
GATTCTTCCGCAGCAAAATTTAAGTCCCCATTATCAGCTAAGCTTGTTCCCGTCAACCTCGAGAAAACTTGTTTCTCTTA
TGAAACATGCCCACGGTTTAATCATCCTTACAGGTCCAACAGGAAGCGGCAAGACCACAACTCTTTATTCCCTTCTATCA
GAAGCATCCCAGGTAATCAACCGCAATGTTATTACACTCGAAGATCCGATTGAGAAACTGAATGACACTGTGCTGCAGAT
TCAAGTGAATGAGAAGGCAGGTGTTTCGTATTCCGCCGGTTTAAAAGCCATCCTCAGGCATGACCCTGATATTATTATGG
TGGGAGAAATTCGTGATAAAGAAACAGCCGAGATTGCTGTGCGGGCCGCTTTGACGGGACACCTTGTGCTGTCGACCATG
CATACCCGCGATGCCAAAGGTGCAATCTATCGGCTTCTTGAATTTGGTGTCAATTGGCTGGAAATTCAACAGACACTGAT
TGCGGTGACAGCCCAAAGGTTAGTGGAACTCACATGTCCTTTTTGCAGTGAGGGCTGTTCTCCATTTTGCTATACAGCCG
GCAGGGGGAAAAGAGGGGGTGTCTTTGAGATTCTGTCAGGAAGGGCATTGTCTGCTGTTTTGGCGGAATCAAGAGGGGAA
GCTGCTGATTATCATTATCCAACATTAAAAGATGCCATATTAAAAGGGGTTTCTCTTGGATTTATTAAGGAAGAAGAGAT
TAAAAGGTGGATAGTGAATGAACAGAGTCAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

53.933

99.72

0.538

  ctsE Campylobacter jejuni subsp. jejuni 81-176

36.997

100

0.387

  pilB Glaesserella parasuis strain SC1401

38.244

98.88

0.378

  pilB Haemophilus influenzae 86-028NP

37.535

100

0.375

  comGA Staphylococcus aureus N315

38.04

97.199

0.37

  comGA Staphylococcus aureus MW2

38.04

97.199

0.37

  pilB Vibrio cholerae strain A1552

33.676

100

0.367

  pilB Haemophilus influenzae Rd KW20

36.49

100

0.367


Multiple sequence alignment