Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB   Type   Machinery gene
Locus tag   NL113_RS18620 Genome accession   NZ_CP101130
Coordinates   3721511..3722548 (+) Length   345 a.a.
NCBI ID   WP_006637539.1    Uniprot ID   -
Organism   Bacillus sp. KRF7     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3716511..3727548
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NL113_RS18580 - 3717030..3717344 (+) 315 WP_006637546.1 MTH1187 family thiamine-binding protein -
  NL113_RS18585 - 3717402..3717575 (-) 174 WP_006637545.1 DUF2759 domain-containing protein -
  NL113_RS18590 - 3717716..3718354 (+) 639 WP_006637544.1 MBL fold metallo-hydrolase -
  NL113_RS18595 - 3718404..3718649 (-) 246 WP_006637543.1 DUF2626 domain-containing protein -
  NL113_RS18600 - 3718862..3719242 (+) 381 WP_006637542.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  NL113_RS18610 - 3719461..3720318 (+) 858 WP_006637541.1 STAS domain-containing protein -
  NL113_RS18615 comGA 3720457..3721524 (+) 1068 WP_006637540.1 competence type IV pilus ATPase ComGA Machinery gene
  NL113_RS18620 comGB 3721511..3722548 (+) 1038 WP_006637539.1 competence type IV pilus assembly protein ComGB Machinery gene
  NL113_RS18625 comGC 3722563..3722856 (+) 294 WP_006637538.1 competence type IV pilus major pilin ComGC Machinery gene
  NL113_RS18630 comGD 3722856..3723299 (+) 444 WP_006637537.1 competence type IV pilus minor pilin ComGD -
  NL113_RS18635 comGE 3723283..3723630 (+) 348 WP_006637536.1 competence type IV pilus minor pilin ComGE -
  NL113_RS18640 comGF 3723539..3724030 (+) 492 WP_224254419.1 competence type IV pilus minor pilin ComGF -
  NL113_RS18645 comGG 3724085..3724408 (+) 324 WP_224254418.1 competence type IV pilus minor pilin ComGG -
  NL113_RS18650 - 3724488..3724673 (+) 186 WP_006637533.1 YqzE family protein -
  NL113_RS18655 - 3724767..3725087 (-) 321 WP_006637532.1 DUF3889 domain-containing protein -
  NL113_RS18660 tapA 3725350..3726096 (+) 747 WP_006637531.1 amyloid fiber anchoring/assembly protein TapA -
  NL113_RS18665 sipW 3726093..3726674 (+) 582 WP_006637530.1 signal peptidase I SipW -
  NL113_RS18670 tasA 3726744..3727538 (+) 795 WP_006637529.1 biofilm matrix protein TasA -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 40024.46 Da        Isoelectric Point: 9.6575

>NTDB_id=708988 NL113_RS18620 WP_006637539.1 3721511..3722548(+) (comGB) [Bacillus sp. KRF7]
MKPIKNRWPVGEQAEFLEKLGEMMMNGYTLLDALSMLELQLKRQQKTDIAFGRRKLAEGYPVFQVLNMISFHKAAVSIVY
FAERHGNVPFAFMQSGELLRRKIEQAEKIKKAAHYPAFLILTVCLIVYMMKAAIVPQFSAIYDSMNIETPFLTSFIFLFF
ESFSLLFLCILAAAAVFWAYYLYAFRQKPPEDKMALLIRIPLAGRILKLFNSYFLSLQLSNLLTSGLSIYDSLKAFESQP
FLPFFQKEAKRLIERLKQGEAIEHMLNGHPFYEKDLSKVVAHGQLNGQLHRELYSYSQFLIDRFEKKAEKWTGLLQPLIY
GFTAAMILILYLSMLLPMYQMMNQL

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=708988 NL113_RS18620 WP_006637539.1 3721511..3722548(+) (comGB) [Bacillus sp. KRF7]
ATGAAGCCGATTAAGAACAGATGGCCTGTCGGGGAACAGGCAGAGTTCCTTGAAAAGCTTGGCGAGATGATGATGAACGG
TTATACGCTTCTTGATGCATTAAGCATGCTGGAACTGCAATTAAAGCGGCAGCAAAAAACGGATATTGCATTCGGGAGGA
GAAAGCTTGCGGAAGGGTATCCTGTTTTTCAAGTTTTAAATATGATTTCATTTCATAAAGCTGCCGTCAGCATCGTTTAT
TTCGCCGAACGTCACGGTAATGTGCCATTTGCTTTTATGCAGAGCGGCGAATTGCTCCGCCGTAAAATCGAACAGGCCGA
AAAAATCAAAAAAGCCGCACATTATCCGGCATTTTTGATTTTGACGGTTTGCCTCATTGTCTATATGATGAAAGCCGCCA
TTGTTCCGCAGTTTTCCGCGATCTATGACTCGATGAACATAGAAACGCCCTTTCTGACATCCTTCATCTTTTTATTTTTT
GAAAGTTTCTCCTTGTTGTTTCTGTGCATACTAGCCGCCGCTGCTGTTTTTTGGGCGTATTATTTGTACGCTTTCCGCCA
AAAGCCCCCTGAAGACAAAATGGCTCTTCTCATCAGAATTCCGCTGGCAGGCAGAATCCTCAAATTGTTTAACAGCTACT
TTTTATCACTTCAGCTGAGCAATCTTCTTACATCCGGTTTGTCTATATATGACAGTTTAAAAGCGTTTGAAAGCCAGCCC
TTTTTGCCGTTTTTCCAAAAGGAAGCTAAACGGCTGATCGAGAGGCTGAAACAGGGGGAAGCGATAGAACATATGTTAAA
CGGACACCCGTTTTATGAAAAGGACCTATCAAAGGTGGTGGCTCACGGCCAATTAAACGGCCAGCTTCACAGAGAGCTTT
ATTCATACAGCCAATTTCTGATCGACCGGTTTGAAAAGAAAGCGGAAAAGTGGACAGGCCTGCTGCAGCCGCTGATTTAC
GGTTTTACCGCGGCCATGATTTTAATACTCTATCTGTCCATGCTGTTGCCAATGTATCAAATGATGAATCAGTTATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB Bacillus subtilis subsp. subtilis str. 168

56.347

93.623

0.528