Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   RE438_RS21665 Genome accession   NZ_CP133557
Coordinates   4171839..4172882 (-) Length   347 a.a.
NCBI ID   WP_048562430.1    Uniprot ID   -
Organism   Bacillus wiedmannii strain EPS29     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4166839..4177882
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RE438_RS21620 (RE438_21595) - 4167269..4167469 (-) 201 WP_000106081.1 YqzE family protein -
  RE438_RS21625 (RE438_21600) - 4167508..4168005 (-) 498 WP_048562314.1 shikimate kinase -
  RE438_RS21630 (RE438_21605) - 4168124..4168774 (-) 651 WP_048562313.1 2OG-Fe(II) oxygenase -
  RE438_RS21635 (RE438_21610) comGG 4168949..4169320 (-) 372 WP_048562312.1 competence type IV pilus minor pilin ComGG -
  RE438_RS21640 (RE438_21615) comGF 4169317..4169787 (-) 471 WP_048562311.1 competence type IV pilus minor pilin ComGF -
  RE438_RS21645 (RE438_21620) - 4169757..4170059 (-) 303 WP_000829516.1 hypothetical protein -
  RE438_RS21650 (RE438_21625) comGD 4170052..4170507 (-) 456 WP_048562310.1 competence type IV pilus minor pilin ComGD -
  RE438_RS21655 (RE438_21630) comGC 4170504..4170803 (-) 300 WP_001178693.1 competence type IV pilus major pilin ComGC -
  RE438_RS21660 (RE438_21635) comGB 4170815..4171846 (-) 1032 WP_080349860.1 competence type IV pilus assembly protein ComGB -
  RE438_RS21665 (RE438_21640) comGA 4171839..4172882 (-) 1044 WP_048562430.1 competence protein ComGA Machinery gene
  RE438_RS21670 (RE438_21645) - 4173089..4173784 (+) 696 WP_048562308.1 metalloregulator ArsR/SmtB family transcription factor -
  RE438_RS21675 (RE438_21650) - 4173911..4174153 (+) 243 WP_000440714.1 DUF2626 domain-containing protein -
  RE438_RS21680 (RE438_21655) - 4174260..4175654 (+) 1395 WP_048562307.1 L-cystine transporter -
  RE438_RS21685 (RE438_21660) - 4175885..4176292 (-) 408 WP_048562306.1 hypothetical protein -
  RE438_RS21690 (RE438_21665) - 4176391..4176606 (-) 216 WP_001008334.1 DUF3912 family protein -
  RE438_RS21695 (RE438_21670) - 4176861..4177172 (+) 312 WP_048562305.1 hypothetical protein -
  RE438_RS21700 (RE438_21675) - 4177211..4177699 (-) 489 WP_000764275.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39324.94 Da        Isoelectric Point: 8.7272

>NTDB_id=874856 RE438_RS21665 WP_048562430.1 4171839..4172882(-) (comGA) [Bacillus wiedmannii strain EPS29]
MNGIEMFANTILKEACRVQASDLHIVPRKKDVAVQLRIGKDLMTKHCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IKKKKTRRIVTLEDPVEKRNDDVLQIQINEKAGITYEAGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TSDAKGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCQRKCSILCKSMREVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLESLIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=874856 RE438_RS21665 WP_048562430.1 4171839..4172882(-) (comGA) [Bacillus wiedmannii strain EPS29]
ATGAATGGAATTGAAATGTTTGCGAATACGATTTTGAAAGAGGCGTGCAGGGTACAAGCTTCGGATTTACATATTGTGCC
CCGGAAGAAGGATGTAGCGGTTCAACTGCGTATAGGAAAAGATTTAATGACGAAACACTGTATCGAAAAGGAGTTTGGAG
AAAAGCTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGGGAGAGACGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGACAAGAAGTGTATTTACGCCTTTCCACGCTTCCAACCGTATACCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAAAAACTACTCTCTTTTTTAC
ACTATTCACATGGGTTACTCGTATTTACTGGACCGACTGGTTCGGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAAAAAAAAGAAAACACGTCGCATCGTTACACTTGAAGATCCAGTTGAAAAAAGAAATGACGATGTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCGGGACTAAAGGCCATTTTACGTCATGATCCAGATATTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCGAAAATCGCTATAAGAGCAAGTTTGACTGGACATTTAGTAATGACGACACTGCAT
ACTAGTGATGCGAAAGGGGCGATACTCCGATTTATGGATTTCGGTATAACGAGGCAAGAGATTGAACAATCTTTATTGGC
TATAGCTGCACAGCGACTTGTCGAATTGAAGTGTCCGTTTTGCCAAAGAAAGTGCTCAATTTTATGTAAATCAATGAGGG
AGGTAAGACAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCGAACGGGGAATGT
GTCACATACAAGCATGAAACATTAGAATCTTTGATACGAAAAGGATACGCTTTAGGATTTTTAGAAGAGGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

58.501

100

0.585

  pilF Thermus thermophilus HB27

38.58

93.372

0.36