Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   S101395_RS09255 Genome accession   NZ_CP021920
Coordinates   1749035..1750102 (+) Length   355 a.a.
NCBI ID   WP_006637540.1    Uniprot ID   -
Organism   Bacillus sonorensis strain SRCM101395     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1744035..1755102
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101395_RS09215 (S101395_01866) - 1744484..1745593 (+) 1110 WP_029419374.1 hypothetical protein -
  S101395_RS09220 (S101395_01867) - 1745608..1745922 (+) 315 WP_006637546.1 MTH1187 family thiamine-binding protein -
  S101395_RS09225 (S101395_01868) - 1745980..1746153 (-) 174 WP_006637545.1 DUF2759 domain-containing protein -
  S101395_RS09230 (S101395_01869) - 1746294..1746932 (+) 639 WP_006637544.1 MBL fold metallo-hydrolase -
  S101395_RS09235 (S101395_01870) - 1746982..1747227 (-) 246 WP_006637543.1 DUF2626 domain-containing protein -
  S101395_RS09240 (S101395_01871) - 1747440..1747820 (+) 381 WP_006637542.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  S101395_RS09250 (S101395_01873) - 1748039..1748896 (+) 858 WP_006637541.1 STAS domain-containing protein -
  S101395_RS09255 (S101395_01874) comGA 1749035..1750102 (+) 1068 WP_006637540.1 competence type IV pilus ATPase ComGA Machinery gene
  S101395_RS09260 (S101395_01875) comGB 1750089..1751126 (+) 1038 WP_006637539.1 competence type IV pilus assembly protein ComGB Machinery gene
  S101395_RS09265 (S101395_01876) - 1751192..1752724 (-) 1533 WP_167373791.1 recombinase family protein -
  S101395_RS09270 (S101395_01877) - 1752772..1753680 (-) 909 WP_208620356.1 hypothetical protein -
  S101395_RS09280 (S101395_01879) - 1753936..1754436 (-) 501 WP_088272793.1 ImmA/IrrE family metallo-endopeptidase -
  S101395_RS09285 (S101395_01880) - 1754627..1754974 (-) 348 WP_088272794.1 helix-turn-helix domain-containing protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 39821.31 Da        Isoelectric Point: 9.4732

>NTDB_id=234886 S101395_RS09255 WP_006637540.1 1749035..1750102(+) (comGA) [Bacillus sonorensis strain SRCM101395]
MQAIEPLSGRVIEEACRMRASDIHIVPCKKEAIIRFRIDGELIQKDRLTRLECSRLISHFKFLSSMDIGERRQPQSGALT
LQVNNKPVHLRMSTLPTVYDESLVIRVLPQASAPPLRSLSLFPDATSKLLSFLKHSHGLMIFTGPTGSGKTTTLYSLIEY
AKQHFNRNIITLEDPVESRSEHVLQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAKTAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVTMTEIEQTLVAVSAQRLVNLVCPFCGEQCSFYCRMAREVRRASIFELLYGKSLNLCIKEAKGAY
VNNRFDTLRKLIRKGIALGYVPAGSYERWVHHEAD

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=234886 S101395_RS09255 WP_006637540.1 1749035..1750102(+) (comGA) [Bacillus sonorensis strain SRCM101395]
TTGCAAGCGATTGAACCATTAAGCGGGAGAGTAATCGAAGAGGCATGCAGAATGAGAGCATCTGACATTCATATTGTTCC
GTGTAAAAAAGAGGCGATTATCCGTTTTAGAATCGACGGTGAATTGATTCAAAAGGACAGGCTGACGAGGCTTGAGTGCT
CAAGGCTGATTTCCCACTTTAAATTTCTTTCTTCAATGGATATCGGAGAACGGAGACAGCCGCAAAGCGGTGCTTTAACC
CTTCAAGTGAACAATAAGCCTGTTCATTTAAGAATGTCGACTTTGCCCACTGTATACGATGAAAGCTTGGTGATCCGCGT
TTTGCCGCAAGCAAGTGCCCCGCCGCTCAGAAGCCTGTCATTGTTTCCGGATGCAACGTCGAAGCTGCTGTCTTTTCTGA
AGCATTCACACGGCCTGATGATCTTCACCGGCCCTACAGGTTCGGGAAAAACGACAACTCTGTATTCGCTAATCGAATAT
GCAAAACAGCATTTTAACCGCAATATTATTACCCTGGAGGATCCGGTGGAATCCAGAAGCGAGCATGTTCTTCAAGTACA
GGTAAATGAGAAGGCGGGTATGACATATTCTGCCGGCTTAAAGGCTGTTCTCCGCCATGATCCGGACATGATCATTCTGG
GGGAAATCCGCGATGCAGAAACAGCCAAAACCGCGGTCAGAGCGGCGCTGACGGGTCATCTTGTATTATCGAGCATGCAC
GCGAAAAACGCAAAAGGCGCGATATACAGACTGCTTGAATTCGGCGTCACAATGACAGAAATTGAACAGACGCTGGTTGC
TGTAAGCGCGCAACGACTCGTCAATCTTGTCTGTCCATTTTGCGGAGAGCAGTGTTCTTTTTATTGCAGAATGGCGAGGG
AAGTCAGAAGAGCAAGCATTTTTGAGCTTCTGTATGGAAAGAGCCTGAATCTTTGTATAAAAGAAGCTAAAGGCGCATAT
GTAAACAACCGCTTCGACACATTGAGAAAATTGATCCGCAAAGGGATAGCGCTCGGTTATGTGCCGGCCGGATCTTATGA
ACGCTGGGTGCATCATGAAGCCGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

67.978

100

0.682


Multiple sequence alignment