Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   RE409_RS18830 Genome accession   NZ_CP133763
Coordinates   3882228..3883313 (-) Length   361 a.a.
NCBI ID   WP_053346947.1    Uniprot ID   -
Organism   Peribacillus sp. R9-11     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3877228..3888313
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RE409_RS18790 (RE409_18795) - 3877890..3878687 (+) 798 WP_201223654.1 YqhG family protein -
  RE409_RS18795 (RE409_18800) - 3878779..3878979 (-) 201 WP_309557885.1 YqzE family protein -
  RE409_RS18800 (RE409_18805) comGG 3879211..3879597 (-) 387 WP_056526035.1 competence type IV pilus minor pilin ComGG -
  RE409_RS18805 (RE409_18810) comGF 3879590..3880069 (-) 480 WP_116820708.1 competence type IV pilus minor pilin ComGF -
  RE409_RS18810 (RE409_18815) - 3880017..3880355 (-) 339 WP_056526040.1 hypothetical protein -
  RE409_RS18815 (RE409_18820) comGD 3880342..3880779 (-) 438 WP_309559290.1 competence type IV pilus minor pilin ComGD -
  RE409_RS18820 (RE409_18825) comGC 3880793..3881101 (-) 309 WP_053346945.1 competence type IV pilus major pilin ComGC -
  RE409_RS18825 (RE409_18830) comGB 3881210..3882268 (-) 1059 WP_201253722.1 competence type IV pilus assembly protein ComGB -
  RE409_RS18830 (RE409_18835) comGA 3882228..3883313 (-) 1086 WP_053346947.1 competence type IV pilus ATPase ComGA Machinery gene
  RE409_RS18835 (RE409_18840) - 3883436..3883798 (-) 363 WP_309557887.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  RE409_RS18840 (RE409_18845) - 3884015..3884716 (+) 702 WP_309557888.1 helix-turn-helix domain-containing protein -
  RE409_RS18845 (RE409_18850) - 3884802..3885044 (+) 243 WP_053346950.1 DUF2626 domain-containing protein -
  RE409_RS18850 (RE409_18855) - 3885101..3886198 (-) 1098 WP_309557889.1 SAM-dependent methyltransferase -
  RE409_RS18855 (RE409_18860) - 3886317..3887489 (-) 1173 WP_201254645.1 amidohydrolase -
  RE409_RS18860 (RE409_18865) - 3887664..3888290 (-) 627 WP_201185800.1 MBL fold metallo-hydrolase -

Sequence


Protein


Download         Length: 361 a.a.        Molecular weight: 40279.11 Da        Isoelectric Point: 9.4384

>NTDB_id=876520 RE409_RS18830 WP_053346947.1 3882228..3883313(-) (comGA) [Peribacillus sp. R9-11]
MKDLISIEKKAEKILSRAIQLSASDIHVLPRREGPLIQFRIDNKLVPQETLSFFETERLISHLKFLAAMDIGEKRRPQSG
SITVNLSKKVVGLRLSTLPTAHLESLVIRLIPQQNILPLEQLSLFPNTVQKLVALLKHSHGMLIFTGPTGSGKTTTLYSL
LHHSKEMINRNIITLEDPIENVSEKVLQVQINEKAGITYSVGLKAVLRHDPDVIMVGEIRDSETAKIAVRAALTGHLILT
TMHTRDAQGAISRLLEFGISLLEIEQSLIGVTAQRLVELTCLQCKEVCTSACQMIARNKRASVYELLYGKSLAEVLKMTK
EEKGKGTVSYRQLKDEIGKAVAMGYVDSHEFERLVYDEGKK

Nucleotide


Download         Length: 1086 bp        

>NTDB_id=876520 RE409_RS18830 WP_053346947.1 3882228..3883313(-) (comGA) [Peribacillus sp. R9-11]
ATGAAAGATTTGATATCGATTGAAAAGAAAGCTGAAAAAATCCTTTCCCGTGCCATTCAATTATCGGCATCGGATATTCA
CGTTTTACCACGCAGAGAGGGCCCTCTCATTCAATTCAGGATTGACAACAAACTCGTTCCTCAGGAAACATTGTCATTCT
TTGAAACGGAACGACTGATCTCCCATTTAAAGTTCCTTGCCGCTATGGATATAGGTGAGAAAAGAAGACCTCAGAGTGGT
TCAATTACCGTCAATTTATCAAAAAAAGTGGTTGGACTTCGCCTTTCCACATTACCAACTGCACATCTCGAAAGTTTGGT
CATCCGCTTGATTCCCCAGCAGAATATCCTTCCTTTAGAACAATTATCCCTATTTCCAAACACCGTTCAAAAACTGGTTG
CCCTTCTGAAACATTCCCATGGCATGCTCATATTTACCGGTCCGACTGGTAGTGGGAAAACCACGACATTATACTCCTTG
CTTCATCATTCAAAAGAGATGATCAACCGGAACATCATTACCCTTGAAGATCCCATTGAAAATGTGTCAGAAAAGGTCCT
ACAAGTTCAAATTAATGAAAAGGCAGGTATTACGTACTCTGTTGGTCTAAAAGCTGTGCTTAGGCATGATCCTGACGTGA
TTATGGTCGGGGAAATCAGGGATTCTGAAACAGCAAAAATTGCAGTTCGTGCTGCATTAACGGGTCATTTAATCCTCACG
ACCATGCATACAAGGGATGCACAAGGAGCTATCTCTAGGTTGCTCGAATTTGGAATCAGCTTGCTTGAGATAGAACAGAG
CTTGATTGGTGTGACCGCTCAGCGGCTAGTTGAACTAACATGTCTGCAATGTAAAGAGGTTTGTACATCTGCTTGCCAAA
TGATTGCTCGTAATAAAAGGGCAAGTGTGTATGAACTGCTATATGGAAAAAGCTTGGCAGAAGTATTGAAGATGACGAAA
GAGGAAAAAGGGAAGGGGACGGTCTCCTACCGCCAATTAAAGGATGAGATAGGAAAAGCGGTGGCAATGGGCTATGTGGA
TTCACATGAATTTGAACGGCTGGTATACGATGAAGGCAAAAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.901

98.338

0.56

  pilB Haemophilus influenzae 86-028NP

39.655

96.399

0.382

  pilB Glaesserella parasuis strain SC1401

40.12

92.521

0.371

  pilB Haemophilus influenzae Rd KW20

38.218

96.399

0.368