Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   P5638_RS10680 Genome accession   NZ_CP120599
Coordinates   2107933..2108226 (+) Length   97 a.a.
NCBI ID   WP_024422977.1    Uniprot ID   A0A0M2EIN6
Organism   Bacillus safensis strain PRO114     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2102933..2113226
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P5638_RS10645 (P5638_10645) - 2103027..2103665 (+) 639 WP_024422981.1 MBL fold metallo-hydrolase -
  P5638_RS10650 (P5638_10650) - 2103713..2103955 (-) 243 WP_003217440.1 DUF2626 domain-containing protein -
  P5638_RS10655 (P5638_10655) - 2104168..2104548 (+) 381 WP_007501232.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  P5638_RS10665 (P5638_10665) - 2104772..2105647 (+) 876 WP_024422979.1 STAS domain-containing protein -
  P5638_RS10670 (P5638_10670) comGA 2105825..2106895 (+) 1071 WP_025092972.1 competence type IV pilus ATPase ComGA Machinery gene
  P5638_RS10675 (P5638_10675) comGB 2106876..2107916 (+) 1041 WP_024427986.1 competence type IV pilus assembly protein ComGB -
  P5638_RS10680 (P5638_10680) comGC 2107933..2108226 (+) 294 WP_024422977.1 competence type IV pilus major pilin ComGC Machinery gene
  P5638_RS10685 (P5638_10685) comGD 2108219..2108665 (+) 447 WP_034621605.1 competence type IV pilus minor pilin ComGD -
  P5638_RS10690 (P5638_10690) comGE 2108649..2108963 (+) 315 WP_034621606.1 competence type IV pilus minor pilin ComGE -
  P5638_RS10695 (P5638_10695) comGF 2108950..2109402 (+) 453 WP_277721854.1 competence type IV pilus minor pilin ComGF -
  P5638_RS10700 (P5638_10700) comGG 2109399..2109782 (+) 384 WP_277721855.1 competence type IV pilus minor pilin ComGG -
  P5638_RS10705 (P5638_10705) - 2109842..2110036 (+) 195 WP_056767455.1 YqzE family protein -
  P5638_RS10710 (P5638_10710) - 2110071..2110349 (-) 279 WP_171464609.1 DUF3889 domain-containing protein -
  P5638_RS10715 (P5638_10715) tapA 2110649..2111176 (+) 528 WP_024422972.1 amyloid fiber anchoring/assembly protein TapA -
  P5638_RS10720 (P5638_10720) - 2111228..2111800 (+) 573 WP_024422971.1 signal peptidase I -
  P5638_RS10725 (P5638_10725) - 2111855..2112655 (+) 801 WP_098677431.1 TasA family protein -
  P5638_RS10730 (P5638_10730) sinR 2112723..2113058 (-) 336 WP_024422969.1 transcriptional regulator SinR Regulator

Sequence


Protein


Download         Length: 97 a.a.        Molecular weight: 10686.62 Da        Isoelectric Point: 7.3218

>NTDB_id=806510 P5638_RS10680 WP_024422977.1 2107933..2108226(+) (comGC) [Bacillus safensis strain PRO114]
MNDKGFTLLEMLIVMLVISILLLITIPNITRHHQAIQAKGCEGLKSMVSTQVMAYELEHDGKTPSLAELEKEGYVKKGLT
CPNGKAIVIQNGNVKHE

Nucleotide


Download         Length: 294 bp        

>NTDB_id=806510 P5638_RS10680 WP_024422977.1 2107933..2108226(+) (comGC) [Bacillus safensis strain PRO114]
ATGAATGATAAAGGATTTACCCTGTTAGAAATGTTAATAGTCATGCTCGTAATATCCATTCTTTTACTCATTACCATACC
GAATATCACACGTCATCACCAAGCGATTCAGGCAAAGGGCTGTGAAGGGCTTAAATCAATGGTCAGCACACAAGTGATGG
CATATGAATTGGAGCATGATGGGAAGACGCCATCTTTAGCTGAATTAGAAAAAGAAGGTTATGTCAAAAAGGGATTAACT
TGTCCTAATGGAAAAGCGATTGTCATTCAAAACGGCAACGTGAAGCATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0M2EIN6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

67.01

100

0.67

  comGC Staphylococcus aureus MW2

42.222

92.784

0.392

  comGC Staphylococcus aureus N315

42.222

92.784

0.392