Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   NL113_RS18625 Genome accession   NZ_CP101130
Coordinates   3722563..3722856 (+) Length   97 a.a.
NCBI ID   WP_006637538.1    Uniprot ID   -
Organism   Bacillus sp. KRF7     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3717563..3727856
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NL113_RS18590 - 3717716..3718354 (+) 639 WP_006637544.1 MBL fold metallo-hydrolase -
  NL113_RS18595 - 3718404..3718649 (-) 246 WP_006637543.1 DUF2626 domain-containing protein -
  NL113_RS18600 - 3718862..3719242 (+) 381 WP_006637542.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  NL113_RS18610 - 3719461..3720318 (+) 858 WP_006637541.1 STAS domain-containing protein -
  NL113_RS18615 comGA 3720457..3721524 (+) 1068 WP_006637540.1 competence type IV pilus ATPase ComGA Machinery gene
  NL113_RS18620 comGB 3721511..3722548 (+) 1038 WP_006637539.1 competence type IV pilus assembly protein ComGB Machinery gene
  NL113_RS18625 comGC 3722563..3722856 (+) 294 WP_006637538.1 competence type IV pilus major pilin ComGC Machinery gene
  NL113_RS18630 comGD 3722856..3723299 (+) 444 WP_006637537.1 competence type IV pilus minor pilin ComGD -
  NL113_RS18635 comGE 3723283..3723630 (+) 348 WP_006637536.1 competence type IV pilus minor pilin ComGE -
  NL113_RS18640 comGF 3723539..3724030 (+) 492 WP_224254419.1 competence type IV pilus minor pilin ComGF -
  NL113_RS18645 comGG 3724085..3724408 (+) 324 WP_224254418.1 competence type IV pilus minor pilin ComGG -
  NL113_RS18650 - 3724488..3724673 (+) 186 WP_006637533.1 YqzE family protein -
  NL113_RS18655 - 3724767..3725087 (-) 321 WP_006637532.1 DUF3889 domain-containing protein -
  NL113_RS18660 tapA 3725350..3726096 (+) 747 WP_006637531.1 amyloid fiber anchoring/assembly protein TapA -
  NL113_RS18665 sipW 3726093..3726674 (+) 582 WP_006637530.1 signal peptidase I SipW -
  NL113_RS18670 tasA 3726744..3727538 (+) 795 WP_006637529.1 biofilm matrix protein TasA -

Sequence


Protein


Download         Length: 97 a.a.        Molecular weight: 10610.46 Da        Isoelectric Point: 7.8560

>NTDB_id=708989 NL113_RS18625 WP_006637538.1 3722563..3722856(+) (comGC) [Bacillus sp. KRF7]
MNEKGFTLIEMLVVMLVISILLLITIPNVTKHNQSIQKKGCEGLKSMIQAQITAYEIDHDGKTPSLGDLESEGYIKKNLA
CPNGKPIVISNGSVQTR

Nucleotide


Download         Length: 294 bp        

>NTDB_id=708989 NL113_RS18625 WP_006637538.1 3722563..3722856(+) (comGC) [Bacillus sp. KRF7]
ATGAATGAAAAGGGATTTACGCTGATTGAAATGCTAGTCGTCATGCTTGTCATATCAATTCTGCTTTTAATTACGATCCC
GAATGTCACCAAGCACAACCAAAGTATTCAAAAGAAAGGCTGTGAGGGATTAAAAAGCATGATTCAGGCCCAAATTACGG
CTTATGAGATCGATCATGACGGAAAAACGCCGAGTCTGGGCGATCTGGAATCAGAGGGATACATTAAGAAGAACTTGGCT
TGTCCTAACGGAAAACCGATTGTGATTTCAAATGGCAGTGTACAAACCCGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

71.579

97.938

0.701

  comGC Staphylococcus aureus MW2

44.444

92.784

0.412

  comGC Staphylococcus aureus N315

44.444

92.784

0.412

  comGC Latilactobacillus sakei subsp. sakei 23K

37.113

100

0.371