Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   NL113_RS18615 Genome accession   NZ_CP101130
Coordinates   3720457..3721524 (+) Length   355 a.a.
NCBI ID   WP_006637540.1    Uniprot ID   -
Organism   Bacillus sp. KRF7     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3715457..3726524
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NL113_RS18575 - 3715906..3717015 (+) 1110 WP_029419374.1 hypothetical protein -
  NL113_RS18580 - 3717030..3717344 (+) 315 WP_006637546.1 MTH1187 family thiamine-binding protein -
  NL113_RS18585 - 3717402..3717575 (-) 174 WP_006637545.1 DUF2759 domain-containing protein -
  NL113_RS18590 - 3717716..3718354 (+) 639 WP_006637544.1 MBL fold metallo-hydrolase -
  NL113_RS18595 - 3718404..3718649 (-) 246 WP_006637543.1 DUF2626 domain-containing protein -
  NL113_RS18600 - 3718862..3719242 (+) 381 WP_006637542.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  NL113_RS18610 - 3719461..3720318 (+) 858 WP_006637541.1 STAS domain-containing protein -
  NL113_RS18615 comGA 3720457..3721524 (+) 1068 WP_006637540.1 competence type IV pilus ATPase ComGA Machinery gene
  NL113_RS18620 comGB 3721511..3722548 (+) 1038 WP_006637539.1 competence type IV pilus assembly protein ComGB Machinery gene
  NL113_RS18625 comGC 3722563..3722856 (+) 294 WP_006637538.1 competence type IV pilus major pilin ComGC Machinery gene
  NL113_RS18630 comGD 3722856..3723299 (+) 444 WP_006637537.1 competence type IV pilus minor pilin ComGD -
  NL113_RS18635 comGE 3723283..3723630 (+) 348 WP_006637536.1 competence type IV pilus minor pilin ComGE -
  NL113_RS18640 comGF 3723539..3724030 (+) 492 WP_224254419.1 competence type IV pilus minor pilin ComGF -
  NL113_RS18645 comGG 3724085..3724408 (+) 324 WP_224254418.1 competence type IV pilus minor pilin ComGG -
  NL113_RS18650 - 3724488..3724673 (+) 186 WP_006637533.1 YqzE family protein -
  NL113_RS18655 - 3724767..3725087 (-) 321 WP_006637532.1 DUF3889 domain-containing protein -
  NL113_RS18660 tapA 3725350..3726096 (+) 747 WP_006637531.1 amyloid fiber anchoring/assembly protein TapA -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 39821.31 Da        Isoelectric Point: 9.4732

>NTDB_id=708987 NL113_RS18615 WP_006637540.1 3720457..3721524(+) (comGA) [Bacillus sp. KRF7]
MQAIEPLSGRVIEEACRMRASDIHIVPCKKEAIIRFRIDGELIQKDRLTRLECSRLISHFKFLSSMDIGERRQPQSGALT
LQVNNKPVHLRMSTLPTVYDESLVIRVLPQASAPPLRSLSLFPDATSKLLSFLKHSHGLMIFTGPTGSGKTTTLYSLIEY
AKQHFNRNIITLEDPVESRSEHVLQVQVNEKAGMTYSAGLKAVLRHDPDMIILGEIRDAETAKTAVRAALTGHLVLSSMH
AKNAKGAIYRLLEFGVTMTEIEQTLVAVSAQRLVNLVCPFCGEQCSFYCRMAREVRRASIFELLYGKSLNLCIKEAKGAY
VNNRFDTLRKLIRKGIALGYVPAGSYERWVHHEAD

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=708987 NL113_RS18615 WP_006637540.1 3720457..3721524(+) (comGA) [Bacillus sp. KRF7]
TTGCAAGCGATTGAACCATTAAGCGGGAGAGTAATCGAAGAGGCATGCAGAATGAGAGCATCTGACATTCATATTGTTCC
GTGTAAAAAAGAGGCGATTATCCGTTTTAGAATCGACGGTGAATTGATTCAAAAGGACAGGCTGACGAGGCTTGAGTGCT
CAAGGCTGATTTCCCACTTTAAATTTCTTTCTTCAATGGATATCGGAGAACGGAGACAGCCGCAAAGCGGTGCTTTAACC
CTTCAAGTGAACAATAAGCCTGTTCATTTAAGAATGTCGACTTTGCCCACTGTATACGATGAAAGCTTGGTGATCCGCGT
TTTGCCGCAAGCAAGTGCCCCGCCGCTCAGAAGCCTGTCATTGTTTCCGGATGCAACGTCGAAGCTGCTGTCTTTTCTGA
AGCATTCACACGGCCTGATGATCTTCACCGGCCCTACAGGTTCGGGAAAAACGACAACTCTGTATTCGCTAATCGAATAT
GCAAAACAGCATTTTAACCGCAATATTATTACCCTGGAGGATCCGGTGGAATCCAGAAGCGAGCATGTTCTTCAAGTACA
GGTAAATGAGAAGGCGGGTATGACATATTCTGCCGGCTTAAAGGCTGTTCTCCGCCATGATCCGGACATGATCATTCTGG
GGGAAATCCGCGATGCAGAAACAGCCAAAACCGCGGTCAGAGCGGCGCTGACGGGTCATCTTGTATTATCGAGCATGCAC
GCGAAAAACGCAAAAGGCGCGATATACAGACTGCTTGAATTCGGCGTCACAATGACAGAAATTGAACAGACGCTGGTTGC
TGTAAGCGCGCAACGACTCGTCAATCTTGTCTGTCCATTTTGCGGAGAGCAGTGTTCTTTTTATTGCAGAATGGCGAGGG
AAGTCAGAAGAGCAAGCATTTTTGAGCTTCTGTATGGAAAGAGCCTGAATCTTTGTATAAAAGAAGCTAAAGGCGCATAT
GTAAACAACCGCTTCGACACATTGAGAAAATTGATCCGCAAAGGGATAGCGCTCGGTTATGTGCCGGCCGGATCTTATGA
ACGCTGGGTGCATCATGAAGCCGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

67.978

100

0.682