Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   EUC39_RS11060 Genome accession   NZ_CP040334
Coordinates   2180811..2181854 (+) Length   347 a.a.
NCBI ID   WP_157417559.1    Uniprot ID   -
Organism   Bacillus cereus strain DLOU-Changhai     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2175811..2186854
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EUC39_RS11025 - 2176176..2176664 (+) 489 WP_145959752.1 hypothetical protein -
  EUC39_RS11030 - 2176699..2177010 (-) 312 WP_001093240.1 hypothetical protein -
  EUC39_RS11035 - 2177264..2177479 (+) 216 WP_001008326.1 DUF3912 family protein -
  EUC39_RS11040 - 2177567..2177959 (+) 393 WP_157417557.1 hypothetical protein -
  EUC39_RS11045 - 2178039..2179433 (-) 1395 WP_001094344.1 L-cystine transporter -
  EUC39_RS11050 - 2179541..2179783 (-) 243 WP_000375619.1 DUF2626 domain-containing protein -
  EUC39_RS11055 - 2179910..2180605 (-) 696 WP_000434107.1 helix-turn-helix transcriptional regulator -
  EUC39_RS11060 comGA 2180811..2181854 (+) 1044 WP_157417559.1 competence type IV pilus ATPase ComGA Machinery gene
  EUC39_RS11065 comGB 2181841..2182878 (+) 1038 WP_106825182.1 competence type IV pilus assembly protein ComGB -
  EUC39_RS11070 comGC 2182890..2183189 (+) 300 WP_001178687.1 competence type IV pilus major pilin ComGC -
  EUC39_RS11075 comGD 2183186..2183641 (+) 456 WP_000813141.1 competence type IV pilus minor pilin ComGD -
  EUC39_RS11080 - 2183634..2183936 (+) 303 WP_000829467.1 hypothetical protein -
  EUC39_RS11085 comGF 2183906..2184376 (+) 471 WP_000923101.1 competence type IV pilus minor pilin ComGF -
  EUC39_RS11090 comGG 2184373..2184744 (+) 372 WP_106824087.1 competence type IV pilus minor pilin ComGG -
  EUC39_RS11095 - 2184918..2185568 (+) 651 WP_000183863.1 2OG-Fe(II) oxygenase -
  EUC39_RS11100 aroK 2185684..2186181 (+) 498 WP_106824089.1 shikimate kinase AroK -
  EUC39_RS11105 - 2186220..2186420 (+) 201 WP_000106083.1 YqzE family protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39119.52 Da        Isoelectric Point: 8.3562

>NTDB_id=363303 EUC39_RS11060 WP_157417559.1 2180811..2181854(+) (comGA) [Bacillus cereus strain DLOU-Changhai]
MNGIEIFANTILKEACRVQASDLHIVPRQKDVAVQLRIGKDLMTGHCIEKEFGEKLVLHFKFLASMDIGEKRKPQNGSLY
LQIDGQEVYLRLSTLPTVYQESLVIRLHLQASVQPLSHLSLFPSTAAKLLSFLRYSQGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIITLEDPVEKRSDDVLQIQINEKAGLTYETGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAIIRFMDYGITRQEIEQSLLAVAAQRLVELKCPFCRGKCSTLCKSMRQIRQASIYELLYGYELKQALKEADGEC
VTYKHETLESSIRKGYALGFLEDDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=363303 EUC39_RS11060 WP_157417559.1 2180811..2181854(+) (comGA) [Bacillus cereus strain DLOU-Changhai]
ATGAATGGGATTGAAATTTTTGCGAATACAATTTTAAAAGAAGCGTGTAGGGTACAAGCGTCAGATTTACATATTGTGCC
CCGGCAGAAGGATGTAGCGGTGCAACTACGTATAGGAAAAGATTTAATGACGGGACATTGTATTGAAAAGGAGTTTGGAG
AAAAACTTGTCTTACATTTTAAATTTTTAGCATCTATGGACATAGGAGAGAAACGGAAGCCACAGAATGGTTCGTTGTAT
TTGCAAATTGATGGACAAGAAGTGTATTTACGCCTTTCAACACTTCCAACAGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTGTTCAGCCCTTATCACATCTTTCGTTATTTCCAAGTACGGCAGCAAAACTACTCTCGTTTTTAC
GCTATTCACAGGGGCTACTCGTGTTTACTGGACCGACTGGTTCTGGGAAGACAACGACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCATTACACTTGAAGATCCGGTTGAAAAAAGAAGTGACGATGTATTACAAATTCA
AATAAATGAAAAAGCGGGCCTCACATATGAAACAGGATTAAAGGCTATTTTACGTCATGATCCAGATATTATTTTAGTCG
GTGAAATCCGTGATGAAGAAACAGCGAAAATAGCTATAAGAGCAAGTTTGACAGGACATTTAGTCATGACAACGCTGCAT
ACGAATGATGCGAAAGGGGCAATCATTCGTTTCATGGATTATGGTATAACGAGGCAAGAGATTGAACAGTCTTTATTGGC
TGTAGCTGCACAGCGACTTGTCGAATTGAAGTGTCCGTTTTGCAGAGGCAAGTGCTCAACTTTATGTAAATCAATGAGGC
AAATAAGGCAAGCAAGCATTTATGAGCTGCTATATGGATATGAGTTAAAACAAGCGCTTAAAGAAGCAGATGGAGAATGT
GTTACTTACAAACATGAGACTTTAGAATCGTCAATAAGAAAAGGATATGCTTTAGGTTTTTTAGAAGATGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.62

100

0.556

  pilB Haemophilus influenzae 86-028NP

39.441

92.795

0.366


Multiple sequence alignment