Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   LZS83_RS20980 Genome accession   NZ_CP091444
Coordinates   4095297..4096340 (-) Length   347 a.a.
NCBI ID   WP_242252231.1    Uniprot ID   -
Organism   Bacillus cereus strain MO2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4090297..4101340
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LZS83_RS20935 - 4090726..4090926 (-) 201 WP_000106080.1 YqzE family protein -
  LZS83_RS20940 aroK 4090965..4091462 (-) 498 WP_242252225.1 shikimate kinase AroK -
  LZS83_RS20945 - 4091581..4092231 (-) 651 WP_063222963.1 2OG-Fe(II) oxygenase -
  LZS83_RS20950 comGG 4092406..4092777 (-) 372 WP_063222962.1 competence type IV pilus minor pilin ComGG -
  LZS83_RS20955 comGF 4092774..4093244 (-) 471 WP_309259321.1 competence type IV pilus minor pilin ComGF -
  LZS83_RS20960 comGE 4093214..4093516 (-) 303 WP_309259322.1 competence type IV pilus minor pilin ComGE -
  LZS83_RS20965 comGD 4093509..4093964 (-) 456 WP_242252228.1 comG operon protein ComGD -
  LZS83_RS20970 comGC 4093961..4094260 (-) 300 WP_063222959.1 comG operon protein ComGC -
  LZS83_RS20975 comGB 4094273..4095304 (-) 1032 WP_242252230.1 competence type IV pilus assembly protein ComGB -
  LZS83_RS20980 comGA 4095297..4096340 (-) 1044 WP_242252231.1 competence protein ComGA Machinery gene
  LZS83_RS20985 - 4096546..4097241 (+) 696 WP_000434117.1 metalloregulator ArsR/SmtB family transcription factor -
  LZS83_RS20990 - 4097367..4097609 (+) 243 WP_000440713.1 DUF2626 domain-containing protein -
  LZS83_RS20995 - 4097717..4099111 (+) 1395 WP_001094341.1 L-cystine transporter -
  LZS83_RS21000 - 4099191..4099582 (-) 392 Protein_4085 hypothetical protein -
  LZS83_RS21005 - 4099670..4099885 (-) 216 WP_001008320.1 DUF3912 family protein -
  LZS83_RS21010 - 4100138..4100449 (+) 312 WP_063222956.1 hypothetical protein -
  LZS83_RS21015 - 4100486..4100974 (-) 489 WP_242252237.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39281.76 Da        Isoelectric Point: 8.7292

>NTDB_id=651208 LZS83_RS20980 WP_242252231.1 4095297..4096340(-) (comGA) [Bacillus cereus strain MO2]
MNGIESFANTILKEACRVQASDLHIVPRQKDVAIQLRIGKDLMMKHCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLSYSHGLLVFTGPTGSGKTTTMYALLEV
MRKKKTRRIVTLEDPVEKRNDDLLQIQINEKAGITYEAGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAILRFMDFGVTRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYTHETLESSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=651208 LZS83_RS20980 WP_242252231.1 4095297..4096340(-) (comGA) [Bacillus cereus strain MO2]
ATGAATGGAATTGAAAGTTTTGCGAATACGATTTTGAAAGAAGCGTGTAGGGTACAAGCGTCGGACTTACATATCGTGCC
GAGGCAGAAGGACGTGGCAATTCAACTACGTATAGGAAAAGATTTAATGATGAAACATTGTATTGAAAAGGAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGAGAGAGGCGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGACAAGAAGTGTACTTACGCCTTTCCACGCTTCCAACCGTATACCAAGAAAGTCTCGTTATTCGTCT
CCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCAAAAAAATTACTCTCTTTTTTAA
GTTATTCACATGGGTTACTTGTATTTACTGGACCGACTGGTTCTGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATGAGAAAAAAGAAAACGCGCCGCATCGTTACACTGGAAGATCCAGTTGAAAAAAGAAATGACGATTTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCGGGACTAAAGGCTATTTTACGTCATGATCCAGATATTATTTTAGTTG
GTGAAATCCGTGATGAAGAAACAGCGAAAATCGCAATAAGAGCAAGTTTGACTGGACATTTAGTAATGACGACACTGCAT
ACGAATGATGCGAAAGGGGCGATACTCAGGTTCATGGATTTTGGCGTAACGAGGCAAGAGATTGAACAGTCTTTATTGGC
TATAGCTGCACAGCGACTTGTCGAATTAAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGTAAATCAATGAGGC
AAGTAAGACAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACATACACGCATGAAACATTAGAATCTTCGATACGAAAAGGATACGCTTTAGGATTTTTAGAAGAGGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.925

100

0.579

  pilB Vibrio campbellii strain DS40M4

35.411

100

0.36