Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   EQK03_RS11880 Genome accession   NZ_CP040342
Coordinates   2339435..2340478 (+) Length   347 a.a.
NCBI ID   WP_001013167.1    Uniprot ID   -
Organism   Bacillus cereus strain DLOU-Weihai     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2334435..2345478
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQK03_RS11845 - 2334803..2335291 (+) 489 WP_000764267.1 hypothetical protein -
  EQK03_RS11850 - 2335326..2335637 (-) 312 WP_001093237.1 hypothetical protein -
  EQK03_RS11855 - 2335890..2336105 (+) 216 WP_001008327.1 DUF3912 family protein -
  EQK03_RS11860 - 2336193..2336584 (+) 392 Protein_2258 hypothetical protein -
  EQK03_RS11865 - 2336664..2338058 (-) 1395 WP_025709781.1 L-cystine transporter -
  EQK03_RS11870 - 2338166..2338408 (-) 243 WP_000440716.1 DUF2626 domain-containing protein -
  EQK03_RS11875 - 2338534..2339229 (-) 696 WP_000434125.1 helix-turn-helix domain-containing protein -
  EQK03_RS11880 comGA 2339435..2340478 (+) 1044 WP_001013167.1 competence type IV pilus ATPase ComGA Machinery gene
  EQK03_RS11885 comGB 2340465..2341502 (+) 1038 WP_016123380.1 competence type IV pilus assembly protein ComGB -
  EQK03_RS11890 comGC 2341514..2341813 (+) 300 WP_001178684.1 competence type IV pilus major pilin ComGC -
  EQK03_RS11895 comGD 2341810..2342265 (+) 456 WP_000813141.1 competence type IV pilus minor pilin ComGD -
  EQK03_RS11900 - 2342258..2342560 (+) 303 WP_000829464.1 hypothetical protein -
  EQK03_RS11905 comGF 2342530..2343000 (+) 471 WP_033692316.1 competence type IV pilus minor pilin ComGF -
  EQK03_RS11910 comGG 2342997..2343368 (+) 372 WP_001231318.1 competence type IV pilus minor pilin ComGG -
  EQK03_RS11915 - 2343543..2344193 (+) 651 WP_141539863.1 2OG-Fe(II) oxygenase -
  EQK03_RS11920 aroK 2344309..2344806 (+) 498 WP_049106675.1 shikimate kinase AroK -
  EQK03_RS11925 - 2344845..2345045 (+) 201 WP_000106083.1 YqzE family protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39092.47 Da        Isoelectric Point: 8.3536

>NTDB_id=363463 EQK03_RS11880 WP_001013167.1 2339435..2340478(+) (comGA) [Bacillus cereus strain DLOU-Weihai]
MNGIEIFANAILKEACRVQASDLHIVPRQKDVAVQLRIGKDLMTIQCIEKEFGEKLVSHFKFLASMDIGEKRKPQNGSLY
LQIDGQEVYLRLSTLPTVYQESLVIRLHLQASVQPLSHLSLFPSTAAKLLSFLRYSQGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIITLEDPVEKRSDDVLQIQINEKAGLTYETGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAIIRFIDYGITRQEIEQSLLAVAAQRLVELKCPFCRGKCSTLCKSMRQVRQASIYELLYGYELKQALKEADGEC
VTYKHETLESSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=363463 EQK03_RS11880 WP_001013167.1 2339435..2340478(+) (comGA) [Bacillus cereus strain DLOU-Weihai]
ATGAATGGGATTGAAATTTTTGCGAATGCAATTTTAAAAGAAGCGTGTAGGGTACAAGCGTCAGATTTACATATTGTGCC
CCGGCAGAAGGATGTAGCGGTGCAACTACGTATAGGAAAAGATTTAATGACGATACAGTGTATTGAAAAGGAGTTTGGAG
AAAAACTTGTCTCACATTTCAAATTTTTAGCATCTATGGACATAGGCGAGAAACGGAAGCCACAGAATGGTTCGTTGTAT
TTGCAAATTGATGGACAAGAAGTGTATTTACGCCTTTCCACACTTCCAACAGTATATCAAGAAAGTCTTGTTATTCGTCT
TCATTTACAAGCATCTGTTCAGCCCTTATCACATCTTTCGTTATTTCCAAGTACGGCAGCAAAACTACTCTCGTTTTTAC
GCTATTCACAGGGGCTACTTGTGTTTACTGGACCGACTGGATCTGGGAAGACGACGACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCATTACACTTGAAGATCCGGTTGAAAAAAGAAGTGACGATGTATTACAAATTCA
AATAAATGAAAAAGCGGGTCTCACATATGAAACAGGATTAAAGGCTATTTTACGTCATGATCCAGATATTATTTTAGTCG
GTGAAATCCGTGATGAAGAAACAGCAAAAATAGCTATAAGAGCAAGTTTGACAGGACATTTAGTAATGACAACGCTGCAT
ACGAATGATGCGAAAGGAGCAATCATTCGTTTCATAGATTATGGTATAACGAGGCAAGAGATTGAACAGTCTTTATTGGC
TGTAGCTGCACAGCGACTTGTCGAATTGAAGTGTCCGTTTTGCAGAGGAAAGTGCTCAACTTTATGTAAATCAATGAGGC
AAGTAAGGCAGGCTAGCATTTATGAGCTACTATATGGATATGAGTTAAAACAAGCGCTTAAAGAAGCTGATGGAGAATGT
GTTACTTACAAACATGAGACTTTAGAATCATCAATACGAAAAGGATATGCTTTAGGTTTTTTAGAAGAAGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.908

100

0.559

  pilB Haemophilus influenzae 86-028NP

39.752

92.795

0.369


Multiple sequence alignment