Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   QTA68_RS21770 Genome accession   NZ_CP129005
Coordinates   4251091..4252134 (-) Length   347 a.a.
NCBI ID   WP_290101808.1    Uniprot ID   -
Organism   Bacillus cereus strain lycx     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4246091..4257134
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QTA68_RS21725 (QTA68_21725) - 4246524..4246724 (-) 201 WP_000106083.1 YqzE family protein -
  QTA68_RS21730 (QTA68_21730) aroK 4246763..4247260 (-) 498 WP_000836624.1 shikimate kinase AroK -
  QTA68_RS21735 (QTA68_21735) - 4247376..4248026 (-) 651 WP_290101801.1 2OG-Fe(II) oxygenase -
  QTA68_RS21740 (QTA68_21740) comGG 4248201..4248572 (-) 372 WP_001231318.1 competence type IV pilus minor pilin ComGG -
  QTA68_RS21745 (QTA68_21745) comGF 4248569..4249039 (-) 471 WP_033692316.1 competence type IV pilus minor pilin ComGF -
  QTA68_RS21750 (QTA68_21750) - 4249009..4249311 (-) 303 WP_000829464.1 hypothetical protein -
  QTA68_RS21755 (QTA68_21755) comGD 4249304..4249759 (-) 456 WP_086386139.1 competence type IV pilus minor pilin ComGD -
  QTA68_RS21760 (QTA68_21760) comGC 4249756..4250055 (-) 300 WP_001178684.1 competence type IV pilus major pilin ComGC -
  QTA68_RS21765 (QTA68_21765) comGB 4250067..4251104 (-) 1038 WP_290101806.1 competence type IV pilus assembly protein ComGB -
  QTA68_RS21770 (QTA68_21770) comGA 4251091..4252134 (-) 1044 WP_290101808.1 competence type IV pilus ATPase ComGA Machinery gene
  QTA68_RS21775 (QTA68_21775) - 4252340..4253035 (+) 696 WP_000434125.1 helix-turn-helix domain-containing protein -
  QTA68_RS21780 (QTA68_21780) - 4253161..4253403 (+) 243 WP_000440716.1 DUF2626 domain-containing protein -
  QTA68_RS21785 (QTA68_21785) - 4253511..4254905 (+) 1395 WP_001094345.1 L-cystine transporter -
  QTA68_RS21790 (QTA68_21790) - 4254984..4255376 (-) 393 WP_000875394.1 hypothetical protein -
  QTA68_RS21795 (QTA68_21795) - 4255464..4255679 (-) 216 WP_001008327.1 DUF3912 family protein -
  QTA68_RS21800 (QTA68_21800) - 4255932..4256243 (+) 312 WP_001093236.1 hypothetical protein -
  QTA68_RS21805 (QTA68_21805) - 4256278..4256766 (-) 489 WP_290101816.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39155.59 Da        Isoelectric Point: 8.7212

>NTDB_id=850422 QTA68_RS21770 WP_290101808.1 4251091..4252134(-) (comGA) [Bacillus cereus strain lycx]
MNGIEIFANAILKEACRVQASDLHIVPRQKDVAVQLRIGKDLMTRQCIEKEFGEKLVSHFKFLASMDIGEKRKPQNGSLY
LQIDGQEVYLRLSTLPTVYQESLVIRLHLQAFVQPLSHLSLFPSTAAKLLSFLRYSQGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIITLEDPVEKRSDDVLQIQINEKAGLTYETGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAIIRFMDYGITRQEIEQSLLAVAAQRLVELKCPFCRGKCSTLCKSMRQVRQASIYELLYGYELKQALKEAGGEC
VTYKHETLESSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=850422 QTA68_RS21770 WP_290101808.1 4251091..4252134(-) (comGA) [Bacillus cereus strain lycx]
ATGAATGGGATTGAAATTTTTGCGAATGCAATTTTAAAAGAAGCGTGTAGGGTACAAGCGTCAGATTTACATATTGTGCC
CCGGCAGAAGGATGTAGCGGTTCAACTACGTATAGGAAAAGATTTAATGACGAGACAGTGTATTGAAAAGGAATTTGGAG
AAAAACTTGTCTCACATTTCAAATTTTTAGCATCTATGGACATAGGCGAGAAACGGAAGCCACAGAATGGTTCGTTGTAT
TTGCAAATTGATGGACAAGAAGTGTATTTACGCCTTTCAACACTTCCAACAGTATACCAAGAAAGTCTCGTTATTCGCCT
TCATTTACAAGCATTTGTTCAGCCCTTATCACATCTTTCGTTATTTCCAAGTACGGCAGCAAAACTACTCTCGTTTTTAC
GCTATTCACAGGGGCTACTTGTGTTTACTGGACCGACTGGATCTGGGAAGACGACGACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCATTACACTTGAAGATCCGGTTGAAAAAAGAAGTGACGATGTATTACAAATTCA
AATAAATGAAAAAGCGGGTCTCACATATGAAACAGGATTAAAGGCTATTTTACGTCATGATCCAGATATTATTTTAGTCG
GTGAAATCCGTGATGAAGAAACAGCAAAAATAGCTATAAGAGCAAGTTTGACAGGACATTTAGTAATGACAACACTGCAT
ACGAATGATGCGAAAGGAGCAATCATTCGCTTCATGGATTATGGTATAACGAGACAAGAGATTGAACAGTCTTTATTGGC
TGTAGCTGCACAGCGACTTGTCGAATTGAAGTGTCCGTTTTGCAGAGGAAAGTGCTCAACTTTATGTAAATCAATGAGGC
AAGTAAGGCAGGCTAGCATATATGAGCTACTATATGGATATGAGTTAAAACAAGCGCTTAAAGAAGCTGGTGGAGAATGT
GTTACTTACAAACATGAGACTTTAGAATCGTCAATACGAAAAGGATATGCTTTAGGTTTTTTAGAAGAAGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.908

100

0.559

  pilB Haemophilus influenzae 86-028NP

40.062

92.795

0.372