Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   BGI23_RS20475 Genome accession   NZ_CP017016
Coordinates   3938813..3939856 (-) Length   347 a.a.
NCBI ID   WP_044793799.1    Uniprot ID   A0A5C5AMY2
Organism   Bacillus sp. ABP14     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3933813..3944856
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BGI23_RS20430 (BGI23_20430) - 3934241..3934441 (-) 201 WP_000106079.1 YqzE family protein -
  BGI23_RS20435 (BGI23_20435) - 3934480..3934977 (-) 498 WP_044793804.1 shikimate kinase -
  BGI23_RS20440 (BGI23_20440) - 3935097..3935747 (-) 651 WP_070806833.1 2OG-Fe(II) oxygenase -
  BGI23_RS20445 (BGI23_20445) comGG 3935923..3936294 (-) 372 WP_000595196.1 competence type IV pilus minor pilin ComGG -
  BGI23_RS20450 (BGI23_20450) comGF 3936291..3936761 (-) 471 WP_000923058.1 competence type IV pilus minor pilin ComGF -
  BGI23_RS20455 (BGI23_20455) comGE 3936731..3937033 (-) 303 WP_070806834.1 competence type IV pilus minor pilin ComGE -
  BGI23_RS20460 (BGI23_20460) comGD 3937026..3937481 (-) 456 WP_000810387.1 comG operon protein ComGD -
  BGI23_RS20465 (BGI23_20465) comGC 3937478..3937777 (-) 300 WP_001178694.1 competence type IV pilus major pilin ComGC -
  BGI23_RS20470 (BGI23_20470) comGB 3937789..3938820 (-) 1032 WP_052587781.1 competence type IV pilus assembly protein ComGB -
  BGI23_RS20475 (BGI23_20475) comGA 3938813..3939856 (-) 1044 WP_044793799.1 competence protein ComGA Machinery gene
  BGI23_RS20480 (BGI23_20480) - 3940062..3940757 (+) 696 WP_044793798.1 metalloregulator ArsR/SmtB family transcription factor -
  BGI23_RS20485 (BGI23_20485) - 3940883..3941125 (+) 243 WP_044793797.1 DUF2626 domain-containing protein -
  BGI23_RS20490 (BGI23_20490) - 3941231..3942625 (+) 1395 WP_044793796.1 L-cystine transporter -
  BGI23_RS20495 (BGI23_20495) - 3942865..3943263 (-) 399 WP_070806835.1 hypothetical protein -
  BGI23_RS20500 (BGI23_20500) - 3943351..3943566 (-) 216 WP_001008330.1 DUF3912 family protein -
  BGI23_RS20505 (BGI23_20505) - 3943821..3944132 (+) 312 WP_001080447.1 hypothetical protein -
  BGI23_RS20510 (BGI23_20510) - 3944169..3944657 (-) 489 WP_070806836.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39259.79 Da        Isoelectric Point: 9.0719

>NTDB_id=194500 BGI23_RS20475 WP_044793799.1 3938813..3939856(-) (comGA) [Bacillus sp. ABP14]
MNGIESFANMILKEACRVQASDLHIVPRQKDVVVQLRIGKDLMTKQCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDGVLQIQINEKAGITYEAGLKAILRHDPDVILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTVCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLESSLRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=194500 BGI23_RS20475 WP_044793799.1 3938813..3939856(-) (comGA) [Bacillus sp. ABP14]
ATGAATGGAATTGAAAGCTTTGCGAATATGATTTTAAAAGAAGCGTGCAGGGTACAAGCGTCGGACTTACATATTGTGCC
CCGGCAGAAGGATGTAGTGGTTCAGTTGCGTATAGGAAAAGATTTAATGACGAAACAATGCATTGAAAAGGAGTTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGCTCACTGTAT
TTACAAATGGACGGACAAGAAGTGTATTTACGTCTTTCCACGCTTCCAACTGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTACTTTCTTTTTTAC
ATTATTCCCATGGGTTACTCGTATTTACTGGACCGACTGGGTCGGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGCCGCATCGTTACACTGGAAGATCCAGTTGAAAAAAGAAATGACGGTGTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCGGGACTAAAGGCTATTTTGCGTCATGATCCAGATGTTATTTTAGTGG
GTGAAATTCGTGATGAAGAAACAGCAAAAATCGCTATAAGAGCAAGTTTGACTGGCCATTTAGTAATGACGACATTGCAT
ACGAACGATGCGAAAGGAGCGATACTTCGGTTTATGGACTTTGGCATAACGAGGCAAGAGATTGAACAATCTTTATTGGC
TATAGCTGCACAGCGACTTGTTGAATTGAAGTGTCCGTTTTGTAAAAGAAAGTGCTCAACTGTATGTAAATCAATGAGGC
AAGTAAGGCAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACGTACAAGCATGAGACATTAGAGTCTTCGTTGCGAAAAGGATACGCTTTAGGATTTTTAGAAGAGGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5C5AMY2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.925

100

0.579

  pilB Vibrio campbellii strain DS40M4

36.544

100

0.372


Multiple sequence alignment