Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   CAP2UW1_RS20725 Genome accession   NC_013194
Coordinates   4739737..4741452 (-) Length   571 a.a.
NCBI ID   WP_015768572.1    Uniprot ID   A0A9D8KUA7
Organism   Candidatus Accumulibacter phosphatis     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4734737..4746452
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CAP2UW1_RS20705 (CAP2UW1_4165) - 4736544..4737191 (-) 648 WP_015768568.1 DUF502 domain-containing protein -
  CAP2UW1_RS20710 (CAP2UW1_4166) - 4737176..4737508 (-) 333 WP_015768569.1 FmdB family zinc ribbon protein -
  CAP2UW1_RS20715 (CAP2UW1_4167) pilD 4737616..4738476 (-) 861 WP_015768570.1 prepilin peptidase Machinery gene
  CAP2UW1_RS20720 (CAP2UW1_4168) pilC 4738476..4739714 (-) 1239 WP_015768571.1 type II secretion system F family protein Machinery gene
  CAP2UW1_RS20725 (CAP2UW1_4169) pilB 4739737..4741452 (-) 1716 WP_015768572.1 type IV-A pilus assembly ATPase PilB Machinery gene
  CAP2UW1_RS26265 (CAP2UW1_4170) - 4741638..4741766 (-) 129 WP_291911647.1 hypothetical protein -
  CAP2UW1_RS20730 (CAP2UW1_4171) xerC 4741877..4742791 (-) 915 WP_015768574.1 tyrosine recombinase XerC -
  CAP2UW1_RS20735 (CAP2UW1_4172) - 4742802..4743455 (-) 654 WP_015768575.1 DUF484 family protein -
  CAP2UW1_RS20740 (CAP2UW1_4173) dapF 4743452..4744282 (-) 831 WP_015768576.1 diaminopimelate epimerase -
  CAP2UW1_RS20745 (CAP2UW1_4174) - 4744279..4744806 (-) 528 WP_015768577.1 GNAT family N-acetyltransferase -

Sequence


Protein


Download         Length: 571 a.a.        Molecular weight: 62339.89 Da        Isoelectric Point: 5.4716

>NTDB_id=35164 CAP2UW1_RS20725 WP_015768572.1 4739737..4741452(-) (pilB) [Candidatus Accumulibacter phosphatis]
MAANPQQSPLSGLARALVQAGRLKEAEAESLLVQSIASKTSLIEQMVAANKTTFLDVARFAADKFGYPLLDLAAIDDGNI
QKNAVDRKLIATHRVVPLHKRGNRLAIAIADPTNLRALDEIRFQTGLAVDPVIVEENKLGPLVAKLSETVEESLKTLASD
DINLEFTDEQAQDKADEASSLEVDDAPVVRFIQKMLLDAINEGASDIHFEPYEKSYRIRFRTDGILREIASPPLVIKDKI
ASRIKVISRLNIAEKRVPQDGRMRLVLSKSRSIDFRVSTLPTMYGEKIVLRILDPSSATLGIDALGYEPEQKKILLDAIH
RPYGMILVTGPTGSGKTVSLYTCLNILNRPGVNIATAEDPAEIPLPGINQVNVDDKAGLTFPIALKAFLRQDPDIIMVGE
IRDIETAEIAIKAAQTGHMVLSTLHTNDAPATLTRLMNMGIPTFNLASSILLITAQRLVRRLCTCKRPLETPVETLLNAG
FEESDLDGTWTLFGPGECERCKGSGYKGRVGLYEVMPVTEAIQRIIMANGTELDIAIQARTEGVNDLRRSGLLKVKQGLT
SLDEVLGSTNA

Nucleotide


Download         Length: 1716 bp        

>NTDB_id=35164 CAP2UW1_RS20725 WP_015768572.1 4739737..4741452(-) (pilB) [Candidatus Accumulibacter phosphatis]
ATGGCTGCCAATCCGCAACAATCGCCGCTCAGCGGCCTCGCTCGTGCGCTGGTACAGGCTGGTCGGCTCAAAGAAGCCGA
GGCGGAGTCCTTGCTTGTCCAGTCGATCGCCAGCAAGACGTCGCTGATCGAGCAAATGGTCGCCGCCAACAAGACGACCT
TCCTCGACGTGGCGCGCTTCGCGGCAGACAAGTTCGGTTACCCGTTGCTCGATCTCGCCGCCATCGACGACGGGAATATC
CAGAAGAATGCGGTGGACCGCAAGCTGATCGCGACTCATCGCGTCGTGCCTCTGCACAAACGTGGCAACCGCCTGGCCAT
CGCGATTGCCGATCCAACCAATCTGCGGGCGCTCGACGAAATCCGCTTTCAGACCGGCCTGGCGGTCGACCCGGTGATCG
TCGAGGAAAACAAGCTTGGGCCGCTGGTTGCCAAACTGTCCGAGACGGTCGAAGAGAGCCTGAAAACTCTCGCCAGCGAC
GACATCAACCTCGAGTTTACCGATGAGCAGGCGCAGGACAAGGCCGATGAGGCCTCGAGTCTCGAGGTCGACGACGCGCC
GGTGGTCAGGTTCATTCAGAAGATGTTGCTCGATGCCATCAATGAAGGGGCTTCCGACATCCATTTCGAGCCCTACGAAA
AGAGCTATCGCATTCGCTTTCGAACCGATGGAATCCTGCGGGAAATCGCCTCGCCGCCGCTGGTCATCAAAGACAAGATC
GCTTCGCGCATCAAGGTCATTTCTCGCCTCAACATCGCCGAGAAGCGTGTGCCACAGGATGGCAGGATGCGCCTGGTGCT
GTCGAAAAGCCGGTCGATAGATTTCCGCGTCAGCACCTTGCCGACGATGTACGGTGAAAAGATCGTCTTGCGCATTCTCG
ATCCGAGCAGCGCCACCCTGGGTATCGACGCATTGGGTTATGAGCCCGAGCAGAAGAAGATTTTGCTCGACGCCATCCAC
CGGCCTTACGGAATGATTCTGGTGACCGGACCCACCGGTTCGGGGAAGACCGTTTCACTGTATACCTGTCTCAACATACT
CAACCGGCCCGGGGTCAACATCGCCACGGCCGAAGACCCAGCCGAAATTCCCTTGCCGGGTATCAATCAGGTCAATGTCG
ACGACAAGGCGGGACTGACCTTTCCGATTGCCCTCAAGGCCTTTCTTCGCCAGGATCCGGACATCATCATGGTGGGTGAG
ATTCGCGACATCGAAACCGCGGAGATTGCGATCAAGGCGGCACAGACTGGCCACATGGTCCTGTCGACGCTGCACACCAA
CGATGCGCCGGCGACCCTGACCCGCCTGATGAATATGGGCATACCGACTTTCAATCTCGCTTCGAGCATCCTGTTGATCA
CGGCGCAACGGCTGGTTCGTCGGCTCTGCACCTGCAAGAGACCGCTCGAAACGCCGGTCGAGACACTCCTCAACGCCGGC
TTCGAGGAGAGCGACCTCGATGGTACCTGGACCCTGTTCGGTCCGGGCGAATGCGAACGCTGCAAGGGGAGCGGCTACAA
GGGACGGGTCGGCCTCTACGAGGTGATGCCGGTCACTGAAGCCATTCAGCGCATCATCATGGCCAACGGAACGGAACTCG
ACATTGCCATCCAGGCCAGGACGGAAGGCGTCAATGATCTGCGCCGCTCGGGCCTGCTGAAGGTCAAGCAGGGGCTCACC
TCTCTCGACGAGGTTCTCGGCAGCACCAACGCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baumannii D1279779

53.616

99.299

0.532

  pilB Legionella pneumophila strain ERS1305867

52.778

100

0.532

  pilB Acinetobacter baylyi ADP1

53.086

99.299

0.527

  pilF Neisseria gonorrhoeae MS11

52.297

99.124

0.518

  pilB Vibrio parahaemolyticus RIMD 2210633

48.043

98.424

0.473

  pilB Vibrio cholerae strain A1552

47.527

99.124

0.471

  pilB Vibrio campbellii strain DS40M4

47.247

98.599

0.466

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.859

99.299

0.396

  pilF Thermus thermophilus HB27

39.054

100

0.391

  pilB/pilB1 Synechocystis sp. PCC 6803

42.137

86.865

0.366


Multiple sequence alignment