Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   G9Q37_RS21050 Genome accession   NZ_CP049989
Coordinates   4413885..4415618 (-) Length   577 a.a.
NCBI ID   WP_166230498.1    Uniprot ID   A0A6G8IN67
Organism   Hydrogenophaga crocea strain BA0156     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4408885..4420618
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G9Q37_RS21020 (G9Q37_21020) - 4409712..4410167 (+) 456 WP_166230488.1 NUDIX domain-containing protein -
  G9Q37_RS21025 (G9Q37_21025) - 4410178..4410372 (-) 195 WP_166230490.1 DNA gyrase inhibitor YacG -
  G9Q37_RS21030 (G9Q37_21030) zapD 4410377..4411132 (-) 756 WP_166230492.1 cell division protein ZapD -
  G9Q37_RS21035 (G9Q37_21035) coaE 4411176..4411781 (-) 606 WP_166230494.1 dephospho-CoA kinase -
  G9Q37_RS21040 (G9Q37_21040) pilD 4411778..4412623 (-) 846 WP_166231555.1 prepilin peptidase Machinery gene
  G9Q37_RS21045 (G9Q37_21045) pilC 4412632..4413849 (-) 1218 WP_166230496.1 type II secretion system F family protein Machinery gene
  G9Q37_RS21050 (G9Q37_21050) pilB 4413885..4415618 (-) 1734 WP_166230498.1 type IV-A pilus assembly ATPase PilB Machinery gene
  G9Q37_RS21060 (G9Q37_21060) - 4415892..4416821 (-) 930 WP_166230500.1 polyprenyl synthetase family protein -
  G9Q37_RS21065 (G9Q37_21065) rplU 4417083..4417394 (+) 312 WP_166230502.1 50S ribosomal protein L21 -
  G9Q37_RS21070 (G9Q37_21070) rpmA 4417407..4417664 (+) 258 WP_166230504.1 50S ribosomal protein L27 -
  G9Q37_RS21075 (G9Q37_21075) cgtA 4417745..4418815 (+) 1071 WP_166230506.1 Obg family GTPase CgtA -
  G9Q37_RS21080 (G9Q37_21080) proB 4418819..4419988 (+) 1170 WP_240936449.1 glutamate 5-kinase -
  G9Q37_RS21085 (G9Q37_21085) - 4419998..4420507 (-) 510 WP_166230508.1 CNP1-like family protein -

Sequence


Protein


Download         Length: 577 a.a.        Molecular weight: 62729.24 Da        Isoelectric Point: 5.6391

>NTDB_id=429190 G9Q37_RS21050 WP_166230498.1 4413885..4415618(-) (pilB) [Hydrogenophaga crocea strain BA0156]
MASADTVNQESPSMALPGLGRALVSAGKLGQKAAEELYRKAQSSRTSFIAELTGSGAVSPADLAHTMSTAFAAPLLDLDA
VDVQRLPSGLLDPKICADFRIIALSKRNNRLIVATADPSDQQAAERIKFATQMGVDWVIAEFDKLSKLVESQTKSVTEAM
DNIVGDVEFDDLAAEAAVTDTASEKAAEVDDAPVVKFLHKMLIDAFNMRASDLHFEPYEHTYRVRFRIDGELREIASPPI
AIKEKLASRIKVISRMDISEKRVPQDGRMKLKVGADRVIDFRVSTLPTLFGEKIVIRILDPSQAKLGIDALGYEPEEKER
LLEAIQRPYGMVLVTGPTGSGKTVSLYTCLNILNKPGVNISTAEDPSEINLPGVNQVNMNEKAGLTFAVALKAFLRQDPD
VIMVGEIRDLETADIAIKAAQTGHMVMSTLHTNDAPTTLTRMMNMGIPTFNIASSVILITAQRLARRLCPNCKAPLDVPR
KALLEAGYKPEEIDGSWTPYKPVGCSACNNGYKGRVGIYQVMPISEEIQRIILRGGTALDIAQQASKEGVRSLRESGLLK
VKLGLTSLEEVLSVTNE

Nucleotide


Download         Length: 1734 bp        

>NTDB_id=429190 G9Q37_RS21050 WP_166230498.1 4413885..4415618(-) (pilB) [Hydrogenophaga crocea strain BA0156]
ATGGCGTCAGCAGACACGGTCAATCAGGAAAGTCCGTCCATGGCCCTGCCGGGCTTGGGGCGCGCGCTCGTTTCCGCCGG
CAAATTGGGGCAGAAGGCGGCTGAAGAGCTCTACCGCAAAGCCCAAAGCAGCCGAACCAGCTTCATCGCCGAACTCACGG
GTTCGGGCGCGGTGTCACCCGCCGACCTCGCCCACACCATGTCGACGGCCTTCGCGGCACCCTTGCTCGACCTCGACGCG
GTGGACGTGCAACGCCTGCCCTCCGGCTTGCTCGACCCCAAGATCTGTGCCGACTTCCGCATCATCGCGCTGAGCAAGCG
CAACAACCGGCTGATCGTCGCCACGGCCGATCCCTCTGACCAACAGGCCGCCGAGCGCATCAAGTTCGCCACGCAAATGG
GCGTGGACTGGGTCATTGCGGAGTTCGACAAGCTGAGCAAGCTGGTCGAGTCACAGACCAAGTCGGTGACCGAGGCGATG
GACAACATCGTCGGCGACGTCGAGTTCGACGACCTCGCCGCCGAAGCCGCCGTCACCGACACGGCATCCGAAAAGGCCGC
CGAGGTCGACGACGCCCCCGTCGTCAAGTTCCTGCACAAGATGCTGATCGACGCGTTCAACATGCGCGCGTCCGATCTGC
ACTTCGAGCCCTACGAGCACACCTACCGCGTGCGCTTCCGCATCGACGGCGAGTTGCGCGAGATCGCCTCACCGCCCATT
GCCATCAAGGAAAAGCTCGCCTCGCGCATCAAGGTGATCTCCCGCATGGACATCTCCGAGAAGCGGGTGCCGCAGGACGG
GCGCATGAAGCTCAAGGTGGGCGCCGACCGCGTGATCGACTTCCGCGTCAGCACGCTGCCCACGCTGTTCGGCGAAAAGA
TCGTGATCCGTATCCTGGACCCGAGCCAGGCCAAGCTGGGCATCGACGCCCTCGGATACGAACCCGAGGAAAAAGAGCGG
CTGCTGGAGGCCATTCAGCGGCCCTACGGCATGGTGCTGGTGACCGGCCCGACGGGTTCGGGCAAGACCGTGTCGCTCTA
CACCTGCCTGAACATCCTCAACAAGCCCGGCGTCAACATCTCGACGGCGGAGGACCCCTCGGAAATCAACCTGCCCGGGG
TCAACCAGGTCAACATGAACGAGAAGGCGGGGCTGACCTTCGCGGTCGCGCTCAAGGCCTTCCTGCGCCAGGATCCCGAC
GTGATCATGGTGGGCGAGATCCGCGACCTGGAAACCGCCGACATCGCGATCAAGGCCGCACAGACCGGCCACATGGTGAT
GTCGACCCTGCACACCAACGACGCGCCGACGACCCTCACCCGCATGATGAACATGGGCATCCCCACGTTCAACATCGCCT
CCAGCGTCATTCTGATCACGGCCCAGCGCCTGGCACGCCGCCTGTGTCCCAATTGCAAGGCGCCGCTCGACGTGCCGCGC
AAGGCGCTGCTTGAAGCCGGATACAAGCCCGAAGAAATCGATGGCAGCTGGACACCCTACAAGCCCGTGGGTTGCTCGGC
TTGCAACAACGGCTACAAGGGACGCGTAGGCATCTACCAGGTGATGCCGATTTCGGAGGAGATCCAACGCATCATCCTGC
GAGGCGGCACCGCACTGGACATCGCGCAGCAAGCCAGCAAGGAAGGGGTTCGCTCGCTGCGCGAATCCGGCTTGCTCAAG
GTCAAGCTCGGCCTGACTTCCCTCGAGGAAGTGCTGAGCGTGACCAACGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6G8IN67

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baumannii D1279779

54.982

97.4

0.536

  pilB Acinetobacter baylyi ADP1

54.707

97.574

0.534

  pilF Neisseria gonorrhoeae MS11

52.575

97.574

0.513

  pilB Legionella pneumophila strain ERS1305867

50

98.44

0.492

  pilB Vibrio campbellii strain DS40M4

44.563

97.227

0.433

  pilB Vibrio parahaemolyticus RIMD 2210633

44.207

97.227

0.43

  pilB Vibrio cholerae strain A1552

47.195

89.601

0.423

  pilF Thermus thermophilus HB27

38.011

97.574

0.371

  pilB/pilB1 Synechocystis sp. PCC 6803

36.271

100

0.371

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.439

92.721

0.366