Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   E5CHR_RS06840 Genome accession   NZ_LR594675
Coordinates   1452882..1454615 (-) Length   577 a.a.
NCBI ID   WP_162578991.1    Uniprot ID   A0A6P2EHH6
Organism   Variovorax sp. PBS-H4     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1447882..1459615
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5CHR_RS06820 (H4CHR_01362) - 1448501..1449994 (-) 1494 WP_162578987.1 MBOAT family protein -
  E5CHR_RS06825 (H4CHR_01363) coaE 1450079..1450720 (-) 642 WP_162578988.1 dephospho-CoA kinase -
  E5CHR_RS06830 (H4CHR_01364) - 1450678..1451643 (-) 966 WP_162578989.1 A24 family peptidase -
  E5CHR_RS06835 (H4CHR_01365) pilC 1451643..1452866 (-) 1224 WP_162578990.1 type II secretion system F family protein Machinery gene
  E5CHR_RS06840 (H4CHR_01366) pilB 1452882..1454615 (-) 1734 WP_162578991.1 type IV-A pilus assembly ATPase PilB Machinery gene

Sequence


Protein


Download         Length: 577 a.a.        Molecular weight: 62493.80 Da        Isoelectric Point: 5.6405

>NTDB_id=1128245 E5CHR_RS06840 WP_162578991.1 1452882..1454615(-) (pilB) [Variovorax sp. PBS-H4]
MAAADPLAKESASIALPGLARALISAGKLPVRTAEEIYQKSLSGRSSFIAELTGTGAVSAADLAHTLSSAFGAPLLDLDA
VDSQRLPKDLLDSKLCQAYRVVVLSKRNNRLIVATADPSDQQAAEKIKFASQMGVDWVVAEYDKLSRMIEAAAVSAAETI
DSIVGAEFEFDDSSIAESAEANEAAIAEVEDAPVVRFLHKMLLDAFSMRASDIHFEPYEHQYRVRFRIDGELREIATPPT
VIKDKLASRIKVISRLDISEKRVPQDGRMKLKIGADRVIDFRVSTLPTLFGEKIVVRILDPSSARLGIDALGYDADEKER
LMQAIGRPYGMVLVTGPTGSGKTVSLYTCLNLLNKPGVNIATAEDPSEINLPGVNQVNVNDKAGLTFAAALRAFLRQDPD
IIMVGEIRDLETADIAIKAAQTGHLVLSTLHTNDAPTTLTRMRNMGIAPFNIASSVILITAQRLARRLCTVCKAPIEVPR
EALLEAGFKDRDIDGTWKPYKPVGCPSCNGGYKGRVGIYQVMPISEEIQKIILRDGSALEIARQSEAEGVRSLRSSGLRK
VMQGLTSLEEVLAVTNE

Nucleotide


Download         Length: 1734 bp        

>NTDB_id=1128245 E5CHR_RS06840 WP_162578991.1 1452882..1454615(-) (pilB) [Variovorax sp. PBS-H4]
ATGGCCGCCGCTGACCCTCTCGCCAAAGAAAGCGCATCGATCGCCCTCCCCGGGCTGGCGCGCGCCTTGATCTCCGCCGG
CAAGCTGCCGGTGAGGACCGCGGAGGAGATCTATCAGAAGTCGCTCAGCGGGCGAAGCAGCTTCATCGCCGAGCTGACCG
GCACCGGCGCGGTGTCTGCGGCGGATCTCGCCCACACCCTGTCGAGCGCGTTCGGCGCCCCCCTGCTCGACCTTGACGCC
GTCGATTCCCAGCGCCTGCCCAAGGACCTGCTGGATTCCAAGCTGTGTCAGGCCTACCGGGTGGTGGTGCTCAGCAAGCG
CAACAACCGTCTCATCGTCGCAACAGCCGACCCGTCCGACCAGCAGGCGGCGGAAAAAATAAAGTTCGCGTCCCAGATGG
GCGTGGACTGGGTCGTTGCCGAGTACGACAAGCTCTCGCGCATGATCGAGGCGGCGGCGGTGAGCGCCGCCGAAACGATC
GACAGCATCGTCGGCGCCGAGTTCGAGTTCGACGACTCCTCGATCGCCGAGAGCGCCGAGGCGAACGAAGCCGCGATCGC
CGAGGTCGAGGATGCGCCGGTCGTCCGCTTCCTGCACAAGATGCTGCTCGACGCCTTCAGCATGCGGGCGTCGGACATCC
ACTTCGAGCCTTACGAGCATCAGTACCGCGTGCGCTTCCGGATCGACGGCGAGCTGCGCGAGATCGCCACGCCGCCCACG
GTCATCAAGGACAAGCTGGCTTCGCGCATCAAGGTGATTTCCCGGCTGGACATCTCCGAGAAGCGGGTGCCGCAGGACGG
CCGCATGAAGCTCAAGATCGGCGCCGACCGGGTGATCGACTTCCGCGTCAGCACCCTGCCCACGCTGTTCGGCGAGAAGA
TCGTGGTGCGTATCCTCGACCCGAGCAGCGCGCGCCTGGGCATCGATGCCCTGGGCTATGACGCCGACGAGAAAGAGCGG
CTGATGCAAGCCATCGGCCGCCCCTACGGCATGGTGCTGGTCACCGGGCCGACCGGCTCGGGCAAGACGGTTTCGCTCTA
CACCTGCCTGAACCTGCTGAACAAGCCGGGCGTCAACATCGCCACGGCCGAGGACCCATCCGAAATCAACCTTCCCGGGG
TCAACCAGGTCAACGTCAACGACAAGGCGGGTCTGACCTTCGCGGCGGCGCTGCGCGCTTTCCTGCGCCAGGACCCGGAC
ATCATCATGGTCGGCGAAATCCGCGACCTGGAAACCGCCGACATCGCGATCAAGGCCGCGCAAACCGGCCACCTGGTGCT
GTCGACCCTGCACACCAACGACGCACCGACCACGCTGACGCGGATGCGCAACATGGGCATCGCGCCCTTCAACATCGCCT
CCAGCGTGATCCTGATCACCGCGCAGCGGCTCGCGCGCCGGCTGTGCACCGTATGCAAGGCGCCAATCGAGGTGCCCAGG
GAGGCGCTGCTCGAAGCCGGCTTCAAGGACCGCGACATCGACGGCACCTGGAAGCCCTACAAGCCCGTCGGCTGCCCGTC
CTGCAACGGCGGCTACAAGGGCCGCGTGGGCATCTACCAGGTGATGCCGATCTCCGAGGAAATCCAGAAGATCATCCTGC
GCGATGGAAGCGCGCTCGAGATCGCCCGGCAATCCGAGGCCGAGGGCGTGCGCTCGCTGCGCAGTTCCGGGCTGCGGAAG
GTCATGCAGGGCCTGACCTCTCTCGAAGAAGTGCTCGCGGTCACCAACGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6P2EHH6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baylyi ADP1

54.902

97.227

0.534

  pilB Acinetobacter baumannii D1279779

54.545

97.227

0.53

  pilF Neisseria gonorrhoeae MS11

52.931

97.574

0.516

  pilB Legionella pneumophila strain ERS1305867

49.033

98.614

0.484

  pilB Vibrio parahaemolyticus RIMD 2210633

45.088

98.787

0.445

  pilB Vibrio campbellii strain DS40M4

45.648

97.574

0.445

  pilB Vibrio cholerae strain A1552

48.462

90.121

0.437

  pilF Thermus thermophilus HB27

40.526

98.787

0.4

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

41.468

94.454

0.392


Multiple sequence alignment