Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   DPH57_RS14615 Genome accession   NZ_CP030092
Coordinates   3459482..3461170 (+) Length   562 a.a.
NCBI ID   WP_227470197.1    Uniprot ID   -
Organism   Massilia sp. YMA4     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3454482..3466170
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DPH57_RS14590 (DPH57_14590) uraH 3454812..3455165 (+) 354 WP_112938399.1 hydroxyisourate hydrolase -
  DPH57_RS14595 (DPH57_14595) - 3455150..3456376 (-) 1227 WP_112938400.1 glutamate carboxypeptidase -
  DPH57_RS14600 (DPH57_14600) - 3456436..3456930 (-) 495 WP_112938401.1 NADH-quinone oxidoreductase subunit B family protein -
  DPH57_RS14605 (DPH57_14605) - 3457003..3457860 (-) 858 WP_112938402.1 helix-turn-helix domain-containing protein -
  DPH57_RS14610 (DPH57_14610) - 3458085..3459356 (+) 1272 WP_112938403.1 HlyC/CorC family transporter -
  DPH57_RS14615 (DPH57_14615) pilF 3459482..3461170 (+) 1689 WP_227470197.1 type IV-A pilus assembly ATPase PilB Machinery gene
  DPH57_RS14620 (DPH57_14620) pilC 3461181..3462395 (+) 1215 WP_112938405.1 type II secretion system F family protein Machinery gene
  DPH57_RS14625 (DPH57_14625) - 3462402..3463751 (-) 1350 WP_112938406.1 MFS transporter -
  DPH57_RS14630 (DPH57_14630) - 3463800..3465638 (-) 1839 WP_112938407.1 alpha-amylase family glycosyl hydrolase -

Sequence


Protein


Download         Length: 562 a.a.        Molecular weight: 61039.74 Da        Isoelectric Point: 5.9346

>NTDB_id=299058 DPH57_RS14615 WP_227470197.1 3459482..3461170(+) (pilF) [Massilia sp. YMA4]
MSGLARALMQAGRLTPPQADALQKKSHTDKLPFIDVLLASGVVNSRDLALFCAETFAYPMLDLHAFAVAALPQKLIDPKL
MQSQRVVALAKRGNKMSVAISDPTNTQALDQIKFQTESSVEPVIVPHDALVRLLQELGKSSEQLMGDLAGDEGEIQFAEE
QESTTVAEAPATDVEDAPIVRFLNKMLMDAVNMGASDLHFEPFEKFYRIRFRVDGVLIEHAQPPIAIKDKLVSRIKVLAR
LDISEKRVPQDGRMRLIVSPTKTIDLRISTLPTLFGEKTVMRILDATQAQMGIDALGYDPDQKALLLDAIERPYGMVLVT
GPTGSGKTVSLYTCLNILNKPGINISTAEDPAEINLPGVNQVNVNDKAGLTFPVALKSFLRQDPDIIMVGEIRDLETADI
AIKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLARRLCTCKQPVDISADLLLRAGFKPEQLDGT
WKPYGPVGCERCNGTGYKGRVGIYQIMPISPAIEALILAHGNAMQIAAQSESEGVKSLRQSGLVKVKVGLTSLEEVLGCT
NE

Nucleotide


Download         Length: 1689 bp        

>NTDB_id=299058 DPH57_RS14615 WP_227470197.1 3459482..3461170(+) (pilF) [Massilia sp. YMA4]
ATGTCGGGCCTGGCTCGTGCCTTGATGCAGGCGGGGCGGCTCACGCCCCCGCAGGCGGACGCGCTGCAAAAAAAATCCCA
CACCGACAAGCTGCCTTTCATCGACGTGCTGCTGGCCAGCGGCGTCGTCAATTCACGCGACCTGGCGCTGTTCTGTGCCG
AGACGTTCGCCTACCCGATGCTGGACCTGCACGCGTTTGCCGTCGCCGCGCTGCCCCAGAAGCTGATCGACCCGAAGCTG
ATGCAGAGCCAGCGCGTCGTGGCGCTGGCCAAGCGCGGCAACAAGATGTCGGTGGCGATCTCCGACCCCACCAACACGCA
GGCGCTGGACCAGATCAAGTTCCAGACCGAGTCGTCGGTGGAACCGGTGATCGTGCCGCACGACGCGCTGGTGCGCCTGC
TGCAGGAACTGGGCAAGAGCAGCGAGCAGCTGATGGGGGACCTGGCCGGCGACGAGGGCGAGATCCAGTTCGCCGAGGAG
CAGGAATCGACCACCGTGGCGGAAGCCCCGGCCACCGACGTCGAGGATGCACCGATCGTGCGCTTCCTGAACAAGATGCT
GATGGACGCCGTCAACATGGGCGCCTCCGACCTGCACTTCGAGCCGTTCGAGAAGTTCTACCGCATCCGCTTCCGCGTCG
ACGGCGTGCTGATCGAACACGCCCAGCCGCCGATCGCCATCAAGGACAAGCTGGTCTCGCGCATCAAGGTGCTGGCGCGC
CTGGACATCTCGGAAAAGCGCGTGCCGCAGGACGGCCGCATGCGCCTGATCGTCTCGCCGACCAAGACCATCGACCTGCG
TATCTCGACCCTGCCGACACTGTTCGGCGAGAAGACCGTGATGCGTATCCTGGACGCGACCCAGGCCCAGATGGGCATCG
ACGCGCTGGGCTACGACCCGGACCAGAAGGCCCTGCTGCTGGATGCGATCGAACGGCCCTACGGCATGGTGCTGGTGACG
GGACCTACCGGCTCCGGCAAGACCGTTTCGCTGTACACCTGCCTGAACATCCTGAACAAGCCGGGCATCAACATCTCGAC
GGCGGAAGACCCGGCCGAGATCAACCTGCCCGGCGTGAACCAGGTCAACGTCAACGACAAGGCGGGGCTGACCTTCCCCG
TCGCGCTGAAGTCCTTCCTGCGCCAGGACCCGGACATCATCATGGTCGGCGAGATCCGCGACCTGGAAACGGCGGACATC
GCCATCAAGGCGGCGCAGACGGGCCACATGGTGTTCTCCACGCTGCACACCAACGACGCGCCATCGACCCTGACGCGCCT
GATGAACATGGGCGTGGCGCCGTTCAACATCGCCTCCTCCGTCATCCTGATCACGGCGCAGCGCCTGGCGCGCCGGCTGT
GCACCTGCAAGCAGCCGGTGGACATCTCGGCCGACCTGCTGCTGCGGGCAGGATTCAAACCGGAACAGCTGGACGGCACC
TGGAAACCGTACGGCCCGGTGGGCTGCGAGCGCTGCAATGGCACCGGCTACAAGGGCCGCGTGGGTATCTACCAGATCAT
GCCGATCAGCCCCGCCATCGAGGCGCTGATCCTGGCGCACGGCAACGCGATGCAGATCGCGGCGCAGTCGGAAAGCGAAG
GCGTGAAGTCCTTGCGCCAGTCCGGCCTCGTCAAGGTCAAGGTGGGCCTGACCAGCCTGGAAGAAGTGCTGGGCTGCACC
AACGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Neisseria gonorrhoeae MS11

53.996

100

0.541

  pilB Acinetobacter baylyi ADP1

53.014

100

0.532

  pilB Acinetobacter baumannii D1279779

55.682

93.95

0.523

  pilB Legionella pneumophila strain ERS1305867

47.687

100

0.477

  pilB Vibrio cholerae strain A1552

46.277

100

0.464

  pilB Vibrio campbellii strain DS40M4

44.7

100

0.45

  pilB Vibrio parahaemolyticus RIMD 2210633

44.425

100

0.447

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.192

100

0.397

  pilF Thermus thermophilus HB27

37.168

100

0.374


Multiple sequence alignment