Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   MOTHA_RS08015 Genome accession   NZ_CP012370
Coordinates   1591188..1592849 (-) Length   553 a.a.
NCBI ID   WP_011393063.1    Uniprot ID   -
Organism   Moorella thermoacetica strain DSM 2955     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1586188..1597849
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MOTHA_RS07990 (MOTHA_c16260) - 1586526..1587047 (-) 522 WP_071522541.1 type II secretion system protein -
  MOTHA_RS07995 (MOTHA_c16270) - 1587079..1587822 (-) 744 WP_011393059.1 A24 family peptidase -
  MOTHA_RS08000 (MOTHA_c16280) - 1587991..1588506 (-) 516 WP_011393060.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  MOTHA_RS08005 (MOTHA_c16290) - 1588751..1589959 (-) 1209 WP_011393061.1 type II secretion system F family protein -
  MOTHA_RS08010 (MOTHA_c16300) pilT 1590155..1591174 (-) 1020 WP_011393062.1 type IV pilus twitching motility protein PilT Machinery gene
  MOTHA_RS08015 (MOTHA_c16310) pilF 1591188..1592849 (-) 1662 WP_011393063.1 GspE/PulE family protein Machinery gene
  MOTHA_RS08020 (MOTHA_c16320) aroB 1592855..1593949 (-) 1095 WP_011393064.1 3-dehydroquinate synthase -
  MOTHA_RS08025 (MOTHA_c16330) - 1593928..1594449 (-) 522 WP_011393065.1 shikimate kinase -
  MOTHA_RS08030 (MOTHA_c16340) aroC 1594502..1595653 (-) 1152 WP_011393066.1 chorismate synthase -
  MOTHA_RS08035 (MOTHA_c16350) - 1595678..1596565 (-) 888 WP_011393067.1 shikimate dehydrogenase -
  MOTHA_RS08040 (MOTHA_c16360) - 1596685..1597206 (-) 522 WP_011393068.1 YqeG family HAD IIIA-type phosphatase -

Sequence


Protein


Download         Length: 553 a.a.        Molecular weight: 60458.16 Da        Isoelectric Point: 6.4190

>NTDB_id=153876 MOTHA_RS08015 WP_011393063.1 1591188..1592849(-) (pilF) [Moorella thermoacetica strain DSM 2955]
MDSRRRLGDLLIEAGMLTPAQLEQALQEQKRSGERLGKVLIRLGFITEASMLEVLEFQLGIPKVVLADYHLDPEVVRLVP
EGLARRYQAIPIRLDGNRLLVAMADPLNLVALDDLRLVTGKEIMPAIAAEKEIEAALSRFWQREPVTSMSEVAAAVAAAE
SGGRAGGTEGAPAVRLVNSFIQQAIQTRASDIHIEPQEGEVRVRLRVDGLLRELTRLPLGVLSSLISRIKIMAGMDIAEK
RLPQDGRFQFTLGKRSVDLRVSSLPTVYGEKIVLRLLDQEAMLLPLDDLGFLPAIKERFESLIHSSYGMLLITGPTGSGK
TTTLYATLNILSSPEKNIITIEDPVEYLLPGINQVRVNPKAGLTFASGLRSILRQDPDIIMVGEIRDRETADIAVRAATT
GHLVLTTLHTNDAAGAVTRLLDMGVEGYLVNSSLIGVVAQRLVRRICPHCREMYEPEPGSPERAWLPGAERLWRGRGCEN
CHYTGYTNRTAIQEVLVMNEELRRLVAAKAPATALKEAAVAGGMVPLIDDGLEKARQGITTVSEVLRVSLGGL

Nucleotide


Download         Length: 1662 bp        

>NTDB_id=153876 MOTHA_RS08015 WP_011393063.1 1591188..1592849(-) (pilF) [Moorella thermoacetica strain DSM 2955]
ATGGATAGTCGACGACGACTGGGGGACCTGTTGATCGAAGCCGGGATGCTTACCCCGGCCCAGCTGGAACAGGCCCTGCA
GGAACAGAAACGCAGCGGGGAGCGCCTGGGTAAGGTTTTAATCCGCCTGGGATTTATCACCGAGGCCAGCATGCTGGAGG
TCCTGGAGTTCCAGCTGGGGATCCCCAAGGTGGTCCTGGCTGACTACCACCTGGATCCGGAGGTGGTCCGCCTGGTGCCG
GAAGGCCTGGCCCGGCGCTACCAGGCCATCCCCATCCGCCTGGACGGCAACCGCCTCCTGGTGGCCATGGCCGATCCCCT
GAACCTCGTGGCCCTGGACGACCTGCGCCTGGTCACCGGCAAGGAGATTATGCCGGCTATAGCCGCCGAGAAGGAAATCG
AGGCAGCTTTAAGCCGGTTCTGGCAACGGGAACCCGTTACGAGCATGAGCGAAGTAGCGGCAGCCGTCGCCGCCGCGGAA
TCTGGCGGGCGCGCCGGCGGCACGGAAGGCGCGCCGGCTGTGCGCCTGGTCAACAGTTTTATCCAGCAGGCCATCCAGAC
CCGGGCCAGCGACATCCATATAGAGCCCCAGGAGGGGGAGGTCCGGGTGCGCCTGCGGGTAGACGGCCTGCTGCGGGAGT
TGACCCGCCTGCCCCTGGGGGTTTTAAGTAGCCTGATCTCCAGGATCAAGATCATGGCCGGCATGGACATCGCCGAAAAA
CGCTTGCCCCAGGACGGCCGTTTTCAGTTTACCCTGGGTAAACGCAGTGTCGACCTCAGGGTTTCCAGCCTGCCTACTGT
TTACGGCGAAAAGATCGTCCTGCGCCTCCTGGACCAGGAGGCCATGCTCCTGCCCCTGGACGACCTGGGATTTTTGCCGG
CCATAAAAGAACGCTTTGAGAGTCTCATCCACAGTTCCTACGGCATGCTCCTCATTACCGGTCCCACGGGCAGCGGTAAG
ACGACGACCCTTTATGCTACTCTTAACATTTTAAGCTCGCCGGAAAAAAATATCATTACCATTGAGGATCCGGTAGAATA
CCTGCTGCCCGGCATCAATCAGGTGCGGGTTAACCCCAAGGCCGGCCTGACCTTTGCTTCAGGGCTGCGTTCCATCCTGC
GTCAGGACCCGGATATCATTATGGTCGGGGAGATTCGCGACCGGGAGACGGCCGATATCGCCGTCCGGGCGGCGACTACC
GGTCACCTGGTCTTAACGACCCTGCACACCAATGACGCCGCCGGCGCCGTAACCCGCCTCCTGGATATGGGAGTGGAAGG
CTACCTGGTCAATTCCTCCCTTATTGGCGTGGTGGCCCAGCGCCTGGTGCGCCGCATCTGTCCCCATTGCCGGGAGATGT
ACGAGCCGGAGCCGGGCTCTCCGGAAAGGGCCTGGTTGCCGGGCGCGGAACGGCTCTGGCGCGGCCGGGGTTGCGAAAAC
TGCCATTATACCGGTTACACCAACCGGACGGCCATCCAGGAGGTCCTGGTCATGAATGAAGAACTCCGGCGCCTGGTAGC
CGCCAAGGCGCCGGCTACGGCCCTGAAGGAGGCAGCGGTGGCCGGCGGTATGGTTCCTTTGATTGACGACGGTTTGGAAA
AAGCCCGCCAGGGGATCACTACGGTGAGCGAGGTCCTACGCGTTTCCCTGGGAGGTTTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

49.362

99.277

0.49

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

45.833

99.819

0.458

  pilB Vibrio campbellii strain DS40M4

41.831

100

0.421

  pilB Vibrio parahaemolyticus RIMD 2210633

41.281

100

0.42

  pilB Vibrio cholerae strain A1552

41.877

100

0.42

  pilB/pilB1 Synechocystis sp. PCC 6803

37.767

100

0.416

  pilB Acinetobacter baylyi ADP1

40.36

100

0.405

  pilB Acinetobacter baumannii D1279779

39.421

100

0.394

  pilF Neisseria gonorrhoeae MS11

38.321

99.096

0.38

  pilB Legionella pneumophila strain ERS1305867

41.616

89.512

0.373


Multiple sequence alignment