Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   MothHH_RS08335 Genome accession   NZ_CP031054
Coordinates   1613193..1614854 (-) Length   553 a.a.
NCBI ID   WP_011393063.1    Uniprot ID   -
Organism   Moorella thermoacetica strain 39073-HH     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1608193..1619854
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MothHH_RS08310 (MothHH_01663) - 1608531..1609052 (-) 522 WP_071522541.1 type II secretion system protein -
  MothHH_RS08315 (MothHH_01664) - 1609084..1609827 (-) 744 WP_011393059.1 A24 family peptidase -
  MothHH_RS08320 (MothHH_01665) - 1609996..1610511 (-) 516 WP_011393060.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  MothHH_RS08325 (MothHH_01666) - 1610756..1611964 (-) 1209 WP_011393061.1 type II secretion system F family protein -
  MothHH_RS08330 (MothHH_01667) pilT 1612160..1613179 (-) 1020 WP_011393062.1 type IV pilus twitching motility protein PilT Machinery gene
  MothHH_RS08335 (MothHH_01668) pilF 1613193..1614854 (-) 1662 WP_011393063.1 GspE/PulE family protein Machinery gene
  MothHH_RS08340 (MothHH_01669) aroB 1614860..1615954 (-) 1095 WP_011393064.1 3-dehydroquinate synthase -
  MothHH_RS08345 (MothHH_01670) - 1615933..1616454 (-) 522 WP_011393065.1 shikimate kinase -
  MothHH_RS08350 (MothHH_01671) aroC 1616507..1617658 (-) 1152 WP_011393066.1 chorismate synthase -
  MothHH_RS08355 (MothHH_01672) - 1617683..1618570 (-) 888 WP_011393067.1 shikimate dehydrogenase -
  MothHH_RS08360 (MothHH_01673) - 1618690..1619211 (-) 522 WP_011393068.1 YqeG family HAD IIIA-type phosphatase -

Sequence


Protein


Download         Length: 553 a.a.        Molecular weight: 60458.16 Da        Isoelectric Point: 6.4190

>NTDB_id=302937 MothHH_RS08335 WP_011393063.1 1613193..1614854(-) (pilF) [Moorella thermoacetica strain 39073-HH]
MDSRRRLGDLLIEAGMLTPAQLEQALQEQKRSGERLGKVLIRLGFITEASMLEVLEFQLGIPKVVLADYHLDPEVVRLVP
EGLARRYQAIPIRLDGNRLLVAMADPLNLVALDDLRLVTGKEIMPAIAAEKEIEAALSRFWQREPVTSMSEVAAAVAAAE
SGGRAGGTEGAPAVRLVNSFIQQAIQTRASDIHIEPQEGEVRVRLRVDGLLRELTRLPLGVLSSLISRIKIMAGMDIAEK
RLPQDGRFQFTLGKRSVDLRVSSLPTVYGEKIVLRLLDQEAMLLPLDDLGFLPAIKERFESLIHSSYGMLLITGPTGSGK
TTTLYATLNILSSPEKNIITIEDPVEYLLPGINQVRVNPKAGLTFASGLRSILRQDPDIIMVGEIRDRETADIAVRAATT
GHLVLTTLHTNDAAGAVTRLLDMGVEGYLVNSSLIGVVAQRLVRRICPHCREMYEPEPGSPERAWLPGAERLWRGRGCEN
CHYTGYTNRTAIQEVLVMNEELRRLVAAKAPATALKEAAVAGGMVPLIDDGLEKARQGITTVSEVLRVSLGGL

Nucleotide


Download         Length: 1662 bp        

>NTDB_id=302937 MothHH_RS08335 WP_011393063.1 1613193..1614854(-) (pilF) [Moorella thermoacetica strain 39073-HH]
ATGGATAGTCGACGACGACTGGGGGACCTGTTGATCGAAGCCGGGATGCTTACCCCGGCCCAGCTGGAACAGGCCCTGCA
GGAACAGAAACGCAGCGGGGAGCGCCTGGGTAAGGTTTTAATCCGCCTGGGATTTATCACCGAGGCCAGCATGCTGGAGG
TCCTGGAGTTCCAGCTGGGGATCCCCAAGGTGGTCCTGGCTGACTACCACCTGGATCCGGAGGTGGTCCGCCTGGTGCCG
GAAGGCCTGGCCCGGCGCTACCAGGCCATCCCCATCCGCCTGGACGGCAACCGCCTCCTGGTGGCCATGGCCGATCCCCT
GAACCTCGTGGCCCTGGACGACCTGCGCCTGGTCACCGGCAAGGAGATTATGCCGGCTATAGCCGCCGAGAAGGAAATCG
AGGCAGCTTTAAGCCGGTTCTGGCAACGGGAACCCGTTACGAGCATGAGCGAAGTAGCGGCAGCCGTCGCCGCCGCGGAA
TCTGGCGGGCGCGCCGGCGGCACGGAAGGCGCGCCGGCTGTGCGCCTGGTCAACAGTTTTATCCAGCAGGCCATCCAGAC
CCGGGCCAGCGACATCCATATAGAGCCCCAGGAGGGGGAGGTCCGGGTGCGCCTGCGGGTAGACGGCCTGCTGCGGGAGT
TGACCCGCCTGCCCCTGGGGGTTTTAAGTAGCCTGATCTCCAGGATCAAGATCATGGCCGGCATGGACATCGCCGAAAAA
CGCTTGCCCCAGGACGGCCGTTTTCAGTTTACCCTGGGTAAACGCAGTGTCGACCTCAGGGTTTCCAGCCTGCCTACTGT
TTACGGCGAAAAGATCGTCCTGCGCCTCCTGGACCAGGAGGCCATGCTCCTGCCCCTGGACGACCTGGGATTTTTGCCGG
CCATAAAAGAACGCTTTGAGAGTCTCATCCACAGTTCCTACGGCATGCTCCTCATTACCGGTCCCACGGGCAGCGGTAAG
ACGACGACCCTTTATGCTACTCTTAACATTTTAAGCTCGCCGGAAAAAAATATCATTACCATTGAGGATCCGGTAGAATA
CCTGCTGCCCGGCATCAATCAGGTGCGGGTTAACCCCAAGGCCGGCCTGACCTTTGCTTCAGGGCTGCGTTCCATCCTGC
GTCAGGACCCGGATATCATTATGGTCGGGGAGATTCGCGACCGGGAGACGGCCGATATCGCCGTCCGGGCGGCGACTACC
GGTCACCTGGTCTTAACGACCCTGCACACCAATGACGCCGCCGGCGCCGTAACCCGCCTCCTGGATATGGGAGTGGAAGG
CTACCTGGTCAATTCCTCCCTTATTGGCGTGGTGGCCCAGCGCCTGGTGCGCCGCATCTGTCCCCATTGCCGGGAGATGT
ACGAGCCGGAGCCGGGCTCTCCGGAAAGGGCCTGGTTGCCGGGCGCGGAACGGCTCTGGCGCGGCCGGGGTTGCGAAAAC
TGCCATTATACCGGTTACACCAACCGGACGGCCATCCAGGAGGTCCTGGTCATGAATGAAGAACTCCGGCGCCTGGTAGC
CGCCAAGGCGCCGGCTACGGCCCTGAAGGAGGCAGCGGTGGCCGGCGGTATGGTTCCTTTGATTGACGACGGTTTGGAAA
AAGCCCGCCAGGGGATCACTACGGTGAGCGAGGTCCTACGCGTTTCCCTGGGAGGTTTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

49.362

99.277

0.49

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

45.833

99.819

0.458

  pilB Vibrio campbellii strain DS40M4

41.831

100

0.421

  pilB Vibrio parahaemolyticus RIMD 2210633

41.281

100

0.42

  pilB Vibrio cholerae strain A1552

41.877

100

0.42

  pilB/pilB1 Synechocystis sp. PCC 6803

37.767

100

0.416

  pilB Acinetobacter baylyi ADP1

40.36

100

0.405

  pilB Acinetobacter baumannii D1279779

39.421

100

0.394

  pilF Neisseria gonorrhoeae MS11

38.321

99.096

0.38

  pilB Legionella pneumophila strain ERS1305867

41.616

89.512

0.373


Multiple sequence alignment