Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   MTCOM_RS07735 Genome accession   NZ_CP136551
Coordinates   1515598..1517259 (-) Length   553 a.a.
NCBI ID   WP_338817898.1    Uniprot ID   -
Organism   Moorella thermoacetica strain COM     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1510598..1522259
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MTCOM_RS07710 (MTCOM_15360) - 1510851..1511372 (-) 522 WP_338817893.1 type II secretion system protein -
  MTCOM_RS07715 (MTCOM_15370) - 1511404..1512129 (-) 726 WP_338817894.1 prepilin peptidase -
  MTCOM_RS07720 (MTCOM_15380) - 1512377..1512907 (-) 531 WP_338817895.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  MTCOM_RS07725 (MTCOM_15390) - 1513157..1514365 (-) 1209 WP_338817896.1 type II secretion system F family protein -
  MTCOM_RS07730 (MTCOM_15400) pilT 1514565..1515584 (-) 1020 WP_338817897.1 type IV pilus twitching motility protein PilT Machinery gene
  MTCOM_RS07735 (MTCOM_15410) pilF 1515598..1517259 (-) 1662 WP_338817898.1 GspE/PulE family protein Machinery gene
  MTCOM_RS07740 (MTCOM_15420) aroB 1517265..1518359 (-) 1095 WP_338817899.1 3-dehydroquinate synthase -
  MTCOM_RS07745 (MTCOM_15430) - 1518338..1518859 (-) 522 WP_338817900.1 shikimate kinase -
  MTCOM_RS07750 (MTCOM_15440) aroC 1518913..1520064 (-) 1152 WP_338817901.1 chorismate synthase -
  MTCOM_RS07755 (MTCOM_15450) - 1520089..1520976 (-) 888 WP_071522534.1 shikimate dehydrogenase -
  MTCOM_RS07760 (MTCOM_15460) - 1521096..1521617 (-) 522 WP_011393068.1 YqeG family HAD IIIA-type phosphatase -

Sequence


Protein


Download         Length: 553 a.a.        Molecular weight: 60540.26 Da        Isoelectric Point: 6.0632

>NTDB_id=891567 MTCOM_RS07735 WP_338817898.1 1515598..1517259(-) (pilF) [Moorella thermoacetica strain COM]
MDSRRRLGDLLIEAGMLTPAQLEQALQEQKRSGERLGKVLIRLGFITEASMLEVLEFQLGIPKVVLADYYLDPEVVRLVP
EGLARRYQAIPIRLDGNRLLVAMADPLNLVALDDLRLVTGKEIMPAIAAEKEIEAALSRFWQREPVTSMSEVAAAVAAAE
SGGRAGGTEGAPAVRLVNSFIQQAIQTRASDIHIEPQEGEVRVRLRVDGLLRELTRLPLGVLSSLISRIKIMAGMDIAEK
RLPQDGRFQFTLGKRSVDLRVSSLPTVYGEKIVLRLLDQEAMLLPLDDLGFLPAIKERFESLIHSSYGMLLITGPTGSGK
TTTLYATLNILSSPEKNIITIEDPVEYLLPGINQVRVNPKAGLTFASGLRSILRQDPDIIMVGEIRDRETADIAVRAATT
GHLVLTTLHTNDAAGAVTRLLDMGVEGYLVNSSLIGVVAQRLVRRICPHCREMYEPEPGSPERAWLPGEERLWRGRGCEN
CHYTGYANRTAIQEVLVMNEELRRLVAAKAPVTALKEAAVAGGMVPLIDDGLEKARQGITTVSEVLRVSLGGL

Nucleotide


Download         Length: 1662 bp        

>NTDB_id=891567 MTCOM_RS07735 WP_338817898.1 1515598..1517259(-) (pilF) [Moorella thermoacetica strain COM]
ATGGATAGTCGACGACGACTGGGGGACCTGTTGATCGAAGCCGGGATGCTTACCCCGGCCCAGCTGGAACAGGCCCTGCA
GGAACAGAAACGCAGCGGGGAGCGCCTGGGTAAGGTTTTAATCCGCCTGGGATTTATCACCGAGGCCAGCATGCTGGAGG
TCCTGGAGTTCCAGCTGGGGATCCCCAAGGTGGTCCTGGCTGACTACTACCTGGATCCGGAGGTGGTCCGCCTGGTGCCG
GAAGGCCTGGCCCGGCGCTACCAGGCCATCCCCATCCGCCTGGACGGCAATCGCCTCCTGGTGGCCATGGCCGATCCCCT
GAACCTCGTGGCCCTGGACGACCTGCGCCTGGTCACCGGCAAGGAGATTATGCCGGCTATAGCCGCCGAGAAGGAAATCG
AGGCAGCTTTAAGCCGGTTCTGGCAACGGGAACCCGTTACGAGCATGAGCGAAGTAGCGGCAGCCGTCGCCGCCGCGGAA
TCTGGCGGGCGCGCCGGCGGCACGGAAGGCGCGCCGGCTGTGCGCCTGGTCAACAGTTTTATCCAGCAGGCCATCCAGAC
CCGGGCCAGCGACATCCATATAGAGCCCCAGGAGGGGGAGGTCCGGGTGCGCCTGCGGGTAGACGGCCTGCTGCGGGAGT
TGACCCGCCTGCCCCTGGGGGTTTTAAGTAGCCTGATCTCCAGGATCAAGATCATGGCCGGCATGGACATCGCCGAAAAA
CGCTTGCCCCAGGACGGCCGTTTTCAATTTACCCTGGGTAAACGCAGTGTCGACCTCAGGGTTTCCAGCCTGCCTACTGT
TTACGGCGAAAAGATCGTCCTGCGCCTCCTGGACCAGGAGGCCATGCTCCTGCCCCTGGACGACCTGGGATTTTTGCCGG
CCATAAAAGAACGCTTTGAGAGCCTCATCCACAGTTCCTACGGTATGCTCCTCATTACCGGTCCCACGGGCAGCGGTAAG
ACGACGACCCTCTATGCCACCCTTAACATTTTAAGCTCGCCGGAAAAAAATATCATTACCATTGAGGACCCGGTAGAATA
CCTTCTGCCCGGCATCAATCAGGTGCGGGTCAACCCCAAGGCCGGCCTGACCTTTGCTTCAGGGCTGCGTTCCATCCTGC
GTCAGGATCCGGATATCATCATGGTGGGGGAAATCCGCGACCGGGAGACGGCCGATATCGCCGTCCGGGCGGCGACTACC
GGTCACCTGGTCTTAACGACCCTGCACACCAATGACGCCGCCGGCGCCGTCACCCGCCTCCTGGATATGGGAGTGGAAGG
CTACCTGGTCAATTCCTCCCTTATTGGCGTGGTGGCCCAGCGCCTGGTGCGCCGCATCTGTCCCCATTGCCGGGAGATGT
ACGAGCCGGAGCCTGGCTCTCCGGAAAGGGCCTGGTTGCCGGGCGAGGAACGACTCTGGCGCGGCCGGGGTTGCGAAAAC
TGCCATTATACCGGTTACGCCAACCGGACGGCCATCCAGGAGGTCCTGGTCATGAATGAAGAACTCCGGCGCCTGGTAGC
CGCCAAGGCGCCGGTTACGGCCCTGAAGGAGGCGGCGGTGGCCGGCGGTATGGTTCCTTTAATCGATGACGGTTTGGAAA
AAGCCCGCCAGGGGATCACTACGGTGAGCGAGGTCCTACGCGTTTCCCTGGGAGGTTTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

48.998

99.277

0.486

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

45.091

99.458

0.448

  pilB Vibrio campbellii strain DS40M4

42.011

100

0.423

  pilB Vibrio parahaemolyticus RIMD 2210633

41.459

100

0.421

  pilB Vibrio cholerae strain A1552

41.877

100

0.42

  pilB/pilB1 Synechocystis sp. PCC 6803

37.931

100

0.418

  pilB Acinetobacter baylyi ADP1

40.36

100

0.405

  pilB Acinetobacter baumannii D1279779

39.35

100

0.394

  pilF Neisseria gonorrhoeae MS11

38.321

99.096

0.38

  pilB Legionella pneumophila strain ERS1305867

41.616

89.512

0.373