Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   L103DPR2_RS03830 Genome accession   NZ_CP011834
Coordinates   788800..790014 (-) Length   404 a.a.
NCBI ID   WP_055359828.1    Uniprot ID   A0A0P0MDE8
Organism   Limnohabitans sp. 103DPR2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 783800..795014
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  L103DPR2_RS03800 (L103DPR2_00761) - 784672..785133 (+) 462 WP_055361828.1 NUDIX domain-containing protein -
  L103DPR2_RS03805 (L103DPR2_00762) - 785135..785341 (-) 207 WP_055359824.1 DNA gyrase inhibitor YacG -
  L103DPR2_RS03810 (L103DPR2_00763) zapD 785380..786135 (-) 756 WP_055359825.1 cell division protein ZapD -
  L103DPR2_RS03815 (L103DPR2_00764) coaE 786158..786769 (-) 612 WP_055359826.1 dephospho-CoA kinase -
  L103DPR2_RS03820 (L103DPR2_00765) - 786777..787685 (-) 909 WP_231717677.1 A24 family peptidase -
  L103DPR2_RS03825 (L103DPR2_00766) - 787685..788803 (-) 1119 WP_055359827.1 type II secretion system F family protein -
  L103DPR2_RS03830 (L103DPR2_00767) pilF 788800..790014 (-) 1215 WP_055359828.1 ATPase, T2SS/T4P/T4SS family Machinery gene
  L103DPR2_RS03835 (L103DPR2_00768) - 790075..790674 (-) 600 WP_055359829.1 class I SAM-dependent methyltransferase -
  L103DPR2_RS03840 (L103DPR2_00769) - 790684..791154 (-) 471 WP_055359830.1 hypothetical protein -
  L103DPR2_RS03845 (L103DPR2_00770) - 791338..791610 (+) 273 WP_156339851.1 hypothetical protein -
  L103DPR2_RS03850 (L103DPR2_00771) - 791649..792362 (-) 714 WP_055359832.1 YebC/PmpR family DNA-binding transcriptional regulator -
  L103DPR2_RS03855 (L103DPR2_00772) - 792455..792640 (-) 186 WP_055359833.1 hypothetical protein -
  L103DPR2_RS03860 (L103DPR2_00773) - 792750..793019 (+) 270 WP_146179244.1 hypothetical protein -
  L103DPR2_RS03865 (L103DPR2_00774) - 793146..793628 (+) 483 WP_156339852.1 hypothetical protein -
  L103DPR2_RS03870 (L103DPR2_00775) - 793781..794179 (+) 399 WP_156339853.1 hypothetical protein -
  L103DPR2_RS03875 (L103DPR2_00776) - 794223..794564 (+) 342 WP_055359837.1 hypothetical protein -

Sequence


Protein


Download         Length: 404 a.a.        Molecular weight: 44903.84 Da        Isoelectric Point: 7.5444

>NTDB_id=148898 L103DPR2_RS03830 WP_055359828.1 788800..790014(-) (pilF) [Limnohabitans sp. 103DPR2]
MDNTLSHPQPAAVEAPVVHYLQQTWETAAQLRASDLHFEPFENFYRVRLRIDGVLQEIPPPPYEFKDQIASRIKVLAKLD
IAEKRLPQDGRMTIGLRNLQRLNLRVSTLPTLFGEKLVVRVLATDMAQLHLDHLGYGPQEKQRMLEAIHKTQGLILVTGP
TGSGKSQSLYACLHLLNRPEINIATVEDPSEIQLNGVNQVNVKEQIGLNFASSLKAFLRQDPDVIMLGEIRDPETADIAI
KASQTGHLVMSTLHTNDAAGALVRLRNMGVASYNLAASISLISAQRLIRRLCLHCRKPMHVSLQALRELGLQTSFQDLPF
EPVFFSAQGCSQCHKGYWGRIGLFEVMPMSPSLRQCVENDSSHVQLATQAYKEGVHSLRHAGLIQAAMGITSMAEVLTQT
ECMA

Nucleotide


Download         Length: 1215 bp        

>NTDB_id=148898 L103DPR2_RS03830 WP_055359828.1 788800..790014(-) (pilF) [Limnohabitans sp. 103DPR2]
ATGGACAACACTCTTTCCCATCCCCAACCGGCTGCAGTTGAAGCCCCTGTGGTTCATTATTTGCAGCAGACATGGGAAAC
AGCAGCCCAACTGCGCGCCTCCGATCTTCACTTCGAACCTTTCGAGAATTTTTACCGAGTCCGTTTGCGCATCGATGGCG
TGTTGCAAGAAATTCCGCCACCCCCCTACGAATTCAAAGACCAAATTGCCTCCCGCATCAAGGTGCTGGCCAAGTTAGAC
ATTGCAGAAAAAAGATTGCCACAAGATGGGCGCATGACAATCGGTTTGCGCAATTTGCAACGCTTGAATTTAAGGGTCAG
CACACTGCCTACTTTGTTTGGCGAGAAGTTGGTGGTTCGGGTCTTAGCCACCGACATGGCGCAACTGCATCTTGACCATT
TGGGCTATGGTCCCCAGGAAAAGCAGCGCATGCTAGAGGCCATCCACAAAACACAAGGCCTCATTTTGGTCACGGGCCCC
ACAGGCTCGGGCAAGTCGCAATCACTTTATGCCTGCCTGCACCTTTTGAACCGGCCCGAAATCAACATCGCCACTGTCGA
AGACCCCTCCGAAATTCAGCTGAATGGCGTCAACCAAGTCAATGTCAAAGAACAAATTGGGCTGAACTTTGCCAGTTCAC
TCAAAGCCTTCCTCCGACAAGATCCCGACGTCATCATGCTTGGCGAAATCAGAGACCCTGAAACAGCCGACATCGCCATC
AAGGCTTCACAAACAGGTCATTTGGTCATGTCCACGTTGCACACCAATGATGCCGCAGGCGCACTGGTTCGTCTTCGCAA
CATGGGCGTGGCGTCCTACAACTTGGCCGCCAGCATCAGCCTCATTTCTGCGCAAAGGCTGATTCGGCGTTTGTGTTTGC
ATTGCCGAAAACCCATGCATGTCTCGCTTCAAGCCCTGCGCGAATTGGGCCTGCAAACGTCCTTTCAAGACTTGCCGTTT
GAACCTGTCTTTTTCAGTGCGCAGGGTTGTTCGCAATGTCACAAAGGCTATTGGGGACGCATTGGCTTGTTTGAAGTCAT
GCCCATGAGCCCCAGTTTGCGGCAATGCGTTGAAAATGACAGCAGCCATGTTCAATTGGCAACGCAAGCGTACAAAGAAG
GCGTCCACTCGCTTCGACATGCAGGTTTAATTCAAGCCGCCATGGGCATCACATCGATGGCCGAAGTGCTCACGCAAACT
GAGTGCATGGCATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0P0MDE8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Neisseria gonorrhoeae MS11

51.372

99.257

0.51

  pilB Vibrio parahaemolyticus RIMD 2210633

52.48

94.802

0.498

  pilB Acinetobacter baylyi ADP1

51.289

96.04

0.493

  pilB Acinetobacter baumannii D1279779

51.157

96.287

0.493

  pilB Vibrio campbellii strain DS40M4

51.958

94.802

0.493

  pilB Vibrio cholerae strain A1552

51.562

95.05

0.49

  pilB Legionella pneumophila strain ERS1305867

47.396

95.05

0.45

  pilB/pilB1 Synechocystis sp. PCC 6803

39.952

100

0.408

  pilB Haemophilus influenzae Rd KW20

43.081

94.802

0.408

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

42.199

96.782

0.408

  pilF Thermus thermophilus HB27

41.944

96.782

0.406

  pilB Haemophilus influenzae 86-028NP

42.559

94.802

0.403


Multiple sequence alignment