Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   B4X03_RS04730 Genome accession   NZ_CP020085
Coordinates   939930..940511 (-) Length   193 a.a.
NCBI ID   WP_021110392.1    Uniprot ID   A0A084ELH5
Organism   Glaesserella parasuis strain CL120103     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 899345..950959 939930..940511 within 0


Gene organization within MGE regions


Location: 899345..950959
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B4X03_RS12065 - 899345..899557 (+) 213 WP_021119235.1 hypothetical protein -
  B4X03_RS04510 rpoD 899877..901733 (-) 1857 WP_082259131.1 RNA polymerase sigma factor RpoD -
  B4X03_RS04515 dnaG 901832..903580 (-) 1749 WP_021112720.1 DNA primase -
  B4X03_RS04520 rpsU 903724..903939 (-) 216 WP_005712190.1 30S ribosomal protein S21 -
  B4X03_RS04525 - 904109..904513 (-) 405 WP_052317703.1 hypothetical protein -
  B4X03_RS04530 - 904833..905930 (+) 1098 Protein_879 ISAs1 family transposase -
  B4X03_RS04535 - 905964..906539 (-) 576 Protein_880 ATP-binding cassette domain-containing protein -
  B4X03_RS04540 - 906660..907073 (-) 414 WP_257788245.1 ISAs1 family transposase -
  B4X03_RS11230 - 907087..907392 (-) 306 WP_106379859.1 integrase core domain-containing protein -
  B4X03_RS11235 - 907447..907872 (-) 426 WP_071610673.1 IS3 family transposase -
  B4X03_RS04550 - 907911..908234 (-) 324 WP_021118421.1 transposase -
  B4X03_RS04555 tilS 908287..909552 (-) 1266 WP_082259132.1 tRNA lysidine(34) synthetase TilS -
  B4X03_RS04560 pdxY 909559..910419 (-) 861 WP_021114409.1 pyridoxal kinase PdxY -
  B4X03_RS04565 accA 910505..911458 (-) 954 WP_021114408.1 acetyl-CoA carboxylase carboxyl transferase subunit alpha -
  B4X03_RS04570 yhbY 911664..911960 (-) 297 Protein_888 ribosome assembly RNA-binding protein YhbY -
  B4X03_RS11510 - 911973..913031 (-) 1059 Protein_889 ISAs1 family transposase -
  B4X03_RS04585 - 913452..914348 (+) 897 WP_021113240.1 transcriptional regulator GcvA -
  B4X03_RS04590 yjjG 914370..915044 (+) 675 WP_010786074.1 pyrimidine 5'-nucleotidase -
  B4X03_RS04595 cysB 915062..916045 (+) 984 WP_021112000.1 HTH-type transcriptional regulator CysB -
  B4X03_RS04600 - 916045..916686 (+) 642 WP_043896638.1 hypothetical protein -
  B4X03_RS04605 cpdA 916889..917716 (+) 828 WP_021117792.1 3',5'-cyclic-AMP phosphodiesterase -
  B4X03_RS04610 - 917815..918800 (-) 986 Protein_895 2-hydroxyacid dehydrogenase -
  B4X03_RS04615 ispG 918823..919932 (-) 1110 WP_021112005.1 flavodoxin-dependent (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase -
  B4X03_RS04620 - 919934..920974 (-) 1041 WP_021114535.1 RodZ domain-containing protein -
  B4X03_RS04625 sppA 921109..922974 (-) 1866 WP_021114536.1 signal peptide peptidase SppA -
  B4X03_RS04630 msbA 923192..924973 (+) 1782 WP_021113241.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  B4X03_RS04635 pnuC 925113..925796 (+) 684 WP_021113230.1 nicotinamide riboside transporter PnuC -
  B4X03_RS04640 hda 925888..926598 (+) 711 WP_021113223.1 DnaA regulatory inactivator Hda -
  B4X03_RS04645 ccmA 926599..927225 (+) 627 WP_021114537.1 cytochrome c biogenesis heme-transporting ATPase CcmA -
  B4X03_RS04650 ccmB 927235..927897 (+) 663 WP_021113229.1 heme exporter protein CcmB -
  B4X03_RS04655 - 927944..928684 (+) 741 WP_021114538.1 heme ABC transporter permease -
  B4X03_RS04660 ccmD 928693..928887 (+) 195 WP_010786065.1 heme exporter protein CcmD -
  B4X03_RS04665 - 929052..930440 (+) 1389 WP_082259135.1 DASS family sodium-coupled anion symporter -
  B4X03_RS04670 - 930679..931854 (+) 1176 WP_082259136.1 phosphoglycerate kinase -
  B4X03_RS04675 fbaA 931996..933075 (+) 1080 WP_082259137.1 class II fructose-bisphosphate aldolase -
  B4X03_RS04680 - 933247..934047 (+) 801 WP_005710396.1 ATP-binding cassette domain-containing protein -
  B4X03_RS04685 - 934060..934698 (+) 639 WP_021114541.1 nitroreductase family protein -
  B4X03_RS12070 - 934787..934876 (+) 90 Protein_911 TIGR00645 family protein -
  B4X03_RS04690 - 934911..936008 (-) 1098 WP_082259138.1 ISAs1 family transposase -
  B4X03_RS04695 - 936083..936535 (+) 453 Protein_913 TIGR00645 family protein -
  B4X03_RS04700 - 936622..937920 (-) 1299 WP_082259140.1 NAD(P)/FAD-dependent oxidoreductase -
  B4X03_RS04720 greB 938446..938934 (-) 489 WP_005710400.1 transcription elongation factor GreB -
  B4X03_RS04725 - 938934..939869 (-) 936 WP_005710401.1 KpsF/GutQ family sugar-phosphate isomerase -
  B4X03_RS04730 pilF 939930..940511 (-) 582 WP_021110392.1 type IV pilus biogenesis/stability protein PilW Machinery gene
  B4X03_RS04735 - 940611..941771 (-) 1161 WP_005710403.1 bifunctional tRNA (adenosine(37)-C2)-methyltransferase TrmG/ribosomal RNA large subunit methyltransferase RlmN -
  B4X03_RS04740 - 941972..942280 (-) 309 WP_005710405.1 helix-turn-helix domain-containing protein -
  B4X03_RS11515 - 942421..942621 (-) 201 WP_021119412.1 hypothetical protein -
  B4X03_RS04770 gltX 943456..944895 (+) 1440 WP_082259141.1 glutamate--tRNA ligase -
  B4X03_RS04780 - 945362..945622 (-) 261 Protein_922 cation-transporting ATPase PacS -
  B4X03_RS04785 - 945583..946517 (-) 935 Protein_923 transposase -
  B4X03_RS04790 - 946657..947031 (-) 375 WP_082259142.1 cytochrome b562 -
  B4X03_RS04795 fabB 947108..948325 (-) 1218 WP_021114544.1 beta-ketoacyl-ACP synthase I -
  B4X03_RS04800 purA 948490..949788 (-) 1299 WP_082259143.1 adenylosuccinate synthase -
  B4X03_RS04805 hflC 950072..950959 (-) 888 WP_021111697.1 protease modulator HflC -

Sequence


Protein


Download         Length: 193 a.a.        Molecular weight: 22339.24 Da        Isoelectric Point: 8.0872

>NTDB_id=220961 B4X03_RS04730 WP_021110392.1 939930..940511(-) (pilF) [Glaesserella parasuis strain CL120103]
MTFAKFFRNLTACSVALWLVACANQTTQVDFNRSEAVKARINLALAYLEQSDFPKAKENIDKALEHDVKDYLPHSVLAYY
YQQTGDNQKAEEAYQQALKLSKTQSKNNQVRPDVLNNYGTFLCKQKQFDKAYQQFETALTSQEAYYNQADTLENIALCAN
MKKDIEKQKTALSQLEKIDKSRAERLYNFMEIR

Nucleotide


Download         Length: 582 bp        

>NTDB_id=220961 B4X03_RS04730 WP_021110392.1 939930..940511(-) (pilF) [Glaesserella parasuis strain CL120103]
ATGACCTTTGCAAAATTTTTCCGTAATTTGACCGCTTGTAGCGTGGCATTATGGCTTGTTGCTTGTGCGAACCAAACTAC
TCAGGTGGATTTTAACCGCTCGGAAGCCGTCAAAGCTCGCATTAATTTGGCCCTTGCTTACCTTGAACAATCTGATTTTC
CAAAAGCGAAAGAGAACATTGACAAAGCGCTTGAACACGATGTAAAAGACTACCTACCGCACTCTGTGCTTGCCTATTAT
TACCAACAAACTGGCGATAACCAAAAAGCAGAAGAAGCCTACCAGCAAGCCTTGAAGTTAAGTAAAACACAGAGCAAAAA
CAATCAAGTTCGCCCTGATGTATTGAATAACTACGGTACCTTTCTGTGCAAACAAAAACAGTTCGACAAAGCCTATCAGC
AATTTGAAACGGCATTAACCAGTCAAGAAGCCTATTACAATCAAGCAGATACACTAGAAAATATCGCCTTATGTGCCAAT
ATGAAGAAGGATATTGAGAAGCAAAAAACAGCACTATCTCAACTTGAGAAAATAGATAAATCAAGAGCAGAACGACTATA
TAATTTCATGGAGATTAGATAG

Domains


Predicted by InterproScan.

(74-102)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A084ELH5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Glaesserella parasuis strain SC1401

99.482

100

0.995

  pilF/pilF2 Haemophilus influenzae Rd KW20

44.693

92.746

0.415


Multiple sequence alignment