Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   LSQ66_RS01755 Genome accession   NZ_CP088952
Coordinates   355206..356930 (-) Length   574 a.a.
NCBI ID   WP_231768102.1    Uniprot ID   -
Organism   Massilia endophytica strain DM-R-R2A-13     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 303383..391510 355206..356930 within 0


Gene organization within MGE regions


Location: 303383..391510
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LSQ66_RS01490 (LSQ66_01490) - 303643..304086 (-) 444 WP_231768054.1 hypothetical protein -
  LSQ66_RS01495 (LSQ66_01495) rpoH 304211..305104 (-) 894 WP_407659618.1 RNA polymerase sigma factor RpoH -
  LSQ66_RS01500 (LSQ66_01500) ftsX 305405..306319 (-) 915 WP_231768056.1 permease-like cell division protein FtsX -
  LSQ66_RS01505 (LSQ66_01505) - 306316..306981 (-) 666 WP_231768057.1 cell division ATP-binding protein FtsE -
  LSQ66_RS01510 (LSQ66_01510) pilA 306987..308021 (-) 1035 WP_231768058.1 signal recognition particle-docking protein FtsY Machinery gene
  LSQ66_RS01515 (LSQ66_01515) - 308125..309099 (-) 975 WP_231768059.1 MerR family transcriptional regulator -
  LSQ66_RS01520 (LSQ66_01520) - 309722..310801 (+) 1080 WP_231768060.1 porin -
  LSQ66_RS01525 (LSQ66_01525) - 310875..311456 (+) 582 WP_231768061.1 hypothetical protein -
  LSQ66_RS01530 (LSQ66_01530) - 311453..313150 (-) 1698 WP_231768062.1 thiamine pyrophosphate-binding protein -
  LSQ66_RS01540 (LSQ66_01540) glpK 313644..315137 (+) 1494 WP_231768063.1 glycerol kinase GlpK -
  LSQ66_RS01545 (LSQ66_01545) mraZ 315678..316106 (+) 429 WP_231768064.1 division/cell wall cluster transcriptional repressor MraZ -
  LSQ66_RS01550 (LSQ66_01550) rsmH 316116..317081 (+) 966 WP_231768065.1 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH -
  LSQ66_RS01555 (LSQ66_01555) ftsL 317078..317350 (+) 273 WP_231768066.1 cell division protein FtsL -
  LSQ66_RS01560 (LSQ66_01560) - 317350..319131 (+) 1782 WP_231768067.1 penicillin-binding protein 2 -
  LSQ66_RS01565 (LSQ66_01565) - 319131..320657 (+) 1527 WP_231768068.1 UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2, 6-diaminopimelate ligase -
  LSQ66_RS01570 (LSQ66_01570) murF 320657..322033 (+) 1377 WP_231768069.1 UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase -
  LSQ66_RS01575 (LSQ66_01575) mraY 322033..323202 (+) 1170 WP_231768070.1 phospho-N-acetylmuramoyl-pentapeptide- transferase -
  LSQ66_RS01580 (LSQ66_01580) murD 323210..324712 (+) 1503 WP_231768071.1 UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase -
  LSQ66_RS01585 (LSQ66_01585) ftsW 324712..325917 (+) 1206 WP_231768072.1 putative lipid II flippase FtsW -
  LSQ66_RS01590 (LSQ66_01590) murG 325914..326981 (+) 1068 WP_231768073.1 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase -
  LSQ66_RS01595 (LSQ66_01595) murC 326978..328369 (+) 1392 WP_231768074.1 UDP-N-acetylmuramate--L-alanine ligase -
  LSQ66_RS01600 (LSQ66_01600) - 328373..329332 (+) 960 WP_231768075.1 D-alanine--D-alanine ligase -
  LSQ66_RS01605 (LSQ66_01605) - 329342..330106 (+) 765 WP_231768076.1 cell division protein FtsQ/DivIB -
  LSQ66_RS01610 (LSQ66_01610) ftsA 330121..331353 (+) 1233 WP_231768077.1 cell division protein FtsA -
  LSQ66_RS01615 (LSQ66_01615) ftsZ 331480..332685 (+) 1206 WP_231768078.1 cell division protein FtsZ -
  LSQ66_RS01620 (LSQ66_01620) - 332810..333313 (+) 504 WP_231768079.1 peroxiredoxin -
  LSQ66_RS01630 (LSQ66_01630) - 335060..335668 (+) 609 WP_231768080.1 DUF2971 domain-containing protein -
  LSQ66_RS01635 (LSQ66_01635) - 335950..336441 (+) 492 WP_231768081.1 type VI secretion system tube protein Hcp -
  LSQ66_RS01640 (LSQ66_01640) - 336451..336939 (+) 489 WP_231768082.1 penicillin-insensitive murein endopeptidase -
  LSQ66_RS01645 (LSQ66_01645) - 337368..337778 (+) 411 WP_231768083.1 STY0301 family protein -
  LSQ66_RS01650 (LSQ66_01650) - 337796..338152 (-) 357 WP_231768084.1 DUF3088 domain-containing protein -
  LSQ66_RS01655 (LSQ66_01655) - 338168..339862 (-) 1695 WP_231768085.1 tannase/feruloyl esterase family alpha/beta hydrolase -
  LSQ66_RS01660 (LSQ66_01660) - 339995..340480 (-) 486 WP_231768086.1 SET domain-containing protein -
  LSQ66_RS01665 (LSQ66_01665) - 340519..340917 (-) 399 WP_231768087.1 hypothetical protein -
  LSQ66_RS01670 (LSQ66_01670) - 341059..341901 (+) 843 WP_231770025.1 DMT family transporter -
  LSQ66_RS01675 (LSQ66_01675) lpxC 341944..342879 (+) 936 WP_231768088.1 UDP-3-O-acyl-N-acetylglucosamine deacetylase -
  LSQ66_RS01680 (LSQ66_01680) - 342883..343350 (-) 468 WP_231768089.1 DciA family protein -
  LSQ66_RS01685 (LSQ66_01685) secA 343578..346340 (+) 2763 WP_231768090.1 preprotein translocase subunit SecA -
  LSQ66_RS01690 (LSQ66_01690) - 346476..346775 (-) 300 WP_231768091.1 helix-turn-helix domain containing protein -
  LSQ66_RS01695 (LSQ66_01695) - 346958..347563 (+) 606 WP_231768092.1 response regulator -
  LSQ66_RS01700 (LSQ66_01700) - 347621..347986 (+) 366 WP_231768093.1 Hpt domain-containing protein -
  LSQ66_RS01705 (LSQ66_01705) - 348005..348406 (+) 402 WP_231768094.1 response regulator -
  LSQ66_RS01710 (LSQ66_01710) argJ 348485..349723 (+) 1239 WP_231768095.1 bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ -
  LSQ66_RS01715 (LSQ66_01715) - 349723..350586 (+) 864 WP_231768096.1 ATP-binding protein -
  LSQ66_RS01720 (LSQ66_01720) - 350583..350972 (+) 390 WP_269449128.1 NUDIX domain-containing protein -
  LSQ66_RS01725 (LSQ66_01725) - 351037..351453 (-) 417 WP_231768097.1 nuclear transport factor 2 family protein -
  LSQ66_RS01730 (LSQ66_01730) yacG 351453..351638 (-) 186 WP_231768098.1 DNA gyrase inhibitor YacG -
  LSQ66_RS01735 (LSQ66_01735) zapD 351641..352396 (-) 756 WP_231768099.1 cell division protein ZapD -
  LSQ66_RS01740 (LSQ66_01740) coaE 352461..353093 (-) 633 WP_231768100.1 dephospho-CoA kinase -
  LSQ66_RS01745 (LSQ66_01745) pilD 353096..353965 (-) 870 WP_269449129.1 A24 family peptidase Machinery gene
  LSQ66_RS01750 (LSQ66_01750) pilC 353981..355195 (-) 1215 WP_231768101.1 type II secretion system F family protein Machinery gene
  LSQ66_RS01755 (LSQ66_01755) pilF 355206..356930 (-) 1725 WP_231768102.1 type IV-A pilus assembly ATPase PilB Machinery gene
  LSQ66_RS01760 (LSQ66_01760) - 356990..358282 (-) 1293 WP_231768103.1 HlyC/CorC family transporter -
  LSQ66_RS01765 (LSQ66_01765) - 358608..359555 (+) 948 WP_231768104.1 IS110 family transposase -
  LSQ66_RS01770 (LSQ66_01770) - 359601..360449 (+) 849 WP_231768105.1 AraC family transcriptional regulator -
  LSQ66_RS01775 (LSQ66_01775) - 360524..361015 (+) 492 WP_269449130.1 NADH-quinone oxidoreductase subunit B family protein -
  LSQ66_RS01780 (LSQ66_01780) - 361055..362287 (+) 1233 WP_231768106.1 glutamate carboxypeptidase -
  LSQ66_RS01785 (LSQ66_01785) uraH 362272..362625 (-) 354 WP_231768107.1 hydroxyisourate hydrolase -
  LSQ66_RS01790 (LSQ66_01790) puuE 362720..363679 (+) 960 WP_231770029.1 allantoinase PuuE -
  LSQ66_RS01795 (LSQ66_01795) xdhA 363759..365228 (+) 1470 WP_231768108.1 xanthine dehydrogenase small subunit -
  LSQ66_RS01800 (LSQ66_01800) xdhB 365240..367567 (+) 2328 WP_231768109.1 xanthine dehydrogenase molybdopterin binding subunit -
  LSQ66_RS01805 (LSQ66_01805) xdhC 367577..368596 (+) 1020 WP_231768110.1 xanthine dehydrogenase accessory protein XdhC -
  LSQ66_RS01810 (LSQ66_01810) guaD 368616..369932 (+) 1317 WP_231768111.1 guanine deaminase -
  LSQ66_RS01815 (LSQ66_01815) - 369963..371411 (+) 1449 WP_231768112.1 adenosine deaminase -
  LSQ66_RS01820 (LSQ66_01820) - 371854..374244 (+) 2391 WP_231768113.1 TonB-dependent receptor -
  LSQ66_RS01825 (LSQ66_01825) - 374379..376073 (+) 1695 WP_231768114.1 bifunctional UDP-sugar hydrolase/5'-nucleotidase -
  LSQ66_RS01830 (LSQ66_01830) - 376077..377291 (+) 1215 WP_231768115.1 phospholipase D-like domain-containing protein -
  LSQ66_RS01835 (LSQ66_01835) - 377292..378350 (+) 1059 WP_231768116.1 S1/P1 nuclease -
  LSQ66_RS01840 (LSQ66_01840) - 378369..379124 (-) 756 WP_231768117.1 DeoR/GlpR family DNA-binding transcription regulator -
  LSQ66_RS01845 (LSQ66_01845) - 379199..379780 (+) 582 WP_231768118.1 NUDIX domain-containing protein -
  LSQ66_RS01850 (LSQ66_01850) - 379871..380131 (+) 261 WP_231770030.1 c-type cytochrome -
  LSQ66_RS01855 (LSQ66_01855) - 380353..381300 (+) 948 WP_231765801.1 IS110 family transposase -
  LSQ66_RS01860 (LSQ66_01860) - 381332..382300 (-) 969 WP_231768119.1 ABC transporter permease -
  LSQ66_RS01865 (LSQ66_01865) - 382305..383387 (-) 1083 WP_231768120.1 ABC transporter permease -
  LSQ66_RS01870 (LSQ66_01870) - 383384..384913 (-) 1530 WP_231768121.1 ABC transporter ATP-binding protein -
  LSQ66_RS01875 (LSQ66_01875) - 384994..385989 (-) 996 WP_231768122.1 BMP family protein -
  LSQ66_RS01880 (LSQ66_01880) - 386486..388738 (+) 2253 WP_231768123.1 TonB-dependent receptor -
  LSQ66_RS01885 (LSQ66_01885) - 388804..389961 (-) 1158 WP_231768124.1 sensor histidine kinase KdpD -
  LSQ66_RS01890 (LSQ66_01890) - 390045..390950 (-) 906 WP_231768125.1 LysR family transcriptional regulator -

Sequence


Protein


Download         Length: 574 a.a.        Molecular weight: 62172.64 Da        Isoelectric Point: 5.0270

>NTDB_id=634101 LSQ66_RS01755 WP_231768102.1 355206..356930(-) (pilF) [Massilia endophytica strain DM-R-R2A-13]
MASVQTNSSAGAPVSGLARALMQAGKLTLPQADALNRRAQAEKLPFIDVLVSSGTVNARELAVFCSETFAYPLMDLAAFS
IDALPPKIIEPKLMQSQRVVALAKRGNKMSVAISDPTNTQALDQIKFQTESSVEPVIVPHDALIRLLNELGKSADQTIAD
LAGEEGDIQFAEEEEANAAPDPSTDVEDAPVVRFLNKMLMDAVNMGASDLHFEPFEKFYRIRFRVDGVLVEHAQPPVSIK
EKLVSRIKVLARLDISEKRIPQDGRMRLIVSPTKTIDLRISTLPTLFGEKVVMRILDATQAQMGIDSLGYDPDQKELLLD
AIQRPYGMVLVTGPTGSGKTVSLYTCLNILNKPGINISTAEDPAEINLPGVNQVNVNDKAGLTFPVALKSFLRQDPDIIM
VGEIRDLETADIAIKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLARRLCTCKQPVDISEDLLR
RAGFKDEELDGNWKPYGPVGCERCNGSGYKGRVGIYQIMPITPAIEALILASGNAMQIAAQSESEGVKSLRQSGLVKVKA
GLTSLEEVLGCTNE

Nucleotide


Download         Length: 1725 bp        

>NTDB_id=634101 LSQ66_RS01755 WP_231768102.1 355206..356930(-) (pilF) [Massilia endophytica strain DM-R-R2A-13]
ATGGCATCAGTGCAAACCAACTCGTCGGCTGGCGCTCCCGTGTCAGGGCTGGCGCGGGCGCTGATGCAGGCCGGGAAGCT
GACCCTGCCGCAGGCGGACGCGCTCAACCGCCGGGCCCAGGCCGAAAAGCTGCCCTTCATCGACGTGCTGGTTTCCAGCG
GCACGGTGAACGCACGCGAGCTGGCCGTGTTCTGCTCCGAAACTTTCGCCTATCCCCTGATGGACCTGGCGGCCTTCAGC
ATCGACGCCCTGCCGCCCAAGATCATCGAGCCCAAGCTGATGCAGAGCCAGCGGGTGGTGGCCCTGGCCAAGCGCGGGAA
CAAGATGTCCGTTGCCATCTCGGACCCCACGAATACCCAGGCGCTGGACCAGATCAAGTTCCAGACCGAGAGCTCGGTGG
AACCCGTGATCGTGCCGCACGATGCGCTGATCCGCCTGCTGAATGAACTGGGCAAGAGCGCCGACCAGACCATCGCCGAC
CTGGCGGGCGAGGAAGGCGACATCCAGTTCGCCGAGGAAGAGGAGGCGAACGCCGCCCCCGATCCTTCCACGGACGTGGA
AGACGCGCCGGTGGTGCGCTTCCTGAACAAGATGCTGATGGACGCCGTGAACATGGGCGCGTCCGACCTGCACTTCGAAC
CTTTCGAGAAGTTCTACCGCATCCGCTTCCGCGTGGACGGGGTGCTGGTGGAGCATGCGCAGCCCCCTGTCTCCATCAAG
GAAAAGCTGGTCTCGCGCATCAAGGTGCTGGCGCGGCTGGACATCTCGGAGAAGCGCATTCCGCAGGACGGGCGCATGCG
CCTGATCGTGTCTCCTACGAAGACCATCGACCTGCGCATCTCCACCCTGCCCACCCTCTTCGGCGAGAAGGTGGTGATGC
GTATTCTGGATGCGACCCAGGCCCAGATGGGCATCGACTCCCTGGGCTATGACCCGGACCAGAAGGAGCTGCTGCTGGAC
GCCATCCAGCGCCCCTACGGCATGGTGCTGGTGACGGGGCCCACCGGCTCGGGCAAGACGGTGTCCCTGTACACCTGCCT
GAACATCCTGAACAAGCCCGGCATCAATATCTCGACGGCGGAAGACCCGGCCGAGATCAACCTGCCCGGCGTGAACCAGG
TGAACGTGAACGACAAGGCGGGGCTTACCTTCCCCGTCGCGCTGAAATCCTTCCTGCGCCAGGACCCGGACATCATCATG
GTGGGTGAGATCCGCGACCTGGAAACGGCGGACATTGCGATCAAGGCCGCACAGACCGGCCACATGGTGTTTTCCACCCT
GCACACGAACGACGCACCGTCCACACTCACACGCCTGATGAACATGGGCGTGGCTCCCTTCAATATCGCCTCCTCCGTCA
TCCTGATCACGGCGCAGCGCCTGGCGCGCCGCCTGTGCACCTGCAAGCAGCCGGTCGACATCTCGGAAGACCTGCTGCGC
CGCGCAGGCTTCAAGGACGAAGAACTGGACGGCAACTGGAAGCCATACGGCCCCGTGGGCTGCGAGCGCTGCAACGGCTC
GGGCTACAAGGGCCGCGTGGGCATCTACCAGATCATGCCGATCACGCCCGCCATCGAGGCGCTGATCCTTGCCAGCGGCA
ACGCGATGCAGATCGCCGCGCAGTCGGAGTCGGAAGGCGTGAAGTCGCTGCGGCAGTCCGGACTGGTGAAGGTCAAGGCA
GGCCTCACCAGCCTGGAAGAAGTGCTGGGCTGCACGAACGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Neisseria gonorrhoeae MS11

53.534

98.606

0.528

  pilB Acinetobacter baylyi ADP1

52.364

99.477

0.521

  pilB Acinetobacter baumannii D1279779

52.364

99.477

0.521

  pilB Legionella pneumophila strain ERS1305867

48.485

97.735

0.474

  pilB Vibrio cholerae strain A1552

46.346

97.735

0.453

  pilB Vibrio parahaemolyticus RIMD 2210633

45.31

98.432

0.446

  pilB Vibrio campbellii strain DS40M4

45.018

97.909

0.441

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.509

92.16

0.364

  pilF Thermus thermophilus HB27

39.024

92.857

0.362