Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   NLY38_RS02705 Genome accession   NZ_CP100553
Coordinates   566689..567753 (+) Length   354 a.a.
NCBI ID   WP_003246678.1    Uniprot ID   -
Organism   Pseudomonas hydrolytica strain KHPS2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 552763..582869 566689..567753 within 0


Gene organization within MGE regions


Location: 552763..582869
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NLY38_RS02650 (NLY38_02650) - 552763..553134 (-) 372 WP_011920751.1 gamma-butyrobetaine hydroxylase-like domain-containing protein -
  NLY38_RS02655 (NLY38_02655) hslU 553216..554556 (-) 1341 WP_195881195.1 HslU--HslV peptidase ATPase subunit -
  NLY38_RS02660 (NLY38_02660) hslV 554596..555126 (-) 531 WP_004372920.1 ATP-dependent protease subunit HslV -
  NLY38_RS02665 (NLY38_02665) - 555223..555897 (-) 675 WP_195881194.1 SPOR domain-containing protein -
  NLY38_RS02670 (NLY38_02670) argS 555898..557637 (-) 1740 WP_195881193.1 arginine--tRNA ligase -
  NLY38_RS02675 (NLY38_02675) - 557809..560028 (-) 2220 WP_195881192.1 primosomal protein N' -
  NLY38_RS02680 (NLY38_02680) rpmE 560185..560397 (+) 213 WP_004372913.1 50S ribosomal protein L31 -
  NLY38_RS02685 (NLY38_02685) - 560424..561224 (+) 801 WP_195881191.1 thermonuclease family protein -
  NLY38_RS02690 (NLY38_02690) - 561221..562657 (+) 1437 WP_195881190.1 M48 family metalloprotease -
  NLY38_RS02695 (NLY38_02695) - 562762..564030 (+) 1269 WP_011920758.1 malic enzyme-like NAD(P)-binding protein -
  NLY38_RS02700 (NLY38_02700) - 564096..566507 (-) 2412 WP_195881189.1 penicillin-binding protein 1A -
  NLY38_RS02705 (NLY38_02705) comM 566689..567753 (+) 1065 WP_003246678.1 pilus assembly protein PilM Machinery gene
  NLY38_RS02710 (NLY38_02710) pilN 567753..568325 (+) 573 WP_195881188.1 type 4a pilus biogenesis protein PilN -
  NLY38_RS02715 (NLY38_02715) pilO 568322..568942 (+) 621 WP_195881187.1 type 4a pilus biogenesis protein PilO -
  NLY38_RS02720 (NLY38_02720) pilP 568942..569466 (+) 525 WP_080518515.1 type 4a pilus biogenesis lipoprotein PilP -
  NLY38_RS02725 (NLY38_02725) pilQ 569522..571639 (+) 2118 WP_195881186.1 type IV pilus secretin PilQ Machinery gene
  NLY38_RS02730 (NLY38_02730) aroK 571645..572163 (+) 519 WP_003246673.1 shikimate kinase AroK -
  NLY38_RS02735 (NLY38_02735) aroB 572339..573442 (+) 1104 WP_195881185.1 3-dehydroquinate synthase -
  NLY38_RS02740 (NLY38_02740) - 573453..575078 (+) 1626 WP_254348304.1 SPOR domain-containing protein -
  NLY38_RS02745 (NLY38_02745) gltB 575316..579764 (+) 4449 WP_011920767.1 glutamate synthase large subunit -
  NLY38_RS02750 (NLY38_02750) - 579799..581217 (+) 1419 WP_195881183.1 FAD-dependent oxidoreductase -
  NLY38_RS02755 (NLY38_02755) hemE 581336..582403 (+) 1068 WP_195881182.1 uroporphyrinogen decarboxylase -
  NLY38_RS02760 (NLY38_02760) - 582438..582869 (-) 432 WP_195881181.1 hypothetical protein -

Sequence


Protein


Download         Length: 354 a.a.        Molecular weight: 37831.21 Da        Isoelectric Point: 4.4688

>NTDB_id=705816 NLY38_RS02705 WP_003246678.1 566689..567753(+) (comM) [Pseudomonas hydrolytica strain KHPS2]
MLGLFTKKANTLLGIDISSTSVKLLELSRSGSRYKVEAYAVEPLPPNAVVEKNIAELEGVGQALSRVLAKAKTGVKTAAV
AVAGSAVITKTIEMEAGLSEDELENQLKIEADQYIPYPLEEVAIDFEVQGPAARNPERVEVLLAACRKENVEVREAALAL
AGLTAKVVDVEAYALERAYSLLEAQLGGGHDELTVAVVDIGATMTTLSVLHNGRTIYTREQLFGGKQLTEEVQRRYGLSV
EEAGLAKKQGGLPDDYDSEVLQPFKEAVVQQVSRSLQFFFAAGQFNDVDYILLAGGTASIPDLDRLIQQKIGTQTLVANP
FADMALSSKVNAGALASDAPALMIACGLAMRSFD

Nucleotide


Download         Length: 1065 bp        

>NTDB_id=705816 NLY38_RS02705 WP_003246678.1 566689..567753(+) (comM) [Pseudomonas hydrolytica strain KHPS2]
GTGCTAGGGCTCTTCACTAAGAAAGCGAATACGCTGCTGGGGATCGATATCAGCTCGACTTCGGTCAAGCTCCTCGAACT
GAGTCGCTCGGGAAGCCGCTACAAGGTAGAGGCTTACGCAGTCGAGCCGCTCCCGCCAAACGCGGTGGTTGAGAAGAATA
TCGCCGAGCTGGAGGGGGTCGGTCAGGCGCTGTCACGCGTCTTGGCCAAGGCCAAGACCGGCGTCAAGACCGCGGCCGTA
GCGGTCGCCGGTTCGGCGGTCATCACCAAGACCATCGAGATGGAGGCCGGCCTTTCCGAGGATGAACTGGAGAACCAGCT
GAAGATCGAGGCTGACCAGTACATTCCTTATCCGCTCGAAGAAGTCGCGATCGATTTCGAGGTTCAGGGGCCTGCTGCGC
GTAATCCTGAGCGCGTCGAAGTGCTCCTCGCCGCGTGCCGCAAGGAGAACGTCGAAGTTCGCGAGGCTGCGCTGGCGCTT
GCCGGCCTGACGGCCAAGGTGGTCGACGTCGAGGCCTATGCGCTGGAGCGTGCTTACAGCCTGCTGGAGGCACAGCTGGG
TGGCGGCCACGACGAGCTCACCGTGGCGGTCGTCGATATCGGCGCCACCATGACCACGCTGAGCGTGCTGCACAACGGTC
GCACCATCTATACCCGCGAGCAGCTCTTCGGTGGCAAGCAGCTGACCGAGGAGGTGCAGCGTCGTTATGGTCTTTCCGTC
GAAGAAGCCGGCCTTGCCAAGAAGCAAGGTGGTCTTCCGGATGACTACGACAGCGAGGTTTTGCAACCGTTCAAGGAAGC
CGTCGTTCAGCAGGTTTCGCGTTCCTTGCAGTTCTTCTTTGCTGCTGGCCAGTTCAATGATGTCGACTACATCCTTCTGG
CTGGTGGAACCGCGTCCATTCCCGATCTTGATCGACTGATCCAGCAAAAGATCGGCACGCAGACGCTAGTGGCCAATCCC
TTTGCCGATATGGCGCTTAGCAGCAAGGTCAACGCCGGTGCGCTGGCCAGTGATGCGCCGGCACTGATGATTGCCTGTGG
TCTGGCGATGAGGAGTTTCGACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Acinetobacter nosocomialis M2

56.78

100

0.568

  pilM Acinetobacter baumannii D1279779

56.78

100

0.568

  comM Acinetobacter baylyi ADP1

54.237

100

0.542

  pilM Legionella pneumophila strain ERS1305867

45.198

100

0.452