Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilQ   Type   Machinery gene
Locus tag   MON37_RS01100 Genome accession   NZ_CP094242
Coordinates   244528..246663 (+) Length   711 a.a.
NCBI ID   WP_039408916.1    Uniprot ID   -
Organism   Morococcus cerebrosus strain CIP 81.93     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 242636..261816 244528..246663 within 0


Gene organization within MGE regions


Location: 242636..261816
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MON37_RS01085 (MON37_01085) pilN 242636..243250 (+) 615 WP_039408907.1 PilN domain-containing protein Machinery gene
  MON37_RS01090 (MON37_01090) pilO 243251..243940 (+) 690 WP_039408910.1 type 4a pilus biogenesis protein PilO -
  MON37_RS01095 (MON37_01095) pilP 243955..244509 (+) 555 WP_039408913.1 pilus assembly protein PilP Machinery gene
  MON37_RS01100 (MON37_01100) pilQ 244528..246663 (+) 2136 WP_039408916.1 type IV pilus secretin PilQ Machinery gene
  MON37_RS01105 (MON37_01105) - 247023..247815 (+) 793 WP_234403912.1 IS5 family transposase -
  MON37_RS01110 (MON37_01110) - 247938..248474 (+) 537 WP_234396315.1 shikimate kinase -
  MON37_RS01115 (MON37_01115) aroB 248676..249755 (+) 1080 WP_039410819.1 3-dehydroquinate synthase -
  MON37_RS01120 (MON37_01120) - 249852..250091 (-) 240 Protein_207 N-acetyl-gamma-glutamyl-phosphate reductase -
  MON37_RS01125 (MON37_01125) - 250116..251081 (-) 966 WP_242883583.1 IS30 family transposase -
  MON37_RS01130 (MON37_01130) argC 251172..251978 (-) 807 Protein_209 N-acetyl-gamma-glutamyl-phosphate reductase -
  MON37_RS01135 (MON37_01135) - 252263..253261 (-) 999 WP_039404571.1 L,D-transpeptidase -
  MON37_RS01140 (MON37_01140) rlmD 253359..254735 (+) 1377 WP_039404576.1 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD -
  MON37_RS01145 (MON37_01145) - 254881..256332 (+) 1452 WP_039404568.1 hypothetical protein -
  MON37_RS01150 (MON37_01150) - 256357..257448 (+) 1092 WP_039404565.1 alpha-glucan phosphorylase -
  MON37_RS01155 (MON37_01155) - 257461..257997 (+) 537 WP_003743737.1 ATP/GTP-binding protein -
  MON37_RS01160 (MON37_01160) - 258003..258644 (+) 642 WP_039404562.1 hypothetical protein -
  MON37_RS01165 (MON37_01165) - 258684..259043 (+) 360 WP_003743735.1 roadblock/LC7 domain-containing protein -
  MON37_RS01175 (MON37_01175) - 259556..260563 (+) 1008 WP_242883729.1 IS5 family transposase -
  MON37_RS01180 (MON37_01180) prmC 260989..261816 (+) 828 WP_039409321.1 peptide chain release factor N(5)-glutamine methyltransferase -

Sequence


Protein


Download         Length: 711 a.a.        Molecular weight: 77653.83 Da        Isoelectric Point: 9.6327

>NTDB_id=668334 MON37_RS01100 WP_039408916.1 244528..246663(+) (pilQ) [Morococcus cerebrosus strain CIP 81.93]
MKTKNMTKLFAGISVIAAAQIALAGNITDINVSTLPDSQKIIKIRFDRDVISPSGFVTSTPSRIALDFAKTNIQLAQPVL
EYADPLLNQITAAQNNDRSRIVLGLNKAAQYNTEVHGNEVWVFVTESSDRTITSAAISNNGNAHQKVSSSETVKQTVNSA
NIDFRRGAHNSGIVELNAPDFTGKPEVKQQRDRVIITLKNYPLPTQAQRSLDVADFSTPVKNITLKRIGNDAQLIIRNNG
NWDFNTKAANGQFAFEVMPKSASTESSGLNANPNKSFRGRKISLDFQDVEVRTILQILAKESGVNIVASDSVNGKMTLSL
KDVPWDQALDLVMQARNLDMRRQGNIINIAPRDELLAKDKAFLQAEKEIAELGPLYSQTFQLKYKNVEEFRKILRLDDNN
SGNSNGRNTLLSGRGSALIDPATNTLIITDNRNVVEKFRKLIEELDVPTRQVMVEARIVEAKDGFSRDLGVKFGVAGSKG
RNAWGNSWSNAQDNHNTNSAVLRGDANYRTWSLQPNISLPAAAATNNIALVRALSSGALGLEISASEATGKSKTISNPRV
LTQDRREATIESGTEIPYQEASSSGATSVSFKKAVLGLTVTPNVTPDGQVIMTVKITKDTPQDCTVDSLITKCIQTKNLN
TQAMVEDGGTLIVGGIYEEDNSDTINKVPLLGDLPVVGNLFKSRAKKEERRELLVFITPRIMDGVGSNLRY

Nucleotide


Download         Length: 2136 bp        

>NTDB_id=668334 MON37_RS01100 WP_039408916.1 244528..246663(+) (pilQ) [Morococcus cerebrosus strain CIP 81.93]
ATGAAAACTAAAAATATGACGAAATTATTTGCCGGCATCAGTGTGATTGCGGCAGCACAAATTGCGTTGGCAGGTAATAT
TACTGATATTAATGTATCAACTTTGCCGGATAGTCAAAAAATTATCAAAATTCGTTTTGACCGGGATGTGATTAGTCCGA
GCGGCTTTGTTACATCTACTCCGTCCCGAATTGCCTTAGATTTTGCTAAAACAAATATTCAATTGGCTCAACCTGTATTG
GAATATGCGGATCCTTTGTTAAATCAAATTACTGCGGCTCAGAATAATGACCGATCTCGTATTGTTTTGGGATTGAATAA
AGCAGCGCAATATAATACTGAAGTCCATGGTAATGAAGTATGGGTATTTGTTACAGAATCATCTGATCGAACCATAACTT
CTGCAGCTATTTCAAATAATGGAAATGCACATCAGAAAGTAAGCTCGTCTGAAACAGTCAAACAAACGGTAAATTCAGCT
AATATTGACTTCAGAAGAGGTGCACATAACTCAGGTATTGTGGAATTGAATGCGCCTGATTTCACAGGCAAACCTGAAGT
GAAGCAACAACGTGATCGGGTTATTATTACTCTGAAAAATTATCCGTTGCCGACACAAGCGCAACGCAGTTTGGATGTTG
CCGATTTCAGTACACCGGTTAAAAATATTACGTTAAAACGTATTGGAAATGATGCTCAACTCATTATTCGCAATAATGGC
AATTGGGATTTCAATACCAAAGCTGCAAATGGTCAATTTGCTTTTGAAGTAATGCCTAAGTCCGCAAGTACGGAATCAAG
TGGGCTGAATGCAAATCCAAACAAATCTTTCCGCGGCCGTAAAATTTCCCTGGACTTCCAAGATGTTGAAGTACGTACCA
TTTTGCAAATCCTTGCGAAGGAGTCAGGGGTCAATATCGTGGCCAGTGATTCCGTGAATGGCAAAATGACACTTTCTTTA
AAAGATGTGCCTTGGGATCAGGCTTTGGATTTGGTTATGCAGGCGCGTAATTTGGATATGCGCCGACAAGGCAATATTAT
CAATATCGCACCTCGTGATGAATTGTTAGCTAAAGATAAGGCTTTCTTGCAGGCGGAGAAAGAGATTGCCGAACTGGGGC
CTTTATATTCCCAAACTTTCCAATTGAAGTATAAGAATGTAGAAGAGTTCCGGAAAATTTTGCGTTTGGACGACAACAAT
AGTGGAAACTCCAATGGTAGAAATACCTTGTTGAGCGGCCGTGGCAGTGCGCTGATTGATCCTGCAACGAATACTTTGAT
TATTACGGATAATCGAAATGTTGTAGAAAAATTCCGTAAGTTGATTGAGGAATTGGATGTACCGACTCGTCAAGTTATGG
TTGAAGCCCGTATCGTTGAAGCAAAAGACGGTTTTTCGCGTGATTTGGGCGTGAAATTCGGTGTCGCAGGTTCCAAAGGA
CGCAACGCTTGGGGTAACTCTTGGAGTAATGCGCAAGATAACCATAATACTAATTCGGCTGTTTTGCGAGGTGATGCAAA
TTACCGTACATGGTCTTTGCAACCGAATATTAGCCTGCCTGCGGCAGCGGCGACTAATAATATTGCTTTGGTGCGAGCTT
TGTCATCGGGTGCATTAGGTTTGGAAATTTCTGCTTCGGAAGCTACCGGTAAAAGTAAGACCATTTCTAATCCGCGTGTC
CTGACACAAGACCGTAGAGAAGCTACTATTGAGTCTGGTACTGAAATTCCGTATCAGGAGGCATCTTCCAGTGGTGCAAC
CTCGGTATCGTTCAAAAAAGCCGTGCTGGGCTTAACGGTAACGCCGAATGTAACGCCTGATGGTCAAGTTATCATGACTG
TAAAAATTACCAAGGATACACCGCAAGACTGTACAGTGGACTCTTTGATTACGAAATGTATTCAGACTAAAAACTTGAAT
ACTCAGGCAATGGTGGAAGATGGTGGTACGTTGATCGTTGGTGGCATCTACGAAGAAGATAATAGCGATACGATTAACAA
AGTACCATTATTGGGCGATTTGCCAGTGGTGGGTAATTTGTTCAAGTCCCGTGCGAAAAAAGAAGAGCGTCGCGAATTGC
TGGTTTTCATCACTCCGCGAATTATGGATGGTGTAGGCAGCAATCTCCGCTACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilQ Neisseria gonorrhoeae strain FA1090

61.943

100

0.646

  pilQ Neisseria meningitidis MC58

66.37

79.044

0.525

  pilQ Neisseria meningitidis M1080

66.726

78.622

0.525


Multiple sequence alignment