Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilR   Type   Regulator
Locus tag   G7079_RS12085 Genome accession   NZ_CP049872
Coordinates   2546572..2547975 (+) Length   467 a.a.
NCBI ID   WP_166057543.1    Uniprot ID   A0A6G7ZXE9
Organism   Thermomonas sp. HDW16     
Function   regulate pilin expression (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2548809..2566460 2546572..2547975 flank 834


Gene organization within MGE regions


Location: 2546572..2566460
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G7079_RS12085 (G7079_12090) pilR 2546572..2547975 (+) 1404 WP_166057543.1 sigma-54 dependent transcriptional regulator Regulator
  G7079_RS13500 (G7079_12095) - 2548265..2548597 (+) 333 WP_166057544.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  G7079_RS13480 (G7079_12100) - 2548809..2549138 (+) 330 WP_166057545.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  G7079_RS12100 (G7079_12105) - 2549273..2551144 (+) 1872 WP_166057546.1 hypothetical protein -
  G7079_RS12105 (G7079_12110) gmd 2551205..2552314 (+) 1110 WP_166057547.1 GDP-mannose 4,6-dehydratase -
  G7079_RS12110 (G7079_12115) - 2552320..2553258 (+) 939 WP_166057548.1 GDP-L-fucose synthase -
  G7079_RS12115 (G7079_12120) - 2553285..2554724 (+) 1440 WP_166057549.1 lipopolysaccharide biosynthesis protein -
  G7079_RS12120 (G7079_12125) - 2554714..2555616 (+) 903 WP_166057550.1 class I SAM-dependent methyltransferase -
  G7079_RS12125 (G7079_12130) - 2555702..2556214 (+) 513 WP_166057551.1 class I SAM-dependent methyltransferase -
  G7079_RS12130 (G7079_12135) - 2556211..2557200 (+) 990 WP_166057552.1 glycosyltransferase family 2 protein -
  G7079_RS12135 (G7079_12140) - 2557200..2558120 (+) 921 WP_166057553.1 class I SAM-dependent methyltransferase -
  G7079_RS12140 (G7079_12145) - 2558117..2558557 (+) 441 WP_166057555.1 hypothetical protein -
  G7079_RS12145 (G7079_12150) - 2558554..2559675 (+) 1122 WP_166057556.1 DegT/DnrJ/EryC1/StrS family aminotransferase -
  G7079_RS12150 (G7079_12155) - 2559672..2560430 (+) 759 WP_166057557.1 class I SAM-dependent methyltransferase -
  G7079_RS12155 (G7079_12160) - 2560427..2561632 (+) 1206 WP_166057558.1 glycosyltransferase family 4 protein -
  G7079_RS12160 (G7079_12165) - 2561649..2562413 (+) 765 WP_166057559.1 glycosyltransferase family 2 protein -
  G7079_RS12165 (G7079_12170) - 2562567..2563604 (-) 1038 WP_166057560.1 glycosyltransferase family 4 protein -
  G7079_RS12170 (G7079_12175) - 2563743..2564456 (-) 714 WP_166057561.1 class I SAM-dependent methyltransferase -
  G7079_RS12175 (G7079_12180) - 2564456..2565538 (-) 1083 WP_166057562.1 glycosyltransferase family 4 protein -
  G7079_RS12180 (G7079_12185) - 2565591..2566460 (-) 870 WP_166057563.1 glycosyltransferase family 2 protein -

Sequence


Protein


Download         Length: 467 a.a.        Molecular weight: 50552.03 Da        Isoelectric Point: 7.1656

>NTDB_id=428309 G7079_RS12085 WP_166057543.1 2546572..2547975(+) (pilR) [Thermomonas sp. HDW16]
MTAPARSTQASALIVDDERDIRELLVLTLGRMGLRCDTASSLTDARAMLRQNRYALCLTDMRLADGSGVDLVREIAQLYA
ETPVAMITAFGNQDAAVGALKAGAFDFVSKPVDLSVLRGLVQHALKLGSDSRKSPTTAPDPAAPRLLGASPAMDDLRTTL
SKVARSQAPVYIAGESGVGKELVARTIHAEGARTAGPFVPVNCGAIPAELMESEFFGHKKGSFTGAHADKPGLFQAAEGG
TLFLDEVAELPLPMQVKLLRAIQEKSVRPVGANAEIAVDVRILSATHKNLSALVDEGRFRHDLYYRINVIELRVPPLRER
REDLPLLADAVLARLATAQHRVKPQLHADAVEALARYNFPGNVRELENILERAFALADDDGIRAADLRLPEMRPSIPAQA
AAAQLDPSLIDPRESATSALPSFIEEIERKAIEQALTDNRYNKTKAAAQLGITFRALRYKLKKLGIE

Nucleotide


Download         Length: 1404 bp        

>NTDB_id=428309 G7079_RS12085 WP_166057543.1 2546572..2547975(+) (pilR) [Thermomonas sp. HDW16]
ATGACAGCCCCTGCCCGCAGCACCCAGGCCAGCGCCCTGATCGTCGATGACGAGCGCGACATTCGCGAACTGCTGGTGCT
GACCCTGGGGCGAATGGGACTGCGCTGCGACACCGCCTCCTCACTCACCGATGCCCGTGCCATGCTGCGGCAGAACCGCT
ATGCGCTATGCCTGACCGACATGCGCCTGGCCGATGGTTCCGGCGTGGATCTGGTGCGTGAAATCGCCCAGCTGTATGCG
GAAACCCCGGTGGCGATGATCACCGCGTTCGGCAACCAGGATGCCGCGGTCGGCGCGCTCAAGGCTGGCGCCTTCGATTT
CGTCAGCAAACCGGTGGATCTCAGCGTGCTGCGGGGTCTGGTACAGCATGCTTTGAAGCTGGGCAGCGACTCGCGCAAAT
CGCCGACAACGGCCCCGGATCCCGCCGCACCCCGCCTGCTCGGGGCTTCGCCGGCCATGGATGATCTGCGCACCACCCTC
TCCAAGGTCGCGCGCAGCCAGGCGCCGGTCTACATCGCCGGCGAATCCGGCGTGGGCAAGGAGCTGGTCGCGCGCACCAT
CCATGCAGAGGGCGCGCGCACCGCAGGCCCGTTCGTACCGGTCAATTGCGGCGCAATTCCGGCCGAATTGATGGAAAGCG
AGTTCTTCGGCCACAAGAAGGGCAGCTTCACCGGCGCCCACGCGGACAAGCCAGGCCTGTTCCAGGCGGCGGAGGGCGGC
ACCCTGTTCCTGGACGAAGTGGCCGAGCTGCCACTGCCGATGCAGGTCAAGCTGCTGCGCGCGATCCAGGAAAAATCCGT
GCGCCCAGTGGGCGCCAATGCCGAAATCGCGGTGGACGTACGCATCCTGTCCGCCACCCACAAGAACCTGAGCGCGCTGG
TCGATGAGGGCCGCTTTCGCCATGACCTGTACTACCGCATCAACGTGATCGAGCTGCGGGTGCCGCCGCTGCGCGAACGC
CGCGAAGACCTGCCGTTGCTCGCCGATGCTGTGCTGGCCCGCCTTGCCACCGCCCAGCATCGGGTCAAGCCGCAACTGCA
TGCGGATGCCGTGGAAGCGCTGGCCCGCTACAACTTCCCCGGCAACGTGCGTGAGCTGGAGAACATCCTGGAGCGCGCCT
TCGCCCTGGCCGATGACGACGGCATCCGTGCCGCCGACCTACGCCTGCCGGAGATGCGGCCCAGCATTCCCGCCCAGGCC
GCGGCCGCGCAACTGGACCCAAGCCTGATCGACCCACGCGAATCCGCCACCAGCGCCCTGCCCAGTTTCATCGAGGAAAT
CGAACGCAAGGCGATCGAGCAGGCCCTGACCGACAACCGCTACAACAAGACCAAGGCCGCCGCGCAGCTGGGCATCACCT
TCCGCGCGCTGCGCTACAAGCTGAAGAAGCTGGGGATCGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6G7ZXE9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilR Pseudomonas aeruginosa PAK

61.304

98.501

0.604

  pilR Acinetobacter baumannii strain A118

48.283

99.786

0.482