Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CD58_RS01980 Genome accession   NZ_CP007410
Coordinates   461854..462918 (+) Length   354 a.a.
NCBI ID   WP_025211413.1    Uniprot ID   -
Organism   Pseudomonas brassicacearum strain DF41     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 456854..467918
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CD58_RS01970 (CD58_02085) - 456924..458192 (+) 1269 WP_025211411.1 malic enzyme-like NAD(P)-binding protein -
  CD58_RS29815 - 458607..458918 (-) 312 WP_144238500.1 hypothetical protein -
  CD58_RS01975 (CD58_02090) - 459201..461645 (-) 2445 WP_419178852.1 penicillin-binding protein 1A -
  CD58_RS01980 (CD58_02095) comM 461854..462918 (+) 1065 WP_025211413.1 pilus assembly protein PilM Machinery gene
  CD58_RS01985 (CD58_02100) - 462918..463484 (+) 567 WP_025211414.1 PilN domain-containing protein -
  CD58_RS01990 (CD58_02105) pilO 463481..464104 (+) 624 WP_025211415.1 type 4a pilus biogenesis protein PilO -
  CD58_RS01995 (CD58_02110) - 464101..464646 (+) 546 WP_025211416.1 pilus assembly protein PilP -
  CD58_RS02000 (CD58_02115) pilQ 464643..466718 (+) 2076 WP_025211417.1 type IV pilus secretin PilQ Machinery gene
  CD58_RS02005 (CD58_02120) aroK 466723..467241 (+) 519 WP_024779904.1 shikimate kinase AroK -

Sequence


Protein


Download         Length: 354 a.a.        Molecular weight: 38146.67 Da        Isoelectric Point: 4.9070

>NTDB_id=119257 CD58_RS01980 WP_025211413.1 461854..462918(+) (comM) [Pseudomonas brassicacearum strain DF41]
MLRLFNKKAHTLLGIDISSTSVKLLELSRQGDRYRVESYAVEPLPANAVIEKNIAELEGVGQALSRVLAKAKTASRSVAV
AVAGSAVITKIIEMDAGMSDDDMENQLKIEADQYIPYPLDEVAIDFEVLGVSPRSAERVEVLLAACRKENVEVREAALAL
AGLTARVVDVEAYALERAFGLLATQLAASQERLTVAVIDIGATMTTLSVLHNGRIIYTREQLFGGRQLTEEIQRRYGLTP
EQAGQAKRQGGLPDDYLSEVLQPFREALVQQVSRSLQFFFASGQYSAVDHILLAGGTASVAGLDRLIEQRLGTPTQVANP
FTNMALSSKVNAGALASDAPALMIACGLALRSFD

Nucleotide


Download         Length: 1065 bp        

>NTDB_id=119257 CD58_RS01980 WP_025211413.1 461854..462918(+) (comM) [Pseudomonas brassicacearum strain DF41]
GTGCTACGACTCTTCAATAAAAAAGCCCATACGCTTCTGGGGATAGACATCAGCTCCACCTCGGTGAAGCTGCTTGAGTT
GAGCCGCCAGGGTGACCGATACCGCGTCGAGTCCTACGCGGTCGAACCGTTGCCGGCCAACGCCGTGATCGAAAAGAACA
TCGCCGAGCTCGAAGGGGTGGGCCAGGCATTGTCTCGGGTGCTCGCCAAGGCCAAGACCGCCTCGCGTAGCGTGGCAGTG
GCGGTGGCGGGGTCGGCGGTGATCACCAAGATCATCGAGATGGACGCCGGGATGTCCGATGACGACATGGAAAACCAGCT
CAAGATCGAGGCCGATCAGTACATTCCTTATCCGCTGGATGAGGTGGCCATCGATTTTGAAGTGCTGGGCGTGTCACCGC
GCAGCGCCGAGCGGGTCGAGGTGCTGTTGGCGGCCTGTCGCAAGGAAAACGTCGAGGTTCGCGAGGCTGCGCTGGCGCTG
GCCGGGCTGACAGCCCGGGTGGTCGACGTGGAAGCCTACGCGCTGGAGCGCGCCTTTGGTCTGCTCGCCACGCAACTGGC
GGCGTCCCAGGAACGGCTGACCGTGGCGGTCATCGACATCGGCGCCACCATGACCACCCTCAGCGTGCTGCACAACGGGC
GGATCATCTATACCCGCGAGCAATTGTTCGGCGGCCGCCAGCTCACCGAGGAAATCCAGCGCCGCTATGGCCTGACGCCC
GAGCAGGCCGGCCAGGCAAAAAGGCAGGGTGGCCTGCCGGACGATTATCTCAGTGAGGTGCTGCAACCCTTTCGCGAGGC
CCTGGTGCAGCAAGTTTCGCGGTCCTTGCAGTTTTTCTTCGCTTCGGGCCAGTACAGCGCGGTGGACCACATTTTGTTGG
CCGGAGGCACGGCGTCGGTCGCCGGCCTGGATCGGCTGATCGAGCAACGCCTGGGCACACCGACCCAGGTCGCCAACCCG
TTTACCAACATGGCCCTGAGCAGCAAGGTCAATGCCGGTGCCCTGGCCAGTGACGCGCCAGCGCTGATGATTGCCTGCGG
GCTGGCCCTCAGGAGTTTCGACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Acinetobacter nosocomialis M2

55.65

100

0.557

  pilM Acinetobacter baumannii D1279779

55.65

100

0.557

  comM Acinetobacter baylyi ADP1

54.52

100

0.545

  pilM Legionella pneumophila strain ERS1305867

47.458

100

0.475


Multiple sequence alignment