Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6037_RS05450 Genome accession   NZ_CP015425
Coordinates   1081816..1083039 (+) Length   407 a.a.
NCBI ID   WP_231087463.1    Uniprot ID   -
Organism   [Haemophilus] ducreyi strain VAN2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1076816..1088039
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6037_RS05425 (A6037_05445) - 1077003..1079561 (-) 2559 WP_064082619.1 penicillin-binding protein 1A -
  A6037_RS05430 (A6037_05450) - 1079697..1080362 (+) 666 WP_064082620.1 hypothetical protein -
  A6037_RS05435 (A6037_05455) - 1080344..1080865 (+) 522 WP_064082621.1 hypothetical protein -
  A6037_RS05440 (A6037_05460) - 1080870..1081250 (+) 381 WP_064082622.1 hypothetical protein -
  A6037_RS07950 - 1081263..1081388 (+) 126 WP_257722001.1 hypothetical protein -
  A6037_RS05445 (A6037_05465) - 1081391..1081786 (+) 396 WP_064082623.1 hypothetical protein -
  A6037_RS05450 (A6037_05470) comE 1081816..1083039 (+) 1224 WP_231087463.1 type IV pilus secretin PilQ Machinery gene
  A6037_RS05455 (A6037_05475) nusB 1083209..1083622 (+) 414 WP_064082624.1 transcription antitermination factor NusB -
  A6037_RS05460 (A6037_05480) thiL 1083679..1084644 (+) 966 WP_064082625.1 thiamine-phosphate kinase -
  A6037_RS05465 (A6037_05485) - 1084646..1085119 (+) 474 WP_064083034.1 phosphatidylglycerophosphatase A -
  A6037_RS05470 (A6037_05490) - 1085129..1085749 (+) 621 WP_064082626.1 LysE family transporter -
  A6037_RS05475 (A6037_05495) - 1085810..1086832 (-) 1023 WP_064082627.1 TrmH family RNA methyltransferase -
  A6037_RS05480 (A6037_05500) suhB 1086999..1087811 (+) 813 WP_064082628.1 inositol-1-monophosphatase -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 45571.35 Da        Isoelectric Point: 6.6003

>NTDB_id=179452 A6037_RS05450 WP_231087463.1 1081816..1083039(+) (comE) [[Haemophilus] ducreyi strain VAN2]
MLFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIIESIVRANQLSKRKEKQIYFI
GHLKEDKAIDNDDKPKLITQTIKLDYAKASEVIESLTKGNGNVLSEQGYLYFDERSNSLIIKDSPDSMKHILALIKNLDK
PTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVAISLETKGLTTANHLNVNFPVNNATSLALQVARINSRVLDLE
LTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQILLDLVVTQNSP
NLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPILGSIPVLKKLFSHSTDKITKRELVIFVTPYL
VQDKYKK

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=179452 A6037_RS05450 WP_231087463.1 1081816..1083039(+) (comE) [[Haemophilus] ducreyi strain VAN2]
ATTCTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGCACCTACTGCAGAAATATTAAG
TTATTTGGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATACGACTCTTAGAATTGAAAATA
GTCATTTTGATGAAATTATTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAAGAAAAACAAATTTATTTTATC
GGGCATTTAAAAGAAGATAAGGCAATAGATAATGATGATAAGCCTAAGTTAATCACTCAAACTATTAAGTTGGATTATGC
TAAAGCTTCAGAAGTGATTGAATCTCTTACCAAAGGTAATGGTAATGTATTGTCAGAACAAGGGTATTTATATTTTGATG
AGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATGAAGCATATTTTGGCATTAATAAAAAATTTAGATAAG
CCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAGTAGTCAAGATTTACAAGAACTTGGTGTCCGTTGGGG
AATGTTTTCTCCTTCTGACGATTATCATAAGGTGGCCATTAGTTTAGAAACTAAGGGTTTAACAACAGCAAATCATTTAA
ATGTTAACTTCCCGGTAAATAATGCAACTTCACTTGCTTTGCAAGTTGCAAGAATTAATAGTCGCGTATTAGATTTAGAA
TTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAGCCCTCGTTTACTGACCACTAATAAAAAGCCTGCAAG
TATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAATCGTAAAGATGAAGTAAAAAATATTGAATTTAAAGAAGCTG
TATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAAATTTTATTAGATCTAGTGGTTACACAGAATTCACCA
AATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAAACAAGAATTGAATACTCAAGTATTTGCTAAACATGG
TGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTGACTGCAAAAGGTGAAGATCGTGTGCCTATTTTAGGTTCAATTC
CTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACTAAGCGTGAGCTTGTCATTTTTGTCACGCCTTATCTA
GTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.015

100

0.663

  comE Haemophilus influenzae Rd KW20

51.515

97.297

0.501

  comE Haemophilus influenzae 86-028NP

50.758

97.297

0.494

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.304

100

0.42

  pilQ Vibrio cholerae strain A1552

41.304

100

0.42

  pilQ Vibrio campbellii strain DS40M4

40.38

100

0.418


Multiple sequence alignment