Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6041_RS00350 Genome accession   NZ_CP015429
Coordinates   58521..59744 (+) Length   407 a.a.
NCBI ID   WP_231087463.1    Uniprot ID   -
Organism   [Haemophilus] ducreyi strain GHA1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 53521..64744
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6041_RS00325 (A6041_00325) - 53708..56266 (-) 2559 WP_064082619.1 penicillin-binding protein 1A -
  A6041_RS00330 (A6041_00330) - 56402..57067 (+) 666 WP_064082620.1 hypothetical protein -
  A6041_RS00335 (A6041_00335) - 57049..57570 (+) 522 WP_064082621.1 hypothetical protein -
  A6041_RS00340 (A6041_00340) - 57575..57955 (+) 381 WP_064082622.1 hypothetical protein -
  A6041_RS08050 - 57968..58093 (+) 126 WP_257722001.1 hypothetical protein -
  A6041_RS00345 (A6041_00345) - 58096..58491 (+) 396 WP_064083238.1 hypothetical protein -
  A6041_RS00350 (A6041_00350) comE 58521..59744 (+) 1224 WP_231087463.1 type IV pilus secretin PilQ Machinery gene
  A6041_RS00355 (A6041_00355) nusB 59914..60327 (+) 414 WP_064082624.1 transcription antitermination factor NusB -
  A6041_RS00360 (A6041_00360) thiL 60384..61349 (+) 966 WP_064082625.1 thiamine-phosphate kinase -
  A6041_RS00365 (A6041_00365) - 61351..61824 (+) 474 WP_064083425.1 phosphatidylglycerophosphatase A -
  A6041_RS00370 (A6041_00370) - 61834..62454 (+) 621 WP_064082626.1 LysE family transporter -
  A6041_RS00375 (A6041_00375) - 62515..63537 (-) 1023 WP_064082627.1 TrmH family RNA methyltransferase -
  A6041_RS00380 (A6041_00380) suhB 63704..64516 (+) 813 WP_064082628.1 inositol-1-monophosphatase -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 45571.35 Da        Isoelectric Point: 6.6003

>NTDB_id=179524 A6041_RS00350 WP_231087463.1 58521..59744(+) (comE) [[Haemophilus] ducreyi strain GHA1]
MLFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIIESIVRANQLSKRKEKQIYFI
GHLKEDKAIDNDDKPKLITQTIKLDYAKASEVIESLTKGNGNVLSEQGYLYFDERSNSLIIKDSPDSMKHILALIKNLDK
PTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVAISLETKGLTTANHLNVNFPVNNATSLALQVARINSRVLDLE
LTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQILLDLVVTQNSP
NLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPILGSIPVLKKLFSHSTDKITKRELVIFVTPYL
VQDKYKK

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=179524 A6041_RS00350 WP_231087463.1 58521..59744(+) (comE) [[Haemophilus] ducreyi strain GHA1]
ATTCTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGCACCTACTGCAGAAATATTAAG
TTATTTGGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATACGACTCTTAGAATTGAAAATA
GTCATTTTGATGAAATTATTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAAGAAAAACAAATTTATTTTATC
GGGCATTTAAAAGAAGATAAGGCAATAGATAATGATGATAAGCCTAAGTTAATCACTCAAACTATTAAGTTGGATTATGC
TAAAGCTTCAGAAGTGATTGAATCTCTTACCAAAGGTAATGGTAATGTATTGTCAGAACAAGGGTATTTATATTTTGATG
AGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATGAAGCATATTTTGGCATTAATAAAAAATTTAGATAAG
CCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAGTAGTCAAGATTTACAAGAACTTGGTGTCCGTTGGGG
AATGTTTTCTCCTTCTGACGATTATCATAAGGTGGCCATTAGTTTAGAAACTAAGGGTTTAACAACAGCAAATCATTTAA
ATGTTAACTTCCCGGTAAATAATGCAACTTCACTTGCTTTGCAAGTTGCAAGAATTAATAGTCGCGTATTAGATTTAGAA
TTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAGCCCTCGTTTACTGACCACTAATAAAAAGCCTGCAAG
TATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAATCGTAAAGATGAAGTAAAAAATATTGAATTTAAAGAAGCTG
TATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAAATTTTATTAGATCTAGTGGTTACACAGAATTCACCA
AATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAAACAAGAATTGAATACTCAAGTATTTGCTAAACATGG
TGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTGACTGCAAAAGGTGAAGATCGTGTGCCTATTTTAGGTTCAATTC
CTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACTAAGCGTGAGCTTGTCATTTTTGTCACGCCTTATCTA
GTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.015

100

0.663

  comE Haemophilus influenzae Rd KW20

51.515

97.297

0.501

  comE Haemophilus influenzae 86-028NP

50.758

97.297

0.494

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.304

100

0.42

  pilQ Vibrio cholerae strain A1552

41.304

100

0.42

  pilQ Vibrio campbellii strain DS40M4

40.38

100

0.418


Multiple sequence alignment