Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6042_RS02990 Genome accession   NZ_CP015430
Coordinates   676359..677582 (-) Length   407 a.a.
NCBI ID   WP_231087463.1    Uniprot ID   -
Organism   [Haemophilus] ducreyi strain GHA2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 671359..682582
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6042_RS02960 (A6042_02965) suhB 671587..672399 (-) 813 WP_064082628.1 inositol-1-monophosphatase -
  A6042_RS02965 (A6042_02970) - 672566..673588 (+) 1023 WP_064082627.1 TrmH family RNA methyltransferase -
  A6042_RS02970 (A6042_02975) - 673649..674269 (-) 621 WP_064082626.1 LysE family transporter -
  A6042_RS02975 (A6042_02980) - 674279..674752 (-) 474 WP_064083425.1 phosphatidylglycerophosphatase A -
  A6042_RS02980 (A6042_02985) thiL 674754..675719 (-) 966 WP_064082625.1 thiamine-phosphate kinase -
  A6042_RS02985 (A6042_02990) nusB 675776..676189 (-) 414 WP_064082624.1 transcription antitermination factor NusB -
  A6042_RS02990 (A6042_02995) comE 676359..677582 (-) 1224 WP_231087463.1 type IV pilus secretin PilQ Machinery gene
  A6042_RS02995 (A6042_03000) - 677612..678007 (-) 396 WP_064083238.1 hypothetical protein -
  A6042_RS08125 - 678010..678135 (-) 126 WP_257722001.1 hypothetical protein -
  A6042_RS03000 (A6042_03005) - 678148..678528 (-) 381 WP_064082622.1 hypothetical protein -
  A6042_RS03005 (A6042_03010) - 678533..679054 (-) 522 WP_064082621.1 hypothetical protein -
  A6042_RS03010 (A6042_03015) - 679036..679701 (-) 666 WP_064082620.1 hypothetical protein -
  A6042_RS03015 (A6042_03020) - 679837..682395 (+) 2559 WP_064082619.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 45571.35 Da        Isoelectric Point: 6.6003

>NTDB_id=179549 A6042_RS02990 WP_231087463.1 676359..677582(-) (comE) [[Haemophilus] ducreyi strain GHA2]
MLFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIIESIVRANQLSKRKEKQIYFI
GHLKEDKAIDNDDKPKLITQTIKLDYAKASEVIESLTKGNGNVLSEQGYLYFDERSNSLIIKDSPDSMKHILALIKNLDK
PTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVAISLETKGLTTANHLNVNFPVNNATSLALQVARINSRVLDLE
LTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQILLDLVVTQNSP
NLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPILGSIPVLKKLFSHSTDKITKRELVIFVTPYL
VQDKYKK

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=179549 A6042_RS02990 WP_231087463.1 676359..677582(-) (comE) [[Haemophilus] ducreyi strain GHA2]
ATTCTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGCACCTACTGCAGAAATATTAAG
TTATTTGGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATACGACTCTTAGAATTGAAAATA
GTCATTTTGATGAAATTATTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAAGAAAAACAAATTTATTTTATC
GGGCATTTAAAAGAAGATAAGGCAATAGATAATGATGATAAGCCTAAGTTAATCACTCAAACTATTAAGTTGGATTATGC
TAAAGCTTCAGAAGTGATTGAATCTCTTACCAAAGGTAATGGTAATGTATTGTCAGAACAAGGGTATTTATATTTTGATG
AGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATGAAGCATATTTTGGCATTAATAAAAAATTTAGATAAG
CCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAGTAGTCAAGATTTACAAGAACTTGGTGTCCGTTGGGG
AATGTTTTCTCCTTCTGACGATTATCATAAGGTGGCCATTAGTTTAGAAACTAAGGGTTTAACAACAGCAAATCATTTAA
ATGTTAACTTCCCGGTAAATAATGCAACTTCACTTGCTTTGCAAGTTGCAAGAATTAATAGTCGCGTATTAGATTTAGAA
TTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAGCCCTCGTTTACTGACCACTAATAAAAAGCCTGCAAG
TATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAATCGTAAAGATGAAGTAAAAAATATTGAATTTAAAGAAGCTG
TATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAAATTTTATTAGATCTAGTGGTTACACAGAATTCACCA
AATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAAACAAGAATTGAATACTCAAGTATTTGCTAAACATGG
TGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTGACTGCAAAAGGTGAAGATCGTGTGCCTATTTTAGGTTCAATTC
CTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACTAAGCGTGAGCTTGTCATTTTTGTCACGCCTTATCTA
GTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.015

100

0.663

  comE Haemophilus influenzae Rd KW20

51.515

97.297

0.501

  comE Haemophilus influenzae 86-028NP

50.758

97.297

0.494

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.304

100

0.42

  pilQ Vibrio cholerae strain A1552

41.304

100

0.42

  pilQ Vibrio campbellii strain DS40M4

40.38

100

0.418


Multiple sequence alignment