Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6039_RS06580 Genome accession   NZ_CP015427
Coordinates   1368382..1369635 (-) Length   417 a.a.
NCBI ID   WP_064085149.1    Uniprot ID   -
Organism   [Haemophilus] ducreyi strain VAN4     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1363382..1374635
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6039_RS06555 (A6039_06575) suhB 1364171..1364983 (-) 813 WP_010944457.1 inositol-1-monophosphatase -
  A6039_RS06560 (A6039_06580) - 1365150..1366172 (+) 1023 WP_041603368.1 TrmH family RNA methyltransferase -
  A6039_RS06565 (A6039_06585) - 1366302..1366775 (-) 474 WP_041603618.1 phosphatidylglycerophosphatase A -
  A6039_RS06570 (A6039_06590) thiL 1366777..1367742 (-) 966 WP_010944454.1 thiamine-phosphate kinase -
  A6039_RS06575 (A6039_06595) nusB 1367799..1368212 (-) 414 WP_010944453.1 transcription antitermination factor NusB -
  A6039_RS06580 (A6039_06600) comE 1368382..1369635 (-) 1254 WP_064085149.1 type IV pilus secretin PilQ Machinery gene
  A6039_RS06585 (A6039_06605) - 1369645..1370040 (-) 396 WP_010944451.1 hypothetical protein -
  A6039_RS06590 (A6039_06610) - 1370043..1370561 (-) 519 WP_041603365.1 hypothetical protein -
  A6039_RS06595 (A6039_06615) - 1370566..1370751 (-) 186 WP_010944449.1 hypothetical protein -
  A6039_RS08305 (A6039_06625) - 1371065..1371256 (-) 192 WP_050711621.1 hypothetical protein -
  A6039_RS06605 (A6039_06630) - 1371272..1371733 (-) 462 WP_010944445.1 hypothetical protein -
  A6039_RS06610 (A6039_06635) - 1371869..1374427 (+) 2559 WP_041603361.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 417 a.a.        Molecular weight: 46768.84 Da        Isoelectric Point: 7.2514

>NTDB_id=179493 A6039_RS06580 WP_064085149.1 1368382..1369635(-) (comE) [[Haemophilus] ducreyi strain VAN4]
MRILFSLFFFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIVESIVRANQLSKRK
EKQIYFIGHLKEDKAIEKIDNGDKPKLITQTIKLDYAKASEVIESLTKGKGNVLSEQGYLYFDERSNSLIIKDSPDSMKH
ILALIKNLDKPTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVASSLETKGLTTANHLNVNFPVNNATSLALQVA
RINSRVLDLELTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQIL
LDLVVTQNSPNLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPLLGSIPVLKKLFSHSTDKITKR
ELVIFVTPYLVQDKYKK

Nucleotide


Download         Length: 1254 bp        

>NTDB_id=179493 A6039_RS06580 WP_064085149.1 1368382..1369635(-) (comE) [[Haemophilus] ducreyi strain VAN4]
ATGAGAATACTTTTTTCTTTATTCTTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGC
ACCTACTGCAGAAATATTAAGTTATTTAGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATA
CGACTCTTAGAATTGAAAATAGTCATTTTGATGAAATTGTTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAA
GAAAAACAGATTTATTTTATCGGGCATTTAAAAGAAGATAAGGCAATAGAGAAAATAGATAATGGTGATAAGCCTAAGTT
AATCACTCAAACTATTAAGTTGGATTATGCTAAAGCTTCAGAAGTAATTGAATCTCTTACTAAAGGTAAAGGTAATGTAT
TGTCAGAACAAGGGTATTTATATTTTGATGAGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATGAAGCAT
ATTTTGGCATTAATAAAAAATTTAGATAAGCCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAGTAGTCA
AGATTTACAAGAGCTTGGTGTCCGTTGGGGAATGTTTTCGCCTTCTGACGATTATCATAAGGTGGCCAGTAGTTTAGAAA
CCAAGGGTTTAACAACAGCAAATCATTTAAATGTTAACTTCCCAGTAAATAATGCAACTTCACTTGCTTTGCAAGTTGCA
AGAATTAATAGTCGCGTATTAGATTTAGAATTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAGTCCTCG
TTTACTGACCACTAATAAAAAGCCTGCAAGTATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAACCGTAAAGATG
AAGTAAAAAATATTGAATTTAAAGAAGCTGTATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAAATTTTA
TTAGATCTAGTGGTTACACAGAACTCACCAAATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAAACAAGA
ATTGAATACTCAAGTATTTGCTAAGCATGGTGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTAACTGCAAAAGGTG
AAGATCGCGTGCCTCTTTTAGGTTCAATTCCTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACTAAGCGT
GAACTTGTCATTTTTGTCACGCCTTATCTAGTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.587

99.76

0.664

  comE Haemophilus influenzae Rd KW20

49.881

100

0.501

  comE Haemophilus influenzae 86-028NP

49.403

100

0.496

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.029

99.281

0.417

  pilQ Vibrio cholerae strain A1552

42.029

99.281

0.417

  pilQ Vibrio campbellii strain DS40M4

40.995

100

0.415


Multiple sequence alignment