Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6046_RS04900 Genome accession   NZ_CP015434
Coordinates   938953..940212 (+) Length   419 a.a.
NCBI ID   WP_010944452.1    Uniprot ID   Q7VNQ3
Organism   [Haemophilus] ducreyi strain GHA9     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 933953..945212
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6046_RS04875 (A6046_04885) - 934161..936719 (-) 2559 WP_041603361.1 penicillin-binding protein 1A -
  A6046_RS04880 (A6046_04890) - 936855..937316 (+) 462 WP_010944445.1 hypothetical protein -
  A6046_RS08820 (A6046_04895) - 937320..937523 (+) 204 WP_010944446.1 hypothetical protein -
  A6046_RS08955 - 937873..938022 (+) 150 WP_155727656.1 hypothetical protein -
  A6046_RS04890 (A6046_04905) - 938027..938545 (+) 519 WP_064083109.1 hypothetical protein -
  A6046_RS04895 (A6046_04910) - 938548..938943 (+) 396 WP_010944451.1 hypothetical protein -
  A6046_RS04900 (A6046_04915) comE 938953..940212 (+) 1260 WP_010944452.1 type IV pilus secretin PilQ Machinery gene
  A6046_RS04905 (A6046_04920) nusB 940382..940795 (+) 414 WP_010944453.1 transcription antitermination factor NusB -
  A6046_RS04910 (A6046_04925) thiL 940852..941817 (+) 966 WP_010944454.1 thiamine-phosphate kinase -
  A6046_RS04915 (A6046_04930) - 941819..942292 (+) 474 WP_041603618.1 phosphatidylglycerophosphatase A -
  A6046_RS04920 (A6046_04935) - 942422..943444 (-) 1023 WP_041603368.1 TrmH family RNA methyltransferase -
  A6046_RS04925 (A6046_04940) suhB 943611..944423 (+) 813 WP_064083110.1 inositol-1-monophosphatase -

Sequence


Protein


Download         Length: 419 a.a.        Molecular weight: 46939.99 Da        Isoelectric Point: 7.2514

>NTDB_id=179654 A6046_RS04900 WP_010944452.1 938953..940212(+) (comE) [[Haemophilus] ducreyi strain GHA9]
MRILFSLFFFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIVESIVRANQLSKRK
EKQIYFIGHLKEDKAIEKIDNGDKPKLITQTIKLDYAKASEVIESLTKGKGNGNVLSEQGYLYFDERSNSLIIKDSPDSM
KHILALIKNLDKPTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVASSLETKGLTTANHLNVNFPVNNATSLALQ
VARINSRVLDLELTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQ
ILLDLVVTQNSPNLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPLLGSIPVLKKLFSHSTDKIT
KRELVIFVTPYLVQDKYKK

Nucleotide


Download         Length: 1260 bp        

>NTDB_id=179654 A6046_RS04900 WP_010944452.1 938953..940212(+) (comE) [[Haemophilus] ducreyi strain GHA9]
ATGAGAATACTTTTTTCTTTATTCTTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGC
ACCTACTGCAGAAATATTAAGTTATTTAGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATA
CGACTCTTAGAATTGAAAATAGTCATTTTGATGAAATTGTTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAA
GAAAAACAGATTTATTTTATCGGGCATTTAAAAGAAGATAAGGCAATAGAGAAAATAGATAATGGTGATAAGCCTAAGTT
AATCACTCAAACTATTAAGTTGGATTATGCTAAAGCTTCAGAAGTAATTGAATCTCTTACTAAAGGTAAAGGTAATGGTA
ATGTATTGTCAGAACAAGGGTATTTATATTTTGATGAGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATG
AAGCATATTTTGGCATTAATAAAAAATTTAGATAAGCCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAG
TAGTCAAGATTTACAAGAGCTTGGTGTCCGTTGGGGAATGTTTTCGCCTTCTGACGATTATCATAAGGTGGCCAGTAGTT
TAGAAACCAAGGGTTTAACAACAGCAAATCATTTAAATGTTAACTTCCCAGTAAATAATGCAACTTCACTTGCTTTGCAA
GTTGCAAGAATTAATAGTCGCGTATTAGATTTAGAATTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAG
TCCTCGTTTACTGACCACTAATAAAAAGCCTGCAAGTATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAACCGTA
AAGATGAAGTAAAAAATATTGAATTTAAAGAAGCTGTATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAA
ATTTTATTAGATCTAGTGGTTACACAGAACTCACCAAATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAA
ACAAGAATTGAATACTCAAGTATTTGCTAAGCATGGTGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTAACTGCAA
AAGGTGAAGATCGCGTGCCTCTTTTAGGTTCAATTCCTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACT
AAGCGTGAACTTGTCATTTTTGTCACGCCTTATCTAGTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VNQ3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.268

99.761

0.661

  comE Haemophilus influenzae Rd KW20

49.644

100

0.499

  comE Haemophilus influenzae 86-028NP

49.169

100

0.494

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.029

98.807

0.415

  pilQ Vibrio cholerae strain A1552

42.029

98.807

0.415

  pilQ Vibrio campbellii strain DS40M4

40.995

100

0.413


Multiple sequence alignment