Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A6044_RS07245 Genome accession   NZ_CP015432
Coordinates   1470624..1471883 (-) Length   419 a.a.
NCBI ID   WP_010944452.1    Uniprot ID   Q7VNQ3
Organism   [Haemophilus] ducreyi strain GHA5     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1465624..1476883
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6044_RS07220 (A6044_07240) suhB 1466413..1467225 (-) 813 WP_064088426.1 inositol-1-monophosphatase -
  A6044_RS07225 (A6044_07245) - 1467392..1468414 (+) 1023 WP_064088425.1 TrmH family RNA methyltransferase -
  A6044_RS07230 (A6044_07250) - 1468544..1469017 (-) 474 WP_041603618.1 phosphatidylglycerophosphatase A -
  A6044_RS07235 (A6044_07255) thiL 1469019..1469984 (-) 966 WP_010944454.1 thiamine-phosphate kinase -
  A6044_RS07240 (A6044_07260) nusB 1470041..1470454 (-) 414 WP_010944453.1 transcription antitermination factor NusB -
  A6044_RS07245 (A6044_07265) comE 1470624..1471883 (-) 1260 WP_010944452.1 type IV pilus secretin PilQ Machinery gene
  A6044_RS07250 (A6044_07270) - 1471893..1472288 (-) 396 WP_010944451.1 hypothetical protein -
  A6044_RS07255 (A6044_07275) - 1472291..1472809 (-) 519 WP_041603365.1 hypothetical protein -
  A6044_RS07260 (A6044_07280) - 1472814..1472999 (-) 186 WP_010944449.1 hypothetical protein -
  A6044_RS08685 (A6044_07290) - 1473313..1473504 (-) 192 WP_050711621.1 hypothetical protein -
  A6044_RS07270 (A6044_07295) - 1473520..1473981 (-) 462 WP_010944445.1 hypothetical protein -
  A6044_RS07275 (A6044_07300) - 1474117..1476675 (+) 2559 WP_064088424.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 419 a.a.        Molecular weight: 46939.99 Da        Isoelectric Point: 7.2514

>NTDB_id=179606 A6044_RS07245 WP_010944452.1 1470624..1471883(-) (comE) [[Haemophilus] ducreyi strain GHA5]
MRILFSLFFFISLSLLAQPLSLSLKNAPTAEILSYLAEENMKNIVLSDQIDKNTTLRIENSHFDEIVESIVRANQLSKRK
EKQIYFIGHLKEDKAIEKIDNGDKPKLITQTIKLDYAKASEVIESLTKGKGNGNVLSEQGYLYFDERSNSLIIKDSPDSM
KHILALIKNLDKPTEQIAIEARIVTISSQDLQELGVRWGMFSPSDDYHKVASSLETKGLTTANHLNVNFPVNNATSLALQ
VARINSRVLDLELTALEQENNVEIIASPRLLTTNKKPASIKQGAEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISADNQ
ILLDLVVTQNSPNLNNTAVNGLITIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRVPLLGSIPVLKKLFSHSTDKIT
KRELVIFVTPYLVQDKYKK

Nucleotide


Download         Length: 1260 bp        

>NTDB_id=179606 A6044_RS07245 WP_010944452.1 1470624..1471883(-) (comE) [[Haemophilus] ducreyi strain GHA5]
ATGAGAATACTTTTTTCTTTATTCTTTTTTATTTCATTATCATTATTAGCACAACCGTTATCGCTGTCTTTAAAAAATGC
ACCTACTGCAGAAATATTAAGTTATTTAGCTGAAGAAAATATGAAAAATATCGTATTGAGTGATCAAATAGACAAAAATA
CGACTCTTAGAATTGAAAATAGTCATTTTGATGAAATTGTTGAAAGTATTGTGCGTGCTAATCAATTATCAAAACGAAAA
GAAAAACAGATTTATTTTATCGGGCATTTAAAAGAAGATAAGGCAATAGAGAAAATAGATAATGGTGATAAGCCTAAGTT
AATCACTCAAACTATTAAGTTGGATTATGCTAAAGCTTCAGAAGTAATTGAATCTCTTACTAAAGGTAAAGGTAATGGTA
ATGTATTGTCAGAACAAGGGTATTTATATTTTGATGAGCGAAGTAATAGCTTAATTATAAAAGATAGTCCTGATTCAATG
AAGCATATTTTGGCATTAATAAAAAATTTAGATAAGCCAACAGAACAAATTGCAATTGAGGCTAGAATTGTGACGATTAG
TAGTCAAGATTTACAAGAGCTTGGTGTCCGTTGGGGAATGTTTTCGCCTTCTGACGATTATCATAAGGTGGCCAGTAGTT
TAGAAACCAAGGGTTTAACAACAGCAAATCATTTAAATGTTAACTTCCCAGTAAATAATGCAACTTCACTTGCTTTGCAA
GTTGCAAGAATTAATAGTCGCGTATTAGATTTAGAATTAACGGCATTAGAGCAAGAGAATAATGTAGAAATTATTGCTAG
TCCTCGTTTACTGACCACTAATAAAAAGCCTGCAAGTATTAAACAAGGCGCTGAAATTCCTTATGTATTGTATAACCGTA
AAGATGAAGTAAAAAATATTGAATTTAAAGAAGCTGTATTAGGATTACAAGTTACACCGCATATTTCGGCGGATAATCAA
ATTTTATTAGATCTAGTGGTTACACAGAACTCACCAAATTTAAATAATACGGCAGTGAATGGTTTAATTACGATTGATAA
ACAAGAATTGAATACTCAAGTATTTGCTAAGCATGGTGAAACCATTGTGCTTGGCGGTATTTTTCAACATTTAACTGCAA
AAGGTGAAGATCGCGTGCCTCTTTTAGGTTCAATTCCTGTTCTTAAGAAGCTATTTAGCCATTCCACAGATAAAATCACT
AAGCGTGAACTTGTCATTTTTGTCACGCCTTATCTAGTGCAAGATAAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VNQ3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.268

99.761

0.661

  comE Haemophilus influenzae Rd KW20

49.644

100

0.499

  comE Haemophilus influenzae 86-028NP

49.169

100

0.494

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.029

98.807

0.415

  pilQ Vibrio cholerae strain A1552

42.029

98.807

0.415

  pilQ Vibrio campbellii strain DS40M4

40.995

100

0.413


Multiple sequence alignment