Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   DV384_RS06010 Genome accession   NZ_CP031241
Coordinates   1227195..1228532 (-) Length   445 a.a.
NCBI ID   WP_114934598.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M17648     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1222195..1233532
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV384_RS05985 - 1223925..1224197 (-) 273 WP_114890071.1 HU family DNA-binding protein -
  DV384_RS05990 - 1224337..1224927 (-) 591 WP_005652133.1 YjaG family protein -
  DV384_RS05995 nudC 1224963..1225757 (-) 795 WP_114890070.1 NAD(+) diphosphatase -
  DV384_RS06000 nfuA 1225824..1226420 (-) 597 WP_005649300.1 Fe-S biogenesis protein NfuA -
  DV384_RS06005 - 1226496..1227182 (-) 687 WP_042594791.1 ComF family protein -
  DV384_RS06010 comE 1227195..1228532 (-) 1338 WP_114934598.1 type IV pilus secretin PilQ family protein Machinery gene
  DV384_RS06015 - 1228546..1228956 (-) 411 Protein_1141 pilus assembly protein PilP -
  DV384_RS06020 comC 1228953..1229474 (-) 522 WP_005662592.1 competence protein ComC Machinery gene
  DV384_RS06025 comB 1229471..1229977 (-) 507 WP_005656467.1 competence protein ComB Machinery gene
  DV384_RS06030 comA 1229978..1230775 (-) 798 WP_105886343.1 competence protein ComA Machinery gene
  DV384_RS06035 - 1230874..1233468 (+) 2595 WP_114934599.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 445 a.a.        Molecular weight: 49181.52 Da        Isoelectric Point: 7.8654

>NTDB_id=304500 DV384_RS06010 WP_114934598.1 1227195..1228532(-) (comE) [Haemophilus influenzae strain M17648]
MKKYLLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTVEQLAFQQDVNLVMGERLEGNISLKLNNIDMPRLLK
IIAKSKHLTLNKDDGIYYLNGSQSGKGEVAGNLTTNEPHLVSHTIKLHFAKASELMKSLTTGSGSLLSPAGSITFDDRSN
LLVIQDEPRFVQNIKKLIAEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLTGNSFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPLLGDI
PIIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGKK

Nucleotide


Download         Length: 1338 bp        

>NTDB_id=304500 DV384_RS06010 WP_114934598.1 1227195..1228532(-) (comE) [Haemophilus influenzae strain M17648]
ATGAAGAAATATCTTTTAAAGTGCGGTTATTTTTTAGTATGTTTTTGTTTGCCATTAATCGTTTTTGCTAATCCTAAAAC
AGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACAGTGGAGCAATTAGCTTTTCAACAAGATG
TGAATTTAGTGATGGGTGAGAGGTTAGAAGGCAATATTTCTTTGAAATTAAACAATATTGATATGCCACGTTTGCTAAAA
ATAATCGCAAAAAGTAAGCATCTTACTTTGAATAAAGATGATGGGATTTATTATTTAAACGGCAGTCAATCTGGCAAAGG
TGAAGTTGCAGGAAATCTTACGACAAATGAACCGCACTTAGTGAGTCATACGATAAAACTTCATTTTGCTAAAGCCTCTG
AATTAATGAAATCCTTAACAACAGGAAGTGGATCTTTGCTTTCTCCTGCGGGGAGCATTACCTTTGATGATCGCAGTAAT
TTGCTGGTTATTCAGGATGAACCTCGTTTTGTGCAAAATATCAAAAAACTGATTGCTGAAATGGATAAGCCTATTGAACA
GATCGCTATTGAAGCGCGAATTGTGACAATTACGGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACTGAAAATGCAAGACGAGTTGCGGGCAGCCTTACAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACGACACCTGCTGGCTCTATAGCATTACAAGTCGCGAAAATTAATGGGCGATTGCTTGATTTAGAATT
GAGTGCGTTGGAGCGTGAAAATAATGTAGAAATTATTGCAAGTCCTCGCTTACTCACTACCAATAAGAAAAGTGCAAGCA
TTAAACAGGGGACAGAAATTCCTTACGTCGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGTGAGGCAGTG
CTTGGTTTGGAAGTGACGCCACATATTTCTAAAGATAACAATATCTTACTTGATTTATTGGTAAGTCAAAATTCCCCTGG
TTCTCGTGTCGCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAGGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTATTTCACGACACAATCACGAAAAGCGAAGATAAAGTGCCATTGCTTGGCGATATA
CCCATTATTAAACGATTATTTAGCAAAGAAAGTGAACGACATCAAAAACGTGAGCTCGTGATTTTCGTCACGCCACATAT
TTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAACAAAAAAGTGCGGGCAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae 86-028NP

97.978

100

0.98

  comE Haemophilus influenzae Rd KW20

96.854

100

0.969

  comE Glaesserella parasuis strain SC1401

52.235

95.506

0.499

  pilQ Vibrio campbellii strain DS40M4

42.317

95.056

0.402

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.748

92.584

0.387

  pilQ Vibrio cholerae strain A1552

41.748

92.584

0.387


Multiple sequence alignment