Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   JUD77_RS03050 Genome accession   NZ_AP024414
Coordinates   610774..612135 (+) Length   453 a.a.
NCBI ID   WP_203620542.1    Uniprot ID   -
Organism   Haemophilus influenzae strain TA8730     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 605774..617135
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  JUD77_RS03025 (TA8730_05780) - 605840..608434 (-) 2595 WP_071162875.1 penicillin-binding protein 1A -
  JUD77_RS03030 (TA8730_05790) comA 608532..609329 (+) 798 WP_071162874.1 competence protein ComA Machinery gene
  JUD77_RS03035 (TA8730_05800) comB 609330..609836 (+) 507 WP_071162873.1 competence protein ComB Machinery gene
  JUD77_RS03040 (TA8730_05810) comC 609833..610354 (+) 522 WP_048951462.1 competence protein ComC Machinery gene
  JUD77_RS03045 (TA8730_05820) comD 610351..610764 (+) 414 WP_005693727.1 pilus assembly protein PilP Machinery gene
  JUD77_RS03050 (TA8730_05830) comE 610774..612135 (+) 1362 WP_203620542.1 type IV pilus secretin PilQ family protein Machinery gene
  JUD77_RS03055 (TA8730_05840) - 612148..612834 (+) 687 WP_071162871.1 ComF family protein -
  JUD77_RS03060 (TA8730_05850) nfuA 612910..613506 (+) 597 WP_005649300.1 Fe-S biogenesis protein NfuA -
  JUD77_RS03065 (TA8730_05860) nudC 613574..614368 (+) 795 WP_071162870.1 NAD(+) diphosphatase -
  JUD77_RS03070 (TA8730_05870) - 614404..614994 (+) 591 WP_071162869.1 YjaG family protein -
  JUD77_RS03075 (TA8730_05880) - 615134..615406 (+) 273 WP_005630954.1 HU family DNA-binding protein -

Sequence


Protein


Download         Length: 453 a.a.        Molecular weight: 50009.32 Da        Isoelectric Point: 7.1645

>NTDB_id=85083 JUD77_RS03050 WP_203620542.1 610774..612135(+) (comE) [Haemophilus influenzae strain TA8730]
MKKYFLKCGYFLVCFCLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQQDVNLVMGERLEGNISLKLNNIDMPRLLK
IIAKSKHLTLNKDDGIYYLNGSQSGKGQVAGNLTTNEPHLVSHTVKLHFAKASELMKSLTTGSGSLLSPAGSITFDDRSN
LLVIQDEPRSVQNIKKLIAEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENARRVAGSLTGNSFENIADNLNVN
IADNLNVNFATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVSNTRNDTQ
SVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGSRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSED
KVPLLGDIPVIKRLFSKESERHQKRELVIFVTPHILKAGETLEALKQKSAGEK

Nucleotide


Download         Length: 1362 bp        

>NTDB_id=85083 JUD77_RS03050 WP_203620542.1 610774..612135(+) (comE) [Haemophilus influenzae strain TA8730]
ATGAAGAAATATTTTTTAAAGTGCGGTTATTTTTTAGTATGTTTTTGTTTGCCATTAATCGTTTTTGCTAATCCTAAAAC
AGATAACGAACGTTTTTTTATTCGTTTATCGCAAGCACCTTTAGCTCAAACACTGGAGCAATTAGCTTTTCAACAAGATG
TGAATTTAGTGATGGGTGAGAGGTTAGAAGGCAATATTTCTTTGAAATTAAACAATATTGATATGCCACGTTTGCTAAAA
ATAATCGCAAAAAGTAAGCATCTTACTTTGAATAAAGATGATGGGATTTATTATTTAAACGGCAGTCAATCTGGCAAAGG
TCAAGTTGCAGGAAATCTTACGACAAATGAACCGCACTTAGTCAGCCACACGGTAAAACTTCATTTTGCTAAAGCCTCTG
AATTAATGAAATCTTTAACAACAGGAAGTGGATCTTTGCTTTCTCCTGCGGGGAGCATTACCTTTGATGATCGCAGTAAT
TTGCTGGTTATTCAGGATGAACCTCGTTCTGTGCAAAATATCAAAAAACTGATTGCTGAAATGGATAAGCCTATTGAACA
GATCGCTATTGAAGCGCGAATTGTGACAATTACGGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACTGAAAATGCAAGACGAGTTGCGGGCAGCCTTACAGGCAATAGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
ATTGCGGATAATCTTAATGTAAATTTTGCGACAACGACGACACCTGCTGGCTCTATAGCATTACAAGTTGCCAAAATTAA
TGGGCGATTGCTTGATTTAGAATTGAGTGCGTTGGAGCGTGAAAATAATGTAGAAATTATTGCAAGTCCTCGCTTACTCA
CTACCAATAAGAAAAGTGCGAGCATTAAACAGGGGACAGAAATTCCTTACATCGTGAGTAATACTCGTAACGATACGCAA
TCTGTGGAATTTCGTGAGGCGGTGCTTGGTTTGGAAGTGACGCCACATATTTCTAAAGATAACAATATCTTACTTGATTT
ATTGGTAAGTCAAAATTCCCCTGGTTCTCGTGTCGCTTATGGACAAAATGAGGTGGTTTCTATTGATAAGCAAGAAATTA
ATACTCAGGTTTTTGCCAAAGATGGGGAAACCATTGTGCTTGGCGGCGTATTTCACGACACAATCACGAAAAGCGAAGAT
AAAGTGCCATTGCTTGGCGATATACCCGTTATTAAACGATTATTTAGCAAAGAAAGTGAACGACATCAAAAACGTGAGCT
CGTGATTTTCGTGACGCCACATATTTTAAAAGCAGGAGAAACGTTAGAGGCGTTGAAACAAAAAAGTGCGGGGGAAAAAT
AA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae 86-028NP

97.13

100

0.971

  comE Haemophilus influenzae Rd KW20

96.468

100

0.965

  comE Glaesserella parasuis strain SC1401

51.501

95.585

0.492

  pilQ Vibrio campbellii strain DS40M4

42.959

92.494

0.397

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.889

91.17

0.382

  pilQ Vibrio cholerae strain A1552

41.889

91.17

0.382


Multiple sequence alignment