Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   ADJ80_RS03665 Genome accession   NZ_CP012067
Coordinates   745158..746582 (-) Length   474 a.a.
NCBI ID   WP_050692985.1    Uniprot ID   -
Organism   Aggregatibacter aphrophilus strain W10433     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 740158..751582
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ADJ80_RS03640 (ADJ80_03655) ygiD 740689..741474 (+) 786 WP_080988233.1 4,5-DOPA dioxygenase extradiol -
  ADJ80_RS03645 (ADJ80_03660) - 741754..742122 (-) 369 WP_005701820.1 rhodanese-like domain-containing protein -
  ADJ80_RS03650 (ADJ80_03665) - 742446..743300 (-) 855 WP_050692982.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  ADJ80_RS03655 (ADJ80_03670) aroB 743306..744394 (-) 1089 WP_050692983.1 3-dehydroquinate synthase -
  ADJ80_RS03660 (ADJ80_03675) aroK 744416..744943 (-) 528 WP_050692984.1 shikimate kinase AroK -
  ADJ80_RS03665 (ADJ80_03680) comE 745158..746582 (-) 1425 WP_050692985.1 type IV pilus secretin PilQ Machinery gene
  ADJ80_RS03670 (ADJ80_03685) - 746592..746987 (-) 396 WP_005701815.1 hypothetical protein -
  ADJ80_RS03675 (ADJ80_03690) - 746987..747520 (-) 534 WP_005701814.1 competence protein -
  ADJ80_RS03680 (ADJ80_03695) - 747517..748035 (-) 519 WP_005701813.1 PilN domain-containing protein -
  ADJ80_RS03685 (ADJ80_03700) - 748049..748852 (-) 804 WP_050692986.1 hypothetical protein -
  ADJ80_RS03690 (ADJ80_03705) - 748979..751540 (+) 2562 WP_050692987.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 474 a.a.        Molecular weight: 52237.14 Da        Isoelectric Point: 7.0239

>NTDB_id=151024 ADJ80_RS03665 WP_050692985.1 745158..746582(-) (comE) [Aggregatibacter aphrophilus strain W10433]
MKLPLPYCVKCGLFFSCFLSFAISLSVHAEQNQVFSIRLKQAPLVPTLQQLALAQNVNLIIDDELQGTVSLQLENVDLDQ
LFRSVAKIKQLDLWQENGIYYLSKLDTSAKFATKMDEPFILPAVMTEEPTRLTTATIKLHFAKASEVMKSLTGGSGSLLS
PNGSLTFDDRSNLLLIQDEPKSIANLKKLIKELDKPIEQIAIEARIVTMNGESLKELGVRWGIFNPTENAHRIGGSLAGN
GFDNITNQLNVNFPTATTPAGSVALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKAASIKQGTELPYILVNEK
SGSQSVEFREAVLGLEVTPHISQDKQILLDLVVSQNAPGSRVAHGLGEVVSIDKQEINTQVFAKDGETIVLGGVFHDTIT
KNVDKVPLLGDIPGIKHLFSKESERHQKRELVIFVTPHILRQGETLESFKQKQGITQQTKMLPTQNTHKNPTKS

Nucleotide


Download         Length: 1425 bp        

>NTDB_id=151024 ADJ80_RS03665 WP_050692985.1 745158..746582(-) (comE) [Aggregatibacter aphrophilus strain W10433]
ATGAAATTACCTCTGCCCTACTGCGTAAAGTGCGGTCTGTTTTTCAGTTGTTTTTTGAGTTTTGCCATAAGCCTTTCCGT
TCATGCGGAACAAAATCAAGTGTTTTCCATTCGGCTGAAACAAGCGCCCTTGGTGCCCACTTTGCAACAATTGGCGTTAG
CGCAAAACGTCAATTTGATTATTGATGACGAATTGCAAGGCACGGTTTCGCTCCAACTGGAAAACGTGGATTTAGATCAG
TTGTTTCGCTCCGTTGCTAAAATTAAGCAGTTGGATTTGTGGCAGGAAAATGGGATTTATTATTTAAGCAAACTCGATAC
TTCCGCTAAATTTGCCACCAAAATGGACGAACCCTTTATCCTTCCTGCCGTGATGACGGAAGAACCGACGCGACTCACCA
CCGCCACCATTAAATTGCATTTTGCTAAAGCCTCGGAAGTGATGAAATCACTCACCGGCGGTAGCGGTTCGTTACTTTCC
CCAAACGGTAGCCTCACTTTTGACGATCGCAGTAACTTGTTGTTAATTCAAGACGAACCGAAATCCATCGCTAACCTGAA
AAAGCTGATTAAAGAATTAGATAAGCCCATTGAACAAATCGCCATTGAAGCCAGAATTGTCACCATGAACGGTGAAAGTC
TAAAAGAATTGGGCGTACGCTGGGGCATTTTTAACCCCACCGAAAATGCACATCGTATCGGCGGAAGTTTGGCGGGTAAC
GGCTTTGACAATATCACCAATCAATTAAACGTCAACTTCCCTACAGCCACCACCCCCGCCGGTTCCGTCGCACTGCAGGT
GGCGAAAATCAACGGACGACTTTTAGACTTAGAATTGACCGCACTTGAACGGGAAAATAATGTAGAAATCATTGCCAGCC
CGCGATTGCTCACAACCAATAAAAAAGCCGCCAGCATCAAGCAAGGAACGGAATTGCCGTATATTTTGGTGAATGAAAAA
AGCGGCTCACAAAGCGTGGAATTTCGTGAGGCCGTGTTAGGCTTGGAAGTAACACCGCATATTTCACAAGATAAACAAAT
TCTGTTGGATTTAGTGGTCAGTCAAAATGCGCCCGGTAGCCGAGTAGCACATGGTTTGGGCGAAGTCGTCTCCATTGATA
AACAGGAAATCAACACCCAAGTCTTTGCTAAAGACGGAGAAACCATCGTGTTGGGTGGCGTATTTCACGATACCATCACT
AAAAACGTGGACAAAGTGCCCTTGTTGGGAGACATTCCGGGCATAAAACACTTGTTTAGCAAAGAAAGCGAACGTCATCA
AAAACGGGAATTGGTGATTTTCGTCACACCGCATATTTTACGCCAAGGAGAAACCTTAGAATCCTTTAAACAAAAACAGG
GCATAACACAGCAAACTAAAATGCTCCCCACACAAAACACGCATAAAAATCCAACAAAATCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

70.179

94.093

0.66

  comE Haemophilus influenzae 86-028NP

68.973

94.515

0.652

  comE Glaesserella parasuis strain SC1401

52.045

92.827

0.483

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.667

91.139

0.38

  pilQ Vibrio cholerae strain A1552

41.667

91.139

0.38

  pilQ Vibrio campbellii strain DS40M4

40.724

93.249

0.38


Multiple sequence alignment