Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   NJ8700_RS10365 Genome accession   NZ_CP009230
Coordinates   2181150..2182574 (-) Length   474 a.a.
NCBI ID   WP_012771965.1    Uniprot ID   -
Organism   Aggregatibacter aphrophilus NJ8700     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2176150..2187574
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NJ8700_RS10340 (NJ8700_10560) ygiD 2176684..2177469 (+) 786 WP_005701822.1 4,5-DOPA dioxygenase extradiol -
  NJ8700_RS10345 (NJ8700_10565) - 2177748..2178116 (-) 369 WP_005701820.1 rhodanese-like domain-containing protein -
  NJ8700_RS10350 (NJ8700_10570) - 2178440..2179294 (-) 855 WP_005701819.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  NJ8700_RS10355 (NJ8700_10575) aroB 2179300..2180388 (-) 1089 WP_005701818.1 3-dehydroquinate synthase -
  NJ8700_RS10360 (NJ8700_10580) aroK 2180409..2180936 (-) 528 WP_005701817.1 shikimate kinase AroK -
  NJ8700_RS10365 (NJ8700_10585) comE 2181150..2182574 (-) 1425 WP_012771965.1 type IV pilus secretin PilQ Machinery gene
  NJ8700_RS10370 (NJ8700_10590) - 2182584..2182979 (-) 396 WP_005701815.1 hypothetical protein -
  NJ8700_RS10375 (NJ8700_10595) - 2182979..2183512 (-) 534 WP_005701814.1 competence protein -
  NJ8700_RS10380 (NJ8700_10600) - 2183509..2184027 (-) 519 WP_005701813.1 hypothetical protein -
  NJ8700_RS10385 (NJ8700_10605) - 2184041..2184844 (-) 804 WP_005701812.1 hypothetical protein -
  NJ8700_RS10390 (NJ8700_10610) - 2184971..2187532 (+) 2562 WP_005701810.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 474 a.a.        Molecular weight: 52241.15 Da        Isoelectric Point: 7.0239

>NTDB_id=127418 NJ8700_RS10365 WP_012771965.1 2181150..2182574(-) (comE) [Aggregatibacter aphrophilus NJ8700]
MKLPLPYCVKCGLFFSCFLSFAISLSVHAEQNQVFSIRLKQAPLVPTLQQLALAQNVNLIIDDELQGTVSLQLENVDLDQ
LFRSVAKIKQLDLWQENGIYYLSKLDTSAKFATKMDEPFILPAVMTEEPTRLTTATIKLHFAKASEVMKSLTGGSGSLLS
PNGSLTFDDRSNLLLIQDEPKSIANLKKLIKELDKPIEQIAIEARIVTMNGESLKELGVRWGMFNPTENAHRIGGSLAGN
GFDNITNQLNVNFPTATTPAGSVALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKAASIKQGTELPYILVNEK
SGSQSVEFREAVLGLEVTPHISQDKQILLDLVVSQNAPGSRVAHGLGEVVSIDKQEINTQVFAKDGETIVLGGVFHDTIT
KNVDKVPLLGDIPGVKHLFSKESERHQKRELVIFVTPHILRQGETLESFKQKQGITQQTKMLPTQNTHKNPTKS

Nucleotide


Download         Length: 1425 bp        

>NTDB_id=127418 NJ8700_RS10365 WP_012771965.1 2181150..2182574(-) (comE) [Aggregatibacter aphrophilus NJ8700]
ATGAAATTACCTCTGCCCTACTGCGTAAAGTGCGGTCTGTTTTTCAGTTGTTTTTTGAGTTTTGCCATAAGCCTTTCCGT
TCATGCGGAACAAAATCAAGTGTTTTCCATTCGGCTGAAACAAGCGCCCTTGGTGCCCACTTTGCAACAATTGGCGTTAG
CGCAAAACGTCAATTTGATTATTGATGACGAATTGCAAGGCACGGTTTCGCTCCAACTGGAAAACGTGGATTTAGATCAG
TTGTTTCGCTCCGTTGCTAAAATTAAGCAGTTGGATTTGTGGCAGGAAAATGGGATTTATTATTTAAGCAAACTCGATAC
TTCCGCTAAATTTGCCACCAAAATGGACGAACCCTTTATCCTTCCTGCCGTGATGACGGAAGAACCGACGCGACTCACCA
CCGCCACCATTAAATTGCATTTTGCCAAAGCCTCGGAAGTGATGAAATCGCTCACCGGCGGTAGCGGTTCGTTACTTTCC
CCAAACGGTAGCCTCACTTTTGACGATCGCAGTAACTTGTTGTTAATTCAAGACGAACCGAAATCCATCGCTAACCTAAA
AAAGCTGATTAAAGAATTAGATAAGCCCATTGAACAAATTGCCATTGAAGCCAGAATTGTCACCATGAACGGCGAAAGTC
TAAAAGAATTGGGCGTACGCTGGGGCATGTTTAACCCCACCGAAAATGCGCATCGTATCGGCGGAAGTTTGGCGGGTAAC
GGCTTTGACAATATCACCAATCAATTAAACGTCAACTTCCCTACAGCCACCACACCCGCCGGCTCCGTCGCACTGCAGGT
GGCGAAAATAAACGGACGACTTTTGGACTTAGAATTGACCGCACTTGAACGGGAAAACAATGTAGAAATTATTGCCAGCC
CGCGATTGCTCACAACCAATAAAAAAGCCGCCAGCATCAAGCAAGGGACGGAATTGCCGTATATTTTGGTGAATGAAAAA
AGCGGCTCACAAAGCGTGGAATTTCGTGAGGCTGTATTAGGCTTGGAAGTAACACCGCATATTTCACAAGATAAACAAAT
TCTGTTGGATTTAGTGGTCAGTCAAAATGCGCCCGGTAGCCGAGTAGCACATGGTTTGGGCGAAGTCGTCTCCATTGATA
AACAGGAAATCAACACCCAAGTCTTTGCTAAAGACGGAGAAACCATCGTGTTAGGTGGCGTATTTCACGATACCATCACT
AAAAACGTGGACAAAGTGCCCTTGTTGGGAGACATTCCGGGCGTAAAACACTTGTTTAGCAAAGAAAGCGAACGTCATCA
AAAACGGGAATTGGTGATTTTCGTCACACCGCATATTTTACGCCAAGGAGAAACCTTAGAATCCTTTAAACAAAAACAGG
GCATAACACAGCAAACTAAAATGCTCCCCACACAAAACACGCATAAAAATCCAACAAAATCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

69.731

94.093

0.656

  comE Haemophilus influenzae 86-028NP

68.527

94.515

0.648

  comE Glaesserella parasuis strain SC1401

51.818

92.827

0.481

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.667

91.139

0.38

  pilQ Vibrio cholerae strain A1552

41.667

91.139

0.38

  pilQ Vibrio campbellii strain DS40M4

40.724

93.249

0.38


Multiple sequence alignment