Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   DQN58_RS03540 Genome accession   NZ_LS483443
Coordinates   684271..685695 (-) Length   474 a.a.
NCBI ID   WP_005715894.1    Uniprot ID   E6KVL9
Organism   Aggregatibacter segnis ATCC 33393 strain NCTC 10977     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 679271..690695
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN58_RS03510 (NCTC10977_00705) - 680180..680551 (-) 372 WP_005715885.1 hypothetical protein -
  DQN58_RS03520 (NCTC10977_00706) - 680859..681437 (+) 579 WP_005715886.1 NAD(P)H-dependent oxidoreductase -
  DQN58_RS03525 (NCTC10977_00707) - 681536..682402 (-) 867 WP_005715888.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  DQN58_RS03530 (NCTC10977_00708) aroB 682407..683495 (-) 1089 WP_005715889.1 3-dehydroquinate synthase -
  DQN58_RS03535 (NCTC10977_00709) aroK 683529..684056 (-) 528 WP_005715891.1 shikimate kinase AroK -
  DQN58_RS03540 (NCTC10977_00710) comE 684271..685695 (-) 1425 WP_005715894.1 type IV pilus secretin PilQ Machinery gene
  DQN58_RS03545 (NCTC10977_00711) - 685715..686101 (-) 387 WP_005715896.1 pilus assembly protein PilP -
  DQN58_RS03550 (NCTC10977_00712) - 686112..686639 (-) 528 WP_005715897.1 hypothetical protein -
  DQN58_RS03555 (NCTC10977_00713) - 686636..687154 (-) 519 WP_005715900.1 PilN domain-containing protein -
  DQN58_RS03560 (NCTC10977_00714) - 687151..687972 (-) 822 WP_005715903.1 hypothetical protein -
  DQN58_RS03565 (NCTC10977_00715) - 688083..690650 (+) 2568 WP_005715905.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 474 a.a.        Molecular weight: 52034.84 Da        Isoelectric Point: 7.2242

>NTDB_id=1141691 DQN58_RS03540 WP_005715894.1 684271..685695(-) (comE) [Aggregatibacter segnis ATCC 33393 strain NCTC 10977]
MQSFFLKILKCGLFFGCFFTFAVHAEQHQTFSIQLKQAPLVPTLQQLALAQNVNLLIDDELQGTLTLKLDNVDLDQLFRS
VAKIKQLDLWQEEGIYYLSKRDTVAKFTGNMTEPQAIAALPSEEETSLATATIKLHFAKASEVMKSLTGGSGSMLSPNGS
ITFDDRSNLLLIQDEPKSIANLKKLIKELDKPIEQIAIEARIVTINAESLKDLGVRWGIFNPTENAHKIAGSLAANGFDN
IANQLNVNLPTATTPAGSIALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTELPYIMVNEKSGTQ
SVEFREAVLGLEVTPHISLDKQILLDLIVSQNAPGSRVAYGLGEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSLD
KVPLLGDIPGIKHLFSKESDRHQKRELVIFVTPHILRQGETLENLKNKQSTAKKNNSLGKSKAAQNTPKKTITQ

Nucleotide


Download         Length: 1425 bp        

>NTDB_id=1141691 DQN58_RS03540 WP_005715894.1 684271..685695(-) (comE) [Aggregatibacter segnis ATCC 33393 strain NCTC 10977]
ATGCAATCGTTTTTTCTCAAGATCTTAAAGTGCGGTCTGTTTTTCGGATGTTTTTTCACGTTCGCCGTGCATGCCGAACA
ACATCAAACCTTTTCCATTCAACTCAAACAGGCGCCTTTGGTGCCGACCTTACAACAACTTGCGCTGGCGCAAAATGTTA
ACTTGCTAATTGACGACGAATTGCAAGGCACACTGACATTAAAGTTAGATAACGTAGATTTAGATCAACTCTTTCGTTCC
GTTGCCAAAATCAAACAATTAGATTTATGGCAGGAAGAGGGGATTTATTATTTAAGTAAACGCGATACCGTCGCCAAATT
TACCGGCAATATGACAGAACCTCAAGCCATTGCGGCACTGCCTTCGGAGGAAGAAACCTCGCTCGCGACCGCCACCATTA
AACTGCACTTTGCCAAAGCCTCTGAAGTGATGAAATCACTCACCGGTGGCAGTGGGTCGATGTTATCACCGAATGGCAGT
ATTACCTTTGATGACCGAAGCAACCTCTTGCTGATTCAAGACGAACCGAAATCCATCGCCAATTTGAAAAAATTGATTAA
GGAATTAGATAAGCCCATTGAACAAATCGCCATCGAAGCACGTATTGTCACCATCAACGCGGAAAGTTTGAAAGATTTGG
GCGTGCGTTGGGGGATTTTTAACCCGACAGAAAATGCCCACAAAATTGCTGGCAGTCTCGCCGCTAACGGCTTTGACAAT
ATTGCGAATCAATTAAACGTAAACCTGCCCACCGCCACAACACCGGCGGGGTCTATCGCGTTACAGGTAGCCAAAATCAA
CGGACGACTTTTAGACTTAGAATTGACCGCACTTGAACGGGAAAACAATGTGGAAATTATCGCTAGCCCACGTTTGCTTA
CCACCAACAAGAAAAGCGCCAGCATTAAGCAAGGGACGGAATTGCCTTATATCATGGTGAATGAAAAAAGTGGTACGCAA
AGCGTAGAGTTTCGCGAAGCCGTATTAGGGTTGGAAGTCACACCGCACATTTCCTTGGATAAGCAAATTCTGTTGGATTT
AATTGTCAGTCAAAACGCACCGGGCAGCCGCGTGGCATACGGTTTAGGCGAGGTGGTTTCGATTGATAAACAAGAAATTA
ATACGCAAGTTTTCGCCAAGGATGGGGAAACCATCGTGCTTGGTGGCGTCTTTCACGACACCATTACCAAAAGCTTAGAC
AAAGTGCCATTACTGGGAGACATTCCCGGCATTAAGCATTTGTTTAGCAAAGAAAGTGATCGCCACCAAAAGCGGGAATT
GGTGATCTTCGTCACGCCGCATATTTTGCGCCAAGGTGAGACATTGGAAAACTTGAAGAACAAACAGTCAACAGCGAAAA
AGAACAATAGTCTAGGAAAATCTAAGGCGGCACAAAACACACCTAAAAAGACGATAACGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E6KVL9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

69.584

96.414

0.671

  comE Haemophilus influenzae 86-028NP

68.709

96.414

0.662

  comE Glaesserella parasuis strain SC1401

53.647

89.662

0.481

  pilQ Vibrio campbellii strain DS40M4

42.459

90.928

0.386

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.093

90.717

0.382

  pilQ Vibrio cholerae strain A1552

42.093

90.717

0.382


Multiple sequence alignment