Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   DQL22_RS00935 Genome accession   NZ_LS483485
Coordinates   185503..186927 (+) Length   474 a.a.
NCBI ID   WP_032995306.1    Uniprot ID   A0A3S4PYH6
Organism   Aggregatibacter aphrophilus strain NCTC11096     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 180503..191927
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL22_RS00910 (NCTC11096_00182) - 180545..183106 (-) 2562 WP_050332893.1 penicillin-binding protein 1A -
  DQL22_RS00915 (NCTC11096_00183) - 183233..184036 (+) 804 WP_172452394.1 competence protein ComA -
  DQL22_RS00920 (NCTC11096_00184) - 184050..184568 (+) 519 WP_032995314.1 PilN domain-containing protein -
  DQL22_RS00925 (NCTC11096_00185) - 184565..185098 (+) 534 WP_109843408.1 competence protein -
  DQL22_RS00930 (NCTC11096_00186) - 185098..185493 (+) 396 WP_032995307.1 hypothetical protein -
  DQL22_RS00935 (NCTC11096_00187) comE 185503..186927 (+) 1425 WP_032995306.1 type IV pilus secretin PilQ Machinery gene
  DQL22_RS00940 (NCTC11096_00188) aroK 187142..187669 (+) 528 WP_005704481.1 shikimate kinase AroK -
  DQL22_RS00945 (NCTC11096_00189) aroB 187691..188779 (+) 1089 WP_111300527.1 3-dehydroquinate synthase -
  DQL22_RS00950 (NCTC11096_00190) - 188785..189639 (+) 855 WP_111300525.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  DQL22_RS00955 (NCTC11096_00191) - 189945..190313 (+) 369 WP_111300523.1 rhodanese-like domain-containing protein -
  DQL22_RS00960 (NCTC11096_00192) ygiD 190592..191377 (-) 786 WP_172452416.1 4,5-DOPA dioxygenase extradiol -

Sequence


Protein


Download         Length: 474 a.a.        Molecular weight: 52278.15 Da        Isoelectric Point: 7.2656

>NTDB_id=1142583 DQL22_RS00935 WP_032995306.1 185503..186927(+) (comE) [Aggregatibacter aphrophilus strain NCTC11096]
MKLPLPYRVKCGLFFSCFLSFAMSLSVHAEQNQVFSIRLKQAPLVPTLQQLALAQNVNLIIDDELQGTVSLQLENVDLDQ
LFRSVAKIKQLDLWQENGIYYLSKPDTSAKFATKMDEPFILPAVMTEEPTRLTTATIKLHFAKASEVMKSLTGGSGSLLS
PNGSLTFDDRSNLLLIQDEPKSIANLKKLIKELDKPIEQIAIEARIVTMNGESLKELGVRWGIFNPTENAHRIGGSLAGN
GFDNITNQLNVNFPTATTPAGSVALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKAASIKQGTELPYILVNEK
SGSQSVEFREAVLGLEVTPHISQDKQILLDLVVSQNAPGSRVAHGLGEVVSIDKQEINTQVFAKDGETIVLGGVFHDTIT
KNVDKVPLLGDIPGVKHLFSKESERHQKRELVIFVTPHILRQGETLESFKQKQGITQQTKMLPTQNTHKNPTKS

Nucleotide


Download         Length: 1425 bp        

>NTDB_id=1142583 DQL22_RS00935 WP_032995306.1 185503..186927(+) (comE) [Aggregatibacter aphrophilus strain NCTC11096]
ATGAAATTACCTCTGCCCTACCGCGTAAAGTGCGGTCTGTTTTTCAGTTGTTTTTTGAGTTTTGCCATGAGCCTTTCCGT
TCATGCAGAACAAAATCAGGTATTTTCCATTCGGCTGAAACAAGCGCCCTTGGTACCCACTTTGCAACAATTGGCGTTGG
CGCAAAATGTCAATTTGATTATTGATGATGAATTGCAAGGCACGGTTTCGCTCCAACTGGAAAACGTCGATTTGGATCAG
CTGTTTCGCTCCGTTGCTAAAATTAAGCAGTTGGATTTGTGGCAGGAAAATGGGATTTATTATTTAAGCAAACCCGATAC
TTCCGCTAAATTTGCCACCAAAATGGACGAACCCTTTATCCTTCCTGCCGTGATGACGGAAGAACCGACGCGACTCACCA
CCGCCACCATTAAATTGCATTTTGCCAAAGCCTCGGAAGTGATGAAATCGCTCACCGGCGGTAGCGGTTCGTTACTTTCC
CCAAACGGTAGCCTCACTTTTGACGATCGCAGTAACTTGTTATTAATTCAAGACGAACCGAAATCCATCGCTAACCTGAA
AAAACTGATTAAAGAATTAGATAAGCCCATTGAACAAATCGCCATTGAAGCCAGAATTGTCACCATGAACGGCGAAAGTC
TAAAAGAATTAGGCGTACGCTGGGGCATTTTTAACCCCACCGAAAATGCGCATCGTATCGGCGGAAGTTTGGCGGGTAAC
GGCTTTGACAATATCACCAATCAATTAAACGTCAACTTCCCTACAGCCACCACCCCCGCCGGCTCCGTCGCACTACAGGT
GGCGAAAATAAACGGACGACTTTTGGACTTAGAATTGACCGCACTTGAACGGGAAAACAATGTAGAAATTATTGCCAGCC
CGCGATTGCTCACAACCAATAAAAAAGCCGCCAGCATCAAGCAAGGGACGGAATTGCCGTATATTTTGGTGAATGAAAAA
AGCGGCTCACAAAGCGTGGAATTTCGTGAGGCTGTATTAGGCTTGGAAGTAACACCGCATATTTCACAAGATAAACAAAT
TCTGTTGGATTTAGTGGTCAGTCAAAATGCGCCCGGTAGCCGAGTAGCACATGGTTTGGGCGAAGTCGTCTCCATTGATA
AACAGGAAATCAACACCCAAGTCTTTGCTAAAGACGGAGAAACCATCGTGTTAGGTGGCGTATTTCACGATACCATCACT
AAAAACGTGGACAAAGTGCCCTTGTTGGGAGACATTCCGGGCGTAAAACACTTGTTTAGCAAAGAAAGCGAACGTCATCA
AAAACGGGAATTGGTGATTTTTGTCACACCGCATATTTTACGTCAAGGAGAAACCTTAGAATCCTTTAAACAAAAGCAAG
GCATAACACAGCAAACTAAAATGCTCCCCACACAAAACACGCATAAAAATCCAACAAAATCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3S4PYH6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

69.731

94.093

0.656

  comE Haemophilus influenzae 86-028NP

68.527

94.515

0.648

  comE Glaesserella parasuis strain SC1401

51.818

92.827

0.481

  pilQ Vibrio campbellii strain DS40M4

41.23

92.616

0.382

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.958

90.506

0.38

  pilQ Vibrio cholerae strain A1552

41.958

90.506

0.38


Multiple sequence alignment