Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   K7G91_RS10605 Genome accession   NZ_CP085791
Coordinates   2237015..2238325 (+) Length   436 a.a.
NCBI ID   WP_223667849.1    Uniprot ID   -
Organism   Pasteurella canis strain HL_D1250     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2232015..2243325
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K7G91_RS10580 (K7G91_002111) - 2232072..2234639 (-) 2568 WP_227797938.1 penicillin-binding protein 1A -
  K7G91_RS10585 (K7G91_002112) - 2234775..2235563 (+) 789 WP_227797939.1 competence protein ComA -
  K7G91_RS10590 (K7G91_002113) - 2235560..2236075 (+) 516 WP_227797940.1 hypothetical protein -
  K7G91_RS10595 (K7G91_002114) - 2236206..2236610 (+) 405 WP_227797941.1 hypothetical protein -
  K7G91_RS10600 (K7G91_002115) - 2236616..2237005 (+) 390 WP_265332965.1 pilus assembly protein PilP -
  K7G91_RS10605 (K7G91_002116) comE 2237015..2238325 (+) 1311 WP_223667849.1 type IV pilus secretin PilQ Machinery gene
  K7G91_RS10610 (K7G91_002117) aroK 2238542..2239069 (+) 528 WP_046339606.1 shikimate kinase AroK -
  K7G91_RS10615 (K7G91_002118) aroB 2239085..2240173 (+) 1089 WP_160531112.1 3-dehydroquinate synthase -
  K7G91_RS10620 (K7G91_002119) - 2240177..2241070 (+) 894 WP_227797942.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  K7G91_RS10625 (K7G91_002120) - 2241133..2241438 (-) 306 WP_115323520.1 DciA family protein -
  K7G91_RS10630 (K7G91_002121) secM 2241531..2241842 (+) 312 WP_227797943.1 secA translation cis-regulator SecM -

Sequence


Protein


Download         Length: 436 a.a.        Molecular weight: 48435.80 Da        Isoelectric Point: 6.3206

>NTDB_id=619981 K7G91_RS10605 WP_223667849.1 2237015..2238325(+) (comE) [Pasteurella canis strain HL_D1250]
MQKGVNYCIKCGLLWLSFVCCVLAQSSETFSIRLKQAPLVEILQQLALQQNKSLVIDNELDGTLSLQLEQTSFEKLLYAV
AKIKQLELHKEGQLYYLSKGFKADKIKSESSQLSPNLVTDSIKLQFAKAEDVMKSLTTGNGSLLSVDGRISVDIRSNLLL
IQDQAESVRNIKKLVSEMDKPVEQIVIEARIVTMTDESLKELGVRWGMFDPTSHKHTLSGSLESNGFLNIQDHLNVNFAT
NVTPAGSVALQLAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYIVTNGKNDTQSIEFREAVLGL
EVTPQISKDNTILLDMVVSQNSLGPRVTYDKGESISIDKQEINTQVFAKDGETIVLGGVFHDTIMKGVDKVPLLGDLPVL
KHLFSKESERHQKRELVIFVTPHILKQNEIVAKSEQ

Nucleotide


Download         Length: 1311 bp        

>NTDB_id=619981 K7G91_RS10605 WP_223667849.1 2237015..2238325(+) (comE) [Pasteurella canis strain HL_D1250]
ATGCAAAAAGGGGTTAACTATTGTATTAAGTGCGGTCTATTATGGCTTAGTTTTGTTTGTTGTGTGCTAGCGCAATCTAG
TGAGACGTTTTCTATTCGCTTAAAGCAAGCGCCACTAGTTGAAATTTTGCAACAATTAGCGTTACAACAAAACAAAAGTT
TAGTCATTGATAATGAATTAGATGGTACGTTATCACTGCAATTAGAGCAAACTAGTTTTGAGAAATTGTTATATGCCGTA
GCTAAGATTAAACAATTAGAGCTACACAAAGAAGGTCAATTGTATTATTTGAGTAAAGGTTTTAAAGCGGACAAAATAAA
GAGTGAAAGTAGTCAGCTAAGTCCTAATTTAGTAACAGACTCAATAAAATTACAATTTGCTAAAGCAGAAGATGTGATGA
AATCATTGACTACTGGTAATGGCTCTTTATTATCGGTAGATGGGCGTATTAGTGTAGATATACGCAGTAATTTATTATTA
ATTCAAGATCAAGCTGAATCAGTCCGAAATATTAAAAAATTAGTATCAGAAATGGATAAACCTGTTGAACAAATCGTGAT
TGAAGCGCGAATTGTCACTATGACCGATGAAAGCCTAAAAGAGCTGGGCGTTAGATGGGGAATGTTTGATCCTACTTCTC
ATAAACATACTTTATCAGGTAGTTTAGAAAGTAATGGCTTTTTGAATATTCAAGATCATCTCAATGTTAATTTTGCAACT
AATGTAACACCTGCAGGAAGCGTTGCATTACAACTAGCAAAAATTAATGGTCGCTTATTAGACTTGGAATTAACTGCACT
TGAACGAGAGAATAATGTTGAAATTATTGCTAGCCCACGTTTGTTAACGACGAATAAGAAAAGTGCCAGTATTAAGCAGG
GAACAGAAATACCTTATATTGTGACGAATGGTAAAAATGATACACAATCTATTGAATTTCGTGAAGCAGTACTAGGTTTA
GAGGTTACACCACAAATCTCAAAAGATAATACTATTTTATTAGATATGGTGGTTAGTCAAAACTCATTAGGACCGAGAGT
GACTTATGATAAAGGCGAAAGTATTTCAATAGATAAGCAAGAAATTAATACCCAAGTTTTTGCAAAGGATGGTGAAACTA
TTGTTTTAGGCGGAGTATTTCATGATACGATAATGAAGGGCGTAGACAAAGTGCCTCTGCTTGGTGATTTACCTGTATTA
AAACATCTATTTAGTAAAGAAAGTGAACGTCATCAAAAGCGGGAGCTCGTTATTTTTGTGACACCACATATTTTAAAGCA
GAATGAAATAGTCGCAAAATCAGAGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

65.446

100

0.656

  comE Haemophilus influenzae 86-028NP

65.217

100

0.654

  comE Glaesserella parasuis strain SC1401

50

97.706

0.489

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.274

97.248

0.401

  pilQ Vibrio cholerae strain A1552

41.274

97.248

0.401

  pilQ Vibrio campbellii strain DS40M4

39.679

100

0.397