Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   PMCN01_RS11125 Genome accession   NZ_CP006976
Coordinates   2349500..2350834 (+) Length   444 a.a.
NCBI ID   WP_015702713.1    Uniprot ID   A0AAW8VA23
Organism   Pasteurella multocida subsp. multocida HB01     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2294113..2350834 2349500..2350834 within 0


Gene organization within MGE regions


Location: 2294113..2350834
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PMCN01_RS10785 (PMCN01_2110) - 2294113..2294958 (+) 846 WP_005717653.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  PMCN01_RS10790 (PMCN01_2111) - 2295273..2296271 (+) 999 WP_064775740.1 tyrosine-type recombinase/integrase -
  PMCN01_RS10795 (PMCN01_2112) - 2296416..2296721 (-) 306 WP_005723677.1 heavy-metal-associated domain-containing protein -
  PMCN01_RS10800 (PMCN01_2113) - 2296731..2297084 (-) 354 WP_005723675.1 mercuric transporter MerT family protein -
  PMCN01_RS10805 (PMCN01_2114) - 2297056..2298087 (-) 1032 Protein_2118 transglutaminase-like domain-containing protein -
  PMCN01_RS10810 (PMCN01_2115) - 2298293..2298472 (-) 180 WP_016570130.1 hypothetical protein -
  PMCN01_RS11855 (PMCN01_2116) - 2298533..2300989 (-) 2457 WP_064775697.1 tail fiber protein -
  PMCN01_RS10820 (PMCN01_2117) - 2301004..2301570 (-) 567 WP_064775698.1 phage tail protein -
  PMCN01_RS10825 (PMCN01_2118) - 2301563..2302675 (-) 1113 WP_064775699.1 baseplate assembly protein -
  PMCN01_RS10830 (PMCN01_2119) - 2302665..2303030 (-) 366 WP_064775741.1 GPW/gp25 family protein -
  PMCN01_RS10835 (PMCN01_2120) - 2303083..2303664 (-) 582 WP_064775700.1 phage baseplate assembly protein V -
  PMCN01_RS10840 (PMCN01_2121) - 2303679..2304731 (-) 1053 WP_064775701.1 phage late control D family protein -
  PMCN01_RS10845 (PMCN01_2122) - 2304731..2304952 (-) 222 WP_064775702.1 tail protein X -
  PMCN01_RS10850 (PMCN01_2123) - 2304940..2305890 (-) 951 WP_064775703.1 phage tail protein -
  PMCN01_RS10855 (PMCN01_2124) - 2305899..2308175 (-) 2277 WP_235605392.1 phage tail tape measure protein -
  PMCN01_RS10860 (PMCN01_2125) - 2308242..2308433 (+) 192 WP_064775705.1 hypothetical protein -
  PMCN01_RS11625 (PMCN01_2126) - 2308430..2308564 (-) 135 WP_227718009.1 GpE family phage tail protein -
  PMCN01_RS10865 (PMCN01_2127) - 2308579..2308857 (-) 279 WP_064775706.1 phage tail assembly protein -
  PMCN01_RS10870 (PMCN01_2128) - 2308940..2309455 (-) 516 WP_064775707.1 phage major tail tube protein -
  PMCN01_RS10875 (PMCN01_2129) - 2309455..2310846 (-) 1392 WP_064775708.1 phage tail sheath family protein -
  PMCN01_RS10880 (PMCN01_2130) - 2310856..2311323 (-) 468 WP_064775709.1 Gp37 family protein -
  PMCN01_RS10885 (PMCN01_2131) - 2311323..2311772 (-) 450 WP_064775710.1 gp436 family protein -
  PMCN01_RS10890 (PMCN01_2132) - 2311772..2312230 (-) 459 WP_064775711.1 hypothetical protein -
  PMCN01_RS10895 (PMCN01_2133) - 2312283..2313209 (-) 927 WP_064775712.1 hypothetical protein -
  PMCN01_RS10900 (PMCN01_2134) - 2313219..2314322 (-) 1104 WP_064775713.1 hypothetical protein -
  PMCN01_RS10905 (PMCN01_2135) - 2314559..2315014 (-) 456 WP_064775714.1 phage virion morphogenesis protein -
  PMCN01_RS10910 (PMCN01_2137) - 2315216..2315404 (-) 189 WP_064775715.1 hypothetical protein -
  PMCN01_RS10915 (PMCN01_2138) - 2315541..2316815 (-) 1275 WP_170381806.1 phage minor head protein -
  PMCN01_RS10920 (PMCN01_2139) - 2316808..2318253 (-) 1446 WP_064775716.1 DUF935 domain-containing protein -
  PMCN01_RS10925 (PMCN01_2140) - 2318246..2319802 (-) 1557 WP_196768127.1 terminase large subunit domain-containing protein -
  PMCN01_RS10930 (PMCN01_2141) - 2319805..2320386 (-) 582 WP_064775717.1 DUF3486 family protein -
  PMCN01_RS10935 (PMCN01_2142) - 2320409..2320711 (-) 303 WP_064775718.1 winged-helix domain-containing protein -
  PMCN01_RS10940 (PMCN01_2143) - 2320708..2321037 (-) 330 WP_064775719.1 DUF2730 family protein -
  PMCN01_RS10950 (PMCN01_2145) - 2321155..2321415 (-) 261 WP_064775721.1 DUF2681 domain-containing protein -
  PMCN01_RS10955 (PMCN01_2146) - 2321412..2321639 (-) 228 WP_064775722.1 DUF2644 domain-containing protein -
  PMCN01_RS10960 (PMCN01_2147) - 2321650..2322186 (-) 537 WP_064775723.1 N-acetylmuramoyl-L-alanine amidase -
  PMCN01_RS10965 (PMCN01_2148) - 2322264..2322626 (-) 363 WP_064775724.1 hypothetical protein -
  PMCN01_RS10970 (PMCN01_2149) - 2322701..2323075 (-) 375 WP_064775725.1 Mor transcription activator family protein -
  PMCN01_RS10975 (PMCN01_2150) - 2323075..2323620 (-) 546 WP_064775726.1 hypothetical protein -
  PMCN01_RS10980 (PMCN01_2151) - 2323617..2324123 (-) 507 WP_064775727.1 gp16 family protein -
  PMCN01_RS10985 (PMCN01_2152) - 2324197..2324898 (-) 702 WP_064775728.1 DUF2786 domain-containing protein -
  PMCN01_RS10990 (PMCN01_2153) - 2324907..2325122 (-) 216 WP_061406041.1 hypothetical protein -
  PMCN01_RS10995 (PMCN01_2154) - 2325140..2325331 (-) 192 WP_061406042.1 ANR family transcriptional regulator -
  PMCN01_RS11000 (PMCN01_2155) - 2325409..2325927 (-) 519 WP_061406043.1 host-nuclease inhibitor Gam family protein -
  PMCN01_RS11005 (PMCN01_2156) - 2325920..2326159 (-) 240 WP_046339044.1 hypothetical protein -
  PMCN01_RS11010 (PMCN01_2157) - 2326369..2326614 (-) 246 WP_064775729.1 hypothetical protein -
  PMCN01_RS11015 (PMCN01_2158) - 2326635..2326838 (-) 204 WP_064775730.1 hypothetical protein -
  PMCN01_RS11020 (PMCN01_2159) - 2326838..2327755 (-) 918 WP_016533375.1 AAA family ATPase -
  PMCN01_RS11025 (PMCN01_2160) - 2327782..2329785 (-) 2004 WP_064775731.1 transposase domain-containing protein -
  PMCN01_RS11030 - 2329769..2330002 (-) 234 WP_235605393.1 helix-turn-helix domain-containing protein -
  PMCN01_RS11035 (PMCN01_2161) - 2330245..2330718 (+) 474 WP_081273992.1 DNA-binding protein -
  PMCN01_RS11040 (PMCN01_2162) tkt 2331071..2332798 (+) 1728 Protein_2165 transketolase -
  PMCN01_RS11045 (PMCN01_2163) rpoD 2332894..2334762 (-) 1869 WP_064775694.1 RNA polymerase sigma factor RpoD -
  PMCN01_RS11050 (PMCN01_2164) dnaG 2334838..2336586 (-) 1749 WP_005751910.1 DNA primase -
  PMCN01_RS11055 (PMCN01_2165) rpsU 2336702..2336917 (-) 216 WP_005717672.1 30S ribosomal protein S21 -
  PMCN01_RS11060 (PMCN01_2166) tsaD 2337136..2338167 (+) 1032 WP_005723723.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD -
  PMCN01_RS11065 (PMCN01_2167) - 2338188..2338424 (+) 237 WP_005717662.1 hypothetical protein -
  PMCN01_RS11070 (PMCN01_2168) - 2338426..2339004 (+) 579 WP_005717659.1 thymidine kinase -
  PMCN01_RS11075 (PMCN01_2169) gorA 2339092..2340447 (-) 1356 WP_015702704.1 glutathione-disulfide reductase -
  PMCN01_RS11080 (PMCN01_2170) - 2340544..2341389 (-) 846 WP_015702705.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  PMCN01_RS11085 (PMCN01_2171) pdxT 2341550..2342131 (-) 582 WP_015702706.1 pyridoxal 5'-phosphate synthase glutaminase subunit PdxT -
  PMCN01_RS11090 (PMCN01_2172) pdxS 2342134..2343021 (-) 888 WP_015702707.1 pyridoxal 5'-phosphate synthase lyase subunit PdxS -
  PMCN01_RS11095 (PMCN01_2173) - 2343132..2344544 (+) 1413 WP_016504289.1 PLP-dependent aminotransferase family protein -
  PMCN01_RS11100 (PMCN01_2174) - 2344576..2347137 (-) 2562 WP_015702709.1 penicillin-binding protein 1A -
  PMCN01_RS11105 (PMCN01_2175) - 2347273..2348046 (+) 774 WP_015702710.1 pilus assembly protein PilM -
  PMCN01_RS11110 (PMCN01_2176) - 2348086..2348601 (+) 516 WP_015702711.1 hypothetical protein -
  PMCN01_RS11115 (PMCN01_2177) - 2348601..2349125 (+) 525 WP_005751903.1 hypothetical protein -
  PMCN01_RS11120 (PMCN01_2178) - 2349128..2349490 (+) 363 WP_015702712.1 pilus assembly protein PilP -
  PMCN01_RS11125 (PMCN01_2179) comE 2349500..2350834 (+) 1335 WP_015702713.1 type IV pilus secretin PilQ Machinery gene

Sequence


Protein


Download         Length: 444 a.a.        Molecular weight: 49246.60 Da        Isoelectric Point: 8.0315

>NTDB_id=115404 PMCN01_RS11125 WP_015702713.1 2349500..2350834(+) (comE) [Pasteurella multocida subsp. multocida HB01]
MWRAFRKISFVYFLCGVAYVGSSQAQDAEHFYLRLKQAPLVEMLQYLALQQHQDLLIDDHLEGTLSLQMKKTTFEKCLQS
IARMKQLELHQEGKSYYLTSPSGVAANDTHHPTSLMTSSIKLHFAKAAEVVKSLTSGQGSLLSVGGSLSFDERTNLLLIQ
DEPQSIQRIKALVAEMDKPIEQIAIEARIVTMTDESLQELGVRWGLFQATEQAHTIAGSLAANGFSNIENQLNVNFSTNS
TPVGSIALQLAKINGRLLDLELTALEREKHIEIIASPRLLTTNKKSASIKQGTEIPYVMKRGKDKSESVEFREAVLGLDV
TPHISKDNSILLDLLITQNTLGAPVVYDKGEIVSIDKQEINTQVVAQDGETIVLGGVFHDTMTKGVNKVPLLGDLPLLKH
VFSQKTERHQKRELVIFVTPHIIKSSQGSPEQKTTRVKKSAKSR

Nucleotide


Download         Length: 1335 bp        

>NTDB_id=115404 PMCN01_RS11125 WP_015702713.1 2349500..2350834(+) (comE) [Pasteurella multocida subsp. multocida HB01]
ATGTGGCGAGCATTCAGAAAAATATCTTTTGTGTACTTTTTATGTGGGGTTGCTTATGTTGGAAGTAGTCAAGCACAAGA
CGCAGAACATTTTTATTTACGTTTAAAACAAGCGCCTTTAGTCGAAATGTTACAGTATTTAGCATTGCAACAACATCAGG
ATTTGTTAATCGATGATCATTTAGAGGGCACATTATCATTACAGATGAAAAAGACAACCTTTGAGAAATGTTTACAGTCG
ATTGCAAGAATGAAACAACTTGAGTTACATCAAGAAGGAAAATCCTATTATTTAACTTCCCCTTCAGGTGTTGCAGCAAA
CGATACTCATCATCCTACGTCATTGATGACATCTTCAATAAAATTGCATTTTGCCAAAGCTGCAGAGGTGGTGAAATCTT
TAACTTCAGGGCAGGGAAGTTTACTTTCTGTCGGGGGGAGTTTGAGTTTTGATGAGCGGACTAATTTACTGCTGATTCAG
GATGAACCGCAATCAATACAGCGTATTAAAGCATTAGTAGCAGAAATGGATAAACCCATTGAACAAATTGCGATCGAAGC
TAGGATTGTGACGATGACAGACGAAAGTTTGCAGGAACTTGGTGTAAGATGGGGGCTATTTCAAGCAACAGAACAGGCAC
ATACTATTGCGGGGAGTTTAGCCGCGAACGGCTTTTCGAATATAGAAAACCAATTAAATGTGAATTTCTCGACCAATAGT
ACACCTGTTGGTTCCATCGCCTTACAGTTGGCGAAAATAAATGGTCGATTATTAGACTTGGAATTAACTGCCTTGGAGCG
AGAAAAGCATATTGAGATTATTGCGAGTCCTCGTTTATTAACAACGAATAAAAAAAGTGCCAGTATCAAACAAGGGACGG
AAATTCCTTATGTGATGAAACGGGGAAAAGATAAAAGCGAATCGGTGGAATTTCGAGAAGCTGTATTGGGTTTAGATGTG
ACACCGCATATCTCAAAAGACAACTCGATTTTATTAGATTTATTGATTACACAAAATACATTAGGTGCACCGGTAGTGTA
TGATAAAGGCGAAATTGTTTCGATCGATAAACAGGAAATCAATACTCAAGTCGTCGCTCAAGATGGTGAAACCATCGTTT
TAGGTGGGGTGTTTCATGATACGATGACAAAGGGAGTCAATAAAGTACCACTACTAGGGGATTTGCCTTTGCTTAAACAT
GTGTTTAGCCAGAAAACTGAACGTCATCAAAAGCGGGAATTAGTGATTTTTGTCACTCCTCATATTATCAAATCTAGCCA
AGGTTCGCCTGAACAAAAAACAACAAGAGTTAAAAAATCTGCAAAATCAAGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

62.269

97.297

0.606

  comE Haemophilus influenzae 86-028NP

61.574

97.297

0.599

  comE Glaesserella parasuis strain SC1401

49.192

97.523

0.48

  pilQ Vibrio campbellii strain DS40M4

39.524

94.595

0.374

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

39.563

92.793

0.367

  pilQ Vibrio cholerae strain A1552

39.563

92.793

0.367


Multiple sequence alignment