Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   M8849_RS10285 Genome accession   NZ_CP097609
Coordinates   2219523..2220857 (+) Length   444 a.a.
NCBI ID   WP_169029542.1    Uniprot ID   -
Organism   Pasteurella multocida strain 11245     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2166948..2220857 2219523..2220857 within 0


Gene organization within MGE regions


Location: 2166948..2220857
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M8849_RS10005 (M8849_10005) - 2166948..2167628 (-) 681 WP_005755258.1 YtjB family periplasmic protein -
  M8849_RS10010 (M8849_10010) serB 2167708..2168670 (+) 963 WP_169029040.1 phosphoserine phosphatase -
  M8849_RS10015 (M8849_10015) - 2168677..2169168 (+) 492 WP_005718544.1 YajQ family cyclic di-GMP-binding protein -
  M8849_RS10020 (M8849_10020) - 2169252..2169515 (-) 264 WP_005718542.1 GlsB/YeaQ/YmgE family stress response membrane protein -
  M8849_RS10025 (M8849_10025) gpU 2170439..2170846 (+) 408 WP_005752731.1 phage tail terminator protein -
  M8849_RS10030 (M8849_10030) - 2170856..2172055 (+) 1200 WP_064969152.1 phage major capsid protein -
  M8849_RS10035 (M8849_10035) - 2172107..2172667 (+) 561 WP_064969151.1 HK97 family phage prohead protease -
  M8849_RS10040 (M8849_10040) - 2172669..2173880 (+) 1212 WP_064965240.1 phage portal protein -
  M8849_RS10045 (M8849_10045) - 2173864..2174220 (+) 357 WP_064969150.1 phage head closure protein -
  M8849_RS10050 (M8849_10050) - 2174207..2174533 (+) 327 WP_064969149.1 head-tail connector protein -
  M8849_RS10055 (M8849_10055) - 2174542..2174898 (+) 357 WP_064969148.1 HNH endonuclease signature motif containing protein -
  M8849_RS10060 (M8849_10060) - 2174956..2177685 (+) 2730 WP_064969147.1 tape measure protein -
  M8849_RS10065 (M8849_10065) - 2177860..2178228 (+) 369 WP_046333234.1 phage terminase small subunit P27 family -
  M8849_RS10070 (M8849_10070) - 2178228..2179895 (+) 1668 WP_064969146.1 terminase large subunit -
  M8849_RS10075 (M8849_10075) - 2179904..2180368 (+) 465 WP_064969145.1 HK97-gp10 family putative phage morphogenesis protein -
  M8849_RS10080 (M8849_10080) - 2180539..2181189 (+) 651 WP_064969144.1 hypothetical protein -
  M8849_RS10085 (M8849_10085) - 2181325..2181528 (+) 204 WP_169029039.1 AlpA family phage regulatory protein -
  M8849_RS10090 (M8849_10090) - 2181620..2182231 (+) 612 WP_231104079.1 host cell division inhibitor Icd-like protein -
  M8849_RS10095 (M8849_10095) - 2182224..2182439 (+) 216 WP_016534278.1 hypothetical protein -
  M8849_RS10100 (M8849_10100) - 2182432..2182752 (+) 321 WP_064969141.1 hypothetical protein -
  M8849_RS10105 (M8849_10105) - 2182757..2183182 (+) 426 WP_064969140.1 hypothetical protein -
  M8849_RS10110 (M8849_10110) - 2183166..2184947 (+) 1782 WP_064969139.1 phage/plasmid primase, P4 family -
  M8849_RS10115 (M8849_10115) - 2185122..2186321 (-) 1200 WP_064969138.1 integrase arm-type DNA-binding domain-containing protein -
  M8849_RS10125 (M8849_10125) - 2186895..2188175 (-) 1281 WP_005755251.1 TRAP transporter large permease -
  M8849_RS10130 (M8849_10130) - 2188172..2188732 (-) 561 WP_005724517.1 TRAP transporter small permease -
  M8849_RS10135 (M8849_10135) - 2188756..2189745 (-) 990 WP_005718535.1 TRAP transporter substrate-binding protein -
  M8849_RS10140 (M8849_10140) - 2190098..2191111 (+) 1014 WP_005718533.1 4-hydroxythreonine-4-phosphate dehydrogenase PdxA -
  M8849_RS10145 (M8849_10145) - 2191132..2192136 (+) 1005 WP_005718532.1 dihydroxyacetone kinase subunit DhaK -
  M8849_RS10150 (M8849_10150) dhaL 2192138..2192764 (+) 627 WP_005757839.1 dihydroxyacetone kinase subunit DhaL -
  M8849_RS10155 (M8849_10155) - 2192943..2193803 (+) 861 WP_059246483.1 TIM barrel protein -
  M8849_RS10160 (M8849_10160) - 2193796..2194749 (+) 954 WP_005718523.1 sugar-binding transcriptional regulator -
  M8849_RS10165 (M8849_10165) rpiB 2194771..2195226 (+) 456 WP_005718521.1 ribose 5-phosphate isomerase B -
  M8849_RS10170 (M8849_10170) - 2195275..2196252 (+) 978 WP_005718520.1 TRAP transporter substrate-binding protein -
  M8849_RS10175 (M8849_10175) - 2196306..2196782 (+) 477 WP_005718519.1 TRAP transporter small permease -
  M8849_RS10180 (M8849_10180) - 2196782..2198074 (+) 1293 WP_005752182.1 TRAP transporter large permease -
  M8849_RS10185 (M8849_10185) - 2198085..2198756 (+) 672 WP_005718518.1 cyclase family protein -
  M8849_RS10190 (M8849_10190) tpiA 2198769..2199818 (+) 1050 WP_005724499.1 triose-phosphate isomerase -
  M8849_RS10195 (M8849_10195) tal 2199836..2200786 (+) 951 WP_005752180.1 transaldolase -
  M8849_RS10200 (M8849_10200) tkt 2200814..2202820 (+) 2007 WP_250023508.1 transketolase -
  M8849_RS10205 (M8849_10205) rpoD 2202916..2204784 (-) 1869 WP_005751911.1 RNA polymerase sigma factor RpoD -
  M8849_RS10210 (M8849_10210) dnaG 2204860..2206608 (-) 1749 WP_005751910.1 DNA primase -
  M8849_RS10215 (M8849_10215) rpsU 2206724..2206939 (-) 216 WP_005717672.1 30S ribosomal protein S21 -
  M8849_RS10220 (M8849_10220) tsaD 2207158..2208189 (+) 1032 WP_005723723.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD -
  M8849_RS10225 (M8849_10225) - 2208210..2208446 (+) 237 WP_005717662.1 hypothetical protein -
  M8849_RS10230 (M8849_10230) - 2208448..2209026 (+) 579 WP_005717659.1 thymidine kinase -
  M8849_RS10235 (M8849_10235) gorA 2209114..2210469 (-) 1356 WP_169029539.1 glutathione-disulfide reductase -
  M8849_RS10240 (M8849_10240) - 2210566..2211411 (-) 846 WP_169029540.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  M8849_RS10245 (M8849_10245) pdxT 2211572..2212153 (-) 582 WP_102955867.1 pyridoxal 5'-phosphate synthase glutaminase subunit PdxT -
  M8849_RS10250 (M8849_10250) pdxS 2212156..2213043 (-) 888 WP_005751907.1 pyridoxal 5'-phosphate synthase lyase subunit PdxS -
  M8849_RS10255 (M8849_10255) - 2213155..2214567 (+) 1413 WP_169029541.1 PLP-dependent aminotransferase family protein -
  M8849_RS10260 (M8849_10260) - 2214599..2217160 (-) 2562 WP_059246302.1 penicillin-binding protein 1A -
  M8849_RS10265 (M8849_10265) - 2217296..2218069 (+) 774 WP_046333757.1 pilus assembly protein PilM -
  M8849_RS10270 (M8849_10270) - 2218109..2218624 (+) 516 WP_046333758.1 competence protein ComB -
  M8849_RS10275 (M8849_10275) - 2218624..2219148 (+) 525 WP_046333759.1 hypothetical protein -
  M8849_RS10280 (M8849_10280) - 2219151..2219513 (+) 363 WP_005748555.1 pilus assembly protein PilP -
  M8849_RS10285 (M8849_10285) comE 2219523..2220857 (+) 1335 WP_169029542.1 type IV pilus secretin PilQ Machinery gene

Sequence


Protein


Download         Length: 444 a.a.        Molecular weight: 49284.71 Da        Isoelectric Point: 8.0163

>NTDB_id=691184 M8849_RS10285 WP_169029542.1 2219523..2220857(+) (comE) [Pasteurella multocida strain 11245]
MWRAFRKISFVYFLCGVAYVGSSQAQDAEHFYLRLKQAPLVEMLQYLALQQHQDLLIDDHLEGTLSLQMKKTTFEKCLQS
IARMKQLELHQEGKSYYLTSPSGVAANDTHHPTSLMTSSIKLHFAKAAEVMKSLTSGQGSLLSVGGSLSFDERTNLLLIQ
DEPQSIQRIKALVAEMDKPIEQIAIEARIVTMTDESLQELGVRWGLFQATEQAHTIAGSLAANGFSNIENQLNVNFSTNS
APVGSIALQLAKINGRLLDLELTALEREKHIEIIASPRLLTTNKKSASIKQGTEIPYVMKRGKDKSESVEFREAVLGLDV
TPHISKDNSILLDLLITQNTLGAPVVYDKGEIVSIDKQEINTQVVAQDGETIVLGGVFHDTMTKGVNKVPLLGDLPLLKY
VFSQKTERHQKRELVIFVTPHIIKPSQGSPEQKTTRVKKSAKSR

Nucleotide


Download         Length: 1335 bp        

>NTDB_id=691184 M8849_RS10285 WP_169029542.1 2219523..2220857(+) (comE) [Pasteurella multocida strain 11245]
ATGTGGCGAGCATTCAGAAAAATATCTTTTGTGTACTTTTTATGTGGGGTTGCTTATGTTGGAAGTAGTCAAGCACAAGA
CGCAGAACATTTTTATTTACGTTTAAAACAAGCGCCTTTAGTCGAAATGTTACAGTATTTAGCATTACAACAACATCAGG
ATTTGTTAATCGATGATCATTTAGAGGGCACATTATCATTACAGATGAAAAAGACAACCTTTGAGAAATGTTTACAGTCG
ATTGCAAGAATGAAACAACTTGAGTTACATCAAGAAGGAAAATCCTATTATTTAACTTCCCCTTCAGGTGTTGCAGCAAA
CGATACTCATCATCCTACGTCATTGATGACATCTTCAATAAAATTGCATTTTGCCAAAGCCGCAGAGGTGATGAAATCTT
TAACTTCAGGGCAGGGAAGTTTACTTTCTGTCGGGGGGAGTTTGAGTTTTGATGAGCGGACTAATTTACTGCTGATTCAG
GATGAACCGCAATCAATACAGCGTATTAAAGCATTAGTAGCAGAAATGGATAAACCCATTGAACAAATTGCGATCGAAGC
TAGGATTGTGACGATGACAGACGAAAGTTTGCAGGAACTTGGTGTAAGATGGGGGCTATTTCAAGCAACAGAACAGGCAC
ATACTATTGCAGGGAGTTTAGCCGCGAACGGCTTTTCGAATATAGAAAACCAATTAAATGTGAATTTCTCGACCAATAGT
GCACCTGTTGGTTCCATCGCCTTACAGTTGGCGAAAATAAATGGTCGATTATTAGACTTGGAATTAACTGCCTTGGAGCG
AGAAAAGCATATTGAGATTATTGCGAGTCCTCGTTTATTAACAACGAATAAAAAAAGTGCCAGTATCAAACAAGGGACGG
AAATTCCTTATGTGATGAAACGGGGAAAAGATAAAAGCGAATCGGTGGAATTTCGAGAAGCTGTATTGGGTTTAGATGTG
ACACCGCATATCTCAAAAGACAACTCGATTTTATTAGATTTATTGATTACACAAAATACATTAGGTGCACCGGTAGTGTA
TGATAAAGGCGAAATTGTTTCGATCGATAAACAGGAAATCAATACTCAAGTCGTCGCTCAAGATGGTGAAACCATCGTTT
TAGGTGGGGTGTTTCATGATACGATGACAAAGGGAGTCAATAAAGTACCACTACTAGGGGATTTGCCTTTGCTTAAATAT
GTGTTTAGCCAGAAAACTGAACGTCATCAAAAGCGGGAATTAGTGATTTTTGTCACTCCTCATATTATCAAACCTAGCCA
AGGTTCGCCTGAACAAAAAACAACAAGAGTTAAAAAATCTGCAAAATCAAGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

62.269

97.297

0.606

  comE Haemophilus influenzae 86-028NP

61.574

97.297

0.599

  comE Glaesserella parasuis strain SC1401

48.961

97.523

0.477

  pilQ Vibrio campbellii strain DS40M4

39.524

94.595

0.374

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

39.806

92.793

0.369

  pilQ Vibrio cholerae strain A1552

39.806

92.793

0.369