Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   KMZ33_RS11530 Genome accession   NZ_CP076113
Coordinates   2376915..2378249 (+) Length   444 a.a.
NCBI ID   WP_241951591.1    Uniprot ID   -
Organism   Pasteurella multocida strain GS2020-X2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2371915..2383249
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KMZ33_RS11505 - 2371991..2374552 (-) 2562 WP_241951586.1 penicillin-binding protein 1A -
  KMZ33_RS11510 - 2374688..2375461 (+) 774 WP_241951587.1 competence protein ComA -
  KMZ33_RS11515 - 2375501..2376016 (+) 516 WP_241951588.1 competence protein ComB -
  KMZ33_RS11520 - 2376016..2376540 (+) 525 WP_310549681.1 ATPase -
  KMZ33_RS11525 - 2376543..2376905 (+) 363 WP_241951590.1 pilus assembly protein PilP -
  KMZ33_RS11530 comE 2376915..2378249 (+) 1335 WP_241951591.1 type IV pilus secretin PilQ Machinery gene
  KMZ33_RS11535 aroK 2378432..2378959 (+) 528 WP_005717623.1 shikimate kinase AroK -
  KMZ33_RS11540 aroB 2378976..2380064 (+) 1089 WP_115173789.1 3-dehydroquinate synthase -
  KMZ33_RS11545 - 2380068..2380973 (+) 906 WP_241951592.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  KMZ33_RS11550 - 2381020..2381325 (-) 306 WP_241951593.1 DciA family protein -
  KMZ33_RS11555 secM 2381415..2381729 (+) 315 WP_241951594.1 secA translation cis-regulator SecM -

Sequence


Protein


Download         Length: 444 a.a.        Molecular weight: 49285.64 Da        Isoelectric Point: 7.3925

>NTDB_id=571965 KMZ33_RS11530 WP_241951591.1 2376915..2378249(+) (comE) [Pasteurella multocida strain GS2020-X2]
MWRAFRKISFVYFLCGVAYVGSSQAQDAEHFYLRLKQAPLVEMLQYLALQQHQDLLIDDHLEGTLSLQMKKTTFEKCLQS
IARMKQLELHQEGKSYYLTSPSGVAANDTHHPTSLMTSSIKLHFAKAAEVVKSLTSGQGSLLSVGGSLSFDERTNLLLIQ
DEPQSIQRIKALVAEMDKPIEQIAIEARIVTMTDESLQELGVRWGLFQATEQAHTIAGSLAANGFSNIENQLNVNFSTNS
APVGSIALQLAKINGRLLDLELTALEREKHIEIIASPRLLTTNKKSASIKQGTEIPYVMKRGKDKSESVEFREAVLGLDV
TPHISKDNTILLDLLITQNTLGAPVVYDKGEIVSIDKQEINTQVVAQDGETIVLGGVFHDTMTKGVNKVPLLGDLPLLKH
VFSQETERHQKRELVIFVTPHIIKPTQSSPEQKTTRVKKSAKSR

Nucleotide


Download         Length: 1335 bp        

>NTDB_id=571965 KMZ33_RS11530 WP_241951591.1 2376915..2378249(+) (comE) [Pasteurella multocida strain GS2020-X2]
ATGTGGCGAGCATTCAGAAAAATATCTTTTGTGTACTTTTTATGTGGAGTTGCTTATGTTGGAAGTAGTCAAGCACAAGA
CGCAGAACATTTTTATTTACGTTTAAAACAAGCGCCTTTAGTCGAAATGTTACAGTATTTAGCATTACAACAACATCAGG
ATTTGTTAATCGATGATCATTTAGAGGGCACATTATCATTACAGATGAAAAAGACAACCTTTGAGAAATGTTTACAGTCG
ATTGCAAGAATGAAACAACTTGAGTTACATCAAGAAGGGAAATCCTATTATTTAACTTCCCCTTCAGGTGTTGCAGCAAA
CGATACTCATCATCCTACGTCATTGATGACATCTTCAATAAAATTGCATTTTGCCAAAGCTGCAGAGGTAGTGAAATCTT
TAACTTCAGGGCAGGGAAGTTTACTTTCTGTCGGGGGGAGTTTGAGTTTTGATGAGCGGACTAATTTACTGCTGATTCAG
GATGAACCGCAATCGATACAGCGTATTAAAGCATTAGTAGCAGAAATGGATAAACCCATTGAACAAATTGCGATCGAAGC
CAGAATTGTGACGATGACAGACGAAAGTTTGCAGGAACTTGGCGTAAGATGGGGGCTATTTCAAGCAACAGAACAGGCAC
ATACTATTGCTGGGAGTTTAGCTGCAAACGGCTTTTCAAATATAGAAAACCAATTAAATGTGAATTTCTCGACCAATAGT
GCACCAGTTGGTTCCATCGCCCTACAGTTGGCGAAAATAAATGGTCGATTATTAGACTTGGAATTAACTGCCTTGGAACG
AGAAAAGCATATTGAGATTATTGCGAGTCCTCGTTTATTAACAACGAATAAAAAAAGTGCCAGTATTAAACAAGGGACGG
AAATTCCTTATGTGATGAAACGGGGAAAAGATAAAAGCGAATCGGTGGAATTTCGAGAAGCTGTATTAGGTTTAGATGTG
ACACCGCATATCTCAAAAGACAACACGATTTTATTAGATTTATTGATTACACAAAATACATTAGGTGCACCGGTAGTGTA
TGATAAAGGCGAAATTGTTTCGATCGATAAACAGGAAATCAATACTCAAGTCGTCGCGCAAGATGGTGAAACCATCGTTT
TAGGCGGGGTGTTTCATGATACGATGACAAAGGGAGTCAATAAAGTACCACTACTAGGGGATTTGCCTTTGCTTAAACAT
GTGTTTAGCCAGGAAACTGAACGTCATCAAAAGCGGGAATTAGTGATTTTTGTCACGCCTCATATTATCAAACCGACCCA
AAGTTCGCCTGAACAAAAAACAACGAGAGTTAAAAAATCTGCAAAATCAAGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

62.125

97.523

0.606

  comE Haemophilus influenzae 86-028NP

61.432

97.523

0.599

  comE Glaesserella parasuis strain SC1401

48.611

97.297

0.473

  pilQ Vibrio campbellii strain DS40M4

39.524

94.595

0.374

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

39.806

92.793

0.369

  pilQ Vibrio cholerae strain A1552

39.806

92.793

0.369