Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A2U20_RS03485 Genome accession   NZ_CP018032
Coordinates   671788..673050 (+) Length   420 a.a.
NCBI ID   WP_021112071.1    Uniprot ID   -
Organism   Glaesserella parasuis D74     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 666788..678050
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A2U20_RS03460 (A2U20_03420) - 666982..669537 (-) 2556 WP_043896007.1 penicillin-binding protein 1A -
  A2U20_RS03465 (A2U20_03425) comA 669691..670368 (+) 678 WP_021112067.1 hypothetical protein Machinery gene
  A2U20_RS03470 (A2U20_03430) comB 670344..670865 (+) 522 WP_021112068.1 hypothetical protein Machinery gene
  A2U20_RS03475 (A2U20_03435) comC 670862..671392 (+) 531 WP_043894777.1 hypothetical protein Machinery gene
  A2U20_RS03480 (A2U20_03440) comD 671437..671778 (+) 342 WP_021114623.1 pilus assembly, PilP family protein Machinery gene
  A2U20_RS03485 (A2U20_03445) comE 671788..673050 (+) 1263 WP_021112071.1 type IV pilus secretin PilQ Machinery gene
  A2U20_RS03490 (A2U20_03450) nusB 673129..673542 (+) 414 WP_021112072.1 transcription antitermination factor NusB -
  A2U20_RS03495 (A2U20_03455) thiL 673551..674519 (+) 969 WP_021117941.1 thiamine-phosphate kinase -
  A2U20_RS03500 (A2U20_03460) - 674522..674992 (+) 471 WP_021117942.1 phosphatidylglycerophosphatase A -
  A2U20_RS03505 (A2U20_03465) - 674994..675623 (+) 630 WP_015940164.1 LysE family transporter -
  A2U20_RS03510 (A2U20_03470) trxA 675920..676234 (+) 315 WP_021117943.1 thioredoxin -
  A2U20_RS03515 (A2U20_03475) - 676719..677069 (+) 351 WP_043894779.1 hypothetical protein -
  A2U20_RS03520 (A2U20_03480) - 677222..677641 (-) 420 WP_021117944.1 hotdog fold thioesterase -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 46967.82 Da        Isoelectric Point: 7.3676

>NTDB_id=205179 A2U20_RS03485 WP_021112071.1 671788..673050(+) (comE) [Glaesserella parasuis D74]
MRYLFLLFFATFPVLANQQISLSIRNAPTAEIISYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSESGYLHFDDRSNSIIVKDSAASVK
NFTQLIESLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLDLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKEEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQINNTMVTIDKQEIKTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQSQDKISKREL
VIFVTPYIVKSEKIGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=205179 A2U20_RS03485 WP_021112071.1 671788..673050(+) (comE) [Glaesserella parasuis D74]
ATGCGTTATTTATTTCTGCTCTTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAAATTTCACTTTCGATAAGAAATGC
CCCAACGGCAGAGATTATTAGTTATTTGGCGGAGGAAACGGGAAAGAATATTACAATTTCGGATGAAATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTTATAGAGTCTTTAACAAAAGGGCAAGGCAATT
TACTATCAGAAAGTGGTTATCTTCACTTTGATGATCGCAGTAATAGCATTATCGTCAAAGACAGTGCCGCCTCAGTTAAA
AACTTTACTCAACTTATCGAAAGTCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACTATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGCTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGCAATTTAG
AAGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCTGTTGCCTTACAAATTGCT
GCAATCAACAGCCGTGTACTTGATTTAGAACTTACTGCATTAGAAAGCGAAAATAACATAGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAAATTCCTTATGCAATGTACAGTAAGAAAGAGG
AAATCACCGATATTGAATTTCGTGAAGCCGTTTTGGGGTTAGAGGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGACCTCGCTATCAGCCAAAATTCGCCCAATAACCAAATAAATAATACAATGGTGACAATTGATAAACAGGAAATCAA
AACGCAAGTTCTAGCTAAACACGGTGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTACCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAGCCAAGATAAAATCTCTAAACGAGAATTA
GTGATTTTTGTCACGCCTTATATTGTGAAATCTGAAAAAATAGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

99.762

100

0.998

  comE Haemophilus influenzae Rd KW20

52.941

100

0.536

  comE Haemophilus influenzae 86-028NP

52

100

0.526

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

38.164

98.571

0.376

  pilQ Vibrio cholerae strain A1552

38.164

98.571

0.376

  pilQ Vibrio campbellii strain DS40M4

36.967

100

0.371


Multiple sequence alignment