Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   B4U42_RS08755 Genome accession   NZ_CP021644
Coordinates   1674075..1675337 (-) Length   420 a.a.
NCBI ID   WP_005712502.1    Uniprot ID   -
Organism   Glaesserella parasuis 29755     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1669075..1680337
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B4U42_RS08720 (B4U42_08575) - 1669089..1669508 (+) 420 WP_005714137.1 hotdog fold thioesterase -
  B4U42_RS08725 (B4U42_08580) - 1669699..1670300 (-) 602 Protein_1650 transposase -
  B4U42_RS08730 (B4U42_08585) trxA 1670891..1671205 (-) 315 WP_005712493.1 thioredoxin -
  B4U42_RS08735 (B4U42_08590) - 1671502..1672131 (-) 630 WP_005712495.1 LysE family transporter -
  B4U42_RS08740 (B4U42_08595) - 1672133..1672603 (-) 471 WP_005712497.1 phosphatidylglycerophosphatase A -
  B4U42_RS08745 (B4U42_08600) thiL 1672606..1673574 (-) 969 WP_005712499.1 thiamine-phosphate kinase -
  B4U42_RS08750 (B4U42_08605) nusB 1673583..1673996 (-) 414 WP_005712500.1 transcription antitermination factor NusB -
  B4U42_RS08755 (B4U42_08610) comE 1674075..1675337 (-) 1263 WP_005712502.1 type IV pilus secretin PilQ Machinery gene
  B4U42_RS08760 (B4U42_08615) comD 1675347..1675688 (-) 342 WP_235685072.1 hypothetical protein Machinery gene
  B4U42_RS08765 (B4U42_08620) comC 1675733..1676263 (-) 531 WP_005712505.1 hypothetical protein Machinery gene
  B4U42_RS08770 (B4U42_08625) comB 1676260..1676781 (-) 522 WP_005712507.1 hypothetical protein Machinery gene
  B4U42_RS08775 (B4U42_08630) comA 1676757..1677434 (-) 678 WP_005712510.1 hypothetical protein Machinery gene
  B4U42_RS08780 (B4U42_08635) - 1677588..1680143 (+) 2556 WP_005712512.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 47051.05 Da        Isoelectric Point: 8.9182

>NTDB_id=232166 B4U42_RS08755 WP_005712502.1 1674075..1675337(-) (comE) [Glaesserella parasuis 29755]
MRYLFLLFFATFPVLANQQISLSIKNAPTAEIIGYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSENGYLHFDDRSNSIIVKDSAASVK
NFTQLIETLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLHLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKKEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQMNNMMATIDKQEINTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQNRDKISKREL
VIFVTPYIVKSEKMGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=232166 B4U42_RS08755 WP_005712502.1 1674075..1675337(-) (comE) [Glaesserella parasuis 29755]
ATGCGTTATTTATTCCTGCTATTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAAATTTCGCTTTCTATAAAAAATGC
CCCTACGGCAGAAATTATTGGTTATTTAGCTGAAGAAACGGGAAAAAATATTACGATTTCAGATGAGATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTAATAGAGTCTTTGACAAAAGGGCAAGGCAACT
TGCTATCAGAGAATGGTTATCTTCATTTTGATGATCGCAGTAATAGTATTATCGTCAAAGACAGTGCCGCCTCAGTTAAA
AACTTTACTCAACTTATTGAAACCCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACGATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGCTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGAAACTTAG
AGGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCCGTTGCCTTACAAATTGCC
GCTATCAACAGCCGTGTACTTCATTTAGAACTCACCGCATTAGAGAGCGAAAATAACATTGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAGATTCCTTATGCAATGTACAGTAAGAAAAAGG
AAATCACCGATATTGAATTTCGTGAAGCGGTTTTGGGGCTAGAAGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGATCTTGCCATTAGCCAAAATTCGCCAAATAACCAGATGAACAATATGATGGCAACGATTGATAAACAAGAAATTAA
TACACAAGTCCTTGCTAAACACGGCGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTCCCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAACCGAGATAAAATCTCTAAACGGGAGCTG
GTTATTTTTGTTACGCCTTATATTGTAAAATCTGAAAAAATGGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

96.667

100

0.967

  comE Haemophilus influenzae Rd KW20

52.706

100

0.533

  comE Haemophilus influenzae 86-028NP

51.765

100

0.524

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

37.44

98.571

0.369

  pilQ Vibrio cholerae strain A1552

37.44

98.571

0.369

  pilQ Vibrio campbellii strain DS40M4

36.019

100

0.362


Multiple sequence alignment