Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   HAPS_RS11115 Genome accession   NC_011852
Coordinates   2257259..2258521 (+) Length   420 a.a.
NCBI ID   WP_015940161.1    Uniprot ID   B8F8R6
Organism   Glaesserella parasuis SH0165     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2252259..2263521
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HAPS_RS11090 (HAPS_2284) - 2252453..2255008 (-) 2556 WP_015940158.1 penicillin-binding protein 1A -
  HAPS_RS11095 (HAPS_2285) comA 2255162..2255839 (+) 678 WP_005712510.1 hypothetical protein Machinery gene
  HAPS_RS11100 (HAPS_2286) comB 2255815..2256336 (+) 522 WP_010785956.1 hypothetical protein Machinery gene
  HAPS_RS11105 (HAPS_2287) comC 2256333..2256863 (+) 531 WP_015940159.1 hypothetical protein Machinery gene
  HAPS_RS11110 (HAPS_2288) comD 2256908..2257249 (+) 342 WP_230205898.1 hypothetical protein Machinery gene
  HAPS_RS11115 (HAPS_2289) comE 2257259..2258521 (+) 1263 WP_015940161.1 type IV pilus secretin PilQ Machinery gene
  HAPS_RS11120 (HAPS_2290) nusB 2258600..2259013 (+) 414 WP_010785952.1 transcription antitermination factor NusB -
  HAPS_RS11125 (HAPS_2291) thiL 2259022..2259990 (+) 969 WP_015940162.1 thiamine-phosphate kinase -
  HAPS_RS11130 (HAPS_2292) - 2259993..2260463 (+) 471 WP_021117942.1 phosphatidylglycerophosphatase A -
  HAPS_RS11135 (HAPS_2293) - 2260465..2261094 (+) 630 WP_015940164.1 LysE family transporter -
  HAPS_RS11140 (HAPS_2294) trxA 2261391..2261705 (+) 315 WP_015940165.1 thioredoxin -
  HAPS_RS12800 - 2262022..2262651 (+) 630 WP_416352721.1 IS256 family transposase, variant Zn-binding type -
  HAPS_RS11150 (HAPS_2296) - 2263089..2263508 (-) 420 WP_005714137.1 hotdog fold thioesterase -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 47064.04 Da        Isoelectric Point: 8.9232

>NTDB_id=32592 HAPS_RS11115 WP_015940161.1 2257259..2258521(+) (comE) [Glaesserella parasuis SH0165]
MRYLFLLFFATFPVLANQQISLSIRNAPTAEIISYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSESGYLHFDDRSNSIIVKDSAASVK
NFTQLIETLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLHLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKKEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQMNNMMATIDKQEINTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQNRDKISKREL
VIFVTPYIVKSEKIGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=32592 HAPS_RS11115 WP_015940161.1 2257259..2258521(+) (comE) [Glaesserella parasuis SH0165]
ATGCGTTATTTATTCCTGCTATTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAAATTTCACTTTCGATAAGAAATGC
CCCAACGGCAGAGATTATTAGTTATTTGGCGGAGGAAACTGGAAAGAATATTACAATTTCGGATGAAATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTTATAGAGTCTTTAACAAAAGGGCAAGGCAATT
TACTATCAGAAAGTGGTTATCTTCACTTTGATGATCGCAGTAATAGCATTATCGTCAAAGACAGTGCCGCATCGGTTAAA
AACTTTACTCAACTTATTGAAACCCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACGATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGCTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGAAACTTAG
AGGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCCGTTGCCTTACAAATTGCC
GCTATCAACAGCCGTGTACTTCATTTAGAACTCACCGCATTAGAGAGCGAAAATAACATTGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAGATTCCTTATGCAATGTACAGTAAGAAAAAGG
AAATCACCGATATTGAATTTCGTGAAGCGGTTTTGGGGCTAGAAGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGATCTTGCCATTAGCCAAAATTCGCCAAATAACCAGATGAACAATATGATGGCAACGATTGATAAACAAGAAATTAA
TACACAAGTCCTTGCTAAACACGGCGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTCCCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAACCGAGATAAAATCTCTAAACGGGAGCTG
GTTATTTTTGTTACACCTTATATTGTAAAATCTGAAAAAATAGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB B8F8R6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

97.619

100

0.976

  comE Haemophilus influenzae Rd KW20

52.706

100

0.533

  comE Haemophilus influenzae 86-028NP

51.765

100

0.524

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

37.44

98.571

0.369

  pilQ Vibrio cholerae strain A1552

37.44

98.571

0.369

  pilQ Vibrio campbellii strain DS40M4

36.256

100

0.364


Multiple sequence alignment