Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   B4X03_RS05485 Genome accession   NZ_CP020085
Coordinates   1095645..1096907 (+) Length   420 a.a.
NCBI ID   WP_026917118.1    Uniprot ID   -
Organism   Glaesserella parasuis strain CL120103     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1090645..1101907
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B4X03_RS05460 - 1090839..1093394 (-) 2556 WP_035490200.1 penicillin-binding protein 1A -
  B4X03_RS05465 comA 1093548..1094225 (+) 678 WP_005712510.1 hypothetical protein Machinery gene
  B4X03_RS05470 comB 1094201..1094722 (+) 522 WP_005712507.1 hypothetical protein Machinery gene
  B4X03_RS05475 comC 1094719..1095249 (+) 531 WP_015940159.1 hypothetical protein Machinery gene
  B4X03_RS05480 comD 1095249..1095635 (+) 387 WP_026917117.1 pilus assembly protein PilP Machinery gene
  B4X03_RS05485 comE 1095645..1096907 (+) 1263 WP_026917118.1 type IV pilus secretin PilQ Machinery gene
  B4X03_RS05490 nusB 1096986..1097399 (+) 414 WP_005712500.1 transcription antitermination factor NusB -
  B4X03_RS05495 thiL 1097408..1098376 (+) 969 WP_082259161.1 thiamine-phosphate kinase -
  B4X03_RS05500 - 1098379..1098849 (+) 471 WP_021113432.1 phosphatidylglycerophosphatase A -
  B4X03_RS05505 - 1098851..1099492 (+) 642 WP_157888943.1 LysE family transporter -
  B4X03_RS05510 trxA 1099776..1100090 (+) 315 WP_015940165.1 thioredoxin -
  B4X03_RS12080 - 1100820..1101035 (+) 216 WP_418251379.1 hypothetical protein -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 47095.07 Da        Isoelectric Point: 8.9232

>NTDB_id=220967 B4X03_RS05485 WP_026917118.1 1095645..1096907(+) (comE) [Glaesserella parasuis strain CL120103]
MRYLFLLFFATFPVLANQQISLSIRNAPTAEIISYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSENGYLHFDDRSNSIIVKDSAASVK
NFTQLIESLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLHLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKKEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQMNNMMATIDKQEINTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQNRDKISKREL
VIFVTPYIVKSEKMGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=220967 B4X03_RS05485 WP_026917118.1 1095645..1096907(+) (comE) [Glaesserella parasuis strain CL120103]
ATGCGTTATTTATTCCTGCTATTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAAATTTCACTTTCGATAAGAAATGC
CCCAACGGCAGAGATTATTAGTTATTTGGCGGAGGAAACTGGAAAGAATATTACAATTTCGGATGAAATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTAATAGAGTCTTTGACAAAAGGGCAAGGCAACT
TGCTATCAGAGAATGGTTATCTTCATTTTGATGATCGCAGTAATAGTATTATCGTCAAAGACAGTGCCGCCTCAGTTAAA
AACTTTACTCAACTTATCGAAAGCCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACGATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGCTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGAAACTTAG
AGGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCCGTTGCCTTACAAATTGCC
GCTATCAACAGCCGTGTACTTCATTTAGAACTCACCGCATTAGAGAGCGAAAATAACATTGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAGATTCCTTATGCAATGTACAGTAAGAAAAAGG
AAATCACCGATATTGAATTTCGTGAAGCGGTTTTGGGGCTAGAAGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGATCTTGCCATTAGCCAAAATTCGCCAAATAACCAGATGAACAATATGATGGCAACGATTGATAAACAAGAAATTAA
TACACAAGTCCTTGCTAAACACGGCGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTCCCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAACCGAGATAAAATCTCTAAACGGGAGCTG
GTTATTTTTGTTACGCCTTATATTGTAAAATCTGAAAAAATGGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

97.381

100

0.974

  comE Haemophilus influenzae Rd KW20

52.706

100

0.533

  comE Haemophilus influenzae 86-028NP

51.765

100

0.524

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

37.681

98.571

0.371

  pilQ Vibrio cholerae strain A1552

37.681

98.571

0.371

  pilQ Vibrio campbellii strain DS40M4

36.493

100

0.367


Multiple sequence alignment