Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   I5Z97_RS03275 Genome accession   NZ_CP065386
Coordinates   642639..643901 (+) Length   420 a.a.
NCBI ID   WP_160434722.1    Uniprot ID   -
Organism   Glaesserella parasuis strain LHDR_HPS_1_2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 637639..648901
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I5Z97_RS03250 (I5Z97_03250) - 637833..640388 (-) 2556 WP_021119495.1 penicillin-binding protein 1A -
  I5Z97_RS03255 (I5Z97_03255) comA 640542..641219 (+) 678 WP_021113429.1 hypothetical protein Machinery gene
  I5Z97_RS03260 (I5Z97_03260) comB 641195..641716 (+) 522 WP_062923926.1 hypothetical protein Machinery gene
  I5Z97_RS03265 (I5Z97_03265) comC 641713..642243 (+) 531 WP_196980303.1 ATPase Machinery gene
  I5Z97_RS03270 (I5Z97_03270) comD 642288..642629 (+) 342 WP_231401392.1 hypothetical protein Machinery gene
  I5Z97_RS03275 (I5Z97_03275) comE 642639..643901 (+) 1263 WP_160434722.1 type IV pilus secretin PilQ Machinery gene
  I5Z97_RS03280 (I5Z97_03280) nusB 643980..644393 (+) 414 WP_010785952.1 transcription antitermination factor NusB -
  I5Z97_RS03285 (I5Z97_03285) thiL 644402..645370 (+) 969 WP_196980305.1 thiamine-phosphate kinase -
  I5Z97_RS03290 (I5Z97_03290) - 645373..645843 (+) 471 WP_021117942.1 phosphatidylglycerophosphatase A -
  I5Z97_RS03295 (I5Z97_03295) - 645845..646474 (+) 630 WP_015940164.1 LysE family transporter -
  I5Z97_RS03300 (I5Z97_03300) trxA 646771..647085 (+) 315 WP_015940165.1 thioredoxin -
  I5Z97_RS11530 - 647816..648094 (+) 279 WP_414626320.1 hypothetical protein -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 47033.02 Da        Isoelectric Point: 8.9182

>NTDB_id=509127 I5Z97_RS03275 WP_160434722.1 642639..643901(+) (comE) [Glaesserella parasuis strain LHDR_HPS_1_2]
MRYLFLLFFATFPVLANQQISLSIKNAPTAEIIGYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSENGYLHFDDRSNSIIVKDSAASVK
NFTQLIETLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLHLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKKEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQMNNMMATIDKQEINTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQNRDKISKREL
VIFVTPYIVKSEKIGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=509127 I5Z97_RS03275 WP_160434722.1 642639..643901(+) (comE) [Glaesserella parasuis strain LHDR_HPS_1_2]
ATGCGTTATTTATTCCTGCTATTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAAATTTCGCTTTCTATAAAAAATGC
CCCTACGGCAGAAATTATTGGTTATTTAGCTGAAGAAACGGGAAAAAATATTACGATTTCAGATGAGATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTAATAGAGTCTTTGACAAAAGGGCAAGGCAACT
TGCTATCAGAGAATGGTTATCTTCATTTTGATGATCGCAGTAATAGTATTATCGTCAAAGACAGTGCCGCATCGGTTAAA
AACTTTACTCAACTTATTGAAACCCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACGATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGTTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGAAACTTAG
AGGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCCGTTGCCTTACAAATTGCC
GCTATCAACAGCCGTGTACTTCATTTAGAACTCACCGCATTAGAGAGCGAAAATAACATTGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAGATTCCTTATGCAATGTACAGTAAGAAAAAGG
AAATCACCGATATTGAATTTCGTGAAGCGGTTTTGGGGCTAGAAGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGATCTTGCCATTAGCCAAAATTCGCCAAATAACCAGATGAACAATATGATGGCAACGATTGATAAACAAGAAATTAA
TACACAAGTCCTTGCTAAACACGGCGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTCCCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAACCGAGATAAAATCTCTAAACGGGAGCTG
GTTATTTTTGTTACGCCTTATATTGTAAAATCTGAAAAAATAGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

96.905

100

0.969

  comE Haemophilus influenzae Rd KW20

52.706

100

0.533

  comE Haemophilus influenzae 86-028NP

51.765

100

0.524

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

37.44

98.571

0.369

  pilQ Vibrio cholerae strain A1552

37.44

98.571

0.369

  pilQ Vibrio campbellii strain DS40M4

36.019

100

0.362