Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   CRH16_RS07475 Genome accession   NZ_CP024412
Coordinates   1439625..1440887 (-) Length   420 a.a.
NCBI ID   WP_010785953.1    Uniprot ID   -
Organism   Glaesserella parasuis strain SH0104     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1434625..1445887
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CRH16_RS07440 - 1434640..1435059 (+) 420 WP_005714137.1 hotdog fold thioesterase -
  CRH16_RS12345 - 1435747..1436145 (-) 399 WP_420889732.1 hypothetical protein -
  CRH16_RS07450 trxA 1436442..1436756 (-) 315 WP_005712493.1 thioredoxin -
  CRH16_RS07455 - 1437053..1437682 (-) 630 WP_005712495.1 LysE family transporter -
  CRH16_RS07460 - 1437684..1438153 (-) 470 Protein_1416 phosphatidylglycerophosphatase A -
  CRH16_RS07465 thiL 1438156..1439124 (-) 969 WP_112063047.1 thiamine-phosphate kinase -
  CRH16_RS07470 nusB 1439133..1439546 (-) 414 WP_010785952.1 transcription antitermination factor NusB -
  CRH16_RS07475 comE 1439625..1440887 (-) 1263 WP_010785953.1 type IV pilus secretin PilQ Machinery gene
  CRH16_RS07480 comD 1440897..1441283 (-) 387 WP_112063048.1 hypothetical protein Machinery gene
  CRH16_RS07485 comC 1441283..1441813 (-) 531 WP_112063049.1 ATPase Machinery gene
  CRH16_RS07490 comB 1441810..1442331 (-) 522 WP_005712507.1 hypothetical protein Machinery gene
  CRH16_RS07495 comA 1442307..1442984 (-) 678 WP_112063050.1 competence protein ComA Machinery gene
  CRH16_RS07500 - 1443138..1445692 (+) 2555 Protein_1424 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 47079.07 Da        Isoelectric Point: 8.9232

>NTDB_id=253412 CRH16_RS07475 WP_010785953.1 1439625..1440887(-) (comE) [Glaesserella parasuis strain SH0104]
MRYLFLLFFATFPVLANQQISLSIRNAPTAEIIGYLAEETGKNITISDEIKDTKNFRVEKSHFDEILNSLIKTHQLNLKK
ENGIYYIHQAQEHKQHTTAQLVNALPKLITKTIKLHYSKASEVIESLTKGQGNLLSENGYLHFDDRSNSIIVKDSAASVK
NFTQLIETLDKPTEQIAIEARIVTISSEHLQQLGVRWGLFSPNENHYKLAGNLEGNGLTTNNLNVNFPVNPSASVALQIA
AINSRVLHLELTALESENNIEIIASPRLLTTDKKPASIKQGTEIPYAMYSKKKEITDIEFREAVLGLEVTPHISKQNQIL
LDLAISQNSPNNQMNNMMATIDKQEINTQVLAKHGETIVLGGIFQHLIAKGEDKVPLLGSIPVIKRLFSQNRDKISKREL
VIFVTPYIVKSEKMGAEKQK

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=253412 CRH16_RS07475 WP_010785953.1 1439625..1440887(-) (comE) [Glaesserella parasuis strain SH0104]
ATGCGTTATTTATTCCTGCTATTCTTTGCTACTTTTCCTGTATTAGCAAACCAACAGATTTCACTTTCGATAAGAAATGC
CCCTACGGCAGAAATTATTGGTTATTTAGCTGAAGAAACGGGAAAAAATATTACGATTTCAGATGAGATTAAAGATACAA
AAAATTTCAGAGTAGAAAAAAGCCATTTTGATGAGATATTAAATAGTTTAATCAAAACACATCAATTAAATTTAAAAAAA
GAAAATGGCATTTACTATATTCATCAAGCCCAAGAACATAAACAACATACTACGGCACAATTAGTTAATGCTCTGCCTAA
ATTAATCACCAAAACAATCAAGTTACACTATTCCAAAGCCTCTGAAGTAATAGAGTCTTTGACAAAAGGGCAAGGCAACT
TGCTATCAGAGAATGGTTATCTTCATTTTGATGATCGCAGTAATAGTATTATCGTCAAAGACAGTGCCGCATCGGTTAAA
AACTTTACTCAACTTATTGAAACCCTAGATAAACCAACGGAACAAATTGCAATTGAAGCCCGAATTGTCACGATCAGCAG
TGAACATTTACAACAACTTGGCGTGCGTTGGGGCTTATTTTCCCCTAACGAAAATCACTACAAATTGGCAGGAAACTTAG
AGGGCAACGGATTAACTACCAACAACTTAAACGTAAATTTTCCAGTAAATCCGTCCGCTTCCGTTGCCTTACAAATTGCC
GCTATCAACAGCCGTGTACTTCATTTAGAACTCACCGCATTAGAGAGCGAAAATAACATTGAGATCATTGCAAGCCCTCG
CTTACTCACAACAGATAAAAAACCAGCGAGTATCAAACAAGGTACAGAGATTCCTTATGCAATGTACAGTAAGAAAAAGG
AAATCACCGATATTGAATTTCGTGAAGCGGTTTTGGGGCTAGAAGTCACGCCACATATTTCTAAACAAAATCAGATTTTG
TTAGATCTTGCCATTAGCCAAAATTCGCCAAATAACCAGATGAACAATATGATGGCAACGATTGATAAACAAGAAATTAA
TACACAAGTCCTTGCTAAACACGGCGAAACCATCGTATTAGGCGGTATTTTTCAACATCTGATCGCCAAAGGCGAAGATA
AAGTCCCGCTGTTAGGCAGTATCCCCGTGATTAAACGCTTATTTAGCCAAAACCGAGATAAAATCTCTAAACGGGAGCTG
GTTATTTTTGTTACGCCTTATATTGTAAAATCTGAAAAAATGGGAGCGGAAAAGCAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

96.905

100

0.969

  comE Haemophilus influenzae Rd KW20

52.706

100

0.533

  comE Haemophilus influenzae 86-028NP

51.765

100

0.524

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

37.44

98.571

0.369

  pilQ Vibrio cholerae strain A1552

37.44

98.571

0.369

  pilQ Vibrio campbellii strain DS40M4

36.019

100

0.362


Multiple sequence alignment