Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   SOPEG_RS30490 Genome accession   NZ_CP006568
Coordinates   251275..251844 (-) Length   189 a.a.
NCBI ID   WP_335334084.1    Uniprot ID   -
Organism   Candidatus Sodalis pierantonius str. SOPE     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 246275..256844
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SOPEG_RS01385 (SOPEG_0270) rpe 246527..247207 (-) 681 WP_025244035.1 ribulose-phosphate 3-epimerase -
  SOPEG_RS01390 (SOPEG_0271) dam 247255..248067 (-) 813 WP_025244036.1 adenine-specific DNA-methyltransferase -
  SOPEG_RS01395 (SOPEG_0272) - 248151..249188 (-) 1038 WP_025244037.1 SPOR domain-containing protein -
  SOPEG_RS01400 (SOPEG_0273) aroB 249293..250378 (-) 1086 WP_025244038.1 3-dehydroquinate synthase -
  SOPEG_RS01405 (SOPEG_0274) aroK 250419..250940 (-) 522 WP_025244039.1 shikimate kinase AroK -
  SOPEG_RS30490 comE 251275..251844 (-) 570 WP_335334084.1 hypothetical protein Machinery gene
  SOPEG_RS30495 - 251897..252115 (-) 219 WP_335334066.1 secretin N-terminal domain-containing protein -
  SOPEG_RS30500 - 252157..252399 (-) 243 WP_335334067.1 secretin and TonB N-terminal domain-containing protein -
  SOPEG_RS01420 (SOPEG_ps0277) - 253171..253707 (-) 537 WP_025244041.1 PilN domain-containing protein -
  SOPEG_RS01425 (SOPEG_ps0278) - 253795..254124 (-) 330 WP_025244042.1 hypothetical protein -
  SOPEG_RS01430 (SOPEG_ps0280) mrcA 254244..256746 (+) 2503 Protein_294 peptidoglycan glycosyltransferase/peptidoglycan DD-transpeptidase MrcA -

Sequence


Protein


Download         Length: 189 a.a.        Molecular weight: 20971.28 Da        Isoelectric Point: 10.4016

>NTDB_id=113133 SOPEG_RS30490 WP_335334084.1 251275..251844(-) (comE) [Candidatus Sodalis pierantonius str. SOPE]
MPVGGHFARIGLPLARLNGRMLELELTALEQENKVDILASPRLYTAHQQTASIKQGTQIPYPMTSGHGKHPSIQFKEAVL
CMEVTPRILRNGRITLDLRLSQNVPGSIIKQGESQSVTIDTEEIKTQVTIADGETIVLGGIFQHQKQRSNDRVPLLADIP
LVGALFKRQRANHKQRELVIFITPTRISG

Nucleotide


Download         Length: 570 bp        

>NTDB_id=113133 SOPEG_RS30490 WP_335334084.1 251275..251844(-) (comE) [Candidatus Sodalis pierantonius str. SOPE]
ATGCCGGTTGGCGGGCATTTTGCCCGCATCGGGCTGCCGCTGGCGCGTCTCAATGGCCGGATGCTGGAGCTTGAGTTGAC
CGCGCTGGAGCAGGAAAACAAGGTGGATATTCTGGCCAGCCCGCGTCTTTATACCGCCCATCAGCAGACCGCGAGCATCA
AGCAAGGCACGCAAATCCCGTATCCGATGACCAGCGGCCACGGCAAACATCCCTCAATTCAGTTCAAAGAGGCGGTGTTG
TGCATGGAGGTGACGCCGCGTATTCTGCGCAACGGGCGCATCACGCTGGATTTGCGGCTGAGCCAGAATGTTCCGGGCAG
TATCATCAAGCAGGGAGAAAGCCAGAGCGTGACCATCGATACGGAAGAGATTAAAACTCAGGTAACGATTGCAGACGGTG
AGACCATCGTGCTCGGCGGCATATTCCAGCATCAAAAGCAGCGGAGCAACGATCGGGTGCCGCTATTGGCCGATATCCCG
CTAGTAGGGGCGTTATTTAAGCGCCAACGAGCTAATCATAAACAACGTGAACTAGTTATTTTTATTACACCCACCCGCAT
ATCAGGATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

53.107

93.651

0.497

  comE Haemophilus influenzae 86-028NP

53.143

92.593

0.492

  comE Haemophilus influenzae Rd KW20

53.143

92.593

0.492

  pilQ Vibrio campbellii strain DS40M4

49.432

93.122

0.46

  pilQ Vibrio cholerae strain A1552

46.409

95.767

0.444

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

46.409

95.767

0.444

  pilQ Legionella pneumophila strain ERS1305867

40.642

98.942

0.402

  pilQ Legionella pneumophila strain Lp02

40.642

98.942

0.402


Multiple sequence alignment