Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   F542_RS02145 Genome accession   NZ_CP006954
Coordinates   440594..441853 (-) Length   419 a.a.
NCBI ID   WP_025266805.1    Uniprot ID   -
Organism   Bibersteinia trehalosi USDA-ARS-USMARC-188     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 435594..446853
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  F542_RS02120 (F542_4410) modF 436535..437998 (-) 1464 WP_025266801.1 molybdate ABC transporter ATP-binding protein ModF -
  F542_RS02125 (F542_4420) - 437999..438625 (-) 627 WP_025266802.1 LysE family transporter -
  F542_RS02130 (F542_4430) - 438647..439120 (-) 474 WP_025266803.1 phosphatidylglycerophosphatase A -
  F542_RS02135 (F542_4440) thiL 439123..440091 (-) 969 WP_025266804.1 thiamine-phosphate kinase -
  F542_RS02140 (F542_4450) nusB 440107..440532 (-) 426 WP_015433225.1 transcription antitermination factor NusB -
  F542_RS02145 (F542_4460) comE 440594..441853 (-) 1260 WP_025266805.1 type IV pilus secretin PilQ Machinery gene
  F542_RS02150 (F542_4470) - 441862..442281 (-) 420 WP_025266806.1 hypothetical protein -
  F542_RS02155 (F542_4480) - 442290..442805 (-) 516 WP_015433222.1 hypothetical protein -
  F542_RS10735 (F542_4490) - 442805..443332 (-) 528 WP_015433221.1 hypothetical protein -
  F542_RS02165 (F542_4500) - 443314..443949 (-) 636 WP_025266807.1 hypothetical protein -
  F542_RS02170 (F542_4510) - 444115..446727 (+) 2613 WP_025266808.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 419 a.a.        Molecular weight: 46771.81 Da        Isoelectric Point: 8.3598

>NTDB_id=115194 F542_RS02145 WP_025266805.1 440594..441853(-) (comE) [Bibersteinia trehalosi USDA-ARS-USMARC-188]
MRYILLLFSTMISFTMAQTISLNLKNAPTAEIISYLAEESGKNVVLSDNLSDTKSLRLENFTFEQILQSLAKLHQFSLKQ
EKGIYYISEKPLEPAKANPITSITPKMPKLITKTIKLHYAKASEVIHSLTQGSGTMLAENGYIHFDERSNSVIIKDTATS
IKNLRALIAQLDQPTEQIAIEARIVTINSENLQELGVRWGMFSPNTHHHRFSGNLEGSGLPMNNLNVNFPVNNAASMVLQ
VASINSRVLDLELTALERENSVEIIASPRLLTTNKKSANIKQGTEIPYAVYSQKDELRNIEFREAVLGLEVTPHLSKNNQ
ILLDLVVSQNSPNSQTIGSDMVTIDKQEINTQVFAKHGETIVLGGIFQHLIAKGEDRVPILGSIPVIKRLFSQSRDKIAK
RELVIFVTPYIVKSERIEN

Nucleotide


Download         Length: 1260 bp        

>NTDB_id=115194 F542_RS02145 WP_025266805.1 440594..441853(-) (comE) [Bibersteinia trehalosi USDA-ARS-USMARC-188]
ATGCGTTATATTCTTTTGTTATTTAGCACCATGATTTCCTTTACTATGGCACAAACAATCAGTTTAAATTTAAAAAATGC
GCCTACCGCAGAAATCATTAGTTATCTAGCGGAAGAAAGTGGCAAAAATGTAGTGCTATCGGATAATTTAAGTGACACAA
AATCACTTAGATTAGAAAATTTCACATTCGAGCAAATTCTGCAAAGTTTAGCCAAATTACATCAATTTTCACTAAAACAG
GAAAAGGGCATTTATTATATTAGTGAAAAACCATTAGAACCTGCGAAAGCAAATCCTATTACTAGCATCACTCCTAAAAT
GCCCAAACTCATCACTAAAACGATTAAATTGCATTATGCTAAAGCTTCAGAAGTCATTCATTCGCTCACACAAGGTAGTG
GCACAATGCTTGCTGAAAATGGCTATATTCACTTTGATGAACGCAGTAACAGTGTGATTATCAAAGACACCGCCACATCA
ATTAAAAATCTTCGAGCTTTAATTGCACAACTTGACCAACCAACAGAACAAATTGCAATTGAAGCTCGTATCGTTACCAT
CAATAGTGAAAATTTACAAGAACTGGGCGTTCGTTGGGGAATGTTTTCGCCTAATACTCATCATCATAGATTCTCAGGAA
ATCTTGAAGGTAGTGGTTTGCCAATGAATAACTTAAATGTCAATTTTCCTGTTAATAATGCCGCTTCGATGGTGCTTCAA
GTTGCAAGTATCAATAGCCGTGTTTTAGATTTAGAGCTCACCGCGCTTGAACGAGAAAATAGCGTTGAAATTATTGCCAG
CCCAAGATTACTTACAACGAACAAAAAAAGTGCCAACATTAAACAAGGCACAGAGATCCCTTATGCAGTCTATAGTCAAA
AAGACGAACTGAGAAATATCGAATTTCGGGAAGCGGTTTTAGGCTTGGAAGTGACACCTCATCTTTCTAAGAATAACCAA
ATTCTGCTGGATTTAGTCGTGAGCCAAAATTCACCCAATAGCCAGACAATAGGCAGTGATATGGTAACAATTGATAAACA
AGAAATTAATACTCAGGTATTTGCAAAACATGGAGAAACCATTGTTTTAGGTGGAATTTTCCAGCATCTCATCGCAAAAG
GTGAAGATCGCGTACCGATCTTAGGTTCAATTCCAGTGATCAAGCGCCTATTTAGCCAAAGTCGCGATAAAATCGCTAAA
CGTGAATTAGTCATTTTTGTCACCCCTTATATTGTGAAATCGGAGAGAATTGAGAATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

70.264

99.523

0.699

  comE Haemophilus influenzae Rd KW20

53.428

100

0.539

  comE Haemophilus influenzae 86-028NP

52.482

100

0.53

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

39.183

99.284

0.389

  pilQ Vibrio cholerae strain A1552

39.183

99.284

0.389

  pilQ Vibrio campbellii strain DS40M4

37.589

100

0.379


Multiple sequence alignment