Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   A4G17_RS09945 Genome accession   NZ_CP015029
Coordinates   2079654..2080931 (+) Length   425 a.a.
NCBI ID   WP_123955744.1    Uniprot ID   A0AAE7C332
Organism   Frederiksenia canicola strain HPA 21     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2074654..2085931
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A4G17_RS09920 (A4G17_09820) - 2074767..2077343 (-) 2577 WP_123955749.1 penicillin-binding protein 1A -
  A4G17_RS09925 (A4G17_09825) - 2077486..2078223 (+) 738 WP_123955748.1 hypothetical protein -
  A4G17_RS09930 (A4G17_09830) - 2078205..2078732 (+) 528 WP_123955747.1 hypothetical protein -
  A4G17_RS09935 (A4G17_09835) gspM 2078729..2079247 (+) 519 WP_123955746.1 type II secretion system protein GspM -
  A4G17_RS09940 (A4G17_09840) - 2079259..2079645 (+) 387 WP_123955745.1 pilus assembly protein PilP -
  A4G17_RS09945 (A4G17_09845) comE 2079654..2080931 (+) 1278 WP_123955744.1 type IV pilus secretin PilQ Machinery gene
  A4G17_RS09950 (A4G17_09850) nusB 2081000..2081443 (+) 444 WP_123955743.1 transcription antitermination factor NusB -
  A4G17_RS09955 (A4G17_09855) thiL 2081452..2082417 (+) 966 WP_123955742.1 thiamine-phosphate kinase -
  A4G17_RS09960 (A4G17_09860) - 2082414..2082887 (+) 474 WP_123955741.1 phosphatidylglycerophosphatase A -
  A4G17_RS09965 (A4G17_09865) - 2082889..2083509 (+) 621 WP_123955740.1 LysE family transporter -
  A4G17_RS10325 (A4G17_09870) - 2083596..2084549 (-) 954 WP_236941010.1 pyruvate formate lyase-activating protein -
  A4G17_RS09975 (A4G17_09875) - 2084717..2084932 (+) 216 WP_123955739.1 YdcH family protein -
  A4G17_RS09980 (A4G17_09880) cysZ 2085006..2085833 (-) 828 WP_123955738.1 sulfate transporter CysZ -

Sequence


Protein


Download         Length: 425 a.a.        Molecular weight: 47571.24 Da        Isoelectric Point: 8.9515

>NTDB_id=176290 A4G17_RS09945 WP_123955744.1 2079654..2080931(+) (comE) [Frederiksenia canicola strain HPA 21]
MRYLLLSLLFINQSVFANTLSISLKEAPTKSILAYLAEENNKNIVLVEDIQNRSTLRIESRSFDEIIKSIAEINQLSVKI
ENNIYYIHKKEQNKGEEVNKNPTLLTQNIKLHYAKASEIIDSITKGNGNLLSENGYLHFDERSNSIIIKDTASSIKNITT
LIKHLDLPTEQIAIEARIVTIGSENLKELGVRWGMFNRNEHSHRFAGRLEGNGFETNNLNVNFPVLNNSASAVLQIASIN
GRVLDLELSALEQENNVEIIASPRLLTTNKKSASIQQGTEIPYTIYNKKSETFDFEFKDAVLGLDVTPQISADNQILLDL
IVTQNSPNGQNGVSGLTTIDKQELRTQVFAKHGETIVLGGVFQHLKSKGEDKVPILGSIPVVKQLFSQSRNKISKRELVI
FVTPYIVKSSPMSTTQEKVRKSKQN

Nucleotide


Download         Length: 1278 bp        

>NTDB_id=176290 A4G17_RS09945 WP_123955744.1 2079654..2080931(+) (comE) [Frederiksenia canicola strain HPA 21]
ATGCGTTATCTACTTTTAAGTCTTCTTTTTATCAATCAATCTGTTTTTGCTAATACCTTATCTATTTCATTAAAAGAGGC
ACCGACAAAATCAATTTTAGCTTATTTAGCTGAAGAAAATAATAAAAATATTGTATTAGTGGAAGATATTCAAAACCGTT
CAACACTAAGAATAGAAAGTCGCTCCTTTGATGAAATCATCAAAAGCATTGCAGAAATCAATCAGCTTTCTGTTAAAATA
GAAAATAATATCTATTACATTCATAAGAAAGAACAAAATAAAGGAGAAGAAGTTAATAAAAACCCAACGCTCCTTACTCA
AAATATAAAGCTGCATTATGCTAAAGCCTCCGAAATCATTGACTCCATTACAAAAGGAAATGGAAATTTATTATCTGAAA
ATGGATATTTACATTTTGATGAGCGTAGCAATAGCATTATTATTAAAGACACGGCTTCCTCGATAAAAAATATTACGACG
CTAATCAAACATCTGGATCTGCCAACAGAACAAATTGCGATTGAAGCTCGAATCGTCACGATTGGCAGTGAGAATCTCAA
AGAACTTGGGGTTCGTTGGGGGATGTTTAATCGAAATGAACATAGCCATCGTTTTGCGGGGCGTTTAGAAGGAAATGGCT
TTGAGACCAACAACTTGAATGTTAATTTCCCTGTATTAAATAATTCTGCATCTGCCGTATTACAAATTGCGAGTATTAAT
GGGCGTGTTTTAGATTTAGAACTGAGTGCTTTAGAGCAGGAAAATAATGTGGAAATCATCGCCAGCCCACGATTACTCAC
AACAAATAAAAAAAGTGCAAGTATTCAGCAAGGCACAGAAATTCCTTACACCATTTACAATAAGAAATCTGAGACATTTG
ATTTTGAGTTTAAAGATGCTGTACTTGGATTAGATGTTACCCCCCAAATCTCTGCCGATAATCAAATTTTGCTTGACTTA
ATCGTCACTCAAAACTCGCCGAATGGACAAAATGGCGTGAGCGGTTTAACTACCATTGACAAACAAGAGCTACGCACACA
AGTCTTTGCTAAACATGGTGAAACCATTGTCCTTGGCGGCGTTTTTCAGCACTTAAAATCAAAAGGTGAAGATAAAGTGC
CTATTTTAGGCAGCATTCCTGTCGTGAAGCAGCTATTTAGTCAAAGCAGAAATAAAATCTCAAAACGAGAGCTCGTCATT
TTTGTCACGCCTTATATTGTGAAGTCATCACCGATGTCAACCACACAAGAGAAAGTGAGAAAATCCAAACAAAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

64.303

99.529

0.64

  comE Haemophilus influenzae Rd KW20

51.79

98.588

0.511

  comE Haemophilus influenzae 86-028NP

51.551

98.588

0.508

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

39.952

97.176

0.388

  pilQ Vibrio cholerae strain A1552

39.952

97.176

0.388

  pilQ Vibrio campbellii strain DS40M4

39.007

99.529

0.388


Multiple sequence alignment