Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   LQ983_RS04215 Genome accession   NZ_OV100759
Coordinates   908592..910016 (+) Length   474 a.a.
NCBI ID   WP_230621762.1    Uniprot ID   -
Organism   Aggregatibacter sp. Marseille-P9115     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 903592..915016
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LQ983_RS04190 - 903637..906204 (-) 2568 WP_230621757.1 penicillin-binding protein 1A -
  LQ983_RS04195 - 906315..907136 (+) 822 WP_230621758.1 competence protein ComA -
  LQ983_RS04200 - 907133..907651 (+) 519 WP_230621759.1 PilN domain-containing protein -
  LQ983_RS04205 - 907648..908175 (+) 528 WP_230621760.1 competence protein -
  LQ983_RS04210 - 908186..908572 (+) 387 WP_230621761.1 pilus assembly protein PilP -
  LQ983_RS04215 comE 908592..910016 (+) 1425 WP_230621762.1 type IV pilus secretin PilQ Machinery gene
  LQ983_RS04220 aroK 910231..910758 (+) 528 WP_048750075.1 shikimate kinase AroK -
  LQ983_RS04225 aroB 910792..911880 (+) 1089 WP_230621763.1 3-dehydroquinate synthase -
  LQ983_RS04230 - 911886..912752 (+) 867 WP_230621764.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  LQ983_RS04235 ssb 912852..913328 (-) 477 WP_048750111.1 single-stranded DNA-binding protein Machinery gene

Sequence


Protein


Download         Length: 474 a.a.        Molecular weight: 52013.78 Da        Isoelectric Point: 7.2242

>NTDB_id=1151814 LQ983_RS04215 WP_230621762.1 908592..910016(+) (comE) [Aggregatibacter sp. Marseille-P9115]
MQSLFLKILKCGLFFGCFFTFTVHAEQHQTFSIQLKQAPLVPTLQQLALAENVNLLIDDELQGTLTLKLDNVNLDQLFRS
VAKIKQLDLWQEEGIYYLSKQDTAAKFAGNMTEPQAIAALPSEEETSLATATIKLHFAKASEVMKSLTGGNGSMLSPNGS
ITFDDRSNLLLIQDEPKSIANLKKLIKELDKPIEQIAIEARIVTINAESLKDLGVRWGIFNPTENAHKIAGSLAANGFDN
IANQLNVNLPTATTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTELPYIMVNEKSGTQ
SVEFREAVLGLEVTPHISLDKQILLDLIVSQNAPGSRVAYGLGEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSLD
KVPLLGDIPGIKHLFSKESDRHQKRELVIFVTPHILRQGETLENLKSNQSTAKKNKNPGKSKTAQNTPKKTITR

Nucleotide


Download         Length: 1425 bp        

>NTDB_id=1151814 LQ983_RS04215 WP_230621762.1 908592..910016(+) (comE) [Aggregatibacter sp. Marseille-P9115]
ATGCAATCGCTTTTTCTCAAGATCTTAAAGTGCGGTCTGTTTTTCGGATGTTTTTTCACATTCACCGTGCATGCCGAACA
ACATCAAACCTTTTCCATTCAACTCAAACAGGCGCCTTTGGTGCCGACCTTACAACAACTCGCGCTGGCAGAAAATGTTA
ACTTACTGATCGACGATGAATTGCAAGGCACACTGACATTAAAGTTAGATAACGTGAATTTAGATCAGCTCTTTCGTTCC
GTTGCCAAAATTAAACAATTGGATTTATGGCAGGAAGAGGGCATTTATTATTTAAGCAAGCAGGATACCGCTGCCAAATT
TGCCGGCAATATGACAGAACCTCAAGCCATTGCGGCACTGCCTTCGGAGGAAGAAACCTCGCTCGCGACAGCCACCATTA
AACTGCACTTTGCCAAAGCTTCTGAAGTAATGAAATCGCTTACCGGCGGTAATGGGTCGATGTTATCACCGAACGGTAGC
ATTACCTTTGATGACCGCAGCAATCTCTTACTGATTCAAGACGAACCGAAATCTATCGCCAACTTGAAGAAATTGATTAA
AGAATTGGATAAGCCCATTGAACAAATCGCCATCGAAGCACGTATTGTCACCATCAACGCCGAAAGTTTGAAAGATTTGG
GCGTGCGTTGGGGGATTTTTAACCCGACAGAAAATGCCCACAAAATCGCTGGCAGTCTCGCCGCTAACGGCTTTGACAAT
ATTGCGAATCAATTAAACGTCAACTTGCCCACCGCCACGACACCGGCCGGGTCTCTCGCGTTACAAGTGGCTAAAATCAA
CGGACGACTTTTAGACTTAGAATTGACCGCACTTGAACGGGAAAACAATGTGGAAATTATCGCTAGCCCACGTTTGCTTA
CCACCAACAAGAAAAGTGCCAGCATTAAGCAAGGGACGGAATTGCCTTATATCATGGTGAATGAAAAAAGTGGTACGCAA
AGCGTAGAGTTTCGCGAAGCCGTATTAGGGTTGGAAGTCACACCACATATTTCCTTAGATAAGCAAATTCTGTTGGATTT
AATTGTAAGCCAAAACGCACCGGGCAGCCGCGTGGCATACGGTTTAGGCGAGGTTGTTTCGATTGATAAACAAGAAATAA
ATACGCAAGTTTTCGCCAAGGACGGCGAAACCATCGTGCTTGGTGGCGTCTTTCATGACACCATTACTAAAAGCTTAGAC
AAAGTGCCATTACTGGGAGACATTCCCGGCATTAAGCATTTATTCAGCAAAGAAAGTGATCGCCACCAAAAGCGGGAATT
GGTGATCTTCGTCACGCCACATATTTTGCGCCAGGGTGAGACATTGGAAAACTTGAAGAGCAACCAGTCAACAGCGAAAA
AGAACAAGAATCCTGGGAAATCTAAGACGGCACAAAACACACCTAAAAAGACGATAACGCGGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

69.845

95.148

0.665

  comE Haemophilus influenzae 86-028NP

68.805

95.359

0.656

  comE Glaesserella parasuis strain SC1401

53.647

89.662

0.481

  pilQ Vibrio campbellii strain DS40M4

42.459

90.928

0.386

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.424

90.506

0.384

  pilQ Vibrio cholerae strain A1552

42.424

90.506

0.384


Multiple sequence alignment