Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   NYR87_RS05810 Genome accession   NZ_CP103831
Coordinates   1291415..1292710 (-) Length   431 a.a.
NCBI ID   WP_279480071.1    Uniprot ID   -
Organism   Actinobacillus equuli subsp. haemolyticus strain 1812     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1293118..1307285 1291415..1292710 flank 408


Gene organization within MGE regions


Location: 1291415..1307285
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NYR87_RS05810 (NYR87_05805) comE 1291415..1292710 (-) 1296 WP_279480071.1 type IV pilus secretin PilQ Machinery gene
  NYR87_RS05815 (NYR87_05810) - 1292721..1293125 (-) 405 WP_279436619.1 hypothetical protein -
  NYR87_RS05820 (NYR87_05815) - 1293118..1293645 (-) 528 WP_279480074.1 chromosome segregation protein -
  NYR87_RS05825 (NYR87_05820) - 1293648..1294169 (-) 522 WP_279480076.1 hypothetical protein -
  NYR87_RS05830 (NYR87_05825) - 1294151..1294837 (-) 687 WP_279480077.1 pilus assembly protein PilM -
  NYR87_RS05835 (NYR87_05830) - 1294976..1297561 (+) 2586 WP_279480078.1 penicillin-binding protein 1A -
  NYR87_RS05840 (NYR87_05835) aroK 1297825..1298346 (+) 522 WP_279480082.1 shikimate kinase AroK -
  NYR87_RS05845 (NYR87_05840) aroB 1298363..1299451 (+) 1089 WP_279480084.1 3-dehydroquinate synthase -
  NYR87_RS05850 (NYR87_05845) - 1299455..1300291 (+) 837 WP_279480085.1 Dam family site-specific DNA-(adenine-N6)-methyltransferase -
  NYR87_RS05855 (NYR87_05850) - 1300361..1301719 (+) 1359 WP_279480087.1 sodium-dependent transporter -
  NYR87_RS05860 (NYR87_05855) pgi 1301847..1303487 (-) 1641 WP_279480089.1 glucose-6-phosphate isomerase -
  NYR87_RS05865 (NYR87_05860) - 1303526..1303777 (-) 252 WP_039197458.1 accessory factor UbiK family protein -
  NYR87_RS05895 (NYR87_05890) aroE 1304622..1305446 (-) 825 WP_279480091.1 shikimate dehydrogenase -
  NYR87_RS05900 (NYR87_05895) - 1305452..1306006 (-) 555 WP_279480093.1 Sua5/YciO/YrdC/YwlC family protein -
  NYR87_RS05905 (NYR87_05900) - 1305999..1306532 (-) 534 WP_279480095.1 topoisomerase DNA-binding C4 zinc finger domain-containing protein -
  NYR87_RS05910 (NYR87_05905) yqfB 1306529..1306837 (-) 309 WP_279480099.1 N(4)-acetylcytidine aminohydrolase -
  NYR87_RS05915 (NYR87_05910) recX 1306830..1307285 (-) 456 WP_279438030.1 recombination regulator RecX -

Sequence


Protein


Download         Length: 431 a.a.        Molecular weight: 47801.78 Da        Isoelectric Point: 9.2286

>NTDB_id=725778 NYR87_RS05810 WP_279480071.1 1291415..1292710(-) (comE) [Actinobacillus equuli subsp. haemolyticus strain 1812]
MKKAILLLFFIFSSVTVYAFSISLKNAPTAEILRYLAEEHGKNIVLSDNIETNTTLRIENSDFDSVLKSITRANKLTTAY
ENQIYFIGHKKDEKAATIGVNSELLKPKLITKTIKLDYAKAAEVIESLTKGSGNFLSENGYLHFDDRSNSLIIKDSPESM
KNIVKLIRNLDKPTEQIAIEARIVTISSENLQELGVRWGMFSPTNGHHKVAGSLEANGLPNTNHLNVNFPVNNAASIALQ
VAKINGRVLDLELTALEQENDVEIIASPRLLTTNKKPASIKQGTEIPYVLYNRKDEVKNIEFKEAVLGLQVTPHISNDNQ
ILLDLVVTQNSPNSTSSTVHGLVTIDKQELNTQVFAKHGETIVLGGIFQHLTAKGEDRMPILGSIPVIKKLFSHSSDRSS
KRELVIFVTPYIVKSEKQQISSHSSQKLPPK

Nucleotide


Download         Length: 1296 bp        

>NTDB_id=725778 NYR87_RS05810 WP_279480071.1 1291415..1292710(-) (comE) [Actinobacillus equuli subsp. haemolyticus strain 1812]
ATGAAGAAAGCGATTTTGTTACTCTTTTTTATATTTTCATCCGTTACTGTGTATGCATTTTCTATCTCGTTAAAAAATGC
ACCGACCGCAGAAATTCTGCGTTATTTAGCGGAAGAGCACGGAAAAAATATTGTGCTAAGCGACAATATTGAAACGAATA
CCACATTAAGAATTGAAAATAGTGATTTTGATAGTGTTTTAAAAAGTATCACTCGAGCGAATAAATTGACGACAGCATAT
GAGAACCAAATCTATTTTATTGGCCATAAAAAAGATGAAAAGGCTGCTACTATAGGCGTAAATTCTGAGTTGTTAAAGCC
CAAGCTCATTACTAAAACCATCAAATTAGATTATGCCAAAGCGGCGGAAGTGATTGAATCTTTAACCAAAGGCAGCGGAA
ACTTTTTATCGGAAAACGGCTATCTACATTTTGATGATCGTAGTAATAGTTTGATAATTAAAGATAGCCCAGAATCGATG
AAAAATATCGTGAAATTAATTAGAAATCTGGATAAACCGACTGAGCAGATTGCGATTGAAGCAAGGATAGTCACAATAAG
TAGTGAAAATTTGCAAGAGCTTGGGGTACGCTGGGGAATGTTTTCTCCGACAAACGGACATCATAAAGTCGCTGGTTCGC
TTGAAGCAAATGGACTACCGAATACTAACCATTTAAACGTAAATTTTCCGGTAAATAATGCTGCATCTATTGCGCTACAA
GTGGCAAAAATTAATGGACGAGTGCTTGATTTGGAATTAACCGCTTTAGAGCAAGAAAATGATGTGGAAATTATTGCCAG
CCCTCGTTTACTGACCACTAATAAGAAACCGGCGAGTATTAAGCAAGGGACTGAAATTCCGTATGTGCTTTATAACCGTA
AAGATGAAGTGAAAAATATCGAATTTAAAGAAGCCGTTTTAGGGCTACAGGTCACGCCACATATTTCAAATGATAATCAA
ATTTTGCTTGATTTGGTGGTGACACAAAATTCGCCGAATTCAACCAGTTCGACAGTTCATGGTTTAGTAACGATTGATAA
ACAGGAATTAAATACGCAAGTATTCGCTAAGCATGGTGAAACTATTGTGCTAGGCGGTATTTTTCAGCATTTAACCGCAA
AAGGTGAGGACAGAATGCCGATTTTAGGTTCAATTCCGGTCATTAAAAAGTTATTTAGCCATTCCAGTGATAGGAGCAGT
AAGCGCGAATTAGTTATTTTCGTTACACCTTATATTGTTAAAAGTGAAAAACAGCAAATTTCCTCGCATTCTTCACAGAA
ATTACCGCCAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Glaesserella parasuis strain SC1401

66.427

96.752

0.643

  comE Haemophilus influenzae Rd KW20

56.25

92.807

0.522

  comE Haemophilus influenzae 86-028NP

55.86

93.039

0.52

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

43.237

96.056

0.415

  pilQ Vibrio cholerae strain A1552

43.237

96.056

0.415

  pilQ Vibrio campbellii strain DS40M4

41.943

97.912

0.411