Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   ASUC_RS03570 Genome accession   NC_009655
Coordinates   747544..748992 (-) Length   482 a.a.
NCBI ID   WP_012072441.1    Uniprot ID   A6VM64
Organism   Actinobacillus succinogenes 130Z     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 742544..753992
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ASUC_RS03545 (Asuc_0683) - 743202..743801 (+) 600 WP_012072436.1 beta-phosphoglucomutase family hydrolase -
  ASUC_RS03550 (Asuc_0684) - 743801..744406 (+) 606 WP_012072437.1 sugar O-acetyltransferase -
  ASUC_RS03555 (Asuc_0685) luxS 744648..745154 (+) 507 WP_012072438.1 S-ribosylhomocysteine lyase -
  ASUC_RS03560 (Asuc_0686) purT 745166..746347 (+) 1182 WP_012072439.1 formate-dependent phosphoribosylglycinamide formyltransferase -
  ASUC_RS03565 (Asuc_0687) pfkA 746413..747378 (-) 966 WP_012072440.1 6-phosphofructokinase -
  ASUC_RS03570 (Asuc_0688) comE 747544..748992 (-) 1449 WP_012072441.1 type IV pilus secretin PilQ Machinery gene
  ASUC_RS03575 (Asuc_0689) - 749002..749382 (-) 381 WP_012072442.1 pilus assembly protein PilP -
  ASUC_RS10975 (Asuc_0690) - 749385..749978 (-) 594 WP_012072443.1 hypothetical protein -
  ASUC_RS03585 (Asuc_0691) - 749975..750481 (-) 507 WP_012072444.1 fimbrial assembly family protein -
  ASUC_RS03590 (Asuc_0692) pilM 750487..751323 (-) 837 WP_012072445.1 pilus assembly protein PilM -

Sequence


Protein


Download         Length: 482 a.a.        Molecular weight: 53129.75 Da        Isoelectric Point: 7.5873

>NTDB_id=28520 ASUC_RS03570 WP_012072441.1 747544..748992(-) (comE) [Actinobacillus succinogenes 130Z]
MKHLNNFTTKCGLFLILFLVSTQAQSADNHVFSIRLQQAPLVATLQQLALELDTNLIIDDELEGTLSLKLDNTDLDQLLR
SVAKIKRLDFWQENGIYYINRKNSSSVPDEAFNIAEVADIPVAPAEPKPETATVKLHYAKASEIMKSLTAGNGALVSDTG
RLTFDDRSNRLIIQDNRQSIRNIKKLIAELDKPIEQIAIEARIVTMNDESLKELGVRWGMFEPTAGAHKVSGSLAANGFT
DISNNLNVNFATATMPAGSVALQVAKINGRLLDLELTALEQEKNVEIIASPRLLTTNKKAASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISQNNAILMDLTVSQNSPGNRVSYGENSEVVSIDKQEIKTQVFAKDGETIVLGGVFHDTVTKG
SNKVPVLGDIPVIKHLFGNHSERRQKRELVIFVTPRILKNGETLEQLRHKDGLQNRKLLPVRDKIPADSQLRKNIAKSAV
GK

Nucleotide


Download         Length: 1449 bp        

>NTDB_id=28520 ASUC_RS03570 WP_012072441.1 747544..748992(-) (comE) [Actinobacillus succinogenes 130Z]
ATGAAACACTTGAACAATTTTACGACAAAGTGCGGTCTGTTTTTAATCCTTTTTTTGGTTTCGACTCAGGCTCAATCCGC
GGACAACCACGTATTTTCCATCCGTTTGCAGCAAGCCCCTTTAGTGGCGACGTTACAACAACTGGCACTGGAACTGGATA
CCAATTTAATCATTGATGACGAATTGGAAGGTACGCTATCGCTGAAATTGGATAATACGGATTTAGATCAGTTATTGCGT
TCGGTAGCGAAAATCAAACGATTGGATTTTTGGCAGGAAAACGGCATTTACTATATCAACCGCAAAAATTCATCTTCTGT
GCCGGACGAGGCCTTCAATATAGCGGAAGTTGCCGATATTCCCGTGGCGCCGGCAGAACCTAAACCGGAAACCGCCACCG
TGAAGCTGCATTATGCCAAAGCGTCGGAAATAATGAAATCGCTCACGGCCGGCAACGGTGCGCTAGTCAGCGACACGGGA
CGGCTGACGTTTGATGATCGCAGTAACCGACTGATCATTCAGGACAACAGACAGTCCATTCGTAATATTAAAAAACTTAT
CGCCGAATTAGACAAACCTATCGAGCAAATCGCCATTGAAGCGCGCATCGTCACTATGAATGATGAAAGCCTGAAAGAAC
TCGGGGTACGCTGGGGCATGTTCGAACCGACGGCTGGCGCACATAAAGTCAGCGGCAGTTTGGCGGCCAACGGATTCACC
GACATCTCGAATAATCTCAATGTTAATTTTGCTACCGCGACAATGCCGGCCGGTTCCGTCGCATTACAAGTGGCTAAAAT
CAACGGCCGTCTGTTAGATCTTGAATTAACCGCCCTGGAACAGGAAAAAAATGTGGAAATCATTGCCAGTCCGCGCTTGC
TCACCACCAACAAAAAAGCGGCCAGTATCAAACAAGGAACGGAGATTCCTTATGTAGTCACCAACGGCAAAAACGACACA
CAATCGGTAGAATTCCGTGAAGCGGTATTAGGGCTGGAAGTCACGCCGCATATTTCACAAAACAACGCAATTTTAATGGA
TTTAACCGTCAGCCAAAATTCTCCCGGAAACCGCGTTTCATACGGAGAAAACAGCGAAGTAGTCTCCATTGATAAACAGG
AAATTAAAACTCAAGTCTTCGCTAAAGACGGCGAGACCATTGTACTTGGCGGCGTATTTCACGACACCGTAACCAAAGGC
AGTAATAAAGTACCCGTTCTCGGCGATATTCCCGTCATTAAACATTTATTCGGCAATCACAGCGAACGCCGCCAGAAACG
GGAATTGGTTATTTTCGTCACACCGCGTATTTTGAAAAACGGCGAAACCTTAGAACAATTAAGGCATAAAGACGGCTTGC
AAAACCGAAAATTGTTACCCGTCCGCGACAAAATACCAGCGGATTCGCAACTAAGAAAAAACATTGCCAAAAGTGCGGTC
GGAAAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A6VM64

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

66.444

93.361

0.62

  comE Haemophilus influenzae 86-028NP

65.487

93.776

0.614

  comE Glaesserella parasuis strain SC1401

51.991

88.589

0.461


Multiple sequence alignment