Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   A6039_RS03105 Genome accession   NZ_CP015427
Coordinates   599902..601095 (+) Length   397 a.a.
NCBI ID   WP_010945037.1    Uniprot ID   Q7VM72
Organism   [Haemophilus] ducreyi strain VAN4     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 594902..606095
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6039_RS03080 (A6039_03090) - 595601..596086 (+) 486 WP_010945031.1 dihydrofolate reductase -
  A6039_RS03085 (A6039_03095) - 596208..596423 (+) 216 WP_010945033.1 YgjV family protein -
  A6039_RS03090 (A6039_03100) radA 596490..597869 (-) 1380 WP_064085182.1 DNA repair protein RadA -
  A6039_RS03095 (A6039_03105) pilA 598063..598512 (+) 450 WP_010945035.1 prepilin peptidase-dependent pilin Machinery gene
  A6039_RS03100 (A6039_03110) pilB 598524..599918 (+) 1395 WP_010945036.1 GspE/PulE family protein Machinery gene
  A6039_RS03105 (A6039_03115) pilC 599902..601095 (+) 1194 WP_010945037.1 type II secretion system F family protein Machinery gene
  A6039_RS03110 (A6039_03120) - 601095..601787 (+) 693 WP_041603480.1 prepilin peptidase -
  A6039_RS03115 (A6039_03125) coaE 601805..602422 (+) 618 WP_010945039.1 dephospho-CoA kinase -
  A6039_RS03120 (A6039_03130) yacG 602425..602613 (+) 189 WP_010945040.1 DNA gyrase inhibitor YacG -
  A6039_RS03125 (A6039_03135) holA 602647..603672 (-) 1026 WP_010945041.1 DNA polymerase III subunit delta -
  A6039_RS03130 (A6039_03140) lptE 603678..604175 (-) 498 WP_010945042.1 LPS assembly lipoprotein LptE -

Sequence


Protein


Download         Length: 397 a.a.        Molecular weight: 45721.33 Da        Isoelectric Point: 10.1310

>NTDB_id=179489 A6039_RS03105 WP_010945037.1 599902..601095(+) (pilC) [[Haemophilus] ducreyi strain VAN4]
MLKIYEFHWRAFNRFNQRQKGKSLAKSHDELEQRLLAKGYSNIHIQRNFILVRKPKSEQITQLINQLAMLLNASVPLKQS
LVMVLENVHNIKLYLWLSEIIMLLEAGHAFSASLTKMNVYLTKQEIQLIQMGEQSGRLAIMLDNIAQVRNQSQKLANKVK
KIMFYPAIILAVSISVLIGLLLFIVPQFEDLYRNKDQSLPFITQLLFQLSNFLQQYAFILFIGGLFSGIVSYFLAKKWLF
WRTLRSRLLNHMPGFQQIIKDARIIFFSQNLALMLNAHIHLDATLKAFLSEHRQDPVLHQEILSILSLLKQGYKFSEGLN
PTVFTSQVVQMMAIGEQSGNLAKMCMYISQLYQQKLDYQIDLFAQLLEPVLMVVIGIIVGTILIGIYLPIFDMGALV

Nucleotide


Download         Length: 1194 bp        

>NTDB_id=179489 A6039_RS03105 WP_010945037.1 599902..601095(+) (pilC) [[Haemophilus] ducreyi strain VAN4]
ATGCTGAAAATATATGAATTTCATTGGCGGGCTTTTAATCGTTTTAATCAACGGCAAAAAGGTAAATCTCTGGCCAAAAG
TCACGATGAATTAGAACAACGGCTATTAGCTAAAGGCTATTCAAATATCCATATTCAACGTAATTTTATTTTAGTTAGGA
AACCAAAAAGCGAACAAATCACTCAATTAATTAATCAGTTAGCTATGTTATTGAATGCCTCGGTTCCATTAAAACAATCG
CTGGTGATGGTATTAGAAAACGTACATAATATTAAATTATATTTATGGCTATCTGAAATTATTATGCTGTTAGAAGCGGG
GCATGCTTTTTCAGCCAGTCTCACTAAAATGAACGTTTATTTGACTAAGCAAGAAATTCAGTTAATTCAGATGGGAGAAC
AGAGTGGGCGTTTAGCCATTATGCTAGATAACATTGCACAAGTACGTAATCAATCACAGAAATTAGCTAATAAAGTAAAG
AAAATTATGTTTTATCCGGCGATTATTTTAGCCGTATCAATTAGTGTATTAATTGGATTGTTATTATTTATTGTGCCTCA
ATTTGAAGATCTTTATCGTAACAAAGATCAGTCACTGCCTTTTATTACCCAATTATTATTTCAGTTATCGAATTTTCTTC
AACAATATGCATTTATTTTATTCATCGGCGGCCTATTTAGCGGTATTGTTAGCTATTTTTTAGCGAAAAAGTGGCTTTTT
TGGCGTACCTTAAGATCGCGTTTATTAAATCATATGCCAGGGTTTCAGCAAATTATTAAAGATGCTCGAATTATTTTCTT
TAGTCAAAATCTTGCGTTAATGCTTAATGCACATATTCATTTAGATGCAACGTTAAAAGCATTTTTATCAGAACATCGCC
AAGATCCGGTATTACATCAGGAGATTTTGAGTATATTAAGTTTATTAAAGCAAGGTTATAAATTCTCTGAGGGGCTTAAT
CCAACCGTATTTACTAGCCAAGTAGTACAAATGATGGCAATTGGCGAACAAAGTGGTAATTTGGCTAAAATGTGTATGTA
TATTAGTCAACTGTATCAGCAAAAACTGGATTACCAAATTGATTTGTTTGCTCAATTACTTGAACCGGTTTTAATGGTAG
TGATTGGCATTATTGTGGGGACGATTTTGATTGGGATTTATTTGCCTATTTTTGATATGGGAGCATTGGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VM72

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Glaesserella parasuis strain SC1401

51.637

100

0.516

  pilC Haemophilus influenzae Rd KW20

40.75

100

0.411

  pilC Haemophilus influenzae 86-028NP

40.25

100

0.406


Multiple sequence alignment