Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   A6044_RS03765 Genome accession   NZ_CP015432
Coordinates   704591..705784 (+) Length   397 a.a.
NCBI ID   WP_010945037.1    Uniprot ID   Q7VM72
Organism   [Haemophilus] ducreyi strain GHA5     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 699591..710784
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6044_RS03740 (A6044_03750) - 700290..700775 (+) 486 WP_010945031.1 dihydrofolate reductase -
  A6044_RS03745 (A6044_03755) - 700897..701112 (+) 216 WP_010945033.1 YgjV family protein -
  A6044_RS03750 (A6044_03760) radA 701179..702558 (-) 1380 WP_010945034.1 DNA repair protein RadA -
  A6044_RS03755 (A6044_03765) pilA 702752..703201 (+) 450 WP_010945035.1 prepilin peptidase-dependent pilin Machinery gene
  A6044_RS03760 (A6044_03770) pilB 703213..704607 (+) 1395 WP_064088541.1 GspE/PulE family protein Machinery gene
  A6044_RS03765 (A6044_03775) pilC 704591..705784 (+) 1194 WP_010945037.1 type II secretion system F family protein Machinery gene
  A6044_RS03770 (A6044_03780) - 705784..706476 (+) 693 WP_041603480.1 prepilin peptidase -
  A6044_RS03775 (A6044_03785) coaE 706494..707111 (+) 618 WP_010945039.1 dephospho-CoA kinase -
  A6044_RS03780 (A6044_03790) yacG 707114..707302 (+) 189 WP_010945040.1 DNA gyrase inhibitor YacG -
  A6044_RS03785 (A6044_03795) holA 707336..708361 (-) 1026 WP_010945041.1 DNA polymerase III subunit delta -
  A6044_RS03790 (A6044_03800) lptE 708367..708864 (-) 498 WP_010945042.1 LPS assembly lipoprotein LptE -

Sequence


Protein


Download         Length: 397 a.a.        Molecular weight: 45721.33 Da        Isoelectric Point: 10.1310

>NTDB_id=179602 A6044_RS03765 WP_010945037.1 704591..705784(+) (pilC) [[Haemophilus] ducreyi strain GHA5]
MLKIYEFHWRAFNRFNQRQKGKSLAKSHDELEQRLLAKGYSNIHIQRNFILVRKPKSEQITQLINQLAMLLNASVPLKQS
LVMVLENVHNIKLYLWLSEIIMLLEAGHAFSASLTKMNVYLTKQEIQLIQMGEQSGRLAIMLDNIAQVRNQSQKLANKVK
KIMFYPAIILAVSISVLIGLLLFIVPQFEDLYRNKDQSLPFITQLLFQLSNFLQQYAFILFIGGLFSGIVSYFLAKKWLF
WRTLRSRLLNHMPGFQQIIKDARIIFFSQNLALMLNAHIHLDATLKAFLSEHRQDPVLHQEILSILSLLKQGYKFSEGLN
PTVFTSQVVQMMAIGEQSGNLAKMCMYISQLYQQKLDYQIDLFAQLLEPVLMVVIGIIVGTILIGIYLPIFDMGALV

Nucleotide


Download         Length: 1194 bp        

>NTDB_id=179602 A6044_RS03765 WP_010945037.1 704591..705784(+) (pilC) [[Haemophilus] ducreyi strain GHA5]
ATGCTGAAAATATATGAATTTCATTGGCGGGCTTTTAATCGTTTTAATCAACGGCAAAAAGGTAAATCTCTGGCCAAAAG
TCACGATGAATTAGAACAACGGCTATTAGCTAAAGGCTATTCAAATATCCATATTCAACGTAATTTTATTTTAGTTAGGA
AACCAAAAAGCGAACAAATCACTCAATTAATTAATCAGTTAGCTATGTTATTGAATGCCTCGGTTCCATTAAAACAATCG
CTGGTGATGGTATTAGAAAACGTACATAATATTAAATTATATTTATGGCTATCTGAAATTATTATGCTGTTAGAAGCGGG
GCATGCTTTTTCAGCCAGTCTCACTAAAATGAACGTTTATTTGACTAAGCAAGAAATTCAGTTAATTCAGATGGGAGAAC
AGAGTGGGCGTTTAGCCATTATGCTAGATAACATTGCACAAGTACGTAATCAATCACAGAAATTAGCTAATAAAGTAAAG
AAAATTATGTTTTATCCGGCGATTATTTTAGCCGTATCAATTAGTGTATTAATTGGATTGTTATTATTTATTGTGCCTCA
ATTTGAAGATCTTTATCGTAACAAAGATCAGTCACTGCCTTTTATTACCCAATTATTATTTCAGTTATCGAATTTTCTTC
AACAATATGCATTTATTTTATTCATCGGCGGCCTATTTAGCGGTATTGTTAGCTATTTTTTAGCGAAAAAGTGGCTTTTT
TGGCGTACCTTAAGATCGCGTTTATTAAATCATATGCCAGGGTTTCAGCAAATTATTAAAGATGCTCGAATTATTTTCTT
TAGTCAAAATCTTGCGTTAATGCTTAATGCACATATTCATTTAGATGCAACGTTAAAAGCATTTTTATCAGAACATCGCC
AAGATCCGGTATTACATCAGGAGATTTTGAGTATATTAAGTTTATTAAAGCAAGGTTATAAATTCTCTGAGGGGCTTAAT
CCAACCGTATTTACTAGCCAAGTAGTACAAATGATGGCAATTGGCGAACAAAGTGGTAATTTGGCTAAAATGTGTATGTA
TATTAGTCAACTGTATCAGCAAAAACTGGATTACCAAATTGATTTGTTTGCTCAATTACTTGAACCGGTTTTAATGGTAG
TGATTGGCATTATTGTGGGGACGATTTTGATTGGGATTTATTTGCCTATTTTTGATATGGGAGCATTGGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VM72

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Glaesserella parasuis strain SC1401

51.637

100

0.516

  pilC Haemophilus influenzae Rd KW20

40.75

100

0.411

  pilC Haemophilus influenzae 86-028NP

40.25

100

0.406


Multiple sequence alignment