Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   A6043_RS05330 Genome accession   NZ_CP015431
Coordinates   1129295..1130488 (-) Length   397 a.a.
NCBI ID   WP_010945037.1    Uniprot ID   Q7VM72
Organism   [Haemophilus] ducreyi strain GHA3     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1124295..1135488
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6043_RS05305 (A6043_05315) lptE 1126215..1126712 (+) 498 WP_010945042.1 LPS assembly lipoprotein LptE -
  A6043_RS05310 (A6043_05320) holA 1126718..1127743 (+) 1026 WP_010945041.1 DNA polymerase III subunit delta -
  A6043_RS05315 (A6043_05325) yacG 1127777..1127965 (-) 189 WP_010945040.1 DNA gyrase inhibitor YacG -
  A6043_RS05320 (A6043_05330) coaE 1127968..1128585 (-) 618 WP_010945039.1 dephospho-CoA kinase -
  A6043_RS05325 (A6043_05335) - 1128603..1129295 (-) 693 WP_041603480.1 prepilin peptidase -
  A6043_RS05330 (A6043_05340) pilC 1129295..1130488 (-) 1194 WP_010945037.1 type II secretion system F family protein Machinery gene
  A6043_RS05335 (A6043_05345) pilB 1130472..1131866 (-) 1395 WP_064088541.1 GspE/PulE family protein Machinery gene
  A6043_RS05340 (A6043_05350) pilA 1131878..1132327 (-) 450 WP_010945035.1 prepilin peptidase-dependent pilin Machinery gene
  A6043_RS05345 (A6043_05355) radA 1132521..1133900 (+) 1380 WP_010945034.1 DNA repair protein RadA -
  A6043_RS05350 (A6043_05360) - 1133967..1134182 (-) 216 WP_010945033.1 YgjV family protein -
  A6043_RS05355 (A6043_05365) - 1134304..1134789 (-) 486 WP_010945031.1 dihydrofolate reductase -

Sequence


Protein


Download         Length: 397 a.a.        Molecular weight: 45721.33 Da        Isoelectric Point: 10.1310

>NTDB_id=179581 A6043_RS05330 WP_010945037.1 1129295..1130488(-) (pilC) [[Haemophilus] ducreyi strain GHA3]
MLKIYEFHWRAFNRFNQRQKGKSLAKSHDELEQRLLAKGYSNIHIQRNFILVRKPKSEQITQLINQLAMLLNASVPLKQS
LVMVLENVHNIKLYLWLSEIIMLLEAGHAFSASLTKMNVYLTKQEIQLIQMGEQSGRLAIMLDNIAQVRNQSQKLANKVK
KIMFYPAIILAVSISVLIGLLLFIVPQFEDLYRNKDQSLPFITQLLFQLSNFLQQYAFILFIGGLFSGIVSYFLAKKWLF
WRTLRSRLLNHMPGFQQIIKDARIIFFSQNLALMLNAHIHLDATLKAFLSEHRQDPVLHQEILSILSLLKQGYKFSEGLN
PTVFTSQVVQMMAIGEQSGNLAKMCMYISQLYQQKLDYQIDLFAQLLEPVLMVVIGIIVGTILIGIYLPIFDMGALV

Nucleotide


Download         Length: 1194 bp        

>NTDB_id=179581 A6043_RS05330 WP_010945037.1 1129295..1130488(-) (pilC) [[Haemophilus] ducreyi strain GHA3]
ATGCTGAAAATATATGAATTTCATTGGCGGGCTTTTAATCGTTTTAATCAACGGCAAAAAGGTAAATCTCTGGCCAAAAG
TCACGATGAATTAGAACAACGGCTATTAGCTAAAGGCTATTCAAATATCCATATTCAACGTAATTTTATTTTAGTTAGGA
AACCAAAAAGCGAACAAATCACTCAATTAATTAATCAGTTAGCTATGTTATTGAATGCCTCGGTTCCATTAAAACAATCG
CTGGTGATGGTATTAGAAAACGTACATAATATTAAATTATATTTATGGCTATCTGAAATTATTATGCTGTTAGAAGCGGG
GCATGCTTTTTCAGCCAGTCTCACTAAAATGAACGTTTATTTGACTAAGCAAGAAATTCAGTTAATTCAGATGGGAGAAC
AGAGTGGGCGTTTAGCCATTATGCTAGATAACATTGCACAAGTACGTAATCAATCACAGAAATTAGCTAATAAAGTAAAG
AAAATTATGTTTTATCCGGCGATTATTTTAGCCGTATCAATTAGTGTATTAATTGGATTGTTATTATTTATTGTGCCTCA
ATTTGAAGATCTTTATCGTAACAAAGATCAGTCACTGCCTTTTATTACCCAATTATTATTTCAGTTATCGAATTTTCTTC
AACAATATGCATTTATTTTATTCATCGGCGGCCTATTTAGCGGTATTGTTAGCTATTTTTTAGCGAAAAAGTGGCTTTTT
TGGCGTACCTTAAGATCGCGTTTATTAAATCATATGCCAGGGTTTCAGCAAATTATTAAAGATGCTCGAATTATTTTCTT
TAGTCAAAATCTTGCGTTAATGCTTAATGCACATATTCATTTAGATGCAACGTTAAAAGCATTTTTATCAGAACATCGCC
AAGATCCGGTATTACATCAGGAGATTTTGAGTATATTAAGTTTATTAAAGCAAGGTTATAAATTCTCTGAGGGGCTTAAT
CCAACCGTATTTACTAGCCAAGTAGTACAAATGATGGCAATTGGCGAACAAAGTGGTAATTTGGCTAAAATGTGTATGTA
TATTAGTCAACTGTATCAGCAAAAACTGGATTACCAAATTGATTTGTTTGCTCAATTACTTGAACCGGTTTTAATGGTAG
TGATTGGCATTATTGTGGGGACGATTTTGATTGGGATTTATTTGCCTATTTTTGATATGGGAGCATTGGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VM72

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Glaesserella parasuis strain SC1401

51.637

100

0.516

  pilC Haemophilus influenzae Rd KW20

40.75

100

0.411

  pilC Haemophilus influenzae 86-028NP

40.25

100

0.406


Multiple sequence alignment