Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   DQL17_RS03415 Genome accession   NZ_LS483496
Coordinates   669402..670622 (-) Length   406 a.a.
NCBI ID   WP_005691827.1    Uniprot ID   A0A2S9S191
Organism   Haemophilus influenzae strain NCTC8455     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 664402..675622
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL17_RS03380 (NCTC8455_00665) - 665588..665794 (-) 207 WP_005631184.1 heavy-metal-associated domain-containing protein -
  DQL17_RS03385 (NCTC8455_00666) - 665868..666074 (-) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -
  DQL17_RS03390 (NCTC8455_00667) - 666149..666355 (-) 207 WP_005648955.1 heavy-metal-associated domain-containing protein -
  DQL17_RS03395 (NCTC8455_00668) cueR 666432..666818 (+) 387 WP_005648963.1 Cu(I)-responsive transcriptional regulator -
  DQL17_RS03400 (NCTC8455_00669) metJ 666832..667149 (-) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  DQL17_RS03405 (NCTC8455_00670) rho 667397..668659 (+) 1263 WP_111688882.1 transcription termination factor Rho -
  DQL17_RS03410 (NCTC8455_00671) pilD 668713..669405 (-) 693 WP_111688883.1 prepilin peptidase Machinery gene
  DQL17_RS03415 (NCTC8455_00672) pilC 669402..670622 (-) 1221 WP_005691827.1 type II secretion system F family protein Machinery gene
  DQL17_RS03420 (NCTC8455_00673) pilB 670619..672013 (-) 1395 WP_005691826.1 GspE/PulE family protein Machinery gene
  DQL17_RS03425 (NCTC8455_00674) pilA 672010..672459 (-) 450 WP_005691824.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  DQL17_RS03430 (NCTC8455_00675) ampD 672574..673128 (+) 555 WP_005691822.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  DQL17_RS03435 (NCTC8455_00676) corC 673760..674659 (+) 900 WP_005654105.1 CNNM family magnesium/cobalt transport protein CorC -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 46222.47 Da        Isoelectric Point: 9.9010

>NTDB_id=1142726 DQL17_RS03415 WP_005691827.1 669402..670622(-) (pilC) [Haemophilus influenzae strain NCTC8455]
MTKKLFYYQGSNALNQKQKGSIIADTKQQAHFQLISRGITHIKLQQNWQFGAKPKNSEISELLNQLATLLQSAIPLKNSL
QILQQNCTQIVLNEWLERLLQSIEAGLAFSQAIEQQGKYLTQQEIQLIQVGEMTGKLAVVCKKIATHRSQSLALQRKLQK
IMLYPSMVLGISLLLTLALLLFIVPQFAEMYSGNNAELPTITAILLSISNFLKQNIGILLFFVLSFFLFYYFYLKRQTWF
HQKKNQLISITPIFGTIQKLSRLVNFSQSLQIMLQAGVPLNQALDSFLPRTQTWQTKKTLVNDIILDKEVRSILQWVSQG
YAFSNSVSSDLFPMEAQQMLQIGEQSGKLALMLEHIAENYQEKLNHQIDLLSQMLEPLMMVIIGSLIGIIMMGMYLPIFN
MGSVIQ

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=1142726 DQL17_RS03415 WP_005691827.1 669402..670622(-) (pilC) [Haemophilus influenzae strain NCTC8455]
ATGACTAAAAAACTCTTTTATTATCAAGGTAGTAACGCATTAAATCAGAAACAAAAAGGCTCAATTATTGCGGATACAAA
ACAACAAGCACACTTTCAATTAATAAGCCGCGGGATTACTCACATCAAATTACAACAAAACTGGCAATTTGGGGCAAAGC
CCAAAAATTCAGAAATTAGCGAATTACTCAATCAATTAGCCACGTTGCTACAGTCCGCTATTCCGTTAAAAAACAGTCTG
CAAATTTTGCAACAAAATTGTACTCAAATTGTACTCAATGAATGGCTTGAACGACTACTTCAATCTATTGAAGCTGGTTT
AGCATTTTCACAAGCCATTGAACAACAAGGGAAATATCTCACACAACAAGAAATTCAACTGATTCAAGTGGGAGAAATGA
CAGGCAAACTAGCCGTAGTTTGTAAAAAAATAGCCACACATCGCAGCCAATCTTTAGCATTACAACGTAAATTACAGAAA
ATCATGTTGTACCCCTCAATGGTGTTAGGAATTTCTCTATTATTGACACTCGCATTACTGCTTTTTATCGTGCCTCAATT
TGCTGAAATGTACAGTGGCAATAATGCTGAGTTACCAACAATAACTGCAATATTGCTCTCTATATCCAATTTTCTTAAGC
AAAATATTGGCATTTTGCTATTTTTCGTTTTGAGTTTTTTTCTATTTTATTATTTCTATTTAAAACGCCAGACTTGGTTT
CATCAAAAGAAAAATCAACTTATTTCTATCACGCCTATTTTTGGCACAATTCAAAAGCTTTCACGTTTAGTGAACTTTAG
TCAAAGTTTACAAATTATGTTGCAGGCCGGCGTACCGCTTAATCAGGCACTAGACAGTTTTCTTCCTCGCACACAAACTT
GGCAAACCAAGAAAACGCTTGTAAATGACATCATATTAGATAAAGAAGTGCGGTCAATTTTGCAATGGGTTTCTCAAGGC
TATGCGTTTTCTAATAGCGTAAGTAGCGATCTTTTCCCGATGGAAGCACAACAAATGCTACAAATTGGCGAGCAAAGCGG
AAAACTCGCTTTGATGCTAGAGCATATTGCGGAAAATTACCAAGAAAAACTTAATCATCAAATTGACTTACTCTCACAAA
TGCTAGAACCATTAATGATGGTGATCATCGGCAGTCTGATTGGAATTATTATGATGGGAATGTATTTACCTATCTTTAAT
ATGGGTTCTGTTATTCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2S9S191

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

98.522

100

0.985

  pilC Haemophilus influenzae 86-028NP

97.783

100

0.978

  pilC Glaesserella parasuis strain SC1401

38.653

98.768

0.382


Multiple sequence alignment