Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   EL230_RS03855 Genome accession   NZ_LR134490
Coordinates   758548..759768 (+) Length   406 a.a.
NCBI ID   WP_126513489.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NCTC11873     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 753548..764768
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL230_RS03835 (NCTC11873_00762) corC 754512..755411 (-) 900 WP_005662987.1 CNNM family magnesium/cobalt transport protein CorC -
  EL230_RS03840 (NCTC11873_00763) ampD 756043..756597 (-) 555 WP_041175038.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  EL230_RS03845 (NCTC11873_00764) pilA 756711..757160 (+) 450 WP_015701561.1 prepilin peptidase-dependent pilin Machinery gene
  EL230_RS03850 (NCTC11873_00765) pilB 757157..758551 (+) 1395 WP_126513488.1 GspE/PulE family protein Machinery gene
  EL230_RS03855 (NCTC11873_00766) pilC 758548..759768 (+) 1221 WP_126513489.1 type II secretion system F family protein Machinery gene
  EL230_RS03860 (NCTC11873_00767) pilD 759765..760457 (+) 693 WP_126513490.1 prepilin peptidase Machinery gene
  EL230_RS03865 (NCTC11873_00768) rho 760511..761773 (-) 1263 WP_005648966.1 transcription termination factor Rho -
  EL230_RS03870 (NCTC11873_00769) metJ 762018..762335 (+) 318 WP_005634287.1 met regulon transcriptional regulator MetJ -
  EL230_RS03875 (NCTC11873_00770) cueR 762349..762735 (-) 387 WP_005648963.1 Cu(I)-responsive transcriptional regulator -
  EL230_RS03880 (NCTC11873_00771) - 762812..763018 (+) 207 WP_012054944.1 heavy-metal-associated domain-containing protein -
  EL230_RS03885 (NCTC11873_00772) - 763092..763298 (+) 207 WP_012054944.1 heavy-metal-associated domain-containing protein -
  EL230_RS03890 (NCTC11873_00773) - 763384..763590 (+) 207 WP_012054944.1 heavy-metal-associated domain-containing protein -
  EL230_RS03895 (NCTC11873_00774) - 763676..763882 (+) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 46258.52 Da        Isoelectric Point: 10.0444

>NTDB_id=1123200 EL230_RS03855 WP_126513489.1 758548..759768(+) (pilC) [Haemophilus influenzae strain NCTC11873]
MTKKLFYYQGSNALNQKQKGSIIADTKQQAHFQLISRGLTHIKLQQNWQFGAKPKNSEISELLNQLATLLQSAIPLKNSL
QILQQNCTQIVLNEWLERLLQSIEAGLAFSQAIEQQGKYLTQQEIQLIQVGEMTGKLAVVCKKIATHRSQSLALQRKLQK
IMLYPSMVLGISLLLTLALLLFIVPQFAKMYSGNNAELPTITAILLSISNFLKQNIGILLFFAFNFFLFYYFYLKRQTWF
HQKKNQLISITPIFGTIQKLSRLVNFSQSLQIMLQAGVPLNQALDSFLPRTQTWQTKKTLVNDMVLDKEVRSILQWVSQG
YAFSNSVSSDLFPMEAQQMLQIGEQSGKLALMLEHIAENYQEKLNHQIDLLSQMLEPLMMVIIGSLIGIIMMGMYLPIFN
MGSVIQ

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=1123200 EL230_RS03855 WP_126513489.1 758548..759768(+) (pilC) [Haemophilus influenzae strain NCTC11873]
ATGACTAAAAAACTCTTTTATTATCAAGGTAGTAACGCATTAAATCAGAAACAAAAAGGCTCAATTATTGCGGATACAAA
ACAACAAGCACACTTTCAATTAATAAGCCGCGGGCTTACTCACATCAAATTACAACAAAACTGGCAATTTGGGGCAAAGC
CCAAAAATTCAGAAATTAGTGAATTACTCAATCAATTAGCCACGTTGCTACAATCCGCAATTCCATTAAAAAACAGTCTG
CAAATTTTGCAACAAAATTGTACTCAAATTGTACTCAATGAATGGCTTGAACGACTGCTTCAATCTATTGAAGCTGGTTT
AGCATTTTCACAAGCCATTGAACAACAAGGGAAATATCTCACTCAACAAGAAATTCAACTGATTCAAGTGGGAGAAATGA
CAGGTAAACTAGCCGTAGTTTGTAAAAAAATAGCCACACATCGCAGCCAATCTTTAGCATTACAACGTAAATTACAGAAA
ATTATGTTATATCCCTCAATGGTATTGGGAATTTCTCTATTATTGACACTCGCATTACTGCTTTTTATCGTGCCTCAATT
TGCTAAAATGTACAGTGGCAATAATGCTGAGTTACCAACAATAACTGCAATATTGCTCTCTATATCCAATTTTCTTAAGC
AAAATATTGGCATTTTGCTATTTTTCGCTTTTAATTTTTTTCTATTTTATTACTTCTATCTAAAACGCCAGACTTGGTTT
CATCAAAAGAAAAATCAACTTATTTCTATCACGCCTATTTTTGGCACAATTCAAAAGCTTTCACGTTTAGTGAACTTTAG
TCAAAGTTTACAAATTATGTTGCAGGCCGGCGTACCGCTTAATCAGGCACTAGACAGTTTTCTTCCTCGCACACAAACTT
GGCAAACCAAGAAAACGCTTGTAAACGATATGGTATTAGATAAAGAAGTGCGGTCAATTTTGCAATGGGTTTCTCAAGGC
TATGCGTTTTCTAATAGTGTAAGTAGCGATCTTTTCCCGATGGAAGCACAACAAATGCTACAAATTGGCGAGCAAAGCGG
AAAACTCGCTTTGATGCTAGAACATATTGCGGAAAATTACCAAGAAAAACTTAATCATCAAATTGACTTACTCTCACAAA
TGCTAGAACCATTAATGATGGTAATCATCGGAAGCCTGATTGGAATTATTATGATGGGAATGTATTTACCTATCTTTAAT
ATGGGATCAGTTATTCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

97.783

100

0.978

  pilC Haemophilus influenzae 86-028NP

97.537

100

0.975

  pilC Glaesserella parasuis strain SC1401

38.155

98.768

0.377


Multiple sequence alignment