Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   H733_RS01025 Genome accession   NZ_CP007805
Coordinates   216159..217379 (-) Length   406 a.a.
NCBI ID   WP_038439228.1    Uniprot ID   -
Organism   Haemophilus influenzae CGSHiCZ412602     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 211159..222379
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H733_RS00995 (H733_0187) - 212625..212831 (-) 207 WP_005686717.1 heavy-metal-associated domain-containing protein -
  H733_RS01000 (H733_0188) - 212905..213111 (-) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -
  H733_RS01005 (H733_0189) cueR 213188..213574 (+) 387 WP_038439225.1 Cu(I)-responsive transcriptional regulator -
  H733_RS01010 (H733_0190) metJ 213588..213905 (-) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  H733_RS01015 (H733_0191) rho 214153..215415 (+) 1263 WP_005666690.1 transcription termination factor Rho -
  H733_RS01020 (H733_0192) pilD 215470..216162 (-) 693 WP_038439227.1 prepilin peptidase Machinery gene
  H733_RS01025 (H733_0193) pilC 216159..217379 (-) 1221 WP_038439228.1 type II secretion system F family protein Machinery gene
  H733_RS01030 (H733_0194) pilB 217376..218773 (-) 1398 WP_038439229.1 GspE/PulE family protein Machinery gene
  H733_RS01035 (H733_0195) pilA 218770..219219 (-) 450 WP_038439231.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  H733_RS01040 (H733_0196) ampD 219334..219885 (+) 552 WP_038439232.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  H733_RS01045 (H733_0198) corC 220508..221407 (+) 900 WP_038439233.1 CNNM family magnesium/cobalt transport protein CorC -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 46182.43 Da        Isoelectric Point: 9.6187

>NTDB_id=123473 H733_RS01025 WP_038439228.1 216159..217379(-) (pilC) [Haemophilus influenzae CGSHiCZ412602]
MTKKLFYYQGSNALNQKQKGSIIADTKQQAHFQLISCGLTHIKLQQNWQFVAKPKNSEISELLNQLATLLQSAIPLKNSL
QILQQNCTQIMLNEWLERLLQSIESGLAFSQAIEQQGKYLTQQEIQLIQVGEMTGKLAVVCKKIATHRSQSLALQRKLQK
IMLYPSMVLGISLLLTLALLLFIVPQFAEMYSGNNAELPTITAILLSISNFLKQNIGILLFFALSFVLFYYFYLKRQIWF
HQQKNQLISVTPIFGTIQKLSRLVNFSRSLQIMLQAGVPLNQALDSFLPRTQTWQTKKTLVNDIVLDKEVRSILQWVSQG
YAFSDSVSSDLFPMEAQQMLQIGEQSGKLALMLEHIADNYQEKLNHQIDLLSQMLEPLMMVIIGSLIGIIMMGMYLPIFN
MGSVIQ

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=123473 H733_RS01025 WP_038439228.1 216159..217379(-) (pilC) [Haemophilus influenzae CGSHiCZ412602]
ATGACTAAAAAACTCTTTTATTATCAAGGTAGTAACGCATTAAATCAGAAACAAAAAGGCTCAATTATTGCGGATACGAA
ACAACAAGCGCACTTTCAGTTAATAAGCTGCGGGCTTACTCACATCAAATTACAACAAAACTGGCAATTTGTGGCAAAGC
CCAAAAATTCAGAAATCAGTGAATTACTCAATCAATTAGCGACACTACTACAATCTGCCATTCCGTTAAAAAACAGCCTG
CAAATTTTGCAACAAAATTGTACTCAAATTATGCTCAACGAATGGCTTGAACGACTGCTTCAATCCATTGAATCTGGCTT
AGCATTCTCACAAGCAATTGAACAACAAGGAAAATATCTCACACAACAAGAAATTCAACTGATTCAAGTGGGAGAAATGA
CAGGAAAACTTGCCGTAGTTTGTAAAAAAATAGCCACGCACCGTAGCCAATCTTTAGCATTACAACGCAAATTACAGAAA
ATTATGTTATATCCCTCAATGGTATTGGGAATTTCTCTATTATTGACACTCGCATTACTGCTTTTTATCGTGCCTCAATT
TGCTGAAATGTACAGTGGCAATAATGCCGAATTACCAACAATAACCGCAATATTGCTCTCTATATCCAATTTTCTTAAGC
AAAATATTGGCATTTTGCTATTTTTCGCTTTGAGTTTTGTTCTATTTTATTACTTCTATCTAAAACGCCAGATCTGGTTT
CATCAACAGAAAAACCAACTTATCTCTGTCACGCCTATTTTTGGCACAATTCAAAAACTTTCACGTTTAGTGAACTTTAG
TCGCAGTTTACAAATTATGTTGCAGGCTGGCGTACCGCTTAATCAGGCACTAGACAGTTTTCTTCCTCGCACACAAACGT
GGCAAACCAAGAAAACGCTTGTAAATGACATCGTATTAGATAAAGAAGTGCGGTCAATTTTACAATGGGTTTCTCAAGGC
TATGCGTTTTCTGATAGTGTAAGTAGCGATCTTTTCCCGATGGAAGCACAACAAATGCTCCAAATTGGCGAACAAAGCGG
AAAACTCGCTTTGATGCTAGAGCATATCGCGGATAATTACCAAGAAAAACTTAATCATCAAATTGACTTACTCTCACAAA
TGCTAGAACCATTAATGATGGTAATTATCGGCAGTCTGATTGGAATTATTATGATGGGAATGTATTTACCTATCTTTAAT
ATGGGCTCTGTTATTCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae 86-028NP

96.798

100

0.968

  pilC Haemophilus influenzae Rd KW20

96.552

100

0.966

  pilC Glaesserella parasuis strain SC1401

38.404

98.768

0.379


Multiple sequence alignment