Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   HQ939_RS09020 Genome accession   NZ_CP054198
Coordinates   1791678..1792874 (-) Length   398 a.a.
NCBI ID   WP_075606470.1    Uniprot ID   -
Organism   Glaesserella parasuis strain YHP170504     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1786678..1797874
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HQ939_RS08995 (HQ939_09005) - 1787394..1787783 (-) 390 WP_075606473.1 SufE family protein -
  HQ939_RS09000 (HQ939_09010) clpP 1787884..1788465 (+) 582 WP_075606472.1 ATP-dependent Clp endopeptidase proteolytic subunit ClpP -
  HQ939_RS09005 (HQ939_09015) clpX 1788472..1789722 (+) 1251 WP_042906659.1 ATP-dependent protease ATP-binding subunit ClpX -
  HQ939_RS09010 (HQ939_09020) dapE 1789796..1790929 (-) 1134 WP_021117768.1 succinyl-diaminopimelate desuccinylase -
  HQ939_RS09015 (HQ939_09025) - 1791013..1791681 (-) 669 WP_075606471.1 prepilin peptidase -
  HQ939_RS09020 (HQ939_09030) pilC 1791678..1792874 (-) 1197 WP_075606470.1 type II secretion system F family protein Machinery gene
  HQ939_RS09025 (HQ939_09035) pilB 1792867..1794252 (-) 1386 WP_075606469.1 GspE/PulE family protein Machinery gene
  HQ939_RS09030 (HQ939_09040) pilA 1794493..1794948 (-) 456 WP_021117772.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  HQ939_RS09035 (HQ939_09045) radA 1795136..1796512 (+) 1377 WP_075606468.1 DNA repair protein RadA -
  HQ939_RS09040 (HQ939_09050) gmk 1796564..1797190 (-) 627 WP_075606467.1 guanylate kinase -

Sequence


Protein


Download         Length: 398 a.a.        Molecular weight: 45737.52 Da        Isoelectric Point: 10.1227

>NTDB_id=451408 HQ939_RS09020 WP_075606470.1 1791678..1792874(-) (pilC) [Glaesserella parasuis strain YHP170504]
MNNIYEFQWKGRNRFGQKQTGRQLAESREVLEKRLQQKGYSQLRISRHFALPTAPKKEEINQFMQQLALLIQAKIPLKQS
LVMLLETCQNRTFYRWQQETIRLIEAGFALSVAFEKQGKYINPQEIQLIKMAETSGNLGLILTNIAQRREKSEKLTKKIK
KILFYPVFILAVSITLSILLLLFIVPQFAELYGSKGKSLPLITEILFSLSNFLQHSILTLIILCVLCFLLIHILNKKTDL
IKRLKFIILSHIPILKGIIQYSRIIFFCQNSSLMLASHIRLDTVLHSFLANKSDDILLQRSLATTLTYLKQGYRLADSLD
PKLFPTDMLQMIAIGEQSGKLSPMLQQISDGYQQRLDYQIDILSQLLEPMLMVLMGIIVGTILVGLYLPIFDMGAMIE

Nucleotide


Download         Length: 1197 bp        

>NTDB_id=451408 HQ939_RS09020 WP_075606470.1 1791678..1792874(-) (pilC) [Glaesserella parasuis strain YHP170504]
ATGAATAACATTTATGAATTTCAATGGAAAGGACGCAACCGCTTTGGACAAAAGCAAACAGGGCGACAGCTTGCCGAAAG
CAGAGAAGTGCTAGAAAAACGCTTGCAACAAAAAGGCTATAGCCAGTTACGCATTAGCCGACATTTTGCCTTGCCGACGG
CACCGAAAAAGGAAGAAATTAATCAATTTATGCAACAGTTGGCATTATTAATTCAAGCCAAAATCCCCCTAAAACAGAGT
TTGGTAATGTTGCTGGAAACTTGCCAAAATCGCACCTTTTACCGTTGGCAACAAGAGACAATCCGCTTGATTGAAGCTGG
CTTTGCCCTATCCGTTGCCTTTGAAAAACAGGGGAAATATATCAATCCACAAGAGATCCAATTAATTAAAATGGCAGAAA
CAAGCGGTAATTTAGGGCTGATTTTAACCAATATTGCACAACGCCGTGAAAAATCGGAAAAACTAACCAAGAAAATTAAA
AAAATTCTCTTTTATCCCGTCTTTATTTTAGCTGTTTCAATCACACTCTCTATTTTATTATTACTTTTTATTGTGCCACA
ATTTGCAGAACTTTATGGATCAAAAGGAAAATCATTACCCCTAATTACCGAAATACTCTTTAGCCTATCCAATTTTCTAC
AACACTCTATTTTAACGCTAATTATCTTATGTGTGTTATGTTTTCTTTTAATTCATATTTTAAATAAAAAAACAGATCTT
ATTAAACGCCTAAAGTTTATTATATTAAGCCATATTCCTATTTTAAAAGGCATTATCCAGTATTCACGCATTATCTTTTT
TTGTCAAAATAGCAGTTTAATGCTGGCTTCACATATTCGTTTAGATACTGTACTTCATTCCTTTTTAGCCAATAAAAGTG
ATGATATTTTATTACAACGTTCTCTAGCAACTACACTAACTTATTTAAAACAAGGTTATCGTTTAGCCGACAGTTTAGAT
CCAAAACTCTTTCCTACGGATATGTTACAGATGATTGCCATTGGCGAGCAGAGCGGAAAATTATCGCCAATGTTGCAACA
AATCAGCGACGGTTATCAACAGCGGTTAGATTATCAAATTGATATACTTTCCCAATTATTAGAGCCGATGTTAATGGTGT
TAATGGGAATTATTGTAGGAACTATTTTAGTCGGGTTATATTTGCCGATTTTTGATATGGGAGCAATGATAGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Glaesserella parasuis strain SC1401

98.492

100

0.985

  pilC Haemophilus influenzae 86-028NP

38.404

100

0.387

  pilC Haemophilus influenzae Rd KW20

38.404

100

0.387