Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC/pilC1   Type   Machinery gene
Locus tag   HBA47_RS05855 Genome accession   NZ_CP050136
Coordinates   1088602..1092684 (+) Length   1360 a.a.
NCBI ID   WP_154700189.1    Uniprot ID   -
Organism   Kingella kingae strain ATCC 23332     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1083602..1097684
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HBA47_RS05850 (HBA47_05865) - 1086355..1088265 (+) 1911 WP_019389113.1 ATP-binding cassette domain-containing protein -
  HBA47_RS05855 (HBA47_05870) pilC/pilC1 1088602..1092684 (+) 4083 WP_154700189.1 PilC/PilY family type IV pilus protein Machinery gene
  HBA47_RS05860 (HBA47_05875) - 1092900..1093097 (+) 198 WP_154700188.1 transposase -
  HBA47_RS05865 (HBA47_05880) - 1093094..1093294 (+) 201 WP_154700187.1 hypothetical protein -
  HBA47_RS05870 (HBA47_05885) - 1093318..1093863 (+) 546 WP_154700186.1 hypothetical protein -
  HBA47_RS05875 (HBA47_05890) - 1094089..1094445 (-) 357 WP_230311300.1 3-isopropylmalate dehydratase -
  HBA47_RS05880 (HBA47_05895) - 1094551..1096017 (+) 1467 WP_019389116.1 DegQ family serine endoprotease -
  HBA47_RS05885 (HBA47_05900) - 1096114..1097280 (+) 1167 WP_019389117.1 NnrS family protein -

Sequence


Protein


Download         Length: 1360 a.a.        Molecular weight: 148015.75 Da        Isoelectric Point: 8.2028

>NTDB_id=430291 HBA47_RS05855 WP_154700189.1 1088602..1092684(+) (pilC/pilC1) [Kingella kingae strain ATCC 23332]
MSLKYKKYNCNLKPLSLGVSLALSSFATMAADSPFAIAPQHLSSTATVKKTTVTTPYADIFKQQQSITNTTVRGMQGAKP
NIMLLLDDSGSMGAQVPGSGRTRQQILQSSLSQVVAKYGNQINWGLIAFNDQSSAFNLPLGTNYSTVVNSIKRFPANGLT
PTITSYIKAVDTLNKGIQYRCQKSYLVMLSDGDSNYPMYFEREAGVNARPGQMLTSYLRYPHWVYINREINKYDFWRELL
TKDGKLIDRNSSKYRGMCSTAYQKSNSNVKNWCSALPQQFVYYPYSGISDPDDTTTSAYRINYMNTWSGIFAKKYTEYPL
RDLTNDLDFLDSGSAYYTNTNLPSSGFGRYKKFFFNNQPILAPQTQYAGAMLSFFSEPINKIDLRIGGNDATGKSWDDSA
FPTQNITTFTIGFGNGLSESGLALLRGGASNIKLKDGTVQKAYYSASDQTALQKAFDAILAQVEAENPPPSNSNPIDLPP
TTNTLPISAPTAGVPNRQQSVMAETSYATSPPSATGASNSIPNMAASVFVPAGLKSSELRFYALNSKAQADTSSWLVADF
SQRKALIRTSSGVRWANDANLDNTFFLLSADTQTPQRTDEWNKALLPWTTRTVEGSDDSKIKALNYPTPYRIRPIPPTDS
PRSTGVDEYNMGDVLDTPIVATGKNGIAQGNRQEFLITAANDGMVYLFQSNPDVNAKNPYSLKLNYLPATMPRQTLTDTH
ATHYKDLTHEGYAKDSARPHLFMINGGITVRTTSAKSNRQNTFMVGALGQGGRGMYALNIGGKALDSGNDIGLNAESSTW
NTSVPLYESNPNDNNTKFSSLGYTVGYPQIGRIAKYNTGGSLSDETLQTGVYYASFLSSGFSYPNGNQETALYIQDALGV
DVGGGASTAAKTNNNPKGKLLAKVVVKDGVGGLASPTLLDVNRDGVYDFAFAGDYGGNMYRFDLRKIDLSNGTVTDGAVK
KIYSGSTTQPITSAPAISMQKNGRYVVAFGTGSDMYASDYNSTAEQAVYGIYQRFNERLDPVNINDPAAVEAEKPLTAEN
LLEQTLTTATAKMAEGTPDEKEVTVRKGSTNAIESTSDKSKLAYDGWKVRLGNTTSVQDGERVVVTPTVIFGTLILSTRI
YKEKPVATANNNPWSESNWQADGWVKVREDLDQNRTESPWSGWKEISTNSTSTTSSGDICTGTVNTSTKMQERERKVNYE
TITTYEKVQPSENQSSGWILVLNATNGGAVKVSSGTGVDFLGKYDASNFNPNDTAHSNTYFTVGLYSSSGLPNHTLLTND
SNVHSLGSAMTRDGEARTNGEDGDLTPGGEKAKDDCFNNTDGNYVLTGNTATGIGATFSVVGRRCGKKIQIRRLSWREIF

Nucleotide


Download         Length: 4083 bp        

>NTDB_id=430291 HBA47_RS05855 WP_154700189.1 1088602..1092684(+) (pilC/pilC1) [Kingella kingae strain ATCC 23332]
ATGAGTCTTAAATATAAAAAATATAATTGCAATCTTAAACCATTAAGCTTAGGGGTATCACTTGCCTTGTCCAGCTTTGC
GACAATGGCGGCTGATTCACCATTTGCAATTGCACCACAACATTTATCCAGTACGGCAACTGTAAAAAAAACCACAGTAA
CGACACCGTATGCCGATATTTTCAAACAGCAACAATCAATTACAAATACAACAGTGCGTGGTATGCAGGGTGCTAAACCC
AATATTATGCTACTATTGGATGACTCAGGAAGTATGGGTGCTCAAGTTCCAGGCTCTGGTCGAACACGACAACAAATATT
GCAAAGTTCCTTATCACAAGTAGTTGCTAAGTATGGTAATCAAATTAACTGGGGGCTTATTGCATTTAATGATCAAAGTT
CCGCATTTAACCTTCCATTAGGGACAAATTATTCAACCGTTGTTAATTCAATTAAAAGATTTCCTGCTAATGGTCTGACA
CCAACTATTACCAGCTACATAAAGGCTGTAGATACACTAAATAAAGGTATTCAATACCGTTGCCAAAAAAGTTATTTAGT
TATGTTATCTGATGGCGATTCTAACTATCCTATGTATTTTGAGCGAGAAGCAGGAGTTAATGCTAGACCTGGTCAGATGC
TTACTAGCTATTTAAGATATCCACACTGGGTTTATATTAACAGAGAAATTAATAAATACGATTTTTGGCGTGAACTCTTA
ACAAAAGATGGTAAATTAATTGATAGAAATTCATCCAAATATCGTGGAATGTGCAGTACTGCTTATCAAAAATCTAATTC
GAATGTTAAAAACTGGTGCAGTGCCTTACCTCAACAATTTGTTTATTATCCATATAGTGGTATTTCTGACCCTGATGATA
CTACCACCTCTGCTTATCGAATAAATTATATGAATACTTGGTCAGGTATTTTCGCAAAAAAATATACCGAATACCCATTG
CGGGATTTAACTAATGATTTAGATTTTTTGGATAGTGGTTCTGCATACTATACAAATACAAACCTACCTTCATCAGGATT
TGGAAGATATAAAAAATTTTTCTTCAACAATCAACCTATTTTGGCTCCCCAGACCCAATATGCGGGCGCGATGTTGTCAT
TTTTTAGCGAGCCAATCAACAAAATTGACCTACGAATAGGTGGTAATGATGCTACAGGTAAATCATGGGATGACTCAGCT
TTTCCAACACAAAACATTACAACCTTTACTATTGGTTTTGGTAATGGTCTGAGTGAGAGTGGTTTGGCATTATTGCGCGG
TGGTGCCAGCAATATTAAATTAAAAGATGGAACAGTACAAAAAGCGTACTATTCCGCTAGCGATCAAACAGCATTGCAAA
AGGCATTTGATGCTATTTTGGCGCAAGTTGAAGCGGAAAATCCACCACCTAGCAATAGTAATCCTATTGATTTGCCTCCC
ACTACTAATACCTTGCCAATTTCTGCACCTACGGCAGGTGTACCTAATCGCCAGCAAAGTGTGATGGCGGAGACGTCTTA
TGCCACTTCTCCTCCTAGTGCAACAGGTGCTAGCAATAGTATTCCTAATATGGCGGCATCGGTGTTTGTCCCTGCAGGGC
TAAAAAGTAGCGAATTGCGTTTTTATGCTTTAAATAGCAAAGCGCAAGCAGATACTTCGTCTTGGCTTGTGGCTGACTTT
TCGCAACGTAAAGCGTTAATCAGGACAAGTTCGGGTGTACGATGGGCAAATGATGCAAACTTGGATAATACGTTCTTTTT
ATTGTCTGCTGATACGCAAACGCCTCAACGGACTGATGAGTGGAATAAGGCATTATTGCCTTGGACAACTCGTACTGTTG
AAGGCAGCGATGACAGCAAAATTAAAGCGTTGAATTACCCAACGCCTTATCGTATTCGTCCAATTCCACCTACTGATAGT
CCACGTAGTACTGGCGTTGATGAATATAATATGGGCGATGTGTTGGATACACCGATTGTTGCGACTGGTAAAAATGGCAT
TGCGCAAGGTAATCGACAAGAGTTTTTGATTACAGCGGCCAATGATGGCATGGTGTACTTGTTCCAAAGTAATCCTGATG
TCAATGCAAAAAATCCGTATTCATTAAAATTGAATTACTTACCTGCGACAATGCCGCGCCAAACATTGACTGATACTCAT
GCTACTCATTACAAAGATTTGACACACGAGGGTTATGCAAAAGATTCTGCGCGTCCACATTTATTTATGATTAATGGTGG
TATTACGGTTCGTACGACTTCAGCTAAATCCAATCGTCAAAACACGTTTATGGTGGGGGCTTTGGGACAAGGCGGACGTG
GTATGTACGCACTCAATATTGGCGGTAAGGCTTTGGATTCTGGTAACGACATCGGCTTAAATGCCGAAAGTAGTACATGG
AATACGTCCGTGCCTTTATATGAGTCTAATCCAAATGATAATAATACGAAGTTTTCTTCATTAGGCTATACCGTTGGTTA
TCCACAAATTGGTCGTATTGCTAAATATAACACTGGTGGCTCTTTATCTGATGAAACCTTACAAACAGGCGTTTATTATG
CTTCTTTCTTATCAAGTGGCTTTTCTTATCCTAATGGTAATCAGGAAACCGCTCTGTATATTCAAGATGCGTTGGGTGTA
GATGTTGGTGGTGGTGCTAGTACTGCTGCCAAAACAAACAACAATCCAAAAGGTAAATTATTGGCAAAAGTAGTGGTTAA
AGACGGCGTGGGTGGTTTGGCATCGCCAACCTTACTTGATGTGAATCGAGATGGTGTGTATGACTTTGCCTTTGCGGGAG
ATTATGGTGGTAATATGTATCGCTTTGATTTGCGTAAAATTGATTTGTCTAATGGTACTGTTACCGACGGCGCTGTGAAG
AAAATTTATTCTGGTTCTACAACACAACCAATTACAAGTGCTCCTGCTATTTCCATGCAAAAAAATGGACGGTATGTTGT
GGCATTTGGTACGGGTAGTGATATGTATGCTAGCGATTATAATTCAACCGCTGAGCAAGCTGTATATGGTATTTATCAAC
GATTCAACGAGCGTTTGGATCCTGTTAATATTAATGATCCTGCTGCTGTTGAAGCTGAAAAACCTTTAACAGCTGAGAAT
TTATTGGAGCAAACATTAACCACAGCGACTGCAAAAATGGCAGAGGGTACTCCTGACGAAAAAGAAGTAACTGTTCGTAA
AGGCTCAACCAATGCAATTGAAAGTACTAGTGATAAATCAAAATTAGCCTATGACGGTTGGAAAGTGCGATTAGGTAACA
CGACATCTGTACAAGATGGTGAACGTGTTGTGGTTACCCCAACTGTGATTTTTGGCACTTTGATTCTTTCAACTCGTATC
TATAAAGAAAAACCAGTTGCTACGGCCAATAATAATCCTTGGTCAGAAAGTAATTGGCAAGCTGATGGTTGGGTTAAAGT
AAGGGAAGATTTAGACCAAAATCGCACAGAATCTCCATGGTCTGGTTGGAAAGAAATCAGCACGAACTCAACATCAACCA
CTAGTTCAGGTGATATTTGTACGGGTACTGTGAATACCAGCACTAAAATGCAAGAGCGCGAAAGAAAGGTTAATTATGAA
ACCATTACCACTTATGAGAAAGTACAGCCAAGTGAGAATCAATCATCTGGATGGATTTTAGTATTGAATGCGACTAATGG
CGGTGCGGTAAAAGTATCATCTGGTACTGGAGTTGACTTTTTAGGTAAGTATGATGCTAGTAACTTTAATCCTAATGATA
CAGCCCATAGTAATACCTATTTTACCGTAGGTTTGTACTCTTCCAGCGGTTTACCAAATCATACATTATTAACCAATGAT
TCTAATGTTCATAGTTTAGGTTCTGCGATGACTCGTGATGGGGAAGCAAGAACCAATGGTGAAGATGGTGATTTAACTCC
TGGTGGTGAAAAGGCAAAGGATGATTGCTTTAATAATACTGACGGTAACTATGTATTGACAGGTAATACGGCAACAGGAA
TTGGCGCAACATTTAGCGTTGTTGGTCGTCGTTGTGGTAAAAAAATACAAATCCGTCGTTTATCTTGGCGTGAGATTTTC
TAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC/pilC1 Kingella kingae strain KK03

96.985

100

0.97