Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC/pilC2   Type   Machinery gene
Locus tag   KKKWG1_RS03000 Genome accession   NZ_LN869922
Coordinates   575657..580177 (+) Length   1506 a.a.
NCBI ID   WP_060536962.1    Uniprot ID   -
Organism   Kingella kingae strain KWG1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 570657..585177
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KKKWG1_RS02980 (KKKWG1_0608) - 571212..571799 (-) 588 WP_019389762.1 FixH family protein -
  KKKWG1_RS02985 (KKKWG1_0609) ccoG 571803..573230 (-) 1428 WP_019389761.1 cytochrome c oxidase accessory protein CcoG -
  KKKWG1_RS02990 (KKKWG1_0610) - 573327..574607 (-) 1281 WP_038305996.1 valine--pyruvate transaminase -
  KKKWG1_RS02995 (KKKWG1_0611) - 574737..575444 (+) 708 WP_060536961.1 SAM-dependent methyltransferase -
  KKKWG1_RS03000 (KKKWG1_0612) pilC/pilC2 575657..580177 (+) 4521 WP_060536962.1 PilC/PilY family type IV pilus protein Machinery gene
  KKKWG1_RS03005 (KKKWG1_0613) mnmG 580482..582368 (+) 1887 WP_019389758.1 tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG -
  KKKWG1_RS03010 (KKKWG1_0614) - 582411..584528 (-) 2118 WP_019389757.1 M3 family metallopeptidase -

Sequence


Protein


Download         Length: 1506 a.a.        Molecular weight: 165018.76 Da        Isoelectric Point: 8.0479

>NTDB_id=1114872 KKKWG1_RS03000 WP_060536962.1 575657..580177(+) (pilC/pilC2) [Kingella kingae strain KWG1]
MKPASKKRISTQLRQLNKMLWVAFPLLSLPFYVQAENKTSFSDSSLTGIQKTYGPNVTLALSVEFPTAGAAYSTATEFTN
AMMSQTFLGYFDNTKCYEYVAQNDPTGMLAFTANGYRQDSGSDNVSKDKSGKPLLAPDGTQALAGKRRPWIVYYEGHRLS
VPDSIRSNNMLYESIRQKEFFLYDRINTNSYDDDDSMGTSGAVNHKYETNEREYFRPTRTASTSNGMVGVCDGPNEFSGN
FMNWATMSAIDIFRQAMTGGNRALGVAKDTTAYEAGDTPTRTFLRRANVVREQNAHYMQRTVALSEENIKKVLPHDYAVD
RPNPILADRGQESSSNKFFLYKGYIPHPYAYRIWLNSRGLWEDETKQTTRKAMMGQVTLATSRPLLVRNSGFGVDFRRAV
HDTSNSGWYNNAFRRDDWSPRTNWWSGDVTRIGNINDGQNGRYWAPVKSRVMPYQVTVEACVPGKLEANCVLQPSGSYKP
EGLMQQNASTMRFAAFGYANIAGNTVSGGVLRSRMRYIANPNKDGATATGPAVKYQEEIDPRTGQFYLNPDRNAEGSAQQ
AFTVNTDKKSPSPNFDNSGTINYLNKFGDYNNYKTNDPGAELYYTALRYLRHAGFPALYKTKLAEAAGNEGITKNVRDNF
PLITDNWDNPLDVFSNSTTLNVADNVCRPNYIIYIGDTNTHADASLPGGGNTPYDSNVDDDRDIHVRNLINEKIRSNEPR
LSWWNGNWGADASPGGAMAALALWGRTNDLQQNFGKRLSSFMIDTVEDGKFKPENNNYWLAAKYGGFNDLNNNGVPDKGE
WEATSASDKVSAFEAPAVSNGTPQNFAVANNPTSMVAALNRAFEATATAEAPSATSLGTTSTKPLSERGKTMLLQSVFRD
ATTVVNGARVKVASGDVLGLEAILKGHKLEYEKKWSAGEKLQAAYHNANGWTNRKVFTRANANSGAVRLVGNNSAVTAAV
GGSDAADLVNYALGAPTYEDGRKFRIRPNHLMGTVINSPVVTIPSRTGSVSTVAGSCTYPAYANIGSRPNRHVVAANDGM
FYVLDNAGNEIASYMPSTALSQLARYASPNYSHFFMNDGIAATSEVCFDTGNDKGNGKAHTVVVGTAGRGGNSVYALDLT
NPTDLTESDMLWEFTHDGLGKSLFAPIITHDSTGTPVAIVSSGYNAKEDNGYVYVLKLNKPVGSPWVEDTHPTKPWGNNG
NWYRIKLGAGGVGELFAYQNEARTVAAVYGGDLEGKLWKVSQNAEGRFVAGYVDSANNTLPIFKTADKTSIVGAPFAQIV
GGKTYVVFITGRYFNKSDLPSTTQTVQNYAYGIIESKISDNRNPTASGALIEDGENLLQQHVEEEIVPDNVRQTVFYKVT
NHQITNKHQGWKLKLQKNWLSIDKSAIRGKRVAEFSAVNPLATDVASADNMCTENGSTYSLSVNVFNGGVYNKPIYDTNG
DGKFTEADTLVSVAGQAGILTRLTEVSTEFGRIVGGINSMGNMIQMPKDNITSDPVVKRVSWREIF

Nucleotide


Download         Length: 4521 bp        

>NTDB_id=1114872 KKKWG1_RS03000 WP_060536962.1 575657..580177(+) (pilC/pilC2) [Kingella kingae strain KWG1]
ATGAAGCCAGCAAGCAAAAAGCGCATTTCAACGCAACTGCGCCAGCTCAATAAAATGCTGTGGGTAGCCTTTCCTTTGCT
ATCCTTGCCTTTTTATGTACAAGCAGAAAACAAAACCTCTTTTTCTGATTCATCATTAACAGGTATTCAAAAAACCTACG
GCCCAAACGTAACATTGGCTTTGTCGGTGGAGTTTCCAACCGCAGGTGCGGCGTATTCTACAGCAACCGAATTTACCAAT
GCCATGATGAGCCAAACCTTTTTGGGCTATTTCGACAACACCAAATGCTATGAATATGTTGCGCAAAATGACCCAACTGG
CATGCTAGCGTTTACTGCTAATGGTTATAGGCAAGACAGTGGCTCGGATAATGTATCCAAGGATAAGTCTGGTAAACCAT
TACTTGCGCCCGATGGTACGCAAGCGTTGGCGGGTAAACGGCGCCCATGGATAGTCTATTATGAGGGACACAGGCTCAGT
GTTCCGGACTCAATACGCAGCAACAATATGCTTTATGAATCAATACGACAGAAAGAATTCTTCTTGTATGATAGGATTAA
TACCAATTCGTATGATGATGATGATAGTATGGGTACCAGTGGTGCAGTTAATCATAAATATGAAACGAACGAGCGAGAAT
ATTTCCGTCCAACACGAACCGCCAGTACAAGCAATGGCATGGTCGGTGTTTGTGATGGTCCAAATGAATTTAGTGGTAAC
TTCATGAACTGGGCAACCATGTCGGCGATTGATATTTTCCGTCAAGCGATGACAGGTGGTAACCGTGCGCTGGGTGTTGC
AAAAGACACAACCGCTTATGAGGCAGGCGATACGCCAACGCGAACGTTCTTGCGCCGTGCTAATGTGGTGCGTGAACAAA
ATGCGCACTATATGCAGCGTACAGTGGCTTTAAGTGAAGAGAATATCAAAAAAGTCTTGCCACACGATTATGCAGTAGAT
AGACCCAATCCTATTTTGGCAGACCGTGGTCAAGAATCGTCAAGTAATAAGTTTTTCTTATACAAGGGTTATATTCCACA
CCCATATGCATATCGTATTTGGTTGAATTCAAGAGGGCTGTGGGAAGATGAAACTAAACAAACCACACGTAAGGCAATGA
TGGGCCAAGTTACATTGGCGACTAGTCGTCCATTGCTTGTACGTAACTCTGGTTTTGGCGTGGATTTCCGCCGTGCTGTT
CATGACACATCAAATAGTGGTTGGTATAATAATGCGTTTAGGCGGGATGATTGGAGTCCACGCACCAATTGGTGGAGTGG
TGATGTTACTCGGATTGGTAACATCAACGATGGTCAAAATGGTCGCTATTGGGCGCCTGTTAAATCGCGTGTAATGCCTT
ATCAAGTGACCGTAGAAGCGTGTGTGCCAGGCAAATTGGAAGCCAACTGCGTGCTTCAGCCATCTGGTTCGTATAAGCCA
GAAGGTTTGATGCAACAAAACGCGTCAACCATGCGTTTTGCAGCATTTGGTTACGCCAATATTGCAGGTAATACCGTGAG
TGGTGGTGTGTTACGTAGCCGTATGCGTTATATCGCCAATCCAAATAAAGATGGAGCAACGGCAACAGGACCAGCTGTAA
AATACCAAGAAGAGATTGACCCTCGTACAGGTCAATTTTATTTGAATCCAGACCGCAACGCCGAAGGTTCAGCGCAGCAA
GCGTTTACTGTTAATACAGATAAAAAATCGCCATCTCCTAATTTTGACAATTCGGGCACGATTAATTATCTGAACAAGTT
TGGTGACTACAACAACTATAAAACCAATGACCCTGGCGCAGAGTTGTATTACACCGCATTGCGCTATTTGCGTCATGCAG
GTTTCCCAGCGTTGTACAAAACCAAGTTAGCAGAAGCGGCTGGTAACGAAGGGATTACAAAAAATGTTCGCGATAACTTC
CCATTGATTACCGACAATTGGGATAACCCATTAGACGTGTTCAGCAACAGCACTACTTTGAATGTAGCGGATAACGTGTG
TCGTCCAAACTATATTATCTATATTGGTGATACCAATACCCACGCCGATGCTTCATTGCCTGGCGGTGGGAATACGCCAT
ACGATTCCAATGTGGATGACGACCGAGACATTCATGTGCGTAACTTGATTAACGAGAAAATCAGATCAAACGAACCTAGG
TTGAGTTGGTGGAATGGCAACTGGGGTGCAGATGCATCACCAGGTGGTGCAATGGCGGCATTGGCGTTGTGGGGTCGAAC
CAACGATTTGCAGCAGAATTTTGGTAAGCGTTTAAGTAGCTTTATGATTGATACGGTTGAGGATGGTAAATTTAAACCTG
AAAATAATAACTATTGGTTGGCAGCCAAATACGGTGGCTTCAACGATTTAAATAATAATGGTGTACCTGATAAAGGCGAA
TGGGAAGCAACTTCCGCTTCGGATAAAGTGTCGGCGTTTGAAGCACCAGCGGTGTCTAATGGTACGCCACAGAACTTTGC
GGTAGCCAACAACCCAACTTCAATGGTAGCAGCATTGAACCGTGCGTTTGAAGCCACAGCCACAGCCGAAGCACCATCTG
CAACCAGCTTGGGCACAACTAGCACCAAACCTTTGTCAGAACGCGGTAAAACTATGTTGTTGCAATCCGTGTTCCGCGAC
GCAACCACTGTGGTAAATGGTGCGCGTGTTAAAGTGGCATCGGGCGATGTGTTGGGATTGGAAGCAATCTTAAAAGGTCA
TAAGTTGGAGTACGAGAAAAAATGGTCGGCTGGCGAAAAATTGCAGGCTGCTTATCACAATGCAAATGGTTGGACAAATC
GTAAGGTGTTTACTCGCGCTAACGCCAATAGCGGTGCAGTTCGTTTGGTAGGCAACAATTCTGCGGTTACAGCAGCCGTG
GGTGGTTCAGACGCAGCAGATTTGGTTAATTACGCTTTGGGTGCACCAACGTATGAAGATGGCAGAAAATTCCGTATTCG
TCCAAACCATTTGATGGGTACGGTGATTAACTCGCCAGTAGTAACCATTCCATCACGCACAGGCAGCGTTAGCACCGTGG
CAGGTAGCTGTACTTATCCTGCATATGCCAATATTGGCTCGCGTCCTAATCGTCATGTGGTAGCAGCCAATGATGGTATG
TTCTATGTGTTGGATAACGCAGGTAATGAGATTGCATCGTATATGCCATCAACTGCGTTGAGTCAGTTGGCACGTTATGC
GTCGCCAAATTACAGCCATTTCTTTATGAATGACGGTATTGCCGCTACTTCTGAAGTATGTTTTGACACGGGTAATGACA
AGGGTAATGGCAAGGCGCATACGGTAGTAGTTGGCACAGCAGGTCGTGGTGGTAACAGCGTGTATGCGTTGGATTTGACC
AACCCAACAGACTTGACTGAAAGTGATATGCTGTGGGAGTTTACGCACGACGGCTTGGGTAAGAGCTTGTTTGCGCCAAT
CATCACGCACGATAGCACTGGTACACCTGTGGCGATTGTGAGCAGTGGCTATAATGCAAAAGAAGACAATGGCTATGTGT
ATGTGCTGAAATTGAACAAACCTGTTGGCAGTCCTTGGGTTGAAGATACACACCCAACCAAACCATGGGGCAATAACGGT
AACTGGTATCGCATTAAACTGGGTGCTGGTGGTGTAGGCGAATTGTTTGCTTACCAAAATGAAGCACGCACCGTTGCAGC
TGTGTATGGTGGCGATTTAGAAGGCAAGCTGTGGAAGGTATCGCAAAATGCTGAAGGTCGCTTTGTAGCGGGTTATGTGG
ATAGCGCAAATAACACGTTGCCTATCTTCAAAACAGCAGACAAGACGTCTATTGTGGGTGCGCCTTTTGCACAAATCGTA
GGTGGTAAAACGTATGTTGTGTTTATTACTGGTCGCTACTTCAACAAGAGCGATTTGCCTAGCACAACCCAAACAGTACA
AAACTACGCATACGGCATTATCGAGAGCAAAATCAGCGATAACCGCAATCCAACTGCTAGCGGTGCATTGATTGAAGACG
GTGAGAACTTGTTGCAACAACATGTGGAAGAGGAAATTGTTCCAGACAACGTGCGACAAACCGTGTTCTATAAGGTTACG
AATCACCAAATTACCAACAAGCATCAAGGTTGGAAGTTGAAACTGCAAAAAAACTGGCTGAGTATTGATAAGAGTGCGAT
TCGCGGCAAACGTGTTGCAGAGTTTAGCGCAGTGAATCCGTTGGCGACCGATGTGGCTAGCGCCGATAATATGTGTACCG
AAAACGGTTCGACTTATTCGTTGTCAGTTAATGTGTTCAATGGTGGTGTGTACAACAAACCTATCTATGACACAAATGGC
GATGGTAAGTTTACCGAAGCGGATACTTTGGTATCTGTGGCTGGTCAGGCTGGTATTTTGACTCGCTTAACCGAAGTTTC
TACTGAGTTTGGTCGTATTGTTGGCGGTATCAACAGTATGGGTAACATGATTCAAATGCCAAAAGACAATATCACCAGCG
ACCCTGTGGTGAAACGTGTATCTTGGCGCGAGATTTTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC/pilC2 Kingella kingae strain KK03

90.975

100

0.91


Multiple sequence alignment