Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC/pilC2   Type   Machinery gene
Locus tag   DQN52_RS00450 Genome accession   NZ_LS483426
Coordinates   85812..90311 (-) Length   1499 a.a.
NCBI ID   WP_003787260.1    Uniprot ID   A0AAX2J107
Organism   Kingella kingae strain NCTC10529     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 80812..95311
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN52_RS00440 (NCTC10529_00084) - 81460..83577 (+) 2118 WP_080561234.1 M3 family metallopeptidase -
  DQN52_RS00445 (NCTC10529_00085) mnmG 83620..85506 (-) 1887 WP_019389758.1 tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG -
  DQN52_RS00450 (NCTC10529_00086) pilC/pilC2 85812..90311 (-) 4500 WP_003787260.1 pilus assembly protein Machinery gene
  DQN52_RS11055 - 90552..90815 (+) 264 WP_071461654.1 transposase -
  DQN52_RS00460 (NCTC10529_00087) - 90712..91143 (+) 432 WP_111694396.1 transposase -
  DQN52_RS00465 (NCTC10529_00088) - 91175..91882 (-) 708 WP_003787259.1 SAM-dependent methyltransferase -
  DQN52_RS00470 (NCTC10529_00089) - 92011..93291 (+) 1281 WP_003787257.1 valine--pyruvate transaminase -
  DQN52_RS00475 (NCTC10529_00090) ccoG 93388..94815 (+) 1428 WP_003787256.1 cytochrome c oxidase accessory protein CcoG -

Sequence


Protein


Download         Length: 1499 a.a.        Molecular weight: 164203.13 Da        Isoelectric Point: 8.6238

>NTDB_id=1141172 DQN52_RS00450 WP_003787260.1 85812..90311(-) (pilC/pilC2) [Kingella kingae strain NCTC10529]
MKPASKKRISTQLRQLNKMLWVAFPLLSLPFYAQAENNTQFSNSSLTGIQKTYGPNVTLALSVEFPTAGAAYSTATEFTD
AMMSQTFLGYFDNTKCYKYVAPNDPDGMKAFADNGYSQYSGSDNVSKGKNGPLLAPDGTRALAGKRRPWIVYYDGRRLSI
PDSVRSNNMLYESIRQKEFFLHDSMDAGSDDSMGTSGAVNHRYETKEREYFQPTRTASTSNGMVGVCDGPNEFSGNFMNW
ATMSAIDIFRQAMTGGNRALGVAKDTTAYEAGDTQTQTFLRRANVVREQNAHYMQRTVALSEANIKKVLPHDYAVDNPLL
ADRGKELPSNKFFLYKGYIPHPYAYRIGLNSIGRWEEETQQTSIKAMTGRVILATRRPLLVRNSGFGVDFRRAVYDESHG
GWYHKVFSRDKWSPHTDWWSRVTWIGNINDGQNGRYWAPVKSRVMPYQVTVEACVPGKLEANCVRQPSGSYKPEGLMQQN
ASTMRFAAFGYANIAGNTVSGGVLRSRMRYIANPNKDGATATGPAVKYQEEINPKTGQFYVNPDRNAEETAAQAFVVNTG
DKNLPSSNFDNSGTINYLNKFGDYNNYKTNDPGAELYYTALRYLRNKGFPALYKTKLAEATRSNITREVRDNFPLIADNW
DNPLNVFSDNTTLNVADNVCRPNYIIYIGDTNTHADASLPGGWNTSYDSNVDDDQDIHVRNLINEKIRPNEPRLSWWNDN
WGADASPGGAMAALALWGRTNDLQQNFGKRLSSFMIDTVEDGKFKSENNNYWLAAKYGGFNDLNNNGVPDKGEWEATSAS
DKVSAFEAPAVSNGTPQNFAVANNPTSMVAALNRAFEATATAEAPSATSLGTTSTKPLSERGKTMLLQSVFRDATTVVDG
ARVKVASGDVLGLEATLKGHKLEYEKKWSAGEKLQAAFHSSDNWKKRKVFTRASANSSAVRLADNNSEVAAVVNDSNAVD
LINYALGAKTYENGRKFRIRPNHLMGTVINSPVVTIPSRTGSVSTVAGSCTYPASANISSRANRHVVAANDGMFYVLNSS
GQEIASYMPSTALSQLARYASPNYSHFFMNDGIAATSEVCFDTGNGKGDGKAHTVVVGTAGRGGNSVYALDLTDPTTLEA
KDMLWEFTDAGLGKSLFAPIITHDNTGTPVAIVSGGYNASEDTGYIYILKLNKGANEPWSEDTNNTAPWQNGKNWYRIKL
GKGGVGELFAYQTEARTVAAVYAGDLEGKLWKVSQNAEGRFVAGYVDSANNALPIFKTADNTSIVGAPFAQIVGGKTYVV
FITGRYFNKSDLPSTTKTVQNYAYGIIESKIGDNRTPTGSGALIDDSTNLLQQSVNEKIVPDSVPQTVFYTVTNNQITNQ
HQGWKLKLQKNWLSIDKSAIRGKRVAEFSAVNPLATDVASAGNMCTENGSTYSLSVNVFNGGVYNKPIYDTNGDGKFTEA
DTLVSVAGQAGILTRLTEVSTEFGRIVGGINSMGNMIQMPKDNITSDPVVKRVSWREIF

Nucleotide


Download         Length: 4500 bp        

>NTDB_id=1141172 DQN52_RS00450 WP_003787260.1 85812..90311(-) (pilC/pilC2) [Kingella kingae strain NCTC10529]
ATGAAGCCAGCAAGCAAAAAGCGCATTTCAACGCAACTGCGCCAGCTCAATAAAATGCTGTGGGTAGCCTTTCCTTTGCT
ATCCTTGCCTTTTTATGCCCAAGCAGAAAACAATACCCAGTTTTCTAATTCATCATTAACAGGTATTCAAAAAACCTACG
GACCAAACGTAACATTGGCTTTGTCGGTGGAGTTTCCAACTGCAGGTGCGGCGTATTCTACAGCAACCGAATTTACTGAT
GCCATGATGAGCCAAACCTTTTTGGGCTATTTCGACAACACCAAATGCTATAAATACGTTGCGCCAAATGACCCAGACGG
TATGAAGGCGTTTGCTGATAATGGTTATAGTCAATACAGTGGCTCGGATAATGTATCCAAGGGTAAGAATGGACCATTAC
TTGCGCCCGATGGTACGCGAGCGTTGGCGGGTAAACGGCGCCCATGGATAGTCTATTATGATGGACGCAGACTCAGTATT
CCGGATTCAGTACGCAGCAACAATATGCTTTATGAATCAATACGACAGAAAGAATTCTTCTTGCATGATAGTATGGATGC
TGGTTCGGATGATAGTATGGGTACCAGTGGTGCAGTTAATCATAGATATGAAACGAAAGAGCGAGAATATTTCCAACCAA
CACGAACCGCCAGTACAAGCAATGGCATGGTGGGTGTTTGTGATGGTCCAAATGAATTTAGTGGTAACTTCATGAACTGG
GCAACCATGTCGGCGATTGATATTTTCCGTCAAGCGATGACAGGTGGTAACCGTGCGCTGGGTGTTGCAAAAGACACAAC
CGCTTATGAGGCAGGCGATACGCAAACGCAAACGTTCTTGCGCCGTGCTAATGTGGTGCGTGAACAAAATGCGCACTATA
TGCAGCGTACAGTGGCTTTAAGTGAAGCCAATATTAAAAAAGTCTTGCCACATGATTATGCAGTAGATAATCCTCTTTTG
GCGGATCGTGGTAAGGAATTGCCAAGTAATAAGTTTTTCTTATACAAGGGTTATATTCCACACCCATACGCATATCGTAT
TGGGTTGAATTCAATAGGGCGGTGGGAAGAGGAAACTCAACAAACCAGCATTAAGGCAATGACAGGTAGAGTTATATTGG
CGACTCGTCGCCCATTGCTTGTACGTAACTCTGGTTTTGGCGTGGATTTCCGCCGTGCTGTTTATGACGAATCACATGGG
GGTTGGTATCATAAAGTGTTTAGTCGGGATAAGTGGAGTCCACACACCGATTGGTGGAGTCGTGTTACTTGGATTGGTAA
CATCAACGATGGTCAAAATGGTCGCTATTGGGCGCCTGTTAAATCGCGTGTAATGCCTTATCAAGTAACCGTAGAAGCGT
GTGTGCCAGGCAAATTGGAAGCCAACTGCGTGCGTCAGCCATCTGGTTCGTATAAGCCAGAAGGTTTGATGCAACAAAAC
GCGTCAACCATGCGTTTTGCAGCATTTGGTTACGCCAATATTGCAGGTAATACCGTGAGTGGTGGTGTGTTACGTAGCCG
TATGCGTTATATCGCCAATCCAAATAAAGATGGGGCAACGGCAACAGGACCAGCTGTAAAATACCAAGAAGAGATTAACC
CAAAAACAGGTCAGTTCTATGTGAACCCAGACCGCAATGCAGAAGAAACAGCAGCGCAGGCATTTGTGGTTAATACAGGC
GATAAAAACTTGCCATCTTCTAATTTTGACAATTCGGGCACGATTAATTATCTGAACAAGTTTGGTGATTATAACAACTA
CAAAACCAATGACCCTGGTGCAGAGTTGTATTACACCGCATTGCGCTATTTGCGTAATAAAGGTTTCCCAGCGTTGTACA
AAACCAAGTTAGCAGAAGCGACTCGTAGCAATATTACAAGAGAGGTTCGCGATAACTTCCCATTGATTGCCGACAATTGG
GATAACCCATTAAACGTGTTCAGCGACAACACTACTTTGAATGTAGCGGATAACGTGTGTCGTCCAAACTATATTATCTA
TATTGGTGATACCAATACCCACGCTGATGCTTCATTGCCTGGCGGTTGGAATACGTCATACGATTCCAATGTGGATGACG
ACCAAGACATTCATGTGCGTAACTTGATTAACGAGAAAATCAGACCAAACGAACCTAGGTTGAGTTGGTGGAATGACAAC
TGGGGTGCAGATGCATCACCAGGTGGTGCAATGGCGGCATTGGCGTTGTGGGGTCGAACCAACGATTTGCAGCAGAATTT
TGGTAAGCGTTTAAGTAGCTTTATGATTGATACGGTTGAGGATGGTAAATTTAAATCTGAAAATAATAATTATTGGTTGG
CAGCCAAATACGGTGGCTTCAACGATTTAAATAATAATGGTGTACCTGATAAAGGCGAATGGGAAGCAACTTCTGCTTCG
GATAAAGTGTCGGCGTTTGAAGCACCAGCGGTGTCTAATGGTACGCCACAGAACTTTGCGGTAGCCAACAACCCAACTTC
AATGGTAGCAGCATTGAACCGTGCGTTTGAAGCCACAGCCACAGCCGAAGCACCTTCTGCAACCAGCTTGGGCACAACTA
GCACCAAACCTTTGTCAGAACGCGGTAAAACTATGTTGTTGCAATCCGTGTTCCGCGACGCAACCACTGTGGTAGATGGT
GCGCGTGTTAAAGTAGCATCGGGCGATGTGTTGGGATTGGAAGCAACCTTAAAAGGTCATAAGTTGGAGTACGAGAAAAA
ATGGTCGGCTGGCGAAAAATTGCAGGCTGCATTCCATAGTTCAGATAACTGGAAAAAACGCAAAGTGTTTACTCGTGCCA
GTGCCAATAGCAGTGCAGTTCGTTTGGCAGACAACAATTCTGAGGTTGCAGCAGTCGTGAATGATTCAAATGCAGTAGAT
TTGATTAACTACGCTTTGGGTGCGAAAACGTATGAAAATGGCAGAAAATTCCGTATTCGTCCAAATCATTTGATGGGTAC
AGTGATTAACTCGCCAGTAGTAACCATTCCATCACGCACAGGCAGCGTTAGCACCGTGGCAGGTAGCTGTACTTATCCTG
CATCTGCCAATATTAGCTCGCGTGCAAATCGCCATGTGGTAGCAGCCAATGATGGTATGTTCTATGTGTTGAACTCGTCA
GGTCAGGAAATTGCATCGTATATGCCATCAACTGCGTTGAGTCAGTTGGCACGTTATGCGTCGCCAAATTACAGCCATTT
CTTTATGAATGACGGTATTGCCGCTACTTCTGAAGTATGTTTTGACACGGGTAATGGCAAGGGTGATGGCAAGGCGCATA
CGGTAGTAGTTGGCACAGCAGGTCGCGGTGGTAACAGCGTGTATGCGTTGGATTTGACTGACCCAACTACTTTGGAAGCC
AAAGATATGCTGTGGGAATTCACAGATGCTGGTTTGGGTAAGAGCTTGTTTGCACCAATTATCACGCATGACAATACAGG
CACACCTGTGGCGATTGTGAGTGGTGGTTATAACGCAAGCGAAGACACTGGTTACATCTATATTCTGAAATTGAACAAGG
GTGCAAATGAACCATGGAGTGAAGATACCAACAATACTGCCCCATGGCAAAATGGTAAAAACTGGTATCGCATCAAGTTG
GGTAAAGGTGGTGTAGGTGAATTGTTTGCTTACCAAACCGAAGCACGCACCGTTGCAGCTGTGTATGCAGGCGATTTAGA
AGGCAAGCTGTGGAAGGTATCGCAAAATGCTGAAGGTCGCTTTGTAGCAGGTTATGTGGATAGCGCAAATAATGCGTTGC
CTATCTTCAAAACAGCAGACAACACGTCTATTGTGGGTGCGCCTTTTGCACAAATCGTAGGTGGTAAAACGTATGTTGTG
TTTATTACTGGTCGCTACTTCAACAAGAGCGATTTGCCTAGTACAACCAAAACAGTACAAAACTACGCATACGGCATTAT
CGAGAGCAAAATCGGCGATAACCGCACGCCAACTGGTAGCGGTGCATTGATTGACGATAGCACTAACTTGTTGCAACAAA
GCGTGAATGAGAAAATTGTTCCAGATAGCGTGCCACAAACTGTGTTCTATACGGTTACGAATAACCAAATTACCAACCAG
CATCAAGGTTGGAAGTTGAAACTGCAAAAAAACTGGCTGAGTATCGATAAGAGTGCGATTCGCGGCAAACGTGTTGCAGA
GTTTAGCGCAGTGAATCCGTTGGCGACCGATGTGGCTAGTGCTGGTAATATGTGTACCGAAAACGGTTCGACTTATTCGT
TGTCAGTTAATGTGTTCAATGGTGGTGTGTACAACAAACCTATCTATGACACAAACGGCGATGGTAAGTTTACCGAAGCA
GATACTTTGGTATCTGTGGCTGGTCAGGCTGGTATTTTGACTCGCTTAACCGAAGTTTCTACCGAGTTCGGTCGTATTGT
TGGCGGTATCAACAGTATGGGTAACATGATTCAAATGCCAAAAGACAATATTACCAGCGACCCTGTGGTGAAACGTGTAT
CTTGGCGCGAGATTTTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC/pilC2 Kingella kingae strain KK03

89.214

100

0.894


Multiple sequence alignment