Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC/pilC1   Type   Machinery gene
Locus tag   DQN52_RS04775 Genome accession   NZ_LS483426
Coordinates   884492..888562 (+) Length   1356 a.a.
NCBI ID   WP_032827588.1    Uniprot ID   A0AAX2J411
Organism   Kingella kingae strain NCTC10529     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 873421..884050 884492..888562 flank 442


Gene organization within MGE regions


Location: 873421..888562
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN52_RS04700 (NCTC10529_00924) - 873421..873582 (-) 162 WP_050792387.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  DQN52_RS04705 (NCTC10529_00925) - 873691..874146 (-) 456 WP_003785447.1 hypothetical protein -
  DQN52_RS04710 (NCTC10529_00926) - 874180..875308 (-) 1129 Protein_886 ATP phosphoribosyltransferase regulatory subunit -
  DQN52_RS04715 (NCTC10529_00927) metF 875426..876259 (-) 834 WP_003785451.1 methylenetetrahydrofolate reductase [NAD(P)H] -
  DQN52_RS04720 (NCTC10529_00928) - 876283..876951 (-) 669 WP_003785452.1 HAD family phosphatase -
  DQN52_RS04725 (NCTC10529_00929) hda 876948..877616 (-) 669 WP_003789459.1 DnaA regulatory inactivator Hda -
  DQN52_RS04730 - 877856..878736 (-) 881 Protein_890 LysR family transcriptional regulator -
  DQN52_RS04735 (NCTC10529_00932) - 879202..880065 (+) 864 WP_003785460.1 DegV family protein -
  DQN52_RS04740 (NCTC10529_00933) - 880175..881938 (+) 1764 WP_003785462.1 ABC transporter ATP-binding protein/permease -
  DQN52_RS04745 (NCTC10529_00934) - 881988..882251 (+) 264 WP_003789867.1 transposase -
  DQN52_RS04750 (NCTC10529_00935) - 882269..882598 (+) 330 WP_111694420.1 transposase -
  DQN52_RS11155 (NCTC10529_00936) - 882581..883102 (-) 522 WP_331250425.1 IS5 family transposase -
  DQN52_RS11160 (NCTC10529_00937) - 883081..883344 (-) 264 WP_032133164.1 transposase -
  DQN52_RS04765 (NCTC10529_00938) - 883369..883638 (+) 270 WP_071461657.1 transposase -
  DQN52_RS04770 (NCTC10529_00939) - 883613..884186 (+) 574 Protein_898 IS5 family transposase -
  DQN52_RS04775 (NCTC10529_00940) pilC/pilC1 884492..888562 (+) 4071 WP_032827588.1 PilC/PilY family type IV pilus protein Machinery gene

Sequence


Protein


Download         Length: 1356 a.a.        Molecular weight: 147357.34 Da        Isoelectric Point: 8.6220

>NTDB_id=1141184 DQN52_RS04775 WP_032827588.1 884492..888562(+) (pilC/pilC1) [Kingella kingae strain NCTC10529]
MSLKCKKYNCNLKPLSLGVSLALSSFATMAADSPFAIAPQHLSSTATVKKTTVTTPYANILKQQQSVTNTTVRGMQGAKP
NIMLLLDDSGSMRAEVPGSYGQTRQGILRNSLSKIVDKYGSRINWGLVSFNDSSSRYNLSLGTSYLTVANAIRNFPASGT
TPTITSYLKAVNMLNEGIKYRCQKSYVVLLSDGDSNWPMGFWVDAGVRRNTVGDFLYYPHWVYINRELNKYEFWRNLLTK
DGKLINKSSPKYRSMCNAAYRQSNSNAKNWCSELPQNFIYYPYSGVSDPSVNFGPSVRKRELGYWYTESPLRNLTSDLDF
LDSDSAFYTPSKRTAYGLYGMYPIFNYQPILAPQTQYAGEMLSFFSEPINKIDLRTVGNDDTGKSWNDPAFPTQNITTFT
IGFGSGLTSSGLALLKGGASDIKLKDGTVQKAYYPANDQAGLQKAFDSIFDQIERENPPPSNSNPIDLPPTTNTLPILAP
IAGTPNRQQSVMAETSYATSPPSATGASNSIPNMAASVFVPAGLKSSELRFYALNSKAQADTSSWHVADFSQRKALIMTS
SGVRWADNVGNSLNNTFFNLPADTQTPPRTDEWNKALLPWTTRAVTGSDDSKIKDMKYPTPYRIRPNPPTDSPRSTGVDE
YNMGDVLDTPIVATGKNGTAQGNRQEFLITAANDGMVYLFQSNPDANAKNPYSLKLNYLPATMPRQTLFDTHATRYKDLA
DEDYAKDSARPHLFMINGGITVRTTSAKSNSQNTFMVGALGQGGRGMYALNIGGKALDTGKPIGLNAESNTWNTSVPLYE
SNPKDNSADFSSLGYTVGYPQIGRVAIRHNADGSLSDKNLTTGIYYASFLSSGFSYPTNGSQETALYIQDALGVDVGGGA
ATAATANRNPKGKLLAKVVVPDGVGGLASPTLLDVNRDGVYDFAFAGDYGGNMYRFDLRKIDLSNGNVAYGAGAVKKIYS
GSANQPITSAPAISMQKNGRYVVAFGTGSDMYASDYNSTVEQAVYGIYQRFNTSNLDPVNVINDSAAEAEKPLTASDLLE
QTLTTTTATMAEGTPDAKEVIVRKGSTNAIESSSDKSKLAYGGWKVRLGNTTSVQDGERVVVTPTVIFGTLILSTRIYKE
KPVATANNNDWSKDNWQADGWVKVSEKLDQNHQELPWSVWKEISTNSTSTTSSGDVCTGTVNTSTKMQEREKRVTYETIT
TYEKVQPSENQSSGWILVLNATNGGAVKVSSGTGVDFLGKYASNFNPNDTAYSNTYFTVGLYSSSGLPNHTLLTNDSNVH
SLGSAMTRDGEARTNGEDGDLTPGGEKAKDDCFNNTDGNYVLTGNTATGIGATFSVVGRRCGKKIQIRRLSWREIF

Nucleotide


Download         Length: 4071 bp        

>NTDB_id=1141184 DQN52_RS04775 WP_032827588.1 884492..888562(+) (pilC/pilC1) [Kingella kingae strain NCTC10529]
ATGAGTCTCAAATGTAAAAAATATAATTGCAATCTTAAACCATTAAGCTTGGGGGTATCACTTGCTTTGTCCAGCTTTGC
GACAATGGCAGCTGATTCGCCATTTGCCATTGCGCCACAGCATTTATCCAGTACGGCAACTGTAAAAAAAACCACAGTAA
CGACACCGTATGCCAATATTTTAAAACAGCAACAATCGGTTACCAATACAACAGTACGTGGTATGCAGGGTGCTAAACCT
AATATTATGTTGTTGTTGGATGATTCTGGGAGTATGAGGGCAGAGGTTCCTGGTTCGTATGGTCAAACTCGGCAAGGAAT
ATTGCGAAATTCCTTGTCAAAAATAGTTGATAAATATGGTAGCCGAATTAACTGGGGTCTTGTTTCATTTAATGACTCAA
GTTCTAGATACAACCTTTCGTTAGGAACCAGTTATTTAACTGTTGCTAATGCAATTAGAAATTTCCCTGCTTCTGGTACT
ACGCCAACCATTACTAGCTATCTCAAGGCTGTTAATATGCTAAATGAAGGTATCAAATACCGTTGCCAAAAAAGTTATGT
AGTTCTATTATCTGATGGTGATTCGAACTGGCCTATGGGTTTTTGGGTAGATGCTGGTGTTAGAAGGAATACGGTAGGTG
ACTTTTTATATTATCCACATTGGGTTTATATTAACAGAGAACTTAATAAATATGAATTTTGGCGTAATCTCTTAACGAAA
GATGGTAAATTGATTAATAAAAGTTCACCCAAATATCGTAGCATGTGCAATGCAGCTTATCGACAATCCAATTCTAATGC
CAAAAACTGGTGTAGTGAATTACCACAAAATTTTATTTATTATCCATATAGTGGTGTTTCTGACCCTAGTGTGAATTTTG
GTCCCTCAGTAAGGAAGAGAGAGCTGGGGTACTGGTATACTGAATCTCCATTACGGAATTTAACTAGCGATTTGGATTTC
TTGGATAGTGATTCCGCATTCTATACACCTAGCAAACGCACTGCATATGGTTTATACGGAATGTATCCTATCTTTAATTA
TCAACCAATTTTAGCTCCTCAAACCCAATATGCAGGCGAGATGTTGTCATTTTTTAGCGAGCCAATCAACAAAATTGATT
TGCGCACAGTTGGCAATGATGATACAGGTAAATCATGGAATGACCCAGCTTTTCCAACACAAAACATTACAACGTTTACC
ATTGGGTTTGGTAGTGGTTTGACTAGTAGTGGTTTGGCCTTATTAAAAGGTGGTGCTAGCGACATTAAATTAAAAGATGG
AACAGTACAAAAAGCGTACTATCCAGCCAATGACCAAGCAGGATTACAAAAGGCATTTGATTCGATTTTTGACCAAATTG
AACGTGAAAATCCACCACCTAGCAATAGTAATCCTATTGATTTGCCTCCCACTACCAATACCTTGCCGATTCTTGCACCT
ATTGCAGGTACGCCTAATCGCCAGCAAAGTGTAATGGCAGAGACATCTTATGCCACCTCTCCTCCTAGTGCGACAGGTGC
TAGCAATAGTATTCCTAATATGGCGGCATCAGTGTTTGTCCCCGCAGGGCTAAAAAGTAGCGAATTGCGTTTTTATGCTT
TAAATAGCAAAGCACAAGCAGATACTTCGTCTTGGCATGTGGCTGACTTTTCGCAACGTAAAGCGTTGATTATGACAAGT
TCGGGTGTACGATGGGCAGATAATGTAGGCAATAGCTTGAATAATACATTCTTTAATTTGCCTGCTGATACGCAAACGCC
TCCACGTACTGATGAGTGGAATAAGGCATTATTGCCTTGGACAACTCGTGCTGTTACAGGCAGCGATGACAGCAAAATTA
AAGATATGAAGTATCCAACGCCTTATCGTATTCGTCCAAATCCACCTACTGATAGTCCACGCAGTACTGGTGTTGATGAA
TATAATATGGGCGATGTGTTGGATACGCCGATTGTTGCGACTGGTAAAAATGGCACTGCGCAAGGTAATCGACAAGAGTT
TTTGATTACGGCAGCCAATGATGGCATGGTGTACTTGTTCCAAAGTAATCCTGATGCCAATGCAAAAAATCCGTATTCAT
TAAAATTGAATTACTTACCTGCGACAATGCCGCGCCAAACATTGTTTGATACTCATGCTACTCGTTACAAAGATTTAGCA
GACGAGGATTATGCAAAAGATTCTGCGCGTCCACATTTATTTATGATTAATGGTGGTATTACGGTTCGTACGACTTCAGC
TAAATCTAATAGTCAAAACACGTTTATGGTGGGGGCTTTGGGACAAGGCGGACGTGGTATGTACGCACTCAATATTGGCG
GTAAGGCTTTAGATACTGGTAAACCTATCGGCTTAAATGCCGAAAGTAATACATGGAATACGTCCGTGCCTTTATATGAA
TCTAATCCAAAGGATAATAGTGCAGATTTTTCTTCATTAGGCTATACCGTTGGTTATCCACAAATTGGTCGCGTTGCTAT
TAGACATAACGCCGATGGCTCTTTATCAGACAAAAACTTAACCACAGGTATTTATTATGCTTCTTTCTTATCAAGTGGCT
TCTCTTATCCTACTAATGGTAGTCAGGAAACCGCTCTGTATATTCAAGATGCGTTGGGTGTAGATGTTGGTGGTGGTGCT
GCTACTGCTGCTACAGCAAACCGTAATCCAAAAGGTAAATTATTGGCAAAAGTGGTGGTTCCAGACGGCGTGGGTGGTTT
GGCATCGCCAACCTTACTTGATGTGAATCGAGATGGTGTGTATGACTTTGCCTTTGCAGGAGATTATGGTGGTAATATGT
ATCGCTTTGATTTGCGCAAAATTGATTTGTCTAATGGTAATGTTGCCTATGGTGCTGGTGCTGTGAAGAAAATTTATTCA
GGTTCTGCAAACCAACCAATTACAAGTGCTCCTGCTATTTCCATGCAAAAAAATGGACGATATGTTGTGGCATTTGGTAC
GGGTAGTGATATGTATGCTAGCGATTATAATTCGACCGTTGAGCAGGCTGTATATGGTATTTATCAACGATTCAATACAA
GTAATTTGGATCCTGTTAATGTTATTAATGATTCTGCTGCTGAAGCTGAAAAACCTTTAACAGCTAGTGATTTGTTGGAG
CAAACATTAACCACAACGACTGCAACAATGGCAGAGGGTACTCCTGACGCAAAAGAAGTAATTGTTCGTAAAGGCTCAAC
CAATGCAATTGAAAGTTCTAGTGATAAATCAAAATTAGCCTATGGTGGTTGGAAAGTGCGATTAGGTAACACGACATCTG
TACAAGATGGTGAACGTGTTGTGGTTACCCCAACTGTGATTTTTGGTACTTTGATTCTTTCAACTCGTATCTATAAAGAA
AAACCAGTTGCTACGGCCAATAATAATGATTGGTCAAAAGATAATTGGCAAGCTGATGGTTGGGTTAAAGTAAGCGAAAA
GTTAGACCAAAATCACCAAGAACTTCCATGGTCTGTTTGGAAAGAAATCAGCACGAACTCAACATCAACCACTAGTTCAG
GTGATGTTTGTACGGGTACTGTAAATACCAGCACTAAAATGCAAGAGCGCGAAAAAAGGGTTACTTATGAAACCATTACC
ACTTATGAGAAAGTACAGCCAAGTGAGAATCAATCATCTGGATGGATTTTGGTATTGAATGCGACCAATGGTGGTGCGGT
AAAAGTATCATCAGGTACGGGTGTTGACTTTTTAGGTAAATATGCCAGCAACTTTAATCCTAATGATACTGCATATAGTA
ATACCTATTTTACCGTAGGTTTGTACTCTTCCAGCGGTTTACCAAATCATACATTATTAACCAATGATTCTAATGTTCAT
AGTTTAGGTTCTGCGATGACTCGTGATGGGGAAGCGAGAACCAATGGTGAAGATGGTGATTTAACTCCTGGTGGTGAAAA
GGCAAAGGATGATTGCTTTAATAATACTGACGGTAACTATGTATTGACAGGTAATACGGCAACAGGAATTGGCGCAACAT
TTAGCGTTGTTGGTCGTCGTTGTGGTAAAAAAATACAAATCCGTCGTTTATCTTGGCGTGAGATTTTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC/pilC1 Kingella kingae strain KK03

83.053

100

0.838


Multiple sequence alignment