Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   GCU53_RS07600 Genome accession   NZ_CP045302
Coordinates   1612868..1614085 (+) Length   405 a.a.
NCBI ID   WP_152387086.1    Uniprot ID   -
Organism   Azotobacter salinestris strain KACC 13899     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1607868..1619085
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GCU53_RS07580 nadC 1608254..1609102 (+) 849 WP_152387082.1 carboxylating nicotinate-nucleotide diphosphorylase -
  GCU53_RS07585 - 1609230..1609931 (-) 702 WP_152387083.1 hypothetical protein -
  GCU53_RS07590 - 1610076..1610615 (-) 540 WP_152387084.1 pilin -
  GCU53_RS07595 pilB 1611165..1612865 (+) 1701 WP_152387085.1 type IV-A pilus assembly ATPase PilB Machinery gene
  GCU53_RS07600 pilC 1612868..1614085 (+) 1218 WP_152387086.1 type II secretion system F family protein Machinery gene
  GCU53_RS07605 pilD 1614088..1614957 (+) 870 WP_152387087.1 A24 family peptidase Machinery gene
  GCU53_RS07610 coaE 1615105..1615713 (+) 609 WP_152387088.1 dephospho-CoA kinase -
  GCU53_RS07615 yacG 1615710..1615910 (+) 201 WP_152387089.1 DNA gyrase inhibitor YacG -
  GCU53_RS07620 - 1616005..1616691 (-) 687 WP_152387090.1 energy-coupling factor ABC transporter permease -
  GCU53_RS07625 - 1616688..1617155 (-) 468 WP_152387091.1 FAD/FMN-containing dehydrogenase -
  GCU53_RS07630 - 1617152..1617577 (-) 426 WP_152387092.1 GNAT family N-acetyltransferase -
  GCU53_RS07635 - 1617892..1618065 (+) 174 WP_152387093.1 DUF3094 family protein -

Sequence


Protein


Download         Length: 405 a.a.        Molecular weight: 44357.16 Da        Isoelectric Point: 9.7209

>NTDB_id=393268 GCU53_RS07600 WP_152387086.1 1612868..1614085(+) (pilC) [Azotobacter salinestris strain KACC 13899]
MAEKALKTSLFIWEGTDRRGTKVKGELSGQNPALVKAQLRKQGINPTKVRKKAASLFGAGKKIKPMDIALFTRQMATMMK
AGVPLLQAFDIISEGFDNPNMRKLVDEVKQEVAAGNSFATALRKKPLYFDDLYCNLVESGEQSGALENLLDRVATYKEKT
EALKAKIKKAMTYPIAVIVVAVIVSAILLIKVVPQFESVFANFGAELPAFTRMVISLSEIMQEYWFYALLGIFVVAFTLK
QAHQRSEKFRNWTDRTLLKLPIVGEILYKSAVARFARTLSTTFAAGVPLVDALDSVAGATGNVVFRSATEKVKADVTTGM
QLNFSMRTTGTFPTMAIQMTAIGEESGALDEMLDKVASFYEAEVDNMVDSLTSLMEPMIMAVLGVLVGGLIIAMYLPIFQ
LGAVV

Nucleotide


Download         Length: 1218 bp        

>NTDB_id=393268 GCU53_RS07600 WP_152387086.1 1612868..1614085(+) (pilC) [Azotobacter salinestris strain KACC 13899]
ATGGCGGAAAAAGCGTTGAAAACCAGTCTCTTCATCTGGGAAGGCACCGACCGGCGCGGCACCAAGGTCAAGGGCGAGTT
GAGCGGGCAGAATCCAGCGCTGGTCAAGGCACAACTGCGCAAGCAGGGCATCAACCCGACCAAGGTACGCAAGAAAGCCG
CTTCGCTGTTCGGCGCCGGCAAGAAGATCAAGCCGATGGACATCGCCCTGTTCACTAGGCAGATGGCCACCATGATGAAG
GCAGGCGTGCCGCTGCTGCAGGCCTTCGACATCATCTCCGAGGGCTTCGACAACCCGAACATGCGCAAGCTGGTGGACGA
GGTGAAGCAGGAGGTGGCGGCTGGCAACAGCTTTGCCACTGCATTGCGCAAAAAGCCACTCTATTTCGATGACCTCTACT
GCAACCTGGTGGAGTCCGGCGAACAGTCCGGTGCCCTGGAGAACCTGCTGGACCGGGTCGCCACCTACAAGGAGAAGACC
GAGGCGCTGAAGGCCAAGATCAAGAAGGCGATGACCTATCCCATCGCGGTGATCGTGGTGGCGGTCATCGTCTCGGCAAT
TTTGCTGATCAAGGTAGTGCCGCAATTCGAATCGGTATTCGCCAATTTCGGCGCCGAACTGCCAGCTTTTACTCGAATGG
TCATCAGCCTGTCCGAGATCATGCAGGAATACTGGTTCTATGCCCTGCTGGGAATATTCGTCGTTGCCTTCACTCTGAAG
CAGGCCCATCAGCGCTCGGAAAAATTTCGCAACTGGACGGACCGCACCTTGCTGAAGCTGCCGATCGTCGGCGAGATCCT
CTACAAGTCGGCGGTGGCCCGCTTTGCCCGCACCCTGTCCACCACCTTCGCCGCCGGCGTGCCGCTGGTCGATGCGCTCG
ATTCAGTGGCTGGCGCTACCGGAAACGTGGTATTTCGCAGCGCTACCGAAAAGGTCAAGGCCGACGTCACCACCGGCATG
CAGCTGAACTTTTCCATGCGCACCACCGGTACCTTCCCCACCATGGCGATCCAGATGACCGCCATCGGCGAGGAATCCGG
TGCGCTGGACGAGATGCTCGACAAGGTGGCGAGCTTCTATGAGGCCGAGGTGGACAACATGGTGGACAGCCTGACCAGCC
TGATGGAACCGATGATCATGGCGGTGCTCGGCGTGCTGGTCGGTGGCCTGATCATCGCCATGTACCTGCCGATCTTCCAG
CTGGGTGCCGTGGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

82.716

100

0.827

  pilC Acinetobacter baylyi ADP1

60.837

100

0.61

  pilC Acinetobacter baumannii D1279779

60

100

0.6

  pilC Legionella pneumophila strain ERS1305867

54.798

97.778

0.536

  pilG Neisseria gonorrhoeae MS11

44.776

99.259

0.444

  pilG Neisseria meningitidis 44/76-A

44.776

99.259

0.444

  pilC Vibrio cholerae strain A1552

42.716

100

0.427

  pilC Vibrio campbellii strain DS40M4

41.919

97.778

0.41

  pilC Thermus thermophilus HB27

37.717

99.506

0.375


Multiple sequence alignment