Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   R5H22_RS04680 Genome accession   NZ_CP137764
Coordinates   983461..984678 (-) Length   405 a.a.
NCBI ID   WP_012699853.1    Uniprot ID   M9YCM4
Organism   Azotobacter sp. NL3     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 978461..989678
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R5H22_RS04645 (R5H22_04645) - 979466..979639 (-) 174 WP_012699847.1 DUF3094 domain-containing protein -
  R5H22_RS04650 (R5H22_04650) - 979944..980369 (+) 426 WP_012699848.1 GNAT family N-acetyltransferase -
  R5H22_RS04655 (R5H22_04655) - 980366..980833 (+) 468 WP_012699849.1 hypothetical protein -
  R5H22_RS04660 (R5H22_04660) - 980830..981516 (+) 687 WP_012699850.1 energy-coupling factor ABC transporter permease -
  R5H22_RS04665 (R5H22_04665) yacG 981622..981822 (-) 201 WP_041806982.1 DNA gyrase inhibitor YacG -
  R5H22_RS04670 (R5H22_04670) coaE 981819..982427 (-) 609 WP_012699851.1 dephospho-CoA kinase -
  R5H22_RS04675 (R5H22_04675) pilD 982589..983458 (-) 870 WP_012699852.1 A24 family peptidase Machinery gene
  R5H22_RS04680 (R5H22_04680) pilC 983461..984678 (-) 1218 WP_012699853.1 type II secretion system F family protein Machinery gene
  R5H22_RS04685 (R5H22_04685) pilB 984903..986603 (-) 1701 WP_012699854.1 type IV-A pilus assembly ATPase PilB Machinery gene
  R5H22_RS04690 (R5H22_04690) - 987132..987629 (+) 498 WP_012699856.1 pilin -
  R5H22_RS04695 (R5H22_04695) nadC 988287..989135 (-) 849 WP_012699858.1 carboxylating nicotinate-nucleotide diphosphorylase -

Sequence


Protein


Download         Length: 405 a.a.        Molecular weight: 44210.02 Da        Isoelectric Point: 10.0225

>NTDB_id=900598 R5H22_RS04680 WP_012699853.1 983461..984678(-) (pilC) [Azotobacter sp. NL3]
MAEKALKISLFTWEGIDRRGARIKGELNGRNPALVKAQLRKQGINPTRVRKKAGSLFGGDQKIKPLDIALFTRQMATMMK
AGVPLLQAFDIISEGFDKPAMRKLVDEVKQEVAAGNGFAASLRKKPRYFDNLYCNLVESGEQSGALESLLERIATYKEKT
EALKARIKKAMTYPIAVILVAIAVSAILLIKVVPQFQSVFANFGAELPALTLLVVNLSEVLQEYWFYALLGIPIAVLILK
QAHRRSEAFRNWTDRSLLKLPIVGQILYKSAVARFARTLSTTFAAGVPLVDALDSVAGATGNVVFRNATEKVKADVTTGM
QLNFSMRTTGTFPSMAIQMTAIGEESGTLDEMLDKVAGFYEAEVDNMVDSLTGLLEPMIMAVLGVLVGGLIVAMYLPIFQ
LGSVV

Nucleotide


Download         Length: 1218 bp        

>NTDB_id=900598 R5H22_RS04680 WP_012699853.1 983461..984678(-) (pilC) [Azotobacter sp. NL3]
ATGGCGGAAAAAGCGTTGAAAATCAGTCTGTTCACCTGGGAGGGCATCGACCGGCGCGGTGCCAGGATCAAGGGAGAACT
GAACGGGAGAAATCCGGCGCTGGTCAAGGCGCAACTGCGCAAGCAGGGCATCAATCCGACCAGGGTACGCAAGAAGGCCG
GCTCGCTGTTCGGCGGCGACCAGAAGATCAAACCGCTGGATATCGCCCTGTTCACCCGGCAGATGGCCACCATGATGAAG
GCAGGCGTGCCGCTGCTGCAGGCCTTCGACATCATCTCCGAGGGTTTCGACAAACCGGCCATGCGCAAGCTGGTGGACGA
AGTGAAACAGGAGGTGGCGGCGGGCAACGGTTTCGCCGCTTCACTGCGCAAGAAGCCCCGCTATTTCGACAACCTCTACT
GCAACCTGGTGGAGTCCGGCGAGCAGTCCGGCGCCCTGGAAAGCCTGCTGGAGCGGATCGCCACCTACAAGGAGAAGACC
GAGGCGCTGAAGGCCAGGATCAAGAAGGCGATGACCTACCCCATCGCGGTGATCCTGGTGGCGATCGCCGTCTCGGCCAT
TCTGCTGATAAAGGTGGTACCGCAATTCCAGTCGGTGTTCGCCAACTTCGGTGCCGAACTGCCGGCGCTCACGCTGCTGG
TCGTCAACCTGTCGGAAGTTCTGCAGGAATACTGGTTCTACGCGCTGCTCGGAATACCCATCGCAGTTCTCATCCTGAAG
CAGGCCCACCGGCGCTCCGAGGCGTTTCGCAACTGGACGGATCGCAGCCTGCTGAAGCTGCCGATCGTCGGCCAAATCCT
CTACAAGTCGGCGGTGGCCCGCTTCGCCCGCACCCTGTCCACCACTTTCGCCGCCGGTGTGCCGCTGGTCGACGCACTCG
ACTCGGTGGCCGGCGCCACCGGCAACGTGGTGTTCCGCAACGCCACCGAGAAGGTCAAGGCCGACGTCACCACCGGCATG
CAACTGAACTTCTCCATGCGCACCACCGGCACCTTCCCCAGCATGGCGATCCAGATGACCGCCATCGGCGAGGAGTCCGG
CACGCTGGACGAGATGCTCGACAAGGTGGCGGGCTTCTACGAGGCCGAAGTGGACAACATGGTGGACAGCCTGACCGGTT
TGCTGGAGCCGATGATCATGGCGGTGCTCGGCGTGCTGGTCGGCGGCCTGATCGTCGCCATGTACCTGCCGATCTTCCAG
TTGGGCTCCGTGGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB M9YCM4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

79.259

100

0.793

  pilC Acinetobacter baylyi ADP1

59.852

100

0.6

  pilC Acinetobacter baumannii D1279779

58.519

100

0.585

  pilC Legionella pneumophila strain ERS1305867

52.525

97.778

0.514

  pilG Neisseria meningitidis 44/76-A

43.719

98.272

0.43

  pilG Neisseria gonorrhoeae MS11

43.719

98.272

0.43

  pilC Vibrio cholerae strain A1552

42.716

100

0.427

  pilC Vibrio campbellii strain DS40M4

41.96

98.272

0.412