Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   UNDKW_RS04140 Genome accession   NZ_AP018439
Coordinates   918292..919527 (-) Length   411 a.a.
NCBI ID   WP_162057697.1    Uniprot ID   A0A6N4SZJ1
Organism   Undibacterium sp. KW1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 913292..924527
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UNDKW_RS04110 (UNDKW_0827) - 913491..914054 (+) 564 WP_162057693.1 hypothetical protein -
  UNDKW_RS04115 (UNDKW_0828) - 914098..915654 (+) 1557 WP_162057694.1 sigma-54-dependent transcriptional regulator -
  UNDKW_RS04120 (UNDKW_0829) - 915685..915864 (-) 180 WP_162039882.1 DNA gyrase inhibitor YacG -
  UNDKW_RS04125 (UNDKW_0830) zapD 915867..916622 (-) 756 WP_162039883.1 cell division protein ZapD -
  UNDKW_RS04130 coaE 916797..917423 (-) 627 WP_162057695.1 dephospho-CoA kinase -
  UNDKW_RS04135 (UNDKW_0831) pilD 917432..918256 (-) 825 WP_370529082.1 A24 family peptidase Machinery gene
  UNDKW_RS04140 (UNDKW_0832) pilC 918292..919527 (-) 1236 WP_162057697.1 type II secretion system F family protein Machinery gene
  UNDKW_RS04145 (UNDKW_0833) pilF 919543..921270 (-) 1728 WP_162057698.1 type IV-A pilus assembly ATPase PilB Machinery gene
  UNDKW_RS04150 (UNDKW_0834) ispB 921595..922524 (-) 930 WP_162061753.1 octaprenyl diphosphate synthase -
  UNDKW_RS04155 (UNDKW_0835) rplU 922839..923150 (+) 312 WP_162020347.1 50S ribosomal protein L21 -
  UNDKW_RS04160 (UNDKW_0836) rpmA 923200..923469 (+) 270 WP_162039887.1 50S ribosomal protein L27 -

Sequence


Protein


Download         Length: 411 a.a.        Molecular weight: 44535.30 Da        Isoelectric Point: 9.7747

>NTDB_id=69603 UNDKW_RS04140 WP_162057697.1 918292..919527(-) (pilC) [Undibacterium sp. KW1]
MATNLAKSAKSGQPKEQLYAWEGKDKFGKVVRGETRAGGEAIVNATLRRQGILVTKLKKKNYTSGKAITDKDITLFTRQL
ATMMKAGVPLLQSFDIVSKGHSNPSVSKLLQDIRGDVETGTSLNAAFRKFPLYFDPLFCNLVGAGEQAGILEDLLTRLAI
YKEKTMAIKAKIKSALTYPIAILGIAFIVTAVIMIWVVPAFKQAFSSFGAELPAPTLIVMNISDNFVKYWYIIFGGLFGS
IYFFFQAWRRSLKVQQFMDRALLQAPIFGDVIKKATIARWTRTLATMFAAGVPLVESLDSVGGAAGNAVYLDATIKIQTE
VSTGTSLTVAMQNANVFPSMVTQMVAIGEESGALDQMLGKVADFYEDEVDEAVAALSSLMEPIIMVILGVVIGGLVVAMY
LPIFKMGSVAG

Nucleotide


Download         Length: 1236 bp        

>NTDB_id=69603 UNDKW_RS04140 WP_162057697.1 918292..919527(-) (pilC) [Undibacterium sp. KW1]
ATGGCAACCAATCTGGCAAAATCAGCAAAATCAGGGCAACCCAAAGAGCAGCTATATGCCTGGGAAGGCAAGGACAAGTT
TGGCAAGGTCGTCAGGGGCGAAACGCGCGCTGGTGGTGAAGCCATCGTCAATGCCACCCTGCGCAGACAAGGTATCCTGG
TTACCAAGTTAAAAAAGAAGAATTACACCTCCGGTAAGGCGATTACTGATAAAGACATCACTCTGTTTACCCGTCAACTG
GCAACGATGATGAAGGCAGGCGTACCGCTACTGCAGTCTTTTGATATTGTATCCAAAGGCCACAGCAACCCTTCGGTCTC
GAAATTACTGCAGGACATCCGTGGTGACGTTGAAACCGGCACCAGCCTGAACGCAGCTTTCCGCAAGTTCCCTTTATATT
TTGACCCGCTGTTCTGCAATCTCGTGGGTGCAGGTGAGCAAGCCGGTATTCTGGAAGATTTGCTGACCCGTCTGGCCATC
TACAAAGAAAAAACCATGGCCATCAAGGCCAAGATCAAGTCAGCGCTGACTTACCCTATCGCCATTCTGGGCATCGCTTT
CATCGTAACCGCGGTTATCATGATCTGGGTGGTGCCAGCGTTTAAGCAGGCTTTCAGCAGCTTTGGTGCAGAATTGCCAG
CACCTACCCTGATCGTCATGAACATCTCGGACAATTTCGTCAAATACTGGTACATCATCTTTGGCGGACTGTTTGGCAGT
ATCTACTTCTTTTTCCAGGCATGGCGTCGCTCCCTGAAGGTGCAACAGTTCATGGACAGAGCGCTGTTACAGGCACCGAT
TTTTGGTGACGTGATAAAAAAAGCAACCATAGCACGCTGGACGCGCACGCTGGCCACCATGTTTGCCGCTGGTGTGCCTT
TGGTTGAATCCCTGGATTCCGTTGGTGGCGCTGCCGGTAATGCCGTCTATCTTGACGCAACCATCAAAATTCAAACCGAA
GTCAGCACTGGTACCAGCCTGACAGTTGCCATGCAGAACGCCAATGTATTTCCTTCCATGGTCACGCAAATGGTGGCAAT
TGGTGAAGAATCCGGTGCGCTTGATCAAATGCTGGGCAAAGTCGCAGACTTTTATGAAGATGAAGTCGATGAAGCCGTTG
CTGCCCTGTCGAGCCTGATGGAACCTATCATCATGGTGATTCTGGGTGTGGTCATTGGTGGCCTGGTGGTTGCCATGTAC
TTGCCTATCTTCAAAATGGGTTCGGTCGCAGGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6N4SZJ1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

54.271

96.837

0.526

  pilC Legionella pneumophila strain ERS1305867

54.315

95.864

0.521

  pilG Neisseria gonorrhoeae MS11

52.778

96.35

0.509

  pilG Neisseria meningitidis 44/76-A

52.525

96.35

0.506

  pilC Acinetobacter baumannii D1279779

52.273

96.35

0.504

  pilC Acinetobacter baylyi ADP1

50.882

96.594

0.491

  pilC Vibrio campbellii strain DS40M4

40.295

99.027

0.399

  pilC Vibrio cholerae strain A1552

40.955

96.837

0.397


Multiple sequence alignment