Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   THITH_RS13940 Genome accession   NZ_CP007029
Coordinates   3113295..3114503 (+) Length   402 a.a.
NCBI ID   WP_006747206.1    Uniprot ID   W0DLE6
Organism   Thioalkalivibrio paradoxus ARh     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3108295..3119503
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THITH_RS13900 (THITH_14350) nusG 3108693..3109226 (-) 534 WP_006747210.1 transcription termination/antitermination protein NusG -
  THITH_RS13905 (THITH_14355) secE 3109242..3109610 (-) 369 WP_006747209.1 preprotein translocase subunit SecE -
  THITH_RS13915 (THITH_14365) tuf 3109801..3110991 (-) 1191 WP_006747208.1 elongation factor Tu -
  THITH_RS13935 (THITH_14385) pilB 3111575..3113287 (+) 1713 WP_006747207.1 type IV-A pilus assembly ATPase PilB Machinery gene
  THITH_RS13940 (THITH_14390) pilC 3113295..3114503 (+) 1209 WP_006747206.1 type II secretion system F family protein Machinery gene
  THITH_RS13945 (THITH_14395) pilD 3114565..3115416 (+) 852 WP_025367580.1 A24 family peptidase Machinery gene
  THITH_RS13950 (THITH_14400) coaE 3115413..3116024 (+) 612 WP_006747204.1 dephospho-CoA kinase -
  THITH_RS13955 (THITH_14405) zapD 3116103..3116915 (+) 813 WP_006747203.1 cell division protein ZapD -
  THITH_RS13960 (THITH_14410) - 3116960..3117895 (-) 936 WP_006747202.1 Nudix family hydrolase -
  THITH_RS13965 (THITH_14415) argJ 3117909..3119120 (-) 1212 WP_006747201.1 bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ -

Sequence


Protein


Download         Length: 402 a.a.        Molecular weight: 44089.91 Da        Isoelectric Point: 9.6529

>NTDB_id=116065 THITH_RS13940 WP_006747206.1 3113295..3114503(+) (pilC) [Thioalkalivibrio paradoxus ARh]
MAETETIFIWEGLNKKGTRVKGETPADSEMLARAELRRNGINVLKIRKKPKSLFSTKKRIKPVDIAYFLRQMTTMLSSGV
PLVQAFDIVGRGHENPTMSALIMDLKSSVEGGETFAAALAKHPRHFDDLVTNLVEAGEQSGTLETLLDKVATYKEKTESL
KAKIKKAMFYPAAVIVVAIVVTAILLIFVVPQFEALFVGFGADLPAFTRMVVNLSEFVQAWWWAILAAMVAIGFVFVQLR
RRSPRFSRFVDLAVLRIPAIGPILRKAAVARFARTLSTMFAAGVPLVEALRSVAGATGNALYAEATERMREETAAGAQLQ
WSMRNTNVFPNMVVQMVAIGEESGSLDSMLAKVADFYEEEVDNAVDSLSSLLEPLIMVVLGVLIGGLVIAMYLPIFMLGQ
VI

Nucleotide


Download         Length: 1209 bp        

>NTDB_id=116065 THITH_RS13940 WP_006747206.1 3113295..3114503(+) (pilC) [Thioalkalivibrio paradoxus ARh]
ATGGCAGAGACCGAGACAATCTTTATCTGGGAGGGCCTGAACAAGAAAGGGACTCGCGTGAAAGGCGAGACCCCGGCCGA
CAGCGAAATGCTCGCGCGAGCGGAGCTGCGCCGTAACGGCATCAACGTTCTGAAGATCCGCAAGAAGCCGAAGTCGCTGT
TTTCGACCAAGAAGCGGATCAAGCCGGTCGACATCGCCTATTTCCTGCGTCAGATGACGACGATGCTCAGCTCCGGCGTA
CCGCTGGTGCAGGCGTTCGACATCGTCGGGCGCGGGCACGAGAACCCGACGATGTCGGCGCTGATCATGGACCTGAAGTC
CTCGGTCGAGGGGGGCGAGACCTTCGCAGCGGCGCTGGCCAAGCATCCGCGACACTTCGACGATCTGGTGACCAACCTGG
TCGAGGCCGGTGAACAGTCGGGCACGCTGGAAACCCTGCTCGACAAGGTCGCGACCTACAAGGAGAAGACCGAAAGCCTC
AAGGCCAAGATCAAGAAGGCGATGTTCTATCCCGCCGCGGTGATCGTGGTTGCGATCGTCGTGACCGCAATCCTGCTGAT
CTTCGTGGTGCCTCAGTTCGAGGCGCTGTTCGTCGGCTTCGGCGCGGACCTGCCGGCGTTCACCCGCATGGTCGTGAACC
TCTCGGAATTCGTGCAGGCATGGTGGTGGGCCATACTCGCGGCAATGGTCGCGATTGGCTTTGTCTTCGTTCAGCTGCGG
CGGCGATCGCCCCGGTTCAGCCGCTTCGTCGACCTCGCGGTGCTGCGGATACCCGCGATCGGGCCGATCCTGCGCAAGGC
GGCGGTGGCGCGGTTCGCCCGGACGCTGTCGACGATGTTCGCGGCGGGCGTGCCGCTGGTCGAGGCGCTGCGCTCGGTGG
CCGGAGCGACCGGCAACGCCCTCTACGCCGAGGCCACCGAGCGGATGCGCGAGGAGACCGCGGCCGGGGCCCAGCTGCAA
TGGTCGATGCGCAACACCAACGTCTTCCCGAACATGGTCGTGCAGATGGTCGCGATCGGCGAAGAATCCGGCTCGCTCGA
CTCGATGCTCGCGAAGGTCGCCGATTTCTACGAGGAGGAGGTCGACAACGCAGTGGACAGCCTGTCCAGCCTGCTCGAGC
CGCTGATCATGGTTGTGCTCGGGGTGCTGATCGGCGGCCTGGTCATCGCGATGTACCTGCCGATCTTCATGCTGGGCCAG
GTGATCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB W0DLE6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

60.05

99.005

0.595

  pilC Legionella pneumophila strain ERS1305867

58.586

98.507

0.577

  pilC Acinetobacter baylyi ADP1

56.03

99.005

0.555

  pilC Acinetobacter baumannii D1279779

56.171

98.756

0.555

  pilG Neisseria gonorrhoeae MS11

46.42

100

0.468

  pilG Neisseria meningitidis 44/76-A

46.269

100

0.463

  pilC Vibrio campbellii strain DS40M4

45.592

98.756

0.45

  pilC Vibrio cholerae strain A1552

44.01

100

0.448

  pilC Thermus thermophilus HB27

38.945

99.005

0.386


Multiple sequence alignment