Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   CCR98_RS16500 Genome accession   NZ_CP021768
Coordinates   3517984..3519243 (+) Length   419 a.a.
NCBI ID   WP_087923449.1    Uniprot ID   -
Organism   Stenotrophomonas sp. WZN-1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3516688..3532659 3517984..3519243 within 0


Gene organization within MGE regions


Location: 3516688..3532659
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CCR98_RS16490 (CCR98_16495) - 3516688..3517095 (-) 408 WP_087923447.1 pilin -
  CCR98_RS16495 (CCR98_16500) pilA/pilAI 3517212..3517628 (-) 417 WP_087923448.1 pilin Machinery gene
  CCR98_RS16500 (CCR98_16505) pilC 3517984..3519243 (+) 1260 WP_087923449.1 type II secretion system F family protein Machinery gene
  CCR98_RS16505 (CCR98_16510) - 3519251..3520114 (+) 864 WP_005418626.1 A24 family peptidase -
  CCR98_RS16510 (CCR98_16515) coaE 3520126..3520737 (+) 612 WP_087923450.1 dephospho-CoA kinase -
  CCR98_RS21120 - 3520811..3521089 (-) 279 WP_157721541.1 hypothetical protein -
  CCR98_RS21125 - 3521554..3522201 (-) 648 WP_157721542.1 hypothetical protein -
  CCR98_RS16525 (CCR98_16530) - 3523087..3523473 (-) 387 WP_087923453.1 hypothetical protein -
  CCR98_RS16530 (CCR98_16535) - 3523964..3525334 (-) 1371 WP_032957212.1 HAMP domain-containing sensor histidine kinase -
  CCR98_RS16535 (CCR98_16540) - 3525297..3525974 (-) 678 WP_005410804.1 response regulator transcription factor -
  CCR98_RS16540 (CCR98_16545) - 3526136..3526597 (-) 462 WP_087923454.1 hypothetical protein -
  CCR98_RS16545 (CCR98_16550) - 3526713..3527183 (-) 471 WP_087923455.1 hypothetical protein -
  CCR98_RS16550 (CCR98_16555) - 3527279..3527758 (-) 480 WP_087923456.1 hypothetical protein -
  CCR98_RS16555 (CCR98_16560) rimK 3528018..3528896 (-) 879 WP_005410807.1 30S ribosomal protein S6--L-glutamate ligase -
  CCR98_RS16560 (CCR98_16565) - 3529064..3529429 (+) 366 WP_232463040.1 hypothetical protein -
  CCR98_RS16565 (CCR98_16570) - 3529508..3529930 (+) 423 WP_087923458.1 thioesterase family protein -
  CCR98_RS16570 (CCR98_16575) - 3530099..3531133 (-) 1035 WP_232463041.1 hypothetical protein -
  CCR98_RS16575 (CCR98_16580) - 3531375..3532253 (+) 879 WP_232463042.1 PoNe immunity protein domain-containing protein -
  CCR98_RS16580 (CCR98_16585) - 3532255..3532659 (-) 405 WP_087923461.1 DUF805 domain-containing protein -

Sequence


Protein


Download         Length: 419 a.a.        Molecular weight: 45662.01 Da        Isoelectric Point: 10.2308

>NTDB_id=233121 CCR98_RS16500 WP_087923449.1 3517984..3519243(+) (pilC) [Stenotrophomonas sp. WZN-1]
MSVSRSAIKKEPVARNTTDLQPFVWVGTDKRGVKMKGEQAAKNANLLRAELRRQGITPGTVKLKPKPLFGGSGSRISPKD
IAFFSRQMATMMKSGVPIVSSLEIIASGHKNPRMKKMVDGLRTDIEGGSSLYEAVSKHPVQFDELYRNLVKAGEGAGVLE
TVLDTVANYKENIEALKGKIKKAMFYPAMVLAVALLVSGILLVWVVPQFEDVFKGFGADLPAFTQMIVNLSRFMVSWWWL
ILLVIIGSIVGFIAAYKRSPKMQHSMDRLVLKVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGATGNTVYEH
AVLRMRDDVSVGYPVNMAMKQTALFPHMVIQMTGIGEEAGALDAMLFKVAEYYEQEVNNAVDALSSLLEPIIMVIIGTIV
GGMVIGMYLPIFKLASVVG

Nucleotide


Download         Length: 1260 bp        

>NTDB_id=233121 CCR98_RS16500 WP_087923449.1 3517984..3519243(+) (pilC) [Stenotrophomonas sp. WZN-1]
ATGTCTGTCAGTCGCAGTGCGATCAAGAAGGAGCCCGTGGCTCGCAACACCACGGACTTGCAGCCGTTTGTCTGGGTGGG
GACCGACAAGCGGGGTGTGAAGATGAAAGGCGAGCAGGCCGCGAAAAATGCGAACCTGCTGCGGGCTGAGTTGCGCCGCC
AAGGCATTACGCCAGGCACGGTCAAACTCAAGCCCAAGCCGTTGTTCGGCGGCTCCGGGAGCCGAATCTCGCCGAAGGAC
ATCGCGTTCTTCAGCCGGCAGATGGCCACCATGATGAAGTCCGGTGTGCCTATCGTCAGTTCGCTGGAAATCATCGCCAG
CGGGCACAAGAACCCGCGCATGAAGAAGATGGTGGATGGCCTGCGCACTGATATCGAAGGCGGATCATCGCTCTACGAAG
CGGTCAGCAAGCATCCGGTTCAATTCGACGAGCTCTACCGCAATCTGGTCAAGGCCGGCGAAGGCGCAGGTGTTCTGGAG
ACGGTGCTGGACACCGTCGCCAACTACAAAGAGAACATTGAAGCGCTGAAGGGCAAGATCAAGAAGGCGATGTTCTATCC
CGCCATGGTACTGGCGGTTGCCCTGTTGGTCAGCGGAATCCTGCTGGTCTGGGTGGTGCCGCAGTTCGAGGACGTATTCA
AAGGCTTCGGCGCGGACCTACCCGCTTTCACCCAGATGATCGTGAATCTGTCCCGCTTCATGGTCTCGTGGTGGTGGCTG
ATCCTGCTGGTCATCATCGGATCGATCGTCGGCTTCATCGCGGCCTACAAACGCTCGCCCAAGATGCAGCACAGCATGGA
CCGACTGGTGCTCAAGGTGCCTGTCATCGGGCAGATCATGCACAACAGCTCCATCGCCCGCTTTGCCCGTACCACCGCCG
TCACCTTCAAGGCCGGCGTGCCTTTGGTGGAGGCTTTGGGGATCGTGGCCGGCGCCACCGGCAATACGGTTTACGAACAT
GCCGTATTGCGCATGCGCGATGACGTGTCGGTCGGTTACCCAGTCAACATGGCCATGAAGCAGACCGCCCTGTTCCCGCA
CATGGTCATCCAGATGACCGGCATCGGTGAAGAGGCCGGTGCCCTGGACGCCATGCTGTTCAAGGTGGCCGAGTACTATG
AGCAGGAGGTCAACAATGCCGTTGACGCCCTCAGCAGCCTGCTGGAACCGATCATCATGGTGATCATTGGTACCATCGTC
GGCGGCATGGTCATCGGCATGTACCTGCCGATCTTCAAGCTCGCTTCCGTCGTCGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Acinetobacter baylyi ADP1

52.463

96.897

0.508

  pilC Pseudomonas stutzeri DSM 10701

53.283

94.511

0.504

  pilC Legionella pneumophila strain ERS1305867

51.98

96.42

0.501

  pilC Acinetobacter baumannii D1279779

51.385

94.749

0.487

  pilG Neisseria gonorrhoeae MS11

43.216

94.988

0.411

  pilG Neisseria meningitidis 44/76-A

42.714

94.988

0.406

  pilC Vibrio cholerae strain A1552

40.887

96.897

0.396

  pilC Vibrio campbellii strain DS40M4

39.9

95.704

0.382


Multiple sequence alignment