Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACMGRF_RS14110 Genome accession   NZ_CP179923
Coordinates   3059074..3060330 (+) Length   418 a.a.
NCBI ID   WP_262224781.1    Uniprot ID   -
Organism   Stenotrophomonas sp. EMP41     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3050221..3070706 3059074..3060330 within 0


Gene organization within MGE regions


Location: 3050221..3070706
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACMGRF_RS14070 (ACMGRF_14070) - 3050221..3051435 (+) 1215 WP_414485327.1 glycosyltransferase -
  ACMGRF_RS14075 (ACMGRF_14075) - 3051479..3052675 (+) 1197 WP_262224786.1 polysaccharide biosynthesis protein -
  ACMGRF_RS14080 (ACMGRF_14080) - 3052638..3053753 (-) 1116 WP_032955899.1 glycosyltransferase -
  ACMGRF_RS14085 (ACMGRF_14085) - 3053750..3054562 (-) 813 WP_262224784.1 FkbM family methyltransferase -
  ACMGRF_RS14090 (ACMGRF_14090) - 3054732..3055823 (+) 1092 WP_414485328.1 glycosyltransferase family 4 protein -
  ACMGRF_RS14095 (ACMGRF_14095) - 3055857..3056273 (-) 417 WP_032957223.1 pilin -
  ACMGRF_RS14100 (ACMGRF_14100) - 3056277..3058166 (-) 1890 WP_262224782.1 hypothetical protein -
  ACMGRF_RS14105 (ACMGRF_14105) pilE 3058281..3058718 (-) 438 WP_032957219.1 pilin Machinery gene
  ACMGRF_RS14110 (ACMGRF_14110) pilC 3059074..3060330 (+) 1257 WP_262224781.1 type II secretion system F family protein Machinery gene
  ACMGRF_RS14115 (ACMGRF_14115) - 3060338..3061201 (+) 864 WP_049416956.1 prepilin peptidase -
  ACMGRF_RS14120 (ACMGRF_14120) coaE 3061213..3061824 (+) 612 WP_099487579.1 dephospho-CoA kinase -
  ACMGRF_RS14125 (ACMGRF_14125) - 3061998..3062384 (-) 387 WP_111177536.1 hypothetical protein -
  ACMGRF_RS14130 (ACMGRF_14130) - 3062926..3064296 (-) 1371 WP_032954439.1 sensor histidine kinase -
  ACMGRF_RS14135 (ACMGRF_14135) - 3064259..3064936 (-) 678 WP_005410804.1 response regulator transcription factor -
  ACMGRF_RS14140 (ACMGRF_14140) - 3065099..3065560 (-) 462 WP_049412681.1 hypothetical protein -
  ACMGRF_RS14145 (ACMGRF_14145) - 3065676..3066146 (-) 471 WP_049412683.1 hypothetical protein -
  ACMGRF_RS14150 (ACMGRF_14150) - 3066245..3066724 (-) 480 WP_049412685.1 hypothetical protein -
  ACMGRF_RS14155 (ACMGRF_14155) rimK 3066983..3067861 (-) 879 WP_005418651.1 30S ribosomal protein S6--L-glutamate ligase -
  ACMGRF_RS14160 (ACMGRF_14160) - 3067959..3068201 (-) 243 WP_049412687.1 hypothetical protein -
  ACMGRF_RS14165 (ACMGRF_14165) - 3068525..3069748 (-) 1224 WP_049412688.1 FAD-dependent oxidoreductase -
  ACMGRF_RS14170 (ACMGRF_14170) - 3069879..3070706 (+) 828 WP_049412690.1 AraC family transcriptional regulator -

Sequence


Protein


Download         Length: 418 a.a.        Molecular weight: 45594.92 Da        Isoelectric Point: 10.1822

>NTDB_id=1093867 ACMGRF_RS14110 WP_262224781.1 3059074..3060330(+) (pilC) [Stenotrophomonas sp. EMP41]
MSVSRSAIKKEPVARSTTDLQPFVWEGTDKRGVKMKGEQAAKNANLLRAELRRQGITPGTVKLKPKPLFGAAGSKISPKD
IAFFSRQMATMMKSGVPIVSALDIIASGHKNPRMKKMVDGIRTDIEGGSSLHEAISKYPVQFDELYRNLVRAGEGAGVLE
TVLDTVANYKENIEALKGKIKKALFYPAMVIAVALLVSSILLIWVVPQFEDVFRGFGADLPAFTQMIVAASRFMVSWWWL
IAIIVIGSIVGFTMAYKRSPKMQHGMDRLILKVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGATGNTVYEH
AVLRMRDDVSVGYPVNMAMKQTALFPHMVIQMTGIGEEAGALDAMLFKVAEYYEQEVNNAVDALSSLLEPMIMVFIGIIV
GGLVIGMYLPIFKLASVV

Nucleotide


Download         Length: 1257 bp        

>NTDB_id=1093867 ACMGRF_RS14110 WP_262224781.1 3059074..3060330(+) (pilC) [Stenotrophomonas sp. EMP41]
ATGTCTGTCAGTCGCAGTGCGATCAAGAAGGAGCCCGTGGCGCGCAGCACCACGGACCTGCAGCCGTTTGTCTGGGAGGG
GACGGACAAGCGGGGTGTGAAGATGAAAGGGGAGCAGGCCGCGAAAAACGCGAATCTGCTGCGGGCCGAGCTGCGCCGCC
AAGGCATCACCCCGGGCACGGTCAAGCTAAAGCCCAAGCCACTGTTCGGCGCAGCCGGCAGCAAGATTTCGCCGAAGGAC
ATTGCATTCTTCAGCCGCCAGATGGCGACGATGATGAAGTCCGGTGTGCCGATCGTCTCAGCGCTGGACATCATCGCCAG
CGGGCACAAGAACCCGCGCATGAAGAAGATGGTGGACGGCATCCGCACCGATATCGAGGGCGGATCGTCCCTTCATGAGG
CAATCAGCAAGTATCCGGTGCAGTTCGACGAGCTCTATCGCAACCTGGTGCGTGCAGGTGAAGGCGCCGGTGTATTGGAG
ACGGTGCTTGATACCGTGGCCAACTACAAAGAGAACATCGAAGCCCTGAAGGGCAAGATAAAAAAGGCTCTGTTTTACCC
TGCAATGGTCATTGCGGTAGCGCTCTTGGTCAGCTCGATCCTGCTGATCTGGGTGGTTCCGCAATTTGAGGACGTCTTCA
GGGGATTTGGCGCTGATCTGCCAGCATTTACTCAGATGATCGTAGCCGCATCCCGGTTCATGGTCTCTTGGTGGTGGCTG
ATCGCGATCATCGTCATCGGCAGCATCGTAGGTTTCACCATGGCCTACAAACGCTCGCCGAAGATGCAGCACGGGATGGA
TCGCTTGATCCTGAAAGTGCCAGTCATTGGCCAGATCATGCATAACAGCTCGATTGCCCGCTTTGCCCGCACGACCGCTG
TGACTTTCAAGGCTGGCGTACCACTGGTAGAAGCCTTGGGCATTGTTGCAGGCGCTACCGGCAATACGGTTTACGAGCAT
GCCGTATTGCGCATGCGCGATGACGTCTCGGTCGGTTACCCGGTCAACATGGCCATGAAGCAGACCGCCCTGTTCCCACA
CATGGTCATCCAGATGACGGGTATCGGTGAAGAGGCGGGTGCCCTGGACGCCATGCTGTTCAAGGTGGCCGAGTACTATG
AGCAGGAGGTCAACAATGCCGTTGATGCCTTGAGCAGCCTGCTCGAGCCGATGATCATGGTGTTCATTGGTATCATCGTC
GGCGGCCTGGTCATTGGCATGTACCTCCCCATCTTCAAATTGGCATCGGTGGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

52.723

96.651

0.51

  pilC Pseudomonas stutzeri DSM 10701

53.788

94.737

0.51

  pilC Acinetobacter baylyi ADP1

51.724

97.129

0.502

  pilC Acinetobacter baumannii D1279779

51.889

94.976

0.493

  pilG Neisseria gonorrhoeae MS11

44.081

94.976

0.419

  pilG Neisseria meningitidis 44/76-A

43.577

94.976

0.414

  pilC Vibrio cholerae strain A1552

40.741

96.89

0.395

  pilC Vibrio campbellii strain DS40M4

39.75

95.694

0.38


Multiple sequence alignment