Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   PN837_RS16895 Genome accession   NZ_CP171822
Coordinates   3997020..3998246 (+) Length   408 a.a.
NCBI ID   WP_395374934.1    Uniprot ID   -
Organism   Marinicella sp. W31     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3992020..4003246
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PN837_RS16875 (PN837_016875) - 3992235..3992672 (-) 438 WP_395374930.1 DUF4426 domain-containing protein -
  PN837_RS16880 (PN837_016880) sucD 3992835..3993707 (-) 873 WP_395374931.1 succinate--CoA ligase subunit alpha -
  PN837_RS16885 (PN837_016885) sucC 3993718..3994878 (-) 1161 WP_395374932.1 ADP-forming succinate--CoA ligase subunit beta -
  PN837_RS16890 (PN837_016890) pilB 3995305..3997017 (+) 1713 WP_395374933.1 type IV-A pilus assembly ATPase PilB Machinery gene
  PN837_RS16895 (PN837_016895) pilC 3997020..3998246 (+) 1227 WP_395374934.1 type II secretion system F family protein Machinery gene
  PN837_RS16900 (PN837_016900) - 3998288..3999109 (+) 822 WP_395374935.1 prepilin peptidase -
  PN837_RS16905 (PN837_016905) coaE 3999140..3999748 (+) 609 WP_395374936.1 dephospho-CoA kinase -
  PN837_RS16915 (PN837_016915) - 4000114..4001172 (+) 1059 WP_395374937.1 pilus assembly protein PilM -
  PN837_RS16920 (PN837_016920) - 4001172..4001768 (+) 597 WP_395374938.1 PilN domain-containing protein -
  PN837_RS16925 (PN837_016925) - 4001765..4002418 (+) 654 WP_395374939.1 type 4a pilus biogenesis protein PilO -
  PN837_RS16930 (PN837_016930) - 4002415..4002945 (+) 531 WP_395374940.1 pilus assembly protein PilP -

Sequence


Protein


Download         Length: 408 a.a.        Molecular weight: 44479.52 Da        Isoelectric Point: 9.8140

>NTDB_id=1064432 PN837_RS16895 WP_395374934.1 3997020..3998246(+) (pilC) [Marinicella sp. W31]
MATKTQSLTEVTTFQWVGVDRVGKRMKGEQQAKSITLAKNELRRQGIQVKKISKKRKSAFGKGKKIKAQDIALFSRQLAT
MMEAGVPMVQAFEIIEGGQTNQNMAKLINAVKTDIQSGTALADSLGKHPLYFDELYCNLVAAGEKAGVLDELLDTIATYK
EKTEEIKGKIKKAMYYPAAVLFVAIGVTILLLVKVVPQFQDLFKGFGADLPGLTLMVVAASEYMQDHWLKVILVMGGLIY
GFIYFKKRSLKFAHALDRMVLKMPIIGGILRNAAIARFSRTLSTTFAAGVPLVDGLDTVSGAVGNVVFRDAVLKVKDDVS
TGHQLQLAMGQTGVFPHMVVQMAAIGEESGNLDEMLAKVADYYEQEVNNAVDALSSLLEPLIMVLIGGLVGVMVVAMYLP
IFKMASVF

Nucleotide


Download         Length: 1227 bp        

>NTDB_id=1064432 PN837_RS16895 WP_395374934.1 3997020..3998246(+) (pilC) [Marinicella sp. W31]
ATGGCTACAAAAACACAAAGCCTCACAGAGGTAACCACATTTCAATGGGTAGGCGTTGACCGAGTCGGTAAACGCATGAA
AGGCGAACAACAAGCCAAAAGCATCACCCTGGCCAAAAATGAATTGCGTAGGCAAGGCATCCAGGTCAAGAAGATTTCCA
AAAAGCGAAAATCTGCTTTTGGCAAGGGTAAGAAAATCAAAGCGCAAGATATTGCGCTGTTTTCAAGGCAGTTAGCAACC
ATGATGGAAGCTGGTGTTCCTATGGTGCAGGCATTTGAGATTATTGAAGGTGGTCAAACAAATCAGAACATGGCCAAATT
AATCAATGCTGTGAAAACGGATATTCAATCAGGTACAGCTTTGGCAGACTCCTTGGGCAAGCACCCGCTCTATTTTGATG
AATTGTATTGTAATCTGGTGGCTGCCGGTGAGAAAGCAGGTGTTTTGGATGAGTTGCTGGATACCATTGCCACCTACAAA
GAGAAAACCGAAGAAATTAAAGGCAAAATTAAAAAAGCCATGTATTACCCAGCGGCAGTATTGTTTGTGGCCATTGGTGT
AACAATATTGTTGTTGGTGAAAGTAGTGCCTCAATTCCAGGACCTGTTCAAGGGGTTCGGTGCTGATTTGCCGGGTTTGA
CCTTAATGGTCGTAGCTGCTTCAGAATATATGCAGGATCACTGGTTGAAAGTGATATTAGTTATGGGGGGGCTCATTTAT
GGATTTATTTATTTCAAGAAAAGGTCGCTGAAATTTGCACATGCTCTGGATAGGATGGTGCTTAAAATGCCGATCATTGG
TGGTATTTTACGAAACGCTGCTATTGCAAGATTCTCCAGAACGCTTTCAACTACTTTTGCAGCAGGTGTGCCATTAGTAG
ATGGTTTGGATACAGTTTCAGGAGCAGTTGGTAATGTAGTCTTTCGTGATGCGGTCCTTAAGGTAAAAGACGATGTTTCT
ACTGGGCATCAGTTGCAACTGGCTATGGGGCAAACAGGTGTTTTTCCGCACATGGTTGTGCAAATGGCAGCCATTGGTGA
GGAATCCGGTAACTTGGATGAGATGTTGGCAAAAGTAGCTGATTATTATGAGCAAGAGGTCAACAATGCCGTTGATGCGC
TGAGTAGTTTGTTGGAGCCGTTAATTATGGTGCTTATTGGTGGTCTTGTTGGTGTCATGGTTGTCGCCATGTATCTGCCG
ATCTTTAAAATGGCGTCCGTATTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

58.228

96.814

0.564

  pilC Pseudomonas stutzeri DSM 10701

56.716

98.529

0.559

  pilC Acinetobacter baylyi ADP1

54.902

100

0.549

  pilC Acinetobacter baumannii D1279779

54.408

97.304

0.529

  pilC Vibrio cholerae strain A1552

41.481

99.265

0.412

  pilG Neisseria gonorrhoeae MS11

42.969

94.118

0.404

  pilG Neisseria meningitidis 44/76-A

42.448

94.118

0.4

  pilC Vibrio campbellii strain DS40M4

40

98.039

0.392

  pilC Thermus thermophilus HB27

38.25

98.039

0.375


Multiple sequence alignment