Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   E5P2_RS05945 Genome accession   NZ_LR594666
Coordinates   1233934..1235157 (-) Length   407 a.a.
NCBI ID   WP_162566227.1    Uniprot ID   A0A8B6LHH1
Organism   Variovorax sp. SRS16     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1228934..1240157
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5P2_RS05920 (G3W92_RS05920) zapD 1229072..1229827 (-) 756 WP_162566222.1 cell division protein ZapD -
  E5P2_RS05925 (G3W92_RS05925) - 1229864..1230880 (-) 1017 WP_162566223.1 hypothetical protein -
  E5P2_RS05930 (G3W92_RS05930) - 1230889..1232385 (-) 1497 WP_162566224.1 MBOAT family protein -
  E5P2_RS05935 (G3W92_RS05935) coaE 1232467..1233066 (-) 600 WP_162566225.1 dephospho-CoA kinase -
  E5P2_RS05940 (G3W92_RS05940) pilD 1233080..1233934 (-) 855 WP_162566226.1 A24 family peptidase Machinery gene
  E5P2_RS05945 (G3W92_RS05945) pilC 1233934..1235157 (-) 1224 WP_162566227.1 type II secretion system F family protein Machinery gene
  E5P2_RS05950 (G3W92_RS05950) pilB 1235178..1236911 (-) 1734 WP_162566228.1 type IV-A pilus assembly ATPase PilB Machinery gene
  E5P2_RS05960 (G3W92_RS05960) - 1237145..1238137 (-) 993 WP_162566229.1 polyprenyl synthetase family protein -
  E5P2_RS05965 (G3W92_RS05965) rplU 1238324..1238635 (+) 312 WP_162566230.1 50S ribosomal protein L21 -
  E5P2_RS05970 (G3W92_RS05970) rpmA 1238648..1238905 (+) 258 WP_162566231.1 50S ribosomal protein L27 -
  E5P2_RS05975 (G3W92_RS05975) obgE 1238974..1240029 (+) 1056 WP_162566232.1 GTPase ObgE -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 44471.09 Da        Isoelectric Point: 9.6151

>NTDB_id=1128204 E5P2_RS05945 WP_162566227.1 1233934..1235157(-) (pilC) [Variovorax sp. SRS16]
MATAAASSRTLKEYVFEWEGKDRNGKAVRGEVRAAGENQVQASLRRQGVLTTKIKKRRMRSGKSIKPKDIAIFTRQLATM
MKAGVPLLQAFDIVGRGNANPSVAKLLNDVRSDVETGTSLSAAFRKFPRYFDNLYCNLVEAGEAAGILEELLDRLATYME
KTEAIKSKIKSALMYPTSVIVVAFVVVAVIMIFVIPAFKQVFTSFGADLPAPTLIVMAISEYFVTYWWLIFGVIGGGIYF
FLQAWKRNERVQKFMDRLLLRVPIFGTLIDKSCVARWTRTLATMFAAGVPLVEALDSVGGASGNSVYADATTKIQQEVST
GTSLTAAMTNANLFPSMVLQMTAIGEESGSIDHMLGKAADFYEAEVDDMVAGLSSLMEPIIIVFLGVIIGGIVVSMYLPI
FKLGQVV

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=1128204 E5P2_RS05945 WP_162566227.1 1233934..1235157(-) (pilC) [Variovorax sp. SRS16]
ATGGCCACGGCAGCAGCATCGTCCCGAACGCTCAAGGAGTATGTCTTCGAGTGGGAGGGCAAGGACCGCAACGGCAAGGC
CGTGCGCGGCGAAGTGCGGGCAGCGGGCGAGAACCAGGTGCAGGCGTCGTTGCGGCGCCAGGGCGTGCTGACGACGAAGA
TCAAGAAGCGGCGCATGCGCTCCGGCAAGTCCATCAAGCCGAAGGACATCGCCATCTTCACGCGCCAGCTGGCGACCATG
ATGAAAGCCGGCGTGCCATTGCTGCAGGCATTCGACATCGTCGGCCGCGGCAATGCCAACCCGAGTGTCGCCAAGCTGCT
CAACGACGTCCGCAGCGATGTCGAGACCGGCACCTCGCTCTCCGCCGCGTTCCGCAAGTTCCCGAGGTACTTCGACAACC
TCTACTGCAACCTGGTCGAGGCCGGCGAAGCCGCAGGTATCCTGGAAGAGTTGCTCGATCGCCTGGCAACCTACATGGAA
AAGACCGAGGCGATCAAGTCGAAGATCAAGTCGGCGCTGATGTATCCGACCTCGGTCATCGTGGTCGCCTTCGTGGTGGT
GGCGGTCATCATGATCTTCGTGATTCCGGCCTTCAAGCAGGTGTTCACCTCCTTCGGCGCCGACCTGCCCGCGCCGACGC
TGATCGTCATGGCGATCAGCGAATACTTCGTCACCTACTGGTGGCTGATCTTCGGCGTGATCGGCGGCGGCATCTACTTC
TTTCTCCAGGCCTGGAAGCGCAACGAGCGCGTGCAGAAGTTCATGGACCGTCTGTTGCTGCGCGTGCCGATCTTCGGCAC
GCTGATCGACAAGTCCTGCGTCGCGCGCTGGACCCGCACGCTGGCCACGATGTTCGCCGCAGGCGTCCCGCTGGTCGAAG
CGCTGGACTCCGTCGGCGGTGCATCGGGCAATTCGGTGTACGCGGACGCCACGACCAAGATCCAGCAGGAAGTCTCCACC
GGCACCAGCCTCACGGCGGCCATGACCAATGCCAACCTCTTTCCGTCGATGGTCCTGCAGATGACCGCGATCGGCGAGGA
GTCCGGCTCGATCGACCACATGCTCGGAAAGGCCGCCGACTTCTACGAGGCGGAAGTCGACGACATGGTCGCCGGCCTCT
CGAGCCTGATGGAGCCCATCATCATCGTTTTCCTCGGGGTCATCATCGGTGGCATCGTGGTCTCGATGTACCTGCCCATC
TTCAAGCTCGGCCAGGTCGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A8B6LHH1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

52.723

99.263

0.523

  pilG Neisseria gonorrhoeae MS11

52.645

97.543

0.514

  pilG Neisseria meningitidis 44/76-A

52.393

97.543

0.511

  pilC Acinetobacter baylyi ADP1

50.86

100

0.509

  pilC Acinetobacter baumannii D1279779

49.37

97.543

0.482

  pilC Legionella pneumophila strain ERS1305867

49.242

97.297

0.479

  pilC Vibrio campbellii strain DS40M4

41.235

99.509

0.41

  pilC Vibrio cholerae strain A1552

40.887

99.754

0.408

  pilC Thermus thermophilus HB27

38.861

99.263

0.386


Multiple sequence alignment