Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACZ75_RS13860 Genome accession   NZ_CP012201
Coordinates   3420232..3421449 (+) Length   405 a.a.
NCBI ID   WP_050409290.1    Uniprot ID   A0A0K1JZI9
Organism   Massilia sp. NR 4-1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3415232..3426449
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACZ75_RS13840 (ACZ75_13885) - 3415453..3415950 (-) 498 WP_050409286.1 NADH-quinone oxidoreductase subunit B family protein -
  ACZ75_RS13845 - 3416038..3416871 (-) 834 WP_050409287.1 AraC family transcriptional regulator -
  ACZ75_RS13850 (ACZ75_13895) - 3417061..3418350 (+) 1290 WP_050409288.1 HlyC/CorC family transporter -
  ACZ75_RS13855 (ACZ75_13900) pilB 3418445..3420175 (+) 1731 WP_050409289.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACZ75_RS13860 (ACZ75_13905) pilC 3420232..3421449 (+) 1218 WP_050409290.1 type II secretion system F family protein Machinery gene
  ACZ75_RS13865 (ACZ75_13910) pilD 3421477..3422346 (+) 870 WP_050409291.1 A24 family peptidase Machinery gene
  ACZ75_RS13870 (ACZ75_13915) coaE 3422377..3423027 (+) 651 WP_050409292.1 dephospho-CoA kinase -
  ACZ75_RS13875 (ACZ75_13920) zapD 3423092..3423847 (+) 756 WP_050409293.1 cell division protein ZapD -
  ACZ75_RS13880 (ACZ75_13925) yacG 3423860..3424045 (+) 186 WP_050409294.1 DNA gyrase inhibitor YacG -
  ACZ75_RS13885 (ACZ75_13930) - 3424151..3424555 (-) 405 WP_050409295.1 NUDIX domain-containing protein -
  ACZ75_RS13890 (ACZ75_13935) - 3424552..3425433 (-) 882 WP_050409296.1 ATP-binding protein -

Sequence


Protein


Download         Length: 405 a.a.        Molecular weight: 44401.27 Da        Isoelectric Point: 9.8773

>NTDB_id=152166 ACZ75_RS13860 WP_050409290.1 3420232..3421449(+) (pilC) [Massilia sp. NR 4-1]
MARNAGNQIKESVFAWEGKDKSGKTVRGELRAGGEAVVNVTLRRQGIVVTKVKKKVYRSGKKVSDKDITLFTRQLATMMK
AGVPLLQSFDIVGKGHANPSVSKLVMDLRADIETGTSLNQAFRKFPLYFDPLFCNLVGAGEQAGILEDLLTRLAIYKEKT
LAMKAKIKSALMYPVSILAVAFIVTAVIMIWVVPAFKEVFSSFGADLPAPTVFVMTVSEYFVKYWYVIFGTLFGGLYFFF
QSWRRSVKMQQAMDRFLLKIPVFGDVVRKATIARWTRTLATMFAAGVPLVEALDSVGGASGNHVYLEATRRIQNEVSTGT
SLTVAMQNADVFPNMVTQMVAIGEESGALDAMLGKVADFYEEEVDEAVASLSSLMEPMIMVILGVLIGGLVVAMYLPIFK
LGSVV

Nucleotide


Download         Length: 1218 bp        

>NTDB_id=152166 ACZ75_RS13860 WP_050409290.1 3420232..3421449(+) (pilC) [Massilia sp. NR 4-1]
ATGGCAAGGAATGCGGGCAACCAGATCAAGGAATCGGTCTTCGCCTGGGAGGGCAAGGACAAGAGCGGCAAGACCGTGCG
CGGCGAGCTGCGCGCCGGCGGCGAGGCGGTGGTCAACGTCACCCTGCGGCGCCAGGGCATCGTCGTCACCAAGGTCAAGA
AAAAAGTCTACCGCTCCGGCAAGAAGGTCAGCGACAAGGACATCACCCTCTTCACGCGCCAGCTGGCAACCATGATGAAG
GCCGGCGTGCCGCTGCTGCAATCGTTCGACATCGTGGGCAAGGGCCACGCCAATCCTTCCGTGTCCAAGCTGGTGATGGA
TCTGCGCGCCGATATCGAGACCGGCACCAGCCTGAATCAGGCCTTCCGTAAATTCCCCCTGTACTTCGATCCGCTGTTCT
GCAACCTGGTGGGCGCCGGCGAACAGGCCGGGATCCTGGAGGACTTGCTGACCCGGCTCGCCATCTACAAGGAAAAGACC
CTGGCCATGAAGGCCAAGATCAAATCGGCCCTGATGTACCCGGTGTCGATCCTGGCGGTGGCCTTCATCGTCACCGCCGT
CATCATGATCTGGGTGGTGCCGGCCTTCAAGGAAGTGTTCAGCAGCTTCGGCGCCGACCTGCCCGCACCCACCGTATTCG
TGATGACGGTGTCCGAATACTTCGTCAAATACTGGTACGTCATTTTCGGCACCTTGTTCGGCGGCCTGTATTTCTTCTTC
CAGTCCTGGCGCCGATCGGTCAAAATGCAGCAGGCGATGGACCGCTTCCTGCTCAAGATCCCCGTTTTCGGCGACGTGGT
GCGCAAGGCGACCATCGCGCGCTGGACGCGCACCCTGGCCACCATGTTCGCCGCCGGCGTGCCGCTGGTGGAGGCGCTGG
ACTCGGTGGGCGGGGCTTCGGGCAACCATGTGTACCTGGAAGCGACGCGCCGCATCCAGAATGAAGTCAGCACCGGCACC
AGCCTGACGGTGGCGATGCAGAATGCCGACGTCTTTCCGAACATGGTCACGCAGATGGTGGCCATCGGCGAGGAGTCCGG
CGCGCTGGACGCCATGCTGGGCAAGGTGGCCGACTTCTACGAGGAGGAAGTGGACGAGGCGGTGGCCTCGCTGTCCAGCC
TGATGGAGCCGATGATCATGGTGATCCTCGGCGTATTGATCGGCGGCCTGGTGGTCGCCATGTATCTTCCGATCTTCAAG
CTCGGCTCAGTAGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0K1JZI9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

55.446

99.753

0.553

  pilC Legionella pneumophila strain ERS1305867

54.545

97.778

0.533

  pilG Neisseria gonorrhoeae MS11

52.078

100

0.526

  pilG Neisseria meningitidis 44/76-A

51.834

100

0.523

  pilC Acinetobacter baumannii D1279779

51.889

98.025

0.509

  pilC Acinetobacter baylyi ADP1

51.256

98.272

0.504

  pilC Vibrio campbellii strain DS40M4

40.302

98.025

0.395

  pilC Vibrio cholerae strain A1552

39.646

97.778

0.388

  pilC Thermus thermophilus HB27

36.145

100

0.37


Multiple sequence alignment