Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACA027_RS04190 Genome accession   NZ_CP166730
Coordinates   961918..963159 (-) Length   413 a.a.
NCBI ID   WP_370681149.1    Uniprot ID   -
Organism   Comamonas sp. GB3 AK4-5     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 956918..968159
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACA027_RS04160 (ACA027_04160) - 957704..958657 (+) 954 WP_370681143.1 ATP-binding protein -
  ACA027_RS04165 (ACA027_04165) - 958709..959209 (+) 501 WP_370681144.1 NUDIX domain-containing protein -
  ACA027_RS04170 (ACA027_04170) - 959320..959535 (-) 216 WP_370681145.1 DNA gyrase inhibitor YacG -
  ACA027_RS04175 (ACA027_04175) zapD 959575..960330 (-) 756 WP_370681146.1 cell division protein ZapD -
  ACA027_RS04180 (ACA027_04180) coaE 960432..961043 (-) 612 WP_370681147.1 dephospho-CoA kinase -
  ACA027_RS04185 (ACA027_04185) pilD 961043..961918 (-) 876 WP_370681148.1 A24 family peptidase Machinery gene
  ACA027_RS04190 (ACA027_04190) pilC 961918..963159 (-) 1242 WP_370681149.1 type II secretion system F family protein Machinery gene
  ACA027_RS04195 (ACA027_04195) pilB 963209..964948 (-) 1740 WP_370681150.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACA027_RS04205 (ACA027_04205) - 965200..966129 (-) 930 WP_370682507.1 polyprenyl synthetase family protein -
  ACA027_RS04210 (ACA027_04210) rplU 966346..966657 (+) 312 WP_370682508.1 50S ribosomal protein L21 -
  ACA027_RS04215 (ACA027_04215) rpmA 966676..966933 (+) 258 WP_066540535.1 50S ribosomal protein L27 -

Sequence


Protein


Download         Length: 413 a.a.        Molecular weight: 45256.09 Da        Isoelectric Point: 10.2635

>NTDB_id=1035051 ACA027_RS04190 WP_370681149.1 961918..963159(-) (pilC) [Comamonas sp. GB3 AK4-5]
MATAARPHSSSGKRTASKERLFAWEGKDRSGKLVRGEMRAASALLVHSSLRRQGIGGLRIQQRRMPAGKRIRPRDIALFT
RQMASMLKAGVPLLQAFDIVGSGHHNPRVSQLLLEIRSDVETGTSLSAAFRKHPLYFNALYCNLIEAGETAGILEALLDR
LASYMEKTERIKLQIRSALMYPCTVLAVALVVVSVIMVWVIPAFKEVFSSFGADLPAPTLLVMAVSDTVVRWWWQLCAGL
VATVYGLRLAWKRSERLQQHMDRLLLKLPLLGPLLQQSCVARWTRTLSTMFAAGVPLVEALGSVGGASGNHVYHSATQRI
QQEVATGTSLSTAMGHTQVFPHMVLQMAAIGEESGTLDHMLGKAADYFEQEVDERVAGLSSLMEPLIIVFLGTLIGGIVV
SMYLPIFKLGQVV

Nucleotide


Download         Length: 1242 bp        

>NTDB_id=1035051 ACA027_RS04190 WP_370681149.1 961918..963159(-) (pilC) [Comamonas sp. GB3 AK4-5]
ATGGCGACTGCAGCCCGTCCCCACAGCAGCAGTGGCAAGCGCACCGCCAGCAAAGAGCGGCTGTTTGCCTGGGAGGGCAA
GGACCGCAGCGGCAAGCTGGTGCGCGGCGAGATGCGTGCCGCAAGCGCCCTCCTGGTGCACTCCAGCCTGCGCCGCCAGG
GCATTGGTGGCCTGCGCATCCAGCAGCGGCGCATGCCTGCAGGCAAGCGCATCCGCCCCAGGGACATCGCCCTGTTCACC
CGCCAGATGGCCAGCATGCTGAAGGCCGGTGTTCCGCTGCTGCAGGCCTTCGACATCGTGGGGAGCGGCCACCACAACCC
CAGAGTCAGCCAGCTGCTCCTGGAGATCCGCAGCGATGTGGAAACAGGCACCTCGCTGAGTGCGGCTTTTCGCAAGCACC
CGCTTTACTTCAACGCGCTGTATTGCAACCTGATCGAGGCCGGCGAGACCGCAGGCATTCTGGAGGCGCTGCTGGACCGC
CTGGCCAGCTATATGGAGAAGACCGAGCGCATCAAGCTCCAGATCCGGTCGGCGCTGATGTACCCCTGCACCGTGCTCGC
CGTGGCCCTGGTGGTGGTGAGCGTGATCATGGTCTGGGTGATCCCGGCGTTCAAGGAAGTCTTCTCCTCCTTCGGCGCCG
ATCTGCCCGCGCCCACGCTGCTGGTGATGGCGGTCAGCGACACCGTGGTGCGCTGGTGGTGGCAGCTCTGCGCTGGCCTG
GTTGCCACCGTCTACGGCCTGCGGCTGGCCTGGAAGCGCAGCGAACGCCTGCAGCAGCACATGGACCGGCTGCTGCTGAA
GCTGCCCCTCTTGGGCCCGCTGCTCCAACAGTCCTGCGTGGCCCGCTGGACGCGCACGCTCTCCACCATGTTTGCCGCCG
GCGTTCCGCTGGTGGAGGCCCTGGGCTCGGTGGGCGGCGCCTCGGGCAACCATGTCTACCACAGCGCGACACAGCGCATT
CAGCAGGAGGTGGCCACCGGTACCAGCCTGAGCACCGCCATGGGCCATACCCAGGTCTTTCCCCATATGGTGCTGCAGAT
GGCTGCCATCGGCGAGGAGTCAGGCACCCTCGACCATATGCTGGGCAAGGCCGCCGACTACTTCGAGCAAGAGGTCGATG
AGAGGGTGGCCGGCCTGTCCAGCCTGATGGAGCCCCTCATCATCGTGTTCCTGGGCACGCTGATTGGCGGTATCGTCGTG
TCCATGTATTTGCCCATTTTCAAACTCGGCCAGGTGGTGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

49.628

97.579

0.484

  pilC Acinetobacter baylyi ADP1

48.744

96.368

0.47

  pilG Neisseria gonorrhoeae MS11

48.371

96.61

0.467

  pilG Neisseria meningitidis 44/76-A

47.87

96.61

0.462

  pilC Legionella pneumophila strain ERS1305867

47.222

95.884

0.453

  pilC Acinetobacter baumannii D1279779

46.348

96.126

0.446

  pilC Vibrio cholerae strain A1552

39.9

97.094

0.387

  pilC Vibrio campbellii strain DS40M4

38.519

98.063

0.378

  pilC Thermus thermophilus HB27

38

96.852

0.368


Multiple sequence alignment