Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACEN9H_RS24815 Genome accession   NZ_CP168562
Coordinates   5689236..5690474 (+) Length   412 a.a.
NCBI ID   WP_056122892.1    Uniprot ID   -
Organism   Massilia cellulosiltytica strain CT11-92     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5684236..5695474
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACEN9H_RS24795 (ACEN9H_24800) - 5684348..5685031 (+) 684 WP_156403646.1 hypothetical protein -
  ACEN9H_RS24800 (ACEN9H_24805) - 5685036..5685851 (+) 816 WP_413672448.1 slipin family protein -
  ACEN9H_RS24805 (ACEN9H_24810) - 5685938..5687245 (+) 1308 WP_413340974.1 HlyC/CorC family transporter -
  ACEN9H_RS24810 (ACEN9H_24815) pilB 5687493..5689226 (+) 1734 WP_056122895.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACEN9H_RS24815 (ACEN9H_24820) pilC 5689236..5690474 (+) 1239 WP_056122892.1 type II secretion system F family protein Machinery gene
  ACEN9H_RS24820 (ACEN9H_24825) - 5691005..5691913 (-) 909 WP_413672449.1 LysR substrate-binding domain-containing protein -
  ACEN9H_RS24825 (ACEN9H_24830) - 5692008..5693186 (+) 1179 WP_413672450.1 SGNH/GDSL hydrolase family protein -
  ACEN9H_RS24830 (ACEN9H_24835) - 5693191..5694027 (+) 837 WP_413672451.1 AraC family transcriptional regulator -
  ACEN9H_RS24835 (ACEN9H_24840) - 5694062..5694958 (+) 897 WP_413672452.1 DMT family transporter -
  ACEN9H_RS24840 (ACEN9H_24845) - 5694928..5695233 (-) 306 WP_371768675.1 hypothetical protein -

Sequence


Protein


Download         Length: 412 a.a.        Molecular weight: 45203.48 Da        Isoelectric Point: 10.1731

>NTDB_id=1044563 ACEN9H_RS24815 WP_056122892.1 5689236..5690474(+) (pilC) [Massilia cellulosiltytica strain CT11-92]
MATTSAPARRAPSAQVKEHIFAWEGKDKTGKTVRGEMRAGGETIVNVTLRRQGIMVTKVKKKVYRSGKKIQDKDLTLFTR
QLATMMKAGVPLLQSFDIVGKGHSNPSMSKLIMDLRADIETGTSLNNAFRKYPLYFDPLFCNLVGAGEQAGILEDLLTRL
AIYKEKTLALKSKIKGALMYPCAIIAIAFIVTAVIMIWVVPAFKSVFSSFGANLPAPTLIVMGISDFVVKWWYIIFGSIF
GALYFFFQSWRRSLKMQQFMDRVLLRLPVFGEVIRKATIARWTRTLATMFAAGVPLVEALDSVGGASGNNVYLEATKKIQ
TEVSTGTSLTAAMQNANVFPNMVTQMVAIGEESGALDGMLGKVADFYEEEVDEAVKTLSSLMEPMIMVVLGVLIGGLVIA
MYLPIFKLGSVV

Nucleotide


Download         Length: 1239 bp        

>NTDB_id=1044563 ACEN9H_RS24815 WP_056122892.1 5689236..5690474(+) (pilC) [Massilia cellulosiltytica strain CT11-92]
ATGGCAACGACATCCGCCCCGGCACGCCGCGCGCCCAGCGCGCAAGTCAAGGAACACATCTTCGCCTGGGAAGGCAAGGA
CAAGACCGGCAAGACCGTGCGCGGCGAAATGCGCGCCGGCGGCGAGACCATCGTGAACGTGACCCTGCGCCGCCAGGGCA
TCATGGTGACCAAGGTCAAGAAGAAGGTGTACCGCAGCGGCAAGAAGATCCAGGACAAGGACCTCACGCTGTTCACGCGC
CAGCTGGCCACGATGATGAAAGCCGGCGTGCCGCTGCTGCAATCGTTCGACATCGTCGGCAAGGGCCATTCGAATCCGTC
GATGTCCAAGCTGATCATGGACTTGCGCGCCGACATCGAGACGGGCACCAGCCTGAACAACGCGTTCCGCAAGTATCCGC
TGTACTTCGACCCGCTGTTCTGCAACCTGGTCGGCGCCGGCGAGCAGGCCGGTATCCTCGAGGACCTGCTCACGCGCCTG
GCGATCTACAAGGAAAAGACGCTTGCGCTGAAGAGCAAGATCAAGGGTGCGCTGATGTACCCCTGCGCGATCATCGCCAT
CGCCTTCATCGTCACGGCCGTGATCATGATCTGGGTGGTGCCCGCCTTCAAATCCGTGTTTTCCAGCTTCGGCGCGAACC
TGCCGGCGCCGACCCTGATCGTGATGGGCATCTCGGATTTCGTCGTGAAATGGTGGTACATCATCTTCGGCTCGATCTTC
GGCGCGCTGTACTTCTTCTTCCAGTCCTGGCGCCGGTCGCTGAAAATGCAGCAGTTCATGGACCGCGTGCTGCTGCGCCT
CCCCGTGTTCGGCGAAGTGATCCGCAAGGCGACGATCGCCCGCTGGACCCGCACGCTGGCGACGATGTTCGCGGCCGGCG
TGCCGCTCGTGGAAGCGCTGGACTCCGTGGGCGGCGCGTCCGGCAACAACGTCTACCTGGAAGCCACGAAAAAGATCCAG
ACGGAAGTCAGCACGGGCACGAGCCTGACGGCCGCGATGCAGAACGCCAATGTATTCCCCAACATGGTCACGCAAATGGT
CGCCATCGGCGAGGAATCCGGCGCGCTGGACGGCATGCTGGGCAAGGTGGCCGACTTCTACGAGGAAGAAGTGGACGAGG
CCGTCAAGACGCTGTCGTCGCTGATGGAGCCGATGATCATGGTCGTGCTCGGGGTGCTGATCGGCGGTCTCGTGATTGCC
ATGTATCTGCCGATCTTCAAGCTGGGGTCGGTGGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

55.25

97.087

0.536

  pilC Legionella pneumophila strain ERS1305867

54.545

96.117

0.524

  pilC Acinetobacter baylyi ADP1

51.256

96.602

0.495

  pilC Acinetobacter baumannii D1279779

51.385

96.359

0.495

  pilG Neisseria gonorrhoeae MS11

51.134

96.359

0.493

  pilG Neisseria meningitidis 44/76-A

50.882

96.359

0.49

  pilC Vibrio campbellii strain DS40M4

39.646

96.117

0.381

  pilC Vibrio cholerae strain A1552

39.646

96.117

0.381


Multiple sequence alignment