Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   AAGF34_RS00650 Genome accession   NZ_CP152376
Coordinates   143061..144284 (+) Length   407 a.a.
NCBI ID   WP_342618710.1    Uniprot ID   -
Organism   Rhodoferax sp. GW822-FHT02A01     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 138061..149284
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAGF34_RS00630 (AAGF34_00630) rfbD 138824..139732 (+) 909 WP_342618706.1 dTDP-4-dehydrorhamnose reductase -
  AAGF34_RS00635 (AAGF34_00635) rfbA 139743..140654 (+) 912 WP_342618707.1 glucose-1-phosphate thymidylyltransferase RfbA -
  AAGF34_RS00640 (AAGF34_00640) rfbC 140654..141196 (+) 543 WP_342618708.1 dTDP-4-dehydrorhamnose 3,5-epimerase -
  AAGF34_RS00645 (AAGF34_00645) pilB 141308..143041 (+) 1734 WP_342618709.1 type IV-A pilus assembly ATPase PilB Machinery gene
  AAGF34_RS00650 (AAGF34_00650) pilC 143061..144284 (+) 1224 WP_342618710.1 type II secretion system F family protein Machinery gene
  AAGF34_RS00655 (AAGF34_00655) - 144284..145129 (+) 846 WP_342618711.1 A24 family peptidase -
  AAGF34_RS00660 (AAGF34_00660) coaE 145132..145740 (+) 609 WP_342618712.1 dephospho-CoA kinase -
  AAGF34_RS00665 (AAGF34_00665) zapD 145763..146518 (+) 756 WP_342618713.1 cell division protein ZapD -
  AAGF34_RS00670 (AAGF34_00670) - 146535..146738 (+) 204 WP_342618714.1 DNA gyrase inhibitor YacG -
  AAGF34_RS00675 (AAGF34_00675) - 146714..147199 (-) 486 WP_342618715.1 NUDIX domain-containing protein -
  AAGF34_RS00680 (AAGF34_00680) - 147196..148074 (-) 879 WP_342618716.1 ATP-binding protein -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 44586.27 Da        Isoelectric Point: 9.1258

>NTDB_id=988735 AAGF34_RS00650 WP_342618710.1 143061..144284(+) (pilC) [Rhodoferax sp. GW822-FHT02A01]
MATNPAKTANNNEAVYEWEGKDKNGKQVRGEIRAGGENQVKSALRRQGVTPTKIKMRRTRSGKSIKPKDLAIFTRQLATM
MKAGVPLLQAFDIVGRGNPNPSVTKLLNDIRNDVETGTSLSVAFRKYPIYFDNLYCNLVEAGEAAGILDQLLDRLAVYME
KTEAIKSKIKSALMYPISVLIVAFVVVAVIMIFVVPSFKQVFTSFGGELPGPTLVVIAMSEYFVKYWYLIFGGLGGGIYF
FLESWKRNEKVQEFMDRLMRKLPIFGVLVDKSCIARWTRTLSTMFAAGVPLVEALDSVGGAAGNIVYQHATQKIQQEVST
GTALTAAMTNANLFPSMVLQMCAIGEESGSIDHMLGKCADFYEAEVDDMVAGISSLMEPIIIVVLGTIIGGIVVAMYLPI
FKLGQVV

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=988735 AAGF34_RS00650 WP_342618710.1 143061..144284(+) (pilC) [Rhodoferax sp. GW822-FHT02A01]
ATGGCTACGAATCCAGCTAAAACTGCCAACAACAATGAAGCAGTCTACGAATGGGAAGGCAAGGACAAGAATGGCAAGCA
GGTACGCGGTGAGATTCGTGCCGGCGGCGAAAACCAGGTCAAGTCCGCCTTGCGTCGCCAGGGTGTCACTCCCACCAAGA
TAAAGATGCGGCGCACCCGCTCCGGCAAGTCCATCAAGCCCAAGGATCTGGCCATCTTCACGCGTCAGCTGGCCACCATG
ATGAAGGCGGGCGTTCCCTTGCTACAGGCTTTCGATATCGTGGGTCGCGGCAACCCCAACCCCAGTGTCACCAAGCTGCT
CAACGACATCCGCAACGACGTGGAGACTGGCACATCGCTAAGCGTTGCATTCCGCAAATACCCGATCTACTTCGACAACC
TGTACTGCAATCTGGTCGAGGCCGGTGAAGCGGCCGGTATCCTGGACCAGTTGCTGGATCGTCTGGCGGTCTACATGGAA
AAGACCGAAGCCATCAAGTCCAAGATCAAGTCGGCCCTGATGTATCCCATCTCGGTTCTGATCGTAGCGTTTGTGGTGGT
CGCTGTGATCATGATTTTCGTGGTGCCCTCGTTCAAGCAGGTGTTTACCTCCTTCGGTGGCGAGCTGCCTGGACCAACCC
TGGTCGTGATTGCCATGAGTGAGTATTTCGTCAAGTACTGGTACCTGATCTTCGGTGGCCTGGGGGGCGGAATCTACTTT
TTCCTCGAGTCCTGGAAGCGCAATGAAAAGGTCCAGGAATTCATGGACAGGCTGATGCGCAAGCTTCCCATCTTTGGTGT
TCTGGTCGACAAGTCCTGTATCGCGCGCTGGACACGCACGCTCTCCACCATGTTTGCCGCCGGTGTCCCGCTGGTGGAAG
CACTGGACTCGGTGGGAGGAGCTGCCGGCAACATCGTATACCAGCATGCAACCCAGAAGATCCAGCAGGAAGTCTCCACG
GGTACTGCGTTGACTGCGGCCATGACCAACGCCAACCTGTTTCCGTCCATGGTGCTGCAGATGTGCGCCATCGGAGAGGA
ATCCGGCTCGATCGACCACATGCTGGGCAAGTGCGCCGACTTCTATGAAGCCGAAGTGGACGACATGGTGGCCGGCATTT
CCAGCCTGATGGAACCCATCATCATCGTGGTGCTGGGTACCATCATCGGTGGCATTGTGGTGGCCATGTACCTGCCCATC
TTCAAGCTGGGTCAAGTGGTTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

52.645

97.543

0.514

  pilG Neisseria gonorrhoeae MS11

51.256

97.789

0.501

  pilG Neisseria meningitidis 44/76-A

50.754

97.789

0.496

  pilC Acinetobacter baylyi ADP1

50.251

97.789

0.491

  pilC Legionella pneumophila strain ERS1305867

50.253

97.297

0.489

  pilC Acinetobacter baumannii D1279779

48.866

97.543

0.477

  pilC Vibrio cholerae strain A1552

40.05

97.543

0.391

  pilC Thermus thermophilus HB27

37.709

100

0.388

  pilC Vibrio campbellii strain DS40M4

37.75

98.28

0.371


Multiple sequence alignment