Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   L6418_RS12085 Genome accession   NZ_AP023423
Coordinates   2491447..2492667 (+) Length   406 a.a.
NCBI ID   WP_237247174.1    Uniprot ID   A0AAN1XBV8
Organism   Sideroxyarcus emersonii strain MIZ01     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2486447..2497667
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  L6418_RS12065 (MIZ01_2463) - 2486557..2487177 (-) 621 WP_237247170.1 ParA family protein -
  L6418_RS12070 (MIZ01_2464) ilvD 2487231..2489081 (-) 1851 WP_237247171.1 dihydroxy-acid dehydratase -
  L6418_RS12075 (MIZ01_2465) lgt 2489147..2489950 (+) 804 WP_237247172.1 prolipoprotein diacylglyceryl transferase -
  L6418_RS12080 (MIZ01_2466) - 2490050..2491339 (+) 1290 WP_237247173.1 HlyC/CorC family transporter -
  L6418_RS12085 (MIZ01_2467) pilC 2491447..2492667 (+) 1221 WP_237247174.1 type II secretion system F family protein Machinery gene
  L6418_RS12090 (MIZ01_2468) pilD 2492850..2493719 (+) 870 WP_237247175.1 A24 family peptidase Machinery gene
  L6418_RS12095 (MIZ01_2469) coaE 2493716..2494327 (+) 612 WP_237247176.1 dephospho-CoA kinase -
  L6418_RS12100 (MIZ01_2470) zapD 2494389..2495147 (+) 759 WP_237247177.1 cell division protein ZapD -
  L6418_RS12105 (MIZ01_2471) - 2495144..2495329 (+) 186 WP_237247178.1 DNA gyrase inhibitor YacG -
  L6418_RS12110 (MIZ01_2472) - 2495311..2496255 (-) 945 WP_237247179.1 Nudix family hydrolase -
  L6418_RS12115 (MIZ01_2473) - 2496242..2497105 (-) 864 WP_237247180.1 ATP-binding protein -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 43983.57 Da        Isoelectric Point: 9.6229

>NTDB_id=83173 L6418_RS12085 WP_237247174.1 2491447..2492667(+) (pilC) [Sideroxyarcus emersonii strain MIZ01]
MATARSDKPKEQQYAWEGKDKAGKIVKGEMRGAGEASVSAHLRRQGITVTKIKKSSKGGGKVTEKDITLFTRQLATMLKS
GVPLLQAFDIVGKGHDNPAVARLLFDIKTDVETGSSLEQSFRKFPLYFDDLYCNLLGAGEAAGILDSLLDRLATYKEKIL
AIKSKIKSALFYPVSIIVVAFVITAVIMIFVIPAFKTLFSNFGADLPGPTLVVMSISDFFVAWWWAIFGIVGGSVYGFFY
AWKRNKTMQRRMDQLMLKIPVFGPLVRKASIARWARTLSTMFAAGVPLVEAFDSVAGAAGNAVYSDATKAIQREVTSGTS
LTVAMQNTDVFPSMVLQMVAIGEESGALDAMLSKVADFFEAEVDDAVEALSSLMEPIIMVVLGTLIGGMVVAMYLPIFKM
GQAVSG

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=83173 L6418_RS12085 WP_237247174.1 2491447..2492667(+) (pilC) [Sideroxyarcus emersonii strain MIZ01]
ATGGCTACAGCAAGATCTGACAAACCGAAAGAGCAGCAATACGCCTGGGAAGGCAAGGACAAGGCCGGCAAGATCGTCAA
GGGCGAGATGCGCGGCGCCGGCGAAGCCAGCGTTTCGGCGCATTTGCGCCGCCAGGGCATCACCGTCACCAAGATCAAGA
AGAGCTCAAAGGGCGGCGGCAAGGTTACCGAAAAGGACATCACGCTGTTCACGCGCCAGCTCGCCACCATGCTGAAGTCT
GGCGTACCGCTGCTGCAAGCTTTCGACATCGTCGGCAAGGGTCACGACAATCCTGCCGTCGCACGCTTGCTGTTCGACAT
CAAGACCGATGTCGAGACCGGCAGCAGCCTGGAACAGTCATTCCGCAAGTTCCCGCTGTATTTCGACGACCTGTACTGCA
ACCTGCTCGGTGCGGGCGAAGCGGCGGGTATCCTGGACAGTCTGCTGGATCGTCTGGCGACCTACAAGGAAAAGATCCTG
GCCATCAAGAGCAAGATCAAATCCGCCTTGTTCTACCCTGTTTCCATCATCGTGGTCGCATTCGTCATCACCGCAGTGAT
CATGATCTTCGTGATCCCCGCCTTCAAGACGCTGTTTTCCAACTTCGGCGCCGATTTGCCCGGCCCGACCCTGGTCGTCA
TGTCCATCTCCGACTTTTTCGTGGCATGGTGGTGGGCGATTTTCGGTATCGTCGGCGGCAGCGTATACGGGTTCTTCTAT
GCCTGGAAGCGCAACAAGACCATGCAGCGCCGCATGGACCAGCTGATGCTGAAGATCCCGGTGTTCGGGCCGCTGGTGCG
CAAGGCTTCCATCGCGCGCTGGGCGCGCACCCTCTCCACCATGTTTGCAGCCGGCGTGCCGCTGGTCGAGGCTTTCGACT
CCGTCGCGGGCGCTGCCGGCAATGCAGTCTATTCCGATGCCACCAAGGCCATCCAGCGCGAAGTGACGTCGGGCACCAGC
CTGACCGTGGCCATGCAAAATACCGATGTGTTCCCCAGCATGGTGCTGCAGATGGTCGCCATCGGCGAAGAGTCCGGCGC
CCTGGACGCCATGCTGAGCAAGGTCGCAGACTTCTTCGAGGCCGAGGTCGACGACGCTGTGGAAGCGTTGTCGAGCCTGA
TGGAACCCATCATCATGGTGGTCCTGGGTACGCTGATCGGCGGCATGGTGGTGGCAATGTACCTGCCGATCTTCAAAATG
GGCCAGGCCGTCAGCGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

54.342

99.261

0.539

  pilC Pseudomonas stutzeri DSM 10701

53.133

98.276

0.522

  pilG Neisseria gonorrhoeae MS11

51.75

98.522

0.51

  pilG Neisseria meningitidis 44/76-A

51

98.522

0.502

  pilC Acinetobacter baumannii D1279779

49.118

97.783

0.48

  pilC Acinetobacter baylyi ADP1

48.615

97.783

0.475

  pilC Vibrio cholerae strain A1552

40.355

97.044

0.392

  pilC Vibrio campbellii strain DS40M4

38.329

100

0.384

  pilC Thermus thermophilus HB27

37.594

98.276

0.369


Multiple sequence alignment