Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   E6W36_RS12590 Genome accession   NZ_CP039704
Coordinates   2139622..2140473 (-) Length   283 a.a.
NCBI ID   WP_222872810.1    Uniprot ID   -
Organism   Hankyongella ginsenosidimutans strain W1-2-3     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2134622..2145473
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E6W36_RS12545 (E6W36_11875) - 2134922..2135179 (-) 258 WP_222872801.1 hypothetical protein -
  E6W36_RS12550 (E6W36_11880) - 2135253..2135396 (-) 144 WP_222872802.1 hypothetical protein -
  E6W36_RS12555 (E6W36_11885) - 2135375..2135926 (-) 552 WP_222872803.1 hypothetical protein -
  E6W36_RS12560 - 2136515..2136982 (-) 468 WP_222872804.1 type II secretion system protein -
  E6W36_RS12565 - 2136979..2137146 (-) 168 WP_222872805.1 hypothetical protein -
  E6W36_RS12570 - 2137352..2137498 (-) 147 WP_222872806.1 hypothetical protein -
  E6W36_RS12575 (E6W36_11895) - 2137639..2137788 (-) 150 WP_222872807.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  E6W36_RS12580 (E6W36_11900) gspG 2137790..2138200 (-) 411 WP_246047692.1 type II secretion system major pseudopilin GspG -
  E6W36_RS12585 (E6W36_11905) - 2138399..2139619 (-) 1221 WP_222872809.1 type II secretion system F family protein -
  E6W36_RS12590 pilF 2139622..2140473 (-) 852 WP_222872810.1 ATPase, T2SS/T4P/T4SS family Machinery gene
  E6W36_RS22235 - 2140391..2140630 (-) 240 WP_222872811.1 ATPase, T2SS/T4P/T4SS family -
  E6W36_RS12600 - 2140627..2141322 (-) 696 WP_222872812.1 hypothetical protein -
  E6W36_RS12605 (E6W36_11915) - 2141898..2142557 (+) 660 WP_222872813.1 hypothetical protein -
  E6W36_RS12610 (E6W36_11920) - 2142520..2143857 (+) 1338 WP_222872814.1 substrate-binding domain-containing protein -
  E6W36_RS12615 (E6W36_11925) - 2144076..2144630 (-) 555 WP_246047693.1 cell wall hydrolase -
  E6W36_RS12620 (E6W36_11930) - 2144882..2145340 (+) 459 WP_222872816.1 hypothetical protein -

Sequence


Protein


Download         Length: 283 a.a.        Molecular weight: 30482.32 Da        Isoelectric Point: 8.1845

>NTDB_id=360205 E6W36_RS12590 WP_222872810.1 2139622..2140473(-) (pilF) [Hankyongella ginsenosidimutans strain W1-2-3]
MRLLDRSNLTLDLATLGFDPPTVKALTDLVHQPHGIVLVTGPTGSGKTTTLYATLSLLNAATRKILTVEDPVEYRLAGIN
QTQVNPQIGLTFAAALRSFLRQDPDVMMVGEIRDLETAQVAVQASLTGHMILSTLHTNTAAGAVTRLIDMGVEPFLIAST
VSAVLAQRLVRRLCPHCRQPQQADHAVLRSLGFNVKEGARITLYQPVGCAACGGSGFRGRIAIHELLRMDEKLSQMVVAQ
AEARDIQRQAVGAGMHTMLQDGLIKAHAGLTTIEEVVRVTREG

Nucleotide


Download         Length: 852 bp        

>NTDB_id=360205 E6W36_RS12590 WP_222872810.1 2139622..2140473(-) (pilF) [Hankyongella ginsenosidimutans strain W1-2-3]
ATGCGCCTGCTCGACCGCTCCAACCTGACCCTCGATCTGGCGACGCTCGGCTTCGACCCGCCGACCGTCAAGGCGCTGAC
TGATCTGGTGCACCAGCCGCACGGCATCGTGCTGGTGACTGGGCCGACCGGCAGCGGTAAGACCACGACGCTCTATGCGA
CGCTCTCGCTGCTCAACGCAGCGACCCGGAAGATCCTCACCGTCGAAGACCCGGTCGAATACCGGCTCGCCGGCATCAAC
CAGACCCAGGTCAATCCGCAGATCGGCCTGACCTTCGCTGCCGCCCTGCGCTCGTTCCTGCGCCAGGACCCGGACGTGAT
GATGGTCGGCGAAATCCGCGACCTCGAAACCGCACAGGTCGCGGTCCAGGCTTCGCTCACCGGCCACATGATCCTCTCGA
CCCTGCACACCAACACTGCGGCCGGCGCCGTCACCCGCCTCATCGACATGGGGGTCGAGCCGTTCCTGATCGCCTCGACG
GTCTCCGCCGTTCTCGCCCAGCGCCTCGTGCGCCGGCTCTGCCCGCACTGCCGCCAGCCGCAGCAGGCGGACCATGCGGT
GCTGCGCTCGCTCGGCTTCAACGTGAAGGAAGGCGCGCGGATCACCCTCTATCAGCCGGTCGGCTGCGCCGCCTGCGGCG
GCAGCGGCTTTCGCGGCCGAATCGCCATCCACGAACTGCTGCGCATGGACGAGAAGCTGTCGCAGATGGTCGTCGCCCAG
GCGGAAGCGCGGGATATCCAGCGACAGGCGGTCGGCGCCGGCATGCACACCATGTTGCAGGACGGGCTGATCAAGGCCCA
TGCGGGTCTCACCACCATCGAAGAGGTGGTCCGCGTGACGCGCGAGGGGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

51.408

100

0.516

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

50.89

99.293

0.505

  pilB Vibrio campbellii strain DS40M4

49.104

98.587

0.484

  pilB Legionella pneumophila strain ERS1305867

48.754

99.293

0.484

  pilB Vibrio parahaemolyticus RIMD 2210633

48.571

98.94

0.481

  pilB Vibrio cholerae strain A1552

48.043

99.293

0.477

  pilF Neisseria gonorrhoeae MS11

46.479

100

0.466

  pilB Acinetobacter baumannii D1279779

45.775

100

0.459

  pilB Acinetobacter baylyi ADP1

45.775

100

0.459

  pilB/pilB1 Synechocystis sp. PCC 6803

42.712

100

0.445

  ctsE Campylobacter jejuni subsp. jejuni 81-176

42.705

99.293

0.424

  pilB Haemophilus influenzae 86-028NP

38.662

95.053

0.367

  pilB Haemophilus influenzae Rd KW20

38.29

95.053

0.364

  pilB Glaesserella parasuis strain SC1401

44.934

80.212

0.36


Multiple sequence alignment