Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   SCE1572_RS28255 Genome accession   NC_021658
Coordinates   7801159..7802859 (+) Length   566 a.a.
NCBI ID   WP_020737369.1    Uniprot ID   S4XHF8
Organism   Sorangium cellulosum So0157-2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 7796159..7807859
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SCE1572_RS28240 (SCE1572_27255) - 7798170..7798466 (+) 297 WP_020737366.1 hypothetical protein -
  SCE1572_RS28245 (SCE1572_27260) - 7798481..7799830 (-) 1350 WP_020737367.1 serine/threonine-protein kinase -
  SCE1572_RS28250 (SCE1572_27265) - 7799827..7801053 (-) 1227 WP_020737368.1 FHA domain-containing protein -
  SCE1572_RS28255 (SCE1572_27270) pilB 7801159..7802859 (+) 1701 WP_020737369.1 type IV-A pilus assembly ATPase PilB Machinery gene
  SCE1572_RS28260 (SCE1572_27275) pilT 7802864..7804003 (+) 1140 WP_020737370.1 type IV pilus twitching motility protein PilT Machinery gene
  SCE1572_RS28265 (SCE1572_27280) lexA 7804207..7804899 (+) 693 WP_020737371.1 transcriptional repressor LexA -
  SCE1572_RS28270 (SCE1572_27285) - 7805078..7805353 (+) 276 WP_236643994.1 DUF4398 domain-containing protein -
  SCE1572_RS28275 (SCE1572_27290) - 7805350..7806513 (+) 1164 WP_020737373.1 OmpA family protein -

Sequence


Protein


Download         Length: 566 a.a.        Molecular weight: 62245.89 Da        Isoelectric Point: 5.6082

>NTDB_id=59400 SCE1572_RS28255 WP_020737369.1 7801159..7802859(+) (pilB) [Sorangium cellulosum So0157-2]
MSTNHRLGELLVREKLISLQQLRQAQEEQRKTGQNLGYALAKLGYISDGEITSFLSTQYRVPAVALDEYEIDAEVSRLVS
REVCEKHKIIPISRSGTALVVAMADPTNLHAIDDIKFLTGFNVEPVVASETGITEAIERAYNVGPSYDEVLSEFGEEEVG
FQVEADDVNVLELEKAAEGAPVVRLVNAILLNAIKKGASDIHVEPYEKKLRVRYRIDGVLMEEMQPPIKLKNAIASRLKI
MSSLDIAERRLPQDGRIKLKMGRGREMDFRVSVLPTIWGEKIVLRLLDKSNLQLDMAKLGFDPKPLADFKWAIGQPWGMV
LVTGPTGSGKTTTLYSALSDLNQIGSNISTAEDPVEYNLHGINQVQMHDEIGLNFAMSLRSFLRQDPDIIMVGEIRDFET
AEIAVKAALTGHLVLSTLHTNDAPSTISRLLNMGVEPFLITASVNLVLAQRLARKICPDCRVPLRVDPKVLLDFGFTEQQ
VARADLVRGAGCKTCNGSGYKGRVALYEVMRFTDALKEMVLQGASTAELKAAAIKGGMLTLRMSGIEKVLAGVTTTEEVG
RVTMGD

Nucleotide


Download         Length: 1701 bp        

>NTDB_id=59400 SCE1572_RS28255 WP_020737369.1 7801159..7802859(+) (pilB) [Sorangium cellulosum So0157-2]
ATGTCGACCAACCACCGCCTCGGCGAACTGCTCGTCCGCGAAAAGCTCATCAGCCTCCAGCAGCTCCGGCAAGCGCAGGA
GGAGCAGCGCAAGACGGGGCAGAACCTCGGCTATGCCCTCGCCAAGCTCGGGTACATCTCCGACGGCGAGATCACGAGCT
TCCTCTCGACGCAGTACCGCGTCCCGGCCGTCGCCCTCGACGAGTACGAGATCGACGCGGAGGTCTCGCGCCTCGTGTCA
CGGGAGGTCTGCGAAAAGCACAAGATCATCCCGATCTCCCGGTCGGGGACGGCGCTCGTCGTCGCGATGGCCGACCCGAC
GAACCTGCACGCGATCGACGACATCAAGTTCCTCACCGGCTTCAACGTCGAGCCCGTCGTCGCGTCCGAGACGGGCATCA
CCGAGGCGATCGAGCGCGCGTACAACGTCGGGCCGTCCTACGACGAGGTGCTGAGCGAGTTCGGGGAGGAGGAGGTCGGC
TTCCAGGTCGAGGCCGACGACGTGAACGTCCTCGAGCTGGAGAAGGCCGCCGAGGGCGCGCCCGTGGTCCGGCTCGTCAA
CGCGATCCTCCTGAACGCCATCAAGAAGGGCGCGAGCGACATCCACGTCGAGCCGTACGAGAAGAAGCTCCGCGTGCGCT
ACCGCATCGACGGCGTGCTGATGGAGGAGATGCAGCCGCCGATCAAGCTGAAGAACGCGATCGCGAGCCGCCTCAAGATC
ATGAGCTCGCTCGACATCGCCGAGCGGCGGCTCCCGCAGGACGGCCGCATCAAGCTGAAGATGGGCAGGGGCCGGGAGAT
GGACTTCCGCGTCTCCGTGCTCCCGACGATCTGGGGCGAGAAGATCGTCCTCCGCCTCCTCGACAAGTCGAACCTGCAGC
TCGACATGGCGAAGCTCGGCTTCGACCCGAAGCCGCTCGCGGACTTCAAGTGGGCGATCGGCCAGCCGTGGGGCATGGTC
CTCGTCACCGGCCCGACCGGCTCCGGCAAGACGACGACGCTCTACTCGGCGCTCTCGGACCTCAACCAGATCGGCTCGAA
CATCAGCACCGCCGAGGATCCGGTCGAGTACAACCTGCACGGCATCAACCAGGTGCAGATGCACGACGAGATCGGGCTGA
ACTTCGCGATGTCGCTGCGCTCGTTCCTCCGGCAGGACCCGGACATCATCATGGTCGGCGAGATCCGCGACTTCGAGACC
GCCGAGATCGCCGTCAAGGCGGCGCTCACCGGCCACCTCGTGCTCTCGACGCTGCACACGAACGACGCGCCGTCGACGAT
CTCGCGCCTCCTCAACATGGGCGTCGAGCCGTTCCTCATCACCGCCAGCGTGAACCTCGTGCTCGCCCAGCGCCTCGCGC
GCAAGATCTGCCCCGACTGCCGCGTCCCGCTGCGCGTGGACCCGAAGGTGCTGCTCGACTTCGGCTTCACCGAGCAGCAG
GTCGCGCGCGCGGACCTCGTCCGCGGCGCGGGCTGCAAGACGTGCAACGGCTCCGGCTACAAGGGCCGCGTCGCGCTCTA
CGAGGTCATGCGCTTCACCGACGCGCTGAAGGAGATGGTCCTCCAGGGCGCGTCCACGGCCGAGCTCAAGGCCGCGGCCA
TCAAGGGCGGGATGCTGACGCTCCGGATGAGCGGGATCGAGAAGGTGCTGGCGGGCGTCACGACGACCGAGGAGGTCGGC
CGCGTGACGATGGGGGACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB S4XHF8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Vibrio parahaemolyticus RIMD 2210633

48.057

100

0.481

  pilB Vibrio campbellii strain DS40M4

47.257

99.823

0.472

  pilB Vibrio cholerae strain A1552

46.996

100

0.47

  pilB Acinetobacter baumannii D1279779

46.774

98.587

0.461

  pilB Acinetobacter baylyi ADP1

45.341

98.587

0.447

  pilB Legionella pneumophila strain ERS1305867

44.779

99.823

0.447

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

42.132

100

0.44

  pilB/pilB1 Synechocystis sp. PCC 6803

39.806

100

0.435

  pilF Thermus thermophilus HB27

42.832

98.587

0.422

  pilF Neisseria gonorrhoeae MS11

41.772

97.703

0.408


Multiple sequence alignment