Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB/pilB1   Type   Machinery gene
Locus tag   CLI64_RS20920 Genome accession   NZ_CP023278
Coordinates   5007326..5009338 (-) Length   670 a.a.
NCBI ID   WP_103139014.1    Uniprot ID   -
Organism   Nostoc sp. CENA543     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5002326..5014338
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CLI64_RS20905 (CLI64_20765) - 5004153..5004710 (-) 558 WP_103139011.1 hypothetical protein -
  CLI64_RS20910 (CLI64_20770) - 5004749..5005963 (-) 1215 WP_103139012.1 type II secretion system F family protein -
  CLI64_RS20915 (CLI64_20775) pilT 5006099..5007223 (-) 1125 WP_103139013.1 type IV pilus twitching motility protein PilT Machinery gene
  CLI64_RS20920 (CLI64_20780) pilB/pilB1 5007326..5009338 (-) 2013 WP_103139014.1 GspE/PulE family protein Machinery gene
  CLI64_RS31665 - 5009598..5009756 (+) 159 WP_225977404.1 hypothetical protein -
  CLI64_RS20925 (CLI64_20785) grpE 5009786..5010532 (+) 747 WP_103139015.1 nucleotide exchange factor GrpE -
  CLI64_RS20930 (CLI64_20790) dnaK 5010784..5012745 (+) 1962 WP_103139016.1 molecular chaperone DnaK -

Sequence


Protein


Download         Length: 670 a.a.        Molecular weight: 74967.77 Da        Isoelectric Point: 5.1789

>NTDB_id=245712 CLI64_RS20920 WP_103139014.1 5007326..5009338(-) (pilB/pilB1) [Nostoc sp. CENA543]
MTQSSPQRRSTALTTRTEFSPFGSKLVQSGYVNTDQMRQALIESRKSGRPLTDVLESITGRQLSPEFLRQYKKQQLFELK
ILYGVEFLDPEANNVGDTMVGQLIESLIPVDICRRHRLVPLSKNDQQNPPYILVAMVDPDNLEASDDLNRILRPQGLALK
RMVITQEDYQQLINQYLDDLAVRQKHIEQEKFTDINQDLENLSNLNLEDAPEEMEADLGAAMKGAEDAPVINLVNRILAK
ALHEKVSDIHIEPQEENLRIRFRKDGVLREAFDPLPKKIIPAVTARFKIISNLDIAERRLPQDGRIRRMFEGRKVDFRVN
TLPSRYGEKVCLRILDNSSTQLGLDKLITDPETLNIVKDMVSKPFGLILVTGPTGSGKTTSLYSALSEKNSPGINISTVE
DPIEYSLPGITQVQVIREKGLDFATALRAFLRQDPDVLLVGETRDKETAKTAIEAALTGHLVLTTLHTNDAPGAIARLGE
MGIEPFMVSSSLIGVLAQRLVRRVCSDCKIAYTPTPEELARYGMSASQEVNVTFYKANTVPSEAIAEAKNKNQLCPSCNG
VGYKGRCGVYEVMRVTEQLQTLINEEAPTERIKEVAVEEGMKTLLAYSLDLVRQGSTTLEEVERVTFTDTGLEAELKAKR
KSGLTCRSCNAVLQPEWLDCPYCMTPRFED

Nucleotide


Download         Length: 2013 bp        

>NTDB_id=245712 CLI64_RS20920 WP_103139014.1 5007326..5009338(-) (pilB/pilB1) [Nostoc sp. CENA543]
ATGACTCAATCCTCTCCGCAACGGCGCAGTACCGCTCTCACGACAAGAACGGAGTTTTCGCCTTTTGGTAGTAAATTAGT
ACAGTCTGGTTATGTCAATACTGACCAGATGAGGCAAGCGTTGATTGAAAGTCGCAAGTCTGGTAGACCATTGACAGACG
TGCTAGAGTCAATCACCGGGCGACAATTATCACCTGAGTTTCTTAGACAATACAAGAAACAACAATTATTTGAACTGAAG
ATACTATACGGTGTTGAATTCCTTGATCCGGAAGCCAACAATGTCGGCGATACAATGGTGGGTCAACTGATAGAATCCCT
GATTCCAGTTGATATCTGCCGTCGTCATCGTTTAGTACCCTTATCAAAAAATGATCAACAAAATCCACCTTACATTCTGG
TGGCAATGGTTGATCCAGATAACTTAGAAGCGTCTGATGACCTGAATCGTATCTTGCGCCCCCAAGGTTTAGCACTAAAG
CGCATGGTAATCACCCAGGAGGATTATCAGCAACTGATCAATCAATATCTGGATGATTTGGCTGTCCGACAAAAACACAT
AGAACAAGAAAAATTTACAGATATTAATCAGGATTTAGAAAACCTCAGCAATCTCAACCTGGAAGATGCCCCAGAAGAAA
TGGAGGCGGATTTAGGGGCGGCGATGAAGGGTGCAGAGGATGCGCCAGTAATTAATCTGGTGAACCGCATTCTGGCGAAA
GCTTTGCATGAGAAGGTTTCTGATATCCACATTGAACCCCAAGAAGAGAATTTGCGGATTCGTTTCCGTAAGGACGGGGT
GTTGCGGGAGGCTTTTGATCCTCTACCGAAGAAAATCATTCCGGCAGTCACAGCCCGCTTTAAAATCATCTCCAACTTAG
ATATTGCAGAAAGACGTTTACCTCAAGATGGACGGATTCGCCGGATGTTTGAGGGACGCAAGGTAGACTTTCGGGTAAAT
ACCCTACCCAGTCGCTATGGGGAAAAGGTGTGTCTGCGAATTTTGGATAACTCTTCCACCCAGTTGGGATTGGATAAGTT
AATTACTGATCCAGAAACTCTGAATATTGTCAAAGACATGGTCAGTAAGCCTTTCGGATTAATCTTGGTAACGGGGCCGA
CTGGTTCTGGTAAAACTACTTCGCTGTATTCTGCACTTTCGGAAAAGAACTCTCCTGGTATTAACATCAGTACGGTAGAA
GATCCGATTGAGTACAGTTTGCCAGGGATTACTCAAGTCCAGGTAATTCGGGAGAAAGGACTAGATTTCGCCACTGCATT
ACGGGCGTTTTTGCGACAAGACCCCGATGTGCTGCTGGTGGGGGAAACAAGAGATAAAGAAACAGCCAAAACAGCGATTG
AAGCTGCTTTGACTGGTCACTTAGTATTAACTACTTTACACACAAATGATGCTCCTGGCGCGATCGCTCGTTTGGGAGAA
ATGGGTATTGAACCATTCATGGTGTCAAGTTCTCTAATTGGGGTATTAGCACAACGTCTAGTGCGGCGTGTTTGTTCTGA
TTGTAAGATTGCCTATACTCCCACACCAGAAGAATTAGCCCGTTATGGGATGTCTGCTTCCCAAGAAGTTAACGTCACCT
TCTATAAGGCTAATACTGTACCATCAGAGGCGATCGCGGAAGCTAAAAACAAAAATCAGCTTTGCCCCAGCTGTAACGGT
GTCGGCTACAAAGGCCGTTGTGGTGTTTATGAAGTCATGCGGGTGACTGAACAGCTACAAACTTTAATCAACGAAGAAGC
ACCCACAGAACGCATCAAAGAGGTAGCTGTGGAAGAAGGTATGAAAACCTTGCTGGCTTACAGCTTAGACCTAGTACGCC
AAGGTTCTACCACCTTAGAAGAAGTAGAACGGGTGACATTCACAGACACAGGTTTAGAAGCTGAATTAAAAGCTAAACGC
AAGAGTGGACTGACCTGTCGCAGTTGTAACGCCGTTTTACAACCAGAATGGCTCGATTGTCCCTACTGTATGACACCTCG
GTTTGAAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB/pilB1 Synechocystis sp. PCC 6803

64.243

100

0.646


Multiple sequence alignment