Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB/pilB1   Type   Machinery gene
Locus tag   COO91_RS21360 Genome accession   NZ_CP024785
Coordinates   4608349..4610355 (-) Length   668 a.a.
NCBI ID   WP_100900134.1    Uniprot ID   A0A2K8SU69
Organism   Nostoc flagelliforme CCNUN1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4603349..4615355
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  COO91_RS21340 (COO91_04968) - 4603868..4604890 (+) 1023 WP_100900130.1 saccharopine dehydrogenase-like oxidoreductase -
  COO91_RS21345 (COO91_04969) - 4605015..4605569 (-) 555 WP_404824134.1 hypothetical protein -
  COO91_RS21350 (COO91_04970) - 4605705..4606928 (-) 1224 WP_100900132.1 type II secretion system F family protein -
  COO91_RS21355 (COO91_04972) pilT 4607111..4608238 (-) 1128 WP_100900133.1 type IV pilus twitching motility protein PilT Machinery gene
  COO91_RS21360 (COO91_04973) pilB/pilB1 4608349..4610355 (-) 2007 WP_100900134.1 GspE/PulE family protein Machinery gene
  COO91_RS54880 (COO91_04975) - 4610884..4611018 (-) 135 WP_263982783.1 hypothetical protein -
  COO91_RS21365 (COO91_04976) grpE 4611254..4612003 (+) 750 WP_100900135.1 nucleotide exchange factor GrpE -
  COO91_RS21370 (COO91_04977) dnaK 4612228..4614186 (+) 1959 WP_100900136.1 molecular chaperone DnaK -

Sequence


Protein


Download         Length: 668 a.a.        Molecular weight: 74563.31 Da        Isoelectric Point: 5.1796

>NTDB_id=255421 COO91_RS21360 WP_100900134.1 4608349..4610355(-) (pilB/pilB1) [Nostoc flagelliforme CCNUN1]
MTYSSPQRRSTALTTRTEFSPFGNKLVQSGYVNTEQMRQALIESRKSGRPLTEVLESITGQQLSPELLRQYKKQQLFELK
ILYGVEFLDPEVNSFGNTEMANLIETLIPVDICRRHRLVPLSKHEDQTPPSVLVAMVAPDNLEASDDLNRILRPQGLALQ
RMVITQEDYQQLINQYLDEMAVKQKHLEQEKFTDINQDLENLGNLDISDAPEEMEADLGAAMKGAEDAPVINLVNRILAK
ALHEGVSDIHIEPQEENLRIRFRKDGVLREAFDPLPKKIIPAVTARFKIISNLDIAERRLPQDGRIRRLFEGRKVDFRVN
TLPSRYGEKVVLRILDNSSTQLGLDKLITDPETLNIVKDMVSRPFGLILVTGPTGSGKTTSLYSALSEKNDPGINISTVE
DPIEYSLPGITQVQVIREKGLDFATALRAFLRQDPDVLLVGETRDKETAKTAIEAALTGHLVLTTLHTNDAPGAIARLGE
MGIEPFMVSSSLIGVLAQRLVRRVCSECRIAYTPTTEELGRYGLSASSDVGVTFYKANSLTLDAIAEAKGKNQFCSKCNG
VGYKGRCGVYEVMRVTENLQTLINEDAPTERIKEVAIEEGMKTLLAYSLDLVRQGSTTLEEVERVTFTDTGLEAELKAKR
KTGLTCRTCEATLKPEWLDCPYCMTSRF

Nucleotide


Download         Length: 2007 bp        

>NTDB_id=255421 COO91_RS21360 WP_100900134.1 4608349..4610355(-) (pilB/pilB1) [Nostoc flagelliforme CCNUN1]
ATGACTTACTCGTCACCACAACGGCGCAGTACCGCTTTAACTACCAGAACAGAGTTTTCGCCCTTCGGCAACAAGCTAGT
GCAATCTGGCTATGTCAATACCGAACAGATGAGGCAGGCACTAATTGAAAGCCGCAAATCTGGCAGACCCTTAACGGAAG
TACTAGAGTCAATCACTGGGCAACAACTATCACCTGAGTTGCTCAGGCAATACAAAAAACAGCAGCTATTTGAACTTAAA
ATACTATACGGTGTTGAATTTCTTGATCCGGAAGTCAATTCCTTTGGCAACACGGAGATGGCGAACCTGATTGAAACCCT
CATCCCAGTGGATATTTGCCGTCGCCACCGTTTAGTACCACTATCGAAACACGAAGACCAAACCCCGCCCTCAGTTTTAG
TGGCGATGGTTGCTCCAGATAATCTAGAGGCTTCTGATGACCTAAATCGCATCTTGCGCCCCCAAGGCTTGGCGTTGCAG
CGCATGGTGATTACCCAGGAAGACTACCAACAGCTAATCAACCAATATCTGGATGAAATGGCTGTTAAGCAAAAGCACCT
GGAACAAGAAAAGTTTACAGATATTAATCAGGATTTAGAAAACCTCGGAAATCTCGATATTTCGGATGCTCCTGAAGAAA
TGGAGGCTGATCTAGGGGCAGCGATGAAGGGTGCAGAGGATGCCCCAGTGATTAACCTAGTTAATAGAATCTTGGCTAAA
GCCTTGCATGAGGGCGTTTCTGATATTCATATCGAACCGCAAGAAGAAAACTTACGCATTCGCTTTCGGAAAGATGGCGT
ACTGCGCGAAGCTTTCGATCCCCTACCGAAAAAAATCATCCCGGCGGTGACAGCCCGATTTAAAATCATCTCCAATCTAG
ACATTGCTGAACGCCGTCTACCCCAAGATGGACGCATCCGGCGGCTGTTTGAGGGACGTAAGGTGGACTTCCGTGTGAAT
ACCTTGCCCAGTCGCTATGGGGAAAAGGTGGTGCTGCGAATTTTGGATAACTCCTCCACCCAATTGGGATTAGATAAGTT
AATTACTGATCCAGAGACTTTGAATATTGTCAAGGATATGGTCAGCCGTCCCTTTGGTCTAATTTTGGTAACTGGGCCAA
CTGGTTCTGGGAAAACAACTTCGCTGTATTCTGCACTCTCAGAAAAAAATGATCCCGGAATTAATATCAGTACTGTAGAA
GACCCAATTGAGTACAGTCTTCCAGGGATTACTCAAGTACAGGTGATTCGGGAAAAAGGGCTGGATTTTGCAACGGCTCT
GCGCGCTTTTCTGCGGCAAGATCCAGATGTGCTGCTGGTGGGTGAAACGCGGGACAAGGAAACGGCAAAAACAGCAATTG
AAGCTGCGTTAACCGGTCACTTGGTATTAACTACCTTACATACCAATGATGCCCCAGGTGCGATCGCTCGTTTGGGAGAA
ATGGGGATTGAGCCTTTCATGGTTTCTAGTTCCCTAATTGGCGTTTTAGCTCAACGTTTGGTGCGGCGTGTATGTTCTGA
ATGTCGTATTGCCTACACTCCCACAACCGAAGAATTGGGTCGTTATGGTCTATCAGCTTCCTCAGATGTCGGAGTTACTT
TCTATAAGGCTAACAGTTTGACATTAGATGCGATCGCAGAAGCTAAAGGCAAAAATCAGTTTTGCTCAAAATGTAATGGG
GTCGGCTACAAAGGGCGTTGTGGTGTTTATGAAGTCATGCGAGTCACCGAAAACCTGCAAACTCTCATCAACGAAGATGC
ACCCACGGAACGCATCAAAGAAGTGGCAATAGAAGAGGGCATGAAAACCTTGCTGGCTTACAGTTTGGACTTAGTGCGTC
AAGGTTCTACCACTCTAGAAGAAGTAGAACGGGTGACGTTTACTGATACTGGTTTAGAAGCCGAGTTAAAGGCCAAACGC
AAGACTGGTCTTACCTGCCGGACTTGCGAGGCCACATTAAAACCAGAATGGCTCGATTGTCCCTACTGTATGACATCTCG
GTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K8SU69

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB/pilB1 Synechocystis sp. PCC 6803

66.518

100

0.669

  pilF Thermus thermophilus HB27

39.935

91.467

0.365


Multiple sequence alignment