Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   THEOS_RS02815 Genome accession   NC_019386
Coordinates   518848..520068 (+) Length   406 a.a.
NCBI ID   WP_016328828.1    Uniprot ID   K7R3Z1
Organism   Thermus oshimai JL-2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 513848..525068
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THEOS_RS02785 (Theos_0565) - 514923..515546 (-) 624 WP_016328824.1 GNAT family N-acetyltransferase -
  THEOS_RS02790 (Theos_0566) - 515568..516638 (+) 1071 WP_016328825.1 PIG-L deacetylase family protein -
  THEOS_RS02795 (Theos_0567) scpB 516585..517112 (-) 528 WP_016328826.1 SMC-Scp complex subunit ScpB -
  THEOS_RS02800 (Theos_0568) - 517100..517672 (-) 573 WP_016328827.1 L-threonylcarbamoyladenylate synthase -
  THEOS_RS14265 (Theos_0569) - 517662..518752 (-) 1091 Protein_555 ISAs1 family transposase -
  THEOS_RS02815 (Theos_0570) pilC 518848..520068 (+) 1221 WP_016328828.1 type II secretion system F family protein Machinery gene
  THEOS_RS02820 (Theos_0571) - 520043..520495 (-) 453 WP_016328829.1 EVE domain-containing protein -
  THEOS_RS02825 (Theos_0572) - 520566..521018 (+) 453 WP_016328830.1 hypothetical protein -
  THEOS_RS02830 (Theos_0573) - 521028..522368 (+) 1341 WP_016328831.1 FAD-binding oxidoreductase -
  THEOS_RS02835 (Theos_0574) - 522421..523362 (+) 942 WP_016328832.1 GGDEF domain-containing protein -
  THEOS_RS02840 (Theos_0575) - 523359..524438 (-) 1080 WP_016328833.1 prephenate dehydrogenase/arogenate dehydrogenase family protein -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 44916.04 Da        Isoelectric Point: 10.0899

>NTDB_id=54354 THEOS_RS02815 WP_016328828.1 518848..520068(+) (pilC) [Thermus oshimai JL-2]
MPVYQYKARDRQGRLVEATIEAEDLRTAARLLRDRGLFVAEIKEPGRGLRAEVRIPALERGPGLKDLAIFSRQLATMLSA
GLTLLQSLSILERQTENKKFREIIKKVRTDVEGGSALSEALSKHKLFSRLYVNLVRAGETSGGMDVILDRLATFLEKELE
LRGKIRSAMTYPTIVFVFAVGVAYFLLTGIVPQFAQILTDLGSELPLLTRFLIALSNLLRVATLPLLLLLVVLYFVYRSY
YRTPQGRRVIDRIKLRMPVFGNLNRKTAIARFARTLALLLQSGVNILESLDITKGTAGNAIVEDLVETAKNKVQQGEPLN
LTLAQNPLVFPPMVSSMVAIGEETGALDTLLSKIADFYEREVDEAVASLTAAIEPLMIIFLGVIVGMIVAGMFLPLFKII
GTLSVQ

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=54354 THEOS_RS02815 WP_016328828.1 518848..520068(+) (pilC) [Thermus oshimai JL-2]
ATGCCGGTCTACCAGTATAAGGCCCGCGACCGTCAGGGCCGCTTGGTGGAGGCCACCATCGAGGCCGAGGACCTGCGCAC
CGCGGCCCGCCTCCTCCGGGACCGGGGGCTTTTCGTGGCGGAGATCAAGGAGCCGGGGCGGGGCCTCCGGGCGGAGGTGC
GGATCCCCGCCCTGGAGCGGGGGCCTGGGCTCAAGGACCTCGCCATCTTCTCCCGCCAGCTCGCCACCATGCTCTCCGCG
GGGCTCACCCTCCTCCAGTCCCTTTCCATCCTGGAGCGGCAGACGGAGAACAAGAAGTTCCGGGAGATCATCAAAAAGGT
CCGCACGGACGTGGAAGGGGGTAGCGCCCTCTCCGAGGCCCTTTCCAAGCACAAGCTCTTCTCCCGGCTTTACGTGAACC
TGGTGCGGGCGGGGGAGACCTCGGGGGGGATGGACGTCATCCTGGACCGCCTGGCCACCTTCTTGGAGAAGGAGCTGGAG
CTTCGGGGGAAGATCCGGAGCGCCATGACCTACCCCACCATCGTCTTCGTCTTCGCGGTGGGCGTGGCCTACTTCCTCCT
CACGGGGATCGTGCCCCAGTTCGCCCAGATCCTCACCGACCTGGGCTCGGAGCTCCCCCTCCTCACCCGCTTCCTCATCG
CCCTCTCTAACCTCCTCCGGGTGGCCACCCTGCCCCTCCTCCTCCTCCTGGTGGTCCTCTACTTCGTCTACCGCTCCTAC
TACCGCACCCCCCAGGGAAGGCGGGTCATCGACCGCATCAAGCTCCGGATGCCCGTCTTCGGCAACCTGAACCGCAAGAC
GGCCATCGCCCGCTTCGCCCGCACCCTGGCCCTCCTCCTCCAGAGCGGGGTGAACATCCTGGAGTCTTTGGACATCACCA
AGGGCACCGCGGGGAACGCCATCGTGGAGGACCTGGTGGAGACCGCCAAAAACAAGGTCCAGCAGGGGGAGCCCCTGAAC
CTCACCCTGGCCCAGAACCCCCTGGTCTTCCCCCCCATGGTGAGCTCCATGGTGGCCATCGGCGAGGAGACGGGGGCTTT
GGACACCCTCCTTTCCAAGATCGCCGACTTCTACGAGCGGGAGGTGGACGAGGCGGTGGCCAGCCTCACCGCGGCCATCG
AGCCCCTCATGATCATCTTCCTGGGCGTCATCGTGGGCATGATCGTGGCGGGGATGTTCCTGCCCCTCTTCAAGATCATC
GGGACCCTCTCCGTGCAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB K7R3Z1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Thermus thermophilus HB27

89.409

100

0.894

  pilC Legionella pneumophila strain ERS1305867

36.658

98.768

0.362


Multiple sequence alignment