Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYF   Type   Machinery gene
Locus tag   SEZ_RS00750 Genome accession   NC_011134
Coordinates   131233..131667 (+) Length   144 a.a.
NCBI ID   WP_037580631.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus MGCS10565     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 132053..133395 131233..131667 flank 386


Gene organization within MGE regions


Location: 131233..133395
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SEZ_RS00750 (Sez_0119) comYF 131233..131667 (+) 435 WP_037580631.1 competence type IV pilus minor pilin ComGF Machinery gene
  SEZ_RS00755 (Sez_0120) comGG 131645..132007 (+) 363 WP_012514785.1 competence type IV pilus minor pilin ComGG -

Sequence


Protein


Download         Length: 144 a.a.        Molecular weight: 16185.72 Da        Isoelectric Point: 10.1988

>NTDB_id=31353 SEZ_RS00750 WP_037580631.1 131233..131667(+) (comYF) [Streptococcus equi subsp. zooepidemicus MGCS10565]
MKDSRLKAFTLIECLIALLVISGSLLVYQALTKSLMVSERYLAANDQDNWLLFSQQLRAELSGTTLQGVSNNRLYVEKDK
KTLSFGQVKSHDFRKAAGNGRGYQPMLFSLSSSQITAVGQQVIIKLKWQSGLERTFIYAFQEKG

Nucleotide


Download         Length: 435 bp        

>NTDB_id=31353 SEZ_RS00750 WP_037580631.1 131233..131667(+) (comYF) [Streptococcus equi subsp. zooepidemicus MGCS10565]
TTGAAAGACAGTAGGTTAAAGGCTTTCACCTTGATAGAGTGCCTTATTGCCTTGCTTGTCATCTCAGGCTCTTTATTAGT
TTATCAGGCCTTAACCAAGAGCCTTATGGTGAGTGAGAGGTATCTAGCAGCAAATGATCAGGACAACTGGCTTTTGTTTT
CCCAGCAATTGCGAGCAGAGCTTTCAGGTACTACCTTACAGGGTGTCTCCAATAATAGGCTATATGTTGAGAAAGACAAG
AAAACTCTGTCCTTTGGACAGGTCAAAAGTCATGATTTTAGAAAAGCAGCTGGCAATGGTCGAGGCTATCAGCCCATGCT
GTTTAGCTTGTCAAGTAGCCAAATAACAGCAGTAGGTCAGCAGGTTATCATCAAGCTGAAATGGCAAAGCGGGTTAGAAA
GGACCTTTATTTATGCATTTCAAGAGAAGGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYF Streptococcus mutans UA140

52.083

100

0.521

  comYF Streptococcus mutans UA159

51.389

100

0.514

  comGF/cglF Streptococcus mitis SK321

49.635

95.139

0.472

  comGF/cglF Streptococcus mitis NCTC 12261

48.905

95.139

0.465

  comGF Lactococcus lactis subsp. cremoris KW2

47.143

97.222

0.458

  comGF/cglF Streptococcus pneumoniae D39

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae R6

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae TIGR4

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae Rx1

47.445

95.139

0.451


Multiple sequence alignment