Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYF   Type   Machinery gene
Locus tag   I2434_RS00775 Genome accession   NZ_CP065058
Coordinates   134071..134505 (+) Length   144 a.a.
NCBI ID   WP_111677651.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain SEZ18     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 134912..136253 134071..134505 flank 407


Gene organization within MGE regions


Location: 134071..136253
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I2434_RS00775 (MCPGFBBE_00153) comYF 134071..134505 (+) 435 WP_111677651.1 competence type IV pilus minor pilin ComGF Machinery gene
  I2434_RS00780 (MCPGFBBE_00154) comGG 134483..134845 (+) 363 WP_111677652.1 competence type IV pilus minor pilin ComGG -
  I2434_RS00785 - 134912..136253 (+) 1342 Protein_120 IS3 family transposase -

Sequence


Protein


Download         Length: 144 a.a.        Molecular weight: 16178.78 Da        Isoelectric Point: 10.3341

>NTDB_id=506403 I2434_RS00775 WP_111677651.1 134071..134505(+) (comYF) [Streptococcus equi subsp. zooepidemicus strain SEZ18]
MKDSRLKAFTLIECLIALLVISGSLLVYQALTKSLMVSERYLAANDQDNWLLFSQQLRAELSGTTLQGIRNNRLYVEKDK
KALSFGQVKGHDFRKAAGNGRGYQPMLFGLSSSQITAVGQQVIIKLKWQSGLERTFIYAFQEKG

Nucleotide


Download         Length: 435 bp        

>NTDB_id=506403 I2434_RS00775 WP_111677651.1 134071..134505(+) (comYF) [Streptococcus equi subsp. zooepidemicus strain SEZ18]
TTGAAAGACAGTAGGTTAAAGGCTTTCACCTTGATAGAGTGCCTTATTGCCTTGCTTGTCATCTCAGGCTCTTTATTAGT
TTATCAGGCCTTAACTAAGAGCCTTATGGTGAGTGAGAGGTATCTAGCAGCAAATGATCAGGACAACTGGCTTTTGTTTT
CCCAGCAATTGCGAGCAGAGCTTTCAGGTACTACCCTACAGGGTATCCGCAATAATAGGCTATATGTTGAAAAGGACAAG
AAGGCTTTGTCCTTTGGACAGGTCAAGGGTCATGATTTTAGAAAAGCAGCTGGCAATGGTCGAGGCTATCAGCCCATGCT
GTTTGGCTTGTCAAGTAGCCAAATAACAGCAGTAGGTCAGCAGGTTATCATCAAGCTGAAATGGCAAAGCGGCTTAGAAA
GGACCTTTATTTATGCATTTCAAGAGAAGGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYF Streptococcus mutans UA140

52.083

100

0.521

  comYF Streptococcus mutans UA159

51.389

100

0.514

  comGF/cglF Streptococcus mitis SK321

50.365

95.139

0.479

  comGF Lactococcus lactis subsp. cremoris KW2

47.143

97.222

0.458

  comGF/cglF Streptococcus mitis NCTC 12261

48.175

95.139

0.458

  comGF/cglF Streptococcus pneumoniae Rx1

46.715

95.139

0.444

  comGF/cglF Streptococcus pneumoniae D39

46.715

95.139

0.444

  comGF/cglF Streptococcus pneumoniae R6

46.715

95.139

0.444

  comGF/cglF Streptococcus pneumoniae TIGR4

46.715

95.139

0.444