Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYF   Type   Machinery gene
Locus tag   FGK94_RS05625 Genome accession   NZ_LR590471
Coordinates   1227695..1228129 (-) Length   144 a.a.
NCBI ID   WP_037580631.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain NCTC11854     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 1225946..1227288 1227695..1228129 flank 407


Gene organization within MGE regions


Location: 1225946..1228129
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGK94_RS05620 (NCTC11854_01155) comGG 1227355..1227717 (-) 363 WP_012677238.1 competence type IV pilus minor pilin ComGG -
  FGK94_RS05625 (NCTC11854_01156) comYF 1227695..1228129 (-) 435 WP_037580631.1 competence type IV pilus minor pilin ComGF Machinery gene

Sequence


Protein


Download         Length: 144 a.a.        Molecular weight: 16185.72 Da        Isoelectric Point: 10.1988

>NTDB_id=1126877 FGK94_RS05625 WP_037580631.1 1227695..1228129(-) (comYF) [Streptococcus equi subsp. zooepidemicus strain NCTC11854]
MKDSRLKAFTLIECLIALLVISGSLLVYQALTKSLMVSERYLAANDQDNWLLFSQQLRAELSGTTLQGVSNNRLYVEKDK
KTLSFGQVKSHDFRKAAGNGRGYQPMLFSLSSSQITAVGQQVIIKLKWQSGLERTFIYAFQEKG

Nucleotide


Download         Length: 435 bp        

>NTDB_id=1126877 FGK94_RS05625 WP_037580631.1 1227695..1228129(-) (comYF) [Streptococcus equi subsp. zooepidemicus strain NCTC11854]
TTGAAAGACAGTAGGTTAAAAGCTTTCACCTTGATAGAGTGCCTTATTGCCTTGCTTGTCATCTCAGGCTCTTTATTAGT
TTATCAGGCCTTAACCAAGAGCCTTATGGTGAGTGAGAGGTATCTAGCAGCAAATGATCAGGACAACTGGCTTTTGTTTT
CCCAGCAATTGCGAGCAGAGCTTTCAGGTACTACCTTACAGGGTGTCTCCAATAATAGGCTATATGTTGAGAAAGACAAG
AAAACTCTGTCCTTTGGACAGGTCAAAAGTCATGATTTTAGAAAAGCAGCTGGCAATGGTCGAGGCTATCAGCCCATGCT
GTTTAGCTTGTCAAGTAGCCAAATAACAGCAGTAGGTCAGCAGGTTATCATCAAGCTGAAATGGCAAAGCGGGTTAGAAA
GGACCTTTATTTATGCATTTCAAGAGAAGGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYF Streptococcus mutans UA140

52.083

100

0.521

  comYF Streptococcus mutans UA159

51.389

100

0.514

  comGF/cglF Streptococcus mitis SK321

49.635

95.139

0.472

  comGF/cglF Streptococcus mitis NCTC 12261

48.905

95.139

0.465

  comGF Lactococcus lactis subsp. cremoris KW2

47.143

97.222

0.458

  comGF/cglF Streptococcus pneumoniae D39

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae R6

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae TIGR4

47.445

95.139

0.451

  comGF/cglF Streptococcus pneumoniae Rx1

47.445

95.139

0.451


Multiple sequence alignment