Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   YB51_RS02395 Genome accession   NC_022516
Coordinates   452502..453794 (+) Length   430 a.a.
NCBI ID   WP_004194121.1    Uniprot ID   A0A140EWW0
Organism   Streptococcus suis YB51     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 433399..453794 452502..453794 within 0


Gene organization within MGE regions


Location: 433399..453794
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  YB51_RS02245 (YB51_2120) - 433399..434556 (-) 1158 WP_013730163.1 tyrosine-type recombinase/integrase -
  YB51_RS02250 (YB51_2125) - 434753..435484 (-) 732 WP_013730164.1 hypothetical protein -
  YB51_RS02255 (YB51_2130) - 435532..435915 (-) 384 WP_013730165.1 ImmA/IrrE family metallo-endopeptidase -
  YB51_RS02260 (YB51_2135) - 435922..436470 (-) 549 WP_022540560.1 helix-turn-helix domain-containing protein -
  YB51_RS02265 (YB51_2140) - 436487..436687 (+) 201 WP_002937001.1 helix-turn-helix domain-containing protein -
  YB51_RS02270 (YB51_2145) - 436757..437083 (+) 327 WP_013730166.1 hypothetical protein -
  YB51_RS02275 - 437157..437411 (+) 255 WP_013730167.1 hypothetical protein -
  YB51_RS02280 (YB51_2150) - 437452..437700 (+) 249 WP_013730168.1 hypothetical protein -
  YB51_RS10650 (YB51_2155) - 437690..437842 (+) 153 WP_022540561.1 hypothetical protein -
  YB51_RS02285 (YB51_2160) - 437847..438188 (+) 342 WP_013730169.1 hypothetical protein -
  YB51_RS02290 (YB51_2165) - 438188..438667 (+) 480 WP_013730170.1 siphovirus Gp157 family protein -
  YB51_RS02295 (YB51_2170) - 438664..438909 (+) 246 WP_013730171.1 hypothetical protein -
  YB51_RS02300 (YB51_2175) - 438890..439354 (+) 465 WP_013730172.1 hypothetical protein -
  YB51_RS02305 (YB51_2180) - 439323..440495 (+) 1173 WP_013730173.1 DEAD/DEAH box helicase family protein -
  YB51_RS02310 (YB51_2185) - 440570..440836 (+) 267 WP_226960629.1 hypothetical protein -
  YB51_RS02315 (YB51_2190) - 440846..441544 (+) 699 WP_013730175.1 ERF family protein -
  YB51_RS02320 (YB51_2195) - 441544..441840 (+) 297 WP_022540564.1 hypothetical protein -
  YB51_RS02325 (YB51_2200) ssbA 441837..442235 (+) 399 WP_022540565.1 single-stranded DNA-binding protein Machinery gene
  YB51_RS02330 (YB51_2205) - 442246..443073 (+) 828 WP_013730177.1 bifunctional DNA primase/polymerase -
  YB51_RS02335 (YB51_2210) - 443057..444436 (+) 1380 WP_079253058.1 virulence-associated E family protein -
  YB51_RS10655 (YB51_2215) - 444814..444969 (+) 156 WP_013730179.1 hypothetical protein -
  YB51_RS02340 (YB51_2220) - 444966..445274 (+) 309 WP_013730180.1 DUF1372 family protein -
  YB51_RS02345 (YB51_2225) - 445276..445491 (+) 216 WP_013730181.1 hypothetical protein -
  YB51_RS02350 (YB51_2230) - 445680..445994 (+) 315 WP_013730182.1 hypothetical protein -
  YB51_RS02355 (YB51_2235) - 446067..446501 (+) 435 WP_002937915.1 DUF1492 domain-containing protein -
  YB51_RS02360 (YB51_2240) - 446598..446945 (+) 348 WP_013730183.1 HNH endonuclease -
  YB51_RS02365 (YB51_2245) - 447150..447395 (+) 246 WP_022540566.1 hypothetical protein -
  YB51_RS02370 (YB51_2250) - 447392..448657 (+) 1266 WP_013730185.1 phage portal protein -
  YB51_RS10660 (YB51_2255) - 448650..449870 (+) 1221 WP_013730186.1 hypothetical protein -
  YB51_RS02380 (YB51_2260) - 449875..450108 (+) 234 WP_022540567.1 hypothetical protein -
  YB51_RS02385 (YB51_2270) - 450809..451033 (-) 225 Protein_430 DUF1492 domain-containing protein -
  YB51_RS02390 (YB51_2275) - 451813..452445 (-) 633 WP_004194123.1 YigZ family protein -
  YB51_RS02395 (YB51_2280) comFA/cflA 452502..453794 (+) 1293 WP_004194121.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 430 a.a.        Molecular weight: 48868.45 Da        Isoelectric Point: 9.2762

>NTDB_id=62441 YB51_RS02395 WP_004194121.1 452502..453794(+) (comFA/cflA) [Streptococcus suis YB51]
MKELENYYGRLFTKYQLTAKEREIAEKVPSITKKNNCFRCGTTFKEENKLPNDAYYCRACLLLGRVRSDEKLYHFPQKDF
PITKCLKWKGQLTDWQQRISDGLVANVENNRATLVHAVTGAGKTEMIYHTVASVIDKGGAVCLASPRIDVCIELYKRLQN
DFSVPISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLVLIDEVDAFPYADNPMLYQAADNAVKEAGVQVFLTATSTDEL
DKKVRTGKLSRLSLPRRFHGNPLVVPQKVWFSKFDDTLKKNRLVPKLKKAIEEQRKSGFPLLIFVPEISKGQEFTKIMKK
TFPEETIGFVSSQTENRLEIVEGFRKREITVLISTTILERGVTFPCVDVFVVQANHYLYTASSLVQIAGRVGRSIERPTG
LLQFYHEGSTGAIEKAIAEIKQMNKEAGYV

Nucleotide


Download         Length: 1293 bp        

>NTDB_id=62441 YB51_RS02395 WP_004194121.1 452502..453794(+) (comFA/cflA) [Streptococcus suis YB51]
ATGAAAGAATTAGAAAATTATTATGGAAGATTATTTACCAAATACCAATTGACAGCAAAAGAAAGAGAAATAGCAGAAAA
AGTGCCAAGTATTACAAAAAAGAATAACTGCTTTCGCTGTGGAACAACTTTTAAAGAAGAAAACAAATTGCCAAACGATG
CTTATTACTGTCGAGCCTGCTTGCTTCTAGGCAGAGTACGGTCAGACGAAAAACTCTATCATTTTCCTCAGAAAGATTTT
CCAATCACTAAGTGTTTAAAGTGGAAAGGTCAACTAACTGATTGGCAACAAAGAATTTCAGATGGACTAGTTGCAAACGT
GGAAAATAATCGTGCGACATTGGTTCATGCAGTAACAGGAGCAGGTAAGACAGAAATGATCTACCACACCGTTGCCTCAG
TGATTGATAAAGGCGGAGCGGTTTGCCTAGCCAGTCCTCGAATTGATGTTTGTATCGAACTCTATAAACGTCTGCAAAAT
GACTTTTCAGTTCCAATTAGTTTACTACATGGAGAGTCTGAACCCTATTTCCGAACCCCATTAGTTGTAGCAACCACACA
TCAGTTATTAAAATTTTATCAGGCCTTTGATTTGGTTTTGATTGATGAAGTAGACGCCTTTCCCTATGCAGATAATCCCA
TGCTCTATCAAGCAGCAGACAATGCGGTCAAGGAAGCCGGTGTTCAAGTTTTTCTGACAGCGACTTCAACAGATGAATTG
GATAAAAAAGTCAGAACAGGTAAATTAAGTCGTCTTAGTTTGCCAAGGCGCTTTCATGGCAACCCACTTGTTGTCCCGCA
AAAAGTCTGGTTTAGTAAATTCGATGATACCCTAAAGAAAAATAGACTAGTCCCAAAGTTGAAAAAAGCGATTGAAGAAC
AGAGAAAGTCGGGCTTTCCCTTACTCATTTTTGTCCCAGAAATCTCCAAAGGTCAAGAATTTACCAAGATAATGAAAAAA
ACATTCCCAGAAGAAACAATTGGCTTTGTATCCAGTCAAACAGAAAATCGCCTTGAAATAGTTGAAGGGTTTCGCAAGAG
AGAAATCACAGTCTTAATCTCGACTACTATTCTTGAACGTGGGGTGACCTTCCCATGTGTAGACGTCTTTGTTGTTCAAG
CTAATCATTACCTCTACACAGCGTCAAGTCTTGTTCAGATTGCAGGCCGGGTCGGAAGGAGTATAGAACGTCCGACTGGT
TTACTTCAGTTTTATCATGAGGGAAGTACAGGAGCCATTGAAAAGGCAATCGCTGAAATTAAACAGATGAACAAGGAGGC
TGGTTATGTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A140EWW0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

67.053

100

0.672

  comFA/cflA Streptococcus pneumoniae Rx1

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae D39

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae R6

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae TIGR4

66.357

100

0.665

  comFA/cflA Streptococcus mitis SK321

65.893

100

0.66

  comFA Lactococcus lactis subsp. cremoris KW2

54.156

92.326

0.5

  comFA Latilactobacillus sakei subsp. sakei 23K

38.051

100

0.381

  comFA Bacillus subtilis subsp. subtilis str. 168

37.59

96.512

0.363


Multiple sequence alignment