Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   SSUST3_RS02355 Genome accession   NC_015433
Coordinates   445958..447250 (+) Length   430 a.a.
NCBI ID   WP_004194121.1    Uniprot ID   A0A140EWW0
Organism   Streptococcus suis ST3     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 426855..447250 445958..447250 within 0


Gene organization within MGE regions


Location: 426855..447250
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SSUST3_RS02205 (SSUST3_0431) - 426855..428012 (-) 1158 WP_013730163.1 tyrosine-type recombinase/integrase -
  SSUST3_RS02210 (SSUST3_0432) - 428209..428940 (-) 732 WP_013730164.1 hypothetical protein -
  SSUST3_RS02215 (SSUST3_0433) - 428988..429371 (-) 384 WP_013730165.1 ImmA/IrrE family metallo-endopeptidase -
  SSUST3_RS02220 (SSUST3_0434) - 429378..429926 (-) 549 WP_022540560.1 helix-turn-helix domain-containing protein -
  SSUST3_RS02225 (SSUST3_0435) - 429943..430143 (+) 201 WP_002937001.1 helix-turn-helix domain-containing protein -
  SSUST3_RS02230 (SSUST3_0436) - 430213..430539 (+) 327 WP_013730166.1 hypothetical protein -
  SSUST3_RS02235 (SSUST3_0437) - 430613..430867 (+) 255 WP_013730167.1 hypothetical protein -
  SSUST3_RS02240 (SSUST3_0438) - 430908..431156 (+) 249 WP_013730168.1 hypothetical protein -
  SSUST3_RS10610 - 431146..431298 (+) 153 WP_022540561.1 hypothetical protein -
  SSUST3_RS02245 (SSUST3_0439) - 431303..431644 (+) 342 WP_013730169.1 hypothetical protein -
  SSUST3_RS02250 (SSUST3_0440) - 431644..432123 (+) 480 WP_013730170.1 siphovirus Gp157 family protein -
  SSUST3_RS02255 (SSUST3_0441) - 432120..432365 (+) 246 WP_013730171.1 hypothetical protein -
  SSUST3_RS02260 (SSUST3_0442) - 432346..432810 (+) 465 WP_013730172.1 hypothetical protein -
  SSUST3_RS02265 (SSUST3_0443) - 432779..433951 (+) 1173 WP_013730173.1 DEAD/DEAH box helicase family protein -
  SSUST3_RS02270 (SSUST3_0444) - 433972..434292 (+) 321 WP_013730174.1 hypothetical protein -
  SSUST3_RS02275 (SSUST3_0445) - 434302..435000 (+) 699 WP_013730175.1 ERF family protein -
  SSUST3_RS02280 - 435000..435296 (+) 297 WP_022540564.1 hypothetical protein -
  SSUST3_RS02285 (SSUST3_0446) ssbA 435293..435691 (+) 399 WP_013730176.1 single-stranded DNA-binding protein Machinery gene
  SSUST3_RS02290 (SSUST3_0447) - 435702..436529 (+) 828 WP_013730177.1 bifunctional DNA primase/polymerase -
  SSUST3_RS02295 (SSUST3_0448) - 436513..437892 (+) 1380 WP_079253058.1 virulence-associated E family protein -
  SSUST3_RS10680 (SSUST3_0449) - 438270..438425 (+) 156 WP_013730179.1 DUF7204 family protein -
  SSUST3_RS02300 (SSUST3_0450) - 438422..438730 (+) 309 WP_013730180.1 DUF1372 family protein -
  SSUST3_RS02305 (SSUST3_0451) - 438732..438947 (+) 216 WP_013730181.1 hypothetical protein -
  SSUST3_RS02310 (SSUST3_0452) - 439136..439450 (+) 315 WP_013730182.1 hypothetical protein -
  SSUST3_RS02315 (SSUST3_0453) - 439523..439957 (+) 435 WP_002937915.1 DUF1492 domain-containing protein -
  SSUST3_RS02320 (SSUST3_0454) - 440054..440401 (+) 348 WP_013730183.1 HNH endonuclease -
  SSUST3_RS02325 (SSUST3_0455) - 440606..440851 (+) 246 WP_022540566.1 hypothetical protein -
  SSUST3_RS02330 (SSUST3_0456) - 440848..442113 (+) 1266 WP_013730185.1 phage portal protein -
  SSUST3_RS10685 (SSUST3_0457) - 442106..443326 (+) 1221 WP_013730186.1 hypothetical protein -
  SSUST3_RS02340 - 443331..443564 (+) 234 WP_022540567.1 hypothetical protein -
  SSUST3_RS02345 (SSUST3_0459) - 444265..444471 (-) 207 WP_041179173.1 DUF1492 domain-containing protein -
  SSUST3_RS02350 (SSUST3_0460) - 445269..445901 (-) 633 WP_004194123.1 YigZ family protein -
  SSUST3_RS02355 (SSUST3_0461) comFA/cflA 445958..447250 (+) 1293 WP_004194121.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 430 a.a.        Molecular weight: 48868.45 Da        Isoelectric Point: 9.2762

>NTDB_id=40537 SSUST3_RS02355 WP_004194121.1 445958..447250(+) (comFA/cflA) [Streptococcus suis ST3]
MKELENYYGRLFTKYQLTAKEREIAEKVPSITKKNNCFRCGTTFKEENKLPNDAYYCRACLLLGRVRSDEKLYHFPQKDF
PITKCLKWKGQLTDWQQRISDGLVANVENNRATLVHAVTGAGKTEMIYHTVASVIDKGGAVCLASPRIDVCIELYKRLQN
DFSVPISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLVLIDEVDAFPYADNPMLYQAADNAVKEAGVQVFLTATSTDEL
DKKVRTGKLSRLSLPRRFHGNPLVVPQKVWFSKFDDTLKKNRLVPKLKKAIEEQRKSGFPLLIFVPEISKGQEFTKIMKK
TFPEETIGFVSSQTENRLEIVEGFRKREITVLISTTILERGVTFPCVDVFVVQANHYLYTASSLVQIAGRVGRSIERPTG
LLQFYHEGSTGAIEKAIAEIKQMNKEAGYV

Nucleotide


Download         Length: 1293 bp        

>NTDB_id=40537 SSUST3_RS02355 WP_004194121.1 445958..447250(+) (comFA/cflA) [Streptococcus suis ST3]
ATGAAAGAATTAGAAAATTATTATGGAAGATTATTTACCAAATACCAATTGACAGCAAAAGAAAGAGAAATAGCAGAAAA
AGTGCCAAGTATTACAAAAAAGAATAACTGCTTTCGCTGTGGAACAACTTTTAAAGAAGAAAACAAATTGCCAAACGATG
CTTATTACTGTCGAGCCTGCTTGCTTCTAGGCAGAGTACGGTCAGACGAAAAACTCTATCATTTTCCTCAGAAAGATTTT
CCAATCACTAAGTGTTTAAAGTGGAAAGGTCAACTAACTGATTGGCAACAAAGAATTTCAGATGGACTAGTTGCAAACGT
GGAAAATAATCGTGCGACATTGGTTCATGCAGTAACAGGAGCAGGTAAGACAGAAATGATCTACCACACCGTTGCCTCAG
TGATTGATAAAGGCGGAGCGGTTTGCCTAGCCAGTCCTCGAATTGATGTTTGTATCGAACTCTATAAACGTCTGCAAAAT
GACTTTTCAGTTCCAATTAGTTTACTACATGGAGAGTCTGAACCCTATTTCCGAACCCCATTAGTTGTAGCAACCACACA
TCAGTTATTAAAATTTTATCAGGCCTTTGATTTGGTTTTGATTGATGAAGTAGACGCCTTTCCCTATGCAGATAATCCCA
TGCTCTATCAAGCAGCAGACAATGCGGTCAAGGAAGCCGGTGTTCAAGTTTTTCTGACAGCGACTTCAACAGATGAATTG
GATAAAAAAGTCAGAACAGGTAAATTAAGTCGTCTTAGTTTGCCAAGGCGCTTTCATGGCAACCCACTTGTTGTCCCGCA
AAAAGTCTGGTTTAGTAAATTCGATGATACCCTAAAGAAAAATAGACTAGTCCCAAAGTTGAAAAAAGCGATTGAAGAAC
AGAGAAAGTCGGGCTTTCCCTTACTCATTTTTGTCCCAGAAATCTCCAAAGGTCAAGAATTTACCAAGATAATGAAAAAA
ACATTCCCAGAAGAAACAATTGGCTTTGTATCCAGTCAAACAGAAAATCGCCTTGAAATAGTTGAAGGGTTTCGCAAGAG
AGAAATCACAGTCTTAATCTCGACTACTATTCTTGAACGTGGGGTGACCTTCCCATGTGTAGACGTCTTTGTTGTTCAAG
CTAATCATTACCTCTACACAGCGTCAAGTCTTGTTCAGATTGCAGGCCGGGTCGGAAGGAGTATAGAACGTCCGACTGGT
TTACTTCAGTTTTATCATGAGGGAAGTACAGGAGCCATTGAAAAGGCAATCGCTGAAATTAAACAGATGAACAAGGAGGC
TGGTTATGTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A140EWW0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

67.053

100

0.672

  comFA/cflA Streptococcus pneumoniae Rx1

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae D39

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae R6

66.357

100

0.665

  comFA/cflA Streptococcus pneumoniae TIGR4

66.357

100

0.665

  comFA/cflA Streptococcus mitis SK321

65.893

100

0.66

  comFA Lactococcus lactis subsp. cremoris KW2

54.156

92.326

0.5

  comFA Latilactobacillus sakei subsp. sakei 23K

38.051

100

0.381

  comFA Bacillus subtilis subsp. subtilis str. 168

37.59

96.512

0.363


Multiple sequence alignment