Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   SNAG_RS09560 Genome accession   NZ_AP017652
Coordinates   1892249..1893547 (-) Length   432 a.a.
NCBI ID   WP_096408718.1    Uniprot ID   A0A1E1GD74
Organism   Streptococcus sp. NPS 308     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1887249..1898547
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SNAG_RS09535 (SNAG_1866) - 1887382..1888560 (-) 1179 WP_096408710.1 acetyl-CoA C-acetyltransferase -
  SNAG_RS09540 (SNAG_1867) - 1888579..1889427 (-) 849 WP_096408713.1 3-hydroxybutyryl-CoA dehydrogenase -
  SNAG_RS09545 (SNAG_1868) - 1889508..1890716 (-) 1209 WP_000201480.1 acyl-CoA dehydrogenase family protein -
  SNAG_RS09550 (SNAG_1869) raiA 1890962..1891510 (-) 549 WP_000599113.1 ribosome-associated translation inhibitor RaiA -
  SNAG_RS09555 (SNAG_1870) comFC/cflB 1891590..1892252 (-) 663 WP_096408716.1 ComF family protein Machinery gene
  SNAG_RS09560 (SNAG_1871) comFA/cflA 1892249..1893547 (-) 1299 WP_096408718.1 DEAD/DEAH box helicase Machinery gene
  SNAG_RS09565 (SNAG_1872) - 1893604..1894239 (+) 636 WP_096408721.1 YigZ family protein -
  SNAG_RS09570 (SNAG_1873) - 1894255..1894698 (+) 444 WP_096408724.1 PH domain-containing protein -
  SNAG_RS09575 (SNAG_1874) cysK 1894795..1895721 (+) 927 WP_096408726.1 cysteine synthase A -
  SNAG_RS09580 (SNAG_1875) tsf 1895811..1896851 (-) 1041 WP_096408729.1 translation elongation factor Ts -
  SNAG_RS09585 (SNAG_1876) rpsB 1896931..1897710 (-) 780 WP_000268475.1 30S ribosomal protein S2 -

Sequence


Protein


Download         Length: 432 a.a.        Molecular weight: 49685.73 Da        Isoelectric Point: 8.5121

>NTDB_id=68380 SNAG_RS09560 WP_096408718.1 1892249..1893547(-) (comFA/cflA) [Streptococcus sp. NPS 308]
MKVNPNYLGRLFTEKELTKEERQMAEKLPAIRKEKGKLFCQRCNSSILEEWHLPIGAYYCRECLLMKRVRSDQALYYFPQ
EDFPKQDVLKWRGQLTPFQEKVSEGLLQAVDKQEPILVHAVTGAGKTEMIYQVVAKVIDEGGAVCLASPRIDVCLELYKR
LQNDFACEIALLHGESEPYFRTPLVVATTHQLLKFYHAFDLLIVDEVDAFPYVDNSVLYYAVNQCVKEEGLKIFLTATST
DELDKKVRTGELKRLSLPRRFHGNPLIIPKLVWLSDFNRYIEKSQLSPKLKSYIKKQRRTSYPLLIFASEIKKGEKLKEL
LQEQFPNENIGFVSSITENRLEQVQAFRDGELTILISTTILERGVTFPCVDVFVVEANHRLFTKSSLIQIGGRVGRSMDR
PTGELLFFYDGLNVSIKKAIKEIKQMNKEAGL

Nucleotide


Download         Length: 1299 bp        

>NTDB_id=68380 SNAG_RS09560 WP_096408718.1 1892249..1893547(-) (comFA/cflA) [Streptococcus sp. NPS 308]
ATGAAAGTAAATCCAAACTATCTCGGTCGCTTGTTTACTGAGAAAGAATTAACGAAAGAAGAACGTCAGATGGCTGAGAA
ACTGCCGGCAATTAGAAAAGAGAAAGGGAAACTGTTTTGTCAACGTTGTAATAGTAGTATTCTAGAAGAATGGCATTTAC
CTATAGGTGCTTACTATTGTAGGGAGTGTTTATTGATGAAGAGGGTCAGGAGTGATCAAGCTTTATACTATTTTCCGCAG
GAGGATTTTCCTAAGCAAGACGTCCTCAAATGGCGTGGTCAGTTAACACCTTTTCAAGAAAAAGTGTCAGAGGGACTGCT
TCAAGCGGTGGACAAGCAAGAGCCAATCTTGGTTCACGCTGTAACAGGAGCTGGAAAGACAGAGATGATTTACCAGGTTG
TGGCTAAGGTGATTGATGAAGGTGGTGCAGTTTGTTTAGCCAGCCCTCGAATTGATGTGTGTTTGGAGCTGTATAAACGA
CTGCAGAATGACTTTGCTTGTGAGATAGCACTACTTCATGGTGAGTCAGAACCTTATTTTCGAACACCACTAGTTGTTGC
AACGACTCATCAGCTGTTAAAATTTTATCATGCTTTTGACTTGCTGATAGTGGATGAAGTAGATGCCTTTCCTTATGTTG
ACAACTCTGTTCTTTACTATGCTGTAAACCAATGTGTAAAGGAGGAGGGGCTAAAGATATTTCTTACAGCGACCTCTACA
GATGAGTTAGATAAGAAGGTTCGCACTGGAGAATTAAAACGATTGAGCTTGCCAAGACGATTTCATGGAAATCCATTGAT
TATTCCAAAGCTAGTTTGGTTATCAGATTTTAATCGCTATATAGAAAAGAGTCAGTTGTCTCCAAAGTTAAAGTCCTACA
TTAAGAAGCAGAGAAGAACAAGTTATCCGTTGTTAATCTTTGCATCTGAGATTAAGAAAGGCGAGAAACTAAAAGAACTC
TTGCAGGAACAGTTTCCAAATGAAAACATCGGCTTTGTGTCCTCTATCACAGAAAATCGATTAGAGCAGGTACAAGCTTT
TCGAGATGGAGAGTTGACAATCCTTATTAGTACAACAATTTTGGAGCGTGGGGTCACCTTTCCTTGTGTGGATGTTTTTG
TTGTAGAAGCTAATCATCGTCTCTTTACCAAGTCTAGCTTGATTCAGATTGGAGGGCGAGTTGGGCGCAGTATGGATAGA
CCGACTGGTGAACTGCTCTTCTTTTATGATGGATTAAATGTTTCCATTAAAAAAGCAATCAAGGAAATTAAGCAGATGAA
CAAGGAGGCAGGTTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1E1GD74

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

90.278

100

0.903

  comFA/cflA Streptococcus mitis SK321

89.12

100

0.891

  comFA/cflA Streptococcus pneumoniae Rx1

88.889

100

0.889

  comFA/cflA Streptococcus pneumoniae D39

88.889

100

0.889

  comFA/cflA Streptococcus pneumoniae R6

88.889

100

0.889

  comFA/cflA Streptococcus pneumoniae TIGR4

88.657

100

0.887

  comFA Lactococcus lactis subsp. cremoris KW2

51.378

92.361

0.475

  comFA Latilactobacillus sakei subsp. sakei 23K

38.584

100

0.391


Multiple sequence alignment