Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   H1W81_RS08670 Genome accession   NZ_LR824002
Coordinates   1670532..1671851 (-) Length   439 a.a.
NCBI ID   WP_011225516.1    Uniprot ID   Q5M5T4
Organism   Streptococcus thermophilus isolate STH_CIRM_67     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1661142..1670551 1670532..1671851 flank -19


Gene organization within MGE regions


Location: 1661142..1671851
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H1W81_RS08595 (STHERMO_1935) - 1661142..1661675 (+) 534 WP_002949762.1 DUF402 domain-containing protein -
  H1W81_RS08600 (STHERMO_1936) - 1661705..1662364 (-) 660 WP_011225520.1 DUF1803 domain-containing protein -
  H1W81_RS08605 (STHERMO_1937) - 1662418..1663350 (-) 933 WP_011225519.1 manganese-dependent inorganic pyrophosphatase -
  H1W81_RS08660 (STHERMO_1940) hpf 1669262..1669810 (-) 549 WP_011225518.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  H1W81_RS08665 (STHERMO_1941) - 1669889..1670551 (-) 663 WP_002949747.1 ComF family protein -
  H1W81_RS08670 (STHERMO_1942) comFA/cflA 1670532..1671851 (-) 1320 WP_011225516.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 439 a.a.        Molecular weight: 50628.72 Da        Isoelectric Point: 9.5663

>NTDB_id=1132356 H1W81_RS08670 WP_011225516.1 1670532..1671851(-) (comFA/cflA) [Streptococcus thermophilus isolate STH_CIRM_67]
MIPKEYYGRLFTKEQLPVDYLSEAVKLESMIKVDKKLRCKRCYSRIEEDWQLPNGQYYCRACIVFGRNQEGKELYYFPSE
KSEVDFPVLKWSGKLTPYQNEVSEKLLKTYKNQKHSLVHAVTGAGKTEMIYNIVAYVLENKNRVVIASPRVDVCRELFLR
MQKDFTCSISLLHADSEPYDGSPLVIATTHQLLKFYHSFDLIIVDEVDAFPFVGNVMLNHAVKQAKTETGRYIYLTATST
LALEEQVRLGAIEKHHLASRFHGNPLVLPRFFWQGRLQKSLTSEKLPRPLIHQIKKQRKSNFPLLIFFPNIALGEKFSIT
LKKYLPTENIAFVSSKSEERSTIVEKFRKKELSILVTTTILERGVTFPQVDVFVCMANHYLYTSSSLIQIGGRVGRSPER
PTGKLYFFHEGLSKSMLQCREEINAMNKKGGFENEVSTM

Nucleotide


Download         Length: 1320 bp        

>NTDB_id=1132356 H1W81_RS08670 WP_011225516.1 1670532..1671851(-) (comFA/cflA) [Streptococcus thermophilus isolate STH_CIRM_67]
ATGATACCTAAAGAATATTATGGACGACTATTTACGAAAGAACAGTTACCAGTGGATTATCTCTCAGAGGCTGTAAAATT
AGAAAGTATGATAAAGGTTGATAAAAAACTTAGATGTAAAAGATGTTATAGTCGAATAGAGGAAGATTGGCAATTACCGA
ATGGTCAGTATTATTGTAGAGCGTGTATTGTCTTTGGTCGAAACCAAGAAGGAAAAGAACTCTATTACTTTCCCTCAGAA
AAATCAGAAGTAGATTTTCCTGTCTTGAAATGGTCAGGAAAACTGACTCCTTATCAAAATGAGGTCTCGGAAAAGCTTTT
AAAGACTTATAAAAATCAAAAACACAGTCTTGTTCATGCAGTGACTGGTGCTGGCAAGACAGAGATGATTTATAATATTG
TTGCCTATGTTCTTGAAAATAAAAATCGTGTCGTCATCGCAAGTCCCCGAGTTGATGTTTGTCGAGAATTGTTTCTACGC
ATGCAGAAAGATTTTACTTGTAGTATTTCTCTGCTTCATGCTGATAGTGAACCATATGATGGTAGTCCGCTCGTTATAGC
TACCACTCATCAATTACTAAAATTTTATCATAGCTTTGACTTGATTATTGTTGACGAAGTTGATGCCTTTCCATTTGTAG
GGAATGTCATGTTAAATCATGCTGTTAAACAGGCAAAGACGGAAACAGGCCGGTATATTTACTTAACAGCAACTTCTACA
TTAGCTTTAGAAGAGCAAGTGCGCCTTGGAGCTATAGAAAAGCATCACCTTGCTAGTCGTTTCCACGGAAATCCTTTAGT
CCTTCCTCGTTTCTTTTGGCAAGGAAGGTTACAAAAGTCGTTGACGAGCGAGAAGCTTCCAAGGCCTCTAATTCACCAGA
TTAAGAAGCAGCGTAAATCAAATTTTCCTCTATTAATCTTTTTCCCCAATATAGCATTAGGTGAAAAGTTTAGTATTACC
CTAAAAAAATATCTCCCTACTGAAAACATAGCCTTTGTTTCATCAAAAAGCGAGGAGCGTTCAACCATCGTAGAGAAATT
CCGAAAAAAAGAATTGTCAATCTTAGTGACGACAACTATTCTCGAACGTGGTGTTACCTTTCCACAAGTAGATGTTTTTG
TTTGTATGGCAAATCATTACTTATATACTAGTTCGAGTCTTATTCAGATTGGTGGTAGGGTGGGGCGTTCGCCCGAGAGA
CCTACAGGGAAACTCTATTTCTTTCATGAAGGATTATCTAAATCAATGTTGCAATGTCGGGAAGAAATAAATGCAATGAA
TAAAAAAGGAGGGTTTGAAAATGAAGTGTCTACTATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q5M5T4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae D39

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae R6

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae TIGR4

54.988

98.178

0.54

  comFA/cflA Streptococcus mitis NCTC 12261

54.651

97.95

0.535

  comFA/cflA Streptococcus mitis SK321

53.721

97.95

0.526

  comFA Lactococcus lactis subsp. cremoris KW2

45.714

95.672

0.437

  comFA Latilactobacillus sakei subsp. sakei 23K

37.011

99.089

0.367


Multiple sequence alignment