Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   H0504_RS07155 Genome accession   NZ_LR822011
Coordinates   1390869..1392188 (-) Length   439 a.a.
NCBI ID   WP_179971743.1    Uniprot ID   -
Organism   Streptococcus thermophilus isolate STH_CIRM_23     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1380630..1390888 1390869..1392188 flank -19


Gene organization within MGE regions


Location: 1380630..1392188
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H0504_RS07075 (STHERMO_1591) recX 1380630..1381406 (+) 777 WP_011680776.1 recombination regulator RecX -
  H0504_RS07080 (STHERMO_1592) - 1381477..1382010 (+) 534 WP_002949762.1 DUF402 domain-containing protein -
  H0504_RS07085 (STHERMO_1593) - 1382040..1382699 (-) 660 WP_011225520.1 DUF1803 domain-containing protein -
  H0504_RS07090 (STHERMO_1594) - 1382753..1383685 (-) 933 WP_011225519.1 manganese-dependent inorganic pyrophosphatase -
  H0504_RS07145 (STHERMO_1595) hpf 1389599..1390147 (-) 549 WP_022096635.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  H0504_RS07150 (STHERMO_1596) - 1390226..1390888 (-) 663 WP_002949747.1 ComF family protein -
  H0504_RS07155 (STHERMO_1597) comFA/cflA 1390869..1392188 (-) 1320 WP_179971743.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 439 a.a.        Molecular weight: 50588.65 Da        Isoelectric Point: 9.5827

>NTDB_id=1130680 H0504_RS07155 WP_179971743.1 1390869..1392188(-) (comFA/cflA) [Streptococcus thermophilus isolate STH_CIRM_23]
MIPKEYYGRLFTKEQLPVDYLSEAVKLESMIKVDKKLRCKRCYSRIEEDWQLPNGQYYCRACIVFGRNQEGKELYYFPSE
KSEVDFPVLKWSGKLTPYQNEVSEKLLKTYKNQKHSLVHAVTGAGKTEMIYNIVAYVLENKNRVVIASPRVDVCRELFLR
MQKDFTCSISLLHADSEPYDGSPLVIATTHQLLKFYHSFDLIIVDEVDAFPFVGNVMLNHAVKQAKTETGRYIYLTATST
LALEEQVRLGAIEKHHLASRFHGNPLVLPRFFWQGRLQKSLTSDKLPRPLIHQIKKQRKSNFPLLIFFPNIALGEKFSIT
LKKYLPTENIAFVSSKSEERSTIVEKFRKKELSILVTTTILERGVTFPQVDVFVCMANHHLYTSSSLIQIGGRVGRSPER
PTGKLYFFHEGLSKSMLQCREEINAMNKKGGFENEVSTM

Nucleotide


Download         Length: 1320 bp        

>NTDB_id=1130680 H0504_RS07155 WP_179971743.1 1390869..1392188(-) (comFA/cflA) [Streptococcus thermophilus isolate STH_CIRM_23]
ATGATACCTAAAGAATATTATGGACGACTATTTACGAAAGAACAGTTACCAGTGGATTATCTCTCAGAGGCTGTAAAATT
AGAAAGTATGATAAAGGTTGATAAAAAACTTAGATGTAAAAGATGTTATAGTCGAATAGAGGAAGATTGGCAATTACCGA
ATGGTCAGTATTATTGTAGAGCGTGTATTGTCTTTGGTCGAAACCAAGAAGGAAAAGAACTCTATTACTTTCCCTCAGAA
AAATCAGAAGTAGATTTTCCTGTCTTGAAATGGTCAGGAAAACTGACTCCTTATCAAAATGAGGTCTCGGAAAAGCTTTT
AAAGACTTATAAAAATCAAAAACACAGTCTTGTTCATGCAGTGACTGGTGCTGGCAAGACAGAGATGATTTATAATATTG
TTGCCTATGTTCTTGAAAATAAAAATCGTGTCGTCATCGCAAGTCCCCGAGTTGATGTTTGTCGAGAATTGTTTCTACGC
ATGCAGAAAGATTTTACTTGTAGTATTTCTCTGCTTCATGCTGATAGTGAACCATATGATGGTAGTCCGCTCGTTATAGC
TACCACTCATCAATTACTAAAATTTTATCATAGCTTTGACTTGATTATTGTTGACGAAGTTGATGCCTTTCCATTTGTAG
GGAATGTCATGTTAAATCATGCTGTTAAACAGGCAAAGACGGAAACAGGCCGGTATATTTACTTAACAGCAACTTCTACA
TTAGCTTTAGAAGAGCAAGTGCGCCTTGGAGCTATAGAAAAGCATCACCTTGCTAGTCGTTTCCACGGAAATCCTTTAGT
CCTTCCTCGTTTCTTTTGGCAAGGAAGGTTACAAAAGTCGTTGACGAGCGATAAGCTTCCAAGGCCTCTAATTCACCAGA
TTAAGAAGCAGCGTAAATCAAATTTTCCTCTATTAATCTTTTTCCCCAATATAGCATTAGGTGAAAAGTTTAGTATTACC
CTAAAAAAATATCTCCCTACTGAAAACATAGCCTTTGTTTCATCAAAAAGCGAGGAGCGTTCAACCATCGTAGAGAAATT
CCGAAAAAAAGAATTGTCAATCTTAGTGACGACAACTATTCTCGAACGTGGTGTTACCTTTCCACAAGTAGATGTTTTTG
TTTGTATGGCAAATCATCACTTATATACTAGTTCGAGTCTTATTCAGATTGGTGGTAGGGTGGGGCGTTCGCCCGAGAGA
CCTACAGGGAAACTCTATTTCTTTCATGAAGGATTATCTAAATCAATGTTGCAATGTCGGGAAGAAATAAATGCAATGAA
TAAAAAAGGAGGGTTTGAAAATGAAGTGTCTACTATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae D39

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae R6

54.988

98.178

0.54

  comFA/cflA Streptococcus pneumoniae TIGR4

54.988

98.178

0.54

  comFA/cflA Streptococcus mitis NCTC 12261

54.651

97.95

0.535

  comFA/cflA Streptococcus mitis SK321

53.721

97.95

0.526

  comFA Lactococcus lactis subsp. cremoris KW2

45.714

95.672

0.437

  comFA Latilactobacillus sakei subsp. sakei 23K

37.011

99.089

0.367


Multiple sequence alignment