Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   DR994_RS05745 Genome accession   NZ_CP030927
Coordinates   1113557..1114876 (-) Length   439 a.a.
NCBI ID   WP_173940557.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain CS9     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1104164..1113576 1113557..1114876 flank -19


Gene organization within MGE regions


Location: 1104164..1114876
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DR994_RS05670 (DR994_05825) - 1104164..1104697 (+) 534 WP_002949762.1 DUF402 domain-containing protein -
  DR994_RS05675 (DR994_05830) - 1104727..1105386 (-) 660 WP_173940556.1 DUF1803 domain-containing protein -
  DR994_RS05680 (DR994_05835) - 1105440..1106372 (-) 933 WP_022096881.1 manganese-dependent inorganic pyrophosphatase -
  DR994_RS05735 (DR994_05890) raiA 1112287..1112835 (-) 549 WP_022096635.1 ribosome-associated translation inhibitor RaiA -
  DR994_RS05740 (DR994_05895) - 1112914..1113576 (-) 663 WP_002949747.1 ComF family protein -
  DR994_RS05745 (DR994_05900) comFA/cflA 1113557..1114876 (-) 1320 WP_173940557.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 439 a.a.        Molecular weight: 50627.78 Da        Isoelectric Point: 9.6931

>NTDB_id=302359 DR994_RS05745 WP_173940557.1 1113557..1114876(-) (comFA/cflA) [Streptococcus thermophilus strain CS9]
MIPKEYYGRLFTKEQLPVDYLSEAVKLESMIKVDKKLRCKRCYSRIEEDWQLPNGQYYCRACIVFGRNQEGKELYYFPSE
KSEVDFPVLKWSGKLTPYQNEVSEKLLKTYKNQKHSLVHAVTGAGKTEMIYNIVAYVLENKNRVVIASPRVDVCRELFLR
MQKDFTCSISLLHADSEPYDGSPLVIATTHQLLKFYHSFDLIIVDEVDAFPFVGNVMLNHAVKQAKTETGRYIYLTATST
LALEEQVRLGAIEKHHLASRFHGNPLVLPRFFWQGRLQKSLTSEKLPRPLIHQIKKQRKSNFPLLIFFPNIALGKKFSIT
LKKYLPTENIAFVSSKSEERSTIVEKFRKKELSILVTTTILERGVTFPQVDVFVCMANHYLYTSSSLIQIGGRVGRSPER
PTGKLYFFHEGLSKSMLQCREEINAMNKKGGFENEVSTM

Nucleotide


Download         Length: 1320 bp        

>NTDB_id=302359 DR994_RS05745 WP_173940557.1 1113557..1114876(-) (comFA/cflA) [Streptococcus thermophilus strain CS9]
ATGATACCTAAAGAATATTATGGACGACTATTTACGAAAGAACAGTTACCAGTGGATTATCTCTCAGAGGCTGTAAAATT
AGAAAGTATGATAAAGGTTGATAAAAAACTTAGATGTAAAAGATGTTATAGTCGAATAGAGGAAGATTGGCAATTACCAA
ATGGTCAGTATTATTGTAGAGCGTGTATTGTCTTTGGTCGAAACCAAGAAGGAAAAGAACTCTATTACTTTCCCTCAGAA
AAATCAGAAGTAGATTTTCCTGTCTTGAAATGGTCAGGAAAACTGACTCCTTATCAAAATGAGGTCTCGGAAAAGCTTTT
AAAGACTTATAAAAATCAAAAACACAGTCTTGTTCATGCAGTGACTGGTGCTGGCAAGACAGAGATGATTTATAATATTG
TTGCCTATGTTCTTGAAAATAAAAATCGTGTCGTCATCGCAAGTCCCCGAGTTGATGTTTGTCGAGAATTGTTTCTACGC
ATGCAGAAAGATTTTACTTGTAGTATTTCTCTGCTTCATGCTGATAGTGAACCATATGATGGTAGTCCGCTCGTTATAGC
TACCACTCATCAATTACTAAAATTTTATCATAGCTTTGACTTGATTATTGTTGACGAAGTTGATGCCTTTCCATTTGTAG
GGAATGTCATGTTAAATCATGCTGTTAAACAGGCAAAGACGGAAACAGGCCGGTATATTTACTTAACAGCAACTTCTACA
TTAGCTTTAGAAGAGCAAGTGCGCCTTGGAGCTATAGAAAAGCATCACCTTGCTAGTCGTTTCCACGGAAATCCTTTAGT
CCTTCCTCGTTTCTTTTGGCAAGGAAGGTTACAAAAGTCGTTGACGAGCGAGAAGCTTCCAAGGCCTCTAATTCACCAGA
TTAAGAAGCAGCGTAAATCAAATTTTCCTCTATTAATCTTTTTCCCCAATATAGCATTAGGTAAAAAGTTTAGTATTACC
CTAAAAAAATATCTCCCTACTGAAAACATAGCCTTTGTTTCATCAAAAAGCGAGGAGCGTTCAACCATTGTTGAGAAATT
CCGAAAAAAAGAATTGTCAATCTTAGTGACGACAACTATTCTCGAACGTGGTGTTACCTTTCCACAAGTAGATGTTTTTG
TTTGTATGGCAAATCATTACTTATATACTAGTTCGAGTCTTATTCAGATTGGTGGTAGGGTGGGGCGTTCGCCCGAGAGA
CCTACGGGGAAACTCTATTTCTTTCATGAAGGATTATCTAAATCAATGTTGCAATGTCGGGAAGAAATAAATGCAATGAA
TAAAAAAGGAGGGTTTGAAAATGAAGTGTCTACTATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

54.756

98.178

0.538

  comFA/cflA Streptococcus pneumoniae D39

54.756

98.178

0.538

  comFA/cflA Streptococcus pneumoniae R6

54.756

98.178

0.538

  comFA/cflA Streptococcus pneumoniae TIGR4

54.756

98.178

0.538

  comFA/cflA Streptococcus mitis NCTC 12261

54.419

97.95

0.533

  comFA/cflA Streptococcus mitis SK321

53.488

97.95

0.524

  comFA Lactococcus lactis subsp. cremoris KW2

45.714

95.672

0.437

  comFA Latilactobacillus sakei subsp. sakei 23K

37.011

99.089

0.367


Multiple sequence alignment