Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   GE023_RS08115 Genome accession   NZ_CP053789
Coordinates   1629908..1631233 (-) Length   441 a.a.
NCBI ID   WP_093999505.1    Uniprot ID   A0A3P5Y1B1
Organism   Streptococcus canis strain HL_98_2     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1624908..1636233
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GE023_RS08100 (GE023_008095) - 1627637..1628591 (+) 955 Protein_1553 IS30 family transposase -
  GE023_RS08105 (GE023_008100) raiA 1628643..1629191 (-) 549 WP_003044623.1 ribosome-associated translation inhibitor RaiA -
  GE023_RS11010 - 1629271..1629693 (-) 423 WP_003044691.1 ComF family protein -
  GE023_RS08115 (GE023_008110) comFA/cflA 1629908..1631233 (-) 1326 WP_093999505.1 DEAD/DEAH box helicase Machinery gene
  GE023_RS08120 (GE023_008115) - 1631289..1631921 (+) 633 WP_159305988.1 YigZ family protein -
  GE023_RS08125 (GE023_008120) cysK 1632039..1632974 (+) 936 WP_003044700.1 cysteine synthase A -
  GE023_RS08130 (GE023_008125) - 1633141..1633500 (-) 360 WP_003044703.1 S1 RNA-binding domain-containing protein -
  GE023_RS08135 (GE023_008130) - 1633500..1634900 (-) 1401 WP_125073665.1 bifunctional Cof-type HAD-IIB family hydrolase/peptidylprolyl isomerase -
  GE023_RS08140 (GE023_008135) - 1634937..1635578 (-) 642 WP_003044709.1 response regulator transcription factor -

Sequence


Protein


Download         Length: 441 a.a.        Molecular weight: 49753.89 Da        Isoelectric Point: 9.7993

>NTDB_id=447173 GE023_RS08115 WP_093999505.1 1629908..1631233(-) (comFA/cflA) [Streptococcus canis strain HL_98_2]
MEGIENYYGRLFLENQLPEEGKHLAKPLESIVITKGKVTCQRCRYHISEEERLPSGAYYCRFCLVFGRNQSDKSLYYMSP
KPFPQGSCLKWKGQLTPHQKKISQQLVRNVQAKKPTLVHAVTGAGKTEMIYAAIARVIEAGGWVCLASPRVDVCIEVAKR
LSQAFSCQVCLMHAGSSPYQRSPIIVATTHQLLTFYKAFDLLIIDEVDAFPFVTNVQLNHAANQASKTDAARILLTATST
TTLEKQVKRGEVEKLTLARRFHNHPLVIPQFIRSFAILNNIHCHKIPEIVIKYLREQRQTGYPLLIFLPVITTAEIVTNL
LKKAFPKEKIACVSSQAEEREKDITAFRQGEKTILVTTTILERGVTFPGVDVFVLAAHHRVFTPQSLVQISGRVGRSLER
PTGNLYFFHDGISRAMLKARKEIKEMNQKGYPDDMPTVPTT

Nucleotide


Download         Length: 1326 bp        

>NTDB_id=447173 GE023_RS08115 WP_093999505.1 1629908..1631233(-) (comFA/cflA) [Streptococcus canis strain HL_98_2]
ATGGAAGGTATTGAAAATTATTACGGTCGTTTATTTTTAGAAAATCAATTGCCAGAAGAAGGAAAGCATCTTGCAAAACC
TTTAGAAAGTATAGTAATTACAAAAGGTAAAGTTACTTGCCAACGATGCCGTTACCACATTAGTGAGGAAGAAAGATTGC
CTAGTGGCGCTTATTATTGCCGCTTCTGTCTTGTCTTTGGCCGCAATCAATCTGATAAATCGCTTTACTATATGTCCCCT
AAGCCATTCCCTCAAGGAAGCTGTCTTAAATGGAAAGGACAATTAACACCTCATCAAAAAAAGATTTCCCAACAGTTGGT
AAGAAATGTACAGGCCAAAAAGCCTACCTTGGTTCATGCTGTAACAGGAGCTGGTAAAACAGAAATGATTTATGCAGCTA
TTGCAAGAGTTATTGAAGCAGGTGGCTGGGTATGTTTAGCTAGCCCGCGAGTAGATGTTTGTATCGAAGTGGCCAAACGG
TTATCTCAAGCCTTTTCGTGTCAAGTCTGTTTAATGCATGCTGGATCCTCTCCTTACCAGAGAAGTCCTATTATTGTAGC
AACAACACATCAGTTACTCACTTTTTACAAGGCTTTTGACCTTTTGATTATTGATGAAGTTGATGCTTTTCCATTTGTGA
CAAACGTTCAACTGAATCACGCTGCAAATCAAGCCTCTAAAACAGATGCTGCTAGGATTTTATTAACAGCAACATCAACG
ACAACCTTAGAAAAACAAGTCAAAAGAGGAGAAGTAGAAAAGCTAACATTAGCTAGGCGCTTTCATAATCATCCCTTAGT
TATTCCACAGTTTATCAGAAGTTTTGCTATCTTGAACAATATACACTGCCATAAGATTCCTGAAATAGTCATCAAATACC
TTCGAGAGCAGCGGCAGACAGGTTATCCCCTGTTAATATTTCTTCCTGTTATTACAACAGCAGAAATAGTCACCAATTTG
TTAAAAAAAGCTTTTCCAAAAGAAAAGATAGCTTGTGTTTCAAGTCAAGCAGAAGAGAGGGAAAAAGATATTACTGCCTT
TCGCCAGGGTGAAAAAACTATTTTAGTCACGACAACAATCTTGGAAAGGGGGGTTACCTTTCCTGGTGTTGATGTCTTTG
TACTAGCAGCCCACCATCGAGTTTTCACCCCACAAAGTCTTGTTCAAATCTCGGGCCGTGTGGGAAGATCTCTGGAGAGA
CCGACAGGGAATTTATACTTTTTTCATGATGGCATCAGTAGAGCAATGTTGAAGGCTCGAAAAGAAATCAAAGAAATGAA
TCAGAAAGGTTACCCAGATGACATGCCTACTGTGCCAACAACCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3P5Y1B1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

55.294

96.372

0.533

  comFA/cflA Streptococcus pneumoniae D39

55.294

96.372

0.533

  comFA/cflA Streptococcus pneumoniae R6

55.294

96.372

0.533

  comFA/cflA Streptococcus pneumoniae TIGR4

55.294

96.372

0.533

  comFA/cflA Streptococcus mitis NCTC 12261

54.118

96.372

0.522

  comFA/cflA Streptococcus mitis SK321

53.63

96.825

0.519

  comFA Lactococcus lactis subsp. cremoris KW2

45.476

97.732

0.444

  comFA Latilactobacillus sakei subsp. sakei 23K

39.558

92.29

0.365