Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   SAIN_RS02165 Genome accession   NC_022244
Coordinates   434564..435865 (+) Length   433 a.a.
NCBI ID   WP_021001373.1    Uniprot ID   A0A0P0N797
Organism   Streptococcus anginosus C1051     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 429564..440865
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SAIN_RS02150 - 432039..432560 (+) 522 WP_037590270.1 hypothetical protein -
  SAIN_RS02155 (SAIN_0421) cysK 432844..433773 (-) 930 WP_021001371.1 cysteine synthase A -
  SAIN_RS02160 (SAIN_0422) - 433872..434507 (-) 636 WP_021001372.1 YigZ family protein -
  SAIN_RS02165 (SAIN_0423) comFA/cflA 434564..435865 (+) 1302 WP_021001373.1 DEAD/DEAH box helicase Machinery gene
  SAIN_RS02170 (SAIN_0424) - 435862..436527 (+) 666 WP_003025757.1 ComF family protein -
  SAIN_RS02175 (SAIN_0425) hpf 436606..437148 (+) 543 WP_003025754.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  SAIN_RS02180 (SAIN_0426) rsmD 437336..437884 (+) 549 WP_025271932.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  SAIN_RS02185 (SAIN_0427) coaD 437874..438371 (+) 498 WP_021001374.1 pantetheine-phosphate adenylyltransferase -
  SAIN_RS02190 (SAIN_0428) sepM 438352..439407 (+) 1056 WP_021001375.1 SepM family pheromone-processing serine protease Regulator
  SAIN_RS02195 (SAIN_0429) - 439545..440111 (+) 567 WP_021001376.1 YutD family protein -

Sequence


Protein


Download         Length: 433 a.a.        Molecular weight: 48894.84 Da        Isoelectric Point: 8.8375

>NTDB_id=61749 SAIN_RS02165 WP_021001373.1 434564..435865(+) (comFA/cflA) [Streptococcus anginosus C1051]
MTELQDCLGRIFTKSQLSPELQLQAQTLAGMVEEKGKLSCNRCGQVIDKEKHQLPIGAYYCRSCLILGRVRSDEDLYYFP
QEEFPKANVLKWQGKLTAFQAKVSQGLVEAVAKRKNSLVHAVTGAGKTEMIYQVVAQVINQGGAVCLASPRIDVCLELYR
RLKVDFTCDISLLHGESEAYSRSPLVIATTHQLLKFYRAFDLLIVDEVDAFPYVDNPMLYHAVHQAVKVEGTKIFLTATS
TDELDKKVAKGELTRLSLPRRFHGNPLIVPQKIWLADFQKYLGQKKLVPKLRQFIQKQRKTGFPLLIFASEIRKGQELAE
ILQSTFPNEKVGFVASTTENRLEIVEKFRQKEITILVTTTILERGVTFPCVDVFVVEANHRLFSRSALVQIAGRVGRSME
RPTGELIFFHDGTTMAIEKAIKEIQEMNQEAGL

Nucleotide


Download         Length: 1302 bp        

>NTDB_id=61749 SAIN_RS02165 WP_021001373.1 434564..435865(+) (comFA/cflA) [Streptococcus anginosus C1051]
ATGACAGAATTACAAGATTGTTTAGGTCGCATTTTTACAAAAAGTCAACTGTCACCAGAATTACAATTGCAAGCACAAAC
CTTAGCTGGAATGGTAGAAGAAAAAGGGAAGTTAAGCTGCAATCGTTGTGGGCAAGTCATTGACAAAGAAAAACACCAAT
TGCCAATAGGTGCTTATTATTGCAGGTCTTGCTTGATCTTAGGAAGGGTCAGAAGTGATGAAGATCTTTACTATTTTCCA
CAGGAAGAATTTCCTAAAGCGAATGTCTTGAAATGGCAAGGGAAGTTGACAGCGTTTCAAGCCAAGGTTTCTCAAGGACT
TGTAGAGGCGGTTGCTAAACGAAAAAATAGCTTGGTTCACGCAGTCACGGGAGCCGGAAAGACGGAAATGATCTATCAGG
TGGTGGCACAAGTCATCAATCAAGGTGGAGCAGTCTGCTTGGCTAGTCCCAGAATTGATGTCTGCTTAGAACTTTATCGC
AGATTGAAAGTAGATTTTACTTGTGATATTTCACTCCTACACGGTGAATCAGAAGCATATTCCCGCAGTCCTCTCGTGAT
TGCCACCACACATCAGCTTCTCAAATTTTACCGAGCATTTGATCTTCTTATTGTTGATGAGGTAGATGCCTTTCCTTATG
TGGACAATCCGATGCTTTATCATGCAGTTCATCAGGCAGTCAAAGTAGAGGGGACGAAGATTTTCTTAACGGCAACTTCC
ACAGATGAGCTGGATAAAAAAGTGGCTAAAGGAGAATTAACTCGTTTGAGTCTACCCAGACGTTTTCATGGCAATCCTTT
GATTGTTCCGCAAAAAATTTGGTTGGCGGATTTTCAAAAATATCTTGGTCAAAAGAAGCTAGTTCCCAAGTTAAGGCAGT
TTATTCAAAAGCAGAGAAAAACAGGATTTCCACTTCTTATTTTTGCTTCCGAGATCAGAAAAGGGCAGGAGCTGGCAGAG
ATTCTTCAAAGCACTTTTCCTAATGAAAAAGTTGGTTTTGTAGCCTCGACGACTGAAAATCGACTAGAAATTGTAGAGAA
ATTTCGTCAAAAAGAAATCACGATTTTGGTAACGACGACGATTTTGGAGCGTGGGGTGACTTTTCCTTGTGTAGATGTTT
TCGTGGTGGAAGCCAACCACCGTCTGTTTAGTCGCAGCGCTCTGGTACAAATTGCTGGACGCGTTGGTCGCAGTATGGAG
CGACCGACAGGCGAGTTAATCTTTTTTCATGACGGTACGACTATGGCGATAGAAAAAGCGATTAAAGAAATTCAGGAGAT
GAATCAGGAGGCCGGTTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0P0N797

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae D39

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae R6

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae TIGR4

71.495

98.845

0.707

  comFA/cflA Streptococcus mitis NCTC 12261

71.596

98.383

0.704

  comFA/cflA Streptococcus mitis SK321

70.892

98.383

0.697

  comFA Lactococcus lactis subsp. cremoris KW2

55.528

91.917

0.51

  comFA Latilactobacillus sakei subsp. sakei 23K

40.698

99.307

0.404

  comFA Bacillus subtilis subsp. subtilis str. 168

40.247

93.533

0.376


Multiple sequence alignment