Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   LK450_RS04835 Genome accession   NZ_CP085939
Coordinates   997037..998338 (-) Length   433 a.a.
NCBI ID   WP_022524394.1    Uniprot ID   A0A3S4MUE3
Organism   Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 992037..1003338
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LK450_RS04805 (LK450_04805) - 992793..993356 (-) 564 WP_026248195.1 YutD family protein -
  LK450_RS04810 (LK450_04810) sepM 993496..994551 (-) 1056 WP_003032358.1 SepM family pheromone-processing serine protease Regulator
  LK450_RS04815 (LK450_04815) coaD 994532..995029 (-) 498 WP_003032432.1 pantetheine-phosphate adenylyltransferase -
  LK450_RS04820 (LK450_04820) rsmD 995019..995567 (-) 549 WP_025271932.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  LK450_RS04825 (LK450_04825) hpf 995755..996297 (-) 543 WP_003032381.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  LK450_RS04830 (LK450_04830) comFC/cflB 996375..997040 (-) 666 WP_003032902.1 ComF family protein Machinery gene
  LK450_RS04835 (LK450_04835) comFA/cflA 997037..998338 (-) 1302 WP_022524394.1 DEAD/DEAH box helicase Machinery gene
  LK450_RS04840 (LK450_04840) - 998395..999030 (+) 636 WP_003032900.1 YigZ family protein -
  LK450_RS04845 (LK450_04845) cysK 999129..1000052 (+) 924 WP_003032904.1 cysteine synthase A -
  LK450_RS04850 (LK450_04850) - 1000352..1000825 (-) 474 Protein_965 DNA cytosine methyltransferase -
  LK450_RS04855 (LK450_04855) - 1001004..1001831 (-) 828 WP_003030992.1 hypothetical protein -

Sequence


Protein


Download         Length: 433 a.a.        Molecular weight: 49110.09 Da        Isoelectric Point: 8.5998

>NTDB_id=621406 LK450_RS04835 WP_022524394.1 997037..998338(-) (comFA/cflA) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
MIELQDCLGRMFTKAQLPAELQLYAQTLPAMKEEKGKLCCNRCGQAVDKERHQLPIGAYYCRSCLILGRVRSDEDLYYFP
QEEFPKANVLKWQGKLTEFQAKVSQGLVEAVTKRKDSLVHAVTGAGKTEMIYQVVAQVINQGGAVCLASPRIDVCLELYR
RLKVDFTCDISLLRGESEAYSRSPLVIATTHQLLKFYQAFDLLIVDEVDAFPYVDNPMLYHAVHQAVKVEGTKIFLTATS
TDELDKKVAKGELTRLSLPRRFHGNPLIVPQKIWLADFQKYLGQKKLVTKLRQFIQKQRKTGFPLLIFASEIRRGQELAE
ILQSNFPNEKVGFVASTTENRLEIVEKFRQKEITILVTTTILERGVTFPCVDVFVVEANHRLFSRSALVQIAGRVGRSMD
RPTGELIFFHDGTTMAIEKAIKEIQEMNQEAGL

Nucleotide


Download         Length: 1302 bp        

>NTDB_id=621406 LK450_RS04835 WP_022524394.1 997037..998338(-) (comFA/cflA) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
ATGATAGAATTACAAGATTGCTTAGGCCGTATGTTTACAAAAGCACAACTACCAGCAGAATTGCAATTATATGCGCAAAC
TTTACCAGCAATGAAAGAAGAAAAAGGAAAATTATGTTGCAATCGTTGTGGGCAAGCAGTTGATAAAGAAAGACACCAAC
TGCCAATAGGTGCTTACTACTGTAGATCCTGCTTGATCTTAGGAAGGGTTAGAAGTGATGAAGATCTCTACTATTTTCCA
CAGGAAGAGTTTCCTAAAGCGAATGTCTTGAAATGGCAAGGAAAGTTGACAGAATTTCAAGCTAAGGTTTCTCAAGGACT
TGTAGAGGCGGTTACCAAACGCAAAGATAGCTTGGTTCACGCAGTCACAGGAGCCGGAAAGACGGAAATGATCTATCAGG
TGGTGGCACAAGTCATCAATCAAGGCGGAGCCGTCTGCTTAGCTAGCCCCAGAATTGATGTCTGCTTAGAACTTTATCGC
AGACTGAAAGTAGATTTTACCTGTGATATTTCACTCCTGCGCGGCGAATCAGAAGCATATTCCCGCAGTCCTCTCGTGAT
TGCCACCACACATCAGCTTCTCAAATTTTATCAAGCATTTGATCTTCTTATCGTTGATGAAGTAGATGCCTTTCCTTATG
TGGACAATCCGATGCTTTATCATGCAGTTCATCAGGCAGTCAAAGTAGAGGGGACGAAGATTTTCTTAACAGCAACTTCC
ACAGATGAGCTGGATAAAAAAGTGGCTAAAGGAGAATTAACTCGTTTGAGTCTACCCAGACGTTTTCATGGCAATCCTTT
GATTGTTCCGCAAAAAATTTGGTTGGCGGATTTTCAAAAATATCTTGGTCAAAAGAAGCTAGTTACCAAGCTAAGGCAGT
TTATTCAAAAGCAGAGAAAAACAGGATTTCCACTTCTTATTTTTGCTTCCGAGATCAGAAGAGGGCAGGAGCTGGCAGAG
ATTCTCCAAAGCAATTTTCCAAATGAAAAAGTTGGTTTTGTAGCTTCGACGACTGAAAATCGACTAGAAATTGTAGAGAA
GTTTCGTCAAAAAGAAATCACGATTTTGGTAACGACGACGATTTTGGAGCGTGGGGTGACCTTTCCTTGTGTAGATGTTT
TCGTGGTGGAAGCCAACCACCGTCTGTTTAGTCGCAGCGCTCTGGTACAAATCGCTGGACGCGTCGGTCGCAGTATGGAT
CGACCGACAGGCGAGTTAATCTTTTTTCATGATGGTACAACTATGGCGATAGAAAAAGCGATTAAAGAAATTCAGGAGAT
GAATCAGGAGGCCGGTTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3S4MUE3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae D39

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae R6

71.729

98.845

0.709

  comFA/cflA Streptococcus pneumoniae TIGR4

71.495

98.845

0.707

  comFA/cflA Streptococcus mitis NCTC 12261

71.596

98.383

0.704

  comFA/cflA Streptococcus mitis SK321

71.362

98.383

0.702

  comFA Lactococcus lactis subsp. cremoris KW2

56.03

91.917

0.515

  comFA Latilactobacillus sakei subsp. sakei 23K

41.763

99.538

0.416

  comFA Bacillus subtilis subsp. subtilis str. 168

40

92.379

0.37