Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   DYB47_RS02305 Genome accession   NZ_UHFV01000002
Coordinates   417870..419165 (+) Length   431 a.a.
NCBI ID   WP_003080792.1    Uniprot ID   G5JY37
Organism   Streptococcus macacae NCTC 11558     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 412870..424165
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DYB47_RS02280 (NCTC11558_00463) - 413522..414163 (+) 642 WP_003082065.1 response regulator transcription factor -
  DYB47_RS02285 (NCTC11558_00464) - 414202..415614 (+) 1413 WP_003080313.1 Cof-type HAD-IIB family hydrolase -
  DYB47_RS02290 (NCTC11558_00465) - 415601..415957 (+) 357 WP_003079785.1 S1 RNA-binding domain-containing protein -
  DYB47_RS09860 - 415962..416096 (+) 135 WP_003081823.1 hypothetical protein -
  DYB47_RS02295 (NCTC11558_00466) cysK 416154..417080 (-) 927 WP_003081389.1 cysteine synthase A -
  DYB47_RS02300 (NCTC11558_00467) - 417171..417800 (-) 630 WP_003079022.1 YigZ family protein -
  DYB47_RS02305 (NCTC11558_00468) comFA/cflA 417870..419165 (+) 1296 WP_003080792.1 DEAD/DEAH box helicase Machinery gene
  DYB47_RS02310 (NCTC11558_00469) - 419165..419830 (+) 666 WP_003078542.1 ComF family protein -
  DYB47_RS02315 (NCTC11558_00470) hpf 419908..420456 (+) 549 WP_003081889.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  DYB47_RS02320 (NCTC11558_00471) - 420671..421825 (+) 1155 Protein_405 zinc ribbon domain-containing protein -

Sequence


Protein


Download         Length: 431 a.a.        Molecular weight: 49091.19 Da        Isoelectric Point: 9.6810

>NTDB_id=1171306 DYB47_RS02305 WP_003080792.1 417870..419165(+) (comFA/cflA) [Streptococcus macacae NCTC 11558]
MERLDDYYGRIFTEDQLKEELREFAKILPAVNHFTNSRICNRCGSQIAAENKLPDGSFYCRECIILGRNSSGALYYFEQK
PFPEGKVLKWQGSLTDLQKEVSDSLKIGVEQKENLLIHAVTGAGKTEIIYETVASVLDKGGAVALASPRIDVCLELYKRF
KNDFSCSISLLHGESEPYSRSPLVILTTHQLLKFYKAFDLLIIDEVDAFPFVDNPSLYYAVEQSVKADGVNVFLTATSTE
QLDKQVKKGKLKKLHLARRFHASPLVVPKTVWLYLALEKLQRGKLPFKLLRIIKEQRQSQFPLLIFFPVIKLGELFSRCL
KNYFLDEKIAFVSSMTENRLEIVEKFRKGQITVLISTTILERGVTFPFVDVFVMMANHYLFTKSSLIQIAGRVGRSAERP
TGTLLFFHSGKTKSIRKAISEIRRMNQIGGF

Nucleotide


Download         Length: 1296 bp        

>NTDB_id=1171306 DYB47_RS02305 WP_003080792.1 417870..419165(+) (comFA/cflA) [Streptococcus macacae NCTC 11558]
ATGGAAAGACTAGATGATTATTATGGACGTATTTTTACAGAAGATCAGCTAAAAGAGGAGCTGAGGGAATTTGCTAAAAT
TTTGCCGGCAGTAAACCATTTTACAAATAGTAGAATTTGCAATCGCTGCGGCAGCCAAATAGCAGCTGAAAATAAATTAC
CTGATGGCTCTTTTTATTGTCGGGAGTGCATTATTTTAGGACGAAACAGCAGCGGAGCTTTATACTATTTTGAGCAAAAA
CCTTTTCCTGAAGGAAAGGTTTTAAAATGGCAAGGATCATTAACAGATCTTCAAAAAGAAGTGTCGGATAGTTTGAAAAT
AGGAGTTGAACAAAAAGAAAACCTCCTTATACATGCAGTAACAGGTGCTGGTAAAACAGAGATAATATATGAGACAGTAG
CAAGTGTACTGGATAAAGGCGGAGCCGTAGCCTTAGCGAGTCCTCGAATTGATGTTTGTCTCGAATTATATAAACGCTTT
AAAAATGATTTTTCCTGTTCAATAAGCCTGCTGCATGGAGAGTCAGAACCATACAGTCGCAGTCCTTTAGTTATTTTGAC
AACCCATCAGCTATTAAAATTTTATAAAGCTTTTGACCTTTTAATCATTGATGAAGTTGATGCTTTCCCTTTTGTTGATA
ATCCAAGCCTTTACTATGCCGTTGAACAGAGTGTAAAAGCAGATGGAGTCAACGTTTTTTTAACAGCTACTTCAACTGAA
CAATTGGATAAACAGGTGAAAAAAGGAAAGCTTAAAAAGCTGCATTTGGCGAGACGATTTCATGCTTCTCCTCTAGTTGT
TCCAAAAACGGTTTGGTTATATCTTGCTTTAGAAAAATTACAAAGGGGCAAGCTTCCTTTTAAATTATTGAGGATTATAA
AAGAACAGCGGCAAAGTCAGTTTCCTTTACTCATATTTTTTCCGGTAATTAAGTTAGGTGAGTTATTTTCAAGGTGTTTG
AAGAATTATTTTCTAGATGAAAAAATAGCTTTTGTCTCCAGTATGACAGAAAATAGATTAGAAATAGTGGAGAAGTTTAG
AAAAGGGCAAATAACGGTATTAATATCAACGACTATTTTAGAAAGAGGAGTGACCTTTCCCTTTGTAGATGTTTTTGTTA
TGATGGCTAATCATTATTTGTTTACAAAAAGTTCCTTAATTCAGATAGCGGGGCGCGTTGGCAGGTCTGCTGAGAGGCCG
ACAGGAACGCTGCTCTTTTTCCATAGCGGAAAAACAAAGAGTATCAGAAAAGCTATTTCAGAAATTAGAAGGATGAACCA
GATTGGAGGATTTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB G5JY37

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

59.953

99.072

0.594

  comFA/cflA Streptococcus pneumoniae D39

59.953

99.072

0.594

  comFA/cflA Streptococcus pneumoniae R6

59.953

99.072

0.594

  comFA/cflA Streptococcus pneumoniae TIGR4

59.953

99.072

0.594

  comFA/cflA Streptococcus mitis NCTC 12261

59.485

99.072

0.589

  comFA/cflA Streptococcus mitis SK321

59.016

99.072

0.585

  comFA Lactococcus lactis subsp. cremoris KW2

49.881

97.216

0.485

  comFA Latilactobacillus sakei subsp. sakei 23K

37.123

100

0.371

  comFA Bacillus subtilis subsp. subtilis str. 168

39.012

93.968

0.367


Multiple sequence alignment