Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   S101395_RS03040 Genome accession   NZ_CP021920
Coordinates   589297..590643 (+) Length   448 a.a.
NCBI ID   WP_006639265.1    Uniprot ID   M5NZS0
Organism   Bacillus sonorensis strain SRCM101395     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 584297..595643
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101395_RS03020 (S101395_00610) - 585354..585992 (-) 639 WP_006639268.1 YigZ family protein -
  S101395_RS03025 (S101395_00611) degS 586214..587371 (+) 1158 WP_006639267.1 sensor histidine kinase Regulator
  S101395_RS03030 (S101395_00612) degU 587453..588142 (+) 690 WP_003185730.1 two-component system response regulator DegU Regulator
  S101395_RS03035 (S101395_00613) - 588262..589104 (+) 843 WP_006639266.1 DegV family protein -
  S101395_RS03040 (S101395_00614) comFA 589297..590643 (+) 1347 WP_006639265.1 DEAD/DEAH box helicase Machinery gene
  S101395_RS03045 (S101395_00615) - 590700..590984 (+) 285 WP_006639264.1 late competence development ComFB family protein -
  S101395_RS26895 (S101395_00616) comFC 590941..591675 (+) 735 WP_006639263.1 ComF family protein Machinery gene
  S101395_RS03055 (S101395_00617) - 591733..592152 (+) 420 WP_006639262.1 TIGR03826 family flagellar region protein -
  S101395_RS03060 (S101395_00618) flgM 592232..592495 (+) 264 WP_006639261.1 flagellar biosynthesis anti-sigma factor FlgM -
  S101395_RS03065 (S101395_00619) - 592510..592992 (+) 483 WP_029419581.1 flagellar protein FlgN -
  S101395_RS03070 (S101395_00620) flgK 593010..594533 (+) 1524 WP_006639259.1 flagellar hook-associated protein FlgK -
  S101395_RS03075 (S101395_00621) flgL 594544..595455 (+) 912 WP_006639258.1 flagellar hook-associated protein FlgL -

Sequence


Protein


Download         Length: 448 a.a.        Molecular weight: 50591.30 Da        Isoelectric Point: 10.0116

>NTDB_id=234871 S101395_RS03040 WP_006639265.1 589297..590643(+) (comFA) [Bacillus sonorensis strain SRCM101395]
MLESRHLLKSELPFPDHVIEWHIQKGLIKTEKPIAKTAKGFICKRCGQDQQTFFAKYPCFICDKTCVYCRSCVMMGRVSE
CTPLLTWKDADLPKWPAVRMEWRGVLSEGQEKAASSIVEAIRKKEELLIWAVCGAGKTEILFQGIEFALTKGLRVCIATP
RTDVVLELAPRLKNAIKGVEIAALYGGSPDRGMLSPLMISTTHQLLRYKEAFDVIIVDEVDAFPYCLDKKLQYAVKKAGK
QQCTRIYLTATPSREMKRHVGSGRLKAVQIPARYHRSPLPEPEFAWCGNWKKRLERKNIPSAVKNWLFKHKELDQPVFLF
VPSIQTLQSAVRLLKKEHFNTAGVHADDPDRNEKVKQFRSGAFDILVTTTILERGVTVKKAQVGVLGAESAVFTESALVQ
ISGRAGRHPQFPTGAVCFFHFGKTVNMIAARRHIQQMNKMAKLENLID

Nucleotide


Download         Length: 1347 bp        

>NTDB_id=234871 S101395_RS03040 WP_006639265.1 589297..590643(+) (comFA) [Bacillus sonorensis strain SRCM101395]
ATGCTTGAGTCCCGCCACCTTCTCAAAAGCGAGTTGCCTTTTCCCGATCATGTGATTGAATGGCATATCCAAAAAGGTTT
GATAAAAACTGAAAAACCGATTGCAAAAACGGCAAAAGGCTTTATTTGCAAACGATGCGGACAGGATCAGCAGACGTTTT
TTGCTAAATACCCTTGTTTTATTTGTGATAAAACCTGCGTTTACTGCCGTTCATGCGTGATGATGGGGAGAGTAAGCGAA
TGTACGCCGCTCTTAACTTGGAAAGACGCTGACCTGCCCAAATGGCCGGCTGTCCGGATGGAGTGGAGAGGCGTTCTTTC
CGAGGGGCAGGAAAAAGCGGCAAGCTCGATTGTTGAAGCCATCCGCAAAAAAGAAGAGCTGTTAATTTGGGCGGTTTGCG
GTGCCGGTAAAACAGAGATTCTTTTTCAGGGAATAGAATTTGCTTTGACTAAAGGCTTGAGAGTATGTATAGCCACTCCG
AGAACAGATGTTGTTCTTGAGCTTGCCCCGAGATTAAAGAACGCTATTAAAGGAGTAGAAATTGCCGCTTTATACGGAGG
GAGTCCAGACAGGGGGATGCTCTCGCCTCTCATGATTTCTACCACCCACCAGCTTCTGCGCTACAAAGAAGCGTTCGATG
TGATCATTGTTGATGAGGTTGATGCATTTCCATATTGTTTGGATAAAAAGCTGCAATACGCCGTGAAAAAAGCGGGAAAG
CAGCAATGCACCCGAATATATTTAACGGCTACACCTTCACGGGAAATGAAACGGCATGTTGGATCCGGGAGGCTTAAGGC
CGTTCAGATTCCGGCAAGGTATCACAGAAGTCCATTGCCAGAACCGGAATTTGCATGGTGCGGCAACTGGAAAAAGAGAC
TTGAACGAAAAAACATTCCCTCCGCCGTGAAAAATTGGCTTTTCAAGCATAAAGAACTGGATCAGCCTGTCTTTTTATTT
GTGCCCTCGATTCAGACGCTTCAATCTGCGGTCAGGCTGTTGAAAAAGGAACATTTCAATACAGCGGGCGTACATGCGGA
TGACCCGGATAGAAATGAAAAGGTAAAGCAGTTTAGAAGCGGAGCGTTTGATATTCTCGTCACAACAACCATATTGGAAC
GGGGGGTAACGGTCAAAAAAGCCCAGGTCGGAGTGCTCGGTGCGGAGTCAGCCGTTTTTACAGAAAGCGCTCTTGTTCAA
ATATCGGGAAGAGCGGGGAGGCATCCGCAGTTTCCGACCGGAGCCGTTTGTTTTTTTCATTTCGGCAAGACGGTCAATAT
GATAGCCGCCCGCCGTCATATTCAACAAATGAATAAAATGGCTAAACTGGAAAATTTGATTGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB M5NZS0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

61.136

98.214

0.6

  comFA/cflA Streptococcus mitis SK321

37.9

97.768

0.371


Multiple sequence alignment