Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   R5H20_RS20570 Genome accession   NZ_CP137345
Coordinates   4007737..4009083 (-) Length   448 a.a.
NCBI ID   WP_006639265.1    Uniprot ID   M5NZS0
Organism   Bacillus sp. KICET-3     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4002737..4014083
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R5H20_RS20535 (R5H20_20535) flgL 4002925..4003836 (-) 912 WP_006639258.1 flagellar hook-associated protein FlgL -
  R5H20_RS20540 (R5H20_20540) flgK 4003847..4005370 (-) 1524 WP_006639259.1 flagellar hook-associated protein FlgK -
  R5H20_RS20545 (R5H20_20545) - 4005388..4005870 (-) 483 WP_029419581.1 flagellar protein FlgN -
  R5H20_RS20550 (R5H20_20550) flgM 4005885..4006148 (-) 264 WP_006639261.1 flagellar biosynthesis anti-sigma factor FlgM -
  R5H20_RS20555 (R5H20_20555) - 4006228..4006647 (-) 420 WP_006639262.1 TIGR03826 family flagellar region protein -
  R5H20_RS20560 (R5H20_20560) comFC 4006705..4007439 (-) 735 WP_006639263.1 ComF family protein Machinery gene
  R5H20_RS20565 (R5H20_20565) - 4007396..4007680 (-) 285 WP_006639264.1 late competence development ComFB family protein -
  R5H20_RS20570 (R5H20_20570) comFA 4007737..4009083 (-) 1347 WP_006639265.1 DEAD/DEAH box helicase Machinery gene
  R5H20_RS20575 (R5H20_20575) - 4009276..4010118 (-) 843 WP_006639266.1 DegV family protein -
  R5H20_RS20580 (R5H20_20580) degU 4010238..4010927 (-) 690 WP_003185730.1 two-component system response regulator DegU Regulator
  R5H20_RS20585 (R5H20_20585) degS 4011009..4012166 (-) 1158 WP_006639267.1 sensor histidine kinase Regulator
  R5H20_RS20590 (R5H20_20590) - 4012388..4013026 (+) 639 WP_006639268.1 YigZ family protein -

Sequence


Protein


Download         Length: 448 a.a.        Molecular weight: 50591.30 Da        Isoelectric Point: 10.0116

>NTDB_id=897615 R5H20_RS20570 WP_006639265.1 4007737..4009083(-) (comFA) [Bacillus sp. KICET-3]
MLESRHLLKSELPFPDHVIEWHIQKGLIKTEKPIAKTAKGFICKRCGQDQQTFFAKYPCFICDKTCVYCRSCVMMGRVSE
CTPLLTWKDADLPKWPAVRMEWRGVLSEGQEKAASSIVEAIRKKEELLIWAVCGAGKTEILFQGIEFALTKGLRVCIATP
RTDVVLELAPRLKNAIKGVEIAALYGGSPDRGMLSPLMISTTHQLLRYKEAFDVIIVDEVDAFPYCLDKKLQYAVKKAGK
QQCTRIYLTATPSREMKRHVGSGRLKAVQIPARYHRSPLPEPEFAWCGNWKKRLERKNIPSAVKNWLFKHKELDQPVFLF
VPSIQTLQSAVRLLKKEHFNTAGVHADDPDRNEKVKQFRSGAFDILVTTTILERGVTVKKAQVGVLGAESAVFTESALVQ
ISGRAGRHPQFPTGAVCFFHFGKTVNMIAARRHIQQMNKMAKLENLID

Nucleotide


Download         Length: 1347 bp        

>NTDB_id=897615 R5H20_RS20570 WP_006639265.1 4007737..4009083(-) (comFA) [Bacillus sp. KICET-3]
ATGCTTGAGTCCCGCCACCTTCTCAAAAGCGAGTTGCCTTTTCCCGATCATGTGATTGAATGGCATATCCAAAAAGGTTT
GATAAAAACTGAAAAACCGATTGCAAAAACGGCAAAAGGCTTTATTTGCAAACGATGCGGACAGGATCAGCAGACGTTTT
TTGCTAAATACCCTTGTTTTATTTGTGATAAAACCTGCGTTTACTGCCGTTCATGCGTGATGATGGGGAGAGTAAGCGAA
TGTACGCCGCTCTTAACTTGGAAAGACGCTGACCTGCCCAAATGGCCGGCTGTCCGGATGGAGTGGAGAGGCGTTCTTTC
CGAGGGGCAGGAAAAAGCGGCAAGCTCGATTGTTGAAGCCATCCGCAAAAAAGAAGAGCTGTTAATTTGGGCGGTTTGCG
GTGCCGGTAAAACAGAGATTCTTTTTCAGGGAATAGAATTTGCTTTGACTAAAGGCTTGAGAGTATGTATAGCCACTCCG
AGAACAGATGTTGTTCTTGAGCTTGCCCCGAGATTAAAGAACGCTATTAAAGGAGTAGAAATTGCCGCTTTATACGGAGG
GAGTCCAGACAGGGGGATGCTCTCGCCTCTCATGATTTCTACCACCCACCAGCTTCTGCGCTACAAAGAAGCGTTCGATG
TGATCATTGTTGATGAGGTTGATGCATTTCCATATTGTTTGGATAAAAAGCTGCAATACGCCGTGAAAAAAGCGGGAAAG
CAGCAATGCACCCGAATATATTTAACGGCTACACCTTCACGGGAAATGAAACGGCATGTTGGATCCGGGAGGCTTAAGGC
CGTTCAGATTCCGGCAAGGTATCACAGAAGTCCATTGCCAGAACCGGAATTTGCATGGTGCGGCAACTGGAAAAAGAGAC
TTGAACGAAAAAACATTCCCTCCGCCGTGAAAAATTGGCTTTTCAAGCATAAAGAACTGGATCAGCCTGTCTTTTTATTT
GTGCCCTCGATTCAGACGCTTCAATCTGCGGTCAGGCTGTTGAAAAAGGAACATTTCAATACAGCGGGCGTACATGCGGA
TGACCCGGATAGAAATGAAAAGGTAAAGCAGTTTAGAAGCGGAGCGTTTGATATTCTCGTCACAACAACCATATTGGAAC
GGGGGGTAACGGTCAAAAAAGCCCAGGTCGGAGTGCTCGGTGCGGAGTCAGCCGTTTTTACAGAAAGCGCTCTTGTTCAA
ATATCGGGAAGAGCGGGGAGGCATCCGCAGTTTCCGACCGGAGCCGTTTGTTTTTTTCATTTCGGCAAGACGGTCAATAT
GATAGCCGCCCGCCGTCATATTCAACAAATGAATAAAATGGCTAAACTGGAAAATTTGATTGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB M5NZS0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

61.136

98.214

0.6

  comFA/cflA Streptococcus mitis SK321

37.9

97.768

0.371