Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   S101444_RS18120 Genome accession   NZ_CP021498
Coordinates   3410703..3412094 (-) Length   463 a.a.
NCBI ID   WP_014481052.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM101444     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3405703..3417094
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101444_RS18085 (S101444_03600) flgL 3405875..3406771 (-) 897 WP_087614741.1 flagellar hook-associated protein FlgL -
  S101444_RS18090 (S101444_03601) flgK 3406782..3408305 (-) 1524 WP_046160905.1 flagellar hook-associated protein FlgK -
  S101444_RS18095 (S101444_03602) flgN 3408324..3408806 (-) 483 WP_014481048.1 flagellar protein FlgN -
  S101444_RS18100 (S101444_03603) flgM 3408822..3409088 (-) 267 WP_014481049.1 flagellar biosynthesis anti-sigma factor FlgM -
  S101444_RS18105 (S101444_03604) yvyF 3409169..3409588 (-) 420 WP_024572694.1 TIGR03826 family flagellar region protein -
  S101444_RS18110 (S101444_03605) comFC 3409661..3410383 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  S101444_RS18115 (S101444_03606) comFB 3410347..3410643 (-) 297 WP_003227989.1 late competence protein ComFB -
  S101444_RS18120 (S101444_03607) comFA 3410703..3412094 (-) 1392 WP_014481052.1 ATP-dependent helicase ComFA Machinery gene
  S101444_RS18125 (S101444_03608) fakBA 3412201..3413046 (-) 846 WP_014481053.1 DegV family protein -
  S101444_RS18130 (S101444_03609) degU 3413144..3413833 (-) 690 WP_014481054.1 two-component system response regulator DegU Regulator
  S101444_RS18135 (S101444_03610) degS 3413916..3415073 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  S101444_RS18140 (S101444_03611) - 3415290..3415943 (+) 654 WP_014481055.1 YigZ family protein -
  S101444_RS18145 (S101444_03612) - 3415943..3416935 (+) 993 WP_014481056.1 LCP family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52543.49 Da        Isoelectric Point: 9.7976

>NTDB_id=231162 S101444_RS18120 WP_014481052.1 3410703..3412094(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM101444]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=231162 S101444_RS18120 WP_014481052.1 3410703..3412094(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM101444]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

98.056

100

0.981


Multiple sequence alignment