Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   S100757_RS18115 Genome accession   NZ_CP021499
Coordinates   3410741..3412132 (-) Length   463 a.a.
NCBI ID   WP_014481052.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM100757     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3405741..3417132
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100757_RS18080 (S100757_03587) flgL 3405913..3406809 (-) 897 WP_087614741.1 flagellar hook-associated protein FlgL -
  S100757_RS18085 (S100757_03588) flgK 3406820..3408343 (-) 1524 WP_046160905.1 flagellar hook-associated protein FlgK -
  S100757_RS18090 (S100757_03589) flgN 3408362..3408844 (-) 483 WP_014481048.1 flagellar protein FlgN -
  S100757_RS18095 (S100757_03590) flgM 3408860..3409126 (-) 267 WP_014481049.1 flagellar biosynthesis anti-sigma factor FlgM -
  S100757_RS18100 (S100757_03591) yvyF 3409207..3409626 (-) 420 WP_024572694.1 TIGR03826 family flagellar region protein -
  S100757_RS18105 (S100757_03592) comFC 3409699..3410421 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  S100757_RS18110 (S100757_03593) comFB 3410385..3410681 (-) 297 WP_003227989.1 late competence protein ComFB -
  S100757_RS18115 (S100757_03594) comFA 3410741..3412132 (-) 1392 WP_014481052.1 ATP-dependent helicase ComFA Machinery gene
  S100757_RS18120 (S100757_03595) fakBA 3412239..3413084 (-) 846 WP_014481053.1 DegV family protein -
  S100757_RS18125 (S100757_03596) degU 3413182..3413871 (-) 690 WP_014481054.1 two-component system response regulator DegU Regulator
  S100757_RS18130 (S100757_03597) degS 3413954..3415111 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  S100757_RS18135 (S100757_03598) - 3415328..3415981 (+) 654 WP_014481055.1 YigZ family protein -
  S100757_RS18140 (S100757_03599) - 3415981..3416973 (+) 993 WP_014481056.1 LCP family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52543.49 Da        Isoelectric Point: 9.7976

>NTDB_id=231242 S100757_RS18115 WP_014481052.1 3410741..3412132(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100757]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=231242 S100757_RS18115 WP_014481052.1 3410741..3412132(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100757]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

98.056

100

0.981


Multiple sequence alignment