Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   S100333_RS18725 Genome accession   NZ_CP021892
Coordinates   3517968..3519359 (-) Length   463 a.a.
NCBI ID   WP_014481052.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM100333     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3512968..3524359
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100333_RS18690 (S100333_03816) flgL 3513140..3514036 (-) 897 WP_014481046.1 flagellar hook-associated protein FlgL -
  S100333_RS18695 (S100333_03817) flgK 3514047..3515570 (-) 1524 WP_014481047.1 flagellar hook-associated protein FlgK -
  S100333_RS18700 (S100333_03818) flgN 3515589..3516071 (-) 483 WP_014481048.1 flagellar protein FlgN -
  S100333_RS18705 (S100333_03819) flgM 3516087..3516353 (-) 267 WP_014481049.1 flagellar biosynthesis anti-sigma factor FlgM -
  S100333_RS18710 (S100333_03820) yvyF 3516434..3516853 (-) 420 WP_038429728.1 TIGR03826 family flagellar region protein -
  S100333_RS18715 (S100333_03821) comFC 3516926..3517648 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  S100333_RS18720 (S100333_03822) comFB 3517612..3517908 (-) 297 WP_003227989.1 late competence protein ComFB -
  S100333_RS18725 (S100333_03823) comFA 3517968..3519359 (-) 1392 WP_014481052.1 ATP-dependent helicase ComFA Machinery gene
  S100333_RS18730 (S100333_03824) fakBA 3519466..3520311 (-) 846 WP_014481053.1 DegV family protein -
  S100333_RS18735 (S100333_03825) degU 3520409..3521098 (-) 690 WP_014481054.1 two-component system response regulator DegU Regulator
  S100333_RS18740 (S100333_03826) degS 3521181..3522338 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  S100333_RS18745 (S100333_03827) - 3522555..3523208 (+) 654 WP_014481055.1 YigZ family protein -
  S100333_RS18750 (S100333_03828) - 3523208..3524172 (+) 965 Protein_3647 LCP family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52543.49 Da        Isoelectric Point: 9.7976

>NTDB_id=234562 S100333_RS18725 WP_014481052.1 3517968..3519359(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100333]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=234562 S100333_RS18725 WP_014481052.1 3517968..3519359(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100333]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

98.056

100

0.981


Multiple sequence alignment