Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   S100761_RS18115 Genome accession   NZ_CP021889
Coordinates   3410728..3412119 (-) Length   463 a.a.
NCBI ID   WP_014481052.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM100761     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3405728..3417119
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100761_RS18080 (S100761_03599) flgL 3405900..3406796 (-) 897 WP_087614741.1 flagellar hook-associated protein FlgL -
  S100761_RS18085 (S100761_03600) flgK 3406807..3408330 (-) 1524 WP_046160905.1 flagellar hook-associated protein FlgK -
  S100761_RS18090 (S100761_03601) flgN 3408349..3408831 (-) 483 WP_014481048.1 flagellar protein FlgN -
  S100761_RS18095 (S100761_03602) flgM 3408847..3409113 (-) 267 WP_014481049.1 flagellar biosynthesis anti-sigma factor FlgM -
  S100761_RS18100 (S100761_03603) yvyF 3409194..3409613 (-) 420 WP_024572694.1 TIGR03826 family flagellar region protein -
  S100761_RS18105 (S100761_03604) comFC 3409686..3410408 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  S100761_RS18110 (S100761_03605) comFB 3410372..3410668 (-) 297 WP_003227989.1 late competence protein ComFB -
  S100761_RS18115 (S100761_03606) comFA 3410728..3412119 (-) 1392 WP_014481052.1 ATP-dependent helicase ComFA Machinery gene
  S100761_RS18120 (S100761_03607) fakBA 3412226..3413071 (-) 846 WP_014481053.1 DegV family protein -
  S100761_RS18125 (S100761_03608) degU 3413169..3413858 (-) 690 WP_014481054.1 two-component system response regulator DegU Regulator
  S100761_RS18130 (S100761_03609) degS 3413941..3415098 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  S100761_RS18135 (S100761_03610) - 3415315..3415968 (+) 654 WP_014481055.1 YigZ family protein -
  S100761_RS18140 (S100761_03611) - 3415968..3416960 (+) 993 WP_014481056.1 LCP family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52543.49 Da        Isoelectric Point: 9.7976

>NTDB_id=234485 S100761_RS18115 WP_014481052.1 3410728..3412119(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100761]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENEPNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=234485 S100761_RS18115 WP_014481052.1 3410728..3412119(-) (comFA) [Bacillus subtilis subsp. subtilis strain SRCM100761]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGA
ACCAAACTGGCAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGGGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCCGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

98.056

100

0.981


Multiple sequence alignment