Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   D9C22_RS19020 Genome accession   NZ_CP033064
Coordinates   3617507..3618898 (-) Length   463 a.a.
NCBI ID   WP_041850475.1    Uniprot ID   -
Organism   Bacillus sp. WR11     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3612507..3623898
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D9C22_RS18985 (D9C22_18990) flgL 3612678..3613574 (-) 897 WP_015714797.1 flagellar hook-associated protein FlgL -
  D9C22_RS18990 (D9C22_18995) flgK 3613585..3615108 (-) 1524 WP_003228001.1 flagellar hook-associated protein FlgK -
  D9C22_RS18995 (D9C22_19000) flgN 3615127..3615609 (-) 483 WP_014481048.1 flagellar protein FlgN -
  D9C22_RS19000 (D9C22_19005) flgM 3615625..3615891 (-) 267 WP_014481049.1 flagellar biosynthesis anti-sigma factor FlgM -
  D9C22_RS19005 (D9C22_19010) - 3615972..3616391 (-) 420 WP_003227995.1 TIGR03826 family flagellar region protein -
  D9C22_RS19010 (D9C22_19015) comFC 3616465..3617187 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  D9C22_RS19015 (D9C22_19020) comFB 3617151..3617447 (-) 297 WP_003227989.1 late competence protein ComFB -
  D9C22_RS19020 (D9C22_19025) comFA 3617507..3618898 (-) 1392 WP_041850475.1 ATP-dependent helicase ComFA Machinery gene
  D9C22_RS19025 (D9C22_19030) - 3619004..3619849 (-) 846 WP_003227986.1 DegV family protein -
  D9C22_RS19030 (D9C22_19035) degU 3619947..3620636 (-) 690 WP_003219701.1 two-component system response regulator DegU Regulator
  D9C22_RS19035 (D9C22_19040) degS 3620719..3621876 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  D9C22_RS19040 (D9C22_19045) - 3622093..3622746 (+) 654 WP_003227979.1 YigZ family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52485.50 Da        Isoelectric Point: 9.9185

>NTDB_id=321098 D9C22_RS19020 WP_041850475.1 3617507..3618898(-) (comFA) [Bacillus sp. WR11]
MNVPVEKNSSFSRELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRGYRCNRCGQTDQRYYSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWEEENAPNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTIVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIFFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=321098 D9C22_RS19020 WP_041850475.1 3617507..3618898(-) (comFA) [Bacillus sp. WR11]
GTGAATGTGCCAGTTGAAAAAAACAGTTCCTTTTCTAGAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGAAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGGGATATAGATGTAATAGGTGCGGCCAAACTGATCAGCGGTATTATTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGGAAGAGGAAAATGC
ACCAAACTGGAAGTCAATTAAACTGACATGGGACGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGGCT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCATCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATGTAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATCTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGTGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTTCTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

97.84

100

0.978


Multiple sequence alignment