Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GFX43_RS08950 Genome accession   NZ_CP046448
Coordinates   1731600..1733930 (+) Length   776 a.a.
NCBI ID   WP_041517945.1    Uniprot ID   -
Organism   Bacillus subtilis strain ZD01     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1726600..1738930
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GFX43_RS08910 (GFX43_008910) yhbY 1726837..1727127 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  GFX43_RS08915 (GFX43_008915) nadD 1727139..1727708 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  GFX43_RS08920 (GFX43_008920) yqeK 1727698..1728258 (+) 561 WP_015483448.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  GFX43_RS08925 (GFX43_008925) rsfS 1728276..1728632 (+) 357 WP_014480315.1 ribosome silencing factor -
  GFX43_RS08930 (GFX43_008930) yqeM 1728629..1729372 (+) 744 WP_015384134.1 class I SAM-dependent methyltransferase -
  GFX43_RS08935 (GFX43_008935) comER 1729438..1730259 (-) 822 WP_004398597.1 late competence protein ComER -
  GFX43_RS08940 (GFX43_008940) comEA 1730343..1730960 (+) 618 WP_015384132.1 competence protein ComEA Machinery gene
  GFX43_RS08945 (GFX43_008945) comEB 1731027..1731596 (+) 570 WP_003229978.1 ComE operon protein 2 -
  GFX43_RS08950 (GFX43_008950) comEC 1731600..1733930 (+) 2331 WP_041517945.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GFX43_RS08955 (GFX43_008955) yqzM 1733970..1734104 (-) 135 WP_003229983.1 YqzM family protein -
  GFX43_RS08960 (GFX43_008960) - 1734145..1734294 (+) 150 WP_003229985.1 hypothetical protein -
  GFX43_RS08965 (GFX43_008965) holA 1734334..1735377 (+) 1044 WP_015384130.1 DNA polymerase III subunit delta -
  GFX43_RS08970 (GFX43_008970) rpsT 1735392..1735658 (-) 267 WP_003229989.1 30S ribosomal protein S20 -
  GFX43_RS08975 (GFX43_008975) gpr 1735862..1736968 (+) 1107 WP_014480309.1 GPR endopeptidase -
  GFX43_RS08980 (GFX43_008980) spoIIP 1737031..1738236 (+) 1206 WP_015384129.1 spore autolysin SpoIIP -
  GFX43_RS08985 (GFX43_008985) yqxA 1738253..1738591 (+) 339 WP_014477382.1 YqxA family protein -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86647.97 Da        Isoelectric Point: 6.8607

>NTDB_id=404392 GFX43_RS08950 WP_041517945.1 1731600..1733930(+) (comEC) [Bacillus subtilis strain ZD01]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDNIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAASSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHKFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVDVFTIMIAHPAPVLLFLFTVTILLLLMAIEKRSLSQLMVTGG
ICCTVLFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKYHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFTQQLQPETAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRGGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=404392 GFX43_RS08950 WP_041517945.1 1731600..1733930(+) (comEC) [Bacillus subtilis strain ZD01]
ATGCGTAATTCGCGATTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACTTATCAATTC
AAGGCAGTGATTGACAATATTCCTAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTCAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAAGATGAAGTGCTTACTGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGGCTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATACGCCTTGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTTCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATAAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCTGATGTTGATGTATTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCCTCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACTGGAGGC
ATTTGCTGCACGGTGCTGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGTATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCACCCAACAGCTTCAGCCGGAAACGGCCATTATCTCAGCTGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGGTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

97.809

100

0.978


Multiple sequence alignment