Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   AAVB74_RS09995 Genome accession   NZ_CP154920
Coordinates   1867469..1869799 (+) Length   776 a.a.
NCBI ID   WP_345806062.1    Uniprot ID   -
Organism   Bacillus subtilis strain FUA2232     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1862469..1874799
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAVB74_RS09955 (AAVB74_09955) yhbY 1862719..1863009 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  AAVB74_RS09960 (AAVB74_09960) nadD 1863021..1863590 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  AAVB74_RS09965 (AAVB74_09965) yqeK 1863580..1864140 (+) 561 WP_029317884.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  AAVB74_RS09970 (AAVB74_09970) rsfS 1864158..1864508 (+) 351 WP_345806061.1 ribosome silencing factor -
  AAVB74_RS09975 (AAVB74_09975) yqeM 1864505..1865248 (+) 744 WP_106073582.1 class I SAM-dependent methyltransferase -
  AAVB74_RS09980 (AAVB74_09980) comER 1865314..1866135 (-) 822 WP_004398597.1 late competence protein ComER -
  AAVB74_RS09985 (AAVB74_09985) comEA 1866219..1866836 (+) 618 WP_004398514.1 competence protein ComEA Machinery gene
  AAVB74_RS09990 (AAVB74_09990) comEB 1866903..1867472 (+) 570 WP_015714288.1 ComE operon protein 2 -
  AAVB74_RS09995 (AAVB74_09995) comEC 1867469..1869799 (+) 2331 WP_345806062.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  AAVB74_RS10000 (AAVB74_10000) yqzM 1869839..1869973 (-) 135 WP_003229983.1 YqzM family protein -
  AAVB74_RS10005 (AAVB74_10005) - 1870014..1870163 (+) 150 WP_003229985.1 hypothetical protein -
  AAVB74_RS10010 (AAVB74_10010) holA 1870203..1871246 (+) 1044 WP_345806063.1 DNA polymerase III subunit delta -
  AAVB74_RS10015 (AAVB74_10015) rpsT 1871261..1871527 (-) 267 WP_003229989.1 30S ribosomal protein S20 -
  AAVB74_RS10020 (AAVB74_10020) gpr 1871731..1872837 (+) 1107 WP_014480309.1 GPR endopeptidase -
  AAVB74_RS10025 (AAVB74_10025) spoIIP 1872900..1874105 (+) 1206 WP_014477383.1 spore autolysin SpoIIP -
  AAVB74_RS10030 (AAVB74_10030) yqxA 1874122..1874460 (+) 339 WP_024572265.1 YqxA family protein -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86625.24 Da        Isoelectric Point: 7.0493

>NTDB_id=995452 AAVB74_RS09995 WP_345806062.1 1867469..1869799(+) (comEC) [Bacillus subtilis strain FUA2232]
MMNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMVVKTPDKEKWAAAYRILSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSDPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMAGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVDVFTIMIAHPAPVLLFLFTVAIILLLMAIEKRSLSQLMITGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMETGVLSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=995452 AAVB74_RS09995 WP_345806062.1 1867469..1869799(+) (comEC) [Bacillus subtilis strain FUA2232]
GTGATGAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGGTGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCTGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGAGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGACCCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTTTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGGCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGACAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTTCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGTTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGATGTGTTCACGATTATGATCGCCCATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTTGCGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGATAACCGGCGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
ACACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGTTCTGAGCTGGATTTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

97.552

100

0.976


Multiple sequence alignment