Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   LK685_RS08140 Genome accession   NZ_CP086061
Coordinates   1565104..1567434 (+) Length   776 a.a.
NCBI ID   WP_229762702.1    Uniprot ID   -
Organism   Bacillus sp. BC1-43     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1560104..1572434
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LK685_RS08100 (LK685_08105) yhbY 1560341..1560631 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  LK685_RS08105 (LK685_08110) - 1560643..1561212 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  LK685_RS08110 (LK685_08115) yqeK 1561202..1561762 (+) 561 WP_229762701.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  LK685_RS08115 (LK685_08120) rsfS 1561780..1562136 (+) 357 WP_041333759.1 ribosome silencing factor -
  LK685_RS08120 (LK685_08125) - 1562133..1562876 (+) 744 WP_003229973.1 class I SAM-dependent methyltransferase -
  LK685_RS08125 (LK685_08130) comER 1562942..1563763 (-) 822 WP_014480313.1 late competence protein ComER -
  LK685_RS08130 (LK685_08135) comEA 1563847..1564464 (+) 618 WP_004398514.1 competence protein ComEA Machinery gene
  LK685_RS08135 (LK685_08140) - 1564531..1565100 (+) 570 WP_003229978.1 ComE operon protein 2 -
  LK685_RS08140 (LK685_08145) comEC 1565104..1567434 (+) 2331 WP_229762702.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  LK685_RS08145 (LK685_08150) - 1567474..1567608 (-) 135 WP_003229983.1 YqzM family protein -
  LK685_RS08150 (LK685_08155) - 1567649..1567798 (+) 150 WP_003229985.1 hypothetical protein -
  LK685_RS08155 (LK685_08160) holA 1567838..1568881 (+) 1044 WP_029317888.1 DNA polymerase III subunit delta -
  LK685_RS08160 (LK685_08165) rpsT 1568896..1569162 (-) 267 WP_131227637.1 30S ribosomal protein S20 -
  LK685_RS08165 (LK685_08170) gpr 1569366..1570472 (+) 1107 WP_014480309.1 GPR endopeptidase -
  LK685_RS08170 (LK685_08175) spoIIP 1570535..1571740 (+) 1206 WP_014477383.1 spore autolysin SpoIIP -
  LK685_RS08175 (LK685_08180) - 1571757..1572095 (+) 339 WP_014477382.1 YqxA family protein -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86771.27 Da        Isoelectric Point: 6.9842

>NTDB_id=622170 LK685_RS08140 WP_229762702.1 1565104..1567434(+) (comEC) [Bacillus sp. BC1-43]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=622170 LK685_RS08140 WP_229762702.1 1565104..1567434(+) (comEC) [Bacillus sp. BC1-43]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGTGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTACTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAGGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
ACACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.454

100

0.985