Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   AB2M34_RS08755 Genome accession   NZ_CP161903
Coordinates   1649545..1651875 (+) Length   776 a.a.
NCBI ID   WP_046160607.1    Uniprot ID   -
Organism   Bacillus subtilis strain DYJ24     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1613746..1653323 1649545..1651875 within 0


Gene organization within MGE regions


Location: 1613746..1653323
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB2M34_RS08510 (AB2M34_08510) - 1613746..1613926 (+) 181 Protein_1667 hypothetical protein -
  AB2M34_RS08515 (AB2M34_08515) - 1614235..1614465 (+) 231 WP_224588644.1 hypothetical protein -
  AB2M34_RS08520 (AB2M34_08520) - 1614610..1614858 (+) 249 Protein_1669 hypothetical protein -
  AB2M34_RS08525 (AB2M34_08525) - 1614821..1614943 (+) 123 Protein_1670 RusA family crossover junction endodeoxyribonuclease -
  AB2M34_RS08530 (AB2M34_08530) - 1615026..1615178 (+) 153 WP_049832653.1 XtrA/YqaO family protein -
  AB2M34_RS08535 (AB2M34_08535) - 1615317..1615583 (-) 267 WP_033881358.1 hypothetical protein -
  AB2M34_RS08540 (AB2M34_08540) - 1616200..1616679 (+) 480 WP_014480344.1 hypothetical protein -
  AB2M34_RS08545 (AB2M34_08545) istA 1616994..1618541 (+) 1548 WP_014480339.1 IS21 family transposase -
  AB2M34_RS08550 (AB2M34_08550) istB 1618538..1619296 (+) 759 WP_014479891.1 IS21-like element helper ATPase IstB -
  AB2M34_RS08555 (AB2M34_08555) - 1619558..1619863 (-) 306 WP_123772463.1 hypothetical protein -
  AB2M34_RS08560 (AB2M34_08560) terS 1619990..1620555 (+) 566 Protein_1677 phage terminase small subunit -
  AB2M34_RS08565 (AB2M34_08565) - 1620553..1620989 (+) 437 Protein_1678 phage tail tube protein -
  AB2M34_RS08570 (AB2M34_08570) - 1621276..1621362 (-) 87 WP_072592549.1 putative holin-like toxin -
  AB2M34_RS08575 (AB2M34_08575) - 1621540..1621637 (+) 98 Protein_1680 N-acetylmuramoyl-L-alanine amidase -
  AB2M34_RS08580 (AB2M34_08580) - 1622612..1622902 (-) 291 WP_129134327.1 contact-dependent growth inhibition system immunity protein -
  AB2M34_RS08585 (AB2M34_08585) atxG 1623012..1623589 (-) 578 Protein_1682 suppressor of fused domain protein -
  AB2M34_RS08590 (AB2M34_08590) - 1623857..1624090 (-) 234 WP_224588641.1 hypothetical protein -
  AB2M34_RS08595 (AB2M34_08595) - 1624179..1624382 (+) 204 WP_123772462.1 hypothetical protein -
  AB2M34_RS08600 (AB2M34_08600) - 1624690..1625169 (-) 480 WP_250622332.1 hypothetical protein -
  AB2M34_RS08605 (AB2M34_08605) cdiI 1625274..1625633 (-) 360 WP_014480334.1 ribonuclease toxin immunity protein CdiI -
  AB2M34_RS08610 (AB2M34_08610) - 1625730..1626182 (-) 453 WP_014480333.1 SMI1/KNR4 family protein -
  AB2M34_RS08615 (AB2M34_08615) - 1626281..1626721 (-) 441 WP_014480332.1 SMI1/KNR4 family protein -
  AB2M34_RS08620 (AB2M34_08620) - 1627124..1627411 (-) 288 WP_014480331.1 hypothetical protein -
  AB2M34_RS08625 (AB2M34_08625) - 1627425..1629351 (-) 1927 Protein_1690 ribonuclease YeeF family protein -
  AB2M34_RS08630 (AB2M34_08630) - 1629533..1630660 (+) 1128 WP_014480328.1 Rap family tetratricopeptide repeat protein -
  AB2M34_RS08635 (AB2M34_08635) - 1631400..1631795 (-) 396 WP_046160622.1 VOC family protein -
  AB2M34_RS08640 (AB2M34_08640) - 1632737..1633627 (-) 891 WP_014480326.1 LysR family transcriptional regulator -
  AB2M34_RS08645 (AB2M34_08645) fumC 1633794..1635182 (+) 1389 WP_014480325.1 class II fumarate hydratase -
  AB2M34_RS08650 (AB2M34_08650) - 1635401..1635613 (+) 213 Protein_1695 recombinase family protein -
  AB2M34_RS08655 (AB2M34_08655) - 1635610..1635708 (-) 99 WP_031600702.1 hypothetical protein -
  AB2M34_RS08660 (AB2M34_08660) sigK 1635708..1636436 (-) 729 WP_013308023.1 RNA polymerase sporulation sigma factor SigK -
  AB2M34_RS08665 (AB2M34_08665) nucA/comI 1636632..1637042 (+) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  AB2M34_RS08670 (AB2M34_08670) yqeB 1637075..1637797 (-) 723 WP_014480321.1 hypothetical protein -
  AB2M34_RS08675 (AB2M34_08675) gnd 1638038..1638931 (+) 894 WP_103330412.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  AB2M34_RS08680 (AB2M34_08680) yqeD 1638950..1639576 (-) 627 WP_014480319.1 TVP38/TMEM64 family protein -
  AB2M34_RS08685 (AB2M34_08685) cwlH 1639763..1640515 (+) 753 WP_014480318.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  AB2M34_RS08690 (AB2M34_08690) yqeF 1640767..1641498 (+) 732 WP_029317882.1 SGNH/GDSL hydrolase family protein -
  AB2M34_RS08695 (AB2M34_08695) - 1641804..1641944 (-) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  AB2M34_RS08700 (AB2M34_08700) yqeG 1642306..1642824 (+) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  AB2M34_RS08705 (AB2M34_08705) yqeH 1642828..1643928 (+) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  AB2M34_RS08710 (AB2M34_08710) aroE 1643946..1644788 (+) 843 WP_029317883.1 shikimate dehydrogenase -
  AB2M34_RS08715 (AB2M34_08715) yhbY 1644782..1645072 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  AB2M34_RS08720 (AB2M34_08720) nadD 1645084..1645653 (+) 570 WP_041333756.1 nicotinate-nucleotide adenylyltransferase -
  AB2M34_RS08725 (AB2M34_08725) yqeK 1645643..1646203 (+) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  AB2M34_RS08730 (AB2M34_08730) rsfS 1646221..1646577 (+) 357 WP_014480315.1 ribosome silencing factor -
  AB2M34_RS08735 (AB2M34_08735) yqeM 1646574..1647317 (+) 744 WP_046160609.1 class I SAM-dependent methyltransferase -
  AB2M34_RS08740 (AB2M34_08740) comER 1647383..1648204 (-) 822 WP_032726195.1 late competence protein ComER -
  AB2M34_RS08745 (AB2M34_08745) comEA 1648288..1648905 (+) 618 WP_046160608.1 competence protein ComEA Machinery gene
  AB2M34_RS08750 (AB2M34_08750) comEB 1648972..1649541 (+) 570 WP_003229978.1 ComE operon protein 2 -
  AB2M34_RS08755 (AB2M34_08755) comEC 1649545..1651875 (+) 2331 WP_046160607.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  AB2M34_RS08760 (AB2M34_08760) yqzM 1651916..1652050 (-) 135 WP_003229983.1 YqzM family protein -
  AB2M34_RS08765 (AB2M34_08765) - 1652127..1652240 (+) 114 WP_122060479.1 hypothetical protein -
  AB2M34_RS08770 (AB2M34_08770) holA 1652280..1653323 (+) 1044 WP_029317888.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86846.48 Da        Isoelectric Point: 7.2477

>NTDB_id=1023062 AB2M34_RS08755 WP_046160607.1 1649545..1651875(+) (comEC) [Bacillus subtilis strain DYJ24]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLRQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKYHKVKRLVIPKGFVSEPKDEKVLQAAREEGVEIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=1023062 AB2M34_RS08755 WP_046160607.1 1649545..1651875(+) (comEC) [Bacillus subtilis strain DYJ24]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAATTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCGATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATCCGTCTTGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCTTTGAGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGTATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGGCAGCCAGGGAAG
AGGGAGTGGAAATTGAAGAGGTGAAGCGAGGCGATGTGTTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCGGAA
GCACCTGATCCGGCAAGCAAAAATAATTCATCTCTCGTTCTTTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAACGTGTTTCCGAATATAAAAGCAGATGTGTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGACCAAAA
CGGAACGATCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.711

100

0.987


Multiple sequence alignment