Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   O6U12_RS04205 Genome accession   NZ_CP114899
Coordinates   779322..781652 (+) Length   776 a.a.
NCBI ID   WP_017696281.1    Uniprot ID   -
Organism   Bacillus subtilis strain     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 741234..783099 779322..781652 within 0


Gene organization within MGE regions


Location: 741234..783099
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  O6U12_RS03990 (O6U12_03990) yqcE 741234..741398 (+) 165 WP_024122146.1 XkdX family protein -
  O6U12_RS03995 (O6U12_03995) xepA 741487..742380 (+) 894 WP_129093405.1 phage-like element PBSX protein XepA -
  O6U12_RS04000 (O6U12_04000) skhD 742426..742848 (+) 423 WP_003246208.1 phage holin family protein -
  O6U12_RS04005 (O6U12_04005) - 742893..743861 (+) 969 WP_129093404.1 N-acetylmuramoyl-L-alanine amidase -
  O6U12_RS04010 (O6U12_04010) - 743995..744987 (+) 993 WP_224912955.1 HNH endonuclease -
  O6U12_RS04015 (O6U12_04015) - 745007..745552 (+) 546 WP_069322772.1 SMI1/KNR4 family protein -
  O6U12_RS04020 (O6U12_04020) - 745746..746198 (-) 453 WP_014480333.1 SMI1/KNR4 family protein -
  O6U12_RS04025 (O6U12_04025) - 746297..746737 (-) 441 WP_129093403.1 SMI1/KNR4 family protein -
  O6U12_RS04030 (O6U12_04030) - 746747..747139 (-) 393 WP_019715065.1 immunity 50 family protein -
  O6U12_RS04035 (O6U12_04035) yqcF 747247..747825 (-) 579 WP_129093402.1 type VII secretion system immunity protein YqcF -
  O6U12_RS04040 (O6U12_04040) - 747917..748348 (-) 432 WP_129093401.1 SMI1/KNR4 family protein -
  O6U12_RS04045 (O6U12_04045) - 748345..748716 (-) 372 WP_224912977.1 hypothetical protein -
  O6U12_RS04050 (O6U12_04050) - 748708..749997 (-) 1290 Protein_783 ribonuclease YeeF family protein -
  O6U12_RS04055 (O6U12_04055) - 750184..751311 (+) 1128 WP_129093400.1 Rap family tetratricopeptide repeat protein -
  O6U12_RS04060 (O6U12_04060) - 751800..752711 (+) 912 WP_069487169.1 hypothetical protein -
  O6U12_RS04065 (O6U12_04065) - 752638..756258 (+) 3621 WP_128747171.1 DEAD/DEAH box helicase -
  O6U12_RS04070 (O6U12_04070) - 756930..758045 (-) 1116 WP_017694944.1 IS4 family transposase -
  O6U12_RS04075 (O6U12_04075) - 758227..759951 (+) 1725 WP_269767453.1 AAA family ATPase -
  O6U12_RS04080 (O6U12_04080) - 760213..760391 (+) 179 Protein_789 hypothetical protein -
  O6U12_RS04085 (O6U12_04085) spoIVCA 760349..761809 (+) 1461 WP_224912975.1 site-specific DNA recombinase SpoIVCA -
  O6U12_RS04090 (O6U12_04090) - 761996..763408 (-) 1413 WP_129093397.1 MDR family MFS transporter -
  O6U12_RS04095 (O6U12_04095) - 763558..764028 (-) 471 WP_015251668.1 MarR family transcriptional regulator -
  O6U12_RS04100 (O6U12_04100) - 764168..764518 (-) 351 Protein_793 sigma-70 family RNA polymerase sigma factor -
  O6U12_RS04105 (O6U12_04105) nucB 764714..765124 (+) 411 Protein_794 sporulation-specific Dnase NucB -
  O6U12_RS04110 (O6U12_04110) yqeB 765158..765880 (-) 723 WP_026113675.1 hypothetical protein -
  O6U12_RS04115 (O6U12_04115) - 766023..767051 (+) 1029 WP_017696289.1 IS1595 family transposase -
  O6U12_RS04120 (O6U12_04120) - 767101..767523 (+) 423 WP_017696288.1 hypothetical protein -
  O6U12_RS04125 (O6U12_04125) gnd 767863..768755 (+) 893 Protein_798 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  O6U12_RS04130 (O6U12_04130) yqeD 768774..769400 (-) 627 WP_014480319.1 TVP38/TMEM64 family protein -
  O6U12_RS04135 (O6U12_04135) cwlH 769587..770339 (+) 753 WP_017696287.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  O6U12_RS04140 (O6U12_04140) yqeF 770591..771322 (+) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  O6U12_RS04145 (O6U12_04145) - 771581..771721 (-) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  O6U12_RS04150 (O6U12_04150) yqeG 772083..772601 (+) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  O6U12_RS04155 (O6U12_04155) yqeH 772605..773705 (+) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  O6U12_RS04160 (O6U12_04160) aroE 773723..774565 (+) 843 WP_017696285.1 shikimate dehydrogenase -
  O6U12_RS04165 (O6U12_04165) yhbY 774559..774849 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  O6U12_RS04170 (O6U12_04170) nadD 774861..775430 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  O6U12_RS04175 (O6U12_04175) yqeK 775420..775980 (+) 561 WP_017696284.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  O6U12_RS04180 (O6U12_04180) rsfS 775998..776354 (+) 357 WP_014480315.1 ribosome silencing factor -
  O6U12_RS04185 (O6U12_04185) yqeM 776351..777094 (+) 744 WP_017696283.1 class I SAM-dependent DNA methyltransferase -
  O6U12_RS04190 (O6U12_04190) comER 777160..777981 (-) 822 WP_004398597.1 late competence protein ComER -
  O6U12_RS04195 (O6U12_04195) comEA 778065..778682 (+) 618 WP_017696282.1 competence protein ComEA Machinery gene
  O6U12_RS04200 (O6U12_04200) comEB 778749..779318 (+) 570 WP_003229978.1 ComE operon protein 2 -
  O6U12_RS04205 (O6U12_04205) comEC 779322..781652 (+) 2331 WP_017696281.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  O6U12_RS04210 (O6U12_04210) yqzM 781692..781826 (-) 135 WP_003229983.1 YqzM family protein -
  O6U12_RS04215 (O6U12_04215) - 781867..782016 (+) 150 WP_003229985.1 hypothetical protein -
  O6U12_RS04220 (O6U12_04220) holA 782056..783099 (+) 1044 WP_017696280.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86633.13 Da        Isoelectric Point: 7.2477

>NTDB_id=767569 O6U12_RS04205 WP_017696281.1 779322..781652(+) (comEC) [Bacillus subtilis strain]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFIFFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVLFLLFLYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=767569 O6U12_RS04205 WP_017696281.1 779322..781652(+) (comEC) [Bacillus subtilis strain]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCTTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGGACCTACCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCTGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCATTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGACAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGCTGTTTCTGCTCTTTTTATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGGCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGTCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCCAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGTAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.325

100

0.983