Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   BKP58_RS21090 Genome accession   NZ_CP017763
Coordinates   3947992..3948609 (+) Length   205 a.a.
NCBI ID   WP_014480312.1    Uniprot ID   -
Organism   Bacillus subtilis strain 29R7-12     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3913440..3953026 3947992..3948609 within 0


Gene organization within MGE regions


Location: 3913440..3953026
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BKP58_RS20860 (BKP58_20815) - 3913440..3913620 (+) 181 Protein_4042 hypothetical protein -
  BKP58_RS23405 - 3913929..3914159 (+) 231 WP_224588644.1 hypothetical protein -
  BKP58_RS20870 - 3914304..3914552 (+) 249 Protein_4044 hypothetical protein -
  BKP58_RS20875 - 3914515..3914637 (+) 123 Protein_4045 RusA family crossover junction endodeoxyribonuclease -
  BKP58_RS20880 (BKP58_20825) - 3914720..3914872 (+) 153 WP_049832653.1 XtrA/YqaO family protein -
  BKP58_RS20885 (BKP58_20830) - 3915011..3915277 (-) 267 WP_033881358.1 hypothetical protein -
  BKP58_RS20890 (BKP58_20835) - 3915894..3916373 (+) 480 WP_014480344.1 hypothetical protein -
  BKP58_RS22855 - 3916766..3917071 (-) 306 WP_123772463.1 hypothetical protein -
  BKP58_RS23040 (BKP58_20840) terS 3917198..3917762 (+) 565 Protein_4050 phage terminase small subunit -
  BKP58_RS20900 (BKP58_20845) - 3917722..3918197 (+) 476 Protein_4051 phage tail tube protein -
  BKP58_RS20905 - 3918484..3918570 (-) 87 WP_072592549.1 putative holin-like toxin -
  BKP58_RS23655 (BKP58_20850) - 3918745..3918911 (+) 167 Protein_4053 peptidoglycan-binding domain-containing protein -
  BKP58_RS20915 (BKP58_20855) istB 3919163..3919921 (-) 759 WP_014479891.1 IS21-like element helper ATPase IstB -
  BKP58_RS20920 (BKP58_20860) istA 3919918..3921465 (-) 1548 WP_014480339.1 IS21 family transposase -
  BKP58_RS20930 (BKP58_20870) - 3922306..3922596 (-) 291 WP_014480337.1 contact-dependent growth inhibition system immunity protein -
  BKP58_RS20935 (BKP58_20875) atxG 3922706..3923283 (-) 578 Protein_4057 suppressor of fused domain protein -
  BKP58_RS20940 (BKP58_20880) - 3923551..3923784 (-) 234 WP_224588641.1 hypothetical protein -
  BKP58_RS22865 - 3923873..3924076 (+) 204 WP_123772462.1 hypothetical protein -
  BKP58_RS20945 (BKP58_20885) - 3924384..3924896 (-) 513 WP_014477426.1 hypothetical protein -
  BKP58_RS20950 (BKP58_20890) cdiI 3924968..3925327 (-) 360 WP_014480334.1 ribonuclease toxin immunity protein CdiI -
  BKP58_RS20955 (BKP58_20895) - 3925424..3925876 (-) 453 WP_014480333.1 SMI1/KNR4 family protein -
  BKP58_RS20960 (BKP58_20900) - 3925975..3926415 (-) 441 WP_014480332.1 SMI1/KNR4 family protein -
  BKP58_RS20965 (BKP58_20905) - 3926818..3927105 (-) 288 WP_014480331.1 hypothetical protein -
  BKP58_RS23660 - 3927119..3927826 (-) 708 WP_014480330.1 hypothetical protein -
  BKP58_RS23665 (BKP58_20915) - 3928212..3929045 (-) 834 Protein_4066 ribonuclease YeeF family protein -
  BKP58_RS20975 (BKP58_20920) - 3929227..3930354 (+) 1128 WP_014480328.1 Rap family tetratricopeptide repeat protein -
  BKP58_RS20985 (BKP58_20925) - 3931094..3931489 (-) 396 WP_014480327.1 VOC family protein -
  BKP58_RS20990 (BKP58_20930) - 3932431..3933321 (-) 891 WP_014480326.1 LysR family transcriptional regulator -
  BKP58_RS20995 (BKP58_20935) fumC 3933488..3934876 (+) 1389 WP_014480325.1 class II fumarate hydratase -
  BKP58_RS23435 - 3935095..3935307 (+) 213 Protein_4071 recombinase family protein -
  BKP58_RS22875 (BKP58_20945) - 3935304..3935402 (-) 99 WP_031600702.1 hypothetical protein -
  BKP58_RS21005 (BKP58_20950) sigK 3935402..3936130 (-) 729 WP_013308023.1 RNA polymerase sporulation sigma factor SigK -
  BKP58_RS21010 (BKP58_20955) nucA/comI 3936326..3936736 (+) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  BKP58_RS21015 (BKP58_20960) yqeB 3936769..3937491 (-) 723 WP_014480321.1 hypothetical protein -
  BKP58_RS21020 (BKP58_20965) gnd 3937742..3938635 (+) 894 WP_014480320.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  BKP58_RS21025 (BKP58_20970) yqeD 3938654..3939280 (-) 627 WP_014480319.1 TVP38/TMEM64 family protein -
  BKP58_RS21030 (BKP58_20975) cwlH 3939467..3940219 (+) 753 WP_014480318.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  BKP58_RS21035 (BKP58_20980) yqeF 3940471..3941202 (+) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  BKP58_RS21040 (BKP58_20990) - 3941508..3941648 (-) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  BKP58_RS21045 (BKP58_20995) yqeG 3942010..3942528 (+) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  BKP58_RS21050 (BKP58_21000) yqeH 3942532..3943632 (+) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  BKP58_RS21055 (BKP58_21005) aroE 3943650..3944492 (+) 843 WP_014480317.1 shikimate dehydrogenase -
  BKP58_RS21060 (BKP58_21010) yhbY 3944486..3944776 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  BKP58_RS21065 (BKP58_21015) nadD 3944788..3945357 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  BKP58_RS21070 (BKP58_21020) yqeK 3945347..3945907 (+) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  BKP58_RS21075 (BKP58_21025) rsfS 3945925..3946281 (+) 357 WP_014480315.1 ribosome silencing factor -
  BKP58_RS21080 (BKP58_21030) yqeM 3946278..3947021 (+) 744 WP_014480314.1 class I SAM-dependent DNA methyltransferase -
  BKP58_RS21085 (BKP58_21035) comER 3947087..3947908 (-) 822 WP_014480313.1 late competence protein ComER -
  BKP58_RS21090 (BKP58_21040) comEA 3947992..3948609 (+) 618 WP_014480312.1 competence protein ComEA Machinery gene
  BKP58_RS21095 (BKP58_21045) comEB 3948676..3949245 (+) 570 WP_003229978.1 ComE operon protein 2 -
  BKP58_RS21100 (BKP58_21050) comEC 3949249..3951579 (+) 2331 WP_033881047.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  BKP58_RS21105 (BKP58_21055) yqzM 3951619..3951753 (-) 135 WP_003229983.1 YqzM family protein -
  BKP58_RS22880 - 3951794..3951943 (+) 150 WP_003229985.1 hypothetical protein -
  BKP58_RS21110 (BKP58_21060) holA 3951983..3953026 (+) 1044 WP_014480310.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 205 a.a.        Molecular weight: 21781.51 Da        Isoelectric Point: 4.7220

>NTDB_id=202700 BKP58_RS21090 WP_014480312.1 3947992..3948609(+) (comEA) [Bacillus subtilis strain 29R7-12]
MNWLNQHKKAIILAASAAVFTAIMIFLATGKNKEPVKQAVPTETENTVVKQEANNDESNETIVIDIKGAVQHPGVYEMRT
GDRVSQAIEKAGGTSEQADEAQVNLAEILQDGTVVYIPKKGEEIAVQQGGGGSVQSDGGKGALVNINTATLEELQGISGV
GPSKAEAIIAYREENGRFQTIEDITKVSGIGEKSFEKIKSSITVK

Nucleotide


Download         Length: 618 bp        

>NTDB_id=202700 BKP58_RS21090 WP_014480312.1 3947992..3948609(+) (comEA) [Bacillus subtilis strain 29R7-12]
ATGAATTGGTTGAATCAGCATAAGAAAGCAATTATTTTAGCGGCTTCTGCGGCTGTTTTCACAGCGATTATGATCTTTCT
GGCCACAGGGAAAAATAAAGAGCCGGTGAAGCAAGCTGTACCAACAGAGACAGAAAATACAGTGGTAAAGCAGGAAGCAA
ACAACGACGAGTCAAACGAAACAATTGTGATAGACATCAAAGGTGCTGTTCAGCATCCTGGCGTTTATGAAATGCGAACA
GGGGACAGAGTATCTCAGGCAATTGAGAAAGCGGGCGGGACCAGTGAACAAGCAGACGAAGCGCAAGTAAATTTGGCGGA
GATTCTGCAGGACGGGACAGTGGTGTACATCCCGAAAAAGGGAGAGGAAATAGCAGTGCAGCAAGGTGGCGGAGGGTCAG
TCCAAAGCGATGGAGGGAAGGGAGCGCTGGTGAATATCAATACAGCAACCTTAGAGGAGTTACAAGGCATCTCAGGGGTG
GGGCCATCCAAAGCTGAAGCTATTATTGCATACCGGGAAGAAAACGGGCGTTTCCAAACAATTGAAGATATCACTAAGGT
TTCAGGAATAGGTGAAAAGTCATTTGAGAAAATAAAGTCTTCCATTACAGTAAAGTGA

Domains


Predicted by InterproScan.

(142-203)

(63-118)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

99.512

100

0.995

  comEA Staphylococcus aureus MW2

37.273

100

0.4

  comEA Staphylococcus aureus N315

36.364

100

0.39


Multiple sequence alignment