Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   IMZ18_RS05305 Genome accession   NZ_CP063151
Coordinates   996698..997315 (+) Length   205 a.a.
NCBI ID   WP_014480312.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain CMIN-4     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 962148..1001732 996698..997315 within 0


Gene organization within MGE regions


Location: 962148..1001732
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IMZ18_RS05070 (IMZ18_05070) - 962148..962328 (+) 181 Protein_991 hypothetical protein -
  IMZ18_RS22415 - 962462..962692 (+) 231 WP_014480347.1 hypothetical protein -
  IMZ18_RS05080 (IMZ18_05080) - 963011..963259 (+) 249 Protein_993 hypothetical protein -
  IMZ18_RS05085 (IMZ18_05085) - 963222..963344 (+) 123 Protein_994 RusA family crossover junction endodeoxyribonuclease -
  IMZ18_RS05090 (IMZ18_05090) - 963427..963579 (+) 153 WP_049832653.1 XtrA/YqaO family protein -
  IMZ18_RS05095 (IMZ18_05095) - 963718..963984 (-) 267 WP_033881358.1 hypothetical protein -
  IMZ18_RS05100 (IMZ18_05100) - 964601..965080 (+) 480 WP_014480344.1 hypothetical protein -
  IMZ18_RS22420 - 965235..965300 (+) 66 Protein_998 hypothetical protein -
  IMZ18_RS05105 (IMZ18_05105) - 965473..965778 (-) 306 WP_123772463.1 hypothetical protein -
  IMZ18_RS05110 (IMZ18_05110) terS 965905..966470 (+) 566 Protein_1000 phage terminase small subunit -
  IMZ18_RS05115 (IMZ18_05115) - 966468..966904 (+) 437 Protein_1001 phage tail tube protein -
  IMZ18_RS05120 (IMZ18_05120) - 967191..967277 (-) 87 WP_072592549.1 putative holin-like toxin -
  IMZ18_RS22425 - 967455..967552 (+) 98 Protein_1003 N-acetylmuramoyl-L-alanine amidase -
  IMZ18_RS05130 (IMZ18_05130) istB 967870..968628 (-) 759 WP_014479891.1 IS21-like element helper ATPase IstB -
  IMZ18_RS05135 (IMZ18_05135) istA 968625..970172 (-) 1548 WP_014480339.1 IS21 family transposase -
  IMZ18_RS05140 (IMZ18_05140) - 971012..971302 (-) 291 WP_014480337.1 contact-dependent growth inhibition system immunity protein -
  IMZ18_RS05145 (IMZ18_05145) atxG 971412..971989 (-) 578 Protein_1007 suppressor of fused domain protein -
  IMZ18_RS05150 (IMZ18_05150) - 972257..972490 (-) 234 WP_224588641.1 hypothetical protein -
  IMZ18_RS05155 (IMZ18_05155) - 972579..972782 (+) 204 WP_123772462.1 hypothetical protein -
  IMZ18_RS05160 (IMZ18_05160) - 973090..973602 (-) 513 WP_014477426.1 hypothetical protein -
  IMZ18_RS05165 (IMZ18_05165) cdiI 973674..974033 (-) 360 WP_014480334.1 ribonuclease toxin immunity protein CdiI -
  IMZ18_RS05170 (IMZ18_05170) - 974130..974582 (-) 453 WP_014480333.1 SMI1/KNR4 family protein -
  IMZ18_RS05175 (IMZ18_05175) - 974681..975121 (-) 441 WP_014480332.1 SMI1/KNR4 family protein -
  IMZ18_RS05180 (IMZ18_05180) - 975524..975811 (-) 288 WP_014480331.1 hypothetical protein -
  IMZ18_RS22755 (IMZ18_05185) - 975825..977751 (-) 1927 Protein_1015 T7SS effector LXG polymorphic toxin -
  IMZ18_RS05190 (IMZ18_05190) - 977933..979060 (+) 1128 WP_014480328.1 Rap family tetratricopeptide repeat protein -
  IMZ18_RS05195 (IMZ18_05195) - 979800..980195 (-) 396 WP_014480327.1 VOC family protein -
  IMZ18_RS05200 (IMZ18_05200) - 981137..982027 (-) 891 WP_014480326.1 LysR family transcriptional regulator -
  IMZ18_RS05205 (IMZ18_05205) fumC 982194..983582 (+) 1389 WP_014480325.1 class II fumarate hydratase -
  IMZ18_RS22445 - 983801..984013 (+) 213 Protein_1020 recombinase family protein -
  IMZ18_RS05215 (IMZ18_05215) - 984010..984108 (-) 99 WP_031600702.1 hypothetical protein -
  IMZ18_RS05220 (IMZ18_05220) sigK 984108..984836 (-) 729 WP_013308023.1 RNA polymerase sporulation sigma factor SigK -
  IMZ18_RS05225 (IMZ18_05225) nucA/comI 985032..985442 (+) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  IMZ18_RS05230 (IMZ18_05230) yqeB 985475..986197 (-) 723 WP_014480321.1 hypothetical protein -
  IMZ18_RS05235 (IMZ18_05235) gnd 986448..987341 (+) 894 WP_014480320.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  IMZ18_RS05240 (IMZ18_05240) yqeD 987360..987986 (-) 627 WP_014480319.1 TVP38/TMEM64 family protein -
  IMZ18_RS05245 (IMZ18_05245) cwlH 988173..988925 (+) 753 WP_014480318.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  IMZ18_RS05250 (IMZ18_05250) yqeF 989177..989908 (+) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  IMZ18_RS05255 (IMZ18_05255) - 990214..990354 (-) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  IMZ18_RS05260 (IMZ18_05260) yqeG 990716..991234 (+) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  IMZ18_RS05265 (IMZ18_05265) yqeH 991238..992338 (+) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  IMZ18_RS05270 (IMZ18_05270) aroE 992356..993198 (+) 843 WP_014480317.1 shikimate dehydrogenase -
  IMZ18_RS05275 (IMZ18_05275) yhbY 993192..993482 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  IMZ18_RS05280 (IMZ18_05280) nadD 993494..994063 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  IMZ18_RS05285 (IMZ18_05285) yqeK 994053..994613 (+) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  IMZ18_RS05290 (IMZ18_05290) rsfS 994631..994987 (+) 357 WP_014480315.1 ribosome silencing factor -
  IMZ18_RS05295 (IMZ18_05295) yqeM 994984..995727 (+) 744 WP_014480314.1 class I SAM-dependent methyltransferase -
  IMZ18_RS05300 (IMZ18_05300) comER 995793..996614 (-) 822 WP_014480313.1 late competence protein ComER -
  IMZ18_RS05305 (IMZ18_05305) comEA 996698..997315 (+) 618 WP_014480312.1 competence protein ComEA Machinery gene
  IMZ18_RS05310 (IMZ18_05310) comEB 997382..997951 (+) 570 WP_003229978.1 ComE operon protein 2 -
  IMZ18_RS05315 (IMZ18_05315) comEC 997955..1000285 (+) 2331 WP_033881047.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  IMZ18_RS05320 (IMZ18_05320) yqzM 1000325..1000459 (-) 135 WP_003229983.1 YqzM family protein -
  IMZ18_RS05325 (IMZ18_05325) - 1000500..1000649 (+) 150 WP_003229985.1 hypothetical protein -
  IMZ18_RS05330 (IMZ18_05330) holA 1000689..1001732 (+) 1044 WP_014480310.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 205 a.a.        Molecular weight: 21781.51 Da        Isoelectric Point: 4.7220

>NTDB_id=493452 IMZ18_RS05305 WP_014480312.1 996698..997315(+) (comEA) [Bacillus subtilis subsp. subtilis strain CMIN-4]
MNWLNQHKKAIILAASAAVFTAIMIFLATGKNKEPVKQAVPTETENTVVKQEANNDESNETIVIDIKGAVQHPGVYEMRT
GDRVSQAIEKAGGTSEQADEAQVNLAEILQDGTVVYIPKKGEEIAVQQGGGGSVQSDGGKGALVNINTATLEELQGISGV
GPSKAEAIIAYREENGRFQTIEDITKVSGIGEKSFEKIKSSITVK

Nucleotide


Download         Length: 618 bp        

>NTDB_id=493452 IMZ18_RS05305 WP_014480312.1 996698..997315(+) (comEA) [Bacillus subtilis subsp. subtilis strain CMIN-4]
ATGAATTGGTTGAATCAGCATAAGAAAGCAATTATTTTAGCGGCTTCTGCGGCTGTTTTCACAGCGATTATGATCTTTCT
GGCCACAGGGAAAAATAAAGAGCCGGTGAAGCAAGCTGTACCAACAGAGACAGAAAATACAGTGGTAAAGCAGGAAGCAA
ACAACGACGAGTCAAACGAAACAATTGTGATAGACATCAAAGGTGCTGTTCAGCATCCTGGCGTTTATGAAATGCGAACA
GGGGACAGAGTATCTCAGGCAATTGAGAAAGCGGGCGGGACCAGTGAACAAGCAGACGAAGCGCAAGTAAATTTGGCGGA
GATTCTGCAGGACGGGACAGTGGTGTACATCCCGAAAAAGGGAGAGGAAATAGCAGTGCAGCAAGGTGGCGGAGGGTCAG
TCCAAAGCGATGGAGGGAAGGGAGCGCTGGTGAATATCAATACAGCAACCTTAGAGGAGTTACAAGGCATCTCAGGGGTG
GGGCCATCCAAAGCTGAAGCTATTATTGCATACCGGGAAGAAAACGGGCGTTTCCAAACAATTGAAGATATCACTAAGGT
TTCAGGAATAGGTGAAAAGTCATTTGAGAAAATAAAGTCTTCCATTACAGTAAAGTGA

Domains


Predicted by InterproScan.

(142-203)

(63-118)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

99.512

100

0.995

  comEA Staphylococcus aureus MW2

37.273

100

0.4

  comEA Staphylococcus aureus N315

36.364

100

0.39