Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   AS891_RS10295 Genome accession   NZ_CP015375
Coordinates   1971008..1971625 (+) Length   205 a.a.
NCBI ID   WP_004398514.1    Uniprot ID   A0AAE2SLR2
Organism   Bacillus subtilis subsp. subtilis strain KCTC 3135     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1912895..1974595 1971008..1971625 within 0


Gene organization within MGE regions


Location: 1912895..1974595
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AS891_RS09890 (AS891_09885) sknR 1912895..1913245 (-) 351 WP_004398704.1 transcriptional regulator SknR -
  AS891_RS09895 (AS891_09890) yqaF 1913422..1913652 (+) 231 WP_004398958.1 helix-turn-helix transcriptional regulator -
  AS891_RS09900 (AS891_09895) - 1913682..1913822 (+) 141 WP_003229902.1 hypothetical protein -
  AS891_RS09905 (AS891_09900) yqaG 1913896..1914465 (+) 570 WP_004398626.1 helix-turn-helix domain-containing protein -
  AS891_RS09910 (AS891_09905) sknH 1914462..1914719 (+) 258 WP_003245994.1 YqaH family protein -
  AS891_RS22875 - 1914716..1914889 (+) 174 WP_119123069.1 hypothetical protein -
  AS891_RS09915 (AS891_09910) yqaI 1914849..1915043 (+) 195 WP_003229905.1 YqaI family protein -
  AS891_RS09920 (AS891_09915) yqaJ 1915149..1916108 (+) 960 WP_004398673.1 YqaJ viral recombinase family protein -
  AS891_RS09925 (AS891_09920) recT 1916111..1916965 (+) 855 WP_003229907.1 recombinase RecT -
  AS891_RS09930 (AS891_09925) yqaL 1917041..1917718 (+) 678 WP_010886575.1 DnaD domain-containing protein -
  AS891_RS09935 (AS891_09930) sknM 1917600..1918541 (+) 942 WP_075058863.1 ATP-binding protein -
  AS891_RS09940 (AS891_09935) - 1918532..1918681 (+) 150 WP_003229910.1 hypothetical protein -
  AS891_RS09950 (AS891_09945) yqaN 1918777..1919205 (+) 429 WP_009967809.1 RusA family crossover junction endodeoxyribonuclease -
  AS891_RS09955 (AS891_09950) yqaO 1919287..1919493 (+) 207 WP_003229912.1 XtrA/YqaO family protein -
  AS891_RS09960 (AS891_09955) - 1919567..1920496 (-) 930 WP_003229913.1 hypothetical protein -
  AS891_RS09965 (AS891_09960) yqaQ 1920694..1921149 (+) 456 WP_004398775.1 hypothetical protein -
  AS891_RS09970 (AS891_09965) - 1921293..1921757 (+) 465 WP_004398685.1 hypothetical protein -
  AS891_RS09975 (AS891_09970) terS 1921825..1922544 (+) 720 WP_003229916.1 phage terminase small subunit -
  AS891_RS09980 (AS891_09975) stmB 1922537..1923832 (+) 1296 WP_003229917.1 PBSX family phage terminase large subunit -
  AS891_RS09985 (AS891_09980) yqbA 1923836..1925368 (+) 1533 WP_004398894.1 phage portal protein -
  AS891_RS09990 (AS891_09985) yqbB 1925365..1926282 (+) 918 WP_004398748.1 phage head morphogenesis protein -
  AS891_RS09995 (AS891_09990) - 1926323..1926976 (+) 654 WP_003229920.1 hypothetical protein -
  AS891_RS10000 (AS891_09995) yqbD 1927009..1927977 (+) 969 WP_003229921.1 XkdF-like putative serine protease domain-containing protein -
  AS891_RS10005 (AS891_10000) skdG 1927996..1928931 (+) 936 WP_003229922.1 phage major capsid protein -
  AS891_RS10010 (AS891_10005) yqbF 1928942..1929253 (+) 312 WP_003229923.1 YqbF domain-containing protein -
  AS891_RS10015 (AS891_10010) gkpG 1929257..1929652 (+) 396 WP_004398566.1 DUF3199 family protein -
  AS891_RS10020 (AS891_10015) yqbH 1929649..1930011 (+) 363 WP_003229925.1 YqbH/XkdH family protein -
  AS891_RS10025 (AS891_10020) yqbI 1930008..1930511 (+) 504 WP_003246050.1 HK97 gp10 family phage protein -
  AS891_RS10030 (AS891_10025) yqbJ 1930524..1930961 (+) 438 WP_003229927.1 phage tail terminator family protein -
  AS891_RS10035 (AS891_10030) - 1930958..1931149 (+) 192 WP_010886574.1 hypothetical protein -
  AS891_RS10040 (AS891_10035) yqbK 1931150..1932550 (+) 1401 WP_003229929.1 phage tail sheath family protein -
  AS891_RS10045 (AS891_10040) yqbM 1932553..1932996 (+) 444 WP_003229930.1 phage tail tube protein -
  AS891_RS22880 bsrH 1933250..1933339 (-) 90 WP_075058862.1 type I toxin-antitoxin system toxin BsrH -
  AS891_RS10050 (AS891_10045) txpA 1933719..1933898 (-) 180 WP_004398662.1 type I toxin-antitoxin system toxin TxpA -
  AS891_RS10055 (AS891_10050) - 1934044..1934493 (+) 450 WP_003229933.1 phage tail assembly chaperone -
  AS891_RS10060 (AS891_10055) - 1934535..1934672 (+) 138 WP_003229934.1 hypothetical protein -
  AS891_RS10065 (AS891_10060) yqbO 1934675..1939432 (+) 4758 WP_003246092.1 phage tail tape measure protein -
  AS891_RS10070 (AS891_10065) yqbP 1939425..1940084 (+) 660 WP_004398548.1 LysM peptidoglycan-binding domain-containing protein -
  AS891_RS10075 (AS891_10070) yqbQ 1940097..1941077 (+) 981 WP_004398524.1 XkdQ/YqbQ family protein -
  AS891_RS10080 (AS891_10075) yqbR 1941074..1941337 (+) 264 WP_003229938.1 DUF2577 family protein -
  AS891_RS10085 (AS891_10080) yqbS 1941350..1941775 (+) 426 WP_004398572.1 DUF2634 domain-containing protein -
  AS891_RS10090 (AS891_10085) yqbT 1941768..1942814 (+) 1047 WP_003229940.1 baseplate J/gp47 family protein -
  AS891_RS10095 (AS891_10090) yqcA 1942798..1943376 (+) 579 WP_003229941.1 YmfQ family protein -
  AS891_RS10100 (AS891_10095) - 1943373..1943645 (+) 273 WP_003229942.1 hypothetical protein -
  AS891_RS10105 (AS891_10100) yqcC 1943648..1944748 (+) 1101 WP_003229943.1 pyocin knob domain-containing protein -
  AS891_RS10110 (AS891_10105) yqcD 1944758..1945093 (+) 336 WP_009967793.1 XkdW family protein -
  AS891_RS10115 (AS891_10110) yqcE 1945090..1945254 (+) 165 WP_003229944.1 XkdX family protein -
  AS891_RS10120 (AS891_10115) xepA 1945342..1946235 (+) 894 WP_003246010.1 phage-like element PBSX protein XepA -
  AS891_RS10125 (AS891_10120) skhD 1946280..1946702 (+) 423 WP_003246208.1 phage holin family protein -
  AS891_RS10130 (AS891_10125) cwlA 1946747..1947565 (+) 819 WP_003229946.1 N-acetylmuramoyl-L-alanine amidase CwlA -
  AS891_RS10135 (AS891_10130) - 1947730..1948209 (+) 480 WP_004399085.1 hypothetical protein -
  AS891_RS10140 (AS891_10135) - 1948225..1948587 (+) 363 WP_003229947.1 hypothetical protein -
  AS891_RS22885 (AS891_10140) - 1948584..1948730 (-) 147 WP_009967791.1 hypothetical protein -
  AS891_RS10150 (AS891_10145) yqcF 1948848..1949426 (-) 579 WP_009967790.1 type VII secretion system immunity protein YqcF -
  AS891_RS10155 (AS891_10150) yqcG 1949441..1951036 (-) 1596 WP_004399034.1 LXG family T7SS effector endonuclease toxin YqcG -
  AS891_RS23325 - 1951152..1951442 (-) 291 WP_418910793.1 hypothetical protein -
  AS891_RS10160 (AS891_10155) - 1951406..1951564 (-) 159 WP_003245945.1 hypothetical protein -
  AS891_RS10165 (AS891_10160) phrE 1951674..1951808 (-) 135 WP_004398770.1 phosphatase RapE inhibitor PhrE -
  AS891_RS10170 (AS891_10165) rapE 1951798..1952925 (-) 1128 WP_004398842.1 response regulator aspartate phosphatase RapE -
  AS891_RS10175 (AS891_10170) yqcI 1953368..1954132 (+) 765 WP_004398670.1 YqcI/YcgG family protein -
  AS891_RS10180 (AS891_10175) arsR 1954504..1954821 (+) 318 WP_004399122.1 arsenical resistance operon transcriptional regulator ArsR -
  AS891_RS10185 (AS891_10180) arsK 1954882..1955322 (+) 441 WP_003229954.1 ArsI/CadI family heavy metal resistance metalloenzyme -
  AS891_RS10190 (AS891_10185) acr3 1955345..1956385 (+) 1041 WP_004398718.1 arsenite efflux transporter Acr3 -
  AS891_RS10195 (AS891_10190) arsC 1956397..1956816 (+) 420 WP_004398596.1 thioredoxin-dependent arsenate reductase -
  AS891_RS23165 - 1957171..1957349 (+) 179 Protein_1963 hypothetical protein -
  AS891_RS10200 (AS891_10195) spoIVCA 1957307..1958767 (+) 1461 WP_223257626.1 site-specific DNA recombinase SpoIVCA -
  AS891_RS10205 (AS891_10200) - 1958795..1959145 (-) 351 Protein_1965 sigma-70 family RNA polymerase sigma factor -
  AS891_RS10210 (AS891_10205) nucA/comI 1959341..1959751 (+) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  AS891_RS10215 (AS891_10210) yqeB 1959784..1960506 (-) 723 WP_010886572.1 hypothetical protein -
  AS891_RS10220 (AS891_10215) gnd 1960758..1961651 (+) 894 WP_003229961.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  AS891_RS10225 (AS891_10220) yqeD 1961670..1962296 (-) 627 WP_003229962.1 TVP38/TMEM64 family protein -
  AS891_RS10230 (AS891_10225) cwlH 1962483..1963235 (+) 753 WP_003229963.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  AS891_RS10235 (AS891_10230) yqeF 1963487..1964218 (+) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  AS891_RS10245 (AS891_10240) - 1964524..1964664 (-) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  AS891_RS10250 (AS891_10245) yqeG 1965026..1965544 (+) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  AS891_RS10255 (AS891_10250) yqeH 1965548..1966648 (+) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  AS891_RS10260 (AS891_10255) aroE 1966666..1967508 (+) 843 WP_003229967.1 shikimate dehydrogenase -
  AS891_RS10265 (AS891_10260) yhbY 1967502..1967792 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  AS891_RS10270 (AS891_10265) nadD 1967804..1968373 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  AS891_RS10275 (AS891_10270) yqeK 1968363..1968923 (+) 561 WP_004399059.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  AS891_RS10280 (AS891_10275) rsfS 1968941..1969297 (+) 357 WP_003229971.1 ribosome silencing factor -
  AS891_RS10285 (AS891_10280) yqeM 1969294..1970037 (+) 744 WP_003229973.1 class I SAM-dependent DNA methyltransferase -
  AS891_RS10290 (AS891_10285) comER 1970103..1970924 (-) 822 WP_004398597.1 late competence protein ComER -
  AS891_RS10295 (AS891_10290) comEA 1971008..1971625 (+) 618 WP_004398514.1 competence protein ComEA Machinery gene
  AS891_RS10300 (AS891_10295) comEB 1971692..1972261 (+) 570 WP_003229978.1 ComE operon protein 2 -
  AS891_RS10305 (AS891_10300) comEC 1972265..1974595 (+) 2331 WP_009967776.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 205 a.a.        Molecular weight: 21769.45 Da        Isoelectric Point: 4.7220

>NTDB_id=178909 AS891_RS10295 WP_004398514.1 1971008..1971625(+) (comEA) [Bacillus subtilis subsp. subtilis strain KCTC 3135]
MNWLNQHKKAIILAASAAVFTAIMIFLATGKNKEPVKQAVPTETENTVVKQEANNDESNETIVIDIKGAVQHPGVYEMRT
GDRVSQAIEKAGGTSEQADEAQVNLAEILQDGTVVYIPKKGEETAVQQGGGGSVQSDGGKGALVNINTATLEELQGISGV
GPSKAEAIIAYREENGRFQTIEDITKVSGIGEKSFEKIKSSITVK

Nucleotide


Download         Length: 618 bp        

>NTDB_id=178909 AS891_RS10295 WP_004398514.1 1971008..1971625(+) (comEA) [Bacillus subtilis subsp. subtilis strain KCTC 3135]
ATGAATTGGTTGAATCAGCATAAGAAAGCAATTATTTTAGCGGCTTCTGCGGCTGTTTTCACAGCGATTATGATCTTTCT
GGCCACAGGGAAAAATAAAGAGCCGGTGAAGCAAGCTGTACCAACAGAGACAGAAAATACAGTGGTAAAGCAGGAAGCAA
ACAACGACGAGTCAAACGAAACAATTGTGATAGACATCAAAGGTGCTGTTCAGCATCCTGGCGTTTATGAAATGCGAACA
GGGGACAGAGTATCTCAGGCAATTGAGAAAGCGGGCGGGACCAGTGAACAAGCAGACGAAGCGCAAGTAAATTTGGCGGA
GATTCTGCAGGACGGGACAGTGGTGTACATCCCGAAAAAGGGAGAGGAAACAGCAGTGCAGCAAGGTGGCGGAGGGTCTG
TCCAAAGCGATGGAGGGAAGGGAGCGCTGGTGAATATCAATACAGCAACCTTAGAGGAGTTACAAGGCATCTCAGGGGTG
GGGCCATCCAAAGCTGAAGCTATTATTGCATACCGGGAGGAAAACGGTCGTTTCCAAACAATTGAAGATATCACTAAGGT
TTCAGGAATAGGTGAAAAGTCATTTGAGAAAATAAAGTCTTCCATTACAGTAAAGTGA

Domains


Predicted by InterproScan.

(142-203)

(63-118)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Bacillus subtilis subsp. subtilis str. 168

100

100

1

  comEA Staphylococcus aureus MW2

37.273

100

0.4

  comEA Staphylococcus aureus N315

36.364

100

0.39


Multiple sequence alignment