Detailed information    

insolico Bioinformatically predicted

Overview


Name   nucA/comI   Type   Machinery gene
Locus tag   DA787_RS12975 Genome accession   NZ_CP033205
Coordinates   2511271..2511681 (-) Length   136 a.a.
NCBI ID   WP_009967785.1    Uniprot ID   A0A6I4D881
Organism   Bacillus subtilis strain MBI 600     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2504374..2558127 2511271..2511681 within 0


Gene organization within MGE regions


Location: 2504374..2558127
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DA787_RS12935 (DA787_12920) yqeH 2504374..2505474 (-) 1101 WP_003229966.1 ribosome biogenesis GTPase YqeH -
  DA787_RS12940 (DA787_12925) yqeG 2505478..2505996 (-) 519 WP_003226126.1 YqeG family HAD IIIA-type phosphatase -
  DA787_RS12945 (DA787_12930) - 2506358..2506498 (+) 141 WP_003226124.1 sporulation histidine kinase inhibitor Sda -
  DA787_RS12950 (DA787_12935) yqeF 2506804..2507535 (-) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  DA787_RS12955 (DA787_12940) cwlH 2507787..2508539 (-) 753 WP_003229963.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  DA787_RS12960 (DA787_12945) yqeD 2508726..2509352 (+) 627 WP_003229962.1 TVP38/TMEM64 family protein -
  DA787_RS12965 (DA787_12950) gnd 2509371..2510264 (-) 894 WP_003229961.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  DA787_RS12970 (DA787_12955) yqeB 2510516..2511238 (+) 723 WP_010886572.1 hypothetical protein -
  DA787_RS12975 (DA787_12960) nucA/comI 2511271..2511681 (-) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  DA787_RS12980 (DA787_12965) - 2511877..2512224 (+) 348 Protein_2486 sigma-70 family RNA polymerase sigma factor -
  DA787_RS12985 (DA787_12970) spoIVCA 2512255..2513715 (-) 1461 WP_223257626.1 site-specific DNA recombinase SpoIVCA -
  DA787_RS21440 (DA787_12975) - 2513673..2513851 (-) 179 Protein_2488 hypothetical protein -
  DA787_RS12995 (DA787_12980) arsC 2514206..2514625 (-) 420 WP_004398596.1 thioredoxin-dependent arsenate reductase -
  DA787_RS13000 (DA787_12985) acr3 2514637..2515677 (-) 1041 WP_004398718.1 arsenite efflux transporter Acr3 -
  DA787_RS13005 (DA787_12990) arsK 2515700..2516140 (-) 441 WP_003229954.1 ArsI/CadI family heavy metal resistance metalloenzyme -
  DA787_RS13010 (DA787_12995) arsR 2516201..2516518 (-) 318 WP_004399122.1 arsenical resistance operon transcriptional regulator ArsR -
  DA787_RS13015 (DA787_13000) yqcI 2516890..2517654 (-) 765 WP_004398670.1 YqcI/YcgG family protein -
  DA787_RS13020 (DA787_13005) rapE 2518097..2519224 (+) 1128 WP_004398842.1 response regulator aspartate phosphatase RapE -
  DA787_RS13025 (DA787_13010) phrE 2519214..2519348 (+) 135 WP_004398770.1 phosphatase RapE inhibitor PhrE -
  DA787_RS13030 (DA787_13015) - 2519458..2519616 (+) 159 WP_003245945.1 hypothetical protein -
  DA787_RS13035 (DA787_13020) yqcG 2519986..2521581 (+) 1596 WP_004399034.1 LXG family T7SS effector endonuclease toxin YqcG -
  DA787_RS13040 (DA787_13025) yqcF 2521596..2522174 (+) 579 WP_009967790.1 type VII secretion system immunity protein YqcF -
  DA787_RS13045 (DA787_13030) - 2522292..2522438 (+) 147 WP_009967791.1 hypothetical protein -
  DA787_RS13050 (DA787_13035) - 2522435..2522797 (-) 363 WP_003229947.1 hypothetical protein -
  DA787_RS13055 (DA787_13040) - 2522813..2523292 (-) 480 WP_004399085.1 hypothetical protein -
  DA787_RS13060 (DA787_13045) cwlA 2523457..2524275 (-) 819 WP_003229946.1 N-acetylmuramoyl-L-alanine amidase CwlA -
  DA787_RS13065 (DA787_13050) skhD 2524320..2524742 (-) 423 WP_003246208.1 holin family protein -
  DA787_RS13070 (DA787_13055) xepAK 2524787..2525680 (-) 894 WP_003246010.1 hypothetical protein -
  DA787_RS13075 (DA787_13060) yqcE 2525768..2525932 (-) 165 WP_003229944.1 XkdX family protein -
  DA787_RS13080 (DA787_13065) yqcD 2525929..2526264 (-) 336 WP_009967793.1 XkdW family protein -
  DA787_RS13085 (DA787_13070) yqcC 2526274..2527374 (-) 1101 WP_003229943.1 pyocin knob domain-containing protein -
  DA787_RS13090 (DA787_13075) - 2527377..2527649 (-) 273 WP_003229942.1 hypothetical protein -
  DA787_RS13095 (DA787_13080) yqcA 2527646..2528224 (-) 579 WP_003229941.1 YmfQ family protein -
  DA787_RS13100 (DA787_13085) yqbT 2528208..2529254 (-) 1047 WP_003229940.1 baseplate J/gp47 family protein -
  DA787_RS13105 (DA787_13090) yqbS 2529247..2529672 (-) 426 WP_004398572.1 DUF2634 domain-containing protein -
  DA787_RS13110 (DA787_13095) yqbR 2529685..2529948 (-) 264 WP_003229938.1 DUF2577 family protein -
  DA787_RS13115 (DA787_13100) yqbQ 2529945..2530925 (-) 981 WP_004398524.1 hypothetical protein -
  DA787_RS13120 (DA787_13105) yqbP 2530938..2531597 (-) 660 WP_004398548.1 LysM peptidoglycan-binding domain-containing protein -
  DA787_RS13125 (DA787_13110) yqbO 2531590..2536347 (-) 4758 WP_003246092.1 phage tail tape measure protein -
  DA787_RS13130 (DA787_13115) - 2536350..2536487 (-) 138 WP_003229934.1 hypothetical protein -
  DA787_RS13135 (DA787_13120) - 2536529..2536978 (-) 450 WP_003229933.1 phage portal protein -
  DA787_RS13140 (DA787_13125) txpA 2537124..2537303 (+) 180 WP_004398662.1 type I toxin-antitoxin system toxin TxpA -
  DA787_RS13145 (DA787_13130) bsrH 2537683..2537772 (+) 90 WP_075058862.1 type I toxin-antitoxin system toxin BsrH -
  DA787_RS13150 (DA787_13135) yqbM 2538026..2538469 (-) 444 WP_003229930.1 phage tail tube protein -
  DA787_RS13155 (DA787_13140) yqbK 2538472..2539872 (-) 1401 WP_003229929.1 phage tail sheath family protein -
  DA787_RS13160 (DA787_13145) - 2539873..2540064 (-) 192 WP_010886574.1 hypothetical protein -
  DA787_RS13165 (DA787_13150) yqbJ 2540061..2540498 (-) 438 WP_003229927.1 DUF6838 family protein -
  DA787_RS13170 (DA787_13155) yqbI 2540511..2541014 (-) 504 WP_003246050.1 HK97 gp10 family phage protein -
  DA787_RS13175 (DA787_13160) yqbH 2541011..2541373 (-) 363 WP_003229925.1 YqbH/XkdH family protein -
  DA787_RS13180 (DA787_13165) gkpG 2541370..2541765 (-) 396 WP_004398566.1 DUF3199 family protein -
  DA787_RS13185 (DA787_13170) yqbF 2541769..2542080 (-) 312 WP_003229923.1 YqbF domain-containing protein -
  DA787_RS13190 (DA787_13175) skdG 2542091..2543026 (-) 936 WP_003229922.1 phage major capsid protein -
  DA787_RS13195 (DA787_13180) yqbD 2543045..2544013 (-) 969 WP_003229921.1 XkdF-like putative serine protease domain-containing protein -
  DA787_RS13200 (DA787_13185) - 2544046..2544699 (-) 654 WP_003229920.1 hypothetical protein -
  DA787_RS13205 (DA787_13190) yqbB 2544740..2545657 (-) 918 WP_004398748.1 phage head morphogenesis protein -
  DA787_RS13210 (DA787_13195) yqbA 2545654..2547186 (-) 1533 WP_004398894.1 phage portal protein -
  DA787_RS13215 (DA787_13200) stmB 2547190..2548485 (-) 1296 WP_003229917.1 PBSX family phage terminase large subunit -
  DA787_RS13220 (DA787_13205) terS 2548478..2549197 (-) 720 WP_003229916.1 phage terminase small subunit -
  DA787_RS13225 (DA787_13210) - 2549265..2549729 (-) 465 WP_004398685.1 hypothetical protein -
  DA787_RS13230 (DA787_13215) yqaQ 2549873..2550328 (-) 456 WP_004398775.1 hypothetical protein -
  DA787_RS13235 (DA787_13220) - 2550526..2551455 (+) 930 WP_003229913.1 hypothetical protein -
  DA787_RS13240 (DA787_13225) yqaO 2551529..2551735 (-) 207 WP_003229912.1 XtrA/YqaO family protein -
  DA787_RS13245 (DA787_13230) yqaN 2551817..2552245 (-) 429 WP_009967809.1 RusA family crossover junction endodeoxyribonuclease -
  DA787_RS13250 (DA787_13235) - 2552341..2552490 (-) 150 WP_003229910.1 hypothetical protein -
  DA787_RS13255 (DA787_13240) sknM 2552481..2553422 (-) 942 WP_075058863.1 ATP-binding protein -
  DA787_RS13260 (DA787_13245) yqaL 2553304..2553981 (-) 678 WP_010886575.1 DnaD domain protein -
  DA787_RS13265 (DA787_13250) recT 2554057..2554911 (-) 855 WP_003229907.1 recombinase RecT -
  DA787_RS13270 (DA787_13255) yqaJ 2554914..2555873 (-) 960 WP_004398673.1 YqaJ viral recombinase family protein -
  DA787_RS13275 (DA787_13260) - 2555979..2556173 (-) 195 WP_003229905.1 hypothetical protein -
  DA787_RS13280 (DA787_13265) - 2556133..2556306 (-) 174 WP_119123069.1 hypothetical protein -
  DA787_RS13285 (DA787_13270) sknH 2556303..2556560 (-) 258 WP_003245994.1 YqaH family protein -
  DA787_RS13290 (DA787_13275) yqaG 2556557..2557126 (-) 570 WP_004398626.1 helix-turn-helix transcriptional regulator -
  DA787_RS13295 (DA787_13280) - 2557200..2557340 (-) 141 WP_003229902.1 hypothetical protein -
  DA787_RS13300 (DA787_13285) yqaF 2557370..2557600 (-) 231 WP_004398958.1 helix-turn-helix transcriptional regulator -
  DA787_RS13305 (DA787_13290) sknR 2557777..2558127 (+) 351 WP_004398704.1 transcriptional regulator SknR -

Sequence


Protein


Download         Length: 136 a.a.        Molecular weight: 14967.97 Da        Isoelectric Point: 5.1853

>NTDB_id=322571 DA787_RS12975 WP_009967785.1 2511271..2511681(-) (nucA/comI) [Bacillus subtilis strain MBI 600]
MKKWMAGLFLAAAVLLCLMVPQQIQGASSYDKVLYFPLSRYPETGSHIRDAIAEGHPDICTIDRDGADKRREESLKGIPT
KPGYDRDEWPMAVCEEGGAGADVRYVTPSDNRGAGSWVGNQMSSYPDGTRVLFIVQ

Nucleotide


Download         Length: 411 bp        

>NTDB_id=322571 DA787_RS12975 WP_009967785.1 2511271..2511681(-) (nucA/comI) [Bacillus subtilis strain MBI 600]
ATGAAAAAATGGATGGCAGGCCTGTTTCTTGCTGCAGCAGTTCTTCTTTGTTTAATGGTTCCGCAACAGATCCAAGGCGC
ATCTTCGTATGACAAAGTGTTATATTTTCCGCTGTCTCGTTATCCGGAAACCGGCAGTCATATTAGGGATGCGATTGCAG
AGGGACATCCAGATATTTGTACCATTGACAGAGATGGAGCAGACAAAAGGCGGGAGGAATCTTTAAAGGGAATCCCGACC
AAGCCGGGCTATGACCGGGATGAGTGGCCGATGGCGGTCTGCGAGGAAGGCGGTGCAGGGGCTGATGTCCGATATGTGAC
GCCTTCTGATAATCGCGGCGCCGGCTCGTGGGTAGGGAATCAAATGAGCAGCTATCCTGACGGTACCAGAGTGCTGTTTA
TTGTGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6I4D881

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  nucA/comI Bacillus subtilis subsp. subtilis str. 168

62.609

84.559

0.529


Multiple sequence alignment