Detailed information    

insolico Bioinformatically predicted

Overview


Name   nucA/comI   Type   Machinery gene
Locus tag   C2H94_RS02270 Genome accession   NZ_CP026034
Coordinates   385871..386281 (-) Length   136 a.a.
NCBI ID   WP_009967785.1    Uniprot ID   A0A6I4D881
Organism   Bacillus subtilis strain PK5_52     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 386849..417345 385871..386281 flank 568


Gene organization within MGE regions


Location: 385871..417345
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H94_RS02270 (C2H94_02255) nucA/comI 385871..386281 (-) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  C2H94_RS02275 (C2H94_02260) - 386477..386827 (+) 351 Protein_452 sigma-70 family RNA polymerase sigma factor -
  C2H94_RS02280 (C2H94_02265) - 386849..387268 (-) 420 WP_249849480.1 NUDIX domain-containing protein -
  C2H94_RS02285 (C2H94_02270) spoIVCA 387336..388796 (-) 1461 WP_249849668.1 site-specific DNA recombinase SpoIVCA -
  C2H94_RS02290 (C2H94_02275) - 388754..388932 (-) 179 Protein_455 hypothetical protein -
  C2H94_RS02295 (C2H94_02280) - 389178..390992 (-) 1815 WP_249849481.1 AAA family ATPase -
  C2H94_RS02300 (C2H94_02285) - 391994..396304 (-) 4311 WP_249849482.1 ATP-binding protein -
  C2H94_RS02305 (C2H94_02290) rapE 396575..397702 (+) 1128 WP_249849483.1 response regulator aspartate phosphatase RapE -
  C2H94_RS02310 (C2H94_02295) phrE 397692..397826 (+) 135 WP_014114495.1 phosphatase RapE inhibitor PhrE -
  C2H94_RS02315 (C2H94_02300) - 397936..398095 (+) 160 Protein_460 hypothetical protein -
  C2H94_RS21080 (C2H94_02305) - 398464..400422 (+) 1959 WP_283939287.1 T7SS effector LXG polymorphic toxin -
  C2H94_RS02330 (C2H94_02310) - 400441..400983 (+) 543 WP_043857857.1 T6SS immunity protein Tdi1 domain-containing protein -
  C2H94_RS02335 (C2H94_02315) - 401317..401592 (-) 276 WP_249849484.1 barstar family protein -
  C2H94_RS02340 (C2H94_02320) - 401887..402738 (+) 852 WP_249853176.1 hypothetical protein -
  C2H94_RS02345 (C2H94_02325) - 402755..403132 (+) 378 WP_032725168.1 DUF3139 domain-containing protein -
  C2H94_RS02350 (C2H94_02330) cwlA 403159..403977 (-) 819 WP_249849485.1 N-acetylmuramoyl-L-alanine amidase -
  C2H94_RS02355 (C2H94_02335) - 404022..404162 (-) 141 Protein_467 phage holin family protein -
  C2H94_RS02360 (C2H94_02340) - 404189..404374 (-) 186 Protein_468 phage terminase large subunit -
  C2H94_RS02365 (C2H94_02345) terS 404371..405096 (-) 726 WP_249849486.1 phage terminase small subunit -
  C2H94_RS02370 (C2H94_02350) - 405439..405876 (-) 438 WP_119996313.1 ArpU family phage packaging/lysis transcriptional regulator -
  C2H94_RS02375 (C2H94_02355) - 406097..406957 (-) 861 WP_119996311.1 hypothetical protein -
  C2H94_RS02380 (C2H94_02360) - 407313..407579 (+) 267 WP_249849487.1 transcriptional regulator -
  C2H94_RS02385 (C2H94_02365) yqaO 407718..407924 (-) 207 WP_024122227.1 XtrA/YqaO family protein -
  C2H94_RS02390 (C2H94_02370) yqaN 408006..408434 (-) 429 WP_249849488.1 RusA family crossover junction endodeoxyribonuclease -
  C2H94_RS02395 (C2H94_02375) - 408530..408679 (-) 150 WP_003229910.1 hypothetical protein -
  C2H94_RS02400 (C2H94_02380) sknM 408670..409611 (-) 942 WP_249849489.1 ATP-binding protein -
  C2H94_RS02405 (C2H94_02385) yqaL 409493..410167 (-) 675 WP_249849669.1 DnaD domain protein -
  C2H94_RS02410 (C2H94_02390) yqaK 410244..411098 (-) 855 WP_249849490.1 recombinase RecT -
  C2H94_RS02415 (C2H94_02395) yqaJ 411101..412060 (-) 960 WP_249849491.1 YqaJ viral recombinase family protein -
  C2H94_RS02420 (C2H94_02400) - 412166..412360 (-) 195 WP_249849492.1 hypothetical protein -
  C2H94_RS02425 - 412320..412493 (-) 174 WP_125825444.1 hypothetical protein -
  C2H94_RS02430 (C2H94_02405) sknH 412490..412747 (-) 258 WP_032722183.1 YqaH family protein -
  C2H94_RS02435 (C2H94_02410) yqaG 412744..413313 (-) 570 WP_015714341.1 helix-turn-helix transcriptional regulator -
  C2H94_RS02440 (C2H94_02415) - 413385..413525 (-) 141 WP_032679254.1 hypothetical protein -
  C2H94_RS02445 (C2H94_02420) yqaF 413555..413785 (-) 231 WP_080332276.1 helix-turn-helix transcriptional regulator -
  C2H94_RS02450 (C2H94_02425) sknR 413962..414312 (+) 351 WP_004398704.1 transcriptional regulator SknR -
  C2H94_RS02455 (C2H94_02430) - 414553..414909 (+) 357 WP_249849493.1 hypothetical protein -
  C2H94_RS02460 (C2H94_02435) aadK 415003..415857 (-) 855 WP_249849494.1 aminoglycoside 6-adenylyltransferase AadK -
  C2H94_RS02465 (C2H94_02440) yqaB 416108..416626 (+) 519 WP_249849495.1 ImmA/IrrE family metallo-endopeptidase -
  C2H94_RS02470 (C2H94_02445) - 416632..417024 (+) 393 Protein_490 sigma-70 family RNA polymerase sigma factor -
  C2H94_RS02475 - 417024..417140 (+) 117 WP_042976632.1 hypothetical protein -
  C2H94_RS02480 (C2H94_02450) - 417106..417303 (-) 198 Protein_492 recombinase family protein -

Sequence


Protein


Download         Length: 136 a.a.        Molecular weight: 14967.97 Da        Isoelectric Point: 5.1853

>NTDB_id=265441 C2H94_RS02270 WP_009967785.1 385871..386281(-) (nucA/comI) [Bacillus subtilis strain PK5_52]
MKKWMAGLFLAAAVLLCLMVPQQIQGASSYDKVLYFPLSRYPETGSHIRDAIAEGHPDICTIDRDGADKRREESLKGIPT
KPGYDRDEWPMAVCEEGGAGADVRYVTPSDNRGAGSWVGNQMSSYPDGTRVLFIVQ

Nucleotide


Download         Length: 411 bp        

>NTDB_id=265441 C2H94_RS02270 WP_009967785.1 385871..386281(-) (nucA/comI) [Bacillus subtilis strain PK5_52]
ATGAAAAAATGGATGGCAGGCCTGTTTCTTGCTGCAGCAGTTCTTCTTTGTTTAATGGTTCCGCAGCAGATCCAAGGCGC
ATCTTCGTATGACAAAGTGTTATATTTTCCGCTGTCTCGTTATCCGGAAACCGGTAGTCATATTAGAGATGCGATTGCAG
AGGGACATCCAGATATTTGTACCATTGACAGAGATGGAGCAGACAAAAGGCGGGAGGAATCTTTAAAGGGAATCCCGACC
AAGCCGGGCTATGACCGGGATGAGTGGCCGATGGCGGTCTGCGAGGAAGGCGGCGCAGGGGCTGATGTCCGATATGTGAC
GCCTTCTGATAATCGCGGCGCCGGCTCGTGGGTAGGGAATCAAATGAGCAGCTATCCTGACGGTACCAGAGTGCTGTTTA
TTGTGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6I4D881

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  nucA/comI Bacillus subtilis subsp. subtilis str. 168

62.609

84.559

0.529


Multiple sequence alignment