Detailed information    

insolico Bioinformatically predicted

Overview


Name   nucA/comI   Type   Machinery gene
Locus tag   C2H95_RS16115 Genome accession   NZ_CP026037
Coordinates   3132452..3132862 (-) Length   136 a.a.
NCBI ID   WP_009967785.1    Uniprot ID   A0A6I4D881
Organism   Bacillus subtilis strain PK5_17     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3127818..3164981 3132452..3132862 within 0


Gene organization within MGE regions


Location: 3127818..3164981
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H95_RS16090 (C2H95_16025) yqeF 3127985..3128716 (-) 732 WP_003229964.1 SGNH/GDSL hydrolase family protein -
  C2H95_RS16095 (C2H95_16030) cwlH 3128968..3129720 (-) 753 WP_021480067.1 N-acetylmuramoyl-L-alanine amidase CwlH -
  C2H95_RS16100 (C2H95_16035) yqeD 3129907..3130533 (+) 627 WP_046160610.1 TVP38/TMEM64 family protein -
  C2H95_RS16105 (C2H95_16040) gnd 3130553..3131446 (-) 894 WP_072175805.1 phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) -
  C2H95_RS16110 (C2H95_16045) yqeB 3131698..3132420 (+) 723 WP_072175806.1 hypothetical protein -
  C2H95_RS16115 (C2H95_16050) nucA/comI 3132452..3132862 (-) 411 WP_009967785.1 sporulation-specific Dnase NucB Machinery gene
  C2H95_RS16120 (C2H95_16055) - 3133058..3133408 (+) 351 Protein_3132 sigma-70 family RNA polymerase sigma factor -
  C2H95_RS16125 (C2H95_16060) - 3133430..3133849 (-) 420 WP_249849480.1 NUDIX domain-containing protein -
  C2H95_RS16130 (C2H95_16065) spoIVCA 3133917..3135377 (-) 1461 WP_249849668.1 site-specific DNA recombinase SpoIVCA -
  C2H95_RS16135 (C2H95_16070) - 3135335..3135513 (-) 179 Protein_3135 hypothetical protein -
  C2H95_RS16140 (C2H95_16075) - 3135759..3137573 (-) 1815 WP_249849481.1 AAA family ATPase -
  C2H95_RS16145 (C2H95_16080) - 3138575..3142885 (-) 4311 WP_249849482.1 ATP-binding protein -
  C2H95_RS16150 (C2H95_16085) rapE 3143156..3144283 (+) 1128 WP_249849483.1 response regulator aspartate phosphatase RapE -
  C2H95_RS16155 (C2H95_16090) phrE 3144273..3144407 (+) 135 WP_014114495.1 phosphatase RapE inhibitor PhrE -
  C2H95_RS16160 (C2H95_16095) - 3144517..3144676 (+) 160 Protein_3140 hypothetical protein -
  C2H95_RS21055 (C2H95_16100) - 3145045..3147003 (+) 1959 WP_283939287.1 T7SS effector LXG polymorphic toxin -
  C2H95_RS16175 (C2H95_16105) - 3147022..3147564 (+) 543 WP_043857857.1 T6SS immunity protein Tdi1 domain-containing protein -
  C2H95_RS16180 (C2H95_16110) - 3147898..3148173 (-) 276 WP_249849484.1 barstar family protein -
  C2H95_RS16185 (C2H95_16115) - 3148438..3149319 (+) 882 WP_032725167.1 hypothetical protein -
  C2H95_RS16190 (C2H95_16120) - 3149336..3149713 (+) 378 WP_032725168.1 DUF3139 domain-containing protein -
  C2H95_RS16195 (C2H95_16125) cwlA 3149741..3150559 (-) 819 WP_249849485.1 N-acetylmuramoyl-L-alanine amidase -
  C2H95_RS16200 (C2H95_16130) - 3150604..3150744 (-) 141 Protein_3147 phage holin family protein -
  C2H95_RS16205 (C2H95_16135) - 3150771..3150956 (-) 186 Protein_3148 phage terminase large subunit -
  C2H95_RS16210 (C2H95_16140) terS 3150953..3151678 (-) 726 WP_249849486.1 phage terminase small subunit -
  C2H95_RS16215 (C2H95_16145) - 3152021..3152458 (-) 438 WP_119996313.1 ArpU family phage packaging/lysis transcriptional regulator -
  C2H95_RS16220 (C2H95_16150) - 3152679..3153539 (-) 861 WP_119996311.1 hypothetical protein -
  C2H95_RS16225 (C2H95_16155) - 3153895..3154161 (+) 267 WP_249849487.1 transcriptional regulator -
  C2H95_RS16230 (C2H95_16160) yqaO 3154300..3154506 (-) 207 WP_024122227.1 XtrA/YqaO family protein -
  C2H95_RS16235 (C2H95_16165) yqaN 3154588..3155016 (-) 429 WP_249849488.1 RusA family crossover junction endodeoxyribonuclease -
  C2H95_RS16240 (C2H95_16170) - 3155112..3155261 (-) 150 WP_003229910.1 hypothetical protein -
  C2H95_RS16245 (C2H95_16175) sknM 3155252..3156193 (-) 942 WP_249849489.1 ATP-binding protein -
  C2H95_RS16250 (C2H95_16180) yqaL 3156075..3156749 (-) 675 WP_249849669.1 DnaD domain protein -
  C2H95_RS16255 (C2H95_16185) yqaK 3156826..3157680 (-) 855 WP_249849490.1 recombinase RecT -
  C2H95_RS16260 (C2H95_16190) yqaJ 3157683..3158642 (-) 960 WP_249849491.1 YqaJ viral recombinase family protein -
  C2H95_RS16265 (C2H95_16195) - 3158748..3158942 (-) 195 WP_249849492.1 hypothetical protein -
  C2H95_RS16270 - 3158902..3159075 (-) 174 WP_125825444.1 hypothetical protein -
  C2H95_RS16275 (C2H95_16200) sknH 3159072..3159329 (-) 258 WP_032722183.1 YqaH family protein -
  C2H95_RS16280 (C2H95_16205) yqaG 3159326..3159895 (-) 570 WP_015714341.1 helix-turn-helix transcriptional regulator -
  C2H95_RS16285 (C2H95_16210) - 3159967..3160107 (-) 141 WP_032679254.1 hypothetical protein -
  C2H95_RS16290 (C2H95_16215) yqaF 3160137..3160367 (-) 231 WP_080332276.1 helix-turn-helix transcriptional regulator -
  C2H95_RS16295 (C2H95_16220) sknR 3160544..3160894 (+) 351 WP_004398704.1 transcriptional regulator SknR -
  C2H95_RS16300 (C2H95_16225) - 3161135..3161491 (+) 357 WP_249849493.1 hypothetical protein -
  C2H95_RS16305 (C2H95_16230) aadK 3161585..3162439 (-) 855 WP_249849494.1 aminoglycoside 6-adenylyltransferase AadK -
  C2H95_RS16310 (C2H95_16235) yqaB 3162690..3163208 (+) 519 WP_249849495.1 ImmA/IrrE family metallo-endopeptidase -
  C2H95_RS16315 (C2H95_16240) - 3163214..3163606 (+) 393 Protein_3170 sigma-70 family RNA polymerase sigma factor -
  C2H95_RS16320 - 3163606..3163722 (+) 117 WP_042976632.1 hypothetical protein -
  C2H95_RS16325 (C2H95_16245) - 3163688..3163885 (-) 198 Protein_3172 recombinase family protein -
  C2H95_RS16330 (C2H95_16250) - 3163843..3164021 (-) 179 Protein_3173 hypothetical protein -
  C2H95_RS16335 (C2H95_16255) psiE 3164565..3164981 (-) 417 WP_003246154.1 phosphate-starvation-inducible protein PsiE -

Sequence


Protein


Download         Length: 136 a.a.        Molecular weight: 14967.97 Da        Isoelectric Point: 5.1853

>NTDB_id=265726 C2H95_RS16115 WP_009967785.1 3132452..3132862(-) (nucA/comI) [Bacillus subtilis strain PK5_17]
MKKWMAGLFLAAAVLLCLMVPQQIQGASSYDKVLYFPLSRYPETGSHIRDAIAEGHPDICTIDRDGADKRREESLKGIPT
KPGYDRDEWPMAVCEEGGAGADVRYVTPSDNRGAGSWVGNQMSSYPDGTRVLFIVQ

Nucleotide


Download         Length: 411 bp        

>NTDB_id=265726 C2H95_RS16115 WP_009967785.1 3132452..3132862(-) (nucA/comI) [Bacillus subtilis strain PK5_17]
ATGAAAAAATGGATGGCAGGCCTGTTTCTTGCTGCAGCAGTTCTTCTTTGTTTAATGGTTCCGCAGCAGATCCAAGGCGC
ATCTTCGTATGACAAAGTGTTATATTTTCCGCTGTCTCGTTATCCGGAAACCGGTAGTCATATTAGAGATGCGATTGCAG
AGGGACATCCAGATATTTGTACCATTGACAGAGATGGAGCAGACAAAAGGCGGGAGGAATCTTTAAAGGGAATCCCGACC
AAGCCGGGCTATGACCGGGATGAGTGGCCGATGGCGGTCTGCGAGGAAGGCGGCGCAGGGGCTGATGTCCGATATGTGAC
GCCTTCTGATAATCGCGGCGCCGGCTCGTGGGTAGGGAATCAAATGAGCAGCTATCCTGACGGTACCAGAGTGCTGTTTA
TTGTGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6I4D881

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  nucA/comI Bacillus subtilis subsp. subtilis str. 168

62.609

84.559

0.529


Multiple sequence alignment