Detailed information    

insolico Bioinformatically predicted

Overview


Name   nucA/comI   Type   Machinery gene
Locus tag   H70737_RS22245 Genome accession   NZ_CP009279
Coordinates   5078027..5078458 (+) Length   143 a.a.
NCBI ID   WP_042194351.1    Uniprot ID   A0A089IM67
Organism   Paenibacillus sp. FSL H7-0737     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Genomic Context


Location: 5073027..5083458
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H70737_RS31105 - 5073725..5073889 (+) 165 WP_197071245.1 hypothetical protein -
  H70737_RS22220 (H70737_22850) - 5073886..5074221 (-) 336 WP_042130171.1 YolD-like family protein -
  H70737_RS22225 (H70737_22855) - 5074604..5075164 (-) 561 WP_042190778.1 hypothetical protein -
  H70737_RS22230 - 5075282..5075770 (+) 489 WP_042190780.1 PH domain-containing protein -
  H70737_RS22235 (H70737_22865) - 5075853..5077328 (-) 1476 WP_042190782.1 glycosyl hydrolase -
  H70737_RS22240 (H70737_22870) - 5077503..5077919 (-) 417 WP_042194350.1 cytidine deaminase -
  H70737_RS22245 (H70737_22875) nucA/comI 5078027..5078458 (+) 432 WP_042194351.1 NucA/NucB deoxyribonuclease domain-containing protein Machinery gene
  H70737_RS30845 - 5078537..5078683 (-) 147 WP_156113170.1 hypothetical protein -
  H70737_RS30455 - 5078833..5079009 (+) 177 WP_076098563.1 YjfB family protein -
  H70737_RS22250 (H70737_22890) - 5079150..5079692 (+) 543 WP_231573332.1 ImmA/IrrE family metallo-endopeptidase -
  H70737_RS30850 - 5079769..5079930 (+) 162 WP_179085777.1 hypothetical protein -
  H70737_RS30855 - 5080055..5080228 (-) 174 WP_156113172.1 hypothetical protein -
  H70737_RS22255 (H70737_22905) - 5080337..5080669 (+) 333 WP_042190784.1 hypothetical protein -
  H70737_RS22260 (H70737_22910) - 5080847..5081341 (+) 495 WP_042190786.1 sigma-70 family RNA polymerase sigma factor -
  H70737_RS22265 (H70737_22915) - 5081338..5082327 (+) 990 WP_042190788.1 hypothetical protein -
  H70737_RS22270 (H70737_22920) - 5082664..5083167 (-) 504 WP_042190791.1 hypothetical protein -

Sequence


Protein


Download         Length: 143 a.a.        Molecular weight: 15851.71 Da        Isoelectric Point: 4.4729

>NTDB_id=127992 H70737_RS22245 WP_042194351.1 5078027..5078458(+) (nucA/comI) [Paenibacillus sp. FSL H7-0737]
MQKLLTSLIIVVLLALGGYWYEQNGDPAAPSSTSDSEVVQLIFPSDRYPETAKHIQDAIAKGESATCTINREQAEDNRKE
SLKGIPTKKGYDRDEWPMAMCNEGGKGADIEYITPKDNRGAGSWVGNQLEDYADGTRVEFMFK

Nucleotide


Download         Length: 432 bp        

>NTDB_id=127992 H70737_RS22245 WP_042194351.1 5078027..5078458(+) (nucA/comI) [Paenibacillus sp. FSL H7-0737]
ATACAGAAATTACTCACCAGTCTCATCATTGTTGTGTTACTTGCTTTGGGCGGTTACTGGTATGAACAGAACGGAGATCC
GGCAGCCCCATCGTCCACATCCGATTCAGAAGTTGTTCAGCTGATCTTCCCATCAGATCGTTATCCTGAGACCGCTAAGC
ACATTCAGGACGCAATTGCTAAAGGGGAGTCCGCGACGTGCACGATTAACCGTGAGCAGGCTGAAGACAACCGCAAAGAA
TCCTTAAAAGGGATCCCCACCAAAAAAGGCTACGACCGCGATGAATGGCCCATGGCCATGTGTAATGAAGGCGGCAAAGG
GGCCGACATTGAATACATAACCCCAAAAGATAACCGCGGTGCAGGAAGCTGGGTTGGCAATCAGTTAGAAGATTATGCGG
ATGGAACCCGCGTAGAATTTATGTTCAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A089IM67

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  nucA/comI Bacillus subtilis subsp. subtilis str. 168

67.961

72.028

0.49


Multiple sequence alignment