Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   EY643_RS16565 Genome accession   NZ_CP036422
Coordinates   3622431..3623921 (+) Length   496 a.a.
NCBI ID   WP_153240272.1    Uniprot ID   A0A5P9NNW3
Organism   Halioglobus maricola strain IMCC14385     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 3617431..3628921
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EY643_RS16535 (EY643_16515) - 3617893..3618231 (-) 339 WP_153240267.1 P-II family nitrogen regulator -
  EY643_RS16540 (EY643_16520) - 3618385..3619104 (-) 720 WP_170287434.1 TorF family putative porin -
  EY643_RS16545 (EY643_16525) - 3619343..3620611 (-) 1269 WP_153240269.1 ammonium transporter -
  EY643_RS16550 (EY643_16530) glnK 3620642..3620980 (-) 339 WP_133209572.1 P-II family nitrogen regulator -
  EY643_RS16555 (EY643_16535) - 3621051..3621770 (-) 720 WP_170287435.1 TorF family putative porin -
  EY643_RS16560 (EY643_16540) - 3622125..3622388 (+) 264 WP_153240271.1 accessory factor UbiK family protein -
  EY643_RS16565 (EY643_16545) comM 3622431..3623921 (+) 1491 WP_153240272.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  EY643_RS16570 (EY643_16550) - 3624056..3625306 (+) 1251 WP_153240273.1 cytochrome P450 -
  EY643_RS16575 (EY643_16555) - 3625349..3626833 (+) 1485 WP_153240274.1 carboxypeptidase M32 -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52731.44 Da        Isoelectric Point: 7.7359

>NTDB_id=348830 EY643_RS16565 WP_153240272.1 3622431..3623921(+) (comM) [Halioglobus maricola strain IMCC14385]
MELSIIHSRALAGLSAPPVQVETHLSNGLPAFHIVGMPETAVRESKDRVRSAILNSHFDFPDRRITVNLAPADLPKGGGR
FDLPIALGILVASGQVPRDRLENHEFLGELALNGGLRAVSGVICAALAASASGKQLVAPEQCAGTAAAVPEVRLIGPPDL
LTLCAHLNGSVPLHPVKIPAVTASGETGSDLAEVVGQDAARRALEVAASGGHNLLLAGPPGTGKTLLASRLPGILPPPDH
DESLVILALRDFGGVARPGEALVRPFRAPHHSASAGALIGGGSPPLPGEASLAHGGVLFLDELPEFSRHCLETLREPMES
GHVTLSRARHKASYPASFQLIAAMNPCPCGYLGDPERACRCSADQIQRYTARVSGPLLDRIDLHVQVSRVAPAQLLQREH
DGEDSATVRRRVSRCRARQLERQHCVNARLHSDRLLDICTLGRGERKILEQAAQRLKLSGRAIHRTLRVALTLADMAEVD
HIHELHLAEALGYRAS

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=348830 EY643_RS16565 WP_153240272.1 3622431..3623921(+) (comM) [Halioglobus maricola strain IMCC14385]
ATGGAACTTTCGATTATTCACAGCCGCGCGCTGGCGGGCCTGTCTGCACCGCCAGTACAAGTAGAAACCCACCTCTCCAA
CGGGCTGCCCGCTTTTCACATCGTGGGCATGCCAGAGACAGCAGTGCGCGAGAGCAAGGACAGGGTCCGCTCCGCCATCC
TCAATTCACATTTCGACTTTCCCGACCGTCGCATCACGGTCAACCTCGCTCCGGCTGACCTGCCCAAAGGCGGCGGTCGC
TTCGATCTGCCTATCGCCCTGGGCATCCTGGTGGCGTCTGGGCAGGTCCCCAGGGATCGCCTGGAAAATCATGAGTTTCT
GGGAGAACTCGCGCTCAACGGCGGATTGCGTGCGGTATCCGGTGTTATATGTGCAGCGCTTGCCGCCAGCGCCAGCGGTA
AACAATTGGTTGCGCCCGAGCAGTGTGCCGGAACGGCCGCCGCAGTACCCGAAGTGCGCCTCATCGGCCCACCCGACCTG
CTCACACTCTGCGCCCACCTCAACGGCAGCGTGCCACTTCACCCGGTGAAAATCCCAGCTGTAACTGCCTCCGGCGAGAC
CGGCAGTGATCTCGCGGAAGTCGTCGGCCAGGATGCGGCCCGACGCGCGCTGGAAGTGGCTGCCAGCGGCGGACACAACC
TGCTGCTCGCTGGACCTCCCGGGACAGGCAAGACACTCCTTGCCAGCCGGCTTCCGGGCATCCTGCCACCACCCGACCAC
GATGAGTCACTGGTCATTCTTGCTCTGCGTGACTTTGGTGGCGTTGCCAGGCCCGGCGAGGCACTTGTGCGCCCATTCCG
CGCACCCCACCACAGTGCCAGCGCCGGCGCCCTGATCGGCGGCGGCAGCCCTCCACTGCCAGGCGAGGCTTCCCTCGCTC
ATGGTGGAGTACTGTTTCTGGATGAGTTGCCCGAGTTTTCCCGCCACTGCCTTGAAACCCTGCGAGAGCCCATGGAATCA
GGCCATGTGACGCTGTCACGCGCCCGGCACAAAGCGAGCTACCCGGCGAGCTTCCAGCTGATCGCGGCCATGAACCCCTG
CCCCTGCGGCTATCTGGGAGACCCTGAACGCGCTTGCCGTTGCTCGGCTGACCAGATTCAGCGCTATACCGCCAGGGTGT
CGGGCCCCCTGCTGGATCGGATTGATCTCCACGTCCAGGTCTCGAGAGTGGCCCCGGCCCAGCTATTGCAGCGCGAACAC
GATGGCGAAGATTCCGCCACGGTCAGGCGCCGGGTCAGTCGTTGCCGGGCCCGCCAACTGGAGCGCCAGCACTGCGTCAA
TGCCAGGCTCCACAGTGACCGTCTACTGGACATCTGCACCCTGGGCAGGGGCGAGCGCAAGATACTGGAGCAGGCAGCAC
AGCGTCTGAAGCTCTCAGGCCGCGCCATCCATCGGACGCTCAGAGTCGCTCTTACGTTGGCCGACATGGCGGAGGTAGAC
CATATCCATGAACTTCATCTGGCAGAGGCCCTGGGCTACCGCGCGAGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5P9NNW3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

50.905

100

0.51

  comM Vibrio campbellii strain DS40M4

50.605

100

0.506

  comM Haemophilus influenzae Rd KW20

49.507

100

0.506

  comM Glaesserella parasuis strain SC1401

48.031

100

0.492

  comM Legionella pneumophila str. Paris

46.787

100

0.47

  comM Legionella pneumophila strain ERS1305867

46.787

100

0.47

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

41.434

100

0.419


Multiple sequence alignment