Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AAY24_RS11985 Genome accession   NZ_CP011412
Coordinates   2601292..2602803 (+) Length   503 a.a.
NCBI ID   WP_046859878.1    Uniprot ID   A0A0F7K028
Organism   Sedimenticola thiotaurini strain SIP-G1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 2596292..2607803
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAY24_RS11950 (AAY24_11950) - 2597441..2597920 (-) 480 WP_046859871.1 DUF4124 domain-containing protein -
  AAY24_RS11955 (AAY24_11955) - 2597948..2598631 (-) 684 WP_046859872.1 GGDEF domain-containing protein -
  AAY24_RS11965 (AAY24_11965) - 2599022..2600299 (-) 1278 WP_046859874.1 ammonium transporter -
  AAY24_RS11970 (AAY24_11970) glnK 2600340..2600678 (-) 339 WP_046859875.1 P-II family nitrogen regulator -
  AAY24_RS11980 (AAY24_11980) ubiK 2600970..2601227 (+) 258 WP_046859877.1 ubiquinone biosynthesis accessory factor UbiK -
  AAY24_RS11985 (AAY24_11985) comM 2601292..2602803 (+) 1512 WP_046859878.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AAY24_RS11990 (AAY24_11990) - 2602891..2604648 (+) 1758 WP_082117256.1 sigma-54 interaction domain-containing protein -
  AAY24_RS11995 (AAY24_11995) ccoG 2604801..2606213 (+) 1413 WP_046859879.1 cytochrome c oxidase accessory protein CcoG -
  AAY24_RS12000 (AAY24_12000) - 2606401..2606856 (+) 456 WP_046859880.1 hypothetical protein -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 53980.87 Da        Isoelectric Point: 8.0760

>NTDB_id=145714 AAY24_RS11985 WP_046859878.1 2601292..2602803(+) (comM) [Sedimenticola thiotaurini strain SIP-G1]
MSLATLHSRARSGIQAPLVTIEVHLANGLPALSIVGLPEMAVKESKDRVRGALLNSGFEFPPRRITINLAPADLPKEGGR
FDLAIALGILAASGQIPSQSLHQYEFIGELALSGALRPVKGVLPVALAARDAGRGLILPQECAGEAALVSQIPLFPANHL
LSVCDLLIKGDEVAPHPTAPEPPAPAQQPDLADVRGQQHAKRALEIAAAGAHSLLMIGPPGTGKSMLASRLPGILPGMTE
QEALETAAIRSISNLGFSAADWRQRPFRAPHHTASGVALVGGGSNPRPGEISLAHNGVMFLDELPEFDRRVLEVLREPLE
SGQVTISRAARQEQFPARFQLVAAMNPCPCGYLGDNSGQLCRCSSDQVARYRNRISGPLLDRIDMTIEVPRLPAQDLNSQ
PASPAEPSRQVQARVEQCRQQQMTRNGCPNSQLAGRQLEAVCQLSDESRQLITRAMDQLGLSARAYHRILRLARTIADLS
RSDTITPAHLGEAIGYRRLDRQA

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=145714 AAY24_RS11985 WP_046859878.1 2601292..2602803(+) (comM) [Sedimenticola thiotaurini strain SIP-G1]
ATGTCTCTCGCCACACTGCACAGCCGTGCCCGCAGCGGTATACAGGCACCCCTGGTCACCATTGAGGTACACCTGGCCAA
TGGCCTGCCGGCGCTCTCAATCGTTGGCCTGCCGGAAATGGCGGTCAAGGAGAGCAAGGATCGGGTCCGGGGCGCCCTGC
TCAACTCCGGTTTCGAATTTCCACCCCGCCGCATCACCATCAACCTGGCCCCGGCCGATCTGCCTAAAGAGGGCGGGCGG
TTTGACCTGGCTATCGCCCTGGGCATCCTGGCCGCCTCGGGACAGATCCCGAGCCAGTCTCTCCATCAGTATGAATTCAT
CGGCGAACTGGCCCTGTCCGGGGCGCTCCGGCCGGTCAAGGGAGTACTGCCGGTGGCCCTGGCGGCCCGGGATGCCGGTC
GCGGTCTGATCCTGCCCCAGGAGTGCGCCGGTGAAGCGGCACTGGTCAGCCAGATCCCGCTGTTTCCGGCCAACCACCTG
CTGAGTGTCTGCGACCTGCTGATCAAGGGTGATGAAGTTGCTCCCCACCCAACAGCTCCCGAGCCGCCCGCTCCGGCACA
GCAGCCGGATCTGGCCGATGTGCGGGGTCAACAGCACGCCAAGCGTGCGCTGGAGATCGCCGCCGCCGGTGCCCACAGTC
TGCTGATGATCGGTCCGCCCGGCACCGGCAAGTCGATGCTCGCCTCCCGCCTGCCCGGTATCCTGCCGGGCATGACCGAA
CAGGAGGCACTGGAGACCGCCGCTATCCGCTCCATCAGCAACCTGGGCTTCTCCGCCGCAGACTGGCGCCAACGGCCGTT
TCGCGCCCCACACCACACCGCCTCCGGGGTCGCTTTGGTGGGCGGCGGCAGTAACCCGCGCCCCGGCGAGATATCCCTGG
CCCATAACGGGGTGATGTTCCTCGATGAACTGCCGGAGTTCGACCGGCGCGTGCTGGAGGTACTGCGCGAACCCCTGGAG
AGTGGTCAGGTGACCATCTCCCGGGCCGCCCGGCAGGAGCAGTTCCCGGCCCGCTTCCAGTTGGTGGCCGCCATGAACCC
CTGCCCCTGCGGTTATCTGGGGGATAACAGCGGCCAGTTGTGTCGCTGCAGCAGCGACCAGGTAGCCCGTTACCGCAATC
GAATCTCCGGTCCCTTGCTGGATCGCATCGACATGACCATCGAAGTGCCCCGTCTGCCGGCACAGGATCTGAACAGTCAA
CCCGCGTCACCGGCCGAACCCAGCCGCCAGGTACAGGCCCGGGTGGAACAGTGCCGCCAACAGCAGATGACGCGCAACGG
CTGCCCCAACAGCCAGCTGGCCGGGCGCCAGCTGGAGGCGGTCTGTCAACTCAGCGACGAATCACGCCAACTGATCACCC
GTGCCATGGATCAGCTGGGCCTGTCGGCGCGGGCCTATCACCGCATTCTGCGTCTGGCCCGCACCATTGCCGATCTGTCC
CGGTCCGATACCATCACCCCGGCTCACCTGGGTGAGGCGATTGGCTACCGTCGGCTGGACAGACAGGCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0F7K028

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.138

100

0.555

  comM Vibrio campbellii strain DS40M4

55.289

99.602

0.551

  comM Glaesserella parasuis strain SC1401

54.563

100

0.547

  comM Vibrio cholerae strain A1552

54.691

99.602

0.545

  comM Legionella pneumophila str. Paris

51.406

99.006

0.509

  comM Legionella pneumophila strain ERS1305867

51.406

99.006

0.509

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.461

100

0.483


Multiple sequence alignment