Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   UNDYM_RS00245 Genome accession   NZ_AP018441
Coordinates   51746..53269 (-) Length   507 a.a.
NCBI ID   WP_162039213.1    Uniprot ID   A0A809QVC3
Organism   Undibacterium sp. YM2     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 46746..58269
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UNDYM_RS00230 (UNDYM_0048) - 47786..48988 (-) 1203 WP_162039210.1 DUF1501 domain-containing protein -
  UNDYM_RS00235 (UNDYM_0049) - 49008..50717 (-) 1710 WP_162039211.1 DUF1800 domain-containing protein -
  UNDYM_RS00240 (UNDYM_0050) - 50922..51665 (-) 744 WP_162039212.1 DUF2076 domain-containing protein -
  UNDYM_RS00245 (UNDYM_0051) comM 51746..53269 (-) 1524 WP_162039213.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  UNDYM_RS00250 (UNDYM_0052) - 53282..53524 (-) 243 WP_162039214.1 accessory factor UbiK family protein -
  UNDYM_RS00255 (UNDYM_0053) - 53972..54730 (+) 759 WP_162039215.1 TorF family putative porin -
  UNDYM_RS00260 (UNDYM_0054) - 54745..55083 (+) 339 WP_110254759.1 P-II family nitrogen regulator -
  UNDYM_RS00265 (UNDYM_0056) - 55104..56699 (+) 1596 WP_162039216.1 ammonium transporter -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54763.54 Da        Isoelectric Point: 8.3943

>NTDB_id=69621 UNDYM_RS00245 WP_162039213.1 51746..53269(-) (comM) [Undibacterium sp. YM2]
MSLAVLKSRALNGMDAPQVTVEVHLANGLPAFTIVGLPETEVKESKDRVRAALQNARFEFPNKRITVNLAPADLPKESGR
FDLPIALGILAASGQIPGDALEQYEFAGELSLSGELRPIRGALAMTFAICKSQQHSAQEPAFILPRTNADEAALVSDAAI
YPADSLLQVCAHFAGVDGEQRLARHRPASRPSRPKYADFAEVKGQLQAKRALEVAAAGSHSVLLMGPPGTGKSMLAARFP
GILPTMTDQEALESAAVQSLTSGFSIEKWKARPYRAPHHTASAVALVGGGGTPRPGEISLAHRGVLFLDELPEFDRKVLE
VLREPLESGHITISRAARQADFPARFQLIAAMNPCPCGYFGHSNGKCRCTPDIIARYQDRISGPLLDRIDMQIQVGALPH
ADLLKQADGEASASINARVETAFERQQQRQGKANNLLSTTEIDLHCQPDAQAGQLLRNAMTKLNWSARAYHRVLKVARTI
ADLAGSAHIATPHVAEAIQYRRALRDQ

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=69621 UNDYM_RS00245 WP_162039213.1 51746..53269(-) (comM) [Undibacterium sp. YM2]
ATGTCCTTAGCCGTACTCAAAAGCCGCGCCCTGAATGGCATGGATGCGCCACAGGTGACTGTGGAAGTGCATCTGGCCAA
TGGCCTGCCCGCCTTCACTATCGTTGGCCTGCCCGAAACCGAAGTCAAGGAATCCAAGGACAGGGTCAGGGCCGCCCTGC
AAAATGCCCGCTTTGAGTTCCCAAACAAGCGCATCACAGTGAATTTGGCACCTGCTGATTTGCCGAAGGAATCTGGCCGC
TTTGATTTGCCTATCGCGCTGGGTATTCTGGCCGCATCCGGACAAATCCCCGGAGATGCGCTGGAACAATATGAATTTGC
CGGTGAGCTATCCTTATCAGGCGAACTGCGGCCGATACGCGGCGCACTGGCAATGACGTTTGCGATATGCAAAAGCCAGC
AACACTCGGCACAAGAACCCGCCTTCATATTGCCACGCACGAACGCCGATGAAGCGGCGCTGGTCAGCGATGCTGCGATT
TATCCCGCTGATTCGCTGCTACAGGTATGCGCTCACTTCGCTGGCGTCGATGGTGAACAAAGACTGGCACGTCACCGCCC
TGCCTCCAGACCCAGTCGCCCCAAATATGCTGACTTTGCGGAGGTCAAAGGCCAGTTACAGGCCAAACGGGCACTGGAAG
TAGCAGCAGCAGGCTCACATTCGGTACTTCTGATGGGGCCGCCAGGCACGGGCAAATCGATGCTGGCAGCCCGCTTCCCC
GGCATCTTGCCGACCATGACAGATCAGGAAGCCCTGGAATCAGCCGCAGTACAGTCCCTGACTTCCGGTTTTTCCATAGA
GAAATGGAAGGCGAGACCATACAGGGCACCCCATCACACAGCATCTGCCGTAGCATTAGTTGGTGGCGGCGGAACACCAA
GGCCAGGAGAGATTTCTCTGGCTCACAGGGGGGTGCTCTTCCTCGACGAACTTCCGGAATTCGATAGAAAAGTGCTGGAA
GTTTTACGTGAACCTCTGGAATCTGGCCACATTACCATCTCACGCGCGGCGCGGCAGGCAGATTTCCCGGCACGCTTTCA
ATTAATAGCGGCCATGAACCCATGTCCGTGCGGCTACTTCGGGCACAGCAATGGCAAATGCCGTTGTACACCGGACATCA
TCGCCCGCTATCAGGACAGGATATCCGGCCCTTTGCTGGACCGCATAGACATGCAGATACAAGTCGGCGCACTGCCGCAT
GCGGACTTGCTCAAGCAGGCCGATGGCGAGGCCAGTGCCAGTATCAATGCGCGCGTAGAAACTGCTTTTGAGCGACAACA
ACAGCGCCAGGGCAAGGCGAATAATCTGTTATCGACCACAGAAATTGACCTGCATTGCCAACCGGATGCGCAAGCCGGGC
AACTCTTGCGCAATGCCATGACCAAGCTCAACTGGTCAGCCCGTGCCTATCACCGCGTCTTGAAAGTCGCCCGCACCATA
GCTGACCTGGCCGGTTCAGCCCATATCGCCACGCCACATGTAGCTGAGGCGATACAGTACCGGCGTGCACTTCGGGATCA
GTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A809QVC3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

55.336

99.803

0.552

  comM Vibrio campbellii strain DS40M4

54.365

99.408

0.54

  comM Glaesserella parasuis strain SC1401

52.663

100

0.527

  comM Haemophilus influenzae Rd KW20

52.148

100

0.527

  comM Legionella pneumophila str. Paris

50.292

100

0.509

  comM Legionella pneumophila strain ERS1305867

50.292

100

0.509

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.109

100

0.467


Multiple sequence alignment