Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   G7079_RS02065 Genome accession   NZ_CP049872
Coordinates   478791..480305 (-) Length   504 a.a.
NCBI ID   WP_166055088.1    Uniprot ID   A0A6G7ZS72
Organism   Thermomonas sp. HDW16     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 473791..485305
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G7079_RS02050 (G7079_02055) - 474592..477087 (+) 2496 WP_240906301.1 putative peptide modification system cyclase -
  G7079_RS02055 (G7079_02060) - 477154..477462 (-) 309 WP_166055084.1 NHLP-related RiPP peptide -
  G7079_RS02060 (G7079_02065) - 477549..478769 (+) 1221 WP_166055086.1 putative peptide maturation dehydrogenase -
  G7079_RS02065 (G7079_02070) comM 478791..480305 (-) 1515 WP_166055088.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  G7079_RS02070 (G7079_02075) - 480317..480580 (-) 264 WP_166057806.1 accessory factor UbiK family protein -
  G7079_RS02075 (G7079_02080) - 480779..481117 (+) 339 WP_166055090.1 P-II family nitrogen regulator -
  G7079_RS02080 (G7079_02085) - 481118..483031 (-) 1914 WP_166055092.1 tetratricopeptide repeat-containing diguanylate cyclase -
  G7079_RS02085 (G7079_02090) speE 483126..483980 (-) 855 WP_166055094.1 polyamine aminopropyltransferase -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 53645.47 Da        Isoelectric Point: 8.5889

>NTDB_id=428296 G7079_RS02065 WP_166055088.1 478791..480305(-) (comM) [Thermomonas sp. HDW16]
MGLALVHSRARAGVHAPAVRVEVHLAGGLPAMNIVGLPEAAVREAKDRVRAAIQCAQFEFPARRITVNLAPADLPKDGGR
YDLAIALGILAASGQLSTDALDGWEFLGELALTGELRPVDGVLAAAIATGQANRKLLVPPGNGHEAALASNVEVRTARTL
LEVCAALDARKALPHAQPLPCEEARQPDLGDVRGQAQARRALEIAAAGAHHLLFVGPPGSGKTLLASRLCGILPAPSEAE
ALEAAMIASASGRGLDPARWRQRPFRAPHHTASAVALVGGGADPRPGEISLAHHGVLFLDELPEWGRHALEVLREPLESG
HVTISRAARQCEFPARFQLVAAMNPCPCGWAGDPSGRCLCNNEQIRRYRARISGPLMERIDLHVEVPRLPAVALRHDAAA
GEASARVRERVANARDVQLARCGRTNARLGQAQTDAHCRLAAHDSALLERAVESLQLSARSLHRILRVARTIADLAGSAE
IQTPHLSEAIGYRKLERGYERKVA

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=428296 G7079_RS02065 WP_166055088.1 478791..480305(-) (comM) [Thermomonas sp. HDW16]
ATGGGTCTTGCACTCGTGCACAGCCGTGCACGCGCGGGCGTGCATGCGCCTGCCGTGCGCGTGGAAGTCCACTTGGCTGG
CGGTTTGCCGGCGATGAACATCGTCGGCTTGCCGGAAGCCGCGGTGCGCGAAGCCAAGGATCGCGTGCGCGCCGCGATCC
AGTGCGCGCAGTTCGAATTCCCGGCGCGGCGGATCACCGTCAACCTGGCCCCGGCGGACCTGCCGAAGGACGGCGGCCGC
TACGACCTGGCCATCGCCCTGGGCATCCTTGCCGCCAGCGGGCAACTCTCGACCGACGCGCTGGATGGCTGGGAATTCCT
CGGCGAACTGGCCCTGACCGGTGAATTGCGCCCGGTCGATGGCGTGCTGGCGGCGGCGATCGCCACCGGGCAAGCGAACC
GCAAACTGTTGGTACCGCCGGGCAACGGACACGAAGCGGCATTGGCATCGAATGTCGAAGTCCGCACCGCGCGCACCCTG
CTCGAAGTTTGCGCAGCGCTGGATGCACGCAAGGCGCTGCCGCACGCACAACCGCTGCCTTGCGAAGAAGCGCGCCAGCC
GGATCTCGGCGATGTGCGCGGACAAGCACAGGCGCGCCGCGCGCTGGAGATCGCCGCCGCTGGCGCGCACCACCTGTTGT
TCGTCGGGCCGCCGGGTAGTGGCAAGACCCTGCTCGCCTCGCGGCTGTGCGGCATCCTGCCCGCGCCTTCGGAAGCCGAA
GCGCTGGAAGCGGCGATGATCGCCTCCGCCAGTGGCCGCGGCCTGGATCCCGCACGCTGGCGGCAACGGCCATTCCGTGC
ACCACACCATACGGCCAGTGCGGTTGCGCTCGTCGGCGGCGGTGCAGACCCGCGACCCGGCGAAATTTCCCTAGCCCACC
ACGGGGTGCTGTTTCTCGATGAACTACCCGAATGGGGCCGGCACGCGCTGGAAGTGCTACGCGAGCCGCTGGAATCCGGC
CACGTCACTATCTCGCGTGCGGCACGGCAATGCGAATTCCCGGCGCGTTTCCAGTTGGTCGCGGCGATGAACCCCTGCCC
CTGCGGGTGGGCGGGTGATCCTTCCGGCCGCTGCCTGTGCAACAACGAGCAGATCCGTCGCTATCGCGCACGCATCTCCG
GGCCGCTGATGGAACGCATCGATCTGCATGTCGAAGTACCGCGACTGCCTGCAGTAGCGTTGCGGCACGATGCCGCGGCC
GGCGAAGCGAGTGCGCGAGTGCGCGAACGTGTCGCCAATGCGCGCGATGTGCAGCTGGCGCGCTGCGGCAGGACGAACGC
GCGCCTGGGCCAGGCGCAAACCGATGCCCACTGCCGACTCGCAGCGCACGACAGCGCCCTGCTGGAGCGTGCGGTGGAAA
GCCTGCAGCTGTCGGCGCGTTCGCTGCACCGTATCCTGCGGGTGGCGCGCACCATCGCCGACCTGGCGGGATCGGCGGAG
ATCCAGACACCGCACCTCAGCGAAGCGATCGGTTACCGGAAGCTGGAGCGCGGGTATGAGCGCAAGGTAGCTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6G7ZS72

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.11

99.008

0.546

  comM Vibrio cholerae strain A1552

54.709

99.008

0.542

  comM Haemophilus influenzae Rd KW20

52.465

100

0.528

  comM Glaesserella parasuis strain SC1401

52.465

100

0.528

  comM Legionella pneumophila str. Paris

49.095

98.611

0.484

  comM Legionella pneumophila strain ERS1305867

49.095

98.611

0.484

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.418

100

0.438