Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   B1781_RS21380 Genome accession   NZ_CP019936
Coordinates   4521493..4523010 (+) Length   505 a.a.
NCBI ID   WP_078121627.1    Uniprot ID   -
Organism   Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4516493..4528010
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B1781_RS21355 - 4517326..4518375 (+) 1050 WP_078121622.1 succinylglutamate desuccinylase/aspartoacylase family protein -
  B1781_RS21360 - 4518475..4519761 (-) 1287 WP_078121623.1 ammonium transporter -
  B1781_RS21365 glnK 4519783..4520121 (-) 339 WP_078121624.1 P-II family nitrogen regulator -
  B1781_RS21370 - 4520200..4520913 (-) 714 WP_078121625.1 TorF family putative porin -
  B1781_RS21375 - 4521198..4521446 (+) 249 WP_078121626.1 accessory factor UbiK family protein -
  B1781_RS21380 comM 4521493..4523010 (+) 1518 WP_078121627.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  B1781_RS21385 - 4523086..4524987 (+) 1902 WP_078121628.1 ATP-binding cassette domain-containing protein -
  B1781_RS21390 - 4525006..4525599 (+) 594 WP_078121629.1 hypothetical protein -
  B1781_RS21395 - 4525630..4526748 (+) 1119 WP_078121630.1 PQQ-dependent sugar dehydrogenase -
  B1781_RS21400 arfB 4526745..4527161 (+) 417 WP_408646378.1 alternative ribosome rescue aminoacyl-tRNA hydrolase ArfB -
  B1781_RS21405 - 4527174..4527923 (-) 750 WP_334223815.1 sulfite exporter TauE/SafE family protein -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 53705.46 Da        Isoelectric Point: 8.1416

>NTDB_id=218982 B1781_RS21380 WP_078121627.1 4521493..4523010(+) (comM) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
MAFAVTHCRAAIGVDAPPVAVETHLANGLPSFNIVGLPEKAVQESRDRVRSALVNSGFDFPARRITVNLAPADIPKHGSR
FDLAIAIGILLASGQLPDKSADGYEFVGELSLAGALRSISGVLPMALATAAADTKMILPAANADEAALVSSLASYPAEHL
LGVTAHLLGATLLKEHCKPQQQSSNSPSSDLADVWGQSQAKRALEIVAAGRHNLLMVGPPGSGKSMLASRLPGILPPMTE
REALESAAVRSIAGLPFSPGTWMQRPFRAPHHTASGVALVGGGGGSHPRPGEVSLAHFGTLFLDELPEFNGKVLDVLREP
LETGKILISRAARQAEFPADFQLIAAMNPCQCGYANDPERICAGCSPERVARYQRRISGPLRDRIDIQIEVSALPRQALL
GGLKARSEDSATVRQRVCSAWEKQLERQGTANARLGHEDLQRHCSLSPAGQALLANAIDKLGLSARAFHRILRVARTIAD
LAGKESIEDQHLTEAIGYRRLDRIG

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=218982 B1781_RS21380 WP_078121627.1 4521493..4523010(+) (comM) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
ATGGCATTTGCAGTAACACACTGTCGCGCCGCTATTGGCGTCGACGCCCCACCGGTAGCCGTCGAGACTCATCTCGCCAA
CGGTCTGCCCAGCTTCAACATCGTCGGACTGCCCGAGAAGGCGGTACAGGAAAGCCGCGACCGCGTGCGCAGCGCGCTGG
TCAACAGCGGATTTGATTTCCCCGCCAGAAGGATCACGGTCAACCTTGCCCCTGCAGATATACCCAAGCACGGCAGCCGG
TTTGATCTGGCAATCGCGATCGGCATCCTCCTGGCGAGTGGGCAACTGCCGGATAAGTCTGCCGACGGCTATGAGTTTGT
CGGCGAGCTGAGTCTCGCCGGCGCATTGCGCAGCATCAGCGGTGTTCTGCCCATGGCGTTGGCTACTGCCGCGGCCGACA
CCAAGATGATCCTGCCCGCGGCCAATGCCGACGAAGCCGCGCTTGTCTCGTCGCTGGCCTCATACCCCGCTGAGCATCTC
CTGGGAGTTACCGCCCATCTGCTGGGCGCGACCCTTCTGAAAGAACATTGCAAACCCCAGCAACAATCCTCGAACTCACC
GTCATCGGATCTCGCCGACGTATGGGGCCAGTCGCAGGCCAAGCGTGCCCTCGAAATCGTGGCTGCCGGTCGCCACAACC
TGCTAATGGTGGGGCCACCAGGCAGCGGCAAATCAATGCTCGCCAGCCGGCTGCCCGGTATTCTGCCGCCCATGACAGAG
CGAGAAGCACTCGAGAGTGCGGCCGTCAGATCGATCGCCGGTCTGCCCTTCTCGCCGGGCACATGGATGCAACGGCCCTT
TAGGGCACCGCACCATACGGCGTCCGGAGTCGCATTGGTCGGAGGTGGCGGCGGGAGCCACCCGAGACCAGGAGAGGTCT
CGCTCGCCCACTTCGGTACGCTGTTCCTGGACGAACTGCCAGAGTTCAACGGCAAGGTGCTCGACGTACTGCGCGAGCCG
CTCGAAACCGGAAAGATCCTGATCTCGCGCGCCGCTCGCCAAGCCGAGTTTCCGGCCGATTTCCAATTGATCGCGGCGAT
GAATCCCTGCCAGTGCGGCTACGCCAACGACCCTGAACGCATTTGTGCCGGCTGCAGCCCGGAACGCGTGGCGCGCTATC
AGCGGCGCATCTCCGGTCCGTTGCGCGACCGCATCGATATCCAGATCGAAGTATCCGCCTTACCACGTCAAGCGCTGCTG
GGCGGGTTGAAAGCGCGCTCGGAAGACAGCGCAACGGTGCGCCAGCGGGTCTGCTCAGCGTGGGAAAAGCAGCTTGAGCG
CCAGGGAACCGCCAACGCTCGGCTCGGCCACGAAGACCTGCAACGGCACTGCTCATTGTCGCCCGCCGGGCAAGCCCTGC
TCGCCAACGCCATCGATAAACTCGGTCTCTCAGCGCGTGCCTTTCACCGCATCCTGCGCGTGGCGCGTACGATCGCCGAC
CTGGCGGGCAAGGAAAGTATCGAAGATCAACACCTGACCGAGGCCATCGGTTATCGCCGTTTGGATCGCATCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.063

100

0.525

  comM Vibrio campbellii strain DS40M4

52.579

99.802

0.525

  comM Vibrio cholerae strain A1552

52.579

99.802

0.525

  comM Glaesserella parasuis strain SC1401

51.772

100

0.521

  comM Legionella pneumophila str. Paris

48

99.01

0.475

  comM Legionella pneumophila strain ERS1305867

48

99.01

0.475

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.488

100

0.448


Multiple sequence alignment