Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   QUE09_RS17080 Genome accession   NZ_AP027361
Coordinates   3793354..3794874 (-) Length   506 a.a.
NCBI ID   WP_286234082.1    Uniprot ID   -
Organism   Thalassotalea sediminis strain KCTC 42588     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 3788354..3799874
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QUE09_RS17060 - 3789115..3790371 (+) 1257 WP_286234078.1 glutamate-5-semialdehyde dehydrogenase -
  QUE09_RS17065 - 3790472..3790627 (-) 156 WP_286234079.1 hypothetical protein -
  QUE09_RS17070 ilvC 3790836..3792320 (-) 1485 WP_286234080.1 ketol-acid reductoisomerase -
  QUE09_RS17075 ilvY 3792471..3793349 (+) 879 WP_286234081.1 HTH-type transcriptional activator IlvY -
  QUE09_RS17080 comM 3793354..3794874 (-) 1521 WP_286234082.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  QUE09_RS17085 ilvG 3795322..3796986 (+) 1665 WP_286234083.1 acetolactate synthase 2 catalytic subunit -
  QUE09_RS17090 ilvM 3796988..3797275 (+) 288 WP_286234084.1 acetolactate synthase 2 small subunit -
  QUE09_RS17095 - 3797322..3798248 (+) 927 WP_286234085.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55284.64 Da        Isoelectric Point: 6.8612

>NTDB_id=100802 QUE09_RS17080 WP_286234082.1 3793354..3794874(-) (comM) [Thalassotalea sediminis strain KCTC 42588]
MSLACVYSRARVGLTSPLVTVEVHLANGLPAFNIVGLPETSVRESKDRVRSAIINCGYEFPAKRITINLAPADLPKEGGR
FDLPIAVGILVASQQLPSDDIENYEFAGELALSGELRPIVGEIPMAMASSESKRALILSAENGEQASWVESATILPLKHL
ADLYPHLMRQQRLAIAEPNRQAVNTAIMEDDISDVIGQPLAKRALELSASGGHNLLLVGPPGTGKTMLASRLPGILPAMT
EQEGINVAAIKSISNQAVDANAWLTRPFRSPHHTASSAALIGGGSIPQPGEVTLAHHGVLFLDELPEFDRKVLDVLREPM
ESGEVTISRALQKQTYPAQFQLVAAMNPSPTGFYNDNRSTPEQVLRYLNKLSGPFLDRIDIQIEVARLPKGMWSDCVPVM
ETSAQVRFRVQKCRTIQIKRQGKANAHLTSTELRQYCVLTPDDNEFLALAVEKLGLSTRAHHKILKIARTLADMENCDAI
EHKHLIEALSYRAMDRLLKHLTTAVA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=100802 QUE09_RS17080 WP_286234082.1 3793354..3794874(-) (comM) [Thalassotalea sediminis strain KCTC 42588]
ATGTCTTTAGCGTGTGTTTATAGTCGTGCTCGTGTTGGTTTAACTTCACCATTGGTCACTGTTGAAGTACATCTAGCAAA
TGGTTTACCTGCTTTTAATATTGTAGGGTTACCTGAAACAAGTGTTCGAGAGTCTAAAGACCGAGTGCGCAGTGCTATCA
TCAATTGTGGTTATGAATTCCCCGCTAAACGTATCACGATCAATTTAGCCCCAGCTGACTTACCCAAAGAAGGGGGACGA
TTCGATCTGCCTATTGCAGTTGGAATTTTAGTTGCGTCGCAGCAATTGCCGTCAGATGATATTGAAAATTATGAGTTTGC
AGGAGAGCTTGCTTTATCAGGAGAACTTCGGCCGATTGTTGGTGAGATCCCAATGGCAATGGCGAGCAGTGAATCTAAAA
GAGCGTTAATTTTATCGGCAGAAAATGGCGAACAAGCCAGTTGGGTTGAATCAGCAACTATACTCCCGCTCAAACATCTT
GCAGATTTATACCCGCACCTGATGCGTCAACAAAGATTAGCTATTGCAGAGCCCAATCGACAAGCCGTTAATACTGCGAT
AATGGAAGACGACATTAGTGATGTTATTGGTCAACCATTAGCCAAGCGCGCTTTAGAACTGTCGGCGAGCGGTGGCCACA
ATTTATTGTTAGTAGGTCCTCCTGGCACAGGAAAAACCATGCTAGCAAGCCGATTACCGGGTATTTTACCCGCGATGACT
GAACAAGAAGGTATCAATGTTGCTGCAATCAAATCGATTAGCAACCAAGCTGTTGATGCAAATGCATGGCTTACACGCCC
GTTTCGAAGTCCACACCATACGGCTTCTTCCGCAGCACTCATTGGTGGAGGTAGTATACCTCAGCCAGGAGAGGTAACGC
TGGCGCATCATGGTGTATTATTTCTTGATGAATTGCCTGAGTTTGATCGTAAAGTGCTTGATGTGTTGCGTGAACCTATG
GAGTCTGGTGAGGTCACGATATCGCGCGCATTACAAAAACAAACGTATCCAGCGCAGTTTCAATTGGTGGCAGCGATGAA
CCCATCACCGACGGGTTTTTACAATGATAACCGCAGTACACCAGAGCAAGTGTTACGATACTTAAACAAACTATCAGGAC
CGTTTTTAGATCGTATTGATATTCAAATTGAAGTGGCGAGGTTACCAAAAGGTATGTGGTCAGACTGTGTTCCGGTAATG
GAAACCAGTGCACAAGTAAGGTTTAGGGTTCAGAAATGCCGTACCATTCAAATTAAGCGCCAAGGAAAAGCAAATGCGCA
TCTTACCAGCACAGAACTTCGACAATACTGTGTATTAACACCCGATGATAATGAGTTCTTAGCACTTGCGGTAGAAAAAC
TCGGGTTATCGACACGTGCCCATCATAAGATTTTAAAGATCGCGCGAACATTAGCCGATATGGAAAATTGTGATGCTATT
GAACATAAACATTTGATCGAAGCATTGTCTTATCGTGCAATGGATCGATTATTAAAGCATTTAACGACAGCCGTCGCCTA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.512

100

0.557

  comM Vibrio cholerae strain A1552

54.743

100

0.547

  comM Glaesserella parasuis strain SC1401

54.348

100

0.543

  comM Vibrio campbellii strain DS40M4

54.257

99.802

0.541

  comM Legionella pneumophila str. Paris

46.278

98.221

0.455

  comM Legionella pneumophila strain ERS1305867

46.278

98.221

0.455

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.882

100

0.451


Multiple sequence alignment