Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   SGN16_RS28395 Genome accession   NZ_CP146985
Coordinates   6372419..6373912 (+) Length   497 a.a.
NCBI ID   WP_210642000.1    Uniprot ID   -
Organism   Pseudomonas sp. G166     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6367419..6378912
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SGN16_RS28370 - 6367817..6368242 (-) 426 WP_210641994.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  SGN16_RS28375 - 6368456..6369790 (-) 1335 WP_210641995.1 ammonium transporter -
  SGN16_RS28380 glnK 6369824..6370162 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  SGN16_RS28385 - 6370576..6370836 (+) 261 WP_063323829.1 accessory factor UbiK family protein -
  SGN16_RS28390 - 6370855..6372132 (-) 1278 WP_335944659.1 HAMP domain-containing sensor histidine kinase -
  SGN16_RS28395 comM 6372419..6373912 (+) 1494 WP_210642000.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  SGN16_RS28400 - 6373919..6375133 (-) 1215 WP_335944660.1 aldose 1-epimerase family protein -
  SGN16_RS28405 - 6375507..6377483 (-) 1977 WP_335944661.1 methyl-accepting chemotaxis protein -
  SGN16_RS28410 - 6377671..6378591 (-) 921 WP_210642003.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 53067.24 Da        Isoelectric Point: 8.2115

>NTDB_id=945359 SGN16_RS28395 WP_210642000.1 6372419..6373912(+) (comM) [Pseudomonas sp. G166]
MSLAIVHSRAQVGVEAPAVTVEVHLANGLPSLTMVGLPEAAVKESKDRVRSAIINSGLSFPARRITLNLAPADLPKDGGR
FDLAIALGILAASVQVPTLMLDDVECLGELALSGAVRPVRGVLPAALAARKAGRTLVVPWANAEEACLASGLKVIAVNHL
LEAVAHFNGHTPVKTFVSNGLLSASKPYPDLNEVQGQAGAKRALLIAAAGAHNLLLSGPPGTGKTLLASRLPGLLPPLAE
SEALEVAAIQSVASCVPLSHWPQRPFRQPHHSASGPALVGGGSKPQPGEITLAHHGVLFLDELPEFDRKVLEVLREPLES
GHIVVSRARDRVRFPARFQLVAAMNPCPCGYLGEPSGRCSCTPDMVQRYRNKLSGPLLDRIDLHLTVAREATALNPRHEP
GADTATVAEQVAQARERQQKRQDCANAFLDLPGLRQHCKLSAIDETWLETACERLTLSLRAAHRLLKVARTLADLERVDQ
ISREHLAEALQYRPATP

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=945359 SGN16_RS28395 WP_210642000.1 6372419..6373912(+) (comM) [Pseudomonas sp. G166]
ATGTCGCTTGCCATTGTCCATAGCCGCGCTCAGGTGGGGGTGGAGGCTCCTGCCGTTACCGTTGAAGTTCACCTGGCCAA
TGGCTTGCCGTCGTTGACGATGGTCGGCTTGCCGGAGGCGGCGGTGAAGGAAAGCAAGGACCGGGTGCGCAGCGCGATCA
TCAACTCGGGCCTGAGTTTTCCGGCGCGGCGCATCACCTTGAACCTGGCGCCGGCGGATCTACCAAAGGACGGCGGTCGG
TTCGACCTGGCCATTGCCTTGGGGATCCTCGCAGCCAGCGTGCAAGTACCGACCCTGATGCTCGACGACGTTGAATGCCT
TGGCGAGTTGGCGTTATCCGGCGCCGTGCGACCGGTACGCGGGGTATTGCCGGCAGCGCTGGCGGCGCGCAAGGCCGGGC
GTACGCTGGTGGTGCCATGGGCGAATGCCGAGGAAGCCTGCTTGGCGTCGGGGCTGAAGGTGATTGCGGTGAATCATCTG
CTCGAAGCCGTGGCTCACTTTAACGGCCACACCCCGGTCAAGACGTTTGTTTCCAACGGGCTGCTTTCAGCCAGCAAGCC
CTACCCCGACCTGAATGAAGTACAAGGCCAGGCCGGGGCCAAGCGGGCGTTGCTGATCGCCGCCGCAGGGGCTCACAACC
TGTTGCTCAGCGGACCACCGGGCACCGGCAAGACCCTGCTGGCGAGCCGTTTGCCGGGGCTGCTACCGCCGTTGGCGGAA
AGTGAGGCCCTGGAGGTCGCAGCCATTCAGTCGGTCGCCAGTTGCGTGCCGTTGAGCCATTGGCCGCAGCGCCCGTTTCG
CCAGCCCCACCACTCGGCTTCCGGTCCGGCGCTGGTGGGCGGTGGATCGAAACCGCAACCAGGGGAAATCACCCTCGCCC
ACCATGGCGTACTGTTCCTGGATGAACTGCCGGAGTTCGACCGTAAGGTGCTGGAAGTCCTGAGGGAACCACTGGAGTCT
GGCCACATCGTGGTTTCTCGCGCCCGAGACCGTGTGCGCTTTCCGGCACGCTTTCAATTGGTAGCGGCGATGAATCCCTG
TCCCTGTGGATATCTTGGCGAGCCCAGCGGTCGTTGCTCATGTACACCGGACATGGTGCAGCGCTACCGCAATAAACTCT
CTGGCCCTCTGCTGGACCGCATCGACCTGCACCTGACCGTCGCCCGGGAGGCCACGGCGTTGAACCCAAGGCACGAACCC
GGTGCCGATACCGCCACCGTCGCCGAGCAAGTGGCCCAGGCCCGGGAGCGCCAGCAAAAGCGTCAGGACTGCGCCAACGC
TTTCCTGGATCTGCCGGGCTTGCGTCAGCATTGCAAATTATCCGCAATCGATGAAACGTGGCTGGAAACCGCTTGCGAGC
GCCTGACCTTATCGCTGCGAGCCGCCCACCGCCTGCTCAAGGTCGCCCGTACGTTGGCGGACCTTGAGCGGGTGGATCAA
ATCAGTCGCGAGCACCTGGCTGAAGCATTGCAGTATCGGCCCGCAACGCCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.556

99.598

0.553

  comM Vibrio cholerae strain A1552

55.354

99.598

0.551

  comM Haemophilus influenzae Rd KW20

53.908

100

0.541

  comM Glaesserella parasuis strain SC1401

53

100

0.533

  comM Legionella pneumophila str. Paris

50.806

99.799

0.507

  comM Legionella pneumophila strain ERS1305867

50.806

99.799

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.23

100

0.469