Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   D6Z43_RS20890 Genome accession   NZ_CP032616
Coordinates   4425145..4426641 (-) Length   498 a.a.
NCBI ID   WP_120653963.1    Uniprot ID   A0AAD0TFC0
Organism   Pseudomonas sp. DY-1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4420145..4431641
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D6Z43_RS20875 (D6Z43_20875) - 4421162..4421872 (-) 711 WP_371924426.1 molecular chaperone -
  D6Z43_RS20880 (D6Z43_20880) - 4421940..4422536 (-) 597 WP_120653961.1 fimbrial protein -
  D6Z43_RS20885 (D6Z43_20885) - 4423056..4425032 (-) 1977 WP_120653962.1 choline BCCT transporter BetT -
  D6Z43_RS20890 (D6Z43_20890) comM 4425145..4426641 (-) 1497 WP_120653963.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  D6Z43_RS20895 (D6Z43_20895) - 4426680..4426943 (-) 264 WP_120653964.1 accessory factor UbiK family protein -
  D6Z43_RS20900 (D6Z43_20900) glnK 4427358..4427696 (+) 339 WP_003457590.1 P-II family nitrogen regulator -
  D6Z43_RS20905 (D6Z43_20905) - 4427731..4429050 (+) 1320 WP_120653965.1 ammonium transporter -
  D6Z43_RS20910 (D6Z43_20910) - 4429189..4429614 (+) 426 WP_120653966.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  D6Z43_RS20915 (D6Z43_20915) sutA 4429691..4430005 (+) 315 WP_120653967.1 transcriptional regulator SutA -
  D6Z43_RS20920 (D6Z43_20920) - 4430185..4430889 (-) 705 WP_120653968.1 HAD family hydrolase -

Sequence


Protein


Download         Length: 498 a.a.        Molecular weight: 52951.81 Da        Isoelectric Point: 7.2359

>NTDB_id=317058 D6Z43_RS20890 WP_120653963.1 4425145..4426641(-) (comM) [Pseudomonas sp. DY-1]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPSLALVGLPETAVKESKDRVRSAIQNSGFDFPPRRITLNLAPADLPKDGGR
FDLAIALGVLAASGQLPAGALDEVECLGELALSGSLRSVQGVLPAALAARAAGRTLVVPRDNAEEASLAGGLKVLAASHL
LELTAHFAGHTPLPFYQAQGLMDERPPYPDLAEVQGQVAAKRALLVAAAGSHNLLLCGPPGTGKTLLASRLPGLLPPLEE
HEALEVAAIHSVASHAPLTSWPQRPFRQPHHSASGAALVGGGSRPKPGEITLAHQGVLFLDELPEFERKVLEVLREPLES
GEIVIARAHDKLRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHLTVAREATALEPPTST
GPGSAEVAATVAQSRRVQLRRQGCANAFLDLKGLRQHCALESEDRAWLEAACERLNLSLRAAHRILKVARTLADLEQAET
IARHHLGEALQYRPETMT

Nucleotide


Download         Length: 1497 bp        

>NTDB_id=317058 D6Z43_RS20890 WP_120653963.1 4425145..4426641(-) (comM) [Pseudomonas sp. DY-1]
ATGTCCCTGGCCATCGTCCACAGTCGCGCGCAGGTTGGCGTCGAAGCCCCTGCCGTGACCGTCGAAGCGCACCTCGCCAA
CGGCCTGCCGTCCCTGGCCCTGGTTGGCCTTCCGGAAACCGCCGTGAAGGAAAGCAAGGACCGCGTCCGCAGCGCGATCC
AGAACAGCGGGTTCGATTTCCCTCCCCGACGCATCACCCTCAACCTCGCTCCGGCAGATCTGCCGAAAGATGGCGGCCGC
TTCGACCTCGCCATTGCCCTTGGCGTGCTCGCGGCCAGCGGCCAATTGCCGGCCGGTGCCCTGGATGAGGTGGAATGCCT
GGGTGAACTGGCGCTTTCCGGCTCGCTGCGATCTGTGCAGGGTGTACTCCCCGCCGCGCTCGCCGCTCGCGCCGCCGGGC
GGACCCTGGTGGTGCCACGGGATAATGCCGAAGAAGCCAGCCTGGCCGGCGGTCTCAAGGTGCTGGCCGCCTCGCACTTG
CTGGAGCTGACCGCGCACTTTGCCGGCCATACCCCACTGCCCTTCTACCAGGCCCAGGGCCTGATGGACGAGCGCCCGCC
CTACCCCGACCTCGCCGAAGTCCAGGGCCAGGTCGCAGCAAAGCGCGCCCTCCTGGTCGCTGCGGCTGGGTCGCACAACC
TGTTGCTTTGCGGGCCGCCCGGCACCGGCAAGACCTTGCTGGCCAGTCGCCTGCCTGGTCTGCTGCCACCGCTGGAAGAG
CATGAGGCCCTGGAAGTGGCGGCCATCCACTCGGTTGCCAGCCATGCACCGCTGACGAGCTGGCCGCAACGCCCCTTCCG
CCAGCCCCATCACAGCGCCTCCGGTGCCGCGCTGGTAGGTGGTGGTAGTCGGCCGAAGCCGGGCGAAATCACCCTGGCGC
ATCAGGGCGTGCTGTTCCTCGATGAACTGCCGGAGTTCGAGCGCAAGGTGCTGGAGGTCCTGCGAGAGCCCCTGGAAAGC
GGTGAGATCGTCATCGCCCGCGCCCATGACAAGCTGCGTTTTCCAGCGCGCTTCCAACTGGTCGCGGCAATGAACCCCTG
CCCCTGTGGATATCTCGGCGACCCGGCCGGGCGATGCCGCTGTACCCCGGAGCAGATCCAGCGCTACCGCGCCAAGTTGT
CCGGCCCGCTACTTGACCGGATCGACCTGCACCTGACAGTTGCCCGCGAAGCAACCGCCCTGGAGCCACCGACAAGCACG
GGGCCCGGCAGCGCGGAAGTTGCCGCGACGGTGGCGCAATCACGCCGCGTGCAACTCCGACGCCAGGGGTGCGCCAACGC
CTTCCTCGACCTCAAGGGCCTGCGCCAGCATTGCGCCCTGGAAAGCGAAGACCGCGCCTGGCTGGAAGCAGCCTGCGAAC
GCCTCAATCTCTCCCTGCGTGCAGCCCATCGCATCCTCAAGGTGGCTCGCACCCTAGCCGATCTGGAGCAGGCCGAAACC
ATAGCCCGCCATCACCTCGGCGAAGCCCTGCAGTACCGCCCGGAAACCATGACATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.96

99.398

0.556

  comM Vibrio cholerae strain A1552

55.556

99.398

0.552

  comM Haemophilus influenzae Rd KW20

54.183

100

0.546

  comM Glaesserella parasuis strain SC1401

54.108

100

0.542

  comM Legionella pneumophila str. Paris

50.495

100

0.512

  comM Legionella pneumophila strain ERS1305867

50.495

100

0.512

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.365

100

0.462


Multiple sequence alignment