Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   B195_RS00365 Genome accession   NZ_CP017432
Coordinates   80595..82088 (+) Length   497 a.a.
NCBI ID   WP_003437864.1    Uniprot ID   -
Organism   Pseudomonas sp. Lz4W     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 75595..87088
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B195_RS00345 (B195_000335) - 76353..77900 (+) 1548 WP_003437873.1 SUMF1/EgtB/PvdO family nonheme iron enzyme -
  B195_RS00350 (B195_000340) - 77933..78646 (+) 714 WP_003437870.1 hypothetical protein -
  B195_RS00355 (B195_000345) - 78651..79154 (+) 504 WP_003437868.1 hypothetical protein -
  B195_RS00360 (B195_000350) - 79216..80463 (+) 1248 WP_003437866.1 VWA domain-containing protein -
  B195_RS00365 (B195_000355) comM 80595..82088 (+) 1494 WP_003437864.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  B195_RS00370 (B195_000360) - 82099..83019 (-) 921 WP_003437862.1 LysR substrate-binding domain-containing protein -
  B195_RS00375 (B195_000365) - 83172..84548 (+) 1377 WP_003437859.1 NorM family multidrug efflux MATE transporter -
  B195_RS00380 (B195_000370) - 84613..86172 (-) 1560 WP_010656748.1 EAL domain-containing protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 52763.05 Da        Isoelectric Point: 8.4333

>NTDB_id=198080 B195_RS00365 WP_003437864.1 80595..82088(+) (comM) [Pseudomonas sp. Lz4W]
MSLAIVYSRARVGVEAPAVTVETHLANGLPALTLVGLPETAVKESKDRVRSAILNSGFEFPARRITLNLAPADLPKDGGR
FDLAIALGILAASGQIPAVSLAQVECLGELALSGAIRSVQGVLPAALAARAAGRALLVPLANAEEACLASGLTVIAVEHL
LQAVAHFAGRTVLEPYGASGLLRESMPYPDLSDVQGQLSAKRALVIAAAGSHNLLFTGPPGTGKTLLASRLPGLLPPLNE
QEALEVAAIQSVASHVPLKCWPQRPFRQPHHSASGPALVGGGSKPQPGEITLAHHGVLFLDELPEFDRKVLEVLREPMES
GFIVIARARDRMRFPARFQLVAAMNPCPCGYLGEPTGRCRCTPEHIQRYRNKLSGPLLDRIDLHLTVARETTSLNPALAC
GLTTASAAASVAGARERQLQRQGCANAFLDLPGVRSQCRLSAVDNTWLETACERMGLSLRAAHRLLKVARTLADLEQVDA
IERKHLAEALQYRPVSG

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=198080 B195_RS00365 WP_003437864.1 80595..82088(+) (comM) [Pseudomonas sp. Lz4W]
ATGTCGCTGGCCATCGTTTACAGCCGAGCCAGAGTGGGCGTCGAGGCCCCGGCCGTTACGGTCGAAACGCATCTGGCCAA
CGGGTTGCCGGCACTGACCCTGGTCGGGCTGCCGGAAACCGCAGTCAAGGAGAGCAAGGACCGGGTACGCAGCGCGATTC
TAAACTCCGGGTTTGAGTTTCCGGCCCGGCGTATCACCCTCAATCTGGCGCCTGCCGACCTGCCCAAGGACGGCGGGCGT
TTTGACCTGGCCATTGCTCTCGGCATCCTGGCTGCCAGCGGGCAAATCCCCGCTGTATCCCTGGCCCAGGTCGAGTGTCT
CGGTGAACTGGCCCTGTCGGGCGCCATCAGGTCGGTGCAGGGCGTACTGCCTGCCGCCCTGGCAGCCCGTGCTGCGGGCC
GGGCTTTGCTGGTGCCGCTGGCCAATGCCGAGGAAGCCTGTCTGGCCTCCGGCCTGACGGTCATCGCGGTCGAACACTTG
TTGCAGGCCGTGGCGCACTTCGCCGGCCGTACAGTGCTTGAACCTTACGGCGCCAGTGGCTTGCTGCGCGAGAGCATGCC
TTATCCGGACTTGAGCGATGTACAGGGCCAGCTCTCGGCCAAGCGGGCGTTAGTGATTGCCGCTGCTGGCAGCCACAATT
TGTTGTTCACCGGGCCGCCCGGCACGGGCAAGACCCTGCTGGCCAGCCGCCTTCCGGGGTTATTGCCGCCCCTCAACGAG
CAAGAGGCGCTGGAGGTGGCGGCCATTCAATCGGTGGCCAGTCATGTGCCCCTCAAGTGCTGGCCGCAGCGTCCGTTTCG
CCAGCCGCACCATTCGGCTTCCGGCCCGGCGCTGGTTGGCGGCGGCTCAAAACCCCAGCCGGGCGAGATCACTTTGGCGC
ACCACGGGGTGTTGTTTCTGGATGAACTGCCCGAATTCGATCGCAAAGTGCTGGAGGTGCTCAGAGAACCGATGGAATCG
GGATTTATCGTGATCGCCCGGGCCCGGGATCGAATGCGGTTTCCGGCGCGCTTCCAGCTGGTGGCGGCAATGAACCCCTG
CCCCTGTGGCTATCTGGGAGAGCCCACGGGCCGTTGTCGCTGTACCCCGGAGCACATCCAGCGCTACCGCAACAAGCTGT
CCGGGCCCTTGCTCGATCGCATCGACCTGCACCTGACCGTGGCCCGCGAAACCACCTCGCTCAACCCCGCTCTTGCCTGC
GGCTTGACCACGGCCAGCGCCGCTGCGTCAGTGGCTGGTGCCCGTGAACGGCAGCTGCAGCGCCAGGGCTGTGCCAATGC
TTTTCTGGACCTGCCCGGGGTGCGGAGCCAGTGCAGGCTCAGTGCTGTGGATAACACCTGGCTGGAGACGGCCTGCGAGC
GCATGGGCCTGTCGCTGCGGGCCGCTCACCGGTTGTTGAAAGTTGCACGAACCCTGGCCGATCTGGAGCAAGTGGACGCT
ATAGAGCGCAAGCATCTGGCGGAGGCATTGCAGTACCGGCCGGTGAGCGGGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.758

99.598

0.555

  comM Vibrio cholerae strain A1552

55.556

99.598

0.553

  comM Haemophilus influenzae Rd KW20

54.781

100

0.553

  comM Glaesserella parasuis strain SC1401

54.291

100

0.547

  comM Legionella pneumophila str. Paris

49.597

99.799

0.495

  comM Legionella pneumophila strain ERS1305867

49.597

99.799

0.495

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.95

100

0.457


Multiple sequence alignment