Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   C4Q26_RS10775 Genome accession   NZ_CP026674
Coordinates   2281986..2283473 (+) Length   495 a.a.
NCBI ID   WP_023378099.1    Uniprot ID   A0A964GGS9
Organism   Pseudomonas sp. SWI44     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 2276986..2288473
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C4Q26_RS10755 (C4Q26_10755) glnK 2278074..2278412 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  C4Q26_RS10760 (C4Q26_10760) - 2278881..2279144 (+) 264 WP_023378102.1 accessory factor UbiK family protein -
  C4Q26_RS10765 (C4Q26_10765) - 2279172..2281196 (-) 2025 WP_104883072.1 DUF4034 domain-containing protein -
  C4Q26_RS10770 (C4Q26_10770) - 2281525..2281737 (-) 213 WP_023378100.1 PLDc N-terminal domain-containing protein -
  C4Q26_RS10775 (C4Q26_10775) comM 2281986..2283473 (+) 1488 WP_023378099.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  C4Q26_RS10780 (C4Q26_10780) - 2283619..2283873 (+) 255 WP_023378097.1 DUF2790 domain-containing protein -
  C4Q26_RS10785 (C4Q26_10785) - 2283921..2284292 (-) 372 WP_023378096.1 response regulator transcription factor -
  C4Q26_RS10790 (C4Q26_10790) - 2284289..2285281 (-) 993 WP_104883073.1 response regulator -

Sequence


Protein


Download         Length: 495 a.a.        Molecular weight: 53031.83 Da        Isoelectric Point: 7.9195

>NTDB_id=271573 C4Q26_RS10775 WP_023378099.1 2281986..2283473(+) (comM) [Pseudomonas sp. SWI44]
MSLALVHSRAQVGVQAPAVSVETHLANGLPNLTLVGLPETTVKESKDRVRSAIVNSGLDYPARRITQNLAPADLPKDGGR
YDLAIALGILAANGQVPVHALNDIECLGELALSGKLRPVQGILPAALAAREAGRALVVPQENAEEASLAGGLVVYAVGHL
LELVAHLNGQVVLPPYAANGLLLQSRPYPDLSEVQGQVAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIQSISGHTPLKSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARAKDKVRFPARFQLVAAMNPCPCGYLGDPSGRCRCSTEQIQRYRNKLSGPLLDRIDLHLTVAREATTLDNHPCG
DTSAKVARVVAEARERQQRRQGCANAFLDLDGLRRHCELTAADQAWLESACERLTLSLRAAHRLLKVARTLADLDRTDSI
ARAHLAEALQYRPGT

Nucleotide


Download         Length: 1488 bp        

>NTDB_id=271573 C4Q26_RS10775 WP_023378099.1 2281986..2283473(+) (comM) [Pseudomonas sp. SWI44]
ATGTCCCTAGCCCTCGTCCACAGTCGCGCCCAGGTGGGCGTGCAGGCACCTGCAGTCAGTGTCGAAACGCATCTGGCCAA
CGGCCTGCCCAATCTCACTCTGGTCGGCTTACCCGAAACCACGGTCAAGGAGAGCAAGGATCGAGTACGCAGCGCCATCG
TCAATTCCGGCCTGGACTATCCCGCCAGGCGTATCACCCAGAACCTTGCCCCAGCCGATCTGCCCAAGGATGGCGGCCGT
TATGACCTGGCCATCGCCTTGGGGATACTCGCCGCCAACGGCCAGGTACCGGTGCACGCCTTGAACGACATCGAGTGCCT
GGGTGAGCTGGCGTTGTCCGGCAAATTACGGCCTGTGCAGGGGATACTGCCGGCCGCCCTGGCAGCGCGCGAAGCAGGTC
GGGCCTTGGTGGTCCCGCAGGAAAACGCTGAAGAAGCCAGTCTGGCGGGTGGCCTGGTGGTCTACGCAGTGGGGCACCTT
CTAGAGTTGGTCGCCCACCTCAACGGCCAGGTGGTGTTGCCGCCATATGCCGCCAATGGCCTGCTGCTGCAAAGTCGGCC
CTACCCCGACTTGAGCGAAGTACAAGGCCAGGTCGCGGCCAAACGGGCACTGCTGCTGGCAGCCGCTGGCGCGCACAACC
TGCTGTTCACCGGGCCACCCGGTACCGGCAAGACCTTGCTGGCCAGTCGCTTGCCAGGGCTGCTGCCGCCGCTGGACGAG
CACGAAGCCCTGGAGGTCGCGGCTATCCAGTCGATCAGTGGGCACACGCCCCTGAAAAGCTGGCCGCAGCGTCCGTTCCG
CCACCCCCATCACTCAGCATCCGGGCCTGCGCTGGTAGGCGGCGGTAGCCGGCCGCAACCCGGCGAAATCACGCTGGCGC
ACCATGGTGTGTTGTTTCTCGATGAGTTGCCAGAGTTCGAGCGACGGGTGCTTGAGGTTCTGCGTGAACCCTTGGAGTCC
GGCGAGATCGTCATCGCCCGTGCCAAGGACAAAGTGCGCTTCCCTGCCCGTTTCCAACTGGTAGCGGCGATGAACCCCTG
CCCTTGTGGCTACCTCGGCGATCCCTCAGGCCGCTGCCGCTGCAGCACAGAGCAGATCCAGCGCTACCGCAACAAACTGT
CAGGCCCGTTGCTCGACCGCATCGACCTGCACCTGACTGTGGCGCGCGAAGCCACCACACTCGACAACCACCCCTGTGGC
GACACCAGCGCCAAGGTCGCCCGTGTCGTGGCTGAGGCCCGCGAACGGCAGCAGCGCCGCCAGGGCTGCGCGAATGCCTT
CCTCGACCTCGATGGCTTGCGCCGTCACTGCGAGCTCACTGCAGCTGACCAGGCCTGGCTGGAAAGCGCCTGCGAACGGC
TGACCCTGTCGCTACGGGCTGCGCACCGCCTACTCAAGGTCGCGCGGACGCTGGCTGACCTGGACAGAACCGACTCGATA
GCCAGGGCTCATTTGGCCGAAGCCCTCCAGTATCGCCCTGGCACCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

55.354

100

0.554

  comM Haemophilus influenzae Rd KW20

54.6

100

0.552

  comM Vibrio campbellii strain DS40M4

54.949

100

0.549

  comM Glaesserella parasuis strain SC1401

54.2

100

0.547

  comM Legionella pneumophila str. Paris

49.798

100

0.499

  comM Legionella pneumophila strain ERS1305867

49.798

100

0.499

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.614

100

0.473


Multiple sequence alignment