Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   U6037_RS28225 Genome accession   NZ_CP140706
Coordinates   6344317..6345810 (+) Length   497 a.a.
NCBI ID   WP_322845234.1    Uniprot ID   -
Organism   Pseudomonas sp. B33.4     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 6339317..6350810
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U6037_RS28200 sutA 6340066..6340392 (-) 327 WP_007920420.1 transcriptional regulator SutA -
  U6037_RS28205 - 6340495..6340920 (-) 426 WP_102901435.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  U6037_RS28210 - 6341136..6342473 (-) 1338 WP_016985874.1 ammonium transporter -
  U6037_RS28215 glnK 6342512..6342850 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  U6037_RS28220 - 6343264..6343524 (+) 261 WP_007920416.1 accessory factor UbiK family protein -
  U6037_RS28225 comM 6344317..6345810 (+) 1494 WP_322845234.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  U6037_RS28230 - 6346112..6347284 (+) 1173 WP_322845235.1 hypothetical protein -
  U6037_RS28235 - 6347358..6347900 (+) 543 WP_322845236.1 adenylyl-sulfate kinase -
  U6037_RS28240 - 6347903..6348652 (+) 750 WP_064116408.1 class I SAM-dependent methyltransferase -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 52965.14 Da        Isoelectric Point: 8.2051

>NTDB_id=915054 U6037_RS28225 WP_322845234.1 6344317..6345810(+) (comM) [Pseudomonas sp. B33.4]
MSLSIVHSRAQIGVEAPAVTVEVHLANGLPSLTMVGLPEAAVKESKDRVRSAIINSGLQFPARRITLNLAPADLPKDGGR
FDLAIALGILSASVQVPCLTLDDVECLGELALSGAVRAVRGVLPAALAARKAGRALVVPRANAEEACLASGLKVFAVDHL
LEAVAHFNGHTPVEPYVSDGLMHASKPYPDLNEVQGQLAAKRALLIAAAGAHNLLFSGPPGTGKTLLASRLPGLLPPLVE
SEALEVAAIQSVASGAPLTHWPQRPFRQPHHSASGPALVGGSSKPQPGEITLAHHGVLFLDELPEFDRKVLEVLREPLES
GFIVIARAKDRVRFPARFQLVAAMNPCPCGYLGEPSGKCSCTPDMVQRYRNKLSGPLLDRIDLHLTVAREATALNPAVKP
GEDSASAAALVAEARERQQKRQGCANAFLDLPGLRRHCKLSTADEAWLESACERLTLSLRSAHRLLKVARTLADLSQEKD
IKREHLAEALQYRPATQ

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=915054 U6037_RS28225 WP_322845234.1 6344317..6345810(+) (comM) [Pseudomonas sp. B33.4]
ATGTCCCTCTCCATCGTCCACAGTCGCGCCCAGATTGGCGTGGAAGCCCCCGCTGTCACCGTCGAAGTCCATCTGGCCAA
TGGTCTGCCATCGCTGACCATGGTCGGCCTGCCCGAGGCGGCGGTGAAGGAGAGCAAGGATCGGGTGCGCAGCGCGATCA
TCAATTCCGGGCTGCAGTTTCCGGCGCGGCGGATCACCTTGAATCTGGCGCCGGCCGATCTGCCCAAGGATGGCGGGCGG
TTTGATCTGGCGATTGCCTTGGGGATTCTCTCGGCGAGTGTGCAGGTGCCGTGTCTGACGCTGGATGATGTGGAATGTCT
GGGTGAATTGGCGTTGTCCGGCGCAGTGCGAGCGGTGCGTGGCGTGTTGCCGGCGGCACTGGCGGCGCGCAAGGCTGGAC
GGGCACTGGTGGTGCCGCGGGCGAATGCCGAGGAGGCGTGCCTGGCTTCGGGGTTGAAGGTGTTTGCGGTGGATCATTTG
CTGGAGGCGGTGGCGCACTTCAATGGGCATACGCCGGTTGAGCCCTATGTGTCCGACGGGTTGATGCACGCCAGCAAGCC
TTATCCCGACTTGAATGAAGTGCAGGGGCAACTGGCGGCAAAACGGGCGCTGCTGATTGCTGCTGCGGGTGCGCATAACT
TGTTGTTCAGCGGGCCGCCGGGAACCGGCAAAACCTTGTTGGCCAGCCGGTTACCGGGACTGTTGCCACCGTTGGTTGAG
AGTGAAGCGCTGGAAGTCGCGGCGATTCAGTCAGTCGCCAGCGGTGCGCCGCTGACCCATTGGCCGCAGCGTCCGTTCCG
CCAACCGCACCACTCGGCATCCGGGCCGGCACTGGTCGGTGGCAGTTCAAAACCGCAACCCGGCGAAATCACCCTGGCCC
ACCACGGCGTGCTGTTCCTCGATGAGCTGCCGGAGTTTGATCGCAAGGTGCTGGAGGTTTTACGCGAGCCTTTGGAGTCC
GGGTTCATCGTGATCGCCAGGGCCAAGGACCGCGTGCGTTTCCCCGCGCGCTTTCAGTTGGTGGCGGCGATGAACCCTTG
CCCCTGTGGATATCTTGGTGAACCCAGTGGCAAGTGCTCGTGCACGCCGGATATGGTTCAGCGTTATCGCAACAAGTTGT
CGGGCCCCCTGTTGGACCGGATCGACTTACACCTGACGGTAGCGCGGGAGGCGACAGCGCTGAACCCTGCGGTAAAGCCA
GGAGAAGACAGCGCCAGTGCGGCCGCGTTGGTAGCCGAGGCCCGTGAACGACAACAGAAACGCCAGGGATGTGCCAATGC
GTTTCTTGATCTGCCAGGCCTGCGTCGCCACTGCAAGTTATCCACAGCCGATGAGGCCTGGCTGGAATCAGCCTGTGAAC
GGCTGACCCTGTCACTGCGCTCGGCGCACCGGTTGCTCAAGGTCGCCAGAACGTTGGCCGATCTGAGCCAGGAAAAAGAC
ATTAAACGCGAACACCTGGCGGAGGCTTTGCAGTATCGGCCGGCAACGCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.747

99.598

0.545

  comM Vibrio campbellii strain DS40M4

53.939

99.598

0.537

  comM Haemophilus influenzae Rd KW20

53.307

100

0.535

  comM Glaesserella parasuis strain SC1401

52.8

100

0.531

  comM Legionella pneumophila str. Paris

50.4

100

0.507

  comM Legionella pneumophila strain ERS1305867

50.4

100

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.6

100

0.459