Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   LOY31_RS28310 Genome accession   NZ_CP087192
Coordinates   6255082..6256575 (+) Length   497 a.a.
NCBI ID   WP_085612066.1    Uniprot ID   -
Organism   Pseudomonas sp. B21-021     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 6253319..6262294 6255082..6256575 within 0


Gene organization within MGE regions


Location: 6253319..6262294
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LOY31_RS28300 (LOY31_28280) - 6253319..6253579 (+) 261 WP_007920416.1 accessory factor UbiK family protein -
  LOY31_RS28305 (LOY31_28285) - 6254339..6254593 (-) 255 WP_085606077.1 HigA family addiction module antitoxin -
  LOY31_RS28310 (LOY31_28290) comM 6255082..6256575 (+) 1494 WP_085612066.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  LOY31_RS28315 (LOY31_28295) - 6256681..6258657 (-) 1977 WP_258712607.1 methyl-accepting chemotaxis protein -
  LOY31_RS28320 (LOY31_28300) - 6258841..6259761 (-) 921 WP_127800610.1 LysR substrate-binding domain-containing protein -
  LOY31_RS28325 (LOY31_28305) - 6259919..6261313 (+) 1395 WP_258712608.1 NorM family multidrug efflux MATE transporter -
  LOY31_RS28330 (LOY31_28310) - 6261332..6262294 (-) 963 WP_064378582.1 IS110 family transposase -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 52854.02 Da        Isoelectric Point: 8.2051

>NTDB_id=627775 LOY31_RS28310 WP_085612066.1 6255082..6256575(+) (comM) [Pseudomonas sp. B21-021]
MSLAIVHSRAQIGVDAPAVTVEVHLANGLPSLTMVGLPEAAVKESKDRVRSAIINSGLQFPARRITLNLAPADLPKDGGR
FDLAIALGILSASVQVPTLTLDDVECLGELALSGAVRAVRGVLPAALAARKAGRTLVVPRANAEEACLASGLRVIAVDHL
LEAVAHFNGHTPVEPYVSDGLIHAAKPYPDLNEVQGQLAAKRALLIAAAGAHNLLFSGPPGTGKTLLASRLPGLLPPLAE
SEALEVAAIQSVASGAPLTHWPQRPFRQPHHSASGPALVGGSSKPQPGEITLAHHGVLFLDELPEFDRKVLEVLREPLES
GCIVIARAKERVRFPARFQLVAAMNPCPCGYLGEPSGKCSCTPDMVQRYRNKLSGPLLDRIDLHLTVAREATALNPAVKP
GEDSASAATLVAEARERQQKRQGCANAFLDLPGLKKHCKLSTADETWLETACERLTLSLRSAHRLLKVARTLADLEKRAD
ISREHLAEALQYRPATQ

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=627775 LOY31_RS28310 WP_085612066.1 6255082..6256575(+) (comM) [Pseudomonas sp. B21-021]
ATGTCCCTCGCCATCGTCCACAGCCGCGCCCAGATAGGCGTGGATGCTCCCGCCGTCACCGTCGAAGTTCACCTGGCCAA
CGGTTTGCCCTCGCTGACCATGGTCGGCTTGCCCGAAGCGGCGGTGAAGGAGAGCAAGGACCGCGTGCGCAGTGCGATCA
TCAATTCCGGGCTGCAGTTTCCGGCGCGGCGGATCACGTTGAATCTGGCGCCGGCGGATCTGCCCAAGGATGGCGGGCGG
TTCGATCTGGCGATTGCGCTGGGGATTCTGTCGGCGAGTGTGCAGGTGCCGACGTTGACGCTGGATGACGTGGAGTGCCT
TGGCGAACTGGCGTTGTCCGGCGCCGTGCGGGCGGTGCGCGGGGTGTTGCCGGCGGCACTGGCAGCGCGCAAGGCCGGGC
GGACGCTGGTGGTGCCGCGGGCGAATGCCGAGGAGGCTTGTCTGGCTTCAGGACTCAGGGTGATCGCGGTGGATCATCTG
CTGGAGGCCGTGGCGCATTTCAATGGGCATACACCGGTCGAGCCTTATGTCTCGGACGGGCTGATCCATGCCGCGAAACC
CTATCCCGACCTGAATGAAGTCCAAGGCCAACTGGCGGCCAAACGGGCGCTGCTGATTGCCGCCGCCGGTGCCCACAACC
TACTGTTCAGCGGGCCACCGGGAACGGGCAAGACGTTGCTGGCCAGTCGATTACCGGGGTTGCTGCCGCCGCTGGCCGAG
AGTGAAGCGCTGGAAGTGGCGGCGATTCAATCCGTCGCCAGCGGCGCGCCGTTGACCCATTGGCCGCAGCGCCCGTTTCG
CCAGCCGCATCACTCGGCCTCTGGCCCGGCACTGGTCGGTGGCAGCTCAAAACCTCAACCCGGCGAAATCACCCTCGCCC
ACCACGGTGTGCTGTTTCTCGATGAGCTGCCCGAGTTTGATCGCAAGGTGCTCGAGGTGCTGCGCGAACCGCTGGAATCC
GGCTGCATCGTGATCGCCCGTGCCAAGGAACGGGTGCGATTTCCCGCAAGATTCCAGTTGGTGGCGGCGATGAATCCCTG
TCCCTGTGGATATCTTGGCGAACCGAGCGGCAAGTGCTCATGCACGCCGGACATGGTGCAGCGCTATCGCAACAAACTGT
CGGGGCCGTTGCTGGACCGGATCGATTTGCACCTGACGGTGGCGCGGGAGGCGACAGCGCTGAACCCTGCGGTAAAACCG
GGAGAGGACAGCGCCAGCGCGGCCACATTGGTCGCAGAGGCCCGCGAGCGTCAGCAAAAACGCCAGGGCTGCGCCAATGC
GTTTCTCGATTTGCCGGGGCTGAAGAAGCACTGCAAGTTATCCACAGCCGATGAAACCTGGCTGGAGACGGCGTGCGAAA
GACTGACCTTGTCGCTGCGCTCGGCGCATCGGCTGCTCAAGGTCGCGCGGACGTTGGCGGATCTGGAGAAGCGCGCGGAT
ATCAGTCGCGAACATCTGGCCGAAGCTTTGCAGTATCGGCCGGCGACTCAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.949

99.598

0.547

  comM Vibrio campbellii strain DS40M4

54.545

99.598

0.543

  comM Glaesserella parasuis strain SC1401

53

100

0.533

  comM Haemophilus influenzae Rd KW20

53.106

100

0.533

  comM Legionella pneumophila str. Paris

50

100

0.503

  comM Legionella pneumophila strain ERS1305867

50

100

0.503

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.418

100

0.459