Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   OCT39_RS16300 Genome accession   NZ_CP107007
Coordinates   3543040..3544551 (+) Length   503 a.a.
NCBI ID   WP_263585485.1    Uniprot ID   -
Organism   Halomonas sp. GD1P12     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 3538040..3549551
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  OCT39_RS16280 (OCT39_16280) - 3538781..3539713 (+) 933 WP_263585481.1 AraC family transcriptional regulator -
  OCT39_RS16285 (OCT39_16285) - 3539864..3541966 (+) 2103 WP_263585482.1 TonB-dependent siderophore receptor -
  OCT39_RS16290 (OCT39_16290) - 3541954..3542082 (-) 129 WP_263585483.1 hypothetical protein -
  OCT39_RS16295 (OCT39_16295) - 3542081..3542947 (+) 867 WP_263585484.1 ABC transporter substrate-binding protein -
  OCT39_RS16300 (OCT39_16300) comM 3543040..3544551 (+) 1512 WP_263585485.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  OCT39_RS16305 (OCT39_16305) - 3545005..3546300 (-) 1296 WP_263585486.1 hypothetical protein -
  OCT39_RS16310 (OCT39_16310) - 3546798..3547468 (+) 671 Protein_3175 ATP-binding protein -
  OCT39_RS16315 (OCT39_16315) - 3547477..3548307 (-) 831 WP_263585487.1 sulfurtransferase -
  OCT39_RS16320 (OCT39_16320) - 3548384..3549139 (-) 756 WP_263585488.1 glutaredoxin family protein -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 53620.70 Da        Isoelectric Point: 7.1099

>NTDB_id=737307 OCT39_RS16300 WP_263585485.1 3543040..3544551(+) (comM) [Halomonas sp. GD1P12]
MTLAIVNTRAGVGLDAPAVQVEVHLANGLPGLTLVGLPEAAVKESRERVRSALVNAGFDFPNTRRITLNLAPADLPKEGG
RFDLPIALGILAASGQLPVEALEGIECAGELALDGKLRPVPGMLPFALATKSAGRRLMVPRACADEAALAGNGLEVLPAE
SLWDVVAHLLDQTRIAPHVLPAPPKAQTPAPDLADVRGQHQARRALEVAAAGGHNLLLAGPPGTGKTMLASRLPGILPPL
NEDDALQVAAVRSVCGLALEADWGQRPFRQPHHSASAAALVGGGSKPKPGEISLAHMGVLFLDELPEFSRSVLEVLRQPL
ETGEIHLSRASHERRYPARFQLVAAMNPCPCGHLGDPRSRCQCTASQIQRYQARLSGPLLDRIDLQVEVPALPPEQLTAQ
TQGESSDAVRERVMAARARQMERGALNSQLSGKALEAACDLNDEERTWLAEVLEKLKLSARAYHRVLRVALTLSDLQGEP
KPGQPHLIEAIGYRQLDRLLGKG

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=737307 OCT39_RS16300 WP_263585485.1 3543040..3544551(+) (comM) [Halomonas sp. GD1P12]
ATGACACTTGCCATCGTGAACACGCGTGCCGGCGTAGGGCTCGACGCTCCGGCAGTGCAGGTCGAGGTGCATCTGGCCAA
CGGCCTGCCGGGATTGACGCTGGTCGGCCTGCCGGAAGCCGCCGTGAAGGAGAGCCGCGAGCGGGTGCGCAGTGCGCTGG
TCAACGCAGGCTTTGACTTTCCCAACACCCGGCGCATCACCCTGAACCTGGCCCCGGCGGATCTGCCCAAGGAGGGTGGG
CGTTTCGATCTGCCCATTGCGCTGGGCATTCTCGCCGCCTCCGGCCAGCTGCCGGTGGAGGCGCTTGAAGGCATCGAGTG
CGCGGGCGAGCTGGCCCTCGACGGCAAGCTGCGTCCGGTGCCCGGCATGCTGCCGTTTGCGCTGGCCACCAAAAGCGCCG
GGCGCAGGCTGATGGTGCCCAGGGCTTGCGCGGACGAAGCCGCGCTTGCGGGTAACGGTCTGGAAGTCCTGCCGGCCGAA
TCACTCTGGGACGTCGTGGCCCACCTTCTGGATCAGACCCGCATCGCGCCCCACGTGCTGCCCGCCCCACCGAAAGCGCA
GACCCCCGCGCCAGACCTTGCCGATGTACGCGGCCAGCACCAGGCGCGCCGGGCGCTGGAAGTGGCCGCCGCCGGTGGGC
ATAACCTCTTGCTGGCCGGCCCGCCGGGCACCGGTAAGACCATGCTGGCCAGCCGACTGCCCGGCATTCTGCCGCCGCTT
AACGAGGACGACGCGCTGCAGGTCGCCGCGGTGCGCTCGGTGTGCGGGCTGGCGCTCGAGGCGGACTGGGGCCAGCGCCC
GTTTCGACAACCGCACCACAGCGCCAGCGCCGCCGCGCTCGTTGGCGGCGGCTCGAAACCCAAGCCTGGCGAAATTTCCT
TGGCGCACATGGGCGTACTGTTTCTCGACGAGCTGCCGGAGTTTTCCAGAAGCGTTTTGGAAGTCTTACGCCAGCCTTTG
GAAACCGGCGAAATCCACCTGTCGCGCGCCAGCCACGAGCGCCGCTACCCTGCCCGTTTTCAGCTCGTCGCCGCCATGAA
CCCCTGCCCCTGCGGTCACCTGGGCGACCCGCGAAGCCGCTGCCAGTGTACGGCCAGCCAGATCCAGCGCTATCAGGCGC
GGCTCTCCGGCCCGCTTTTGGATCGTATCGACCTTCAGGTGGAAGTCCCGGCGCTGCCGCCGGAGCAGCTCACCGCCCAG
ACCCAGGGCGAATCGTCCGACGCGGTACGCGAGCGCGTGATGGCCGCCCGAGCGCGCCAGATGGAGCGCGGCGCGCTCAA
CAGCCAGCTAAGCGGCAAGGCGCTGGAAGCCGCCTGCGACTTGAACGACGAGGAGCGCACCTGGCTTGCCGAGGTGCTGG
AAAAGCTCAAGCTCTCGGCGCGGGCGTATCACCGCGTGTTGCGGGTGGCGCTGACGCTTTCTGATCTACAGGGCGAACCC
AAGCCGGGGCAGCCGCACCTGATCGAGGCCATTGGTTACCGGCAGCTGGATCGCCTGCTGGGAAAAGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

54.813

100

0.555

  comM Glaesserella parasuis strain SC1401

54.224

100

0.549

  comM Vibrio campbellii strain DS40M4

53.877

100

0.539

  comM Vibrio cholerae strain A1552

53.663

100

0.539

  comM Legionella pneumophila str. Paris

50.699

99.602

0.505

  comM Legionella pneumophila strain ERS1305867

50.699

99.602

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.215

99.801

0.461