Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AACL23_RS03145 Genome accession   NZ_OZ026471
Coordinates   661235..662755 (-) Length   506 a.a.
NCBI ID   WP_100102925.1    Uniprot ID   -
Organism   Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 656235..667755
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AACL23_RS03130 - 656737..656811 (+) 75 Protein_606 transposase -
  AACL23_RS03135 - 656964..659882 (-) 2919 WP_174888361.1 hypothetical protein -
  AACL23_RS03140 - 660015..660818 (-) 804 WP_339050072.1 metallophosphoesterase -
  AACL23_RS03145 comM 661235..662755 (-) 1521 WP_100102925.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AACL23_RS03150 kdsB 663102..663848 (-) 747 WP_100096033.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  AACL23_RS03155 - 663848..664027 (-) 180 WP_012737967.1 Trm112 family protein -
  AACL23_RS03160 - 664301..664510 (-) 210 WP_012737966.1 cold shock domain-containing protein -
  AACL23_RS03165 lpxK 664992..665978 (-) 987 WP_012737964.1 tetraacyldisaccharide 4'-kinase -
  AACL23_RS03170 - 666371..667648 (+) 1278 WP_012737963.1 hemolysin family protein -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55335.18 Da        Isoelectric Point: 9.6166

>NTDB_id=1163195 AACL23_RS03145 WP_100102925.1 661235..662755(-) (comM) [Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3]
MALAVINTRASLGVQSPAVAVEAHISNGLPCFTLVGLPETAVKEARDRVRSAILNSGFMFPAKRITVSLGPADLPKEGGR
YDLPIALAILIASEQVSGHKSVHYEFLGELALSGALRPINSTIPAALACSKANRKLILPSANSLEMTLIPEGEVMIAKHL
LEVCGFLSGEENLLSVVNNETDLTDTYADLQDVIGQQQSKRALEVAAAGGHNLLLIGPPGTGKTMLASRLIGILPALTNE
EALETAAVNSLLNIEKYPTQWRKRPFRSPHHSASIAALVGGGSLPRPGEISLAHNGVLFLDELPEFERRVLDSLREPLES
GEIIISRAMAKIRFPARVQLIAAMNPSPSGHYKGIHHRTTPQQILRYLSKLSGPFLDRFDLSIEVPLLPPGLLQMQENQG
ESSSTIRTRVLKARERQLNRTKKINAHLNNKEVAKFCHISTKDAQFLEQVLLKLGLSVRAWHRILKVARTLADLAEQENI
KKDHLAEALSYRCIDRLFLKLNKTLN

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=1163195 AACL23_RS03145 WP_100102925.1 661235..662755(-) (comM) [Candidatus Hamiltonella endosymbiont of Tuberolachnus salignus isolate eee46e84-88ba-4997-9627-ba9a3c08c7b3]
ATGGCACTGGCAGTAATCAACACTCGAGCGAGTCTGGGGGTACAGTCTCCTGCAGTTGCGGTTGAAGCGCATATCAGTAA
TGGATTGCCTTGTTTTACCCTGGTGGGCTTACCTGAAACCGCGGTAAAAGAAGCGAGAGATCGCGTACGGAGTGCGATTT
TGAACAGTGGTTTTATGTTTCCAGCCAAGCGTATTACGGTGAGTCTAGGACCTGCAGATCTACCAAAAGAAGGTGGGCGT
TACGATTTACCTATAGCTTTAGCCATATTAATAGCCTCAGAACAAGTGAGTGGTCACAAATCAGTTCATTATGAATTTTT
AGGTGAATTAGCTTTATCTGGTGCATTACGACCTATTAATAGCACAATTCCAGCGGCTCTTGCATGCTCTAAAGCAAATC
GGAAGCTGATATTACCTTCAGCTAATTCTCTAGAAATGACGTTAATCCCTGAAGGGGAGGTTATGATTGCCAAACACTTG
CTAGAGGTGTGTGGGTTTTTAAGTGGAGAAGAAAATTTACTTTCTGTTGTAAATAATGAAACTGATCTAACAGATACGTA
TGCTGATCTTCAAGATGTGATTGGTCAACAGCAGTCAAAACGAGCTCTCGAAGTGGCTGCAGCTGGGGGGCATAATTTAT
TACTAATTGGCCCACCTGGTACTGGCAAAACGATGCTGGCTAGCAGATTGATTGGCATACTACCTGCATTAACAAACGAG
GAAGCATTGGAAACCGCTGCTGTAAACAGTTTATTAAATATTGAAAAATATCCGACTCAATGGCGTAAGCGTCCTTTTCG
ATCCCCTCACCATAGTGCCTCGATTGCTGCTTTAGTGGGCGGGGGATCTTTACCTCGTCCAGGCGAAATATCTTTAGCTC
ATAATGGCGTATTATTTTTAGATGAATTACCAGAATTTGAACGCAGGGTACTAGATTCATTAAGAGAACCACTAGAATCT
GGGGAGATTATCATTTCTCGCGCCATGGCCAAGATTCGTTTTCCTGCAAGGGTACAGTTGATTGCGGCCATGAACCCAAG
CCCAAGTGGGCATTATAAAGGTATTCATCATCGAACAACCCCACAACAAATTTTACGATATCTTTCAAAACTTTCTGGTC
CTTTCCTTGATAGGTTTGATCTATCTATTGAAGTTCCTTTATTACCTCCTGGTTTATTACAAATGCAAGAAAATCAGGGT
GAAAGCAGCAGTACTATTAGAACGCGTGTTCTGAAAGCTCGTGAACGCCAATTAAATAGAACAAAAAAAATCAACGCTCA
TCTCAATAATAAAGAAGTGGCTAAATTTTGTCATATAAGCACTAAGGATGCTCAATTTTTAGAACAAGTGTTGTTAAAAT
TAGGTCTTTCCGTACGTGCTTGGCATCGCATTTTGAAAGTAGCTCGAACACTTGCGGATTTAGCTGAACAAGAAAATATT
AAGAAGGATCACCTTGCTGAAGCGCTAAGTTACCGTTGCATAGATAGATTATTTTTAAAATTAAATAAAACTTTGAATTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

61.176

100

0.617

  comM Vibrio campbellii strain DS40M4

60.835

99.407

0.605

  comM Glaesserella parasuis strain SC1401

60.277

100

0.603

  comM Vibrio cholerae strain A1552

60.636

99.407

0.603

  comM Legionella pneumophila str. Paris

50.202

98.024

0.492

  comM Legionella pneumophila strain ERS1305867

50.202

98.024

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.726

99.407

0.455


Multiple sequence alignment