Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DAI21_RS05145 Genome accession   NZ_CP028520
Coordinates   1095040..1096560 (-) Length   506 a.a.
NCBI ID   WP_107702629.1    Uniprot ID   -
Organism   Lelliottia sp. WB101     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1090040..1101560
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DAI21_RS05135 (DAI21_05135) hdfR 1093730..1094551 (-) 822 WP_100778407.1 HTH-type transcriptional regulator HdfR -
  DAI21_RS05140 (DAI21_05140) - 1094670..1095008 (+) 339 WP_100778408.1 DUF413 domain-containing protein -
  DAI21_RS05145 (DAI21_05145) comM 1095040..1096560 (-) 1521 WP_107702629.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DAI21_RS05150 (DAI21_05150) ilvL 1096904..1097002 (+) 99 WP_023620831.1 ilv operon leader peptide -
  DAI21_RS23050 ilvX 1097089..1097145 (+) 57 WP_194514210.1 peptide IlvX -
  DAI21_RS05155 (DAI21_05155) ilvG 1097142..1098788 (+) 1647 WP_100778410.1 acetolactate synthase 2 catalytic subunit -
  DAI21_RS05160 (DAI21_05160) ilvM 1098785..1099048 (+) 264 WP_095283916.1 acetolactate synthase 2 small subunit -
  DAI21_RS05165 (DAI21_05165) ilvE 1099067..1099996 (+) 930 WP_100778411.1 branched-chain-amino-acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55342.34 Da        Isoelectric Point: 7.2759

>NTDB_id=285388 DAI21_RS05145 WP_107702629.1 1095040..1096560(-) (comM) [Lelliottia sp. WB101]
MSLSVVFTRAALGVQAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPAKKITINLAPADLPKEGGR
YDLPIAIALLSASEQLNAPHLNRYEFVGELALTGALRGVPGAISGVLAALQAGRHIIVATENSSEVSLIEEKGCLIAGHL
QEVCAFLEGRHELAEPEENRIGVPEHLEDLSDIIGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLNGLLPPLSNH
EALESAAIISLTNSTSLQKHWRRRPFRSPHHSASLTAMVGGGSIPIPGEISLAHHGILFLDELPEFERRVLDALREPIES
GQIHISRTRAKISYPAQFQLIAAMNPSPSGHYQGNHNRSTPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGVLSQMSNTG
ENSSTVRERVIRAQERQYLRQNKLNARLDNAEIRQFCCLSEVDSRWLEETLTRFGLSVRAWQRLLKVARTIADLEGSGDI
ARQHLQEALSYRAIDRLLLHLQKMLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=285388 DAI21_RS05145 WP_107702629.1 1095040..1096560(-) (comM) [Lelliottia sp. WB101]
ATGTCACTGTCAGTCGTTTTTACTCGCGCAGCGTTGGGCGTACAGGCGCCGCTTATTTCTGTGGAAGTTCATCTGAGCAA
TGGACTGCCCGGCTTAACCCTCGTCGGACTCCCGGAAACCACCGTGAAAGAGGCCCGGGATCGCGTTCGCAGCGCTATCA
TCAATAGCGGTTATACCTTTCCCGCCAAGAAGATAACCATCAACCTGGCCCCTGCCGACCTCCCAAAAGAAGGCGGGCGA
TATGATTTACCTATCGCCATAGCGCTTCTCTCTGCCTCTGAGCAGCTTAACGCTCCCCATTTGAATCGGTATGAGTTTGT
GGGCGAACTGGCGCTTACAGGCGCGTTAAGAGGCGTTCCAGGCGCAATCTCGGGAGTGTTAGCCGCGTTACAGGCCGGAA
GACATATCATCGTGGCTACTGAGAACTCATCAGAAGTCAGCCTCATTGAGGAAAAGGGGTGCCTGATTGCCGGACATTTA
CAGGAGGTGTGTGCATTTCTGGAGGGGCGTCATGAGTTGGCTGAACCTGAAGAGAATCGTATCGGTGTGCCGGAACATCT
GGAGGATCTTAGCGACATCATTGGTCAGGAGCAGGGCAAGCGCGCGCTGGAAATCACTGCGGCTGGCGGCCACAACCTGC
TGCTGATTGGGCCGCCAGGCACAGGGAAAACTATGCTTGCCAGTAGGCTTAACGGTTTGTTGCCCCCACTGAGCAACCAT
GAAGCTCTGGAGAGCGCAGCAATCATTAGTCTGACGAATTCCACATCCCTACAGAAGCACTGGCGCAGAAGGCCGTTTCG
CTCCCCACATCACAGCGCCTCACTCACGGCGATGGTTGGCGGCGGTTCGATCCCAATACCTGGTGAAATTTCCCTCGCAC
ACCACGGGATTCTGTTTCTGGATGAATTACCTGAATTCGAAAGACGTGTTCTGGATGCCCTGCGCGAACCGATCGAGTCC
GGGCAGATTCATATCTCCCGTACCCGGGCCAAGATCAGCTATCCGGCGCAGTTCCAACTCATTGCTGCCATGAATCCCAG
CCCATCAGGCCATTACCAGGGCAACCACAACCGAAGTACACCTGAGCAGACACTTCGTTATCTGGGACGACTGTCAGGCC
CGTTCCTCGACCGCTTCGATCTCTCGCTCGAAATCCCACTGCCGCCACCGGGAGTGCTCAGCCAGATGAGTAACACAGGG
GAGAACAGCAGCACCGTACGGGAACGCGTGATTCGAGCCCAGGAACGGCAGTATTTACGTCAGAACAAGCTCAATGCGCG
TCTCGATAACGCAGAGATCCGTCAGTTTTGTTGTCTGTCCGAAGTTGATTCACGCTGGCTGGAAGAGACGCTGACGCGAT
TTGGCCTCTCAGTACGAGCATGGCAACGCCTGTTGAAGGTTGCGCGTACTATCGCTGATTTGGAGGGGAGTGGTGATATC
GCGCGGCAACATCTGCAGGAAGCGCTAAGTTATCGCGCAATAGACCGTTTGCTGCTGCACCTGCAAAAGATGCTGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.708

100

0.593

  comM Glaesserella parasuis strain SC1401

58.777

100

0.589

  comM Vibrio campbellii strain DS40M4

58.73

99.605

0.585

  comM Vibrio cholerae strain A1552

58.02

99.802

0.579

  comM Legionella pneumophila str. Paris

49.497

98.221

0.486

  comM Legionella pneumophila strain ERS1305867

49.497

98.221

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.615

100

0.439


Multiple sequence alignment