Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ATE40_RS20070 Genome accession   NZ_CP016948
Coordinates   4286645..4288168 (+) Length   507 a.a.
NCBI ID   WP_063918336.1    Uniprot ID   -
Organism   Serratia surfactantfaciens strain YD25     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4281645..4293168
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ATE40_RS20050 (ATE40_020050) - 4281846..4282772 (-) 927 WP_019455268.1 branched-chain amino acid transaminase -
  ATE40_RS20055 (ATE40_020055) ilvM 4282796..4283053 (-) 258 WP_063918337.1 acetolactate synthase 2 small subunit -
  ATE40_RS20060 (ATE40_020060) ilvG 4283050..4284696 (-) 1647 WP_025159877.1 acetolactate synthase 2 catalytic subunit -
  ATE40_RS24380 ilvL 4284839..4284937 (-) 99 WP_013814970.1 ilv operon leader peptide -
  ATE40_RS20065 (ATE40_020065) - 4285071..4286303 (-) 1233 WP_019455270.1 MFS transporter -
  ATE40_RS20070 (ATE40_020070) comM 4286645..4288168 (+) 1524 WP_063918336.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ATE40_RS20075 (ATE40_020075) - 4288197..4288535 (-) 339 WP_048232148.1 DUF413 domain-containing protein -
  ATE40_RS20080 (ATE40_020080) hdfR 4288655..4289482 (+) 828 WP_025159876.1 HTH-type transcriptional regulator HdfR -
  ATE40_RS20095 (ATE40_020095) murI 4291940..4292803 (-) 864 WP_025921004.1 glutamate racemase -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54456.73 Da        Isoelectric Point: 7.6569

>NTDB_id=193937 ATE40_RS20070 WP_063918336.1 4286645..4288168(+) (comM) [Serratia surfactantfaciens strain YD25]
MSLAVIYSRAIIGVQAPSVTVEVHISNGLPGLTLVGLPETTVKEARDRVRSALINNGFTFPARRITVNLAPADLPKEGGR
YDLPIALAILAASEQLPLAPLARYEFLGELALSGSLRAVKGAIPAALAAAEAGRQLILSTDNAAEVGLIAQSHSHTARHL
LEVCAFLLGQGELPVATTPPAADNVCESADLRDIIGQEQAKRALEIAAAGGHNLLLIGPPGTGKTMLASRLTGLLPPLAE
HEALESLAVASLQHHVPAALLSRQRPFRAPHHSASMAALVGGGSLPRPGEISMAHNGVLFLDELPEFERKVLDALREPLE
SGEIVISRANAKVCFPARVQLIAAMNPSPTGHYHGLHNRASPQQVLRYLARLSGPFLDRFDLSIEVPLLPPGTLSQRQRH
GESSEQVRERVLLARARQLERAGKINALLSNREVERDCVLQAADAEFLETTLNALGLSVRAWQRILKVARTLADLAGDAE
LNRRHLSEALGYRSMDRLLLQLHRSLE

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=193937 ATE40_RS20070 WP_063918336.1 4286645..4288168(+) (comM) [Serratia surfactantfaciens strain YD25]
ATGTCACTGGCGGTAATCTATAGCCGCGCCATCATCGGCGTTCAGGCCCCTTCCGTCACGGTAGAGGTGCATATCAGCAA
TGGCCTGCCCGGCCTGACGTTGGTCGGTCTGCCGGAAACCACGGTGAAAGAAGCGCGCGATCGGGTGCGCAGCGCCCTGA
TAAACAACGGTTTCACCTTCCCCGCACGGCGCATCACCGTCAATTTGGCGCCCGCCGATCTACCGAAAGAAGGCGGGCGT
TACGATCTGCCGATAGCGCTGGCGATCCTCGCCGCCTCCGAGCAACTGCCCCTCGCCCCCCTGGCGCGCTATGAGTTTCT
CGGCGAGCTGGCGCTGTCCGGCTCACTGCGCGCGGTCAAGGGCGCTATCCCGGCGGCGCTGGCGGCAGCCGAGGCCGGGC
GACAACTGATCCTCTCAACGGATAACGCCGCCGAGGTCGGCCTGATCGCACAGTCGCACTCCCATACCGCCCGACACCTG
TTGGAAGTCTGCGCCTTTCTGCTCGGCCAGGGCGAACTGCCGGTGGCGACCACACCTCCCGCTGCGGACAACGTCTGCGA
AAGCGCCGACCTACGCGACATCATCGGCCAGGAACAGGCCAAACGGGCACTGGAGATCGCCGCCGCCGGCGGGCATAACC
TGCTGTTGATCGGGCCGCCTGGCACCGGTAAAACCATGCTGGCCAGCCGTCTCACGGGCTTGCTGCCGCCGCTCGCGGAG
CATGAAGCGCTGGAAAGCCTGGCGGTCGCCAGCTTGCAGCATCATGTTCCCGCCGCTCTGCTATCGCGCCAGCGGCCGTT
TCGCGCGCCGCATCACAGCGCCTCGATGGCGGCATTGGTCGGCGGCGGCTCACTGCCGCGCCCGGGCGAGATCTCGATGG
CGCATAACGGCGTGCTGTTTCTGGATGAGCTGCCGGAATTCGAGCGTAAAGTACTGGATGCGCTGCGTGAACCGCTGGAA
TCCGGCGAGATCGTGATTTCACGCGCCAACGCCAAGGTCTGCTTCCCTGCAAGGGTACAATTGATCGCAGCGATGAACCC
CAGCCCGACCGGGCATTACCACGGGTTGCACAATCGCGCCTCACCGCAGCAGGTGCTGCGTTATCTGGCTCGGCTATCGG
GGCCTTTTCTCGACCGTTTCGATCTGTCTATCGAGGTGCCGCTGCTGCCGCCGGGTACGCTCAGTCAGCGGCAAAGGCAT
GGAGAAAGCAGCGAGCAGGTGAGAGAAAGAGTGTTACTGGCGCGCGCCCGGCAGCTCGAACGCGCCGGCAAAATCAACGC
GCTGTTGAGCAACCGCGAAGTGGAACGGGATTGCGTTTTGCAGGCGGCGGACGCCGAGTTTTTGGAGACGACATTGAACG
CGCTGGGGCTATCGGTCCGCGCCTGGCAGCGCATCTTGAAGGTTGCACGCACGCTGGCGGATTTGGCGGGAGATGCTGAG
CTCAACAGGCGCCACCTCAGCGAAGCGCTGGGCTATCGCAGTATGGATCGTCTGCTGTTACAGCTGCATCGGAGTCTGGA
ATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

66.206

99.803

0.661

  comM Glaesserella parasuis strain SC1401

65.422

100

0.657

  comM Vibrio cholerae strain A1552

65.339

99.014

0.647

  comM Vibrio campbellii strain DS40M4

64.143

99.014

0.635

  comM Legionella pneumophila str. Paris

50.602

98.225

0.497

  comM Legionella pneumophila strain ERS1305867

50.602

98.225

0.497

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.349

100

0.462


Multiple sequence alignment