Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GUC32_RS23550 Genome accession   NZ_CP047605
Coordinates   4972204..4973727 (+) Length   507 a.a.
NCBI ID   WP_369435466.1    Uniprot ID   A0AAX1GE54
Organism   Serratia sp. NGAS9     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4967204..4978727
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GUC32_RS23525 (GUC32_23435) - 4967405..4968331 (-) 927 WP_016930475.1 branched-chain amino acid transaminase -
  GUC32_RS23530 (GUC32_23440) ilvM 4968355..4968612 (-) 258 WP_049294986.1 acetolactate synthase 2 small subunit -
  GUC32_RS23535 (GUC32_23445) ilvG 4968609..4970255 (-) 1647 WP_369435465.1 acetolactate synthase 2 catalytic subunit -
  GUC32_RS23540 (GUC32_23450) ilvL 4970398..4970496 (-) 99 WP_013814970.1 ilv operon leader peptide -
  GUC32_RS23545 (GUC32_23455) - 4970630..4971862 (-) 1233 WP_025304644.1 MFS transporter -
  GUC32_RS23550 (GUC32_23460) comM 4972204..4973727 (+) 1524 WP_369435466.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GUC32_RS23555 (GUC32_23465) - 4973755..4974093 (-) 339 WP_004929946.1 DUF413 domain-containing protein -
  GUC32_RS23560 (GUC32_23470) hdfR 4974213..4975040 (+) 828 WP_369435467.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54421.69 Da        Isoelectric Point: 7.1088

>NTDB_id=414549 GUC32_RS23550 WP_369435466.1 4972204..4973727(+) (comM) [Serratia sp. NGAS9]
MSLAVIYSRAIIGVQAPSVTVEVHISNGLPGLTLVGLPETTVKEARDRVRSALINNGFTFPARRITVNLAPADLPKEGGR
YDLPIALAILAASEQLPLAPLARYEFLGELALSGALRAVRGAIPAALAAADAGRQLILSTDNAAEVGLIAQSQSHTAQHL
LEVCAFLLGQGELPVAVTPPAAGNAYENADLRDIIGQEQAKRALEIAAAGGHNLLLIGPPGTGKTMLASRLTGLLPPLTE
PEALESLAIASLQHPLLSALPWRQRPFRAPHHSASMAALVGGGSLPRPGEISMAHNGVLFLDELPEFERKVLDALREPLE
SGEIVISRANAKVCFPARVQLIAAMNPSPTGHYQGLHNRASPQQVLRYLARLSGPFLDRFDLSIEVPLLPPGTLSKRQTQ
GESSELVQERVQLARARQLERAGKVNALLSNREVERDCVLQPADAEFLEATLNALGLSVRAWQRILKVARTLADLAGDTE
LDRRHLSEALGYRSMDRLLLQLHRSLE

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=414549 GUC32_RS23550 WP_369435466.1 4972204..4973727(+) (comM) [Serratia sp. NGAS9]
ATGTCACTGGCGGTAATCTATAGCCGCGCCATCATCGGCGTTCAGGCCCCTTCCGTGACGGTGGAGGTGCATATCAGCAA
TGGCCTGCCCGGCCTGACGCTGGTCGGTCTACCGGAAACCACGGTAAAAGAGGCGCGCGATCGGGTGCGCAGCGCCCTGA
TCAACAACGGTTTCACCTTTCCCGCCCGGCGCATCACCGTCAATTTGGCACCGGCCGATCTGCCGAAAGAAGGCGGGCGT
TACGATCTGCCGATAGCGCTGGCGATCCTCGCCGCCTCCGAGCAACTGCCCCTCGCACCGTTGGCACGCTACGAGTTTCT
TGGCGAGCTCGCGCTGTCCGGCGCACTGCGTGCGGTCAGAGGCGCCATCCCGGCGGCGCTGGCGGCGGCTGACGCCGGGC
GACAATTGATCCTGTCGACGGACAACGCCGCCGAGGTCGGCCTGATCGCACAGTCACAATCCCATACCGCCCAACACCTG
TTGGAGGTCTGTGCTTTTTTACTCGGCCAGGGCGAACTGCCGGTGGCCGTCACACCTCCCGCAGCCGGCAATGCGTACGA
AAACGCCGATCTGCGCGACATCATCGGCCAGGAGCAGGCCAAGCGGGCGCTGGAGATCGCCGCCGCCGGCGGGCATAACC
TGCTGCTGATTGGGCCGCCGGGCACAGGCAAAACCATGCTGGCCAGCCGACTGACGGGCTTACTGCCGCCGCTGACGGAG
CCTGAGGCGCTGGAAAGCCTGGCGATCGCCAGCTTGCAACACCCTCTTCTGAGCGCTCTGCCATGGCGCCAGAGGCCGTT
TCGCGCGCCGCATCACAGCGCATCGATGGCGGCATTGGTCGGCGGCGGCTCACTGCCGCGTCCGGGCGAGATCTCGATGG
CGCATAACGGCGTGCTGTTTCTGGATGAGCTACCGGAATTCGAGCGTAAGGTCCTGGATGCGCTGCGCGAGCCGTTGGAA
TCCGGCGAGATCGTGATTTCACGCGCCAACGCCAAGGTCTGTTTCCCTGCCAGAGTACAGTTGATCGCGGCAATGAACCC
CAGCCCGACAGGGCATTATCAGGGGTTGCACAACCGCGCCTCGCCGCAGCAGGTGTTGCGCTATCTGGCCCGGCTGTCAG
GGCCTTTTCTCGACCGTTTCGATCTGTCTATCGAAGTGCCGCTGTTGCCGCCGGGTACGCTCAGTAAGCGGCAAACGCAG
GGAGAAAGCAGTGAGCTGGTTCAGGAAAGAGTGCAACTGGCACGCGCCCGCCAGCTCGAACGCGCCGGTAAAGTCAATGC
GCTGTTGAGCAACCGCGAAGTAGAACGGGATTGCGTGTTGCAGCCGGCAGACGCCGAGTTTCTGGAAGCGACATTAAACG
CGTTAGGGCTATCGGTACGCGCCTGGCAGCGCATCCTGAAAGTGGCTCGCACGCTGGCGGATTTGGCGGGAGATACTGAA
CTCGACAGGCGCCACCTCAGCGAAGCGCTGGGCTATCGCAGTATGGATCGTCTGTTGTTACAGCTGCATCGCAGTCTGGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

66.008

99.803

0.659

  comM Glaesserella parasuis strain SC1401

65.226

100

0.655

  comM Vibrio cholerae strain A1552

65.339

99.014

0.647

  comM Vibrio campbellii strain DS40M4

64.542

99.014

0.639

  comM Legionella pneumophila str. Paris

51.107

98.028

0.501

  comM Legionella pneumophila strain ERS1305867

51.107

98.028

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.455

99.803

0.454


Multiple sequence alignment