Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   D9980_RS21035 Genome accession   NZ_CP033055
Coordinates   4662050..4663573 (-) Length   507 a.a.
NCBI ID   WP_121608342.1    Uniprot ID   -
Organism   Serratia sp. 3ACOL1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4657050..4668573
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D9980_RS21010 (D9980_21010) ytfR 4657209..4658723 (+) 1515 WP_121608340.1 galactofuranose ABC transporter, ATP-binding protein YtfR -
  D9980_RS21015 (D9980_21015) ytfT 4658741..4659742 (+) 1002 WP_037376499.1 galactofuranose ABC transporter, ATP-binding protein YtfT -
  D9980_RS21020 (D9980_21020) yjfF 4659739..4660731 (+) 993 WP_059199402.1 galactofuranose ABC transporter, permease protein YjfF -
  D9980_RS21025 (D9980_21025) hdfR 4660736..4661563 (-) 828 WP_121608341.1 HTH-type transcriptional regulator HdfR -
  D9980_RS21030 (D9980_21030) - 4661683..4662021 (+) 339 WP_021181829.1 DUF413 domain-containing protein -
  D9980_RS21035 (D9980_21035) comM 4662050..4663573 (-) 1524 WP_121608342.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  D9980_RS21040 (D9980_21040) - 4663916..4665148 (+) 1233 WP_121608343.1 MFS transporter -
  D9980_RS21045 (D9980_21045) ilvL 4665282..4665380 (+) 99 WP_071681478.1 ilv operon leader peptide -
  D9980_RS21050 (D9980_21050) ilvG 4665522..4667168 (+) 1647 WP_121608344.1 acetolactate synthase 2 catalytic subunit -
  D9980_RS21055 (D9980_21055) ilvM 4667165..4667422 (+) 258 WP_121608345.1 acetolactate synthase 2 small subunit -
  D9980_RS21060 (D9980_21060) ilvE 4667444..4668370 (+) 927 WP_024482841.1 branched-chain-amino-acid transaminase -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54704.05 Da        Isoelectric Point: 8.5199

>NTDB_id=320988 D9980_RS21035 WP_121608342.1 4662050..4663573(-) (comM) [Serratia sp. 3ACOL1]
MSLAVIYTRATIGVQAPSVMVEVHISNGLPGLTLVGLPETTVKEARDRVRSALINNGFTFPAKRITVNLAPADLPKEGGR
YDLPIALAILAASEQLPADKLAHYEFLGELALSGALRAVHGAIPAAIAAAEAKRELILATANGKEAGLLPNSTTRVAEHL
LEVCAFLQGKGDLPIAVAPPAAATSTETADLQEVIGQEQAKRALEIAAAGGHNLLLLGPPGTGKTMLASRLNGLLPPLSD
REALESVAVASLLHHPDETLPWRQRPFRAPHHSASMAALVGGGSLPRPGEISMAHNGVLFLDELPEFERRVLDALREPLE
SGEIVISRASAKVRFPARVQLIAAMNPSPTGHYQGIHNRTPPQQILRYLGRLSGPFLDRFDLSIEVPLLPPGVLAKQHHA
GESSQQVRQRVLVARERQLARAGKVNALLSNREVDRDCHLPTAEAEFLEKTLSQLGLSVRAWHRILKVARTLADLTGEQQ
IDKRHLSEALSYRSMDRLLLQLHRSLE

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=320988 D9980_RS21035 WP_121608342.1 4662050..4663573(-) (comM) [Serratia sp. 3ACOL1]
ATGTCACTGGCGGTAATCTATACCCGTGCCACGATCGGCGTGCAGGCGCCTTCGGTCATGGTTGAAGTCCATATCAGCAA
CGGATTACCCGGCCTTACACTGGTCGGCCTGCCGGAAACCACGGTGAAAGAAGCCCGTGACCGGGTGCGCAGCGCACTGA
TCAACAACGGCTTCACCTTTCCGGCCAAACGCATCACCGTCAATCTGGCACCCGCCGATCTGCCTAAGGAAGGGGGGCGT
TATGATTTACCCATTGCGCTGGCCATTCTGGCAGCCTCAGAGCAGTTGCCAGCAGATAAGTTGGCGCACTATGAGTTCCT
GGGTGAATTGGCGCTTTCTGGCGCATTGAGGGCGGTACACGGTGCAATTCCGGCGGCAATAGCGGCAGCCGAGGCCAAGC
GTGAGCTGATCCTCGCAACGGCCAACGGCAAAGAGGCGGGGTTGCTTCCCAACAGCACGACCCGAGTTGCGGAACATTTG
CTGGAGGTTTGCGCTTTTTTGCAGGGAAAGGGGGATCTGCCAATCGCCGTAGCGCCTCCTGCGGCTGCCACGTCAACGGA
AACTGCCGACCTGCAGGAAGTCATTGGCCAAGAGCAGGCAAAGCGAGCGTTGGAAATTGCCGCGGCCGGGGGCCACAACC
TATTGCTGCTTGGCCCCCCGGGCACCGGAAAAACCATGCTGGCCAGCCGCCTCAACGGTTTGCTGCCACCGTTAAGCGAT
CGGGAGGCGCTGGAAAGCGTCGCCGTTGCCAGCTTATTACATCACCCTGATGAAACCCTGCCATGGCGACAGCGGCCGTT
TCGTGCCCCGCATCACAGCGCTTCAATGGCTGCCCTGGTTGGGGGCGGTTCCTTGCCGCGTCCAGGGGAAATCTCCATGG
CGCACAACGGCGTGCTTTTTCTGGATGAGTTACCCGAATTTGAACGGAGAGTGTTAGATGCGTTGCGTGAGCCACTGGAA
TCCGGTGAAATCGTCATCTCACGCGCCAGCGCCAAGGTACGCTTCCCTGCCAGGGTGCAGCTGATTGCGGCGATGAACCC
CAGCCCAACCGGGCACTATCAGGGCATACACAACCGTACGCCACCGCAGCAGATTTTGCGCTACCTCGGCAGACTTTCCG
GCCCCTTCCTCGATCGATTTGATTTATCGATTGAGGTGCCACTGCTGCCACCGGGCGTACTGGCCAAGCAGCACCATGCG
GGGGAAAGCAGCCAACAGGTTCGTCAGCGGGTGCTGGTGGCACGCGAGCGGCAGCTGGCACGGGCTGGTAAGGTGAATGC
GCTGTTGAGTAACCGCGAGGTGGATCGGGATTGCCATTTGCCTACGGCGGAAGCGGAGTTTTTGGAGAAAACGCTGAGCC
AACTGGGATTATCAGTTCGAGCCTGGCATCGCATCCTGAAAGTGGCACGCACGCTGGCCGATTTGACGGGGGAACAACAG
ATTGATAAACGCCATCTCAGCGAAGCGCTCAGCTATCGCAGTATGGATCGCCTGCTGCTGCAGCTACACCGCAGCCTGGA
ATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

66.403

99.803

0.663

  comM Vibrio cholerae strain A1552

66.534

99.014

0.659

  comM Glaesserella parasuis strain SC1401

64.694

100

0.647

  comM Vibrio campbellii strain DS40M4

64.94

99.014

0.643

  comM Legionella pneumophila str. Paris

50.403

97.83

0.493

  comM Legionella pneumophila strain ERS1305867

50.403

97.83

0.493

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.882

100

0.462


Multiple sequence alignment