Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GQR89_RS20935 Genome accession   NZ_CP047024
Coordinates   4899556..4901070 (+) Length   504 a.a.
NCBI ID   WP_158772020.1    Uniprot ID   -
Organism   Paraglaciecola sp. L1A13     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4894556..4906070
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GQR89_RS20920 ilvA 4894925..4896481 (-) 1557 WP_158772017.1 threonine ammonia-lyase, biosynthetic -
  GQR89_RS20925 ilvD 4896482..4898329 (-) 1848 WP_158772018.1 dihydroxy-acid dehydratase -
  GQR89_RS20930 - 4898833..4899453 (+) 621 WP_158772019.1 trimeric intracellular cation channel family protein -
  GQR89_RS20935 comM 4899556..4901070 (+) 1515 WP_158772020.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GQR89_RS20940 ilvY 4901082..4901936 (-) 855 WP_158772021.1 HTH-type transcriptional activator IlvY -
  GQR89_RS20945 ilvC 4902080..4903564 (+) 1485 WP_158772022.1 ketol-acid reductoisomerase -
  GQR89_RS20950 - 4904306..4904617 (+) 312 WP_158772351.1 hypothetical protein -
  GQR89_RS20955 - 4904738..4905385 (-) 648 WP_158772023.1 OmpA family protein -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54413.41 Da        Isoelectric Point: 8.4860

>NTDB_id=409952 GQR89_RS20935 WP_158772020.1 4899556..4901070(+) (comM) [Paraglaciecola sp. L1A13]
MSLAVVFSRASVGIDAPLITVEVHLANGLPCFNLVGLPEASVREAKDRVRSALINSGFEFPARRITVNLAPADLPKEGGR
FDLAIAIGIIAASNQLAGASLNGIELVGELALSGEIRSIKGALPFTYACYKAGRIAILPLDNANEAALISGAKIVPASQL
LDVFHHLGKQKSLALFTSDNIAKEAEYEVDLQDVVGQISAKRALEIAAAGGHNILFTGPPGTGKTMLASRLITILPPMTD
EEALASAAIHSIVGKPVNPETWKQRAFRHPHHTSSAVALVGGGSVPRPGEISLAHHGVLFLDELPEFDRKVLDVLREPLE
SGFVSISRAARQAQFPAQFQLVAAMNPSPTGSLNDGRCTADQILRYLNRISGPFLDRIDLQVDVPKLNSNEFSEQVKERG
QSSNEIRQRVIQARNVALARSNKPNTLLGSKEVQKHCTLSPEDQHFLQGAVEKLGLSLRTYHRVLKVSRTIADLAGEPNI
KRQHLAEALNYRAFDRMLAQLAYN

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=409952 GQR89_RS20935 WP_158772020.1 4899556..4901070(+) (comM) [Paraglaciecola sp. L1A13]
ATGTCGTTGGCAGTGGTGTTTTCAAGAGCCAGTGTTGGTATAGATGCGCCACTTATAACGGTAGAAGTGCATCTGGCCAA
TGGTTTACCGTGTTTTAACTTAGTTGGCTTACCCGAAGCGTCGGTGCGAGAAGCGAAAGACAGAGTCCGCAGCGCATTAA
TTAATTCTGGATTTGAATTCCCCGCTCGGCGTATTACGGTCAACCTCGCACCTGCCGATTTACCCAAAGAAGGTGGACGT
TTCGACTTAGCCATCGCCATCGGTATTATCGCTGCAAGCAATCAGCTCGCTGGCGCAAGCCTAAACGGTATAGAGCTGGT
GGGAGAGCTTGCCTTATCTGGCGAAATACGCTCCATCAAAGGTGCTCTGCCTTTCACTTATGCCTGTTACAAAGCCGGGC
GCATTGCAATTTTGCCCCTTGATAACGCGAATGAAGCAGCCCTAATCAGTGGCGCAAAGATCGTTCCTGCTAGTCAGTTA
CTTGATGTATTTCATCATCTTGGCAAACAAAAAAGCCTTGCCTTATTCACCTCGGACAATATTGCCAAAGAAGCCGAGTA
TGAAGTTGATTTACAAGATGTTGTGGGCCAAATATCGGCAAAACGTGCGCTCGAAATTGCGGCAGCTGGGGGCCACAATA
TACTTTTCACCGGACCTCCAGGTACAGGCAAAACAATGCTTGCTAGTCGCTTAATCACCATATTACCGCCCATGACCGAT
GAAGAAGCACTCGCAAGTGCCGCTATTCACTCGATTGTTGGAAAACCTGTTAATCCGGAAACATGGAAGCAGCGAGCGTT
TCGCCATCCACACCACACCAGCTCAGCAGTGGCACTCGTCGGAGGTGGGAGTGTGCCAAGACCCGGAGAAATATCATTAG
CTCATCACGGAGTTTTATTTTTAGATGAGCTACCCGAATTCGACCGCAAAGTACTCGATGTTCTACGAGAACCTTTAGAA
TCTGGCTTCGTATCTATATCTAGAGCCGCGCGACAGGCGCAGTTTCCTGCACAATTTCAATTAGTCGCAGCAATGAACCC
CAGTCCCACTGGCAGCCTCAATGATGGGCGTTGTACAGCAGATCAGATACTGCGTTATTTGAATCGCATTTCGGGACCTT
TTCTGGACCGTATTGATTTACAAGTAGATGTGCCTAAACTAAATAGCAACGAATTTTCTGAGCAAGTGAAAGAACGAGGA
CAAAGCAGTAATGAGATACGCCAACGAGTTATCCAAGCGCGTAATGTCGCCCTCGCCCGCAGTAATAAACCCAATACTCT
ATTGGGCAGCAAAGAAGTGCAAAAACACTGTACTCTCTCGCCCGAGGATCAACATTTTTTACAAGGTGCGGTTGAAAAAC
TAGGATTGTCTTTACGTACATATCATCGAGTGCTGAAAGTGTCACGCACCATAGCTGATTTAGCAGGCGAGCCTAATATA
AAGCGTCAACACTTAGCCGAAGCTCTTAATTACCGTGCTTTTGATCGAATGCTCGCACAGCTTGCCTATAATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

57.255

100

0.579

  comM Glaesserella parasuis strain SC1401

57.115

100

0.573

  comM Vibrio campbellii strain DS40M4

57.455

99.802

0.573

  comM Vibrio cholerae strain A1552

57.341

100

0.573

  comM Legionella pneumophila str. Paris

50

98.413

0.492

  comM Legionella pneumophila strain ERS1305867

50

98.413

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.669

100

0.46


Multiple sequence alignment