Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   PATL_RS21810 Genome accession   NC_008228
Coordinates   5118183..5119697 (+) Length   504 a.a.
NCBI ID   WP_041714661.1    Uniprot ID   -
Organism   Paraglaciecola sp. T6c     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5113183..5124697
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PATL_RS21795 (Patl_4249) ilvA 5113618..5115168 (-) 1551 WP_011576934.1 threonine ammonia-lyase, biosynthetic -
  PATL_RS21800 (Patl_4250) ilvD 5115168..5117015 (-) 1848 WP_011576935.1 dihydroxy-acid dehydratase -
  PATL_RS21805 (Patl_4251) - 5117465..5118085 (+) 621 WP_011576936.1 trimeric intracellular cation channel family protein -
  PATL_RS21810 (Patl_4252) comM 5118183..5119697 (+) 1515 WP_041714661.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  PATL_RS21815 (Patl_4253) - 5119915..5121051 (+) 1137 WP_301547046.1 PepSY-associated TM helix domain-containing protein -
  PATL_RS21820 (Patl_4254) ilvY 5121048..5121902 (-) 855 WP_006994967.1 HTH-type transcriptional activator IlvY -
  PATL_RS21825 (Patl_4255) ilvC 5122047..5123531 (+) 1485 WP_011576939.1 ketol-acid reductoisomerase -
  PATL_RS21830 (Patl_4256) - 5124108..5124419 (+) 312 WP_041714151.1 hypothetical protein -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 54662.71 Da        Isoelectric Point: 9.0790

>NTDB_id=26057 PATL_RS21810 WP_041714661.1 5118183..5119697(+) (comM) [Paraglaciecola sp. T6c]
MSLAVVFSRASVGIDAPLITVEVHLANGLPCFNLVGLPEASVREAKDRVRSALINSGFEFPARRITVNLAPADLPKEGGR
FDLAIAIGIIAASNQLKGASLEGIELVGELALSGEIRSIKGALPFTYACFKEGRTAILPAKNANEAALISGAKIVPAYQL
LDVFHHLGKQKTLPLFTSDNILQEAEYNVDLQDVVGQSSAKRALEIAAVGGHNLLFTGPPGTGKTMLASRLITILPPMTD
EEALASAAIHSIVGKPVNPQTWKQRAFRHPHHTSSAVALVGGGSVPRPGEISLAHHGVLFLDELPEFDRKVLDVLREPLE
SGSVSISRAARQAQFPAQFQLVAAMNPSPTGSLNDGRCTADQILRYLNRISGPFLDRIDLQVDVPKLNGNEFSEQVKTRG
CSSKETRERVVFARNIALSRSNKPNTMLGSKEVQEYCQLSSNDQRFLQVAVEKLGLSLRTYHRVLKVSRTIADLANQPNI
TRQHLAEALNYRAFDRMLAQLAYN

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=26057 PATL_RS21810 WP_041714661.1 5118183..5119697(+) (comM) [Paraglaciecola sp. T6c]
ATGTCATTAGCGGTAGTATTTTCAAGAGCGAGTGTCGGCATTGATGCGCCGCTTATCACTGTAGAAGTGCACTTAGCCAA
CGGGCTGCCTTGTTTTAACTTAGTTGGGTTGCCTGAAGCATCGGTGCGCGAAGCCAAAGATCGCGTGCGTAGCGCCCTGA
TAAATTCAGGCTTTGAATTTCCTGCTCGTCGTATAACCGTTAACTTGGCCCCCGCAGACTTACCGAAAGAAGGCGGTCGA
TTCGATCTTGCCATCGCTATCGGTATTATCGCTGCTAGTAATCAACTAAAAGGCGCAAGCCTTGAGGGCATTGAGTTAGT
AGGGGAACTTGCACTTTCCGGTGAAATACGCTCCATCAAAGGCGCGCTGCCTTTTACTTATGCATGTTTTAAGGAGGGTC
GTACCGCTATATTACCCGCTAAAAATGCGAATGAAGCTGCTCTTATCAGCGGAGCGAAGATAGTCCCTGCATATCAATTA
CTTGATGTGTTTCACCATTTAGGCAAGCAAAAAACTCTCCCCTTATTTACATCTGACAACATCTTGCAAGAAGCCGAATA
TAACGTGGATTTACAAGATGTGGTCGGGCAAAGTTCAGCAAAACGTGCCCTAGAAATAGCGGCAGTCGGCGGACACAACT
TGTTATTCACTGGCCCACCAGGCACAGGTAAAACGATGTTGGCCAGTCGTTTGATCACTATTTTACCGCCCATGACCGAT
GAAGAAGCCCTCGCCAGCGCCGCCATTCATTCGATCGTAGGTAAACCGGTTAACCCACAAACGTGGAAACAGCGCGCTTT
TCGCCATCCCCACCACACGAGTTCGGCCGTTGCGTTAGTCGGTGGTGGCAGTGTGCCAAGGCCCGGGGAAATATCACTCG
CTCATCACGGCGTGTTGTTTTTAGATGAATTACCAGAATTTGATCGCAAAGTGCTTGATGTACTTCGAGAGCCGCTGGAG
TCAGGCTCGGTGTCTATTTCACGTGCCGCTAGACAAGCGCAATTTCCGGCTCAGTTTCAGTTAGTCGCAGCGATGAACCC
GAGCCCAACGGGCAGCCTTAACGACGGGCGCTGTACTGCTGACCAAATATTGCGTTATTTGAATCGTATTTCGGGACCAT
TTCTAGATCGCATCGATTTACAAGTTGATGTGCCTAAGCTCAACGGCAATGAGTTCTCTGAGCAAGTTAAGACGCGAGGA
TGTAGCAGTAAAGAAACCAGAGAGCGAGTCGTATTTGCGCGTAATATTGCCCTTTCGCGCAGCAACAAACCCAACACTAT
GCTTGGTAGCAAAGAGGTGCAAGAGTACTGTCAGCTTTCAAGTAACGATCAACGTTTTTTGCAAGTTGCCGTTGAAAAAT
TAGGCTTGTCACTGCGTACCTACCATAGAGTATTAAAGGTATCTCGCACTATCGCAGACCTTGCTAATCAGCCTAACATA
ACCCGCCAACATCTGGCCGAAGCCCTTAATTACCGAGCCTTTGATCGTATGTTGGCGCAGCTTGCTTACAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

57.451

100

0.581

  comM Glaesserella parasuis strain SC1401

56.719

100

0.569

  comM Vibrio cholerae strain A1552

56.944

100

0.569

  comM Vibrio campbellii strain DS40M4

56.66

99.802

0.565

  comM Legionella pneumophila str. Paris

49.597

98.413

0.488

  comM Legionella pneumophila strain ERS1305867

49.597

98.413

0.488

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.276

100

0.456


Multiple sequence alignment