Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GQS55_RS00965 Genome accession   NZ_CP047130
Coordinates   238206..239729 (+) Length   507 a.a.
NCBI ID   WP_159817099.1    Uniprot ID   -
Organism   Colwellia sp. 20A7     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 233206..244729
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GQS55_RS00950 - 234748..235674 (-) 927 WP_159817093.1 branched-chain amino acid transaminase -
  GQS55_RS00955 ilvM 235789..236046 (-) 258 WP_159817095.1 acetolactate synthase 2 small subunit -
  GQS55_RS00960 ilvG 236048..237712 (-) 1665 WP_159817097.1 acetolactate synthase 2 catalytic subunit -
  GQS55_RS00965 comM 238206..239729 (+) 1524 WP_159817099.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GQS55_RS00970 ilvY 239816..240709 (-) 894 WP_159817101.1 HTH-type transcriptional activator IlvY -
  GQS55_RS00975 ilvC 240899..242380 (+) 1482 WP_159817103.1 ketol-acid reductoisomerase -
  GQS55_RS00980 - 242535..242852 (+) 318 WP_159817105.1 pyrimidine/purine nucleoside phosphorylase -
  GQS55_RS00985 - 242865..243626 (+) 762 WP_159817107.1 sulfite exporter TauE/SafE family protein -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 55921.11 Da        Isoelectric Point: 6.7473

>NTDB_id=410828 GQS55_RS00965 WP_159817099.1 238206..239729(+) (comM) [Colwellia sp. 20A7]
MSLACVYSRARVGLESPLVTVEVHLANGLPAFHIVGLPEASVKESKDRVRSAIINCGYEFPAKRITINLAPADLPKEGGR
FDLPIAVGILAASEQIPQVDLAQYEFAGELALSGELRAIIGEIPMAMACCQSKRTLIVPRQNSEQASWVKEAKIHAVDHL
SQLYAHFSRQMILPFVEEKELEQFTDINELDISDVIGQPLAKRALEIAASGNHNLLFIGPPGTGKTMLASRLAGILPRMT
EQEALEVAAIQSITNQRINAKSWLTRPFRAPHHTASSAALVGGGGQPQPGEISLAHNGVLFLDELPEFERKVLDVLREPM
ESGEVTISRAMHKQSFPARFQLIAAMNPSPTGFYNDNRSTPEQVLRYLNRLSGPFLDRIDIQIEVARLPRGTWAGDTDMN
ETNDIVQARVQACRMKQLARQNKANAHLGTSELKSYCHLSVENNEFLELAVEKLGLSTRAHHKILKIARTLADMADENDI
THAHITEALSYRAMDRLLRHLTSSVSL

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=410828 GQS55_RS00965 WP_159817099.1 238206..239729(+) (comM) [Colwellia sp. 20A7]
ATGTCACTTGCCTGTGTTTATAGTCGTGCTCGTGTTGGTTTAGAATCACCGCTTGTTACCGTTGAAGTTCATCTTGCGAA
TGGTTTACCTGCTTTTCATATTGTAGGTTTGCCTGAAGCTTCCGTTAAAGAATCAAAAGACCGTGTTCGTAGCGCCATTA
TTAATTGTGGTTATGAATTTCCTGCAAAAAGAATAACCATTAATCTTGCTCCTGCAGATTTGCCAAAAGAAGGGGGGCGT
TTTGATCTACCCATTGCCGTTGGAATATTAGCTGCTTCTGAGCAAATCCCTCAAGTTGACTTAGCACAATATGAATTTGC
AGGTGAATTAGCTTTGTCAGGTGAGCTAAGAGCCATTATCGGTGAAATTCCTATGGCGATGGCTTGTTGTCAAAGCAAAA
GAACCTTAATTGTTCCAAGGCAGAATAGTGAACAAGCGAGCTGGGTGAAAGAAGCCAAAATTCATGCTGTCGATCATTTA
AGTCAACTGTACGCTCACTTTTCAAGACAAATGATATTACCTTTTGTGGAAGAAAAGGAACTCGAACAGTTTACGGATAT
TAATGAACTAGATATTAGTGACGTTATTGGGCAGCCTCTTGCTAAACGTGCACTAGAGATCGCCGCAAGCGGTAATCACA
ATTTGCTTTTTATTGGACCACCCGGCACAGGGAAAACTATGTTAGCGAGTAGACTGGCAGGTATTTTACCTCGCATGACT
GAGCAAGAAGCGTTAGAAGTAGCGGCTATTCAATCAATAACGAATCAAAGGATTAATGCAAAATCATGGTTAACTCGGCC
ATTTCGTGCGCCTCACCATACTGCTTCATCGGCAGCTTTAGTGGGTGGTGGTGGTCAGCCTCAACCAGGAGAGATCTCGT
TAGCACACAATGGCGTTTTGTTTCTTGATGAGCTCCCAGAATTTGAACGTAAGGTACTTGATGTACTTCGTGAACCGATG
GAATCCGGAGAAGTGACTATATCTAGAGCGATGCACAAACAAAGTTTTCCTGCCAGATTCCAGCTTATTGCGGCCATGAA
CCCAAGCCCAACTGGCTTTTACAATGATAATCGTAGCACTCCCGAACAAGTTTTGCGTTATTTGAATCGTTTGTCTGGAC
CTTTTTTAGATCGAATTGATATTCAAATAGAAGTTGCGCGTTTACCTCGAGGCACCTGGGCCGGAGATACTGATATGAAT
GAAACCAATGACATTGTTCAGGCGCGTGTGCAGGCTTGTAGAATGAAACAATTAGCACGGCAGAACAAGGCCAACGCGCA
TTTAGGCACCAGTGAGTTAAAAAGTTATTGTCACTTATCCGTTGAAAATAATGAATTTCTTGAATTAGCAGTCGAAAAGT
TGGGACTTTCTACTAGGGCTCATCATAAGATTTTAAAAATAGCTCGAACCTTGGCAGATATGGCCGATGAGAATGATATT
ACACATGCACATATTACTGAAGCCTTGTCATATCGAGCTATGGATCGTTTATTACGTCATTTAACTAGTTCGGTATCGTT
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

57.905

99.803

0.578

  comM Haemophilus influenzae Rd KW20

57.283

100

0.574

  comM Vibrio campbellii strain DS40M4

57.087

100

0.572

  comM Glaesserella parasuis strain SC1401

56.522

99.803

0.564

  comM Legionella pneumophila str. Paris

47.887

98.028

0.469

  comM Legionella pneumophila strain ERS1305867

47.887

98.028

0.469

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.047

99.803

0.46


Multiple sequence alignment