Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   IL_RS01085 Genome accession   NC_006512
Coordinates   235592..237139 (+) Length   515 a.a.
NCBI ID   WP_011233478.1    Uniprot ID   Q5QZ88
Organism   Idiomarina loihiensis L2TR     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 230592..242139
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IL_RS01060 (IL0210) - 230705..231271 (+) 567 WP_011233473.1 hypothetical protein -
  IL_RS01065 (IL0211) - 231622..232353 (-) 732 WP_011233474.1 DsbA family protein -
  IL_RS01070 (IL0212) - 232383..232907 (-) 525 WP_011233475.1 protein disulfide oxidoreductase -
  IL_RS01075 (IL0213) - 232897..234981 (-) 2085 WP_011233476.1 protein-disulfide reductase DsbD family protein -
  IL_RS01080 (IL0214) - 235031..235360 (-) 330 WP_237699333.1 hypothetical protein -
  IL_RS01085 (IL0215) comM 235592..237139 (+) 1548 WP_011233478.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  IL_RS01090 - 237225..237506 (+) 282 WP_016341255.1 hypothetical protein -
  IL_RS01095 (IL0216) - 237536..237874 (+) 339 WP_011233479.1 hypothetical protein -
  IL_RS01100 (IL0217) ltaE 237991..239013 (-) 1023 WP_011233480.1 low-specificity L-threonine aldolase -
  IL_RS01105 (IL0218) - 239142..240107 (+) 966 WP_011233481.1 PLP-dependent cysteine synthase family protein -
  IL_RS01110 (IL0219) - 240136..241308 (+) 1173 WP_011233482.1 cystathionine gamma-synthase -
  IL_RS01115 - 241523..241753 (+) 231 WP_050731324.1 hypothetical protein -

Sequence


Protein


Download         Length: 515 a.a.        Molecular weight: 56442.08 Da        Isoelectric Point: 8.8070

>NTDB_id=24087 IL_RS01085 WP_011233478.1 235592..237139(+) (comM) [Idiomarina loihiensis L2TR]
MALARVYTRALIGIHAPEIRVEVDLARGMPGFTMVGMPATTVKEARDRVKTALMNAGFEYPPATRITVNLAPADIPKTGA
RYDLAIAVGILAASGQIPDSILEQAEFYGELALTGELRAVNGLIPALLACRNKKRQAFVPIANEQEAGLVREQKSWLSAD
LVSVYEHLHDHQPLRESKPINWAKESSNHDGDLADVVGQEQAKRALLLSAAGKHHLLFVGPPGTGKTMLAQRLNGILPPL
TEQEAMEIAAVKSVSFDYHHAEALTRRELRNPHHTCTAAALVGGGAGNSVRPGEISLANNGLLFLDEMPEFSRHVLDCLR
EPLGSGEITISRAGYNVKFPANFQLVCALNPSPCGQFDGSLANCRSTPDQILKYLSKLSGPLLDRIDIQVMVPRESESMS
LKGATKHKTQPMTSLQAKSLVEKARHRQLMRQGCLNSELKPKVLKQICFLSEQSEAFLAKAAEKMKLSHRAYHRTLRLAR
TIADLADSKKIEQQHLAEALNYRALDRLIEQVQSL

Nucleotide


Download         Length: 1548 bp        

>NTDB_id=24087 IL_RS01085 WP_011233478.1 235592..237139(+) (comM) [Idiomarina loihiensis L2TR]
ATGGCGTTAGCGCGAGTTTATACCCGGGCATTGATAGGTATTCATGCGCCGGAAATTCGGGTTGAAGTGGATCTGGCAAG
GGGAATGCCAGGCTTTACTATGGTCGGCATGCCGGCAACCACGGTTAAGGAGGCTCGTGATCGGGTCAAAACCGCCTTGA
TGAACGCGGGATTTGAATACCCGCCAGCCACGCGTATTACAGTGAATTTGGCGCCAGCGGATATCCCCAAAACCGGGGCG
CGTTATGATTTGGCCATAGCGGTGGGGATATTGGCGGCTTCGGGACAAATACCGGACAGTATTTTAGAGCAAGCAGAATT
CTATGGTGAGCTGGCGTTAACCGGCGAGTTGCGGGCTGTGAATGGTTTAATCCCAGCCTTATTGGCGTGTCGTAATAAAA
AGCGGCAGGCGTTTGTGCCTATTGCTAACGAGCAGGAAGCCGGGTTAGTACGTGAACAAAAGAGCTGGTTGTCGGCCGAT
TTGGTCAGTGTTTATGAACATTTGCACGATCACCAGCCACTGAGGGAATCAAAGCCGATAAATTGGGCAAAGGAATCATC
GAACCATGATGGCGACTTAGCCGACGTGGTTGGGCAGGAGCAGGCAAAACGAGCTTTGTTGTTGTCGGCGGCCGGTAAGC
ACCACTTGTTGTTCGTCGGGCCGCCAGGTACGGGAAAAACCATGCTTGCGCAACGGCTTAATGGCATATTACCTCCACTG
ACCGAGCAGGAAGCCATGGAAATTGCTGCGGTGAAGTCGGTGTCTTTTGATTACCATCATGCGGAGGCGTTAACACGGCG
CGAGCTGCGAAATCCGCACCATACCTGTACAGCCGCTGCGCTTGTGGGAGGTGGAGCGGGTAATTCTGTTCGTCCGGGAG
AAATATCCTTAGCAAACAACGGGTTATTGTTTTTAGATGAGATGCCCGAGTTCAGTCGACATGTATTGGACTGCTTACGT
GAACCACTCGGAAGCGGTGAAATTACCATTAGTCGTGCAGGATACAACGTCAAATTCCCGGCTAACTTTCAGCTGGTTTG
CGCGCTGAACCCTTCGCCCTGCGGTCAATTCGATGGCTCCTTAGCTAACTGCCGTTCTACCCCGGATCAAATCTTAAAAT
ACTTAAGTAAACTCTCAGGACCGCTGTTAGATCGTATTGATATACAGGTGATGGTGCCCCGAGAGAGCGAGTCTATGAGT
TTAAAGGGGGCAACTAAGCACAAGACTCAGCCCATGACCTCATTACAAGCAAAAAGCTTAGTCGAGAAAGCTCGGCATCG
CCAACTAATGCGACAAGGTTGCTTAAACAGCGAGTTAAAACCAAAAGTGTTAAAGCAGATATGTTTTTTGTCTGAACAAA
GCGAAGCTTTTTTGGCAAAAGCGGCCGAGAAAATGAAATTGTCGCATCGGGCCTATCACCGAACACTTCGTTTAGCCCGA
ACCATTGCCGATTTAGCCGACAGCAAAAAAATAGAACAACAGCATCTGGCAGAGGCATTGAATTATCGTGCGTTGGATCG
GCTTATCGAGCAGGTGCAGAGCTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q5QZ88

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

50.577

100

0.511

  comM Vibrio cholerae strain A1552

50.194

100

0.503

  comM Vibrio campbellii strain DS40M4

50.292

99.612

0.501

  comM Glaesserella parasuis strain SC1401

49.515

100

0.495

  comM Legionella pneumophila str. Paris

46.918

97.67

0.458

  comM Legionella pneumophila strain ERS1305867

46.918

97.67

0.458

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.248

100

0.423


Multiple sequence alignment