Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ACGTJS_RS04165 Genome accession   NZ_CP171363
Coordinates   816939..818429 (+) Length   496 a.a.
NCBI ID   WP_410472902.1    Uniprot ID   -
Organism   Faucicola mancuniensis strain GVCNT2     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 811939..823429
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACGTJS_RS04145 (ACGTJS_04145) - 812737..813135 (+) 399 WP_315042407.1 ComEA family DNA-binding protein -
  ACGTJS_RS04150 (ACGTJS_04150) pcnB 813254..815131 (+) 1878 WP_410472899.1 polynucleotide adenylyltransferase PcnB -
  ACGTJS_RS04155 (ACGTJS_04155) folK 815124..815627 (+) 504 WP_410472900.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  ACGTJS_RS04160 (ACGTJS_04160) - 815653..816663 (-) 1011 WP_410472901.1 adenosine kinase -
  ACGTJS_RS04165 (ACGTJS_04165) comM 816939..818429 (+) 1491 WP_410472902.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ACGTJS_RS04170 (ACGTJS_04170) hemN 818524..819999 (+) 1476 WP_410472903.1 oxygen-independent coproporphyrinogen III oxidase -
  ACGTJS_RS04175 (ACGTJS_04175) - 820078..820278 (+) 201 WP_410472904.1 hypothetical protein -
  ACGTJS_RS04180 (ACGTJS_04180) - 820294..820899 (+) 606 WP_410472905.1 class I SAM-dependent methyltransferase -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 54076.64 Da        Isoelectric Point: 8.0234

>NTDB_id=1060912 ACGTJS_RS04165 WP_410472902.1 816939..818429(+) (comM) [Faucicola mancuniensis strain GVCNT2]
MSFAQIFTRSVVGLNAPSVMVEVHLSQGLPAITIVGLPEASVRESKDRVRSAIINSGFQFPNRRLTINLAPADLPKDGAR
LDLPIAIGILVASGQIDDSKIANFEFIGELALNGDLRPISGVLAVARAIKTSNHTLIAPKDNANEAVKVQGIEVLQAENL
KQVCEHLNNEQSLSHAEFKASYQVSSHKLDLADVKGQHQARRALEIAAAGGHSLLFCGSPGTGKTLMASRLPTILPPLND
NEALEVASIYSIANVPYDFGTRPFRQVHHTTSAVALVGGGSNPKPGEISLANFGVLFLDEMPEFDRKVLEVLRQPIENKE
IIISRANHQTKFPANFQLVGAMNPCPCGYYGDKSGRCSCRPEQIKRYQEKLSGPLLDRIDLHITVPSLPMSDLQSAQSGE
SSASVRERVIKAYERQQNRQAKLNSDLTPNELEQFAKLGEAQAKLLQMAGQRLNLSARSYHRIVRVARTIADLAQSENIE
TAHITEALSYRGNLQN

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=1060912 ACGTJS_RS04165 WP_410472902.1 816939..818429(+) (comM) [Faucicola mancuniensis strain GVCNT2]
ATGTCATTTGCCCAAATTTTTACTCGCTCGGTGGTCGGTTTAAATGCCCCAAGCGTCATGGTAGAAGTCCATCTATCACA
AGGCTTACCAGCCATTACCATTGTCGGCTTGCCAGAGGCGAGCGTGCGAGAAAGCAAAGACAGAGTGCGTTCCGCCATTA
TCAACTCGGGTTTTCAATTTCCCAATCGCCGACTGACGATTAATCTTGCCCCTGCCGACTTGCCAAAAGACGGAGCAAGG
CTGGATTTACCGATTGCGATTGGTATTTTGGTGGCAAGCGGTCAAATTGACGATAGCAAAATTGCCAATTTTGAATTTAT
TGGTGAGCTGGCGTTAAATGGTGATTTGCGTCCGATTTCTGGCGTGTTGGCTGTCGCAAGGGCAATCAAAACCAGCAATC
ATACCCTTATCGCCCCAAAAGACAACGCCAATGAAGCAGTGAAAGTACAAGGCATTGAGGTATTACAAGCGGAAAATTTA
AAACAAGTTTGTGAACATTTAAACAATGAGCAAAGCTTGAGCCATGCCGAATTTAAAGCCAGCTATCAGGTCAGCTCGCA
CAAACTGGACTTAGCCGATGTCAAAGGGCAACACCAAGCCCGCAGAGCGTTGGAGATTGCGGCAGCAGGCGGACATTCGC
TGTTGTTTTGTGGTTCGCCAGGGACTGGCAAAACCTTGATGGCATCACGCCTGCCGACTATTTTACCCCCATTAAATGAC
AACGAAGCCTTAGAAGTGGCGAGCATTTATTCGATTGCCAATGTGCCGTATGACTTTGGCACTCGCCCCTTTCGCCAAGT
TCATCATACCACATCAGCGGTGGCGTTGGTCGGTGGTGGCTCAAATCCCAAACCGGGGGAAATTTCGTTGGCAAATTTTG
GTGTGTTATTTTTAGACGAAATGCCAGAATTTGACCGAAAAGTGCTTGAAGTATTGCGGCAACCGATTGAAAACAAAGAA
ATCATTATCAGCCGTGCCAATCACCAAACAAAATTTCCAGCCAATTTTCAACTGGTCGGTGCGATGAACCCATGTCCATG
CGGTTATTATGGTGATAAGTCTGGGCGGTGCAGTTGCCGACCCGAACAAATCAAACGCTATCAAGAAAAACTCTCAGGAC
CATTGCTTGACCGCATTGACCTACACATTACCGTGCCAAGTTTGCCCATGTCCGATTTGCAATCAGCTCAAAGTGGCGAA
AGCTCGGCAAGCGTGCGTGAGCGTGTTATCAAAGCCTATGAGCGACAGCAAAACCGTCAAGCCAAACTTAATAGTGATTT
AACCCCAAATGAACTTGAACAATTTGCCAAACTTGGCGAAGCCCAAGCTAAACTGTTACAAATGGCAGGGCAACGCCTAA
ATCTATCGGCTCGTTCGTATCATCGCATTGTGCGTGTGGCTCGCACCATTGCTGATTTAGCACAAAGCGAAAATATTGAA
ACCGCTCACATTACCGAAGCGTTAAGCTATCGTGGCAATTTACAAAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.6

100

0.53

  comM Vibrio cholerae strain A1552

53.024

100

0.53

  comM Glaesserella parasuis strain SC1401

51.896

100

0.524

  comM Vibrio campbellii strain DS40M4

52.525

99.798

0.524

  comM Legionella pneumophila str. Paris

50.2

100

0.506

  comM Legionella pneumophila strain ERS1305867

50.2

100

0.506

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.351

100

0.474


Multiple sequence alignment