Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   EL144_RS02405 Genome accession   NZ_LR134327
Coordinates   451700..453235 (+) Length   511 a.a.
NCBI ID   WP_005703474.1    Uniprot ID   A0A336N8Q6
Organism   Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 446700..458235
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL144_RS02380 (NCTC5906_00483) - 447331..447645 (+) 315 WP_005703479.1 hypothetical protein -
  EL144_RS02385 (NCTC5906_00484) - 447952..448353 (-) 402 WP_005703478.1 pyrimidine dimer DNA glycosylase/endonuclease V -
  EL144_RS02390 (NCTC5906_00485) - 448545..449912 (-) 1368 WP_050332724.1 patatin-like phospholipase family protein -
  EL144_RS02395 (NCTC5906_00486) deoC 450146..450817 (+) 672 WP_005703476.1 deoxyribose-phosphate aldolase -
  EL144_RS02400 (NCTC5906_00487) deoR 450842..451591 (+) 750 WP_005703475.1 DNA-binding transcriptional repressor DeoR -
  EL144_RS02405 (NCTC5906_00488) comM 451700..453235 (+) 1536 WP_005703474.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  EL144_RS02410 (NCTC5906_00489) - 453765..454133 (+) 369 WP_032995007.1 hypothetical protein -
  EL144_RS02415 (NCTC5906_00490) - 454271..454648 (+) 378 WP_225791922.1 hypothetical protein -
  EL144_RS02420 (NCTC5906_00491) - 454711..455091 (+) 381 WP_005701924.1 hypothetical protein -
  EL144_RS02425 (NCTC5906_00492) - 455147..456733 (+) 1587 WP_005703472.1 DUF4384 domain-containing protein -
  EL144_RS02430 (NCTC5906_00493) - 456751..457836 (+) 1086 WP_032995005.1 tetratricopeptide repeat protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 56063.62 Da        Isoelectric Point: 9.4972

>NTDB_id=1121595 EL144_RS02405 WP_005703474.1 451700..453235(+) (comM) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPNFTLVGLPEKTVKEAQDRVRSALLNAEFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGYIDAEKLKQFEFIGELALTGQLRAVHGVIPAILAAKKAKRKCIIAYGNANEASLISDQETYFAHSL
LEVVQFLNNQGELPLAKDIMAQSAVDFGGENQKDLTEIIGQQHAKRALIIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDQEAIETASVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGTIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
NTERGEPSAIVREKVLKTRNIQLERAGKINAHLTGKEIERDCKLEGKDALFLESALTKLGLSVRAYHRILKVSRTIADLE
GEKCITQKHLAEALGYRAMDRFLQRLSKESS

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1121595 EL144_RS02405 WP_005703474.1 451700..453235(+) (comM) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
ATGTCTTTAGCCATCGTTTACAGTCGTGCTTCAATGGGCGTGCAGGCGCCTTTAGTGACGATTGAAGTGCATTTAAGTAA
CGGCAAGCCTAACTTTACGTTGGTTGGATTGCCGGAAAAAACTGTTAAAGAAGCACAAGATCGAGTTCGTAGTGCATTGC
TGAACGCCGAATTCAAATATCCCGCCAAACGCATTACCGTCAATCTCGCGCCTGCTGATTTACCCAAAGAAGGCGGACGT
TTTGACTTGCCTATCGCTATCGGTATGCTTGCGGCTTCAGGCTATATTGACGCGGAAAAATTAAAACAATTTGAATTTAT
CGGTGAATTGGCATTAACCGGTCAACTCCGCGCGGTACATGGCGTAATTCCTGCTATTTTGGCTGCCAAAAAAGCAAAAC
GAAAATGTATTATCGCTTATGGCAATGCTAATGAAGCCTCATTGATCTCCGATCAAGAAACGTATTTTGCCCATTCATTG
CTTGAAGTTGTGCAATTTCTCAATAATCAAGGGGAACTGCCTTTGGCAAAGGACATAATGGCTCAAAGTGCGGTGGATTT
TGGCGGCGAAAATCAAAAAGATCTGACGGAGATTATCGGTCAACAACACGCCAAGCGGGCGCTGATTATTGCCGCAGCCG
GGCAACATAACTTGTTATTTCTGGGCCCACCCGGCACTGGTAAAACTATGCTTGCCAGCCGTTTAACGGGGCTATTACCG
GAAATGACCGACCAAGAAGCCATCGAAACCGCCTCCGTCGCCAGTCTCGTGCAAAATGAATTGAATTTTCATAACTGGAA
ACAACGCCCTTTCCGTGCCCCACATCACAGCGCGTCCACTCCGGCTTTAGTGGGAGGTGGCACAATTCCAAAACCCGGCG
AAATTTCTCTCGCGCATAATGGTGTGCTTTTTTTAGACGAATTGCCTGAATTTGAACGTAAGGTGCTGGATGCTTTACGC
CAACCGTTAGAAAGTGGAGAAATCATTATTTCGCGTGCCAATGCCAAAATTCAGTTTCCGGCAAGGTTTCAGCTTATCGC
GGCGATGAACCCAAGCCCAACGGGGCATTATCAAGGGACACACAATCGTACGTCGCCGCAGCAAATCATGCGGTATTTAA
ATCGTCTCTCCGGCCCATTTCTGGATCGTTTTGATTTATCCATTGAAGTGCCTTTGTTGCCACAAGGCAGTTTGCAAAAT
AATACGGAGCGTGGCGAACCTAGCGCAATCGTCCGTGAAAAAGTGTTAAAAACCCGCAACATCCAATTAGAACGTGCAGG
CAAAATAAACGCCCACTTAACCGGTAAAGAAATTGAACGCGATTGTAAACTGGAAGGCAAAGACGCGTTATTTCTTGAAA
GTGCTCTGACCAAACTAGGGCTTTCCGTACGGGCTTACCACCGAATCTTAAAAGTGTCACGCACCATTGCTGATTTAGAA
GGTGAAAAATGCATTACCCAAAAACATTTGGCTGAAGCATTAGGCTACCGAGCAATGGATCGGTTTTTACAGAGGCTATC
GAAAGAATCAAGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A336N8Q6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

86.588

99.217

0.859

  comM Glaesserella parasuis strain SC1401

78.082

100

0.781

  comM Vibrio cholerae strain A1552

64.902

99.804

0.648

  comM Vibrio campbellii strain DS40M4

64.51

99.804

0.644

  comM Legionella pneumophila str. Paris

50.992

98.63

0.503

  comM Legionella pneumophila strain ERS1305867

50.992

98.63

0.503

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.923

100

0.477


Multiple sequence alignment