Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   J5X96_RS00670 Genome accession   NZ_CP072548
Coordinates   138029..139564 (-) Length   511 a.a.
NCBI ID   WP_033002516.1    Uniprot ID   -
Organism   Aggregatibacter sp. 2125159857     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 133029..144564
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  J5X96_RS00650 (J5X96_00650) - 133727..135193 (-) 1467 WP_209363640.1 aminoacyl-histidine dipeptidase -
  J5X96_RS00655 (J5X96_00655) gpt 135308..135772 (+) 465 WP_021616013.1 xanthine phosphoribosyltransferase -
  J5X96_RS00660 (J5X96_00660) envC 136025..137191 (+) 1167 WP_245193496.1 murein hydrolase activator EnvC -
  J5X96_RS00665 (J5X96_00665) - 137188..138015 (+) 828 WP_209363642.1 divergent polysaccharide deacetylase family protein -
  J5X96_RS00670 (J5X96_00670) comM 138029..139564 (-) 1536 WP_033002516.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  J5X96_RS00675 (J5X96_00675) yihA 139678..140292 (-) 615 WP_209363644.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  J5X96_RS00680 (J5X96_00680) - 140432..141307 (+) 876 WP_209363645.1 DUF535 family protein -
  J5X96_RS00685 (J5X96_00685) - 141371..142258 (+) 888 WP_209363647.1 RsiV family protein -
  J5X96_RS00690 (J5X96_00690) - 142347..143063 (-) 717 WP_209363648.1 MgtC/SapB family protein -
  J5X96_RS00695 (J5X96_00695) - 143068..143682 (-) 615 WP_245193497.1 pseudouridine synthase -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 56003.50 Da        Isoelectric Point: 9.3332

>NTDB_id=554082 J5X96_RS00670 WP_033002516.1 138029..139564(-) (comM) [Aggregatibacter sp. 2125159857]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPSFTLVGLPEKTVKEAQDRVRSALLNAEFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGMIAAFGYIDPEKLKQFEFIGELALTGQLRAVHGVIPAILAAKQAKRKCIIAQGNANEASLVSEQETYYANSL
LDVVQFLNEQGELPLAGDIKTQSAVDFFPENPKDLTDIIGQQHAKRALMIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDQEAIETAAVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGSIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
SGDRGESSAIVREKVLAARAIQLQRAGKINAHLTSKEIERDCKLEEKDALFLENALTKLGLSVRAYHRILKVSRTIADLE
GEKHIHQRHLAEALGYRAMDRLLQKLSKASV

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=554082 J5X96_RS00670 WP_033002516.1 138029..139564(-) (comM) [Aggregatibacter sp. 2125159857]
ATGTCTTTAGCCATCGTTTACAGCCGCGCCTCCATGGGCGTTCAAGCGCCACTGGTTACCATTGAGGTGCATTTGAGCAA
CGGTAAGCCAAGTTTTACGCTGGTTGGGTTACCGGAAAAAACGGTGAAAGAAGCCCAAGATCGTGTCCGCAGTGCGTTGC
TCAATGCTGAATTCAAATATCCTGCCAAGCGCATTACCGTCAACCTCGCCCCGGCAGACTTACCCAAAGAAGGCGGACGT
TTTGATTTGCCTATTGCGATTGGTATGATTGCCGCCTTTGGTTATATCGATCCGGAAAAATTAAAACAGTTTGAATTTAT
CGGCGAACTTGCCCTGACTGGTCAACTGCGTGCCGTGCATGGCGTCATCCCTGCGATTCTGGCGGCGAAACAAGCCAAGC
GAAAATGCATTATTGCACAAGGCAATGCCAACGAAGCGTCACTGGTTTCTGAGCAAGAAACCTATTACGCCAATTCCTTA
TTAGATGTGGTGCAATTTCTTAATGAACAAGGGGAGCTACCTCTTGCCGGCGACATCAAAACTCAAAGTGCGGTGGATTT
TTTCCCTGAAAATCCCAAAGATCTGACAGACATCATCGGACAACAACATGCTAAGCGCGCTTTAATGATTGCCGCAGCAG
GACAGCACAATCTGTTATTTTTAGGTCCGCCCGGTACAGGAAAGACCATGCTTGCCAGTCGCTTAACGGGATTACTGCCG
GAAATGACCGATCAAGAAGCCATTGAAACTGCCGCGGTTGCAAGCCTTGTACAAAATGAACTGAATTTTCACAACTGGAA
ACAGCGTCCTTTTCGTGCGCCTCATCACAGCGCTTCCACACCGGCATTGGTAGGTGGCGGTTCGATTCCAAAACCCGGGG
AAATTTCCCTCGCACACAATGGCGTGTTGTTTTTAGATGAACTGCCAGAATTTGAACGCAAGGTGTTAGACGCCCTACGC
CAGCCGTTAGAAAGTGGTGAAATCATCATTTCTCGCGCCAATGCCAAGATTCAATTTCCGGCAAGATTTCAGCTGATTGC
CGCCATGAATCCCAGCCCGACAGGGCATTATCAAGGTACCCATAACCGCACATCGCCACAACAAATCATGCGCTATTTAA
ATCGCCTATCAGGCCCATTTTTAGATCGTTTCGATTTATCTATTGAGGTGCCATTGCTGCCGCAAGGGAGTCTGCAAAAT
AGTGGTGATCGTGGCGAATCCAGTGCCATAGTGAGAGAGAAAGTCTTAGCCGCCCGTGCCATTCAACTGCAACGTGCCGG
CAAAATTAACGCCCATTTAACCAGCAAAGAAATTGAACGGGATTGCAAATTGGAAGAGAAAGACGCGTTATTTTTAGAAA
ATGCGTTGACCAAACTCGGGCTGTCTGTGCGGGCATACCACCGCATACTGAAAGTTTCCCGCACCATTGCGGATTTAGAA
GGGGAAAAGCACATTCACCAGCGACACTTAGCAGAAGCTCTTGGCTATCGTGCGATGGATAGGCTGTTACAAAAGCTTTC
TAAAGCATCTGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

85.996

99.217

0.853

  comM Glaesserella parasuis strain SC1401

77.886

100

0.779

  comM Vibrio cholerae strain A1552

64.355

100

0.654

  comM Vibrio campbellii strain DS40M4

65.166

100

0.652

  comM Legionella pneumophila str. Paris

50

99.413

0.497

  comM Legionella pneumophila strain ERS1305867

50

99.413

0.497

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.093

100

0.476