Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   IMCC21906_RS00970 Genome accession   NZ_CP011477
Coordinates   220622..222112 (+) Length   496 a.a.
NCBI ID   WP_047010601.1    Uniprot ID   A0A0F7M0N2
Organism   Spongiibacter sp. IMCC21906     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 215622..227112
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IMCC21906_RS00945 (IMCC21906_00193) ureC 216325..218028 (-) 1704 WP_047010597.1 urease subunit alpha -
  IMCC21906_RS00950 (IMCC21906_00194) - 218025..218354 (-) 330 WP_047010598.1 urease subunit beta -
  IMCC21906_RS00955 (IMCC21906_00195) - 218370..218672 (-) 303 WP_047010599.1 urease subunit gamma -
  IMCC21906_RS00960 (IMCC21906_00196) - 219207..220103 (-) 897 WP_052763290.1 urease accessory protein UreD -
  IMCC21906_RS00965 (IMCC21906_00197) - 220298..220567 (+) 270 WP_047010600.1 accessory factor UbiK family protein -
  IMCC21906_RS00970 (IMCC21906_00198) comM 220622..222112 (+) 1491 WP_047010601.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  IMCC21906_RS00975 (IMCC21906_00199) - 222281..222733 (+) 453 WP_047010602.1 transposase -
  IMCC21906_RS00980 (IMCC21906_00200) rep 223149..225158 (+) 2010 WP_047010603.1 DNA helicase Rep -
  IMCC21906_RS00985 (IMCC21906_00201) - 225188..226084 (-) 897 WP_047010604.1 trypsin-like peptidase domain-containing protein -
  IMCC21906_RS00990 (IMCC21906_00202) - 226213..226605 (-) 393 WP_047013019.1 cytochrome c5 family protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 53023.13 Da        Isoelectric Point: 7.9415

>NTDB_id=146155 IMCC21906_RS00970 WP_047010601.1 220622..222112(+) (comM) [Spongiibacter sp. IMCC21906]
MPLATVFSRAKLGVKAPLVTVEVHLSSGLPAFHIVGLPEATVRESRDRVRSALINSRFDFPVSRITVNLAPADLPKEGGR
FDLPIAIGILLASAQIPVKAVDGYEFIGELGLSGELRAVDAALPAAVQTHHVNKVLILPASNAGLAAIVDSQHLREAQSL
LDVVAHLQGQSSLPFGSDAPPGTGTTKTIDYPDLSDVLGQQTAKRALEIAAAGGHSLLLSGPPGTGKTLLANRLAGILPP
LNEQEERDIAIVQSIAGKPLNLHRPFRAPHHTASAVALVGGGSQPRPGEISLAHGGILFLDELPEYPRKVLEVLREPLES
GEVMISRAAQQLTFPARFQLVAAMNPCPCGYDGDPTQDCRCTPDQIQRYRQKISGPLLDRIDLRITVPRLPPGVLQSLSA
GESSAMVRERVCAARALQLIRSGDCNANLSATEVRQHCVLGQQCKELMVKAEQRLGLSARAHHRIIKIARTIADLASEPK
ITLPHLQEAIAYRGDF

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=146155 IMCC21906_RS00970 WP_047010601.1 220622..222112(+) (comM) [Spongiibacter sp. IMCC21906]
ATGCCCCTCGCCACTGTTTTTAGCCGAGCCAAGCTCGGCGTTAAAGCGCCATTGGTTACGGTAGAAGTCCACCTCTCATC
GGGCTTACCGGCCTTCCATATTGTCGGCTTGCCAGAGGCCACGGTCAGAGAAAGCCGCGACCGGGTGCGCTCCGCACTCA
TTAACTCCCGTTTTGACTTTCCCGTCAGCCGCATCACCGTCAACCTGGCCCCAGCAGACCTGCCCAAAGAAGGTGGCCGC
TTTGACCTGCCCATTGCCATTGGCATTTTGCTGGCGTCAGCTCAAATTCCCGTCAAGGCTGTTGATGGTTACGAATTTAT
TGGTGAACTGGGACTCAGTGGCGAGTTGCGCGCCGTAGATGCCGCGCTACCCGCCGCCGTGCAGACCCACCACGTTAACA
AGGTGCTCATCCTCCCCGCCAGCAACGCAGGCTTGGCCGCCATTGTCGATAGCCAGCACCTCCGAGAAGCTCAATCCCTA
CTTGACGTTGTCGCACACCTACAAGGACAGAGCAGTCTGCCCTTTGGCTCCGATGCGCCTCCCGGCACAGGCACAACCAA
AACCATCGACTACCCCGATCTTAGCGATGTATTGGGCCAGCAAACCGCCAAACGCGCACTGGAAATTGCCGCCGCTGGAG
GCCACAGCCTGTTGCTAAGCGGCCCTCCCGGAACCGGCAAAACGCTGCTGGCCAATCGACTGGCCGGTATTTTGCCGCCC
TTAAATGAGCAGGAAGAACGGGATATCGCTATTGTGCAGTCCATTGCAGGCAAGCCATTAAACCTACACCGCCCCTTTCG
GGCGCCCCACCACACCGCCTCTGCGGTAGCGTTGGTTGGCGGCGGCAGTCAGCCGCGACCCGGCGAAATTAGCTTGGCGC
ACGGCGGCATTTTATTTTTAGATGAGCTGCCCGAATACCCACGCAAAGTACTAGAAGTGCTACGAGAACCTTTGGAGTCT
GGCGAGGTGATGATTTCCCGCGCTGCGCAACAGCTGACGTTTCCTGCAAGATTCCAACTCGTCGCAGCGATGAACCCCTG
CCCCTGTGGCTACGACGGTGACCCCACTCAAGACTGCCGCTGTACCCCGGACCAAATACAACGGTATCGGCAAAAAATCT
CCGGGCCTTTACTAGACCGTATCGATCTGCGCATCACCGTGCCTCGACTACCACCAGGCGTATTGCAGTCCCTCTCAGCT
GGCGAAAGCAGCGCCATGGTGCGAGAGCGGGTTTGCGCCGCAAGAGCACTACAGCTAATACGCAGCGGCGACTGCAATGC
CAATTTAAGCGCAACCGAAGTGCGTCAGCACTGCGTACTTGGCCAGCAATGCAAAGAGCTAATGGTAAAAGCTGAGCAGC
GTTTGGGCTTAAGTGCCCGGGCCCACCATCGCATTATCAAAATTGCCCGAACCATTGCCGACCTCGCCAGTGAACCGAAA
ATCACGCTGCCCCACCTACAAGAGGCCATTGCTTATAGGGGCGACTTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0F7M0N2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.914

100

0.573

  comM Vibrio campbellii strain DS40M4

54.91

100

0.552

  comM Haemophilus influenzae Rd KW20

54.8

100

0.552

  comM Glaesserella parasuis strain SC1401

52.6

100

0.53

  comM Legionella pneumophila str. Paris

50.199

100

0.508

  comM Legionella pneumophila strain ERS1305867

50.199

100

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.576

100

0.456


Multiple sequence alignment