Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   V6667_RS02335 Genome accession   NZ_CP145811
Coordinates   506118..507608 (+) Length   496 a.a.
NCBI ID   WP_338809480.1    Uniprot ID   -
Organism   Neisseria leonii strain CCUG 45853     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 501118..512608
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  V6667_RS02315 (V6667_02315) waaF 501565..502578 (-) 1014 WP_274584556.1 lipopolysaccharide heptosyltransferase II -
  V6667_RS02320 (V6667_02320) - 502695..504671 (-) 1977 WP_274584555.1 choline BCCT transporter BetT -
  V6667_RS02325 (V6667_02325) ung 505023..505694 (+) 672 WP_338809479.1 uracil-DNA glycosylase -
  V6667_RS02330 (V6667_02330) - 505794..506072 (+) 279 WP_274584553.1 accessory factor UbiK family protein -
  V6667_RS02335 (V6667_02335) comM 506118..507608 (+) 1491 WP_338809480.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  V6667_RS02340 (V6667_02340) - 507734..508513 (+) 780 WP_338809481.1 SPOR domain-containing protein -
  V6667_RS02345 (V6667_02345) - 508530..509222 (+) 693 WP_338809482.1 thiol:disulfide interchange protein DsbA/DsbL -
  V6667_RS02355 (V6667_02355) - 510282..510785 (+) 504 WP_338809483.1 DUF1841 family protein -
  V6667_RS02360 (V6667_02360) hisD 510942..512246 (-) 1305 WP_338809485.1 histidinol dehydrogenase -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52793.89 Da        Isoelectric Point: 8.6179

>NTDB_id=940011 V6667_RS02335 WP_338809480.1 506118..507608(+) (comM) [Neisseria leonii strain CCUG 45853]
MTWAVVYSRALSGMSAPLVEVEVHLANGLPQFNIVGLPDTEVKESRDRVRAAIVQSGFDMPAKKITVNLAPADLPKESGR
FDLPIAVGILAASGQVLSDKLADYELAGELALSGALRPVRGALAMAWQGAKDKRAFILPAENAVQTALLKDLTAYGAACL
GEVAAHLNGIAPLQAVSGGIRPSESRPLPDLAEVKGQHTARLALEIAAAGGHSLLMVGPPGTGKSMLAQRLPGILPPLSD
EETISVWALRSLLPLHLQDSSTERPFRAPHHSASAVALVGGGSDPRPGEISLAHHGVLFLDELPEFDRKVLEVLREPLEN
GEIHISRAARQATFPARFQLVAAMNPCPCGYFGHPAKPCRCTPERIAAYRSKISGPLLDRIDLTIEVPALPAAELAQARP
GEASAAVKIRVEAARGRQYARQGKTNAQLSVAELDTVARIEPEAAQVLSGLLEKLSLSARSYHRILRVARTLADLAGDER
VGRAHVLRAVSFRRAL

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=940011 V6667_RS02335 WP_338809480.1 506118..507608(+) (comM) [Neisseria leonii strain CCUG 45853]
ATGACATGGGCGGTGGTGTACAGCCGCGCATTGAGCGGCATGAGTGCGCCTTTGGTCGAAGTGGAAGTCCATTTGGCCAA
CGGGCTGCCGCAGTTTAATATTGTGGGCCTGCCGGATACCGAAGTGAAGGAAAGCCGCGACCGCGTGCGCGCGGCGATTG
TCCAGAGCGGCTTTGACATGCCGGCGAAAAAAATTACCGTCAATCTGGCACCGGCCGACCTGCCCAAGGAGTCAGGCCGC
TTCGACCTGCCGATTGCCGTCGGTATTTTGGCGGCATCGGGGCAGGTATTGTCCGATAAACTGGCCGATTACGAGTTGGC
GGGCGAGCTGGCACTGTCCGGCGCGCTGCGGCCGGTACGCGGTGCGTTGGCTATGGCCTGGCAGGGTGCGAAAGACAAAC
GCGCCTTTATCCTGCCGGCGGAAAATGCCGTACAGACGGCATTGCTGAAAGATTTAACGGCATACGGCGCGGCCTGCCTG
GGCGAAGTGGCCGCCCATCTGAACGGTATCGCGCCGTTGCAGGCGGTATCGGGCGGCATCAGGCCGTCTGAATCCCGCCC
TTTGCCCGATTTGGCCGAAGTCAAAGGCCAGCATACTGCGCGGCTGGCATTGGAAATTGCCGCAGCCGGCGGACACAGTC
TGCTGATGGTCGGCCCGCCCGGTACGGGCAAATCCATGCTGGCGCAACGATTGCCGGGCATTCTGCCGCCGTTGAGCGAC
GAAGAAACGATTTCTGTGTGGGCACTGCGCTCGCTGCTGCCGCTTCATTTGCAGGACAGCAGTACCGAGCGGCCGTTCCG
TGCCCCGCACCACAGTGCCAGCGCGGTGGCTTTGGTCGGCGGCGGTTCCGATCCGCGCCCCGGCGAAATTTCCCTGGCGC
ACCACGGTGTGCTGTTTCTCGACGAGCTGCCCGAATTCGACCGCAAAGTGCTGGAAGTATTGCGCGAGCCGCTGGAAAAC
GGCGAAATCCATATTTCGCGTGCTGCCCGTCAGGCCACATTTCCGGCGCGTTTCCAACTGGTTGCCGCCATGAACCCCTG
TCCGTGCGGCTATTTCGGCCATCCGGCCAAGCCCTGCCGCTGTACTCCCGAACGCATCGCGGCTTACCGCAGCAAAATTT
CCGGCCCGCTGTTGGACCGTATCGATTTAACGATTGAAGTGCCTGCGTTACCGGCGGCCGAGCTGGCTCAGGCACGTCCG
GGCGAGGCGAGTGCCGCAGTCAAAATCCGGGTGGAAGCCGCGCGCGGCCGACAATATGCCCGTCAGGGCAAAACCAACGC
ACAATTGAGCGTGGCCGAATTGGACACCGTTGCCCGGATTGAGCCCGAAGCTGCGCAGGTATTGAGCGGGCTGCTGGAAA
AGCTGTCGCTGTCCGCGCGCAGTTACCACCGCATTTTGAGGGTGGCGCGCACATTGGCCGATTTGGCGGGCGATGAACGG
GTCGGCCGCGCCCATGTGTTACGTGCCGTAAGTTTCCGCAGAGCTTTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

52.525

99.798

0.524

  comM Vibrio campbellii strain DS40M4

52.323

99.798

0.522

  comM Glaesserella parasuis strain SC1401

51.098

100

0.516

  comM Haemophilus influenzae Rd KW20

50.501

100

0.508

  comM Legionella pneumophila str. Paris

47.348

100

0.486

  comM Legionella pneumophila strain ERS1305867

47.348

100

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44

100

0.444