Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ANFP_RS12485 Genome accession   NZ_AP025160
Coordinates   2454390..2455091 (+) Length   233 a.a.
NCBI ID   WP_074874161.1    Uniprot ID   -
Organism   Acidithiobacillus ferrooxidans strain NFP31     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2449390..2460091
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ANFP_RS12465 - 2449775..2450851 (-) 1077 WP_111122573.1 TniQ family protein -
  ANFP_RS12470 (ANFP_25120) - 2450866..2451753 (-) 888 WP_041646474.1 TniB family NTP-binding protein -
  ANFP_RS12475 (ANFP_25130) - 2451750..2453630 (-) 1881 WP_012607455.1 Mu transposase C-terminal domain-containing protein -
  ANFP_RS12480 (ANFP_25140) - 2453623..2454312 (-) 690 WP_012537231.1 heteromeric transposase endonuclease subunit TnsA -
  ANFP_RS12485 (ANFP_25150) comM 2454390..2455091 (+) 702 WP_074874161.1 ATP-binding protein Machinery gene
  ANFP_RS12490 (ANFP_25160) - 2455238..2455615 (-) 378 WP_012537232.1 DsrE family protein -
  ANFP_RS12495 (ANFP_25170) - 2455857..2456339 (-) 483 WP_009561598.1 MerR family transcriptional regulator -
  ANFP_RS12500 (ANFP_25180) - 2456385..2456618 (-) 234 WP_009561599.1 hypothetical protein -
  ANFP_RS12505 (ANFP_25190) - 2457153..2457677 (-) 525 WP_009561601.1 tetratricopeptide repeat protein -
  ANFP_RS12510 (ANFP_25200) - 2457705..2459324 (-) 1620 WP_012537233.1 peptide chain release factor 3 -
  ANFP_RS12515 (ANFP_25210) rimI 2459331..2459807 (-) 477 WP_012537234.1 ribosomal protein S18-alanine N-acetyltransferase -

Sequence


Protein


Download         Length: 233 a.a.        Molecular weight: 25852.47 Da        Isoelectric Point: 7.7885

>NTDB_id=90194 ANFP_RS12485 WP_074874161.1 2454390..2455091(+) (comM) [Acidithiobacillus ferrooxidans strain NFP31]
MVHSVALIGRSHSGSHPRPGEISLAHHGVLFLDEMPEFPRAVLEVLREPLESGEIHIARAARRATFPARFQLVAAMNPCP
CGHLGDPQQICRCTPAQVSQYRSRLSGPLLDRIDIQMEVPALPVSALQEAGPGESSAYWRERIAQAVDRQWQRQQVRNAQ
LQGELLDQFCALDTVGTRLLSRATETLHLSARGYHRVLRVARSIADLEGSDPIGTQHLAEAIQYRRLAQTLAQ

Nucleotide


Download         Length: 702 bp        

>NTDB_id=90194 ANFP_RS12485 WP_074874161.1 2454390..2455091(+) (comM) [Acidithiobacillus ferrooxidans strain NFP31]
ATGGTTCATAGTGTAGCATTAATTGGTAGAAGTCACAGTGGTTCGCACCCTCGCCCCGGTGAAATCAGTCTGGCACATCA
TGGCGTTCTTTTCCTGGACGAAATGCCCGAGTTCCCTCGCGCGGTGCTGGAAGTCCTGCGGGAGCCGTTGGAATCGGGAG
AAATCCATATCGCCAGGGCGGCGCGGCGGGCCACTTTTCCAGCCCGGTTTCAGTTGGTCGCCGCGATGAATCCCTGTCCC
TGCGGCCATCTCGGTGATCCGCAGCAGATATGCCGCTGTACCCCGGCACAGGTCAGCCAATACCGCAGTCGGCTCTCCGG
CCCGCTGCTGGACCGCATCGACATCCAGATGGAAGTGCCCGCCCTGCCGGTAAGCGCCTTACAAGAGGCGGGACCAGGCG
AAAGTTCTGCCTACTGGCGCGAACGCATAGCTCAGGCCGTTGATCGCCAGTGGCAGCGGCAGCAGGTGCGCAATGCACAA
CTTCAGGGCGAACTGCTGGATCAGTTTTGCGCTCTGGATACGGTGGGCACCCGCCTGCTCAGCCGCGCCACCGAGACCCT
GCACCTCTCCGCGCGGGGCTACCATCGGGTGCTGCGCGTCGCCCGCAGTATCGCGGATCTGGAAGGGAGTGATCCGATCG
GTACCCAGCATCTGGCAGAAGCCATCCAGTATCGCCGCCTGGCGCAGACGCTGGCCCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

50.862

99.571

0.506

  comM Vibrio cholerae strain A1552

50.431

99.571

0.502

  comM Haemophilus influenzae Rd KW20

48.936

100

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

49.351

99.142

0.489

  comM Glaesserella parasuis strain SC1401

48.918

99.142

0.485

  comM Legionella pneumophila strain ERS1305867

49.107

96.137

0.472

  comM Legionella pneumophila str. Paris

49.107

96.137

0.472


Multiple sequence alignment