Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ANFP_RS11950 Genome accession   NZ_AP025160
Coordinates   2357903..2358724 (+) Length   273 a.a.
NCBI ID   WP_009565783.1    Uniprot ID   -
Organism   Acidithiobacillus ferrooxidans strain NFP31     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 2343835..2386856 2357903..2358724 within 0


Gene organization within MGE regions


Location: 2343835..2386856
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ANFP_RS11885 (ANFP_23900) - 2344235..2345023 (-) 789 WP_009567106.1 hypothetical protein -
  ANFP_RS11890 (ANFP_23910) - 2345020..2346291 (-) 1272 WP_012537159.1 restriction endonuclease subunit S -
  ANFP_RS11895 (ANFP_23920) - 2346288..2346644 (-) 357 WP_012537160.1 hypothetical protein -
  ANFP_RS11900 (ANFP_23930) - 2346641..2347255 (-) 615 WP_009567105.1 helix-turn-helix domain-containing protein -
  ANFP_RS11905 (ANFP_23940) - 2347248..2350346 (-) 3099 WP_012537161.1 type I restriction endonuclease subunit R -
  ANFP_RS11910 (ANFP_23950) ubiD 2350639..2352111 (-) 1473 WP_012537162.1 4-hydroxy-3-polyprenylbenzoate decarboxylase -
  ANFP_RS11915 (ANFP_23960) - 2352136..2353008 (-) 873 WP_009562226.1 phosphate/phosphite/phosphonate ABC transporter substrate-binding protein -
  ANFP_RS11920 (ANFP_23970) - 2353021..2353413 (-) 393 WP_041647435.1 SCP2 sterol-binding domain-containing protein -
  ANFP_RS11925 (ANFP_23980) - 2353524..2354270 (-) 747 WP_009562222.1 tRNA threonylcarbamoyladenosine dehydratase -
  ANFP_RS11930 (ANFP_23990) - 2354270..2355085 (-) 816 WP_009562220.1 TatD family hydrolase -
  ANFP_RS11935 (ANFP_24000) - 2355215..2356447 (+) 1233 WP_225487177.1 FAD-dependent monooxygenase -
  ANFP_RS11940 (ANFP_24010) - 2356437..2357624 (+) 1188 WP_012537165.1 FAD-dependent monooxygenase -
  ANFP_RS11945 (ANFP_24020) - 2357636..2357899 (+) 264 WP_009565782.1 accessory factor UbiK family protein -
  ANFP_RS11950 (ANFP_24030) comM 2357903..2358724 (+) 822 WP_009565783.1 magnesium chelatase domain-containing protein Machinery gene
  ANFP_RS11955 - 2358660..2358794 (+) 135 WP_229129733.1 ATP-binding protein -
  ANFP_RS11960 (ANFP_24040) - 2358866..2360710 (+) 1845 WP_012537166.1 hypothetical protein -
  ANFP_RS11965 - 2360873..2361819 (+) 947 Protein_2368 transposase -
  ANFP_RS11970 (ANFP_24080) - 2362482..2362763 (+) 282 WP_009565786.1 RNA-directed DNA polymerase -
  ANFP_RS11975 (ANFP_24090) ltrA 2362919..2363941 (+) 1023 Protein_2370 group II intron reverse transcriptase/maturase -
  ANFP_RS11980 - 2364046..2364216 (-) 171 Protein_2371 IS481 family transposase -
  ANFP_RS11985 (ANFP_24110) - 2364382..2365626 (+) 1245 WP_012535982.1 tyrosine-type recombinase/integrase -
  ANFP_RS11990 (ANFP_24120) - 2365623..2366564 (+) 942 WP_012535983.1 tyrosine-type recombinase/integrase -
  ANFP_RS11995 (ANFP_24130) - 2366557..2367555 (+) 999 WP_009561339.1 tyrosine-type recombinase/integrase -
  ANFP_RS12000 (ANFP_24140) - 2367648..2368490 (-) 843 Protein_2375 DDE-type integrase/transposase/recombinase -
  ANFP_RS12005 (ANFP_24150) - 2368624..2369010 (-) 387 WP_225487178.1 hypothetical protein -
  ANFP_RS12010 (ANFP_24160) - 2369295..2370302 (-) 1008 WP_012537170.1 hypothetical protein -
  ANFP_RS12015 (ANFP_24170) - 2371023..2371523 (+) 501 WP_012607405.1 hypothetical protein -
  ANFP_RS16805 - 2371816..2372190 (-) 375 WP_012607406.1 helix-turn-helix domain-containing protein -
  ANFP_RS12020 (ANFP_24190) - 2372407..2372598 (+) 192 WP_012537173.1 hypothetical protein -
  ANFP_RS12025 (ANFP_24200) - 2373105..2374487 (+) 1383 WP_041647442.1 DUF6880 family protein -
  ANFP_RS12030 (ANFP_24210) - 2374497..2376926 (+) 2430 WP_012537175.1 TOTE conflict system archaeo-eukaryotic primase domain-containing protein -
  ANFP_RS12040 (ANFP_24230) - 2377463..2377924 (-) 462 WP_196491152.1 GNAT family N-acetyltransferase -
  ANFP_RS12045 (ANFP_24240) - 2378182..2378658 (-) 477 WP_009561179.1 NuoB/complex I 20 kDa subunit family protein -
  ANFP_RS12050 (ANFP_24250) - 2378735..2380912 (-) 2178 WP_012537178.1 PEP/pyruvate-binding domain-containing protein -
  ANFP_RS12055 (ANFP_24260) istB 2381001..2381750 (-) 750 WP_049756721.1 IS21-like element helper ATPase IstB -
  ANFP_RS12060 (ANFP_24280) - 2381942..2382223 (+) 282 WP_012537180.1 BrnT family toxin -
  ANFP_RS12065 (ANFP_24290) - 2382220..2382462 (+) 243 WP_009567395.1 BrnA antitoxin family protein -
  ANFP_RS12070 (ANFP_24300) - 2382670..2383428 (+) 759 WP_012537181.1 helix-turn-helix domain-containing protein -
  ANFP_RS12075 (ANFP_24310) - 2383519..2384316 (+) 798 WP_009567396.1 DUF6615 family protein -
  ANFP_RS12080 (ANFP_24320) - 2384988..2386463 (-) 1476 WP_012607413.1 plasmid recombination protein -

Sequence


Protein


Download         Length: 273 a.a.        Molecular weight: 28247.53 Da        Isoelectric Point: 7.1072

>NTDB_id=90193 ANFP_RS11950 WP_009565783.1 2357903..2358724(+) (comM) [Acidithiobacillus ferrooxidans strain NFP31]
MPLAIVHSRALTGVQAAGVAVECDLGPGLPTFAVVGLAETAVKEARDRVRSAIQNSGFEFPARRMVVNLAPADLPKEGGR
FDLPMAIGILAASGQLPAAALEKLEMIGELALDGSLRPVTGTLSSALAAGQAGHAILVPQGNAREAAFAQSTPVFACANL
AQAAAHLRGTDRLPEVVCDAESPESAFPYLDLRDVRGQESTKRALIVAAVGGHHLLLSGPPGTGKSMLAARLPGLLPPLH
RSEALEVAAIHSLARERFRYPSVGQTSVPQPPP

Nucleotide


Download         Length: 822 bp        

>NTDB_id=90193 ANFP_RS11950 WP_009565783.1 2357903..2358724(+) (comM) [Acidithiobacillus ferrooxidans strain NFP31]
TTGCCGCTGGCGATAGTCCACAGCCGCGCCCTGACCGGGGTACAGGCAGCAGGCGTCGCCGTGGAATGCGATCTGGGCCC
CGGCCTACCGACCTTTGCCGTTGTCGGTCTCGCTGAAACGGCGGTCAAGGAAGCCCGGGACCGGGTGCGTTCAGCCATTC
AAAACAGCGGCTTCGAGTTTCCGGCACGACGGATGGTCGTCAATCTGGCCCCTGCCGACCTGCCCAAGGAAGGTGGCCGC
TTCGATCTGCCCATGGCCATCGGCATTCTCGCCGCCAGCGGGCAACTTCCTGCGGCCGCACTGGAAAAGCTGGAGATGAT
CGGTGAATTGGCACTGGATGGGAGCCTACGTCCGGTGACAGGCACGCTCTCCAGCGCACTGGCCGCTGGACAGGCGGGAC
ACGCCATTCTGGTGCCGCAAGGCAATGCCCGGGAAGCCGCTTTTGCCCAAAGTACGCCGGTCTTTGCCTGCGCGAACCTG
GCACAGGCCGCCGCCCACCTGCGTGGTACGGATCGCCTACCTGAAGTCGTCTGCGATGCGGAAAGCCCCGAATCGGCCTT
TCCGTACCTCGATCTGCGCGATGTGCGTGGTCAGGAATCGACCAAGCGGGCACTGATTGTTGCGGCGGTGGGTGGTCATC
ACCTACTCCTGTCGGGCCCGCCCGGCACCGGGAAAAGCATGCTGGCGGCACGACTACCCGGCCTGTTACCGCCCCTGCAC
CGCAGCGAGGCGCTGGAAGTAGCCGCCATCCACAGCCTTGCAAGGGAACGGTTTCGATATCCGTCAGTGGGGCAGACGTC
CGTTCCGCAGCCCCCACCATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

48.097

100

0.509

  comM Vibrio cholerae strain A1552

52.326

94.505

0.495

  comM Haemophilus influenzae Rd KW20

48.881

98.168

0.48

  comM Glaesserella parasuis strain SC1401

48.485

96.703

0.469

  comM Legionella pneumophila str. Paris

44.964

100

0.458

  comM Legionella pneumophila strain ERS1305867

44.964

100

0.458

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

41.634

94.139

0.392


Multiple sequence alignment