Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   FH041_RS18080 Genome accession   NZ_CP040930
Coordinates   3960695..3962185 (-) Length   496 a.a.
NCBI ID   WP_125921885.1    Uniprot ID   -
Organism   Pseudomonas sp. SWI7     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 3955695..3967185
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FH041_RS18055 (FH041_18055) - 3956650..3957561 (-) 912 WP_027913398.1 LysR substrate-binding domain-containing protein -
  FH041_RS18060 (FH041_18060) - 3957690..3958487 (-) 798 WP_028686766.1 SDR family NAD(P)-dependent oxidoreductase -
  FH041_RS18065 (FH041_18065) - 3958487..3959263 (-) 777 WP_027913400.1 alpha/beta fold hydrolase -
  FH041_RS18070 (FH041_18070) - 3959443..3960018 (-) 576 WP_140174923.1 PIN domain-containing protein -
  FH041_RS18075 (FH041_18075) - 3960015..3960482 (-) 468 WP_027913402.1 helix-turn-helix domain-containing protein -
  FH041_RS18080 (FH041_18080) comM 3960695..3962185 (-) 1491 WP_125921885.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  FH041_RS18085 (FH041_18085) - 3962418..3964427 (+) 2010 WP_111475156.1 DUF4034 domain-containing protein -
  FH041_RS18090 (FH041_18090) - 3964463..3964720 (-) 258 WP_027913405.1 accessory factor UbiK family protein -
  FH041_RS18095 (FH041_18095) glnK 3965084..3965422 (+) 339 WP_002555808.1 P-II family nitrogen regulator -
  FH041_RS18100 (FH041_18100) - 3965470..3966801 (+) 1332 WP_023532646.1 ammonium transporter -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52685.42 Da        Isoelectric Point: 7.8847

>NTDB_id=368380 FH041_RS18080 WP_125921885.1 3960695..3962185(-) (comM) [Pseudomonas sp. SWI7]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPARRITQNLAPADLPKDGGR
YDLAIALGILAADAQVPVATLAELECLGELALSGALRPVQGVLPAALAARQAGRALVVPRENAEEASLASGLVVYAVGHL
LELVAHLNGQVPLAAYAANGLLLNPRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIQSVSGHTPLNSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCNTEQISRYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
QSSADVALQVAHAREVQQRRQGCANAFLNLDGLRQHCSLAAPDQAWLEGACERLTLSLRAAHRLLKVARTLADLEGCTAI
GRAHLAEALQYRPGSG

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=368380 FH041_RS18080 WP_125921885.1 3960695..3962185(-) (comM) [Pseudomonas sp. SWI7]
ATGTCTTTAGCCCTTGTTCACAGCCGCGCCCAGGTAGGCGTGCAGGCCCCGGCCGTCAGCGTTGAAACCCACCTGGCCAA
TGGCTTGCCTCATCTCACACTGGTCGGCTTGCCCGAAACCACGGTCAAGGAAAGCAAGGACCGCGTGCGCAGTGCCATCG
TCAACAGTGGCCTGAACTACCCAGCCCGGCGCATCACGCAAAACCTCGCTCCGGCCGACCTGCCCAAGGACGGTGGCCGC
TACGACCTGGCCATCGCCCTGGGCATCCTGGCGGCCGATGCCCAGGTGCCCGTGGCCACCCTGGCCGAGCTCGAATGCCT
GGGCGAGCTGGCACTGTCGGGCGCGTTGCGGCCGGTACAGGGCGTCTTGCCCGCAGCGCTTGCCGCACGCCAGGCGGGGC
GGGCACTGGTAGTGCCCCGTGAAAATGCCGAGGAAGCCAGCCTTGCCAGTGGCCTGGTGGTGTATGCCGTAGGCCATCTA
TTGGAGCTGGTGGCCCACCTGAACGGACAGGTACCGCTAGCTGCCTATGCCGCCAATGGCTTGCTCCTGAACCCTCGCCC
CTACCCCGACCTGAGCGAGGTGCAGGGGCAACTGGCGGCCAAACGCGCGCTGCTACTGGCCGCTGCCGGGGCGCACAACC
TGCTTTTCACCGGGCCGCCAGGTACTGGCAAAACCTTGTTGGCCAGCCGCCTGCCGGGCCTGCTGCCACCGCTGGATGAG
CATGAAGCGCTGGAGGTCGCCGCCATCCAGTCGGTCAGCGGGCACACCCCGCTGAACAGCTGGCCACAACGGCCGTTTCG
CCACCCTCATCATTCCGCCTCCGGACCTGCGCTGGTGGGTGGCGGAAGCCGGCCGCAGCCCGGGGAAATCACCTTGGCCC
ACCACGGGGTGCTGTTTCTGGACGAGTTGCCCGAATTCGAGCGCCGGGTACTGGAAGTGCTGCGTGAACCCCTGGAATCT
GGCGAGATCGTAATCGCCAGGGCGCGGGATAAAGTGCGCTTTCCCGCCCGCTTTCAGCTCGTCGCTGCCATGAACCCCTG
CCCTTGCGGCTACCTGGGCGACCCTACAGGGCGATGCCGCTGCAATACCGAGCAGATATCCCGTTATCGCAACAAACTGT
CAGGCCCGCTACTGGACCGGATCGACTTGCACCTGACCGTGGCGCGCGAGAGCACCACGCTCAACAACCAGCCTTGCGGG
CAAAGCAGCGCCGACGTCGCCCTTCAGGTCGCGCACGCCCGGGAGGTGCAACAACGTCGGCAAGGGTGCGCCAATGCATT
CCTCAACCTCGATGGCCTGCGCCAGCACTGTAGCTTGGCAGCACCTGATCAAGCTTGGTTGGAAGGCGCCTGTGAACGAT
TGACGCTGTCGCTGCGAGCAGCACACCGATTGCTCAAGGTCGCCAGAACACTGGCAGACCTCGAGGGATGCACGGCAATT
GGCCGGGCGCACCTGGCTGAGGCCCTGCAGTACCGTCCCGGGAGTGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.4

100

0.558

  comM Vibrio campbellii strain DS40M4

55.645

100

0.556

  comM Vibrio cholerae strain A1552

55.556

99.798

0.554

  comM Glaesserella parasuis strain SC1401

53.6

100

0.54

  comM Legionella pneumophila str. Paris

49.899

100

0.5

  comM Legionella pneumophila strain ERS1305867

49.899

100

0.5

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.322

100

0.47


Multiple sequence alignment