Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   INQ43_RS12060 Genome accession   NZ_CP063658
Coordinates   2670470..2671984 (+) Length   504 a.a.
NCBI ID   WP_194036602.1    Uniprot ID   A0A7S6UQ94
Organism   Lysobacter sp. H23M47     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2665470..2676984
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INQ43_RS12040 (INQ43_12040) speE 2666861..2667712 (+) 852 WP_194036600.1 polyamine aminopropyltransferase -
  INQ43_RS12045 (INQ43_12045) - 2667719..2669623 (-) 1905 WP_194036601.1 DUF4153 domain-containing protein -
  INQ43_RS12050 (INQ43_12050) - 2669646..2669984 (-) 339 WP_043958974.1 P-II family nitrogen regulator -
  INQ43_RS12055 (INQ43_12055) - 2670161..2670460 (+) 300 WP_194034487.1 accessory factor UbiK family protein -
  INQ43_RS12060 (INQ43_12060) comM 2670470..2671984 (+) 1515 WP_194036602.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  INQ43_RS12065 (INQ43_12065) - 2672039..2672251 (-) 213 WP_194036603.1 DUF2945 domain-containing protein -
  INQ43_RS12070 (INQ43_12070) aceA 2672380..2673675 (-) 1296 WP_194036604.1 isocitrate lyase -
  INQ43_RS12075 (INQ43_12075) aceB 2673730..2675412 (-) 1683 WP_194036605.1 malate synthase A -
  INQ43_RS12080 (INQ43_12080) - 2675557..2676528 (+) 972 WP_194036606.1 LysR family transcriptional regulator -

Sequence


Protein


Download         Length: 504 a.a.        Molecular weight: 53257.20 Da        Isoelectric Point: 8.7143

>NTDB_id=496176 INQ43_RS12060 WP_194036602.1 2670470..2671984(+) (comM) [Lysobacter sp. H23M47]
MNLALVHSRARSGIRAAPVRVEVHLGGGLPSMSIVGLPETAVRESRERVRAAIQCAQFEFPARRITVNLAPADLPKGGGR
FDLPIALGILAASGQIPLEALGEYEFLGELGLTGELRAVDAVLPAALAAAQAGRKLVVPPANGAEAALVSGVETRTARTL
LEVCAMLSGHKSLPRAEAPAPSRRLGPDLADVRGQAHARRALEVAAAGGHHLLLVGPPGCGKTLLASRLPGLLPVASDAE
ALDSAAVASLGGRGVDASLWRERPFRSPHHTASAVALVGGGAEPRPGEISMAHNGVLFLDELPEWSRRTLEVLREPLESG
TVTISRAARSVEFPARFQLVAAMNPCPCGWAGDPSGRCRCSPDMVVNYRARISGPLMDRIDLHVEVPRLPPSELRPDAAP
AENSDTVRERVVAARKLQLERAGKANAHLSQSETGATCRLAEADFALLERAIDTLHLSARSMHRIMRVARTVADLAGSPQ
IQTIHLSEAIGYRRVDRGFPVATA

Nucleotide


Download         Length: 1515 bp        

>NTDB_id=496176 INQ43_RS12060 WP_194036602.1 2670470..2671984(+) (comM) [Lysobacter sp. H23M47]
ATGAACCTGGCACTCGTGCACAGCCGTGCGCGATCGGGCATCCGTGCCGCCCCGGTCCGCGTGGAGGTGCATCTGGGTGG
CGGGCTGCCGTCGATGTCGATCGTCGGCCTGCCGGAAACCGCGGTGCGCGAATCGCGTGAGCGCGTGCGTGCCGCGATCC
AGTGCGCCCAGTTCGAGTTCCCGGCCCGGCGGATCACCGTCAACCTCGCCCCCGCCGACCTGCCCAAGGGCGGCGGCCGC
TTCGACCTGCCGATCGCGCTGGGAATCCTCGCCGCCAGTGGACAGATCCCGCTGGAGGCCTTGGGTGAATACGAGTTCCT
CGGCGAGCTGGGCCTGACCGGCGAGTTGCGCGCGGTGGACGCGGTGCTGCCGGCCGCGCTGGCCGCTGCCCAGGCCGGTC
GCAAACTGGTCGTGCCACCCGCCAACGGAGCCGAAGCCGCACTGGTCAGCGGGGTGGAGACGCGGACCGCGCGCACGCTG
CTGGAAGTGTGCGCGATGTTGTCAGGGCATAAATCGCTGCCACGGGCCGAAGCTCCAGCCCCGAGTCGGCGCCTCGGACC
GGACCTGGCCGACGTGCGCGGCCAGGCGCATGCGCGCCGCGCGCTTGAGGTGGCGGCGGCCGGTGGTCACCACCTGCTAC
TCGTGGGGCCGCCAGGCTGTGGAAAGACCCTCCTCGCATCCCGCTTGCCGGGTTTGCTGCCGGTGGCCAGCGATGCCGAA
GCGCTGGATTCGGCGGCCGTCGCTTCGCTGGGCGGACGTGGCGTGGACGCGTCGCTGTGGCGGGAACGCCCGTTCCGCTC
CCCGCACCACACGGCCAGTGCGGTCGCGCTGGTCGGCGGTGGCGCCGAGCCGCGACCGGGCGAAATTTCCATGGCGCACA
ACGGCGTACTGTTTCTGGACGAGCTGCCCGAATGGAGCCGGCGCACGCTGGAGGTGCTGCGCGAGCCGCTGGAGTCGGGC
ACCGTCACCATCTCGCGCGCCGCCCGCAGCGTCGAGTTCCCGGCGCGCTTCCAGCTGGTGGCGGCCATGAACCCCTGCCC
CTGTGGCTGGGCCGGCGACCCCAGCGGGCGCTGCCGCTGCAGCCCCGACATGGTCGTGAATTACCGCGCGCGCATTTCCG
GGCCGCTGATGGACCGGATCGACCTGCACGTCGAGGTGCCGCGGCTGCCGCCATCGGAGCTGCGACCGGATGCCGCGCCT
GCCGAGAACAGCGATACCGTCCGCGAGCGGGTGGTCGCCGCCCGCAAGCTGCAGCTGGAGCGTGCGGGCAAGGCCAACGC
CCACCTCAGTCAGTCAGAGACCGGTGCGACCTGCCGCTTGGCCGAGGCCGACTTCGCCCTGCTGGAACGCGCGATCGACA
CCCTCCACCTGTCGGCACGCTCGATGCACCGGATCATGCGGGTGGCGCGGACGGTCGCCGACCTGGCCGGCAGTCCGCAG
ATCCAGACCATCCACCTGAGCGAGGCGATCGGTTATCGGCGAGTGGATCGCGGGTTCCCGGTGGCAACGGCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7S6UQ94

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

54.545

100

0.548

  comM Vibrio cholerae strain A1552

54.709

99.008

0.542

  comM Glaesserella parasuis strain SC1401

52.953

100

0.534

  comM Haemophilus influenzae Rd KW20

52.778

100

0.528

  comM Legionella pneumophila str. Paris

50.602

98.81

0.5

  comM Legionella pneumophila strain ERS1305867

50.602

98.81

0.5

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.597

100

0.45