Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   G7069_RS04130 Genome accession   NZ_CP049864
Coordinates   855724..857220 (-) Length   498 a.a.
NCBI ID   WP_166294563.1    Uniprot ID   A0A6G7YVU1
Organism   Lysobacter sp. HDW10     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 850724..862220
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G7069_RS04110 (G7069_04110) - 851028..853322 (-) 2295 WP_166294555.1 TonB-dependent receptor -
  G7069_RS04115 (G7069_04115) - 853463..854866 (+) 1404 WP_166294557.1 GntP family permease -
  G7069_RS04120 (G7069_04120) - 854945..855286 (+) 342 WP_205758751.1 PadR family transcriptional regulator -
  G7069_RS04125 (G7069_04125) - 855283..855720 (+) 438 WP_166294561.1 hypothetical protein -
  G7069_RS04130 (G7069_04130) comM 855724..857220 (-) 1497 WP_166294563.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  G7069_RS04135 (G7069_04135) - 857235..857507 (-) 273 WP_166294565.1 accessory factor UbiK family protein -
  G7069_RS04140 (G7069_04140) - 857635..857973 (+) 339 WP_166294567.1 P-II family nitrogen regulator -
  G7069_RS04145 (G7069_04145) - 857970..859829 (-) 1860 WP_166294569.1 diguanylate cyclase -
  G7069_RS04150 (G7069_04150) speE 859971..860819 (-) 849 WP_166294571.1 polyamine aminopropyltransferase -

Sequence


Protein


Download         Length: 498 a.a.        Molecular weight: 53674.47 Da        Isoelectric Point: 8.2852

>NTDB_id=428240 G7069_RS04130 WP_166294563.1 855724..857220(-) (comM) [Lysobacter sp. HDW10]
MALAVVHSRARLGVRAPEVAVEVHLGGGLPRMSIVGLPEAAVRESKDRVRAAFASTQFDFPARAITVNLAPADLPKQGGA
FDLPIALGILVASGQLPARCLQGREFMGELGLTGDLKRIRGVLPAALASAMVKRQLVIPHANAEEAALCETSDIRTAHDL
RALVEQLRANVPAEPLRRPINDSDKSHWPDMADVRGQQVARRALEIAAAGEHHVLFAGPPGCGKSMLATRLPGIMPMASD
EEALETAAIASLATTDVSTRAWRQRPFRSPHHTASAVALIGGGRDPRPGEVSLAHNGVLFLDEFTEWSRHALQCLREPLE
SGVVHIARASKQLSYPARFQLIAAMNPCPCGYAGDPSQRCECMPDAIARYRAKVSGPLLDRIDLHVSLARIHASELHGAL
SQGEGSEAIRERVADARGRQVLRQSVPNARLEGATLTAHCTEGMEDADGFHTASERLRLSARASHRVLRVARTIADLAHA
AHIERAHWLEALTFRPQI

Nucleotide


Download         Length: 1497 bp        

>NTDB_id=428240 G7069_RS04130 WP_166294563.1 855724..857220(-) (comM) [Lysobacter sp. HDW10]
ATGGCTTTGGCCGTCGTGCACAGTCGTGCACGCTTGGGCGTGCGTGCGCCTGAAGTTGCCGTCGAGGTGCACTTGGGTGG
CGGATTGCCGCGCATGTCGATCGTCGGCTTACCCGAAGCAGCGGTACGCGAATCAAAAGACCGTGTACGCGCTGCATTTG
CCAGCACGCAGTTCGACTTTCCCGCGCGCGCGATCACAGTCAATCTCGCACCCGCCGATTTGCCGAAACAAGGCGGTGCG
TTTGATCTTCCGATTGCACTCGGCATCCTGGTCGCCTCTGGGCAACTGCCGGCTCGCTGTTTGCAGGGACGCGAGTTCAT
GGGCGAGCTCGGCCTCACGGGTGATTTGAAGCGCATTCGCGGGGTCCTACCGGCCGCGCTCGCCAGTGCAATGGTCAAAC
GTCAACTCGTCATTCCGCATGCGAATGCCGAAGAAGCCGCTTTGTGTGAGACCTCAGACATTCGTACGGCACATGATTTG
CGTGCACTGGTTGAACAGCTTCGTGCAAACGTACCGGCTGAGCCTTTGCGTCGGCCAATCAACGACTCGGACAAGTCGCA
TTGGCCTGACATGGCCGATGTGCGTGGCCAACAAGTGGCAAGACGCGCGCTCGAGATAGCAGCTGCGGGCGAACACCATG
TGCTGTTCGCAGGGCCGCCAGGCTGTGGCAAGAGCATGTTGGCAACACGGTTGCCCGGCATCATGCCGATGGCGAGCGAT
GAAGAGGCACTTGAGACCGCCGCGATTGCTTCGTTGGCGACCACGGATGTGTCCACGCGTGCATGGCGACAGCGACCTTT
TCGATCGCCGCATCACACGGCGAGTGCCGTGGCGTTGATCGGCGGTGGGCGCGACCCGCGACCGGGTGAAGTGTCGCTCG
CGCACAACGGCGTGCTCTTTTTGGATGAATTTACAGAGTGGAGTCGCCATGCATTGCAGTGCTTGCGCGAACCGCTAGAG
TCCGGCGTGGTGCATATTGCGCGCGCGTCAAAACAACTGAGCTATCCGGCGCGCTTTCAGTTGATTGCTGCAATGAACCC
TTGTCCGTGCGGCTATGCCGGGGATCCATCGCAGCGGTGTGAATGCATGCCCGATGCCATTGCGCGGTATCGCGCCAAGG
TTTCGGGTCCGCTGCTGGATCGGATCGATCTGCATGTGTCGCTGGCGCGTATTCACGCGTCCGAATTGCACGGCGCGTTG
TCTCAGGGCGAGGGCAGTGAAGCGATTCGCGAACGGGTGGCAGATGCGCGGGGGCGCCAAGTGCTTCGGCAATCCGTGCC
CAATGCGCGATTAGAAGGTGCAACGCTCACCGCACATTGCACGGAAGGCATGGAAGATGCCGACGGCTTTCATACGGCCA
GTGAGCGATTACGTTTGTCTGCTCGGGCCAGCCATCGCGTGTTGCGTGTGGCACGCACCATTGCCGATCTGGCGCATGCA
GCGCACATCGAACGCGCACATTGGCTGGAGGCCTTGACCTTCAGGCCACAAATCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6G7YVU1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

49.697

99.398

0.494

  comM Vibrio cholerae strain A1552

49.293

99.398

0.49

  comM Haemophilus influenzae Rd KW20

47.714

100

0.482

  comM Glaesserella parasuis strain SC1401

47.896

100

0.48

  comM Legionella pneumophila str. Paris

46.894

100

0.47

  comM Legionella pneumophila strain ERS1305867

46.894

100

0.47

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

41.897

100

0.426