Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   VUG52_RS19660 Genome accession   NZ_CP144367
Coordinates   4315017..4317233 (-) Length   738 a.a.
NCBI ID   WP_331831960.1    Uniprot ID   -
Organism   Pseudomonas sp. LH21     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4310017..4322233
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  VUG52_RS19625 (VUG52_19625) murB 4310303..4311322 (-) 1020 WP_050704694.1 UDP-N-acetylmuramate dehydrogenase -
  VUG52_RS19630 (VUG52_19630) - 4311319..4311783 (-) 465 WP_050704693.1 low molecular weight protein-tyrosine-phosphatase -
  VUG52_RS19635 (VUG52_19635) kdsB 4311783..4312547 (-) 765 WP_050704692.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  VUG52_RS19640 (VUG52_19640) - 4312544..4312729 (-) 186 WP_011532874.1 Trm112 family protein -
  VUG52_RS19645 (VUG52_19645) lpxK 4312764..4313768 (-) 1005 WP_331831958.1 tetraacyldisaccharide 4'-kinase -
  VUG52_RS19650 (VUG52_19650) - 4313768..4314202 (-) 435 WP_110739307.1 biopolymer transporter ExbD -
  VUG52_RS19655 (VUG52_19655) exbB 4314199..4314834 (-) 636 WP_028690976.1 MotA/TolQ/ExbB proton channel family protein Machinery gene
  VUG52_RS19660 (VUG52_19660) comA 4315017..4317233 (-) 2217 WP_331831960.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  VUG52_RS19665 (VUG52_19665) - 4317378..4318019 (+) 642 WP_052896362.1 DUF2062 domain-containing protein -
  VUG52_RS19670 (VUG52_19670) - 4317903..4318682 (-) 780 WP_028690974.1 ABC transporter permease -
  VUG52_RS19675 (VUG52_19675) - 4318679..4319611 (-) 933 WP_028690973.1 ABC transporter ATP-binding protein -
  VUG52_RS19680 (VUG52_19680) - 4319744..4320367 (-) 624 WP_331831961.1 glutathione S-transferase -
  VUG52_RS19685 (VUG52_19685) - 4321112..4321351 (+) 240 WP_186716206.1 hypothetical protein -
  VUG52_RS19690 (VUG52_19690) - 4321348..4321587 (+) 240 WP_331831967.1 DUF3077 domain-containing protein -

Sequence


Protein


Download         Length: 738 a.a.        Molecular weight: 79944.64 Da        Isoelectric Point: 11.1941

>NTDB_id=933820 VUG52_RS19660 WP_331831960.1 4315017..4317233(-) (comA) [Pseudomonas sp. LH21]
MRTGMLALVLGLLSLRFLPALPPVGWLIPLLLLALASLRTRAWPLGWWMLGVCWACWSAQQALDDRLPAALDGRTLWLEG
RVVGLPTRTERGVRFELEQPQSRRARLPQRVQLSWFDGPPLRAGEHWRLAVNLRRPQGLLNPYGPDQEATLLARRVGATG
TVKAGTRQGEAASSWRDGLRQRLLTTDAHGRETALAALVLGDGAGLAREDWQVLQATGTVHLLVISGQHIGLLAGLVYGL
VAGLARLGAWPRRLPWLPWACGLAMTAALGYGWLAGFGVPVQRACLMLAVVLLWRLRFRHLGAWVPLLMALSGVLLVEPL
ASLLPGFWLSFAAVTVLVLCFAARLGAWRPWQAWTRAQWVIAVGLLPVLLALGLPVSLSAPLANLLAVPWLSLGVLPLAL
LGTALLPVPGLGEGMLWLAGLSLDGLFAVLTRLAALQPAWIPEPLPLWAWLLVCLGAVLVLLPRGVPLRAPGAVMLLALW
APREQVPPGQVEVWQLDVGQGLAVLLRTRNHALLYDAGPAQGGRDLGETVVLPTLRKLGVKALDVMLISHGHADHAGGAA
AVRRGVPVGRVLAGEVAGLEAASLCRSGERWQWDGVDFELWHWPQGASSNERSCVLRVQANGERLLLAGDMEAGAERAWL
AATDDPRVDWLQAPHHGSRTSSSEPFVQATAPRGVLISRGRHNGFGHPHGQVIERYRRHGVAVRDTAVEGALRLVLGSRG
GVEGVRRERRFWREVDEG

Nucleotide


Download         Length: 2217 bp        

>NTDB_id=933820 VUG52_RS19660 WP_331831960.1 4315017..4317233(-) (comA) [Pseudomonas sp. LH21]
ATGCGCACAGGGATGCTGGCGCTGGTCCTCGGGCTGTTGAGCCTGCGCTTTTTACCTGCCTTGCCACCGGTCGGATGGCT
GATTCCATTGCTGTTGCTGGCTCTGGCCAGCCTGCGTACCCGGGCCTGGCCGCTGGGTTGGTGGATGCTGGGTGTGTGCT
GGGCCTGCTGGTCGGCGCAACAGGCCCTGGACGACCGCCTGCCGGCCGCGCTGGACGGTCGCACGCTGTGGCTGGAGGGG
CGCGTGGTCGGCCTGCCGACCCGTACCGAGCGCGGCGTGCGTTTCGAGCTGGAGCAGCCGCAGTCGCGCCGGGCCCGGTT
GCCCCAACGCGTGCAACTGAGCTGGTTCGACGGGCCGCCGCTGCGGGCGGGGGAGCACTGGCGGCTGGCGGTCAACCTGC
GTCGGCCGCAGGGTCTGCTCAACCCCTATGGCCCCGATCAGGAGGCCACGCTGTTGGCGCGGCGGGTCGGTGCCACCGGC
ACGGTCAAGGCCGGTACGCGCCAGGGCGAGGCGGCCAGCAGTTGGCGCGACGGCCTGCGCCAGCGCTTGCTGACCACCGA
TGCCCATGGCCGGGAGACGGCGTTGGCCGCCTTGGTGCTCGGGGATGGCGCAGGCCTGGCGCGGGAGGACTGGCAGGTGC
TGCAGGCCACCGGCACGGTGCACCTGCTGGTGATTTCCGGGCAGCACATCGGCTTGCTGGCCGGATTGGTCTACGGGCTG
GTGGCGGGGCTGGCGCGCCTCGGGGCCTGGCCTCGGCGCCTGCCGTGGCTGCCCTGGGCCTGCGGCCTGGCGATGACAGC
GGCACTGGGTTACGGCTGGCTGGCGGGGTTCGGCGTGCCGGTGCAGCGGGCGTGCCTGATGCTGGCGGTGGTGCTGCTCT
GGCGCTTGCGCTTTCGCCACTTGGGCGCCTGGGTGCCGTTGCTGATGGCGCTGTCGGGCGTGCTGCTGGTCGAGCCGCTG
GCCAGCCTGTTGCCGGGGTTCTGGCTGTCGTTCGCTGCCGTGACGGTGCTGGTGTTGTGCTTCGCCGCCCGGCTGGGCGC
CTGGCGGCCCTGGCAGGCCTGGACCCGCGCCCAGTGGGTGATTGCCGTGGGGTTGTTGCCGGTGCTGCTGGCCTTGGGGC
TGCCCGTCAGCCTCAGTGCGCCGCTGGCCAATCTGCTGGCCGTGCCGTGGCTCAGCCTGGGCGTGCTGCCGCTGGCGTTG
CTGGGCACGGCCCTGTTGCCGGTGCCTGGGCTGGGGGAGGGGATGCTGTGGCTGGCGGGCCTGTCGTTGGACGGCTTGTT
TGCCGTGCTGACGCGATTGGCAGCGCTGCAGCCGGCCTGGATACCCGAGCCGTTGCCGTTGTGGGCCTGGCTGCTGGTGT
GCCTGGGCGCCGTGCTGGTGTTGCTGCCCCGGGGCGTGCCGTTGCGGGCGCCGGGCGCGGTCATGCTGCTGGCGCTGTGG
GCACCACGCGAGCAGGTGCCTCCGGGGCAAGTGGAGGTCTGGCAGCTGGATGTGGGGCAGGGCCTGGCGGTGCTGCTGCG
TACTCGCAATCACGCGCTGCTGTATGACGCAGGTCCGGCTCAGGGTGGCCGTGACCTGGGCGAGACGGTGGTGCTGCCGA
CCTTGCGTAAGCTGGGGGTGAAGGCGCTGGATGTGATGCTGATCAGCCATGGCCATGCCGACCATGCTGGTGGGGCGGCG
GCGGTGCGGCGAGGGGTGCCGGTGGGACGGGTGCTGGCCGGCGAGGTTGCCGGGCTGGAAGCGGCGTCGCTTTGCCGCAG
TGGCGAGCGCTGGCAGTGGGATGGCGTCGATTTCGAGCTTTGGCATTGGCCGCAGGGGGCCTCCAGCAACGAGCGCTCCT
GTGTGTTGCGGGTGCAGGCCAATGGGGAGCGCCTGCTGCTGGCCGGGGACATGGAGGCCGGCGCGGAGCGTGCGTGGCTG
GCCGCGACCGATGATCCTCGGGTGGACTGGTTGCAGGCGCCGCACCATGGCAGCCGCACGTCGTCCAGCGAGCCGTTCGT
CCAGGCCACGGCGCCGCGCGGGGTACTGATTTCCCGAGGGCGGCACAATGGTTTCGGGCATCCCCATGGGCAGGTGATCG
AGCGCTATCGGCGGCATGGGGTGGCGGTGCGTGACACGGCGGTGGAGGGGGCGTTGCGGTTGGTGCTGGGGAGTCGGGGT
GGGGTGGAGGGTGTGCGCAGGGAGCGGCGGTTCTGGCGTGAGGTTGACGAGGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Pseudomonas stutzeri DSM 10701

59.944

97.425

0.584

  comA Ralstonia pseudosolanacearum GMI1000

35.28

100

0.409