Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   WIF35_RS00540 Genome accession   NZ_CP150631
Coordinates   91420..93027 (+) Length   535 a.a.
NCBI ID   WP_412479613.1    Uniprot ID   -
Organism   Azonexus sp. IMCC34839     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 86420..98027
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WIF35_RS00525 - 87157..89358 (+) 2202 WP_412479610.1 hypothetical protein -
  WIF35_RS00530 - 89369..89650 (+) 282 WP_412479611.1 hypothetical protein -
  WIF35_RS00535 - 89895..90431 (-) 537 WP_412479612.1 DUF2442 domain-containing protein -
  WIF35_RS00540 comA 91420..93027 (+) 1608 WP_412479613.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  WIF35_RS00545 - 93053..93907 (+) 855 WP_412479614.1 alpha/beta fold hydrolase -
  WIF35_RS00550 - 93904..94671 (+) 768 WP_412479615.1 alpha/beta fold hydrolase -
  WIF35_RS00555 - 94611..95393 (-) 783 WP_412479616.1 zinc-dependent peptidase -
  WIF35_RS00560 - 95393..96280 (-) 888 WP_412479617.1 DMT family transporter -

Sequence


Protein


Download         Length: 535 a.a.        Molecular weight: 57648.42 Da        Isoelectric Point: 9.3356

>NTDB_id=970710 WIF35_RS00540 WP_412479613.1 91420..93027(+) (comA) [Azonexus sp. IMCC34839]
MFNRTGTTHLMSISGLHVTMVAALFGGLIGALWRRVPALAIRLPAQQAAIAAAGLAALGYALLAGFAVPAQRTLYMLLVA
GLASWSGRIVAPSRVLALALLVVLLIDPWAVLAAGFWLSFGAVAALLYVGSAQLGRPVGWRERLRGWGMVQWAATLASLP
VLLLVFQQFSLVSPLANALAIPVVSFVVTPLALLGAMIPWWPILALAHELLVWLLRFLEWCAVWPVWQAPAPPLWAALTA
GLGVAVCLLPRGVPGRLLGVFLLLPALFWPVAKPGEGEAWIDVLDVGQGLATVVRTREHTLIYDPGPLYSAESDAGQRVV
VPYLRWLGVNRIDTLVVTHRDTDHSGGTASVRAALPVGEILSSLPELGGQLCQAGQDWRWDGVDFAVLHPLEVTAAVHEA
KRAKTNHASCVLKVSAGDHSMLLTSDIEAPDEAALLAREGVKLRSDILLVPHHGSRTSSTEIFLDAVGANEAIVPVGYRN
RFGHPKADVMERYAARGVRVWRTDRDGALQVRLAADTTPALRGWRSEHARYWHGR

Nucleotide


Download         Length: 1608 bp        

>NTDB_id=970710 WIF35_RS00540 WP_412479613.1 91420..93027(+) (comA) [Azonexus sp. IMCC34839]
GTGTTCAATCGCACCGGCACAACGCATCTGATGTCGATTTCCGGTTTGCATGTGACCATGGTGGCAGCGCTGTTCGGCGG
TCTGATCGGGGCGCTGTGGCGACGCGTGCCGGCCTTGGCCATTCGCCTGCCGGCGCAGCAGGCGGCCATCGCGGCGGCCG
GTCTGGCGGCGCTCGGCTATGCCTTGCTGGCGGGTTTTGCCGTACCGGCGCAGCGCACGCTTTACATGTTGCTGGTGGCT
GGTTTGGCTTCGTGGTCTGGGCGCATCGTGGCGCCGAGCCGGGTGCTGGCGCTGGCCTTGCTGGTGGTTCTACTGATCGA
TCCGTGGGCCGTTTTGGCGGCGGGTTTCTGGCTGTCGTTCGGCGCGGTGGCAGCCTTGCTTTATGTCGGCTCGGCGCAGC
TTGGCCGGCCGGTCGGCTGGCGTGAGCGTCTGCGCGGCTGGGGCATGGTGCAATGGGCGGCGACGCTGGCTTCCCTGCCG
GTGTTGCTGCTCGTCTTCCAGCAGTTCTCGCTGGTGTCGCCGCTGGCCAATGCCTTGGCGATTCCTGTGGTCAGCTTCGT
CGTCACGCCGTTGGCGCTGCTCGGGGCCATGATTCCGTGGTGGCCGATTCTCGCGCTGGCGCATGAACTACTCGTGTGGC
TGCTGCGTTTTCTCGAATGGTGCGCCGTCTGGCCAGTCTGGCAGGCGCCGGCGCCGCCCTTGTGGGCCGCACTGACAGCC
GGGTTGGGCGTGGCGGTGTGCCTGCTGCCGCGCGGCGTGCCGGGGCGTCTGCTGGGCGTGTTTTTGTTGTTGCCGGCCTT
GTTCTGGCCGGTAGCAAAACCCGGCGAAGGCGAAGCATGGATCGACGTGCTCGATGTCGGACAGGGCCTGGCGACGGTGG
TGCGTACGCGTGAGCACACGTTGATCTACGATCCGGGGCCCCTTTACAGCGCCGAGTCGGACGCCGGGCAACGCGTGGTC
GTGCCTTATTTGCGCTGGCTGGGCGTGAATCGGATCGATACGCTGGTGGTCACCCATCGCGATACCGATCATTCGGGCGG
CACGGCTTCGGTGCGGGCGGCGCTGCCGGTTGGAGAAATTCTCTCCTCCTTGCCGGAACTGGGCGGCCAACTCTGCCAGG
CTGGGCAAGACTGGCGTTGGGATGGCGTCGATTTCGCCGTTTTGCATCCGCTGGAGGTGACTGCTGCGGTGCACGAGGCA
AAAAGGGCAAAAACCAATCATGCGTCGTGCGTGCTCAAGGTAAGCGCGGGCGATCACAGCATGCTGCTGACCTCCGACAT
CGAGGCACCGGACGAGGCCGCGCTGCTGGCGCGAGAAGGCGTCAAGCTGCGTTCGGATATCCTGCTTGTGCCGCACCATG
GCTCGCGCACCTCGTCTACCGAGATCTTCCTCGATGCCGTGGGCGCCAATGAAGCCATCGTGCCGGTCGGCTATCGCAAC
CGCTTCGGCCACCCCAAGGCCGATGTGATGGAACGCTACGCGGCGCGCGGCGTCAGAGTCTGGCGGACGGATCGCGATGG
GGCGCTCCAGGTGCGTCTGGCGGCGGATACAACGCCCGCCTTGCGCGGCTGGCGCAGCGAGCATGCACGCTACTGGCATG
GCCGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

45.979

100

0.492

  comA Neisseria gonorrhoeae MS11

42.992

98.692

0.424

  comA Pseudomonas stutzeri DSM 10701

42.776

98.318

0.421


Multiple sequence alignment