Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   IV454_RS22615 Genome accession   NZ_CP065053
Coordinates   5083649..5086006 (+) Length   785 a.a.
NCBI ID   WP_206087944.1    Uniprot ID   -
Organism   Massilia antarctica strain P8398     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5078649..5091006
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IV454_RS22590 (IV454_22570) - 5078870..5080135 (+) 1266 WP_206087939.1 lipoprotein-releasing ABC transporter permease subunit -
  IV454_RS22595 (IV454_22575) lolD 5080128..5080835 (+) 708 WP_206087940.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  IV454_RS22600 (IV454_22580) - 5080838..5081632 (+) 795 WP_206087941.1 TatD family hydrolase -
  IV454_RS22605 (IV454_22585) - 5081650..5082405 (-) 756 WP_206087942.1 SDR family NAD(P)-dependent oxidoreductase -
  IV454_RS22610 (IV454_22590) - 5082539..5083426 (+) 888 WP_206087943.1 AraC family transcriptional regulator -
  IV454_RS22615 (IV454_22595) comA 5083649..5086006 (+) 2358 WP_206087944.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  IV454_RS22620 (IV454_22600) - 5086055..5086567 (+) 513 WP_206087945.1 hypothetical protein -
  IV454_RS22625 (IV454_22605) - 5086545..5087975 (-) 1431 WP_206087946.1 ATP-binding protein -
  IV454_RS22630 (IV454_22610) - 5087979..5088665 (-) 687 WP_206087947.1 response regulator transcription factor -
  IV454_RS22635 (IV454_22615) - 5088804..5089274 (+) 471 WP_206087948.1 Spy/CpxP family protein refolding chaperone -
  IV454_RS22640 (IV454_22620) - 5089292..5089885 (+) 594 WP_206087949.1 hypothetical protein -
  IV454_RS22645 (IV454_22625) - 5089908..5090762 (+) 855 WP_206087950.1 intradiol ring-cleavage dioxygenase -

Sequence


Protein


Download         Length: 785 a.a.        Molecular weight: 84551.69 Da        Isoelectric Point: 9.5275

>NTDB_id=506270 IV454_RS22615 WP_206087944.1 5083649..5086006(+) (comA) [Massilia antarctica strain P8398]
MRAAILGFACGAGLLQIQPVLPSPSTMAACAAIAVLLCCLRGVARLAMAGALAGFCWAALAAHLALAPALAKAVEGRDIT
VVGIVDSLPFRFDDGVRFNFKVERVLGEPVVVPPRVSLAWYAGYRDAAQAIGDVQPGERWQLVVRLQRPHGNMNPYGFDY
EAWLLEQGVRATGYVRPEGGTTRRLDGFVFSLSNLVEHCRATLRERILRALPGKEYAGVIVALVVGDQRAIGQADWDVFN
RTGIGHLISISGLHITMVAGLFASLASLLWRRSFFTDAQLPLLMPAPKVAALTGAAVALLYVLLAGFGVPAQRTLYMLTV
VAAALWFGRLTQVSHVLCVALGVVVVLDPWAIASPGFWLSFGAVAAILFATTGRTVVRQPRWRGVLLVAAHTQYVVTLAL
VPLTMLLFSQVSIVSPLANAVAIPVVSFVVTPLALAGSMLPAPLSTLLLNAAHYAVQGLAWALAWCSGLRFAVWSAPAPE
PWLFVFAVVGTLWMLAPRGWPHRWTGLAAWLPLLTAQPVSPPQGEVFVTAFDVGQGMALLIETGTHRLLYDTGPAYTRES
NGANRVILPYLKARGIGFLDGVVVSHSDIDHAGGARTLLGALKVGWVSSSLWFDHPIVKAAPRHARCSGGQQWTWDGVRF
EMLHPSVESYADASLKPNARGCVLRITAGVHSILLAADIEAAQEAKLVAGSAQLLRAEVLLAPHHGSGTSSTPVFLAAVQ
PRLVLFQVGYRNRYHHPKTEVVERYEKLGIERLRSDESGAIMLDSASGFAPVEYRREHARYWYGK

Nucleotide


Download         Length: 2358 bp        

>NTDB_id=506270 IV454_RS22615 WP_206087944.1 5083649..5086006(+) (comA) [Massilia antarctica strain P8398]
ATGCGCGCCGCCATCCTGGGATTTGCCTGTGGCGCCGGCCTGCTGCAAATCCAGCCGGTGCTGCCATCACCATCGACCAT
GGCCGCATGCGCCGCCATCGCGGTCCTGCTTTGCTGTTTGCGCGGTGTTGCGAGGCTGGCCATGGCCGGTGCGCTCGCCG
GTTTCTGCTGGGCCGCGCTGGCCGCCCATCTGGCCTTGGCGCCGGCGCTGGCCAAAGCCGTCGAGGGCCGCGACATCACG
GTCGTCGGCATCGTCGACAGCCTGCCGTTCCGCTTCGATGACGGCGTGCGCTTCAATTTCAAGGTCGAAAGAGTGCTGGG
CGAGCCGGTGGTGGTGCCGCCGCGGGTGTCGCTGGCGTGGTATGCCGGCTACCGCGACGCCGCGCAAGCGATCGGCGACG
TGCAGCCCGGCGAACGCTGGCAGCTGGTCGTGCGCCTGCAACGCCCGCATGGAAACATGAACCCGTACGGCTTCGATTAC
GAGGCCTGGCTGCTGGAGCAGGGCGTGCGCGCCACCGGCTACGTGCGGCCGGAAGGCGGCACGACCCGGCGCCTGGACGG
TTTCGTCTTCAGCTTGTCGAACCTGGTCGAACATTGCCGGGCCACCTTGCGCGAGCGTATCCTGCGCGCGCTGCCGGGCA
AGGAATACGCCGGCGTGATCGTCGCCCTGGTCGTCGGCGACCAGCGCGCCATCGGGCAGGCCGACTGGGACGTGTTCAAC
CGCACCGGAATCGGCCACCTCATTTCCATCTCCGGCCTGCATATCACCATGGTGGCCGGACTGTTCGCGTCGCTCGCCTC
CTTGCTGTGGCGGCGCTCGTTTTTCACCGATGCGCAACTGCCTTTGCTGATGCCGGCGCCGAAGGTGGCCGCCTTGACGG
GCGCCGCGGTCGCGCTGTTGTACGTGCTGCTGGCCGGCTTCGGCGTGCCGGCGCAGCGCACTCTGTATATGTTGACCGTG
GTCGCCGCCGCGCTCTGGTTCGGCCGGCTCACGCAGGTGTCGCACGTGCTGTGCGTGGCGCTCGGCGTGGTCGTCGTGCT
CGATCCGTGGGCGATCGCTTCGCCCGGTTTCTGGCTGTCGTTCGGGGCCGTGGCGGCGATCCTGTTCGCGACCACCGGCC
GCACCGTTGTCAGGCAGCCGCGCTGGCGCGGCGTGCTGCTGGTGGCCGCGCATACGCAGTATGTGGTCACGCTCGCGCTG
GTTCCCCTGACGATGCTGCTGTTCTCGCAGGTGTCCATCGTCAGTCCGCTGGCCAACGCGGTGGCGATCCCGGTGGTCAG
CTTCGTCGTCACGCCGCTCGCGCTGGCCGGCAGCATGCTGCCGGCACCCTTGTCCACCTTGCTGCTCAATGCCGCGCACT
ATGCCGTGCAGGGACTGGCCTGGGCGCTCGCATGGTGCTCGGGATTGCGTTTTGCGGTGTGGAGCGCACCGGCTCCCGAG
CCGTGGCTGTTCGTGTTCGCGGTCGTCGGCACCTTGTGGATGCTCGCCCCGCGCGGCTGGCCGCACCGCTGGACCGGGCT
GGCCGCGTGGCTGCCGCTGCTCACCGCCCAGCCTGTGTCTCCGCCGCAGGGCGAGGTGTTCGTGACCGCATTCGATGTCG
GGCAGGGCATGGCGCTGTTGATCGAAACCGGCACGCACCGGCTGCTGTACGACACCGGACCGGCCTACACCCGCGAGTCG
AACGGTGCCAACCGGGTGATCCTGCCGTACCTCAAGGCGCGCGGCATCGGCTTTCTCGACGGCGTGGTCGTCAGCCACAG
CGACATTGACCATGCCGGCGGCGCGCGCACCTTGCTCGGCGCGCTCAAGGTCGGCTGGGTGTCGTCGTCGCTGTGGTTCG
ACCATCCGATCGTCAAGGCCGCGCCACGCCATGCCCGCTGCAGCGGCGGGCAGCAGTGGACCTGGGATGGCGTGCGCTTC
GAGATGCTGCATCCGAGCGTGGAAAGCTATGCGGATGCCAGCCTCAAACCGAACGCGCGCGGCTGCGTGCTGCGGATCAC
GGCGGGCGTCCATTCCATCCTGCTGGCGGCCGATATCGAGGCCGCGCAGGAGGCGAAGCTGGTCGCCGGATCGGCGCAGC
TGCTGCGCGCCGAGGTTTTGCTGGCGCCGCATCATGGCAGCGGTACCTCGTCGACACCGGTCTTCCTGGCCGCTGTGCAG
CCACGCCTGGTGCTGTTCCAAGTCGGTTATCGCAACCGCTACCATCACCCCAAGACGGAGGTGGTGGAGCGCTACGAAAA
ACTAGGCATCGAACGCTTGCGCTCGGACGAATCGGGAGCGATCATGCTCGATTCGGCCAGCGGCTTCGCGCCGGTCGAAT
ACCGGCGAGAACATGCGCGTTACTGGTACGGAAAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

45.258

100

0.48

  comA Pseudomonas stutzeri DSM 10701

38.021

97.834

0.372