Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   BKK81_RS08700 Genome accession   NZ_CP017751
Coordinates   1977342..1979846 (+) Length   834 a.a.
NCBI ID   WP_071012123.1    Uniprot ID   -
Organism   Cupriavidus sp. USMAHM13 isolate pure     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1968184..1979846 1977342..1979846 within 0


Gene organization within MGE regions


Location: 1968184..1979846
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BKK81_RS08660 (BKK81_08660) - 1968184..1968942 (-) 759 WP_071012114.1 SDR family oxidoreductase -
  BKK81_RS08670 (BKK81_08670) - 1970444..1971217 (+) 774 WP_083384012.1 peptidoglycan DD-metalloendopeptidase family protein -
  BKK81_RS08675 (BKK81_08675) recJ 1971302..1972999 (-) 1698 WP_071012116.1 single-stranded-DNA-specific exonuclease RecJ -
  BKK81_RS08680 (BKK81_08680) - 1973166..1974257 (-) 1092 WP_071012118.1 hypothetical protein -
  BKK81_RS08685 (BKK81_08685) - 1974399..1975649 (+) 1251 WP_071012119.1 lipoprotein-releasing ABC transporter permease subunit -
  BKK81_RS08690 (BKK81_08690) lolD 1975642..1976403 (+) 762 WP_083384013.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  BKK81_RS08695 (BKK81_08695) - 1976427..1977269 (+) 843 WP_071012121.1 TatD family hydrolase -
  BKK81_RS08700 (BKK81_08700) comA 1977342..1979846 (+) 2505 WP_071012123.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 834 a.a.        Molecular weight: 87223.36 Da        Isoelectric Point: 11.5470

>NTDB_id=202515 BKK81_RS08700 WP_071012123.1 1977342..1979846(+) (comA) [Cupriavidus sp. USMAHM13 isolate pure]
MRLFLLAFVTGCWCLQQQPVLPGRAAAAVLLLAAALLAASALYWRRRARPAARAAVCLLGLLAGFGWAGGRAVQALDAWL
PPALDGSDVRVRGVVAGLPAEAERGLRFRFAIEQEVAGAGGGAAAGAEAGGARLPPEILLAWNEAPPGLRPGERREFVVR
LRRPRGLANPHGFDYAYWLLGQGVGATGYVRSAGEGHAMAAGERLAWRIASWRHALRTHLRANLPPDARYGPVLVALVVG
DQRGISAADREIFNRTGIGHLISISGLHITMISGMAGALAAWAWRHAFGLGRRWRRPPPLWLPARQVGLLVALPAGIGYG
LVAGLEVPALRTVAMLVVAVLASWSGRTVPGSLVLGWAALAALVVDPWGVMAPGFWLSFSAVAVIFLAAGRPPAAPAADG
ADGARADRWPARLRVALAQAAHTQWVVTLGLVPATLLLFQQTSLVSPLANAVAIPLVSFVVTPLSLLAAILPSPLAAPLL
ALAHACLALLGSLLAWLSAPSWAVWRAAAAGPVAFALALPGVACLLAPRGFGWRLRWRGLVLLLPLLVAGRTPVAAGAFR
ATAFDVGQGGAVLVETRRHALLFDTGPAYGESSSAGERVIVPHLRAAGLGRLDMLMISHEHADHAGGVGAVLDAVPVAAL
RTAAPPAHALLSPPLRSPRAWEPCAAGQQWEWDGVRFAVLHPPAAQSQSPAYGSNARSCVLHVSAQAGDGAAPSLLLTGD
IERAEEAALLSSLPAQALRASVLMVPHHGSGTSSSPAFLAAVAPQAAVFQLGHANRFRHPRADVWARYGAQGVARYRSDE
NGAVVIEAGADGVAITPYRQQVRRYWREAPPAPR

Nucleotide


Download         Length: 2505 bp        

>NTDB_id=202515 BKK81_RS08700 WP_071012123.1 1977342..1979846(+) (comA) [Cupriavidus sp. USMAHM13 isolate pure]
ATGCGCCTGTTCCTGCTGGCTTTCGTCACCGGCTGCTGGTGCCTGCAGCAGCAGCCGGTGCTGCCCGGCCGGGCAGCGGC
CGCCGTGCTGCTGCTGGCCGCCGCGCTGCTGGCGGCGTCCGCGTTGTACTGGCGGCGCCGGGCGCGGCCGGCGGCGCGTG
CGGCGGTCTGCCTGCTGGGCTTGCTGGCCGGCTTCGGCTGGGCCGGCGGGCGCGCCGTGCAGGCGCTGGATGCCTGGCTG
CCCCCCGCGCTCGACGGCAGCGATGTGCGCGTGCGCGGCGTGGTGGCCGGCCTGCCGGCCGAAGCCGAGCGCGGCCTGCG
TTTCCGCTTTGCCATCGAGCAGGAGGTTGCCGGCGCAGGCGGTGGAGCTGCTGCCGGCGCGGAAGCAGGTGGCGCGCGCC
TGCCGCCCGAGATCCTGCTGGCCTGGAACGAGGCGCCGCCCGGACTGCGGCCGGGCGAGCGGCGCGAGTTCGTGGTGCGC
CTGCGGCGGCCGCGCGGGCTAGCGAATCCACACGGGTTCGACTATGCCTACTGGCTGCTCGGCCAAGGCGTGGGGGCGAC
CGGCTACGTGCGCAGCGCGGGCGAGGGCCACGCCATGGCGGCGGGGGAGCGGCTGGCCTGGCGCATCGCCTCGTGGCGCC
ACGCGCTGCGCACGCACCTGCGCGCCAACCTGCCGCCGGATGCGCGCTACGGCCCGGTGCTGGTGGCGCTGGTTGTGGGC
GACCAGCGCGGCATCTCAGCTGCCGACAGGGAGATATTCAACCGCACCGGCATCGGCCACCTGATCAGCATCTCCGGGCT
CCATATCACCATGATCTCCGGCATGGCCGGTGCGCTGGCCGCCTGGGCCTGGCGCCATGCCTTCGGGCTGGGCCGGCGCT
GGCGCCGGCCGCCGCCGCTATGGCTGCCGGCGCGGCAGGTCGGCCTGCTGGTGGCGCTGCCGGCAGGCATTGGCTACGGC
CTGGTGGCGGGACTCGAGGTGCCTGCCCTGCGCACGGTGGCAATGCTGGTGGTGGCGGTGCTGGCATCCTGGAGCGGACG
TACGGTGCCGGGCTCGCTGGTGCTCGGCTGGGCCGCGCTGGCGGCGCTGGTTGTCGATCCATGGGGCGTGATGGCGCCCG
GATTCTGGCTCTCCTTCAGTGCCGTGGCGGTGATCTTCCTGGCCGCGGGGCGTCCGCCCGCGGCCCCCGCGGCGGACGGC
GCGGACGGCGCCCGCGCGGACCGCTGGCCGGCCCGCCTGCGGGTGGCGCTGGCACAAGCCGCCCATACCCAATGGGTGGT
CACGCTGGGCCTGGTGCCGGCCACGCTGCTGTTGTTCCAGCAGACCTCGCTGGTCTCGCCGCTGGCCAACGCGGTGGCCA
TCCCGCTGGTGAGTTTCGTGGTGACGCCGCTGTCGCTGCTGGCCGCCATCCTGCCGTCGCCGCTGGCCGCACCGCTGCTG
GCGCTGGCCCACGCCTGCCTGGCCCTGCTGGGCAGCCTGCTGGCCTGGCTGTCGGCGCCGTCCTGGGCGGTCTGGCGTGC
GGCGGCGGCCGGTCCGGTGGCCTTCGCGCTGGCCCTGCCGGGCGTCGCCTGCCTGCTGGCACCGCGTGGCTTCGGCTGGC
GCCTGCGCTGGCGCGGGCTGGTGCTGCTGCTGCCGCTGCTGGTGGCGGGGCGGACCCCGGTGGCGGCCGGCGCCTTCCGT
GCGACGGCCTTCGATGTCGGCCAGGGCGGTGCGGTGCTGGTCGAGACGCGGCGGCATGCGCTGCTGTTCGACACCGGACC
GGCCTACGGCGAAAGCTCCAGCGCGGGCGAGCGGGTCATCGTGCCGCACCTGCGCGCGGCCGGCCTGGGGCGCCTCGATA
TGCTGATGATCAGCCACGAGCACGCCGACCATGCCGGCGGAGTCGGCGCGGTGCTGGATGCCGTGCCGGTGGCGGCGCTG
CGCACCGCGGCGCCGCCCGCGCATGCGCTGCTGTCGCCGCCGCTGCGGTCACCACGCGCGTGGGAGCCCTGCGCCGCCGG
GCAGCAATGGGAGTGGGACGGGGTGCGTTTCGCCGTGCTGCATCCGCCGGCGGCGCAGTCGCAGTCGCCGGCCTATGGCA
GCAACGCGCGCAGCTGCGTGCTCCATGTGTCGGCGCAGGCCGGTGACGGGGCGGCCCCGAGCCTGCTGCTGACCGGCGAT
ATCGAGCGTGCCGAGGAGGCGGCGCTGCTGTCGTCGTTGCCGGCGCAAGCGCTGCGCGCCAGCGTGCTGATGGTGCCGCA
CCATGGCAGCGGCACGTCTTCCAGCCCGGCCTTCCTTGCCGCGGTGGCACCGCAGGCCGCTGTGTTCCAGCTGGGCCATG
CCAATCGCTTCCGTCATCCGCGTGCCGATGTCTGGGCCCGCTATGGTGCCCAAGGCGTCGCGCGCTACCGCTCGGATGAG
AACGGCGCGGTGGTGATCGAGGCCGGGGCGGACGGCGTGGCGATCACGCCGTACCGTCAGCAGGTCCGGCGCTACTGGCG
CGAGGCGCCGCCGGCACCGCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

46.767

100

0.486


Multiple sequence alignment