Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   KLP38_RS05005 Genome accession   NZ_CP076060
Coordinates   1069772..1071475 (+) Length   567 a.a.
NCBI ID   WP_370649100.1    Uniprot ID   -
Organism   Cupriavidus sp. EM10     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1061471..1072065 1069772..1071475 within 0


Gene organization within MGE regions


Location: 1061471..1072065
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KLP38_RS04970 (KLP38_04970) prfB 1061471..1062574 (-) 1104 WP_215529668.1 peptide chain release factor 2 -
  KLP38_RS04975 (KLP38_04975) - 1062816..1063550 (+) 735 WP_370649113.1 peptidoglycan DD-metalloendopeptidase family protein -
  KLP38_RS04980 (KLP38_04980) recJ 1063576..1065274 (-) 1699 Protein_997 single-stranded-DNA-specific exonuclease RecJ -
  KLP38_RS04985 (KLP38_04985) - 1065305..1066426 (-) 1122 WP_215529670.1 hypothetical protein -
  KLP38_RS04990 (KLP38_04990) - 1066528..1067777 (+) 1250 Protein_999 lipoprotein-releasing ABC transporter permease subunit -
  KLP38_RS04995 (KLP38_04995) lolD 1067770..1068510 (+) 741 WP_215529671.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  KLP38_RS05000 (KLP38_05000) - 1068527..1069357 (+) 831 WP_215529672.1 TatD family hydrolase -
  KLP38_RS05005 (KLP38_05005) comA 1069772..1071475 (+) 1704 WP_370649100.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  KLP38_RS32890 - 1071463..1072065 (+) 603 WP_370649101.1 ComEC/Rec2 family competence protein -

Sequence


Protein


Download         Length: 567 a.a.        Molecular weight: 60611.88 Da        Isoelectric Point: 11.2229

>NTDB_id=571743 KLP38_RS05005 WP_370649100.1 1069772..1071475(+) (comA) [Cupriavidus sp. EM10]
MAGRNLVVEGMVAGLPDTAPHGTRFLFRVERWEDGGTGQSLPDRLLLTWRDEVSGTGEAAESQQAAALQPGQRYRLTVRL
RRPRGLANPHGFDYAYWLLAEGVHATGYVRSGTALASGGPLPWPSWRSWPVRIAMWRAGLRDHLRAALPADARYGPVLIA
LVIGDQRGIAQGDWDVFRRTGISHLVSISGLHITMIAGAAGALARLGWRHSFGLGRRLRRPFPLIWPAQQAALVVTVLAA
LGYGCIAGMQIPALRTVAMLVVAALALWSGRAPPLSVVLAWAAGIAVAIDPWAVMSPGFWLSFGAVAVIFLHARDDPEGQ
RKKQGGQSGQGERGRMARLRAVVVDAARTQWAVTIGLVPLTLLLFGQVSVVSCLANAVAIPVVSLFVTPLALAGAALPVQ
LATPVLAVAHGAMGWLVDGLAWLAAPAWAVWEAAHPGWWRTALAVVGVWLLLVPRRPILPPDIWPRIKYPIPRWLGAGLM
VPMLMTARTAVAEGEMRVTALDVGQGAAVLVETRRHALLYDAGPAYVSGTSAGGQVVVPFLRASGIRRLDMLMVSHEDAD
HAGACAM

Nucleotide


Download         Length: 1704 bp        

>NTDB_id=571743 KLP38_RS05005 WP_370649100.1 1069772..1071475(+) (comA) [Cupriavidus sp. EM10]
ATGGCCGGACGCAATCTGGTGGTGGAGGGCATGGTGGCGGGGCTGCCAGACACCGCGCCGCATGGCACCCGGTTCCTGTT
TCGTGTCGAGCGCTGGGAGGACGGTGGGACAGGGCAATCGTTGCCGGACCGGCTGCTGCTGACATGGCGCGACGAAGTAT
CAGGTACCGGCGAGGCCGCTGAATCCCAACAAGCCGCCGCGCTGCAGCCGGGCCAGCGCTATCGCCTGACCGTCCGGCTG
CGCCGTCCGCGCGGGCTGGCGAATCCGCATGGCTTCGACTATGCCTACTGGCTGCTGGCGGAAGGGGTGCATGCAACGGG
CTATGTGCGCAGTGGCACCGCGCTGGCCTCCGGCGGTCCACTTCCGTGGCCCTCGTGGCGCTCGTGGCCCGTACGCATCG
CGATGTGGCGCGCCGGCCTGCGCGACCATCTGCGCGCCGCCTTGCCGGCCGACGCCCGCTATGGCCCCGTGCTGATCGCG
CTGGTGATCGGCGACCAGCGCGGCATCGCGCAGGGGGACTGGGACGTTTTCCGGCGTACCGGCATCTCGCATCTGGTCAG
CATCTCGGGGCTGCATATCACGATGATCGCGGGCGCGGCCGGGGCGTTGGCGAGGCTGGGATGGCGGCACTCGTTCGGGC
TCGGGCGGCGGCTGCGCCGGCCGTTCCCGCTGATATGGCCGGCCCAGCAGGCCGCGCTGGTGGTCACCGTGCTGGCCGCG
CTGGGCTATGGCTGCATCGCCGGCATGCAGATTCCGGCGTTGCGCACGGTTGCAATGCTGGTCGTGGCCGCGCTGGCGCT
GTGGAGCGGCCGGGCGCCGCCGCTTTCCGTGGTGCTGGCGTGGGCGGCCGGCATTGCCGTGGCCATCGATCCCTGGGCGG
TCATGTCACCGGGGTTCTGGTTGTCCTTCGGCGCGGTGGCGGTGATCTTCCTTCACGCGCGCGACGATCCGGAGGGGCAG
CGAAAGAAGCAGGGCGGGCAGAGCGGGCAGGGTGAACGGGGCCGGATGGCGCGGCTGCGCGCCGTCGTCGTCGACGCCGC
ACGCACGCAGTGGGCGGTGACCATCGGGCTTGTGCCGCTGACGCTGCTGCTGTTTGGCCAGGTGTCGGTGGTGTCGTGTT
TGGCCAATGCGGTGGCGATCCCGGTGGTGAGCCTGTTCGTGACGCCGCTGGCACTGGCCGGGGCTGCATTGCCGGTGCAG
CTTGCCACGCCGGTGCTGGCCGTGGCGCACGGCGCGATGGGCTGGCTGGTGGACGGGCTGGCGTGGTTGGCCGCGCCGGC
GTGGGCCGTGTGGGAGGCCGCCCATCCGGGATGGTGGCGGACGGCCCTGGCCGTCGTCGGGGTGTGGCTTCTGCTTGTGC
CGCGACGGCCGATATTGCCGCCCGACATCTGGCCGCGCATCAAGTATCCGATTCCACGCTGGCTAGGCGCCGGCCTGATG
GTGCCAATGTTGATGACCGCCCGCACTGCAGTGGCTGAAGGCGAAATGCGCGTGACGGCGCTCGATGTCGGGCAGGGGGC
GGCGGTGCTGGTGGAAACGCGGCGGCATGCGCTGCTCTACGACGCCGGGCCGGCCTACGTATCCGGGACGAGTGCCGGCG
GGCAGGTCGTCGTGCCATTCCTGCGGGCCTCGGGCATCCGGAGGCTCGATATGCTGATGGTCAGCCACGAGGACGCCGAC
CATGCCGGGGCGTGCGCGATGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

48.7

100

0.496