Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   K6V71_RS23485 Genome accession   NZ_CP083438
Coordinates   3099729..3102242 (+) Length   837 a.a.
NCBI ID   WP_252252802.1    Uniprot ID   -
Organism   Cupriavidus gilardii strain NOV2-1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3091327..3102242 3099729..3102242 within 0


Gene organization within MGE regions


Location: 3091327..3102242
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K6V71_RS23455 (K6V71_23345) - 3092756..3093502 (+) 747 WP_151023243.1 M23 family metallopeptidase -
  K6V71_RS23460 (K6V71_23350) recJ 3093614..3095311 (-) 1698 WP_053821508.1 single-stranded-DNA-specific exonuclease RecJ -
  K6V71_RS23465 (K6V71_23355) - 3095403..3096554 (-) 1152 WP_053823045.1 hypothetical protein -
  K6V71_RS23470 (K6V71_23360) - 3096683..3097933 (+) 1251 WP_053821509.1 lipoprotein-releasing ABC transporter permease subunit -
  K6V71_RS23475 (K6V71_23365) lolD 3097926..3098678 (+) 753 WP_260730458.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  K6V71_RS23480 (K6V71_23370) - 3098706..3099548 (+) 843 WP_260730457.1 TatD family hydrolase -
  K6V71_RS23485 (K6V71_23375) comA 3099729..3102242 (+) 2514 WP_252252802.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 837 a.a.        Molecular weight: 88991.76 Da        Isoelectric Point: 11.6881

>NTDB_id=605838 K6V71_RS23485 WP_252252802.1 3099729..3102242(+) (comA) [Cupriavidus gilardii strain NOV2-1]
MRLTLCAFVAGCWLVQQLPALPPASIGVTAGLAALVVVIALSPLAGRPGPSSIGLHVRLLATVLAALIVGAAWSVWRAER
RLAERLAAQFEGVTLSLTGVVSGLPTGNGEGLRFAFTVDGDRPAGLPPLLSLGWQQAPPDLLPGTRLAVTARLKRPHGLA
NPHGFDYEYWLLQRGIGATGYVLAARPAAPVALRPAWRIARWRAQIRSRMLAAMPAEAPLAGVLVALAIGDQSGIAAEQW
RWFTRTGIGHLVSISGLHITMISGLAGALARALWRRSFGVGRWLRRPLPLRCPAQRAAALVAVAMALGYGLLSGMEVPAQ
RTVAMVSVAALALWWQRRPPASLVLAWAAAVALALDPWAVRAAGFWLSFGAVAAIFVAASQRTVAVTGGEPSQRSVWQAR
WQGVRRHLAESVRTQWAVTIGLLPPTLLLFQQVSVVSPLANALAIPLVSLLVTPLALCAALLAAWPVDWPAGGAIAMAAA
SLLALAHQAMAWLAAALVRLAAPDWAVWQAARPDTVELAFALLGTAVLLAPRAFGWGWRVHGLVLLIPMLSSGRAPLPAA
AFRATALDVGQGAAVLIETRGHRLLYDAGPAYGPADAPRPSAGERVVVPFLRAVGIHSLDALVISHEDADHAGGARDVMA
ALPVRRLWGSLPRGHPLWRWPPGGGPDVGQVCAAGIAWQWDGVRFELLHPLPGQGDDAAIGSNARSCVLRVSSPYAAMLL
TGDIDRAAELAMVARLPPDALRADLLLVPHHGSRTSSSDEWLDAVRPAAAVFQLGYRNRYRHPHAQVWARYGRRGIARYR
SDRTGAVEVVADRDGIAIASFRQQARRYWRPAPDAPE

Nucleotide


Download         Length: 2514 bp        

>NTDB_id=605838 K6V71_RS23485 WP_252252802.1 3099729..3102242(+) (comA) [Cupriavidus gilardii strain NOV2-1]
ATGCGCCTGACGCTGTGCGCCTTCGTCGCGGGCTGCTGGCTGGTGCAGCAATTGCCGGCATTGCCGCCAGCGTCGATCGG
CGTGACCGCCGGTCTGGCCGCATTGGTCGTGGTGATCGCGCTGTCGCCGCTTGCCGGGCGTCCGGGGCCTTCCTCGATCG
GCCTCCATGTGCGCTTGCTGGCCACCGTGCTGGCCGCGCTGATCGTCGGCGCGGCATGGTCCGTCTGGCGTGCCGAGCGT
CGCCTGGCCGAGCGGCTTGCGGCGCAGTTCGAAGGCGTCACGCTGTCGCTGACGGGCGTGGTGTCGGGACTGCCGACGGG
CAACGGCGAGGGCCTGCGTTTCGCGTTCACGGTCGACGGCGACAGGCCGGCCGGCTTGCCGCCGCTGCTGTCGCTCGGCT
GGCAGCAGGCGCCGCCCGATCTGCTGCCCGGCACGCGGCTGGCGGTGACCGCACGGCTGAAGCGGCCACATGGTCTCGCC
AATCCGCATGGCTTCGACTACGAATACTGGCTGCTGCAGCGGGGCATTGGCGCAACCGGCTACGTGCTGGCGGCCCGCCC
CGCAGCGCCGGTGGCCCTGCGGCCGGCCTGGCGCATTGCCCGCTGGCGTGCGCAGATCCGGTCCCGCATGCTGGCGGCGA
TGCCGGCCGAGGCGCCGTTGGCGGGCGTGCTGGTGGCGCTGGCGATCGGCGACCAGAGCGGCATCGCCGCCGAGCAGTGG
CGCTGGTTCACGCGCACCGGCATCGGCCATCTGGTCAGCATCTCGGGCCTGCATATCACGATGATCTCGGGGCTGGCGGG
CGCGCTGGCACGGGCCCTGTGGCGGCGCTCGTTCGGCGTGGGCAGATGGCTGCGGCGGCCGCTGCCGTTGCGTTGTCCGG
CGCAGCGGGCGGCGGCGCTGGTCGCCGTGGCGATGGCGCTCGGCTATGGGCTGCTGTCGGGTATGGAGGTGCCGGCGCAA
CGCACGGTGGCGATGGTATCCGTGGCCGCGCTGGCGCTGTGGTGGCAGCGCCGGCCGCCGGCCTCGCTGGTGCTGGCCTG
GGCCGCCGCGGTCGCGCTGGCGCTCGACCCGTGGGCGGTGCGGGCGGCGGGCTTCTGGCTGTCGTTCGGCGCGGTCGCCG
CGATCTTCGTGGCGGCGTCGCAGCGAACCGTGGCCGTGACCGGCGGCGAGCCGTCGCAGCGTTCGGTCTGGCAGGCGCGC
TGGCAAGGCGTGCGGCGGCACCTGGCCGAGTCGGTGCGCACGCAATGGGCCGTCACGATCGGACTGCTGCCGCCGACGCT
GCTGTTGTTCCAGCAGGTGTCGGTGGTGTCGCCGCTGGCCAATGCGCTGGCGATTCCGCTCGTCAGCCTGCTGGTGACGC
CGCTGGCGTTGTGCGCGGCGCTGCTGGCCGCCTGGCCCGTGGACTGGCCGGCAGGCGGCGCCATCGCGATGGCGGCCGCA
TCGCTGCTGGCGCTGGCGCACCAGGCGATGGCATGGCTGGCGGCGGCGCTGGTGCGGCTGGCGGCGCCGGACTGGGCCGT
CTGGCAGGCGGCGCGGCCGGACACGGTCGAGCTCGCCTTCGCGCTGCTCGGCACGGCGGTGCTGCTGGCCCCGCGCGCCT
TCGGCTGGGGCTGGCGCGTGCACGGCCTGGTGCTGCTGATCCCGATGCTGTCGAGCGGCCGCGCGCCGCTGCCGGCCGCG
GCCTTCCGCGCGACGGCGCTCGATGTCGGACAGGGCGCGGCGGTGCTGATCGAGACCCGCGGCCATCGCCTGCTGTACGA
CGCCGGGCCAGCCTATGGCCCCGCCGACGCGCCACGGCCCAGCGCCGGCGAACGCGTGGTGGTGCCGTTCCTGCGCGCGG
TCGGCATCCATTCGCTCGATGCGCTGGTGATCAGCCACGAGGATGCCGATCATGCCGGCGGTGCGCGCGACGTGATGGCC
GCGCTGCCGGTCCGGCGCCTGTGGGGCTCGCTGCCGCGTGGCCATCCGCTATGGCGGTGGCCGCCCGGGGGCGGACCCGA
TGTCGGTCAGGTGTGCGCGGCCGGTATCGCGTGGCAATGGGACGGCGTGCGTTTCGAACTGCTGCATCCGTTGCCGGGGC
AGGGCGACGACGCGGCGATCGGCAGCAATGCGCGCAGCTGCGTGCTGCGGGTCTCGAGTCCTTACGCCGCGATGCTGCTG
ACCGGCGATATCGACCGCGCCGCCGAACTGGCGATGGTGGCGCGCCTGCCGCCCGATGCGCTGCGCGCCGACCTGCTGCT
GGTGCCGCACCATGGCAGCCGCACCTCGTCCAGCGACGAATGGCTCGACGCGGTAAGGCCCGCCGCGGCGGTGTTCCAGC
TCGGCTATCGCAACCGCTACCGGCATCCGCATGCGCAGGTATGGGCGCGCTATGGCCGCCGCGGGATCGCGCGCTATCGC
AGCGATCGCACCGGCGCGGTCGAAGTCGTCGCGGATCGGGACGGCATCGCGATCGCGTCGTTCCGGCAGCAGGCGCGGCG
CTACTGGCGGCCGGCGCCGGATGCGCCGGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

48.492

100

0.499