Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GPY55_RS20070 Genome accession   NZ_CP046831
Coordinates   2086216..2088474 (-) Length   752 a.a.
NCBI ID   WP_031846967.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain 2012AW-0224     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2081216..2093474
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPY55_RS20050 kdsB 2082508..2083263 (-) 756 WP_031846964.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  GPY55_RS20055 - 2083263..2083442 (-) 180 WP_005378451.1 Trm112 family protein -
  GPY55_RS20060 lpxK 2083423..2084430 (-) 1008 WP_031846965.1 tetraacyldisaccharide 4'-kinase -
  GPY55_RS20065 msbA 2084436..2086184 (-) 1749 WP_031846966.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  GPY55_RS20070 comEC 2086216..2088474 (-) 2259 WP_031846967.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GPY55_RS20075 - 2088483..2088992 (+) 510 WP_005456245.1 DUF2062 domain-containing protein -
  GPY55_RS20080 lolE 2089197..2090441 (-) 1245 WP_025500585.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  GPY55_RS20085 lolD 2090444..2091151 (-) 708 WP_005456257.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  GPY55_RS20090 lolC 2091144..2092352 (-) 1209 WP_029785989.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  GPY55_RS20095 - 2092632..2093201 (+) 570 WP_083135841.1 PilZ domain-containing protein -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84708.11 Da        Isoelectric Point: 9.5002

>NTDB_id=408555 GPY55_RS20070 WP_031846967.1 2086216..2088474(-) (comEC) [Vibrio parahaemolyticus strain 2012AW-0224]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFIVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSHWVIRTSSSWRESIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GIALTFGLVLGGLIRLSMPRYWFISSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVSAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWYLLDIFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVVTGLFPKQDNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWHNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLYYQPCIAAEKWKWQGLNIEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=408555 GPY55_RS20070 WP_031846967.1 2086216..2088474(-) (comEC) [Vibrio parahaemolyticus strain 2012AW-0224]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATAGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTGGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAGGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAACCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCACACTGGGTCATTCGTACATCTTCGTCTTGGCGCGAGTC
GATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCACTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTCGTATCGATTTCTGGTTTACACATT
GGGATTGCACTGACGTTTGGGCTCGTCCTTGGTGGGCTTATTCGGCTTTCTATGCCGCGATATTGGTTTATATCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGCCCATGGCGCGTACTGTTACTGGCCGTGGCATTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTAGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTA
TTTGCTTGACATATTTCTGGTGCCACTAAGTTGGTCGGTTCGATATGCCATAGGGACTTGGCAACCTATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCAGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTAGTGACAGGGTTGTTTCCTAAGCAAGATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCACAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTATATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATTGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGTTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAACCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

97.739

100

0.977

  comEC Vibrio campbellii strain DS40M4

66.09

100

0.661

  comEC Vibrio cholerae strain A1552

41.301

100

0.414


Multiple sequence alignment