Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GPY31_RS18460 Genome accession   NZ_CP046760
Coordinates   2053972..2056230 (-) Length   752 a.a.
NCBI ID   WP_069546138.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain AM51552     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2048972..2061230
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPY31_RS18440 kdsB 2050264..2051019 (-) 756 WP_031818521.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  GPY31_RS18445 - 2051019..2051198 (-) 180 WP_005378451.1 Trm112 family protein -
  GPY31_RS18450 lpxK 2051179..2052186 (-) 1008 WP_042772663.1 tetraacyldisaccharide 4'-kinase -
  GPY31_RS18455 msbA 2052192..2053940 (-) 1749 WP_015296464.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  GPY31_RS18460 comEC 2053972..2056230 (-) 2259 WP_069546138.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GPY31_RS18465 - 2056239..2056748 (+) 510 WP_158111835.1 DUF2062 domain-containing protein -
  GPY31_RS18470 lolE 2056953..2058197 (-) 1245 WP_025500585.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  GPY31_RS18475 lolD 2058200..2058907 (-) 708 WP_020904029.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  GPY31_RS18480 lolC 2058900..2060108 (-) 1209 WP_029785989.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  GPY31_RS18485 - 2060373..2060942 (+) 570 WP_042772671.1 PilZ domain-containing protein -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84676.16 Da        Isoelectric Point: 9.5987

>NTDB_id=407290 GPY31_RS18460 WP_069546138.1 2053972..2056230(-) (comEC) [Vibrio parahaemolyticus strain AM51552]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVIAIKQVNSHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSHWVIRTSSSWRESIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLVMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLAKVLWYLLDIFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVVTGLFPKQDNQTWRIDALDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSSGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=407290 GPY31_RS18460 WP_069546138.1 2053972..2056230(-) (comEC) [Vibrio parahaemolyticus strain AM51552]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTTCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCATAGCGATAAAACA
AGTGAATTCTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCTAGAGCGGTCGTAACTAAAGATTCACACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAGTC
GATAATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCACTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGTTATGCCGCGATATTGGTTTCTACCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTTTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGAAGGTTCTATGGTA
TTTGCTTGACATATTCCTGGTTCCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGGTGACAGGGTTGTTTCCTAAGCAAGATAACCAAACTTGGCGTATTGATGCGCTTGATGTCGGGCATGG
GTTGGCGGTACTGGTTGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGTTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTCTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTTCAGGTTTTAAAATGTTGTTCACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAATCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCGTCAACGGCAAAACTAAACCAGTGGG
GAATGCCTGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.537

100

0.985

  comEC Vibrio campbellii strain DS40M4

66.755

100

0.668

  comEC Vibrio cholerae strain A1552

41.169

100

0.412


Multiple sequence alignment