Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GSS20_RS04765 Genome accession   NZ_CP047995
Coordinates   1012848..1015106 (+) Length   752 a.a.
NCBI ID   WP_140391732.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain 20150710009     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1010172..1034757 1012848..1015106 within 0


Gene organization within MGE regions


Location: 1010172..1034757
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GSS20_RS04750 (GSS20_05070) lolD 1010172..1010879 (+) 708 WP_031846041.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  GSS20_RS04755 (GSS20_05075) lolE 1010882..1012126 (+) 1245 WP_025533743.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  GSS20_RS04760 (GSS20_05080) - 1012330..1012839 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  GSS20_RS04765 (GSS20_05085) comEC 1012848..1015106 (+) 2259 WP_140391732.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GSS20_RS04770 (GSS20_05090) msbA 1015138..1016886 (+) 1749 WP_140391731.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  GSS20_RS04775 (GSS20_05095) lpxK 1016892..1017899 (+) 1008 WP_031846043.1 tetraacyldisaccharide 4'-kinase -
  GSS20_RS04780 (GSS20_05100) - 1017880..1018059 (+) 180 WP_005378451.1 Trm112 family protein -
  GSS20_RS04785 (GSS20_05105) kdsB 1018059..1018814 (+) 756 WP_025540306.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  GSS20_RS04790 (GSS20_05110) - 1018892..1020451 (-) 1560 WP_005456192.1 SpoVR family protein -
  GSS20_RS04795 (GSS20_05115) - 1020463..1021734 (-) 1272 WP_005481863.1 YeaH/YhbH family protein -
  GSS20_RS04800 (GSS20_05120) - 1021781..1023715 (-) 1935 WP_005456210.1 PrkA family serine protein kinase -
  GSS20_RS04805 (GSS20_05130) - 1024204..1024704 (-) 501 WP_005456271.1 YfbU family protein -
  GSS20_RS04810 (GSS20_05135) - 1024859..1025527 (-) 669 WP_005495910.1 energy-coupling factor ABC transporter permease -
  GSS20_RS04815 (GSS20_05140) pflA 1025688..1026428 (-) 741 WP_005456250.1 pyruvate formate lyase 1-activating protein -
  GSS20_RS04820 (GSS20_05145) - 1026575..1027552 (-) 978 WP_005481800.1 lipid A deacylase LpxR family protein -
  GSS20_RS04825 (GSS20_05150) pflB 1027705..1029981 (-) 2277 WP_005456189.1 formate C-acetyltransferase -
  GSS20_RS04830 (GSS20_05155) - 1030288..1031835 (-) 1548 WP_005456280.1 DUF3360 family protein -
  GSS20_RS04835 (GSS20_05160) - 1032292..1033752 (+) 1461 WP_140391730.1 flagellar sheath protein A -
  GSS20_RS04840 (GSS20_05170) - 1033987..1034757 (+) 771 WP_025532753.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84708.14 Da        Isoelectric Point: 9.2105

>NTDB_id=419129 GSS20_RS04765 WP_140391732.1 1012848..1015106(+) (comEC) [Vibrio parahaemolyticus strain 20150710009]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQDNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAEETWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQTYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=419129 GSS20_RS04765 WP_140391732.1 1012848..1015106(+) (comEC) [Vibrio parahaemolyticus strain 20150710009]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAACCTATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGGGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTACCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
TTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACAGGGTTGTTTCCTAAGCAAGATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTTGAAAAAGAGGGTAGAGTTTTACTCTATGATACGGGCAAAGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGCATCGCTGAGGAGACATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAGCCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCCTAAGTTTATCAACGTTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGACCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.803

100

0.988

  comEC Vibrio campbellii strain DS40M4

66.622

100

0.666

  comEC Vibrio cholerae strain A1552

41.347

100

0.416


Multiple sequence alignment