Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   EKH72_RS05030 Genome accession   NZ_CP034565
Coordinates   997733..999991 (+) Length   752 a.a.
NCBI ID   WP_129829923.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain D3112     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 995056..1019656 997733..999991 within 0


Gene organization within MGE regions


Location: 995056..1019656
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EKH72_RS05015 lolD 995056..995763 (+) 708 WP_020840312.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  EKH72_RS05020 lolE 995766..997010 (+) 1245 WP_129829922.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  EKH72_RS05025 - 997215..997724 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  EKH72_RS05030 comEC 997733..999991 (+) 2259 WP_129829923.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EKH72_RS05035 msbA 1000023..1001771 (+) 1749 WP_025542099.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  EKH72_RS05040 lpxK 1001777..1002784 (+) 1008 WP_129829924.1 tetraacyldisaccharide 4'-kinase -
  EKH72_RS05045 - 1002765..1002944 (+) 180 WP_023585387.1 Trm112 family protein -
  EKH72_RS05050 kdsB 1002944..1003699 (+) 756 WP_021449339.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  EKH72_RS05055 - 1003777..1005336 (-) 1560 WP_005456192.1 SpoVR family protein -
  EKH72_RS05060 - 1005348..1006619 (-) 1272 WP_005481863.1 YeaH/YhbH family protein -
  EKH72_RS05065 - 1006667..1008601 (-) 1935 WP_005456210.1 PrkA family serine protein kinase -
  EKH72_RS05075 - 1009089..1009589 (-) 501 WP_005456271.1 YfbU family protein -
  EKH72_RS05080 - 1009744..1010412 (-) 669 WP_025520632.1 energy-coupling factor ABC transporter permease -
  EKH72_RS05085 pflA 1010573..1011313 (-) 741 WP_005456250.1 pyruvate formate lyase 1-activating protein -
  EKH72_RS05090 - 1011460..1012437 (-) 978 WP_025499395.1 lipid A deacylase LpxR family protein -
  EKH72_RS05095 pflB 1012590..1014866 (-) 2277 WP_005456189.1 formate C-acetyltransferase -
  EKH72_RS05100 - 1015173..1016720 (-) 1548 WP_005456280.1 DUF3360 family protein -
  EKH72_RS05105 - 1017177..1018643 (+) 1467 WP_005456213.1 hypothetical protein -
  EKH72_RS05115 - 1018886..1019656 (+) 771 WP_005456207.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84699.17 Da        Isoelectric Point: 9.4163

>NTDB_id=331937 EKH72_RS05030 WP_129829923.1 997733..999991(+) (comEC) [Vibrio parahaemolyticus strain D3112]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRFITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALVFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVVLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVEDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFISWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVITGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWHNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAGEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNLKFINAVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=331937 EKH72_RS05030 WP_129829923.1 997733..999991(+) (comEC) [Vibrio parahaemolyticus strain D3112]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTTCGCTTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGTTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCAATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTAGTGTTACTGGCCGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTATTTGCGGTTAA
CACAGTGGAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTTCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
TTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATTACAGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCACAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCTGATAATGAC
CATGCTGGTGGCCGAAAAGTGATCGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGCATCGCTGGTGAAAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTAGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCTTAAGTTTATCAATGCTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAAGCCTATACCGACAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

97.872

100

0.979

  comEC Vibrio campbellii strain DS40M4

66.223

100

0.662

  comEC Vibrio cholerae strain A1552

40.903

100

0.41


Multiple sequence alignment