Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   RHS39_RS04995 Genome accession   NZ_CP133900
Coordinates   1051405..1053663 (+) Length   752 a.a.
NCBI ID   WP_025789083.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain TW01     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1046405..1058663
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RHS39_RS04970 (RHS39_04970) - 1046693..1047262 (-) 570 WP_023624107.1 PilZ domain-containing protein -
  RHS39_RS04975 (RHS39_04975) lolC 1047527..1048735 (+) 1209 WP_031501594.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  RHS39_RS04980 (RHS39_04980) lolD 1048728..1049435 (+) 708 WP_005481886.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  RHS39_RS04985 (RHS39_04985) lolE 1049438..1050682 (+) 1245 WP_025504804.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  RHS39_RS04990 (RHS39_04990) - 1050887..1051396 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  RHS39_RS04995 (RHS39_04995) comEC 1051405..1053663 (+) 2259 WP_025789083.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  RHS39_RS05000 (RHS39_05000) msbA 1053695..1055443 (+) 1749 WP_015296464.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  RHS39_RS05005 (RHS39_05005) lpxK 1055449..1056456 (+) 1008 WP_025789084.1 tetraacyldisaccharide 4'-kinase -
  RHS39_RS05010 (RHS39_05010) - 1056437..1056616 (+) 180 WP_005378451.1 Trm112 family protein -
  RHS39_RS05015 (RHS39_05015) kdsB 1056616..1057371 (+) 756 WP_025789085.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84742.34 Da        Isoelectric Point: 9.5260

>NTDB_id=877420 RHS39_RS04995 WP_025789083.1 1051405..1053663(+) (comEC) [Vibrio parahaemolyticus strain TW01]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWRESIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLVMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQDNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVIRPVLHRRGYSIVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAGEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=877420 RHS39_RS04995 WP_025789083.1 1051405..1053663(+) (comEC) [Vibrio parahaemolyticus strain TW01]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGGTTCGACGCAGAAAAACAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAGTC
GATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCACTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGTTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TTATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
TTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACAGGGTTGTTTCCTAAGCAAGATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTTGAAAAAGAGGGTAGAGTTTTACTCTATGATACGGGCAAAGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTAGGCCAGTACTGCATCGCAGAGGCTACTCAATTGTCGATACGATGATTTTAAGTCATGCTGATAATGAC
CATGCTGGTGGCCGAAAAGTGATCGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGCATCGCTGGTGAAAAATGGAAGTGGCAAGGATTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTAGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAATCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCGTCAACGGCAAAACTAAACCAGTGGG
GAATGCCTGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.537

100

0.985

  comEC Vibrio campbellii strain DS40M4

66.755

100

0.668

  comEC Vibrio cholerae strain A1552

41.744

100

0.42