Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   HK86_RS06725 Genome accession   NC_021847
Coordinates   1420577..1422835 (+) Length   752 a.a.
NCBI ID   WP_020840314.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus O1:Kuk str. FDA_R31     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1415577..1427835
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HK86_RS06700 (M634_06890) - 1415864..1416433 (-) 570 WP_020840310.1 PilZ domain-containing protein -
  HK86_RS06705 (M634_06895) lolC 1416698..1417906 (+) 1209 WP_041170631.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  HK86_RS06710 (M634_06900) lolD 1417899..1418606 (+) 708 WP_020840312.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  HK86_RS06715 (M634_06905) lolE 1418609..1419853 (+) 1245 WP_020840313.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  HK86_RS06720 (M634_06910) - 1420059..1420568 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  HK86_RS06725 (M634_06915) comEC 1420577..1422835 (+) 2259 WP_020840314.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  HK86_RS06730 (M634_06920) msbA 1422867..1424615 (+) 1749 WP_020840315.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  HK86_RS06735 (M634_06925) lpxK 1424621..1425628 (+) 1008 WP_020840316.1 tetraacyldisaccharide 4'-kinase -
  HK86_RS06740 (M634_06930) - 1425609..1425788 (+) 180 WP_005378451.1 Trm112 family protein -
  HK86_RS06745 (M634_06935) kdsB 1425788..1426543 (+) 756 WP_020840317.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84679.24 Da        Isoelectric Point: 9.5262

>NTDB_id=60532 HK86_RS06725 WP_020840314.1 1420577..1422835(+) (comEC) [Vibrio parahaemolyticus O1:Kuk str. FDA_R31]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVIAIKQVNTHTLLPFLKPKVRLITPLPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSHWVIRTSSSWREAIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLMLSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGVNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTSDIE
AISEWILLREPEKLRSDVILVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=60532 HK86_RS06725 WP_020840314.1 1420577..1422835(+) (comEC) [Vibrio parahaemolyticus O1:Kuk str. FDA_R31]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTTCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCATAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCACTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCACACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAGGC
GATTATTCAAACCGTTGAGCGTGATATTTCTCGCCTTGAGCACTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCAATTTCTGGTCTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATGCTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCCATGTTGGCGACGGTTCTATGGTG
CTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACCGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTACTTGATGTCGGGCATGG
GTTGGCGGTACTGGTTGAAAAAGAGGGTAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGTTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGGTGAACATGGAGGTACTTTGGCCTCCAAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTAGTGATATCGAA
GCCATCAGTGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATACTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAACCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.138

100

0.981

  comEC Vibrio campbellii strain DS40M4

66.223

100

0.662

  comEC Vibrio cholerae strain A1552

41.083

100

0.414


Multiple sequence alignment