Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   WJ038_RS04710 Genome accession   NZ_CP149505
Coordinates   996734..998992 (+) Length   752 a.a.
NCBI ID   WP_029804268.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain EB101     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 991734..1003992
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WJ038_RS04685 (WJ038_04680) - 992022..992591 (-) 570 WP_029804270.1 PilZ domain-containing protein -
  WJ038_RS04690 (WJ038_04685) lolC 992856..994064 (+) 1209 WP_029785234.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  WJ038_RS04695 (WJ038_04690) lolD 994057..994764 (+) 708 WP_023585383.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  WJ038_RS04700 (WJ038_04695) lolE 994767..996011 (+) 1245 WP_015296461.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  WJ038_RS04705 (WJ038_04700) - 996216..996725 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  WJ038_RS04710 (WJ038_04705) comEC 996734..998992 (+) 2259 WP_029804268.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  WJ038_RS04715 (WJ038_04710) msbA 999024..1000772 (+) 1749 WP_015296464.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  WJ038_RS04720 (WJ038_04715) lpxK 1000778..1001785 (+) 1008 WP_021822159.1 tetraacyldisaccharide 4'-kinase -
  WJ038_RS04725 (WJ038_04720) - 1001766..1001945 (+) 180 WP_005378451.1 Trm112 family protein -
  WJ038_RS04730 (WJ038_04725) kdsB 1001945..1002700 (+) 756 WP_005456221.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84795.41 Da        Isoelectric Point: 9.6367

>NTDB_id=959741 WJ038_RS04710 WP_029804268.1 996734..998992(+) (comEC) [Vibrio parahaemolyticus strain EB101]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNTHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GNGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALVFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWYLLDVFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSVVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKNKLSSQSFLHYQPCIAAEKWKWQGLNIEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFIKVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=959741 WJ038_RS04710 WP_029804268.1 996734..998992(+) (comEC) [Vibrio parahaemolyticus strain EB101]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATACTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAATGGTGTTGTTGCAAGAGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATTAAAGCACTGGTTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTTGTATCAATTTCTGGTCTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATCGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTA
TTTGCTTGACGTATTCCTTGTGCCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGTCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACCGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTACTTGATGTCGGGCATGG
GTTGGCGGTACTGGTTGAAAAAGAAGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAACAGTACTTTTCACCTAAAAACAAACTAAGTAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAGTGGAAGTGGCAAGGGTTGAACATTGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAACCCTAAGTTTATCAAAGTTGTTGAGCCTAGTTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGGCCTATACCGACAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.537

100

0.985

  comEC Vibrio campbellii strain DS40M4

66.622

100

0.666

  comEC Vibrio cholerae strain A1552

41.347

100

0.416