Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   PY372_RS04770 Genome accession   NZ_CP119301
Coordinates   1012895..1015153 (+) Length   752 a.a.
NCBI ID   WP_062851295.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain LC     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1007895..1020153
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PY372_RS04745 (PY372_04745) - 1008183..1008752 (-) 570 WP_062851292.1 PilZ domain-containing protein -
  PY372_RS04750 (PY372_04750) lolC 1009017..1010225 (+) 1209 WP_062851293.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  PY372_RS04755 (PY372_04755) lolD 1010218..1010925 (+) 708 WP_005481886.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  PY372_RS04760 (PY372_04760) lolE 1010928..1012172 (+) 1245 WP_062851294.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  PY372_RS04765 (PY372_04765) - 1012377..1012886 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  PY372_RS04770 (PY372_04770) comEC 1012895..1015153 (+) 2259 WP_062851295.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  PY372_RS04775 (PY372_04775) msbA 1015185..1016933 (+) 1749 WP_025610353.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  PY372_RS04780 (PY372_04780) lpxK 1016939..1017946 (+) 1008 WP_047732974.1 tetraacyldisaccharide 4'-kinase -
  PY372_RS04785 (PY372_04785) - 1017927..1018106 (+) 180 WP_005378451.1 Trm112 family protein -
  PY372_RS04790 (PY372_04790) kdsB 1018106..1018861 (+) 756 WP_062851296.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84615.15 Da        Isoelectric Point: 9.4335

>NTDB_id=798386 PY372_RS04770 WP_062851295.1 1012895..1015153(+) (comEC) [Vibrio parahaemolyticus strain LC]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVIAIKQVNSHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSHWVIRTSSSWRESIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLMLSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWCLLDVFLVPLSWSVRYAIGTWQPISTEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVVTGLFPKQYSQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRLETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=798386 PY372_RS04770 WP_062851295.1 1012895..1015153(+) (comEC) [Vibrio parahaemolyticus strain LC]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTTCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCATAGCGATAAAACA
AGTGAATTCTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCTAGAGCGGTCGTAACTAAAGATTCACACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAGTC
GATAATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCACTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTACCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGCCCATGGCGCGTACTGTTACTGGCCGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATGCTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTG
CTTGCTTGACGTGTTCCTTGTACCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTACCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGGTGACAGGGTTGTTTCCTAAGCAATATAGCCAAACTTGGCGTATTGATGTACTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGTTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCACATCATGGAAGCAA
AACCTCTTCTAACCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCTTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.404

100

0.984

  comEC Vibrio campbellii strain DS40M4

66.755

100

0.668

  comEC Vibrio cholerae strain A1552

41.169

100

0.412