Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   Q4437_RS04850 Genome accession   NZ_CP130651
Coordinates   1035367..1037625 (+) Length   752 a.a.
NCBI ID   WP_005481856.1    Uniprot ID   Q87R17
Organism   Vibrio parahaemolyticus strain HZ     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1030367..1042625
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  Q4437_RS04825 (Q4437_04825) - 1030655..1031224 (-) 570 WP_005481874.1 PilZ domain-containing protein -
  Q4437_RS04830 (Q4437_04830) lolC 1031489..1032697 (+) 1209 WP_029787805.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  Q4437_RS04835 (Q4437_04835) lolD 1032690..1033397 (+) 708 WP_005481886.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  Q4437_RS04840 (Q4437_04840) lolE 1033400..1034644 (+) 1245 WP_005481897.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  Q4437_RS04845 (Q4437_04845) - 1034849..1035358 (-) 510 WP_005481854.1 DUF2062 domain-containing protein -
  Q4437_RS04850 (Q4437_04850) comEC 1035367..1037625 (+) 2259 WP_005481856.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  Q4437_RS04855 (Q4437_04855) msbA 1037657..1039405 (+) 1749 WP_005456206.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  Q4437_RS04860 (Q4437_04860) lpxK 1039411..1040418 (+) 1008 WP_005456276.1 tetraacyldisaccharide 4'-kinase -
  Q4437_RS04865 (Q4437_04865) - 1040399..1040578 (+) 180 WP_005378451.1 Trm112 family protein -
  Q4437_RS04870 (Q4437_04870) kdsB 1040578..1041333 (+) 756 WP_005456221.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84699.17 Da        Isoelectric Point: 9.5661

>NTDB_id=860903 Q4437_RS04850 WP_005481856.1 1035367..1037625(+) (comEC) [Vibrio parahaemolyticus strain HZ]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNSHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWYLLDIFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSAVLVSRHVMPRYVWMFVC
VIVVMTGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=860903 Q4437_RS04850 WP_005481856.1 1035367..1037625(+) (comEC) [Vibrio parahaemolyticus strain HZ]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATTCTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGGGCGGTCGTTACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCACTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
TAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGCCCATGGCGCGTACTGTTACTGGCCGTGGCATTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTCCTATGGTA
TTTGCTTGACATATTCCTGGTTCCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTCAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACCGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAAGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGTTACTCAAGTGTCGATACGATGATTTTAAGTCATGCTGATAATGAC
CATGCTGGTGGCCGAAAAGTGATAGAACAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTAGTACGAG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGCGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAAGCCTATACCGACAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q87R17

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

100

100

1

  comEC Vibrio campbellii strain DS40M4

66.755

100

0.668

  comEC Vibrio cholerae strain A1552

41.215

100

0.415