Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GP469_RS02025 Genome accession   NZ_CP046808
Coordinates   425156..427414 (-) Length   752 a.a.
NCBI ID   WP_005495920.1    Uniprot ID   A0A249W7Z7
Organism   Vibrio parahaemolyticus strain 2013V-1146     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 405457..430091 425156..427414 within 0


Gene organization within MGE regions


Location: 405457..430091
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GP469_RS01940 - 405457..406227 (-) 771 WP_005456207.1 ABC transporter ATP-binding protein -
  GP469_RS01950 - 406493..407968 (-) 1476 WP_005495905.1 hypothetical protein -
  GP469_RS01955 - 408425..409972 (+) 1548 WP_005456280.1 DUF3360 family protein -
  GP469_RS01960 pflB 410279..412555 (+) 2277 WP_005495907.1 formate C-acetyltransferase -
  GP469_RS01965 - 412708..413685 (+) 978 WP_005481800.1 lipid A deacylase LpxR family protein -
  GP469_RS01970 pflA 413832..414572 (+) 741 WP_005456250.1 pyruvate formate lyase 1-activating protein -
  GP469_RS01975 - 414733..415401 (+) 669 WP_005495910.1 energy-coupling factor ABC transporter permease -
  GP469_RS01980 - 415556..416056 (+) 501 WP_005456271.1 YfbU family protein -
  GP469_RS01990 - 416546..418480 (+) 1935 WP_005456210.1 PrkA family serine protein kinase -
  GP469_RS01995 - 418528..419799 (+) 1272 WP_005481863.1 YeaH/YhbH family protein -
  GP469_RS02000 - 419811..421370 (+) 1560 WP_005495912.1 SpoVR family protein -
  GP469_RS02005 kdsB 421448..422203 (-) 756 WP_005495914.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  GP469_RS02010 - 422203..422382 (-) 180 WP_005378451.1 Trm112 family protein -
  GP469_RS02015 lpxK 422363..423370 (-) 1008 WP_005495917.1 tetraacyldisaccharide 4'-kinase -
  GP469_RS02020 msbA 423376..425124 (-) 1749 WP_005495919.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  GP469_RS02025 comEC 425156..427414 (-) 2259 WP_005495920.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GP469_RS02030 - 427423..427932 (+) 510 WP_005456245.1 DUF2062 domain-containing protein -
  GP469_RS02035 lolE 428137..429381 (-) 1245 WP_005495922.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  GP469_RS02040 lolD 429384..430091 (-) 708 WP_005495923.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84677.16 Da        Isoelectric Point: 9.5001

>NTDB_id=408128 GP469_RS02025 WP_005495920.1 425156..427414(-) (comEC) [Vibrio parahaemolyticus strain 2013V-1146]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNSHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDISRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLALGGLIRLAMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYLALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWYLLDIFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSAVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQDNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSSGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKTSSNPKFINVVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=408128 GP469_RS02025 WP_005495920.1 425156..427414(-) (comEC) [Vibrio parahaemolyticus strain 2013V-1146]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
AGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATTCTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGGGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTCTCGCCTTGAGCATTTTGCTCTAATCAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGACTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCACTGACGTTTGGGCTCGCCCTTGGTGGGCTTATTCGGCTTGCTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATCTTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGTTATTGGCTGTGGCGTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCGGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTGGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATTGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTA
TTTGCTTGACATATTCCTAGTGCCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGCCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACAGGGTTGTTTCCTAAGCAAGATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGTTACTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAACCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTTCAGGTTTTAAAATGTTGTTCACTGGTGATATCGAA
GCCATCAGCGAATGGATCCTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AACCTCTTCTAATCCTAAGTTTATCAATGTTGTTGAGCCTAGCTTGGCTATTGCGTCAACGGCAAAACTAAACCAGTGGG
GAATGCCTGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A249W7Z7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

99.335

100

0.993

  comEC Vibrio campbellii strain DS40M4

67.021

100

0.67

  comEC Vibrio cholerae strain A1552

41.612

100

0.419


Multiple sequence alignment