Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   FORC4_RS04835 Genome accession   NZ_CP009847
Coordinates   1039235..1041493 (+) Length   752 a.a.
NCBI ID   WP_057619250.1    Uniprot ID   -
Organism   Vibrio parahaemolyticus strain FORC_004     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1034235..1046493
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FORC4_RS04810 (FORC4_0903) - 1034523..1035092 (-) 570 WP_015296459.1 PilZ domain-containing protein -
  FORC4_RS04815 (FORC4_0904) lolC 1035357..1036565 (+) 1209 WP_057619248.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  FORC4_RS04820 (FORC4_0905) lolD 1036558..1037265 (+) 708 WP_005481886.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  FORC4_RS04825 (FORC4_0906) lolE 1037268..1038512 (+) 1245 WP_057619249.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  FORC4_RS04830 (FORC4_0907) - 1038717..1039226 (-) 510 WP_005456245.1 DUF2062 domain-containing protein -
  FORC4_RS04835 (FORC4_0908) comEC 1039235..1041493 (+) 2259 WP_057619250.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  FORC4_RS04840 (FORC4_0909) msbA 1041525..1043273 (+) 1749 WP_015296464.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  FORC4_RS04845 (FORC4_0910) lpxK 1043279..1044286 (+) 1008 WP_057619251.1 tetraacyldisaccharide 4'-kinase -
  FORC4_RS04850 (FORC4_0911) - 1044267..1044446 (+) 180 WP_005378451.1 Trm112 family protein -
  FORC4_RS04855 (FORC4_0912) kdsB 1044446..1045201 (+) 756 WP_057619252.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84801.39 Da        Isoelectric Point: 9.5661

>NTDB_id=132547 FORC4_RS04835 WP_057619250.1 1039235..1041493(+) (comEC) [Vibrio parahaemolyticus strain FORC_004]
MTLLEKSWTLALFVASVISSAWWPTMPDWRWLLLGIITTGSIIKLRRGLISIGVIVGFMVVIVHGNIMEYQRQALFQAGE
NSTIIGRVDSSFTQISHGYEGVVAIKQVNSHTLLPFLKPKVRLITPFPLAVNSEFTTNVLIKPIIGLRNEAGFDAEKQSM
GSGVVARAVVTKDSYWVIRTSSSWREAIIQTVERDIFRLEHFALIKALAFADRTGLTKEDWQSLRDSGLLHLVSISGLHI
GMALTFGLAVGGLIRLVMPRYWFLPSVSGLAFAIVYAWLADFSLPTTRAVSVCIIYIALKYWLVHWSPWRVLLLAVALQL
FFQPFASFSLSFWLSYLSVGAVLFAVNTVQDSKEGRLGKLRILLLTQLILSLLIVPISGYFFSGFSWSSLVYNLVFIPWF
GFVVVPIMFAALIASLLFPMLATVLWYLLDVFLVPLSWSVRYAIGTWQPISAEWTFVIAVVSVVLVLRHVMPRYVWMFVC
VIVVMTGLFPKQYNQTWRIDVLDVGHGLAVLVEKEGRVLLYDTGKAWQNGSIAEQVITPVLHRRGYSSVDTMILSHADND
HAGGRKVIEQYFSPKHKLSSQSFLHYQPCIAAEKWKWQGLNMEVLWPPKPVVRAYNPHSCVISLEDPSTGFKMLFTGDIE
AISEWILLREPEKLRSDVMLVPHHGSKSSSNLKFINAVEPSLAIASTAKLNQWGMPAPEVVQAYTDSGVSWLDTGSDGQI
TILLDGNNWRFESKRRETIEPWYRQMLRNRVE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=132547 FORC4_RS04835 WP_057619250.1 1039235..1041493(+) (comEC) [Vibrio parahaemolyticus strain FORC_004]
ATGACTCTCTTAGAAAAAAGTTGGACCTTGGCGTTATTTGTCGCGAGCGTAATTTCGTCTGCATGGTGGCCGACGATGCC
GGATTGGCGTTGGTTGCTGCTGGGAATAATTACCACTGGCTCAATAATTAAATTACGTCGAGGCTTAATTAGCATAGGCG
TAATTGTGGGCTTTATGGTTGTCATCGTCCACGGCAATATTATGGAGTATCAGAGACAAGCCCTTTTTCAAGCAGGTGAG
AATAGTACCATAATTGGTAGAGTTGACAGCTCTTTTACGCAAATAAGTCACGGATATGAAGGTGTCGTAGCGATAAAACA
AGTGAATTCTCACACTCTGTTACCTTTTCTTAAACCTAAAGTCCGCCTTATAACGCCATTCCCACTAGCTGTTAACAGTG
AGTTTACGACTAACGTTTTGATTAAGCCCATTATCGGCCTCAGAAATGAAGCGGGTTTCGACGCAGAAAAGCAGTCAATG
GGAAGTGGTGTTGTTGCAAGGGCGGTCGTAACTAAAGATTCATACTGGGTCATTCGTACTTCTTCGTCTTGGCGCGAAGC
TATTATTCAAACTGTTGAGCGTGATATTTTTCGCCTTGAGCATTTTGCTCTAATTAAAGCACTGGCTTTTGCTGATCGCA
CTGGGCTTACCAAAGAGGATTGGCAGTCTCTGCGTGACAGCGGATTACTGCATCTTGTATCGATTTCTGGTTTACACATT
GGGATGGCATTGACGTTTGGGCTCGCCGTTGGTGGGCTTATTCGGCTTGTTATGCCGCGATATTGGTTTCTGCCATCAGT
GAGTGGGTTGGCGTTTGCCATTGTTTACGCTTGGCTTGCTGATTTTTCTTTGCCGACGACCCGAGCCGTTTCTGTTTGTA
TCATCTATATTGCTTTGAAGTATTGGCTTGTTCATTGGAGTCCATGGCGCGTACTGCTACTGGCCGTGGCTTTGCAACTT
TTCTTCCAACCTTTTGCTTCTTTTAGTTTGAGTTTTTGGCTGTCTTATTTATCCGTTGGTGCCGTTTTGTTTGCAGTTAA
CACAGTGCAAGACTCGAAGGAAGGTCGTTTAGGAAAGTTGCGGATACTTCTGTTGACTCAACTGATACTGAGCTTATTGA
TTGTCCCGATCAGTGGCTATTTTTTCTCTGGATTTAGCTGGTCTTCTTTAGTCTACAATTTGGTTTTCATTCCTTGGTTT
GGCTTTGTTGTCGTTCCAATCATGTTTGCTGCTTTAATCGCGTCATTGCTCTTTCCTATGTTGGCGACGGTTCTATGGTA
TTTGCTTGACGTATTCCTTGTGCCACTAAGTTGGTCGGTTCGATATGCCATAGGAACTTGGCAACCCATTAGTGCCGAGT
GGACATTTGTTATTGCTGTAGTGAGTGTCGTGCTCGTTTTAAGACATGTTATGCCTCGTTACGTTTGGATGTTTGTCTGT
GTAATTGTTGTGATGACCGGGTTGTTTCCTAAGCAATATAACCAAACTTGGCGTATTGATGTGCTTGATGTCGGGCATGG
GTTGGCGGTGCTGGTCGAAAAAGAGGGGAGAGTTTTACTCTATGATACGGGCAAGGCTTGGCAAAACGGCAGTATAGCTG
AGCAAGTGATTACGCCAGTACTGCACCGCAGAGGCTATTCAAGTGTCGATACGATGATTTTAAGTCATGCCGATAATGAC
CATGCTGGCGGCCGAAAAGTGATAGAGCAGTACTTTTCACCTAAACACAAACTAAGCAGCCAGAGCTTTTTACATTATCA
GCCTTGTATTGCTGCTGAGAAATGGAAGTGGCAAGGGTTGAACATGGAGGTACTTTGGCCTCCTAAACCTGTTGTACGTG
CATACAATCCCCACTCTTGTGTCATCAGTCTGGAAGACCCTAGTACAGGTTTTAAAATGTTGTTTACTGGTGATATCGAA
GCCATCAGTGAATGGATACTGCTTCGAGAGCCAGAAAAGCTGCGCAGTGATGTAATGCTAGTGCCGCATCATGGAAGCAA
AAGCTCTTCTAACCTTAAGTTTATCAATGCTGTTGAGCCTAGCTTAGCTATTGCTTCAACGGCAAAACTAAACCAGTGGG
GAATGCCCGCACCTGAAGTCGTTCAGGCCTATACCGATAGTGGTGTTAGTTGGTTAGACACTGGAAGTGATGGCCAAATA
ACAATCTTACTTGATGGCAATAACTGGCGTTTTGAAAGTAAACGTCGTGAGACAATTGAGCCTTGGTATAGGCAGATGCT
GCGTAACCGAGTAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

98.936

100

0.989

  comEC Vibrio campbellii strain DS40M4

66.622

100

0.666

  comEC Vibrio cholerae strain A1552

41.083

100

0.414


Multiple sequence alignment