Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   BA891_RS04845 Genome accession   NZ_CP016347
Coordinates   1053983..1056241 (+) Length   752 a.a.
NCBI ID   WP_065302141.1    Uniprot ID   -
Organism   Vibrio natriegens strain CCUG 16371     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1048983..1061241
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BA891_RS04820 (BA891_04820) - 1049336..1049905 (-) 570 WP_065302138.1 PilZ domain-containing protein -
  BA891_RS04825 (BA891_04825) lolC 1050157..1051365 (+) 1209 WP_065296581.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  BA891_RS04830 (BA891_04830) lolD 1051358..1052065 (+) 708 WP_065302139.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  BA891_RS04835 (BA891_04835) lolE 1052068..1053312 (+) 1245 WP_065302140.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  BA891_RS04840 (BA891_04840) - 1053465..1053974 (-) 510 WP_014231286.1 DUF2062 domain-containing protein -
  BA891_RS04845 (BA891_04845) comEC 1053983..1056241 (+) 2259 WP_065302141.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  BA891_RS04850 (BA891_04850) msbA 1056273..1058021 (+) 1749 WP_014231288.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  BA891_RS04855 (BA891_04855) lpxK 1058025..1059032 (+) 1008 WP_065302142.1 tetraacyldisaccharide 4'-kinase -
  BA891_RS04860 (BA891_04860) - 1059013..1059192 (+) 180 WP_014231290.1 Trm112 family protein -
  BA891_RS04865 (BA891_04865) kdsB 1059192..1059950 (+) 759 WP_014231291.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 752 a.a.        Molecular weight: 84631.65 Da        Isoelectric Point: 8.5414

>NTDB_id=187155 BA891_RS04845 WP_065302141.1 1053983..1056241(+) (comEC) [Vibrio natriegens strain CCUG 16371]
MTLSEKSCTLALFVASIVSSAWWPIMPDWRWLLLGIITTGSIIKLRRGLISIGVILGLMVVIIHGNVLEYQRQALFNVGE
NSTITGKVDSSFNQISHGYEGLVTIKQVGDQSLLPFLKPKVRFITTFPIPVNSEFTTTAQVKPIIGLRNEAGFDAEKQAM
GNGILARMIVTGDAHWIIRNGRSVRQSIINLVMDDISHLDHFPLISALVFADRSWLSGDDWQGLRDSGLLHLASISGLHI
GMAFSFGFLLGVSVRSVFPRYQVVPSITGLTVALCYAWLADFSLPTTRAVSVCVIYIVLKYFLVHWSTWRVLLLAVAFQL
LIQPFASFSMSFWLSYLSVGMVLLVINFVRFNNSSWRGKLRTLFVTQLALSVFVIPISGYFFSGFSLSSIAYNLVFIPWF
GFVVVPLMFIALSTSLLLPTFSPIVWQLMDFSLWPLSESLQYAFGTWQPLSIELTWILFSLCIFLVLKRLLLWQGWLLLC
VITIIVSGLNGRKSTSWRIDVLDVGHGLAVLVEKEGKALIYDTGKSWPEGSVAEQVIIPVLHRRGFREVDTLVLSHSDND
HAGGRQVIETHLKPSYKRSSQDFSGYQPCVSGDNWTWQQLEFEVLWPPQMVTRAYNPHSCVFRLRDRESDFSMLFTGDIE
SISEWILLREPEKLSSDVMLVPHHGSKSSSNPLFIHAVSPILAVASLAKNNQWGMPADKVVTSYLNAGSLWLDTGEGGQV
TIRVTKDKWDFVTKRNDTFEPWYRQMLRKGLE

Nucleotide


Download         Length: 2259 bp        

>NTDB_id=187155 BA891_RS04845 WP_065302141.1 1053983..1056241(+) (comEC) [Vibrio natriegens strain CCUG 16371]
ATGACTCTCTCAGAAAAAAGTTGCACCTTGGCGTTATTTGTCGCAAGCATTGTCTCATCGGCATGGTGGCCGATAATGCC
AGATTGGCGCTGGCTGCTGCTGGGAATAATTACCACTGGTTCAATAATTAAATTACGTCGTGGCTTAATTAGCATAGGCG
TAATTTTGGGCTTAATGGTAGTCATCATCCACGGCAATGTATTGGAGTATCAGCGACAAGCCCTTTTTAATGTAGGGGAG
AATAGTACCATAACTGGCAAAGTTGACAGCTCTTTTAATCAAATAAGTCATGGATATGAAGGACTCGTGACGATAAAACA
AGTGGGTGATCAAAGTCTGTTACCTTTTCTTAAACCTAAAGTCCGTTTTATCACCACCTTTCCTATCCCTGTTAACAGTG
AATTTACGACCACGGCTCAGGTAAAGCCCATTATTGGGTTACGTAACGAAGCGGGTTTCGACGCTGAAAAGCAGGCAATG
GGAAATGGTATTTTAGCCAGAATGATTGTGACTGGCGATGCGCACTGGATTATTCGTAACGGACGTTCTGTACGTCAGTC
GATAATCAACCTGGTAATGGATGATATCTCCCATCTAGACCACTTTCCGTTAATCAGTGCTCTGGTATTTGCTGATAGAA
GCTGGCTTTCTGGTGATGACTGGCAAGGACTGAGAGACAGTGGATTATTGCATCTGGCCTCTATCTCCGGGCTGCACATT
GGAATGGCGTTTAGCTTTGGTTTTTTGCTCGGTGTAAGTGTTCGCTCCGTTTTTCCGCGTTATCAAGTGGTGCCTTCAAT
TACGGGGTTAACCGTCGCGCTATGTTATGCCTGGCTCGCTGATTTTTCTCTGCCAACCACTCGCGCCGTATCGGTTTGTG
TCATCTACATTGTTCTAAAATACTTTTTGGTTCACTGGAGTACCTGGCGAGTGTTGCTCCTGGCGGTTGCTTTTCAGTTA
CTTATTCAGCCTTTCGCATCTTTTAGTATGAGTTTCTGGTTGTCTTATCTCTCTGTGGGTATGGTTCTGTTGGTCATTAA
CTTTGTGCGGTTCAATAACAGTAGCTGGCGAGGAAAACTGCGAACCTTGTTTGTGACTCAACTTGCCTTGAGTGTATTTG
TCATACCTATCAGTGGCTATTTTTTCTCTGGTTTCAGCTTATCGTCCATCGCTTATAACCTTGTGTTTATTCCATGGTTT
GGGTTTGTCGTCGTTCCTTTAATGTTTATCGCGTTATCTACTTCTTTATTGCTACCCACATTCTCGCCAATAGTTTGGCA
ATTGATGGATTTCTCTTTATGGCCGTTAAGCGAATCTCTTCAGTATGCCTTTGGTACGTGGCAACCATTGTCGATAGAAC
TTACGTGGATACTGTTTTCTCTCTGTATTTTCTTGGTTTTGAAGCGGTTATTGCTTTGGCAGGGTTGGCTATTGCTGTGC
GTTATTACCATCATCGTTTCAGGTCTAAATGGACGAAAAAGTACCAGTTGGCGAATCGATGTTCTCGATGTCGGTCATGG
TCTTGCCGTTCTGGTAGAAAAAGAGGGTAAAGCTCTGATTTACGATACTGGAAAATCCTGGCCGGAGGGCAGTGTTGCGG
AGCAGGTGATTATACCTGTGTTACATCGCAGAGGATTTAGAGAGGTTGATACCTTGGTTCTCAGTCATTCGGACAATGAC
CACGCAGGAGGAAGGCAAGTGATTGAAACGCACCTCAAGCCGAGCTATAAACGCAGCAGCCAAGATTTTAGTGGGTATCA
ACCTTGTGTGAGCGGTGATAATTGGACGTGGCAGCAACTCGAATTCGAAGTGCTCTGGCCACCCCAAATGGTCACTCGAG
CCTACAACCCTCACTCGTGTGTCTTTAGACTCAGAGACCGAGAATCTGATTTTTCCATGCTGTTCACGGGTGATATTGAG
TCGATTAGTGAGTGGATACTTCTTCGTGAGCCTGAAAAATTATCTAGTGATGTGATGCTCGTTCCTCATCACGGTAGTAA
AAGTTCCTCCAATCCTCTTTTTATCCATGCAGTCTCTCCAATATTAGCGGTAGCATCACTGGCGAAAAACAATCAGTGGG
GAATGCCTGCTGATAAGGTAGTAACCTCGTATCTCAATGCAGGGTCACTATGGCTTGATACTGGTGAGGGTGGACAAGTT
ACTATCCGAGTAACGAAAGATAAGTGGGATTTTGTAACCAAACGCAATGATACATTTGAGCCTTGGTATAGGCAGATGCT
GCGTAAGGGGTTAGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio parahaemolyticus RIMD 2210633

66.489

100

0.665

  comEC Vibrio campbellii strain DS40M4

63.697

100

0.637

  comEC Vibrio cholerae strain A1552

38.492

100

0.387


Multiple sequence alignment