Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   ACGHCW_RS15510 Genome accession   NZ_AP031614
Coordinates   3352587..3353312 (+) Length   241 a.a.
NCBI ID   WP_009700712.1    Uniprot ID   A0A454CZW2
Organism   Vibrio harveyi strain TUMSAT-2019     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3347587..3358312
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACGHCW_RS15495 (VHTUMSATKI_29640) - 3348712..3351033 (+) 2322 WP_390502083.1 Tex family protein -
  ACGHCW_RS15500 (VHTUMSATKI_29650) - 3351137..3351601 (-) 465 WP_390502084.1 ATP-dependent Lon protease -
  ACGHCW_RS15505 (VHTUMSATKI_29660) bioH 3351738..3352502 (-) 765 WP_005448476.1 pimeloyl-ACP methyl ester esterase BioH -
  ACGHCW_RS15510 (VHTUMSATKI_29670) comF 3352587..3353312 (+) 726 WP_009700712.1 ComF family protein Machinery gene
  ACGHCW_RS15515 (VHTUMSATKI_29680) nfuA 3353410..3353994 (+) 585 WP_005448480.1 Fe-S biogenesis protein NfuA -
  ACGHCW_RS15520 (VHTUMSATKI_29690) nudE 3354170..3354736 (+) 567 WP_009698975.1 ADP compounds hydrolase NudE -
  ACGHCW_RS15525 (VHTUMSATKI_29700) cysQ 3354784..3355611 (+) 828 WP_005448488.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  ACGHCW_RS15530 (VHTUMSATKI_29710) - 3355901..3356662 (-) 762 WP_005448490.1 type II secretion system protein N -
  ACGHCW_RS15535 (VHTUMSATKI_29720) - 3356664..3357155 (-) 492 WP_005448492.1 type II secretion system protein M -

Sequence


Protein


Download         Length: 241 a.a.        Molecular weight: 27713.80 Da        Isoelectric Point: 8.4332

>NTDB_id=110684 ACGHCW_RS15510 WP_009700712.1 3352587..3353312(+) (comF) [Vibrio harveyi strain TUMSAT-2019]
MLSHQWQNIMHRVLSSQCGLCRFPIPHRQTPNLMRWCDSCFDHLTPIKRCSRCGLKMTEEESQSSAECGECLKEPPPWRR
LYTLGDYDFPLSQQVQRFKDHGEAWQVSTLTQLLAERIEQPAPVITSVPVHWQRYIKRGFNQSHILTKHLARHLDVRYES
KIFKRVKSAQSQRGNKKASREQNLQGAFALQGEVDFSHVAIMDDVVTTGSTVRQLCHLLLEVGVESIDIYCICRTPAPGS
I

Nucleotide


Download         Length: 726 bp        

>NTDB_id=110684 ACGHCW_RS15510 WP_009700712.1 3352587..3353312(+) (comF) [Vibrio harveyi strain TUMSAT-2019]
ATGTTATCTCATCAATGGCAAAACATCATGCATCGCGTACTCAGTAGCCAATGCGGATTATGCCGATTTCCCATACCTCA
CCGCCAAACGCCCAACCTAATGCGTTGGTGTGACAGCTGTTTTGACCACCTCACACCAATCAAACGCTGCTCACGTTGTG
GTTTAAAAATGACAGAAGAGGAATCACAATCCTCCGCAGAGTGCGGAGAATGCTTGAAAGAGCCACCACCATGGCGGCGT
TTGTATACCTTAGGTGACTATGACTTTCCGCTCTCTCAACAAGTACAGCGCTTTAAAGATCACGGCGAGGCATGGCAGGT
GAGTACGCTTACTCAACTATTAGCAGAACGAATTGAGCAGCCTGCCCCCGTCATCACCAGCGTGCCCGTTCACTGGCAGC
GCTACATAAAACGAGGCTTCAATCAGAGCCATATCTTAACCAAACACCTCGCCCGCCATTTAGATGTGCGATATGAAAGC
AAGATATTTAAACGTGTCAAAAGCGCTCAGTCGCAGCGAGGAAACAAAAAAGCCAGTCGCGAGCAAAATCTGCAAGGGGC
ATTTGCCCTACAAGGTGAGGTGGACTTTTCCCACGTCGCGATTATGGATGATGTGGTTACAACAGGTAGTACGGTCCGAC
AATTATGTCATTTACTACTTGAAGTTGGCGTAGAAAGCATCGATATTTACTGCATCTGCAGAACTCCTGCCCCCGGCTCA
ATCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A454CZW2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio campbellii strain DS40M4

91.286

100

0.913

  comF Vibrio cholerae strain A1552

47.059

98.755

0.465


Multiple sequence alignment