Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   CU052_RS04815 Genome accession   NZ_CP025537
Coordinates   1014940..1015665 (+) Length   241 a.a.
NCBI ID   WP_033007443.1    Uniprot ID   A0A8B3DV78
Organism   Vibrio harveyi strain 345     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1009940..1020665
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CU052_RS04800 (CU052_05205) - 1011065..1013386 (+) 2322 WP_017189965.1 Tex family protein -
  CU052_RS04805 (CU052_05210) - 1013490..1013954 (-) 465 WP_009698973.1 hypothetical protein -
  CU052_RS04810 (CU052_05215) bioH 1014091..1014855 (-) 765 WP_199828274.1 pimeloyl-ACP methyl ester esterase BioH -
  CU052_RS04815 (CU052_05220) comF 1014940..1015665 (+) 726 WP_033007443.1 ComF family protein Machinery gene
  CU052_RS04820 (CU052_05225) nfuA 1015763..1016347 (+) 585 WP_005448480.1 Fe-S biogenesis protein NfuA -
  CU052_RS04825 (CU052_05230) nudE 1016523..1017089 (+) 567 WP_009698975.1 ADP compounds hydrolase NudE -
  CU052_RS04830 (CU052_05235) cysQ 1017137..1017964 (+) 828 WP_005448488.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  CU052_RS04835 (CU052_05240) - 1018254..1019015 (-) 762 WP_101904323.1 type II secretion system protein N -
  CU052_RS04840 (CU052_05245) - 1019017..1019508 (-) 492 WP_005448492.1 type II secretion system protein M -

Sequence


Protein


Download         Length: 241 a.a.        Molecular weight: 27681.74 Da        Isoelectric Point: 8.4332

>NTDB_id=262387 CU052_RS04815 WP_033007443.1 1014940..1015665(+) (comF) [Vibrio harveyi strain 345]
MLSHQWQNIMHRVLSSQCGLCRFPIPHRQTPNLMRWCDSCFDHLTPIKRCSRCGLKMTEEESQSSAECGECLKEPPPWRR
LYTLGDYDFPLSQQVQRFKDHGEAWQVSTLTQLLAERIEQPAPVITSVPVHWQRYIKRGFNQSHILTKHLARHLDVRYES
KIFKRVKSAQSQRGNKKASREQNLQGAFALQGEVDFSHVAIVDDVVTTGSTVRQLCHLLLEVGVESIDIYCICRTPAPGS
I

Nucleotide


Download         Length: 726 bp        

>NTDB_id=262387 CU052_RS04815 WP_033007443.1 1014940..1015665(+) (comF) [Vibrio harveyi strain 345]
ATGTTATCTCATCAATGGCAAAACATCATGCATCGCGTACTCAGTAGCCAATGCGGATTATGCCGATTTCCCATACCTCA
CCGCCAAACGCCCAACCTAATGCGTTGGTGTGACAGCTGTTTTGACCACCTCACACCAATCAAACGCTGCTCACGTTGTG
GTTTAAAAATGACAGAAGAGGAATCACAATCCTCCGCAGAGTGCGGAGAATGCTTGAAAGAGCCACCACCATGGCGGCGT
TTGTATACCTTAGGTGACTATGACTTTCCGCTCTCTCAACAAGTACAGCGCTTTAAAGATCACGGCGAGGCATGGCAGGT
GAGTACGCTTACTCAACTATTAGCAGAACGAATTGAGCAGCCTGCCCCCGTCATCACCAGCGTGCCCGTTCACTGGCAGC
GCTACATAAAACGAGGATTCAATCAGAGCCATATCTTAACCAAACACCTCGCCCGCCATTTAGATGTGCGATATGAAAGC
AAGATATTTAAACGTGTCAAAAGCGCTCAGTCGCAGCGAGGAAACAAAAAAGCCAGTCGCGAGCAAAATCTGCAAGGGGC
ATTTGCCCTACAAGGTGAGGTGGACTTTTCCCACGTCGCGATTGTGGATGATGTGGTTACAACAGGTAGTACGGTCCGAC
AATTATGTCATTTACTACTTGAAGTTGGCGTAGAAAGCATCGATATTTACTGCATCTGCAGAACTCCTGCCCCCGGCTCA
ATCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A8B3DV78

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio campbellii strain DS40M4

91.701

100

0.917

  comF Vibrio cholerae strain A1552

47.479

98.755

0.469


Multiple sequence alignment