Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   OCU78_RS00690 Genome accession   NZ_AP025490
Coordinates   157396..158112 (-) Length   238 a.a.
NCBI ID   WP_137373969.1    Uniprot ID   -
Organism   Vibrio gallaecicus strain CECT 7244     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 152396..163112
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  OCU78_RS00670 - 152564..154756 (-) 2193 WP_240701752.1 VWA domain-containing protein -
  OCU78_RS00675 cysQ 155049..155876 (-) 828 WP_137373966.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  OCU78_RS00680 nudE 155888..156457 (-) 570 WP_137373967.1 ADP compounds hydrolase NudE -
  OCU78_RS00685 nfuA 156703..157287 (-) 585 WP_137373968.1 Fe-S biogenesis protein NfuA -
  OCU78_RS00690 comF 157396..158112 (-) 717 WP_137373969.1 ComF family protein Machinery gene
  OCU78_RS00695 bioH 158198..158983 (+) 786 WP_137373970.1 pimeloyl-ACP methyl ester esterase BioH -
  OCU78_RS00700 - 159307..159780 (+) 474 WP_137373971.1 ATP-dependent Lon protease -
  OCU78_RS00705 - 159917..162250 (-) 2334 WP_137373972.1 Tex family protein -

Sequence


Protein


Download         Length: 238 a.a.        Molecular weight: 27351.81 Da        Isoelectric Point: 9.5998

>NTDB_id=92036 OCU78_RS00690 WP_137373969.1 157396..158112(-) (comF) [Vibrio gallaecicus strain CECT 7244]
MLSDWLQKNTQRLFTPLCPLCQLKKSDADSASIFCNTCMEFIASTTRCLRCGLETPSTIEQCGSCLSDPPLWDRLYCVSD
YTFPTSSYIHKFKYTKQFWLARDLASLISNRIDQPAPLITSVPLHWRRYLQRGFNQSDLLARYTAKTLTTKEMKIKSKTL
FKRIRATATQQGLSKRLRQQNLANAFTLIKTSLPKHIAIMDDVVTTGSTVNYLCQLLRERGVERIDIYCICRTPEPSK

Nucleotide


Download         Length: 717 bp        

>NTDB_id=92036 OCU78_RS00690 WP_137373969.1 157396..158112(-) (comF) [Vibrio gallaecicus strain CECT 7244]
ATGTTATCTGATTGGCTACAAAAGAACACCCAACGCCTGTTCACGCCTCTGTGCCCATTATGCCAACTAAAAAAGTCGGA
CGCAGATAGTGCTTCAATCTTTTGCAATACGTGTATGGAGTTTATTGCCTCAACGACTCGCTGCTTACGCTGTGGGTTAG
AAACACCTTCAACCATTGAGCAATGCGGCAGTTGCTTATCTGACCCTCCGCTATGGGACAGGCTATATTGTGTGAGTGAC
TATACCTTCCCTACGTCTTCTTATATTCATAAATTCAAATACACCAAACAGTTTTGGCTAGCTCGGGATTTAGCTTCACT
CATTTCAAACCGTATAGACCAACCCGCACCGCTGATCACAAGCGTCCCTTTGCATTGGCGACGCTATCTACAACGCGGAT
TTAATCAGAGCGATTTACTTGCCCGTTACACAGCGAAAACACTGACGACAAAAGAAATGAAAATTAAAAGCAAAACACTT
TTCAAACGAATAAGAGCGACGGCAACTCAGCAAGGCTTATCTAAGCGCCTTCGACAGCAGAACCTCGCCAATGCATTCAC
ACTAATAAAAACGTCATTACCTAAACACATTGCCATTATGGATGATGTGGTCACAACAGGCAGTACCGTGAACTATTTAT
GCCAGTTACTACGAGAAAGAGGGGTTGAAAGAATTGATATCTATTGTATTTGTCGGACGCCCGAGCCAAGTAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

52.966

99.16

0.525

  comF Vibrio campbellii strain DS40M4

43.621

100

0.445


Multiple sequence alignment