Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   GPW86_RS19940 Genome accession   NZ_CP046742
Coordinates   1311175..1312035 (+) Length   286 a.a.
NCBI ID   WP_123011488.1    Uniprot ID   -
Organism   Vibrio cholerae strain 3523-03     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1306175..1317035
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPW86_RS11815 greB 1306709..1307191 (-) 483 WP_000850492.1 transcription elongation factor GreB -
  GPW86_RS11820 - 1307436..1309757 (+) 2322 WP_123011187.1 Tex family protein -
  GPW86_RS11825 - 1309852..1310322 (-) 471 WP_000961393.1 hypothetical protein -
  GPW86_RS11830 bioH 1310478..1311239 (-) 762 WP_184476266.1 pimeloyl-ACP methyl ester esterase BioH -
  GPW86_RS19940 comF 1311175..1312035 (+) 861 WP_123011488.1 ComF family protein Machinery gene
  GPW86_RS11840 nfuA 1312123..1312710 (+) 588 WP_000070178.1 Fe-S biogenesis protein NfuA -
  GPW86_RS11845 nudE 1312849..1313397 (+) 549 WP_123011189.1 ADP compounds hydrolase NudE -
  GPW86_RS11850 cysQ 1313420..1314247 (+) 828 WP_000106799.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  GPW86_RS11855 - 1314299..1315057 (-) 759 WP_123011190.1 type II secretion system protein N -
  GPW86_RS11860 - 1315059..1315556 (-) 498 WP_000661347.1 type II secretion system protein M -
  GPW86_RS11865 gspL 1315566..1316777 (-) 1212 WP_184476140.1 type II secretion system protein GspL -

Sequence


Protein


Download         Length: 286 a.a.        Molecular weight: 32522.43 Da        Isoelectric Point: 7.9387

>NTDB_id=406987 GPW86_RS19940 WP_123011488.1 1311175..1312035(+) (comF) [Vibrio cholerae strain 3523-03]
MPQPCTNTRSCPCPLTCQYNAMVTVSPSISIFIHSTNWQDEAHYPDSWLAMLTHWLRKTTAPLLTPECHLCRLALDTNSP
FGVCSACQAWLEHGYRCARCGLPTLTPVDQCGQCLGQPPPWRKLICVGDYRFPLSDAVHQLKYQRQFWQAPRLAKLLATQ
INEPAPLLCSVPLHWQRRWLRGFNQSDLLARELANVLNIEYDYQLFARRRATPHQQGLSKAQRIHNLRDAFVLNHPPNQP
HVAIVDDVVTTGSTIRHLCDLLLDVGVQSIDIYCICRTPEPKDSHG

Nucleotide


Download         Length: 861 bp        

>NTDB_id=406987 GPW86_RS19940 WP_123011488.1 1311175..1312035(+) (comF) [Vibrio cholerae strain 3523-03]
ATGCCCCAGCCGTGTACCAATACCAGATCTTGCCCTTGTCCACTGACTTGCCAATACAACGCCATGGTCACAGTTTCTCC
GTCTATTTCCATCTTCATCCACTCTACCAATTGGCAAGATGAAGCACATTACCCCGATTCTTGGCTCGCTATGTTAACCC
ATTGGCTCAGAAAAACTACCGCACCACTGCTGACTCCGGAATGTCATCTCTGCCGACTGGCGCTCGATACAAATTCGCCG
TTTGGCGTCTGCTCTGCCTGCCAAGCTTGGCTTGAGCACGGTTATCGCTGCGCACGCTGCGGTTTACCGACTCTTACCCC
AGTTGATCAGTGCGGGCAGTGTTTGGGTCAGCCTCCGCCGTGGCGAAAACTGATATGTGTCGGCGATTACCGCTTTCCAC
TTAGTGATGCGGTGCATCAACTCAAATACCAACGCCAGTTTTGGCAAGCGCCGCGCTTGGCTAAGCTACTCGCCACCCAG
ATTAACGAGCCAGCCCCGTTGCTGTGCAGTGTACCGCTACATTGGCAGCGACGTTGGCTACGCGGCTTTAATCAAAGCGA
TCTGCTGGCGCGCGAGTTAGCCAATGTGCTGAACATTGAATATGACTACCAACTCTTTGCACGCCGCCGAGCCACACCGC
ATCAACAAGGACTCAGCAAGGCGCAGCGAATCCACAACCTACGCGATGCATTTGTGCTCAATCATCCGCCAAACCAACCA
CATGTTGCGATTGTGGACGATGTGGTGACCACAGGCAGTACGATCCGACATTTATGCGATTTACTGCTTGATGTCGGTGT
GCAAAGCATTGATATTTACTGCATATGCCGCACTCCCGAGCCGAAAGATAGCCACGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

97.552

100

0.976

  comF Vibrio campbellii strain DS40M4

47.059

83.217

0.392


Multiple sequence alignment