Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   EZZ25_RS19710 Genome accession   NZ_CP036499
Coordinates   1001597..1002457 (-) Length   286 a.a.
NCBI ID   WP_088124522.1    Uniprot ID   -
Organism   Vibrio cholerae strain 20000     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 996597..1007457
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EZZ25_RS04865 (EZZ25_04865) gspL 996855..998066 (+) 1212 WP_001288747.1 type II secretion system protein GspL -
  EZZ25_RS04870 (EZZ25_04870) gspM 998076..998573 (+) 498 WP_000661339.1 type II secretion system protein GspM -
  EZZ25_RS04875 (EZZ25_04875) - 998575..999333 (+) 759 WP_088124520.1 type II secretion system protein N -
  EZZ25_RS04880 (EZZ25_04880) cysQ 999385..1000212 (-) 828 WP_088124519.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  EZZ25_RS04885 (EZZ25_04885) nudE 1000235..1000783 (-) 549 WP_000095812.1 ADP compounds hydrolase NudE -
  EZZ25_RS04890 (EZZ25_04890) nfuA 1000922..1001509 (-) 588 WP_000070178.1 Fe-S biogenesis protein NfuA -
  EZZ25_RS19710 (EZZ25_04895) comF 1001597..1002457 (-) 861 WP_088124522.1 ComF family protein Machinery gene
  EZZ25_RS04900 (EZZ25_04900) bioH 1002393..1003154 (+) 762 WP_176398116.1 pimeloyl-ACP methyl ester esterase BioH -
  EZZ25_RS04905 (EZZ25_04905) - 1003317..1003787 (+) 471 WP_000961393.1 hypothetical protein -
  EZZ25_RS04910 (EZZ25_04910) - 1003882..1006203 (-) 2322 WP_000030085.1 Tex family protein -
  EZZ25_RS04915 (EZZ25_04915) greB 1006448..1006930 (+) 483 WP_000850499.1 transcription elongation factor GreB -

Sequence


Protein


Download         Length: 286 a.a.        Molecular weight: 32493.43 Da        Isoelectric Point: 8.0275

>NTDB_id=349244 EZZ25_RS19710 WP_088124522.1 1001597..1002457(-) (comF) [Vibrio cholerae strain 20000]
MPQPCTNTRSCPCPLTCQYNAMVTVSPSISIFIRSANWQDEAHYPDSWLAMLTHWLRKTTAPLLTPECHLCRLALDTNSP
FGVCSACQAWLEHGYRCARCGLPTLTPVDQCGQCLGQPPPWRKLMCVGDYRFPLSDAVHQLKYQCQFWQAPRLAKLLATQ
INEPAPLLCSVPLHWRRRWQRGFNQSDLLARELANVLNIEYDHQLFARRRATPHQQGLSKAQRIHNLRDAFVLNHPPNQP
HVAIVDDVVTTGSTIRHLCDLLLDVGVQSIDIYCICRTPEPKDSHG

Nucleotide


Download         Length: 861 bp        

>NTDB_id=349244 EZZ25_RS19710 WP_088124522.1 1001597..1002457(-) (comF) [Vibrio cholerae strain 20000]
ATGCCCCAGCCGTGTACCAATACCAGATCTTGCCCTTGTCCACTGACTTGCCAATACAACGCCATGGTCACAGTTTCTCC
GTCTATTTCCATCTTCATCCGCTCAGCCAATTGGCAAGATGAAGCACATTACCCCGATTCTTGGCTCGCTATGTTAACCC
ATTGGCTCAGAAAAACTACCGCACCACTGCTGACTCCGGAATGTCATCTCTGCCGACTGGCGCTCGATACAAATTCGCCG
TTTGGCGTCTGCTCTGCCTGCCAAGCTTGGCTTGAGCACGGTTATCGCTGCGCACGCTGCGGTTTACCGACTCTTACCCC
AGTTGATCAGTGCGGACAGTGTTTGGGTCAGCCTCCGCCGTGGCGAAAACTGATGTGTGTCGGCGATTACCGCTTCCCAC
TCAGTGATGCGGTACATCAACTCAAATACCAATGCCAGTTTTGGCAAGCGCCGCGCTTGGCTAAGCTACTCGCCACCCAG
ATTAACGAGCCAGCCCCGTTGCTGTGCAGTGTACCGCTGCATTGGCGGCGACGTTGGCAACGCGGCTTTAATCAAAGCGA
TCTGCTGGCACGTGAATTAGCCAATGTGCTGAACATTGAATATGACCACCAACTCTTTGCACGCCGCCGAGCCACACCGC
ATCAACAAGGACTCAGCAAGGCGCAGCGTATCCACAACCTACGTGATGCGTTTGTGCTGAATCATCCGCCAAACCAACCA
CATGTTGCGATTGTGGACGATGTGGTGACCACAGGCAGTACGATCCGACATTTATGCGATTTACTGCTTGATGTCGGTGT
GCAAAGCATTGATATTTACTGCATATGCCGCACTCCCGAGCCGAAAGATAGCCACGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

98.601

100

0.986

  comF Vibrio campbellii strain DS40M4

46.639

83.217

0.388


Multiple sequence alignment