Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   GPY01_RS19110 Genome accession   NZ_CP046839
Coordinates   2137312..2138172 (-) Length   286 a.a.
NCBI ID   WP_001908542.1    Uniprot ID   -
Organism   Vibrio cholerae strain 2011EL-1271     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2132312..2143172
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPY01_RS14355 gspL 2132570..2133781 (+) 1212 WP_001288741.1 type II secretion system protein GspL -
  GPY01_RS14360 gspM 2133791..2134288 (+) 498 WP_000661339.1 type II secretion system protein GspM -
  GPY01_RS14365 - 2134290..2135048 (+) 759 WP_000816415.1 type II secretion system protein N -
  GPY01_RS14370 cysQ 2135100..2135927 (-) 828 WP_000106798.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  GPY01_RS14375 nudE 2135950..2136498 (-) 549 WP_000095812.1 ADP compounds hydrolase NudE -
  GPY01_RS14380 nfuA 2136637..2137224 (-) 588 WP_000070178.1 Fe-S biogenesis protein NfuA -
  GPY01_RS19110 comF 2137312..2138172 (-) 861 WP_001908542.1 ComF family protein Machinery gene
  GPY01_RS14390 bioH 2138108..2138869 (+) 762 WP_184480209.1 pimeloyl-ACP methyl ester esterase BioH -
  GPY01_RS14395 - 2139030..2139500 (+) 471 WP_000961393.1 hypothetical protein -
  GPY01_RS14400 - 2139595..2141916 (-) 2322 WP_000030079.1 Tex family protein -
  GPY01_RS14405 greB 2142161..2142643 (+) 483 WP_000850492.1 transcription elongation factor GreB -

Sequence


Protein


Download         Length: 286 a.a.        Molecular weight: 32401.36 Da        Isoelectric Point: 7.9437

>NTDB_id=408706 GPY01_RS19110 WP_001908542.1 2137312..2138172(-) (comF) [Vibrio cholerae strain 2011EL-1271]
MPQPCTNTRSCPCPLTCQYNAMVTVSPSISIFIHSANWQDEAHYPDSWLAMLTHWLRKTTAPLLTPECHLCRLALDTNSP
FGVCSACQAWLEHGYRCARCGLPTLTPVDQCGQCLGQPPPWRKLMCVGDYRFPLSDVVHQLKYQRQFWQAPRLAKLLATQ
INEPAPLLCSVPLHWQRRWLRGFNQSDLLARELANVLNIEYDHQLFARRRATPHQQGLSKAQRIHNLSGAFVLNHLPNQP
HVAIVDDVVTTGSTIRHLCDLLLDVGVQSIDIYCICRTPEPKDSHG

Nucleotide


Download         Length: 861 bp        

>NTDB_id=408706 GPY01_RS19110 WP_001908542.1 2137312..2138172(-) (comF) [Vibrio cholerae strain 2011EL-1271]
ATGCCCCAGCCGTGTACCAATACCAGATCTTGCCCTTGTCCACTGACTTGCCAATACAACGCCATGGTCACAGTTTCTCC
GTCTATTTCCATCTTCATCCACTCTGCCAATTGGCAAGATGAAGCACATTACCCCGATTCTTGGCTCGCTATGTTAACCC
ATTGGCTCAGAAAAACTACCGCACCACTGCTGACTCCGGAATGTCATCTCTGCCGACTGGCGCTCGATACAAATTCGCCG
TTTGGCGTCTGCTCTGCCTGCCAAGCTTGGCTTGAGCACGGTTATCGCTGCGCACGCTGCGGTTTACCGACTCTTACCCC
AGTTGATCAGTGCGGGCAGTGTTTGGGTCAGCCTCCGCCGTGGCGAAAACTGATGTGTGTCGGCGATTACCGCTTTCCAC
TTAGTGATGTGGTGCATCAACTCAAATACCAACGCCAGTTTTGGCAAGCGCCGCGCTTGGCTAAGCTACTCGCCACCCAG
ATTAACGAGCCAGCCCCGTTGCTGTGCAGTGTACCGCTACATTGGCAGCGACGTTGGCTACGCGGCTTTAATCAAAGCGA
TCTGCTGGCGCGCGAGTTAGCCAATGTGCTGAACATTGAATATGACCACCAACTCTTTGCACGCCGCCGAGCCACACCGC
ATCAACAAGGACTCAGCAAGGCGCAGCGAATCCACAACCTGAGCGGTGCGTTTGTGCTCAATCACCTACCAAACCAACCA
CATGTTGCGATTGTGGACGATGTGGTGACCACAGGCAGTACGATCCGACATTTATGCGATTTACTGCTTGATGTCGGTGT
GCAAAGCATTGATATTTACTGCATATGCCGCACTCCCGAGCCGAAAGATAGCCACGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

97.203

100

0.972

  comF Vibrio campbellii strain DS40M4

47.479

83.217

0.395


Multiple sequence alignment