Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   WDM00_RS01105 Genome accession   NZ_CP147726
Coordinates   225691..226551 (-) Length   286 a.a.
NCBI ID   WP_367007816.1    Uniprot ID   -
Organism   Vibrio cholerae strain CNCTC 6536     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 220691..231551
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WDM00_RS01075 (WDM00_01075) gspL 220949..222160 (+) 1212 WP_001288737.1 type II secretion system protein GspL -
  WDM00_RS01080 (WDM00_01080) gspM 222170..222667 (+) 498 WP_185841318.1 type II secretion system protein GspM -
  WDM00_RS01085 (WDM00_01085) - 222669..223427 (+) 759 WP_057555112.1 type II secretion system protein N -
  WDM00_RS01090 (WDM00_01090) cysQ 223479..224306 (-) 828 WP_000106799.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  WDM00_RS01095 (WDM00_01095) nudE 224329..224877 (-) 549 WP_114974483.1 ADP compounds hydrolase NudE -
  WDM00_RS01100 (WDM00_01100) nfuA 225016..225603 (-) 588 WP_000070178.1 Fe-S biogenesis protein NfuA -
  WDM00_RS01105 (WDM00_01105) comF 225691..226551 (-) 861 WP_367007816.1 ComF family protein Machinery gene
  WDM00_RS01110 (WDM00_01110) bioH 226487..227248 (+) 762 WP_185841320.1 pimeloyl-ACP methyl ester esterase BioH -
  WDM00_RS01115 (WDM00_01115) - 227371..227841 (+) 471 WP_000961393.1 hypothetical protein -
  WDM00_RS01120 (WDM00_01120) - 227936..230257 (-) 2322 WP_142737551.1 Tex family protein -
  WDM00_RS01125 (WDM00_01125) greB 230502..230984 (+) 483 WP_000850492.1 transcription elongation factor GreB -

Sequence


Protein


Download         Length: 286 a.a.        Molecular weight: 32498.48 Da        Isoelectric Point: 8.0586

>NTDB_id=948215 WDM00_RS01105 WP_367007816.1 225691..226551(-) (comF) [Vibrio cholerae strain CNCTC 6536]
MPQPCTNTRSCPCPLTCQYNAMVTVSPSISIFIHSANWQDEAHYPDSWLAMLTHWLRKTTAPLLTPECHLCRLALDTNSP
FGVCSACQAWLEHGYRCARCGLPTLTPVDQCGQCLGQPPPWRKLMCVGDYRFPLSDAVHQLKYQRQFWQAPRLAKLLATQ
IKEPAPLLCSVPLHWQRRWLRGFNQSDLLARELANVLNIEYDHQLFARRRATPHQQGLSKAQRIHNLRDAFVLNHPPNQP
HVAIVDDVVTTGSTIRHLCDLLLDVGVQSIDIYCICRTPEPKDSHG

Nucleotide


Download         Length: 861 bp        

>NTDB_id=948215 WDM00_RS01105 WP_367007816.1 225691..226551(-) (comF) [Vibrio cholerae strain CNCTC 6536]
ATGCCCCAGCCGTGTACCAATACCAGATCTTGCCCTTGTCCACTGACTTGCCAATACAACGCCATGGTCACAGTTTCTCC
GTCTATTTCCATCTTCATCCACTCTGCCAATTGGCAAGATGAAGCACATTACCCCGATTCTTGGCTCGCTATGTTAACCC
ATTGGCTCAGAAAAACTACCGCACCACTGCTGACTCCGGAATGTCATCTCTGCCGACTGGCGCTCGATACAAATTCGCCG
TTTGGCGTCTGCTCTGCCTGCCAAGCTTGGCTTGAGCACGGTTATCGCTGCGCACGCTGCGGTTTACCGACTCTTACCCC
TGTTGATCAGTGCGGGCAGTGTTTGGGTCAGCCTCCGCCGTGGCGAAAACTGATGTGTGTCGGCGATTACCGCTTCCCAC
TCAGTGATGCGGTGCATCAACTCAAATACCAACGCCAGTTTTGGCAAGCGCCGCGCTTGGCTAAGCTACTCGCCACCCAG
ATTAAAGAGCCAGCCCCGTTGCTGTGCAGTGTACCGCTACATTGGCAGCGACGTTGGCTACGCGGCTTTAATCAAAGCGA
TCTGCTGGCGCGCGAGTTAGCCAATGTGCTGAACATTGAATATGACCACCAACTCTTTGCACGCCGCCGAGCCACACCGC
ATCAACAAGGACTCAGCAAGGCGCAGCGAATCCACAACCTACGTGATGCGTTTGTGCTGAATCATCCGCCAAACCAACCA
CATGTTGCGATTGTGGACGATGTGGTGACCACAGGCAGTACGATCCGACATTTATGCGATTTACTGCTTGATGTCGGTGT
GCAAAGCATTGATATTTACTGCATATGCCGTACTCCCGAGCCGAAAGATAGCCACGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

98.252

100

0.983

  comF Vibrio campbellii strain DS40M4

47.059

83.217

0.392