Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   GPY31_RS22800 Genome accession   NZ_CP046760
Coordinates   2925050..2925775 (+) Length   241 a.a.
NCBI ID   WP_005459005.1    Uniprot ID   Q87TC3
Organism   Vibrio parahaemolyticus strain AM51552     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2920050..2930775
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPY31_RS22785 - 2921153..2923474 (+) 2322 WP_025522200.1 Tex family protein -
  GPY31_RS22790 - 2923610..2924074 (-) 465 WP_005459035.1 hypothetical protein -
  GPY31_RS22795 bioH 2924198..2924965 (-) 768 WP_029817328.1 pimeloyl-ACP methyl ester esterase BioH -
  GPY31_RS22800 comF 2925050..2925775 (+) 726 WP_005459005.1 amidophosphoribosyltransferase Machinery gene
  GPY31_RS22805 nfuA 2925873..2926457 (+) 585 WP_005458964.1 Fe-S biogenesis protein NfuA -
  GPY31_RS22810 nudE 2926678..2927247 (+) 570 WP_005459019.1 ADP compounds hydrolase NudE -
  GPY31_RS22815 cysQ 2927289..2928116 (+) 828 WP_017449059.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  GPY31_RS22820 - 2928402..2929163 (-) 762 WP_005496730.1 type II secretion system protein N -
  GPY31_RS22825 - 2929165..2929656 (-) 492 WP_005458943.1 type II secretion system protein M -

Sequence


Protein


Download         Length: 241 a.a.        Molecular weight: 27770.91 Da        Isoelectric Point: 8.9991

>NTDB_id=407298 GPY31_RS22800 WP_005459005.1 2925050..2925775(+) (comF) [Vibrio parahaemolyticus strain AM51552]
MLSHHWQNIMHRVLSSQCGLCRFPIQATAQPNALRWCDHCYQYLTPVKRCQRCGLSLKAEEANIESICGECLSEPPPWQR
LFTLGDYDFPLSREVQRFKDHGQTWHVRALTQLLAQRISTPAPLITTVPLHWQRYFYRGFNQSDILARHLAGHLNVRFDN
HVFRRVKHVQSQRGYKKSSREQNLKGAFTLNQPPKYNHVAIVDDVVTTGSTVRQLCHLLLEVGVETVDIYCICRTPAPGA
V

Nucleotide


Download         Length: 726 bp        

>NTDB_id=407298 GPY31_RS22800 WP_005459005.1 2925050..2925775(+) (comF) [Vibrio parahaemolyticus strain AM51552]
ATGTTATCTCATCACTGGCAAAACATCATGCATCGTGTGCTCAGCAGTCAATGCGGTTTATGTCGCTTCCCGATTCAGGC
TACCGCTCAACCCAATGCGCTGCGTTGGTGTGATCACTGTTATCAATATCTTACGCCAGTAAAACGCTGCCAACGTTGTG
GATTGAGCTTAAAAGCAGAGGAAGCGAATATAGAGAGTATTTGCGGCGAGTGCCTCTCCGAGCCTCCACCGTGGCAACGG
CTATTTACCTTGGGAGACTACGATTTTCCGCTGTCTCGAGAAGTACAACGCTTCAAAGATCACGGACAAACATGGCATGT
TCGCGCTTTAACGCAATTGCTTGCCCAGCGCATTTCAACTCCCGCTCCGCTTATCACCACAGTGCCATTGCACTGGCAAC
GCTACTTTTATCGAGGCTTTAATCAGAGCGACATACTGGCGCGACATTTGGCTGGTCACCTTAATGTGAGGTTTGATAAT
CACGTGTTTCGCCGCGTAAAACACGTCCAGTCGCAGCGTGGGTACAAGAAATCCAGCCGAGAACAGAATTTAAAAGGCGC
TTTCACCTTAAATCAGCCACCAAAGTATAACCACGTCGCAATCGTAGATGATGTGGTCACGACGGGAAGCACGGTTCGAC
AATTATGTCATTTACTACTTGAAGTTGGCGTAGAAACCGTCGATATTTACTGCATCTGCAGAACCCCTGCTCCTGGTGCT
GTCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q87TC3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio campbellii strain DS40M4

72.199

100

0.722

  comF Vibrio cholerae strain A1552

49.16

98.755

0.485


Multiple sequence alignment