Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   N5E84_RS11995 Genome accession   NZ_CP104554
Coordinates   2596992..2597693 (+) Length   233 a.a.
NCBI ID   WP_019282843.1    Uniprot ID   A0A233HRJ8
Organism   Vibrio sp. J502     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2591992..2602693
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N5E84_RS11975 (N5E84_11975) greB 2592039..2592521 (-) 483 WP_019282841.1 transcription elongation factor GreB -
  N5E84_RS11980 (N5E84_11980) - 2592865..2595195 (+) 2331 WP_019282842.1 Tex family protein -
  N5E84_RS11985 (N5E84_11985) - 2595345..2595815 (-) 471 WP_026027513.1 hypothetical protein -
  N5E84_RS11990 (N5E84_11990) bioH 2596081..2596845 (-) 765 WP_038148688.1 pimeloyl-ACP methyl ester esterase BioH -
  N5E84_RS11995 (N5E84_11995) comF 2596992..2597693 (+) 702 WP_019282843.1 amidophosphoribosyltransferase Machinery gene
  N5E84_RS12000 (N5E84_12000) nfuA 2597793..2598380 (+) 588 WP_013855425.1 Fe-S biogenesis protein NfuA -
  N5E84_RS12005 (N5E84_12005) nudE 2598536..2599081 (+) 546 WP_013855426.1 ADP compounds hydrolase NudE -
  N5E84_RS12010 (N5E84_12010) cysQ 2599106..2599933 (+) 828 WP_010319313.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  N5E84_RS12015 (N5E84_12015) - 2600015..2600773 (-) 759 WP_019282844.1 type II secretion system protein N -
  N5E84_RS12020 (N5E84_12020) - 2600770..2601267 (-) 498 WP_013855429.1 type II secretion system protein M -
  N5E84_RS12025 (N5E84_12025) gspL 2601278..2602489 (-) 1212 WP_017049386.1 type II secretion system protein GspL -

Sequence


Protein


Download         Length: 233 a.a.        Molecular weight: 26630.04 Da        Isoelectric Point: 9.4459

>NTDB_id=730452 N5E84_RS11995 WP_019282843.1 2596992..2597693(+) (comF) [Vibrio sp. J502]
MLSHWVQKTITRLVSRQCLLCRLPIETHQTGAWCKTCLTYFAPQPRCQQCGLPTLITVPQCGQCLANPPPWQRLYCVGDY
IFPLSHTIHQLKYEGQFWQSRHLSALLTPRIDTPAPLITSVPLHWQRRLKRGFNQSALLASQLSQQLGASCDNQLIKRNR
ATPQQQGLSKLQRKQNLKHAFTLRHLPTHKHIALVDDVVTTGSTVQQICQLLLEVGVERIDIYCICRTPEPKD

Nucleotide


Download         Length: 702 bp        

>NTDB_id=730452 N5E84_RS11995 WP_019282843.1 2596992..2597693(+) (comF) [Vibrio sp. J502]
ATGTTATCGCATTGGGTTCAAAAAACCATCACTCGCTTGGTAAGCCGTCAGTGTCTACTGTGTCGTTTACCGATTGAAAC
TCACCAAACTGGGGCTTGGTGTAAGACCTGCCTAACCTATTTTGCACCGCAACCTCGCTGCCAACAGTGCGGCCTGCCAA
CGCTTATTACCGTGCCTCAATGTGGGCAATGTTTAGCGAATCCTCCGCCTTGGCAACGCCTATACTGTGTCGGCGACTAT
ATTTTTCCGCTCAGTCACACCATTCATCAACTCAAATATGAAGGTCAATTTTGGCAATCTCGCCATTTGAGTGCCTTACT
GACGCCACGTATCGACACTCCCGCCCCGCTCATCACTAGCGTTCCGTTACACTGGCAACGGCGCTTAAAACGCGGCTTCA
ATCAAAGTGCGCTACTTGCCTCTCAATTAAGTCAACAACTGGGGGCTAGCTGCGATAACCAGTTGATCAAACGCAATCGC
GCAACACCCCAGCAGCAAGGGCTTTCTAAATTACAGCGCAAACAAAATCTCAAACATGCATTCACACTACGACACCTTCC
CACGCACAAGCACATTGCACTTGTTGACGATGTCGTGACCACGGGCAGCACCGTGCAGCAAATCTGTCAATTACTGCTTG
AAGTCGGTGTCGAAAGGATTGATATTTACTGCATATGCCGCACTCCTGAACCTAAAGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A233HRJ8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

58.369

100

0.584

  comF Vibrio campbellii strain DS40M4

51.681

100

0.528