Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   ITG12_RS10265 Genome accession   NZ_CP080238
Coordinates   2218773..2219498 (+) Length   241 a.a.
NCBI ID   WP_038870079.1    Uniprot ID   A0A2A2N5E6
Organism   Vibrio sp. ED002     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2213773..2224498
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ITG12_RS10250 (ITG12_10170) - 2214898..2217219 (+) 2322 WP_039975908.1 Tex family protein -
  ITG12_RS10255 (ITG12_10175) - 2217324..2217788 (-) 465 WP_038870081.1 hypothetical protein -
  ITG12_RS10260 (ITG12_10180) bioH 2217924..2218688 (-) 765 WP_248873049.1 pimeloyl-ACP methyl ester esterase BioH -
  ITG12_RS10265 (ITG12_10185) comF 2218773..2219498 (+) 726 WP_038870079.1 amidophosphoribosyltransferase Machinery gene
  ITG12_RS10270 (ITG12_10190) nfuA 2219596..2220180 (+) 585 WP_038870077.1 Fe-S biogenesis protein NfuA -
  ITG12_RS10275 (ITG12_10195) nudE 2220371..2220940 (+) 570 WP_248873050.1 ADP compounds hydrolase NudE -
  ITG12_RS10280 (ITG12_10200) cysQ 2220985..2221812 (+) 828 WP_038870073.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  ITG12_RS10285 (ITG12_10205) - 2221980..2222741 (-) 762 WP_248873051.1 type II secretion system protein N -
  ITG12_RS10290 (ITG12_10210) - 2222743..2223234 (-) 492 WP_248873959.1 type II secretion system protein M -
  ITG12_RS10295 (ITG12_10215) gspL 2223242..2224444 (-) 1203 WP_045422166.1 type II secretion system protein GspL -

Sequence


Protein


Download         Length: 241 a.a.        Molecular weight: 27504.50 Da        Isoelectric Point: 8.7164

>NTDB_id=591564 ITG12_RS10265 WP_038870079.1 2218773..2219498(+) (comF) [Vibrio sp. ED002]
MLSDQWQNIMHRVLGNQCGLCRFPISAEQKPNPMRWCDHCFEHLTPIKRCTRCGLKMTEKASQTSTECGECLKEPPPWQR
LYTLGDYDFPLSHQVQRFKDNGESWHVTALTQLLAERIEHPAPIITSVPLHWQRYLKRGFNQSHVLATHLAHHLNSNYRN
RVFKRVKSAQSQRGNKKAGREQNLKAAFALHGEVNFSHIAIVDDVVTTGSTVRQLCHLLLEVGVESIDIYCICRTPAPGS
G

Nucleotide


Download         Length: 726 bp        

>NTDB_id=591564 ITG12_RS10265 WP_038870079.1 2218773..2219498(+) (comF) [Vibrio sp. ED002]
ATGTTATCTGATCAATGGCAAAACATCATGCACCGTGTACTTGGCAATCAATGCGGTTTGTGTCGATTTCCGATTTCCGC
TGAACAGAAACCAAACCCAATGCGTTGGTGCGACCATTGTTTTGAACACCTTACACCCATCAAACGGTGCACTCGCTGTG
GTTTAAAAATGACAGAGAAGGCGTCACAAACCTCCACGGAGTGCGGTGAGTGCCTGAAAGAGCCACCGCCTTGGCAACGT
TTGTATACGTTGGGAGATTATGATTTCCCCCTTTCTCATCAAGTACAACGTTTTAAGGATAATGGTGAATCTTGGCATGT
TACCGCCCTAACTCAATTGTTAGCGGAACGAATCGAGCATCCTGCCCCTATCATTACTAGTGTCCCTTTGCATTGGCAAC
GTTACTTAAAGCGAGGATTTAATCAGAGTCATGTCTTGGCGACACATTTAGCGCACCACTTGAATAGCAACTATCGCAAT
CGCGTTTTTAAACGCGTCAAAAGCGCACAATCTCAACGTGGAAATAAAAAAGCAGGTCGCGAGCAAAATTTGAAAGCAGC
ATTTGCCCTACACGGTGAGGTAAACTTTTCCCACATTGCTATCGTGGACGATGTTGTCACAACAGGCAGTACCGTCCGAC
AATTATGTCATTTACTACTTGAAGTTGGCGTAGAAAGCATCGATATTTACTGCATCTGCAGAACTCCTGCCCCTGGTTCA
GGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2A2N5E6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio campbellii strain DS40M4

82.917

99.585

0.826

  comF Vibrio cholerae strain A1552

47.059

98.755

0.465