Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   ITG10_RS08520 Genome accession   NZ_CP066149
Coordinates   1884405..1885118 (-) Length   237 a.a.
NCBI ID   WP_017629488.1    Uniprot ID   -
Organism   Vibrio sp. ED004     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1879405..1890118
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ITG10_RS08495 (ITG10_08415) - 1880599..1881087 (+) 489 WP_017629483.1 type II secretion system protein M -
  ITG10_RS08500 (ITG10_08420) - 1881089..1881868 (+) 780 WP_017629484.1 type II secretion system protein N -
  ITG10_RS08505 (ITG10_08425) cysQ 1881999..1882826 (-) 828 WP_017629485.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  ITG10_RS08510 (ITG10_08430) nudE 1882861..1883412 (-) 552 WP_017629486.1 ADP compounds hydrolase NudE -
  ITG10_RS08515 (ITG10_08435) nfuA 1883711..1884295 (-) 585 WP_026084098.1 Fe-S biogenesis protein NfuA -
  ITG10_RS08520 (ITG10_08440) comF 1884405..1885118 (-) 714 WP_017629488.1 ComF family protein Machinery gene
  ITG10_RS08525 (ITG10_08445) bioH 1885212..1885988 (+) 777 WP_017629489.1 pimeloyl-ACP methyl ester esterase BioH -
  ITG10_RS08530 (ITG10_08450) - 1886328..1886801 (+) 474 WP_017063078.1 hypothetical protein -
  ITG10_RS08535 (ITG10_08455) - 1887054..1889384 (-) 2331 WP_017629490.1 Tex family protein -

Sequence


Protein


Download         Length: 237 a.a.        Molecular weight: 27124.46 Da        Isoelectric Point: 9.0366

>NTDB_id=517884 ITG10_RS08520 WP_017629488.1 1884405..1885118(-) (comF) [Vibrio sp. ED004]
MLSDWLQKHTPRLVTPQCHLCKLDKCPSDTHPRWCNPCLKLFEPVPRCQRCGLKTVTTVEQCGECLSKPPPWHRLYCVGD
YTFPTAGYIQQMKYADKFWFARDLSKILASRIEEPASLLTSVPLHWQRYIDRGFNQSQLLARYTAHELNIKNAVLFRRTR
STISQQGLTKSARKSNLKGAFVLKNLNFSATDYSHVAIIDDVVTTGSTVYQLCQLLLEVGVKRIDIYCICRTPEPSG

Nucleotide


Download         Length: 714 bp        

>NTDB_id=517884 ITG10_RS08520 WP_017629488.1 1884405..1885118(-) (comF) [Vibrio sp. ED004]
ATGTTATCCGATTGGCTACAAAAACACACACCACGTCTGGTCACACCTCAATGCCACCTGTGTAAGCTAGATAAATGTCC
TAGTGATACACATCCTCGATGGTGCAATCCTTGTCTCAAACTCTTTGAGCCAGTACCTCGCTGCCAACGATGTGGCTTAA
AAACTGTCACCACCGTTGAGCAGTGTGGTGAGTGTTTATCAAAGCCACCACCTTGGCATCGCCTCTATTGTGTTGGCGAT
TACACCTTTCCAACCGCTGGTTATATTCAACAAATGAAGTACGCGGATAAGTTCTGGTTTGCTCGCGACTTGTCGAAAAT
ATTAGCCTCACGTATTGAAGAGCCCGCGTCGTTGCTCACGAGTGTTCCACTGCATTGGCAAAGGTACATTGATCGTGGTT
TTAATCAGAGCCAGTTATTAGCACGTTACACCGCTCATGAGCTCAACATCAAAAATGCCGTTTTATTTAGACGAACCCGC
TCAACGATTTCCCAACAAGGGCTCACCAAGTCGGCAAGAAAAAGTAATCTGAAAGGCGCTTTCGTTCTGAAAAATTTAAA
CTTTTCAGCGACGGATTATTCGCACGTCGCGATAATTGATGATGTTGTAACCACAGGCAGTACTGTGTATCAATTATGCC
AATTACTACTTGAAGTAGGGGTGAAAAGGATTGATATTTACTGCATCTGCCGCACTCCTGAGCCCTCTGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Vibrio cholerae strain A1552

55.042

100

0.553

  comF Vibrio campbellii strain DS40M4

49.18

100

0.506