Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   LC814_RS05025 Genome accession   NZ_CP084528
Coordinates   1112993..1113646 (+) Length   217 a.a.
NCBI ID   WP_226065386.1    Uniprot ID   -
Organism   Kaistella polysaccharea strain GW4-15     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1107993..1118646
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LC814_RS05010 aceA 1108029..1109315 (-) 1287 WP_125023425.1 isocitrate lyase -
  LC814_RS05015 aceB 1109554..1111131 (-) 1578 WP_226065383.1 malate synthase A -
  LC814_RS05020 - 1111310..1112779 (+) 1470 WP_226065385.1 helix-turn-helix domain-containing protein -
  LC814_RS05025 comF 1112993..1113646 (+) 654 WP_226065386.1 ComF family protein Machinery gene
  LC814_RS05030 upp 1113797..1114447 (-) 651 WP_226065388.1 uracil phosphoribosyltransferase -
  LC814_RS05035 der 1114529..1115836 (-) 1308 WP_226065390.1 ribosome biogenesis GTPase Der -
  LC814_RS05040 - 1116029..1116580 (+) 552 WP_226065392.1 LemA family protein -
  LC814_RS05045 - 1116585..1118510 (+) 1926 WP_226065394.1 DUF2207 domain-containing protein -

Sequence


Protein


Download         Length: 217 a.a.        Molecular weight: 25404.40 Da        Isoelectric Point: 7.3078

>NTDB_id=611662 LC814_RS05025 WP_226065386.1 1112993..1113646(+) (comF) [Kaistella polysaccharea strain GW4-15]
MFLVDLLFPNRCLECNRIIHHEEVVCELCFDQIHFTHHHHSESNLLLEKCRLLFPIENAFALMQFEEESTSQKIIHQLKY
GSREKIGKIIANWTLEKVDFNAIKIDLLATVPLHPKKEKERGYNQLHSYAEELSKQLKTPCDHTLIKRNFYKKAQAKKNR
DQRTFNEDLFSITKMISDQHILLIDDVFTTGNTMSAVAWQILKDGHNKVSVLVMAMD

Nucleotide


Download         Length: 654 bp        

>NTDB_id=611662 LC814_RS05025 WP_226065386.1 1112993..1113646(+) (comF) [Kaistella polysaccharea strain GW4-15]
ATGTTTTTAGTCGACTTACTTTTTCCGAATCGCTGTTTGGAATGCAACCGAATCATTCATCATGAAGAAGTGGTTTGTGA
ACTGTGTTTTGATCAGATTCATTTTACACATCATCACCATTCCGAAAGCAATCTTTTACTGGAGAAATGTAGGTTACTTT
TTCCGATTGAAAATGCTTTTGCGCTGATGCAGTTTGAGGAAGAAAGCACCAGTCAAAAAATTATTCATCAGTTAAAATAC
GGCAGTCGGGAGAAAATTGGAAAAATCATCGCGAACTGGACTTTAGAAAAAGTTGATTTCAATGCTATTAAAATTGATCT
TTTAGCAACCGTTCCACTTCATCCGAAAAAAGAGAAAGAACGTGGATATAATCAGCTGCATTCGTATGCTGAGGAACTGT
CGAAACAATTAAAAACACCTTGTGATCACACTTTAATAAAAAGAAATTTCTATAAAAAAGCTCAGGCCAAAAAAAATAGA
GATCAGCGAACCTTTAACGAAGATTTATTTTCAATTACGAAAATGATTTCCGATCAGCACATTTTATTAATTGACGATGT
TTTTACCACAGGCAATACGATGAGCGCTGTAGCCTGGCAAATTTTAAAGGACGGACATAACAAAGTGAGTGTTTTGGTCA
TGGCAATGGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

48.848

100

0.488