Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFC/cflB   Type   Machinery gene
Locus tag   EQH31_RS10820 Genome accession   NZ_CP035249
Coordinates   2106756..2107418 (-) Length   220 a.a.
NCBI ID   WP_000649965.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901938     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2101756..2112418
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH31_RS10795 (EQH31_11475) - 2101906..2102172 (-) 267 WP_001278167.1 Veg family protein -
  EQH31_RS10800 (EQH31_11480) dnaB 2102174..2103526 (-) 1353 WP_000852486.1 replicative DNA helicase -
  EQH31_RS10805 (EQH31_11485) rplI 2103570..2104022 (-) 453 WP_000864220.1 50S ribosomal protein L9 -
  EQH31_RS10810 (EQH31_11490) - 2104019..2105992 (-) 1974 WP_000715107.1 DHH family phosphoesterase -
  EQH31_RS10815 (EQH31_11495) hpf 2106128..2106676 (-) 549 WP_000599104.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  EQH31_RS10820 (EQH31_11500) comFC/cflB 2106756..2107418 (-) 663 WP_000649965.1 ComF family protein Machinery gene
  EQH31_RS10825 (EQH31_11505) comFA/cflA 2107415..2108713 (-) 1299 WP_000867610.1 DEAD/DEAH box helicase Machinery gene
  EQH31_RS10830 (EQH31_11510) - 2108769..2109404 (+) 636 WP_000395740.1 YigZ family protein -
  EQH31_RS10835 (EQH31_11515) cysK 2109689..2110609 (+) 921 WP_000029860.1 cysteine synthase A -
  EQH31_RS10840 (EQH31_11525) - 2110684..2111222 (-) 539 Protein_2122 transposase -
  EQH31_RS10845 (EQH31_11530) - 2111195..2111482 (+) 288 Protein_2123 transposase -
  EQH31_RS10850 (EQH31_11535) tnpB 2111618..2111851 (-) 234 WP_000376908.1 IS66 family insertion sequence element accessory protein TnpB -

Sequence


Protein


Download         Length: 220 a.a.        Molecular weight: 25231.11 Da        Isoelectric Point: 7.7116

>NTDB_id=337715 EQH31_RS10820 WP_000649965.1 2106756..2107418(-) (comFC/cflB) [Streptococcus pneumoniae strain TVO_1901938]
MKCLLCGQTMKTVLTFSSLLLLRNDDSCLCSDCDSTFERIGEENCPNCMKTELSTKCQACQFWCKEGVEVSHRAIFTYNQ
AMKDFFSRYKFDGDFLLRKVFASFLSEELKKYKEYQFVVIPLSPDRYANRGFNQVEGLVEAAGFEYLDLLEKREERASSS
KNRSERLGTELPFFIKSGVTIPKKILLIDDIYTTGATINRVKKLLEEAGAKDVKTFSLVR

Nucleotide


Download         Length: 663 bp        

>NTDB_id=337715 EQH31_RS10820 WP_000649965.1 2106756..2107418(-) (comFC/cflB) [Streptococcus pneumoniae strain TVO_1901938]
ATGAAGTGCTTGTTATGTGGGCAGACTATGAAGACTGTTTTAACTTTTAGTAGTCTCTTACTTCTGAGGAATGATGACTC
TTGTCTTTGTTCAGACTGTGATTCTACTTTTGAAAGAATTGGGGAAGAGAACTGTCCAAATTGTATGAAAACAGAGTTGT
CAACAAAGTGTCAAGCTTGTCAATTTTGGTGTAAAGAAGGAGTTGAAGTCAGTCATAGAGCGATTTTTACTTACAATCAA
GCTATGAAGGATTTTTTCAGTCGGTATAAGTTTGATGGAGACTTCCTGTTAAGAAAAGTTTTCGCTTCATTTTTAAGTGA
GGAGTTGAAAAAGTACAAAGAGTATCAATTTGTTGTAATTCCCCTAAGTCCTGATAGATATGCTAACAGAGGATTTAATC
AGGTTGAGGGCTTGGTAGAGGCAGCAGGCTTTGAGTATCTAGATTTATTAGAGAAAAGAGAAGAGAGAGCCAGTTCTTCT
AAAAATCGTTCAGAGCGCTTGGGGACAGAACTTCCTTTCTTTATTAAAAGTGGAGTCACTATTCCTAAAAAAATCCTACT
TATAGATGATATCTATACTACAGGAGCAACTATAAATCGTGTTAAGAAACTGTTGGAAGAAGCTGGTGCTAAGGATGTAA
AAACATTTTCCCTTGTAAGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFC/cflB Streptococcus pneumoniae TIGR4

99.091

100

0.991

  comFC/cflB Streptococcus pneumoniae Rx1

98.636

100

0.986

  comFC/cflB Streptococcus pneumoniae D39

98.636

100

0.986

  comFC/cflB Streptococcus pneumoniae R6

98.636

100

0.986

  comFC/cflB Streptococcus mitis SK321

91.364

100

0.914

  comFC/cflB Streptococcus mitis NCTC 12261

90.909

100

0.909


Multiple sequence alignment