Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFC/cflB   Type   Machinery gene
Locus tag   DQK98_RS11045 Genome accession   NZ_LS483449
Coordinates   2071186..2071848 (-) Length   220 a.a.
NCBI ID   WP_000649971.1    Uniprot ID   A0A0B7MFB3
Organism   Streptococcus pneumoniae strain 4041STDY6836170     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2066186..2076848
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQK98_RS11020 - 2066336..2066602 (-) 267 WP_001278167.1 Veg family protein -
  DQK98_RS11025 dnaB 2066604..2067956 (-) 1353 WP_000852486.1 replicative DNA helicase -
  DQK98_RS11030 rplI 2068000..2068452 (-) 453 WP_000864220.1 50S ribosomal protein L9 -
  DQK98_RS11035 - 2068449..2070422 (-) 1974 WP_000715129.1 DHH family phosphoesterase -
  DQK98_RS11040 hpf 2070558..2071106 (-) 549 WP_000599107.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  DQK98_RS11045 comFC/cflB 2071186..2071848 (-) 663 WP_000649971.1 ComF family protein Machinery gene
  DQK98_RS11050 comFA/cflA 2071845..2073143 (-) 1299 WP_000867601.1 DEAD/DEAH box helicase Machinery gene
  DQK98_RS11055 - 2073199..2073834 (+) 636 WP_000395738.1 YigZ family protein -
  DQK98_RS11060 cysK 2074119..2075039 (+) 921 WP_000029867.1 cysteine synthase A -
  DQK98_RS11070 - 2075114..2075652 (-) 539 Protein_2043 transposase -
  DQK98_RS11075 - 2075625..2075933 (+) 309 Protein_2044 transposase -
  DQK98_RS11080 tnpB 2076088..2076342 (-) 255 WP_061747476.1 IS66 family insertion sequence element accessory protein TnpB -

Sequence


Protein


Download         Length: 220 a.a.        Molecular weight: 25241.10 Da        Isoelectric Point: 7.3688

>NTDB_id=1141945 DQK98_RS11045 WP_000649971.1 2071186..2071848(-) (comFC/cflB) [Streptococcus pneumoniae strain 4041STDY6836170]
MKCLLCGQTMKTVLTFSSLLLLRNDDSCLCSDCDSTFERIGEENCPNCMKTELSTKCQDCQLWCKEGVEVSHRAIFTYNQ
AMKDFFSRYKFDGDFLLRKVFASFLSEELKKYKEYQFVVIPLSPDRYANRGFNQVEGLVEAAGFEYLDLLEKREERASSS
KNRSERLGTELPFFIKSGVTIPKKILLIDDIYTTGATINRVKKLLEEAGAKDVKTFSLVR

Nucleotide


Download         Length: 663 bp        

>NTDB_id=1141945 DQK98_RS11045 WP_000649971.1 2071186..2071848(-) (comFC/cflB) [Streptococcus pneumoniae strain 4041STDY6836170]
ATGAAGTGCTTGTTATGTGGGCAGACTATGAAGACTGTTTTAACTTTTAGTAGTCTCTTACTTCTGAGGAATGATGACTC
TTGTCTTTGTTCAGACTGTGATTCTACTTTTGAAAGAATTGGGGAAGAGAACTGTCCAAATTGTATGAAAACAGAGTTGT
CGACAAAGTGTCAAGATTGTCAACTTTGGTGTAAAGAGGGAGTTGAAGTCAGTCATAGAGCGATTTTTACTTACAATCAA
GCTATGAAGGATTTTTTCAGTCGGTATAAGTTTGATGGAGACTTCCTGTTAAGAAAAGTTTTCGCTTCATTTTTAAGTGA
GGAGTTGAAAAAGTACAAAGAGTATCAATTTGTTGTAATTCCCCTAAGTCCTGATAGATATGCTAATAGAGGATTTAATC
AGGTTGAGGGCTTGGTAGAGGCAGCAGGCTTTGAGTATCTGGATTTATTAGAGAAAAGAGAAGAGAGAGCCAGTTCTTCT
AAAAATCGTTCAGAGCGCTTGGGGACAGAACTTCCTTTCTTTATTAAAAGTGGAGTCACTATTCCTAAAAAAATCCTACT
TATAGATGATATCTATACTACAGGAGCAACTATAAATCGTGTTAAGAAACTGTTGGAAGAAGCTGGTGCTAAGGATGTAA
AAACATTTTCCCTTGTAAGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0B7MFB3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFC/cflB Streptococcus pneumoniae TIGR4

100

100

1

  comFC/cflB Streptococcus pneumoniae Rx1

99.545

100

0.995

  comFC/cflB Streptococcus pneumoniae D39

99.545

100

0.995

  comFC/cflB Streptococcus pneumoniae R6

99.545

100

0.995

  comFC/cflB Streptococcus mitis NCTC 12261

91.818

100

0.918

  comFC/cflB Streptococcus mitis SK321

91.364

100

0.914


Multiple sequence alignment