Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   GPW68_RS06805 Genome accession   NZ_LR738723
Coordinates   1361921..1364158 (-) Length   745 a.a.
NCBI ID   WP_074389299.1    Uniprot ID   A0AAP6DW18
Organism   Streptococcus suis isolate GD-0088     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1356921..1369158
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPW68_RS06785 - 1358044..1358871 (-) 828 WP_011922194.1 hypothetical protein -
  GPW68_RS06790 - 1359076..1360797 (+) 1722 WP_074389404.1 IS1634 family transposase -
  GPW68_RS06795 - 1360940..1361596 (-) 657 WP_011922763.1 CBS domain-containing protein -
  GPW68_RS06800 - 1361779..1361904 (-) 126 Protein_1283 IS982 family transposase -
  GPW68_RS06805 comEC/celB 1361921..1364158 (-) 2238 WP_074389299.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GPW68_RS06810 comEA/celA/cilE 1364142..1364804 (-) 663 WP_023370712.1 helix-hairpin-helix domain-containing protein Machinery gene
  GPW68_RS06815 - 1364871..1365617 (-) 747 WP_023370714.1 lysophospholipid acyltransferase family protein -
  GPW68_RS06820 - 1365763..1367208 (+) 1446 WP_023370716.1 UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase -
  GPW68_RS06825 - 1367360..1368160 (+) 801 WP_023370718.1 ABC transporter ATP-binding protein -
  GPW68_RS06830 - 1368157..1369086 (+) 930 WP_074411828.1 ABC transporter substrate-binding protein -

Sequence


Protein


Download         Length: 745 a.a.        Molecular weight: 84149.21 Da        Isoelectric Point: 7.9211

>NTDB_id=1129557 GPW68_RS06805 WP_074389299.1 1361921..1364158(-) (comEC/celB) [Streptococcus suis isolate GD-0088]
MSRLSKLPCQPIHLALLAVAAYFVVHSFSLLTMGLLSLLLLVFGFRQGKTVFIRTLPLIALCGLFFGYQKVQWERADQLA
PEQVTTVQVIPDTIEVNGDSLSFRGRADGQTYQVFYKLTSQEEQTYFQKLTGLVQLEVEAEISLPAGQRNFKGFDYQAYL
KTQGIYLTVKITAIKKIVPVQSWNVFDWLSNWRRQALVYVKTNFPVPMSHYMTGLLFGDLDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWILLRLGLTKETVDKLQIPFSLVYASLTGFSVSVVRSLVQKILGNLGLRKLDNFAVTVFVCLLIL
PRFLLTAGGVLTFTYAFLLTVFDFEDLGQVKKAAVESLSISLGILPVLMTYFYAFQPLSILLTFAFSFVFDVLLLPGLSV
IFLLSPIVKITWVNGFFVFMEKIIVWVADLGLRPWILGKPSELVLLLLLVSLFLLYDFHRRKKWLLGLSLVLALLFFITK
HPLENEVTVVDIGQGDSIFLRDMRGRTVLIDVGGRVDFAAKEAWQERSRQANAERTLIPYLHSRGVDRIDSLVLTHTDTD
HVGDVLEVAKQVQIGRIVVSPGSLTVPDFVATLKKINVPVHVVKVGDRLPIFDSYLEVLYPDGTGDGGNNDSIVLYGRLL
ETNFLFTGDLEQGELDLIETYPNLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALVSAGENNRYKHPHQETLERFDSRNIQ
VYRTDQQGAIRFRGWKEWKIETVRR

Nucleotide


Download         Length: 2238 bp        

>NTDB_id=1129557 GPW68_RS06805 WP_074389299.1 1361921..1364158(-) (comEC/celB) [Streptococcus suis isolate GD-0088]
ATGTCACGGTTGAGTAAGCTCCCCTGCCAGCCCATCCACTTGGCTCTTTTGGCGGTGGCAGCCTACTTTGTAGTCCACTC
TTTTTCCCTCTTGACAATGGGCCTGCTGAGTTTGTTGTTGCTGGTCTTTGGATTTCGACAAGGTAAGACAGTTTTCATCA
GAACGCTGCCGCTTATAGCCTTATGTGGTCTGTTTTTCGGCTACCAAAAGGTCCAATGGGAGCGGGCAGACCAGTTAGCC
CCAGAGCAAGTGACGACTGTGCAGGTCATTCCAGATACGATTGAAGTCAACGGGGACAGCCTGTCCTTCCGCGGTCGGGC
TGACGGACAAACCTATCAGGTTTTCTATAAATTAACCAGTCAGGAGGAACAGACCTATTTTCAAAAGCTGACAGGTTTGG
TCCAGCTTGAAGTGGAAGCAGAGATCAGTCTGCCAGCTGGTCAGCGGAATTTCAAGGGTTTTGACTATCAGGCCTACCTG
AAAACACAGGGTATTTATCTGACGGTCAAGATTACGGCAATCAAGAAGATTGTCCCAGTCCAGTCTTGGAATGTCTTTGA
CTGGTTGTCAAACTGGCGGAGGCAGGCTTTGGTTTATGTCAAAACCAATTTTCCGGTTCCCATGAGCCACTACATGACAG
GGCTCTTGTTTGGGGACTTGGATAGTGATTTTGACCAGATGAGTGACCTCTATTCCAGTTTGGGTATCATTCATCTCTTT
GCCCTGTCTGGTATGCAGGTGGGCTTTTTCATCGACAAGTTTCGCTGGATTTTACTGCGTTTGGGCTTGACCAAAGAAAC
GGTTGATAAATTACAAATTCCCTTTTCCCTTGTCTATGCGAGCTTGACAGGTTTTTCCGTATCGGTGGTGCGGTCTTTGG
TTCAAAAAATTTTGGGTAATCTGGGCTTGCGGAAGTTGGATAACTTTGCGGTGACAGTTTTTGTCTGTCTGTTGATACTG
CCCCGATTTCTGCTGACAGCTGGTGGTGTCCTGACCTTTACCTATGCTTTTTTGCTGACGGTTTTTGATTTTGAGGACTT
GGGGCAGGTCAAAAAGGCTGCGGTGGAAAGTCTCAGTATTTCACTTGGGATTTTGCCAGTGCTCATGACCTATTTCTATG
CCTTTCAGCCCCTGTCTATCCTCTTGACCTTTGCCTTTTCTTTTGTCTTTGATGTCCTACTCTTGCCAGGCTTGTCAGTC
ATTTTTCTCCTGTCGCCAATCGTCAAGATTACTTGGGTCAACGGATTTTTTGTCTTTATGGAAAAAATCATTGTCTGGGT
GGCGGATTTGGGATTGCGACCTTGGATACTGGGCAAGCCGTCTGAGCTGGTCCTTTTGCTCTTGCTGGTCAGCCTCTTCT
TACTCTACGATTTCCACAGGAGAAAGAAATGGCTTCTAGGACTGAGCTTGGTCCTCGCTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAACGAGGTGACGGTGGTGGACATCGGGCAGGGGGACAGCATCTTTTTGCGGGATATGCGGGGGCGGAC
GGTCCTGATTGATGTGGGCGGACGAGTTGATTTTGCGGCCAAGGAAGCGTGGCAGGAGCGGTCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGGGGTGTGGATAGGATTGATAGTTTAGTCCTGACCCACACCGACACAGAC
CATGTGGGGGATGTGCTGGAAGTGGCTAAGCAGGTGCAAATTGGTCGGATTGTCGTGTCGCCAGGAAGTCTGACGGTGCC
TGACTTTGTAGCAACTCTGAAAAAAATCAATGTGCCTGTCCATGTGGTAAAAGTGGGCGACCGCCTACCAATTTTTGACT
CCTATCTGGAGGTCCTCTATCCAGATGGGACAGGCGACGGTGGCAATAATGACTCCATTGTCCTTTACGGTCGCTTGTTG
GAAACTAACTTTCTCTTTACAGGAGACCTGGAGCAAGGGGAGTTGGACTTGATAGAAACCTATCCCAACCTGCCTGTCGA
TGTCCTCAAAGCCGGTCACCATGGCTCCAAAGGCTCTTCTTATCCTGAATTTTTAGACCATATCGGTGCCAAGATTGCCT
TGGTATCTGCTGGAGAAAACAACCGCTACAAGCATCCTCACCAAGAAACCTTGGAACGATTTGACAGCCGGAATATCCAA
GTCTATCGCACAGACCAGCAAGGAGCCATTCGCTTCCGAGGTTGGAAGGAGTGGAAGATTGAGACGGTGAGGAGGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

54.105

99.732

0.54

  comEC/celB Streptococcus mitis NCTC 12261

53.036

99.463

0.528

  comEC/celB Streptococcus pneumoniae TIGR4

52.156

99.597

0.519

  comEC/celB Streptococcus pneumoniae Rx1

51.887

99.597

0.517

  comEC/celB Streptococcus pneumoniae D39

51.887

99.597

0.517

  comEC/celB Streptococcus pneumoniae R6

51.887

99.597

0.517

  comEC Lactococcus lactis subsp. cremoris KW2

49.13

100

0.493


Multiple sequence alignment