Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   RJW51_RS09725 Genome accession   NZ_CP134491
Coordinates   1910465..1912702 (+) Length   745 a.a.
NCBI ID   WP_172089386.1    Uniprot ID   -
Organism   Streptococcus parasuis strain 1628469     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1900616..1909552 1910465..1912702 flank 913


Gene organization within MGE regions


Location: 1900616..1912702
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RJW51_RS09665 (RJW51_09650) - 1900616..1901476 (-) 861 WP_172016138.1 helix-turn-helix domain-containing protein -
  RJW51_RS09670 (RJW51_09655) - 1901690..1901866 (+) 177 WP_172016139.1 hypothetical protein -
  RJW51_RS09675 (RJW51_09660) - 1901921..1904140 (+) 2220 WP_172016140.1 peptide cleavage/export ABC transporter -
  RJW51_RS09680 (RJW51_09665) - 1904133..1904600 (+) 468 WP_172016141.1 AraC family transcriptional regulator -
  RJW51_RS09685 (RJW51_09670) - 1904597..1905997 (+) 1401 WP_172016142.1 HlyD family efflux transporter periplasmic adaptor subunit -
  RJW51_RS09690 (RJW51_09675) - 1906143..1906517 (+) 375 WP_172016143.1 SPH_0224 family bacteriocin-like peptide -
  RJW51_RS09695 (RJW51_09680) - 1906524..1906709 (+) 186 WP_397610462.1 hypothetical protein -
  RJW51_RS09700 (RJW51_09685) - 1907581..1908186 (+) 606 WP_172036421.1 zinc metallopeptidase -
  RJW51_RS09705 (RJW51_09690) - 1908228..1908806 (+) 579 Protein_1871 HlyD family efflux transporter periplasmic adaptor subunit -
  RJW51_RS09710 (RJW51_09695) - 1908955..1909326 (+) 372 WP_172036422.1 hypothetical protein -
  RJW51_RS09715 (RJW51_09700) - 1909331..1909552 (+) 222 WP_397610463.1 hypothetical protein -
  RJW51_RS09720 (RJW51_09705) - 1909804..1910481 (+) 678 WP_172089387.1 helix-hairpin-helix domain-containing protein -
  RJW51_RS09725 (RJW51_09710) comEC/celB 1910465..1912702 (+) 2238 WP_172089386.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 745 a.a.        Molecular weight: 84455.70 Da        Isoelectric Point: 6.8517

>NTDB_id=880401 RJW51_RS09725 WP_172089386.1 1910465..1912702(+) (comEC/celB) [Streptococcus parasuis strain 1628469]
MSRLIRLPCQPIHYAVLAVLTYFAVHSFSLLTMSLLSLLLAVFRLRQGKVVFIKTLPLLALCGLFFGCQKIQWERTNLWA
PEQVTTVQVIPDTIDVNGDSLSFRGRAEGQVFQVFYKLASQEEQTYFQELTDLVQLEVDAEVCQPADQRNFNGFDYQAYL
KTQGIYRTVKISTINNILPIHSWNIFDWLSTWRRQALVYIKSHFPAPMSHYMTGLLLGELDSDFDQMSDLYSSLGIIHLF
ALSGMQVGFFIDKFRWLLLRLGLTMETVDKLQIPFSLVYAGLTGFSVSVVRSLVQKILGNLGLRKLDNFAATVFVCLLLM
PRFLLTAGGVLTFTYALLLTVFDFEELGQLKKIAVESLSISLGILPVLMSYFFAFQPLSILLTFVFSFVFDVLLLPGLSV
ILLLSPFIKITWVNGFFILMEKIIVWVAELGIRPWILGKPTGLVFLLLLFCLFLLYDFHREKKWLLGLSLILALLFFITK
HPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVNFATKEAWQERSRQANAERTLIPYLHSRGVDWIDSLVLTHTDAD
HVGDVLEVAKQIQIGKIYVSPGSLTVPDFVATLRKINVPVHVVKAGDQLPIFDSYLEVLYPNGIGDGGNNDSIVLYGRLL
KMNFLFTGDLEQGELDLITSYPQLPVDVLKAGHHGSKGSSYPEFLDHIGAKIALISAGENNRYQHPHQETLERFDRQNIQ
VYRTDLQGAIRFRGWKQWSIETVKE

Nucleotide


Download         Length: 2238 bp        

>NTDB_id=880401 RJW51_RS09725 WP_172089386.1 1910465..1912702(+) (comEC/celB) [Streptococcus parasuis strain 1628469]
ATGTCACGGTTGATTAGACTCCCCTGTCAGCCCATTCACTATGCAGTATTGGCGGTGTTAACCTACTTTGCAGTCCACTC
TTTTTCCCTTTTGACAATGAGCCTGCTGAGTCTGTTACTAGCAGTCTTTAGGCTTCGGCAAGGAAAGGTGGTCTTCATCA
AAACGCTACCGCTTCTAGCCTTATGTGGTCTCTTCTTCGGATGTCAGAAGATACAATGGGAGCGGACAAATCTATGGGCT
CCAGAGCAAGTGACAACTGTGCAGGTTATTCCTGATACCATTGATGTCAACGGAGACAGTCTATCTTTTCGTGGTCGGGC
TGAAGGTCAAGTTTTTCAGGTTTTCTATAAACTTGCAAGTCAGGAAGAACAAACCTACTTTCAGGAGCTTACGGACTTGG
TGCAGTTAGAGGTAGATGCAGAAGTTTGCCAACCAGCAGATCAACGTAATTTCAATGGTTTTGATTATCAGGCTTATCTC
AAAACCCAGGGCATCTATCGGACAGTAAAAATAAGTACCATTAACAATATTCTCCCTATTCATTCTTGGAATATCTTTGA
CTGGTTGTCAACCTGGCGGAGGCAGGCTCTCGTTTATATCAAATCACATTTTCCTGCTCCCATGAGCCACTATATGACTG
GATTACTATTGGGAGAGTTAGATAGTGACTTTGACCAAATGAGTGACCTCTATTCTAGTCTAGGGATCATTCATCTTTTT
GCCCTGTCTGGGATGCAGGTTGGTTTTTTCATTGACAAATTTCGCTGGCTTTTATTGCGTTTGGGTTTAACAATGGAAAC
TGTCGATAAGCTTCAAATTCCGTTTTCTCTTGTTTATGCAGGATTAACAGGATTTTCAGTATCAGTCGTGCGGTCCTTGG
TCCAGAAAATTCTTGGTAATCTCGGGCTACGAAAATTGGATAATTTTGCAGCAACTGTCTTTGTTTGTCTCTTGCTTATG
CCACGTTTTCTTCTGACAGCAGGAGGTGTGCTGACATTTACCTATGCTTTGTTATTGACAGTCTTTGATTTTGAAGAGCT
AGGGCAGCTAAAGAAGATAGCAGTGGAGAGTCTGAGTATTTCTCTTGGAATTTTACCGGTCTTGATGTCCTATTTTTTTG
CCTTTCAGCCCTTATCTATCCTTTTAACGTTTGTTTTTTCCTTTGTTTTTGATGTGTTGTTGTTACCTGGGCTATCTGTC
ATTCTTTTACTATCGCCCTTCATTAAAATTACGTGGGTCAACGGATTCTTTATCCTTATGGAAAAGATTATTGTATGGGT
GGCAGAATTGGGGATTCGCCCTTGGATTTTAGGAAAACCTACGGGCCTTGTCTTTTTGCTCTTACTGTTCTGCCTTTTCT
TGCTTTATGATTTTCACAGAGAGAAGAAATGGCTCCTTGGATTGAGTCTGATCCTTGCTCTGCTATTTTTCATAACCAAA
CACCCGCTGGAAAATGAGGTGACGGTGGTAGACGTAGGGCAGGGGGATAGTATCTTTTTGCGGGACATTCGGGGGCGGAC
GGTTCTGATTGATGTGGGTGGTCGGGTTAACTTTGCTACAAAGGAAGCGTGGCAGGAGCGATCTCGGCAGGCAAATGCAG
AGCGGACTTTGATTCCCTATCTGCATAGTCGAGGTGTGGATTGGATTGATAGCCTAGTGCTGACTCACACCGATGCAGAT
CATGTGGGTGATGTGCTAGAAGTGGCTAAGCAGATTCAAATAGGTAAGATTTACGTTTCTCCAGGTAGTTTAACTGTTCC
AGATTTTGTTGCGACTTTGAGGAAAATCAATGTCCCTGTTCATGTTGTAAAAGCTGGGGATCAATTGCCCATTTTTGATT
CCTATCTAGAAGTTCTATATCCCAATGGAATCGGAGATGGAGGCAATAATGACTCAATTGTACTCTATGGTCGTTTGTTA
AAAATGAATTTCCTCTTTACCGGTGACTTGGAGCAAGGGGAATTAGATTTAATCACTTCTTATCCGCAGTTACCAGTCGA
TGTGCTGAAAGCAGGTCACCATGGCTCCAAGGGCTCTTCATATCCAGAATTTTTAGACCATATTGGAGCAAAAATTGCTC
TGATTTCTGCTGGTGAAAATAATCGCTATCAACATCCGCATCAGGAAACTCTGGAACGTTTCGACAGGCAAAATATACAG
GTTTACCGAACGGATCTGCAAGGAGCAATCCGTTTCCGAGGTTGGAAACAGTGGAGTATTGAAACGGTAAAAGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

53.815

100

0.54

  comEC/celB Streptococcus mitis NCTC 12261

52.965

99.597

0.528

  comEC/celB Streptococcus pneumoniae TIGR4

52.086

99.732

0.519

  comEC/celB Streptococcus pneumoniae Rx1

51.548

99.732

0.514

  comEC/celB Streptococcus pneumoniae D39

51.548

99.732

0.514

  comEC/celB Streptococcus pneumoniae R6

51.548

99.732

0.514

  comEC Lactococcus lactis subsp. cremoris KW2

47.931

100

0.482