Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   E0F34_RS04655 Genome accession   NZ_LR536843
Coordinates   856120..858360 (+) Length   746 a.a.
NCBI ID   WP_061384506.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain GPSC55 substr. ST3774 isolate b04a6400-1f66-11e7-b93e-3c4a9275d6c8     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 851120..863360
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F34_RS04610 (SAMEA104035599_00875) - 851280..852248 (+) 969 WP_000658183.1 PhoH family protein -
  E0F34_RS04620 (SAMEA104035599_00876) - 852441..852941 (+) 501 WP_000566989.1 GNAT family N-acetyltransferase -
  E0F34_RS04625 - 852944..853269 (+) 326 Protein_876 TfoX/Sxy family protein -
  E0F34_RS11870 ald 853570..854681 (-) 1112 Protein_877 alanine dehydrogenase -
  E0F34_RS04645 (SAMEA104035599_00880) - 854858..855427 (+) 570 WP_044814470.1 GNAT family N-acetyltransferase -
  E0F34_RS04650 (SAMEA104035599_00881) comEA/celA/cilE 855495..856136 (+) 642 WP_044814472.1 helix-hairpin-helix domain-containing protein Machinery gene
  E0F34_RS04655 (SAMEA104035599_00882) comEC/celB 856120..858360 (+) 2241 WP_061384506.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  E0F34_RS04660 - 858539..858727 (+) 189 WP_044812623.1 hypothetical protein -
  E0F34_RS04665 (SAMEA104035599_00883) - 858760..859347 (+) 588 WP_000939884.1 ATP-binding cassette domain-containing protein -
  E0F34_RS04670 (SAMEA104035599_00884) - 859351..860532 (+) 1182 WP_000655951.1 membrane protein -
  E0F34_RS04675 infC 860839..861369 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  E0F34_RS04680 (SAMEA104035599_00886) rpmI 861402..861602 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  E0F34_RS04685 (SAMEA104035599_00887) rplT 861654..862013 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  E0F34_RS04690 (SAMEA104035599_00888) - 862071..862451 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84698.36 Da        Isoelectric Point: 9.3594

>NTDB_id=1126550 E0F34_RS04655 WP_061384506.1 856120..858360(+) (comEC/celB) [Streptococcus pneumoniae strain GPSC55 substr. ST3774 isolate b04a6400-1f66-11e7-b93e-3c4a9275d6c8]
MLQWIKNFPIPLIYMSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSLQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFALSFLYPVIQLNFIFEWLEGMIRLVSKVASRPLVFGQPNAWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDIGQGESIFLRDMTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQVTQTKVRNVTAGENLPIFGSQLEVLSPRKIGDGDHEDSLVLYGKL
LDKYFLFTGNLEEKGERDLLKHYPDLKVNVLKASQQGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=1126550 E0F34_RS04655 WP_061384506.1 856120..858360(+) (comEC/celB) [Streptococcus pneumoniae strain GPSC55 substr. ST3774 isolate b04a6400-1f66-11e7-b93e-3c4a9275d6c8]
ATGTTACAGTGGATTAAAAATTTCCCCATTCCCCTGATTTACATGAGCTTTCTGTTACTTTGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
CTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCACTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTGTCCTTCTATTTTGCGG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGCCCTTTCCTTTCTATATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGAATGATTCGCTTGGT
ATCAAAGGTGGCAAGTAGGCCTCTAGTCTTTGGACAACCCAATGCATGGCTTTTAATCTTGTTGTTAATTTCCTTGGCTT
TAGTATATGATTTGAGAAAAAATATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATATTGGGCAAGGAGAAAGTATTTTCCTACGGGATATGACTGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGTAACTCAAACCAAGGTGCGCAATGTGACAGCAGGAGAGAACTTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGAAAATTGGAGATGGAGATCATGAAGATTCCCTTGTTCTATACGGGAAACTC
TTGGATAAATACTTTCTCTTCACAGGAAATTTGGAGGAGAAAGGAGAGAGGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACAAGGAAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

94.102

100

0.941

  comEC/celB Streptococcus pneumoniae D39

93.968

100

0.94

  comEC/celB Streptococcus pneumoniae R6

93.968

100

0.94

  comEC/celB Streptococcus pneumoniae Rx1

93.968

100

0.94

  comEC/celB Streptococcus mitis SK321

91.957

100

0.92

  comEC/celB Streptococcus mitis NCTC 12261

91.946

99.866

0.918

  comEC Lactococcus lactis subsp. cremoris KW2

44.265

99.33

0.44


Multiple sequence alignment