Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   SPP_RS04745 Genome accession   NC_012467
Coordinates   880236..882476 (+) Length   746 a.a.
NCBI ID   WP_000942420.1    Uniprot ID   A0A4J1ZPJ8
Organism   Streptococcus pneumoniae P1031     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 875236..887476
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPP_RS04710 (SPP_0955) - 875399..876367 (+) 969 WP_000658180.1 PhoH family protein -
  SPP_RS04715 (SPP_0956) - 876547..877047 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  SPP_RS04720 (SPP_0957) - 877050..877376 (+) 327 Protein_901 TfoX/Sxy family protein -
  SPP_RS11905 (SPP_0958) ald 877677..878788 (-) 1112 Protein_902 alanine dehydrogenase -
  SPP_RS04735 (SPP_0960) - 878965..879534 (+) 570 WP_000443745.1 GNAT family N-acetyltransferase -
  SPP_RS04740 (SPP_0961) comEA/celA/cilE 879602..880252 (+) 651 WP_000387352.1 helix-hairpin-helix domain-containing protein Machinery gene
  SPP_RS04745 (SPP_0962) comEC/celB 880236..882476 (+) 2241 WP_000942420.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SPP_RS04750 - 882655..882843 (+) 189 WP_001812421.1 hypothetical protein -
  SPP_RS04755 (SPP_0963) - 882876..883463 (+) 588 WP_000939895.1 ATP-binding cassette domain-containing protein -
  SPP_RS04760 (SPP_0964) - 883467..884651 (+) 1185 WP_000655953.1 hypothetical protein -
  SPP_RS04765 (SPP_0965) infC 884964..885494 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  SPP_RS04770 (SPP_0966) rpmI 885527..885727 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  SPP_RS04775 (SPP_0967) rplT 885779..886138 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  SPP_RS04780 (SPP_0968) - 886196..886576 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84639.13 Da        Isoelectric Point: 9.4311

>NTDB_id=33226 SPP_RS04745 WP_000942420.1 880236..882476(+) (comEC/celB) [Streptococcus pneumoniae P1031]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSVSYFALLGFVFLLVCLFIQFPWKSAGKVLVICGVFGFWFLFQTWQQSQVSQN
LVDSVERVRILPDTIKVNGDSLSFRGKSDGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQGY
LKTQGIYQTLNIKRIQSFQKVGSWDIGENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQQGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGVDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=33226 SPP_RS04745 WP_000942420.1 880236..882476(+) (comEC/celB) [Streptococcus pneumoniae P1031]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTATGGCTTTACTACGCCATTTT
CTCAGTATCCTATTTTGCTTTGTTGGGTTTTGTTTTTCTGCTAGTCTGCCTCTTTATCCAATTTCCTTGGAAATCAGCAG
GTAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAACTTGGCAACAGAGTCAAGTGAGTCAAAAT
TTGGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTATTAAGGTTAACGGTGACAGTCTGTCCTTTCGTGGTAA
GTCTGATGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TTCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGGCTAT
CTGAAGACTCAGGGAATTTACCAGACACTCAATATTAAAAGAATCCAGTCATTCCAAAAGGTTGGCAGTTGGGATATAGG
TGAAAACCTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGATGCACTTCCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTAGGACATCTGGACACCGATTTTGAGGAGATGAATGAGCTTTATTCCAGTTTAGGAATTATTCACCTT
TTTGCCTTGTCAGGTATGCAGGTAGGGTTTTTCATGGACGGATTTAAGAAACTACTTTTACGATTGGGCTTGACACAAGA
AAAGTTGAAGTGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCGCTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACAAGGAAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGGTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4J1ZPJ8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

96.917

100

0.969

  comEC/celB Streptococcus pneumoniae Rx1

95.174

100

0.952

  comEC/celB Streptococcus pneumoniae D39

95.174

100

0.952

  comEC/celB Streptococcus pneumoniae R6

95.174

100

0.952

  comEC/celB Streptococcus mitis SK321

92.091

100

0.921

  comEC/celB Streptococcus mitis NCTC 12261

91.812

99.866

0.917

  comEC Lactococcus lactis subsp. cremoris KW2

44.534

99.33

0.442


Multiple sequence alignment