Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   R8619_RS04430 Genome accession   NZ_AP026918
Coordinates   863598..865838 (+) Length   746 a.a.
NCBI ID   WP_000942403.1    Uniprot ID   B1IBB4
Organism   Streptococcus pneumoniae strain PZ900700009     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 858598..870838
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R8619_RS04395 (PC0009_08620) pyrH 858688..859425 (+) 738 WP_000002997.1 UMP kinase -
  R8619_RS04400 (PC0009_08630) frr 859434..859991 (+) 558 WP_000024409.1 ribosome recycling factor -
  R8619_RS04405 (PC0009_08640) cvfB 860051..860905 (+) 855 WP_001095445.1 RNA-binding virulence regulatory protein CvfB -
  R8619_RS04410 (PC0009_08650) - 860914..861129 (+) 216 WP_001232085.1 YozE family protein -
  R8619_RS04415 (PC0009_08660) - 861215..862219 (+) 1005 WP_000658185.1 PhoH family protein -
  R8619_RS04420 (PC0009_08670) - 862327..862896 (+) 570 WP_000443751.1 GNAT family N-acetyltransferase -
  R8619_RS04425 (PC0009_08680) comEA/celA/cilE 862964..863614 (+) 651 WP_000387336.1 ComEA family DNA-binding protein Machinery gene
  R8619_RS04430 (PC0009_08690) comEC/celB 863598..865838 (+) 2241 WP_000942403.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  R8619_RS04435 (PC0009_08700) - 866017..866205 (+) 189 WP_001809102.1 hypothetical protein -
  R8619_RS04440 (PC0009_08710) - 866237..866824 (+) 588 WP_000933542.1 ATP-binding cassette domain-containing protein -
  R8619_RS04445 (PC0009_08720) - 866828..868009 (+) 1182 WP_000655950.1 membrane protein -
  R8619_RS04450 (PC0009_08730) infC 868316..868846 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  R8619_RS04455 (PC0009_08740) rpmI 868879..869079 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  R8619_RS04460 (PC0009_08750) rplT 869131..869490 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  R8619_RS04465 (PC0009_08760) - 869548..869928 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84530.99 Da        Isoelectric Point: 9.5181

>NTDB_id=97790 R8619_RS04430 WP_000942403.1 863598..865838(+) (comEC/celB) [Streptococcus pneumoniae strain PZ900700009]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQTTQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=97790 R8619_RS04430 WP_000942403.1 863598..865838(+) (comEC/celB) [Streptococcus pneumoniae strain PZ900700009]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGACGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGACACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB B1IBB4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

98.123

100

0.981

  comEC/celB Streptococcus pneumoniae D39

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae R6

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae Rx1

97.855

100

0.979

  comEC/celB Streptococcus mitis NCTC 12261

91.946

99.866

0.918

  comEC/celB Streptococcus mitis SK321

91.689

100

0.917

  comEC Lactococcus lactis subsp. cremoris KW2

44.13

99.33

0.438


Multiple sequence alignment