Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   EL115_RS04050 Genome accession   NZ_LR134307
Coordinates   812893..815121 (+) Length   742 a.a.
NCBI ID   WP_126441358.1    Uniprot ID   -
Organism   Streptococcus milleri strain NCTC10708     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 807893..820121
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL115_RS04025 rpmG 808753..808902 (+) 150 WP_080979224.1 50S ribosomal protein L33 -
  EL115_RS04030 (NCTC10708_00812) secG 808942..809175 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  EL115_RS04035 (NCTC10708_00813) rnr 809268..811404 (+) 2137 Protein_773 ribonuclease R -
  EL115_RS04040 (NCTC10708_00814) smpB 811570..812037 (+) 468 WP_003034458.1 SsrA-binding protein SmpB -
  EL115_RS04045 (NCTC10708_00815) comEA/celA/cilE 812205..812909 (+) 705 WP_126441357.1 helix-hairpin-helix domain-containing protein Machinery gene
  EL115_RS04050 (NCTC10708_00816) comEC/celB 812893..815121 (+) 2229 WP_126441358.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EL115_RS04055 (NCTC10708_00817) - 815201..815920 (+) 720 WP_126441359.1 DUF805 domain-containing protein -
  EL115_RS04060 (NCTC10708_00819) - 816365..817351 (+) 987 WP_126441360.1 Gfo/Idh/MocA family protein -
  EL115_RS04065 (NCTC10708_00820) - 817376..818029 (+) 654 WP_006269516.1 uracil-DNA glycosylase -
  EL115_RS04070 (NCTC10708_00821) - 818038..818508 (+) 471 WP_126441361.1 NUDIX hydrolase -
  EL115_RS04075 (NCTC10708_00822) - 818520..819794 (+) 1275 WP_126441362.1 dihydroorotase -

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 85443.50 Da        Isoelectric Point: 10.0290

>NTDB_id=1120886 EL115_RS04050 WP_126441358.1 812893..815121(+) (comEC/celB) [Streptococcus milleri strain NCTC10708]
MSQWIKIFPIKPIYIAFLLVWLYFAIYQSSWLAWFCFIFLIIRLFVLYSPKKCFITLMFLACFAAFFFVRREMAEWQTKA
EPSSVRQVAVLPDTIKVNGDSLSFRGKANRQTYQIYYKLKSKEEQSAFQNLSSLVTLTVEGEFDIPEKQRNFSGFDYQAY
LKTQGIYRILKVDQILSSQDRVSFQPFEWLSSWRRKALVFIKRNFPNPMNHYMTGLLFGALETDFDEMSDLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLKLGLTKEMVHKCQYPFSFFYAGMTGFSVSVVRSLIQKLLSQHGITKLDNFALTIMVLSLL
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKVVVESSIISLGVLPILIFYFGEFQPWSILLTFVFSLIFDIVMLPGLT
LIFLISPFIKLIQVNFLFEGLENSIRWIANVFGRPIVFGQPSSLLLVVMLLVLAILYDVRKNKKWVILLSLCLAILFFIT
KFPLQNEITMVDVGQGDSLFLRDWKGRNVLIDVGGREEIRTKESWQKRTTKSNAEKTLIPYLKSRGIDTIDTLILTNPNA
DYAGDVLKVVKKFAVKKIFISRSSLNDADFLNKLKETKTFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEVKLMQLYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKSPNQEMIERLNRLNAKIY
RTDERGAIKFSGWTKWQLETVQ

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=1120886 EL115_RS04050 WP_126441358.1 812893..815121(+) (comEC/celB) [Streptococcus milleri strain NCTC10708]
ATGTCACAGTGGATTAAGATATTTCCGATTAAACCAATTTACATTGCTTTCTTACTTGTCTGGCTATATTTTGCAATCTA
TCAAAGTAGCTGGTTGGCTTGGTTTTGTTTTATTTTTCTAATAATCCGTCTTTTTGTCCTTTATTCACCGAAAAAATGTT
TCATAACTTTAATGTTTCTCGCTTGTTTTGCTGCTTTTTTCTTTGTTCGTAGAGAAATGGCAGAGTGGCAAACAAAAGCA
GAACCTTCTTCTGTAAGACAAGTAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCGCTTTCTTTTCGCGGTAA
AGCAAATAGGCAGACTTATCAAATTTACTACAAATTGAAATCAAAAGAAGAACAGTCGGCTTTTCAAAATCTATCCAGTC
TTGTTACATTGACTGTTGAAGGAGAATTTGATATTCCTGAAAAGCAGCGCAATTTTTCTGGTTTTGATTATCAAGCCTAT
CTAAAAACACAAGGGATTTATCGAATTTTAAAGGTGGATCAAATTTTGTCTAGTCAAGATAGGGTTAGCTTCCAACCGTT
TGAGTGGCTGTCTAGCTGGCGAAGAAAGGCATTGGTTTTCATTAAGAGAAATTTTCCAAATCCGATGAATCATTACATGA
CAGGGCTCTTATTTGGCGCATTAGAGACAGATTTTGATGAAATGAGTGATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGTTATCTGGCATGCAAGTTGGCTTTTTTATGGAAGGATTTCGTAAGTTGCTTTTGAAGCTGGGACTTACGAAGGA
AATGGTTCATAAATGCCAATATCCGTTTTCTTTCTTTTATGCAGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAATTCAGAAATTATTGTCGCAACATGGTATCACTAAATTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGCTT
ATGCCATCTTTTCTTTTAACGGCAGGAGGAGTTCTCTCCTGTGCCTATGCTTTTGTTATTAGTGTTATAGATTTTGAAAG
TCTGACTTCTTGGAGAAAAGTTGTGGTAGAGAGTAGTATCATTTCACTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GAGAATTTCAACCTTGGTCTATTTTATTGACATTTGTTTTTTCACTAATTTTTGATATAGTGATGCTACCGGGTTTAACA
CTGATTTTTCTCATTTCACCTTTCATAAAGCTCATTCAAGTAAATTTTCTTTTTGAAGGCTTAGAAAATAGTATTCGTTG
GATAGCAAATGTCTTTGGCAGACCAATCGTTTTTGGGCAACCTAGCTCGCTTTTATTAGTTGTTATGCTGCTTGTACTGG
CTATTTTGTATGATGTTAGAAAAAATAAAAAATGGGTGATATTGCTCAGCTTGTGTCTTGCTATACTATTTTTTATAACT
AAATTTCCTTTGCAAAATGAGATCACAATGGTTGATGTTGGACAGGGAGACAGTCTTTTTTTGAGAGACTGGAAAGGTAG
AAATGTATTAATTGACGTTGGAGGACGTGAAGAAATTAGAACAAAAGAATCTTGGCAAAAACGAACAACTAAATCAAATG
CGGAGAAAACTTTGATTCCTTACCTAAAAAGTCGTGGTATAGATACGATTGATACTCTAATCTTAACAAATCCTAATGCA
GATTATGCGGGAGATGTATTAAAAGTGGTTAAAAAGTTTGCGGTAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAAGACGTTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAGTAAAGCTAATGCAACTTTATCCAAAGCTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGAGCCAAAAATTCATCACATTCAAAATTTTTACAGCAAATAGAACCGGCAGTTGCACTCATTT
CTGTTGGGAAAAATAACCAATCCAAATCTCCGAATCAAGAAATGATAGAGCGATTGAATCGCTTAAATGCGAAGATTTAT
CGAACAGATGAACGAGGGGCGATCAAGTTTTCAGGATGGACAAAATGGCAATTAGAAACTGTTCAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis NCTC 12261

54.826

100

0.551

  comEC/celB Streptococcus mitis SK321

54.752

100

0.551

  comEC/celB Streptococcus pneumoniae TIGR4

53.949

100

0.543

  comEC/celB Streptococcus pneumoniae Rx1

53.146

100

0.535

  comEC/celB Streptococcus pneumoniae D39

53.146

100

0.535

  comEC/celB Streptococcus pneumoniae R6

53.146

100

0.535

  comEC Lactococcus lactis subsp. cremoris KW2

46.703

100

0.468


Multiple sequence alignment