Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   LK450_RS02095 Genome accession   NZ_CP085939
Coordinates   418625..420853 (+) Length   742 a.a.
NCBI ID   WP_003031687.1    Uniprot ID   A0A3S4QLI5
Organism   Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 413625..425853
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LK450_RS02070 (LK450_02070) rpmG 414479..414628 (+) 150 WP_003024746.1 50S ribosomal protein L33 -
  LK450_RS02075 (LK450_02075) secG 414668..414901 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  LK450_RS02080 (LK450_02080) rnr 414993..417326 (+) 2334 WP_018543505.1 ribonuclease R -
  LK450_RS02085 (LK450_02085) smpB 417289..417756 (+) 468 WP_003031704.1 SsrA-binding protein SmpB -
  LK450_RS02090 (LK450_02090) comEA/celA/cilE 417937..418641 (+) 705 WP_003031685.1 helix-hairpin-helix domain-containing protein Machinery gene
  LK450_RS02095 (LK450_02095) comEC/celB 418625..420853 (+) 2229 WP_003031687.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  LK450_RS02100 (LK450_02100) - 420936..421502 (+) 567 WP_223349562.1 DUF805 domain-containing protein -
  LK450_RS02105 (LK450_02105) - 422133..423119 (+) 987 WP_003024723.1 Gfo/Idh/MocA family protein -
  LK450_RS02110 (LK450_02110) - 423141..423794 (+) 654 WP_018543504.1 uracil-DNA glycosylase -
  LK450_RS02115 (LK450_02115) - 423831..424301 (+) 471 WP_003031692.1 8-oxo-dGTP diphosphatase -
  LK450_RS02120 (LK450_02120) - 424314..425591 (+) 1278 WP_018543503.1 dihydroorotase -

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 85366.30 Da        Isoelectric Point: 10.0727

>NTDB_id=621384 LK450_RS02095 WP_003031687.1 418625..420853(+) (comEC/celB) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVICLFRFYSPKECFMTFMILSCFAGFFFVRREIAEQKTKV
EPSPIRQVAVLPDTIKVNGDSLSFRGKANRQTYQVYYKLKSKEEQLAFQNLSSLVTLTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTSFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGFTQEMVRKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLSAGGVLSCAYAFVISVIDFESLTSWRKVVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLM
FIFLFSPLIKLTQVNLLFEGLENSIRWIASVFGKPIVFGQPSPLLLIVMLLVLAILYDIRQNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGNNVLIDVGGREEIRIKEAWQKRATSSNAEKTLIPYLKSRGIDTIDTLVLTNPNP
DYAGDVLKVVKKFAVKKIFISRSSLNDADFLNKLKETRTFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEIKLMQLYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKSPNQETIERLNRFNAKIY
RTDKQGAIKLSGWTKWQLETVQ

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=621384 LK450_RS02095 WP_003031687.1 418625..420853(+) (comEC/celB) [Streptococcus anginosus subsp. anginosus strain FDAARGOS_1569]
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCGATTTACATTGCTTTTTTGCTCGTCTGGTTGTATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTTGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATAGCAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCATAAGACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGTGATTCGCTTTCTTTTCGTGGTAA
AGCTAATAGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTACATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACATCCTTTGACGAAATGAGCAATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGATTTCGCAAGTTACTGTTAAGGCTGGGGTTTACACAAGA
AATGGTTCGTAAATGCCAATATCCATTTTCTTTCTTTTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTATCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCGTCCTTTCTTTTGTCGGCAGGAGGAGTTCTCTCCTGTGCTTATGCTTTTGTCATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAAGTTGTTGTAGAGAGTAGCGTTATTTCACTTGGTGTTTTACCAATTCTAATCTTTTACTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGATG
TTTATTTTTCTTTTTTCGCCTTTGATAAAGCTGACTCAAGTCAATCTTTTATTTGAAGGTTTAGAAAATAGTATTCGTTG
GATAGCAAGTGTCTTTGGTAAACCAATCGTTTTTGGGCAGCCCAGTCCGCTTTTGCTGATTGTCATGTTACTTGTACTAG
CTATTTTGTACGATATTCGGCAAAATAAAAAGTGGGTAATATTTCTTAGTCTGTTTCTTTCATTACTATTTTTCATAAAT
AAATTTCCTTTGCAAAACGAGATCACAATGGTTGATGTCGGACAGGGAGATAGTATTTTTCTGAGAGATTGGAAAGGAAA
TAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAATAAAAGAAGCTTGGCAAAAACGAGCAACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACCTAAAAAGTCGTGGCATAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAAAAGTGGTTAAAAAGTTTGCGGTAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAGGACGTTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAATAAAGCTAATGCAACTTTATCCAAAGCTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGAGCCAAAAATTCATCACATTCAAAATTTTTACAGCAAATAGAACCGGCAGTTGCACTCATTT
CTGTTGGGAAAAATAACCAATCCAAATCTCCGAATCAAGAGACGATAGAGCGATTGAATCGCTTCAATGCGAAGATTTAT
CGAACGGATAAACAAGGCGCTATCAAACTTTCAGGATGGACAAAATGGCAATTAGAAACAGTTCAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3S4QLI5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

55.154

100

0.555

  comEC/celB Streptococcus mitis NCTC 12261

54.424

100

0.547

  comEC/celB Streptococcus pneumoniae TIGR4

53.949

100

0.543

  comEC/celB Streptococcus pneumoniae Rx1

53.146

100

0.535

  comEC/celB Streptococcus pneumoniae D39

53.146

100

0.535

  comEC/celB Streptococcus pneumoniae R6

53.146

100

0.535

  comEC Lactococcus lactis subsp. cremoris KW2

46.64

100

0.468