Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   SSA_RS03620 Genome accession   NC_009009
Coordinates   701530..703770 (+) Length   746 a.a.
NCBI ID   WP_011836678.1    Uniprot ID   A3CLU7
Organism   Streptococcus sanguinis SK36     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 696530..708770
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SSA_RS03600 (SSA_0712) - 696851..699190 (-) 2340 WP_002912007.1 cation-translocating P-type ATPase -
  SSA_RS03605 (SSA_0713) - 699348..700088 (+) 741 WP_002912008.1 lysophospholipid acyltransferase family protein -
  SSA_RS03610 (SSA_0714) - 700161..700709 (+) 549 WP_002912009.1 HXXEE domain-containing protein -
  SSA_RS03615 (SSA_0715) comEA/celA/cilE 700866..701546 (+) 681 WP_011836677.1 helix-hairpin-helix domain-containing protein Machinery gene
  SSA_RS03620 (SSA_0716) comEC/celB 701530..703770 (+) 2241 WP_011836678.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SSA_RS03625 (SSA_0718) - 703901..704995 (+) 1095 WP_011836679.1 DUF805 domain-containing protein -
  SSA_RS03630 (SSA_0720) holA 705246..706292 (+) 1047 WP_011836680.1 DNA polymerase III subunit delta -
  SSA_RS03635 (SSA_0721) sodA 706355..706960 (+) 606 WP_002896081.1 superoxide dismutase SodA -
  SSA_RS03640 (SSA_0722) - 707096..708175 (+) 1080 WP_317627780.1 serine hydrolase -
  SSA_RS11705 (SSA_0723) - 708386..708526 (+) 141 WP_002897970.1 hypothetical protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 85043.18 Da        Isoelectric Point: 8.0480

>NTDB_id=27516 SSA_RS03620 WP_011836678.1 701530..703770(+) (comEC/celB) [Streptococcus sanguinis SK36]
MSQLIEKLPLAPIYLAFILVWAYFAVYSGSSLAYFGLGVLLLRLFFSYPLKKSLAALAFLSLFVLFFLLRREMAEQAFRQ
EPPSASSVQVLPDTIKVNGDSLSFRGRTNGRLYQVFYRIQSASEKEAFQQLTDLVVLDIEAEFEQAQQQRNFSGFDYQAY
LKSQGIYRTLKINKIQSLRPISSLNPLDWLSVWRRKALIYIRSNFLSPMSHYMTGLLFGDLDTEFAEMSNLYSSLGIIHL
FALSGMQVGFFLDGFRRILLRLGMKRETVNALQLPFSFIYAGLTGFSVSVVRSLVQKLLAQQGLTRLDNFALTLMILFII
MPNFLLTAGGVLSCAYAFLISMMDFEKLPPIRKLLTESLTISLGILPILIYYFSEFQPWSVILTFLFSLLFDLLILPGLT
LIFILSPLLKITQVNFFFDLLEDVIRWVADFAPRPLIFGKPNLWLLFALLLVLALIYDFRRKKSWLLSFSLLALLLFFLT
KHPLQNEITVVDIGQGDSIFLREWQGRTILIDVGGRVEIGKKEAWQERQTSSNAEKTLIPYLKSRGVASLDVLVLTHTDT
DHMGDMLEVAKHFSIKEIYVSKGSLTQPDFVQKLEQIESSVHVVEVGDEIPVFDSALQVLYPSGTGDGGNDDSIVLYGEF
FQTKFLFTGDLERQGEIELLQRFPQLKADVLKAGHHGSKGSSSPEFLEQIQPKLALISAGQNNRYRHPHQETLQRFEKFN
TTIYRTDQQGAIRLIGWDSWRIETVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=27516 SSA_RS03620 WP_011836678.1 701530..703770(+) (comEC/celB) [Streptococcus sanguinis SK36]
ATGTCACAGTTGATTGAAAAGCTACCCCTCGCCCCTATCTATCTGGCTTTCATACTTGTATGGGCCTATTTTGCAGTCTA
TTCTGGCAGTTCCTTGGCTTATTTTGGGCTAGGTGTCCTTCTGCTTCGTCTCTTTTTCAGCTATCCTCTCAAAAAAAGTC
TAGCTGCTTTGGCATTCCTGTCCCTTTTTGTCCTCTTTTTCTTGCTTCGTCGGGAAATGGCAGAGCAGGCCTTTAGGCAG
GAACCTCCTTCAGCTAGCTCGGTTCAGGTCTTGCCAGATACGATCAAAGTCAATGGAGATTCCCTATCCTTTCGAGGGCG
AACGAACGGTCGGCTTTATCAGGTCTTCTACAGGATTCAATCAGCCAGTGAAAAGGAAGCTTTTCAGCAACTGACAGATT
TGGTCGTCCTTGATATTGAAGCAGAGTTTGAACAGGCCCAGCAGCAACGAAATTTTTCGGGCTTTGACTATCAGGCTTAT
TTAAAGAGTCAGGGAATTTACCGTACCTTAAAAATCAATAAGATTCAGTCACTAAGACCAATTTCCAGCCTGAATCCACT
GGATTGGCTGTCTGTCTGGCGCAGAAAAGCGCTGATCTATATTCGAAGTAATTTTCTCAGCCCTATGAGTCATTATATGA
CTGGACTCTTGTTTGGTGACTTAGATACGGAGTTTGCAGAAATGAGCAATCTGTATTCTAGCTTGGGTATCATCCACTTG
TTTGCCTTGTCTGGTATGCAGGTAGGTTTCTTTTTGGATGGTTTTCGCAGAATTCTTCTACGTTTGGGGATGAAGAGGGA
GACAGTAAACGCCTTGCAGCTGCCTTTTTCATTTATCTACGCTGGACTGACAGGCTTTTCTGTTTCTGTCGTTAGAAGCT
TAGTTCAAAAGCTATTGGCTCAGCAAGGACTGACCAGACTGGATAATTTTGCCCTGACACTGATGATTCTCTTTATCATC
ATGCCCAATTTTCTTCTGACAGCAGGCGGTGTCTTGTCCTGCGCTTATGCTTTTTTGATTTCTATGATGGATTTTGAAAA
GTTGCCGCCGATTCGGAAACTTTTAACGGAAAGTCTGACTATTTCATTGGGGATTTTGCCTATACTAATCTACTATTTTT
CTGAATTTCAGCCCTGGTCCGTGATTTTAACTTTTCTATTTTCACTACTCTTTGACCTGCTGATACTGCCAGGGTTAACC
CTCATTTTTATTTTGTCACCACTTTTAAAAATCACTCAAGTCAATTTTTTCTTTGACTTACTCGAAGATGTGATTCGTTG
GGTAGCTGATTTTGCTCCGCGCCCTTTGATTTTTGGGAAGCCCAACCTCTGGTTACTGTTCGCTTTGCTTTTGGTTTTGG
CTTTAATCTATGACTTTCGGAGAAAGAAAAGCTGGCTTTTATCTTTTAGTTTGCTTGCTCTTCTCTTATTTTTCCTAACA
AAGCATCCGCTGCAAAATGAGATTACAGTAGTGGATATTGGTCAGGGAGATAGCATTTTTCTCCGGGAATGGCAGGGCAG
GACGATTTTGATTGACGTTGGGGGACGAGTCGAGATAGGAAAGAAAGAGGCTTGGCAGGAGCGACAGACTTCATCCAATG
CAGAAAAGACCTTAATTCCCTATCTCAAAAGTCGTGGTGTTGCTAGTCTGGATGTATTGGTCCTGACTCATACCGATACC
GATCACATGGGAGATATGCTGGAGGTAGCTAAGCATTTTTCCATCAAGGAAATCTATGTTTCCAAAGGAAGTCTCACTCA
GCCTGATTTTGTTCAAAAGCTGGAGCAGATAGAAAGTTCTGTCCATGTCGTAGAAGTGGGAGATGAGATTCCGGTATTTG
ACTCAGCTTTGCAGGTTCTCTATCCTAGTGGGACAGGCGACGGTGGAAATGATGATTCGATTGTACTCTACGGGGAGTTC
TTTCAGACAAAATTTTTGTTTACAGGTGATTTGGAAAGGCAGGGAGAAATTGAACTGCTGCAGCGATTCCCTCAGCTGAA
AGCAGATGTCTTAAAAGCAGGACACCATGGCTCCAAAGGCTCTTCCAGTCCAGAGTTTTTAGAGCAAATTCAGCCGAAAC
TGGCCTTGATATCAGCTGGCCAGAATAATCGTTATCGACATCCGCATCAGGAAACTTTGCAAAGATTTGAGAAGTTTAAT
ACAACTATCTATCGAACTGACCAGCAGGGGGCCATTCGTTTGATTGGTTGGGATTCCTGGAGGATTGAAACAGTTCGTTG
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A3CLU7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

57.028

100

0.571

  comEC/celB Streptococcus pneumoniae TIGR4

56.493

100

0.566

  comEC/celB Streptococcus mitis NCTC 12261

56.434

100

0.564

  comEC/celB Streptococcus pneumoniae Rx1

55.689

100

0.558

  comEC/celB Streptococcus pneumoniae D39

55.689

100

0.558

  comEC/celB Streptococcus pneumoniae R6

55.689

100

0.558

  comEC Lactococcus lactis subsp. cremoris KW2

47.745

100

0.483


Multiple sequence alignment