Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   SGO_RS07845 Genome accession   NC_009785
Coordinates   1655520..1657760 (-) Length   746 a.a.
NCBI ID   WP_012130665.1    Uniprot ID   A8AYL9
Organism   Streptococcus gordonii str. Challis substr. CH1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1650520..1662760
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SGO_RS07815 (SGO_1595) - 1650793..1651290 (-) 498 WP_012130659.1 DUF1648 domain-containing protein -
  SGO_RS07820 (SGO_1596) - 1651277..1651558 (-) 282 WP_012130660.1 autorepressor SdpR family transcription factor -
  SGO_RS07825 (SGO_1597) - 1651536..1652324 (-) 789 WP_012130661.1 YhfC family intramembrane metalloprotease -
  SGO_RS07830 (SGO_1598) - 1652529..1653602 (-) 1074 WP_012130662.1 serine hydrolase -
  SGO_RS07835 (SGO_1599) - 1653714..1654319 (-) 606 WP_012130663.1 superoxide dismutase -
  SGO_RS07840 (SGO_1600) holA 1654392..1655429 (-) 1038 WP_012130664.1 DNA polymerase III subunit delta -
  SGO_RS07845 (SGO_1601) comEC/celB 1655520..1657760 (-) 2241 WP_012130665.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SGO_RS07850 (SGO_1602) comEA/celA/cilE 1657744..1658409 (-) 666 WP_012130666.1 helix-hairpin-helix domain-containing protein Machinery gene
  SGO_RS07855 (SGO_1603) - 1658506..1659033 (-) 528 WP_012130667.1 HXXEE domain-containing protein -
  SGO_RS07860 (SGO_1604) - 1659072..1659812 (-) 741 WP_041789898.1 lysophospholipid acyltransferase family protein -
  SGO_RS07865 (SGO_1605) - 1659942..1662311 (+) 2370 WP_012130669.1 cation-translocating P-type ATPase -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 85524.30 Da        Isoelectric Point: 9.8278

>NTDB_id=29081 SGO_RS07845 WP_012130665.1 1655520..1657760(-) (comEC/celB) [Streptococcus gordonii str. Challis substr. CH1]
MSQWIKKLPLAPIYLCFLLVWLYFAIYSGERLAYLGYSLLIARLIWHYPRKKWLPTLAILIAFSFFFYARRELAERAFQT
QPAPARQVLVLPDTVKVNGDSLSFRGRIDGRLYQLYYKLASPREKKTFQKLTDLVTLEIEAEFNLAEERRNFSGFDYQAY
LKSQGIYRTVKISRIMSSRSSQSTNPFDWLSVWRRKALVFIKSTFPSPMSHYMTGLLFGDLDIDFAEMNGLYSSLGIIHL
FALSGMQVGFFMDVFRKILLRIGLRMETVDWLQFPLSFLYAGLTGFSVSVVRSLVQKLLSQFGVRRLDNFAMTMLVLMLL
MPSFLLTTGGVLSCAYAFIITMLDFEDSSGFRKAVLESLSISLGILPLLIYYFAEFQPWSILLTFLFSIVFDTFMLPLLS
LIFLISPLFAFTQVNVFFQWLEMVIRWVASLSTRPIILGQPNLPLLIILLLVLALLYDFRKQKKIRAGLSLLAILLFFLS
KHPLQNEITVVDIGQGDSIFLRDVRGRTIVIDTGGRVEIGKKEAWQERVRKSNAETTLIPYLKSRGVDHLDQLVLTHTDT
DHMGDMLELAKHFSIREIYVSKGSMTQVDFVDKLKKMKAKVHVVEVGDRLPIFDSALEVLYPLGQGDGGNDDSIVLYGEF
FRTKFLFTGDLEAPGEGQMVTAYPDLRVDVLKAGHHGSKGSSSPEFLEHIKPKLALISAGKNNRYQHPHKETLDRFEKIQ
TKIFRTDEQGAIRFKGWNSWQIETVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=29081 SGO_RS07845 WP_012130665.1 1655520..1657760(-) (comEC/celB) [Streptococcus gordonii str. Challis substr. CH1]
ATGTCACAGTGGATTAAAAAACTTCCCCTTGCTCCAATCTACTTGTGCTTTCTTTTGGTTTGGCTATACTTTGCTATTTA
TAGTGGTGAAAGACTCGCTTATCTAGGTTATTCCCTTCTTATAGCTCGCCTAATCTGGCATTATCCAAGAAAGAAATGGT
TGCCAACTTTAGCTATTCTAATTGCTTTCTCTTTCTTTTTTTATGCTAGACGTGAGTTGGCGGAACGAGCTTTTCAGACT
CAACCAGCTCCTGCAAGACAAGTTCTTGTTTTACCAGATACTGTTAAAGTGAATGGAGATTCCCTATCTTTCCGTGGTAG
AATAGACGGAAGACTTTATCAGCTCTACTACAAATTAGCAAGTCCAAGAGAGAAAAAAACTTTTCAAAAACTAACTGACT
TGGTCACTTTAGAGATTGAAGCAGAGTTCAATCTAGCAGAAGAACGGCGCAATTTCTCTGGCTTTGACTATCAGGCTTAT
TTAAAAAGTCAGGGAATCTATCGGACTGTTAAGATCAGTAGGATTATGTCTAGTCGTTCTAGTCAGTCTACTAATCCATT
TGATTGGCTGTCTGTTTGGCGTAGGAAGGCCTTGGTTTTCATTAAGTCTACTTTTCCAAGCCCGATGAGTCACTACATGA
CAGGACTCTTGTTTGGTGATTTAGATATTGATTTTGCAGAGATGAATGGCTTGTACTCAAGTTTAGGAATTATCCATCTT
TTTGCTTTATCAGGAATGCAAGTTGGTTTTTTTATGGATGTCTTTCGGAAAATTCTTCTGCGCATAGGCTTAAGGATGGA
AACGGTAGATTGGTTGCAATTCCCGTTGTCCTTTCTTTATGCAGGTTTGACTGGATTTTCTGTGTCAGTAGTGAGGAGTT
TAGTGCAAAAGTTGTTGTCTCAATTTGGAGTGAGGCGCTTGGATAATTTTGCGATGACCATGCTGGTCCTAATGCTTCTT
ATGCCAAGCTTTCTCCTGACAACAGGCGGAGTCCTATCTTGCGCTTATGCCTTTATCATCACTATGTTGGATTTTGAGGA
TTCTAGTGGTTTTCGTAAAGCCGTGCTAGAGAGTTTAAGTATCTCGTTAGGCATTTTACCACTTCTTATCTATTATTTTG
CAGAATTCCAGCCTTGGTCTATCCTCTTGACCTTCCTTTTTTCAATCGTCTTTGATACATTTATGTTACCTCTCTTGAGT
CTAATTTTTCTCATTTCGCCTTTGTTTGCCTTCACTCAAGTTAATGTCTTCTTCCAATGGCTGGAAATGGTGATTCGTTG
GGTAGCTAGTTTGTCAACAAGGCCGATAATTTTAGGACAGCCAAATCTGCCTTTGCTTATTATTCTCCTGCTGGTCTTAG
CCCTGCTCTATGACTTTAGAAAACAAAAAAAGATTAGAGCTGGCCTGAGTCTCTTAGCAATCTTACTATTTTTCCTAAGT
AAACATCCTTTACAAAATGAAATCACAGTGGTAGATATTGGGCAGGGAGACAGTATATTTCTGAGGGATGTCAGAGGTAG
GACTATCGTAATTGATACAGGGGGTCGGGTGGAGATTGGTAAAAAAGAGGCTTGGCAAGAGCGAGTTAGAAAAAGCAATG
CGGAAACGACCTTGATCCCTTATCTAAAAAGCCGAGGAGTGGACCACTTGGATCAATTAGTCCTGACTCACACAGATACA
GACCATATGGGAGATATGTTGGAGTTGGCCAAGCACTTTTCTATCCGAGAAATTTATGTTTCCAAAGGAAGTATGACTCA
AGTTGATTTTGTAGACAAGTTAAAGAAGATGAAAGCTAAAGTTCATGTGGTCGAGGTGGGAGATCGGCTTCCTATCTTTG
ATTCGGCCCTTGAAGTGCTCTATCCACTAGGTCAAGGAGATGGTGGTAATGATGACTCGATTGTCTTGTATGGTGAATTT
TTCCGGACCAAGTTTCTTTTCACAGGAGATTTAGAAGCTCCAGGTGAAGGTCAGATGGTGACAGCCTATCCAGATTTAAG
AGTAGATGTGCTCAAAGCTGGGCATCATGGTTCTAAAGGATCTTCTAGTCCAGAATTTCTAGAGCATATTAAGCCTAAGT
TGGCCTTGATTTCAGCTGGTAAAAACAACCGTTACCAACATCCCCATAAGGAAACTTTAGATAGATTCGAAAAAATCCAG
ACTAAGATTTTTCGGACAGATGAGCAGGGAGCTATTCGATTTAAAGGCTGGAATTCTTGGCAGATAGAAACGGTTCGCTA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A8AYL9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

58.233

100

0.583

  comEC/celB Streptococcus mitis NCTC 12261

57.641

100

0.576

  comEC/celB Streptococcus pneumoniae TIGR4

56.894

100

0.57

  comEC/celB Streptococcus pneumoniae Rx1

56.627

100

0.567

  comEC/celB Streptococcus pneumoniae D39

56.627

100

0.567

  comEC/celB Streptococcus pneumoniae R6

56.627

100

0.567

  comEC Lactococcus lactis subsp. cremoris KW2

50.269

99.732

0.501


Multiple sequence alignment