Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   SANR_RS04795 Genome accession   NC_022239
Coordinates   967321..969549 (+) Length   742 a.a.
NCBI ID   WP_020999585.1    Uniprot ID   -
Organism   Streptococcus anginosus C238     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 962321..974549
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SANR_RS11130 (SANR_0936) rpmG 963183..963332 (+) 150 WP_003034401.1 50S ribosomal protein L33 -
  SANR_RS04775 (SANR_0937) secG 963372..963605 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  SANR_RS04780 (SANR_0938) rnr 963696..966035 (+) 2340 WP_003034491.1 ribonuclease R -
  SANR_RS04785 (SANR_0939) smpB 965998..966465 (+) 468 WP_003034458.1 SsrA-binding protein SmpB -
  SANR_RS04790 (SANR_0940) comEA/celA/cilE 966633..967337 (+) 705 WP_020999584.1 helix-hairpin-helix domain-containing protein Machinery gene
  SANR_RS04795 (SANR_0941) comEC/celB 967321..969549 (+) 2229 WP_020999585.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SANR_RS04800 (SANR_0942) - 969630..970322 (+) 693 WP_020999586.1 DUF805 domain-containing protein -
  SANR_RS04805 (SANR_0943) - 970827..971813 (+) 987 WP_003034384.1 Gfo/Idh/MocA family protein -
  SANR_RS04810 (SANR_0944) - 971835..972488 (+) 654 WP_003034489.1 uracil-DNA glycosylase -
  SANR_RS04815 (SANR_0945) - 972525..972995 (+) 471 WP_003031692.1 NUDIX hydrolase -
  SANR_RS04820 (SANR_0946) - 973008..974285 (+) 1278 WP_003034545.1 dihydroorotase -

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 85066.91 Da        Isoelectric Point: 9.9699

>NTDB_id=61675 SANR_RS04795 WP_020999585.1 967321..969549(+) (comEC/celB) [Streptococcus anginosus C238]
MSQWIKKFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFVGFFFIRREMAERQANA
EPSPVKQIAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSGLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWLASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDYHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDIIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=61675 SANR_RS04795 WP_020999585.1 967321..969549(+) (comEC/celB) [Streptococcus anginosus C238]
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTCGTCTGGTTATATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCAAAAGAATGTT
TCATGACCTTCATGATTCTCTCTTGTTTTGTTGGTTTTTTCTTTATTCGTAGGGAAATGGCAGAGCGGCAAGCAAATGCA
GAACCTTCTCCCGTAAAACAAATAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGACAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTGGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGTTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGAAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GCTAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGGGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATTATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTACAACAAATAGAACCAACGGTCGCTCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATATAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

55.615

100

0.561

  comEC/celB Streptococcus mitis NCTC 12261

54.886

100

0.553

  comEC/celB Streptococcus pneumoniae TIGR4

54.545

100

0.55

  comEC/celB Streptococcus pneumoniae Rx1

53.61

100

0.54

  comEC/celB Streptococcus pneumoniae D39

53.61

100

0.54

  comEC/celB Streptococcus pneumoniae R6

53.61

100

0.54

  comEC Lactococcus lactis subsp. cremoris KW2

47.383

100

0.476


Multiple sequence alignment