Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   ANG_RS05060 Genome accession   NZ_AP013072
Coordinates   996790..999018 (+) Length   742 a.a.
NCBI ID   WP_025271751.1    Uniprot ID   -
Organism   Streptococcus anginosus subsp. whileyi MAS624     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 991790..1004018
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ANG_RS10915 rpmG 992652..992801 (+) 150 WP_003034401.1 50S ribosomal protein L33 -
  ANG_RS05040 (ANG_1013) secG 992841..993074 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  ANG_RS05045 (ANG_1014) rnr 993165..995504 (+) 2340 WP_003034491.1 ribonuclease R -
  ANG_RS05050 (ANG_1015) smpB 995467..995934 (+) 468 WP_003034458.1 SsrA-binding protein SmpB -
  ANG_RS05055 (ANG_1016) comEA/celA/cilE 996102..996806 (+) 705 WP_003034448.1 helix-hairpin-helix domain-containing protein Machinery gene
  ANG_RS05060 (ANG_1017) comEC/celB 996790..999018 (+) 2229 WP_025271751.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ANG_RS05065 (ANG_1018) - 999099..999791 (+) 693 WP_020999586.1 DUF805 domain-containing protein -
  ANG_RS05070 (ANG_1020) - 1000296..1001282 (+) 987 WP_003034384.1 Gfo/Idh/MocA family protein -
  ANG_RS05075 (ANG_1021) - 1001304..1001957 (+) 654 WP_003034489.1 uracil-DNA glycosylase -
  ANG_RS05080 (ANG_1022) - 1001994..1002464 (+) 471 WP_003031692.1 NUDIX hydrolase -
  ANG_RS05085 (ANG_1023) - 1002477..1003754 (+) 1278 WP_003034545.1 dihydroorotase -

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 85115.00 Da        Isoelectric Point: 9.9901

>NTDB_id=65518 ANG_RS05060 WP_025271751.1 996790..999018(+) (comEC/celB) [Streptococcus anginosus subsp. whileyi MAS624]
MSQWIKKFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFVGFFFIRREMAERQANA
EPSPVKQIAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQYRISLHPFEWLSSWRRKALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSGLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWIASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDYHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDIIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=65518 ANG_RS05060 WP_025271751.1 996790..999018(+) (comEC/celB) [Streptococcus anginosus subsp. whileyi MAS624]
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTCGTCTGGTTATATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCAAAAGAATGTT
TCATGACCTTCATGATTCTCTCTTGTTTTGTTGGTTTTTTCTTTATTCGTAGGGAAATGGCAGAGCGGCAAGCAAATGCA
GAACCTTCTCCCGTAAAACAAATAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAATACAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTGGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGTTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGAAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGGGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATTATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTACAACAAATAGAACCAACGGTCGCTCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATATAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

55.615

100

0.561

  comEC/celB Streptococcus mitis NCTC 12261

54.886

100

0.553

  comEC/celB Streptococcus pneumoniae TIGR4

54.545

100

0.55

  comEC/celB Streptococcus pneumoniae Rx1

53.61

100

0.54

  comEC/celB Streptococcus pneumoniae D39

53.61

100

0.54

  comEC/celB Streptococcus pneumoniae R6

53.61

100

0.54

  comEC Lactococcus lactis subsp. cremoris KW2

47.383

100

0.476


Multiple sequence alignment