Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   SAIN_RS04470 Genome accession   NC_022244
Coordinates   904250..906478 (+) Length   742 a.a.
NCBI ID   WP_037613581.1    Uniprot ID   A0AAP6BQ17
Organism   Streptococcus anginosus C1051     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 902080..903431 904250..906478 flank 819


Gene organization within MGE regions


Location: 902080..906478
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SAIN_RS04465 (SAIN_0867) comEA/celA/cilE 903562..904266 (+) 705 WP_003024731.1 helix-hairpin-helix domain-containing protein Machinery gene
  SAIN_RS04470 (SAIN_0868) comEC/celB 904250..906478 (+) 2229 WP_037613581.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 742 a.a.        Molecular weight: 85164.95 Da        Isoelectric Point: 10.0221

>NTDB_id=61767 SAIN_RS04470 WP_037613581.1 904250..906478(+) (comEC/celB) [Streptococcus anginosus C1051]
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVICLFRFYSPKECFMTFMILSCFAGFFFVRREIAEQKTKV
EPSPIRQVAVLPDTIKVNGDSLSFRGKANRQTYQVYYKLKSKEEQLAFQNLSSLVTLTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTSFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGFTQEMVRKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLSAGGVLSCAYAFVISVIDFESLTSWRKVVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLM
FIFLFSPLIKLTQVNFLFEGLENSIRWIASVFGKPIVFGQPSPLLLIVMLLVLAILYDIRQNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGKNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLGVAKKFAIKKIYISRSSLSNADFLKKLRETNTFIHVIKQGDKLPIFDHHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEEEAKLMQHYPKLKTDILKVGQHGAKSSSSSKFLQQIEPTVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK

Nucleotide


Download         Length: 2229 bp        

>NTDB_id=61767 SAIN_RS04470 WP_037613581.1 904250..906478(+) (comEC/celB) [Streptococcus anginosus C1051]
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCGATTTACATTGCTTTTTTGCTCGTCTGGTTGTATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTTGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATAGCAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCATAAGACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGTGATTCGCTTTCTTTTCGTGGTAA
AGCTAATAGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTACATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACATCCTTTGACGAAATGAGCAATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGATTTCGCAAGTTACTGTTAAGGCTGGGGTTTACACAAGA
AATGGTTCGTAAATGCCAATATCCATTTTCTTTCTTTTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTATCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCGTCCTTTCTTTTGTCGGCAGGAGGAGTTCTCTCCTGTGCTTATGCTTTTGTCATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAAGTTGTTGTAGAGAGTAGCGTTATTTCACTTGGTGTTTTACCAATTCTAATCTTTTACTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGATG
TTTATTTTTCTTTTTTCGCCTTTGATAAAGCTGACTCAAGTCAATTTTTTATTTGAAGGTTTAGAAAATAGTATTCGTTG
GATAGCAAGTGTCTTTGGTAAACCAATCGTTTTTGGGCAGCCCAGTCCGCTTTTGCTGATTGTCATGTTACTTGTACTAG
CTATTTTGTACGATATTCGGCAAAATAAAAAGTGGGTAATATTTCTTAGTCTGTTTCTTTCATTACTATTTTTCATAAAT
AAATTTCCTTTGCAAAACGAGATCACAATGGTTGATGTTGGACAAGGAGATAGTATTTTTCTGAGAGACTGGAAAGGAAA
AAATGTATTGATTGATGTTGGCGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCAACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACCTAAAAAGTCGTGGCGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGGAGTTGCTAAAAAGTTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTCCTAAAGAAATTAAGGGAGACAAATACATTCATTCATGTCATAAAACAAGGTGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATTGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCAAGCGATTTGAAAGAAGAAGAGGAAGCAAAGTTGATGCAACATTATCCAAAGTTGAAAACAGATATCTT
GAAAGTCGGGCAACATGGAGCTAAAAGTTCATCAAGTTCAAAGTTTTTGCAACAAATAGAACCAACGGTTGCGCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCTAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

54.618

100

0.55

  comEC/celB Streptococcus mitis NCTC 12261

54.155

100

0.544

  comEC/celB Streptococcus pneumoniae TIGR4

53.681

100

0.54

  comEC/celB Streptococcus pneumoniae Rx1

52.878

100

0.532

  comEC/celB Streptococcus pneumoniae D39

52.878

100

0.532

  comEC/celB Streptococcus pneumoniae R6

52.878

100

0.532

  comEC Lactococcus lactis subsp. cremoris KW2

46.774

100

0.469


Multiple sequence alignment