Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   C2E56_RS04140 Genome accession   NZ_CP026084
Coordinates   769570..771807 (+) Length   745 a.a.
NCBI ID   WP_000939914.1    Uniprot ID   -
Organism   Streptococcus agalactiae strain NJ1606     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 764570..776807
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2E56_RS04115 (C2E56_04240) - 765165..766751 (+) 1587 WP_000673086.1 DEAD/DEAH box helicase -
  C2E56_RS04120 (C2E56_04245) - 766936..767202 (-) 267 WP_000598736.1 GIY-YIG nuclease family protein -
  C2E56_RS04125 (C2E56_04250) - 767195..767959 (-) 765 WP_000567424.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  C2E56_RS04130 (C2E56_04255) - 768094..768834 (+) 741 WP_000500221.1 lysophospholipid acyltransferase family protein -
  C2E56_RS04135 (C2E56_04260) - 768933..769586 (+) 654 WP_000461740.1 helix-hairpin-helix domain-containing protein -
  C2E56_RS04140 (C2E56_04265) comEC/celB 769570..771807 (+) 2238 WP_000939914.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  C2E56_RS04145 (C2E56_04270) - 771932..772741 (+) 810 WP_000153216.1 Cof-type HAD-IIB family hydrolase -
  C2E56_RS04150 (C2E56_04275) - 772752..773696 (+) 945 WP_000200826.1 LacI family DNA-binding transcriptional regulator -
  C2E56_RS04155 (C2E56_04280) - 773755..774747 (-) 993 WP_000800991.1 alpha/beta hydrolase fold domain-containing protein -
  C2E56_RS04160 (C2E56_04285) - 774923..775651 (+) 729 WP_000468967.1 methyltransferase domain-containing protein -
  C2E56_RS04165 (C2E56_04290) holA 775705..776742 (+) 1038 WP_017648383.1 DNA polymerase III subunit delta -

Sequence


Protein


Download         Length: 745 a.a.        Molecular weight: 85398.25 Da        Isoelectric Point: 9.7775

>NTDB_id=266951 C2E56_RS04140 WP_000939914.1 769570..771807(+) (comEC/celB) [Streptococcus agalactiae strain NJ1606]
MLQLTKYFPLKPIYLALLVFQIYLLVFSWTMLGCAFLLFTFIFLIYQYDRETIFKTIAIVIFFLFYFLWQNHNMNVQYQR
VPNHISQIEVRIDTISINGDVLSFQADASGNTYQAFYTLKNKSEKDYFQNLDNNIMIIADIKLEEAEERRHFNGFDYRQY
LKRHGIYRIAKVTKIKQIRLSQHRSFFALMSKWRRSAIVISQTFPNPMRHYMSGLLFGYLDKTFDNMSDLYSSLGIIHLF
ALSGMQVGFFLGIFRYICLRIGLRLDHVWLLQIPFSLIYAGLTGFSISVVRALIQSLLSHSGVKKDENFALCLLICLISL
PHSLLTTGGVLSFAYAFILTMTSFDHFSSIKKVAIESLTVSVGILPILTYYFSGFQPISIILTALLSFAFDIIFLPLLTV
IFVLSPIVKLGCINSLFEILEVLLKWTGQLFPRPLIFGKPSLFLLIVMIIILGLLYDYYHSKCFRYCSLLIIFTLFFITK
NPITNEVAILDVGQGDSILVRDWLGKTILIDTGGRLRFEQPEEWKQKVNQSNAERTLIPYLKSRGISKIDDLVITHTDTD
HMGDMEVISKHFKVARLITSSGSLTNSQYVKRLSKIGVAVKSIEAGDKLAVMGSYLQVLYPWHKGDGKNNDSIVLYGHLL
GKGFLFTGDLEEEGEKQLLEAYPNLSVDILKAGHHGSKGSSSLSFLKKLSPSVVLVSAGKNNRYQHPHQETLQRFQKIKS
KIFRTDQSGTIRLTGWWKWHIQTVR

Nucleotide


Download         Length: 2238 bp        

>NTDB_id=266951 C2E56_RS04140 WP_000939914.1 769570..771807(+) (comEC/celB) [Streptococcus agalactiae strain NJ1606]
ATGTTACAATTGACTAAGTATTTTCCTCTAAAACCTATTTATTTAGCATTGTTGGTCTTCCAAATTTACTTACTAGTGTT
TTCTTGGACAATGCTTGGTTGTGCCTTTCTTTTATTTACTTTTATTTTTCTGATTTATCAATATGATCGTGAAACTATTT
TTAAAACAATAGCAATAGTAATTTTTTTCTTATTTTATTTTTTATGGCAAAATCACAATATGAATGTCCAATATCAAAGA
GTACCGAATCATATTAGCCAGATTGAAGTGCGTATTGATACTATTTCTATCAATGGTGATGTTTTATCATTCCAGGCAGA
TGCTTCAGGTAACACTTATCAAGCTTTTTACACATTAAAAAATAAAAGTGAGAAAGATTATTTTCAAAATCTTGATAATA
ATATAATGATCATTGCAGATATCAAACTCGAAGAAGCAGAGGAGAGAAGGCATTTTAATGGCTTTGATTATCGTCAGTAT
TTAAAAAGACATGGAATTTATCGTATCGCCAAAGTGACAAAGATAAAACAGATACGCTTATCTCAACATAGGTCTTTCTT
TGCTCTTATGTCTAAGTGGCGTAGAAGTGCAATTGTTATTAGTCAAACTTTTCCAAATCCTATGCGTCACTATATGTCAG
GACTTTTGTTTGGATATCTAGATAAGACCTTTGATAACATGTCCGATTTATATAGTAGTCTAGGTATTATACATTTATTT
GCTTTGTCAGGTATGCAAGTAGGTTTTTTTCTCGGTATTTTTCGTTATATCTGTCTACGTATTGGCTTACGTCTAGACCA
TGTTTGGTTACTTCAAATACCATTCTCGCTAATTTATGCTGGTTTAACAGGCTTTAGTATCTCAGTCGTTAGGGCACTTA
TTCAATCTTTATTATCACATAGCGGTGTCAAGAAAGATGAGAACTTTGCTCTCTGCTTGTTAATTTGTCTTATCTCTCTC
CCCCACTCACTTTTGACTACGGGAGGAGTTCTTAGCTTTGCTTATGCTTTTATACTTACGATGACCTCCTTTGATCATTT
TTCGAGTATAAAAAAAGTAGCTATCGAATCTTTGACAGTCTCTGTAGGAATTCTTCCCATACTAACCTACTATTTTTCGG
GTTTTCAACCAATATCAATTATATTAACAGCACTTTTATCTTTTGCATTTGATATTATATTTTTGCCTTTATTAACTGTT
ATATTTGTCTTATCGCCTATCGTTAAATTAGGTTGTATTAATAGTTTGTTTGAAATCCTAGAAGTGTTATTAAAATGGAC
TGGGCAACTGTTTCCAAGGCCACTTATTTTTGGAAAGCCCAGCCTTTTTCTTTTAATAGTCATGATTATAATTTTGGGAT
TACTTTATGATTATTATCATTCTAAATGTTTTCGTTATTGCTCCCTTCTTATTATCTTTACCTTGTTTTTTATCACTAAG
AATCCAATTACTAACGAGGTCGCGATTTTAGATGTTGGACAAGGAGATAGTATTTTAGTGAGGGATTGGTTAGGAAAAAC
AATTTTAATTGATACTGGGGGAAGGTTGAGATTTGAACAGCCTGAAGAATGGAAACAAAAAGTAAATCAGTCTAATGCTG
AGAGAACGCTCATTCCTTACTTGAAAAGCAGAGGCATTAGCAAGATAGATGATTTAGTGATAACTCACACCGATACAGAT
CATATGGGGGATATGGAAGTTATCTCAAAGCATTTTAAAGTTGCACGTTTGATTACAAGTTCAGGTTCTTTAACGAATTC
GCAGTACGTTAAGCGTTTATCAAAGATAGGTGTAGCGGTAAAATCTATAGAAGCCGGTGATAAACTTGCTGTCATGGGAA
GTTATTTACAAGTACTTTACCCATGGCACAAGGGTGATGGAAAAAATAATGATTCAATTGTTTTATATGGACATTTATTA
GGAAAAGGCTTCTTATTTACCGGTGATTTGGAGGAAGAGGGAGAAAAGCAGTTATTAGAAGCTTATCCTAATTTATCAGT
AGATATCCTTAAAGCAGGACATCATGGTTCTAAGGGCTCATCAAGTCTATCCTTTCTGAAAAAGTTGTCTCCTAGTGTGG
TTCTAGTGTCAGCTGGTAAAAATAATCGTTACCAGCATCCTCATCAAGAGACTTTACAAAGGTTCCAAAAGATTAAAAGC
AAGATTTTCCGAACGGATCAATCAGGCACAATTAGGCTAACAGGATGGTGGAAGTGGCATATTCAGACAGTTCGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

48.602

100

0.49

  comEC/celB Streptococcus mitis NCTC 12261

48.4

100

0.487

  comEC/celB Streptococcus pneumoniae TIGR4

47.137

100

0.475

  comEC/celB Streptococcus pneumoniae Rx1

46.667

100

0.47

  comEC/celB Streptococcus pneumoniae D39

46.667

100

0.47

  comEC/celB Streptococcus pneumoniae R6

46.667

100

0.47

  comEC Lactococcus lactis subsp. cremoris KW2

44.145

99.732

0.44


Multiple sequence alignment