Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   UKS_RS04375 Genome accession   NZ_AP021887
Coordinates   847576..849816 (+) Length   746 a.a.
NCBI ID   WP_156011949.1    Uniprot ID   -
Organism   Streptococcus sp. 116-D4     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 842576..854816
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UKS_RS04345 (UKS_08110) - 843079..843636 (+) 558 WP_049495610.1 GrpB family protein -
  UKS_RS04350 (UKS_08120) - 843682..843897 (+) 216 WP_001232084.1 YozE family protein -
  UKS_RS04355 (UKS_08130) - 843982..844956 (+) 975 WP_156011945.1 PhoH family protein -
  UKS_RS04360 (UKS_08140) ald 845020..846132 (-) 1113 WP_156011946.1 alanine dehydrogenase -
  UKS_RS04365 (UKS_08150) - 846306..846875 (+) 570 WP_156011947.1 GNAT family N-acetyltransferase -
  UKS_RS04370 (UKS_08160) comEA/celA/cilE 846942..847592 (+) 651 WP_156011948.1 helix-hairpin-helix domain-containing protein Machinery gene
  UKS_RS04375 (UKS_08170) comEC/celB 847576..849816 (+) 2241 WP_156011949.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  UKS_RS04380 (UKS_08180) - 849947..850195 (+) 249 WP_332066893.1 hypothetical protein -
  UKS_RS04385 (UKS_08190) - 850228..850815 (+) 588 WP_156011951.1 ABC transporter ATP-binding protein -
  UKS_RS04390 (UKS_08200) - 850819..852003 (+) 1185 WP_156011952.1 hypothetical protein -
  UKS_RS04395 (UKS_08210) - 852103..853359 (-) 1257 WP_173020495.1 ISL3 family transposase -
  UKS_RS04400 (UKS_08220) infC 853732..854262 (+) 531 WP_000848184.1 translation initiation factor IF-3 -
  UKS_RS04405 (UKS_08230) rpmI 854295..854495 (+) 201 WP_049496086.1 50S ribosomal protein L35 -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84507.81 Da        Isoelectric Point: 9.3783

>NTDB_id=75354 UKS_RS04375 WP_156011949.1 847576..849816(+) (comEC/celB) [Streptococcus sp. 116-D4]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSTSKVLAICGIFGFWFLFQTWQQTQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSDGRIFQVYYKLQSEEEKETFQALTALHDLELEGKLSEPEGRRNFGGFDYQSY
LKTQGIYQTLNIKRIQSLQKAGSWDIGENLSSLRRKAVVWIKTKFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWMTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGFKGLDNFALTVLVLFIA
MPNFFLTAGGILSCAYAFILTMTSKEGEGLKAVTRESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNVIFEWLEGIIRLVSQVASRPLVFGQPTTWLLILLLVSLALLYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKVESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELEASQTKVRSVTAGENLPIFGSQLEVLSPGKIGEVGSNDSLVLYGKL
LDKHFLFTENLEEKGEKDLLKQYPDLEVDVLKAGQHGSKKSSSSAFLEQLKPEITLISVGKSNRTKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWKSWKIESIR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=75354 UKS_RS04375 WP_156011949.1 847576..849816(+) (comEC/celB) [Streptococcus sp. 116-D4]
ATGTTGCAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAGCATCCTATCTTGCTTTATTGGGCTTTGTTTTTCTGCTAGTTTGTCTCTTTATTCAATTTCCTTGGAAATCTACTA
GCAAAGTTCTAGCAATTTGTGGAATCTTTGGATTTTGGTTTCTATTTCAAACTTGGCAGCAGACACAAGCTAGTCAGAAC
CTAGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAGCCTTAACAGCTC
TTCATGATTTGGAACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGAGGAGAAATTTTGGTGGCTTTGACTACCAATCCTAT
CTGAAAACTCAGGGAATTTACCAGACTCTCAATATCAAAAGAATCCAGTCGCTTCAAAAGGCTGGCAGTTGGGATATAGG
TGAAAACCTATCCAGTTTACGTCGAAAGGCTGTAGTTTGGATTAAGACAAAGTTTCCAGATCCTATGCGCAATTACATGA
CGGGGCTTCTATTAGGACATCTCGACACCGACTTCGAGGAAATGAATGAACTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTAGGCTTTTTCATGGATGGCTTTAAGAAACTTCTTTTGCGACTGGGGTTGACTCAAGA
AAAGTTGAAATGGATGACTTATCCCTTTTCTCTTATTTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCTCAACATGGTTTTAAGGGCTTGGATAATTTTGCCTTGACAGTCCTTGTCCTCTTTATCGCC
ATGCCCAACTTTTTCCTGACGGCGGGAGGTATTTTGTCTTGTGCCTACGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAGGGGCTTAAGGCTGTGACCAGAGAAAGTCTGGTTATTTCTTTGGGCATATTACCCATCCTATCCTTCTATTTTGCAG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTATTTGACTTAGTCTTCTTACCGCTCTTGTCCATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAACTGAATGTTATCTTTGAATGGTTGGAAGGCATTATTCGCTTGGT
ATCACAGGTGGCAAGTAGACCCCTGGTCTTTGGTCAACCCACCACATGGCTTTTGATTCTTCTCTTAGTTTCATTAGCCT
TGCTCTATGATATGAGAAAAAATATCAAAAGACTAGCAGGATTTAGTCTCTTTATCGTGGGGCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGCGAAAGTATTTTCCTAAGGGATGTAACTGGTAAGAC
CATTCTCATAGATGTCGGTGGCAAGGTAGAATCTGATAAGAAAATCGAAAAATGGCAAGAAAAAGCGACAACCAGTAATG
CGCAGAGAACCTTGATTCCCTATCTTAAAAGTCGCGGAGTAGCCAAGATTGACCAGCTAATTTTGACCAACACAGACAAG
GAACATGTTGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTGGTATCAAAAGGAAGTTTGAAACA
GAAGGAATTTGTGGCGGAACTAGAAGCAAGCCAAACCAAGGTACGCAGTGTGACAGCAGGGGAGAATTTACCGATTTTTG
GCAGTCAGTTAGAAGTCCTATCTCCAGGGAAGATTGGAGAAGTTGGTTCCAATGATTCCTTGGTTCTTTATGGGAAACTC
TTGGATAAGCACTTTCTCTTCACGGAAAATTTGGAGGAGAAAGGAGAGAAGGATCTTCTAAAGCAATATCCTGACCTAGA
GGTGGATGTTTTGAAAGCTGGCCAACATGGCTCTAAAAAATCATCAAGTTCGGCCTTTTTAGAACAGCTTAAACCGGAGA
TCACTCTCATCTCAGTTGGAAAGAGCAATCGAACGAAACTCCCCCATCAGGAAACCCTGACCCGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGATTTAAAGGTTGGAAGAGTTGGAAGATCGAAAGTATTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

92.091

100

0.921

  comEC/celB Streptococcus mitis NCTC 12261

90.201

99.866

0.901

  comEC/celB Streptococcus pneumoniae TIGR4

90.08

100

0.901

  comEC/celB Streptococcus pneumoniae D39

89.544

100

0.895

  comEC/celB Streptococcus pneumoniae Rx1

89.544

100

0.895

  comEC/celB Streptococcus pneumoniae R6

89.544

100

0.895

  comEC Lactococcus lactis subsp. cremoris KW2

44.669

99.33

0.444


Multiple sequence alignment