Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   H020_RS0104750 Genome accession   NZ_AKVY01000001
Coordinates   904546..906786 (+) Length   746 a.a.
NCBI ID   WP_000942394.1    Uniprot ID   A0A166WJ67
Organism   Streptococcus pneumoniae TIGR4     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 899546..911786
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H020_RS13425 - 899872..900686 (+) 815 Protein_890 IS5-like element IS1515 family transposase -
  H020_RS0104725 - 900869..901369 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  H020_RS0104730 - 901372..901698 (+) 327 Protein_892 TfoX/Sxy family protein -
  H020_RS12725 ald 901999..903098 (-) 1100 Protein_893 alanine dehydrogenase -
  H020_RS0104740 - 903275..903844 (+) 570 WP_000443750.1 GNAT family N-acetyltransferase -
  H020_RS0104745 comEA/celA/cilE 903912..904562 (+) 651 WP_000387344.1 helix-hairpin-helix domain-containing protein Machinery gene
  H020_RS0104750 comEC/celB 904546..906786 (+) 2241 WP_000942394.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  H020_RS11820 - 906965..907153 (+) 189 WP_001808514.1 hypothetical protein -
  H020_RS0104755 - 907186..907773 (+) 588 WP_000939895.1 ATP-binding cassette domain-containing protein -
  H020_RS0104760 - 907777..908961 (+) 1185 WP_000655960.1 ABC transporter permease -
  H020_RS0104765 infC 909272..909802 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  H020_RS0104770 rpmI 909835..910035 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  H020_RS0104775 rplT 910087..910446 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  H020_RS0104780 - 910504..910884 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84444.00 Da        Isoelectric Point: 9.6202

>NTDB_id=64927 H020_RS0104750 WP_000942394.1 904546..906786(+) (comEC/celB) [Streptococcus pneumoniae TIGR4]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMGNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVARRPLVFGQPNAWLLILLLISLALVYDLRKNIKGLTVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=64927 H020_RS0104750 WP_000942394.1 904546..906786(+) (comEC/celB) [Streptococcus pneumoniae TIGR4]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACTGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGGGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
CTCGCAGGTGGCAAGGAGACCACTTGTCTTTGGTCAACCCAACGCATGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A166WJ67

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

100

100

1

  comEC/celB Streptococcus pneumoniae Rx1

97.587

100

0.976

  comEC/celB Streptococcus pneumoniae D39

97.587

100

0.976

  comEC/celB Streptococcus pneumoniae R6

97.587

100

0.976

  comEC/celB Streptococcus mitis SK321

92.225

100

0.922

  comEC/celB Streptococcus mitis NCTC 12261

91.812

99.866

0.917

  comEC Lactococcus lactis subsp. cremoris KW2

44.399

99.33

0.441


Multiple sequence alignment