Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   D2E16_RS00860 Genome accession   NZ_CP032064
Coordinates   143433..144470 (+) Length   345 a.a.
NCBI ID   WP_225620794.1    Uniprot ID   -
Organism   Streptococcus suis strain YSJ17     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 138433..149470
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D2E16_RS00830 (D2E16_00860) - 138794..139153 (+) 360 WP_192369874.1 DUF1033 family protein -
  D2E16_RS00835 (D2E16_00865) - 139457..140809 (-) 1353 WP_194472476.1 IS66-like element short variant transposase -
  D2E16_RS00840 (D2E16_00870) - 140763..140969 (-) 207 WP_002941576.1 hypothetical protein -
  D2E16_RS00845 (D2E16_00875) tnpB 141021..141371 (-) 351 WP_002941575.1 IS66 family insertion sequence element accessory protein TnpB -
  D2E16_RS00850 (D2E16_00880) - 141622..142485 (-) 864 Protein_133 S66 family peptidase -
  D2E16_RS00855 (D2E16_00885) comYA 142571..143521 (+) 951 WP_100664710.1 competence type IV pilus ATPase ComGA Machinery gene
  D2E16_RS00860 (D2E16_00890) comYB 143433..144470 (+) 1038 WP_225620794.1 competence type IV pilus assembly protein ComGB Machinery gene
  D2E16_RS00865 (D2E16_00895) comYC 144472..144753 (+) 282 WP_044765768.1 competence type IV pilus major pilin ComGC Machinery gene
  D2E16_RS00870 (D2E16_00900) comGD 144734..145141 (+) 408 WP_029173989.1 competence type IV pilus minor pilin ComGD -
  D2E16_RS00875 (D2E16_00905) comYE 145113..145406 (+) 294 WP_192369520.1 competence type IV pilus minor pilin ComGE Machinery gene
  D2E16_RS00880 (D2E16_00910) comGF/cglF 145393..145827 (+) 435 WP_044765773.1 competence type IV pilus minor pilin ComGF Machinery gene
  D2E16_RS00885 (D2E16_00915) comGG 145805..146332 (+) 528 WP_192369521.1 competence type IV pilus minor pilin ComGG -
  D2E16_RS00890 (D2E16_00920) comYH 146388..147341 (+) 954 WP_192369522.1 class I SAM-dependent methyltransferase Machinery gene
  D2E16_RS00895 (D2E16_00925) - 147392..148579 (+) 1188 WP_192369523.1 acetate kinase -
  D2E16_RS00900 (D2E16_00930) - 148839..149390 (+) 552 WP_099806699.1 folate family ECF transporter S component -

Sequence


Protein


Download         Length: 345 a.a.        Molecular weight: 39007.99 Da        Isoelectric Point: 8.2334

>NTDB_id=313508 D2E16_RS00860 WP_225620794.1 143433..144470(+) (comYB) [Streptococcus suis strain YSJ17]
MNKLIAFLQQDISVFGRQKQKKLPLARQRKVIELFNNLFASGFHLGEIVDFLKRSQLLADPYTQVLSDGLLAGKPFSSLL
ADLRFSDAVVTQVALAEVHGNTSLSLSHIQSYLENVSKVRKKMIEVATYPIILLVFLLLIMLGLKNYLLPQLEEGNAATV
LINHLPTIFLSLCGLSLVAVLAGMVWFRKTSKIKVFSCLAALPFFGKLIQTYLTAYYAREWGSLIGQGLDLPQIVGLMQE
QQSQLFREIGQDLEQSLSNGQDFHEHLKTYAFFKRELSLIVEYGQVKSKLGSELTVYAAECWEDFFSWVNRAMQLIQPLV
FLFVALMVVLIYAAMLLPIYQNMEL

Nucleotide


Download         Length: 1038 bp        

>NTDB_id=313508 D2E16_RS00860 WP_225620794.1 143433..144470(+) (comYB) [Streptococcus suis strain YSJ17]
ATGAACAAATTGATCGCCTTTTTGCAGCAGGACATATCAGTCTTCGGCAGGCAGAAACAGAAAAAATTGCCCTTGGCTCG
CCAGCGTAAGGTCATTGAGCTTTTCAATAATCTTTTTGCTAGTGGTTTTCATCTGGGGGAGATTGTTGATTTCCTCAAAC
GCAGTCAGCTTCTGGCAGATCCCTATACCCAGGTCTTGTCGGATGGCCTTCTTGCAGGAAAGCCATTTTCAAGCCTATTA
GCGGATTTGCGGTTTTCAGATGCGGTGGTCACACAGGTGGCTTTGGCAGAGGTTCATGGCAATACTAGTCTGAGTTTGAG
CCATATCCAGTCCTATCTGGAAAATGTCAGCAAGGTTCGTAAAAAAATGATTGAGGTGGCGACCTATCCGATTATCTTAC
TGGTTTTTTTGCTCTTGATTATGCTGGGTTTGAAAAACTATCTTCTGCCCCAGTTGGAGGAAGGCAATGCAGCGACCGTG
CTAATTAATCATCTACCGACCATCTTTTTATCCCTCTGTGGACTTAGTTTGGTAGCGGTCTTGGCTGGTATGGTTTGGTT
TCGCAAAACTAGCAAAATCAAGGTCTTTTCCTGCTTAGCTGCTCTGCCATTTTTCGGAAAACTCATCCAAACCTATCTGA
CGGCCTATTACGCCAGGGAGTGGGGGAGTTTGATTGGGCAAGGTTTGGACCTGCCGCAGATTGTGGGCTTGATGCAAGAG
CAGCAGTCGCAGCTCTTTCGGGAGATTGGCCAGGACTTGGAGCAGTCGCTTTCCAATGGACAGGATTTTCACGAACACCT
CAAGACCTACGCCTTTTTCAAGCGAGAGCTGAGCCTCATCGTGGAGTATGGTCAGGTCAAGTCCAAGTTGGGGAGCGAGT
TGACAGTTTATGCAGCCGAGTGTTGGGAGGATTTTTTCTCTTGGGTCAATAGAGCCATGCAGCTGATTCAACCGCTGGTC
TTTCTCTTTGTGGCCTTAATGGTCGTTCTTATCTACGCAGCCATGTTGCTGCCGATTTATCAAAATATGGAGTTATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus gordonii str. Challis substr. CH1

62.573

99.13

0.62

  comGB/cglB Streptococcus mitis NCTC 12261

60

97.101

0.583

  comYB Streptococcus mutans UA140

58.944

98.841

0.583

  comYB Streptococcus mutans UA159

58.944

98.841

0.583

  comGB/cglB Streptococcus mitis SK321

59.701

97.101

0.58

  comGB/cglB Streptococcus pneumoniae Rx1

59.403

97.101

0.577

  comGB/cglB Streptococcus pneumoniae D39

59.403

97.101

0.577

  comGB/cglB Streptococcus pneumoniae R6

59.403

97.101

0.577

  comGB/cglB Streptococcus pneumoniae TIGR4

59.403

97.101

0.577

  comGB Lactococcus lactis subsp. cremoris KW2

50.742

97.681

0.496


Multiple sequence alignment