Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   GU333_RS10465 Genome accession   NZ_CP047630
Coordinates   2103900..2104985 (-) Length   361 a.a.
NCBI ID   WP_342591238.1    Uniprot ID   -
Organism   Lactococcus raffinolactis strain Lr_18_12S     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2098900..2109985
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GU333_RS10420 (GU333_10385) - 2099096..2099866 (-) 771 WP_245328130.1 metal ABC transporter ATP-binding protein -
  GU333_RS10425 (GU333_10390) - 2099866..2100315 (-) 450 WP_061774324.1 zinc-dependent MarR family transcriptional regulator -
  GU333_RS10430 (GU333_10395) - 2100392..2101240 (-) 849 WP_061774323.1 aminotransferase class IV -
  GU333_RS10435 (GU333_10400) ispE 2101324..2102175 (-) 852 WP_061774322.1 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase -
  GU333_RS10440 (GU333_10405) comGG 2102180..2102464 (-) 285 WP_096039677.1 competence type IV pilus minor pilin ComGG -
  GU333_RS10445 (GU333_10410) comGF 2102476..2102919 (-) 444 WP_244165991.1 competence type IV pilus minor pilin ComGF -
  GU333_RS10450 (GU333_10415) comGE 2102894..2103199 (-) 306 WP_082785376.1 competence type IV pilus minor pilin ComGE -
  GU333_RS10455 (GU333_10420) comGD 2103171..2103620 (-) 450 WP_167838308.1 competence type IV pilus minor pilin ComGD -
  GU333_RS10460 (GU333_10425) comYC 2103571..2103891 (-) 321 WP_061774318.1 competence type IV pilus major pilin ComGC Machinery gene
  GU333_RS10465 (GU333_10430) comYB 2103900..2104985 (-) 1086 WP_342591238.1 competence type IV pilus assembly protein ComGB Machinery gene
  GU333_RS10470 (GU333_10435) comGA/cglA/cilD 2104867..2105808 (-) 942 WP_061774317.1 competence type IV pilus ATPase ComGA Machinery gene
  GU333_RS10475 (GU333_10440) - 2105871..2107715 (-) 1845 WP_167842442.1 acyltransferase family protein -
  GU333_RS10480 (GU333_10445) - 2107712..2108074 (-) 363 WP_061774315.1 hypothetical protein -

Sequence


Protein


Download         Length: 361 a.a.        Molecular weight: 41560.47 Da        Isoelectric Point: 10.1508

>NTDB_id=414756 GU333_RS10465 WP_342591238.1 2103900..2104985(-) (comYB) [Lactococcus raffinolactis strain Lr_18_12S]
MLKKHSKGMPTTSGMPKLKNWLQKDISHLIRRQPKRLAIKQQVKLMQLMNNLFLSGFHLAEIVNFLSRSKLVEERFVATM
RDGLSNGQNLSQILNDLRFSNLVVTQVALADYHGDTQQKLTLIIDNLTKIQKVRKKLVAVSTYPIILLTFLIIIVLGLKT
YLLPQVETDSFAGTVINALPLIFIGLSLVLTFFALGLIRYFKSTSPLANVQWLGRLPLIRWYLKLYLTAFYAREWGNLVK
QGLEMRQILEMMTSQQNLLFREIGQDIRRAMQVGEEFHDKIKQYSFFTPELALIIEYGEMKGKLGDELLIYSDESWTKFF
EKVESAMNVIQPLVFLFVALMIVLIYAAMLLPIYAQMDLGM

Nucleotide


Download         Length: 1086 bp        

>NTDB_id=414756 GU333_RS10465 WP_342591238.1 2103900..2104985(-) (comYB) [Lactococcus raffinolactis strain Lr_18_12S]
GTGCTGAAAAAGCATTCGAAAGGCATGCCAACGACAAGTGGCATGCCAAAATTGAAAAATTGGTTACAGAAGGACATCTC
ACACCTGATCAGGCGACAGCCGAAAAGGTTAGCGATTAAGCAACAAGTGAAGCTGATGCAGTTGATGAATAACCTCTTTT
TGAGTGGTTTTCATTTAGCAGAAATTGTTAATTTTTTGAGCCGCTCGAAGCTAGTGGAGGAGAGATTTGTTGCGACTATG
CGAGATGGCTTATCGAATGGTCAAAATCTCTCACAAATCTTAAATGATTTACGTTTTTCAAACTTAGTCGTCACACAAGT
TGCTTTAGCAGATTACCATGGTGATACCCAGCAAAAACTGACATTAATAATAGATAATTTAACAAAAATTCAGAAGGTTC
GAAAAAAATTGGTGGCTGTATCGACCTATCCCATCATCTTGTTAACTTTTCTAATCATCATCGTACTGGGATTAAAAACC
TATTTGCTACCCCAAGTTGAAACGGATAGTTTTGCGGGCACTGTGATCAATGCCTTACCTTTGATTTTTATCGGGTTGAG
CCTCGTGTTAACATTTTTTGCGTTAGGACTGATTCGATATTTCAAATCAACATCACCATTGGCTAATGTCCAATGGTTAG
GTCGTCTTCCCTTGATCCGATGGTACTTAAAGCTTTATCTGACAGCGTTTTATGCGCGAGAGTGGGGAAACTTGGTTAAG
CAAGGTTTGGAGATGCGACAAATATTGGAGATGATGACCAGCCAACAGAATCTACTTTTTCGTGAAATTGGTCAAGATAT
AAGGCGGGCTATGCAAGTGGGGGAAGAATTTCACGACAAGATTAAGCAATACAGCTTTTTCACACCGGAGTTGGCCCTCA
TCATCGAATATGGTGAAATGAAGGGCAAATTAGGTGATGAATTACTCATCTATTCGGATGAATCCTGGACAAAATTTTTT
GAAAAGGTTGAAAGCGCCATGAATGTCATTCAGCCATTGGTTTTTTTGTTTGTCGCTTTGATGATTGTTTTAATCTATGC
AGCTATGTTGCTACCTATTTATGCTCAGATGGATCTGGGCATGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus mutans UA140

50.725

95.568

0.485

  comYB Streptococcus mutans UA159

50.725

95.568

0.485

  comYB Streptococcus gordonii str. Challis substr. CH1

50.294

94.183

0.474

  comGB Lactococcus lactis subsp. cremoris KW2

48.68

94.46

0.46

  comGB/cglB Streptococcus pneumoniae R6

47.164

92.798

0.438

  comGB/cglB Streptococcus pneumoniae TIGR4

47.164

92.798

0.438

  comGB/cglB Streptococcus mitis NCTC 12261

47.164

92.798

0.438

  comGB/cglB Streptococcus pneumoniae Rx1

47.164

92.798

0.438

  comGB/cglB Streptococcus pneumoniae D39

47.164

92.798

0.438

  comGB/cglB Streptococcus mitis SK321

46.567

92.798

0.432


Multiple sequence alignment