Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   CEQ33_RS02030 Genome accession   NZ_CP022093
Coordinates   303210..303521 (+) Length   103 a.a.
NCBI ID   WP_011303023.1    Uniprot ID   -
Organism   Staphylococcus saprophyticus strain FDAARGOS_355     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 298210..308521
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CEQ33_RS02000 (CEQ33_02000) - 298948..299145 (+) 198 WP_041080619.1 YqgQ family protein -
  CEQ33_RS02005 (CEQ33_02005) - 299149..300135 (+) 987 WP_011303019.1 glucokinase -
  CEQ33_RS02010 (CEQ33_02010) - 300135..300461 (+) 327 WP_002483185.1 MTH1187 family thiamine-binding protein -
  CEQ33_RS02015 (CEQ33_02015) - 300461..301084 (+) 624 WP_011303020.1 MBL fold metallo-hydrolase -
  CEQ33_RS02020 (CEQ33_02020) comGA 301172..302146 (+) 975 WP_011303021.1 competence type IV pilus ATPase ComGA Machinery gene
  CEQ33_RS02025 (CEQ33_02025) comGB 302118..303182 (+) 1065 WP_011303022.1 competence type IV pilus assembly protein ComGB -
  CEQ33_RS02030 (CEQ33_02030) comGC 303210..303521 (+) 312 WP_011303023.1 competence type IV pilus major pilin ComGC Machinery gene
  CEQ33_RS02035 (CEQ33_02035) comGD 303502..303957 (+) 456 WP_225791589.1 competence type IV pilus minor pilin ComGD -
  CEQ33_RS02045 (CEQ33_02045) comGF 304215..304658 (+) 444 WP_011303025.1 competence type IV pilus minor pilin ComGF -
  CEQ33_RS02055 (CEQ33_02055) - 304862..305335 (+) 474 WP_002483192.1 shikimate kinase -
  CEQ33_RS02060 (CEQ33_02060) gcvT 305519..306610 (+) 1092 WP_011303027.1 glycine cleavage system aminomethyltransferase GcvT -
  CEQ33_RS02065 (CEQ33_02065) gcvPA 306628..307980 (+) 1353 WP_011303028.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -

Sequence


Protein


Download         Length: 103 a.a.        Molecular weight: 11372.45 Da        Isoelectric Point: 9.1187

>NTDB_id=236603 CEQ33_RS02030 WP_011303023.1 303210..303521(+) (comGC) [Staphylococcus saprophyticus strain FDAARGOS_355]
MKKLKINRNAFTLIEMLLVLLIISLLLILIIPNIAKQSSHLQNTGCEAQLKMIDSQIEAYALKFNKKPSSIEDLVTEGYI
KENQKSCKSGAAITINNGEAVAN

Nucleotide


Download         Length: 312 bp        

>NTDB_id=236603 CEQ33_RS02030 WP_011303023.1 303210..303521(+) (comGC) [Staphylococcus saprophyticus strain FDAARGOS_355]
ATGAAAAAACTAAAAATTAATAGAAATGCTTTTACATTAATAGAAATGTTACTTGTATTATTAATAATTAGCTTATTATT
AATATTAATTATACCCAATATAGCTAAACAATCGTCTCATTTGCAAAATACTGGATGTGAGGCACAACTTAAAATGATTG
ATAGCCAAATTGAAGCGTATGCTTTAAAGTTTAATAAAAAGCCATCGTCTATAGAAGACTTAGTTACAGAAGGATACATT
AAAGAAAATCAAAAGTCATGTAAATCTGGTGCAGCCATCACTATTAACAATGGTGAAGCTGTTGCAAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Staphylococcus aureus N315

78.723

91.262

0.718

  comGC Staphylococcus aureus MW2

78.723

91.262

0.718

  comYC Streptococcus gordonii str. Challis substr. CH1

42.857

100

0.437

  comYC Streptococcus mutans UA159

48.837

83.495

0.408

  comYC Streptococcus mutans UA140

48.837

83.495

0.408

  comGC/cglC Streptococcus mitis NCTC 12261

39.806

100

0.398

  comGC Bacillus subtilis subsp. subtilis str. 168

41.489

91.262

0.379

  comYC Streptococcus suis isolate S10

44.706

82.524

0.369

  comGC/cglC Streptococcus mitis SK321

45.238

81.553

0.369


Multiple sequence alignment