Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   SE1039_RS06850 Genome accession   NZ_CP013114
Coordinates   1425673..1425990 (-) Length   105 a.a.
NCBI ID   WP_056936206.1    Uniprot ID   -
Organism   Staphylococcus equorum strain KS1039     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1420673..1430990
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SE1039_RS06815 (SE1039_13680) gcvPA 1421229..1422581 (-) 1353 WP_056935676.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  SE1039_RS06820 (SE1039_13690) gcvT 1422599..1423690 (-) 1092 WP_021339246.1 glycine cleavage system aminomethyltransferase GcvT -
  SE1039_RS06825 (SE1039_13700) - 1423868..1424341 (-) 474 WP_230456625.1 shikimate kinase -
  SE1039_RS06835 (SE1039_13720) comGF 1424548..1424991 (-) 444 WP_002507508.1 competence type IV pilus minor pilin ComGF -
  SE1039_RS06840 (SE1039_13730) - 1424963..1425262 (-) 300 WP_056935677.1 hypothetical protein -
  SE1039_RS14100 (SE1039_13740) comGD 1425249..1425695 (-) 447 WP_056935678.1 competence type IV pilus minor pilin ComGD -
  SE1039_RS06850 (SE1039_13750) comGC 1425673..1425990 (-) 318 WP_056936206.1 competence type IV pilus major pilin ComGC Machinery gene
  SE1039_RS06855 (SE1039_13760) comGB 1426015..1427082 (-) 1068 WP_056935679.1 competence type IV pilus assembly protein ComGB -
  SE1039_RS06860 (SE1039_13770) comGA 1427054..1428028 (-) 975 WP_056935680.1 competence type IV pilus ATPase ComGA Machinery gene
  SE1039_RS06865 (SE1039_13780) - 1428128..1428751 (-) 624 WP_002507502.1 MBL fold metallo-hydrolase -
  SE1039_RS06870 (SE1039_13790) - 1428753..1429076 (-) 324 WP_002507501.1 MTH1187 family thiamine-binding protein -
  SE1039_RS06875 (SE1039_13800) - 1429076..1430062 (-) 987 WP_002507500.1 glucokinase -
  SE1039_RS06880 - 1430653..1430853 (-) 201 WP_002512453.1 YqgQ family protein -

Sequence


Protein


Download         Length: 105 a.a.        Molecular weight: 11683.74 Da        Isoelectric Point: 8.4835

>NTDB_id=160006 SE1039_RS06850 WP_056936206.1 1425673..1425990(-) (comGC) [Staphylococcus equorum strain KS1039]
MKNLKKLFNKEAFTLIEMLLVLLIISLLLILIIPNIAKQSSHIQTAGCEAQIKMIDSQIEAYSLKYNKKPTSIDELVSEG
YINESQKRCKTGSTISINNGEAVAN

Nucleotide


Download         Length: 318 bp        

>NTDB_id=160006 SE1039_RS06850 WP_056936206.1 1425673..1425990(-) (comGC) [Staphylococcus equorum strain KS1039]
TTGAAAAACTTAAAAAAACTATTTAATAAAGAAGCATTTACACTTATAGAAATGTTGCTTGTTTTATTAATCATAAGTTT
ATTGCTTATTTTAATCATACCTAACATTGCAAAACAATCATCACATATACAAACTGCTGGTTGTGAAGCTCAAATAAAAA
TGATAGACAGTCAAATTGAAGCTTATTCATTAAAATATAATAAAAAGCCAACATCAATCGATGAACTCGTCTCAGAAGGA
TATATTAACGAAAGTCAAAAGCGATGTAAAACTGGTTCAACAATTTCAATAAATAATGGTGAAGCAGTTGCTAACTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Staphylococcus aureus N315

68.627

97.143

0.667

  comGC Staphylococcus aureus MW2

68.627

97.143

0.667

  comYC Streptococcus gordonii str. Challis substr. CH1

44.444

100

0.457

  comGC/cglC Streptococcus mitis NCTC 12261

41.053

90.476

0.371

  comGC Bacillus subtilis subsp. subtilis str. 168

41.489

89.524

0.371

  comYC Streptococcus mutans UA140

44.186

81.905

0.362

  comYC Streptococcus mutans UA159

44.186

81.905

0.362


Multiple sequence alignment