Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   SAP2_RS06580 Genome accession   NZ_AP019698
Coordinates   1367066..1368040 (+) Length   324 a.a.
NCBI ID   WP_142019638.1    Uniprot ID   -
Organism   Staphylococcus arlettae strain P2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1362066..1373040
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SAP2_RS06545 (SAP2_12980) rpmG 1362157..1362306 (+) 150 WP_002509766.1 50S ribosomal protein L33 -
  SAP2_RS06550 (SAP2_12990) - 1362431..1362970 (+) 540 WP_021459927.1 5-formyltetrahydrofolate cyclo-ligase -
  SAP2_RS06555 (SAP2_13000) - 1362982..1364424 (+) 1443 WP_142019636.1 rhomboid family intramembrane serine protease -
  SAP2_RS06560 (SAP2_13010) - 1364426..1364620 (+) 195 WP_142019637.1 YqgQ family protein -
  SAP2_RS06565 (SAP2_13020) - 1365085..1366071 (+) 987 WP_002509762.1 glucokinase -
  SAP2_RS06570 (SAP2_13030) - 1366071..1366409 (+) 339 WP_002509761.1 MTH1187 family thiamine-binding protein -
  SAP2_RS06575 (SAP2_13040) - 1366396..1367013 (+) 618 WP_002509760.1 MBL fold metallo-hydrolase -
  SAP2_RS06580 (SAP2_13050) comGA 1367066..1368040 (+) 975 WP_142019638.1 competence type IV pilus ATPase ComGA Machinery gene
  SAP2_RS06585 (SAP2_13060) comGB 1368012..1369064 (+) 1053 WP_002509758.1 competence type IV pilus assembly protein ComGB -
  SAP2_RS06590 (SAP2_13070) comGC 1369096..1369404 (+) 309 WP_103387866.1 competence type IV pilus major pilin ComGC Machinery gene
  SAP2_RS06595 (SAP2_13080) - 1369427..1369831 (+) 405 WP_232190723.1 competence protein -
  SAP2_RS06600 (SAP2_13090) - 1369812..1370111 (+) 300 WP_002509755.1 hypothetical protein -
  SAP2_RS06605 (SAP2_13100) - 1370086..1370514 (+) 429 WP_142019639.1 competence type IV pilus minor pilin ComGF -
  SAP2_RS06610 (SAP2_13120) - 1370682..1371194 (+) 513 WP_002509752.1 shikimate kinase -
  SAP2_RS06615 (SAP2_13130) gcvT 1371362..1372453 (+) 1092 WP_021459931.1 glycine cleavage system aminomethyltransferase GcvT -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37369.35 Da        Isoelectric Point: 7.7750

>NTDB_id=73315 SAP2_RS06580 WP_142019638.1 1367066..1368040(+) (comGA) [Staphylococcus arlettae strain P2]
MKLLFEDIMTKAIKQKTSDIHFIPVENEVVIKLRVNESLIDFQKMTKHTYAKLLTYMKYQACLDVSTHHIAQSGRYIYNY
KQLYYLRISTLPLSLGIESCVIRIIPQYFYQSTEEMQLDYLYHLMTKKQGLVLFTGPTGSGKSTSMYQMSLYAKEQLNLN
VITIEDPVERIIDGITQISVNTKAGIDYHNSFKAILRCDPDVILIGEIRNAEIAKQVIHASLSGHLVLTTLHANSCEGAL
LRLLEMGLTNQELVQTIINITNQRLITSNDKHRYLVAESLSYEDIHYFFNNNYRLPEDFQTLPNILQRLSQKGVICEKVV
QKYI

Nucleotide


Download         Length: 975 bp        

>NTDB_id=73315 SAP2_RS06580 WP_142019638.1 1367066..1368040(+) (comGA) [Staphylococcus arlettae strain P2]
TTGAAATTGTTATTTGAAGATATTATGACCAAGGCCATCAAACAGAAAACGTCTGATATTCATTTTATACCAGTCGAAAA
TGAAGTAGTTATTAAATTGCGGGTGAATGAAAGTTTAATAGATTTTCAAAAAATGACTAAGCATACCTATGCTAAACTTT
TAACTTATATGAAATATCAAGCATGCCTAGATGTTTCTACACACCACATTGCACAAAGTGGCCGTTATATTTATAACTAT
AAGCAGTTATATTACTTACGTATTTCCACGTTGCCGTTATCATTAGGAATTGAAAGCTGTGTTATTAGAATAATTCCACA
ATACTTCTACCAATCAACTGAAGAAATGCAATTGGATTATTTATATCATTTAATGACGAAAAAACAAGGCTTGGTGTTAT
TCACAGGTCCAACAGGCTCGGGGAAAAGTACATCAATGTATCAAATGAGTTTGTATGCAAAAGAACAATTAAATTTAAAT
GTGATTACTATTGAAGACCCAGTTGAAAGAATTATAGATGGTATTACGCAAATTTCGGTAAACACTAAAGCTGGCATTGA
CTATCACAATTCTTTTAAAGCTATTTTAAGATGCGATCCAGACGTAATCTTAATTGGTGAAATAAGAAATGCTGAAATAG
CTAAACAAGTCATACACGCGAGTTTAAGTGGACATCTTGTTTTGACAACATTACATGCCAATAGTTGTGAAGGGGCATTA
CTTCGTTTATTAGAAATGGGGCTCACAAATCAGGAATTAGTACAAACTATTATTAATATTACTAACCAACGACTCATTAC
TTCTAATGACAAGCATCGATATTTAGTAGCAGAAAGCTTATCATATGAAGATATACATTACTTCTTTAACAATAATTATA
GACTACCAGAAGATTTCCAAACTTTGCCGAATATATTACAACGATTATCGCAAAAGGGGGTTATTTGTGAAAAAGTGGTA
CAAAAATACATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

61.111

100

0.611

  comGA Staphylococcus aureus N315

61.111

100

0.611


Multiple sequence alignment