Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   FGL66_RS04820 Genome accession   NZ_CP040781
Coordinates   1015115..1016089 (+) Length   324 a.a.
NCBI ID   WP_180808697.1    Uniprot ID   A0A7H9DKT4
Organism   Staphylococcus sp. 17KM0847     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1010115..1021089
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGL66_RS04785 (FGL66_04760) rpmG 1010352..1010501 (+) 150 WP_180808690.1 50S ribosomal protein L33 -
  FGL66_RS04790 (FGL66_04765) - 1010896..1011435 (+) 540 WP_180808691.1 5-formyltetrahydrofolate cyclo-ligase -
  FGL66_RS04795 (FGL66_04770) - 1011456..1012895 (+) 1440 WP_180808692.1 rhomboid family intramembrane serine protease -
  FGL66_RS04800 (FGL66_04775) - 1012898..1013101 (+) 204 WP_180808693.1 YqgQ family protein -
  FGL66_RS04805 (FGL66_04780) - 1013098..1014078 (+) 981 WP_180808694.1 ROK family glucokinase -
  FGL66_RS04810 (FGL66_04785) - 1014086..1014397 (+) 312 WP_180808695.1 MTH1187 family thiamine-binding protein -
  FGL66_RS04815 (FGL66_04790) - 1014426..1015049 (+) 624 WP_180808696.1 MBL fold metallo-hydrolase -
  FGL66_RS04820 (FGL66_04795) comGA 1015115..1016089 (+) 975 WP_180808697.1 competence type IV pilus ATPase ComGA Machinery gene
  FGL66_RS04825 (FGL66_04800) comGB 1016058..1017125 (+) 1068 WP_180808698.1 competence type IV pilus assembly protein ComGB -
  FGL66_RS04830 (FGL66_04805) comGC 1017137..1017457 (+) 321 WP_180808699.1 competence type IV pilus major pilin ComGC Machinery gene
  FGL66_RS04835 (FGL66_04810) comGD 1017438..1017881 (+) 444 WP_258007317.1 competence type IV pilus minor pilin ComGD -
  FGL66_RS04840 (FGL66_04815) - 1018058..1018561 (+) 504 WP_180808701.1 competence type IV pilus minor pilin ComGF -
  FGL66_RS04845 (FGL66_04820) - 1018733..1019257 (+) 525 WP_374757653.1 shikimate kinase -
  FGL66_RS04850 (FGL66_04825) gcvT 1019422..1020513 (+) 1092 WP_180808702.1 glycine cleavage system aminomethyltransferase GcvT -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37065.85 Da        Isoelectric Point: 7.8201

>NTDB_id=367093 FGL66_RS04820 WP_180808697.1 1015115..1016089(+) (comGA) [Staphylococcus sp. 17KM0847]
MQLLLDTVVQFALQKQASDIHFIPSQSQVEVKLRIKDELSLYDTLNLETYQRLLTLLKFQAGLDISSRHKAQSGRYIYHY
ENLYYLRVSTLPLNLGTESCVIRITPQYFQSETSDHAHDMGYIMEKKQGLILFSGPTGSGKSTLMYQMVLYAKQKLNLNV
ITIEDPVERILNGVTQISVNEKADITYNASLKAILRCDPDVILIGEIRDRTIAKEVLQASLSGHLVLSTIHANDCAGVLL
RLIEMGISKQDLIQSLNLIINQRLITNTHRQRQLVYETMTQPQIEHFLCNQFQLPENFTSLDDKLKKLEQEGVIHATTLR
KYKK

Nucleotide


Download         Length: 975 bp        

>NTDB_id=367093 FGL66_RS04820 WP_180808697.1 1015115..1016089(+) (comGA) [Staphylococcus sp. 17KM0847]
ATGCAATTACTGTTAGATACTGTCGTACAGTTTGCGTTACAGAAACAAGCATCTGACATTCATTTTATTCCGTCTCAGTC
ACAAGTAGAAGTCAAATTACGCATTAAAGACGAATTATCTTTATATGACACATTAAACCTAGAAACCTACCAACGCTTAT
TAACATTACTTAAATTTCAAGCAGGACTGGATATCTCATCTCGTCACAAAGCACAAAGTGGGCGTTACATTTATCATTAT
GAAAATTTATACTATTTACGCGTTTCGACATTACCTTTAAATCTTGGCACTGAGAGTTGTGTTATTCGCATTACACCACA
ATATTTTCAATCGGAGACTTCAGATCATGCACATGATATGGGGTATATTATGGAGAAAAAACAGGGACTTATACTCTTTA
GTGGTCCAACAGGCTCAGGTAAAAGCACACTTATGTATCAAATGGTACTTTATGCGAAACAAAAACTTAATTTAAACGTC
ATTACCATTGAAGATCCTGTGGAAAGAATATTAAATGGCGTGACGCAAATATCAGTCAATGAAAAAGCAGATATCACTTA
CAATGCATCACTCAAAGCAATATTACGTTGTGATCCAGATGTAATTTTAATTGGAGAAATCCGCGATCGTACTATTGCTA
AAGAAGTATTACAAGCGAGTTTAAGCGGGCATCTCGTGTTATCTACAATACATGCCAATGACTGCGCAGGGGTATTACTT
CGCTTGATTGAGATGGGCATATCGAAGCAAGACTTAATTCAATCATTAAATCTGATTATCAATCAAAGACTTATCACGAA
TACACATCGTCAACGTCAATTAGTATATGAAACGATGACACAACCACAAATTGAACATTTTTTATGCAATCAATTTCAAC
TACCTGAAAATTTTACAAGTTTAGATGATAAACTCAAAAAGTTAGAACAAGAAGGTGTAATTCATGCAACGACACTTCGC
AAATATAAAAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7H9DKT4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

58.514

99.691

0.583

  comGA Staphylococcus aureus N315

58.514

99.691

0.583


Multiple sequence alignment