Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   GAN20_RS01410 Genome accession   NZ_CP045085
Coordinates   298990..299964 (-) Length   324 a.a.
NCBI ID   WP_014613820.1    Uniprot ID   -
Organism   Staphylococcus pseudintermedius strain KCTC 43135     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 293990..304964
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GAN20_RS01375 gcvT 294564..295655 (-) 1092 WP_110160319.1 glycine cleavage system aminomethyltransferase GcvT -
  GAN20_RS01380 - 295813..296340 (-) 528 WP_014613826.1 shikimate kinase -
  GAN20_RS01385 - 296504..296989 (-) 486 WP_081378286.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  GAN20_RS01390 - 296895..297095 (-) 201 WP_020219623.1 hypothetical protein -
  GAN20_RS01395 comGD 297200..297640 (-) 441 WP_014613823.1 competence type IV pilus minor pilin ComGD -
  GAN20_RS01400 comGC 297618..297941 (-) 324 WP_015729165.1 competence type IV pilus major pilin ComGC Machinery gene
  GAN20_RS01405 comGB 297954..299021 (-) 1068 WP_037542939.1 competence type IV pilus assembly protein ComGB -
  GAN20_RS01410 comGA 298990..299964 (-) 975 WP_014613820.1 competence type IV pilus ATPase ComGA Machinery gene
  GAN20_RS01415 - 300125..300754 (-) 630 WP_037542865.1 MBL fold metallo-hydrolase -
  GAN20_RS01420 - 300767..301078 (-) 312 WP_014613818.1 MTH1187 family thiamine-binding protein -
  GAN20_RS01425 - 301079..302065 (-) 987 WP_014613817.1 ROK family glucokinase -
  GAN20_RS01430 - 302062..302265 (-) 204 WP_014613816.1 YqgQ family protein -
  GAN20_RS01435 - 302267..303706 (-) 1440 WP_110161176.1 rhomboid family intramembrane serine protease -
  GAN20_RS01440 - 303725..304264 (-) 540 WP_014613814.1 5-formyltetrahydrofolate cyclo-ligase -
  GAN20_RS01445 rpmG 304416..304565 (-) 150 WP_014613813.1 50S ribosomal protein L33 -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 36846.58 Da        Isoelectric Point: 7.5584

>NTDB_id=391742 GAN20_RS01410 WP_014613820.1 298990..299964(-) (comGA) [Staphylococcus pseudintermedius strain KCTC 43135]
MQLLLDHVIQFALQNEASDIHFIPSQSQVEVKLRVKDQLVLYDTLNLETYQKLLTLLKYQAGLDISTRHKAQSGRYIYEH
KALYYLRVSTLPLNLGTESCVIRITPQYFQNAKAHSVLDIKSNMEKKQGLILFSGPTGAGKSTLMYQMVVYAKQKLNLNI
ITIEDPVEQLVNGITQISVNEKADITYGSSLKAILRCDPDVILIGEIRDSHIARDVIQASLSGHLVLSTIHANDCQGVLL
RLIEMGLSHQDLSQSINLIINQRLITTKNDQRQLVYETMDQQEIHYFLNNNFQLPTAFTSLSHKLQLLNKEGVIDEKILR
KYTI

Nucleotide


Download         Length: 975 bp        

>NTDB_id=391742 GAN20_RS01410 WP_014613820.1 298990..299964(-) (comGA) [Staphylococcus pseudintermedius strain KCTC 43135]
ATGCAACTACTATTAGATCATGTAATACAATTTGCACTTCAAAATGAGGCATCAGACATTCATTTTATCCCATCACAATC
ACAAGTCGAAGTGAAACTCAGAGTGAAAGATCAACTTGTATTATATGACACATTAAATCTAGAAACGTATCAAAAATTGT
TAACTTTGTTGAAATACCAAGCTGGATTAGATATTTCGACACGCCACAAAGCGCAAAGTGGGCGCTATATTTATGAACAT
AAAGCGTTATATTATTTAAGAGTGTCCACGTTACCTTTAAACTTAGGTACTGAAAGTTGTGTTATTCGTATTACACCCCA
ATATTTTCAAAATGCGAAAGCTCATAGTGTATTAGACATCAAATCTAATATGGAGAAAAAGCAAGGTTTAATTCTGTTTA
GCGGGCCTACAGGTGCAGGAAAGAGCACGTTAATGTATCAAATGGTCGTCTATGCTAAACAAAAGTTAAACCTGAATATT
ATCACAATAGAAGACCCGGTCGAGCAGCTTGTGAACGGCATTACGCAAATTTCCGTTAACGAAAAAGCTGATATTACGTA
TGGATCGTCGTTAAAAGCAATTCTTCGTTGTGATCCAGATGTCATCTTAATTGGTGAGATTCGTGACAGTCATATTGCAA
GGGACGTTATACAAGCGAGTTTAAGTGGGCATTTAGTATTGTCGACGATACACGCCAATGATTGTCAAGGGGTGCTCCTT
AGACTGATTGAAATGGGACTCTCTCACCAAGATTTATCTCAATCTATCAATCTCATCATCAATCAAAGACTCATTACAAC
AAAAAACGACCAGCGACAACTCGTCTATGAAACGATGGATCAACAGGAGATTCATTACTTTTTGAATAATAACTTCCAAC
TACCTACTGCTTTCACAAGCCTTTCTCATAAGCTTCAACTTCTGAATAAAGAGGGGGTTATAGATGAAAAGATTTTACGA
AAGTATACGATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

58.514

99.691

0.583

  comGA Staphylococcus aureus N315

58.514

99.691

0.583


Multiple sequence alignment