Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   EL116_RS07965 Genome accession   NZ_LR134304
Coordinates   1625132..1626106 (-) Length   324 a.a.
NCBI ID   WP_047425712.1    Uniprot ID   A0A2K4AGH2
Organism   Staphylococcus schweitzeri strain NCTC13712     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1620132..1631106
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL116_RS07925 (NCTC13712_01535) gcvT 1620633..1621724 (-) 1092 WP_047532011.1 glycine cleavage system aminomethyltransferase GcvT -
  EL116_RS07930 - 1621881..1622405 (-) 525 WP_047549198.1 shikimate kinase -
  EL116_RS07935 - 1622392..1622541 (-) 150 WP_078101697.1 hypothetical protein -
  EL116_RS07940 (NCTC13712_01537) comGF 1622640..1623095 (-) 456 WP_078101690.1 competence type IV pilus minor pilin ComGF Machinery gene
  EL116_RS07945 (NCTC13712_01538) comGE 1623055..1623354 (-) 300 WP_047532015.1 hypothetical protein Machinery gene
  EL116_RS07950 (NCTC13712_01539) comGD 1623341..1623787 (-) 447 WP_078101689.1 competence type IV pilus minor pilin ComGD Machinery gene
  EL116_RS07955 (NCTC13712_01540) comGC 1623765..1624076 (-) 312 WP_047532019.1 competence type IV pilus major pilin ComGC Machinery gene
  EL116_RS07960 (NCTC13712_01541) comGB 1624090..1625160 (-) 1071 WP_047532021.1 competence type IV pilus assembly protein ComGB Machinery gene
  EL116_RS07965 (NCTC13712_01542) comGA 1625132..1626106 (-) 975 WP_047425712.1 competence type IV pilus ATPase ComGA Machinery gene
  EL116_RS07970 (NCTC13712_01543) - 1626158..1626781 (-) 624 WP_047549195.1 MBL fold metallo-hydrolase -
  EL116_RS07975 (NCTC13712_01544) - 1626778..1627107 (-) 330 WP_047532027.1 MTH1187 family thiamine-binding protein -
  EL116_RS07980 (NCTC13712_01545) - 1627107..1628093 (-) 987 WP_047532029.1 ROK family glucokinase -
  EL116_RS07985 - 1628093..1628293 (-) 201 WP_047425719.1 YqgQ family protein -
  EL116_RS07990 (NCTC13712_01546) - 1628274..1629737 (-) 1464 WP_047532032.1 rhomboid family protein -
  EL116_RS07995 (NCTC13712_01547) - 1629749..1630288 (-) 540 WP_047549191.1 5-formyltetrahydrofolate cyclo-ligase -
  EL116_RS08000 (NCTC13712_01548) rpmG 1630473..1630622 (-) 150 WP_001265709.1 50S ribosomal protein L33 -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 36881.01 Da        Isoelectric Point: 8.8423

>NTDB_id=1120781 EL116_RS07965 WP_047425712.1 1625132..1626106(-) (comGA) [Staphylococcus schweitzeri strain NCTC13712]
MKILFQEIINKAIEMKASDVHFIPVHNEVSIKFRINDNLEQYEVIGNSIYQKLLVYMKFQAGLDVSTQQVAQSGRYSYHF
NKIYFLRISTLPLSLGQESCVIRIVPQYFQQTKSTYKFNDFKHLMNKKQGLLLFSGPTGSGKSTLMYQMVSYANKALNLN
VISIEDPVEMQIPGIVQINLNEKAGINYVNSFKAILRCDPDVILIGEIRDKEVAKCVIQASLSGHLVLTTLHATDCKGAI
LRLLEMGISVQELIQATNLIINQRLVTTIKQQRQLVCEILSQQQLQYFFSHNHSLPTSFKKLETKLDDMTKAGVICESTM
DKYI

Nucleotide


Download         Length: 975 bp        

>NTDB_id=1120781 EL116_RS07965 WP_047425712.1 1625132..1626106(-) (comGA) [Staphylococcus schweitzeri strain NCTC13712]
ATGAAGATTTTATTTCAAGAAATAATCAATAAAGCGATAGAAATGAAAGCGAGTGATGTACATTTTATTCCTGTTCATAA
TGAAGTGAGTATTAAATTTCGTATAAATGATAACTTGGAACAGTATGAGGTAATTGGAAATAGTATTTATCAAAAGTTAT
TAGTATATATGAAATTTCAAGCAGGTCTCGATGTTTCTACGCAACAAGTTGCACAGAGTGGTCGATATAGCTATCACTTT
AATAAAATTTATTTTTTAAGAATTTCAACATTGCCATTATCACTTGGCCAAGAAAGTTGTGTGATAAGAATAGTTCCTCA
ATATTTCCAACAAACTAAATCAACGTATAAATTTAATGATTTTAAACATTTAATGAATAAGAAACAAGGCTTATTGTTAT
TTAGTGGTCCGACTGGTTCGGGTAAAAGTACCTTGATGTATCAAATGGTCTCATACGCTAACAAAGCGTTAAATTTAAAT
GTTATATCAATTGAAGACCCTGTAGAAATGCAAATTCCTGGAATAGTTCAAATAAACCTAAATGAAAAAGCTGGGATTAA
CTATGTAAATTCATTTAAAGCGATATTAAGGTGCGATCCAGATGTTATTCTTATAGGTGAAATTAGAGATAAGGAGGTAG
CTAAATGTGTTATCCAAGCGAGTTTAAGTGGTCATCTAGTACTAACTACCTTACATGCAACAGATTGCAAAGGTGCAATT
TTAAGATTACTAGAAATGGGCATTTCAGTACAAGAACTAATACAAGCAACTAATTTAATTATAAATCAAAGACTTGTTAC
TACAATTAAACAGCAACGACAATTAGTGTGTGAGATTTTATCACAACAACAACTTCAGTATTTCTTCTCGCATAATCATT
CATTACCTACCTCATTTAAAAAACTAGAAACAAAGCTTGATGATATGACAAAAGCAGGTGTCATTTGTGAAAGTACAATG
GATAAGTATATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2K4AGH2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

96.296

100

0.963

  comGA Staphylococcus aureus N315

96.296

100

0.963

  comYA Streptococcus mutans UA159

36.677

98.457

0.361

  comYA Streptococcus mutans UA140

36.677

98.457

0.361


Multiple sequence alignment