Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   EGX82_RS04900 Genome accession   NZ_CP033822
Coordinates   933182..934156 (-) Length   324 a.a.
NCBI ID   WP_001008570.1    Uniprot ID   A0AAW6XQQ3
Organism   Streptococcus agalactiae strain FDAARGOS_512     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 888765..934445 933182..934156 within 0


Gene organization within MGE regions


Location: 888765..934445
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EGX82_RS04640 (EGX82_04640) - 888886..889176 (-) 291 WP_000158581.1 DUF5962 family protein -
  EGX82_RS04645 (EGX82_04645) - 889187..890041 (-) 855 WP_000005759.1 phage replisome organizer N-terminal domain-containing protein -
  EGX82_RS04650 (EGX82_04650) - 890053..890337 (-) 285 WP_001287945.1 hypothetical protein -
  EGX82_RS04655 (EGX82_04655) - 890484..891002 (+) 519 WP_000181342.1 helix-turn-helix transcriptional regulator -
  EGX82_RS04660 (EGX82_04660) - 891057..892211 (+) 1155 WP_000110711.1 site-specific integrase -
  EGX82_RS04665 (EGX82_04665) rpsI 892357..892749 (-) 393 WP_000035940.1 30S ribosomal protein S9 -
  EGX82_RS04670 (EGX82_04670) rplM 892770..893216 (-) 447 WP_001867156.1 50S ribosomal protein L13 -
  EGX82_RS04675 (EGX82_04675) - 893517..894683 (+) 1167 WP_000160598.1 IS30-like element ISSag9 family transposase -
  EGX82_RS04680 (EGX82_04680) - 895001..895120 (-) 120 Protein_868 helix-turn-helix domain-containing protein -
  EGX82_RS04690 (EGX82_04690) - 895658..896518 (-) 861 WP_000143135.1 DegV family protein -
  EGX82_RS04695 (EGX82_04695) - 896611..897129 (-) 519 WP_000716636.1 NYN domain-containing protein -
  EGX82_RS04700 (EGX82_04700) rlmB 897126..897881 (-) 756 WP_000178023.1 23S rRNA (guanosine(2251)-2'-O)-methyltransferase RlmB -
  EGX82_RS04705 (EGX82_04705) - 897984..898370 (-) 387 WP_000568029.1 Mini-ribonuclease 3 -
  EGX82_RS04710 (EGX82_04710) cysS 898363..899706 (-) 1344 WP_000591129.1 cysteine--tRNA ligase -
  EGX82_RS04715 (EGX82_04715) - 899703..899885 (-) 183 WP_000656477.1 hypothetical protein -
  EGX82_RS04720 (EGX82_04720) cysE 899895..900479 (-) 585 WP_000539954.1 serine O-acetyltransferase -
  EGX82_RS04725 (EGX82_04725) - 900488..901240 (-) 753 WP_000204780.1 SseB family protein -
  EGX82_RS04730 (EGX82_04730) pnp 901242..903371 (-) 2130 WP_000043857.1 polyribonucleotide nucleotidyltransferase -
  EGX82_RS04735 (EGX82_04735) rpsO 903752..904021 (-) 270 WP_001018249.1 30S ribosomal protein S15 -
  EGX82_RS04740 (EGX82_04740) - 904109..905368 (-) 1260 WP_001203074.1 ferric reductase-like transmembrane domain-containing protein -
  EGX82_RS04745 (EGX82_04745) - 905477..906406 (-) 930 WP_001203828.1 transketolase C-terminal domain-containing protein -
  EGX82_RS04750 (EGX82_04750) - 906403..907260 (-) 858 WP_000203492.1 transketolase -
  EGX82_RS04755 (EGX82_04755) - 907263..908618 (-) 1356 WP_000677351.1 PTS ascorbate transporter subunit IIC -
  EGX82_RS04760 (EGX82_04760) - 908631..908915 (-) 285 WP_000944235.1 PTS sugar transporter subunit IIB -
  EGX82_RS04765 (EGX82_04765) - 908918..910954 (-) 2037 WP_000228178.1 BglG family transcription antiterminator -
  EGX82_RS04770 (EGX82_04770) treC 911174..912799 (-) 1626 WP_000151014.1 alpha,alpha-phosphotrehalase -
  EGX82_RS04775 (EGX82_04775) treP 913021..915051 (-) 2031 WP_000434610.1 PTS system trehalose-specific EIIBC component -
  EGX82_RS04780 (EGX82_04780) - 915333..915959 (-) 627 WP_000171304.1 ABC transporter ATP-binding protein -
  EGX82_RS04785 (EGX82_04785) - 915943..916746 (-) 804 WP_000140979.1 ABC transporter ATP-binding protein -
  EGX82_RS04790 (EGX82_04790) - 916758..917579 (-) 822 WP_000603397.1 ABC transporter permease -
  EGX82_RS04795 (EGX82_04795) - 917576..918553 (-) 978 WP_000680644.1 ABC transporter permease -
  EGX82_RS04800 (EGX82_04800) - 918666..920294 (-) 1629 WP_000170504.1 ABC transporter substrate-binding protein -
  EGX82_RS04810 (EGX82_04810) lrgB 920537..921265 (-) 729 WP_000421727.1 antiholin-like protein LrgB -
  EGX82_RS04815 (EGX82_04815) - 921267..921722 (-) 456 WP_000683316.1 CidA/LrgA family protein -
  EGX82_RS04820 (EGX82_04820) - 921892..922632 (-) 741 WP_000697630.1 LytTR family transcriptional regulator DNA-binding domain-containing protein -
  EGX82_RS04825 (EGX82_04825) - 922613..924358 (-) 1746 WP_000930334.1 LytS/YhcK type 5TM receptor domain-containing protein -
  EGX82_RS04830 (EGX82_04830) - 924385..925029 (-) 645 WP_000416612.1 HAD family hydrolase -
  EGX82_RS04835 (EGX82_04835) ssbA 925152..925547 (-) 396 WP_000282450.1 single-stranded DNA-binding protein Machinery gene
  EGX82_RS04840 (EGX82_04840) - 925628..926344 (+) 717 WP_000186183.1 class I SAM-dependent methyltransferase -
  EGX82_RS04845 (EGX82_04845) ytpR 926398..927024 (-) 627 WP_000578331.1 YtpR family tRNA-binding protein -
  EGX82_RS04850 (EGX82_04850) - 927057..927380 (-) 324 WP_000601792.1 thioredoxin family protein -
  EGX82_RS04855 (EGX82_04855) - 927377..927661 (-) 285 WP_000791272.1 DUF4651 domain-containing protein -
  EGX82_RS04860 (EGX82_04860) - 927822..928061 (+) 240 WP_000660181.1 hypothetical protein -
  EGX82_RS04865 (EGX82_04865) pepA 928246..929313 (+) 1068 WP_001281321.1 glutamyl aminopeptidase -
  EGX82_RS04870 (EGX82_04870) proC 929383..930153 (+) 771 WP_001867096.1 pyrroline-5-carboxylate reductase -
  EGX82_RS04875 (EGX82_04875) - 930174..930839 (-) 666 WP_000008111.1 type II CAAX endopeptidase family protein -
  EGX82_RS04880 (EGX82_04880) - 930908..931363 (-) 456 WP_000905674.1 hypothetical protein -
  EGX82_RS04885 (EGX82_04885) - 931404..931541 (-) 138 WP_001867090.1 hypothetical protein -
  EGX82_RS04890 (EGX82_04890) - 931600..931806 (-) 207 WP_000798242.1 helix-turn-helix transcriptional regulator -
  EGX82_RS04895 (EGX82_04895) - 931957..933150 (-) 1194 WP_000047535.1 acetate kinase -
  EGX82_RS04900 (EGX82_04900) comYH 933182..934156 (-) 975 WP_001008570.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37100.99 Da        Isoelectric Point: 4.4041

>NTDB_id=326118 EGX82_RS04900 WP_001008570.1 933182..934156(-) (comYH) [Streptococcus agalactiae strain FDAARGOS_512]
MNFEKIETAYELILENIQTIENQLKTHIYDALIEQNSYYLGSSCDLDIVVVNNQKLRQLDLSQEEWRRTFQFIFIKSAQT
EQLQANHQFTPDSIGFILLFLLEELTSQETVDVLEIGSGTGNLAQTLLNNSSKELNYMGIEVDDLLIDLSASIAEIIGSS
AQFIQEDAVRPQILKESDVIISDLPIGYYPNDDIAKRYAVSSSKEHTYAHHLLMEQSLKYLKKDGIAIFLAPENLLTSPQ
SDLLKEWLKGYADVIAVLTLPETIFGSRQNAKSIFVLKKQAEQKPETFVYPLTDLQNRENMANFIENFQKWSRENSHYSK
NMIE

Nucleotide


Download         Length: 975 bp        

>NTDB_id=326118 EGX82_RS04900 WP_001008570.1 933182..934156(-) (comYH) [Streptococcus agalactiae strain FDAARGOS_512]
ATGAATTTTGAAAAAATTGAGACAGCCTATGAGCTGATTTTAGAAAATATCCAAACGATTGAGAACCAATTAAAAACTCA
TATTTATGATGCCTTAATTGAACAGAACTCTTATTACCTTGGTTCAAGTTGTGATTTAGATATTGTTGTGGTGAATAACC
AAAAATTACGTCAACTTGACTTAAGTCAAGAAGAATGGCGTCGCACTTTCCAGTTCATTTTTATCAAATCTGCGCAAACA
GAGCAATTACAAGCTAATCATCAGTTTACGCCAGATAGTATTGGTTTTATCTTGTTATTTCTTTTGGAAGAATTAACGAG
TCAAGAGACAGTGGATGTCTTGGAAATTGGAAGTGGAACTGGGAATTTAGCTCAGACTCTCCTCAATAACAGCTCGAAAG
AGTTAAATTATATGGGCATTGAAGTTGATGATCTTTTGATTGATCTATCAGCAAGCATTGCTGAAATTATAGGTTCTAGT
GCCCAATTTATCCAAGAGGATGCCGTTAGACCACAAATTTTGAAAGAAAGCGATGTAATCATTAGTGATTTACCAATTGG
CTATTATCCTAATGATGATATTGCTAAACGATATGCTGTATCAAGTTCTAAAGAGCACACCTATGCTCACCATCTATTGA
TGGAGCAATCTCTTAAATATTTGAAAAAAGATGGAATCGCTATATTTTTAGCACCCGAAAACCTTTTAACAAGTCCACAA
AGTGATTTGCTGAAGGAGTGGTTAAAAGGATATGCAGATGTCATTGCCGTTTTAACTCTACCAGAAACTATTTTTGGAAG
TCGTCAAAATGCGAAATCTATATTTGTTCTCAAGAAGCAAGCAGAACAAAAACCAGAAACCTTTGTATATCCGCTGACAG
ATTTGCAAAATCGTGAGAATATGGCAAACTTCATTGAAAATTTTCAAAAATGGAGCAGAGAAAATAGTCATTACTCAAAA
AATATGATAGAATAG

Domains


Predicted by InterproScan.

(70-303)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

66.984

97.222

0.651

  comYH Streptococcus mutans UA140

66.984

97.222

0.651


Multiple sequence alignment