Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   SE864_RS01115 Genome accession   NZ_CP138369
Coordinates   190588..191562 (+) Length   324 a.a.
NCBI ID   WP_001008570.1    Uniprot ID   A0AAW6XQQ3
Organism   Streptococcus agalactiae strain SagR31     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 190299..235979 190588..191562 within 0


Gene organization within MGE regions


Location: 190299..235979
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SE864_RS01115 (SE864_01115) comYH 190588..191562 (+) 975 WP_001008570.1 class I SAM-dependent methyltransferase Machinery gene
  SE864_RS01120 (SE864_01120) - 191594..192787 (+) 1194 WP_000047535.1 acetate kinase -
  SE864_RS01125 (SE864_01125) - 192938..193144 (+) 207 WP_000798242.1 helix-turn-helix transcriptional regulator -
  SE864_RS01130 (SE864_01130) - 193203..193340 (+) 138 WP_001867090.1 hypothetical protein -
  SE864_RS01135 (SE864_01135) - 193381..193836 (+) 456 WP_000905674.1 hypothetical protein -
  SE864_RS01140 (SE864_01140) - 193905..194570 (+) 666 WP_000008111.1 CPBP family intramembrane glutamic endopeptidase -
  SE864_RS01145 (SE864_01145) proC 194591..195361 (-) 771 WP_001867096.1 pyrroline-5-carboxylate reductase -
  SE864_RS01150 (SE864_01150) pepA 195431..196498 (-) 1068 WP_001281321.1 glutamyl aminopeptidase -
  SE864_RS01155 (SE864_01155) - 196683..196922 (-) 240 WP_000660181.1 hypothetical protein -
  SE864_RS01160 (SE864_01160) - 197083..197367 (+) 285 WP_000791272.1 DUF4651 domain-containing protein -
  SE864_RS01165 (SE864_01165) - 197364..197687 (+) 324 WP_000601792.1 thioredoxin family protein -
  SE864_RS01170 (SE864_01170) ytpR 197720..198346 (+) 627 WP_000578331.1 YtpR family tRNA-binding protein -
  SE864_RS01175 (SE864_01175) - 198400..199116 (-) 717 WP_000186183.1 class I SAM-dependent methyltransferase -
  SE864_RS01180 (SE864_01180) ssbA 199197..199592 (+) 396 WP_000282450.1 single-stranded DNA-binding protein Machinery gene
  SE864_RS01185 (SE864_01185) - 199715..200359 (+) 645 WP_000416612.1 HAD family phosphatase -
  SE864_RS01190 (SE864_01190) - 200386..202131 (+) 1746 WP_047198532.1 LytS/YhcK type 5TM receptor domain-containing protein -
  SE864_RS01195 (SE864_01195) - 202112..202852 (+) 741 WP_000697630.1 LytTR family DNA-binding domain-containing protein -
  SE864_RS01200 (SE864_01200) - 203022..203477 (+) 456 WP_000683316.1 CidA/LrgA family protein -
  SE864_RS01205 (SE864_01205) lrgB 203479..204207 (+) 729 WP_000421727.1 antiholin-like protein LrgB -
  SE864_RS01210 (SE864_01210) - 204450..206078 (+) 1629 WP_000170504.1 ABC transporter substrate-binding protein -
  SE864_RS01215 (SE864_01215) - 206191..207168 (+) 978 WP_000680644.1 ABC transporter permease -
  SE864_RS01220 (SE864_01220) - 207165..207986 (+) 822 WP_319099080.1 ABC transporter permease -
  SE864_RS01225 (SE864_01225) - 207998..208801 (+) 804 WP_000140979.1 ABC transporter ATP-binding protein -
  SE864_RS01230 (SE864_01230) - 208785..209411 (+) 627 WP_000171304.1 ABC transporter ATP-binding protein -
  SE864_RS01235 (SE864_01235) treP 209693..211723 (+) 2031 WP_000434610.1 PTS system trehalose-specific EIIBC component -
  SE864_RS01240 (SE864_01240) treC 211945..213570 (+) 1626 WP_000151014.1 alpha,alpha-phosphotrehalase -
  SE864_RS01245 (SE864_01245) - 213790..215826 (+) 2037 WP_000228178.1 BglG family transcription antiterminator -
  SE864_RS01250 (SE864_01250) - 215829..216113 (+) 285 WP_000944235.1 PTS sugar transporter subunit IIB -
  SE864_RS01255 (SE864_01255) - 216126..217481 (+) 1356 WP_000677351.1 PTS ascorbate transporter subunit IIC -
  SE864_RS01260 (SE864_01260) - 217484..218341 (+) 858 WP_000203492.1 transketolase -
  SE864_RS01265 (SE864_01265) - 218338..219267 (+) 930 WP_001203828.1 transketolase family protein -
  SE864_RS01270 (SE864_01270) - 219376..220635 (+) 1260 WP_001203074.1 ferric reductase-like transmembrane domain-containing protein -
  SE864_RS01275 (SE864_01275) rpsO 220723..220992 (+) 270 WP_001018249.1 30S ribosomal protein S15 -
  SE864_RS01280 (SE864_01280) pnp 221373..223502 (+) 2130 WP_000043857.1 polyribonucleotide nucleotidyltransferase -
  SE864_RS01285 (SE864_01285) - 223504..224256 (+) 753 WP_000204780.1 SseB family protein -
  SE864_RS01290 (SE864_01290) cysE 224265..224849 (+) 585 WP_000539954.1 serine O-acetyltransferase -
  SE864_RS01295 (SE864_01295) - 224859..225041 (+) 183 WP_000656477.1 hypothetical protein -
  SE864_RS01300 (SE864_01300) cysS 225038..226381 (+) 1344 WP_000591129.1 cysteine--tRNA ligase -
  SE864_RS01305 (SE864_01305) - 226374..226760 (+) 387 WP_000568029.1 Mini-ribonuclease 3 -
  SE864_RS01310 (SE864_01310) rlmB 226863..227618 (+) 756 WP_000178023.1 23S rRNA (guanosine(2251)-2'-O)-methyltransferase RlmB -
  SE864_RS01315 (SE864_01315) - 227615..228133 (+) 519 WP_000716636.1 NYN domain-containing protein -
  SE864_RS01320 (SE864_01320) - 228226..229086 (+) 861 WP_000143135.1 DegV family protein -
  SE864_RS01325 (SE864_01325) - 229624..229743 (+) 120 Protein_208 helix-turn-helix transcriptional regulator -
  SE864_RS01330 (SE864_01330) - 230061..231227 (-) 1167 WP_000160598.1 IS30-like element ISSag9 family transposase -
  SE864_RS01335 (SE864_01335) rplM 231528..231974 (+) 447 WP_001867156.1 50S ribosomal protein L13 -
  SE864_RS01340 (SE864_01340) rpsI 231995..232387 (+) 393 WP_000035940.1 30S ribosomal protein S9 -
  SE864_RS01345 (SE864_01345) - 232533..233687 (-) 1155 WP_000110711.1 site-specific integrase -
  SE864_RS01350 (SE864_01350) - 233742..234260 (-) 519 WP_000181342.1 helix-turn-helix domain-containing protein -
  SE864_RS01355 (SE864_01355) - 234407..234691 (+) 285 WP_001287945.1 hypothetical protein -
  SE864_RS01360 (SE864_01360) - 234703..235557 (+) 855 WP_000005759.1 phage replisome organizer N-terminal domain-containing protein -
  SE864_RS01365 (SE864_01365) - 235568..235858 (+) 291 WP_000158581.1 DUF5962 family protein -

Sequence


Protein


Download         Length: 324 a.a.        Molecular weight: 37100.99 Da        Isoelectric Point: 4.4041

>NTDB_id=902335 SE864_RS01115 WP_001008570.1 190588..191562(+) (comYH) [Streptococcus agalactiae strain SagR31]
MNFEKIETAYELILENIQTIENQLKTHIYDALIEQNSYYLGSSCDLDIVVVNNQKLRQLDLSQEEWRRTFQFIFIKSAQT
EQLQANHQFTPDSIGFILLFLLEELTSQETVDVLEIGSGTGNLAQTLLNNSSKELNYMGIEVDDLLIDLSASIAEIIGSS
AQFIQEDAVRPQILKESDVIISDLPIGYYPNDDIAKRYAVSSSKEHTYAHHLLMEQSLKYLKKDGIAIFLAPENLLTSPQ
SDLLKEWLKGYADVIAVLTLPETIFGSRQNAKSIFVLKKQAEQKPETFVYPLTDLQNRENMANFIENFQKWSRENSHYSK
NMIE

Nucleotide


Download         Length: 975 bp        

>NTDB_id=902335 SE864_RS01115 WP_001008570.1 190588..191562(+) (comYH) [Streptococcus agalactiae strain SagR31]
ATGAATTTTGAAAAAATTGAGACAGCCTATGAGCTGATTTTAGAAAATATCCAAACGATTGAGAACCAATTAAAAACTCA
TATTTATGATGCCTTAATTGAACAGAACTCTTATTACCTTGGTTCAAGTTGTGATTTAGATATTGTTGTGGTGAATAACC
AAAAATTACGTCAACTTGACTTAAGTCAAGAAGAATGGCGTCGCACTTTCCAGTTCATTTTTATCAAATCTGCGCAAACA
GAGCAATTACAAGCTAATCATCAGTTTACGCCAGATAGTATTGGTTTTATCTTGTTATTTCTTTTGGAAGAATTAACGAG
TCAAGAGACAGTGGATGTCTTGGAAATTGGAAGTGGAACTGGGAATTTAGCTCAGACTCTCCTCAATAACAGCTCGAAAG
AGTTAAATTATATGGGCATTGAAGTTGATGATCTTTTGATTGATCTATCAGCAAGCATTGCTGAAATTATAGGTTCTAGT
GCCCAATTTATCCAAGAGGATGCCGTTAGACCACAAATTTTGAAAGAAAGCGATGTAATCATTAGTGATTTACCAATTGG
CTATTATCCTAATGATGATATTGCTAAACGATATGCTGTATCAAGTTCTAAAGAGCACACCTATGCTCACCATCTATTGA
TGGAGCAATCTCTTAAATATTTGAAAAAAGATGGAATCGCTATATTTTTAGCACCCGAAAACCTTTTAACAAGTCCACAA
AGTGATTTGCTGAAGGAGTGGTTAAAAGGATATGCAGATGTCATTGCCGTTTTAACTCTACCAGAAACTATTTTTGGAAG
TCGTCAAAATGCGAAATCTATATTTGTTCTCAAGAAGCAAGCAGAACAAAAACCAGAAACCTTTGTATATCCGCTGACAG
ATTTGCAAAATCGTGAGAATATGGCAAACTTCATTGAAAATTTTCAAAAATGGAGCAGAGAAAATAGTCATTACTCAAAA
AATATGATAGAATAG

Domains


Predicted by InterproScan.

(70-303)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

66.984

97.222

0.651

  comYH Streptococcus mutans UA140

66.984

97.222

0.651