Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYC   Type   Machinery gene
Locus tag   PHA78_RS00810 Genome accession   NZ_CP116604
Coordinates   139334..139615 (+) Length   93 a.a.
NCBI ID   WP_272158321.1    Uniprot ID   -
Organism   Streptococcus sp. HN38     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 107951..138383 139334..139615 flank 951


Gene organization within MGE regions


Location: 107951..139615
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PHA78_RS00680 - 107951..109099 (-) 1149 WP_024377200.1 site-specific integrase -
  PHA78_RS00685 - 109151..109492 (-) 342 WP_024376319.1 DUF771 domain-containing protein -
  PHA78_RS00690 - 109550..110788 (-) 1239 WP_024376318.1 replication initiation factor domain-containing protein -
  PHA78_RS00695 - 110882..111649 (-) 768 WP_099806850.1 AAA family ATPase -
  PHA78_RS00700 - 111649..112134 (-) 486 WP_024376316.1 hypothetical protein -
  PHA78_RS00705 - 112697..113404 (+) 708 WP_024376314.1 hypothetical protein -
  PHA78_RS00710 - 113461..113796 (-) 336 WP_306508887.1 helix-turn-helix transcriptional regulator -
  PHA78_RS00715 - 113943..116426 (+) 2484 WP_272158316.1 SEC10/PgrA surface exclusion domain-containing protein -
  PHA78_RS00720 - 116730..117350 (+) 621 WP_024376311.1 hypothetical protein -
  PHA78_RS00725 - 117520..117663 (+) 144 WP_099806715.1 putative holin-like toxin -
  PHA78_RS00730 - 118369..118812 (+) 444 WP_024377807.1 zinc-dependent MarR family transcriptional regulator -
  PHA78_RS00735 - 118813..119517 (+) 705 WP_024377806.1 metal ABC transporter ATP-binding protein -
  PHA78_RS00740 - 119510..120322 (+) 813 WP_002936593.1 metal ABC transporter permease -
  PHA78_RS00745 - 120332..121846 (+) 1515 WP_099778505.1 zinc ABC transporter substrate-binding protein AdcA -
  PHA78_RS00750 - 121949..122395 (+) 447 WP_024377804.1 CopY/TcrY family copper transport repressor -
  PHA78_RS00755 - 122392..122601 (+) 210 WP_024377803.1 heavy metal-associated domain-containing protein -
  PHA78_RS00760 - 122629..123006 (-) 378 WP_272158317.1 HIT family protein -
  PHA78_RS00765 - 123060..123842 (-) 783 WP_024377801.1 TMEM175 family protein -
  PHA78_RS00770 tyrS 123846..125102 (-) 1257 WP_024415510.1 tyrosine--tRNA ligase -
  PHA78_RS00775 pbp1b 125256..127679 (+) 2424 WP_272158318.1 penicillin-binding protein PBP1B -
  PHA78_RS00780 rpoB 128344..131916 (+) 3573 WP_002936570.1 DNA-directed RNA polymerase subunit beta -
  PHA78_RS00785 rpoC 132094..135741 (+) 3648 WP_272158319.1 DNA-directed RNA polymerase subunit beta' -
  PHA78_RS00790 - 135893..136252 (+) 360 WP_024377796.1 DUF1033 family protein -
  PHA78_RS00795 - 136397..137347 (-) 951 WP_272158320.1 S66 peptidase family protein -
  PHA78_RS00800 comYA 137433..138383 (+) 951 WP_043026808.1 competence type IV pilus ATPase ComGA Machinery gene
  PHA78_RS00805 comYB 138295..139332 (+) 1038 WP_272158556.1 competence type IV pilus assembly protein ComGB Machinery gene
  PHA78_RS00810 comYC 139334..139615 (+) 282 WP_272158321.1 competence type IV pilus major pilin ComGC Machinery gene

Sequence


Protein


Download         Length: 93 a.a.        Molecular weight: 10292.25 Da        Isoelectric Point: 8.4205

>NTDB_id=778745 PHA78_RS00810 WP_272158321.1 139334..139615(+) (comYC) [Streptococcus sp. HN38]
MKKLIEKKVKAFTLVEMLVVLGIISLLLLLFVPNLSKQKEAIKESGGTAVVKVVESQMELYALEHDKEATVADLQAAGYI
TEKQAEEYAKAKK

Nucleotide


Download         Length: 282 bp        

>NTDB_id=778745 PHA78_RS00810 WP_272158321.1 139334..139615(+) (comYC) [Streptococcus sp. HN38]
ATGAAAAAATTAATTGAAAAGAAGGTAAAAGCGTTCACTCTGGTGGAAATGTTAGTCGTTTTGGGGATCATTAGCCTGCT
CTTGCTCCTCTTTGTGCCAAATTTGAGCAAACAAAAAGAAGCGATTAAAGAGTCCGGGGGTACAGCTGTCGTTAAAGTCG
TAGAAAGCCAGATGGAACTTTATGCATTAGAGCATGATAAGGAAGCAACGGTGGCAGATTTACAGGCGGCTGGCTATATT
ACTGAGAAACAAGCAGAAGAGTATGCTAAGGCGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYC Streptococcus suis isolate S10

82.796

100

0.828

  comGC/cglC Streptococcus mitis SK321

64.13

98.925

0.634

  comGC/cglC Streptococcus mitis NCTC 12261

64.13

98.925

0.634

  comYC Streptococcus gordonii str. Challis substr. CH1

64.045

95.699

0.613

  comGC/cglC Streptococcus pneumoniae R6

69.136

87.097

0.602

  comGC/cglC Streptococcus pneumoniae TIGR4

69.136

87.097

0.602

  comGC/cglC Streptococcus pneumoniae D39

69.136

87.097

0.602

  comGC/cglC Streptococcus pneumoniae Rx1

69.136

87.097

0.602

  comYC Streptococcus mutans UA140

62.5

94.624

0.591

  comYC Streptococcus mutans UA159

62.5

94.624

0.591

  comGC Lactococcus lactis subsp. cremoris KW2

56.818

94.624

0.538

  comGC Staphylococcus aureus MW2

53.165

84.946

0.452

  comGC Staphylococcus aureus N315

53.165

84.946

0.452