Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC/cglC   Type   Machinery gene
Locus tag   I6I78_RS16415 Genome accession   NZ_CP068128
Coordinates   3492164..3492394 (-) Length   76 a.a.
NCBI ID   WP_201705387.1    Uniprot ID   -
Organism   Enterococcus casseliflavus strain FDAARGOS_1121     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3489500..3511799 3492164..3492394 within 0


Gene organization within MGE regions


Location: 3489500..3511799
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6I78_RS16390 (I6I78_16390) - 3489500..3490504 (-) 1005 WP_096741836.1 class I SAM-dependent methyltransferase -
  I6I78_RS16395 (I6I78_16395) comGG 3490672..3491043 (-) 372 WP_015510479.1 competence type IV pilus minor pilin ComGG -
  I6I78_RS16400 (I6I78_16400) comGF 3491006..3491431 (-) 426 WP_005227741.1 competence type IV pilus minor pilin ComGF -
  I6I78_RS16405 (I6I78_16405) - 3491421..3491735 (-) 315 WP_151195730.1 hypothetical protein -
  I6I78_RS16410 (I6I78_16410) comGD 3491716..3492180 (-) 465 WP_236598536.1 competence type IV pilus minor pilin ComGD -
  I6I78_RS16415 (I6I78_16415) comGC/cglC 3492164..3492394 (-) 231 WP_201705387.1 competence type IV pilus major pilin ComGC Machinery gene
  I6I78_RS16420 (I6I78_16420) - 3492481..3492885 (+) 405 WP_201705388.1 hypothetical protein -
  I6I78_RS16430 (I6I78_16430) - 3493358..3493765 (+) 408 WP_144771966.1 hypothetical protein -
  I6I78_RS17690 - 3493811..3494830 (-) 1020 WP_236598537.1 CHAP domain-containing protein -
  I6I78_RS16440 (I6I78_16440) - 3494962..3495216 (-) 255 Protein_3234 phage holin -
  I6I78_RS16445 (I6I78_16445) - 3495219..3495572 (-) 354 WP_201705390.1 hypothetical protein -
  I6I78_RS16450 (I6I78_16450) - 3495614..3495814 (-) 201 WP_201705391.1 hypothetical protein -
  I6I78_RS16455 (I6I78_16455) - 3495830..3497197 (-) 1368 WP_201705392.1 hypothetical protein -
  I6I78_RS16460 (I6I78_16460) - 3497365..3499623 (-) 2259 WP_201705393.1 phage tail protein -
  I6I78_RS16465 (I6I78_16465) - 3499625..3501130 (-) 1506 WP_201705394.1 distal tail protein Dit -
  I6I78_RS16470 (I6I78_16470) - 3501134..3503206 (-) 2073 Protein_3240 phage tail tape measure protein -
  I6I78_RS17795 - 3504052..3504180 (-) 129 WP_269116838.1 hypothetical protein -
  I6I78_RS16480 (I6I78_16480) - 3504255..3504623 (-) 369 WP_201705396.1 hypothetical protein -
  I6I78_RS16485 (I6I78_16485) - 3504636..3505223 (-) 588 WP_201705397.1 major tail protein -
  I6I78_RS16490 (I6I78_16490) - 3505239..3505559 (-) 321 WP_201705398.1 hypothetical protein -
  I6I78_RS16495 (I6I78_16495) - 3505556..3505903 (-) 348 WP_201705399.1 HK97 gp10 family phage protein -
  I6I78_RS16500 (I6I78_16500) - 3505903..3506208 (-) 306 WP_201705400.1 phage head closure protein -
  I6I78_RS16505 (I6I78_16505) - 3506210..3506476 (-) 267 WP_201705401.1 hypothetical protein -
  I6I78_RS16510 (I6I78_16510) - 3506652..3507854 (-) 1203 WP_201705402.1 phage major capsid protein -
  I6I78_RS16515 (I6I78_16515) - 3507844..3508425 (-) 582 WP_201705403.1 HK97 family phage prohead protease -
  I6I78_RS16520 (I6I78_16520) - 3508425..3509345 (-) 921 WP_236598538.1 phage portal protein -
  I6I78_RS17695 - 3509342..3509629 (-) 288 WP_236598539.1 phage portal protein -
  I6I78_RS16525 (I6I78_16525) - 3509654..3511435 (-) 1782 WP_201705404.1 terminase large subunit -
  I6I78_RS16530 (I6I78_16530) - 3511440..3511799 (-) 360 WP_201705405.1 P27 family phage terminase small subunit -

Sequence


Protein


Download         Length: 76 a.a.        Molecular weight: 8630.00 Da        Isoelectric Point: 4.2572

>NTDB_id=527335 I6I78_RS16415 WP_201705387.1 3492164..3492394(-) (comGC/cglC) [Enterococcus casseliflavus strain FDAARGOS_1121]
MLVVLLVISILVLLFVPNLANQRGIIDEKGNAAIVKVVETQIELFRLNENREPSREELIAEGYVSEEQYAIYEQGK

Nucleotide


Download         Length: 231 bp        

>NTDB_id=527335 I6I78_RS16415 WP_201705387.1 3492164..3492394(-) (comGC/cglC) [Enterococcus casseliflavus strain FDAARGOS_1121]
ATGTTGGTTGTTTTATTAGTGATCAGTATTTTGGTTTTATTGTTTGTCCCAAATCTTGCCAATCAAAGAGGAATCATTGA
TGAGAAAGGAAATGCGGCGATCGTGAAAGTAGTGGAAACCCAGATCGAACTCTTCCGTCTGAATGAAAATCGGGAGCCTT
CGAGAGAAGAATTGATTGCTGAAGGGTATGTATCTGAGGAACAATATGCGATTTATGAGCAGGGAAAGTGA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC/cglC Streptococcus pneumoniae Rx1

52.703

97.368

0.513

  comGC/cglC Streptococcus pneumoniae D39

52.703

97.368

0.513

  comGC/cglC Streptococcus pneumoniae R6

52.703

97.368

0.513

  comGC/cglC Streptococcus pneumoniae TIGR4

52.703

97.368

0.513

  comYC Streptococcus suis isolate S10

51.316

100

0.513

  comYC Streptococcus gordonii str. Challis substr. CH1

50

97.368

0.487

  comGC Staphylococcus aureus MW2

49.275

90.789

0.447

  comGC Staphylococcus aureus N315

49.275

90.789

0.447

  comGC/cglC Streptococcus mitis NCTC 12261

47.541

80.263

0.382

  comGC/cglC Streptococcus mitis SK321

47.541

80.263

0.382

  comGC Bacillus subtilis subsp. subtilis str. 168

37.662

100

0.382