Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   IEC97_RS03050 Genome accession   NZ_JAEHGC010000001
Coordinates   578757..579836 (-) Length   359 a.a.
NCBI ID   WP_286179865.1    Uniprot ID   -
Organism   Neobacillus cucumis strain T4S4     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 573757..584836
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IEC97_RS03010 (IEC97_03010) - 574747..575538 (+) 792 WP_198309174.1 YqhG family protein -
  IEC97_RS03015 (IEC97_03015) - 575571..575756 (-) 186 WP_198309175.1 YqzE family protein -
  IEC97_RS03020 (IEC97_03020) comGG 575822..576208 (-) 387 WP_198309176.1 competence type IV pilus minor pilin ComGG -
  IEC97_RS03025 (IEC97_03025) comGF 576205..576684 (-) 480 WP_198309177.1 competence type IV pilus minor pilin ComGF -
  IEC97_RS03030 (IEC97_03030) comGE 576629..576967 (-) 339 WP_198309178.1 competence type IV pilus minor pilin ComGE -
  IEC97_RS03035 (IEC97_03035) comGD 576951..577391 (-) 441 WP_198309179.1 competence type IV pilus minor pilin ComGD -
  IEC97_RS03040 (IEC97_03040) comGC 577388..577720 (-) 333 WP_198309180.1 competence type IV pilus major pilin ComGC -
  IEC97_RS03045 (IEC97_03045) comGB 577739..578770 (-) 1032 WP_198309181.1 competence type IV pilus assembly protein ComGB -
  IEC97_RS03050 (IEC97_03050) comGA 578757..579836 (-) 1080 WP_286179865.1 competence type IV pilus ATPase ComGA Machinery gene
  IEC97_RS03055 (IEC97_03055) - 580097..580792 (+) 696 WP_198309182.1 helix-turn-helix domain-containing protein -
  IEC97_RS03060 (IEC97_03060) - 580877..581119 (+) 243 WP_198309183.1 DUF2626 domain-containing protein -
  IEC97_RS03065 (IEC97_03065) - 581160..582245 (-) 1086 WP_198309184.1 SAM-dependent methyltransferase -
  IEC97_RS03070 (IEC97_03070) - 582399..583034 (-) 636 WP_198309185.1 MBL fold metallo-hydrolase -
  IEC97_RS03075 (IEC97_03075) - 583283..583459 (+) 177 WP_026572464.1 DUF2759 domain-containing protein -
  IEC97_RS03080 (IEC97_03080) - 583500..583814 (-) 315 WP_198309186.1 MTH1187 family thiamine-binding protein -

Sequence


Protein


Download         Length: 359 a.a.        Molecular weight: 40582.92 Da        Isoelectric Point: 9.0763

>NTDB_id=1112718 IEC97_RS03050 WP_286179865.1 578757..579836(-) (comGA) [Neobacillus cucumis strain T4S4]
MLIVSAIEILANRIIKDATRNQATDIHIIPRKNDTLVQIRLTNKLIPRLSLPKEECDKLISHFKFTANMDIGERRRPQSG
SIFCDVNGQLMGLRLSTLPSNNRESLVIRLLPQQEQIPFHQLSLFPSMTRKLLALLKHAHGLIIFTGPTGSGKTTTLYSL
LNETAHLFHRNVITLEDPIEKNYDSVLQVQVNEKAGVTYATGLKAILRHDPDIIMVGEIRDSETAKIAVRAALTGHLVLS
TMHTRDAKGAVYRLHEFGVNWLEVEQTLIAVTAQRLVELTCPFCQDDCSPFCYSYGRWKRASVFELLSGRNLQSAMRAAK
GENIEPRYKTINQVINKGIALGYIKESEYERLVYNYETS

Nucleotide


Download         Length: 1080 bp        

>NTDB_id=1112718 IEC97_RS03050 WP_286179865.1 578757..579836(-) (comGA) [Neobacillus cucumis strain T4S4]
ATTCTCATTGTCAGCGCTATCGAAATTTTAGCCAATCGAATTATTAAAGATGCCACTCGAAACCAAGCAACCGATATTCA
TATCATTCCGCGAAAGAATGATACACTCGTTCAAATCCGCCTTACGAATAAACTTATTCCTCGGTTATCCCTTCCAAAAG
AAGAATGCGACAAATTAATTTCACATTTTAAATTTACCGCAAACATGGATATTGGGGAAAGAAGACGCCCGCAAAGCGGT
TCAATTTTTTGCGATGTCAACGGTCAATTGATGGGGCTCAGATTATCCACCCTGCCTTCAAACAATCGAGAAAGCCTTGT
TATCCGGCTTTTACCTCAACAAGAACAGATTCCATTTCACCAGCTATCCTTATTCCCTTCAATGACGAGAAAATTACTTG
CCCTTCTTAAACACGCCCACGGGTTAATTATTTTTACCGGGCCGACAGGCAGTGGAAAAACCACTACCTTATATTCACTC
CTCAATGAAACCGCACACTTATTCCATCGGAACGTCATTACACTAGAAGACCCTATCGAAAAAAATTATGATTCCGTCCT
GCAGGTCCAAGTAAACGAAAAAGCGGGGGTCACCTATGCCACCGGATTAAAAGCTATTTTGCGGCATGATCCGGATATCA
TTATGGTAGGAGAAATAAGGGACAGTGAAACAGCAAAAATTGCTGTTCGGGCTGCGCTTACGGGGCATTTAGTTCTGTCC
ACAATGCATACAAGGGACGCGAAGGGAGCGGTTTATCGACTTCATGAATTCGGAGTGAATTGGCTGGAAGTAGAGCAAAC
GCTTATTGCCGTAACGGCACAGCGCTTGGTGGAATTGACCTGTCCATTTTGCCAGGACGACTGTTCGCCATTTTGTTATT
CATATGGTAGATGGAAAAGAGCAAGTGTGTTTGAACTCTTATCAGGGAGGAATTTACAGTCAGCAATGAGGGCTGCAAAG
GGGGAAAATATTGAACCACGCTATAAGACGATCAATCAAGTAATCAATAAGGGAATTGCACTTGGTTATATAAAAGAGTC
AGAATATGAACGGCTGGTATACAATTATGAAACATCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.091

98.329

0.552

  pilB Glaesserella parasuis strain SC1401

38.997

100

0.39


Multiple sequence alignment