Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   FAY30_RS16935 Genome accession   NZ_CP039727
Coordinates   3489614..3490732 (-) Length   372 a.a.
NCBI ID   WP_223820795.1    Uniprot ID   -
Organism   Bacillus sp. S3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3484614..3495732
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FAY30_RS16895 (FAY30_16895) - 3485573..3486364 (+) 792 WP_149870967.1 YqhG family protein -
  FAY30_RS16900 (FAY30_16900) - 3486447..3486632 (-) 186 WP_149870968.1 YqzE family protein -
  FAY30_RS16905 (FAY30_16905) comGG 3486694..3487077 (-) 384 WP_190284673.1 competence type IV pilus minor pilin ComGG -
  FAY30_RS16910 (FAY30_16910) comGF 3487074..3487550 (-) 477 WP_223820794.1 competence type IV pilus minor pilin ComGF -
  FAY30_RS16915 (FAY30_16915) - 3487498..3487836 (-) 339 WP_190284674.1 hypothetical protein -
  FAY30_RS16920 (FAY30_16920) comGD 3487820..3488260 (-) 441 WP_149870971.1 competence type IV pilus minor pilin ComGD -
  FAY30_RS16925 (FAY30_16925) comGC 3488260..3488580 (-) 321 WP_149870972.1 competence type IV pilus major pilin ComGC -
  FAY30_RS16930 (FAY30_16930) comGB 3488596..3489627 (-) 1032 WP_149870973.1 competence type IV pilus assembly protein ComGB -
  FAY30_RS16935 (FAY30_16935) comGA 3489614..3490732 (-) 1119 WP_223820795.1 competence type IV pilus ATPase ComGA Machinery gene
  FAY30_RS16940 (FAY30_16940) - 3490995..3491693 (+) 699 WP_149870974.1 helix-turn-helix transcriptional regulator -
  FAY30_RS16945 (FAY30_16945) - 3491778..3492020 (+) 243 WP_007086339.1 DUF2626 domain-containing protein -
  FAY30_RS16950 (FAY30_16950) - 3492059..3493150 (-) 1092 WP_149870975.1 class I SAM-dependent methyltransferase -
  FAY30_RS16955 (FAY30_16955) - 3493304..3493939 (-) 636 WP_149870976.1 MBL fold metallo-hydrolase -
  FAY30_RS16960 (FAY30_16960) - 3494147..3494323 (+) 177 WP_149870977.1 DUF2759 domain-containing protein -
  FAY30_RS16965 (FAY30_16965) - 3494354..3495493 (-) 1140 WP_190284675.1 hypothetical protein -

Sequence


Protein


Download         Length: 372 a.a.        Molecular weight: 41885.65 Da        Isoelectric Point: 9.5642

>NTDB_id=360465 FAY30_RS16935 WP_223820795.1 3489614..3490732(-) (comGA) [Bacillus sp. S3]
MSKYMQRQNKKVVLPIVSAIEILANRIITDAARNQATDIHIIPRKKDTLVQIRLTNKLIPRLSLPKDECDRLISHFKFTA
NMDIGERRRPQSGAIFCEVDGQLMGLRLSTLPSNNRESLVIRLLPQQEQIPFHQLSLFPAMTRKLLALLKHAHGLIIFTG
PTGSGKTTTLYSLLNETAHLFHRNVITLEDPIEKNYDSVLQVQVNEKAGVTYAAGLKAILRHDPDIIMVGEIRDAETAKI
AVRAALTGHLVLSTMHTRDAKGAVYRLREFGVNWLEVEQTLIAVTAQRLVELTCPFCEGECSPLCYSYGRWKRASVFELL
SGRNLNTAMKAAKGEKVETHYRTLSKVINKGIALGYIQESEYERLVFADETT

Nucleotide


Download         Length: 1119 bp        

>NTDB_id=360465 FAY30_RS16935 WP_223820795.1 3489614..3490732(-) (comGA) [Bacillus sp. S3]
ATGTCGAAATATATGCAAAGACAAAATAAAAAGGTGGTGTTACCAATTGTTAGTGCGATTGAAATTTTAGCGAATCGGAT
TATCACAGATGCAGCACGGAACCAAGCGACAGATATTCACATAATACCGCGAAAGAAGGACACACTTGTTCAAATCCGTT
TAACCAACAAACTCATTCCCCGGTTATCCCTTCCAAAAGATGAATGCGACAGATTAATCTCACACTTTAAATTTACAGCA
AATATGGATATCGGTGAAAGAAGACGCCCCCAAAGCGGTGCCATTTTTTGTGAGGTAGACGGACAATTAATGGGGCTCAG
GCTTTCAACACTCCCCTCTAACAACAGAGAAAGCCTCGTCATCAGGTTATTACCCCAACAAGAACAGATTCCATTCCACC
AGCTTTCATTATTCCCGGCAATGACGCGGAAATTGCTGGCCCTGCTCAAGCACGCCCATGGCTTAATCATCTTTACCGGT
CCCACCGGCAGTGGGAAGACGACTACTCTCTATTCTCTTTTAAATGAAACAGCCCACTTATTTCATCGCAATGTCATCAC
ATTAGAAGATCCTATCGAAAAAAACTATGACTCTGTTCTTCAGGTACAAGTGAATGAAAAAGCAGGCGTAACCTATGCTG
CTGGTTTAAAGGCGATTCTTCGCCACGATCCCGACATTATCATGGTTGGAGAAATCAGGGATGCTGAAACCGCTAAAATT
GCCGTCAGAGCGGCCCTTACAGGGCATTTAGTACTCTCAACTATGCATACAAGAGATGCCAAGGGAGCAGTTTACCGCCT
CCGTGAATTTGGTGTGAATTGGCTGGAGGTCGAGCAAACATTAATTGCGGTAACCGCTCAAAGACTAGTAGAACTTACGT
GCCCATTCTGCGAAGGTGAATGTTCGCCCTTATGTTATTCCTATGGAAGGTGGAAACGAGCGAGTGTGTTTGAACTGTTG
TCTGGAAGAAATTTAAACACAGCGATGAAAGCGGCAAAAGGGGAAAAAGTCGAAACACATTACAGAACCCTTAGTAAGGT
GATTAACAAGGGGATTGCACTTGGATACATTCAAGAGTCGGAGTATGAACGGCTGGTGTTTGCTGATGAAACCACGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.775

95.43

0.532

  pilB Glaesserella parasuis strain SC1401

40.057

94.624

0.379


Multiple sequence alignment