Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   GRQ40_RS12930 Genome accession   NZ_CP047158
Coordinates   2684596..2685660 (-) Length   354 a.a.
NCBI ID   WP_044744511.1    Uniprot ID   -
Organism   Anoxybacillus sp. PDR2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2679596..2690660
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GRQ40_RS12890 (GRQ40_12890) - 2680521..2680700 (-) 180 WP_044744503.1 YqzE family protein -
  GRQ40_RS12895 (GRQ40_12895) - 2680725..2681240 (-) 516 WP_044744504.1 shikimate kinase -
  GRQ40_RS12900 (GRQ40_12900) comGG 2681702..2682088 (-) 387 WP_066145817.1 competence type IV pilus minor pilin ComGG -
  GRQ40_RS12905 (GRQ40_12905) comGF 2682085..2682543 (-) 459 WP_231734572.1 competence type IV pilus minor pilin ComGF -
  GRQ40_RS12910 (GRQ40_12910) comGE 2682494..2682847 (-) 354 WP_159720136.1 competence type IV pilus minor pilin ComGE -
  GRQ40_RS12915 (GRQ40_12915) comGD 2682831..2683277 (-) 447 WP_066145823.1 competence type IV pilus minor pilin ComGD -
  GRQ40_RS12920 (GRQ40_12920) comGC 2683261..2683557 (-) 297 WP_080862312.1 competence type IV pilus major pilin ComGC Machinery gene
  GRQ40_RS12925 (GRQ40_12925) comGB 2683575..2684603 (-) 1029 WP_066145828.1 competence type IV pilus assembly protein ComGB -
  GRQ40_RS12930 (GRQ40_12930) comGA 2684596..2685660 (-) 1065 WP_044744511.1 competence type IV pilus ATPase ComGA Machinery gene
  GRQ40_RS12935 (GRQ40_12935) - 2685902..2686609 (+) 708 WP_066145834.1 metalloregulator ArsR/SmtB family transcription factor -
  GRQ40_RS12940 (GRQ40_12940) - 2686748..2686990 (+) 243 WP_044744513.1 DUF2626 domain-containing protein -
  GRQ40_RS12945 (GRQ40_12945) - 2687224..2688351 (-) 1128 WP_159720137.1 class I SAM-dependent methyltransferase -
  GRQ40_RS12950 (GRQ40_12950) - 2688510..2689142 (-) 633 WP_159720138.1 MBL fold metallo-hydrolase -
  GRQ40_RS12955 (GRQ40_12955) - 2689229..2689651 (+) 423 WP_066145840.1 hypothetical protein -
  GRQ40_RS12960 (GRQ40_12960) - 2689714..2689890 (+) 177 WP_044744516.1 DUF2759 domain-containing protein -
  GRQ40_RS12965 (GRQ40_12965) - 2689954..2690310 (-) 357 WP_159720139.1 MTH1187 family thiamine-binding protein -

Sequence


Protein


Download         Length: 354 a.a.        Molecular weight: 39881.62 Da        Isoelectric Point: 8.6943

>NTDB_id=411209 GRQ40_RS12930 WP_044744511.1 2684596..2685660(-) (comGA) [Anoxybacillus sp. PDR2]
MNEIEQLADRLVKEASEIGASDIHIVPRRDDALIQFRMDGALVVKGMLEKGLYERLLVYFKFLADMDIGERRRPQSGAME
ITQNGVHVSLRLSTLPTLYDESLVIRLLPHNSLLPLSHLALFPSTTRILLSLLHHSHGLLIFTGPTGSGKTTTLYTLLAA
CQKKWPRNVITLEDPVEKRIENMLQVQVNEKAGITYATGLKAILRHDPDVVIVGEIRDEETAKIAVRAALTGHLILSTMH
TKNAIGAIYRLLEFGVSWREIEQTMLAVTAQRLVELMCPFCKEACSAFCRRQRKIRRAAVYECLHGEALARAIQAARGEE
VEYRYRTLQHMMKKGVALGFLSPSVFKREMIEDA

Nucleotide


Download         Length: 1065 bp        

>NTDB_id=411209 GRQ40_RS12930 WP_044744511.1 2684596..2685660(-) (comGA) [Anoxybacillus sp. PDR2]
ATGAACGAAATTGAACAATTAGCGGATCGTCTCGTAAAAGAGGCGAGCGAAATCGGTGCCTCAGATATTCATATCGTTCC
AAGACGAGATGATGCCCTCATCCAGTTCCGCATGGATGGAGCGCTCGTTGTGAAGGGCATGCTGGAAAAGGGGTTATACG
AGCGGCTGCTTGTTTATTTTAAGTTTTTGGCGGATATGGATATTGGAGAGCGGCGTCGCCCGCAAAGCGGAGCCATGGAA
ATCACTCAAAACGGAGTTCATGTCAGCTTGCGCTTATCTACGCTTCCGACGCTTTATGATGAAAGCCTCGTGATTCGTCT
CCTTCCTCACAACTCACTTCTTCCTCTTTCACATCTTGCTTTATTTCCAAGCACAACGCGAATATTGCTGTCGCTTTTGC
ATCATTCCCACGGTTTGCTTATTTTTACGGGACCGACCGGTTCAGGAAAGACGACCACTTTATATACGCTGTTAGCAGCA
TGTCAGAAAAAGTGGCCGCGCAATGTCATTACCTTGGAAGATCCTGTGGAAAAACGGATCGAAAATATGTTGCAAGTTCA
GGTCAATGAAAAGGCGGGGATCACGTATGCAACAGGGCTAAAGGCGATTTTGCGCCATGATCCCGATGTCGTGATCGTCG
GCGAAATCCGCGATGAAGAAACGGCGAAAATCGCGGTACGCGCCGCTTTGACAGGACATTTAATCTTATCGACGATGCAT
ACAAAAAATGCTATCGGTGCGATTTATCGCTTATTGGAGTTTGGCGTTTCTTGGCGGGAAATCGAACAAACGATGCTGGC
TGTGACAGCCCAACGCCTTGTTGAACTGATGTGCCCATTTTGCAAAGAAGCGTGTTCGGCTTTTTGCCGCCGGCAAAGGA
AGATCAGACGAGCGGCCGTTTACGAATGTTTGCATGGAGAGGCGCTGGCAAGAGCGATTCAAGCGGCAAGAGGCGAAGAG
GTGGAGTATCGGTATCGCACGCTGCAGCACATGATGAAAAAAGGAGTGGCGCTCGGCTTTTTATCACCTTCCGTTTTCAA
AAGAGAGATGATAGAAGATGCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.02

98.588

0.562

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.21

92.938

0.364


Multiple sequence alignment