Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   F7984_RS13510 Genome accession   NZ_CP044545
Coordinates   2820413..2821474 (-) Length   353 a.a.
NCBI ID   WP_140462021.1    Uniprot ID   -
Organism   Pradoshia sp. D12     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2815413..2826474
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  F7984_RS13470 (F7984_13470) - 2815531..2816325 (+) 795 WP_140462026.1 YqhG family protein -
  F7984_RS13475 (F7984_13475) - 2816739..2816924 (-) 186 WP_225983607.1 YqzE family protein -
  F7984_RS13480 (F7984_13480) - 2816987..2817490 (-) 504 WP_140462025.1 shikimate kinase -
  F7984_RS13485 (F7984_13485) comGG 2817508..2817891 (-) 384 WP_181162039.1 competence type IV pilus minor pilin ComGG -
  F7984_RS13490 (F7984_13490) comGF 2817878..2818324 (-) 447 WP_192796851.1 competence type IV pilus minor pilin ComGF -
  F7984_RS19140 - 2818302..2818532 (-) 231 WP_192796812.1 hypothetical protein -
  F7984_RS13495 (F7984_13495) comGD 2818615..2819058 (-) 444 WP_066105059.1 competence type IV pilus minor pilin ComGD -
  F7984_RS13500 (F7984_13500) comGC 2819061..2819369 (-) 309 WP_139892543.1 competence type IV pilus major pilin ComGC -
  F7984_RS13505 (F7984_13505) comGB 2819383..2820423 (-) 1041 WP_180350004.1 competence type IV pilus assembly protein ComGB -
  F7984_RS13510 (F7984_13510) comGA 2820413..2821474 (-) 1062 WP_140462021.1 competence type IV pilus ATPase ComGA Machinery gene
  F7984_RS13515 (F7984_13515) - 2821816..2822523 (+) 708 WP_066105070.1 helix-turn-helix transcriptional regulator -
  F7984_RS13520 (F7984_13520) - 2822625..2822867 (+) 243 WP_066105073.1 DUF2626 domain-containing protein -
  F7984_RS13525 (F7984_13525) - 2822900..2823976 (-) 1077 WP_180350005.1 class I SAM-dependent methyltransferase -
  F7984_RS13530 (F7984_13530) - 2824141..2824773 (-) 633 WP_066105078.1 MBL fold metallo-hydrolase -
  F7984_RS13535 (F7984_13535) - 2824994..2825167 (+) 174 WP_066105081.1 DUF2759 domain-containing protein -
  F7984_RS13540 (F7984_13540) - 2825248..2826438 (-) 1191 WP_181162038.1 M14 family metallocarboxypeptidase -

Sequence


Protein


Download         Length: 353 a.a.        Molecular weight: 39899.19 Da        Isoelectric Point: 8.1048

>NTDB_id=390916 F7984_RS13510 WP_140462021.1 2820413..2821474(-) (comGA) [Pradoshia sp. D12]
MLTIDKLAEKVVTEAVKFKASDIHIVPQRKKTHLYYRIHNRLFSEGSYQRDRAQRLISHFKFMAGMDIGEKRRPQSGALT
MFVSNKWIGLRLSTLPTPYSESLVIRIIPSLQTLPIEEISLFPKMSTTLISMLKHSHGLMIICGPTGSGKTTTLYSMLHH
AKNTVNRNIITLEDPVEHQSDDAIQVQINEKAGISYSVGLKAALRHDPDIIMIGEIRDAETARIAVRAALTGHLILTTMH
ASDVLGAVRRLLEFGIDKEEAKQTLLGVTSQRLLELHCPFCKGECSPYCLMEKSRGRVSVHEMIYGNELAKVWAEMEGDD
VNFHYPELSSLISKAVAYGFVNQKEYERWVHEE

Nucleotide


Download         Length: 1062 bp        

>NTDB_id=390916 F7984_RS13510 WP_140462021.1 2820413..2821474(-) (comGA) [Pradoshia sp. D12]
TTGCTCACCATCGATAAGTTAGCTGAAAAAGTTGTAACAGAAGCAGTAAAATTCAAAGCCTCAGATATCCACATTGTTCC
CCAACGCAAGAAAACCCATCTTTACTATCGGATTCACAATCGCTTATTTTCAGAAGGCAGTTACCAAAGAGACAGGGCCC
AACGACTCATCTCTCATTTTAAATTTATGGCCGGCATGGATATAGGAGAAAAAAGACGTCCTCAGTCAGGGGCCTTAACG
ATGTTTGTTTCAAATAAATGGATTGGTCTCCGATTATCCACCTTGCCTACCCCATACTCTGAATCTCTCGTCATTCGTAT
TATTCCAAGTTTACAAACTCTCCCAATCGAAGAAATCTCCTTATTTCCAAAAATGTCAACTACGTTAATTTCCATGCTTA
AACATTCCCATGGATTAATGATCATCTGTGGACCGACTGGAAGTGGTAAAACTACAACTTTATATTCCATGCTCCATCAT
GCCAAAAATACCGTTAACCGAAATATTATTACACTGGAAGATCCTGTTGAACATCAGTCCGATGATGCCATTCAAGTCCA
GATTAATGAAAAAGCCGGTATCAGTTATTCAGTAGGCTTGAAAGCTGCCCTGCGCCACGATCCCGATATTATCATGATTG
GAGAAATCAGGGATGCTGAAACGGCCAGAATAGCGGTGCGCGCCGCTTTAACAGGCCACTTAATCTTAACTACGATGCAT
GCCAGTGATGTATTGGGTGCAGTTAGAAGGCTTTTGGAATTTGGGATAGACAAGGAGGAGGCAAAGCAAACTCTTTTAGG
TGTAACCTCCCAAAGATTATTGGAATTGCACTGTCCTTTTTGTAAGGGTGAATGCTCACCGTACTGCCTTATGGAAAAGA
GCAGAGGGCGGGTAAGTGTGCATGAAATGATTTATGGCAATGAATTGGCGAAGGTTTGGGCAGAGATGGAGGGAGATGAT
GTGAACTTTCATTATCCGGAATTATCTTCTCTCATCAGCAAGGCTGTTGCCTATGGATTTGTGAATCAAAAAGAGTATGA
GAGGTGGGTGCATGAGGAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

51.13

100

0.513

  pilB Haemophilus influenzae 86-028NP

36.963

98.867

0.365


Multiple sequence alignment