Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   GFC28_RS18190 Genome accession   NZ_CP015435
Coordinates   3750281..3750577 (+) Length   98 a.a.
NCBI ID   WP_080862312.1    Uniprot ID   -
Organism   Anoxybacillus sp. B2M1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3750561..3768057 3750281..3750577 flank -16


Gene organization within MGE regions


Location: 3750281..3768057
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GFC28_RS18190 (GFC28_3737) comGC 3750281..3750577 (+) 297 WP_080862312.1 competence type IV pilus major pilin ComGC Machinery gene
  GFC28_RS18195 (GFC28_3738) comGD 3750561..3751007 (+) 447 WP_044744508.1 competence type IV pilus minor pilin ComGD -
  GFC28_RS18200 (GFC28_3739) comGE 3750991..3751344 (+) 354 WP_044744507.1 competence type IV pilus minor pilin ComGE -
  GFC28_RS18205 (GFC28_3740) comGF 3751295..3751753 (+) 459 WP_052660993.1 competence type IV pilus minor pilin ComGF -
  GFC28_RS18210 (GFC28_3741) comGG 3751750..3752136 (+) 387 WP_044744505.1 competence type IV pilus minor pilin ComGG -
  GFC28_RS18215 (GFC28_3742) - 3752489..3753004 (+) 516 WP_044744504.1 shikimate kinase -
  GFC28_RS18220 (GFC28_3743) - 3753029..3753208 (+) 180 WP_044744503.1 YqzE family protein -
  GFC28_RS18225 (GFC28_3744) - 3753502..3754296 (-) 795 WP_044744502.1 YqhG family protein -
  GFC28_RS18230 (GFC28_3745) - 3754283..3755947 (-) 1665 WP_044744501.1 SNF2-related protein -
  GFC28_RS18235 (GFC28_3746) gcvT 3756432..3757526 (+) 1095 WP_044744500.1 glycine cleavage system aminomethyltransferase GcvT -
  GFC28_RS18240 (GFC28_3747) gcvPA 3757550..3758896 (+) 1347 WP_044744499.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  GFC28_RS18245 (GFC28_3748) gcvPB 3758889..3760343 (+) 1455 WP_044744498.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPB -
  GFC28_RS18250 (GFC28_3749) - 3760954..3761328 (-) 375 WP_044744497.1 rhodanese-like domain-containing protein -
  GFC28_RS18255 (GFC28_3750) - 3761473..3762309 (+) 837 WP_044744496.1 biotin/lipoate A/B protein ligase family protein -
  GFC28_RS18260 (GFC28_3751) - 3762604..3763389 (+) 786 Protein_3576 ribonucleotide reductase N-terminal alpha domain-containing protein -
  GFC28_RS18265 (GFC28_3752) - 3763747..3764913 (+) 1167 Protein_3577 ribonucleotide-diphosphate reductase subunit alpha -
  GFC28_RS19905 - 3765547..3765837 (+) 291 WP_044744495.1 HNH endonuclease -
  GFC28_RS18275 (GFC28_3754) - 3765923..3766537 (+) 615 Protein_3579 hypothetical protein -
  GFC28_RS18280 (GFC28_3755) - 3766648..3768057 (+) 1410 WP_044744493.1 LysM peptidoglycan-binding domain-containing protein -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 11044.94 Da        Isoelectric Point: 6.7075

>NTDB_id=179702 GFC28_RS18190 WP_080862312.1 3750281..3750577(+) (comGC) [Anoxybacillus sp. B2M1]
MQEKGFTLIEMLIVLMVISVLLLIAIPNIVKHNQMINDKGCEAFVKTVQAQVKAYQMEHDTLPTIRQLVDGNYIKSNKCP
NGRTIYIDSSGEVHEGEK

Nucleotide


Download         Length: 297 bp        

>NTDB_id=179702 GFC28_RS18190 WP_080862312.1 3750281..3750577(+) (comGC) [Anoxybacillus sp. B2M1]
ATGCAAGAAAAAGGATTCACATTGATCGAGATGCTTATCGTTTTGATGGTGATTTCAGTTTTATTGCTTATTGCCATTCC
CAATATCGTCAAGCATAACCAAATGATTAACGATAAAGGATGCGAAGCATTTGTTAAAACCGTGCAAGCACAGGTTAAGG
CCTATCAGATGGAGCATGATACCCTTCCAACTATCCGACAGCTTGTTGATGGAAATTATATTAAATCGAATAAATGCCCT
AACGGCCGCACCATTTATATCGATAGCAGCGGGGAAGTGCATGAGGGTGAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

51.579

96.939

0.5

  comGC Staphylococcus aureus MW2

45.161

94.898

0.429

  comGC Staphylococcus aureus N315

45.161

94.898

0.429


Multiple sequence alignment