Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   GFC29_RS08230 Genome accession   NZ_CP015436
Coordinates   1709259..1709555 (-) Length   98 a.a.
NCBI ID   WP_080862312.1    Uniprot ID   -
Organism   Anoxybacillus sp. B7M1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1691779..1709275 1709259..1709555 flank -16


Gene organization within MGE regions


Location: 1691779..1709555
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GFC29_RS08140 (GFC29_1678) - 1691779..1693188 (-) 1410 WP_044744493.1 LysM peptidoglycan-binding domain-containing protein -
  GFC29_RS08145 (GFC29_1679) - 1693299..1693913 (-) 615 Protein_1648 hypothetical protein -
  GFC29_RS20470 (GFC29_1680) - 1693999..1694529 (-) 531 WP_082063735.1 HNH endonuclease -
  GFC29_RS08155 (GFC29_1681) - 1694923..1696047 (-) 1125 Protein_1650 ribonucleotide-diphosphate reductase subunit alpha -
  GFC29_RS08160 (GFC29_1682) - 1696447..1697232 (-) 786 Protein_1651 ribonucleotide reductase N-terminal alpha domain-containing protein -
  GFC29_RS08165 (GFC29_1683) - 1697527..1698363 (-) 837 WP_044744496.1 biotin/lipoate A/B protein ligase family protein -
  GFC29_RS08170 (GFC29_1684) - 1698559..1698882 (+) 324 WP_275900426.1 rhodanese-like domain-containing protein -
  GFC29_RS08175 (GFC29_1685) gcvPB 1699493..1700947 (-) 1455 WP_044744498.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPB -
  GFC29_RS08180 (GFC29_1686) gcvPA 1700940..1702286 (-) 1347 WP_044744499.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  GFC29_RS08185 (GFC29_1687) gcvT 1702310..1703404 (-) 1095 WP_044744500.1 glycine cleavage system aminomethyltransferase GcvT -
  GFC29_RS08190 (GFC29_1688) - 1703892..1705553 (+) 1662 WP_369818753.1 DEAD/DEAH box helicase -
  GFC29_RS08195 (GFC29_1689) - 1705540..1706334 (+) 795 WP_044744502.1 YqhG family protein -
  GFC29_RS08200 (GFC29_1690) - 1706628..1706807 (-) 180 WP_044744503.1 YqzE family protein -
  GFC29_RS08205 (GFC29_1691) - 1706832..1707347 (-) 516 WP_044744504.1 shikimate kinase -
  GFC29_RS08210 (GFC29_1692) comGG 1707700..1708086 (-) 387 WP_044744505.1 competence type IV pilus minor pilin ComGG -
  GFC29_RS08215 (GFC29_1693) comGF 1708083..1708541 (-) 459 WP_052660993.1 competence type IV pilus minor pilin ComGF -
  GFC29_RS08220 (GFC29_1694) comGE 1708492..1708845 (-) 354 WP_044744507.1 competence type IV pilus minor pilin ComGE -
  GFC29_RS08225 (GFC29_1695) comGD 1708829..1709275 (-) 447 WP_044744508.1 competence type IV pilus minor pilin ComGD -
  GFC29_RS08230 (GFC29_1696) comGC 1709259..1709555 (-) 297 WP_080862312.1 competence type IV pilus major pilin ComGC Machinery gene

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 11044.94 Da        Isoelectric Point: 6.7075

>NTDB_id=179716 GFC29_RS08230 WP_080862312.1 1709259..1709555(-) (comGC) [Anoxybacillus sp. B7M1]
MQEKGFTLIEMLIVLMVISVLLLIAIPNIVKHNQMINDKGCEAFVKTVQAQVKAYQMEHDTLPTIRQLVDGNYIKSNKCP
NGRTIYIDSSGEVHEGEK

Nucleotide


Download         Length: 297 bp        

>NTDB_id=179716 GFC29_RS08230 WP_080862312.1 1709259..1709555(-) (comGC) [Anoxybacillus sp. B7M1]
ATGCAAGAAAAAGGATTCACATTGATCGAGATGCTTATCGTTTTGATGGTGATTTCAGTTTTATTGCTTATTGCCATTCC
CAATATCGTCAAGCATAACCAAATGATTAACGATAAAGGATGCGAAGCATTTGTTAAAACCGTGCAAGCACAGGTTAAGG
CCTATCAGATGGAGCATGATACCCTTCCAACTATCCGACAGCTTGTTGATGGAAATTATATTAAATCGAATAAATGCCCT
AACGGCCGCACCATTTATATCGATAGCAGCGGGGAAGTGCATGAGGGTGAGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

51.579

96.939

0.5

  comGC Staphylococcus aureus MW2

45.161

94.898

0.429

  comGC Staphylococcus aureus N315

45.161

94.898

0.429


Multiple sequence alignment