Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   NSQ41_RS13095 Genome accession   NZ_CP150172
Coordinates   2704598..2705665 (-) Length   355 a.a.
NCBI ID   WP_130156359.1    Uniprot ID   -
Organism   Aeribacillus sp. FSL K6-8210     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2699598..2710665
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSQ41_RS13055 (NSQ41_13055) - 2700542..2701339 (+) 798 WP_063388101.1 YqhG family protein -
  NSQ41_RS13060 (NSQ41_13060) - 2701430..2701609 (-) 180 WP_063388100.1 YqzE family protein -
  NSQ41_RS13065 (NSQ41_13065) comGG 2701729..2702100 (-) 372 WP_339210105.1 competence type IV pilus minor pilin ComGG -
  NSQ41_RS13070 (NSQ41_13070) comGF 2702097..2702588 (-) 492 WP_130156367.1 competence type IV pilus minor pilin ComGF -
  NSQ41_RS13075 (NSQ41_13075) - 2702518..2702844 (-) 327 WP_063388097.1 hypothetical protein -
  NSQ41_RS13080 (NSQ41_13080) comGD 2702828..2703289 (-) 462 WP_158640149.1 competence type IV pilus minor pilin ComGD -
  NSQ41_RS13085 (NSQ41_13085) comGC 2703258..2703569 (-) 312 WP_063388095.1 competence type IV pilus major pilin ComGC Machinery gene
  NSQ41_RS13090 (NSQ41_13090) comGB 2703574..2704605 (-) 1032 WP_066247584.1 competence type IV pilus assembly protein ComGB -
  NSQ41_RS13095 (NSQ41_13095) comGA 2704598..2705665 (-) 1068 WP_130156359.1 competence type IV pilus ATPase ComGA Machinery gene
  NSQ41_RS13100 (NSQ41_13100) - 2705982..2706227 (+) 246 WP_063388092.1 DUF2626 domain-containing protein -
  NSQ41_RS13105 (NSQ41_13105) - 2706282..2706926 (-) 645 WP_063388091.1 MBL fold metallo-hydrolase -
  NSQ41_RS13110 (NSQ41_13110) - 2707173..2707346 (+) 174 WP_065095533.1 DUF2759 domain-containing protein -
  NSQ41_RS13115 (NSQ41_13115) - 2707463..2708581 (-) 1119 WP_339210106.1 hypothetical protein -
  NSQ41_RS13120 (NSQ41_13120) - 2708638..2709759 (-) 1122 WP_339210107.1 M14 family metallocarboxypeptidase -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 40517.38 Da        Isoelectric Point: 8.6397

>NTDB_id=964794 NSQ41_RS13095 WP_130156359.1 2704598..2705665(-) (comGA) [Aeribacillus sp. FSL K6-8210]
MIEIEKLSEKLIELACSFSASDIHIVPKKKKAVIQFRIDDDIKEIFHLPKAICERLISHFKFMAQMDIGEKRRPQNGAYS
FHTIYGEVHLRLSTLPTVFDESLVIRILLEQSIQTPEKLSLFPSESKKMLSFLKHSHGLLILTGPTGSGKTTTLYSLLHY
AKKHFKRNIITLEDPVETKSDEMLQVQINEKAGITYANGLKAILRHDPDVIMVGEIRDHETARIAVRAGLSGHLVLTTMH
TRDAKGAIYRLLEFGVTFSEIEQTLIAVAAQRLVHLICPFCKGNCPPYCKKLKEIRRIGVYELLYGKNLTAAIREAKGEE
QSELDYMKLKDVIKKGIALGYLYPDIYDRWVFEYE

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=964794 NSQ41_RS13095 WP_130156359.1 2704598..2705665(-) (comGA) [Aeribacillus sp. FSL K6-8210]
ATGATTGAAATTGAAAAGCTTAGCGAAAAACTGATTGAACTGGCGTGTTCATTTAGTGCCTCAGACATCCATATTGTTCC
GAAAAAGAAAAAAGCAGTTATTCAATTTCGAATTGACGATGACATCAAAGAAATATTTCATCTGCCAAAAGCTATTTGTG
AACGCTTAATTTCTCATTTTAAATTTATGGCCCAAATGGATATTGGCGAAAAAAGGCGTCCGCAAAACGGTGCTTACTCC
TTCCATACGATTTACGGAGAGGTGCATCTTCGTTTATCGACATTGCCTACAGTATTCGATGAAAGTCTAGTCATCCGAAT
ATTGCTAGAGCAATCGATTCAAACGCCTGAAAAACTTTCATTGTTCCCGTCAGAATCCAAAAAAATGCTTTCATTTTTAA
AGCATTCTCACGGTCTCCTGATCCTTACCGGTCCGACCGGTTCGGGAAAAACGACCACTCTTTATTCGCTTCTCCATTAT
GCAAAAAAACATTTTAAAAGAAACATCATCACACTTGAAGATCCTGTGGAAACGAAAAGCGATGAAATGCTTCAAGTGCA
GATCAATGAAAAAGCTGGAATTACGTATGCAAATGGGTTAAAAGCAATATTAAGACATGATCCGGATGTCATTATGGTCG
GTGAAATTCGAGATCATGAGACAGCAAGGATTGCGGTCCGGGCTGGGCTTAGCGGACATTTAGTACTGACCACCATGCAT
ACGCGGGATGCAAAGGGAGCTATTTATCGATTATTAGAATTTGGTGTAACCTTTTCCGAAATTGAGCAAACTTTGATCGC
TGTTGCTGCGCAGCGTCTCGTTCATTTAATATGCCCGTTTTGCAAAGGGAATTGCCCGCCTTATTGCAAAAAATTAAAGG
AAATTAGGAGAATTGGGGTCTATGAACTTCTTTACGGCAAAAATTTAACCGCCGCTATTCGGGAAGCAAAAGGAGAGGAG
CAGAGTGAGTTAGATTATATGAAGTTAAAAGATGTAATTAAAAAAGGAATCGCTCTTGGGTATTTATATCCGGACATATA
TGATCGTTGGGTGTTTGAATATGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

62.323

99.437

0.62

  pilB Glaesserella parasuis strain SC1401

40.058

96.338

0.386

  pilF Thermus thermophilus HB27

42.809

84.225

0.361


Multiple sequence alignment