Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   M654_RS15835 Genome accession   NZ_CP048273
Coordinates   3033928..3034995 (-) Length   355 a.a.
NCBI ID   WP_026587854.1    Uniprot ID   A0A6I7FCH5
Organism   Bacillus sp. NSP9.1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3028928..3039995
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M654_RS15790 (M654_015740) tapA 3029387..3030115 (-) 729 WP_026587845.1 amyloid fiber anchoring/assembly protein TapA -
  M654_RS15795 (M654_015745) - 3030385..3030702 (+) 318 WP_026587846.1 YqzG/YhdC family protein -
  M654_RS15800 (M654_015750) - 3030789..3030974 (-) 186 WP_026587847.1 YqzE family protein -
  M654_RS15805 (M654_015755) comGG 3031046..3031411 (-) 366 WP_026587848.1 competence type IV pilus minor pilin ComGG -
  M654_RS24145 (M654_015760) comGF 3031425..3031913 (-) 489 WP_077736493.1 competence type IV pilus minor pilin ComGF -
  M654_RS15815 (M654_015765) comGE 3031822..3032169 (-) 348 WP_026587850.1 competence type IV pilus minor pilin ComGE -
  M654_RS15820 (M654_015770) comGD 3032153..3032596 (-) 444 WP_236251143.1 competence type IV pilus minor pilin ComGD -
  M654_RS15825 (M654_015775) comGC 3032596..3032889 (-) 294 WP_026587852.1 competence type IV pilus major pilin ComGC Machinery gene
  M654_RS15830 (M654_015780) comGB 3032904..3033941 (-) 1038 WP_026587853.1 competence type IV pilus assembly protein ComGB Machinery gene
  M654_RS15835 (M654_015785) comGA 3033928..3034995 (-) 1068 WP_026587854.1 competence type IV pilus ATPase ComGA Machinery gene
  M654_RS15840 (M654_015790) - 3035165..3036016 (-) 852 WP_026587855.1 STAS domain-containing protein -
  M654_RS15850 (M654_015800) - 3036267..3036644 (-) 378 WP_035428189.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  M654_RS15855 (M654_015805) - 3036853..3037098 (+) 246 WP_026587856.1 DUF2626 domain-containing protein -
  M654_RS15860 (M654_015810) - 3037140..3037778 (-) 639 WP_026587857.1 MBL fold metallo-hydrolase -
  M654_RS15865 (M654_015815) - 3037937..3038110 (+) 174 WP_003183486.1 DUF2759 domain-containing protein -
  M654_RS15870 (M654_015820) - 3038186..3038497 (-) 312 WP_026587858.1 MTH1187 family thiamine-binding protein -
  M654_RS15875 (M654_015825) - 3038514..3039623 (-) 1110 WP_026587859.1 hypothetical protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 39967.43 Da        Isoelectric Point: 9.1233

>NTDB_id=420463 M654_RS15835 WP_026587854.1 3033928..3034995(-) (comGA) [Bacillus sp. NSP9.1]
MQTTESFSGKLIEEAFMMKASDIHIVPGEKDALIRFRIDGELFRKHRLTKNECSRLISHFKFLSSMDIGERRQPQSGSLT
LHMKGEPVHLRMSTLPTINEESLVIRVLPQTSRQPLKKLSLFPSATFKLLSFLKHAHGLMIFTGPTGSGKSTTLYSLIEY
AKRHFNRNIITLEDPVESRSENILQVQVNEKAGMSYSAGLKAVLRHDPDMIILGEIRDAETAETAVRAALTGHLVLTSMH
AKNAKGAIYRLLEFGIDITEIEQTLIAVSAQRLVGLVCPFCGDRCSLFCRLSRPVRRASIFELLYGKRLHQCVKEAKGEY
AVVRDETLGMLIRKGIALGYLPAEAYERWVDHESE

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=420463 M654_RS15835 WP_026587854.1 3033928..3034995(-) (comGA) [Bacillus sp. NSP9.1]
TTGCAGACGACTGAATCATTTAGCGGGAAATTAATCGAAGAGGCATTCATGATGAAAGCTTCGGATATCCATATTGTTCC
GGGAGAAAAAGACGCGCTGATACGCTTCAGAATTGATGGTGAGCTGTTTAGAAAGCATCGATTAACGAAAAATGAATGCT
CGAGGCTTATTTCTCATTTTAAATTTTTGTCTTCCATGGATATCGGTGAACGCAGGCAGCCGCAAAGCGGGTCCTTGACC
TTGCACATGAAGGGGGAGCCCGTTCATTTAAGAATGTCCACATTGCCAACCATAAATGAAGAAAGCCTGGTGATCCGCGT
TTTGCCGCAGACAAGCAGACAGCCGCTCAAAAAGCTTTCTTTGTTTCCAAGCGCAACATTCAAGCTCCTGTCTTTCCTGA
AACATGCTCACGGTTTAATGATCTTTACCGGTCCGACCGGATCAGGCAAGTCGACAACACTTTATTCACTGATCGAATAT
GCCAAACGCCACTTTAACCGCAACATCATTACATTGGAGGATCCGGTTGAATCGAGAAGCGAAAATATTTTGCAAGTACA
GGTGAATGAAAAAGCGGGAATGTCGTATTCGGCCGGTTTAAAGGCGGTTCTCCGCCATGATCCGGACATGATCATACTGG
GGGAAATCCGTGATGCCGAAACAGCCGAAACCGCAGTCAGAGCTGCGCTGACAGGACATCTCGTCCTCACGAGCATGCAT
GCGAAAAATGCAAAGGGAGCGATTTACAGGCTTCTTGAGTTTGGTATTGACATAACAGAAATTGAACAGACGCTCATCGC
AGTAAGCGCTCAGCGCCTGGTAGGGCTTGTCTGCCCGTTTTGCGGCGACAGATGCTCTTTGTTTTGCCGATTGTCCAGGC
CTGTGAGAAGAGCAAGCATTTTTGAGCTGCTTTATGGGAAAAGGCTTCACCAATGTGTAAAAGAAGCAAAAGGTGAATAT
GCCGTTGTCCGCGATGAAACTTTAGGCATGCTGATTCGAAAAGGAATTGCGCTCGGCTATCTGCCGGCGGAAGCCTATGA
GCGCTGGGTTGATCATGAGTCGGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6I7FCH5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

69.663

100

0.699


Multiple sequence alignment