Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   C3496_RS26295 Genome accession   NZ_CP026608
Coordinates   5139296..5140339 (-) Length   347 a.a.
NCBI ID   WP_136444660.1    Uniprot ID   A0A4S4HVD2
Organism   Bacillus anthracis strain HDZK-BYSB7     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5134296..5145339
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C3496_RS26250 (C3496_27600) - 5134723..5134923 (-) 201 WP_000106081.1 YqzE family protein -
  C3496_RS26255 (C3496_27605) aroK 5134962..5135459 (-) 498 WP_071728422.1 shikimate kinase AroK -
  C3496_RS26260 (C3496_27610) - 5135580..5136230 (-) 651 WP_071728423.1 2OG-Fe(II) oxygenase -
  C3496_RS26265 (C3496_27615) comGG 5136406..5136777 (-) 372 WP_071728424.1 competence type IV pilus minor pilin ComGG -
  C3496_RS26270 (C3496_27620) comGF 5136774..5137244 (-) 471 WP_136444718.1 competence type IV pilus minor pilin ComGF -
  C3496_RS26275 (C3496_27625) comGE 5137214..5137516 (-) 303 WP_000829458.1 competence type IV pilus minor pilin ComGE -
  C3496_RS26280 (C3496_27630) comGD 5137509..5137964 (-) 456 WP_000810395.1 comG operon protein ComGD -
  C3496_RS26285 (C3496_27635) comGC 5137961..5138260 (-) 300 WP_001178696.1 comG operon protein ComGC -
  C3496_RS26290 (C3496_27640) comGB 5138272..5139303 (-) 1032 WP_088313315.1 comG operon protein ComGB -
  C3496_RS26295 (C3496_27645) comGA 5139296..5140339 (-) 1044 WP_136444660.1 competence protein ComGA Machinery gene
  C3496_RS26300 (C3496_27650) - 5140544..5141239 (+) 696 WP_071728428.1 metalloregulator ArsR/SmtB family transcription factor -
  C3496_RS26305 (C3496_27655) - 5141365..5141607 (+) 243 WP_000440721.1 DUF2626 domain-containing protein -
  C3496_RS26310 (C3496_27660) - 5141715..5143109 (+) 1395 WP_001094339.1 L-cystine transporter -
  C3496_RS26315 (C3496_27665) - 5143326..5143541 (-) 216 WP_001008322.1 DUF3912 family protein -
  C3496_RS26320 (C3496_27670) - 5143794..5144105 (+) 312 WP_001093243.1 hypothetical protein -
  C3496_RS26325 (C3496_27675) - 5144142..5144630 (-) 489 WP_136444659.1 hypothetical protein -
  C3496_RS26330 (C3496_27680) - 5144710..5145084 (-) 375 WP_136444658.1 nucleoside 2-deoxyribosyltransferase -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39242.74 Da        Isoelectric Point: 9.3736

>NTDB_id=270718 C3496_RS26295 WP_136444660.1 5139296..5140339(-) (comGA) [Bacillus anthracis strain HDZK-BYSB7]
MNGIESFANTILKEACRVQASDLHIVPRQKDVVVQLRIGKNLMTKQCIEKGFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRSDDLLQIQINEKAGITYEAGLKAILRHDPDVILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDARGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLQSSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=270718 C3496_RS26295 WP_136444660.1 5139296..5140339(-) (comGA) [Bacillus anthracis strain HDZK-BYSB7]
ATGAATGGAATTGAAAGCTTTGCGAATACGATTTTGAAAGAAGCGTGTAGGGTACAAGCGTCGGACTTACATATTGTGCC
CCGACAGAAGGATGTAGTGGTTCAACTGCGTATAGGAAAAAATTTAATGACGAAACAATGCATTGAAAAGGGGTTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGACAGGAAGTGTATTTACGTCTTTCCACGCTTCCAACTGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTACTTTCTTTTTTAC
ATTATTCCCATGGATTACTCGTATTTACTGGACCGACTGGTTCGGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGCCGCATCGTTACATTGGAGGATCCAGTTGAAAAAAGAAGTGACGATTTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCTGGACTAAAGGCTATTTTGCGTCATGATCCAGATGTTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCGAAAATTGCTATAAGAGCAAGTTTGACTGGACATTTAGTAATGACGACACTGCAT
ACGAATGATGCGAGAGGGGCGATACTCAGGTTCATGGATTTTGGCATAACAAGGCAAGAAATCGAACAATCTTTATTAGC
TATAGCTGCACAGCGACTTGTCGAATTAAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGCAAATCAATGAGGC
AAGTAAGGCAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACATACAAGCACGAAACATTACAATCTTCGATACGAAAAGGATACGCTTTAGGGTTTTTAGAAGAAGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4S4HVD2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.349

100

0.573

  pilB Vibrio campbellii strain DS40M4

35.977

100

0.366


Multiple sequence alignment