Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   DN409_RS21635 Genome accession   NZ_CP031071
Coordinates   4201852..4202895 (-) Length   347 a.a.
NCBI ID   WP_002138337.1    Uniprot ID   -
Organism   Bacillus mycoides strain BPN401     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4196852..4207895
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DN409_RS21590 (DN409_21595) - 4197251..4197451 (-) 201 WP_002015009.1 YqzE family protein -
  DN409_RS21595 (DN409_21600) - 4197490..4197987 (-) 498 WP_061689624.1 shikimate kinase -
  DN409_RS21600 (DN409_21605) - 4198139..4198789 (-) 651 WP_002034016.1 2OG-Fe(II) oxygenase -
  DN409_RS21605 (DN409_21610) comGG 4198963..4199334 (-) 372 WP_033709423.1 competence type IV pilus minor pilin ComGG -
  DN409_RS21610 (DN409_21615) comGF 4199331..4199792 (-) 462 WP_061689623.1 competence type IV pilus minor pilin ComGF -
  DN409_RS21615 (DN409_21620) - 4199776..4200072 (-) 297 WP_016100615.1 hypothetical protein -
  DN409_RS21620 (DN409_21625) comGD 4200065..4200520 (-) 456 WP_002129089.1 competence type IV pilus minor pilin ComGD -
  DN409_RS21625 (DN409_21630) comGC 4200517..4200816 (-) 300 WP_002088623.1 competence type IV pilus major pilin ComGC -
  DN409_RS21630 (DN409_21635) comGB 4200828..4201865 (-) 1038 WP_016096112.1 competence type IV pilus assembly protein ComGB -
  DN409_RS21635 (DN409_21640) comGA 4201852..4202895 (-) 1044 WP_002138337.1 competence type IV pilus ATPase ComGA Machinery gene
  DN409_RS21640 (DN409_21645) - 4203101..4203796 (+) 696 WP_002014995.1 metalloregulator ArsR/SmtB family transcription factor -
  DN409_RS21645 (DN409_21650) - 4204076..4204318 (+) 243 WP_002015110.1 DUF2626 domain-containing protein -
  DN409_RS21650 (DN409_21655) - 4204426..4205820 (+) 1395 WP_002138338.1 L-cystine transporter -
  DN409_RS21655 (DN409_21660) - 4205901..4206293 (-) 393 WP_149190916.1 hypothetical protein -
  DN409_RS21660 (DN409_21665) - 4206381..4206596 (-) 216 WP_002015115.1 DUF3912 family protein -
  DN409_RS21665 (DN409_21670) - 4206849..4207160 (+) 312 WP_002165070.1 hypothetical protein -
  DN409_RS21670 (DN409_21675) - 4207197..4207685 (-) 489 WP_002015119.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39093.53 Da        Isoelectric Point: 9.3369

>NTDB_id=303221 DN409_RS21635 WP_002138337.1 4201852..4202895(-) (comGA) [Bacillus mycoides strain BPN401]
MNSVELFANMIMKEACGVQASDLHIVPRQKDVAIQLRIGKDLITKRCIEKGFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQIDGQEVYLRLSTLPTVYQESLVIRLHLQASVQPLSHLSLFPSSAEKLLSFLKHSHGLLVFTGPTGSGKTTTMYALLEV
ARKWQTRRIITLEDPVEQRKDGLLQIQINEKAGITYKTGLKAILRHDPDIILVGEIRDEETAKVAVRASLTGHLVMTTLH
TNDAKGAILRFMDYGITRQEIEQSLLAVAAQRLVELKCPFCRGKCATLCKSMRKVRQASIYELLYGYELKQAIKEASGEH
VTYHYKTLESSVRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=303221 DN409_RS21635 WP_002138337.1 4201852..4202895(-) (comGA) [Bacillus mycoides strain BPN401]
ATGAATAGTGTCGAGCTTTTTGCAAATATGATTATGAAAGAAGCTTGTGGGGTGCAAGCATCGGACTTACATATTGTGCC
CAGGCAGAAGGATGTGGCGATTCAATTACGTATTGGAAAAGATTTAATTACGAAACGGTGTATTGAAAAGGGATTTGGAG
AAAAGCTTGTTTCGCACTTTAAATTTTTAGCATCAATGGATATAGGAGAGAGGAGAAAACCTCAAAATGGTTCGTTGTAC
TTACAAATTGATGGACAAGAAGTGTATTTACGCCTTTCAACACTTCCAACAGTATATCAAGAAAGTCTCGTTATTCGCCT
CCATTTACAAGCATCTGTACAGCCGTTGTCTCACCTTTCGTTATTTCCAAGTTCAGCGGAAAAATTACTCTCTTTTTTAA
AGCATTCGCATGGGTTACTCGTATTTACTGGGCCGACTGGTTCTGGAAAAACAACAACAATGTATGCGTTATTAGAAGTA
GCTAGAAAATGGCAAACACGTCGCATCATTACACTAGAAGATCCAGTTGAGCAAAGAAAAGACGGTTTATTACAAATTCA
AATAAATGAAAAAGCTGGTATCACATATAAAACAGGATTAAAGGCTATTTTGCGTCATGATCCAGATATTATTTTAGTTG
GCGAAATTCGTGATGAAGAAACAGCAAAAGTAGCTGTAAGGGCCAGTTTGACGGGACATTTAGTAATGACAACCTTGCAT
ACAAATGATGCGAAAGGAGCGATACTACGATTTATGGATTATGGTATTACAAGGCAAGAAATTGAACAATCATTATTAGC
AGTAGCTGCTCAGCGGCTCGTCGAATTAAAATGTCCATTTTGCAGAGGGAAGTGTGCAACTTTATGTAAATCAATGAGGA
AAGTGAGACAGGCAAGCATTTATGAACTGTTATATGGATATGAATTAAAACAAGCGATTAAAGAAGCAAGTGGAGAACAT
GTTACGTATCACTATAAAACGTTGGAATCGTCGGTTCGAAAAGGGTATGCTTTAGGCTTTTTAGAAGAAGATGTATATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.061

100

0.571

  pilB Glaesserella parasuis strain SC1401

42.955

83.862

0.36


Multiple sequence alignment