Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   CQJ30_RS12195 Genome accession   NZ_CP023704
Coordinates   2472724..2473809 (+) Length   361 a.a.
NCBI ID   WP_034771006.1    Uniprot ID   A0A090IWD5
Organism   Caldibacillus thermoamylovorans strain SSBM     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2467724..2478809
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CQJ30_RS12170 (CQJ30_12135) - 2468544..2470238 (+) 1695 WP_108898277.1 SNF2-related protein -
  CQJ30_RS12175 (CQJ30_12140) - 2470207..2471010 (+) 804 WP_108898278.1 YqhG family protein -
  CQJ30_RS12180 (CQJ30_12145) - 2471189..2471359 (-) 171 WP_034770998.1 DUF2759 domain-containing protein -
  CQJ30_RS12185 (CQJ30_12150) - 2471430..2471672 (-) 243 WP_034771001.1 DUF2626 domain-containing protein -
  CQJ30_RS12190 (CQJ30_12155) - 2471812..2472513 (-) 702 WP_034771004.1 helix-turn-helix domain-containing protein -
  CQJ30_RS12195 (CQJ30_12160) comGA 2472724..2473809 (+) 1086 WP_034771006.1 competence type IV pilus ATPase ComGA Machinery gene
  CQJ30_RS12200 (CQJ30_12165) comGB 2473784..2474815 (+) 1032 WP_161522412.1 competence type IV pilus assembly protein ComGB -
  CQJ30_RS12205 (CQJ30_12170) comGC 2474842..2475141 (+) 300 WP_108898280.1 competence type IV pilus major pilin ComGC Machinery gene
  CQJ30_RS12210 (CQJ30_12175) comGD 2475092..2475619 (+) 528 WP_108898281.1 competence type IV pilus minor pilin ComGD -
  CQJ30_RS12215 (CQJ30_12180) - 2475594..2475950 (+) 357 WP_108898282.1 hypothetical protein -
  CQJ30_RS12220 (CQJ30_12185) comGF 2475922..2476386 (+) 465 WP_041846279.1 competence type IV pilus minor pilin ComGF -
  CQJ30_RS12225 (CQJ30_12190) comGG 2476277..2476807 (+) 531 WP_081912341.1 competence type IV pilus minor pilin ComGG -
  CQJ30_RS12230 (CQJ30_12195) - 2476909..2477094 (+) 186 WP_161522413.1 YqzE family protein -
  CQJ30_RS12235 (CQJ30_12200) - 2477125..2478249 (-) 1125 WP_108898284.1 hypothetical protein -

Sequence


Protein


Download         Length: 361 a.a.        Molecular weight: 40615.22 Da        Isoelectric Point: 7.5768

>NTDB_id=249399 CQJ30_RS12195 WP_034771006.1 2472724..2473809(+) (comGA) [Caldibacillus thermoamylovorans strain SSBM]
MPNSIETRAREILEHALSLHATDIHIIPKSKHASVLFRLSHKLIPIMTIELEETERLISYMKFQAAMDIGEKRKPQNGSF
QIDIGGLSVGLRLSTLPSIFAESLVIRILPQESYIPFQQLSLFPKSLRLLLAMLKFSHGLIIFTGPTGSGKTTTLYSLLQ
HSTKSLGRNVITLEDPVEKNSEDLLQVQVNEKAGITYNTGLKAILRHDPDIIMVGEIRDSETAHIAIRAALTGHLVLTTM
HTKDSKGALYRLIEFGVNWHEIEQTLVAVTAQRLVELICPYCLEEECPVYCNQNKNKRTAVYEILYGRALKEALLEMKGE
AFSARYPTLGQLIAKGIALGFIKKSEYERWVHDIETKSLEN

Nucleotide


Download         Length: 1086 bp        

>NTDB_id=249399 CQJ30_RS12195 WP_034771006.1 2472724..2473809(+) (comGA) [Caldibacillus thermoamylovorans strain SSBM]
TTGCCGAACTCGATTGAAACACGGGCAAGGGAAATTTTGGAGCACGCCCTTTCTTTACATGCAACAGATATTCATATCAT
CCCCAAATCAAAACATGCTTCCGTTCTATTTCGATTGTCTCATAAACTCATCCCCATAATGACAATCGAACTTGAGGAAA
CGGAAAGATTAATTTCATATATGAAGTTTCAGGCAGCAATGGATATCGGCGAAAAAAGAAAACCCCAGAACGGTTCTTTT
CAAATAGATATCGGGGGGCTATCTGTCGGACTTCGATTATCCACTTTGCCTTCCATATTCGCTGAAAGTTTGGTCATCCG
AATATTACCACAAGAATCATACATCCCATTTCAACAATTATCCTTATTCCCAAAATCACTGAGACTACTGTTAGCCATGT
TAAAGTTTTCCCACGGTTTAATCATATTTACAGGACCAACCGGATCGGGCAAAACAACAACTTTATATTCTCTTCTCCAA
CATTCGACAAAATCTCTTGGCAGGAATGTCATCACCTTGGAAGATCCTGTTGAAAAAAATAGTGAAGATTTACTTCAAGT
TCAAGTAAATGAAAAAGCCGGCATTACGTATAATACCGGTTTAAAAGCGATTTTAAGACATGATCCCGATATTATTATGG
TCGGCGAAATTCGTGATAGTGAGACAGCACATATTGCCATTCGAGCAGCATTAACCGGTCATTTAGTATTGACAACGATG
CATACGAAAGATTCAAAAGGTGCATTGTACCGGTTAATCGAATTTGGTGTTAATTGGCATGAAATTGAACAAACTTTGGT
GGCAGTGACGGCCCAAAGATTAGTCGAATTAATTTGTCCTTACTGTTTAGAAGAAGAATGTCCCGTCTATTGCAACCAGA
ATAAAAATAAGCGTACTGCCGTCTATGAAATTTTGTACGGTCGTGCATTAAAGGAAGCACTGTTGGAAATGAAGGGTGAA
GCATTTTCGGCGCGCTATCCGACTTTGGGCCAATTGATTGCAAAAGGTATTGCTCTCGGATTTATCAAGAAAAGTGAGTA
TGAAAGATGGGTGCATGATATTGAGACAAAGTCGCTGGAAAATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A090IWD5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.695

97.23

0.551


Multiple sequence alignment