Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   N288_RS16715 Genome accession   NC_022524
Coordinates   3292736..3293803 (-) Length   355 a.a.
NCBI ID   WP_022544183.1    Uniprot ID   U5LDK5
Organism   Bacillus infantis NRRL B-14911     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3287736..3298803
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N288_RS16665 (N288_18000) - 3288018..3288812 (+) 795 WP_009794922.1 YqhG family protein -
  N288_RS16670 (N288_18005) - 3288871..3289032 (-) 162 WP_156917017.1 hypothetical protein -
  N288_RS16675 (N288_18010) - 3289104..3289298 (-) 195 WP_009794920.1 YqzE family protein -
  N288_RS16680 (N288_18015) - 3289337..3289837 (-) 501 WP_009794919.1 shikimate kinase -
  N288_RS16685 (N288_18020) comGG 3289853..3290242 (-) 390 WP_009794918.1 competence type IV pilus minor pilin ComGG -
  N288_RS24360 (N288_18025) comGF 3290229..3290711 (-) 483 WP_022544181.1 competence type IV pilus minor pilin ComGF -
  N288_RS16695 (N288_18030) - 3290656..3290976 (-) 321 WP_009794916.1 hypothetical protein -
  N288_RS16700 (N288_18035) comGD 3290963..3291400 (-) 438 WP_009794915.1 competence type IV pilus minor pilin ComGD -
  N288_RS16705 (N288_18040) comGC 3291393..3291728 (-) 336 WP_009794914.1 competence type IV pilus major pilin ComGC -
  N288_RS16710 (N288_18045) comGB 3291718..3292749 (-) 1032 WP_009794913.1 competence type IV pilus assembly protein ComGB -
  N288_RS16715 (N288_18050) comGA 3292736..3293803 (-) 1068 WP_022544183.1 competence type IV pilus ATPase ComGA Machinery gene
  N288_RS16725 (N288_18060) - 3294055..3294435 (-) 381 WP_009794911.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  N288_RS16730 (N288_18065) - 3294636..3295337 (+) 702 WP_009794910.1 helix-turn-helix transcriptional regulator -
  N288_RS16735 (N288_18070) - 3295422..3295667 (+) 246 WP_009794909.1 DUF2626 domain-containing protein -
  N288_RS16740 (N288_18075) - 3295788..3296852 (-) 1065 WP_009794908.1 class I SAM-dependent methyltransferase -
  N288_RS16745 (N288_18080) - 3296881..3297513 (-) 633 WP_009794907.1 MBL fold metallo-hydrolase -
  N288_RS16750 (N288_18085) - 3297624..3298112 (+) 489 WP_009794906.1 hypothetical protein -
  N288_RS24810 (N288_18090) - 3298196..3298369 (+) 174 WP_009794905.1 DUF2759 domain-containing protein -
  N288_RS16760 (N288_18095) - 3298457..3298777 (-) 321 WP_009794904.1 MTH1187 family thiamine-binding protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 39215.23 Da        Isoelectric Point: 7.7373

>NTDB_id=62511 N288_RS16715 WP_022544183.1 3292736..3293803(-) (comGA) [Bacillus infantis NRRL B-14911]
MSKIERLADSIVCDAVRRNASDIHILPRRHDTLIQLRIADRLLSRQVLPTDDCERLIAHFKFSASMDIGDRRRPQSGAYS
LEIDQKLIGLRLSTLPSSHHESLVIRILPQEDNIHISKISLFPAMSNKLLALLKHAHGLVLFTGPTGSGKTTTLYSLLHE
TSSLFKRNVITLEDPIEKDSDLVLQVQVNEKAGVSYASGLKAILRHDPDIIMVGEIRDAETAEIAVRAALTGHLVLSTMH
TRDAKGAIYRLCEFGVGWQEIEQTLIAVTAQRLVELSCPYCSGTCPPHCRGAAAGKRASVFELLTGKALSAVMKEARGEG
SAYQYPTLKNAILKGIALGYIKEAEMDRWVYNEDS

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=62511 N288_RS16715 WP_022544183.1 3292736..3293803(-) (comGA) [Bacillus infantis NRRL B-14911]
TTGAGTAAAATTGAAAGATTGGCCGACAGTATCGTATGTGATGCGGTCAGGAGAAATGCCTCGGATATCCACATCCTCCC
GCGCAGGCATGATACCCTGATCCAGCTCAGGATCGCTGACAGGCTCTTATCGCGGCAGGTTCTGCCGACTGATGACTGTG
AGAGGCTGATTGCCCATTTCAAGTTCTCTGCTTCAATGGATATCGGCGACCGCAGGCGCCCGCAAAGCGGCGCGTATTCA
CTGGAGATTGATCAGAAGCTGATCGGCCTCCGATTGTCCACCCTTCCTTCTTCCCATCATGAAAGCCTGGTCATCCGCAT
CCTTCCCCAGGAAGACAATATCCACATTTCAAAGATCTCCCTGTTTCCGGCTATGTCGAATAAGCTCCTGGCACTGCTGA
AGCATGCCCATGGGCTGGTCCTGTTTACCGGGCCGACCGGCAGCGGAAAAACAACTACCCTTTATTCCCTCCTGCATGAA
ACTTCATCCTTATTCAAAAGAAATGTCATCACGCTTGAGGATCCGATCGAGAAGGACAGCGACCTCGTACTTCAGGTGCA
GGTAAATGAAAAGGCGGGCGTGTCATATGCTTCAGGGCTGAAGGCCATCCTGAGGCATGATCCGGACATCATTATGGTCG
GTGAGATACGCGATGCAGAGACAGCTGAAATTGCCGTACGGGCAGCGCTGACAGGCCATCTTGTGCTGAGTACGATGCAC
ACCAGGGATGCCAAGGGTGCTATATACCGGCTTTGTGAGTTCGGGGTAGGATGGCAGGAAATTGAGCAGACCCTGATAGC
TGTAACAGCGCAGCGTCTCGTGGAGCTGTCATGCCCGTATTGCAGCGGTACCTGTCCGCCTCACTGCCGGGGTGCCGCTG
CGGGAAAAAGGGCGAGTGTGTTTGAATTATTGACAGGCAAAGCCCTTTCTGCGGTCATGAAGGAGGCCAGAGGGGAAGGG
AGCGCCTATCAGTATCCTACTCTGAAGAACGCCATTCTCAAAGGAATTGCGCTGGGGTATATTAAGGAGGCTGAAATGGA
TCGGTGGGTGTATAATGAGGACTCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB U5LDK5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.497

99.718

0.563

  pilB Glaesserella parasuis strain SC1401

37.176

97.746

0.363


Multiple sequence alignment