Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   K7887_RS14405 Genome accession   NZ_CP082918
Coordinates   2786143..2787213 (-) Length   356 a.a.
NCBI ID   WP_223490131.1    Uniprot ID   -
Organism   Sutcliffiella horikoshii strain BAT     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2781143..2792213
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K7887_RS14355 (K7887_14330) - 2781392..2781658 (-) 267 WP_204415634.1 phosphocarrier protein HPr -
  K7887_RS14360 (K7887_14335) - 2782069..2782257 (-) 189 WP_223490115.1 YqzE family protein -
  K7887_RS14365 (K7887_14340) - 2782289..2782804 (-) 516 WP_223490117.1 shikimate kinase -
  K7887_RS14370 (K7887_14345) - 2782917..2783147 (-) 231 WP_223490119.1 YuzF family protein -
  K7887_RS14375 (K7887_14350) comGG 2783210..2783638 (-) 429 WP_223490121.1 competence type IV pilus minor pilin ComGG -
  K7887_RS14380 (K7887_14355) comGF 2783592..2784038 (-) 447 WP_223490123.1 competence type IV pilus minor pilin ComGF -
  K7887_RS14385 (K7887_14360) - 2784022..2784357 (-) 336 WP_223490125.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  K7887_RS14390 (K7887_14365) comGD 2784341..2784799 (-) 459 WP_223490126.1 competence type IV pilus minor pilin ComGD -
  K7887_RS14395 (K7887_14370) comGC 2784786..2785121 (-) 336 WP_223490128.1 competence type IV pilus major pilin ComGC -
  K7887_RS14400 (K7887_14375) comGB 2785122..2786159 (-) 1038 WP_223490130.1 competence type IV pilus assembly protein ComGB -
  K7887_RS14405 (K7887_14380) comGA 2786143..2787213 (-) 1071 WP_223490131.1 competence type IV pilus ATPase ComGA Machinery gene
  K7887_RS14410 (K7887_14385) - 2787441..2788268 (-) 828 WP_223490133.1 serine hydrolase -
  K7887_RS14415 (K7887_14390) - 2788255..2789274 (-) 1020 WP_223490135.1 ABC transporter ATP-binding protein -
  K7887_RS14420 (K7887_14395) - 2789277..2790200 (-) 924 WP_223490137.1 C40 family peptidase -
  K7887_RS14425 (K7887_14400) - 2790193..2791293 (-) 1101 WP_223490139.1 dipeptide epimerase -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40813.30 Da        Isoelectric Point: 9.0484

>NTDB_id=603504 K7887_RS14405 WP_223490131.1 2786143..2787213(-) (comGA) [Sutcliffiella horikoshii strain BAT]
MNIEKKCEEIIRQAIRLRVSDIHIKPHETSAKVLFRLDHYLYDQEDLPLEIYERILSHLKFQAEMDIGETRKPQNGALNL
FIDSKHINLRLSTLPTVNQESLVIRILPHDDNQFPLKRLSLFPNSTRKLFSLMKHSHGLVLFTGPTGSGKTTTLYSILEE
SKGMLQRNIITLEDPVERRSKNVLQVQVNEKAGITYATGLKAILRHDPDIIMVGEIRDEETAKIAIRASLTGHLVLSTLH
TRDAKGAVHRLLEFGVTQQELEQTLIAISAQRLVELKCPYCHGDCTSFCRKYRQHRLASVYELLYGRELSKVMEECKGAK
VELRYPTLKEVIKKGIALGFIHQKEYEKWVNDGKGQ

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=603504 K7887_RS14405 WP_223490131.1 2786143..2787213(-) (comGA) [Sutcliffiella horikoshii strain BAT]
ATGAATATTGAAAAAAAGTGTGAAGAGATCATTCGCCAAGCTATCCGTTTGCGTGTATCAGATATTCACATCAAACCACA
TGAAACGTCCGCCAAAGTACTTTTCCGTTTGGACCACTACCTCTATGATCAAGAAGATCTCCCACTAGAAATCTATGAAC
GGATTTTATCTCACCTTAAATTCCAAGCCGAAATGGACATAGGTGAAACAAGAAAGCCCCAGAATGGCGCATTAAACCTT
TTTATCGACTCCAAACATATCAATCTGCGACTCTCGACCTTGCCCACTGTTAATCAAGAAAGTCTAGTCATCAGAATACT
GCCGCATGATGACAACCAATTCCCTTTAAAACGTTTATCCCTGTTTCCGAACTCCACAAGAAAACTATTTTCATTAATGA
AGCACTCCCACGGGCTTGTTCTGTTCACTGGTCCGACCGGCTCTGGCAAAACCACCACTCTGTATTCGATTTTGGAAGAG
TCTAAGGGGATGCTGCAACGAAATATTATTACACTTGAGGACCCGGTGGAGCGGAGAAGCAAAAATGTGCTTCAGGTGCA
GGTGAATGAAAAGGCGGGGATCACCTATGCGACAGGTTTGAAGGCTATCCTCCGACATGATCCAGACATAATTATGGTCG
GGGAAATCAGGGATGAAGAAACGGCGAAGATCGCTATAAGGGCATCGTTAACGGGTCATTTAGTATTAAGCACGTTGCAT
ACGCGCGATGCTAAAGGTGCTGTGCATCGGCTGTTGGAGTTTGGGGTCACGCAACAGGAATTGGAACAAACATTAATCGC
TATTTCGGCACAAAGGCTTGTGGAGTTGAAATGTCCATATTGCCATGGGGATTGCACATCTTTTTGCAGAAAGTACAGGC
AACATCGATTGGCCAGTGTATATGAACTTTTATATGGGCGGGAACTGTCCAAGGTGATGGAAGAGTGTAAAGGGGCAAAG
GTGGAATTACGCTATCCCACACTAAAGGAAGTGATTAAAAAGGGGATAGCACTGGGCTTTATTCATCAAAAGGAATATGA
AAAGTGGGTGAACGATGGCAAGGGGCAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.857

98.315

0.559