Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   LG321_RS14455 Genome accession   NZ_CP085250
Coordinates   2800514..2801584 (-) Length   356 a.a.
NCBI ID   WP_404458551.1    Uniprot ID   -
Organism   Sutcliffiella horikoshii strain ABH-541     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2795514..2806584
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LG321_RS14405 (LG321_14360) - 2795763..2796029 (-) 267 WP_204415634.1 phosphocarrier protein HPr -
  LG321_RS14410 (LG321_14365) - 2796440..2796628 (-) 189 WP_223490115.1 YqzE family protein -
  LG321_RS14415 (LG321_14370) - 2796660..2797175 (-) 516 WP_404458542.1 shikimate kinase -
  LG321_RS14420 (LG321_14375) - 2797288..2797518 (-) 231 WP_223490119.1 YuzF family protein -
  LG321_RS14425 (LG321_14380) comGG 2797581..2798009 (-) 429 WP_404458544.1 competence type IV pilus minor pilin ComGG -
  LG321_RS14430 (LG321_14385) comGF 2797963..2798409 (-) 447 WP_404458545.1 competence type IV pilus minor pilin ComGF -
  LG321_RS14435 (LG321_14390) - 2798393..2798728 (-) 336 WP_404458547.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  LG321_RS14440 (LG321_14395) comGD 2798712..2799170 (-) 459 WP_404458549.1 competence type IV pilus minor pilin ComGD -
  LG321_RS14445 (LG321_14400) comGC 2799157..2799492 (-) 336 WP_404346869.1 competence type IV pilus major pilin ComGC -
  LG321_RS14450 (LG321_14405) comGB 2799493..2800530 (-) 1038 WP_404458550.1 competence type IV pilus assembly protein ComGB -
  LG321_RS14455 (LG321_14410) comGA 2800514..2801584 (-) 1071 WP_404458551.1 competence type IV pilus ATPase ComGA Machinery gene
  LG321_RS14460 (LG321_14415) - 2801813..2802640 (-) 828 WP_404458553.1 serine hydrolase -
  LG321_RS14465 (LG321_14420) - 2802627..2803646 (-) 1020 WP_404458555.1 ABC transporter ATP-binding protein -
  LG321_RS14470 (LG321_14425) - 2803649..2804572 (-) 924 WP_404458557.1 NlpC/P60 family protein -
  LG321_RS14475 (LG321_14430) - 2804565..2805665 (-) 1101 WP_404458558.1 mandelate racemase/muconate lactonizing enzyme family protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40864.35 Da        Isoelectric Point: 9.0520

>NTDB_id=616412 LG321_RS14455 WP_404458551.1 2800514..2801584(-) (comGA) [Sutcliffiella horikoshii strain ABH-541]
MNIEKKCEEIIRQAIRLRVSDIHIKPHETSAKVLFRLDHYLYDQEDLPLEIYERILSHLKFQAEMDIGETRKPQNGALNL
FIDSKHINLRLSTLPTVNQESLVIRILPHDDNQFPLKRLSLFPNSTRKLFSLMKHSHGLVLFTGPTGSGKTTTLYSILEE
SKGMLQRNIITLEDPVERRSKNVLQVQVNEKAGITYATGLKAILRHDPDIIMVGEIRDEETAKIAIRASLTGHLVLSTLH
TRDAKGAVHRLLEFGVTQQELEQTLIAISAQRLVELKCPYCHGECTSFCRKYRHHRLASVYELLYGRELSKVMEECKGAK
VELRYPTLKEVIKKGIALGFIHQREYEKWVNDGKGQ

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=616412 LG321_RS14455 WP_404458551.1 2800514..2801584(-) (comGA) [Sutcliffiella horikoshii strain ABH-541]
ATGAATATTGAAAAAAAGTGTGAAGAGATCATTCGCCAAGCTATCCGTTTGCGTGTATCAGATATTCACATCAAACCACA
TGAAACGTCCGCCAAAGTACTTTTCCGTTTGGACCACTACCTCTATGATCAAGAAGATCTCCCACTAGAAATCTATGAAC
GGATTTTATCTCACCTTAAATTCCAAGCCGAAATGGACATAGGTGAAACAAGAAAGCCCCAAAATGGCGCTTTAAACCTT
TTTATCGACTCCAAACATATCAATCTGCGACTCTCGACCTTGCCCACTGTTAACCAAGAAAGTCTGGTCATCAGAATACT
GCCGCATGATGACAACCAATTCCCTTTAAAACGGTTATCCCTGTTTCCGAACTCCACAAGAAAACTATTTTCATTAATGA
AGCACTCCCACGGCCTTGTTCTGTTCACTGGTCCGACCGGCTCTGGCAAAACCACCACTCTGTATTCGATTTTGGAAGAG
TCTAAGGGGATGCTGCAACGAAATATTATTACACTTGAGGATCCGGTGGAGCGGAGAAGCAAAAATGTGCTTCAGGTGCA
GGTGAATGAAAAGGCGGGGATCACCTATGCGACAGGTTTGAAGGCTATCCTCCGACATGATCCAGACATAATTATGGTCG
GGGAAATCAGGGATGAAGAAACGGCGAAGATCGCTATAAGGGCATCGTTAACGGGTCATTTAGTATTAAGCACGTTGCAT
ACACGAGATGCCAAAGGGGCGGTGCATCGACTGTTGGAGTTTGGGGTCACGCAACAGGAATTGGAACAAACATTAATCGC
TATTTCGGCACAAAGGCTTGTGGAATTGAAATGTCCATATTGCCACGGGGAATGTACATCTTTTTGCAGAAAATACAGGC
ATCATCGATTGGCCAGTGTATATGAACTTTTATATGGGCGGGAACTGTCCAAGGTGATGGAAGAATGTAAAGGGGCAAAG
GTGGAATTACGCTATCCCACACTAAAGGAAGTAATTAAAAAGGGGATAGCACTGGGCTTTATACATCAAAGGGAATATGA
AAAATGGGTGAACGATGGCAAGGGGCAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.857

98.315

0.559

  pilF Thermus thermophilus HB27

40.123

91.011

0.365