Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   LG279_RS14115 Genome accession   NZ_CP085249
Coordinates   2730196..2731266 (-) Length   356 a.a.
NCBI ID   WP_404428674.1    Uniprot ID   -
Organism   Sutcliffiella horikoshii strain ABH-543     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2725196..2736266
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LG279_RS14065 (LG279_14025) - 2725445..2725711 (-) 267 WP_010194663.1 phosphocarrier protein HPr -
  LG279_RS14070 (LG279_14030) - 2726122..2726310 (-) 189 WP_404428668.1 YqzE family protein -
  LG279_RS14075 (LG279_14035) - 2726342..2726857 (-) 516 WP_223490117.1 shikimate kinase -
  LG279_RS14080 (LG279_14040) - 2726970..2727200 (-) 231 WP_223490119.1 YuzF family protein -
  LG279_RS14085 (LG279_14045) comGG 2727263..2727691 (-) 429 WP_404346865.1 competence type IV pilus minor pilin ComGG -
  LG279_RS14090 (LG279_14050) comGF 2727645..2728091 (-) 447 WP_404428669.1 competence type IV pilus minor pilin ComGF -
  LG279_RS14095 (LG279_14055) - 2728075..2728410 (-) 336 WP_404428670.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  LG279_RS14100 (LG279_14060) comGD 2728394..2728852 (-) 459 WP_404428671.1 competence type IV pilus minor pilin ComGD -
  LG279_RS14105 (LG279_14065) comGC 2728839..2729174 (-) 336 WP_404428672.1 competence type IV pilus major pilin ComGC -
  LG279_RS14110 (LG279_14070) comGB 2729175..2730212 (-) 1038 WP_404428673.1 competence type IV pilus assembly protein ComGB -
  LG279_RS14115 (LG279_14075) comGA 2730196..2731266 (-) 1071 WP_404428674.1 competence type IV pilus ATPase ComGA Machinery gene
  LG279_RS14120 (LG279_14080) - 2731492..2732319 (-) 828 WP_404428675.1 serine hydrolase -
  LG279_RS14125 (LG279_14085) - 2732306..2733325 (-) 1020 WP_404428676.1 ABC transporter ATP-binding protein -
  LG279_RS14130 (LG279_14090) - 2733328..2734251 (-) 924 WP_404428677.1 NlpC/P60 family protein -
  LG279_RS14135 (LG279_14095) - 2734244..2735344 (-) 1101 WP_404428678.1 dipeptide epimerase -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40812.32 Da        Isoelectric Point: 9.2194

>NTDB_id=616356 LG279_RS14115 WP_404428674.1 2730196..2731266(-) (comGA) [Sutcliffiella horikoshii strain ABH-543]
MNIEKKCEEIIRQAIRLRVSDIHIKPHETSAKVLFRLDHYLYDQEDLPLEIYERILSHLKFQAEMDIGETRKPQNGALNL
FINSKHINLRLSTLPTVNQESLVIRILPHDDNQFPLKRLSLFPNSTRKLFSLMKHSHGLVLFTGPTGSGKTTTLYSILEE
SKGMLQRNIITLEDPVERRSKNVLQVQVNEKAGITYATGLKAILRHDPDIIMVGEIRDEETAKIAIRASLTGHLVLSTLH
TRDAKGGVHRLLEFGVTQQELEQTLIAISAQRLVELKCPYCHGECTSFCRKYRQHRLASVYELLYGRELSKVMEECKGAK
VELRYPTLKEVIKKGIALGFIHQKEYEKWVNDGKGQ

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=616356 LG279_RS14115 WP_404428674.1 2730196..2731266(-) (comGA) [Sutcliffiella horikoshii strain ABH-543]
ATGAATATTGAAAAAAAGTGTGAAGAGATCATTCGCCAAGCAATCCGTTTGCGTGTATCAGATATTCACATCAAACCACA
TGAAACGTCCGCCAAAGTACTTTTCCGTTTGGACCACTACCTCTATGATCAAGAAGATCTCCCACTAGAAATCTATGAAC
GGATTTTATCTCACCTTAAATTCCAGGCCGAAATGGACATAGGTGAAACAAGAAAGCCCCAAAATGGTGCATTAAATCTT
TTTATCAACTCCAAACATATCAATCTGCGACTCTCGACCTTGCCCACTGTTAACCAAGAAAGTCTGGTCATCAGAATACT
GCCTCATGATGACAACCAATTCCCTTTAAAACGTTTATCCCTGTTTCCGAACTCCACAAGAAAACTATTTTCATTGATGA
AGCACTCCCACGGGCTTGTTCTGTTCACTGGTCCGACTGGCTCTGGCAAAACCACCACTCTGTATTCGATTTTGGAAGAG
TCTAAGGGGATGTTGCAACGAAATATTATTACGCTTGAGGATCCGGTGGAGCGGAGAAGCAAAAATGTGCTTCAGGTGCA
GGTGAATGAAAAGGCAGGGATCACCTATGCGACAGGTTTGAAGGCTATCCTCCGGCATGATCCAGATATAATTATGGTCG
GGGAAATCAGGGATGAAGAAACGGCGAAGATCGCTATAAGGGCATCGTTAACGGGTCATTTAGTTTTAAGTACCTTGCAT
ACACGCGATGCTAAAGGGGGTGTGCATCGGCTGTTGGAGTTTGGGGTCACGCAACAGGAATTGGAACAAACATTAATCGC
TATTTCGGCACAAAGGCTTGTGGAGTTGAAATGTCCATATTGCCATGGGGAATGCACATCTTTCTGCAGAAAGTACAGGC
AACATCGATTGGCCAGTGTATATGAACTTTTATATGGGCGGGAACTGTCCAAGGTGATGGAAGAGTGTAAAGGGGCTAAG
GTGGAATTACGCTATCCCACACTTAAGGAAGTGATTAAAAAGGGGATAGCACTGGGCTTTATTCATCAAAAGGAATATGA
AAAATGGGTGAACGATGGCAAGGGGCAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.571

98.315

0.556