Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   EUC40_RS05850 Genome accession   NZ_CP040336
Coordinates   1136032..1137075 (+) Length   347 a.a.
NCBI ID   WP_071713011.1    Uniprot ID   -
Organism   Bacillus luti strain FJ     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1131032..1142075
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EUC40_RS05815 - 1131448..1131936 (+) 489 WP_156574052.1 hypothetical protein -
  EUC40_RS05820 - 1131972..1132283 (-) 312 WP_063222956.1 hypothetical protein -
  EUC40_RS05825 - 1132487..1132702 (+) 216 WP_001008335.1 DUF3912 family protein -
  EUC40_RS05830 - 1132790..1133182 (+) 393 WP_156574054.1 hypothetical protein -
  EUC40_RS05835 - 1133260..1134654 (-) 1395 WP_156574056.1 L-cystine transporter -
  EUC40_RS05840 - 1134762..1135004 (-) 243 WP_000440709.1 DUF2626 domain-containing protein -
  EUC40_RS05845 - 1135131..1135826 (-) 696 WP_071713010.1 helix-turn-helix domain-containing protein -
  EUC40_RS05850 comGA 1136032..1137075 (+) 1044 WP_071713011.1 competence protein ComGA Machinery gene
  EUC40_RS05855 comGB 1137068..1138099 (+) 1032 WP_156574058.1 competence type IV pilus assembly protein ComGB -
  EUC40_RS05860 comGC 1138111..1138410 (+) 300 WP_001178698.1 competence type IV pilus major pilin ComGC -
  EUC40_RS05865 comGD 1138407..1138862 (+) 456 WP_071713013.1 competence type IV pilus minor pilin ComGD -
  EUC40_RS05870 - 1138855..1139157 (+) 303 WP_000829520.1 hypothetical protein -
  EUC40_RS05875 comGF 1139127..1139594 (+) 468 WP_071713071.1 competence type IV pilus minor pilin ComGF -
  EUC40_RS05880 comGG 1139591..1139962 (+) 372 WP_156574060.1 competence type IV pilus minor pilin ComGG -
  EUC40_RS05885 - 1140139..1140789 (+) 651 WP_156574062.1 2OG-Fe(II) oxygenase -
  EUC40_RS05890 - 1140910..1141407 (+) 498 WP_156574065.1 shikimate kinase -
  EUC40_RS05895 - 1141446..1141646 (+) 201 WP_000106081.1 YqzE family protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39178.53 Da        Isoelectric Point: 9.0215

>NTDB_id=363343 EUC40_RS05850 WP_071713011.1 1136032..1137075(+) (comGA) [Bacillus luti strain FJ]
MNGIESFANTILKEACRVQASDLHIVPRQKDVVVQLRIGKDLMTKQCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQIDGQEAYLRLSTLPTVYQESLVIRLHLQASVQPLSHLSLFPSTVKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDDLLQIQINEKAGITYEAGLKAILRHDPDVILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDTKGAILRFMDYGITRQEIEQSLLAIAAQRLVELKCPFCKGKCSTVCKSMRQVRQASIYELLYGHELKQAIKGANGEY
VTYKHETIESSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=363343 EUC40_RS05850 WP_071713011.1 1136032..1137075(+) (comGA) [Bacillus luti strain FJ]
ATGAATGGAATTGAAAGTTTTGCGAATACGATTTTGAAAGAAGCATGTAGGGTACAAGCATCGGATTTACATATTGTACC
CCGGCAGAAGGATGTAGTGGTTCAACTGCGTATAGGGAAAGATTTAATGACGAAACAATGCATTGAAAAGGAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCTATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCATTGTAT
CTACAAATTGATGGGCAAGAAGCATATTTACGCCTTTCAACACTCCCAACAGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTGTTCAGCCATTATCTCATCTTTCGTTATTTCCAAGTACAGTTAAAAAACTGCTCTCTTTCTTAC
ATTATTCTCATGGATTACTCGTATTTACTGGGCCGACTGGTTCTGGGAAAACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGCCGTATCGTTACACTGGAGGATCCAGTTGAAAAGAGAAATGATGATTTATTACAAATTCA
AATTAATGAAAAAGCAGGGATCACATATGAAGCTGGTTTAAAGGCAATTTTACGTCATGATCCAGATGTTATCTTAGTGG
GAGAAATCCGTGATGAAGAAACAGCGAAAATAGCTATAAGGGCAAGTCTCACGGGGCATTTAGTAATGACGACATTGCAT
ACCAATGATACGAAAGGAGCGATACTGAGGTTTATGGATTATGGCATTACGAGGCAAGAGATTGAACAATCTTTATTGGC
TATAGCTGCACAGCGACTTGTTGAATTGAAATGTCCATTTTGCAAAGGGAAGTGCTCAACTGTATGTAAATCAATGAGGC
AAGTAAGACAAGCGAGCATTTATGAGTTGTTATATGGACATGAGTTAAAACAAGCGATTAAAGGAGCGAACGGAGAATAT
GTTACGTATAAGCATGAAACAATAGAATCTTCAATACGAAAAGGATACGCTTTAGGGTTTTTAGAAGAAGATGTTTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.484

100

0.565


Multiple sequence alignment