Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   FNL97_RS05390 Genome accession   NZ_CP041632
Coordinates   1054927..1055973 (+) Length   348 a.a.
NCBI ID   WP_143415172.1    Uniprot ID   -
Organism   Geobacillus sp. E263     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1049927..1060973
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FNL97_RS05365 - 1050961..1051137 (-) 177 WP_015864488.1 DUF2759 domain-containing protein -
  FNL97_RS05370 - 1051498..1052130 (+) 633 WP_062678741.1 MBL fold metallo-hydrolase -
  FNL97_RS05375 - 1052410..1053552 (+) 1143 WP_201030673.1 SAM-dependent methyltransferase -
  FNL97_RS05380 - 1053601..1053843 (-) 243 WP_008879854.1 DUF2626 family protein -
  FNL97_RS05385 - 1054028..1054726 (-) 699 WP_062678740.1 helix-turn-helix domain-containing protein -
  FNL97_RS05390 comGA 1054927..1055973 (+) 1047 WP_143415172.1 competence type IV pilus ATPase ComGA Machinery gene
  FNL97_RS05395 comGB 1056066..1057091 (+) 1026 WP_062678738.1 competence type IV pilus assembly protein ComGB -
  FNL97_RS05400 comGC 1057106..1057402 (+) 297 WP_015864482.1 competence type IV pilus major pilin ComGC Machinery gene
  FNL97_RS05405 comGD 1057386..1057829 (+) 444 WP_062678737.1 competence type IV pilus minor pilin ComGD -
  FNL97_RS05410 comGE 1057813..1058136 (+) 324 WP_062678736.1 competence type IV pilus minor pilin ComGE -
  FNL97_RS05415 comGF 1058133..1058555 (+) 423 WP_143415173.1 competence type IV pilus minor pilin ComGF -
  FNL97_RS05420 comGG 1058572..1058958 (+) 387 WP_015864478.1 competence type IV pilus minor pilin ComGG -
  FNL97_RS05425 - 1059153..1059332 (+) 180 WP_015864477.1 YqzE family protein -
  FNL97_RS05430 - 1059393..1060184 (-) 792 WP_015864476.1 YqhG family protein -

Sequence


Protein


Download         Length: 348 a.a.        Molecular weight: 39564.09 Da        Isoelectric Point: 9.1071

>NTDB_id=372621 FNL97_RS05390 WP_143415172.1 1054927..1055973(+) (comGA) [Geobacillus sp. E263]
MNEIEYVADRLIKEASLLHVSDIHIVPRKDDAIVRFRLDGLLMEKEALTKEMCERLITHFKFLAGMDIGERRRPQSGAME
ARHQEEIIHLRLSTLPTSYDESLVIRLLPQNFFIPRSQLSLFANATKTLLSLFRQPQGLIIFTGPTGSGKTSTLYTLLRI
CQYEWHRNVITLEDPVEKRIDNILQVQINEKAGITYTTGLKAVLRHDPDVIMIGEIRDAETAKIAVRSAMTGHLIATTMH
TKNAVGAIYRLREFGIPLGDIEQTLLAVVAQRLVDLVCPFCGEHCSIFCRKHRSIRRAAVHELLYGNALSNAIQSVQTKE
KTHHYYTLQHVIRKGVALGFLPAHLLYR

Nucleotide


Download         Length: 1047 bp        

>NTDB_id=372621 FNL97_RS05390 WP_143415172.1 1054927..1055973(+) (comGA) [Geobacillus sp. E263]
ATGAATGAAATTGAATATGTTGCCGATCGTCTCATAAAAGAAGCGAGTTTGCTTCATGTATCTGACATTCATATCGTTCC
GCGCAAAGACGATGCGATTGTGCGTTTCCGGTTAGATGGATTGCTGATGGAAAAGGAAGCGCTGACAAAAGAAATGTGCG
AGCGGCTTATTACGCATTTTAAATTTTTAGCAGGGATGGACATTGGCGAACGCCGCCGTCCGCAAAGCGGAGCGATGGAA
GCAAGGCATCAGGAAGAAATCATTCACTTACGCCTCTCCACATTACCGACATCGTATGATGAAAGCCTCGTTATCCGGCT
TCTTCCGCAGAATTTTTTTATTCCTCGATCACAACTTTCTCTATTTGCAAATGCCACGAAAACGTTACTTTCCCTTTTTC
GGCAGCCCCAAGGATTAATTATTTTTACAGGACCAACTGGATCAGGCAAAACGTCAACATTATATACGTTATTGCGCATT
TGTCAATATGAGTGGCATCGCAATGTCATCACATTGGAAGACCCTGTTGAAAAGCGAATCGACAACATATTGCAAGTGCA
AATTAATGAGAAAGCGGGAATTACGTATACAACCGGTTTAAAAGCTGTTTTGCGCCATGATCCGGATGTGATTATGATCG
GCGAAATTCGCGACGCCGAGACCGCGAAAATTGCGGTACGCTCAGCAATGACGGGACATTTGATTGCTACGACCATGCAT
ACAAAAAACGCTGTTGGTGCGATTTACCGTTTGCGTGAATTCGGGATTCCGCTTGGAGATATTGAGCAAACATTGCTCGC
CGTTGTCGCACAGCGGCTCGTGGATTTAGTATGCCCGTTTTGCGGTGAACATTGCTCCATATTTTGCCGTAAACATCGCT
CCATTCGCCGCGCTGCTGTCCATGAACTGCTGTATGGGAATGCTTTGTCGAATGCCATTCAATCCGTACAAACAAAGGAA
AAGACGCATCACTACTATACGTTGCAACACGTTATTCGAAAAGGAGTTGCTCTTGGATTTTTGCCAGCACACCTTCTTTA
CAGGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.726

98.563

0.569

  pilB Glaesserella parasuis strain SC1401

41.83

87.931

0.368

  pilF Thermus thermophilus HB27

42.568

85.057

0.362

  pilB Haemophilus influenzae 86-028NP

41.86

86.494

0.362


Multiple sequence alignment