Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   DQN84_RS08380 Genome accession   NZ_LS483476
Coordinates   1741196..1742254 (+) Length   352 a.a.
NCBI ID   WP_066137453.1    Uniprot ID   -
Organism   Lederbergia lenta strain NCTC4824     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1736196..1747254
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN84_RS08350 (NCTC4824_01719) - 1736271..1737404 (+) 1134 WP_066137448.1 hypothetical protein -
  DQN84_RS08355 (NCTC4824_01721) - 1737455..1737631 (-) 177 WP_082788578.1 DUF2759 domain-containing protein -
  DQN84_RS08360 (NCTC4824_01722) - 1737861..1738487 (+) 627 WP_066137449.1 MBL fold metallo-hydrolase -
  DQN84_RS08365 (NCTC4824_01723) - 1738664..1739482 (-) 819 WP_066137450.1 ABC transporter permease -
  DQN84_RS08370 (NCTC4824_01724) - 1739479..1740423 (-) 945 WP_066137451.1 ABC transporter ATP-binding protein -
  DQN84_RS08375 (NCTC4824_01725) - 1740689..1740931 (-) 243 WP_066137452.1 DUF2626 domain-containing protein -
  DQN84_RS08380 (NCTC4824_01726) comGA 1741196..1742254 (+) 1059 WP_066137453.1 competence type IV pilus ATPase ComGA Machinery gene
  DQN84_RS08385 (NCTC4824_01727) comGB 1742256..1743269 (+) 1014 WP_145981924.1 competence type IV pilus assembly protein ComGB -
  DQN84_RS08390 (NCTC4824_01728) comGC 1743285..1743599 (+) 315 WP_066137455.1 competence type IV pilus major pilin ComGC -
  DQN84_RS08395 (NCTC4824_01729) comGD 1743610..1744065 (+) 456 WP_066137456.1 competence type IV pilus minor pilin ComGD -
  DQN84_RS08400 (NCTC4824_01730) - 1744052..1744363 (+) 312 WP_066137457.1 type II secretion system protein -
  DQN84_RS08405 (NCTC4824_01731) comGF 1744308..1744790 (+) 483 WP_145981925.1 competence type IV pilus minor pilin ComGF -
  DQN84_RS08410 (NCTC4824_01732) - 1745035..1745538 (+) 504 WP_066137459.1 shikimate kinase -
  DQN84_RS08415 (NCTC4824_01733) - 1745579..1745755 (+) 177 WP_082788580.1 YqzE family protein -
  DQN84_RS08420 (NCTC4824_01734) - 1745792..1746586 (-) 795 WP_066137460.1 YqhG family protein -

Sequence


Protein


Download         Length: 352 a.a.        Molecular weight: 39263.94 Da        Isoelectric Point: 8.8213

>NTDB_id=1142437 DQN84_RS08380 WP_066137453.1 1741196..1742254(+) (comGA) [Lederbergia lenta strain NCTC4824]
MSIEKIADLLLKQAIQLTATDIHITPRKHNYHIQFRLHGLLTTIQSIPFQAGERLISHLKFMSSIDISEKRKPQSGSFEL
NLNNQLIALRISTLPTTLAKESLVIRILPQDHSFILEKMSLFPSTSPILKSFMLHAHGMLIFTGPTGSGKSTTMYAVAEH
CAGTLNRRVVTLEDPVEKQSDMLLQVQLNDKAGITYSTGLKAILRHDPDVIIIGEIRDAETAHIAIRAALTGHLVLTSLH
TRDAKGAIYRLLEFGVKLQDIEQTLIGIMAQRLIALLCPLCGESCSKFCTRRVAKRTGVYEILYGQALVGALEESKGGKE
VYHYPLIKDLIRKGIALGYVPVEEYKHWVLEE

Nucleotide


Download         Length: 1059 bp        

>NTDB_id=1142437 DQN84_RS08380 WP_066137453.1 1741196..1742254(+) (comGA) [Lederbergia lenta strain NCTC4824]
ATGTCTATTGAAAAAATAGCAGATCTCCTTCTTAAACAAGCAATACAACTCACGGCAACAGATATCCATATTACTCCTCG
GAAACATAACTATCACATTCAGTTTAGACTCCATGGTTTACTAACGACAATTCAATCCATTCCTTTCCAAGCAGGGGAGA
GACTAATTTCACATTTGAAGTTTATGTCATCAATCGACATAAGTGAAAAAAGAAAGCCTCAAAGTGGATCCTTCGAACTA
AATCTCAATAATCAGTTAATCGCACTAAGAATTTCTACACTTCCAACTACATTGGCAAAAGAAAGTCTTGTCATTCGTAT
CTTGCCCCAAGATCACTCCTTTATCCTAGAAAAGATGTCTCTATTTCCGTCTACTTCTCCTATACTTAAATCCTTTATGT
TACACGCTCATGGAATGCTAATTTTCACAGGTCCAACCGGCAGCGGTAAATCGACGACAATGTACGCAGTTGCAGAGCAT
TGCGCAGGTACGTTAAATCGCCGGGTAGTAACGTTAGAAGACCCTGTAGAGAAACAAAGTGATATGTTATTACAGGTACA
GTTAAATGATAAAGCTGGTATTACCTACAGTACTGGGTTAAAAGCAATCTTACGGCATGACCCAGATGTTATTATCATCG
GAGAGATTCGTGATGCGGAAACTGCTCATATTGCTATAAGAGCTGCATTAACCGGACATCTCGTATTGACAAGCTTACAC
ACCCGTGATGCCAAAGGAGCAATCTATCGGTTACTTGAATTTGGTGTAAAACTGCAAGATATTGAACAAACCTTAATTGG
AATTATGGCACAGCGTTTAATAGCGCTTTTATGCCCTTTATGTGGGGAAAGCTGTTCGAAATTTTGTACGCGTAGGGTAG
CGAAAAGAACTGGTGTGTATGAAATTCTTTATGGGCAAGCATTAGTAGGTGCACTAGAAGAATCGAAGGGAGGAAAAGAG
GTATATCATTATCCATTGATCAAAGATTTGATTAGAAAGGGGATTGCGCTTGGCTACGTTCCTGTCGAAGAATATAAACA
CTGGGTACTTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.241

100

0.554


Multiple sequence alignment