Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA/cglA/cilD   Type   Machinery gene
Locus tag   FD735_RS00845 Genome accession   NZ_CP040231
Coordinates   148890..149831 (+) Length   313 a.a.
NCBI ID   WP_061421835.1    Uniprot ID   -
Organism   Streptococcus sp. 1643     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 143890..154831
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FD735_RS00825 (FD735_00825) tgt 143926..145068 (+) 1143 WP_139658259.1 tRNA guanosine(34) transglycosylase Tgt -
  FD735_RS00830 (FD735_00830) - 145202..147019 (+) 1818 WP_061421837.1 acyltransferase family protein -
  FD735_RS00835 (FD735_00835) nagA 147160..148311 (+) 1152 WP_001134437.1 N-acetylglucosamine-6-phosphate deacetylase -
  FD735_RS00840 (FD735_00840) - 148463..148828 (+) 366 WP_061421836.1 DUF1033 family protein -
  FD735_RS00845 (FD735_00845) comGA/cglA/cilD 148890..149831 (+) 942 WP_061421835.1 competence type IV pilus ATPase ComGA Machinery gene
  FD735_RS00850 (FD735_00850) comGB/cglB 149779..150795 (+) 1017 WP_081102569.1 competence type IV pilus assembly protein ComGB Machinery gene
  FD735_RS00855 (FD735_00855) comGC/cglC 150797..151120 (+) 324 WP_061421834.1 competence type IV pilus major pilin ComGC Machinery gene
  FD735_RS00860 (FD735_00860) comGD/cglD 151083..151517 (+) 435 WP_176553041.1 competence type IV pilus minor pilin ComGD Machinery gene
  FD735_RS00865 (FD735_00865) comGE/cglE 151480..151782 (+) 303 WP_139658261.1 competence type IV pilus minor pilin ComGE Machinery gene
  FD735_RS00870 (FD735_00870) comGF/cglF 151745..152206 (+) 462 WP_139658262.1 competence type IV pilus minor pilin ComGF Machinery gene
  FD735_RS00875 (FD735_00875) comGG/cglG 152184..152597 (+) 414 WP_139658263.1 competence type IV pilus minor pilin ComGG Machinery gene
  FD735_RS00880 (FD735_00880) - 152630..153220 (+) 591 WP_176553042.1 class I SAM-dependent methyltransferase -
  FD735_RS00885 (FD735_00885) comYH 153279..154232 (+) 954 WP_139658264.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35638.56 Da        Isoelectric Point: 5.6069

>NTDB_id=362612 FD735_RS00845 WP_061421835.1 148890..149831(+) (comGA/cglA/cilD) [Streptococcus sp. 1643]
MVQEIAQKIIATAKEKKAQDIYFIPKEKSYELHMRVGDERCLIDSYEFEVLAAVISHFKFVAGMNVGEKRRSQLGSCDYQ
HGEKKSSLRLSTVGDYRGHESLVIRLLHDEEQELHFWFQDMNELGEQYRQRGLYLFAGPVGSGKTTLMHELAKSLFKGQQ
VMSIEDPVEIKQEDMLQLQLNEAIGLTYENLIKLSLRHRPDLLIIGEIRDSETARAVFRASLTGATVFSTIHAKSIRGVY
ERLLELGVTEEELAVVLQGVCYQRLIGGGGIVDFANKDYQEHQPTSWNEQIDQLLKDGHITSLQAETEKISYS

Nucleotide


Download         Length: 942 bp        

>NTDB_id=362612 FD735_RS00845 WP_061421835.1 148890..149831(+) (comGA/cglA/cilD) [Streptococcus sp. 1643]
ATGGTACAAGAAATTGCACAGAAAATTATTGCTACTGCGAAAGAAAAGAAGGCTCAGGATATCTATTTTATCCCCAAGGA
AAAGTCCTACGAGCTTCACATGCGGGTTGGAGACGAACGGTGTCTAATTGATTCCTATGAGTTTGAGGTTTTAGCTGCAG
TGATTAGTCATTTTAAGTTTGTAGCGGGTATGAATGTAGGAGAGAAGAGACGTAGTCAGCTGGGATCTTGCGACTATCAG
CATGGGGAGAAGAAGTCTTCTCTGCGTTTGTCTACCGTGGGAGATTATCGGGGACATGAGAGTTTGGTCATTCGTTTGTT
GCACGATGAGGAGCAGGAGCTGCATTTCTGGTTTCAGGATATGAACGAACTGGGTGAGCAGTACAGGCAACGGGGCCTCT
ATCTCTTTGCAGGTCCAGTCGGCAGTGGCAAGACGACCTTGATGCACGAATTAGCCAAGTCCCTCTTTAAGGGGCAGCAA
GTTATGTCTATCGAAGACCCAGTAGAAATCAAGCAGGAAGACATGCTCCAATTGCAGTTGAATGAGGCGATTGGATTGAC
CTATGAAAATCTGATCAAACTGTCTCTCCGACATCGTCCGGATCTCCTGATTATCGGTGAAATTCGTGACAGCGAGACGG
CGCGTGCAGTGTTCAGAGCTAGTTTGACAGGTGCGACGGTCTTTTCAACCATTCATGCCAAAAGTATCCGAGGTGTTTAT
GAACGCCTTCTGGAGTTGGGTGTGACGGAGGAGGAACTAGCAGTTGTTCTGCAAGGAGTCTGCTACCAGAGATTAATCGG
GGGAGGAGGAATCGTTGACTTTGCAAACAAAGACTATCAAGAACACCAGCCAACTAGCTGGAATGAGCAGATTGATCAGC
TTCTTAAAGATGGACATATCACAAGTCTTCAGGCTGAAACGGAAAAAATTAGCTACAGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA/cilD Streptococcus mitis NCTC 12261

89.776

100

0.898

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

88.818

100

0.888

  comGA/cglA/cilD Streptococcus pneumoniae D39

88.818

100

0.888

  comGA/cglA/cilD Streptococcus pneumoniae R6

88.818

100

0.888

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

88.818

100

0.888

  comYA Streptococcus gordonii str. Challis substr. CH1

78.387

99.042

0.776

  comYA Streptococcus mutans UA159

66.238

99.361

0.658

  comYA Streptococcus mutans UA140

66.238

99.361

0.658

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

61.613

99.042

0.61

  comGA Lactococcus lactis subsp. cremoris KW2

56.09

99.681

0.559


Multiple sequence alignment