Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   RCG20_RS05120 Genome accession   NZ_CP133266
Coordinates   1007956..1009038 (+) Length   360 a.a.
NCBI ID   WP_308183167.1    Uniprot ID   -
Organism   Neobacillus sp. PS3-40     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1002956..1014038
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RCG20_RS05105 (RCG20_05105) gcvT 1003422..1004528 (-) 1107 WP_308183164.1 glycine cleavage system aminomethyltransferase GcvT -
  RCG20_RS05110 (RCG20_05110) - 1005059..1006729 (+) 1671 WP_308183165.1 SNF2-related protein -
  RCG20_RS05115 (RCG20_05115) - 1006875..1007666 (+) 792 WP_308183166.1 YqhG family protein -
  RCG20_RS05120 (RCG20_05120) comGA 1007956..1009038 (+) 1083 WP_308183167.1 competence type IV pilus ATPase ComGA Machinery gene
  RCG20_RS05125 (RCG20_05125) comGB 1009004..1010056 (+) 1053 WP_308183168.1 competence type IV pilus assembly protein ComGB -
  RCG20_RS05130 (RCG20_05130) comGC 1010166..1010483 (+) 318 WP_308183169.1 competence type IV pilus major pilin ComGC -
  RCG20_RS05135 (RCG20_05135) comGD 1010483..1010923 (+) 441 WP_308183170.1 competence type IV pilus minor pilin ComGD -
  RCG20_RS05140 (RCG20_05140) - 1010907..1011245 (+) 339 WP_308183171.1 hypothetical protein -
  RCG20_RS05145 (RCG20_05145) comGF 1011193..1011669 (+) 477 WP_308183172.1 competence type IV pilus minor pilin ComGF -
  RCG20_RS05150 (RCG20_05150) comGG 1011666..1012052 (+) 387 WP_308183173.1 competence type IV pilus minor pilin ComGG -
  RCG20_RS05155 (RCG20_05155) - 1012265..1012450 (+) 186 WP_308183174.1 YqzE family protein -
  RCG20_RS05160 (RCG20_05160) - 1012709..1013410 (+) 702 WP_308183175.1 helix-turn-helix domain-containing protein -
  RCG20_RS05165 (RCG20_05165) - 1013492..1013734 (+) 243 WP_308183176.1 DUF2626 domain-containing protein -

Sequence


Protein


Download         Length: 360 a.a.        Molecular weight: 40362.78 Da        Isoelectric Point: 8.0922

>NTDB_id=873000 RCG20_RS05120 WP_308183167.1 1007956..1009038(+) (comGA) [Neobacillus sp. PS3-40]
MFFIVSAIEILADRIISDAARNQATDIHIVPRKKDTLIQLRITNMLVPRLTLPKDECDRLISHFKFTANMDIGERRRPQS
GAIVSEVNGNLLSLRLSTLPSNNRESLVIRLLPQQEQIPFHQLSLFPAITRKLLALLKHAHGLIILTGPTGSGKSTTLYS
LLNETAHLFHRNIITLEDPIEKNYDSVLQVQVNEKAGVTYAAGLKAILRHDPDIIMVGEIRDAETAKIAVRAALTGHLVL
STMHTRDAKGAVFRLHEFGVDWLEVEQTLIAVTAQRLVELTCPYCLEHCSPHCYSIGNWKRASVFEILSGRNLSSIMKAA
KGEKVGTDYQLLKDVIRKGIALGYIKETEYERLVYDEETS

Nucleotide


Download         Length: 1083 bp        

>NTDB_id=873000 RCG20_RS05120 WP_308183167.1 1007956..1009038(+) (comGA) [Neobacillus sp. PS3-40]
GTGTTTTTTATTGTAAGTGCTATTGAAATATTAGCAGATCGAATTATTTCAGATGCAGCACGAAACCAAGCAACAGATAT
TCATATAGTTCCCAGGAAAAAGGACACACTTATCCAGCTCCGGATAACAAACATGTTAGTTCCTCGGCTAACTTTGCCAA
AAGACGAATGTGACAGATTGATCTCCCATTTTAAATTCACAGCAAATATGGATATCGGAGAAAGAAGGCGCCCCCAAAGT
GGTGCAATCGTTAGTGAAGTGAACGGAAATTTATTGAGCCTTAGGCTTTCTACTCTTCCATCTAACAATCGAGAAAGCCT
CGTGATCCGACTCTTACCACAACAAGAACAGATTCCTTTCCACCAGCTCTCACTATTCCCTGCAATAACCCGAAAATTAC
TTGCCCTCTTAAAACATGCCCATGGTTTAATCATACTAACAGGCCCAACCGGTAGCGGAAAATCGACTACTCTTTACTCT
CTTCTAAATGAAACAGCACACCTATTTCATCGCAATATCATTACTCTTGAAGATCCCATCGAAAAAAATTATGATTCTGT
CCTGCAGGTCCAAGTAAACGAAAAAGCAGGGGTTACCTATGCTGCCGGATTAAAAGCGATCCTTCGACACGATCCAGATA
TTATAATGGTTGGAGAAATAAGAGATGCTGAGACAGCTAAAATTGCTGTAAGAGCAGCACTTACTGGGCATTTAGTATTA
TCAACAATGCATACAAGAGATGCAAAGGGTGCAGTTTTTCGCCTTCACGAATTTGGGGTCGATTGGCTTGAGGTAGAACA
AACACTAATCGCCGTGACAGCACAACGATTAGTTGAACTTACATGCCCTTATTGCCTAGAACACTGTTCACCACACTGTT
ATTCAATCGGGAATTGGAAAAGAGCAAGCGTTTTCGAGATTTTATCAGGGAGAAACTTAAGTTCTATTATGAAAGCAGCA
AAAGGCGAAAAGGTAGGAACAGACTATCAGTTATTAAAAGACGTAATTCGGAAAGGAATTGCGCTCGGTTACATTAAAGA
GACTGAATATGAACGGCTTGTTTATGATGAAGAAACATCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.775

98.611

0.55

  pilB Glaesserella parasuis strain SC1401

40.115

96.944

0.389

  pilB Haemophilus influenzae 86-028NP

36.932

97.778

0.361