Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   LC087_RS07375 Genome accession   NZ_CP129013
Coordinates   1433790..1434857 (-) Length   355 a.a.
NCBI ID   WP_226539955.1    Uniprot ID   -
Organism   Bacillus carboniphilus strain SaN35-3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1428790..1439857
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LC087_RS07335 (LC087_07335) murB 1428854..1429772 (-) 919 Protein_1454 UDP-N-acetylmuramate dehydrogenase -
  LC087_RS07340 (LC087_07340) - 1430396..1430551 (-) 156 WP_226539965.1 hypothetical protein -
  LC087_RS07345 (LC087_07345) comGG 1430711..1431085 (-) 375 WP_226540124.1 competence type IV pilus minor pilin ComGG -
  LC087_RS07350 (LC087_07350) comGF 1431085..1431570 (-) 486 WP_226539963.1 competence type IV pilus minor pilin ComGF -
  LC087_RS07355 (LC087_07355) - 1431512..1431853 (-) 342 WP_226539961.1 hypothetical protein -
  LC087_RS07360 (LC087_07360) comGD 1431819..1432208 (-) 390 WP_226539959.1 competence type IV pilus minor pilin ComGD -
  LC087_RS19620 - 1432373..1432486 (-) 114 WP_371932658.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  LC087_RS07365 (LC087_07365) comGC 1432458..1432751 (-) 294 WP_226540123.1 competence type IV pilus major pilin ComGC Machinery gene
  LC087_RS07370 (LC087_07370) comGB 1432766..1433800 (-) 1035 WP_226539956.1 competence type IV pilus assembly protein ComGB -
  LC087_RS07375 (LC087_07375) comGA 1433790..1434857 (-) 1068 WP_226539955.1 competence type IV pilus ATPase ComGA Machinery gene
  LC087_RS07380 (LC087_07380) - 1435442..1435687 (+) 246 WP_226539954.1 DUF2626 domain-containing protein -
  LC087_RS07385 (LC087_07385) - 1435888..1436760 (-) 873 WP_306020494.1 SAM-dependent methyltransferase -
  LC087_RS07390 (LC087_07390) - 1436757..1437392 (-) 636 WP_226539952.1 MBL fold metallo-hydrolase -
  LC087_RS07395 (LC087_07395) - 1437610..1437783 (+) 174 WP_226539951.1 DUF2759 domain-containing protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 40191.55 Da        Isoelectric Point: 8.7907

>NTDB_id=850513 LC087_RS07375 WP_226539955.1 1433790..1434857(-) (comGA) [Bacillus carboniphilus strain SaN35-3]
MLSIEQLSEQLIEEACVVNASDLHIIPRDDDALIQLRIDDDIIEKHTMNKKRAERMIAYFKFLSSMDIGEKRKPQNGSLS
FKTKAIQVNLRFSTLPTINDESLVIRIFPQKMVPALQKLSLFPTTTNKLLSLINYSHGLIIFTGPTGSGKTTTLYSLLHY
AKEHFHRNIITLEDPVETKSDQVLQVQVNEKAGMTYSAGLKAILRHDPDMIMVGEIRDSETAKIAIRASLTGHLVLSTMH
TRDARGAIYRLLEFGVGLSEIEQTLIAVSAQRLVELVCPFCEGRCTPFCKSLRKTRRLGVFELLYGKNLTSSIREVRGER
VSYNYLTLKDAMKKGVSLGYLTQSSYNRWIYADEE

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=850513 LC087_RS07375 WP_226539955.1 1433790..1434857(-) (comGA) [Bacillus carboniphilus strain SaN35-3]
ATGTTATCAATTGAACAATTGAGTGAACAATTGATTGAGGAAGCTTGTGTAGTAAATGCATCCGATCTTCACATTATTCC
ACGTGATGATGATGCCCTTATTCAACTAAGAATCGATGACGACATCATTGAAAAGCACACAATGAACAAAAAAAGAGCAG
AAAGAATGATTGCTTATTTTAAATTTTTGTCATCCATGGATATTGGGGAAAAAAGAAAGCCTCAAAATGGAAGTCTTTCA
TTTAAGACAAAAGCAATACAAGTAAACCTTCGCTTTTCCACACTTCCTACCATAAATGATGAAAGTTTAGTCATTCGCAT
TTTTCCACAAAAAATGGTCCCTGCGCTTCAAAAACTATCGTTATTTCCAACAACAACGAACAAGCTTTTATCTTTAATCA
ATTATTCACACGGTCTCATCATTTTTACAGGTCCTACAGGGTCTGGAAAAACGACAACTCTATATTCATTGTTACATTAT
GCAAAGGAGCATTTTCATCGTAACATTATTACCCTAGAAGATCCTGTGGAAACAAAAAGTGATCAAGTGTTGCAAGTTCA
AGTGAATGAAAAGGCGGGTATGACTTATTCTGCTGGACTTAAAGCAATATTAAGACACGATCCGGATATGATTATGGTTG
GTGAGATAAGAGACAGTGAAACAGCTAAAATAGCTATTAGGGCGAGCTTAACAGGCCATTTAGTTCTATCAACGATGCAT
ACGAGAGATGCAAGAGGAGCCATTTATCGTCTTTTAGAGTTTGGTGTAGGTTTATCTGAAATTGAACAAACGTTAATTGC
TGTAAGTGCGCAAAGGCTTGTGGAATTAGTATGCCCTTTTTGTGAAGGGCGTTGTACACCATTTTGCAAATCTTTAAGAA
AAACAAGAAGATTGGGTGTGTTTGAGCTATTATATGGGAAAAATTTAACGTCCTCAATTCGGGAAGTAAGAGGTGAAAGA
GTTTCGTATAACTATTTAACATTAAAGGATGCGATGAAGAAAGGAGTGTCATTAGGGTATTTAACACAATCATCCTATAA
CCGCTGGATTTATGCAGATGAAGAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

62.921

100

0.631

  pilB Glaesserella parasuis strain SC1401

39.429

98.592

0.389

  pilF Thermus thermophilus HB27

42.857

84.789

0.363