Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGC   Type   Machinery gene
Locus tag   N5069_RS14875 Genome accession   NZ_CP104728
Coordinates   3032009..3032341 (-) Length   110 a.a.
NCBI ID   WP_008181040.1    Uniprot ID   -
Organism   Lysinibacillus fusiformis strain HJ.T1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3027009..3037341
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N5069_RS14845 (N5069_14845) gcvPA 3027145..3028491 (-) 1347 WP_150907159.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  N5069_RS14850 (N5069_14850) gcvT 3028517..3029620 (-) 1104 WP_096365269.1 glycine cleavage system aminomethyltransferase GcvT -
  N5069_RS14855 (N5069_14855) - 3030124..3030639 (-) 516 WP_025116055.1 shikimate kinase -
  N5069_RS14860 (N5069_14860) comGF 3030838..3031266 (-) 429 WP_150907158.1 competence type IV pilus minor pilin ComGF -
  N5069_RS14865 (N5069_14865) - 3031259..3031576 (-) 318 WP_036127066.1 hypothetical protein -
  N5069_RS14870 (N5069_14870) comGD 3031569..3032009 (-) 441 WP_036127068.1 competence type IV pilus minor pilin ComGD -
  N5069_RS14875 (N5069_14875) comGC 3032009..3032341 (-) 333 WP_008181040.1 competence type IV pilus major pilin ComGC Machinery gene
  N5069_RS14880 (N5069_14880) comGB 3032343..3033425 (-) 1083 WP_036127072.1 competence type IV pilus assembly protein ComGB -
  N5069_RS14885 (N5069_14885) comGA 3033400..3034419 (-) 1020 WP_036149075.1 competence type IV pilus ATPase ComGA -
  N5069_RS14890 (N5069_14890) - 3034744..3035454 (+) 711 WP_036127077.1 helix-turn-helix transcriptional regulator -
  N5069_RS14895 (N5069_14895) - 3035572..3035814 (+) 243 WP_004231035.1 DUF2626 family protein -
  N5069_RS14900 (N5069_14900) - 3035861..3036505 (-) 645 WP_217581507.1 MBL fold metallo-hydrolase -
  N5069_RS14905 (N5069_14905) - 3036675..3036854 (+) 180 WP_008181026.1 DUF2759 domain-containing protein -

Sequence


Protein


Download         Length: 110 a.a.        Molecular weight: 12241.32 Da        Isoelectric Point: 5.1145

>NTDB_id=731635 N5069_RS14875 WP_008181040.1 3032009..3032341(-) (comGC) [Lysinibacillus fusiformis strain HJ.T1]
MKHIKQQNGFTLIEMLIVLLIISILILITIPNVTKHFATIDEKGCAAYINMVQGQVEAYRVDFMAYPTLEDLVKEGYLKE
NETTCPNKEEIVITTKGEVRLANVGAGTNG

Nucleotide


Download         Length: 333 bp        

>NTDB_id=731635 N5069_RS14875 WP_008181040.1 3032009..3032341(-) (comGC) [Lysinibacillus fusiformis strain HJ.T1]
ATGAAACATATTAAACAACAAAATGGTTTTACTTTAATCGAAATGCTCATTGTTTTGTTAATCATTTCCATCCTTATTTT
AATTACAATTCCAAATGTTACGAAGCACTTCGCTACGATTGATGAGAAGGGCTGTGCTGCCTATATTAATATGGTACAGG
GGCAAGTAGAAGCCTATAGAGTGGATTTTATGGCATATCCAACTTTAGAGGATTTAGTGAAAGAAGGCTATTTGAAAGAA
AATGAAACAACTTGTCCTAATAAAGAAGAAATAGTTATTACGACTAAGGGAGAGGTACGTTTAGCAAATGTCGGTGCAGG
TACGAATGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGC Bacillus subtilis subsp. subtilis str. 168

55.208

87.273

0.482

  comGC Staphylococcus aureus MW2

45.263

86.364

0.391

  comGC Staphylococcus aureus N315

45.263

86.364

0.391