Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   P3U44_RS06850 Genome accession   NZ_CP120180
Coordinates   1350292..1351194 (+) Length   300 a.a.
NCBI ID   WP_058611612.1    Uniprot ID   -
Organism   Mammaliicoccus sciuri strain Dog016     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1345292..1356194
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P3U44_RS06810 (P3U44_06815) - 1345376..1345930 (+) 555 WP_058611616.1 5-formyltetrahydrofolate cyclo-ligase -
  P3U44_RS06815 (P3U44_06820) - 1345931..1347379 (+) 1449 WP_058611615.1 rhomboid family protein -
  P3U44_RS06820 (P3U44_06825) - 1347384..1347575 (+) 192 WP_078099923.1 YqgQ family protein -
  P3U44_RS06825 (P3U44_06830) - 1347572..1348543 (+) 972 WP_025904167.1 ROK family glucokinase -
  P3U44_RS06830 (P3U44_06835) - 1348543..1348857 (+) 315 WP_058611614.1 MTH1187 family thiamine-binding protein -
  P3U44_RS06835 (P3U44_06840) - 1348887..1349126 (-) 240 WP_016911885.1 DUF2759 domain-containing protein -
  P3U44_RS06840 (P3U44_06845) - 1349224..1349856 (+) 633 WP_058611613.1 MBL fold metallo-hydrolase -
  P3U44_RS06845 (P3U44_06850) - 1349890..1350138 (-) 249 WP_025904960.1 DUF2626 family protein -
  P3U44_RS06850 (P3U44_06855) comGA 1350292..1351194 (+) 903 WP_058611612.1 competence type IV pilus ATPase ComGA Machinery gene
  P3U44_RS06855 (P3U44_06860) comGB 1351184..1352224 (+) 1041 WP_323706194.1 competence type IV pilus assembly protein ComGB -
  P3U44_RS14185 - 1352239..1352304 (+) 66 WP_204236796.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  P3U44_RS06860 (P3U44_06865) - 1352392..1353591 (-) 1200 WP_323706195.1 site-specific integrase -
  P3U44_RS06865 (P3U44_06870) - 1353604..1354065 (-) 462 WP_204236787.1 ImmA/IrrE family metallo-endopeptidase -
  P3U44_RS06870 (P3U44_06875) - 1354081..1354452 (-) 372 WP_204236786.1 hypothetical protein -
  P3U44_RS06875 (P3U44_06880) - 1354548..1355126 (-) 579 WP_323706196.1 DUF5067 domain-containing protein -
  P3U44_RS06880 (P3U44_06885) - 1355173..1355784 (-) 612 WP_204236784.1 OB-fold protein -

Sequence


Protein


Download         Length: 300 a.a.        Molecular weight: 34405.30 Da        Isoelectric Point: 6.1198

>NTDB_id=804249 P3U44_RS06850 WP_058611612.1 1350292..1351194(+) (comGA) [Mammaliicoccus sciuri strain Dog016]
MEKLFKSIINDAVNSNASDIHFVPLNDDKVFVQMRINGDNIPYDLDLDFNLYSRLLIYMKFIAHLDVSERNKAQSGQYLF
EHDIHYYLRISTIPISLGIESSSIRITPQHFNHSQVKSEHQSYNVLQNRMYQSEGLFLFTGPTGSGKTTLMYELMYALKR
EKNKHIITIEDPVEQTLNGIVQVSVNEKANITYQQSFKAILRCDPDVIMIGEIRDDITAKQVIQASLSGHLVLSTMHASD
TIGAIHRLVELGVTVEQIKQGISCITNQRLVKKCDEARTLDMSYIFKDDIIKYMENDYET

Nucleotide


Download         Length: 903 bp        

>NTDB_id=804249 P3U44_RS06850 WP_058611612.1 1350292..1351194(+) (comGA) [Mammaliicoccus sciuri strain Dog016]
ATGGAGAAATTATTTAAGTCAATTATTAACGATGCTGTCAATTCTAATGCGAGTGATATTCATTTTGTACCTTTAAACGA
TGATAAAGTCTTTGTCCAAATGAGGATAAATGGCGATAATATACCTTATGATTTAGATTTAGACTTTAATTTGTATAGTA
GGCTACTTATTTATATGAAATTTATCGCTCATCTAGATGTAAGTGAACGAAATAAAGCACAAAGTGGTCAATATTTATTT
GAACATGACATACATTACTATTTAAGAATTTCCACGATACCTATCAGTTTAGGGATAGAATCTAGTTCCATTAGAATAAC
CCCTCAACATTTTAATCATTCACAAGTGAAATCAGAGCATCAATCATATAACGTTCTACAAAACAGAATGTATCAATCAG
AAGGGCTATTTTTATTTACGGGTCCAACTGGATCAGGTAAGACTACATTAATGTATGAACTTATGTACGCATTAAAGAGA
GAGAAGAATAAACATATTATTACGATTGAAGACCCTGTAGAACAAACGTTAAATGGCATCGTTCAAGTTTCAGTCAATGA
AAAAGCTAACATAACTTATCAACAATCCTTTAAAGCCATTTTAAGATGTGATCCGGACGTTATTATGATTGGAGAGATAA
GAGATGATATTACTGCTAAGCAAGTTATTCAAGCTAGTTTAAGTGGTCATTTAGTACTTTCAACAATGCATGCGAGTGAT
ACAATCGGGGCAATCCACCGACTAGTTGAGCTAGGTGTAACAGTCGAACAAATTAAACAAGGTATTTCATGTATTACGAA
CCAACGTCTTGTGAAGAAATGTGATGAAGCAAGAACATTAGATATGTCCTATATTTTTAAAGATGACATTATTAAATATA
TGGAGAATGATTATGAAACGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

52.143

93.333

0.487

  comGA Staphylococcus aureus N315

52.143

93.333

0.487

  comGA Bacillus subtilis subsp. subtilis str. 168

40.58

92

0.373

  pilB Haemophilus influenzae Rd KW20

35.032

100

0.367