Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   ABZN20_RS02480 Genome accession   NZ_CP160399
Coordinates   524723..525136 (+) Length   137 a.a.
NCBI ID   WP_218806547.1    Uniprot ID   -
Organism   Methylococcus sp. ANG     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 519723..530136
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ABZN20_RS02465 (ABZN20_02465) - 520352..521968 (+) 1617 WP_367026644.1 tetratricopeptide repeat protein -
  ABZN20_RS02470 (ABZN20_02470) istB 522050..522805 (-) 756 WP_367025276.1 IS21-like element helper ATPase IstB -
  ABZN20_RS02475 (ABZN20_02475) istA 522832..524370 (-) 1539 WP_367025277.1 IS21 family transposase -
  ABZN20_RS02480 (ABZN20_02480) comP 524723..525136 (+) 414 WP_218806547.1 pilin Machinery gene
  ABZN20_RS02485 (ABZN20_02485) - 525187..527220 (-) 2034 WP_218806548.1 tetratricopeptide repeat protein -
  ABZN20_RS02490 (ABZN20_02490) - 527251..528273 (-) 1023 WP_218806549.1 glycosyltransferase -
  ABZN20_RS02495 (ABZN20_02495) - 528345..529463 (-) 1119 WP_218806550.1 DegT/DnrJ/EryC1/StrS aminotransferase family protein -
  ABZN20_RS02500 (ABZN20_02500) - 529451..529861 (-) 411 WP_218806551.1 FdtA/QdtA family cupin domain-containing protein -

Sequence


Protein


Download         Length: 137 a.a.        Molecular weight: 14002.26 Da        Isoelectric Point: 8.8409

>NTDB_id=1020885 ABZN20_RS02480 WP_218806547.1 524723..525136(+) (comP) [Methylococcus sp. ANG]
MKAVQKGFTLIELMIVVAIIGVLAAVALPAYQDYTVRAKVSELLLTAAKYRTDITEKCQLAGTCTASGTSLTVQIGGKIT
GGSVSDDGIVKIKGSTATDSVGAAVSITLTPSWNSAMGSAVWSCTGAPARYVPGSCR

Nucleotide


Download         Length: 414 bp        

>NTDB_id=1020885 ABZN20_RS02480 WP_218806547.1 524723..525136(+) (comP) [Methylococcus sp. ANG]
ATGAAAGCGGTGCAAAAAGGTTTTACATTGATCGAGCTGATGATCGTCGTGGCGATCATCGGCGTCTTGGCCGCGGTCGC
CTTGCCGGCCTACCAGGACTATACGGTTCGGGCCAAGGTTTCAGAACTGCTTCTCACGGCTGCGAAATATCGAACCGACA
TCACTGAAAAATGCCAGTTGGCTGGTACTTGTACCGCTTCTGGGACCAGTCTGACAGTCCAAATTGGTGGAAAGATAACG
GGCGGTAGCGTTAGCGACGATGGAATTGTCAAAATAAAGGGAAGTACGGCAACTGATAGCGTAGGTGCGGCTGTAAGTAT
TACGCTTACTCCAAGCTGGAACTCGGCAATGGGGTCGGCTGTTTGGTCGTGCACCGGTGCGCCGGCCAGATACGTACCAG
GCTCGTGCCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

42.581

100

0.482

  pilA Ralstonia pseudosolanacearum GMI1000

36.364

100

0.438

  pilA2 Legionella pneumophila str. Paris

42.446

100

0.431

  pilA2 Legionella pneumophila strain ERS1305867

42.446

100

0.431

  pilE Neisseria gonorrhoeae strain FA1090

36.478

100

0.423

  pilA/pilAI Pseudomonas stutzeri DSM 10701

40

100

0.423

  pilA/pilAII Pseudomonas stutzeri DSM 10701

35.333

100

0.387

  pilA/pilA1 Eikenella corrodens VA1

34.211

100

0.38

  pilA Haemophilus influenzae 86-028NP

36.364

100

0.38

  pilA Pseudomonas aeruginosa PAK

33.548

100

0.38


Multiple sequence alignment