Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   C1H20_RS05445 Genome accession   NZ_CP025750
Coordinates   1239892..1240311 (+) Length   139 a.a.
NCBI ID   WP_102650932.1    Uniprot ID   -
Organism   Xanthomonas campestris pv. campestris strain 3811     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1234892..1245311
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C1H20_RS05410 - 1234988..1235479 (-) 492 WP_011038220.1 GNAT family N-acetyltransferase -
  C1H20_RS05415 - 1235476..1235766 (-) 291 WP_011038219.1 DUF1778 domain-containing protein -
  C1H20_RS05420 - 1235841..1236242 (-) 402 WP_012437599.1 SymE family type I addiction module toxin -
  C1H20_RS05430 coaE 1236774..1237397 (-) 624 WP_011038217.1 dephospho-CoA kinase -
  C1H20_RS05435 - 1237411..1238274 (-) 864 WP_012437600.1 A24 family peptidase -
  C1H20_RS05440 pilC 1238281..1239540 (-) 1260 WP_012437601.1 type II secretion system F family protein Machinery gene
  C1H20_RS05445 comP 1239892..1240311 (+) 420 WP_102650932.1 pilin Machinery gene
  C1H20_RS05450 - 1240530..1242041 (+) 1512 WP_256729490.1 phosphoethanolamine transferase -
  C1H20_RS05455 - 1242052..1242681 (+) 630 WP_012437604.1 hypothetical protein -
  C1H20_RS05460 - 1242651..1244177 (+) 1527 WP_016944997.1 membrane protein -

Sequence


Protein


Download         Length: 139 a.a.        Molecular weight: 14230.39 Da        Isoelectric Point: 8.4883

>NTDB_id=263617 C1H20_RS05445 WP_102650932.1 1239892..1240311(+) (comP) [Xanthomonas campestris pv. campestris strain 3811]
MKKQQGFTLIELMIVVAIIAILAAIALPAYQDYTIRSRVSEMAVLASGAKATIGENIANENAINANACRGVATFNTATTN
TASLACANGTVTVTVTGTAKAANVVLTYAPTLVDAGIRWTCSSASAAKYLPAECRGSGT

Nucleotide


Download         Length: 420 bp        

>NTDB_id=263617 C1H20_RS05445 WP_102650932.1 1239892..1240311(+) (comP) [Xanthomonas campestris pv. campestris strain 3811]
ATGAAGAAGCAGCAAGGCTTTACACTTATCGAACTGATGATCGTGGTCGCGATCATCGCTATCCTGGCCGCCATCGCGCT
GCCGGCTTATCAGGATTACACCATTCGCTCGCGTGTCTCTGAAATGGCTGTGCTCGCCTCCGGTGCCAAGGCGACAATTG
GTGAGAATATTGCCAATGAGAATGCGATTAATGCTAATGCGTGCCGCGGAGTTGCTACTTTTAACACCGCCACGACGAAC
ACTGCTTCCCTCGCGTGTGCCAACGGTACTGTCACTGTCACTGTCACTGGTACGGCTAAGGCTGCGAACGTCGTTCTGAC
CTACGCTCCGACCCTTGTGGACGCTGGCATCAGGTGGACCTGTTCGTCTGCCAGTGCGGCTAAGTACCTGCCGGCCGAAT
GCCGTGGTTCGGGCACCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

48.993

100

0.525

  pilA Ralstonia pseudosolanacearum GMI1000

44.172

100

0.518

  pilA2 Legionella pneumophila str. Paris

50.735

97.842

0.496

  pilA2 Legionella pneumophila strain ERS1305867

49.265

97.842

0.482

  pilA/pilAI Pseudomonas stutzeri DSM 10701

40.845

100

0.417

  pilE Neisseria gonorrhoeae strain FA1090

37.662

100

0.417

  pilA/pilA1 Eikenella corrodens VA1

37.908

100

0.417

  pilE Neisseria gonorrhoeae MS11

36.129

100

0.403

  pilA Pseudomonas aeruginosa PAK

36

100

0.388

  pilA Vibrio parahaemolyticus RIMD 2210633

43.2

89.928

0.388

  pilA/pilAII Pseudomonas stutzeri DSM 10701

38.971

97.842

0.381

  pilA Haemophilus influenzae Rd KW20

37.857

100

0.381

  pilA Acinetobacter baumannii strain A118

36.301

100

0.381

  pilA Haemophilus influenzae 86-028NP

37.143

100

0.374


Multiple sequence alignment