Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   D0B54_RS13925 Genome accession   NZ_CP031704
Coordinates   3059846..3060259 (-) Length   137 a.a.
NCBI ID   WP_117291902.1    Uniprot ID   A0A346N2C0
Organism   Solimonas sp. K1W22B-7     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3059846..3071468 3059846..3060259 within 0


Gene organization within MGE regions


Location: 3059846..3071468
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D0B54_RS13925 (D0B54_13925) comP 3059846..3060259 (-) 414 WP_117291902.1 pilin Machinery gene
  D0B54_RS24330 - 3060718..3061029 (-) 312 WP_162932419.1 hypothetical protein -
  D0B54_RS13930 (D0B54_13930) - 3061625..3062332 (-) 708 WP_117291903.1 transposase -
  D0B54_RS25210 (D0B54_13935) - 3062521..3062919 (-) 399 Protein_2756 transposase -
  D0B54_RS24335 - 3063080..3063985 (-) 906 WP_162932420.1 hypothetical protein -
  D0B54_RS13940 (D0B54_13940) - 3063996..3068072 (-) 4077 WP_117291904.1 RHS repeat-associated core domain-containing protein -

Sequence


Protein


Download         Length: 137 a.a.        Molecular weight: 14213.43 Da        Isoelectric Point: 8.1161

>NTDB_id=310629 D0B54_RS13925 WP_117291902.1 3059846..3060259(-) (comP) [Solimonas sp. K1W22B-7]
MKHAQRAFTLIELMIVVAIIGILAIIALPIYQEYTVRSRVAELVVLASGGRATVTENIANRGNAIDAASCDGVTTAITAT
NSTASLTCTAGVLVITGTDKAKSVALTYTPSIGSGGVVAWKCTADPSNQRYVPAECR

Nucleotide


Download         Length: 414 bp        

>NTDB_id=310629 D0B54_RS13925 WP_117291902.1 3059846..3060259(-) (comP) [Solimonas sp. K1W22B-7]
ATGAAACACGCACAACGGGCTTTTACCTTGATCGAGTTGATGATCGTGGTGGCCATCATCGGCATCCTGGCGATCATTGC
ACTGCCGATCTACCAGGAATACACGGTGCGCAGCCGCGTCGCGGAGCTGGTGGTCCTGGCAAGCGGGGGCAGGGCAACGG
TCACCGAGAACATCGCCAACCGGGGCAATGCCATCGATGCCGCTTCCTGCGACGGGGTGACCACGGCGATCACTGCCACC
AACAGCACTGCCTCGCTCACCTGTACCGCAGGCGTGCTGGTGATCACCGGCACCGACAAGGCCAAGAGCGTGGCGCTGAC
GTATACCCCGTCCATCGGGTCCGGCGGCGTCGTGGCCTGGAAGTGCACTGCCGATCCTTCGAATCAGCGCTACGTTCCGG
CCGAGTGTCGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A346N2C0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

48.98

100

0.526

  pilA Ralstonia pseudosolanacearum GMI1000

36.842

100

0.46

  pilA2 Legionella pneumophila str. Paris

44.776

97.81

0.438

  pilA2 Legionella pneumophila strain ERS1305867

44.03

97.81

0.431

  pilA Acinetobacter baumannii strain A118

38.298

100

0.394

  pilA/pilAI Pseudomonas stutzeri DSM 10701

37.324

100

0.387

  pilA Pseudomonas aeruginosa PAK

33.987

100

0.38

  pilA/pilAII Pseudomonas stutzeri DSM 10701

35.417

100

0.372


Multiple sequence alignment