Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   THIVI_RS23710 Genome accession   NC_018012
Coordinates   2730940..2731434 (+) Length   164 a.a.
NCBI ID   WP_014778869.1    Uniprot ID   I3YBQ7
Organism   Thiocystis violascens DSM 198     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2723432..2734595 2730940..2731434 within 0


Gene organization within MGE regions


Location: 2723432..2734595
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THIVI_RS12005 (Thivi_2477) - 2723432..2724313 (+) 882 WP_014778861.1 hypothetical protein -
  THIVI_RS23705 - 2724334..2724702 (+) 369 Protein_2457 ATP-binding cassette domain-containing protein -
  THIVI_RS12015 (Thivi_2479) - 2724851..2725288 (+) 438 WP_014778863.1 DUF2721 domain-containing protein -
  THIVI_RS12020 (Thivi_2480) - 2725332..2726411 (-) 1080 WP_014778864.1 glycosyltransferase -
  THIVI_RS12025 (Thivi_2481) - 2726453..2727541 (-) 1089 WP_014778865.1 glycosyltransferase family 4 protein -
  THIVI_RS12030 (Thivi_2482) - 2727538..2728572 (-) 1035 WP_014778866.1 glycosyltransferase family 9 protein -
  THIVI_RS12035 (Thivi_2483) - 2728694..2730067 (+) 1374 WP_245537263.1 O-antigen ligase family protein -
  THIVI_RS12040 (Thivi_2484) - 2730059..2730535 (-) 477 WP_014778868.1 SEL1-like repeat protein -
  THIVI_RS23710 (Thivi_2485) pilA 2730940..2731434 (+) 495 WP_014778869.1 pilin Machinery gene
  THIVI_RS12050 (Thivi_2486) - 2731467..2732450 (-) 984 WP_014778870.1 IS110 family transposase -
  THIVI_RS12055 (Thivi_2487) - 2733256..2733525 (+) 270 WP_014778871.1 DUF4160 domain-containing protein -
  THIVI_RS12060 (Thivi_2488) - 2733536..2733775 (+) 240 WP_014778872.1 DUF2442 domain-containing protein -
  THIVI_RS12065 - 2734368..2734595 (-) 228 WP_052315021.1 hypothetical protein -

Sequence


Protein


Download         Length: 164 a.a.        Molecular weight: 16172.53 Da        Isoelectric Point: 8.4687

>NTDB_id=51664 THIVI_RS23710 WP_014778869.1 2730940..2731434(+) (pilA) [Thiocystis violascens DSM 198]
MKKYQQGFTLIELMIVVAIIGILAAIALPAYQDYMVRARVTEGLTAASAAKVNVVDVLASGNPSAAAGYGNGYTSPTATE
NVTSVAIAAGTGVITVTMTAAAGNGTLTLTPNAPSGTALPVGTAAFTPPGDSVAWRCGAAGATSTFAGFTAGTLPARFAP
SACK

Nucleotide


Download         Length: 495 bp        

>NTDB_id=51664 THIVI_RS23710 WP_014778869.1 2730940..2731434(+) (pilA) [Thiocystis violascens DSM 198]
ATGAAAAAGTATCAACAAGGCTTTACTTTGATCGAACTGATGATCGTGGTGGCGATTATTGGGATTTTGGCGGCGATTGC
GTTGCCAGCGTATCAGGATTATATGGTCAGGGCTCGCGTCACCGAAGGTTTGACGGCTGCATCTGCTGCAAAGGTAAATG
TCGTCGATGTGCTAGCTTCCGGTAATCCAAGTGCTGCTGCTGGCTATGGAAATGGTTACACAAGCCCAACGGCAACGGAA
AATGTCACCAGCGTTGCGATCGCTGCTGGTACGGGCGTTATTACCGTGACCATGACCGCTGCGGCTGGCAACGGAACCCT
GACACTGACACCGAATGCACCATCTGGCACCGCCCTTCCTGTTGGTACAGCCGCCTTTACTCCTCCGGGTGATTCCGTAG
CGTGGCGCTGCGGTGCTGCCGGTGCTACTTCTACTTTTGCTGGCTTCACTGCCGGTACGCTTCCAGCACGCTTTGCGCCT
TCCGCGTGTAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB I3YBQ7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Ralstonia pseudosolanacearum GMI1000

50.588

100

0.524

  pilA2 Legionella pneumophila strain ERS1305867

50.625

97.561

0.494

  pilA2 Legionella pneumophila str. Paris

50.625

97.561

0.494

  pilE Neisseria elongata subsp. glycolytica ATCC 29315

37.949

100

0.451

  comP Acinetobacter baylyi ADP1

41.52

100

0.433


Multiple sequence alignment