Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA/pilA1   Type   Machinery gene
Locus tag   KFB96_RS04740 Genome accession   NZ_CP073760
Coordinates   1051698..1052096 (+) Length   132 a.a.
NCBI ID   WP_213459423.1    Uniprot ID   -
Organism   MAG: Thiocapsa sp. isolate M50B4     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1052320..1078749 1051698..1052096 flank 224


Gene organization within MGE regions


Location: 1051698..1078749
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KFB96_RS04740 (KFB96_04740) pilA/pilA1 1051698..1052096 (+) 399 WP_213459423.1 pilin Machinery gene
  KFB96_RS04745 (KFB96_04745) - 1052320..1054371 (+) 2052 WP_213501773.1 tetratricopeptide repeat protein -
  KFB96_RS04750 (KFB96_04750) - 1054636..1055583 (+) 948 WP_213459897.1 glycosyltransferase family 2 protein -
  KFB96_RS04755 (KFB96_04755) - 1055663..1056874 (+) 1212 WP_213459427.1 glycosyltransferase family 4 protein -
  KFB96_RS04760 (KFB96_04760) - 1057380..1058513 (-) 1134 WP_213459429.1 glycosyltransferase -
  KFB96_RS04765 (KFB96_04765) wecB 1058600..1059721 (-) 1122 WP_300971308.1 non-hydrolyzing UDP-N-acetylglucosamine 2-epimerase -
  KFB96_RS04770 (KFB96_04770) - 1059728..1061119 (-) 1392 WP_213459431.1 polysaccharide deacetylase family protein -
  KFB96_RS04775 (KFB96_04775) - 1061313..1062272 (-) 960 WP_300971309.1 GNAT family N-acetyltransferase -
  KFB96_RS04780 (KFB96_04780) - 1062585..1063550 (-) 966 WP_213459433.1 sulfotransferase -
  KFB96_RS04785 (KFB96_04785) - 1063599..1064246 (-) 648 WP_213459435.1 methyltransferase domain-containing protein -
  KFB96_RS04790 (KFB96_04790) - 1064252..1065709 (-) 1458 WP_213459437.1 oligosaccharide flippase family protein -
  KFB96_RS04795 (KFB96_04795) - 1065814..1066389 (-) 576 WP_213459439.1 acyltransferase -
  KFB96_RS04800 (KFB96_04800) - 1066401..1067348 (-) 948 WP_213459441.1 Gfo/Idh/MocA family oxidoreductase -
  KFB96_RS04805 (KFB96_04805) - 1067581..1069620 (-) 2040 WP_213459443.1 acyltransferase family protein -
  KFB96_RS04810 (KFB96_04810) - 1069949..1071025 (+) 1077 WP_213459445.1 cytosolic protein -
  KFB96_RS04815 (KFB96_04815) - 1071611..1072627 (-) 1017 WP_213459447.1 IS110 family transposase -
  KFB96_RS04820 (KFB96_04820) - 1072999..1073295 (-) 297 WP_213459449.1 HigA family addiction module antitoxin -
  KFB96_RS04825 (KFB96_04825) - 1073310..1073591 (-) 282 WP_213459451.1 type II toxin-antitoxin system RelE/ParE family toxin -
  KFB96_RS04830 (KFB96_04830) - 1073876..1074049 (+) 174 WP_213459453.1 hypothetical protein -
  KFB96_RS04835 (KFB96_04835) - 1075235..1075444 (-) 210 WP_213459455.1 hypothetical protein -
  KFB96_RS04840 (KFB96_04840) - 1075575..1075853 (+) 279 WP_120798096.1 type II toxin-antitoxin system RelE/ParE family toxin -
  KFB96_RS04845 (KFB96_04845) - 1075866..1076171 (+) 306 WP_300971310.1 HigA family addiction module antitoxin -
  KFB96_RS04850 (KFB96_04850) - 1076222..1076941 (-) 720 WP_213459457.1 transposase -
  KFB96_RS04855 (KFB96_04855) - 1077123..1078165 (-) 1043 Protein_977 IS630 family transposase -
  KFB96_RS04860 (KFB96_04860) - 1078225..1078749 (+) 525 WP_213459459.1 pilin -

Sequence


Protein


Download         Length: 132 a.a.        Molecular weight: 13540.58 Da        Isoelectric Point: 4.4012

>NTDB_id=560534 KFB96_RS04740 WP_213459423.1 1051698..1052096(+) (pilA/pilA1) [MAG: Thiocapsa sp. isolate M50B4]
MKKQQSGFTLIELMIVVAIIGILAAIALPAYQDYTARAQAVEALSLTGGARADLAVAQAEGAVFDDTPLADLAGKYIAAG
GVTAASSVLSVEFSSGALSGETMEISPVISGSQIQGWKCKELESKYLPSGCK

Nucleotide


Download         Length: 399 bp        

>NTDB_id=560534 KFB96_RS04740 WP_213459423.1 1051698..1052096(+) (pilA/pilA1) [MAG: Thiocapsa sp. isolate M50B4]
ATGAAAAAGCAGCAATCCGGTTTTACACTTATCGAGCTGATGATCGTCGTGGCGATCATTGGTATTTTGGCGGCGATTGC
GTTGCCGGCGTATCAGGATTATACGGCTAGGGCTCAAGCGGTCGAGGCATTGTCGCTTACGGGAGGGGCACGAGCCGACT
TGGCCGTTGCACAAGCCGAAGGCGCAGTGTTCGATGACACCCCTTTAGCAGATCTGGCTGGCAAATACATCGCGGCAGGT
GGGGTCACTGCTGCTAGTAGTGTATTGTCTGTGGAATTCTCCAGCGGTGCGTTGAGCGGTGAGACAATGGAGATTTCCCC
CGTAATCTCTGGATCACAGATTCAGGGCTGGAAATGCAAAGAACTTGAATCCAAATACCTCCCTTCGGGCTGTAAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA/pilA1 Eikenella corrodens VA1

44.805

100

0.523

  pilA Ralstonia pseudosolanacearum GMI1000

36.81

100

0.455

  pilA/pilAI Pseudomonas stutzeri DSM 10701

40.268

100

0.455

  pilE Neisseria gonorrhoeae MS11

34.783

100

0.424

  pilA/pilAII Pseudomonas stutzeri DSM 10701

39.437

100

0.424

  pilA2 Legionella pneumophila strain ERS1305867

38.849

100

0.409

  pilA2 Legionella pneumophila str. Paris

38.849

100

0.409

  pilE Neisseria gonorrhoeae strain FA1090

33.75

100

0.409

  pilA Acinetobacter baumannii strain A118

37.063

100

0.402

  comP Acinetobacter baylyi ADP1

35.616

100

0.394

  pilA Pseudomonas aeruginosa PAK

33.784

100

0.379

  pilA Vibrio cholerae C6706

35.211

100

0.379

  pilA Vibrio cholerae strain A1552

35.211

100

0.379

  pilA Vibrio cholerae O1 biovar El Tor strain E7946

35.211

100

0.379

  pilA Acinetobacter nosocomialis M2

42.609

87.121

0.371