Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilD   Type   Machinery gene
Locus tag   NMUL_RS11120 Genome accession   NC_007614
Coordinates   2424846..2425754 (-) Length   302 a.a.
NCBI ID   WP_011381433.1    Uniprot ID   -
Organism   Nitrosospira multiformis ATCC 25196     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2419846..2430754
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NMUL_RS11095 (Nmul_A2126) ispF 2420408..2420941 (-) 534 WP_011381428.1 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase -
  NMUL_RS11100 (Nmul_A2127) ispD 2421068..2421766 (-) 699 WP_011381429.1 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase -
  NMUL_RS11105 (Nmul_A2128) - 2422046..2423020 (+) 975 WP_011381430.1 sigma 54-interacting transcriptional regulator -
  NMUL_RS11110 (Nmul_A2129) zapD 2423358..2424113 (-) 756 WP_041353205.1 cell division protein ZapD -
  NMUL_RS11115 (Nmul_A2130) coaE 2424239..2424841 (-) 603 WP_011381432.1 dephospho-CoA kinase -
  NMUL_RS11120 (Nmul_A2131) pilD 2424846..2425754 (-) 909 WP_011381433.1 prepilin peptidase Machinery gene
  NMUL_RS11125 (Nmul_A2132) - 2425775..2427073 (-) 1299 WP_011381434.1 HlyC/CorC family transporter -
  NMUL_RS11130 (Nmul_A2133) - 2427207..2428034 (-) 828 WP_011381435.1 cytochrome C assembly family protein -
  NMUL_RS11135 (Nmul_A2134) ffh 2428110..2429459 (+) 1350 WP_011381436.1 signal recognition particle protein -
  NMUL_RS11140 (Nmul_A2135) - 2429516..2429947 (-) 432 WP_011381437.1 DUF192 domain-containing protein -
  NMUL_RS11145 (Nmul_A2136) dut 2429947..2430396 (-) 450 WP_011381438.1 dUTP diphosphatase -

Sequence


Protein


Download         Length: 302 a.a.        Molecular weight: 32513.22 Da        Isoelectric Point: 7.1138

>NTDB_id=25117 NMUL_RS11120 WP_011381433.1 2424846..2425754(-) (pilD) [Nitrosospira multiformis ATCC 25196]
MSFISVLQYSPVFFASFCALIGLVAGSFLNVVIYRLPRMLEREWRQQCAELQAELSSGTNGIAPAHEPHEALAAEPAFNL
ITPSSTCPHCGHRITALENIPLISYIALRGRCSQCRTAISMRYPVVEGLTAALSGLVAWHFGYGVIAFAALALVWAMVAL
AFIDLDTQLLPNDITIPLLWGGLLINLSGGFADIHSAVIGAVVGYLALWSVYWGYKLLTGREGMGYGDFKLLAAIGAWLG
WQMLPLVILSSSLVGSMAGLGLMLAAKHGRHVPIPFGPYLVCGGIVALFWGNEINRAYLGSF

Nucleotide


Download         Length: 909 bp        

>NTDB_id=25117 NMUL_RS11120 WP_011381433.1 2424846..2425754(-) (pilD) [Nitrosospira multiformis ATCC 25196]
ATGTCTTTTATCTCCGTGCTGCAATACTCTCCGGTGTTCTTTGCCTCGTTCTGCGCGCTCATTGGTCTGGTCGCCGGCAG
TTTTTTGAATGTGGTTATCTACCGCCTGCCGAGAATGCTGGAGCGGGAATGGCGACAACAGTGCGCCGAACTGCAGGCAG
AACTGTCTTCCGGAACAAACGGGATAGCGCCAGCACACGAACCGCATGAAGCCCTTGCAGCGGAGCCGGCTTTCAATCTC
ATTACGCCGTCTTCCACTTGTCCCCATTGCGGGCACAGGATTACGGCGCTCGAAAATATTCCGCTCATCAGCTACATCGC
ATTGAGGGGACGCTGCTCGCAGTGCCGCACCGCGATTTCCATGCGCTATCCCGTTGTGGAAGGGTTAACCGCAGCGCTGA
GCGGCCTTGTCGCGTGGCATTTTGGTTATGGGGTTATCGCTTTTGCGGCGCTTGCGCTTGTATGGGCCATGGTTGCGCTT
GCTTTCATCGACCTGGATACTCAGTTGCTGCCCAATGACATTACCATCCCCCTGTTATGGGGAGGGTTATTGATTAACCT
GAGCGGTGGTTTTGCGGATATCCATTCTGCCGTGATTGGCGCCGTAGTGGGATATCTTGCCCTGTGGTCGGTGTATTGGG
GTTATAAACTCCTGACCGGCCGGGAGGGAATGGGTTATGGGGACTTCAAGTTGCTGGCCGCCATTGGCGCCTGGCTGGGG
TGGCAAATGCTGCCCCTGGTGATTCTGTCTTCATCCCTCGTGGGAAGCATGGCAGGGCTTGGCCTGATGCTGGCAGCAAA
GCACGGGCGTCATGTCCCTATTCCGTTCGGGCCTTATCTGGTGTGTGGGGGAATCGTGGCTCTGTTTTGGGGAAACGAGA
TCAATAGAGCTTACCTGGGGTCATTTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilD Vibrio cholerae strain A1552

49

99.338

0.487

  pilD Vibrio campbellii strain DS40M4

48.495

99.007

0.48

  pilD Neisseria gonorrhoeae MS11

46.309

98.675

0.457

  pilD Acinetobacter nosocomialis M2

45.035

93.377

0.421

  pilD Acinetobacter baumannii D1279779

44.326

93.377

0.414


Multiple sequence alignment