Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   B1781_RS18100 Genome accession   NZ_CP019936
Coordinates   3818686..3819933 (+) Length   415 a.a.
NCBI ID   WP_078121001.1    Uniprot ID   -
Organism   Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3813686..3824933
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B1781_RS18080 pilR 3814139..3815518 (+) 1380 WP_078120998.1 sigma-54 dependent transcriptional regulator Regulator
  B1781_RS23645 - 3815788..3815887 (+) 100 Protein_3590 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  B1781_RS18090 - 3816373..3816795 (+) 423 WP_078120999.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  B1781_RS18095 pilB 3816958..3818673 (+) 1716 WP_078121000.1 type IV-A pilus assembly ATPase PilB Machinery gene
  B1781_RS18100 pilC 3818686..3819933 (+) 1248 WP_078121001.1 type II secretion system F family protein Machinery gene
  B1781_RS18105 pilD 3819975..3820811 (+) 837 WP_334223777.1 A24 family peptidase Machinery gene
  B1781_RS18110 coaE 3820804..3821409 (+) 606 WP_334223778.1 dephospho-CoA kinase -
  B1781_RS18115 zapD 3821465..3822250 (+) 786 WP_164513453.1 cell division protein ZapD -
  B1781_RS18120 yacG 3822247..3822438 (+) 192 WP_078121004.1 DNA gyrase inhibitor YacG -
  B1781_RS18125 - 3822466..3822888 (-) 423 WP_078121005.1 hypothetical protein -
  B1781_RS18130 - 3822872..3823828 (-) 957 WP_078121006.1 Nudix family hydrolase -

Sequence


Protein


Download         Length: 415 a.a.        Molecular weight: 45090.20 Da        Isoelectric Point: 10.0094

>NTDB_id=218973 B1781_RS18100 WP_078121001.1 3818686..3819933(+) (pilC) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
MATKAAAAPKPEKALIFSWEGTDRKGNRVKGETRASTVAMARAELRRQGINPLKVRKKASSIFSNRKKKITSKDIAIFSR
QLATMMAAGVPMVQAFDIVGRGHNNPSMQEMILSIKADVEGGTSLTGSLRKHPLYFDDLFCNLVEAGEQAGVLETLLDKI
ATYKEKTESLKAKIKKALFYPTAVILVAILITSIIMIFVIPQFKDLFSSFGADLPAFTLVVIKISDFVAGWWWAILGVVV
IAVFTAANVWKRSPKFRETLDKLLLKVPVIGMIMHKAALARFCRTTATMFAAGVPLVEALQSVAGATGSAVYEKAVLKMR
DDVATGQSLTLSMRQQGLFPHMVIQMVTIGEESGSLDDMLSKVADFYEEEVDNAVDALSSLLEPLIMVVLGTVIGGLVIA
LYLPIFQLGSVVSGN

Nucleotide


Download         Length: 1248 bp        

>NTDB_id=218973 B1781_RS18100 WP_078121001.1 3818686..3819933(+) (pilC) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
ATGGCTACCAAAGCAGCGGCAGCGCCGAAACCAGAAAAGGCATTGATTTTTTCCTGGGAAGGGACGGATCGGAAAGGCAA
CCGGGTAAAGGGTGAGACGCGCGCATCGACCGTTGCGATGGCACGTGCCGAACTGCGCCGACAGGGCATCAATCCTCTGA
AAGTACGCAAGAAAGCTTCTTCGATATTCTCTAATCGCAAAAAGAAGATCACCAGTAAAGACATCGCCATATTTTCACGC
CAACTCGCAACCATGATGGCGGCCGGCGTGCCGATGGTGCAGGCCTTCGATATTGTCGGGCGAGGACACAATAACCCATC
GATGCAAGAGATGATTTTGTCGATTAAAGCCGACGTCGAAGGCGGTACTTCGCTGACCGGCTCGCTAAGAAAGCACCCTT
TGTATTTCGACGACCTGTTCTGTAATTTGGTCGAAGCCGGAGAGCAAGCAGGCGTGCTTGAAACCCTGCTCGACAAAATT
GCAACCTATAAAGAGAAAACCGAGTCGCTGAAGGCAAAAATCAAGAAAGCCCTCTTCTATCCAACTGCTGTCATACTCGT
GGCGATACTAATCACATCGATCATCATGATTTTTGTGATTCCGCAGTTTAAGGACTTATTCAGCAGCTTCGGTGCCGATC
TCCCCGCCTTCACCCTGGTCGTGATCAAGATTTCTGACTTCGTTGCCGGATGGTGGTGGGCAATCCTCGGCGTCGTGGTA
ATTGCCGTCTTCACAGCGGCCAATGTCTGGAAACGATCACCCAAGTTCCGCGAAACGCTGGACAAGCTACTGCTCAAGGT
GCCGGTGATTGGGATGATCATGCACAAAGCTGCACTGGCGCGGTTCTGTCGTACCACTGCCACAATGTTTGCGGCGGGTG
TTCCGTTGGTCGAGGCCCTTCAGTCCGTGGCCGGTGCTACCGGCAGCGCGGTCTACGAAAAGGCTGTCCTCAAGATGCGT
GATGACGTCGCCACCGGTCAATCACTGACCTTATCAATGCGCCAACAAGGCCTGTTTCCGCACATGGTGATCCAGATGGT
CACCATCGGCGAAGAGTCGGGTTCGCTCGATGACATGCTGTCGAAAGTGGCGGACTTCTACGAAGAAGAGGTCGACAATG
CGGTTGATGCACTGAGCAGCCTGCTCGAGCCGCTGATCATGGTTGTGCTGGGCACCGTGATCGGTGGTCTCGTCATCGCT
CTGTATCTACCAATCTTCCAGTTGGGTTCCGTCGTTTCTGGAAACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

57.778

97.59

0.564

  pilC Legionella pneumophila strain ERS1305867

57.107

96.627

0.552

  pilC Acinetobacter baylyi ADP1

56.02

98.072

0.549

  pilC Acinetobacter baumannii D1279779

56.675

95.663

0.542

  pilG Neisseria meningitidis 44/76-A

50.493

97.831

0.494

  pilG Neisseria gonorrhoeae MS11

50.493

97.831

0.494

  pilC Vibrio cholerae strain A1552

43.434

95.422

0.414

  pilC Vibrio campbellii strain DS40M4

41.562

95.663

0.398


Multiple sequence alignment