Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   DQN58_RS09770 Genome accession   NZ_LS483443
Coordinates   2011509..2012732 (+) Length   407 a.a.
NCBI ID   WP_006719142.1    Uniprot ID   E6KZA8
Organism   Aggregatibacter segnis ATCC 33393 strain NCTC 10977     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2006509..2017732
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN58_RS09745 (NCTC10977_01952) - 2006854..2007648 (-) 795 WP_006719133.1 formate/nitrite transporter family protein -
  DQN58_RS09750 (NCTC10977_01953) - 2007812..2008357 (-) 546 WP_006719135.1 carboxymuconolactone decarboxylase family protein -
  DQN58_RS09755 (NCTC10977_01954) ampD 2008945..2009499 (-) 555 WP_006719137.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  DQN58_RS09760 (NCTC10977_01955) pilA 2009626..2010078 (+) 453 WP_006719138.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  DQN58_RS09765 (NCTC10977_01956) pilB 2010107..2011516 (+) 1410 WP_006719140.1 GspE/PulE family protein Machinery gene
  DQN58_RS09770 (NCTC10977_01957) pilC 2011509..2012732 (+) 1224 WP_006719142.1 type II secretion system F family protein Machinery gene
  DQN58_RS09775 (NCTC10977_01958) - 2012729..2013421 (+) 693 WP_006719144.1 A24 family peptidase -
  DQN58_RS09780 (NCTC10977_01959) coaE 2013471..2014094 (+) 624 WP_006719146.1 dephospho-CoA kinase -
  DQN58_RS09785 (NCTC10977_01960) yacG 2014084..2014299 (+) 216 WP_006719148.1 DNA gyrase inhibitor YacG -
  DQN58_RS09790 (NCTC10977_01961) - 2014296..2014568 (+) 273 WP_006719149.1 GNAT family N-acetyltransferase -
  DQN58_RS09795 (NCTC10977_01963) - 2014947..2016323 (+) 1377 WP_006719150.1 sodium:alanine symporter family protein -
  DQN58_RS09800 (NCTC10977_01964) raiA 2016557..2016880 (+) 324 WP_006719151.1 ribosome-associated translation inhibitor RaiA -
  DQN58_RS09805 (NCTC10977_01965) - 2016961..2017686 (+) 726 WP_006719152.1 hypothetical protein -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 46108.70 Da        Isoelectric Point: 10.5161

>NTDB_id=1141706 DQN58_RS09770 WP_006719142.1 2011509..2012732(+) (pilC) [Aggregatibacter segnis ATCC 33393 strain NCTC 10977]
MTKLKLFRWRAINRLQQKQKGLIVAESEAKARQQLMARGLQSIALQQNWQLSNKPKNAEICALLSQLATLLQASVPLKHS
LQILLQNCTNIALNQWLRLLLRDIERGLAFSQALEQQGLYLTYQERQLIQVGEMTGKLAAVCHEIAQHKQQALALQRKMQ
KILLYPVLVLGISLTLTVLLLLFIVPQFAAMYDNNSAQLPAFTQLLLTLSQGLQNYWLALLIVVVLTVILIRFQLKHSPW
LHRQKTRLINSIPFLNHIIQLSRLVSFSRSLFLMLQAGIPLNQALQSFLPKQQSWQTKPQLQGDLVLIAEVQSTLHWIQQ
GYPFSASVSGQIFPPAAQQMLQVGEQSGQLPKMLQFIANDHQQQLDHQIDLLSQMLEPLLMVIIGGLIGLIMLGMYLPIF
NIGSLVQ

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=1141706 DQN58_RS09770 WP_006719142.1 2011509..2012732(+) (pilC) [Aggregatibacter segnis ATCC 33393 strain NCTC 10977]
ATGACTAAATTAAAATTATTTCGCTGGCGAGCCATCAATCGGTTACAACAAAAACAAAAAGGACTCATCGTAGCAGAAAG
CGAAGCAAAGGCACGGCAGCAGTTAATGGCGCGAGGTTTGCAAAGCATTGCCTTGCAACAAAACTGGCAACTGAGCAATA
AGCCGAAAAATGCCGAGATCTGTGCCTTACTTTCACAGCTCGCGACATTATTACAAGCCTCGGTCCCCTTAAAACATAGT
CTGCAAATTCTGTTGCAAAACTGCACCAATATCGCACTGAATCAATGGCTCCGTCTCTTATTACGAGACATTGAAAGGGG
TTTAGCCTTTTCTCAAGCCTTAGAGCAGCAAGGGTTATACCTTACCTATCAAGAACGCCAACTTATTCAAGTAGGTGAAA
TGACCGGCAAACTTGCTGCAGTTTGTCATGAAATAGCCCAACATAAACAACAAGCACTGGCGTTGCAGCGTAAAATGCAA
AAGATCTTGCTTTACCCTGTGTTGGTACTGGGTATTTCATTAACATTGACCGTCCTGCTATTGTTATTCATCGTGCCACA
GTTTGCCGCAATGTATGACAACAACAGCGCCCAACTCCCCGCCTTTACACAACTTCTGCTTACACTCTCTCAAGGGTTAC
AAAATTATTGGTTGGCTCTACTCATCGTTGTTGTACTCACCGTCATACTCATTCGCTTTCAGCTGAAACACTCGCCCTGG
CTTCATCGACAAAAAACGCGCCTAATCAATAGTATTCCGTTTCTTAATCACATCATTCAGCTTTCCCGTTTAGTGAGTTT
CAGCCGTAGTTTATTTCTGATGTTACAGGCTGGCATTCCACTTAATCAGGCGCTTCAATCTTTCTTGCCGAAACAACAAA
GTTGGCAAACTAAACCTCAATTACAAGGTGATTTAGTATTAATTGCAGAGGTGCAATCGACACTCCATTGGATTCAACAA
GGCTACCCGTTTTCCGCCAGCGTGAGTGGACAAATTTTTCCCCCTGCAGCGCAACAAATGCTACAAGTAGGAGAACAAAG
CGGACAATTACCCAAGATGCTGCAATTTATCGCCAACGATCATCAACAACAGCTAGATCACCAAATCGATCTGTTGTCAC
AAATGCTTGAACCTTTATTAATGGTGATTATCGGCGGACTTATCGGTTTAATTATGCTCGGCATGTACCTACCGATTTTC
AACATAGGTTCGCTAGTACAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E6KZA8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

62.162

100

0.622

  pilC Haemophilus influenzae 86-028NP

61.916

100

0.619

  pilC Glaesserella parasuis strain SC1401

40.541

100

0.405


Multiple sequence alignment