Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   I6H44_RS06215 Genome accession   NZ_CP065983
Coordinates   1258498..1259721 (-) Length   407 a.a.
NCBI ID   WP_006719142.1    Uniprot ID   E6KZA8
Organism   Aggregatibacter segnis strain FDAARGOS_987     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1253498..1264721
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6H44_RS06180 (I6H44_06180) - 1253544..1254269 (-) 726 WP_006719152.1 hypothetical protein -
  I6H44_RS06185 (I6H44_06185) raiA 1254350..1254673 (-) 324 WP_006719151.1 ribosome-associated translation inhibitor RaiA -
  I6H44_RS06190 (I6H44_06190) - 1254907..1256283 (-) 1377 WP_006719150.1 sodium:alanine symporter family protein -
  I6H44_RS06195 (I6H44_06195) - 1256662..1256934 (-) 273 WP_006719149.1 GNAT family N-acetyltransferase -
  I6H44_RS06200 (I6H44_06200) yacG 1256931..1257146 (-) 216 WP_006719148.1 DNA gyrase inhibitor YacG -
  I6H44_RS06205 (I6H44_06205) coaE 1257136..1257759 (-) 624 WP_006719146.1 dephospho-CoA kinase -
  I6H44_RS06210 (I6H44_06210) - 1257809..1258501 (-) 693 WP_006719144.1 A24 family peptidase -
  I6H44_RS06215 (I6H44_06215) pilC 1258498..1259721 (-) 1224 WP_006719142.1 type II secretion system F family protein Machinery gene
  I6H44_RS06220 (I6H44_06220) pilB 1259714..1261123 (-) 1410 WP_006719140.1 GspE/PulE family protein Machinery gene
  I6H44_RS06225 (I6H44_06225) pilA 1261152..1261604 (-) 453 WP_006719138.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  I6H44_RS06230 (I6H44_06230) ampD 1261731..1262285 (+) 555 WP_006719137.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  I6H44_RS06235 (I6H44_06235) - 1262873..1263418 (+) 546 WP_006719135.1 carboxymuconolactone decarboxylase family protein -
  I6H44_RS06240 (I6H44_06240) - 1263582..1264376 (+) 795 WP_006719133.1 formate/nitrite transporter family protein -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 46108.70 Da        Isoelectric Point: 10.5161

>NTDB_id=515938 I6H44_RS06215 WP_006719142.1 1258498..1259721(-) (pilC) [Aggregatibacter segnis strain FDAARGOS_987]
MTKLKLFRWRAINRLQQKQKGLIVAESEAKARQQLMARGLQSIALQQNWQLSNKPKNAEICALLSQLATLLQASVPLKHS
LQILLQNCTNIALNQWLRLLLRDIERGLAFSQALEQQGLYLTYQERQLIQVGEMTGKLAAVCHEIAQHKQQALALQRKMQ
KILLYPVLVLGISLTLTVLLLLFIVPQFAAMYDNNSAQLPAFTQLLLTLSQGLQNYWLALLIVVVLTVILIRFQLKHSPW
LHRQKTRLINSIPFLNHIIQLSRLVSFSRSLFLMLQAGIPLNQALQSFLPKQQSWQTKPQLQGDLVLIAEVQSTLHWIQQ
GYPFSASVSGQIFPPAAQQMLQVGEQSGQLPKMLQFIANDHQQQLDHQIDLLSQMLEPLLMVIIGGLIGLIMLGMYLPIF
NIGSLVQ

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=515938 I6H44_RS06215 WP_006719142.1 1258498..1259721(-) (pilC) [Aggregatibacter segnis strain FDAARGOS_987]
ATGACTAAATTAAAATTATTTCGCTGGCGAGCCATCAATCGGTTACAACAAAAACAAAAAGGACTCATCGTAGCAGAAAG
CGAAGCAAAGGCACGGCAGCAGTTAATGGCGCGAGGTTTGCAAAGCATTGCCTTGCAACAAAACTGGCAACTGAGCAATA
AGCCGAAAAATGCCGAGATCTGTGCCTTACTTTCACAGCTCGCGACATTATTACAAGCCTCGGTCCCCTTAAAACATAGT
CTGCAAATTCTGTTGCAAAACTGCACCAATATCGCACTGAATCAATGGCTCCGTCTCTTATTACGAGACATTGAAAGGGG
TTTAGCCTTTTCTCAAGCCTTAGAGCAGCAAGGGTTATACCTTACCTATCAAGAACGCCAACTTATTCAAGTAGGTGAAA
TGACCGGCAAACTTGCTGCAGTTTGTCATGAAATAGCCCAACATAAACAACAAGCACTGGCGTTGCAGCGTAAAATGCAA
AAGATCTTGCTTTACCCTGTGTTGGTACTGGGTATTTCATTAACATTGACCGTCCTGCTATTGTTATTCATCGTGCCACA
GTTTGCCGCAATGTATGACAACAACAGCGCCCAACTCCCCGCCTTTACACAACTTCTGCTTACACTCTCTCAAGGGTTAC
AAAATTATTGGTTGGCTCTACTCATCGTTGTTGTACTCACCGTCATACTCATTCGCTTTCAGCTGAAACACTCGCCCTGG
CTTCATCGACAAAAAACGCGCCTAATCAATAGTATTCCGTTTCTTAATCACATCATTCAGCTTTCCCGTTTAGTGAGTTT
CAGCCGTAGTTTATTTCTGATGTTACAGGCTGGCATTCCACTTAATCAGGCGCTTCAATCTTTCTTGCCGAAACAACAAA
GTTGGCAAACTAAACCTCAATTACAAGGTGATTTAGTATTAATTGCAGAGGTGCAATCGACACTCCATTGGATTCAACAA
GGCTACCCGTTTTCCGCCAGCGTGAGTGGACAAATTTTTCCCCCTGCAGCGCAACAAATGCTACAAGTAGGAGAACAAAG
CGGACAATTACCCAAGATGCTGCAATTTATCGCCAACGATCATCAACAACAGCTAGATCACCAAATCGATCTGTTGTCAC
AAATGCTTGAACCTTTATTAATGGTGATTATCGGCGGACTTATCGGTTTAATTATGCTCGGCATGTACCTACCGATTTTC
AACATAGGTTCGCTAGTACAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E6KZA8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

62.162

100

0.622

  pilC Haemophilus influenzae 86-028NP

61.916

100

0.619

  pilC Glaesserella parasuis strain SC1401

40.541

100

0.405