Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   CEP45_RS05630 Genome accession   NZ_CP022010
Coordinates   1247259..1247726 (-) Length   155 a.a.
NCBI ID   WP_261920343.1    Uniprot ID   A0A8E3MDR2
Organism   Mergibacter septicus strain 27643-20-I1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1242259..1252726
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CEP45_RS05600 (CEP45_05585) - 1242660..1242932 (-) 273 WP_261920967.1 GNAT family N-acetyltransferase -
  CEP45_RS05605 (CEP45_05590) yacG 1242972..1243178 (-) 207 WP_265490388.1 DNA gyrase inhibitor YacG -
  CEP45_RS05610 (CEP45_05595) coaE 1243171..1243809 (-) 639 WP_261920347.1 dephospho-CoA kinase -
  CEP45_RS05615 (CEP45_05600) - 1243848..1244537 (-) 690 WP_265490389.1 prepilin peptidase -
  CEP45_RS05620 (CEP45_05605) - 1244578..1245804 (-) 1227 WP_265490390.1 type II secretion system F family protein -
  CEP45_RS05625 (CEP45_05610) pilB 1245819..1247237 (-) 1419 WP_265490391.1 GspE/PulE family protein Machinery gene
  CEP45_RS05630 (CEP45_05615) pilA 1247259..1247726 (-) 468 WP_261920343.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  CEP45_RS05635 (CEP45_05620) ampD 1247854..1248453 (+) 600 WP_409240816.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  CEP45_RS05640 (CEP45_05625) - 1248617..1249684 (+) 1068 WP_265490392.1 LysM-like peptidoglycan-binding domain-containing protein -
  CEP45_RS05645 (CEP45_05630) cgtA 1249748..1250908 (-) 1161 WP_265490393.1 Obg family GTPase CgtA -
  CEP45_RS05650 (CEP45_05635) - 1250952..1251884 (-) 933 WP_265490394.1 DMT family transporter -
  CEP45_RS05655 (CEP45_05640) rpmA 1252056..1252313 (-) 258 WP_265490395.1 50S ribosomal protein L27 -
  CEP45_RS05660 (CEP45_05645) rplU 1252334..1252645 (-) 312 WP_261920338.1 50S ribosomal protein L21 -

Sequence


Protein


Download         Length: 155 a.a.        Molecular weight: 16212.65 Da        Isoelectric Point: 7.5827

>NTDB_id=235828 CEP45_RS05630 WP_261920343.1 1247259..1247726(-) (pilA) [Mergibacter septicus strain 27643-20-I1]
MKRSTLTQAALSRVGLSQGFTLIELMIVIAIIAVLATVAIPSYQNYTKKAAMSELIQAASPYKAEVELCIYNSGSKTNCS
AGQNGIQAAVTESEKVKYLKTVGVNAGVISVTGKGTLENIGYTLTPNGGENGEVIDWAVNCTGDKGIFPVGFCAN

Nucleotide


Download         Length: 468 bp        

>NTDB_id=235828 CEP45_RS05630 WP_261920343.1 1247259..1247726(-) (pilA) [Mergibacter septicus strain 27643-20-I1]
ATGAAAAGATCAACCTTAACTCAAGCAGCTTTATCTCGAGTTGGATTATCTCAAGGCTTTACTTTAATTGAATTGATGAT
TGTCATCGCAATCATTGCTGTATTAGCCACGGTGGCTATTCCTTCCTATCAAAACTATACCAAGAAAGCTGCAATGTCAG
AATTGATTCAAGCTGCTTCGCCGTATAAGGCTGAAGTTGAGTTATGTATTTATAATTCAGGTTCTAAGACAAATTGCTCA
GCAGGGCAAAATGGTATTCAAGCGGCGGTTACTGAGAGTGAGAAAGTAAAATATTTGAAAACGGTCGGAGTAAATGCAGG
TGTTATTTCAGTAACAGGTAAAGGAACATTGGAGAATATTGGATATACACTGACGCCAAATGGTGGCGAAAATGGAGAGG
TCATTGACTGGGCTGTAAACTGTACGGGGGATAAAGGAATATTTCCAGTAGGTTTTTGTGCTAATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Haemophilus influenzae 86-028NP

55.396

89.677

0.497

  pilA Glaesserella parasuis strain SC1401

53.901

90.968

0.49

  pilA Haemophilus influenzae Rd KW20

53.957

89.677

0.484

  pilA Vibrio campbellii strain DS40M4

38.356

94.194

0.361


Multiple sequence alignment