Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   C3363_RS03355 Genome accession   NZ_CP029150
Coordinates   677017..677547 (+) Length   176 a.a.
NCBI ID   WP_015940159.1    Uniprot ID   A0A837B133
Organism   Glaesserella parasuis strain GZ20170512     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 672017..682547
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C3363_RS03335 (C3363_03360) aroK 672386..672907 (-) 522 WP_015940157.1 shikimate kinase AroK -
  C3363_RS03340 (C3363_03365) - 673137..675692 (-) 2556 WP_159179850.1 penicillin-binding protein 1A -
  C3363_RS03345 (C3363_03370) comA 675846..676523 (+) 678 WP_010785957.1 hypothetical protein Machinery gene
  C3363_RS03350 (C3363_03375) comB 676499..677020 (+) 522 WP_010785956.1 hypothetical protein Machinery gene
  C3363_RS03355 (C3363_03380) comC 677017..677547 (+) 531 WP_015940159.1 hypothetical protein Machinery gene
  C3363_RS03360 (C3363_03385) comD 677592..677933 (+) 342 WP_232422128.1 hypothetical protein Machinery gene
  C3363_RS03365 (C3363_03390) comE 677943..679205 (+) 1263 WP_026917118.1 type IV pilus secretin PilQ Machinery gene
  C3363_RS03370 (C3363_03395) nusB 679284..679697 (+) 414 WP_010785952.1 transcription antitermination factor NusB -
  C3363_RS03375 (C3363_03400) thiL 679706..680674 (+) 969 WP_021110497.1 thiamine-phosphate kinase -
  C3363_RS03380 (C3363_03405) - 680677..681147 (+) 471 WP_005712497.1 phosphatidylglycerophosphatase A -
  C3363_RS03385 (C3363_03410) - 681149..681778 (+) 630 WP_015940164.1 LysE family transporter -
  C3363_RS03390 (C3363_03415) trxA 682075..682389 (+) 315 WP_015940165.1 thioredoxin -

Sequence


Protein


Download         Length: 176 a.a.        Molecular weight: 20865.35 Da        Isoelectric Point: 9.8029

>NTDB_id=290757 C3363_RS03355 WP_015940159.1 677017..677547(+) (comC) [Glaesserella parasuis strain GZ20170512]
MKLSLAKYYLMPEHPLYRGLYYLNQNKGFILSGIFLLIVAFPVGNYFYLNQKIEQQQKQLTEVKQIIEHKQQQLKLLQQR
YQLAQDKSELLTKINQQIQLILDKNSVEIDSIQWNMEERKIYLLISQSTQKIFNVIAELNQLTTVKFQEIHLTKKTKQKY
IQLNATLLFQADTGEP

Nucleotide


Download         Length: 531 bp        

>NTDB_id=290757 C3363_RS03355 WP_015940159.1 677017..677547(+) (comC) [Glaesserella parasuis strain GZ20170512]
ATGAAATTGTCCTTAGCCAAGTATTATTTAATGCCAGAACATCCATTATATCGTGGCTTATATTATTTAAACCAAAATAA
AGGATTTATTCTATCGGGAATATTTTTATTGATTGTGGCATTTCCAGTAGGAAATTATTTTTATCTTAATCAGAAAATAG
AACAACAACAAAAACAGCTCACTGAAGTTAAACAGATTATTGAGCATAAACAGCAACAATTAAAATTATTACAACAACGC
TATCAACTTGCTCAAGATAAAAGTGAATTATTAACCAAAATAAATCAACAAATTCAGCTGATTTTAGATAAAAATAGCGT
AGAAATTGACAGTATTCAATGGAATATGGAAGAAAGAAAAATCTATTTATTAATTAGCCAATCTACACAAAAGATATTTA
ATGTGATTGCTGAGCTTAATCAATTAACAACAGTTAAATTTCAGGAAATTCATTTAACTAAAAAAACAAAACAGAAATAT
ATTCAACTCAATGCCACTTTACTTTTTCAAGCTGACACAGGAGAACCGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A837B133

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Glaesserella parasuis strain SC1401

91.477

100

0.915


Multiple sequence alignment