Detailed information    

insolico Bioinformatically predicted

Overview


Name   comN   Type   Machinery gene
Locus tag   LK431_RS03830 Genome accession   NZ_CP085951
Coordinates   755345..755857 (+) Length   170 a.a.
NCBI ID   WP_054249351.1    Uniprot ID   -
Organism   Haemophilus influenzae strain FDAARGOS_1561     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 750345..760857
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LK431_RS03810 (LK431_03810) nrfF 750653..751798 (-) 1146 WP_111695217.1 heme lyase NrfEFG subunit NrfF -
  LK431_RS03815 (LK431_03815) - 751795..752325 (-) 531 WP_021034773.1 DsbE family thiol:disulfide interchange protein -
  LK431_RS03820 (LK431_03820) nrfE 752325..754232 (-) 1908 WP_021034774.1 heme lyase NrfEFG subunit NrfE -
  LK431_RS03825 (LK431_03825) suhB 754342..755145 (-) 804 WP_005631453.1 inositol-1-monophosphatase -
  LK431_RS03830 (LK431_03830) comN 755345..755857 (+) 513 WP_054249351.1 pilus assembly FimT family protein Machinery gene
  LK431_RS03835 (LK431_03835) comO 755857..756576 (+) 720 WP_111695216.1 PulJ/GspJ family protein Machinery gene
  LK431_RS03840 (LK431_03840) comP 756573..757256 (+) 684 WP_054249353.1 DUF2572 family protein Machinery gene
  LK431_RS03845 (LK431_03845) comQ 757249..757536 (+) 288 WP_054249354.1 DUF5374 domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 170 a.a.        Molecular weight: 19876.10 Da        Isoelectric Point: 8.7271

>NTDB_id=621556 LK431_RS03830 WP_054249351.1 755345..755857(+) (comN) [Haemophilus influenzae strain FDAARGOS_1561]
MQKGVTLVELLIGLAIISIVLSFVVPLWQTDSPKTILTKEQHRLYLFLRQIQARAENSSEVWFLLINRNLATQQWCLTAQ
VKNNQTCDCLNPINCPKEVYAHFYYPYFPNKTMIQSHHIYPKEITRFDGIRNTIVTRCFILQAENERTLFLFFNVGSIRL
KTNQFDSACN

Nucleotide


Download         Length: 513 bp        

>NTDB_id=621556 LK431_RS03830 WP_054249351.1 755345..755857(+) (comN) [Haemophilus influenzae strain FDAARGOS_1561]
ATGCAGAAAGGTGTGACATTAGTGGAATTATTGATTGGGTTAGCAATCATCAGCATTGTGCTGAGTTTTGTCGTGCCATT
ATGGCAAACCGATTCACCTAAAACGATTTTAACCAAAGAGCAACATCGCCTGTATTTATTTCTACGACAAATTCAGGCTC
GGGCAGAAAATTCATCGGAAGTGTGGTTTTTACTTATCAATCGGAACCTTGCGACACAACAATGGTGCTTAACGGCACAA
GTAAAAAATAACCAAACTTGTGATTGTTTAAATCCGATAAATTGCCCGAAAGAGGTTTATGCGCATTTTTACTATCCTTA
TTTTCCTAATAAAACGATGATTCAAAGCCATCATATTTATCCAAAAGAAATCACGAGATTTGATGGCATTCGTAATACTA
TCGTTACTCGTTGCTTTATTTTGCAAGCAGAAAATGAACGTACGTTATTTTTATTTTTCAATGTTGGCAGTATTCGTTTA
AAAACCAATCAATTTGATAGTGCTTGTAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comN Haemophilus influenzae Rd KW20

95.882

100

0.959