Detailed information    

insolico Bioinformatically predicted

Overview


Name   comN   Type   Machinery gene
Locus tag   O1Q81_RS04310 Genome accession   NZ_CP170385
Coordinates   888613..889137 (+) Length   174 a.a.
NCBI ID   WP_386698405.1    Uniprot ID   -
Organism   Lonepinella sp. MS14436     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 883613..894137
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  O1Q81_RS04270 (O1Q81_00849) tolR 884360..884785 (+) 426 WP_386689760.1 colicin uptake protein TolR -
  O1Q81_RS04275 (O1Q81_00850) tolA 884807..885958 (+) 1152 WP_386689763.1 cell envelope integrity protein TolA -
  O1Q81_RS04280 (O1Q81_00851) tolB 885996..887288 (+) 1293 WP_386689766.1 Tol-Pal system beta propeller repeat protein TolB -
  O1Q81_RS04285 (O1Q81_00852) pal 887311..887769 (+) 459 WP_386689769.1 peptidoglycan-associated lipoprotein Pal -
  O1Q81_RS04310 (O1Q81_00857) comN 888613..889137 (+) 525 WP_386698405.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  O1Q81_RS04315 (O1Q81_00858) - 889134..889865 (+) 732 WP_386698407.1 hypothetical protein -
  O1Q81_RS04320 (O1Q81_00859) - 889871..890557 (+) 687 WP_386698408.1 DUF2572 family protein -
  O1Q81_RS04325 (O1Q81_00860) comQ 890541..890837 (+) 297 WP_386695942.1 DUF5374 domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 174 a.a.        Molecular weight: 19963.89 Da        Isoelectric Point: 8.4799

>NTDB_id=1056492 O1Q81_RS04310 WP_386698405.1 888613..889137(+) (comN) [Lonepinella sp. MS14436]
MKSGFTLFEMLIVLLIISIISLISLPAWQSDSQRILAKEQHRLYLFLRQIQARVENSTQIWYLILNRDVGQQRWCVTAQI
KSDTTCDCFNPSFCPDNLSAQFYYPFSSQKTMITAKAYYPQKLISFNGTRNSSDSGCFMLQNSQSRILFSFSNLGRIRLK
SDQAENACNAGVEE

Nucleotide


Download         Length: 525 bp        

>NTDB_id=1056492 O1Q81_RS04310 WP_386698405.1 888613..889137(+) (comN) [Lonepinella sp. MS14436]
ATGAAAAGTGGTTTTACCTTATTTGAAATGCTTATTGTTTTGCTGATTATTAGCATCATATCTTTAATTAGCCTGCCTGC
ATGGCAAAGTGATAGCCAACGAATTTTAGCCAAAGAACAACACCGTTTATATCTATTCTTACGCCAAATTCAAGCCAGAG
TAGAAAATTCTACTCAAATCTGGTATCTCATTCTCAATCGAGATGTTGGTCAGCAACGCTGGTGTGTTACCGCTCAAATT
AAGTCAGATACGACTTGCGATTGCTTTAATCCCAGTTTTTGTCCTGATAACCTTTCGGCTCAGTTTTATTATCCTTTTAG
TTCACAAAAAACCATGATTACTGCTAAAGCCTATTATCCACAAAAATTAATCAGTTTTAATGGTACTCGCAATTCGTCAG
ATTCAGGCTGTTTTATGTTGCAAAATTCGCAATCTCGCATCTTATTTTCTTTTTCTAACTTAGGACGAATCCGCTTAAAA
AGTGATCAAGCTGAGAATGCCTGCAATGCTGGAGTAGAAGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comN Haemophilus influenzae Rd KW20

52.353

97.701

0.511


Multiple sequence alignment