Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   K8O89_RS04555 Genome accession   NZ_CP082852
Coordinates   1019075..1021294 (+) Length   739 a.a.
NCBI ID   WP_040522858.1    Uniprot ID   A0AAX0WR66
Organism   Legionella anisa strain FDAARGOS_1480     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1014075..1026294
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K8O89_RS04520 (K8O89_04525) - 1014546..1015538 (+) 993 WP_019234245.1 polysaccharide deacetylase family protein -
  K8O89_RS04525 - 1015724..1015801 (+) 78 Protein_892 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  K8O89_RS04530 (K8O89_04530) fimT 1015829..1016248 (+) 420 WP_223499628.1 GspH/FimT family protein Machinery gene
  K8O89_RS04535 (K8O89_04535) pilV 1016260..1016802 (+) 543 WP_019234243.1 type IV pilus modification protein PilV -
  K8O89_RS04540 (K8O89_04540) - 1016799..1017866 (+) 1068 WP_019234242.1 PilW family protein -
  K8O89_RS04545 (K8O89_04545) - 1017983..1018504 (+) 522 WP_051045716.1 pilus assembly protein -
  K8O89_RS04550 (K8O89_04550) pilE 1018514..1018939 (+) 426 WP_019234240.1 type IV pilin protein Machinery gene
  K8O89_RS04555 (K8O89_04555) comEC 1019075..1021294 (+) 2220 WP_040522858.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  K8O89_RS04560 (K8O89_04560) - 1021499..1022548 (+) 1050 WP_019234238.1 bifunctional transcriptional activator/DNA repair enzyme AdaA -
  K8O89_RS04565 (K8O89_04565) - 1022665..1023456 (-) 792 WP_019234237.1 hypothetical protein -
  K8O89_RS04570 (K8O89_04570) - 1023552..1023833 (-) 282 WP_019234236.1 hypothetical protein -
  K8O89_RS04575 (K8O89_04575) - 1024094..1024405 (-) 312 WP_019234234.1 hypothetical protein -
  K8O89_RS04580 (K8O89_04580) - 1024564..1025040 (+) 477 WP_019234233.1 Rrf2 family transcriptional regulator -
  K8O89_RS04585 (K8O89_04585) - 1025009..1025365 (+) 357 WP_019234232.1 DUF488 domain-containing protein -
  K8O89_RS04590 (K8O89_04590) - 1025449..1025829 (+) 381 WP_019234231.1 group III truncated hemoglobin -
  K8O89_RS04595 (K8O89_04595) cydP 1025835..1026008 (+) 174 WP_019234230.1 cytochrome oxidase putative small subunit CydP -

Sequence


Protein


Download         Length: 739 a.a.        Molecular weight: 83771.01 Da        Isoelectric Point: 9.8030

>NTDB_id=603042 K8O89_RS04555 WP_040522858.1 1019075..1021294(+) (comEC) [Legionella anisa strain FDAARGOS_1480]
MEIFCFFIGILYLHTFNHFLLIITLLFFFLSPKYSLILFFILGVTMAGMHQALVAPKGIPNATVLPKVTLQGTIASIPNQ
DFTKTQFLFALEQYNHHPAQGLIQLSWYNKAPKLHVGQRWQFTVKLKKPRNYLNPGSSDYVGILAARHIQWTGYILSKNN
YQVYPESQPRFNWLLLREHLGNKLSQLAPNQSTAGVVEALTLNLTTHINQKNWDLFRRTGTTHLFGISGEHIALVSGMIY
WLVRWLWSKSSRCCLFIPAPYVASISGLLAALFYAFLAGFAPPVQRALIGCFFYTLYRLGKQRFSTWQIWRYALFGVLCI
EPHAVFMQGFYFSFLAVACLLLTQQRWRLKGYKGNLALQLSCLIGLMPLTLYWYSYGSINGFIANLFAIPLVGLLIVPLA
LTTMILCSWNIAALLMKLLSFLIALLFKGLYLVEHLAIMNINWSISHIELVVILMGALLMWVLLPIKPFQWIALLWILLP
FFPPRVAPPIGGALIDILDVGQGLAIVIRSQHHTLIYDTGDRFFQGNDLGKMVIVPYLKTLGIKKIDFVVISHPDKDHRG
GLNSLEKEMPVDQLLVNDPHYYDHGVTCHDYPAWVWDGVSFRFLPITAHFKNKNNNSCILQISTKAGKILLTGDIEKIAE
DYLIKTYEAKLASDVLIVPHHGSKTSSSYRFLLEVAPHYAIASLGFDNRFHFPHAKTLANMNSLNIPFFRTDQCGMVRLA
LPAQGKIKKPICFSGLKTT

Nucleotide


Download         Length: 2220 bp        

>NTDB_id=603042 K8O89_RS04555 WP_040522858.1 1019075..1021294(+) (comEC) [Legionella anisa strain FDAARGOS_1480]
ATGGAAATTTTTTGCTTTTTCATAGGCATACTCTATCTACATACCTTTAATCATTTTTTACTCATAATTACCTTGCTCTT
CTTTTTTCTTAGTCCAAAGTATTCCTTAATTCTGTTTTTTATTTTAGGGGTTACGATGGCAGGAATGCATCAAGCCCTTG
TTGCACCCAAAGGGATACCCAACGCTACCGTTCTACCTAAAGTAACCCTGCAAGGAACAATTGCATCGATTCCTAACCAA
GATTTTACCAAAACCCAATTTTTATTTGCCCTTGAACAATACAATCATCATCCAGCACAAGGATTAATCCAATTATCCTG
GTACAACAAGGCGCCAAAATTACACGTAGGACAACGCTGGCAATTCACGGTCAAATTAAAAAAACCTCGAAACTACCTTA
ATCCTGGAAGTTCAGATTATGTAGGCATACTTGCTGCACGACATATTCAGTGGACTGGTTATATTCTCTCCAAAAACAAT
TATCAAGTGTACCCAGAATCACAGCCGCGTTTTAATTGGCTTCTATTGCGCGAACATTTGGGAAATAAACTCAGCCAATT
AGCGCCAAACCAATCCACTGCAGGAGTTGTTGAAGCATTGACTTTAAATTTAACTACCCATATTAACCAGAAAAATTGGG
ATCTATTTAGACGTACAGGCACAACTCACCTTTTTGGGATTTCAGGAGAACACATTGCATTGGTATCGGGGATGATTTAT
TGGCTGGTACGCTGGCTATGGTCTAAAAGTTCCCGATGTTGTTTGTTTATCCCAGCCCCCTATGTTGCTAGCATTAGTGG
TCTTTTGGCTGCTTTATTCTATGCCTTTTTAGCAGGATTTGCTCCACCGGTACAAAGAGCATTAATCGGTTGTTTCTTTT
ATACGCTCTACCGCTTAGGGAAACAACGTTTTTCCACTTGGCAAATATGGCGTTATGCCTTATTTGGCGTTTTATGTATT
GAGCCGCATGCCGTATTTATGCAAGGATTTTATTTTTCATTTCTTGCAGTAGCATGTTTGCTTTTAACCCAACAACGTTG
GCGATTAAAGGGTTATAAAGGAAATTTGGCCTTACAATTAAGCTGCTTAATTGGACTCATGCCTTTAACCCTGTACTGGT
ATTCTTATGGTTCAATTAATGGCTTTATTGCAAATTTATTTGCTATTCCTCTCGTTGGTCTTTTGATCGTCCCTTTAGCA
TTAACCACTATGATACTTTGTTCATGGAATATTGCTGCTCTATTAATGAAACTCTTATCTTTCTTGATCGCCCTGTTATT
TAAAGGATTGTACCTGGTTGAGCATTTAGCAATCATGAACATTAATTGGTCTATTTCCCATATTGAGTTGGTTGTTATCT
TAATGGGGGCTTTACTGATGTGGGTTCTATTACCAATTAAACCTTTTCAATGGATTGCACTGTTGTGGATACTGCTCCCA
TTTTTTCCGCCTCGTGTCGCACCACCCATAGGAGGGGCATTGATTGATATTTTGGATGTTGGCCAAGGTTTGGCAATTGT
CATTAGAAGCCAACATCATACCCTGATTTATGATACTGGAGATCGATTTTTTCAAGGTAATGATCTAGGGAAAATGGTGA
TTGTGCCCTATCTTAAAACCTTAGGAATAAAAAAAATTGACTTTGTGGTGATTAGTCATCCAGATAAAGATCATCGTGGT
GGACTCAACTCGCTTGAAAAAGAAATGCCAGTGGATCAGTTATTAGTCAATGATCCTCATTACTATGATCATGGTGTGAC
ATGTCATGATTACCCAGCGTGGGTTTGGGATGGTGTTTCTTTTCGTTTTTTGCCGATTACAGCTCACTTTAAGAATAAAA
ATAATAATTCGTGTATTCTACAAATAAGCACCAAGGCTGGGAAAATATTATTAACAGGGGATATTGAAAAAATAGCCGAA
GATTATTTAATAAAAACTTATGAGGCAAAGCTTGCCTCCGACGTTTTAATTGTTCCCCATCATGGCAGTAAAACTTCGTC
TTCTTATCGATTTTTACTTGAAGTTGCGCCACACTATGCCATCGCTTCTTTAGGCTTTGATAATCGTTTTCACTTTCCTC
ATGCTAAAACCTTGGCAAACATGAACTCATTGAATATCCCCTTTTTTAGAACAGATCAGTGTGGCATGGTACGACTCGCC
TTGCCGGCCCAAGGGAAAATAAAAAAACCGATTTGTTTTAGCGGACTCAAAACAACCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Legionella pneumophila strain Lp02

64.946

99.594

0.647

  comEC Legionella pneumophila strain ERS1305867

64.49

99.459

0.641