Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   FOLKNPGA_RS13700 Genome accession   NZ_CP059400
Coordinates   3136686..3138902 (+) Length   738 a.a.
NCBI ID   WP_182350426.1    Uniprot ID   -
Organism   Legionella sp. PC1000     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3131686..3143902
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FOLKNPGA_RS13670 (FOLKNPGA_02776) - 3132141..3133133 (+) 993 WP_232891287.1 polysaccharide deacetylase family protein -
  FOLKNPGA_RS13675 (FOLKNPGA_02777) fimT 3133379..3133903 (+) 525 WP_182350421.1 GspH/FimT family protein Machinery gene
  FOLKNPGA_RS13680 (FOLKNPGA_02778) pilV 3133916..3134458 (+) 543 WP_182350422.1 type IV pilus modification protein PilV -
  FOLKNPGA_RS13685 (FOLKNPGA_02779) - 3134455..3135522 (+) 1068 WP_182350423.1 PilW family protein -
  FOLKNPGA_RS13690 (FOLKNPGA_02780) - 3135636..3136157 (+) 522 WP_232891309.1 pilus assembly protein -
  FOLKNPGA_RS13695 (FOLKNPGA_02781) pilE 3136167..3136592 (+) 426 WP_182350425.1 type IV pilin protein Machinery gene
  FOLKNPGA_RS13700 (FOLKNPGA_02782) comEC 3136686..3138902 (+) 2217 WP_182350426.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  FOLKNPGA_RS13705 (FOLKNPGA_02783) - 3139078..3140115 (+) 1038 WP_182350427.1 bifunctional transcriptional activator/DNA repair enzyme AdaA -
  FOLKNPGA_RS13710 (FOLKNPGA_02784) - 3140212..3141015 (-) 804 WP_182350428.1 hypothetical protein -
  FOLKNPGA_RS13715 (FOLKNPGA_02785) - 3141108..3141389 (-) 282 WP_182350429.1 hypothetical protein -
  FOLKNPGA_RS19030 (FOLKNPGA_02786) - 3141675..3141803 (-) 129 WP_255465984.1 hypothetical protein -
  FOLKNPGA_RS13720 (FOLKNPGA_02787) - 3141982..3142290 (-) 309 WP_182350430.1 hypothetical protein -
  FOLKNPGA_RS13725 (FOLKNPGA_02788) - 3142449..3142925 (+) 477 WP_182350431.1 Rrf2 family transcriptional regulator -
  FOLKNPGA_RS13730 (FOLKNPGA_02789) - 3142894..3143253 (+) 360 WP_182350432.1 DUF488 domain-containing protein -
  FOLKNPGA_RS13735 (FOLKNPGA_02790) - 3143333..3143725 (+) 393 WP_182350433.1 group III truncated hemoglobin -
  FOLKNPGA_RS13740 (FOLKNPGA_02791) cydP 3143712..3143891 (+) 180 WP_182352905.1 cytochrome oxidase putative small subunit CydP -

Sequence


Protein


Download         Length: 738 a.a.        Molecular weight: 83363.28 Da        Isoelectric Point: 9.4795

>NTDB_id=469391 FOLKNPGA_RS13700 WP_182350426.1 3136686..3138902(+) (comEC) [Legionella sp. PC1000]
MEIFCFFTGILYLYTFNHSLPLITLLFFFLSPKYSLILFFILGVAMAGMHQALVSPKGIPNLTLLPKVTVQGTIASTPNQ
DFTKTQFLFALERYNHHPAQGLIQLSWYTNAPKLHAGQRWQFTVKLKKPRNYLNPGSSDYVGMLAARHIQWTGYILSKNN
QVFPEAEPHFNWLLLREHLGAKLSQLAPNQSTAGVVEALTLNLTTHISQKNWDLFRRTGTTHLFGISGEHIALVSGIIYW
LVRWLWSKSSRCCLFIPAPYVASMSGLWAALFYAFLAGFAPPVQRALIGCFFYTLYSLGKQRFSTWQIWRYALFGVLCIE
PHAVFMQGFYFSFLAVACLLLTQQRWRLKGYKGKLALQLSCLIGLMPLTLYWYSYGSINGFIANLFAIPLVGLLIVPLAL
ITMTLCSCRIAALLMKLLSLLIALLFKGLYLVEHIAIMNINGSISHIGLVVILMGALLMWVLLPIKPFQWIALLWILLPF
FPPRTVSSPGEALIDILDVGQGLAIVIRSQHHTLIYDTGDRFFQGNDLGKMVILPYLKTLGIKKIDFVVISHPDKDHRGG
LDSLEKEIPVDQLLVNDPHYYDHGVRCHDYPQWDWNGVSFRFLPITAHFKNKNNNSCILQISTKAGKILLTGDIEKIAED
YLVKTYEAELASDVLIVPHHGSKTSSSYRFLLEVAPHYAIASLGFDNRFHFPHAKTLANMKSLNIPFFRTDQCGMVRLAL
PAQGEIKKPICFSGLDVI

Nucleotide


Download         Length: 2217 bp        

>NTDB_id=469391 FOLKNPGA_RS13700 WP_182350426.1 3136686..3138902(+) (comEC) [Legionella sp. PC1000]
ATGGAAATTTTTTGCTTTTTCACAGGCATACTTTATCTATATACCTTTAATCATTCTTTACCCTTAATTACCTTGCTCTT
CTTTTTTCTCAGTCCAAAGTATTCCTTAATTCTGTTTTTTATTTTAGGTGTTGCGATGGCAGGAATGCATCAAGCCCTTG
TTTCACCCAAAGGAATACCCAATCTCACTCTACTCCCTAAAGTAACCGTGCAAGGAACAATTGCATCGACTCCTAACCAA
GATTTTACCAAAACCCAATTTTTATTTGCCCTTGAACGATATAATCATCATCCGGCACAAGGATTAATCCAATTATCCTG
GTATACTAATGCACCAAAATTGCATGCAGGACAACGCTGGCAATTCACGGTCAAATTAAAAAAACCTCGGAATTACCTTA
ATCCTGGAAGTTCAGATTATGTAGGCATGCTTGCCGCACGACATATTCAGTGGACTGGCTATATACTCTCCAAAAACAAT
CAAGTGTTCCCTGAGGCAGAACCACATTTTAATTGGCTTCTCTTGCGTGAACATTTGGGAGCTAAACTCAGCCAATTAGC
ACCCAACCAATCCACTGCAGGCGTTGTTGAAGCATTGACTTTAAATTTAACTACCCATATTAGCCAGAAAAATTGGGATC
TATTTAGACGTACGGGCACCACCCACCTTTTTGGGATCTCGGGAGAACACATTGCATTGGTATCGGGGATAATTTATTGG
CTGGTTCGCTGGCTATGGTCTAAAAGCTCCCGATGTTGTTTGTTTATCCCAGCTCCCTATGTGGCCAGCATGAGTGGTCT
TTGGGCTGCTTTATTTTATGCCTTTTTAGCAGGGTTTGCTCCACCGGTACAAAGGGCATTAATCGGGTGTTTCTTTTATA
CGCTCTACAGCTTGGGGAAACAACGTTTTTCCACTTGGCAAATATGGCGTTATGCCTTATTTGGCGTTTTATGTATTGAA
CCACATGCCGTATTTATGCAAGGATTTTATTTTTCATTTCTTGCGGTAGCGTGTTTGCTTTTAACTCAACAACGTTGGCG
ATTAAAGGGTTACAAAGGAAAGTTAGCCTTACAATTAAGCTGCTTAATTGGGCTCATGCCCTTAACCCTATACTGGTATT
CTTATGGTTCAATTAACGGCTTTATTGCAAATTTATTTGCTATTCCTCTCGTTGGTCTTTTGATCGTCCCCTTAGCATTA
ATCACGATGACCCTTTGTTCATGCAGGATTGCTGCTCTTTTAATGAAACTCTTATCGTTGTTGATTGCCTTGTTATTTAA
AGGATTGTACTTGGTTGAACATATAGCAATCATGAACATTAATGGGTCTATTTCCCATATTGGGTTGGTTGTTATATTAA
TGGGGGCTTTGCTGATGTGGGTTCTACTGCCAATTAAACCCTTTCAATGGATTGCACTGTTATGGATACTGCTCCCATTT
TTTCCGCCCCGTACTGTATCATCTCCAGGAGAAGCACTGATTGATATTTTGGATGTTGGCCAAGGTTTGGCAATTGTCAT
TAGAAGCCAACATCATACCCTGATTTATGATACTGGAGATCGATTTTTTCAAGGTAATGATCTAGGGAAAATGGTGATTT
TACCCTATCTTAAAACCTTAGGAATAAAAAAAATTGACTTTGTGGTCATTAGCCATCCGGATAAAGATCATCGTGGCGGA
CTCGACTCGCTTGAAAAAGAAATACCAGTGGATCAGTTATTAGTCAATGATCCACATTACTATGATCATGGTGTGAGATG
TCATGATTACCCGCAGTGGGATTGGAATGGTGTTTCTTTTCGTTTCTTGCCGATTACTGCTCACTTTAAGAATAAAAATA
ATAATTCGTGTATTCTACAAATAAGCACCAAGGCTGGAAAAATATTATTAACAGGGGATATTGAAAAAATAGCAGAAGAT
TATTTAGTAAAAACTTATGAAGCAGAGCTTGCTTCGGATGTTCTGATTGTCCCTCATCATGGCAGTAAAACCTCGTCTTC
TTATCGATTTCTACTTGAAGTTGCGCCACACTATGCCATAGCTTCTTTAGGTTTTGATAATCGTTTTCACTTTCCTCATG
CTAAAACTTTGGCAAACATGAAATCATTGAATATCCCCTTTTTTAGAACAGATCAGTGTGGCATGGTACGACTTGCCTTG
CCGGCTCAAGGTGAAATAAAAAAACCTATTTGTTTTAGCGGTCTTGATGTAATCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Legionella pneumophila strain Lp02

64.49

99.593

0.642

  comEC Legionella pneumophila strain ERS1305867

64.169

99.458

0.638