Detailed information    

insolico Bioinformatically predicted

Overview


Name   rec2   Type   Machinery gene
Locus tag   DV401_RS06080 Genome accession   NZ_CP031254
Coordinates   1200620..1202986 (+) Length   788 a.a.
NCBI ID   WP_103708077.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M25588     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1195620..1207986
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV401_RS06055 - 1195969..1197267 (-) 1299 WP_071162683.1 LysM peptidoglycan-binding domain-containing protein -
  DV401_RS06060 tsaE 1197275..1197751 (-) 477 WP_071162682.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex ATPase subunit type 1 TsaE -
  DV401_RS06065 folK 1197827..1198309 (-) 483 WP_071162681.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  DV401_RS06070 pcnB 1198318..1199760 (-) 1443 WP_373364797.1 polynucleotide adenylyltransferase PcnB -
  DV401_RS06075 dksA 1199924..1200361 (-) 438 WP_005630220.1 RNA polymerase-binding protein DksA -
  DV401_RS06080 rec2 1200620..1202986 (+) 2367 WP_103708077.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  DV401_RS06085 msbA 1203027..1204790 (+) 1764 WP_103708078.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  DV401_RS06090 lpxK 1204863..1205861 (+) 999 WP_103708079.1 tetraacyldisaccharide 4'-kinase -
  DV401_RS06095 kdsB 1205932..1206696 (+) 765 WP_038439601.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 788 a.a.        Molecular weight: 89371.00 Da        Isoelectric Point: 10.0340

>NTDB_id=305110 DV401_RS06080 WP_103708077.1 1200620..1202986(+) (rec2) [Haemophilus influenzae strain M25588]
MKLNLITLAVLLIVADLTLLFLPQPLLLPWQVALVIALVLIFLFIFLRRNFLVSLAFFVASLGYFHYSALSLLQQAQNIT
AQKQVVTFKIQEILHQQDYQTLIATATLANNLQEQRIFLNWKAKEVPQLSEIWQAEISLRPLSARLNFGGFDRQQWYFSK
GITAVGTVKSAVKIADVSSLRAEKLQQVKKQTEGLSLQGLLIALAFGERAWLDKTTWSIYQQTNTAHLIAISGLHIGLAM
GIGFCLARVVQVFFPTRFIHPYFPLVFGVLFALIYAYLAGFSVPTFRAISALVFVFFVQIMRRHYSPLQLFTLVVGFLLF
CDPLMPLSVSFWLSCGAVGCLLLWYRYVPFSLFQWKNRPFSPKVRWILSLFHLQFGLLLFFTPLQLFLFNGLSLSGFLAN
LMAVPIYSFLLVPLILFAVFTNGTMFSWQLANKLAEGITGLISVFQGNWFNVSFNLALVLTALCAGIFMLIIWSIYREPE
VSSSTWKIKRPRFFTLNLSKPLLKTDRINVLRCSFGIILMCFMILLFKQLSKPTWQVDTLDVGQGLATLIVKNGKGILYD
TGSSWRGGSMAELEILPYLQREGIVLEKLILSHDDNDHAGGASTILKAYPNVELITPSRKNYGENYRTFCTAGRDWHWQG
LHFQILSPHNVVTRADNPHSCVILVDDGKNNVLLTGDAEAKNEQIFARTLGKIDVLQVGHHGSKTSTSEYLLSQVRPDVA
IISSGRWNPWKFPHYSVMERLHRYKSAVENTAVSGQVRVNFFQDRLEIQQARTKFSPWYARVIGLSKE

Nucleotide


Download         Length: 2367 bp        

>NTDB_id=305110 DV401_RS06080 WP_103708077.1 1200620..1202986(+) (rec2) [Haemophilus influenzae strain M25588]
ATGAAATTAAACTTAATAACTTTAGCTGTCTTGTTAATTGTCGCGGATTTAACGTTGTTATTTCTACCGCAACCGTTGCT
ATTGCCTTGGCAAGTTGCTCTCGTTATTGCGCTTGTTTTGATTTTTCTTTTTATTTTCTTGCGTAGAAATTTCTTAGTTA
GCCTTGCTTTTTTTGTTGCCTCTCTTGGCTATTTTCATTATTCGGCTTTGAGTTTATTACAACAAGCTCAAAATATTACC
GCTCAAAAGCAAGTGGTAACTTTTAAGATTCAAGAAATTTTGCACCAACAGGATTATCAAACGCTTATCGCCACAGCAAC
ATTGGCGAATAATTTGCAAGAACAACGAATTTTCTTAAATTGGAAAGCGAAAGAGGTGCCTCAATTATCGGAAATTTGGC
AAGCTGAAATTTCTTTACGTCCCCTTTCTGCACGATTAAATTTCGGTGGGTTTGATCGGCAACAATGGTATTTTTCAAAA
GGAATTACGGCTGTTGGAACGGTAAAAAGTGCGGTGAAAATTGCGGATGTTTCATCATTGCGAGCAGAAAAATTGCAACA
AGTAAAGAAGCAAACGGAAGGATTATCTCTACAAGGTTTATTGATTGCCTTAGCTTTTGGCGAACGGGCTTGGTTAGATA
AAACCACTTGGTCAATTTACCAACAAACCAATACCGCGCATCTTATTGCTATTTCTGGCTTACATATTGGGTTGGCTATG
GGAATTGGATTTTGCTTGGCGCGTGTTGTGCAAGTGTTCTTTCCTACGCGTTTTATTCATCCTTATTTTCCTTTAGTTTT
TGGTGTTTTATTTGCTTTAATTTATGCGTATTTGGCTGGTTTTAGTGTGCCAACTTTTCGTGCCATTTCAGCACTTGTTT
TCGTTTTCTTCGTTCAAATAATGAGGAGACATTATTCGCCCCTTCAGCTTTTTACGTTGGTTGTCGGATTCTTGCTTTTC
TGCGATCCATTAATGCCGCTTTCGGTCAGTTTTTGGCTTTCTTGTGGGGCGGTTGGTTGTTTGCTCCTCTGGTATCGTTA
TGTGCCTTTTTCACTTTTTCAATGGAAAAATCGCCCTTTTTCTCCAAAAGTGCGGTGGATTTTGAGTTTATTTCATTTGC
AATTTGGGTTATTGCTCTTTTTTACACCTTTGCAACTTTTTCTATTTAATGGCTTATCGTTGAGTGGATTTTTAGCCAAT
CTTATGGCGGTTCCAATTTATAGTTTTTTGCTTGTGCCATTAATTTTATTTGCCGTTTTTACTAACGGCACAATGTTTTC
TTGGCAACTAGCAAACAAGTTAGCCGAAGGAATTACTGGGTTAATTTCTGTTTTTCAAGGGAATTGGTTTAATGTTTCAT
TTAATTTAGCATTGGTTTTAACCGCACTTTGTGCAGGAATTTTTATGTTAATTATTTGGAGTATTTATCGAGAACCAGAG
GTTTCATCATCAACTTGGAAAATTAAACGACCAAGATTTTTTACATTGAATCTCAGTAAACCTTTGCTAAAAACTGATCG
AATCAACGTTTTGCGATGTTCTTTCGGCATTATTCTTATGTGTTTTATGATTTTGTTGTTTAAACAATTGAGCAAGCCAA
CTTGGCAGGTAGATACTTTAGATGTGGGGCAGGGCTTGGCAACGCTGATTGTGAAAAATGGCAAAGGGATTCTTTATGAT
ACGGGTTCTTCTTGGCGAGGTGGAAGTATGGCTGAATTGGAAATTTTGCCTTATTTACAAAGAGAAGGGATTGTTTTGGA
AAAATTGATTTTAAGCCACGACGATAACGATCACGCAGGTGGTGCTTCGACAATTTTAAAGGCGTATCCCAATGTGGAAT
TGATTACCCCTTCACGGAAAAACTATGGGGAAAATTACCGCACTTTTTGTACTGCTGGGCGTGATTGGCATTGGCAAGGG
TTGCATTTTCAAATACTTTCTCCTCACAACGTTGTGACACGAGCTGATAATCCCCATTCTTGTGTGATTTTAGTCGATGA
TGGAAAGAATAACGTTTTGCTAACTGGCGATGCTGAAGCAAAAAATGAGCAAATTTTTGCCCGCACTTTAGGCAAAATCG
ATGTGTTGCAAGTGGGGCATCATGGGAGTAAAACATCGACAAGTGAATACTTGCTTTCTCAGGTTAGACCAGATGTAGCG
ATTATTTCTAGTGGGCGTTGGAATCCGTGGAAATTCCCTCATTATTCGGTTATGGAAAGGCTTCATCGCTATAAAAGTGC
GGTAGAAAATACCGCTGTTTCGGGGCAAGTGCGGGTAAATTTTTTTCAAGACCGATTAGAAATCCAGCAAGCTCGCACAA
AATTTTCCCCTTGGTATGCACGTGTAATTGGATTATCAAAGGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  rec2 Haemophilus influenzae Rd KW20

96.827

100

0.968


Multiple sequence alignment