Detailed information    

insolico Bioinformatically predicted

Overview


Name   rec2   Type   Machinery gene
Locus tag   H733_RS03525 Genome accession   NZ_CP007805
Coordinates   718418..720772 (+) Length   784 a.a.
NCBI ID   WP_038439598.1    Uniprot ID   -
Organism   Haemophilus influenzae CGSHiCZ412602     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 713418..725772
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H733_RS03500 (H733_0670) - 713766..715064 (-) 1299 WP_038439597.1 N-acetylmuramoyl-L-alanine amidase -
  H733_RS03505 (H733_0671) tsaE 715072..715548 (-) 477 WP_005657736.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex ATPase subunit type 1 TsaE -
  H733_RS03510 (H733_0672) folK 715624..716106 (-) 483 WP_005657739.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  H733_RS03515 (H733_0673) pcnB 716115..717473 (-) 1359 WP_005657742.1 polynucleotide adenylyltransferase PcnB -
  H733_RS03520 (H733_0674) dksA 717721..718158 (-) 438 WP_005657744.1 RNA polymerase-binding protein DksA -
  H733_RS03525 (H733_0675) rec2 718418..720772 (+) 2355 WP_038439598.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  H733_RS03530 (H733_0676) msbA 720813..722576 (+) 1764 WP_038439599.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  H733_RS03535 (H733_0677) lpxK 722649..723647 (+) 999 WP_038439600.1 tetraacyldisaccharide 4'-kinase -
  H733_RS03540 (H733_0678) kdsB 723718..724482 (+) 765 WP_038439601.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 784 a.a.        Molecular weight: 89029.53 Da        Isoelectric Point: 10.0133

>NTDB_id=123494 H733_RS03525 WP_038439598.1 718418..720772(+) (rec2) [Haemophilus influenzae CGSHiCZ412602]
MKLNLITLAVLLIVADLTLLFLPQPLLLPWQVALVIALVLIFLFIFLRRNFLVSLAFFVASLGYFHYSALSLLQQAQNIT
AQKQVVTFKIQEILHQQDYQTLIATATLANNLQEQRIFLNWKAKEVPQLSEIWQAEISLRPLSARLNFGGFDRQQWYFSK
GITAVGTVKSAVKIADVSSLRAEKLQQVKKQTEGLSLQGLLIALAFGERAWLDKTTWSIYQQSNTAHLIAISGLHIGLAM
GIGFYLARVVQVFFIHPYFPLAFGVLFALIYAYLAGFSVPTFRAISALVFVLFIQIMRRHYSPLQLFTVVVGFLLFCNPL
MPLSVSFWLSCEAVGCLILWYRYVPFSLFQWKNRPFSPKVRWILSLFHLQFGLLLFFTPLQLFLFNGLSLSGFLANLMAV
PIYSFLLVPLILFAVFTNGTMFFWQLANKLAEGSTGLISVFQGNWLTVSFNLALFLTALCAGIFMLIIWRIYREPEASSS
TWKIKRPRFFTLNLSKPLLKNERINVLRCSFGIILMCFMILLFKQFSKPTWQVDTLDVGQGLATLIVKNGKGILYDTGSS
WRGGSMAELEILPYLQREGIVLEKLILSHDDNDHAGGASTILKAYPNVELITPSQKNYGENYRTFCTAGRDWHWQGLHFQ
ILSPHNVVTRADNPHSCVILVDDGKHSVLLTGDAEAKNEQIFARTLGKIDVLQVGHHGSKTSTSEYLLSQVRPDVAIISS
GRWNPWKFPHYSVMERLHRYKSAVENTAVSGQVRVNFFQDRLEIQQARTKFSPWYARVIGLSKE

Nucleotide


Download         Length: 2355 bp        

>NTDB_id=123494 H733_RS03525 WP_038439598.1 718418..720772(+) (rec2) [Haemophilus influenzae CGSHiCZ412602]
ATGAAATTAAACTTAATAACTTTAGCTGTCTTGTTAATTGTCGCGGATTTAACGTTGTTATTTCTACCGCAACCGTTGCT
ATTGCCTTGGCAAGTTGCTCTCGTTATTGCGCTTGTTTTGATTTTTCTTTTTATTTTCTTGCGTAGAAATTTCTTAGTTA
GCCTTGCTTTTTTTGTTGCCTCTCTTGGCTATTTTCATTATTCGGCTTTGAGTTTATTACAACAAGCTCAAAATATTACC
GCTCAAAAGCAAGTGGTAACTTTTAAGATTCAAGAAATTTTGCACCAACAGGATTATCAAACGCTTATCGCCACAGCAAC
ATTGGCGAATAATTTGCAAGAACAACGAATTTTCTTAAATTGGAAAGCGAAAGAGGTGCCTCAATTATCGGAAATTTGGC
AAGCTGAAATTTCTTTACGTCCCCTTTCTGCGCGATTAAATTTCGGTGGGTTTGATCGGCAACAATGGTATTTTTCAAAA
GGAATTACGGCTGTTGGAACGGTAAAAAGTGCGGTGAAAATTGCGGATGTTTCATCATTGCGAGCAGAAAAATTGCAACA
AGTGAAAAAACAAACAGAAGGGTTGTCCTTACAAGGTTTATTGATTGCCTTAGCTTTTGGCGAACGGGCTTGGTTAGATA
AAACCACTTGGTCAATTTACCAACAAAGCAATACCGCGCATCTTATTGCTATTTCTGGCTTACATATTGGGTTGGCTATG
GGGATTGGATTTTACTTGGCGCGTGTTGTGCAAGTATTTTTTATTCATCCTTATTTTCCTTTAGCTTTTGGTGTTTTATT
TGCTTTAATTTATGCGTATTTGGCTGGTTTTAGCGTGCCAACTTTTCGTGCCATTTCAGCACTTGTTTTCGTTTTATTTA
TTCAAATAATGAGGCGACATTATTCGCCCCTTCAGCTTTTTACGGTGGTTGTCGGATTCTTGCTTTTCTGCAATCCATTA
ATGCCGCTTTCGGTCAGTTTTTGGCTTTCTTGTGAGGCAGTTGGGTGTTTGATCCTCTGGTATCGTTATGTGCCTTTTTC
ACTTTTTCAATGGAAAAATCGCCCCTTTTCACCAAAAGTGCGGTGGATTTTGAGTTTATTTCATTTGCAATTTGGGTTAT
TGCTCTTTTTTACACCTTTGCAACTTTTTCTATTTAATGGCTTATCGTTGAGTGGATTTTTAGCCAATCTTATGGCGGTT
CCAATTTATAGTTTTTTGCTTGTGCCATTAATTTTATTTGCCGTTTTTACTAACGGCACAATGTTTTTTTGGCAACTAGC
AAACAAGTTAGCCGAAGGAAGTACTGGGTTAATTTCTGTTTTTCAAGGAAATTGGCTCACGGTTTCATTTAATTTAGCAT
TATTTTTAACCGCACTTTGTGCAGGAATTTTTATGTTAATTATTTGGCGTATTTATCGAGAACCAGAGGCTTCATCATCA
ACTTGGAAAATTAAACGACCAAGATTTTTTACATTGAATCTCAGTAAACCTTTGCTAAAAAATGAACGAATCAACGTTTT
GCGATGTTCTTTCGGCATTATTCTTATGTGTTTTATGATTTTGTTGTTTAAACAATTTAGTAAGCCAACTTGGCAGGTAG
ATACTTTAGATGTGGGGCAGGGCTTGGCAACGCTGATTGTGAAAAATGGCAAAGGGATTCTTTATGATACGGGTTCTTCT
TGGCGAGGTGGAAGTATGGCTGAGTTGGAAATTTTGCCTTATTTACAAAGAGAAGGGATTGTTTTGGAAAAATTGATTTT
AAGCCACGACGATAACGATCACGCAGGTGGTGCTTCAACAATTTTAAAGGCGTATCCCAATGTGGAATTAATTACCCCTT
CGCAAAAAAATTATGGGGAAAATTACCGCACTTTTTGTACTGCTGGGCGTGATTGGCATTGGCAAGGGTTGCATTTTCAA
ATACTTTCTCCTCACAACGTTGTGACACGAGCTGATAATCCCCATTCTTGTGTAATTTTAGTCGATGATGGAAAGCATAG
CGTTTTGCTAACTGGCGACGCTGAAGCAAAAAATGAGCAAATTTTTGCCCGCACTTTAGGCAAAATCGATGTGTTGCAAG
TGGGGCATCATGGGAGTAAAACATCGACAAGTGAATACTTGCTTTCTCAGGTTAGACCAGATGTAGCGATTATTTCTAGT
GGACGTTGGAATCCGTGGAAATTCCCTCATTATTCGGTTATGGAAAGGCTTCATCGCTATAAAAGTGCGGTAGAAAATAC
CGCTGTTTCGGGGCAAGTGCGGGTAAATTTTTTTCAAGACCGATTAGAAATCCAGCAAGCTCGCACAAAATTTTCCCCTT
GGTATGCGCGTGTAATTGGATTATCAAAGGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  rec2 Haemophilus influenzae Rd KW20

95.685

100

0.962


Multiple sequence alignment