Detailed information    

insolico Bioinformatically predicted

Overview


Name   rec2   Type   Machinery gene
Locus tag   ACHWYH_RS04115 Genome accession   NZ_CP172084
Coordinates   836531..838897 (-) Length   788 a.a.
NCBI ID   WP_112103420.1    Uniprot ID   -
Organism   Haemophilus influenzae strain GA81666     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 831531..843897
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACHWYH_RS04100 kdsB 832821..833585 (-) 765 WP_112095948.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  ACHWYH_RS04105 lpxK 833656..834654 (-) 999 WP_112103421.1 tetraacyldisaccharide 4'-kinase -
  ACHWYH_RS04110 msbA 834727..836490 (-) 1764 WP_011271803.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  ACHWYH_RS04115 rec2 836531..838897 (-) 2367 WP_112103420.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACHWYH_RS04120 dksA 839156..839593 (+) 438 WP_011271805.1 RNA polymerase-binding protein DksA -
  ACHWYH_RS04125 pcnB 839841..841199 (+) 1359 WP_011271806.1 polynucleotide adenylyltransferase PcnB -
  ACHWYH_RS04130 folK 841208..841690 (+) 483 WP_011271807.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  ACHWYH_RS04135 tsaE 841766..842242 (+) 477 WP_005693857.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex ATPase subunit type 1 TsaE -
  ACHWYH_RS04140 - 842250..843548 (+) 1299 WP_032824196.1 N-acetylmuramoyl-L-alanine amidase -

Sequence


Protein


Download         Length: 788 a.a.        Molecular weight: 89201.70 Da        Isoelectric Point: 9.8996

>NTDB_id=1065044 ACHWYH_RS04115 WP_112103420.1 836531..838897(-) (rec2) [Haemophilus influenzae strain GA81666]
MKLNLITLAVLLIVADLTLLFLPQPLLLPWQVALVIALVLIFLFIFLRRNFLVSLAFFVASLGYFHYSALSLSQQAQNIT
AQNQVVTFKIQEILHQQDYQTLIATATLENNLQEQRIFLNWKAKEVPQLSEIWQAEISLRPLSARLNFGGFDRQQWYFSK
GITAVGTVKSAVKIADVSSLRAEKLQQVKKQTEGLSLQGLLIALAFGERAWLDKTTWSIYQQTNTAHLIAISGLHIGLAM
GIGVCLARVVQVFCPTRFIHPYFPLVFGVLFALIYAYLAGFSVPTFRAISALVFVLFIQIMRRHYSPIQLFTLVVGFLLF
CDPLMPLSVSFWLSCGAVGCLLLWYRYVPFSLFQWKNRPFSPKVRWIFSLFHLQFGLLLFFTPLQLFLFNGLSLSGFLAN
LMAVPIYSFLLVPLILFAVFTNGTMFSWQLANKLAEGITGLISVFQGNWFNVSFNLALVLTALCAGIFMLIIWSIYREPE
VSSSTWKIKRAKFFTLNLSKPLLKTDRINVLRCSFGIILMCFMILLFKQLSKPTWQVDTLDVGQGLATLIVKNGKGILYD
TGSSWRGGSMAELEILPYLQREGIVLEKLILSHDDNDHAGGASTILKAYPNVELITPSRKNYGENYRTFCTAGRDWHWQG
LHFQILSPHNVATRADNPHSCVILVDDGKNSVLLTGDAEAKNEQIFARTLGKIDVLQVGHHGSKTSTSEYLLSQVRPDVA
IISSGRWNPWKFPHYSVMERLHRYKSAVENTAVSGQVRVNFFQDRLEIQQARTKFSPWYARVIGLSKE

Nucleotide


Download         Length: 2367 bp        

>NTDB_id=1065044 ACHWYH_RS04115 WP_112103420.1 836531..838897(-) (rec2) [Haemophilus influenzae strain GA81666]
ATGAAATTAAACTTAATAACTTTAGCTGTCTTGTTAATTGTCGCGGATTTAACGTTGTTATTTCTACCGCAACCGTTGCT
ATTGCCTTGGCAAGTTGCTCTCGTTATTGCGCTTGTTTTGATTTTTCTTTTTATTTTCTTGCGTAGAAATTTCTTAGTTA
GCCTTGCTTTTTTTGTTGCCTCTCTTGGCTATTTTCATTATTCGGCTTTGAGTTTATCACAACAAGCTCAAAATATTACC
GCTCAAAATCAAGTGGTAACTTTTAAGATTCAAGAAATTTTGCACCAACAGGATTATCAAACGCTTATCGCCACAGCAAC
ATTGGAGAATAATTTGCAAGAACAACGAATTTTCTTAAATTGGAAAGCGAAAGAGGTGCCTCAATTATCGGAAATTTGGC
AAGCTGAAATTTCTTTACGTCCCCTTTCTGCGCGATTAAATTTTGGTGGGTTTGATCGGCAACAATGGTATTTTTCAAAA
GGAATTACGGCTGTTGGAACGGTAAAAAGTGCGGTGAAAATTGCGGATGTTTCATCATTGCGTGCAGAAAAATTGCAACA
AGTGAAAAAACAAACAGAAGGGTTGTCCTTACAAGGTTTATTGATTGCCTTAGCGTTTGGCGAACGGGCTTGGTTAGATA
AAACCACTTGGTCAATTTACCAACAAACCAATACTGCGCATCTTATTGCTATTTCTGGCTTACATATTGGGTTGGCTATG
GGAATTGGAGTTTGCTTGGCGCGTGTTGTGCAAGTCTTTTGCCCCACCCGTTTTATTCATCCTTATTTTCCTTTAGTTTT
TGGTGTTTTATTTGCTTTAATTTATGCGTATTTGGCTGGTTTTAGTGTGCCAACTTTTCGTGCCATTTCAGCACTTGTTT
TCGTTTTATTTATTCAAATAATGAGGCGACATTATTCGCCCATTCAGCTTTTTACGTTGGTTGTCGGATTCTTGCTTTTC
TGCGATCCATTAATGCCGCTTTCGGTCAGTTTTTGGCTTTCTTGTGGAGCGGTTGGGTGTTTGCTCCTCTGGTATCGTTA
TGTGCCTTTTTCACTTTTTCAATGGAAAAATCGCCCTTTTTCTCCAAAAGTGCGGTGGATTTTTAGTTTATTTCATTTGC
AATTTGGGTTATTGCTCTTTTTTACGCCTTTGCAACTTTTTCTATTTAATGGCTTATCGTTAAGTGGATTTTTAGCCAAT
CTTATGGCGGTTCCAATTTATAGTTTTTTGCTTGTGCCATTAATTTTATTTGCCGTTTTTACTAACGGCACAATGTTTTC
TTGGCAACTAGCAAACAAGTTAGCCGAAGGAATTACTGGGTTAATTTCTGTTTTTCAAGGGAATTGGTTTAATGTTTCAT
TTAATTTAGCATTGGTTTTAACCGCACTTTGTGCAGGAATTTTTATGTTAATTATTTGGAGTATTTATCGAGAACCAGAG
GTTTCATCATCAACTTGGAAAATTAAACGAGCAAAATTTTTTACATTGAATCTCAGTAAACCTTTGCTAAAAACTGATCG
AATCAACGTTTTGCGATGTTCTTTCGGCATTATTCTTATGTGTTTTATGATTTTGTTGTTTAAACAATTGAGCAAGCCAA
CTTGGCAGGTAGATACTTTAGATGTGGGGCAGGGCTTGGCAACGCTGATTGTGAAAAATGGCAAAGGGATTCTTTATGAT
ACGGGTTCTTCTTGGCGAGGTGGAAGTATGGCTGAGTTGGAAATTTTGCCTTATTTACAAAGAGAAGGGATTGTTTTGGA
AAAATTGATTTTAAGCCACGACGATAACGATCACGCAGGTGGTGCTTCGACAATTTTAAAGGCGTATCCCAATGTGGAAT
TGATTACCCCTTCACGGAAAAACTATGGGGAAAATTACCGCACTTTTTGTACTGCTGGGCGTGATTGGCATTGGCAAGGG
TTGCATTTTCAAATACTTTCTCCTCACAACGTTGCGACACGAGCTGATAATCCCCATTCTTGTGTGATTTTAGTCGATGA
TGGAAAGAATAGCGTTTTGCTAACTGGCGATGCTGAAGCAAAAAATGAGCAAATTTTTGCCCGCACTTTAGGCAAAATCG
ATGTGTTGCAAGTGGGGCATCATGGGAGTAAAACATCGACAAGTGAATACTTGCTTTCTCAGGTTAGACCAGATGTAGCG
ATTATTTCTAGTGGGCGTTGGAATCCGTGGAAATTCCCTCATTATTCGGTTATGGAAAGGCTTCATCGCTATAAAAGTGC
GGTAGAAAATACCGCTGTTTCGGGGCAAGTGCGGGTAAATTTTTTTCAAGACCGATTAGAAATCCAGCAAGCTCGCACAA
AATTTTCCCCTTGGTATGCGCGTGTAATTGGATTATCAAAGGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  rec2 Haemophilus influenzae Rd KW20

97.462

100

0.975


Multiple sequence alignment