Detailed information    

insolico Bioinformatically predicted

Overview


Name   rec2   Type   Machinery gene
Locus tag   INP91_RS04620 Genome accession   NZ_CP063123
Coordinates   941999..944401 (+) Length   800 a.a.
NCBI ID   WP_197544420.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain M1C120_2     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 936999..949401
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP91_RS04600 (INP91_04600) dcuC 937789..939147 (+) 1359 WP_197546310.1 anaerobic C4-dicarboxylate transporter DcuC -
  INP91_RS04605 (INP91_04605) folK 939201..939704 (-) 504 WP_197546311.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  INP91_RS04610 (INP91_04610) pcnB 939704..940963 (-) 1260 WP_232086156.1 polynucleotide adenylyltransferase PcnB -
  INP91_RS04615 (INP91_04615) dksA 941298..941735 (-) 438 WP_049384440.1 RNA polymerase-binding protein DksA -
  INP91_RS04620 (INP91_04620) rec2 941999..944401 (+) 2403 WP_197544420.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  INP91_RS04625 (INP91_04625) msbA 944452..946197 (+) 1746 WP_197546312.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  INP91_RS04630 (INP91_04630) lpxK 946251..947231 (+) 981 WP_197544421.1 tetraacyldisaccharide 4'-kinase -
  INP91_RS04635 (INP91_04635) - 947248..947427 (+) 180 WP_049372406.1 Trm112 family protein -
  INP91_RS04640 (INP91_04640) kdsB 947437..948195 (+) 759 WP_049362145.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 800 a.a.        Molecular weight: 90058.56 Da        Isoelectric Point: 9.1530

>NTDB_id=493171 INP91_RS04620 WP_197544420.1 941999..944401(+) (rec2) [Haemophilus parainfluenzae strain M1C120_2]
MKITLFQLCMMLIVSALSLLIVPDFLLFTWQQAGLSILMLVMTALCFHWFNYFKARNFCISGILFLITLAYAHSPALSLL
GQAEKISSLPNKITLDLHISEILHQQDYQTLVATTSLFDGKEQQIFINWKALEKPQVGEIWQAEVKLRPISARLNHGGFD
RQQWYFSKRIIAVGYVKSAVKIEDDFSYRTHFLQNSLKQTEGLSLQGLLIALAFGERAWLDNKTWLIYQQTNTAHLIAIS
GLHIGLAMGIGFFFARLLQLVLPTRFISPWFPLCFGVLIALGYAYLAGFSLPTFRAMMALLFIAILQCSRRYYTPSKMLC
LVVAFLLFCDSIMPLSVSLWLSVGAVTCLIVWYRYVPLSIFEWRHQKLSRKVRWILGLFHLQFGLLILFTPIQLFFFNGF
ALNGFLANLIAVPLYSFLLVPLILFAVLTNGAFSSWPLSNAIAQGITQCLSFFQGSYMPISMNLSLVLTALLSFVFGSML
CGLYFASQTQTKNTPLKSSRFFTLNYTKILSPKAYKQALLGVILIFSVCMSTLGYRYFTKPKWQLDTLDVGQGLATLIVK
EGKGVLYDTGPAWQSGTGGSSMAELEILPYLQREGIELETLILSHDDNDHSGGAKTILTAYPEIELITPSRKSYGEMHRT
FCLQGKQWQWRGLDFKVLSPAQITDRAENPQSCVILLTDGQYQILLTGDVDVATERSFAENLGKINVLQVGHHGSKTSTG
EFLLGQTKPDIALISSGRWNAWNFPHPTVIERLNRYQSAVENSAISGHIRLNFTEKGIEIEKARGDFSPWFARVIGLSPK

Nucleotide


Download         Length: 2403 bp        

>NTDB_id=493171 INP91_RS04620 WP_197544420.1 941999..944401(+) (rec2) [Haemophilus parainfluenzae strain M1C120_2]
ATGAAAATCACGCTATTTCAACTGTGCATGATGCTCATTGTCTCAGCCTTATCCTTGCTTATCGTGCCCGATTTTTTACT
GTTCACTTGGCAACAGGCGGGACTGAGCATTTTAATGTTGGTTATGACCGCACTTTGTTTCCATTGGTTCAATTATTTTA
AAGCCAGAAATTTTTGCATAAGTGGTATCTTGTTTTTGATTACTTTGGCTTATGCGCATTCACCAGCTTTATCGTTATTG
GGACAAGCAGAGAAAATTTCAAGTTTACCGAATAAAATTACCTTAGATCTCCACATCTCAGAAATTCTTCATCAGCAAGA
TTATCAAACGTTGGTTGCGACAACCTCTTTATTTGATGGAAAAGAACAACAGATTTTTATTAATTGGAAAGCGCTGGAAA
AGCCACAAGTAGGTGAAATTTGGCAGGCAGAAGTAAAACTTCGACCTATATCTGCAAGATTAAATCATGGAGGATTTGAT
CGACAGCAATGGTATTTTTCCAAAAGGATTATTGCCGTTGGATATGTGAAAAGTGCGGTCAAAATTGAAGACGATTTTTC
TTATCGCACACATTTTCTACAGAATAGCCTCAAACAGACAGAAGGGCTTTCTTTACAAGGTTTGCTTATTGCATTGGCAT
TTGGTGAGCGGGCTTGGTTGGATAATAAAACGTGGTTGATTTACCAACAAACGAATACAGCACATTTAATTGCGATTTCA
GGGCTTCATATTGGTTTAGCCATGGGGATAGGCTTTTTCTTTGCTCGATTATTGCAGCTCGTACTGCCTACACGCTTTAT
TTCACCCTGGTTTCCACTTTGCTTTGGTGTGTTGATTGCCTTAGGTTATGCCTATTTAGCCGGATTTAGTTTACCGACTT
TCCGTGCAATGATGGCGTTGCTTTTTATTGCAATCCTTCAATGTTCTCGACGATATTACACGCCTTCGAAAATGCTTTGT
TTAGTGGTGGCATTTTTGTTGTTTTGCGATTCGATTATGCCACTTTCGGTGAGCCTTTGGCTTTCAGTTGGTGCTGTTAC
TTGTTTGATTGTGTGGTACCGATATGTTCCCTTATCGATCTTTGAATGGCGACATCAAAAATTATCCCGCAAAGTGCGGT
GGATTTTGGGCTTGTTTCATTTGCAGTTCGGGCTTCTAATTTTATTCACACCTATTCAGCTTTTCTTTTTTAATGGCTTT
GCGTTAAATGGATTTTTGGCGAATTTAATCGCCGTGCCCTTGTATAGTTTCTTACTGGTACCCTTGATTCTTTTTGCGGT
GTTAACGAATGGCGCATTTTCTTCTTGGCCTCTTTCAAATGCGATTGCTCAAGGTATTACGCAATGCCTCAGTTTTTTCC
AGGGCTCGTATATGCCAATTTCGATGAATTTATCCCTCGTTTTGACTGCACTTTTAAGCTTTGTTTTTGGCAGCATGTTA
TGTGGATTGTATTTTGCATCTCAAACTCAAACAAAAAATACACCGTTAAAATCAAGTCGATTTTTTACATTAAATTACAC
AAAAATCTTATCGCCAAAAGCTTATAAGCAAGCTCTTTTAGGCGTTATTCTTATTTTTAGTGTTTGTATGAGTACGCTAG
GGTATCGATATTTTACGAAACCCAAATGGCAGTTAGATACGTTAGATGTTGGTCAAGGTTTAGCAACCTTAATTGTAAAA
GAAGGGAAAGGCGTGTTGTATGATACGGGCCCAGCTTGGCAGAGTGGAACAGGGGGCTCTTCCATGGCGGAATTGGAAAT
ATTGCCTTATCTTCAACGGGAAGGGATTGAACTTGAAACCTTGATTTTAAGCCATGATGATAATGACCATTCAGGCGGTG
CTAAAACGATTTTAACCGCATACCCAGAGATTGAGCTGATTACACCTTCTCGCAAAAGCTATGGCGAAATGCACCGCACT
TTTTGTCTTCAGGGGAAACAATGGCAATGGCGAGGACTTGATTTTAAAGTTCTTTCACCTGCACAAATTACAGACAGGGC
GGAAAATCCGCAATCTTGCGTAATTTTACTAACGGATGGGCAGTATCAGATCCTTTTAACTGGTGATGTTGATGTGGCAA
CAGAAAGATCTTTTGCTGAAAACCTCGGCAAGATTAACGTGTTACAGGTGGGGCATCATGGTAGTAAAACTTCAACAGGC
GAGTTTTTACTGGGACAGACAAAACCGGATATAGCCTTAATTTCAAGTGGACGATGGAATGCGTGGAATTTTCCTCATCC
AACCGTGATTGAACGGTTAAATCGTTATCAAAGTGCGGTCGAAAATAGCGCTATTTCAGGTCATATAAGGTTAAATTTTA
CTGAAAAGGGCATTGAAATTGAAAAAGCGAGAGGAGATTTTTCACCTTGGTTTGCCCGTGTAATTGGATTATCCCCGAAA
TAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  rec2 Haemophilus influenzae Rd KW20

60.424

100

0.605