Detailed information    

insolico Bioinformatically predicted

Overview


Name   rec2   Type   Machinery gene
Locus tag   LSO74_RS05735 Genome accession   NZ_OV040719
Coordinates   1103211..1105577 (+) Length   788 a.a.
NCBI ID   WP_005657747.1    Uniprot ID   A0A0H3PCV2
Organism   Haemophilus influenzae strain 3655 isolate 3655     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1049730..1107381 1103211..1105577 within 0


Gene organization within MGE regions


Location: 1049730..1107381
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LSO74_RS05405 (KRLU3655_LOCUS1010) - 1049730..1049990 (-) 261 WP_005658398.1 hypothetical protein -
  LSO74_RS05410 (KRLU3655_LOCUS1011) - 1050003..1051055 (-) 1053 WP_005658399.1 WG repeat-containing protein -
  LSO74_RS05415 (KRLU3655_LOCUS1012) - 1051056..1052417 (-) 1362 WP_005658402.1 hypothetical protein -
  LSO74_RS05420 (KRLU3655_LOCUS1013) - 1052417..1053565 (-) 1149 WP_005658404.1 TRAFAC clade GTPase domain-containing protein -
  LSO74_RS05425 (KRLU3655_LOCUS1014) - 1053574..1054419 (-) 846 WP_005661139.1 hypothetical protein -
  LSO74_RS05430 (KRLU3655_LOCUS1015) - 1054661..1054924 (+) 264 WP_005689965.1 hypothetical protein -
  LSO74_RS05435 (KRLU3655_LOCUS1016) - 1054964..1055809 (-) 846 WP_105896994.1 hypothetical protein -
  LSO74_RS05440 (KRLU3655_LOCUS1017) - 1055866..1055985 (-) 120 WP_005641687.1 Com family DNA-binding transcriptional regulator -
  LSO74_RS05445 (KRLU3655_LOCUS1018) - 1056161..1056661 (-) 501 WP_005659026.1 hypothetical protein -
  LSO74_RS05450 (KRLU3655_LOCUS1019) - 1056658..1057260 (-) 603 WP_005661624.1 hypothetical protein -
  LSO74_RS05455 (KRLU3655_LOCUS1020) - 1057272..1058981 (-) 1710 WP_005658853.1 phage tail protein -
  LSO74_RS05460 (KRLU3655_LOCUS1021) - 1059013..1059582 (-) 570 WP_005658855.1 phage tail protein -
  LSO74_RS05465 (KRLU3655_LOCUS1022) - 1059575..1060681 (-) 1107 WP_005658857.1 baseplate J/gp47 family protein -
  LSO74_RS05470 (KRLU3655_LOCUS1023) - 1060681..1061046 (-) 366 WP_005658860.1 GPW/gp25 family protein -
  LSO74_RS05475 (KRLU3655_LOCUS1024) - 1061101..1061709 (-) 609 WP_005658861.1 phage baseplate assembly protein V -
  LSO74_RS05480 (KRLU3655_LOCUS1025) - 1061696..1062760 (-) 1065 WP_005658863.1 phage late control D family protein -
  LSO74_RS05485 (KRLU3655_LOCUS1026) - 1062753..1062983 (-) 231 WP_005658865.1 tail protein X -
  LSO74_RS05490 (KRLU3655_LOCUS1027) - 1062964..1063896 (-) 933 WP_005658868.1 phage tail protein -
  LSO74_RS05495 (KRLU3655_LOCUS1028) - 1063896..1066244 (-) 2349 WP_005658870.1 phage tail tape measure protein -
  LSO74_RS05500 (KRLU3655_LOCUS1029) - 1066282..1066515 (+) 234 WP_005658872.1 hypothetical protein -
  LSO74_RS05505 - 1066531..1066671 (-) 141 WP_005658873.1 GpE family phage tail protein -
  LSO74_RS05510 (KRLU3655_LOCUS1030) - 1066692..1066967 (-) 276 WP_005658875.1 phage tail assembly protein -
  LSO74_RS05515 (KRLU3655_LOCUS1031) - 1067061..1067579 (-) 519 WP_005661614.1 phage major tail tube protein -
  LSO74_RS05520 (KRLU3655_LOCUS1032) - 1067590..1068975 (-) 1386 WP_005658882.1 phage tail sheath family protein -
  LSO74_RS05525 (KRLU3655_LOCUS1033) - 1068985..1069482 (-) 498 WP_005658884.1 Gp37 family protein -
  LSO74_RS05530 (KRLU3655_LOCUS1034) - 1069494..1069928 (-) 435 WP_005658887.1 gp436 family protein -
  LSO74_RS05535 (KRLU3655_LOCUS1035) - 1069928..1070245 (-) 318 WP_005658889.1 hypothetical protein -
  LSO74_RS05540 (KRLU3655_LOCUS1036) - 1070311..1071236 (-) 926 Protein_1046 hypothetical protein -
  LSO74_RS05545 (KRLU3655_LOCUS1037) - 1071255..1072358 (-) 1104 WP_005658895.1 hypothetical protein -
  LSO74_RS05550 (KRLU3655_LOCUS1038) - 1072601..1073101 (-) 501 WP_005658896.1 phage virion morphogenesis protein -
  LSO74_RS05555 (KRLU3655_LOCUS1040) - 1073304..1073492 (-) 189 WP_005658899.1 hypothetical protein -
  LSO74_RS05560 (KRLU3655_LOCUS1041) - 1073679..1074956 (-) 1278 WP_005658901.1 phage minor head protein -
  LSO74_RS05565 (KRLU3655_LOCUS1042) - 1074957..1076372 (-) 1416 WP_005658903.1 DUF935 domain-containing protein -
  LSO74_RS05570 (KRLU3655_LOCUS1043) - 1076374..1077696 (-) 1323 WP_005658905.1 terminase large subunit domain-containing protein -
  LSO74_RS05575 (KRLU3655_LOCUS1044) - 1077680..1078228 (-) 549 WP_005658907.1 DUF3486 family protein -
  LSO74_RS05580 (KRLU3655_LOCUS1045) - 1078238..1078540 (-) 303 WP_005658909.1 hypothetical protein -
  LSO74_RS05585 (KRLU3655_LOCUS1046) - 1078537..1078845 (-) 309 WP_005658911.1 hypothetical protein -
  LSO74_RS05590 (KRLU3655_LOCUS1047) - 1078969..1079226 (-) 258 WP_005658913.1 DUF2681 domain-containing protein -
  LSO74_RS05595 (KRLU3655_LOCUS1048) - 1079223..1079453 (-) 231 WP_005658916.1 DUF2644 domain-containing protein -
  LSO74_RS05600 (KRLU3655_LOCUS1051) - 1079693..1080229 (-) 537 WP_005658923.1 N-acetylmuramoyl-L-alanine amidase -
  LSO74_RS05605 (KRLU3655_LOCUS1052) - 1080316..1080735 (-) 420 WP_005658925.1 hypothetical protein -
  LSO74_RS05610 (KRLU3655_LOCUS1053) - 1080809..1081183 (-) 375 WP_005658927.1 Mor transcription activator family protein -
  LSO74_RS05615 (KRLU3655_LOCUS1054) - 1081187..1081675 (-) 489 WP_005658929.1 gp16 family protein -
  LSO74_RS05620 (KRLU3655_LOCUS1055) - 1081784..1082752 (-) 969 WP_005658930.1 hypothetical protein -
  LSO74_RS05625 (KRLU3655_LOCUS1056) - 1082834..1083034 (-) 201 WP_005658932.1 hypothetical protein -
  LSO74_RS05630 (KRLU3655_LOCUS1057) - 1083031..1083195 (-) 165 WP_006996622.1 hypothetical protein -
  LSO74_RS05635 (KRLU3655_LOCUS1058) - 1083206..1083403 (-) 198 WP_005660802.1 ANR family transcriptional regulator -
  LSO74_RS05640 (KRLU3655_LOCUS1059) - 1083508..1084026 (-) 519 WP_005660799.1 host-nuclease inhibitor Gam family protein -
  LSO74_RS05645 (KRLU3655_LOCUS1060) - 1084048..1084251 (-) 204 WP_005657708.1 hypothetical protein -
  LSO74_RS05650 (KRLU3655_LOCUS1061) - 1084251..1085168 (-) 918 WP_005657710.1 AAA family ATPase -
  LSO74_RS05655 (KRLU3655_LOCUS1062) - 1085195..1087204 (-) 2010 WP_005657712.1 transposase domain-containing protein -
  LSO74_RS05660 (KRLU3655_LOCUS1063) - 1087215..1087442 (-) 228 WP_005657714.1 transcriptional regulator -
  LSO74_RS05665 (KRLU3655_LOCUS1064) - 1087653..1088153 (+) 501 WP_005657715.1 DNA-binding protein -
  LSO74_RS05670 (KRLU3655_LOCUS1065) - 1088483..1088758 (-) 276 WP_005657717.1 hypothetical protein -
  LSO74_RS09725 - 1088954..1089116 (-) 163 Protein_1073 DUF417 family protein -
  LSO74_RS05680 (KRLU3655_LOCUS1067) grpE 1089317..1089913 (-) 597 WP_136427515.1 nucleotide exchange factor GrpE -
  LSO74_RS05685 (KRLU3655_LOCUS1068) - 1090013..1090903 (+) 891 WP_005657722.1 NAD(+) kinase -
  LSO74_RS05690 (KRLU3655_LOCUS1069) recN 1091015..1092691 (+) 1677 WP_005657724.1 DNA repair protein RecN -
  LSO74_RS05695 (KRLU3655_LOCUS1070) glnE 1092775..1095720 (-) 2946 WP_005657726.1 bifunctional [glutamate--ammonia ligase]-adenylyl-L-tyrosine phosphorylase/[glutamate--ammonia-ligase] adenylyltransferase -
  LSO74_RS05700 (KRLU3655_LOCUS1071) miaA 1095726..1096661 (-) 936 WP_005657728.1 tRNA (adenosine(37)-N6)-dimethylallyltransferase MiaA -
  LSO74_RS05705 (KRLU3655_LOCUS1072) mutL 1096669..1098558 (-) 1890 WP_005657730.1 DNA mismatch repair endonuclease MutL -
  LSO74_RS05710 (KRLU3655_LOCUS1073) - 1098559..1099857 (-) 1299 WP_005657733.1 N-acetylmuramoyl-L-alanine amidase -
  LSO74_RS05715 (KRLU3655_LOCUS1074) tsaE 1099865..1100341 (-) 477 WP_005657736.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex ATPase subunit type 1 TsaE -
  LSO74_RS05720 (KRLU3655_LOCUS1075) folK 1100417..1100899 (-) 483 WP_005657739.1 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase -
  LSO74_RS05725 (KRLU3655_LOCUS1076) pcnB 1100908..1102266 (-) 1359 WP_005657742.1 polynucleotide adenylyltransferase PcnB -
  LSO74_RS05730 (KRLU3655_LOCUS1077) dksA 1102514..1102951 (-) 438 WP_005657744.1 RNA polymerase-binding protein DksA -
  LSO74_RS05735 (KRLU3655_LOCUS1078) rec2 1103211..1105577 (+) 2367 WP_005657747.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  LSO74_RS05740 (KRLU3655_LOCUS1079) msbA 1105618..1107381 (+) 1764 WP_005657750.1 lipid A ABC transporter ATP-binding protein/permease MsbA -

Sequence


Protein


Download         Length: 788 a.a.        Molecular weight: 89341.04 Da        Isoelectric Point: 10.0762

>NTDB_id=1151780 LSO74_RS05735 WP_005657747.1 1103211..1105577(+) (rec2) [Haemophilus influenzae strain 3655 isolate 3655]
MKLNLITLAVLLIVADLTLLFLPQPLLLPWQVALVIALVLIFLFIFLRRNFLVSLAFFVASLGYFHYSALSLLQQAQNIT
AQKQVVTFKIQEILHQQDYQTLIATATLANNLQEQRIFLNWKAKEVPQLSEIWQAEISLRPLSARLNFGGFDRQQWYFSK
GITVVGTVKSAVKIADVSSLRAEKLQQVKKQTEGLSLQGLLIALAFGERAWLDKTTWSIYQQTNTAHLIAISGLHIGLAM
GIGFCLVRVMQVFLPTRFIHPYFPLVFGVLLALIYAYLAGFSVPTFRAISALVFVLFIQIMRRHYSPIQLFTVVVGFLLF
CNPLMPLSVSFWLSCGAVGCLILWYRYVPFSLFQWKNRPFSPKVRWILSLFHLQFGLLLFFTPLQLFLFNGLSLSGFLAN
LMAVPIYSFLLVPLILFAVFTNGTMFSWQLANKLAEGITGLISVFQGNWLTVSFNLALFLTALCAGIFMLIIWRIYREPE
ASSSTWKIKRPRFFTLNLSKPLLKNDRINVLRCSFGIILLCFTILLFKQFSKPAWQVDTLDVGQGLATLIVKNGKGILYD
TGSSWRGGSMAELEILPYLQREGIVLEKLILSHDDNDHAGGASTILKAYPNVELITPSRKNYGENYRTFCTAGRDWHWQG
LHFQILSPHNVVTRADNPHSCVILVDDGKNSVLLTGDAEAKNEQIFARTLGKIDVLQVGHHGSKTSTSEYLLSQVRPDVA
IISSGRWNPWKFPHYSVMERLHRYKSAVENTAVSGQVRVNFFKDRLEIQQARTEFSPWYARVIGLSKE

Nucleotide


Download         Length: 2367 bp        

>NTDB_id=1151780 LSO74_RS05735 WP_005657747.1 1103211..1105577(+) (rec2) [Haemophilus influenzae strain 3655 isolate 3655]
ATGAAATTAAACTTAATAACTTTAGCTGTCTTGTTAATTGTCGCGGATTTAACGTTGTTATTTCTACCGCAACCGTTGCT
ATTGCCTTGGCAAGTTGCTCTCGTTATTGCGCTTGTTTTGATTTTTCTTTTTATTTTCTTGCGTAGAAATTTCTTAGTTA
GCCTTGCTTTTTTTGTTGCCTCTCTTGGCTATTTTCATTATTCGGCTTTGAGTTTATTACAACAAGCTCAAAATATTACC
GCTCAAAAGCAAGTGGTAACTTTTAAGATTCAAGAAATTTTGCACCAACAGGATTATCAAACGCTTATCGCCACAGCAAC
ATTGGCGAATAATTTGCAAGAACAACGAATTTTCTTAAATTGGAAAGCGAAAGAGGTGCCTCAATTATCGGAAATTTGGC
AAGCTGAAATTTCTTTACGTCCCCTTTCTGCACGATTAAATTTTGGTGGGTTTGATCGGCAACAATGGTATTTTTCAAAA
GGAATTACTGTTGTTGGAACGGTAAAAAGTGCGGTGAAAATTGCGGATGTTTCATCATTGCGTGCAGAAAAATTGCAACA
AGTAAAGAAGCAAACGGAAGGATTATCTCTACAAGGTTTATTGATTGCCTTAGCTTTTGGCGAACGGGCTTGGTTAGATA
AAACCACTTGGTCAATTTACCAACAAACCAATACCGCACATCTTATTGCTATTTCTGGCTTACATATTGGGTTGGCTATG
GGAATTGGATTTTGCTTGGTGCGTGTTATGCAAGTATTTTTACCCACCCGTTTTATTCATCCTTATTTTCCTTTAGTTTT
TGGTGTTTTATTGGCTTTAATTTATGCGTATTTGGCTGGTTTTAGCGTGCCAACTTTTCGTGCCATTTCAGCACTTGTTT
TCGTTTTATTTATTCAAATAATGAGGCGACATTATTCGCCCATTCAGCTTTTTACGGTGGTTGTCGGATTCTTGCTTTTC
TGCAATCCATTAATGCCGCTTTCGGTCAGTTTTTGGCTTTCTTGTGGGGCGGTTGGGTGTTTGATCCTCTGGTATCGTTA
TGTGCCTTTTTCTCTTTTTCAATGGAAAAATCGCCCCTTTTCACCAAAAGTGCGGTGGATTTTGAGTTTATTTCATTTGC
AATTTGGGTTATTGCTCTTTTTTACACCTTTGCAACTTTTTCTATTTAATGGCTTATCGTTGAGTGGATTTTTAGCCAAT
CTTATGGCGGTTCCAATTTATAGTTTTTTGCTTGTGCCATTAATTTTATTTGCCGTTTTTACTAACGGCACAATGTTTTC
TTGGCAACTAGCAAACAAGTTAGCCGAAGGAATTACTGGGTTAATTTCTGTTTTTCAAGGAAATTGGCTCACGGTTTCAT
TTAATTTAGCATTATTTTTAACCGCACTTTGTGCAGGAATTTTTATGTTAATTATTTGGCGTATTTATCGAGAACCAGAG
GCTTCATCATCAACTTGGAAAATTAAACGACCAAGATTTTTTACATTAAATCTCAGTAAACCTTTGCTAAAAAATGATCG
AATCAACGTTTTGCGATGTTCTTTCGGCATTATCTTACTGTGTTTTACGATTTTGTTGTTTAAACAATTTAGTAAGCCAG
CTTGGCAGGTAGATACTTTAGATGTGGGGCAGGGCTTGGCAACGCTGATTGTGAAAAATGGCAAAGGGATTCTTTATGAT
ACGGGTTCTTCTTGGCGAGGTGGAAGTATGGCTGAGTTGGAAATTTTGCCTTATTTACAAAGAGAAGGGATTGTTTTGGA
AAAATTGATTTTAAGCCACGACGATAACGATCACGCAGGTGGTGCTTCGACAATTTTAAAGGCGTATCCCAATGTGGAAT
TGATTACCCCTTCACGGAAAAACTATGGGGAAAATTACCGCACTTTTTGTACTGCTGGGCGTGATTGGCATTGGCAAGGG
TTGCATTTTCAAATACTTTCTCCTCACAATGTTGTGACACGAGCTGATAATCCCCATTCTTGTGTGATTTTAGTCGATGA
TGGAAAGAATAGTGTTTTGCTAACTGGCGATGCTGAAGCAAAAAATGAGCAAATTTTTGCCCGCACTTTAGGCAAAATCG
ATGTGTTGCAAGTGGGGCATCATGGGAGTAAAACATCGACAAGTGAATACTTGCTTTCTCAGGTTAGACCAGATGTAGCG
ATTATTTCTAGTGGGCGTTGGAATCCGTGGAAATTCCCTCATTATTCGGTTATGGAAAGGCTTCATCGCTATAAAAGTGC
GGTAGAAAATACCGCTGTTTCAGGGCAAGTGCGGGTAAATTTTTTTAAAGACCGATTAGAAATCCAACAAGCACGCACAG
AATTTTCCCCTTGGTATGCGCGTGTAATTGGATTATCAAAGGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0H3PCV2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  rec2 Haemophilus influenzae Rd KW20

96.447

100

0.964


Multiple sequence alignment