Detailed information    

insolico Bioinformatically predicted

Overview


Name   HI0659   Type   Machinery gene
Locus tag   MOUSESFB_RS00790 Genome accession   NC_017294
Coordinates   155360..155596 (-) Length   78 a.a.
NCBI ID   WP_005807464.1    Uniprot ID   A0AAV3EI49
Organism   Candidatus Arthromitus sp. SFB-mouse-Yit     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 155360..173284 155360..155596 within 0
Prophage 110438..155059 155360..155596 flank 301


Gene organization within MGE regions


Location: 110438..173284
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MOUSESFB_RS07895 rlmD 110438..111217 (+) 780 WP_007443020.1 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD -
  MOUSESFB_RS00505 (MOUSESFB_0085) - 111270..112460 (-) 1191 WP_005807580.1 site-specific integrase -
  MOUSESFB_RS00510 (MOUSESFB_0086) - 112695..113999 (+) 1305 WP_005807578.1 ATP/GTP-binding protein -
  MOUSESFB_RS00515 (MOUSESFB_0087) - 114000..114587 (+) 588 WP_005807576.1 hypothetical protein -
  MOUSESFB_RS00520 - 115245..115496 (-) 252 WP_005807574.1 LexA family transcriptional regulator -
  MOUSESFB_RS00525 (MOUSESFB_0088) - 115839..116573 (+) 735 WP_005807572.1 DUF3644 domain-containing protein -
  MOUSESFB_RS00530 (MOUSESFB_0089) - 117316..117822 (-) 507 WP_005807569.1 hypothetical protein -
  MOUSESFB_RS00535 (MOUSESFB_0090) - 117859..118350 (-) 492 WP_242821485.1 LexA family transcriptional regulator -
  MOUSESFB_RS07900 - 118357..118473 (-) 117 Protein_93 LexA family transcriptional regulator -
  MOUSESFB_RS00540 (MOUSESFB_0091) - 118648..118866 (+) 219 WP_005807564.1 helix-turn-helix transcriptional regulator -
  MOUSESFB_RS00545 (MOUSESFB_0092) - 118873..119088 (-) 216 WP_005807562.1 helix-turn-helix domain-containing protein -
  MOUSESFB_RS00550 (MOUSESFB_0093) - 119207..119407 (+) 201 WP_005807560.1 helix-turn-helix transcriptional regulator -
  MOUSESFB_RS00555 - 119462..119680 (+) 219 WP_005807558.1 helix-turn-helix domain-containing protein -
  MOUSESFB_RS00560 - 119685..119864 (+) 180 WP_005807556.1 hypothetical protein -
  MOUSESFB_RS00565 - 119950..120216 (+) 267 WP_005807554.1 hypothetical protein -
  MOUSESFB_RS00570 (MOUSESFB_0094) - 120233..120532 (+) 300 WP_005807552.1 hypothetical protein -
  MOUSESFB_RS00575 (MOUSESFB_0095) - 120532..121668 (+) 1137 WP_005807550.1 DUF2800 domain-containing protein -
  MOUSESFB_RS00580 (MOUSESFB_0096) - 121684..122244 (+) 561 WP_005807548.1 DUF2815 family protein -
  MOUSESFB_RS00585 (MOUSESFB_0097) - 122244..124163 (+) 1920 WP_005807546.1 DNA polymerase -
  MOUSESFB_RS00590 (MOUSESFB_0098) - 124179..126674 (+) 2496 WP_005807544.1 virulence-associated E family protein -
  MOUSESFB_RS00595 (MOUSESFB_0099) - 126969..127277 (+) 309 WP_005807543.1 VRR-NUC domain-containing protein -
  MOUSESFB_RS00600 (MOUSESFB_0100) - 127229..128569 (+) 1341 WP_005807541.1 SNF2-related protein -
  MOUSESFB_RS00605 (MOUSESFB_0101) - 128573..129007 (+) 435 WP_005807538.1 hypothetical protein -
  MOUSESFB_RS00610 (MOUSESFB_0102) - 128994..129206 (+) 213 WP_005807536.1 helix-turn-helix domain-containing protein -
  MOUSESFB_RS00615 (MOUSESFB_0103) - 129284..130000 (+) 717 WP_007440106.1 SHOCT domain-containing protein -
  MOUSESFB_RS00620 (MOUSESFB_0104) - 130164..130586 (+) 423 WP_005807533.1 terminase small subunit -
  MOUSESFB_RS00625 (MOUSESFB_0105) - 130583..131014 (+) 432 WP_005807531.1 dATP/dGTP diphosphohydrolase domain-containing protein -
  MOUSESFB_RS00630 (MOUSESFB_0106) - 131029..132276 (+) 1248 WP_005807529.1 PBSX family phage terminase large subunit -
  MOUSESFB_RS00635 (MOUSESFB_0107) - 132317..132796 (-) 480 WP_007440341.1 DUF2442 domain-containing protein -
  MOUSESFB_RS00640 (MOUSESFB_0108) - 132837..133091 (-) 255 WP_007440342.1 DUF4160 domain-containing protein -
  MOUSESFB_RS00645 (MOUSESFB_0109) - 133187..134575 (+) 1389 WP_005807523.1 phage portal protein -
  MOUSESFB_RS00650 (MOUSESFB_0110) - 134568..136151 (+) 1584 WP_005807521.1 minor capsid protein -
  MOUSESFB_RS00655 - 136155..136361 (+) 207 WP_005807519.1 hypothetical protein -
  MOUSESFB_RS00660 (MOUSESFB_0111) - 136385..136735 (+) 351 WP_005806954.1 YjcQ family protein -
  MOUSESFB_RS00665 (MOUSESFB_0112) - 136821..137498 (+) 678 WP_005807517.1 Panacea domain-containing protein -
  MOUSESFB_RS00670 (MOUSESFB_0113) - 137695..138294 (+) 600 WP_005807516.1 phage scaffolding protein -
  MOUSESFB_RS00675 (MOUSESFB_0114) - 138303..139361 (+) 1059 WP_005807514.1 hypothetical protein -
  MOUSESFB_RS00680 (MOUSESFB_0115) - 139368..139748 (+) 381 WP_005807512.1 hypothetical protein -
  MOUSESFB_RS00685 (MOUSESFB_0116) - 139742..140098 (+) 357 WP_014017809.1 hypothetical protein -
  MOUSESFB_RS00690 (MOUSESFB_0117) - 140098..140505 (+) 408 WP_005807507.1 HK97 gp10 family phage protein -
  MOUSESFB_RS00695 (MOUSESFB_0118) - 140502..140930 (+) 429 WP_005807505.1 DUF6838 family protein -
  MOUSESFB_RS00700 - 140935..141123 (+) 189 WP_005807504.1 hypothetical protein -
  MOUSESFB_RS00705 (MOUSESFB_0119) - 141125..142426 (+) 1302 WP_005807500.1 phage tail sheath family protein -
  MOUSESFB_RS00710 (MOUSESFB_0120) - 142438..142914 (+) 477 WP_005807498.1 phage tail tube protein -
  MOUSESFB_RS00715 (MOUSESFB_0121) - 142916..143347 (+) 432 WP_005807496.1 hypothetical protein -
  MOUSESFB_RS00720 - 143542..145419 (+) 1878 WP_005807492.1 tape measure protein -
  MOUSESFB_RS07905 - 145535..146203 (+) 669 WP_242821478.1 phage tail protein -
  MOUSESFB_RS00730 (MOUSESFB_0122) - 146203..146904 (+) 702 WP_005807488.1 hypothetical protein -
  MOUSESFB_RS00735 (MOUSESFB_0123) - 146907..147866 (+) 960 WP_005807486.1 hypothetical protein -
  MOUSESFB_RS00740 (MOUSESFB_0124) - 147859..148179 (+) 321 WP_005807484.1 DUF2577 domain-containing protein -
  MOUSESFB_RS00745 (MOUSESFB_0125) - 148176..148583 (+) 408 WP_005807482.1 DUF2634 domain-containing protein -
  MOUSESFB_RS00750 (MOUSESFB_0126) - 148583..149653 (+) 1071 WP_005807480.1 baseplate J/gp47 family protein -
  MOUSESFB_RS00755 (MOUSESFB_0127) - 149640..150329 (+) 690 WP_005807479.1 putative phage tail protein -
  MOUSESFB_RS00760 (MOUSESFB_0128) - 150316..151080 (+) 765 WP_005807478.1 hypothetical protein -
  MOUSESFB_RS00765 (MOUSESFB_0129) - 151085..151930 (+) 846 WP_007441037.1 hypothetical protein -
  MOUSESFB_RS00770 (MOUSESFB_0130) - 151930..153639 (+) 1710 WP_005807474.1 phage tail protein -
  MOUSESFB_RS00775 (MOUSESFB_0131) - 153833..154264 (+) 432 WP_007439697.1 holin family protein -
  MOUSESFB_RS00780 (MOUSESFB_0132) - 154409..155059 (+) 651 WP_005807468.1 N-acetylmuramoyl-L-alanine amidase -
  MOUSESFB_RS00785 (MOUSESFB_0133) - 155106..155354 (-) 249 WP_005807466.1 DUF4160 domain-containing protein -
  MOUSESFB_RS00790 (MOUSESFB_0134) HI0659 155360..155596 (-) 237 WP_005807464.1 helix-turn-helix domain-containing protein Machinery gene
  MOUSESFB_RS00795 (MOUSESFB_0135) - 155604..155861 (-) 258 WP_005807462.1 DUF2442 domain-containing protein -
  MOUSESFB_RS00800 - 156054..156266 (+) 213 WP_005807460.1 DUF4214 domain-containing protein -
  MOUSESFB_RS00805 (MOUSESFB_0136) - 156687..158219 (+) 1533 WP_005807458.1 hypothetical protein -
  MOUSESFB_RS00810 (MOUSESFB_0137) lepB 158289..158981 (+) 693 WP_005807456.1 signal peptidase I -
  MOUSESFB_RS00815 (MOUSESFB_0138) - 159064..161151 (+) 2088 WP_005807454.1 UvrD-helicase domain-containing protein -
  MOUSESFB_RS00820 (MOUSESFB_0139) - 161148..162134 (-) 987 WP_005807452.1 hypothetical protein -
  MOUSESFB_RS00825 (MOUSESFB_0140) - 162207..163421 (-) 1215 WP_014518655.1 cation diffusion facilitator family transporter -
  MOUSESFB_RS00830 (MOUSESFB_0141) - 163527..164726 (+) 1200 WP_005807447.1 aspartate kinase -
  MOUSESFB_RS00835 (MOUSESFB_0142) lysA 164741..166021 (+) 1281 WP_005807445.1 diaminopimelate decarboxylase -
  MOUSESFB_RS00840 (MOUSESFB_0143) - 166089..167228 (+) 1140 WP_005807444.1 hypothetical protein -
  MOUSESFB_RS00845 (MOUSESFB_0144) - 167250..168530 (+) 1281 WP_005807442.1 M18 family aminopeptidase -
  MOUSESFB_RS00850 (MOUSESFB_0145) ispF 168543..169028 (-) 486 WP_005807440.1 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase -
  MOUSESFB_RS00855 (MOUSESFB_0146) - 169042..169773 (-) 732 WP_005807438.1 pseudouridine synthase -
  MOUSESFB_RS00860 (MOUSESFB_0147) - 169822..170757 (-) 936 WP_005807437.1 hypothetical protein -
  MOUSESFB_RS00865 (MOUSESFB_0148) - 171011..171763 (+) 753 WP_005807435.1 DUF3298 and DUF4163 domain-containing protein -
  MOUSESFB_RS00870 - 171859..172176 (+) 318 Protein_160 ArsC/Spx/MgsR family protein -
  MOUSESFB_RS00875 (MOUSESFB_0149) - 172196..172825 (-) 630 WP_005807431.1 hypothetical protein -
  MOUSESFB_RS00880 (MOUSESFB_0150) - 172925..173284 (-) 360 WP_005807428.1 DUF1540 domain-containing protein -

Sequence


Protein


Download         Length: 78 a.a.        Molecular weight: 8873.63 Da        Isoelectric Point: 10.5513

>NTDB_id=45950 MOUSESFB_RS00790 WP_005807464.1 155360..155596(-) (HI0659) [Candidatus Arthromitus sp. SFB-mouse-Yit]
MIEEKIKEIVSEIIKLRQENGLTQKQLEELSNIKQPIIARMEKGTNIPQLNTILKLLKPLGKTIAIVPLETTKPTKSR

Nucleotide


Download         Length: 237 bp        

>NTDB_id=45950 MOUSESFB_RS00790 WP_005807464.1 155360..155596(-) (HI0659) [Candidatus Arthromitus sp. SFB-mouse-Yit]
ATGATTGAAGAAAAGATAAAAGAAATTGTATCCGAAATAATAAAACTTAGACAGGAAAATGGACTCACACAGAAACAACT
TGAAGAGTTAAGCAACATAAAACAACCAATTATAGCGAGGATGGAAAAAGGAACAAATATACCACAACTTAACACTATTT
TAAAATTACTAAAACCCTTAGGGAAAACAATTGCAATAGTTCCTCTTGAAACAACAAAACCAACTAAATCGAGGTGA

Domains


Predicted by InterproScan.

(13-65)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  HI0659 Haemophilus influenzae Rd KW20

57.534

93.59

0.538


Multiple sequence alignment