New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Benchmark/rf use case #294

Draft

cxzhang4 wants to merge 39 commits into main from benchmark/rf_use_case

Collaborator

cxzhang4 commented Oct 15, 2024

No description provided.

cxzhang4 added 21 commits

September 10, 2024 18:39


          TODO: write tests


          name -> TB. began refactoring based on last meeting with Sebastian

86f87c8


          slight description change

400ed74


          removed extraneous comments

9e6acd8


          added n_last_loss frequency test

fc4f2fa


          in progress

81d1ded


          autotest working, accidentally used the wrong callback_generator

cb03eb3


          simple and eval_freq tests pass

78b95a5


          changed logging methods to private

a365757


          removed magrittr pipe from tests

43a8ffb


          added details for callback class

6b9a845


          formatting

d354b2c


          built docs

b5b27b1


          Merge branch 'main' into feat/tflog-callback

565456b


          all tests pass, I think this is parity with the previous broken commi…

7c9f431

…t. still need to incorporate the step logging


          implemented step logging

c6c9333


          removed extraneous comments

43e7396


          added tensorboard instructions

ec5d8fc


          passes R CMD Check, minimally addresses every comment in the previous PR

f26a254


          moved newest news to bottom

a86c946


          init

3652fe6

sebffischer reviewed

View reviewed changes

benchmarks/rf_use_case/get_data.R

+              library(data.table)
+              library(tidytable)
+              cc18_collection = ocl(99)

Member

sebffischer Oct 15, 2024

Suggested change

      
            cc18_collection = ocl(99)
          
            options(mlr3oml.cache = TRUE)
          
            cc18_collection = ocl(99)

Member

sebffischer Oct 15, 2024

You can also add this to your .Rprofile

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated


		library(here)

		# define the tasks

Collaborator Author

cxzhang4 Oct 15, 2024

tsk("oml", task_id = 1067)
similarly hard-code for every task

this will ignore the OpenML resampling

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated Show resolved Hide resolved


          Update benchmarks/rf_use_case/run_benchmark.R

92b4ffc

sebffischer reviewed

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated

+              # define the learners
+              mlp = lrn("classif.mlp",
+                activation = nn_relu,
+                neurons = to_tune(

Member

sebffischer Oct 15, 2024 •

edited

Loading

Suggested change

      
              neurons = to_tune(
          
              neurons = to_tune(ps(n_layers = p_int(lower = 1, upper = 10), latent = p_int(10, 500), .extra_trafo = function(x, param_set) {
          
                   list(neurons = rep(x$latent, x$n_layers))
          
                }))

Collaborator Author

cxzhang4 Oct 17, 2024

I think this won't work because the libraries don't allow parameter transformations. When I try to run the experiment I get this error:

Error: Inner tuning and parameter transformations are currently not supported.

Collaborator Author

cxzhang4 Oct 17, 2024 •

edited

Loading

My solution for now:

n_layers_values <- 1:10
latent_dim_values <- seq(10, 500, by = 10)
neurons_search_space <- mapply(
  neurons,
  expand.grid(n_layers = n_layers_values, latent_dim = latent_dim_values)$n_layers,
  expand.grid(n_layers = n_layers_values, latent_dim = latent_dim_values)$latent_dim,
  SIMPLIFY = FALSE
)

mlp = lrn("classif.mlp",
  activation = nn_relu,
  # neurons = to_tune(ps(
  #   n_layers = p_int(lower = 1, upper = 10), latent = p_int(10, 500),
  #   .extra_trafo = function(x, param_set) {
  #     list(neurons = rep(x$latent, x$n_layers))
  #   })
  # ),
  neurons = to_tune(neurons_search_space)

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated

+                    c(10, 10), c(10, 20), c(20, 10), c(20, 20)
+                  )
+                ),
+                batch_size = to_tune(16, 32, 64),

Collaborator Author

cxzhang4 Oct 15, 2024

go bigger

Suggested change

      
              batch_size = to_tune(16, 32, 64),
          
              batch_size = to_tune(16, 32, 64, 128, 256),

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated

+                tuner = tnr("grid_search"),
+                resampling = rsmp("cv"),
+                measure = msr("classif.acc"),
+                term_evals = 10

Collaborator Author

cxzhang4 Oct 15, 2024

likely need more than 10

Suggested change

      
              term_evals = 10
          
              term_evals = 100

Collaborator Author

cxzhang4 Oct 15, 2024

Run on GPU server (but without all cores)

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated

+              # define an AutoTuner that wraps the classif.mlp
+              at = auto_tuner(
+                learner = mlp,
+                tuner = tnr("grid_search"),

Collaborator Author

cxzhang4 Oct 15, 2024

use MBO: more efficient

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated

+                ),
+                batch_size = to_tune(16, 32, 64),
+                p = to_tune(0.1, 0.9),
+                epochs = to_tune(upper = 1000L, internal = TRUE),

Collaborator Author

cxzhang4 Oct 15, 2024

Consider reducing the max number of epochs

cxzhang4 commented

View reviewed changes

benchmarks/rf_use_case/run_benchmark.R Outdated


		bmrdt = as.data.table(bmr)

		fwrite(bmrdt, here("R", "rf_Use_case", "results", "bmrdt.csv"))

Collaborator Author

cxzhang4 Oct 15, 2024

typo

Suggested change

      
            fwrite(bmrdt, here("R", "rf_Use_case", "results", "bmrdt.csv"))
          
            fwrite(bmrdt, here("R", "rf_use_case", "results", "bmrdt.csv"))

cxzhang4 commented

View reviewed changes

Collaborator Author

cxzhang4 left a comment •

edited

Loading

Run this with 10 evaluations on the GPU server (with not all of the cores) and report how long it takes.

Only parallelize the learners (one thread per learner) using future.

Run this experiment using the github installation (main branch) of mlr3torch. This will properly handle the interop threads

cxzhang4 and others added 17 commits

October 17, 2024 22:42


          use mlr3oml cache

f821e09


          Copied in Sebastian's solution for tuning the neurons as a paramset


          looks like benchmark code working

869aba2


          Error: Inner tuning and parameter transformations are currently not s…

ab3bedf

…upported.


          changed to grid search

31b3964


          LLM-generated fn for neuron search space

a489897


          should work, test this on another machine

0073dcc


          fjwoie

10f3448


          encapsulated the learner for parallelization

b81c23b


          comments

00b272f


          added install script

89a72f1


          looks ready to run. 100 evals of mbo

52af8ed


          addoed surrogate learner for mbo

95f0a45


          Delete R/CallbackSetTB.R

5c0a447


          Delete tests/testthat/test_CallbackSetTB.R

c384529


          merge main

ee3f51d


          update benchmark

c01c531

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet