Compare optimized models vs. transformers models #194

fxmarty · 2022-05-17T11:17:14Z

Feedback welcome, notably for the design, code quality, etc.

This PR aims at introducing an unified way to benchmark transformers vs. optimized models, backend-independent (in the sense of, any backend can be plugged for inference and evaluation), code-free (in the sense of, the user does not need to code to start runs and evaluate them).

The two main contributions is to introduce helper classes, methods for data preprocessing, inference, evaluation. In several files:
* optimum/runs_base.py: general methods, this should be backend-agnostic.
* optimum/utils/preprocessing/: handle loading and preprocessing datasets, running inference with pipelines, running evaluation. This should be backend-agnostic.
* optimum/onnxruntime/runs/: OnnxRuntime specific methods

For now, dataset preprocessing and evaluation are task-specific, the supported tasks are:

text-classification
token-classification
question-answering

As for evaluation of transformers models, I believe there is some duplicate work with what exists in the AutoTrain backend and what is being done in https://github.com/huggingface/evaluate. However, my understanding being that it is not a priority to support Optimum-based inference within AutoTrain, it makes sense to me to have a common implementation to evaluate transformers/optimized models for them to be comparable. I hope we can make it such that we minimize duplicate efforts.

I used pipelines for inference for the general metrics, and ORTModel.forward() to measure latency/throughput.

Tasks before (or after) merge

This, with some additional work, should close #128 .

HuggingFaceDocBuilderDev · 2022-05-17T11:29:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

fxmarty · 2022-05-19T08:07:46Z

@lhoestq jfyi, about the latency/throughput measurement, here's what I got: https://github.com/fxmarty/optimum/blob/a111cfee49afc9bed68e18865442f2454d2556c3/optimum/runs_base.py#L172-L242 . Borrowed from https://github.com/huggingface/tune

lewtun · 2022-05-24T14:47:38Z

docs/source/benchmark.mdx

+
+## RunConfig
+
+[[autodoc]] optimum.utils.runs.RunConfig


To generate these docs, you'll need to:

add pydantic to the required deps

update the __init__.py file under utils

For the second point, this works (if you also exclude optimum.runs_base.Run):

from .runs import RunConfig, Calibration, DatasetArgs, TaskArgs from .preprocessing.base import DatasetProcessing

Actually the first point was enough, and the doc for optimum.runs_base.Run is well generated as well.

Just a doubt on whether adding an additional dependency is good, I think keeping install_requires to the bare minimum is best?

Generally, we try to keep external dependencies to a minimum, so ideally we would drop pydantic as a requirement if possible.

If not, you could add an new extras dep for e.g. benchmarks that users can install with pip install optimum['benchmarks']

Understood, for now I did your later suggestion as a matter of time, in a next PR I will replace pydantic by dataclasses altogether.

docs/source/benchmark.mdx

mfuntowicz

LGTM, thanks @lewtun @fxmarty for tracking the issue with the doc, we'll do the necessary things.

fxmarty force-pushed the runs-only branch 2 times, most recently from bf9bd2e to 2a120e4 Compare May 20, 2022 06:52

fxmarty added 17 commits May 20, 2022 09:00

first version

ece6a23

squashed runs history

6a10d1a

remove useless

8c1c089

bug fix in token classification

d89dac8

support two columns text classification

2a27815

code cleaning

3aadf71

fix bug sentence pair text classification

84698d1

renaming

7234dab

renaming

7a58881

style

34a7400

renaming

a1d5e0d

added optimum hash, reformatting

1162ec8

bugfix in question answering

45ddabd

small fix

a39e3c5

added task specific parameters to handle text-classification regression

397ebec

add preprocessor

79ef2c4

cleanup

5b27fbd

fxmarty force-pushed the runs-only branch from 2a120e4 to 5b27fbd Compare May 20, 2022 07:01

fxmarty added 3 commits May 20, 2022 11:47

change evaluation structure, bugfix

89f3ebc

remove unused pre-tokenizer edit

4b0b923

documentation

ca4f5b5

fxmarty mentioned this pull request May 20, 2022

Add Evaluator class to easily evaluate a combination of (model, dataset, metric) huggingface/evaluate#23

Closed

fxmarty added 4 commits May 23, 2022 11:10

add doc

0d76256

fix doc

d23eaba

reformat doc

5678c88

fix doc

09a8a73

fxmarty added 3 commits May 23, 2022 16:39

code quality

0900fed

trigger actions

8abf881

Merge branch 'master' into runs-only

b4c7387

lewtun reviewed May 24, 2022

View reviewed changes

docs/source/benchmark.mdx Outdated Show resolved Hide resolved

fxmarty added 4 commits May 24, 2022 16:57

test doc

3fcabcf

fix doc

d3429c5

remove doc

1e4b6ab

trigger actions

8fb029f

mfuntowicz self-requested a review May 25, 2022 13:08

mfuntowicz approved these changes May 25, 2022

View reviewed changes

mfuntowicz merged commit 1b98940 into huggingface:main May 25, 2022

	# converts pytorch inputs into numpy inputs for onnx
	onnx_inputs = {
	"input_ids": input_ids.cpu().detach().numpy(),
	"attention_mask": attention_mask.cpu().detach().numpy(),
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare optimized models vs. transformers models #194

Compare optimized models vs. transformers models #194

fxmarty commented May 17, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented May 17, 2022

fxmarty commented May 19, 2022

lewtun May 24, 2022

fxmarty May 24, 2022

lewtun May 24, 2022

fxmarty May 25, 2022

mfuntowicz left a comment

Compare optimized models vs. transformers models #194

Compare optimized models vs. transformers models #194

Conversation

fxmarty commented May 17, 2022 • edited Loading

Tasks before (or after) merge

HuggingFaceDocBuilderDev commented May 17, 2022

fxmarty commented May 19, 2022

lewtun May 24, 2022

Choose a reason for hiding this comment

fxmarty May 24, 2022

Choose a reason for hiding this comment

lewtun May 24, 2022

Choose a reason for hiding this comment

fxmarty May 25, 2022

Choose a reason for hiding this comment

mfuntowicz left a comment

Choose a reason for hiding this comment

fxmarty commented May 17, 2022 •

edited

Loading