[server] Update server routes to be compliant with MLServer #1237

dsikka · 2023-09-11T19:45:38Z

Summary:

Includes two other PRs: [server] Update OpenAI Model Support #1300 and [server] Refactor + OpenAI Chat Completion Support #1288
Updates the base routes such that they follow the mlserver convention

Testing

Updated all the server tests
Tested locally using custom routes as well as routes built for the user given the model name

Sample Config:

num_cores: 2
num_workers: 2
endpoints:
  - task: question_answering
    model: zoo:nlp/question_answering/bert-base/pytorch/huggingface/squad/12layer_pruned80_quant-none-vnni
    route: some_route/pruned

This will now produce the following endpoints:

src/deepsparse/server/server.py

bfineran

LGTM overall - as discussed offline, agree with both @dsikka and @Satrat it's time we move route creation to a separate file / fleshed out system

src/deepsparse/server/server.py

dbogunowicz

LGTM, clean

bfineran

LGTM - as mentioned before we need to sync w/ QA before landing

mgoin

This seems like it would cause breaking changes to any application built using the old endpoint structure. I think if you could make a README or document showing how to transition current examples of usage, that would be helpful for other teams and users to have a summary of the changes.
For instance, how should we update this Digital Ocean getting started guide? https://marketplace.digitalocean.com/apps/deepsparse-inference-runtime

* refactor server for different integrations; additional functionality for chat completion streaming and non streaming * further refactor server * add support such that openai can host multiple models * update all tests * fix output for n > 1 * add inline comment explaining ProxyPipeline * [server] Update OpenAI Model Support (#1300) * update server * allow users to send requests with new models * use v1; move around baseroutes * add openai path * PR comments * clean-up output classes to be dataclasses, add docstrings, cleanup generation kwargs

src/deepsparse/server/cli.py

src/deepsparse/server/openai_server.py

src/deepsparse/server/output.py

…parse into match_mlserver

dsikka added 2 commits September 11, 2023 15:41

update/clean-up server to match mlserver docs

141a6b1

update server tests

ff03ad6

dsikka marked this pull request as ready for review September 11, 2023 21:10

dsikka requested review from bfineran, dbogunowicz, Satrat and rahul-tuli September 12, 2023 14:06

Merge branch 'main' into match_mlserver

36e5649

Satrat reviewed Sep 12, 2023

View reviewed changes

src/deepsparse/server/server.py Outdated Show resolved Hide resolved

src/deepsparse/server/server.py Outdated Show resolved Hide resolved

bfineran reviewed Sep 12, 2023

View reviewed changes

src/deepsparse/server/server.py Show resolved Hide resolved

dbogunowicz previously approved these changes Sep 13, 2023

View reviewed changes

dsikka added 3 commits September 13, 2023 10:44

Merge branch 'main' into match_mlserver

a09fe4a

Merge branch 'main' into match_mlserver

a774639

add back ping

750e422

dsikka dismissed dbogunowicz’s stale review via 750e422 September 26, 2023 20:51

dsikka requested review from Satrat, dbogunowicz and bfineran September 26, 2023 21:00

Merge branch 'main' into match_mlserver

6cffa85

bfineran previously approved these changes Oct 6, 2023

View reviewed changes

mgoin reviewed Oct 10, 2023

View reviewed changes

dsikka dismissed bfineran’s stale review via d99a82c October 10, 2023 13:50

Merge branch 'main' into match_mlserver

0afcd7e

Satrat reviewed Oct 10, 2023

View reviewed changes

src/deepsparse/server/cli.py Outdated Show resolved Hide resolved

src/deepsparse/server/openai_server.py Show resolved Hide resolved

src/deepsparse/server/output.py Outdated Show resolved Hide resolved

dsikka added 2 commits October 10, 2023 11:56

update readme, update route cleaning, update docstring

f2327d3

Merge branch 'main' into match_mlserver

c13832b

dsikka requested review from mgoin, Satrat and bfineran October 10, 2023 15:57

bfineran previously approved these changes Oct 10, 2023

View reviewed changes

Merge branch 'main' into match_mlserver

4f7e697

Satrat previously approved these changes Oct 10, 2023

View reviewed changes

dsikka added 2 commits October 10, 2023 15:10

fix README for QA

18913d4

Merge branch 'match_mlserver' of https://github.com/neuralmagic/deeps…

c550799

…parse into match_mlserver

dsikka dismissed stale reviews from Satrat and bfineran via c550799 October 10, 2023 19:11

Merge branch 'main' into match_mlserver

d40fce7

Satrat approved these changes Oct 10, 2023

View reviewed changes

dsikka requested a review from bfineran October 10, 2023 19:36

Merge branch 'main' into match_mlserver

1283671

bfineran approved these changes Oct 11, 2023

View reviewed changes

dsikka merged commit 639e9e4 into main Oct 11, 2023
12 of 13 checks passed

dsikka deleted the match_mlserver branch October 11, 2023 12:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server] Update server routes to be compliant with MLServer #1237

[server] Update server routes to be compliant with MLServer #1237

dsikka commented Sep 11, 2023 •

edited

Loading

bfineran left a comment

dbogunowicz left a comment

bfineran left a comment

mgoin left a comment

[server] Update server routes to be compliant with MLServer #1237

[server] Update server routes to be compliant with MLServer #1237

Conversation

dsikka commented Sep 11, 2023 • edited Loading

Summary:

Testing

bfineran left a comment

Choose a reason for hiding this comment

dbogunowicz left a comment

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment

mgoin left a comment

Choose a reason for hiding this comment

dsikka commented Sep 11, 2023 •

edited

Loading