You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Port Seldon Core's GPT-2 example to Tempo. This would include defining the model through Tempo directives, as well as coding a pipeline that implements beam or greedy search in order to generate full texts from successive calls to the model.
Port Seldon Core's GPT-2 example to Tempo. This would include defining the model through Tempo directives, as well as coding a pipeline that implements beam or greedy search in order to generate full texts from successive calls to the model.
As part of this issue we could explore using the
FasterTransformer
backend of Triton.The text was updated successfully, but these errors were encountered: