o1-preview model not working on chat #1026

perinm · 2024-09-30T14:18:02Z

It gives buttons to click but don't actually execute like in gpt-4o.

Test bring-your-own-model

with both o1-preview, and gpt-4o set, where the latter works fine.

perinm · 2024-09-30T14:18:34Z

This is probably not a bug, but a new feature, because o1-preview changes a lot of requirements.

kgilpin · 2024-09-30T14:33:01Z

Thanks @perinm. There are differences in the API for o1, such as - it doesn't allow System messages. So, it's not working with Navie - yet!

dustinbyrne · 2024-09-30T15:14:58Z

Hi @perinm, thanks for the report.

You're right that o1-preview is not currently functional due to changes in the model's requirements. Though, there are a few things worth mentioning here:

Work on initial support for o1-preview in Navie has been completed, but it has not yet been shipped. I imagine this will be included in the next release, though I don't have a hard date for that.
The buttons work using a separate model, which by default is gpt-4o-mini if you're using OpenAI endpoints. This is configurable by setting the APPMAP_NAVIE_MINI_MODEL environment variable. I only mention this because it's not (yet) documented.
We've done some initial testing with o1-preview on SWE Bench, though our findings are that it currently benchmarks lower than gpt-4o due to changes and limitations of the model.

In regard to using o1-preview, is there a particular aspect of the model that you feel would be beneficial to bring into Navie?

perinm · 2024-09-30T16:16:29Z

@dustinbyrne excited for that.

Actually, I was having great success copying and pasting specific files into o1-preview, and briefly explaining what I need to code to do.

Apply changes suggested, and modify a little, copy and paste all files and repeat.
like 10x more productive than normal using gpt for helping. And surprisingly low content of bugs.

Answers can go up to 32k tokens, and context goes up 128k tokens.

Then I searched solutions to streamline the process and found this project you guys built with great passion and talent.
And feel like they are head-to-head but feel I prefer o1-preview overall, less inconsistencies on my tests, but different playgounds.

gpt-4o I tested using Navie.
o1-preview I tested using open-webui

Will keep testing on gpt-4o with Navie because this is such a great gem!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

o1-preview model not working on chat #1026

o1-preview model not working on chat #1026

perinm commented Sep 30, 2024

perinm commented Sep 30, 2024

kgilpin commented Sep 30, 2024 •

edited

Loading

dustinbyrne commented Sep 30, 2024

perinm commented Sep 30, 2024 •

edited

Loading

o1-preview model not working on chat #1026

o1-preview model not working on chat #1026

Comments

perinm commented Sep 30, 2024

perinm commented Sep 30, 2024

kgilpin commented Sep 30, 2024 • edited Loading

dustinbyrne commented Sep 30, 2024

perinm commented Sep 30, 2024 • edited Loading

kgilpin commented Sep 30, 2024 •

edited

Loading

perinm commented Sep 30, 2024 •

edited

Loading