Skip to content

Commit

Permalink
Add benchmark results for gpt3.5 on 8358b60 (AntonOsika#625)
Browse files Browse the repository at this point in the history
  • Loading branch information
alvaromat authored and 70ziko committed Oct 25, 2023
1 parent 7b68e46 commit 631807a
Showing 1 changed file with 23 additions and 0 deletions.
23 changes: 23 additions & 0 deletions benchmark/RESULTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,29 @@
python scripts/benchmark.py
```

## 2023-08-20 (8358b60e1c6dcfc517c47c15708422d9a7d1d83a)
| Benchmark | Version | Ran | Works | Perfect |
|--------------------|---------------|-----|-------|---------|
| currency_converter | GPT3.5 default||||
| image_resizer | GPT3.5 default||||
| pomodoro_timer | GPT3.5 default||||
| url_shortener | GPT3.5 default||||
| file_explorer | GPT3.5 default||||
| markdown_editor | GPT3.5 default||||
| timer_app | GPT3.5 default||||
| file_organizer | GPT3.5 default||||
| password_generator | GPT3.5 default||||
| todo_list | GPT3.5 default||||

### Notes on the errors

#### GPT3.5
- `pomodoro_timer`: notifications didn't work.
- `file_explorer`: deletion didn't work.
- `file_organizer`: only handled a very small set of formats.
- `todo_list`: tasks couldn't be marked as completed.
- `url_shortener`: file names were wrong. Nothing could be run.

## 2023-06-21

| Benchmark | Ran | Works | Perfect |
Expand Down

0 comments on commit 631807a

Please sign in to comment.