[please test] BYOK with ollama #342

olegklimov · 2024-10-02T07:26:42Z

With the ollama project it's easy to host our own AI models.

You can set up bring-your-own-key (BYOK) to connect to ollama server, and see if you can use StarCoder2 for code completion, llama models for chat.

Does it work at all? What we need to fix to make it better?

pardeep-singh · 2024-10-03T06:00:55Z

@olegklimov I would like to take this up. Can you please some docs/example of how this can done? Do we need to test the integration here or make changes as well to make it work?

olegklimov · 2024-10-03T12:04:07Z

Oh, here https://docs.refact.ai/byok/ you can test if we have documentation that is any good :D

avie66 · 2024-10-21T14:13:23Z

Hi @pardeep-singh
Did you had a look over Oleg's approach?

ukrolelo · 2025-02-05T08:31:40Z

chat_endpoint: "http://localhost:11434/v1/chat/completions"
chat_model: "starcoder2:latest"

mainThreadExtensionService.ts:79 Error: write after end
	at _write (node:internal/streams/writable:489:11)
	at Writable.write (node:internal/streams/writable:510:10)
	at c:\Users\ukro\.vscode\extensions\smallcloud.codify-6.0.4-win32-x64\node_modules\vscode-jsonrpc\lib\node\ril.js:90:29
	at new Promise (<anonymous>)
	at WritableStreamWrapper.write (c:\Users\ukro\.vscode\extensions\smallcloud.codify-6.0.4-win32-x64\node_modules\vscode-jsonrpc\lib\node\ril.js:80:16)
	at StreamMessageWriter.doWrite (c:\Users\ukro\.vscode\extensions\smallcloud.codify-6.0.4-win32-x64\node_modules\vscode-jsonrpc\lib\common\messageWriter.js:100:33)
	at c:\Users\ukro\.vscode\extensions\smallcloud.codify-6.0.4-win32-x64\node_modules\vscode-jsonrpc\lib\common\messageWriter.js:91:29
$onExtensionRuntimeError @ mainThreadExtensionService.ts:79
127.0.0.1:9084/v1/at-command-preview:1 
        
        
       Failed to load resource: the server responded with a status of 417 (Expectation Failed)
127.0.0.1:9084/v1/chat:1 
        
        
       Failed to load resource: the server responded with a status of 400 (Bad Request)
127.0.0.1:9084/v1/at-command-preview:1 
        
        
       Failed to load resource: the server responded with a status of 417 (Expectation Failed)
127.0.0.1:9084/v1/at-command-preview:1 
        
        
       Failed to load resource: the server responded with a status of 417 (Expectation Failed)
127.0.0.1:9084/v1/at-command-preview:1 
        
        
       Failed to load resource: the server responded with a status of 417 (Expectation Failed)
127.0.0.1:9084/v1/chat:1 
        
        
       Failed to load resource: the server responded with a status of 400 (Bad Request)

ukrolelo · 2025-02-05T08:37:29Z

chat_endpoint: "http://localhost:11434/v1/chat/completions"
chat_model: "llama3.2:1b-instruct-q8_0"

Error: Bad Request
Click to retry

 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
Show 25 more frames
index.umd.cjs:1 
 POST http://127.0.0.1:9099/v1/at-command-preview 417 (Expectation Failed)
Show 25 more frames
index.umd.cjs:17 
 POST http://127.0.0.1:9099/v1/chat 400 (Bad Request)

Why it is sending to different port?

olegklimov · 2025-02-05T12:05:01Z

Why it is sending to different port?

VSCode extension talks to refact-lsp, that in turn talks to the inference server.

I think the fastest way we can fix this -- is reproduce your setup. So you have, windows, ollama with llama3.2:1b-instruct-q8_0 right, we'll try it.

oxyplay added hacktoberfest helpwanted goodfirstissue labels Oct 2, 2024

pardeep-singh mentioned this issue Oct 3, 2024

[please test] BYOK with openrouter #343

Open

oxyplay assigned pardeep-singh Oct 3, 2024

avie66 unassigned pardeep-singh Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[please test] BYOK with ollama #342

[please test] BYOK with ollama #342

olegklimov commented Oct 2, 2024

pardeep-singh commented Oct 3, 2024

olegklimov commented Oct 3, 2024

avie66 commented Oct 21, 2024

ukrolelo commented Feb 5, 2025

ukrolelo commented Feb 5, 2025 •

edited

Loading

olegklimov commented Feb 5, 2025

[please test] BYOK with ollama #342

[please test] BYOK with ollama #342

Comments

olegklimov commented Oct 2, 2024

pardeep-singh commented Oct 3, 2024

olegklimov commented Oct 3, 2024

avie66 commented Oct 21, 2024

ukrolelo commented Feb 5, 2025

ukrolelo commented Feb 5, 2025 • edited Loading

olegklimov commented Feb 5, 2025

ukrolelo commented Feb 5, 2025 •

edited

Loading