Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add WasmEdge project idea for lfx 2025 term 1 #1324

Merged
merged 2 commits into from
Jan 22, 2025
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
71 changes: 70 additions & 1 deletion programs/lfx-mentorship/2025/01-Mar-May/project_ideas.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,6 @@ security posture of Envoy Gateway through comprehensive automated testing.
- Teju Nareddy (@nareddyt, [email protected])
- Upstream Issue: https://github.com/envoyproxy/gateway/issues/3124


### Jaeger

#### Jaeger: Upgrade Storage Backends to V2 Storage API
Expand Down Expand Up @@ -132,4 +131,74 @@ security posture of Envoy Gateway through comprehensive automated testing.

- Repository
GitHub Repository: kubestellar/ui

By implementing these enhancements, KubeStellar UI will evolve into a comprehensive tool for cluster management, empowering users to efficiently deploy and manage resources while offering an intuitive and modern interface.

### WasmEdge

#### Implement a new WasmEdge installer in Rust

- Description: Create a new tool in Rust that provides: Support cross-operating systems, including Linux(amd64 and aarch64), macOS(Intel models and Apple Silicon models), and Windows(amd64); Simplifies installation of the WasmEdge runtime and its plugins in a single tool called wasmedgeup; Automatically handles versioning, dependencies, OS/ARCH detection, and ensure the same user experience across operating systems and architectures. For more details, please refer to the upstream issue.
- Expected Outcome:
- A Rust implemented installer in [wasmedgeup](https://github.com/WasmEdge/wasmedgeup).
- A document to describe how to use.
- A CI workflow to build and test on Linux(Ubuntu, Fedora), macOS, and Windows.
- Recommended Skills:
- Mentor(s):
- Hung-Ying Tai (@hydai, [email protected])
- Upstream Issue: https://github.com/WasmEdge/WasmEdge/issues/3990

#### Implement component model's validator

- Description: The current validator of component model inside of WasmEdge only check nested module and ensure VM can run the nested modules without problem, but the validations from component model are mostly skipped.
- Expected Outcome:
- One should create a workable (merged into upstream) implementation of validator by working on
- `include/validator/validator_component.h`
- `lib/validator/validator_component.cpp`
- The visitor pattern are already setup.
- Recommended Skills:
- Since component model proposal separate their validation spec, one should able to
find requirements from https://github.com/WebAssembly/component-model/tree/main/design/mvp
- Implements it in C++.
- Mentor(s):
- Lîm Tsú-thuàn (@dannypsnl, [email protected])
- Upstream Issue: https://github.com/WasmEdge/WasmEdge/issues/3966

#### Improve the WasmEdge-based Rust coding assistant for inference-time scaling

- Description: In a [previous LFX mentorship project](https://github.com/WasmEdge/WasmEdge/issues/3495), we have created an [LLM-based coding assistant grounded in Rust programming language skills](https://huggingface.co/datasets/gaianet/learn-rust). We aim to further improve the Rust coding assistant by incorporating inference-time compute that utilizes the Rust compiler for feedback. One of the greatest advantages of Rust is its powerful and strict compiler, and the detailed error message generated by the compiler. The Rust compiler could give valuable feedbacks to code generating LLMs to improve the code quality.
- Expected Outcome:
- Run a [Qwen Coder 2.5 LLM locally](https://github.com/GaiaNet-AI/node-configs/tree/main/qwen-2.5-coder-7b-instruct) or access it via an API.
- Create an LLM system prompt that describes the structure and key elements of a `cargo` project. It will guide the LLM to generate multiple files (artifacts) for a complete project.
- Create a Python program to send user requests to the LLM and parse the generated result into locally cached files.
- Use a local Rust compiler to build the generated project. Sends the error messages back to the LLM to re-generate.
- Iterate until there is no more errors.
- Build a web API for the Python program that takes OpenAI compatible requests and return OpenAI compatible results.
- Recommended Skills:
- Rust
- [LlamaEdge](https://llamaedge.com/docs/user-guide/llm/full-openai)
- LLMs like ChatGPT
- Coding assistant like GitHub Copilot
- Mentor(s):
- Michael Yuan (@juntao, [email protected])
- Upstream Issue: https://github.com/WasmEdge/WasmEdge/issues/3985

#### Create a Japanese translation agent for CNCF videos

- Description: WasmEdge is a cross-platform and lightweight runtime for AI models. It can run a variety of GenAI models, such as [LLM](https://llamaedge.com/docs/user-guide/llm/get-started-with-llamaedge), [whisper](https://llamaedge.com/docs/user-guide/speech-to-text/quick-start-whisper) (voice to text), and [GPT-SoVITS](https://llamaedge.com/docs/user-guide/text-to-speech/gpt-sovits) (text to voice) on your own computers. By combining those 3 models together, developers in the WasmEdge community has created “video translation” applications that can translate video and audio content into another language. One such application is [VideoLangua.com](http://videolangua.com/) In this mentorship, we would like to build a Japanese translator agent that are specifically tailored to CNCF technical content.
- Expected Outcome:
- Use whisper to extract a time-stamped English transcript from a sample of CNCF videos. Develop whisper prompt that are suitable for CNCF technical content.
- Evaluate and select LLMs that are good at English to Japanese translation.
- Develop LLM prompts that are suitable for CNCF technical content.
- Train Japanese TTS actor models for GPT-SoVITS using PyTorch.
- Create dictionaries for how to pronounce CNCF technical words in Japanese.
- Evaluate the synthesized Japanese voice.
- Recommended Skills:
- The mentee must speak Japanese fluently.
- He or she needs to be familiar with technical content in CNCF videos.
- He or she should also be familiar with GenAI APIs (eg OpenAI API) and be able use PyTorch.
- Mentor(s):
- Michael Yuan (@juntao, [email protected])
- Miley Fu (@MileyFu, [email protected])
- Vivian Hu (@alabulei1, [email protected])
- Upstream Issue: https://github.com/WasmEdge/WasmEdge/issues/3986
Loading