-
Notifications
You must be signed in to change notification settings - Fork 216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation updates #3012
Documentation updates #3012
Conversation
|
||
data:image/s3,"s3://crabby-images/267f5/267f5e9e3a1fc1378b595f2df6ad21ddf90fb46a" alt="OVMS picture" | ||
|
||
The models used by the server need to be stored locally or hosted remotely by object storage services. For more details, refer to [Preparing Model Repository](docs/models_repository.md) documentation. Model server works inside [Docker containers](docs/deploying_server.md#deploying-model-server-in-docker-container), on [Bare Metal](docs/deploying_server.md#deploying-model-server-on-baremetal-without-container), and in [Kubernetes environment](docs/deploying_server.md#deploying-model-server-in-kubernetes). | ||
Start using OpenVINO Model Server with a fast-forward serving example from the [Quickstart guide](docs/ovms_quickstart.md) or explore [Model Server features](docs/features.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to drop reference to complete feature list in our main readme?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is included below the key features.
|
||
* [gRPC](https://grpc.io/) | ||
* [Simplified Deployments with OpenVINO™ Model Server and TensorFlow Serving](https://community.intel.com/t5/Blogs/Tech-Innovation/Artificial-Intelligence-AI/Simplified-Deployments-with-OpenVINO-Model-Server-and-TensorFlow/post/1353218) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose this blog introduces ovmsclient
that is no longer recommended way to prepare a client. Perhaps we should drop this reference?
docs/build_from_source.md
Outdated
1. make | ||
1. bash | ||
|
||
> **Note**: Building Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
> **Note**: Building Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md). | |
> **Note**: Building on Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md). |
docs/build_from_source.md
Outdated
Read more detailed usage in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md). | ||
|
||
## Building ovms.exe on Windows | ||
Read more detailed about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Read more detailed about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md). | |
Read more details about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we use relative path? This file is not visible to sphinx anyway.
docs/genai.md
Outdated
@@ -0,0 +1,46 @@ | |||
# Endpoints for Generative Use Cases {#ovms_docs_genai} | |||
|
|||
OpenVINO Model Server allow extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OpenVINO Model Server allow extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph. | |
OpenVINO Model Server allows extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph. |
docs/genai.md
Outdated
It supports a wide range of text generation models from Hugging Faces Hub. | ||
Internally it employs continuous batching and paged attention algorithms for efficient execution both on CPU and GPU. | ||
|
||
Learn model about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Faces for serving](../demos/common/export_models/README.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Learn model about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Faces for serving](../demos/common/export_models/README.md). | |
Learn more about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Face for serving](../demos/common/export_models/README.md). |
docs/genai.md
Outdated
|
||
## Text embeddings | ||
|
||
Text embeddings transforming the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Text embeddings transforming the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation). | |
Text embeddings transform the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation). |
docs/genai.md
Outdated
|
||
## Documents reranking | ||
|
||
Reranking process is used to sort the list of documented based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reranking process is used to sort the list of documented based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains. | |
Reranking process is used to sort the list of documents based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains. |
docs/starting_server.md
Outdated
@@ -26,6 +17,10 @@ Start the model server by running the following command with your parameters: | |||
docker run -d --rm -v <models_repository>:/models -p 9000:9000 -p 8000:8000 openvino/model_server:latest \ | |||
--model_path <path_to_model> --model_name <model_name> --port 9000 --rest_port 8000 --log_level DEBUG | |||
``` | |||
or for binary package: | |||
```c |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this marking intentional? Or should it be console
?
docs/genai.md
Outdated
@@ -0,0 +1,46 @@ | |||
# Endpoints for Generative Use Cases {#ovms_docs_genai} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This title is descriptive, but from the menu point of view it's quite long (at the first sight you don't see what it means).
Maybe we could shorten it and start with GenAI
phrase so it catches the eye of the viewer like:
GenAI Endpoints
or GenAI Use Cases
?
Co-authored-by: Damian Kalinowski <[email protected]>
* structure adjustments * deepseek demo
* Documentation updates (#3012) * structure adjustments * deepseek demo * set folder context for demo flow (#3011) Co-authored-by: Dariusz Trawinski <[email protected]> --------- Co-authored-by: Trawinski, Dariusz <[email protected]> Co-authored-by: Dariusz Trawinski <[email protected]>
🛠 Summary
Documentation updates
Staging version http://openvino-doc.iotg.sclab.intel.com/ovms/index.html
🧪 Checklist
``