Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation updates #3012

Merged
merged 5 commits into from
Feb 4, 2025
Merged

Documentation updates #3012

merged 5 commits into from
Feb 4, 2025

Conversation

dtrawins
Copy link
Collaborator

@dtrawins dtrawins commented Feb 3, 2025

🛠 Summary

Documentation updates
Staging version http://openvino-doc.iotg.sclab.intel.com/ovms/index.html

🧪 Checklist

  • Unit tests added.
  • The documentation updated.
  • Change follows security best practices.
    ``

@dtrawins dtrawins requested review from mzegla and rasapala February 3, 2025 08:06

![OVMS picture](docs/ovms_high_level.png)

The models used by the server need to be stored locally or hosted remotely by object storage services. For more details, refer to [Preparing Model Repository](docs/models_repository.md) documentation. Model server works inside [Docker containers](docs/deploying_server.md#deploying-model-server-in-docker-container), on [Bare Metal](docs/deploying_server.md#deploying-model-server-on-baremetal-without-container), and in [Kubernetes environment](docs/deploying_server.md#deploying-model-server-in-kubernetes).
Start using OpenVINO Model Server with a fast-forward serving example from the [Quickstart guide](docs/ovms_quickstart.md) or explore [Model Server features](docs/features.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to drop reference to complete feature list in our main readme?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is included below the key features.


* [gRPC](https://grpc.io/)
* [Simplified Deployments with OpenVINO™ Model Server and TensorFlow Serving](https://community.intel.com/t5/Blogs/Tech-Innovation/Artificial-Intelligence-AI/Simplified-Deployments-with-OpenVINO-Model-Server-and-TensorFlow/post/1353218)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose this blog introduces ovmsclient that is no longer recommended way to prepare a client. Perhaps we should drop this reference?

1. make
1. bash

> **Note**: Building Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
> **Note**: Building Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md).
> **Note**: Building on Windows server is covered in [Developer Guide for Windows](windows_developer_guide.md).

Read more detailed usage in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md).

## Building ovms.exe on Windows
Read more detailed about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Read more detailed about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md).
Read more details about building and testing changes in [developer guide](https://github.com/openvinotoolkit/model_server/blob/main/docs/developer_guide.md).

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use relative path? This file is not visible to sphinx anyway.

docs/genai.md Outdated
@@ -0,0 +1,46 @@
# Endpoints for Generative Use Cases {#ovms_docs_genai}

OpenVINO Model Server allow extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
OpenVINO Model Server allow extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph.
OpenVINO Model Server allows extending the REST API interface to support arbitrary input format and execute arbitrary pipeline implemented as a MediaPipe graph.

docs/genai.md Outdated
It supports a wide range of text generation models from Hugging Faces Hub.
Internally it employs continuous batching and paged attention algorithms for efficient execution both on CPU and GPU.

Learn model about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Faces for serving](../demos/common/export_models/README.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Learn model about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Faces for serving](../demos/common/export_models/README.md).
Learn more about the [LLM graph configuration](./llm/reference.md) and [exporting the models from Hugging Face for serving](../demos/common/export_models/README.md).

docs/genai.md Outdated

## Text embeddings

Text embeddings transforming the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Text embeddings transforming the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation).
Text embeddings transform the semantic meaning of the text into a numerical vector. This operation is crucial for text searching and algorithms like RAG (Retrieval Augmented Generation).

docs/genai.md Outdated

## Documents reranking

Reranking process is used to sort the list of documented based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Reranking process is used to sort the list of documented based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains.
Reranking process is used to sort the list of documents based on relevance in the context of a query. Just like text generation and embeddings, it is essential element or RAG chains.

@@ -26,6 +17,10 @@ Start the model server by running the following command with your parameters:
docker run -d --rm -v <models_repository>:/models -p 9000:9000 -p 8000:8000 openvino/model_server:latest \
--model_path <path_to_model> --model_name <model_name> --port 9000 --rest_port 8000 --log_level DEBUG
```
or for binary package:
```c
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this marking intentional? Or should it be console ?

docs/genai.md Outdated
@@ -0,0 +1,46 @@
# Endpoints for Generative Use Cases {#ovms_docs_genai}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This title is descriptive, but from the menu point of view it's quite long (at the first sight you don't see what it means).
Maybe we could shorten it and start with GenAI phrase so it catches the eye of the viewer like:
GenAI Endpoints or GenAI Use Cases ?

@dtrawins dtrawins requested a review from dkalinowski February 4, 2025 11:16
docs/genai.md Outdated Show resolved Hide resolved
Co-authored-by: Damian Kalinowski <[email protected]>
@dtrawins dtrawins merged commit f0814fb into releases/2025/0 Feb 4, 2025
dkalinowski pushed a commit that referenced this pull request Feb 6, 2025
* structure adjustments
* deepseek demo
dkalinowski added a commit that referenced this pull request Feb 6, 2025
* Documentation updates (#3012)

* structure adjustments
* deepseek demo

* set folder context for demo flow (#3011)

Co-authored-by: Dariusz Trawinski <[email protected]>

---------

Co-authored-by: Trawinski, Dariusz <[email protected]>
Co-authored-by: Dariusz Trawinski <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants