Python: #6761 Onnx Connector #8106

nmoeller · 2024-08-14T07:39:51Z

Motivation and Context

Why is this changed required ?
To enable Onnx Models with Semantic Kernel, there was the issue Python: Add support for local models via ONNX #6761 in the Backlog to add a Onnx Connector
What problem does it solve ?
It solves the problem, that semantic kernel is not yet integrated with Onnx Gen Ai Runtime
What scenario does it contribute to?
The scenario is to use different connector than HF,OpenAI or AzureOpenAI. When User's want to use Onnx they can easliy integrate it now
If it fixes an open issue, please link to the issue here.
Python: Add support for local models via ONNX #6761

Description

The changes made are designed by my own based on other connectors, i tried to stay as close as possible to the structure.
For the integration i installed the mistral python package in the repository.

I added the following Classes :

OnnxCompletionBase --> Responsible to control the inference
OnnxTextCompletion --> Inherits from OnnxCompletionBase
- Support for Text Completion with and without Images
- Ready for Multimodal Inference
- Ready for Text Only Inference
- Supports all Models on onnxruntime-genai
OnnxChatCompletion -->Inherits from OnnxCompletionBase
- Support for Text Completion with and without Images
- The user needs to provide the corresponding chat template to use this class
- Ready for Multimodal Inference
- Ready for Text Only Inference
- Supports all Models on onnxruntime-genai

What is integrated yet :

OnnxCompletionBase Class
OnnxChatCompletionBase Class with Dynamic Template Support
OnnxTextCompletionBase Class
Sample Multimodal Inference with Phi3-Vision
Sample of OnnxChatCompletions with Phi3
Sample of OnnxTextCompletions with Phi3
Integration Tests
Unit Tests

Some Notes

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

python/pyproject.toml

python/semantic_kernel/connectors/ai/onnx/services/onnx_text_completion.py

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

…i-Connector

…i-Connector # Conflicts: # python/tests/integration/completions/chat_completion_test_base.py # python/uv.lock

python/semantic_kernel/connectors/ai/onnx/onnx_gen_ai_prompt_execution_settings.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_completion_base.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_chat_completion.py

python/tests/conftest.py

python/tests/integration/completions/test_text_completion.py

python/semantic_kernel/connectors/ai/onnx/onnx_gen_ai_settings.py

python/tests/integration/completions/test_utils.py

TaoChenOSU · 2024-09-18T23:05:24Z

Regarding our offline conversation on the prompt template, is using a prompt template to parse the chat history to some format an overkill? Prompt template can do much more that substituting arguments. Is it possible to override the _prepare_chat_history_for_request method to get what Onnx wants?

…_completion_base.py Co-authored-by: Tao Chen <[email protected]>

…_chat_completion.py Co-authored-by: Tao Chen <[email protected]>

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

TaoChenOSU · 2024-10-09T18:57:05Z

Thanks for the contribution! Will approve once the unit tests pass.

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

### Motivation and Context  1. Why is this changed required ? To enable Onnx Models with Semantic Kernel, there was the issue #6761 in the Backlog to add a Onnx Connector 2. What problem does it solve ? It solves the problem, that semantic kernel is not yet integrated with Onnx Gen Ai Runtime 3. What scenario does it contribute to? The scenario is to use different connector than HF,OpenAI or AzureOpenAI. When User's want to use Onnx they can easliy integrate it now 4. If it fixes an open issue, please link to the issue here. #6761 ### Description The changes made are designed by my own based on other connectors, i tried to stay as close as possible to the structure. For the integration i installed the mistral python package in the repository. I added the following Classes : - OnnxCompletionBase --> Responsible to control the inference - OnnxTextCompletion --> Inherits from OnnxCompletionBase - Support for Text Completion with and without Images - Ready for Multimodal Inference - Ready for Text Only Inference - Supports all Models on [onnxruntime-genai](https://github.com/microsoft/onnxruntime-genai) - OnnxChatCompletion -->Inherits from OnnxCompletionBase - Support for Text Completion with and without Images - The user needs to provide the corresponding chat template to use this class - Ready for Multimodal Inference - Ready for Text Only Inference - Supports all Models on [onnxruntime-genai](https://github.com/microsoft/onnxruntime-genai) What is integrated yet : - [X] OnnxCompletionBase Class - [x] OnnxChatCompletionBase Class with Dynamic Template Support - [x] OnnxTextCompletionBase Class - [x] Sample Multimodal Inference with Phi3-Vision - [x] Sample of OnnxChatCompletions with Phi3 - [x] Sample of OnnxTextCompletions with Phi3 - [x] Integration Tests - [x] Unit Tests ### Some Notes ### Contribution Checklist  - [x] The code builds clean without any errors or warnings - [x] The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts) raises no violations - [x] All unit tests pass, and I have added new tests where possible - [x] I didn't break anyone 😄 --------- Co-authored-by: Tao Chen <[email protected]> Co-authored-by: Eduard van Valkenburg <[email protected]>

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

markwallace-microsoft · 2024-10-10T17:40:49Z

Python Test Coverage Report

File	Stmts	Miss	Cover	Missing
semantic_kernel
kernel.py	199	47	76%	148, 159, 163, 313–316, 423, 437–480
semantic_kernel/agents/group_chat
agent_chat.py	124	2	98%	78, 171
agent_group_chat.py	100	2	98%	151, 201
broadcast_queue.py	72	1	99%	35
semantic_kernel/agents/open_ai
assistant_content_generation.py	133	9	93%	96–97, 281, 291–294, 335, 337
open_ai_assistant_base.py	449	8	98%	259, 337–338, 746, 867, 870, 932, 990
semantic_kernel/connectors/ai
chat_completion_client_base.py	116	2	98%	382, 392
completion_usage.py	8	1	88%	17
semantic_kernel/connectors/ai/anthropic/services
anthropic_chat_completion.py	176	5	97%	147, 165, 169, 223, 419
semantic_kernel/connectors/ai/azure_ai_inference/services
azure_ai_inference_chat_completion.py	119	7	94%	120, 146–149, 159, 180, 202
azure_ai_inference_text_embedding.py	41	1	98%	87
semantic_kernel/connectors/ai/embeddings
embedding_generator_base.py	8	1	88%	50
semantic_kernel/connectors/ai/google
shared_utils.py	26	1	96%	56
semantic_kernel/connectors/ai/google/google_ai/services
google_ai_chat_completion.py	119	4	97%	127, 153, 176, 178
google_ai_text_completion.py	63	2	97%	98, 121
utils.py	65	3	95%	140, 160–165
semantic_kernel/connectors/ai/google/vertex_ai/services
utils.py	66	3	95%	141, 161–166
vertex_ai_chat_completion.py	119	4	97%	121, 147, 170, 172
vertex_ai_text_completion.py	62	2	97%	95, 116
semantic_kernel/connectors/ai/hugging_face/services
hf_text_completion.py	60	3	95%	103, 112, 127
hf_text_embedding.py	32	5	84%	79–83
semantic_kernel/connectors/ai/mistral_ai/services
mistral_ai_chat_completion.py	118	7	94%	118–121, 307–310
semantic_kernel/connectors/ai/ollama/services
ollama_chat_completion.py	60	5	92%	95–98, 108, 143
ollama_text_completion.py	57	6	89%	72, 90–93, 103, 131
semantic_kernel/connectors/ai/onnx
utils.py	53	3	94%	50–51, 112
semantic_kernel/connectors/ai/onnx/services
onnx_gen_ai_chat_completion.py	72	7	90%	67–68, 98, 122, 167, 173, 179
onnx_gen_ai_completion_base.py	58	21	64%	59–71, 79–90
onnx_gen_ai_text_completion.py	46	5	89%	54–55, 87, 117, 133
semantic_kernel/connectors/ai/open_ai/prompt_execution_settings
open_ai_prompt_execution_settings.py	94	1	99%	112
semantic_kernel/connectors/ai/open_ai/services
azure_chat_completion.py	107	5	95%	118, 123, 157, 166, 169
azure_text_completion.py	28	2	93%	82, 87
azure_text_embedding.py	30	2	93%	84, 89
open_ai_chat_completion_base.py	127	5	96%	71, 121, 141, 177, 287
open_ai_handler.py	63	3	95%	86, 95–96
open_ai_text_completion_base.py	80	2	98%	56, 161
semantic_kernel/connectors/ai/open_ai/settings
azure_open_ai_settings.py	22	1	95%	99
semantic_kernel/connectors/memory/azure_ai_search
azure_ai_search_collection.py	87	2	98%	150, 152
semantic_kernel/connectors/memory/redis
redis_collection.py	160	2	99%	146, 316
utils.py	45	11	76%	145–146, 164, 166, 173–188
semantic_kernel/connectors/openapi_plugin
openapi_manager.py	58	2	97%	110–111
openapi_parser.py	88	2	98%	71, 128
openapi_runner.py	105	2	98%	181–182
semantic_kernel/connectors/openapi_plugin/models
rest_api_operation.py	129	1	99%	242
semantic_kernel/contents
function_call_content.py	97	1	99%	201
streaming_chat_message_content.py	68	1	99%	210
streaming_content_mixin.py	39	2	95%	37, 64
semantic_kernel/core_plugins/sessions_python_tool
sessions_python_plugin.py	134	8	94%	69, 82–91, 99
sessions_python_settings.py	39	4	90%	84–87
semantic_kernel/data
vector_store_record_collection.py	249	19	92%	410, 470–474, 482–486, 526–529, 536–539
vector_store_record_utils.py	26	2	92%	50, 52
semantic_kernel/functions
kernel_function_decorator.py	98	1	99%	102
kernel_function_from_method.py	96	1	99%	153
kernel_function_from_prompt.py	154	7	95%	165–166, 180, 201, 219, 239, 322
kernel_function_log_messages.py	36	6	83%	37–43
kernel_plugin.py	187	2	99%	472, 475
semantic_kernel/planners
plan.py	234	45	81%	54, 163–165, 197, 214–227, 264, 269, 277–278, 288–291, 308, 313, 329, 332–337, 355, 360, 363, 365, 372, 386–388, 393–397
semantic_kernel/planners/function_calling_stepwise_planner
function_calling_stepwise_planner.py	116	4	97%	145, 189–190, 198
semantic_kernel/planners/sequential_planner
sequential_planner.py	64	6	91%	71, 75, 109, 125, 134–135
sequential_planner_extensions.py	50	9	82%	31–32, 56, 110–124
sequential_planner_parser.py	77	12	84%	66–74, 93, 117–120
semantic_kernel/schema
kernel_json_schema_builder.py	129	9	93%	53, 90, 186, 194, 205, 213, 228, 232–233
semantic_kernel/services
ai_service_client_base.py	22	1	95%	64
semantic_kernel/template_engine/blocks
code_block.py	77	1	99%	119
named_arg_block.py	43	1	98%	98
semantic_kernel/utils/authentication
entra_id_authentication.py	15	2	87%	26, 38
semantic_kernel/utils/telemetry
user_agent.py	16	2	88%	18–19
semantic_kernel/utils/telemetry/model_diagnostics
decorators.py	171	4	98%	372–375
TOTAL	11692	360	97%

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
2560	4 💤	0 ❌	0 🔥	1m 5s ⏱️

setup for onnx connector

ff979ba

markwallace-microsoft added the python Pull requests for the Python Semantic Kernel label Aug 14, 2024

nmoeller changed the title ~~Python : Issue-6761-Onnx-Connector~~ Python: Issue-6761-Onnx-Connector Aug 14, 2024

nmoeller changed the title ~~Python: Issue-6761-Onnx-Connector~~ Python: #6761 Onnx Connector Aug 14, 2024

eavanvalkenburg reviewed Aug 14, 2024

View reviewed changes

python/pyproject.toml Outdated Show resolved Hide resolved

TaoChenOSU reviewed Aug 15, 2024

View reviewed changes

python/semantic_kernel/connectors/ai/onnx/services/onnx_text_completion.py Outdated Show resolved Hide resolved

nmoeller added 11 commits August 16, 2024 11:08

initial implementation commit

49a2a72

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

b2c5a70

initial unit tests for onnx text completion

342db1d

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

9c371de

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

added chat completion support

adf262b

added small comment regarding Image Opening

a40a6cb

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

fd6d9b4

…i-Connector

migrated to uv

b118396

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

0da7615

…i-Connector

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

0b6df05

…i-Connector

added unit tests and integration tests

3c41141

nmoeller marked this pull request as ready for review September 17, 2024 14:32

nmoeller requested a review from a team as a code owner September 17, 2024 14:32

nmoeller requested a review from TaoChenOSU September 17, 2024 14:32

nmoeller added 2 commits September 18, 2024 17:39

added unit tests and integration tests

81bf663

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

bd21157

…i-Connector # Conflicts: # python/tests/integration/completions/chat_completion_test_base.py # python/uv.lock

TaoChenOSU reviewed Sep 18, 2024

View reviewed changes

nmoeller and others added 6 commits September 19, 2024 08:44

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

21f585f

…_completion_base.py Co-authored-by: Tao Chen <[email protected]>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

58702f3

…_completion_base.py Co-authored-by: Tao Chen <[email protected]>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

8908cb9

…_completion_base.py Co-authored-by: Tao Chen <[email protected]>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

af15de6

…_chat_completion.py Co-authored-by: Tao Chen <[email protected]>

integrated pr feedback

14128e2

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

352dede

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

nmoeller added 4 commits October 9, 2024 19:17

added mac os skip to onnx prompt settings

9dae3f4

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

50813fc

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

fixed missing constructure variables in tests

04545bb

added FilePath option to Binary Content

eaad5b5

nmoeller added 2 commits October 9, 2024 21:20

skipping tests on class level

7eea22d

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

751f07f

moonbox3 approved these changes Oct 9, 2024

View reviewed changes

TaoChenOSU approved these changes Oct 9, 2024

View reviewed changes

TaoChenOSU added this pull request to the merge queue Oct 9, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 9, 2024

moonbox3 added this pull request to the merge queue Oct 9, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 9, 2024

nmoeller added 3 commits October 10, 2024 18:43

skip int tests for mac

d7b6ccf

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

25afc3e

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

2b73d34

moonbox3 approved these changes Oct 10, 2024

View reviewed changes

moonbox3 enabled auto-merge October 10, 2024 16:51

moonbox3 added this pull request to the merge queue Oct 10, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 10, 2024

nmoeller added 2 commits October 10, 2024 19:33

downgraded PyMongo

a55ff90

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

3cb4a2d

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

moonbox3 approved these changes Oct 10, 2024

View reviewed changes

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

3303f93

moonbox3 enabled auto-merge October 10, 2024 17:37

moonbox3 added this pull request to the merge queue Oct 10, 2024

Merged via the queue into microsoft:main with commit b9e1133 Oct 10, 2024
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: #6761 Onnx Connector #8106

Python: #6761 Onnx Connector #8106

nmoeller commented Aug 14, 2024 •

edited

Loading

TaoChenOSU commented Sep 18, 2024

TaoChenOSU commented Oct 9, 2024

markwallace-microsoft commented Oct 10, 2024

Python: #6761 Onnx Connector #8106

Python: #6761 Onnx Connector #8106

Conversation

nmoeller commented Aug 14, 2024 • edited Loading

Motivation and Context

Description

Some Notes

Contribution Checklist

TaoChenOSU commented Sep 18, 2024

TaoChenOSU commented Oct 9, 2024

markwallace-microsoft commented Oct 10, 2024

Python Unit Test Overview

nmoeller commented Aug 14, 2024 •

edited

Loading