Python: Add Intel Gaudi HPU Support to Semantic Kernel #11064

BartoszBLL · 2025-03-19T11:36:30Z

Summary

This PR introduces support for Intel Gaudi HPUs (Habana Processing Units) to Semantic Kernel, enabling optimized AI workloads on specialized hardware.

Changes

Makefile: Added install-gaudi target to handle installation on Gaudi HPU instances, excluding PyTorch.
README.md: Documented Gaudi HPU features, installation methods, and performance tuning guidelines.
pyproject.toml: Included optimum-habana dependency for Gaudi optimization.
Sample Setup: Added Gaudi-specific text completion and embedding services to the Semantic Kernel.
Connectors: Extended Hugging Face connectors to support Gaudi HPU inference.
Utilities & Pipeline: Introduced configuration utilities (config.py), a Gaudi-specific pipeline (pipeline.py), and helper functions (utils.py) to streamline integration and performance optimization.
Tests: Provided unit tests for Gaudi text completion services.

How to Test

Clone the repository and switch to the feature branch.
Execute installation via Makefile:

make install-gaudi

Verify Gaudi functionality by running examples provided in the updated README.

Notes

Ensure Gaudi HPU instances have pre-installed PyTorch (version 2.4.0a0+git74cd574).
Optimal performance tuning parameters are documented for further adjustments.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

feat: Add HPU support

BartoszBLL · 2025-03-19T12:15:26Z

@microsoft-github-policy-service agree

eavanvalkenburg · 2025-03-19T12:53:45Z

python/semantic_kernel/connectors/ai/hugging_face/services/gaudi/__init__.py

We're starting to move towards maintaining less files, so please put all this stuff into a single _gaudi.py file inside the hugging_face services folder.

eavanvalkenburg · 2025-03-19T12:56:57Z

python/semantic_kernel/connectors/ai/hugging_face/services/gaudi/config.py

+
+
+@dataclass
+class Config:


I think it would be better to split this into, a settings setup, similar to what we use for a bunch of stuff, like OpenAISettings, this would be for thing like model_names and auth tokens, those can be passed in or read from the ENV. And other stuff seems like execution settings, but I guess more of this is needed in the constructor, correct? We also use pydantic for a lot of things, and considering what happens in the init, I would use that, so you can just call model_validate with kwargs in the setup.

eavanvalkenburg · 2025-03-19T12:57:55Z

python/semantic_kernel/connectors/ai/hugging_face/services/hf_text_completion.py

+            # Update config with model_kwargs if provided
+            if model_kwargs:
+                # Extract Gaudi-specific parameters from model_kwargs
+                for key, value in model_kwargs.items():


leveraging pydantic features here would make this much simpler.

eavanvalkenburg · 2025-03-19T12:59:34Z

python/semantic_kernel/connectors/ai/hugging_face/services/gaudi/pipeline.py

+from .config import Config
+
+SCRIPT_DIR = os.path.dirname(os.path.abspath(__file__))
+sys.path.append(os.path.dirname(SCRIPT_DIR))


what is this for?

eavanvalkenburg · 2025-03-19T13:00:46Z

python/README.md

+cd semantic-kernel
+
+# Run the Gaudi installation script
+./install_for_gaudi.sh


this isn't in here...

eavanvalkenburg · 2025-03-19T13:01:55Z

python/README.md

+
+## Files
+
+- `config.py`: Contains the `Config` dataclass that defines all configuration options


this seems ai generated and we do not use .py files for config anywhere!

eavanvalkenburg · 2025-03-19T13:02:38Z

python/README.md

+
+For optimal performance on Gaudi HPUs:
+
+1. Use the `optimum-habana` package (automatically installed with the `gaudi` extras)


add the install command

eavanvalkenburg · 2025-03-19T13:04:59Z

python/semantic_kernel/connectors/ai/hugging_face/services/hf_text_completion.py

-            **pipeline_kwargs or {},
-        )
-        resolved_device = f"cuda:{device}" if device >= 0 and torch.cuda.is_available() else "cpu"
+        if use_hpu:


considering the number of parameters, I think it would be better to allow people to pass in the pipeline, and then they can construct their own GaudiPipeline with the parameters of that, instead of this very hard to discover set of configs

that has been something I wanted to do anyway, since that also gives extra capabilities for just HF settings.

BartoszBLL added 4 commits March 10, 2025 21:47

feat: Add HPU support

cc153f0

fix: Add missing __init__.py file

46b3424

Merge pull request #1 from BlueLabelLabs/feat/hpu-support

5209e4a

feat: Add HPU support

Merge branch 'microsoft:main' into main

65229a3

BartoszBLL requested a review from a team as a code owner March 19, 2025 11:36

markwallace-microsoft added python Pull requests for the Python Semantic Kernel documentation labels Mar 19, 2025

github-actions bot changed the title ~~Add Intel Gaudi HPU Support to Semantic Kernel~~ Python: Add Intel Gaudi HPU Support to Semantic Kernel Mar 19, 2025

eavanvalkenburg reviewed Mar 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Add Intel Gaudi HPU Support to Semantic Kernel #11064

Python: Add Intel Gaudi HPU Support to Semantic Kernel #11064

BartoszBLL commented Mar 19, 2025

BartoszBLL commented Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025

eavanvalkenburg Mar 19, 2025


		## Files

		- `config.py`: Contains the `Config` dataclass that defines all configuration options


		For optimal performance on Gaudi HPUs:

		1. Use the `optimum-habana` package (automatically installed with the `gaudi` extras)



		@dataclass
		class Config:

Python: Add Intel Gaudi HPU Support to Semantic Kernel #11064

Are you sure you want to change the base?

Python: Add Intel Gaudi HPU Support to Semantic Kernel #11064

Conversation

BartoszBLL commented Mar 19, 2025

Summary

Changes

How to Test

Notes

Contribution Checklist

BartoszBLL commented Mar 19, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment