diff --git a/playbooks/core/comfyui-image-gen/README.md b/playbooks/core/comfyui-image-gen/README.md index 2645f74d..61792b91 100644 --- a/playbooks/core/comfyui-image-gen/README.md +++ b/playbooks/core/comfyui-image-gen/README.md @@ -22,7 +22,17 @@ This tutorial teaches you how to use ComfyUI with the Z Image Turbo model on you - Generating images and tuning generation parameters - Saving and sharing workflows -## Installing Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites @@ -31,18 +41,24 @@ This tutorial teaches you how to use ComfyUI with the Z Image Turbo model on you +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + #### Create a Virtual Environment On Linux, open a terminal in the directory of your choice and run the following prompt to create a venv: ```bash sudo apt update sudo apt install -y python3-venv -python3 -m venv llm-env -source llm-env/bin/activate +python3 -m venv comfyui-env +source comfyui-env/bin/activate ``` - + @@ -271,8 +287,6 @@ echo "OK: ComfyUI server is reachable!" - - ## Launching ComfyUI diff --git a/playbooks/core/comfyui-image-gen/playbook.json b/playbooks/core/comfyui-image-gen/playbook.json index 13cf0d38..851d8759 100644 --- a/playbooks/core/comfyui-image-gen/playbook.json +++ b/playbooks/core/comfyui-image-gen/playbook.json @@ -1,6 +1,6 @@ { "id": "comfyui-image-gen", - "title": "Generating images with ComfyUI and Z Image Turbo", + "title": "Generate Images with ComfyUI", "description": "Create stunning AI-generated images using ComfyUI with Z Image Turbo.", "time": 15, "unsupported_platforms": { diff --git a/playbooks/core/lmstudio-rocm-llms/README.md b/playbooks/core/lmstudio-rocm-llms/README.md index b5a20992..5b9fdb44 100644 --- a/playbooks/core/lmstudio-rocm-llms/README.md +++ b/playbooks/core/lmstudio-rocm-llms/README.md @@ -20,9 +20,20 @@ LM Studio is a powerful GUI-based wrapper for [llama.cpp](https://github.com/ggm - Serve models via OpenAI Compatible API to power custom workflows and apps -## Installing Dependencies +## Setting the Memory Configuration - + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites + + ## Downloading Models @@ -120,54 +131,60 @@ This model will now be accessible through the LM Studio Server endpoint and will Having just created the OpenAI Compatible endpoint, let's look at how to integrate this into a Python developer environment (such as VSCode) and use your system as a local API Provider. 1. Create a Python virtual environment: + + - - On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - python -m venv llm-env --system-site-packages - llm-env\Scripts\activate + sudo apt update + sudo apt install -y python3-venv + python3 -m venv lmstudio-env --system-site-packages + source lmstudio-env/bin/activate ``` + - > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. - > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + +**Grant your user access to GPU devices** (log out and back in for this to take effect): - +```bash +sudo usermod -aG render,video $LOGNAME +``` - On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash sudo apt update sudo apt install -y python3-venv - python3 -m venv llm-env --system-site-packages - source llm-env/bin/activate + python3 -m venv lmstudio-env + source lmstudio-env/bin/activate ``` - + - - + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - python -m venv llm-env - llm-env\Scripts\activate + python -m venv lmstudio-env --system-site-packages + lmstudio-env\Scripts\activate ``` > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - + - - On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. + + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - sudo apt update - sudo apt install -y python3-venv - python3 -m venv llm-env - source llm-env/bin/activate + python -m venv lmstudio-env + lmstudio-env\Scripts\activate ``` - + + > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. + > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + 2. Install the OpenAI package ```bash diff --git a/playbooks/core/lmstudio-rocm-llms/playbook.json b/playbooks/core/lmstudio-rocm-llms/playbook.json index 0891089c..470ec007 100644 --- a/playbooks/core/lmstudio-rocm-llms/playbook.json +++ b/playbooks/core/lmstudio-rocm-llms/playbook.json @@ -1,6 +1,6 @@ { "id": "lmstudio-rocm-llms", - "title": "Running and serving LLMs with LM Studio", + "title": "Run and Serve LLMs with LM Studio", "description": "Set up LM Studio and LM Studio Server to run and serve large language models locally.", "time": 30, "supported_platforms": { diff --git a/playbooks/core/n8n-automation-gpt-oss/README.md b/playbooks/core/n8n-automation-gpt-oss/README.md index 4e9e6caa..26bd8b71 100644 --- a/playbooks/core/n8n-automation-gpt-oss/README.md +++ b/playbooks/core/n8n-automation-gpt-oss/README.md @@ -35,7 +35,17 @@ In this playbook, we use Lemonade to serve a local LLM that n8n connects to for n8n includes a **native Lemonade node** (`Lemonade Chat Model`) that provides a first-class integration - no need for manual configuration. This makes connecting your local LLM to automation workflows straightforward. -## Prerequisites +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites diff --git a/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml b/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml index 6fddb137..61d8ddb3 100644 --- a/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml +++ b/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml @@ -1,6 +1,6 @@ services: n8n: - image: docker.n8n.io/n8nio/n8n + image: docker.io/n8nio/n8n container_name: n8n restart: unless-stopped network_mode: host diff --git a/playbooks/core/n8n-automation-gpt-oss/playbook.json b/playbooks/core/n8n-automation-gpt-oss/playbook.json index bc04c738..41269177 100644 --- a/playbooks/core/n8n-automation-gpt-oss/playbook.json +++ b/playbooks/core/n8n-automation-gpt-oss/playbook.json @@ -2,7 +2,7 @@ "id": "n8n-automation-gpt-oss", "title": "Automating Workflows with n8n and Local LLMs", "description": "Build an AI-powered news summarizer using n8n and Lemonade.", - "time": 30, + "time": 45, "supported_platforms": { "halo": [ "linux", @@ -59,7 +59,7 @@ "windows" ] }, - "difficulty": "beginner", + "difficulty": "intermediate", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/core/pytorch-rocm-llms/README.md b/playbooks/core/pytorch-rocm-llms/README.md index d71a49b1..2b5e1606 100644 --- a/playbooks/core/pytorch-rocm-llms/README.md +++ b/playbooks/core/pytorch-rocm-llms/README.md @@ -24,74 +24,90 @@ This tutorial uses PyTorch powered by AMD ROCm™ software to run models that ca - Run LLMs like gpt-oss-20b and qwen3.5-4B locally using PyTorch and ROCm - Create a document summarization tool using LLMs -## Initial Setup +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites ### Create a Virtual Environment + - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. - +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. + ```bash -python -m venv llm-env --system-site-packages -llm-env\Scripts\activate +sudo apt update +sudo apt install -y python3-venv +python3 -m venv pytorch-env --system-site-packages +source pytorch-env/bin/activate ``` + + -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + +**Grant your user access to GPU devices** (log out and back in for this to take effect): - - +```bash +sudo usermod -aG render,video $LOGNAME +``` - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash sudo apt update sudo apt install -y python3-venv -python3 -m venv llm-env --system-site-packages -source llm-env/bin/activate +python3 -m venv pytorch-env +source pytorch-env/bin/activate ``` - - + + - - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. ```bash -python -m venv llm-env -llm-env\Scripts\activate +python -m venv pytorch-env --system-site-packages +pytorch-env\Scripts\activate ``` > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - - + + - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. - + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + ```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv llm-env -source llm-env/bin/activate +python -m venv pytorch-env +pytorch-env\Scripts\activate ``` - - + +> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. +> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + + ### Installing Basic Dependencies - + diff --git a/playbooks/core/pytorch-rocm-llms/playbook.json b/playbooks/core/pytorch-rocm-llms/playbook.json index b3f98f06..865631e0 100644 --- a/playbooks/core/pytorch-rocm-llms/playbook.json +++ b/playbooks/core/pytorch-rocm-llms/playbook.json @@ -1,6 +1,6 @@ { "id": "pytorch-rocm-llms", - "title": "Running LLMs with PyTorch and AMD ROCm™ software", + "title": "Run LLMs with PyTorch and ROCm™", "description": "Learn to run powerful language models on your PC with PyTorch and AMD ROCm™ software to summarize documents quickly and easily.", "time": 30, "supported_platforms": { diff --git a/playbooks/core/vscode-qwen3-coder/README.md b/playbooks/core/vscode-qwen3-coder/README.md index 677e14a9..ce8c96f2 100644 --- a/playbooks/core/vscode-qwen3-coder/README.md +++ b/playbooks/core/vscode-qwen3-coder/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Local LLM coding with VSCode and Qwen3-Coder-30B - > [!NOTE] > This playbook requires a minimum of **32GB** of system memory. @@ -28,7 +26,18 @@ This tutorial demonstrates how to use Cline, VS Code, and LM Studio to run a cod * How to configure Cline to communicate with LM Studio for local inference of coding agents. * How to use local coding agents to solve real-world software engineering tasks. -## Core Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates + +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + +## Installing Software Prerequisites diff --git a/playbooks/core/vscode-qwen3-coder/playbook.json b/playbooks/core/vscode-qwen3-coder/playbook.json index 38aeab9a..25a1cca3 100644 --- a/playbooks/core/vscode-qwen3-coder/playbook.json +++ b/playbooks/core/vscode-qwen3-coder/playbook.json @@ -2,7 +2,7 @@ "id": "vscode-qwen3-coder", "title": "Local LLM Coding with VS Code and Qwen3-Coder", "description": "Use VS Code with locally-running Qwen3-Coder for private code assistance.", - "time": 60, + "time": 30, "supported_platforms": { "halo": [ "linux", @@ -59,7 +59,7 @@ "linux" ] }, - "difficulty": "intermediate", + "difficulty": "beginner", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png b/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png new file mode 100644 index 00000000..be26aaa9 Binary files /dev/null and b/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png differ diff --git a/playbooks/dependencies/assets/memory-config/linux_mem_new.png b/playbooks/dependencies/assets/memory-config/linux_mem_new.png new file mode 100644 index 00000000..e4a691f8 Binary files /dev/null and b/playbooks/dependencies/assets/memory-config/linux_mem_new.png differ diff --git a/playbooks/dependencies/assets/software-update/linux.png b/playbooks/dependencies/assets/software-update/linux.png new file mode 100644 index 00000000..99ea1465 Binary files /dev/null and b/playbooks/dependencies/assets/software-update/linux.png differ diff --git a/playbooks/dependencies/assets/software-update/windows.png b/playbooks/dependencies/assets/software-update/windows.png new file mode 100644 index 00000000..24654d11 Binary files /dev/null and b/playbooks/dependencies/assets/software-update/windows.png differ diff --git a/playbooks/dependencies/driver.md b/playbooks/dependencies/driver.md index 232a88ec..b2b16f1e 100644 --- a/playbooks/dependencies/driver.md +++ b/playbooks/dependencies/driver.md @@ -4,6 +4,8 @@ Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: MIT --> + + ### AMD GPU Driver @@ -31,3 +33,5 @@ Download and install the latest AMD GPU driver for Linux: + + diff --git a/playbooks/dependencies/memoryconfig.md b/playbooks/dependencies/memoryconfig.md index 807c4b63..598115ad 100644 --- a/playbooks/dependencies/memoryconfig.md +++ b/playbooks/dependencies/memoryconfig.md @@ -4,47 +4,79 @@ Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: MIT --> -### Memory configuration for running large models - -On Windows, to run larger models that require higher memory, we need to use the AMD Variable Graphics Memory (iGPU VRAM) allocation. Although 64 GB is adequate for most workloads, running the largest models with high context may require 96 GB. +On Windows, to run larger models, increase the **dedicated GPU memory** allocation. + + + +On the Ryzen AI Halo, the default is 64GB dedicated. To modify this, open **AMD Software: Adrenalin Edition™** and navigate to **Performance → Tuning → AMD Variable Graphics Memory**. Reboot for the changes to take effect. + +

+ AMD Software Adrenalin Edition — AMD Variable Graphics Memory panel +

+ + + + + +Open **AMD Software: Adrenalin Edition™** and navigate to **Performance → Tuning → AMD Variable Graphics Memory**. Reboot for the changes to take effect. + +

+ AMD Software Adrenalin Edition — AMD Variable Graphics Memory panel +

-This can be done by opening AMD Software: Adrenalin™ Edition control panel and navigating to: `Performance > Tuning > AMD Variable Graphics Memory`. Please reboot the system for the changes to take effect. + -On Linux, ROCm utilizes a shared system memory pool, and this pool is configured by default to half the system memory. +On Linux, to run larger models, increase the **shared memory** pool available to the GPU. -This amount can be increased by changing the kernel’s Translation Table Manager (TTM) page setting, with the following instructions. AMD recommends setting the minimum dedicated VRAM in the BIOS (0.5GB) + -* Install the pipx utility and add the path for pipx installed wheels into the system search path. +On the AMD Ryzen™ AI Halo, the default is 96GB shared. To modify this, open the **AMD Ryzen™ AI Developer Center** and go to the **Settings** tab. Under **Graphics Performance Settings**, increase the **Shared Video Memory** slider, then click **Apply Changes** and reboot for the changes to take effect. - ```bash - sudo apt install pipx - pipx ensurepath - ``` +

+ AMD Ryzen AI Developer Center — Graphics Performance Settings with Shared Video Memory slider +

-* Install the amd-debug-tools wheel from PyPi. - ```bash - pipx install amd-debug-tools - ``` + -* Run the amd-ttm tool to query the current settings for shared memory. - ```bash - amd-ttm - ``` + -* Reconfigure shared memory settings by using the --set argument (units in GB). - ```bash - amd-ttm --set - ``` +Increase the shared memory pool by changing the kernel's Translation Table Manager (TTM) page setting. AMD recommends setting the minimum dedicated VRAM in the BIOS (0.5 GB) so the maximum amount is available as shared memory. -* Reboot the system for changes to take effect. +1. Install the `pipx` utility and add the path for pipx-installed wheels to the system search path: + ```bash + sudo apt install pipx + pipx ensurepath + ``` + +2. Install the `amd-debug-tools` wheel from PyPI: + + ```bash + pipx install amd-debug-tools + ``` + +3. Query the current shared memory settings: + + ```bash + amd-ttm + ``` + +4. Increase the shared memory allocation (units in GB): + + ```bash + amd-ttm --set + ``` + +5. Reboot for the changes to take effect. For `amd-ttm` usage examples, see the [ROCm documentation](https://rocm.docs.amd.com/projects/radeon-ryzen/en/docs-7.0.2/docs/install/installryz/native_linux/install-ryzen.html#amd-ttm-usage-examples). + + diff --git a/playbooks/dependencies/pytorch.md b/playbooks/dependencies/pytorch.md index 72356df9..dfa0be88 100644 --- a/playbooks/dependencies/pytorch.md +++ b/playbooks/dependencies/pytorch.md @@ -5,6 +5,7 @@ SPDX-License-Identifier: MIT --> #### PyTorch + **Install PyTorch with AMD ROCm™ software support** in the created virtual environment: @@ -31,5 +32,9 @@ python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1150/ "torch= -See [this link](https://rocm.docs.amd.com/projects/radeon-ryzen/en/latest/docs/install/installryz/native_linux/install-ryzen.html) for details. - \ No newline at end of file +```bash +python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1200-all/ torch torchvision torchaudio +``` + + +For other devices, please refer to [this link](https://rocm.docs.amd.com/en/7.13.0-preview/frameworks/pytorch/install.html?fam=ryzen&os=windows&pytorch-ver=2.11.0&w=compute&gpu=max-pro-395&gfx=gfx1151) for full instructions. diff --git a/playbooks/dependencies/registry.json b/playbooks/dependencies/registry.json index 6c4baa59..f5886e02 100644 --- a/playbooks/dependencies/registry.json +++ b/playbooks/dependencies/registry.json @@ -179,6 +179,12 @@ "linux": ["halo_box"], "windows": ["halo_box"] } + }, + "software-update": { + "name": "Software Updates", + "description": "Check for and install software updates on the AMD Ryzen AI Halo", + "platforms": ["windows", "linux"], + "file": "softwareupdate.md" } } } diff --git a/playbooks/dependencies/rocm.md b/playbooks/dependencies/rocm.md index 81db11d9..3f6a57b4 100644 --- a/playbooks/dependencies/rocm.md +++ b/playbooks/dependencies/rocm.md @@ -18,7 +18,6 @@ sudo reboot > **Note**: Ensure the virtual environment is active before proceeding. ```bash -python -m pip install --upgrade pip python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1151/ "rocm[libraries,devel]" ``` @@ -27,7 +26,6 @@ python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1151/ "rocm[l ```bash -python -m pip install --upgrade pip python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1152/ "rocm[libraries,devel]" ``` @@ -36,7 +34,6 @@ python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1152/ "rocm[l ```bash -python -m pip install --upgrade pip python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1150/ "rocm[libraries,devel]" ``` @@ -44,7 +41,6 @@ python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx1150/ "rocm[l ```bash -python -m pip install --upgrade pip python -m pip install --index-url https://repo.amd.com/rocm/whl/gfx120x-all/ "rocm[libraries,devel]" ``` diff --git a/playbooks/dependencies/softwareupdate.md b/playbooks/dependencies/softwareupdate.md new file mode 100644 index 00000000..fe83aac3 --- /dev/null +++ b/playbooks/dependencies/softwareupdate.md @@ -0,0 +1,31 @@ + + + + +Before starting, ensure your Ryzen AI Halo has the latest software installed. Open the **AMD Ryzen™ AI Developer Center** and check for available updates, both to the app itself and additional software. + + + +Go to the **Updates** tab. If updates are available, install them and reboot before continuing. + +

+ AMD Ryzen AI Developer Center — Updates tab on Windows +

+ + + + + +Go to the **Manage** tab. If updates are available, install them and reboot before continuing. + +

+ AMD Ryzen AI Developer Center — Manage tab on Linux +

+ + + + diff --git a/playbooks/supplemental/amd-sync/playbook.json b/playbooks/supplemental/amd-sync/playbook.json index 12d9ee76..10e86c49 100644 --- a/playbooks/supplemental/amd-sync/playbook.json +++ b/playbooks/supplemental/amd-sync/playbook.json @@ -2,7 +2,7 @@ "id": "amd-sync", "title": "Remote Development with AMD Sync", "description": "Install AMD Sync on your laptop and get one-click remote VS Code, Terminal, JupyterLab, and Live Metrics on your AMD Ryzen AI Halo.", - "time": 10, + "time": 15, "supported_platforms": { "halo_box": [ "windows", diff --git a/playbooks/supplemental/clustering-rpc-server/README.md b/playbooks/supplemental/clustering-rpc-server/README.md index 43e5e1b7..f1ef544f 100644 --- a/playbooks/supplemental/clustering-rpc-server/README.md +++ b/playbooks/supplemental/clustering-rpc-server/README.md @@ -24,7 +24,19 @@ This playbook teaches you how to cluster two STX Halo™ systems using llama.cpp - Configuring an RPC worker and launching distributed inference across two nodes - Running a 358B parameter model across two networked STX Halo™ systems -## Prerequisites +## Setting the Memory Configuration + +> **Note**: Complete this step on both Machine 1 and Machine 2. + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites @@ -40,54 +52,6 @@ sudo apt install git cmake python3 python3-pip ``` -## Extending VRAM Allocation - -> **Note**: Complete this step on both Machine 1 and Machine 2. - - -### Memory configuration for running large models - -On Windows, to run larger models that require higher memory, we need to use the AMD Variable Graphics Memory (iGPU VRAM) allocation. - -This can be done by opening AMD Software: Adrenalin Edition control panel and navigating to: `Performance > Tuning > AMD Variable Graphics Memory`. Set the value to **96 GB**. Please reboot the system for the changes to take effect. - - - - -### Memory configuration for running large models - -On Linux, ROCm utilizes a shared system memory pool, and this pool is configured by default to half the system memory. - -This amount can be increased by changing the kernel's Translation Table Manager (TTM) page setting, with the following instructions. AMD recommends setting the minimum dedicated VRAM in the BIOS (0.5GB) - -* Install the pipx utility and add the path for pipx installed wheels into the system search path. - - ```bash - sudo apt install pipx - pipx ensurepath - ``` - -* Install the amd-debug-tools wheel from PyPi. - ```bash - pipx install amd-debug-tools - ``` - -* Run the amd-ttm tool to query the current settings for shared memory. - ```bash - amd-ttm - ``` - -* Reconfigure shared memory settings to **120 GB**: - ```bash - amd-ttm --set 120 - ``` - -* Reboot the system for changes to take effect. - -For `amd-ttm` usage examples, see the [ROCm documentation](https://rocm.docs.amd.com/projects/radeon-ryzen/en/docs-7.0.2/docs/install/installryz/native_linux/install-ryzen.html#amd-ttm-usage-examples). - - - ## Installing llama.cpp > **Note**: Complete this step on both Machine 1 and Machine 2. diff --git a/playbooks/supplemental/cvml/README.md b/playbooks/supplemental/cvml/README.md index 472dcbe0..1d9b6cf5 100644 --- a/playbooks/supplemental/cvml/README.md +++ b/playbooks/supplemental/cvml/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Local Computer Vision with AMD Ryzen™ AI NPU - ## Overview The [Ryzen AI CVML Library](https://ryzenai.docs.amd.com/en/latest/ryzen_ai_libraries.html#ryzen-ai-cvml-library) is an AMD C++ computer vision and machine learning toolkit that provides powerful, on-device perception capabilities — including depth estimation, face detection, and face mesh tracking. Built on top of the Ryzen AI drivers, the library automatically selects the best available hardware (GPU or NPU) for inference, letting you add AI features to C++ applications without worrying about model training or framework integration. All processing happens locally on your system, making it ideal for privacy-sensitive, low-latency applications. @@ -25,7 +23,13 @@ This playbook teaches you how to set up the Ryzen AI CVML Library, build the inc - How to run face detection on an image with bounding boxes and landmarks - How to integrate CVML features into your own C++ applications -## Installing Basic Dependencies + +## Check for Software Updates + + + + +## Installing Software Prerequisites ## Additional Dependencies diff --git a/playbooks/supplemental/cvml/playbook.json b/playbooks/supplemental/cvml/playbook.json index 3b127332..679c2dee 100644 --- a/playbooks/supplemental/cvml/playbook.json +++ b/playbooks/supplemental/cvml/playbook.json @@ -2,7 +2,7 @@ "id": "cvml", "title": "Local Computer Vision with AMD Ryzen\u2122 AI NPU", "description": "Build local perception capabilities using the CVML SDK on top of Ryzen AI and AMD ROCm\u2122 software", - "time": 60, + "time": 45, "supported_platforms": { "halo": [ "windows" diff --git a/playbooks/supplemental/gaia-agents/README.md b/playbooks/supplemental/gaia-agents/README.md index 0f523ea4..c475149a 100644 --- a/playbooks/supplemental/gaia-agents/README.md +++ b/playbooks/supplemental/gaia-agents/README.md @@ -9,9 +9,7 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Overview - -## Getting Started Creating Agents with GAIA +## Overview GAIA agents are AI assistants that use a local LLM to reason and call tools you define — like chatbots that can take action. They run **100% locally** with no cloud APIs, no data leaving your machine, and no API keys required. @@ -25,7 +23,18 @@ In this playbook, you'll build a Hardware Advisor Agent that detects your system - Memory-based model sizing using the 70% rule - Building an interactive CLI for natural language hardware queries -## Installing Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites diff --git a/playbooks/supplemental/gaia-agents/playbook.json b/playbooks/supplemental/gaia-agents/playbook.json index 07812fc6..0f92a896 100644 --- a/playbooks/supplemental/gaia-agents/playbook.json +++ b/playbooks/supplemental/gaia-agents/playbook.json @@ -2,7 +2,7 @@ "id": "gaia-agents", "title": "Building Your First Agent with GAIA", "description": "Build a 100% local AI agent \u2014 no cloud APIs needed. Use the GAIA SDK to create a hardware advisor on your AMD Ryzen\u2122 AI", - "time": 20, + "time": 15, "supported_platforms": { "halo": [ "linux", @@ -55,7 +55,7 @@ "linux" ] }, - "difficulty": "intermediate", + "difficulty": "beginner", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/supplemental/lemonade-getting-started/README.md b/playbooks/supplemental/lemonade-getting-started/README.md index a16e338f..e85ef31c 100644 --- a/playbooks/supplemental/lemonade-getting-started/README.md +++ b/playbooks/supplemental/lemonade-getting-started/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Run Generative AI Locally with Lemonade - ## Overview 🍋 **Lemonade** is an open-source local AI server that lets you run large language models (LLMs), image generators, and audio models directly on your own hardware. It exposes the models through the industry-standard **OpenAI API**, so any app that works with OpenAI can instantly work with Lemonade. By the end of the playbook, you'll be using Lemonade to run models locally on your machine. @@ -26,7 +24,17 @@ By the end of this playbook you will be able to: * **Build a Python app** powered by a local LLM using the OpenAI-compatible API. * **Run models on the AMD Neural Processing Unit (NPU)** using Hybrid and FLM execution modes on AMD Ryzen™ AI hardware. -## Prerequisites +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites Before you begin, make sure you have: diff --git a/playbooks/supplemental/lemonade-getting-started/playbook.json b/playbooks/supplemental/lemonade-getting-started/playbook.json index c54ee12c..f285eefe 100644 --- a/playbooks/supplemental/lemonade-getting-started/playbook.json +++ b/playbooks/supplemental/lemonade-getting-started/playbook.json @@ -1,8 +1,8 @@ { "id": "lemonade-getting-started", - "title": "Using Lemonade Across CPU, GPU, and NPU", + "title": "Getting Started with Lemonade", "description": "Learn to run Gen AI models locally with Lemonade, an open-source local AI server.", - "time": 60, + "time": 30, "supported_platforms": { "halo": [ "linux", @@ -55,7 +55,7 @@ "windows" ] }, - "difficulty": "intermediate", + "difficulty": "beginner", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/supplemental/llama-factory-finetuning/README.md b/playbooks/supplemental/llama-factory-finetuning/README.md index 66c3a4e5..6fc2e5c2 100644 --- a/playbooks/supplemental/llama-factory-finetuning/README.md +++ b/playbooks/supplemental/llama-factory-finetuning/README.md @@ -1,12 +1,10 @@ -# LLM Fine-Tuning with LLaMA Factory - ## Overview Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks. LLaMA Factory is an open-source and user-friendly platform that streamlines the training and fine-tuning of large language models and multimodal models. It allows users to customize hundreds of pre-trained models locally with minimal coding. This playbook teaches you how to fine-tune LLMs using LLaMA Factory on your local AMD hardware. -## What you'll learn +## What You'll Learn - How to set up LLaMA Factory with AMD ROCm™ software - How to configure LLM fine-tuning parameters (using Qwen/Qwen3-4B-Instruct-2507 as an example) @@ -19,7 +17,17 @@ This playbook teaches you how to fine-tune LLMs using LLaMA Factory on your loca - Duration: It will take about 60 minutes to run this playbook (depending on your model/dataset size and network speed). - View the [LLaMA Factory GitHub](https://github.com/hiyouga/LlamaFactory) for more information. -## Setting up the Environment +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites @@ -39,55 +47,73 @@ pip --version - #### Create a Virtual Environment + + +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv llamafactory-env --system-site-packages +source llamafactory-env/bin/activate +``` + + + + + +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + ```bash sudo apt update sudo apt install -y python3-venv -python3 -m venv venv -source venv/bin/activate +python3 -m venv llamafactory-env +source llamafactory-env/bin/activate ``` - + + + ```powershell -python -m venv venv -venv\Scripts\activate +python -m venv llamafactory-env --system-site-packages +llamafactory-env\Scripts\activate ``` - - + + + +```powershell +python -m venv llamafactory-env +llamafactory-env\Scripts\activate +``` + + + + + ### Installing Basic Dependencies - - - - - ### Installing Additional Dependencies -- **Python**: ensure minimum version is 3.11 - -```bash -pip install huggingface_hub --break-system-packages -``` - +> **Note**: Ensure Python version is 3.11, 3.12, or 3.13 - ```bash -pip install huggingface_hub +pip install huggingface_hub ``` - diff --git a/playbooks/supplemental/ollama-getting-started/README.md b/playbooks/supplemental/ollama-getting-started/README.md index a1541c73..6dedc5b7 100644 --- a/playbooks/supplemental/ollama-getting-started/README.md +++ b/playbooks/supplemental/ollama-getting-started/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Getting Started with Ollama - ## Overview Ollama is a popular lightweight tool for running large language models locally. It handles model downloading, quantization, and serving behind a simple command-line interface and desktop app, so you can go from zero to chatting with an LLM in minutes. @@ -24,7 +22,18 @@ This playbook walks you through installing Ollama, pulling the GPT-OSS 20B model - Chat with models using the CLI - Query models programmatically through the REST API -## Installing Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites diff --git a/playbooks/supplemental/open-webui-chat/README.md b/playbooks/supplemental/open-webui-chat/README.md index 1adae8ac..4acd50f9 100644 --- a/playbooks/supplemental/open-webui-chat/README.md +++ b/playbooks/supplemental/open-webui-chat/README.md @@ -66,6 +66,16 @@ If a backend supports those endpoints, Open WebUI can talk to it with minimal se --- +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + ## One-Time Setup This section establishes a stable local environment: Lemonade running, Open WebUI running, and a working connection between them. diff --git a/playbooks/supplemental/open-webui-chat/playbook.json b/playbooks/supplemental/open-webui-chat/playbook.json index 878df1d9..47715126 100644 --- a/playbooks/supplemental/open-webui-chat/playbook.json +++ b/playbooks/supplemental/open-webui-chat/playbook.json @@ -1,6 +1,6 @@ { "id": "open-webui-chat", - "title": "How to Chat with LLMs in Open WebUI", + "title": "Chat with LLMs in Open WebUI", "description": "Use Open WebUI to chat with LLMs locally.", "time": 30, "supported_platforms": { diff --git a/playbooks/supplemental/openclaw-lemonade-server/README.md b/playbooks/supplemental/openclaw-lemonade-server/README.md index 4571aefb..986c8657 100644 --- a/playbooks/supplemental/openclaw-lemonade-server/README.md +++ b/playbooks/supplemental/openclaw-lemonade-server/README.md @@ -27,26 +27,27 @@ By the end of this playbook you will be able to: --- -## Prerequisites +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites - A PC running **Ubuntu 24.04+** or a compatible Debian-based Linux distribution with `apt-get` - At least **24 GB of RAM** (64 GB+ recommended for larger models) -- Depending on the size of the model you're running, set the minimum possible dedicated VRAM in the BIOS. -- Next, install the amd-debug-tools wheel from PyPI, and run the amd-ttm tool to reconfigure shared memory settings to the maximum: -```bash - sudo apt install pipx - pipx ensurepath - pipx install amd-debug-tools - amd-ttm --set 96 # Strix Halos can be set to 96GB. Set the shared memory value for other devices accordingly. -``` - **~10–30 GB of free disk space** for model weights - A PC running **Windows 10/11** - Visual Studio Community Edition [2022](https://aka.ms/vs/17/release/vs_community.exe) - At least **24 GB of RAM** (64 GB+ recommended for larger models) -- You could increase the dedicated GPU memory using [AMD Software: Adrenalin Edition™](https://www.amd.com/en/support/download/drivers.html) to try out larger models - **~10–30 GB of free disk space** for model weights diff --git a/playbooks/supplemental/pytorch-finetuning/README.md b/playbooks/supplemental/pytorch-finetuning/README.md index 1e035e06..49446c86 100644 --- a/playbooks/supplemental/pytorch-finetuning/README.md +++ b/playbooks/supplemental/pytorch-finetuning/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Fine-tune LLMs with Pytorch and AMD ROCm™ Software - ## Overview This tutorial provides step-by-step examples for fine-tuning a large language model (LLM) with PyTorch and ROCm. It covers several techniques, from standard fine-tuning to memory-efficient Parameter-Efficient Fine-Tuning (PEFT) strategies, so you can easily adapt models for your needs. @@ -27,11 +25,27 @@ This tutorial provides step-by-step examples for fine-tuning a large language mo > **Note:** Some of the fine-tuning techniques in this playbook may require more than 64GB of system RAM. -## Quick Start +## What You'll Learn + +- How to fine-tune an LLM using LoRA, QLoRA, and full fine-tuning with PyTorch and ROCm +- How to save and deploy your fine-tuned model +- How to monitor training and debug common issues + +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + -### 1. Setup +## Installing Software Prerequisites #### Create a Virtual Environment + @@ -42,8 +56,26 @@ python3 -m venv finetune-venv --system-site-packages source finetune-venv/bin/activate ``` + + + +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + + +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv finetune-venv +source finetune-venv/bin/activate +``` + + @@ -54,12 +86,9 @@ python -m venv finetune-venv --system-site-packages finetune-venv\Scripts\activate ``` + - - - - ```powershell @@ -67,22 +96,8 @@ python -m venv finetune-venv finetune-venv\Scripts\activate ``` - - - - - - -```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv finetune-venv -source finetune-venv/bin/activate -``` - - #### Installing Basic Dependencies @@ -197,6 +212,33 @@ r = subprocess.run([sys.executable, "train_full_finetuning.py"], timeout=600) sys.exit(r.returncode) ``` +## Quick Start +### 1. Choose Your Method + +| Method | Memory | Speed | Quality | Best For | +|--------|--------|-------|---------|----------| +| **QLoRA** | 12-16GB | Fastest | 90-95% | Low Memory Usage | +| **LoRA** | 24-32GB | Fast | 95-98% | Balanced approach | +| **Full** | 80GB+ | Slowest | 100% | Maximum quality | + +### 2. Run Training + +**Dataset and what the model learns** +The scripts turn the dataset into chat examples. For example, the QLoRA script uses **Abirate/english_quotes**: each example becomes a user–assistant pair like: + +- **User:** “Give me a quote about: <tag>” +- **Assistant:** “<quote> – <author>” + +Fine-tuning teaches the model to respond to prompts asking for quotes about a topic and to return them in the format ` - `. The LoRA and full fine-tuning scripts use **databricks/databricks-dolly-15k** (general instruction/response pairs), so the exact task varies by script; the idea is the same - adapt the model to your chosen dataset and format. + +Below is a summary of the available training methods. Each method links to its script and provides a brief description for choosing the right approach. + +| Script | Method | Description | Typical VRAM | Recommended For | +|-----------------------------------|-------------------|---------------------------------------------------------------------------------------------------------------------|--------------|-------------------------------------------------| +| [`train_lora.py`](assets/train_lora.py) | **LoRA** | Trains small adapter matrices while freezing base model. 3–5x faster; ~95–98% full quality. | 24–32GB | Advanced users; multiple adapters; more VRAM | +| [`train_qlora.py`](assets/train_qlora.py) *(Linux only)* | **QLoRA** | 4-bit quantization + LoRA adapters. Lowest memory use, fastest, small quality trade-off. Requires `bitsandbytes` (Linux only). | 12–16GB | Most users; fast experiments; limited VRAM | +| [`train_full_finetuning.py`](assets/train_full_finetuning.py) | **Full Fine-tuning** | Updates all model parameters. Maximum quality; highest memory and compute usage. | 40GB+ | Maximum quality; research; large VRAM | + --- ## Understanding the Techniques diff --git a/playbooks/supplemental/pytorch-kernels/README.md b/playbooks/supplemental/pytorch-kernels/README.md index 4c6ecd9b..47ac7053 100644 --- a/playbooks/supplemental/pytorch-kernels/README.md +++ b/playbooks/supplemental/pytorch-kernels/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Compile your own GPU kernels for PyTorch + AMD ROCm™ Software - ## Overview Write a GPU kernel from scratch, compile it, launch it on an AMD GPU, and watch utilization spike. This playbook shows how GPU computation actually works: write the kernel code, and execute it in parallel across thousands of threads. @@ -103,24 +101,34 @@ AMD GPUs execute threads in groups of **32** called **wavefronts**. All threads --- -## Installing Software Prerequisites +### PyTorch + AMD/HIP -### Create a Virtual Environment +PyTorch ships a ROCm build where the CUDA API surface (`torch.cuda.*`) is transparently backed by HIP. This means: + +- `torch.cuda.is_available()` works on AMD GPUs with ROCm +- `tensor.to("cuda")` allocates on the AMD GPU +- `torch.version.hip` exposes the HIP version + +PyTorch also exposes `torch.cuda._compile_kernel()`, a high-level shortcut to JIT-compile a raw kernel string and get back a callable, without needing a separate build step. + +--- - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. -```bash -python -m venv kernel-env --system-site-packages -kernel-env\Scripts\activate -``` +## Check for Software Updates -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + +## Installing Software Prerequisites + +### Prerequisites - Windows +- Install latest: [AMD Adrenalin Software](https://www.amd.com/en/products/software/adrenalin.html) +### Create a Virtual Environment + + On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. ```bash sudo apt update @@ -128,10 +136,25 @@ sudo apt install -y python3-venv python3 -m venv kernel-env --system-site-packages source kernel-env/bin/activate ``` - +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv kernel-env +source kernel-env/bin/activate +``` + + + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```powershell @@ -144,17 +167,6 @@ kernel-env\Scripts\activate - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. -```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv kernel-env -source kernel-env/bin/activate -``` - - - ### Installing Basic Dependencies diff --git a/playbooks/supplemental/pytorch-kernels/playbook.json b/playbooks/supplemental/pytorch-kernels/playbook.json index 3389fdac..69bfcd28 100644 --- a/playbooks/supplemental/pytorch-kernels/playbook.json +++ b/playbooks/supplemental/pytorch-kernels/playbook.json @@ -2,7 +2,7 @@ "id": "pytorch-kernels", "title": "Custom GPU Kernels with PyTorch and AMD ROCm\u2122", "description": "Write and optimize custom GPU kernels using PyTorch and AMD ROCm\u2122 software on AMD Ryzen\u2122 AI", - "time": 120, + "time": 60, "supported_platforms": { "halo": [ "linux" diff --git a/playbooks/supplemental/speech2speech-translation/README.md b/playbooks/supplemental/speech2speech-translation/README.md index 5fcc05ca..5a5725ae 100644 --- a/playbooks/supplemental/speech2speech-translation/README.md +++ b/playbooks/supplemental/speech2speech-translation/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Live Speech2Speech Translation on AMD Radeon™ GPU - ## Overview The AMD ROCm™ software and PyTorch stack create a unified ecosystem for on-device AI. It works for both Windows and Linux with official support for a wide range of devices including Ryzen™ AI APUs and Radeon™ GPUs. @@ -29,28 +27,23 @@ This playbook will teach you how to run low-latency, expressive, and private spe - Conveys tone, emotion, and intent without awkward pauses - Enables global collaboration and faster decision-making -## Setting Up Your Environment +## Setting the Memory Configuration -### Create a Virtual Environment + - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed: +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. - -```bash -python -m venv s2st-env --system-site-packages -s2st-env\Scripts\activate -``` - - + + -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. +## Installing Software Prerequisites - +### Create a Virtual Environment + On Linux, open a terminal and run the following prompt to create a venv with ROCm+Pytorch already installed: @@ -62,16 +55,36 @@ source s2st-env/bin/activate ``` - +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + +On Linux, open a terminal and run the following prompt to create a venv: + + +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv s2st-env +source s2st-env/bin/activate +``` + + + + + -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv: + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed: ```bash -python -m venv s2st-env +python -m venv s2st-env --system-site-packages s2st-env\Scripts\activate ``` @@ -80,26 +93,28 @@ s2st-env\Scripts\activate > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - + - -On Linux, open a terminal and run the following prompt to create a venv: + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv: - + ```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv s2st-env -source s2st-env/bin/activate +python -m venv s2st-env +s2st-env\Scripts\activate ``` - - + + +> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. +> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + ### Installing Basic Dependencies - + ### Additional Dependencies diff --git a/playbooks/supplemental/speech2speech-translation/playbook.json b/playbooks/supplemental/speech2speech-translation/playbook.json index 08c82cf9..711c95f1 100644 --- a/playbooks/supplemental/speech2speech-translation/playbook.json +++ b/playbooks/supplemental/speech2speech-translation/playbook.json @@ -1,8 +1,8 @@ { "id": "speech2speech-translation", - "title": "Speech-to-Speech Translation", + "title": "Real-Time Speech-to-Speech Translation", "description": "Build a real-time speech-to-speech translation on your local hardware.", - "time": 20, + "time": 30, "supported_platforms": { "halo": [ "linux", diff --git a/playbooks/supplemental/unsloth-llms-finetuning/README.md b/playbooks/supplemental/unsloth-llms-finetuning/README.md index 74b3f281..f1300582 100644 --- a/playbooks/supplemental/unsloth-llms-finetuning/README.md +++ b/playbooks/supplemental/unsloth-llms-finetuning/README.md @@ -31,6 +31,17 @@ In this playbook, we use Unsloth together with **LoRA-based SFT**. That means th Unsloth also supports other training approaches, including QLoRA and reinforcement learning workflows. This playbook focuses on the simplest path first: a small LoRA fine-tuning example that users can run, understand, and extend. +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + ## Installing Software Prerequisites ### Create a Virtual Environment @@ -71,16 +82,26 @@ source unsloth-env/bin/activate > **Note:** Python 3.13 is required for Windows. + +Open a PowerShell terminal and create a virtual environment: + +```powershell +python -m venv unsloth-env --system-site-packages +.\unsloth-env\Scripts\activate +``` + + + Open a PowerShell terminal and create a virtual environment: ```powershell -python -m venv unsloth_env -.\unsloth_env\Scripts\activate +python -m venv unsloth-env +.\unsloth-env\Scripts\activate ``` + ### Installing Basic Dependencies - diff --git a/playbooks/supplemental/unsloth-llms-finetuning/playbook.json b/playbooks/supplemental/unsloth-llms-finetuning/playbook.json index 0196e58b..46a84f05 100644 --- a/playbooks/supplemental/unsloth-llms-finetuning/playbook.json +++ b/playbooks/supplemental/unsloth-llms-finetuning/playbook.json @@ -1,8 +1,8 @@ { "id": "unsloth-llms-finetuning", - "title": "Optimized LLMs Fine-tuning with Unsloth", + "title": "Fine-Tune LLMs with Unsloth", "description": "Use Unsloth for memory-efficient fine-tuned LLMs™", - "time": 60, + "time": 45, "unsupported_platforms": { }, "supported_platforms": { @@ -37,7 +37,7 @@ "windows" ] }, - "difficulty": "advanced", + "difficulty": "intermediate", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/supplemental/vllm-inference/README.md b/playbooks/supplemental/vllm-inference/README.md index b685da88..cdebd0e6 100644 --- a/playbooks/supplemental/vllm-inference/README.md +++ b/playbooks/supplemental/vllm-inference/README.md @@ -10,21 +10,31 @@ SPDX-License-Identifier: MIT -# High-Performance LLM Inference with vLLM - ## Overview vLLM is a high-performance inference engine designed for large language models (LLMs). It provides optimized serving with continuous batching for high throughput and an OpenAI-compatible API for seamless application integration. This makes vLLM great for production deployments where speed and resource efficiency are critical. This playbook teaches you how to serve LLMs using containerized vLLM on the integrated GPU and interact with models through the OpenAI Python API. -## In This Playbook, You Will Learn +## What You'll Learn - How to set up and start a vLLM server with ROCm support - How to interact with models via OpenAI-compatible API endpoints - How to send prompts to the local server with `vllm-prompt` -## Starting vLLM +## Setting the Memory Configuration + + + + +## Check for Software Updates + +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites This playbook uses a prebuilt container image that includes vLLM, ROCm support, and the helper scripts needed to launch the server. You do not need to install PyTorch, vLLM, or local playbook scripts manually. diff --git a/playbooks/supplemental/vllm-inference/playbook.json b/playbooks/supplemental/vllm-inference/playbook.json index 373fe4e4..e92cc9e8 100644 --- a/playbooks/supplemental/vllm-inference/playbook.json +++ b/playbooks/supplemental/vllm-inference/playbook.json @@ -1,8 +1,8 @@ { "id": "vllm-inference", - "title": "Quick Start on vLLM", + "title": "Getting Started with vLLM", "description": "Learn how to run inference and serving using containerized vLLM on the integrated GPU", - "time": 90, + "time": 30, "unsupported_platforms": { "halo": [ { "platform": "windows", "reason": "vLLM is not supported on Windows" } diff --git a/website/src/app/playbooks/[id]/page.tsx b/website/src/app/playbooks/[id]/page.tsx index bab83c37..d9f3a1f6 100644 --- a/website/src/app/playbooks/[id]/page.tsx +++ b/website/src/app/playbooks/[id]/page.tsx @@ -213,7 +213,7 @@ function HaloPreinstalledDropdown({ - Already pre-installed on your AMD Halo Developer Platform! + Already pre-installed on your AMD Ryzen AI Developer Platform! @@ -223,7 +223,7 @@ function HaloPreinstalledDropdown({ {isOpen && (
- This software comes pre-installed and configured on your AMD Halo Developer Platform. + This software comes pre-installed and configured on your AMD Ryzen AI Developer Platform. If you need to reinstall or configure it manually, follow the instructions below:
)} - {isHaloSelected && featuredPlaybook.category === "core" && ( + {isHaloSelected && featuredPlaybook.category === "core" && featuredPlaybook.id !== "lmstudio-rocm-llms" && ( )} @@ -322,7 +322,7 @@ export default function PlaybooksSection({ activeDevice, selectedDevice, onSelec
- {isHaloSelected && playbook.category === "core" && ( + {isHaloSelected && playbook.category === "core" && playbook.id !== "lmstudio-rocm-llms" && ( )} {playbook.isNew && ( diff --git a/website/src/types/playbook.ts b/website/src/types/playbook.ts index 8860875c..9ac2784d 100644 --- a/website/src/types/playbook.ts +++ b/website/src/types/playbook.ts @@ -52,7 +52,7 @@ * * ## Pre-installed Software Dropdowns * - * For software pre-installed on AMD Halo Developer Platform, use the `@require` tag + * For software pre-installed on AMD Ryzen AI Developer Platform, use the `@require` tag * to reference installation instructions from the central `dependencies/` folder: * * ```markdown