diff --git a/playbooks/core/comfyui-image-gen/README.md b/playbooks/core/comfyui-image-gen/README.md index 2645f74d..61792b91 100644 --- a/playbooks/core/comfyui-image-gen/README.md +++ b/playbooks/core/comfyui-image-gen/README.md @@ -22,7 +22,17 @@ This tutorial teaches you how to use ComfyUI with the Z Image Turbo model on you - Generating images and tuning generation parameters - Saving and sharing workflows -## Installing Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites @@ -31,18 +41,24 @@ This tutorial teaches you how to use ComfyUI with the Z Image Turbo model on you +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + #### Create a Virtual Environment On Linux, open a terminal in the directory of your choice and run the following prompt to create a venv: ```bash sudo apt update sudo apt install -y python3-venv -python3 -m venv llm-env -source llm-env/bin/activate +python3 -m venv comfyui-env +source comfyui-env/bin/activate ``` - + @@ -271,8 +287,6 @@ echo "OK: ComfyUI server is reachable!" - - ## Launching ComfyUI diff --git a/playbooks/core/comfyui-image-gen/playbook.json b/playbooks/core/comfyui-image-gen/playbook.json index 13cf0d38..851d8759 100644 --- a/playbooks/core/comfyui-image-gen/playbook.json +++ b/playbooks/core/comfyui-image-gen/playbook.json @@ -1,6 +1,6 @@ { "id": "comfyui-image-gen", - "title": "Generating images with ComfyUI and Z Image Turbo", + "title": "Generate Images with ComfyUI", "description": "Create stunning AI-generated images using ComfyUI with Z Image Turbo.", "time": 15, "unsupported_platforms": { diff --git a/playbooks/core/lmstudio-rocm-llms/README.md b/playbooks/core/lmstudio-rocm-llms/README.md index b5a20992..5b9fdb44 100644 --- a/playbooks/core/lmstudio-rocm-llms/README.md +++ b/playbooks/core/lmstudio-rocm-llms/README.md @@ -20,9 +20,20 @@ LM Studio is a powerful GUI-based wrapper for [llama.cpp](https://github.com/ggm - Serve models via OpenAI Compatible API to power custom workflows and apps -## Installing Dependencies +## Setting the Memory Configuration - + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites + + ## Downloading Models @@ -120,54 +131,60 @@ This model will now be accessible through the LM Studio Server endpoint and will Having just created the OpenAI Compatible endpoint, let's look at how to integrate this into a Python developer environment (such as VSCode) and use your system as a local API Provider. 1. Create a Python virtual environment: + + - - On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - python -m venv llm-env --system-site-packages - llm-env\Scripts\activate + sudo apt update + sudo apt install -y python3-venv + python3 -m venv lmstudio-env --system-site-packages + source lmstudio-env/bin/activate ``` + - > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. - > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + +**Grant your user access to GPU devices** (log out and back in for this to take effect): - +```bash +sudo usermod -aG render,video $LOGNAME +``` - On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash sudo apt update sudo apt install -y python3-venv - python3 -m venv llm-env --system-site-packages - source llm-env/bin/activate + python3 -m venv lmstudio-env + source lmstudio-env/bin/activate ``` - + - - + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - python -m venv llm-env - llm-env\Scripts\activate + python -m venv lmstudio-env --system-site-packages + lmstudio-env\Scripts\activate ``` > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - + - - On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. + + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash - sudo apt update - sudo apt install -y python3-venv - python3 -m venv llm-env - source llm-env/bin/activate + python -m venv lmstudio-env + lmstudio-env\Scripts\activate ``` - + + > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. + > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + 2. Install the OpenAI package ```bash diff --git a/playbooks/core/lmstudio-rocm-llms/playbook.json b/playbooks/core/lmstudio-rocm-llms/playbook.json index 0891089c..470ec007 100644 --- a/playbooks/core/lmstudio-rocm-llms/playbook.json +++ b/playbooks/core/lmstudio-rocm-llms/playbook.json @@ -1,6 +1,6 @@ { "id": "lmstudio-rocm-llms", - "title": "Running and serving LLMs with LM Studio", + "title": "Run and Serve LLMs with LM Studio", "description": "Set up LM Studio and LM Studio Server to run and serve large language models locally.", "time": 30, "supported_platforms": { diff --git a/playbooks/core/n8n-automation-gpt-oss/README.md b/playbooks/core/n8n-automation-gpt-oss/README.md index 4e9e6caa..26bd8b71 100644 --- a/playbooks/core/n8n-automation-gpt-oss/README.md +++ b/playbooks/core/n8n-automation-gpt-oss/README.md @@ -35,7 +35,17 @@ In this playbook, we use Lemonade to serve a local LLM that n8n connects to for n8n includes a **native Lemonade node** (`Lemonade Chat Model`) that provides a first-class integration - no need for manual configuration. This makes connecting your local LLM to automation workflows straightforward. -## Prerequisites +## Setting the Memory Configuration + + + + +## Check for Software Updates + + + + +## Installing Software Prerequisites diff --git a/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml b/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml index 6fddb137..61d8ddb3 100644 --- a/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml +++ b/playbooks/core/n8n-automation-gpt-oss/assets/compose.yml @@ -1,6 +1,6 @@ services: n8n: - image: docker.n8n.io/n8nio/n8n + image: docker.io/n8nio/n8n container_name: n8n restart: unless-stopped network_mode: host diff --git a/playbooks/core/n8n-automation-gpt-oss/playbook.json b/playbooks/core/n8n-automation-gpt-oss/playbook.json index bc04c738..41269177 100644 --- a/playbooks/core/n8n-automation-gpt-oss/playbook.json +++ b/playbooks/core/n8n-automation-gpt-oss/playbook.json @@ -2,7 +2,7 @@ "id": "n8n-automation-gpt-oss", "title": "Automating Workflows with n8n and Local LLMs", "description": "Build an AI-powered news summarizer using n8n and Lemonade.", - "time": 30, + "time": 45, "supported_platforms": { "halo": [ "linux", @@ -59,7 +59,7 @@ "windows" ] }, - "difficulty": "beginner", + "difficulty": "intermediate", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/core/pytorch-rocm-llms/README.md b/playbooks/core/pytorch-rocm-llms/README.md index d71a49b1..2b5e1606 100644 --- a/playbooks/core/pytorch-rocm-llms/README.md +++ b/playbooks/core/pytorch-rocm-llms/README.md @@ -24,74 +24,90 @@ This tutorial uses PyTorch powered by AMD ROCm™ software to run models that ca - Run LLMs like gpt-oss-20b and qwen3.5-4B locally using PyTorch and ROCm - Create a document summarization tool using LLMs -## Initial Setup +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites ### Create a Virtual Environment + - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. - +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. + ```bash -python -m venv llm-env --system-site-packages -llm-env\Scripts\activate +sudo apt update +sudo apt install -y python3-venv +python3 -m venv pytorch-env --system-site-packages +source pytorch-env/bin/activate ``` + + -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + +**Grant your user access to GPU devices** (log out and back in for this to take effect): - - +```bash +sudo usermod -aG render,video $LOGNAME +``` - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. ```bash sudo apt update sudo apt install -y python3-venv -python3 -m venv llm-env --system-site-packages -source llm-env/bin/activate +python3 -m venv pytorch-env +source pytorch-env/bin/activate ``` - - + + - - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. ```bash -python -m venv llm-env -llm-env\Scripts\activate +python -m venv pytorch-env --system-site-packages +pytorch-env\Scripts\activate ``` > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - - + + - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. - + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. + ```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv llm-env -source llm-env/bin/activate +python -m venv pytorch-env +pytorch-env\Scripts\activate ``` - - + +> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. +> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + + ### Installing Basic Dependencies - + diff --git a/playbooks/core/pytorch-rocm-llms/playbook.json b/playbooks/core/pytorch-rocm-llms/playbook.json index b3f98f06..865631e0 100644 --- a/playbooks/core/pytorch-rocm-llms/playbook.json +++ b/playbooks/core/pytorch-rocm-llms/playbook.json @@ -1,6 +1,6 @@ { "id": "pytorch-rocm-llms", - "title": "Running LLMs with PyTorch and AMD ROCm™ software", + "title": "Run LLMs with PyTorch and ROCm™", "description": "Learn to run powerful language models on your PC with PyTorch and AMD ROCm™ software to summarize documents quickly and easily.", "time": 30, "supported_platforms": { diff --git a/playbooks/core/vscode-qwen3-coder/README.md b/playbooks/core/vscode-qwen3-coder/README.md index 677e14a9..ce8c96f2 100644 --- a/playbooks/core/vscode-qwen3-coder/README.md +++ b/playbooks/core/vscode-qwen3-coder/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Local LLM coding with VSCode and Qwen3-Coder-30B - > [!NOTE] > This playbook requires a minimum of **32GB** of system memory. @@ -28,7 +26,18 @@ This tutorial demonstrates how to use Cline, VS Code, and LM Studio to run a cod * How to configure Cline to communicate with LM Studio for local inference of coding agents. * How to use local coding agents to solve real-world software engineering tasks. -## Core Dependencies +## Setting the Memory Configuration + + + + +## Check for Software Updates + +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + +## Installing Software Prerequisites diff --git a/playbooks/core/vscode-qwen3-coder/playbook.json b/playbooks/core/vscode-qwen3-coder/playbook.json index 38aeab9a..25a1cca3 100644 --- a/playbooks/core/vscode-qwen3-coder/playbook.json +++ b/playbooks/core/vscode-qwen3-coder/playbook.json @@ -2,7 +2,7 @@ "id": "vscode-qwen3-coder", "title": "Local LLM Coding with VS Code and Qwen3-Coder", "description": "Use VS Code with locally-running Qwen3-Coder for private code assistance.", - "time": 60, + "time": 30, "supported_platforms": { "halo": [ "linux", @@ -59,7 +59,7 @@ "linux" ] }, - "difficulty": "intermediate", + "difficulty": "beginner", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png b/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png new file mode 100644 index 00000000..be26aaa9 Binary files /dev/null and b/playbooks/dependencies/assets/memory-config/adrenalin_vram_new.png differ diff --git a/playbooks/dependencies/assets/memory-config/linux_mem_new.png b/playbooks/dependencies/assets/memory-config/linux_mem_new.png new file mode 100644 index 00000000..e4a691f8 Binary files /dev/null and b/playbooks/dependencies/assets/memory-config/linux_mem_new.png differ diff --git a/playbooks/dependencies/assets/software-update/linux.png b/playbooks/dependencies/assets/software-update/linux.png new file mode 100644 index 00000000..99ea1465 Binary files /dev/null and b/playbooks/dependencies/assets/software-update/linux.png differ diff --git a/playbooks/dependencies/assets/software-update/windows.png b/playbooks/dependencies/assets/software-update/windows.png new file mode 100644 index 00000000..24654d11 Binary files /dev/null and b/playbooks/dependencies/assets/software-update/windows.png differ diff --git a/playbooks/dependencies/driver.md b/playbooks/dependencies/driver.md index 232a88ec..b2b16f1e 100644 --- a/playbooks/dependencies/driver.md +++ b/playbooks/dependencies/driver.md @@ -4,6 +4,8 @@ Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: MIT --> + + ### AMD GPU Driver @@ -31,3 +33,5 @@ Download and install the latest AMD GPU driver for Linux: + + diff --git a/playbooks/dependencies/memoryconfig.md b/playbooks/dependencies/memoryconfig.md index 807c4b63..598115ad 100644 --- a/playbooks/dependencies/memoryconfig.md +++ b/playbooks/dependencies/memoryconfig.md @@ -4,47 +4,79 @@ Copyright Advanced Micro Devices, Inc. SPDX-License-Identifier: MIT --> -### Memory configuration for running large models - -On Windows, to run larger models that require higher memory, we need to use the AMD Variable Graphics Memory (iGPU VRAM) allocation. Although 64 GB is adequate for most workloads, running the largest models with high context may require 96 GB. +On Windows, to run larger models, increase the **dedicated GPU memory** allocation. + + + +On the Ryzen AI Halo, the default is 64GB dedicated. To modify this, open **AMD Software: Adrenalin Edition™** and navigate to **Performance → Tuning → AMD Variable Graphics Memory**. Reboot for the changes to take effect. + +
+
+
+
+
+
+
+
+
+
+
-`. The LoRA and full fine-tuning scripts use **databricks/databricks-dolly-15k** (general instruction/response pairs), so the exact task varies by script; the idea is the same - adapt the model to your chosen dataset and format. + +Below is a summary of the available training methods. Each method links to its script and provides a brief description for choosing the right approach. + +| Script | Method | Description | Typical VRAM | Recommended For | +|-----------------------------------|-------------------|---------------------------------------------------------------------------------------------------------------------|--------------|-------------------------------------------------| +| [`train_lora.py`](assets/train_lora.py) | **LoRA** | Trains small adapter matrices while freezing base model. 3–5x faster; ~95–98% full quality. | 24–32GB | Advanced users; multiple adapters; more VRAM | +| [`train_qlora.py`](assets/train_qlora.py) *(Linux only)* | **QLoRA** | 4-bit quantization + LoRA adapters. Lowest memory use, fastest, small quality trade-off. Requires `bitsandbytes` (Linux only). | 12–16GB | Most users; fast experiments; limited VRAM | +| [`train_full_finetuning.py`](assets/train_full_finetuning.py) | **Full Fine-tuning** | Updates all model parameters. Maximum quality; highest memory and compute usage. | 40GB+ | Maximum quality; research; large VRAM | + --- ## Understanding the Techniques diff --git a/playbooks/supplemental/pytorch-kernels/README.md b/playbooks/supplemental/pytorch-kernels/README.md index 4c6ecd9b..47ac7053 100644 --- a/playbooks/supplemental/pytorch-kernels/README.md +++ b/playbooks/supplemental/pytorch-kernels/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Compile your own GPU kernels for PyTorch + AMD ROCm™ Software - ## Overview Write a GPU kernel from scratch, compile it, launch it on an AMD GPU, and watch utilization spike. This playbook shows how GPU computation actually works: write the kernel code, and execute it in parallel across thousands of threads. @@ -103,24 +101,34 @@ AMD GPUs execute threads in groups of **32** called **wavefronts**. All threads --- -## Installing Software Prerequisites +### PyTorch + AMD/HIP -### Create a Virtual Environment +PyTorch ships a ROCm build where the CUDA API surface (`torch.cuda.*`) is transparently backed by HIP. This means: + +- `torch.cuda.is_available()` works on AMD GPUs with ROCm +- `tensor.to("cuda")` allocates on the AMD GPU +- `torch.version.hip` exposes the HIP version + +PyTorch also exposes `torch.cuda._compile_kernel()`, a high-level shortcut to JIT-compile a raw kernel string and get back a callable, without needing a separate build step. + +--- - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. -```bash -python -m venv kernel-env --system-site-packages -kernel-env\Scripts\activate -``` +## Check for Software Updates -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + +## Installing Software Prerequisites + +### Prerequisites - Windows +- Install latest: [AMD Adrenalin Software](https://www.amd.com/en/products/software/adrenalin.html) +### Create a Virtual Environment + + On Linux, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed. ```bash sudo apt update @@ -128,10 +136,25 @@ sudo apt install -y python3-venv python3 -m venv kernel-env --system-site-packages source kernel-env/bin/activate ``` - +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + +On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv kernel-env +source kernel-env/bin/activate +``` + + + On Windows, open a terminal in the directory of your choice and follow the commands to create a venv. ```powershell @@ -144,17 +167,6 @@ kernel-env\Scripts\activate - -On Linux, open a terminal in the directory of your choice and follow the commands to create a venv. -```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv kernel-env -source kernel-env/bin/activate -``` - - - ### Installing Basic Dependencies diff --git a/playbooks/supplemental/pytorch-kernels/playbook.json b/playbooks/supplemental/pytorch-kernels/playbook.json index 3389fdac..69bfcd28 100644 --- a/playbooks/supplemental/pytorch-kernels/playbook.json +++ b/playbooks/supplemental/pytorch-kernels/playbook.json @@ -2,7 +2,7 @@ "id": "pytorch-kernels", "title": "Custom GPU Kernels with PyTorch and AMD ROCm\u2122", "description": "Write and optimize custom GPU kernels using PyTorch and AMD ROCm\u2122 software on AMD Ryzen\u2122 AI", - "time": 120, + "time": 60, "supported_platforms": { "halo": [ "linux" diff --git a/playbooks/supplemental/speech2speech-translation/README.md b/playbooks/supplemental/speech2speech-translation/README.md index 5fcc05ca..5a5725ae 100644 --- a/playbooks/supplemental/speech2speech-translation/README.md +++ b/playbooks/supplemental/speech2speech-translation/README.md @@ -9,8 +9,6 @@ SPDX-License-Identifier: MIT > This playbook uses special tags that GitHub cannot render. Please visit [amd.com/playbooks](https://amd.com/playbooks) to correctly preview this content. -# Live Speech2Speech Translation on AMD Radeon™ GPU - ## Overview The AMD ROCm™ software and PyTorch stack create a unified ecosystem for on-device AI. It works for both Windows and Linux with official support for a wide range of devices including Ryzen™ AI APUs and Radeon™ GPUs. @@ -29,28 +27,23 @@ This playbook will teach you how to run low-latency, expressive, and private spe - Conveys tone, emotion, and intent without awkward pauses - Enables global collaboration and faster decision-making -## Setting Up Your Environment +## Setting the Memory Configuration -### Create a Virtual Environment + - -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed: +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. - -```bash -python -m venv s2st-env --system-site-packages -s2st-env\Scripts\activate -``` - - + + -> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. -> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. +## Installing Software Prerequisites - +### Create a Virtual Environment + On Linux, open a terminal and run the following prompt to create a venv with ROCm+Pytorch already installed: @@ -62,16 +55,36 @@ source s2st-env/bin/activate ``` - +**Grant your user access to GPU devices** (log out and back in for this to take effect): + +```bash +sudo usermod -aG render,video $LOGNAME +``` + +On Linux, open a terminal and run the following prompt to create a venv: + + +```bash +sudo apt update +sudo apt install -y python3-venv +python3 -m venv s2st-env +source s2st-env/bin/activate +``` + + + + + -On Windows, open a terminal in the directory of your choice and follow the commands to create a venv: + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv with ROCm+Pytorch already installed: ```bash -python -m venv s2st-env +python -m venv s2st-env --system-site-packages s2st-env\Scripts\activate ``` @@ -80,26 +93,28 @@ s2st-env\Scripts\activate > **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. > setting it to RemoteSigned or Unrestricted) before running some Powershell commands. - + - -On Linux, open a terminal and run the following prompt to create a venv: + +On Windows, open a terminal in the directory of your choice and follow the commands to create a venv: - + ```bash -sudo apt update -sudo apt install -y python3-venv -python3 -m venv s2st-env -source s2st-env/bin/activate +python -m venv s2st-env +s2st-env\Scripts\activate ``` - - + + +> **Tip**: Windows users may need to modify their PowerShell Execution Policy (e.g. +> setting it to RemoteSigned or Unrestricted) before running some Powershell commands. + + ### Installing Basic Dependencies - + ### Additional Dependencies diff --git a/playbooks/supplemental/speech2speech-translation/playbook.json b/playbooks/supplemental/speech2speech-translation/playbook.json index 08c82cf9..711c95f1 100644 --- a/playbooks/supplemental/speech2speech-translation/playbook.json +++ b/playbooks/supplemental/speech2speech-translation/playbook.json @@ -1,8 +1,8 @@ { "id": "speech2speech-translation", - "title": "Speech-to-Speech Translation", + "title": "Real-Time Speech-to-Speech Translation", "description": "Build a real-time speech-to-speech translation on your local hardware.", - "time": 20, + "time": 30, "supported_platforms": { "halo": [ "linux", diff --git a/playbooks/supplemental/unsloth-llms-finetuning/README.md b/playbooks/supplemental/unsloth-llms-finetuning/README.md index 74b3f281..f1300582 100644 --- a/playbooks/supplemental/unsloth-llms-finetuning/README.md +++ b/playbooks/supplemental/unsloth-llms-finetuning/README.md @@ -31,6 +31,17 @@ In this playbook, we use Unsloth together with **LoRA-based SFT**. That means th Unsloth also supports other training approaches, including QLoRA and reinforcement learning workflows. This playbook focuses on the simplest path first: a small LoRA fine-tuning example that users can run, understand, and extend. +## Setting the Memory Configuration + + + + +## Check for Software Updates +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + ## Installing Software Prerequisites ### Create a Virtual Environment @@ -71,16 +82,26 @@ source unsloth-env/bin/activate > **Note:** Python 3.13 is required for Windows. + +Open a PowerShell terminal and create a virtual environment: + +```powershell +python -m venv unsloth-env --system-site-packages +.\unsloth-env\Scripts\activate +``` + + + Open a PowerShell terminal and create a virtual environment: ```powershell -python -m venv unsloth_env -.\unsloth_env\Scripts\activate +python -m venv unsloth-env +.\unsloth-env\Scripts\activate ``` + ### Installing Basic Dependencies - diff --git a/playbooks/supplemental/unsloth-llms-finetuning/playbook.json b/playbooks/supplemental/unsloth-llms-finetuning/playbook.json index 0196e58b..46a84f05 100644 --- a/playbooks/supplemental/unsloth-llms-finetuning/playbook.json +++ b/playbooks/supplemental/unsloth-llms-finetuning/playbook.json @@ -1,8 +1,8 @@ { "id": "unsloth-llms-finetuning", - "title": "Optimized LLMs Fine-tuning with Unsloth", + "title": "Fine-Tune LLMs with Unsloth", "description": "Use Unsloth for memory-efficient fine-tuned LLMs™", - "time": 60, + "time": 45, "unsupported_platforms": { }, "supported_platforms": { @@ -37,7 +37,7 @@ "windows" ] }, - "difficulty": "advanced", + "difficulty": "intermediate", "isNew": false, "isFeatured": false, "developed": true, diff --git a/playbooks/supplemental/vllm-inference/README.md b/playbooks/supplemental/vllm-inference/README.md index b685da88..cdebd0e6 100644 --- a/playbooks/supplemental/vllm-inference/README.md +++ b/playbooks/supplemental/vllm-inference/README.md @@ -10,21 +10,31 @@ SPDX-License-Identifier: MIT -# High-Performance LLM Inference with vLLM - ## Overview vLLM is a high-performance inference engine designed for large language models (LLMs). It provides optimized serving with continuous batching for high throughput and an OpenAI-compatible API for seamless application integration. This makes vLLM great for production deployments where speed and resource efficiency are critical. This playbook teaches you how to serve LLMs using containerized vLLM on the integrated GPU and interact with models through the OpenAI Python API. -## In This Playbook, You Will Learn +## What You'll Learn - How to set up and start a vLLM server with ROCm support - How to interact with models via OpenAI-compatible API endpoints - How to send prompts to the local server with `vllm-prompt` -## Starting vLLM +## Setting the Memory Configuration + + + + +## Check for Software Updates + +> **Note**: If VSCode is not installed, you can install it with Ryzen AI Developer Center. + + + + +## Installing Software Prerequisites This playbook uses a prebuilt container image that includes vLLM, ROCm support, and the helper scripts needed to launch the server. You do not need to install PyTorch, vLLM, or local playbook scripts manually. diff --git a/playbooks/supplemental/vllm-inference/playbook.json b/playbooks/supplemental/vllm-inference/playbook.json index 373fe4e4..e92cc9e8 100644 --- a/playbooks/supplemental/vllm-inference/playbook.json +++ b/playbooks/supplemental/vllm-inference/playbook.json @@ -1,8 +1,8 @@ { "id": "vllm-inference", - "title": "Quick Start on vLLM", + "title": "Getting Started with vLLM", "description": "Learn how to run inference and serving using containerized vLLM on the integrated GPU", - "time": 90, + "time": 30, "unsupported_platforms": { "halo": [ { "platform": "windows", "reason": "vLLM is not supported on Windows" } diff --git a/website/src/app/playbooks/[id]/page.tsx b/website/src/app/playbooks/[id]/page.tsx index bab83c37..d9f3a1f6 100644 --- a/website/src/app/playbooks/[id]/page.tsx +++ b/website/src/app/playbooks/[id]/page.tsx @@ -213,7 +213,7 @@ function HaloPreinstalledDropdown({ - Already pre-installed on your AMD Halo Developer Platform! + Already pre-installed on your AMD Ryzen AI Developer Platform!