Chat template from chat_template.jinja for all possible paths by dkalinowski · Pull Request #4055 · openvinotoolkit/model_server

dkalinowski · 2026-03-12T13:19:54Z

No description provided.

Copilot

Pull request overview

This PR updates the LLM/VLM servable initialization flow to allow overriding the tokenizer chat template from a chat_template.jinja file located in the model path, making that override available across multiple pipeline initializers.

Changes:

Add logic to detect and read chat_template.jinja from the model path and call tokenizer.set_chat_template(...).
Add <fstream> include where needed to support reading the template file.
Apply the same override behavior across legacy LM, continuous batching LM, and legacy VLM initializers.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
`src/llm/visual_language_model/legacy/servable_initializer.cpp`	Reads `chat_template.jinja` (if present) after creating the VLM pipeline/tokenizer and applies it to the tokenizer.
`src/llm/language_model/legacy/servable_initializer.cpp`	Reads `chat_template.jinja` (if present) after creating the legacy LLM pipeline/tokenizer and applies it to the tokenizer.
`src/llm/language_model/continuous_batching/servable_initializer.cpp`	Reads `chat_template.jinja` (if present) after creating the CB pipeline/tokenizer and applies it to the tokenizer.

Copilot · 2026-03-12T13:32:43Z

src/llm/language_model/legacy/servable_initializer.cpp

+            if (chatTemplateFile.is_open()) {
+                std::string chatTemplateContent((std::istreambuf_iterator<char>(chatTemplateFile)),
+                    std::istreambuf_iterator<char>());
+                if (!chatTemplateContent.empty()) {


This block uses std::istreambuf_iterator (and std::filesystem::path), but the file doesn’t include the corresponding standard headers (<iterator> / <filesystem>). Please add the missing includes to avoid relying on transitive headers (can break on different libstdc++/libc++ versions).

Copilot · 2026-03-12T13:32:43Z

src/llm/language_model/legacy/servable_initializer.cpp

+
+        // Override chat template from chat_template.jinja file if present in model directory
+        std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";
+        if (std::filesystem::exists(chatTemplateJinjaPath)) {
+            std::ifstream chatTemplateFile(chatTemplateJinjaPath);
+            if (chatTemplateFile.is_open()) {
+                std::string chatTemplateContent((std::istreambuf_iterator<char>(chatTemplateFile)),
+                    std::istreambuf_iterator<char>());
+                if (!chatTemplateContent.empty()) {
+                    properties->tokenizer.set_chat_template(chatTemplateContent);
+                    SPDLOG_LOGGER_DEBUG(llm_calculator_logger, "Loaded custom chat template from: {}", chatTemplateJinjaPath.string());
+                }
+            } else {
+                SPDLOG_LOGGER_WARN(llm_calculator_logger, "Failed to open chat template file: {}", chatTemplateJinjaPath.string());
+            }
+        }


The same “read chat_template.jinja and set tokenizer chat template” logic is duplicated across multiple servable initializers in this PR. To prevent drift (e.g., different logging/edge-case handling), consider extracting it into a shared helper (e.g., on GenAiServableInitializer).

Suggested change

// Override chat template from chat_template.jinja file if present in model directory

std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";

if (std::filesystem::exists(chatTemplateJinjaPath)) {

std::ifstream chatTemplateFile(chatTemplateJinjaPath);

if (chatTemplateFile.is_open()) {

std::string chatTemplateContent((std::istreambuf_iterator<char>(chatTemplateFile)),

std::istreambuf_iterator<char>());

if (!chatTemplateContent.empty()) {

properties->tokenizer.set_chat_template(chatTemplateContent);

SPDLOG_LOGGER_DEBUG(llm_calculator_logger, "Loaded custom chat template from: {}", chatTemplateJinjaPath.string());

}

} else {

SPDLOG_LOGGER_WARN(llm_calculator_logger, "Failed to open chat template file: {}", chatTemplateJinjaPath.string());

}

}

Copilot · 2026-03-12T13:32:43Z

src/llm/language_model/continuous_batching/servable_initializer.cpp

        properties->tokenizer = properties->pipeline->get_tokenizer();
+
+        // Override chat template from chat_template.jinja file if present in model directory
+        std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";


chatTemplateJinjaPath is derived as parsedModelsPath / "chat_template.jinja". Since parseModelsPath() accepts .gguf files as a valid models_path, this won’t locate a template stored next to a GGUF file (it will incorrectly search under <file>.gguf/chat_template.jinja). Consider basing the search directory on parsedModelsPath if it’s a directory, otherwise use parent_path().

Suggested change

std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";

std::filesystem::path modelsPathFs(parsedModelsPath);

std::filesystem::path chatTemplateDir = modelsPathFs;

if (!chatTemplateDir.empty() && !std::filesystem::is_directory(chatTemplateDir)) {

chatTemplateDir = chatTemplateDir.parent_path();

}

std::filesystem::path chatTemplateJinjaPath = chatTemplateDir / "chat_template.jinja";

Copilot · 2026-03-12T13:32:44Z

src/llm/language_model/continuous_batching/servable_initializer.cpp

+            if (chatTemplateFile.is_open()) {
+                std::string chatTemplateContent((std::istreambuf_iterator<char>(chatTemplateFile)),
+                    std::istreambuf_iterator<char>());
+                if (!chatTemplateContent.empty()) {


This code uses std::istreambuf_iterator, but there is no explicit #include <iterator> in this translation unit. Please add it to ensure the file compiles without relying on indirect includes.

Copilot · 2026-03-12T13:32:44Z

src/llm/visual_language_model/legacy/servable_initializer.cpp

+        std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";
+        if (std::filesystem::exists(chatTemplateJinjaPath)) {
+            std::ifstream chatTemplateFile(chatTemplateJinjaPath);
+            if (chatTemplateFile.is_open()) {
+                std::string chatTemplateContent((std::istreambuf_iterator<char>(chatTemplateFile)),
+                    std::istreambuf_iterator<char>());
+                if (!chatTemplateContent.empty()) {


This block uses std::istreambuf_iterator / std::filesystem::path, but the file doesn’t include the corresponding standard headers (<iterator> / <filesystem>). Please add the missing includes to avoid relying on transitive headers.

Copilot · 2026-03-12T13:32:44Z

src/llm/language_model/legacy/servable_initializer.cpp

        properties->tokenizer = properties->pipeline->get_tokenizer();
+
+        // Override chat template from chat_template.jinja file if present in model directory
+        std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";


chatTemplateJinjaPath is built by appending chat_template.jinja to parsedModelsPath. However parseModelsPath() allows models_path to be either a directory or a .gguf file; when it’s a file, this will look for <file>.gguf/chat_template.jinja and never find the template next to the model. Consider resolving the “template directory” as parsedModelsPath when it’s a directory, otherwise parent_path() (and use that for the lookup).

Suggested change

std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";

std::filesystem::path modelsPathFs(parsedModelsPath);

std::filesystem::path chatTemplateDir =

std::filesystem::is_directory(modelsPathFs) ? modelsPathFs : modelsPathFs.parent_path();

std::filesystem::path chatTemplateJinjaPath = chatTemplateDir / "chat_template.jinja";

dkalinowski added 3 commits March 12, 2026 12:44

save

831a375

save

15dae46

save

1a7437e

dkalinowski force-pushed the ovms_chat_template branch from 694d391 to 1a7437e Compare March 12, 2026 13:23

dkalinowski marked this pull request as ready for review March 12, 2026 13:27

Copilot AI review requested due to automatic review settings March 12, 2026 13:27

Copilot started reviewing on behalf of dkalinowski March 12, 2026 13:28 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat template from chat_template.jinja for all possible paths#4055

Chat template from chat_template.jinja for all possible paths#4055
dkalinowski wants to merge 3 commits intomainfrom
ovms_chat_template

dkalinowski commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-        std::filesystem::path chatTemplateJinjaPath = std::filesystem::path(parsedModelsPath) / "chat_template.jinja";
+        std::filesystem::path modelsPathFs(parsedModelsPath);
+        std::filesystem::path chatTemplateDir = modelsPathFs;
+        if (!chatTemplateDir.empty() && !std::filesystem::is_directory(chatTemplateDir)) {
+            chatTemplateDir = chatTemplateDir.parent_path();
+        }
+        std::filesystem::path chatTemplateJinjaPath = chatTemplateDir / "chat_template.jinja";

Conversation

dkalinowski commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants