Skip to content

agents-workshop: fix agent_setup failing on deprecated claude-3-7-sonnet PPT#33

Open
Incognico-o wants to merge 1 commit into
mainfrom
fix/agent-setup-ppt-model-swap
Open

agents-workshop: fix agent_setup failing on deprecated claude-3-7-sonnet PPT#33
Incognico-o wants to merge 1 commit into
mainfrom
fix/agent-setup-ppt-model-swap

Conversation

@Incognico-o

Copy link
Copy Markdown
Contributor

Summary

  • databricks-claude-3-7-sonnet was removed from pay-per-token serving, causing the agents-workshop/agent_setup/agent.py notebook to fail at deploy time with:
    NotFoundError: 404 - {'error_code': 'NOT_FOUND', 'message': 'Pay-per-token for this model is disabled. Please see Databricks documentation for supported Foundation Models.'}
    
  • Swapped the active LLM_ENDPOINT_NAME to databricks-gpt-oss-120b — the same model used in Part 2 (02_agent_eval/agent.py), so the full workshop now runs on a single consistent model.
  • Left databricks-claude-sonnet-4-6 commented out as a quick Claude fallback alongside the existing databricks-meta-llama-3-3-70b-instruct fallback.

Why gpt-oss-120b

  • Confirmed live on PPT in e2-demo-field-eng (and across the FE workspaces lab attendees use).
  • Matches the model already shipping in 02_agent_eval/agent.py — students see one model end-to-end.
  • Supports tool calling, which the LangGraph bind_tools(tools) path requires.

Test plan

  • Run agents-workshop/agent_setup/agent_setup.py end-to-end and confirm mlflow.pyfunc.log_model(...) succeeds.
  • Confirm agents.deploy(...) produces a working agents_agents_lab-product-product_agent endpoint.
  • Sanity-query the deployed endpoint with the existing input example ("What color options are available for the Aria Modern Bookshelf?") and confirm it tool-calls into the vector search retriever.

This pull request and its description were written by Isaac.

databricks-claude-3-7-sonnet was removed from pay-per-token serving,
causing the agent_setup notebook to fail with:
  NotFoundError: 404 Pay-per-token for this model is disabled.

Switched to databricks-gpt-oss-120b to match the model used in Part 2
of the workshop (02_agent_eval) — keeps the whole lab on one model.
Left databricks-claude-sonnet-4-6 commented as a Claude fallback
alongside the existing meta-llama-3-3-70b-instruct fallback.

Co-authored-by: Isaac
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant