Hotel Receptionist Scenario Expansion by ShayneP · Pull Request #6186 · livekit/agents

ShayneP · 2026-06-22T18:43:13Z

This PR:

Extends the simulation set to 100 scenarios
- This also adds policies and tools to accomplish the tasks set out in these scenarios
Splits tools and instructions out into their own files so they're easier to grok
Optimizes the agent persona to answer as many of the scenarios correctly as possible

Comprehensive README explaining the architecture and design choices of the Hotel Receptionist
Performance tuning

Adds 11 new scenarios (check-out early, dinner move/cancel, wake-up move, red-eye hold, valuables/liability, local-area, callback-to-finish, hostile free-night, can't-verify change, room/floor confirm) plus the example logic they exercise: view-based room moves (agent.py, hotel_db.py, modify_booking.py) and restaurant-reservation modification (hotel_db.py), and two new policy docs (local_area.md, safe_deposit.md). Ported on top of the benchmark PR branch so the gradeable expected_state versions of shared scenarios are preserved.

devin-ai-integration

Devin Review found 1 new potential issue.

devin-ai-integration · 2026-06-24T19:19:16Z

+    try:
+        report_dict = report.to_dict()
+        report_dict["tags"] = sorted(ctx.tagger.tags)
+        report_dict["evaluations"] = ctx.tagger.evaluations
+        report_dict["outcome"] = ctx.tagger.outcome
+        report_dict["outcome_reason"] = ctx.tagger.outcome_reason
+        with open(os.path.join(report_dir, f"session_report-{room}.json"), "w") as f:
+            json.dump(report_dict, f, indent=2)
+    except Exception:
+        logger.exception("error dumping session report")


🚩 run_artifacts.py references potentially new SessionReport/Tagger API surface

dump_run_artifacts calls report.to_dict(), ctx.tagger.evaluations, ctx.tagger.outcome, and ctx.tagger.outcome_reason (run_artifacts.py:40-44). These may be newer SDK APIs not present in older versions. The entire block is wrapped in a try/except so a missing attribute wouldn't crash the session, but the artifact dump would silently fail. Worth verifying these APIs exist in the targeted SDK version.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration

Devin Review found 1 new potential issue.

devin-ai-integration · 2026-06-24T19:39:18Z

+    async def start_restaurant_booking(self, ctx: RunContext[Userdata]) -> str | None:
+        """Start the restaurant-reservation flow. Call it the moment the caller wants a table - the flow collects date, party size, time, name, and phone itself. Its return is the FINAL result of the reservation: relay it and move on - nothing further to confirm or call afterwards."""
+        reservation = await BookRestaurantTask(
+            db=ctx.userdata.db, chat_ctx=speech_only(self.chat_ctx)
+        )
+        return (
+            f"You're set for {speak_time(reservation.time)} on "
+            f"{reservation.date.strftime('%A, %B %-d')} for "
+            f"{reservation.party_size} guest{'s' if reservation.party_size != 1 else ''}. "
+            f"Confirmation code: {_speak_code(reservation.code)}. "
+            "| reservation complete - relay this to the caller; no further tool call is needed."
+        )


🚩 Asymmetric duplicate-prevention: room bookings guarded but restaurant bookings are not

The PR adds a duplicate-prevention guard for start_room_booking (tools_rooms.py:183-195) using last_room_booking and caller_turns_at_last_booking in Userdata. No equivalent guard exists for start_restaurant_booking (tools_restaurant.py:50-61). The Userdata class in common.py has no last_restaurant_booking field. This is presumably intentional — the room booking flow is longer and more prone to model re-entry than the restaurant flow — but it creates an asymmetry. If the same re-entry problem occurs with restaurant bookings, it would silently double-book a table.

Was this helpful? React with 👍 or 👎 to provide feedback.

ShayneP added 3 commits June 16, 2026 16:08

Add scenarios

cbdc1ea

Scenarios up to 100

4caaf69

ShayneP requested a review from a team as a code owner June 22, 2026 18:43

This comment was marked as resolved.

Sign in to view

Fix Devin feedback

9f921d1

This comment was marked as resolved.

Sign in to view

More fixes from Devin

d10882d

This comment was marked as resolved.

Sign in to view

Add VAD

f4dea99

devin-ai-integration Bot reviewed Jun 24, 2026

View reviewed changes

Fix error type

5b28e2e

devin-ai-integration Bot reviewed Jun 24, 2026

View reviewed changes

tinalenguyen approved these changes Jun 24, 2026

View reviewed changes

ShayneP and others added 2 commits June 24, 2026 16:27

Ruff fixes

f744660

Merge branch 'main' into ShayneP/hotel-scenarios-2

a5ab976

tinalenguyen merged commit 04f3fdd into main Jun 24, 2026
22 of 23 checks passed

tinalenguyen deleted the ShayneP/hotel-scenarios-2 branch June 24, 2026 20:33

rosetta-livekit-bot Bot mentioned this pull request Jun 24, 2026

Add hotel receptionist scenario example livekit/agents-js#1877

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Hotel Receptionist Scenario Expansion#6186

Hotel Receptionist Scenario Expansion#6186
tinalenguyen merged 9 commits into
mainfrom
ShayneP/hotel-scenarios-2

ShayneP commented Jun 22, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Jun 24, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Jun 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ShayneP commented Jun 22, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants