Dev/eric/agent #43

KeplerC · 2025-10-11T00:15:24Z

Summary 📝

Write an overview about it.

Details

Describe more what you did on changes.

(...)
(...)

Bugfixes 🐛 (delete if dind't have any)

Checks

Closed #798
Tested Changes
Stakeholder Approval

…ng, implement parallel loading, and enhance filtering with Executor. Update VLM model to Qwen/Qwen2.5-VL-7B-Instruct across relevant components.

… input/output file handling, and creating a dedicated output directory for analysis results.

…o RoboDM format using Ray. Introduce DROIDProcessor class for managing trajectory processing, and update the main execution flow for improved efficiency. Modify droid_vlm_demo to streamline VLM prompt for success detection.

…omponents, streamline droid_vlm_demo by removing unused methods and comments, and enhance dataset processing with lazy loading. Adjust filtering logic and improve output handling for better performance.

…cy instead of F1 score. Update method names, print statements, and summary output accordingly for clarity and consistency.

…gging Face cache. Modify VLM prompt in droid_vlm_demo for improved specificity in task description.

…by adding checks for TFDS data availability and ensuring valid trajectory data before file creation. Update camera serial mapping logic and adjust dataset loading to use 'droid_100'. Modify .gitignore to include 'droid_100' directory.

…ant text labels and enhancing the VLM prompt for clarity in calibration analysis.

… pipeline usage examples, including auto-scan and quick mode options for trajectory processing.

…cess DROID trajectory directories. Update README to reflect new usage instructions, including automatic ground truth generation and enhanced VLM processing capabilities.

- Updated `.gitignore` to include `eval_runs/`. - Introduced `make_image_grid` function for creating tiled grid images from a list of RGB images. - Enhanced `process_single_trajectory` to support different methods for passing frames to VLM: either as a stream or as a concatenated grid. - Modified `VLMService` to analyze multiple images together with a single prompt. - Updated command-line arguments to allow configuration of frame sampling and passing method. - Improved documentation and comments for clarity on new functionalities.

…eters - Removed image_key and language_key parameters from VLM processing functions as they are not applicable for DROID directories with MP4 files. - Updated the pipeline to handle multiple trials for VLM evaluations, including saving per-trial metrics and aggregate results. - Simplified the example usage in `simple_vlm_processing.py` to reflect the new input format and removed state visualization functionality. - Enhanced documentation to clarify the new input requirements and processing methods.

KeplerC and others added 30 commits June 28, 2025 16:21

add metadata

4a8af81

attempt to fix tests

5af162a

agent + tools

722a1a6

demo tool

afe588c

update test cases

3583345

droid example starter

adaecd6

add tests and move example

5308310

format

1e1cdce

fix linting

51c8e7e

refactor dataset design

d5b215e

add parquet backend

0c0d026

comment out for now

15326f3

at least it runs

e78a921

vlm intiial code

a25e2d4

successfully classify

df82bb7

Refactor droid_vlm_demo to utilize VLADataset for trajectory processi…

074267b

…ng, implement parallel loading, and enhance filtering with Executor. Update VLM model to Qwen/Qwen2.5-VL-7B-Instruct across relevant components.

integrate into sequence

d625daa

update instruction

3a5b4ec

Enhance droid_vlm_demo by updating camera selection logic, adding VLM…

079be41

… input/output file handling, and creating a dedicated output directory for analysis results.

calculate f1 score

16e7774

debug

d325cf2

Update VLM model to Qwen/Qwen2.5-VL-7B-Instruct across all relevant c…

13b0e7c

…omponents, streamline droid_vlm_demo by removing unused methods and comments, and enhance dataset processing with lazy loading. Adjust filtering logic and improve output handling for better performance.

increate posssible trajectories

4e90dcb

update camera views

57a14f6

lerobot first attempt

05077b7

commit before working on other stuff

658d87b

vlm captioning

28deddd

additional fixes on the captioning results

69502fa

Refactor trajectory captioning metrics calculation to focus on accura…

e22e81c

…cy instead of F1 score. Update method names, print statements, and summary output accordingly for clarity and consistency.

Your Name added 20 commits July 10, 2025 06:16

Update .gitignore to include new directories for combined data and Hu…

8896032

…gging Face cache. Modify VLM prompt in droid_vlm_demo for improved specificity in task description.

make droid calibration working

4a01ffe

update to make benchmarks work

9185215

caption performance improvement

dca9b71

Refactor visualization in benchmark_calibration.py by removing redund…

4723d22

…ant text labels and enhancing the VLM prompt for clarity in calibration analysis.

quality score

c58bb07

t

b067f81

seems to work

017cc44

Remove deprecated DROID conversion scripts and update README with new…

5f3acf7

… pipeline usage examples, including auto-scan and quick mode options for trajectory processing.

Remove deprecated DROID scripts and refactor pipeline to directly pro…

3e3e82c

…cess DROID trajectory directories. Update README to reflect new usage instructions, including automatic ground truth generation and enhanced VLM processing capabilities.

d

06ffe9e

frame by frame

3c9890e

d

10b489a

d

bf481ba

dd

d2d5d5a

remove droid for adding them back

5edfc1a

minimal example

978ffed

KeplerC merged commit f60b3f9 into main Oct 11, 2025
1 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Dev/eric/agent #43

Dev/eric/agent #43

Uh oh!

KeplerC commented Oct 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Dev/eric/agent #43

Dev/eric/agent #43

Uh oh!

Conversation

KeplerC commented Oct 11, 2025

Summary 📝

Details

Bugfixes 🐛 (delete if dind't have any)

Checks

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants