Support Torch Cpp Runtime for HSTU by geoffreyQiu · Pull Request #294 · NVIDIA/recsys-examples

geoffreyQiu · 2026-02-05T04:53:56Z

Description

Support Torch Cpp Runtime for HSTU

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

shijieliu · 2026-02-11T02:11:42Z

examples/hstu/modules/inference_embedding.py

+        for idx, embc in enumerate(embc_modulelist):
+            emb_backend = embc_configs[idx].backend
+            create_embedding_collection = select_embedding(emb_backend)
+            embc_modulelist[idx] = create_embedding_collection(


do we need to uset setattr here? like reference

shijieliu · 2026-02-11T02:14:08Z

the overall arch looks good to me. I think you can create a inference directory under dynamicemb and move the inference embedding related code under it.

shijieliu · 2026-02-11T02:15:37Z

examples/hstu/modules/inference_embedding.py

-        embedding_configs,
-        embedding_backend,
-        sparse_shareables,
+def create_torchrec_embedding(


maybe we dont need to support this.

shijieliu · 2026-02-11T02:16:18Z

examples/hstu/modules/inference_embedding.py

-            )
-            model_state_dict = torch.load(model_state_dict_path)["model_state_dict"]
-        self.load_state_dict(model_state_dict, strict=False)
+        pass


for load dump, we can reuse DynamicEmbLoad/DynamicEmbDump for dynamicemb checkpoint

Init apply_inference_embedding implementation

fd46fad

geoffreyQiu changed the title ~~Init apply_inference_embedding implementation~~ Support Torch Cpp Runtime for HSTU Feb 5, 2026

geoffreyQiu mentioned this pull request Feb 5, 2026

[FEA] HSTU Inference in Torch Cpp Runtime #221

Open

Implement apply_inference_embedding based on simple wrapper

1518d9c

shijieliu reviewed Feb 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Torch Cpp Runtime for HSTU#294

Support Torch Cpp Runtime for HSTU#294
geoffreyQiu wants to merge 2 commits intoNVIDIA:mainfrom
geoffreyQiu:cpp_runtime

geoffreyQiu commented Feb 5, 2026

Uh oh!

shijieliu Feb 11, 2026

Uh oh!

shijieliu commented Feb 11, 2026

Uh oh!

shijieliu Feb 11, 2026

Uh oh!

shijieliu Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

geoffreyQiu commented Feb 5, 2026

Description

Checklist

Uh oh!

shijieliu Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

shijieliu commented Feb 11, 2026

Uh oh!

shijieliu Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

shijieliu Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants