Release inference code for OpenBEATs by Shikhar-S · Pull Request #32 · wavlab-speech/versa

Shikhar-S · 2025-05-24T23:12:56Z

Two new features in VERSA:

Sound class prediction from a fine-tuned OpenBEATs checkpoint.
Sound embedding extraction from pre-trained or fine-tuned checkpoint.

ftshijt · 2025-05-27T07:10:55Z

Thanks a lot for the great contribution. May I ask how you plan to store the embedding space in versa?

Shikhar-S · 2025-06-16T01:58:23Z

@ftshijt As discussed, added changes for

Storing embeddings as npy files
Similarity computation with a reference audio
Class prediction to output class names and log probabilty.

ftshijt · 2025-06-16T04:20:10Z

Thanks a lot. I think the interface is good to go. Before further steps, I would like to check the following items with you first:

I'm okay with directly putting the architecture in versa, but it might be easier if we simply utilize it from espnet since we do have the dependency there? Mostly, I want to make it connect to your future versions (if any), which might reduce your effort to push the updated code twice. (It's up to you~)
We definitely want to keep the options of using local checkpoints, but at the same time, do you mind smoothing the usage with automatic download of the model (e.g., from huggingface etc.)?
For the class prediction cases, it would be super helpful if you could provide more examples with different downstream tasks~

Shikhar Bharadwaj added 3 commits May 24, 2025 18:02

add openbeats implementation

be47101

register inference functions

7b3c658

add doc

f64f19b

Shikhar Bharadwaj added 2 commits June 15, 2025 18:18

add metric to compute similarity

725f926

add class prediction capability

79739e9

capability to download ckpts, important ckpt list

3edd7f2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release inference code for OpenBEATs#32

Release inference code for OpenBEATs#32
Shikhar-S wants to merge 6 commits intowavlab-speech:mainfrom
Shikhar-S:main

Shikhar-S commented May 24, 2025

Uh oh!

ftshijt commented May 27, 2025

Uh oh!

Shikhar-S commented Jun 16, 2025

Uh oh!

ftshijt commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Shikhar-S commented May 24, 2025

Uh oh!

ftshijt commented May 27, 2025

Uh oh!

Shikhar-S commented Jun 16, 2025

Uh oh!

ftshijt commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants