Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task #2082

anime-sh · 2025-02-17T06:35:35Z

Implements #2071 #2066 #2070 #2056

Code Quality

Code Formatted: Format the code using make lint to maintain consistent style.

Documentation

Updated Documentation: Add or update documentation to reflect the changes introduced in this PR.

Testing

New Tests Added: Write tests to cover new functionality. Validate with make test-with-coverage.
Tests Passed: Run tests locally using make test or make test-with-coverage to ensure no existing functionality is broken.

Adding datasets checklist

Reason for dataset addition: ...

I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
- sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
- intfloat/multilingual-e5-small
I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
I have filled out the metadata object in the dataset file (find documentation on it here).
Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

Adding a model checklist

I have filled out the ModelMeta object to the extent possible
I have ensured that my model can be loaded using
- mteb.get_model(model_name, revision) and
- mteb.get_model_meta(model_name, revision)
I have tested the implementation works on a representative set of tasks.

Co-authored-by: rahulschand <rahulsc@stanford.edu>

anime-sh changed the title ~~Init MAEB [WIP]~~ [WIP] Init MAEB Feb 17, 2025

anime-sh marked this pull request as draft February 17, 2025 06:39

This was linked to issues Feb 18, 2025

Define an encoder interface for audio #2071

Open

Create audio classification AbsTask and Evaluator #2066

Open

Create multilabel audio classification AbsTask and Evaluator #2070

Open

isaac-chung mentioned this pull request Feb 19, 2025

Create audio clustering AbsTask and Evaluator #2093

Open

anime-sh and others added 7 commits February 20, 2025 19:00

init audio

c5744bf

some encoder related changes

64ccf50

some more abs task defs

1a744c0

Co-authored-by: rahulschand <rahulsc@stanford.edu>

evaluators and classification

c26ebae

remove rahul changes to generate first PR

1289d9b

make lint

bb2b4d0

add dataset/tasks skeleton

705664e

anime-sh force-pushed the maeb branch from bb73ae8 to 705664e Compare February 21, 2025 03:06

anime-sh and others added 8 commits February 20, 2025 19:17

readd changes lost in rebase

07eda3c

add fsd50k

ebae179

add task categories for audio

d51c5d1

slight updates to fsd50k

e3b89fa

make lint

849323c

wav2vec2 model

395b833

add fsd50k metadata

efd7095

rename folder

f97f9a3

anime-sh changed the title ~~[WIP] Init MAEB~~ Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task Feb 21, 2025

anime-sh marked this pull request as ready for review February 21, 2025 03:48

silky1708 and others added 2 commits February 20, 2025 19:48

add metric

6d61f3a

add torchaudio in req

fa61ea6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task #2082

Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task #2082

anime-sh commented Feb 17, 2025 •

edited

Loading

Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task #2082

Are you sure you want to change the base?

Add Audio (Multi Label) Classification Abstask, Baseline Audio model, FSD50k Dataset and Task #2082

Conversation

anime-sh commented Feb 17, 2025 • edited Loading

Code Quality

Documentation

Testing

Adding datasets checklist

Adding a model checklist

anime-sh commented Feb 17, 2025 •

edited

Loading