Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LitSearchRetrieval added #1150

Merged
merged 10 commits into from
Aug 13, 2024
Merged

LitSearchRetrieval added #1150

merged 10 commits into from
Aug 13, 2024

Conversation

dokato
Copy link
Collaborator

@dokato dokato commented Aug 11, 2024

As asked in #1129.

Additionally, I fixed one error in SlovakSum dataset.

Checklist

  • Run tests locally to make sure nothing is broken using make test.
  • Run the formatter to format the code using make lint.

Adding datasets checklist

Reason for dataset addition: ...

  • I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
    • sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
    • intfloat/multilingual-e5-small
  • I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
  • If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
  • I have filled out the metadata object in the dataset file (find documentation on it here).
  • Run tests locally to make sure nothing is broken using make test.
  • Run the formatter to format the code using make lint.

mteb/tasks/Retrieval/eng/LitSearchRetrieval.py Outdated Show resolved Hide resolved
mteb/tasks/Retrieval/eng/LitSearchRetrieval.py Outdated Show resolved Hide resolved
@dokato dokato merged commit a6458aa into embeddings-benchmark:main Aug 13, 2024
9 checks passed
@dokato dokato deleted the litsearch branch August 13, 2024 19:51
isaac-chung added a commit that referenced this pull request Aug 14, 2024
* fix: when create meta merge results with existing README.md (#1117)
* init version
* merge scores
* add test without frontmatter
* lint
* 1.12.91
Automatically generated by python-semantic-release
* fix: IWSLT2017BitextMining loading dataset. (#1132)
fix: the way of loading and transform dataset and typo of filename on IWSLT2017BitextMining
* 1.12.92
Automatically generated by python-semantic-release
* fix: Allow more linient TaskMetadata (#1131)
* fix: Allow more linient TaskMetadata
This fix allows for more lenient TaskMetadata when using the package, but maintain the validation for the package within the tests.
This is intended to ensure flexibility and ease of use, while maintaining high-quality doucementation of tasks
* Added fixes based on corrections
* Update points table
* 1.12.93
Automatically generated by python-semantic-release
* Added pyupgrade (using ruff) to lint CI (#1137)
* Added pyupgrade (using ruff) to lint CI
* Added FA ruleset
* Added C4 ruleset for simplifying comprehensions
* Added a few exceptions for points
Co-authored-by: bryant1410 <[email protected]>
* Added required import
* Add co-author
Co-authored-by: bryant1410 <[email protected]>
* format
---------
Co-authored-by: bryant1410 <[email protected]>
Co-authored-by: bryant1410 <[email protected]>
* fix: add CoIR tasks (#1130)
* add CoIR tasks
* change to mteb datasets
* Update points table
* Update tasks table
* 1.12.94
Automatically generated by python-semantic-release
* feat: Added in functionality to allow loading outdated results (#1141)
* 1.13.0
Automatically generated by python-semantic-release
* fix: add CoIR as Benchmark (#1142)
* add CoIR as Benchmark
* lint
* 1.13.1
Automatically generated by python-semantic-release
* Simplify models (#1118)
* Merge
* Adapt
* Simplify
* Check for rev again
* Rmv cmmnt
* Simplify
* simplify
* Rmv comment
Co-authored-by: Kenneth Enevoldsen <[email protected]>
* Use logging; change try except; add info
* Lint
* Rmv results
* Update rev
* Simplify models; Allow instructions
* Jobs
* Fix merge
* Format
* Adapt models
* Fix task types
* Update
* Fix syntax
* Simplify
* Add comparison
* Format
* Fix double comment
Co-authored-by: Kenneth Enevoldsen <[email protected]>
* Move example
* Format
* Rmv outdated instructions
---------
Co-authored-by: Kenneth Enevoldsen <[email protected]>
* Missing import for SadeemQuestionRetrieval (#1146)
Fixes #1143
* fix: Remove unused tests (#1148)
These tests are no longer used since results moved to the results repo.
* Update tasks table
* 1.13.2
Automatically generated by python-semantic-release
* feat: Cleaned up scripts folder  (#1144)
* fix: Move benchmarks from script to benchmarks.py
- A few of the benchmarks weren't implemented in the benchmark list. The best choice seem to move them benchmarks and remove the script (there were already a few issues). To ensure that they are maintained with the rest.
- Removed trivial (e.g. running one task) or outdated examples. Removing them ensure that we have less to maintain.
- moved mteb specific script into a mteb-folder.
* format
* fix: Convert arbitrary imports to absolute imports. (#1145)
This avoids loading in classes as modules (see PR ...)
* restructe scripts folder
* 1.14.0
Automatically generated by python-semantic-release
* Fix: llm2vec for retrieval (#1152)
* fix: remove  kwargs LLM2Vec doesn't have
* fix: linter error
* fix: an item automatically added by linter
* fix: removed test set for AFQMC with no gold labels (#1153)
The test set of AFQMC is hidden so has removed it from the default suite. This influences the C-MTEB. Who is the best person to contact for this? (@Muennighoff)
* 1.14.1
Automatically generated by python-semantic-release
* LitSearchRetrieval added (#1150)
* LitSearchRetrieval added
* points added
* fixed descriptive_stats in LitSearchRetrieval
* fix lint
* metadata fixed for LitSearchRetrieval
* metadata fixed for LitSearchRetrieval again
* metadata fixed for LitSearchRetrieval again
* points 1150 updated
* metadata fixed for LitSearchRetrieval again
* lint fix
* Update points table
* Update tasks table
* ruff check fixed
* make lint
* fix imports
* make lint
---------
Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: github-actions <[email protected]>
Co-authored-by: Hiroki Iida <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: dokato <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants