Adds support for Stella_en_v5 embedding model - 1.5B variant #2551

AnubhabB · 2024-10-08T07:26:16Z

Stella_en_1.5B_v5 is a top ranking model (top 5) in Retrieval and Reranking tasks as of 8th October 2024.

Model Card

This PR adds support for the model along with some examples.

Tasks:

Check license: Model is licensed MIT
Check numerical accuracy - Verified against source pytorch implementation. At f32 candle implementation yields mathematically equivalent up to a 3-decimal places rounded .. beyond which we observe some variations. Code used to verify numeric equivalence can be found here.
Authors example from the model card added and reproduced.

Note:

Stella_400M seems to follow a significantly different implementation. Have not implemented it in this PR.

Closes #2525

…d would be documented in the readme

LaurentMazare · 2024-10-13T21:09:19Z

Thanks!

…face#2551) * Stella_en_1.5B_v5 * Separated creation. This is a critical step for numerical accuracy and would be documented in the readme * EmbedDim would require clone and copy * WIP: example * Examples added * a litte more in README

AnubhabB added 7 commits October 7, 2024 17:54

Stella_en_1.5B_v5

cae5915

Merge branch 'main' of github.com:AnubhabB/candle into stella

28c953f

Separated creation. This is a critical step for numerical accuracy an…

18edaeb

…d would be documented in the readme

EmbedDim would require clone and copy

4155377

WIP: example

4a33e8a

Examples added

f5b5d55

a litte more in README

d74695a

LaurentMazare approved these changes Oct 13, 2024

View reviewed changes

LaurentMazare merged commit f553ab5 into huggingface:main Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds support for Stella_en_v5 embedding model - 1.5B variant #2551

Adds support for Stella_en_v5 embedding model - 1.5B variant #2551

AnubhabB commented Oct 8, 2024 •

edited

Loading

LaurentMazare commented Oct 13, 2024

Adds support for Stella_en_v5 embedding model - 1.5B variant #2551

Adds support for Stella_en_v5 embedding model - 1.5B variant #2551

Conversation

AnubhabB commented Oct 8, 2024 • edited Loading

Tasks:

Note:

LaurentMazare commented Oct 13, 2024

AnubhabB commented Oct 8, 2024 •

edited

Loading