v2.4.0
🔆 Release highlights
Import Hub datasets from the UI
import_hub_dataset.mp4
In this release, we’ve focused all of our efforts in bringing you a new feature to import datasets from the Hugging Face Hub directly within our UI, making it easier and faster to get started with your AI projects.
To get started, click on the “Import dataset from Hugging Face” button and paste the repo id of the dataset you want to use. Argilla will process the columns of the dataset and map them to Fields or Questions. Then, you can add more questions or remove any unnecessary fields by selecting the “No mapping” options. All the changes you make will be automatically reflected in the preview.
Once you’re happy with the result you simply need to provide a name for your dataset, select a workspace and (if applicable) a split. Then, Argilla will start importing the dataset.
Note
If your dataset is bigger than 10k records, at this stage Argilla will only import the first 10k. You can import the rest of the dataset using the Argilla SDK: simply click on the “Import data” button in the dataset and use the code snippet provided.
If you want to make extra changes, like customizing the titles of your fields and questions, don’t worry, you can always go back to the Dataset Settings page after the dataset has been created.
Learn more about this new feature in our docs.
Deploy an Argilla Space directly from the SDK
If you're working from the SDK and don't want to leave to start your Argilla server, you can start an Argilla deployment on Spaces with a simple line of code:
import argilla as rg
client = rg.Argilla.deploy_on_spaces(api_key="12345678")
Learn more in our docs.
Changelog v2.4.0
- Enhancement/improve-error-messaging-for-role-forbidden by @burtenshaw in #5554
- refactor: add
DatasetPublishValidator
class by @jfcalvo in #5568 - feat: set CREATOR_USER_ID to avoid difficulties with creation in orga… by @davidberenstein1957 in #5556
- [Refactor] remove name validations for dataset workspaces and usernames by @frascuchon in #5575
- fix: SPACES_CREATOR_USER_ID -> SPACE_CREATOR_USER_ID by @davidberenstein1957 in #5590
- [FIX] Prevent duplicated field text by @leiyre in #5592
- feat: Add basic support to bool features by @frascuchon in #5576
- feat: Add support to other than str values for terms metadata properties by @frascuchon in #5594
- [BUGFIX] argilla server: parse fields for record schemas by @frascuchon in #5600
- correct phrase on docs: "a recod question" -> "a question" by @HeAndres in #5599
- docs: update filter_dataset.md by @eltociear in #5571
- feat: 5108 feature add method to deploy on spaces through huggingface hub by @davidberenstein1957 in #5547
- docs: add quickstart update for deploy on spaces by @davidberenstein1957 in #5550
- Typo: missing comma by @ACMCMC in #5565
- Typo fix by @ACMCMC in #5566
- Fix typo by @ACMCMC in #5567
- [REFACTOR] argilla server: moving all record validators by @frascuchon in #5603
- [BUGFIX] argilla server: Prevent convert
ChatFieldValue
objects by @frascuchon in #5605 - Introducing Argilla Guru on Gurubase.io by @kursataktas in #5608
- [PERF][IMPROVEMENT] argilla server: improve computation for dataset progress and metrics by @frascuchon in #5618
- [PERF] argilla server: Reduce general transaction time by @frascuchon in #5609
- fix: Prevent compute metrics for draft datasets by @frascuchon in #5624
- Refine German translations and update non-localized UI elements by @paulbauriegel in #5632
- [BUGFIX] Catch None in image feature columns by @burtenshaw in #5626
- feat: added support for
with_vectors
with query filter in sdk by @bharath97-git in #5638 - perf: Using search engine to compute the total number of records for user metrics by @frascuchon in #5641
- [IMPROVEMENT] feat(helm): add support for default storage class in PVCs by @dme86 in #5628
- Feature - Improve Accessibility for Screenreaders by @paulbauriegel in #5634
- [FEATURE-BRANCH] Argilla direct import from Hub by @jfcalvo in #5572
- fix: remove unnecesary exposed ports for Argilla Docker compose file by @jfcalvo in #5644
- Dataset creation feature final QA by @leiyre in #5646
- [CI] argilla frontend: Remove invalid workflow permissions by @frascuchon in #5647
- [CI] Configure workflow permissions by @frascuchon in #5648
- chore: update changelogs for release
2.4.0
by @jfcalvo in #5650 - chore: small improvement installing dependencies for HF Spaces Dockerfile by @jfcalvo in #5651
- fix: skip
helmlint
pre-commit hook on CI becausehelm
command is not available by @jfcalvo in #5654 - Import from hub docs by @nataliaElv in #5631
- [RELEASE] 2.4.0 by @frascuchon in #5643
New Contributors
- @HeAndres made their first contribution in #5599
- @ACMCMC made their first contribution in #5565
- @kursataktas made their first contribution in #5608
- @bharath97-git made their first contribution in #5638
- @dme86 made their first contribution in #5628
Full Changelog: v2.3.1...v2.4.0