Skip to content

Commit b261b23

Browse files
authored
Merge pull request #1100 from argilla-io/develop
`1.5.0`
2 parents 27b5db2 + dd6eda5 commit b261b23

File tree

305 files changed

+13958
-2500
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

305 files changed

+13958
-2500
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
name: "\U0001F4DA Add a documentation report"
2+
description: "Have you spotted a typo or mistake in our docs?"
3+
title: "[DOCS]"
4+
labels: ["documentation"]
5+
assignees: []
6+
7+
body:
8+
- type: markdown
9+
attributes:
10+
value: "Thank you for reporting a documentation mistake! Before you get started, please [search to see](https://github.com/argilla-io/distilabel/issues) if an issue already exists for the bug you encountered."
11+
12+
- type: textarea
13+
id: doc_report
14+
attributes:
15+
label: "Which page or section is this issue related to?"
16+
description: "Please include the URL and/or source."
17+
validations:
18+
required: false
19+
20+
- type: textarea
21+
id: doc_review
22+
attributes:
23+
label: "What are you documenting, or what change are you making in the documentation?"
24+
description: "If a documentation needs to be created, please specify its coverage.\n If there's a typo or something needs revisiting, please indicate it and show code/text/screenshots."
25+
validations:
26+
required: false
+70
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,70 @@
1+
name: "\U0001FAB2 Bug report"
2+
description: "Report bugs and unexpected behavior."
3+
title: "[BUG]"
4+
labels: ["bug", "ml-internal"]
5+
assignees: []
6+
7+
body:
8+
- type: markdown
9+
attributes:
10+
value: "Thank you for reporting a bug! Before you get started, please [search to see](https://github.com/argilla-io/distilabel/issues) if an issue already exists for the bug you encountered."
11+
12+
- type: textarea
13+
id: bug_description
14+
attributes:
15+
label: "Describe the bug"
16+
description: "A clear and concise description of the bug."
17+
validations:
18+
required: true
19+
20+
- type: textarea
21+
id: stacktrace
22+
attributes:
23+
label: "To reproduce"
24+
description: "The code to reproduce the behavior."
25+
placeholder: |
26+
```python
27+
my_python_code
28+
```
29+
validations:
30+
required: false
31+
32+
- type: textarea
33+
id: expected_behavior
34+
attributes:
35+
label: "Expected behavior"
36+
description: "A clear and concise description of what you expected to happen."
37+
validations:
38+
required: false
39+
40+
- type: textarea
41+
id: screenshots
42+
attributes:
43+
label: "Screenshots"
44+
description: "If applicable, add screenshots to help explain your problem."
45+
validations:
46+
required: false
47+
48+
- type: textarea
49+
id: environment
50+
attributes:
51+
label: "Environment"
52+
description: "Since version 1.16.0 you can use `python -m argilla info` command to easily get the used versions."
53+
value: |
54+
- Distilabel Version [e.g. 1.0.0]:
55+
- Python Version [e.g. 3.11]:
56+
validations:
57+
required: false
58+
59+
- type: textarea
60+
id: additional_context
61+
attributes:
62+
label: "Additional context"
63+
description: "Add any other relevant information."
64+
validations:
65+
required: false
66+
67+
- type: markdown
68+
attributes:
69+
value: |
70+
📌 Make sure you have provided all the required information in each section so we can support you properly.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,44 @@
1+
name: "\U0001F195 Feature request"
2+
description: "Share cool new ideas for the project."
3+
title: "[FEATURE]"
4+
labels: ["enhancement", "ml-internal"]
5+
assignees: []
6+
7+
8+
body:
9+
- type: markdown
10+
attributes:
11+
value: "Thank you for sharing your feature request! Please fill out the sections below."
12+
13+
- type: textarea
14+
id: feature_request
15+
attributes:
16+
label: "Is your feature request related to a problem? Please describe."
17+
description: "A clear and concise description of what the problem is."
18+
placeholder: "I'm always frustrated when..."
19+
validations:
20+
required: false
21+
22+
- type: textarea
23+
id: feature_description
24+
attributes:
25+
label: "Describe the solution you'd like"
26+
description: "A clear and concise description of what you want to happen."
27+
validations:
28+
required: false
29+
30+
- type: textarea
31+
id: feature_alternatives
32+
attributes:
33+
label: "Describe alternatives you've considered"
34+
description: "A clear and concise description of any alternative solutions or features you've considered."
35+
validations:
36+
required: false
37+
38+
- type: textarea
39+
id: additional_context
40+
attributes:
41+
label: "Additional context"
42+
description: "Add any other context or screenshots about the feature request here."
43+
validations:
44+
required: false

.github/ISSUE_TEMPLATE/config.yml

+5
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
blank_issues_enabled: false
2+
contact_links:
3+
- name: 🗯 Community Discussions
4+
url: http://hf.co/join/discord
5+
about: Our Discord Community loves to discuss distilabel and NLP topics

.github/ISSUE_TEMPLATE/🆕-feature-request.md

-20
This file was deleted.

.github/ISSUE_TEMPLATE/🐛-bug-report.md

-30
This file was deleted.

.github/ISSUE_TEMPLATE/📚-documentation-update.md

-16
This file was deleted.

.github/workflows/test.yml

+6
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,12 @@ jobs:
5050
if: steps.cache.outputs.cache-hit != 'true'
5151
run: ./scripts/install_dependencies.sh
5252

53+
- name: Setup tmate session
54+
uses: mxschmitt/action-tmate@v3
55+
if: ${{ matrix.python-version == '3.12' && github.event_name == 'workflow_dispatch' && inputs.tmate_session }}
56+
with:
57+
limit-access-to-actor: true
58+
5359
- name: Lint
5460
run: make lint
5561

.gitignore

+1-1
Original file line numberDiff line numberDiff line change
@@ -77,4 +77,4 @@ venv.bak/
7777
# Other
7878
*.log
7979
*.swp
80-
.DS_Store
80+
.DS_Store

.pre-commit-config.yaml

+3-3
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
repos:
22
- repo: https://github.com/Lucas-C/pre-commit-hooks
3-
rev: v1.5.4
3+
rev: v1.5.5
44
hooks:
55
- id: insert-license
66
name: "Insert license header in Python source files"
@@ -10,8 +10,8 @@ repos:
1010
- LICENSE_HEADER
1111
- --fuzzy-match-generates-todo
1212

13-
- repo: https://github.com/charliermarsh/ruff-pre-commit
14-
rev: v0.6.2
13+
- repo: https://github.com/astral-sh/ruff-pre-commit
14+
rev: v0.8.1
1515
hooks:
1616
- id: ruff
1717
args: [--fix]

README.md

+2-1
Original file line numberDiff line numberDiff line change
@@ -94,6 +94,7 @@ In addition, the following extras are available:
9494
- `vertexai`: for using [Google Vertex AI](https://cloud.google.com/vertex-ai) proprietary models via the `VertexAILLM` integration.
9595
- `vllm`: for using [vllm](https://github.com/vllm-project/vllm) serving engine via the `vLLM` integration.
9696
- `sentence-transformers`: for generating sentence embeddings using [sentence-transformers](https://github.com/UKPLab/sentence-transformers).
97+
- `mlx`: for using [MLX](https://github.com/ml-explore/mlx) models via the `MlxLLM` integration.
9798

9899
### Structured generation
99100

@@ -118,7 +119,7 @@ pip install "distilabel[hf-inference-endpoints]" --upgrade
118119
Then run:
119120

120121
```python
121-
from distilabel.llms import InferenceEndpointsLLM
122+
from distilabel.models import InferenceEndpointsLLM
122123
from distilabel.pipeline import Pipeline
123124
from distilabel.steps import LoadDataFromHub
124125
from distilabel.steps.tasks import TextGeneration

docs/api/embedding/embedding_gallery.md

-8
This file was deleted.

docs/api/llm/index.md

-7
This file was deleted.

docs/api/llm/llm_gallery.md

-10
This file was deleted.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
# Embedding Gallery
2+
3+
This section contains the existing [`Embeddings`][distilabel.models.embeddings] subclasses implemented in `distilabel`.
4+
5+
::: distilabel.models.embeddings
6+
options:
7+
filters:
8+
- "!^Embeddings$"

docs/api/embedding/index.md docs/api/models/embedding/index.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -4,4 +4,4 @@ This section contains the API reference for the `distilabel` embeddings.
44

55
For more information on how the [`Embeddings`][distilabel.steps.tasks.Task] works and see some examples.
66

7-
::: distilabel.embeddings.base
7+
::: distilabel.models.embeddings.base
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
# ImageGenerationModel Gallery
2+
3+
This section contains the existing [`ImageGenerationModel`][distilabel.models.image_generation] subclasses implemented in `distilabel`.
4+
5+
::: distilabel.models.image_generation
6+
options:
7+
filters:
8+
- "!^ImageGenerationModel$"
9+
- "!^AsyngImageGenerationModel$"
10+
- "!typing"
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
# ImageGenerationModel
2+
3+
This section contains the API reference for the `distilabel` image generation models, both for the [`ImageGenerationModel`][distilabel.models.image_generation.ImageGenerationModel] synchronous implementation, and for the [`AsyncImageGenerationModel`][distilabel.models.image_generation.AsyncImageGenerationModel] asynchronous one.
4+
5+
For more information and examples on how to use existing LLMs or create custom ones, please refer to [Tutorial - ImageGenerationModel](../../../sections/how_to_guides/basic/task/image_task.md).
6+
7+
::: distilabel.models.image_generation.base

docs/api/models/llm/index.md

+7
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
# LLM
2+
3+
This section contains the API reference for the `distilabel` LLMs, both for the [`LLM`][distilabel.models.llms.LLM] synchronous implementation, and for the [`AsyncLLM`][distilabel.models.llms.AsyncLLM] asynchronous one.
4+
5+
For more information and examples on how to use existing LLMs or create custom ones, please refer to [Tutorial - LLM](../../../sections/how_to_guides/basic/llm/index.md).
6+
7+
::: distilabel.models.llms.base

docs/api/models/llm/llm_gallery.md

+10
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
# LLM Gallery
2+
3+
This section contains the existing [`LLM`][distilabel.models.llms] subclasses implemented in `distilabel`.
4+
5+
::: distilabel.models.llms
6+
options:
7+
filters:
8+
- "!^LLM$"
9+
- "!^AsyncLLM$"
10+
- "!typing"

docs/api/pipeline/typing.md

-3
This file was deleted.

docs/api/step/typing.md

-3
This file was deleted.

docs/api/task/image_task.md

+7
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
# ImageTask
2+
3+
This section contains the API reference for the `distilabel` image generation tasks.
4+
5+
For more information on how the [`ImageTask`][distilabel.steps.tasks.ImageTask] works and see some examples, check the [Tutorial - Task - ImageTask](../../sections/how_to_guides/basic/task/generator_task.md) page.
6+
7+
::: distilabel.steps.tasks.base.ImageTask

docs/api/task/task_gallery.md

+1
Original file line numberDiff line numberDiff line change
@@ -8,5 +8,6 @@ This section contains the existing [`Task`][distilabel.steps.tasks.Task] subclas
88
- "!Task"
99
- "!_Task"
1010
- "!GeneratorTask"
11+
- "!ImageTask"
1112
- "!ChatType"
1213
- "!typing"

docs/api/task/typing.md

-3
This file was deleted.

0 commit comments

Comments
 (0)