Skip to content

[FEATURE] Create a direct way of passing a dataset as part of a pipeline #779

Closed
@plaguss

Description

@plaguss

Is your feature request related to a problem? Please describe.
There are users that want to process a dataset on its own and just use it in a pipeline, without writing it to disk first, or uploading it to the Hugging Face Hub and download it afterwards. We have to offer this possibility.

Describe the solution you'd like
A simple way of passing a datasets.Dataset or pandas.DataFrame for example already loaded to a Pipeline.

Describe alternatives you've considered
Write it first to disk or upload it to the HF Hub.

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions