You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Goal:** find a way to version code, data and pipelines.
49
47
50
-
#### Existing project
48
+
###Initial project
51
49
52
-
Starting from an existing project composed of Python 3 module(s) and a set of **Jupyter notebooks**,
53
-
we want to create an automated pipeline in order to version, share and reproduce experiments.
50
+
Starting from an existing project composed of Python module(s) and a set of Jupyter
51
+
notebooks, we want to create an automated pipeline in order to version, share and
52
+
reproduce experiments.
54
53
55
54
56
55
│── classifier
@@ -70,16 +69,18 @@ we want to create an automated pipeline in order to version, share and reproduce
70
69
│── requirements.yml
71
70
│── setup.cfg
72
71
│── setup.py
73
-
74
-
The data flow is processed by applying steps and intermediary results are versioned using metadata files. These steps are defined in **Jupyter notebooks**, which are then converted to Python scripts.
72
+
73
+
The data flow is processed by applying steps and intermediary results are versioned
74
+
using metadata files. These steps are defined in Jupyter notebooks, which are then
75
+
converted to Python scripts.
75
76
76
77
Keep in mind that:
77
78
78
-
- The reference for the code of the step remains in **Jupyter notebook**
79
+
- The reference for the code of the step remains in the Jupyter notebook
79
80
- Pipelines are structured according to their inputs and outputs
Create and a Python virtual environment, and activate it:
190
190
191
-
make develop
192
-
193
-
All other steps are explain in each use cases.
191
+
```shell
192
+
virtualenv --python python3 venv
193
+
source venv/bin/activate
194
+
```
194
195
195
-
#### Cases
196
+
Install requirements:
196
197
197
-
-[How DVC works](./tutorial/dvc_overview.md)
198
+
```shell
199
+
make develop
200
+
```
198
201
199
-
-[MLV-tools pipeline features (on simple cases)](./tutorial/pipeline_features.md)
202
+
All other steps are explained in each use case.
200
203
201
-
- Going further with more realistic use cases:
204
+
### Cases
202
205
203
-
-[Use Case 1: Build and Reproduce a Pipeline](./tutorial/use_case1.md)
204
-
-[Use Case 2: Create a new version of a pipeline](./tutorial/use_case2.md) (Run an experiment)
205
-
-[Use Case 3: Build a Pipeline from an Existing Pipeline](./tutorial/use_case3.md)
206
-
-[Use Case 4: Hyperparameter optimisation and fine-tuning](./tutorial/use_case4.md)
206
+
*[How DVC works](./tutorial/dvc_overview.md)
207
+
*[mlvtools pipeline features (on simple cases)](./tutorial/pipeline_features.md)
208
+
* Going further with more realistic use cases:
209
+
*[Use Case 1: Build and Reproduce a Pipeline](./tutorial/use_case1.md)
210
+
*[Use Case 2: Create a new version of a pipeline](./tutorial/use_case2.md) (Run an
211
+
experiment)
212
+
*[Use Case 3: Build a Pipeline from an Existing Pipeline](./tutorial/use_case3.md)
213
+
*[Use Case 4: Hyperparameter optimisation and fine-tuning](./tutorial/use_case4.md)
207
214
208
215
209
216
## Talks
210
217
211
-
-[PyData Paris - March 2019 Meetup](https://www.meetup.com/fr-FR/PyData-Paris/events/259187805/): [talk](https://peopledoc.github.io/mlv-tools-tutorial/talks/pyData/presentation.html)
212
-
-[PyData Amsterdam - May 2019](https://pydata.org/amsterdam2019/schedule/presentation/32/): [tutorial](https://peopledoc.github.io/mlv-tools-tutorial/talks/workshop/presentation.html)
0 commit comments