Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dask tutorial #20

Closed
orbeckst opened this issue Oct 22, 2018 · 5 comments
Closed

dask tutorial #20

orbeckst opened this issue Oct 22, 2018 · 5 comments
Assignees
Labels
notebook Create aJupyter notebook as primary deliverable. tutorial session

Comments

@orbeckst
Copy link
Member

introductory tutorial on dask

  • background: task graph and execution
  • parallelizing with dask
  • using distributed (setting up a dask cluster)
  • using the bokeh status web interface
@orbeckst orbeckst added the notebook Create aJupyter notebook as primary deliverable. label Oct 22, 2018
@orbeckst
Copy link
Member Author

@VOD555 please coordinate with @dotsdl for your PMDA tutorial so that the two tutorials fit together.

@orbeckst
Copy link
Member Author

With the changes in MDAnalysis/pmda#66 and the new way to globally define the scheduler it becomes even more useful to have a solid introduction to dask.

@dotsdl please can you make sure that the new idiom to set the scheduler, which @kain88-de explained in MDAnalysis/pmda#66 (comment), is also covered?

@orbeckst
Copy link
Member Author

orbeckst commented Nov 1, 2018

With PMDA now using dask's preferred way to select a scheduler, we now have to teach people to explicitly set the multiprocessing or distributed scheduler and not the default threaded scheduler. Threaded does not work well with our largely Python based code.

EDIT: As @kain88-de says below we still use multiprocessing as default. It will still be useful to show how to set up distributed and talk about the schedulers, though. Also worthwhile pointing out that the threaded scheduler will not work well with PMDA.

@kain88-de
Copy link
Member

kain88-de commented Nov 2, 2018 via email

@orbeckst
Copy link
Member Author

orbeckst commented Nov 2, 2018

Yes, you're right.

(I didn't think of adding ad note regarding the threaded scheduler in MDAnalysis/pmda#78.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
notebook Create aJupyter notebook as primary deliverable. tutorial session
Projects
None yet
Development

No branches or pull requests

3 participants