Added implementation of zapline for power noise removal #1032

ariguiba · 2024-12-17T16:56:42Z

Information about this PR:

This adds the option of using the Zapline algorithm to filter power noise data using the MEEGKIT implementation https://nbara.github.io/python-meegkit/auto_examples/example_dss_line.html

Current issues:

The algorithm takes too long to run for even a small dataset
Some artifacts are still visible

welcome · 2024-12-17T16:56:45Z

Hello! 👋 Thanks for opening your first pull request here! ❤️ We will try to get back to you soon. 🚴🏽‍♂️

for more information, see https://pre-commit.ci

ariguiba · 2024-12-17T16:58:18Z

@behinger

behinger · 2024-12-18T13:03:04Z

Thanks Boshra!

this looks already good to me - I think zapline is at the conceptually right place (a "replacement" to notch-filtering).
meegkit as a requirement, here someone from mne-bids-pipeline time has to chim in for sure, is that too large? is it ok? can it be made optionally, or how does the dependency-management work?
the failing unittests because of deprecated use of numpy.core.numerictype are a problem to be still fixed. Maybe this is something to update upstream to the pyriemann package, can you check? I'm also wondering if we can use meegkit without ASR etc. - just the dss.py importants - but I dont know enough about python

larsoner · 2024-12-18T17:38:13Z

meegkit as a requirement, here someone from mne-bids-pipeline time has to chim in for sure, is that too large? is it ok? can it be made optionally, or how does the dependency-management work?

We could make it optional but really:

$ pip show meegkit
...
Requires: joblib, matplotlib, numpy, pandas, pyriemann, scikit-learn, scipy, statsmodels, tqdm
...

...we already require all of these except statsmodels and pyriemann so I think it's okay just to add it, assuming it's on PyPI and conda-forge, and it does appear to be both places.

the failing unittests because of deprecated use of numpy.core.numerictype are a problem to be still fixed. Maybe this is something to update upstream to the pyriemann package, can you check? I'm also wondering if we can use meegkit without ASR etc. - just the dss.py importants - but I dont know enough about python

Either meegkit could make some of these imports optional, or we can just ignore the dtype issue locally in our tests. It would be okay to add another ignore to mne_bids_pipeline/tests/conftest.py

hoechenberger · 2024-12-18T18:13:23Z

I'm okay with depending on meegkit. If it ever starts to cause trouble, we can simply drop the functionality again -- it's not a "core" functionality we critically depend on.

@agramfort WDYT?

for more information, see https://pre-commit.ci

behinger · 2025-02-12T20:34:42Z

is there an update on this? How should we move this forward?

larsoner · 2025-02-13T20:09:20Z

@ariguiba do you still want to work on this? If so I'm happy to do a quick review, looks like it might be a few small tweaks then we could get it in!

@behinger if there is no response for a little bit (maybe a week?) then you could take over if you want

ariguiba · 2025-02-14T15:48:03Z

So I would be done with my part, I don't know what more to tweak honestly. I think a decision needs to be made about the following:
As I understand it, the errors are caused because the code we're using from MEEGKit is using some deprecated or problematic numpy method.
Also, in my opinion using the dss_line method may not be the best idea also because it seems to be super slow even on a not-so-big dataset.
So I think the best choice would be to take the source code and adapt it to our use-case 1. to remove the problematic numpy method and 2. maybe make it faster when integrated in our pipeline.
But I don't know if it's possible to just reuse the code, what do you think?
Or how would you move forward with this? Is there some small tweaks I can still do?

larsoner · 2025-02-14T16:36:55Z

So I think the best choice would be to take the source code and adapt it to our use-case 1. to remove the problematic numpy method and 2. maybe make it faster when integrated in our pipeline.

I think it would be better to improve meegkit directly if possible -- have you raised the issue over there yet? Better to improve the upstream package rather than start maintaining a parallel implementation

In the meantime I can hopefully push some commits next week to make CIs happy

larsoner · 2025-02-14T16:44:01Z

... actually meegkit 0.1.9 landed three days ago, I'll restart CIs to see if it's fixed already

larsoner · 2025-02-14T17:39:18Z

Looks like it ran out of memory, I'll try an 8GB machine but if that dies, too, then the implementation will need to be improved before this can proceed I think

larsoner · 2025-02-14T22:12:47Z

Looking at the CircleCI example output from https://output.circle-artifacts.com/output/job/22f9deac-3a86-4049-b3c3-4d05693364f6/artifacts/0/site/examples/eeg_matchingpennies.html#generated-output things look okay, WDYT @ariguiba ?

ariguiba · 2025-02-17T09:47:13Z

Looks good to me too! As I said maybe the upstream implementation can be improved in matters of performance but otherwise it seems to be doing what it should thank you for the tweaks 👍

larsoner · 2025-02-18T15:52:44Z

@drammock feel free to merge if you're happy

drammock

I don't think this is actually getting tested adequately. I spot-checked several dataset test logs on the CIs, and all of them under 04 frequency filter said "computation unnecessary (cached)" so I think our CIs didn't actually hit any case where zapline_fline=None (which, see below for why I'm concerned about that).

mne_bids_pipeline/steps/preprocessing/_04_frequency_filter.py

larsoner

The only dataset where zapline is enabled is eeg_matchingpennies so you should see it used here:

https://app.circleci.com/pipelines/github/mne-tools/mne-bids-pipeline/4911/workflows/2955b461-eadd-49d4-b353-6a031601b6b3/jobs/76007?invite=true#step-104-1870_63

Note if you ever look at GHA logs also make sure that you're looking at the correct one. I suspect you maybe looked at the bottommost one, which should show all steps cached. You need to go one above that to see the first run. (The second one is actually a caching test!) See for example:

I'll push a little commit to name the runs to help with that part

larsoner · 2025-02-19T15:06:32Z

mne_bids_pipeline/steps/preprocessing/_04_frequency_filter.py

+    if fline is None:
+        return


@drammock there is a short-circuit here for the None case

there is a short-circuit here for the None case

🤦🏻 sorry, how did I miss that.

mne_bids_pipeline/steps/preprocessing/_04_frequency_filter.py

drammock · 2025-02-19T15:49:17Z

if you ever look at GHA logs also make sure that you're looking at the correct one. I suspect you maybe looked at the bottommost one, which should show all steps cached. You need to go one above that to see the first run. (The second one is actually a caching test!)

I was looking at the CircleCI runs, but thanks for the GHA tip. The problem was that I somehow missed the short-circuit.

welcome · 2025-02-19T16:10:36Z

🎉 Congrats on merging your first pull request! 🥳 Looking forward to seeing more from you in the future! 💪

added implementation of zapline for power noise removal

3891583

[pre-commit.ci] auto fixes from pre-commit.com hooks

7b9088b

for more information, see https://pre-commit.ci

ariguiba and others added 3 commits January 10, 2025 13:48

fixed zapline integration error

3806be4

small fixes

c795d9d

[pre-commit.ci] auto fixes from pre-commit.com hooks

e87e821

for more information, see https://pre-commit.ci

larsoner added 3 commits February 14, 2025 11:44

Merge branch 'main' into zapline

3c9153e

FIX: pre-commit

ed336b0

FIX: Tweaks

8e8ed58

FIX: Mem

1ce4e5a

larsoner marked this pull request as ready for review February 18, 2025 15:52

drammock requested changes Feb 18, 2025

View reviewed changes

mne_bids_pipeline/steps/preprocessing/_04_frequency_filter.py Show resolved Hide resolved

larsoner reviewed Feb 19, 2025

View reviewed changes

FIX: Type

3252cc7

drammock approved these changes Feb 19, 2025

View reviewed changes

drammock enabled auto-merge (squash) February 19, 2025 15:49

drammock merged commit c9b79e0 into mne-tools:main Feb 19, 2025
55 of 56 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added implementation of zapline for power noise removal #1032

Added implementation of zapline for power noise removal #1032

ariguiba commented Dec 17, 2024

welcome bot commented Dec 17, 2024

ariguiba commented Dec 17, 2024

behinger commented Dec 18, 2024

larsoner commented Dec 18, 2024

hoechenberger commented Dec 18, 2024

behinger commented Feb 12, 2025

larsoner commented Feb 13, 2025

ariguiba commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

ariguiba commented Feb 17, 2025

larsoner commented Feb 18, 2025

drammock left a comment

larsoner left a comment

larsoner Feb 19, 2025

drammock Feb 19, 2025

drammock commented Feb 19, 2025

welcome bot commented Feb 19, 2025

Added implementation of zapline for power noise removal #1032

Added implementation of zapline for power noise removal #1032

Conversation

ariguiba commented Dec 17, 2024

welcome bot commented Dec 17, 2024

ariguiba commented Dec 17, 2024

behinger commented Dec 18, 2024

larsoner commented Dec 18, 2024

hoechenberger commented Dec 18, 2024

behinger commented Feb 12, 2025

larsoner commented Feb 13, 2025

ariguiba commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

larsoner commented Feb 14, 2025

ariguiba commented Feb 17, 2025

larsoner commented Feb 18, 2025

drammock left a comment

Choose a reason for hiding this comment

larsoner left a comment

Choose a reason for hiding this comment

larsoner Feb 19, 2025

Choose a reason for hiding this comment

drammock Feb 19, 2025

Choose a reason for hiding this comment

drammock commented Feb 19, 2025

welcome bot commented Feb 19, 2025