-
Notifications
You must be signed in to change notification settings - Fork 523
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Karcher Mean Merge Method #546
Conversation
All contributors have signed the CLA ✍️ ✅ |
Warning: abstract math I think this may be coincident with the "Geometric Median" in #345, which has been included in a merged PR in other repo. Anyway nice work! |
"Slightly different", this algorithm assumes that the model's intrinsic space is nonlinear. |
This is my recent work, and there is no paper yet. |
Thanks for the PR, this is quite interesting! I'd love to merge it in. Could you run the pre-commit hook to get the formatting standardized? |
Could you please guide me? This is my first time submitting a PR. |
First make sure you have the dev dependencies installed, like so: pip install -e .[dev] And then run this command: pre-commit run --all-files That will autoformat the code, then you can add those changes and push them. |
What should I do when I encounter 1 workflow awaiting approval? |
|
@cg123 Could you take a look? It should work now. |
Thanks for the PR! Merged. |
Add Karcher Mean Merge Method
Description
This PR adds a new merge method based on the Riemannian (Karcher) mean concept. The Karcher mean provides a geometrically meaningful way to average points on a manifold, which is particularly useful for merging model weights that can be interpreted as points on a hypersphere.
Features
max_iter
: Maximum iterations for the Karcher mean algorithm (default: 10)tol
: Convergence tolerance (default: 1e-5)Implementation Details
References