Skip to content

Updated review of Web Neural Network API #1072

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
1 task done
anssiko opened this issue Mar 20, 2025 · 0 comments
Open
1 task done

Updated review of Web Neural Network API #1072

anssiko opened this issue Mar 20, 2025 · 0 comments
Assignees
Labels
Focus: API design (pending) Progress: in progress Topic: Design Principles we believe this design review will inform our work on the Web Platform Design Principles document. Topic: Machine Learning Not AI

Comments

@anssiko
Copy link

anssiko commented Mar 20, 2025

Hello TAG!

I'm requesting an updated TAG review of WebNN API - previous TAG review: #933

Further details:

  • I have reviewed the TAG's Web Platform Design Principles
  • Previous early design review, if any: 2000, 2022, 2024
  • Relevant time constraints or deadlines: We plan to publish a new CRS during Q2 2025
  • The group where the work on this specification is currently being done: W3C Web Machine Learning Working Group
  • Major unresolved issues with or opposition to this specification: none
  • This work is being funded by:

You should also know that...

Below is a summary of substantive changes since the previous TAG review in 2024 to assist you in this delta review.

Significant new features:

  • Add 12 new operators ("Wave 3") to support well-known transformer models based on an analysis of top 20 downloaded Hugging Face models. The models comprise of a diverse set of tasks, including Small Language Models and Large Language Models, object detection, sentence similarity, feature extraction, speech recognition, image segmentation, text classification, and text generation. See issue, PR, presentation for details on these operators and motivation.
  • Add MLTensor interface that represents a tensor, an input or output to an MLGraph. Update MLContext interface with new createTensor(), readTensor(), writeTensor() methods, change compute() to dispatch() for scheduling the workload, add new destroy() method for MLContext, MLGraph, MLTensor. These changes allow buffer reuse, chained inference, better resource management with explicit destruction, and the opportunity to share memory with WebGPU, minimizing copies. See MLTensor Explainer for details.

Other substantial changes:

The group would like to seek TAG's explicit advise on the following:

@jyasskin jyasskin added Progress: in progress Topic: Design Principles we believe this design review will inform our work on the Web Platform Design Principles document. Topic: Machine Learning Not AI Focus: API design (pending) and removed Progress: untriaged labels Apr 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Focus: API design (pending) Progress: in progress Topic: Design Principles we believe this design review will inform our work on the Web Platform Design Principles document. Topic: Machine Learning Not AI
Projects
None yet
Development

No branches or pull requests

4 participants