Query supported devices before graph compilation

[ Spun off from PR #809 review [comment](https://github.com/webmachinelearning/webnn/pull/809#discussion_r1936856070) by @fdwr and a discussion with @inexorabletash, @reillyeon, @RafaelCintron, @philloooo -- thanks! ]

(Emphasis mine:)
>This afternoon we were talking with {@inexorabletash, @reillyeon, @RafaelCintron, Phillis} about the likely need for a caller to know whether a particular device is supported or not, because an app may want to (if say GPU is not supported) use a different more performant fallback than for WebNN to silently fall back to CPU. For example, if GPU was unavailable (even though you preferred high performance), then it might be faster to execute the model with WebGPU shaders than WebNN CPU, or it might be okay to use CPU, but the app could load a different model that's more CPU-friendly, if it knew that was the case.
>
>**This need is the reverse of what we currently have, querying the context for what kind of devices it actually supports rather than prescribing a preferred device type.** Now, preferred device type prescription may return someday as a weaker hint (or list of hints, or list of antihints...) as we learn more about usage/needs, but for either case prescription or post-query, having device enums could be useful. 🤔

To kick off this discussion, I'll start with a simple and naive API further abstracted from the `GPUDevice.features` setlike interface:

```js

// only MLPowerPreference supported at context creation time
const context = await navigator.ml.createContext({ powerPreference: 'high-performance' });

// query the newly created context
if (context.devices.has('cpu') && context.devices.has('npu')) {
   // context supports both CPU and NPU, choose the model accordingly
   // how the work is split across CPU and NPU is an implementation detail
} else if (!context.devices.has('gpu')) {
   //  no GPU available, perhaps try WebGPU?
} else {
   // ...
}
```

To help tease out more informed proposals, I'd like folks to test this imaginary API against the following:
- Address [key use cases](https://github.com/webmachinelearning/webnn/blob/main/device-selection-explainer.md#key-use-cases-and-requirements)
- [Design based on user needs, not the underlying capability](https://www.w3.org/TR/design-principles/#usecase-oriented-apis)
- Implementability / [support the full range of devices and platforms](https://www.w3.org/TR/design-principles/#devices-platforms)
- [Use care when exposing APIs for selecting or enumerating devices](https://www.w3.org/TR/design-principles/#device-enumeration)
- Is extensible and [future-proof](https://www.w3.org/TR/design-principles/#naming-future-proofing)
- [Exposed data is minimized](https://www.w3.org/TR/security-privacy-questionnaire/#data-minimization)
- Other considerations?

@mwyrzykowski would this work with CoreML [MLComputeUnits](https://developer.apple.com/documentation/coreml/mlcomputeunits#Processing-Unit-Configurations)? A subset of possible `MLDeviceType` combinations is supported but there's also a catch-all `all`?

How about other backends?

This issue welcomes further discussion and proposals on the topic of post-context creation query mechanism for supported devices.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Query supported devices before graph compilation #815

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Query supported devices before graph compilation #815

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions