|
async
|
zml: reduce memory usage of sdpaMemEfficient by using zml.ops.while instead of zml.ops.for, avoiding concatenation of intermediate results.
|
2023-08-14 14:24:11 +00:00 |
|
bazel
|
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
|
2023-08-18 17:11:27 +00:00 |
|
docs
|
Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance.
|
2023-08-21 09:15:48 +00:00 |
|
examples
|
Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance.
|
2023-08-21 09:15:48 +00:00 |
|
mlir
|
Update workspace dependencies to newer LLVM, XLA, StableHLO, and PJRT versions and expose new pjrt plugin attribute APIs and stablehlo version APIs in build and runtime configurations.
|
2023-08-07 12:28:36 +00:00 |
|
pjrt
|
Update workspace dependencies to newer LLVM, XLA, StableHLO, and PJRT versions and expose new pjrt plugin attribute APIs and stablehlo version APIs in build and runtime configurations.
|
2023-08-07 12:28:36 +00:00 |
|
platforms
|
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
|
2023-01-02 14:28:25 +00:00 |
|
runtimes
|
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
|
2023-08-18 17:11:27 +00:00 |
|
stdx
|
Rework async runtime with coroutine support, rename async API (async_→asyncc, await_→awaitt), improve type inference, bump libxev (default epoll) and update related stdx and zml modules.
|
2023-08-01 11:35:04 +00:00 |
|
third_party
|
Update workspace dependencies to newer LLVM, XLA, StableHLO, and PJRT versions and expose new pjrt plugin attribute APIs and stablehlo version APIs in build and runtime configurations.
|
2023-08-07 12:28:36 +00:00 |
|
tools
|
Fix CollectionOver scope error in ActivationCollector and clean dead code/comments in zml_utils.py
|
2023-01-10 09:43:03 +00:00 |
|
zml
|
zml/platform: increase maximum device limit to support up to 32 devices per platform.
|
2023-08-24 12:23:07 +00:00 |
|
BUILD.bazel
|
Rework async runtime with coroutine support, rename async API (async_→asyncc, await_→awaitt), improve type inference, bump libxev (default epoll) and update related stdx and zml modules.
|
2023-08-01 11:35:04 +00:00 |
|
build.zig
|
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
|
2023-01-02 14:28:25 +00:00 |
|
MODULE.bazel
|
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
|
2023-08-18 17:11:27 +00:00 |
|
MODULE.bazel.lock
|
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
|
2023-08-18 17:11:27 +00:00 |
|
platform_mappings
|
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
|
2023-01-02 14:28:25 +00:00 |
|
zls.build.json
|
zml: reduce memory usage of sdpaMemEfficient by using zml.ops.while instead of zml.ops.for, avoiding concatenation of intermediate results.
|
2023-08-14 14:24:11 +00:00 |