async
zml: reduce memory usage of sdpaMemEfficient by using zml.ops.while instead of zml.ops.for, avoiding concatenation of intermediate results.
2023-08-14 14:24:11 +00:00
bazel
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
2023-08-18 17:11:27 +00:00
docs
Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance.
2023-08-21 09:15:48 +00:00
examples
Update llama example BUILD to use jax-cuda-pjrt plugin and bump CUDA (12.6.2) / CuDNN (9.5.1) versions.
2023-09-12 15:40:21 +00:00
mlir
zml/pjrtx: prefer the built‑in stablehlo version when a plugin reports a newer version, ensuring artifact serialization uses the correct stablehlo version.
2023-09-07 17:06:19 +00:00
pjrt
workspace: log diagnostic message before returning NotFound to aid debugging.
2023-09-04 13:34:37 +00:00
platforms
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
2023-01-02 14:28:25 +00:00
runtimes
runtimes/cuda: obtain NCCL from the pip package, matching XLA behavior.
2023-09-20 17:41:44 +00:00
stdx
Rework async runtime with coroutine support, rename async API (async_→asyncc, await_→awaitt), improve type inference, bump libxev (default epoll) and update related stdx and zml modules.
2023-08-01 11:35:04 +00:00
third_party
Update workspace dependencies to newer LLVM, XLA, StableHLO, and PJRT versions and expose new pjrt plugin attribute APIs and stablehlo version APIs in build and runtime configurations.
2023-08-07 12:28:36 +00:00
tools
Fix CollectionOver scope error in ActivationCollector and clean dead code/comments in zml_utils.py
2023-01-10 09:43:03 +00:00
zml
zml: fix float8 <-> float32 conversions, support for Tensor.constant(.{}, .{ .f8 = 1.0})
2023-09-21 11:15:50 +00:00
BUILD.bazel
Rework async runtime with coroutine support, rename async API (async_→asyncc, await_→awaitt), improve type inference, bump libxev (default epoll) and update related stdx and zml modules.
2023-08-01 11:35:04 +00:00
build.zig
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
2023-01-02 14:28:25 +00:00
MODULE.bazel
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
2023-08-18 17:11:27 +00:00
MODULE.bazel.lock
Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support.
2023-08-18 17:11:27 +00:00
platform_mappings
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
2023-01-02 14:28:25 +00:00
zls.build.json
zml: reduce memory usage of sdpaMemEfficient by using zml.ops.while instead of zml.ops.for, avoiding concatenation of intermediate results.
2023-08-14 14:24:11 +00:00