|
|
0709b1b32f
|
zml: reduce memory usage of sdpaMemEfficient by using zml.ops.while instead of zml.ops.for, avoiding concatenation of intermediate results.
|
2023-08-14 14:24:11 +00:00 |
|
|
|
f675a203c2
|
zml.ops.makeBlock now returns the inner tensor to propagate tags. The function returns both the created mlir.Block and tensors from the supplied function, allowing shape and tag propagation without exposing mlir.Values. Updated tests to run on non‑CPU platforms.
|
2023-07-21 09:01:01 +00:00 |
|
|
|
9b7eea8ac2
|
Add stdx utilities and rework async signature inference; tidy executable logging.
|
2023-06-21 14:45:14 +00:00 |
|
|
|
2f54e2a5f3
|
zml.tensor: add triangular operator to zero out the upper‑right matrix region with configurable offset, and toDiagonal (diag_embed) to embed a vector as a diagonal matrix, correcting previous diag naming. Also add ELU activation under zml.nn.Activation.
|
2023-05-18 16:39:21 +00:00 |
|
|
|
021111d07d
|
Extend tests to handle all float types, preventing crashes with bfloat16 tensors.
|
2023-04-27 10:34:27 +00:00 |
|
|
|
66881899ca
|
Fix testLayer by removing unnecessary compile_options argument and updating testing logic for new sharded output, ensuring proper usage by llama.zig.
|
2023-03-31 14:23:45 +00:00 |
|
|
|
a4f0fc96c0
|
Integrate user sharding hints and HLO sharding annotations across MLIR dialects and ZML core, and remove the now‑unused module options arguments.
|
2023-03-21 10:50:39 +00:00 |
|
|
|
266da6d4be
|
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
|
2023-01-02 14:28:25 +00:00 |
|