|
|
6d720126ac
|
Add PJRT custom call integration with generic zmlHostBufferCallback to copy tensors to host and invoke user callbacks. Introduce Tensor.print() method to output runtime tensor values (CUDA‑specific, uses a pre‑allocated host buffer).
|
2023-06-05 13:42:45 +00:00 |
|
|
|
05faa5021e
|
zml.tensor: add cumulativeSum operator and refactor maxPoolND. Introduce cumulative sum using reduceWindow. Simplify reduceWindow signature by merging padding_shape and padding_value. Update maxPool1D/2D to accept tuple arguments. Revise pad to use tagged or AOS syntax; remove SOA syntax.
|
2023-05-17 09:01:27 +00:00 |
|
|
|
24a7c98476
|
Implement scatterSlices functionality.
|
2023-02-14 13:52:49 +00:00 |
|
|
|
be6328813d
|
zml: clean up dead and commented code; note that copyslice is currently broken and pending reimplementation
|
2023-02-08 17:13:47 +00:00 |
|
|
|
ebdb8db213
|
zml/tests: re‑enable all Zig tests, fix precision issue by switching to f32, and add refAllDecls to ensure all declarations are tested
|
2023-01-23 16:28:19 +00:00 |
|
|
|
266da6d4be
|
Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML.
|
2023-01-02 14:28:25 +00:00 |
|