b58f7ced3d
Fix $ORIGIN handling in runtimes/neuron by escaping $ since zigopts does not expand Make variables.
2025-11-24 12:04:56 +00:00
993cbffd22
Upgrade workspace Zig version to 0.15.2.
2025-11-21 12:12:36 +00:00
d3baeed676
Add -fllvm flag to the MNIST example BUILD configuration.
2025-11-20 15:28:47 +00:00
3d3a0ea463
Enable -fllvm flag for CUDA runtime, upb, and ZML Bazel targets.
2025-11-11 13:02:40 +00:00
8f50d19cba
Add missing C++ toolchain to zls.
2025-11-10 09:44:11 +00:00
94c5bbcfa6
Update workspace to use rules_zig archive_override before version 0.12.2, fixing shared_library and PJRT loading and ensuring os/log.h inclusion.
2025-11-06 12:36:22 +00:00
cd1b66f615
chore: remove unused target
2025-11-04 12:15:20 +00:00
91f1c3b7aa
pjrt/cpu: fix compilation when CPU is absent by ensuring CcCompilationContext is non-empty for Zig modules accessing the c module
2025-10-30 09:01:36 +00:00
b67a3f9a0a
workspace: switch to libxev from Bazel Central Registry, completing migration away from custom registry
2025-10-29 17:43:22 +00:00
7bdce4c078
Add Llama 3.1 example showcasing optional head‑dimension support.
2025-10-23 17:54:48 +00:00
675cefaf26
Patch http_deb_archive rule and improve stdx utilities and ZML callback handling
2025-10-20 10:24:27 +00:00
20ca4a6648
Update MNIST example BUILD file to use zigopts and align with new upstreamed rules_zig configuration.
2025-10-14 16:57:38 +00:00
bcd43314a4
Migrate BUILD and MODULE files to upstreamed rules_zig (rename copts to zigopts) and adjust ZLS integration accordingly.
2025-10-13 15:26:42 +00:00
7d7c124ada
runtimes/cpu: update Darwin arm64 PJRT tarball checksum to match v13.0.0, enabling Bazel fetch on macOS arm64
2025-10-07 10:18:40 +00:00
d056fd3511
zml: cleanup some todos
2025-10-06 15:29:57 +00:00
77cd21d2b2
Add gpt-oss model support to core ZML components: formatting, utility functions, safetensors I/O, host buffer management, NN layer handling, and tensor operations.
2025-10-01 14:20:32 +00:00
e1b7fc5781
Add example implementation and Bazel build for OpenAI gpt-oss models (GptOss.zig, main.zig, and BUILD.bazel).
2025-09-30 16:17:00 +00:00
d45a667ee5
Revamp gather API with named indices (and add gather_ variant), improve topK handling, and add Yarn rope embedding support across core modules (buffer, nn, pjrtx, quantization, shape, tensor, testing, tokenizer, torch).
2025-09-26 13:38:11 +00:00
7264fff493
Update MNIST and ModernBERT example scripts to use the new gather and topk APIs.
2025-09-24 15:42:09 +00:00
e641d05dd2
mlir, pjrt, zml: expose missing data types (u2, i1, i2, f4e2m1fn, f8e3m4, f8e4m3, f8e8m0fnu); fix Float32 conversion bug that truncated values
2025-09-19 12:13:32 +00:00
29bd1242ba
Add platform tag to buffers for target identification and safety checks; include workaround for PJRT uninitialized memory handling.
2025-09-16 17:43:18 +00:00
9aeb4e9cd0
Update docs and example to demonstrate adding the platform tag to buffers when adding weights.
2025-09-11 11:15:37 +00:00
12efc763d5
Update modernbert example to reflect removal of old writer interface APIs.
2025-09-09 10:54:06 +00:00
3ed9bca5ad
Remove deprecated writer interface APIs from core ZML modules (async, MLIR, PJRT, runtime, fmt, aio, buffer, exe, hostbuffer, meta, mlirx).
2025-09-04 14:03:09 +00:00
090d7748d5
Update docs and example scripts (including test files) to remove async helpers asynk, asyncc, awaitt, and await_
2025-09-02 10:27:40 +00:00
6e15123fb3
Remove obsolete async symbols (asynk, asyncc, awaitt, await_) from core, runtime, and aio modules.
2025-08-29 11:03:59 +00:00
7913c00d70
Implement buffer‑ID based loading by moving tensor ID handling into BufferStore, fix zml.call tag hashing, and expose CPU device count.
2025-08-28 14:39:21 +00:00
6e7617918d
Update llama and simple_layer examples to use BufferStore tensor IDs, new CPU device count API, and fix zml.call tag hashing.
2025-08-22 17:55:03 +00:00
cc969bd532
Add experimental zml.callback API (renamed from custom_call) and fix tensor.print(); update PJRT bindings, host buffer utilities, and related core ZML modules.
2025-08-20 10:27:54 +00:00
1fa056a790
Add example and Bazel build for zml.callback demonstrating a manual CUDA kernel invocation.
2025-08-13 12:14:53 +00:00
01da2184fe
xla: bump to commit b3fbfee, temporarily disable libnvptxcompiler due to missing support in PjRT CUDA plugin v13.0, add nvshmem to sandbox for PjRT CUDA plugin
2025-08-12 13:32:18 +00:00
f35119f768
ci: revamp commit workflow, deduplicate workflows, fix Linux linking issue introduced by upb, resolve duplicated _start symbol in test targets, temporarily use -c opt to bypass static archive bug
2025-08-08 13:51:27 +00:00
9e3cd6d616
bump runtimes/* code to Zig 0.15.1, restore PyTorch loader using std.fs.File, update CI zig fmt, remove stdx.io, note remaining issues with Neuron and CUDA debug builds
2025-08-07 15:09:27 +00:00
0ed7f5c907
Add Zig example programs for LLaMA, ModernBERT, and SimpleLayer, including a Bazel BUILD file for the LLaMA example.
2025-07-29 16:07:11 +00:00
488a844a0f
Add Zig 0.15 compatibility: update BUILD files, async primitives, stdx utilities, MLIR dialects, and PJRT FFI.
2025-07-28 13:54:28 +00:00
e3b7705e3d
Remove gguf and torch loader implementations and related BUILD and test assets.
2025-07-24 17:19:43 +00:00
01db09c24b
Update MNIST example (BUILD.bazel and mnist.zig) to remove torch loader references.
2025-07-23 12:53:46 +00:00
1cf26756a1
workspace: run buildifier, drop rules_uv, refactor tools/hf dependencies
2025-07-16 10:01:41 +00:00
1427286716
runtimes/neuron: fix neuron runtime
...
This PR fixes the neuron runtime with the following:
Proxy the PJRT Api method to enforce the client struct sizes since the
neuron PJRT plugin doesn't use `>=` but `==` to assert them, breaking
PJRT compatibility guarantees.
Fixes https://github.com/aws-neuron/aws-neuron-sdk/issues/1095
Reimplement `libneuronxla` in Zig to control neuronx-cc sandboxing and
invocation.
Implement a python bootstrapper in Zig to create a full blown
`neuronx-cc` executable, avoiding the infamous chicken and egg problem
of python executables boostrapping when sandboxed (due to fixed path
shebangs).
---------
Co-authored-by: Corentin Kerisit <corentin.kerisit@gmail.com>
2025-07-15 15:26:03 +00:00
e1ee340306
runtimes/cuda: implement zmlxcuda in Zig
2025-07-08 09:25:25 +00:00
c488b634fc
runtimes/rocm: implement zmlxrocm in Zig
...
Also, sandbox `amdgpu.ids` and restore safetensors json parsing.
2025-07-07 16:48:07 +00:00
a512b9c8a2
Update docs and example BUILD files to reflect the switch from zig_cc_binary to zig_binary.
2025-07-04 15:10:46 +00:00
cf00506dbb
Switch workspace build rules from zig_cc_binary to zig_binary, removing the hack and using the C linker directly.
2025-07-03 15:10:36 +00:00
78679817df
zml: remove nemo, sentencepiece loaders, remove zig-yaml
...
Progress towards
2025-06-24 15:39:55 +00:00
fe56f03f5d
third_party/xla: de-modularize xla
...
This PR intends to remove XLA as a bzlmod and transfer it as a non
bzlmod dep.
This is because this module will never be upstreamed as is, so keep it
private.
Also, we fetch llvm-raw and stablehlo from it, which is fine.
While there, dummify the various local_config XLA symbols to please the
imports, as we don't use those parts in ZML itself.
Closes
2025-06-23 09:13:43 +00:00
a540564744
Remove example workspace and related documentation files.
2025-06-20 13:23:06 +00:00
e789e26008
Remove examples workspace and clean up related Bazel BUILD/MODULE files and Zig build scripts.
2025-06-19 09:30:29 +00:00
0716f2dc0e
Update examples/MODULE.bazel.lock to reflect the switch from zig‑protobuf to upb.
2025-06-10 16:04:53 +00:00
1b58c34b8c
Update ZML build configuration to replace zig-protobuf with upb library.
2025-06-09 16:34:20 +00:00
cba3be4859
pjrt,zml: remove profiler
...
Closes
Progress towards
2025-06-06 12:24:42 +00:00