Commit Graph

328 Commits

Author SHA1 Message Date
7e81d022ff Add implicit dependency on hf_transfer for tools/hf. 2025-12-08 09:24:52 +00:00
fa886db3bb Disable ZML aio torch pickle test pending implementation fix due to protocol support limitation in version 0.15.2. 2025-12-03 10:57:27 +00:00
ba5043ddf8 Switch ZML to use .host_pinned memory on ROCm, addressing missing host_unpinned memory error. 2025-12-01 14:34:13 +00:00
7894335dc9 Correct compilation errors in examples/callback. 2025-11-26 15:23:41 +00:00
b58f7ced3d Fix $ORIGIN handling in runtimes/neuron by escaping $ since zigopts does not expand Make variables. 2025-11-24 12:04:56 +00:00
993cbffd22 Upgrade workspace Zig version to 0.15.2. 2025-11-21 12:12:36 +00:00
d3baeed676 Add -fllvm flag to the MNIST example BUILD configuration. 2025-11-20 15:28:47 +00:00
3d3a0ea463 Enable -fllvm flag for CUDA runtime, upb, and ZML Bazel targets. 2025-11-11 13:02:40 +00:00
8f50d19cba Add missing C++ toolchain to zls. 2025-11-10 09:44:11 +00:00
94c5bbcfa6 Update workspace to use rules_zig archive_override before version 0.12.2, fixing shared_library and PJRT loading and ensuring os/log.h inclusion. 2025-11-06 12:36:22 +00:00
cd1b66f615 chore: remove unused target 2025-11-04 12:15:20 +00:00
91f1c3b7aa pjrt/cpu: fix compilation when CPU is absent by ensuring CcCompilationContext is non-empty for Zig modules accessing the c module 2025-10-30 09:01:36 +00:00
b67a3f9a0a workspace: switch to libxev from Bazel Central Registry, completing migration away from custom registry 2025-10-29 17:43:22 +00:00
7bdce4c078 Add Llama 3.1 example showcasing optional head‑dimension support. 2025-10-23 17:54:48 +00:00
675cefaf26 Patch http_deb_archive rule and improve stdx utilities and ZML callback handling 2025-10-20 10:24:27 +00:00
20ca4a6648 Update MNIST example BUILD file to use zigopts and align with new upstreamed rules_zig configuration. 2025-10-14 16:57:38 +00:00
bcd43314a4 Migrate BUILD and MODULE files to upstreamed rules_zig (rename copts to zigopts) and adjust ZLS integration accordingly. 2025-10-13 15:26:42 +00:00
7d7c124ada runtimes/cpu: update Darwin arm64 PJRT tarball checksum to match v13.0.0, enabling Bazel fetch on macOS arm64 2025-10-07 10:18:40 +00:00
d056fd3511 zml: cleanup some todos 2025-10-06 15:29:57 +00:00
77cd21d2b2 Add gpt-oss model support to core ZML components: formatting, utility functions, safetensors I/O, host buffer management, NN layer handling, and tensor operations. 2025-10-01 14:20:32 +00:00
e1b7fc5781 Add example implementation and Bazel build for OpenAI gpt-oss models (GptOss.zig, main.zig, and BUILD.bazel). 2025-09-30 16:17:00 +00:00
d45a667ee5 Revamp gather API with named indices (and add gather_ variant), improve topK handling, and add Yarn rope embedding support across core modules (buffer, nn, pjrtx, quantization, shape, tensor, testing, tokenizer, torch). 2025-09-26 13:38:11 +00:00
7264fff493 Update MNIST and ModernBERT example scripts to use the new gather and topk APIs. 2025-09-24 15:42:09 +00:00
e641d05dd2 mlir, pjrt, zml: expose missing data types (u2, i1, i2, f4e2m1fn, f8e3m4, f8e4m3, f8e8m0fnu); fix Float32 conversion bug that truncated values 2025-09-19 12:13:32 +00:00
29bd1242ba Add platform tag to buffers for target identification and safety checks; include workaround for PJRT uninitialized memory handling. 2025-09-16 17:43:18 +00:00
9aeb4e9cd0 Update docs and example to demonstrate adding the platform tag to buffers when adding weights. 2025-09-11 11:15:37 +00:00
12efc763d5 Update modernbert example to reflect removal of old writer interface APIs. 2025-09-09 10:54:06 +00:00
3ed9bca5ad Remove deprecated writer interface APIs from core ZML modules (async, MLIR, PJRT, runtime, fmt, aio, buffer, exe, hostbuffer, meta, mlirx). 2025-09-04 14:03:09 +00:00
090d7748d5 Update docs and example scripts (including test files) to remove async helpers asynk, asyncc, awaitt, and await_ 2025-09-02 10:27:40 +00:00
6e15123fb3 Remove obsolete async symbols (asynk, asyncc, awaitt, await_) from core, runtime, and aio modules. 2025-08-29 11:03:59 +00:00
7913c00d70 Implement buffer‑ID based loading by moving tensor ID handling into BufferStore, fix zml.call tag hashing, and expose CPU device count. 2025-08-28 14:39:21 +00:00
6e7617918d Update llama and simple_layer examples to use BufferStore tensor IDs, new CPU device count API, and fix zml.call tag hashing. 2025-08-22 17:55:03 +00:00
cc969bd532 Add experimental zml.callback API (renamed from custom_call) and fix tensor.print(); update PJRT bindings, host buffer utilities, and related core ZML modules. 2025-08-20 10:27:54 +00:00
1fa056a790 Add example and Bazel build for zml.callback demonstrating a manual CUDA kernel invocation. 2025-08-13 12:14:53 +00:00
01da2184fe xla: bump to commit b3fbfee, temporarily disable libnvptxcompiler due to missing support in PjRT CUDA plugin v13.0, add nvshmem to sandbox for PjRT CUDA plugin 2025-08-12 13:32:18 +00:00
f35119f768 ci: revamp commit workflow, deduplicate workflows, fix Linux linking issue introduced by upb, resolve duplicated _start symbol in test targets, temporarily use -c opt to bypass static archive bug 2025-08-08 13:51:27 +00:00
9e3cd6d616 bump runtimes/* code to Zig 0.15.1, restore PyTorch loader using std.fs.File, update CI zig fmt, remove stdx.io, note remaining issues with Neuron and CUDA debug builds 2025-08-07 15:09:27 +00:00
0ed7f5c907 Add Zig example programs for LLaMA, ModernBERT, and SimpleLayer, including a Bazel BUILD file for the LLaMA example. 2025-07-29 16:07:11 +00:00
488a844a0f Add Zig 0.15 compatibility: update BUILD files, async primitives, stdx utilities, MLIR dialects, and PJRT FFI. 2025-07-28 13:54:28 +00:00
e3b7705e3d Remove gguf and torch loader implementations and related BUILD and test assets. 2025-07-24 17:19:43 +00:00
01db09c24b Update MNIST example (BUILD.bazel and mnist.zig) to remove torch loader references. 2025-07-23 12:53:46 +00:00
1cf26756a1 workspace: run buildifier, drop rules_uv, refactor tools/hf dependencies 2025-07-16 10:01:41 +00:00
1427286716 runtimes/neuron: fix neuron runtime
This PR fixes the neuron runtime with the following:

Proxy the PJRT Api method to enforce the client struct sizes since the
neuron PJRT plugin doesn't use `>=` but `==` to assert them, breaking
PJRT compatibility guarantees.
Fixes https://github.com/aws-neuron/aws-neuron-sdk/issues/1095

Reimplement `libneuronxla` in Zig to control neuronx-cc sandboxing and
invocation.

Implement a python bootstrapper in Zig to create a full blown
`neuronx-cc` executable, avoiding the infamous chicken and egg problem
of python executables boostrapping when sandboxed (due to fixed path
shebangs).

---------

Co-authored-by: Corentin Kerisit <corentin.kerisit@gmail.com>
2025-07-15 15:26:03 +00:00
e1ee340306 runtimes/cuda: implement zmlxcuda in Zig 2025-07-08 09:25:25 +00:00
c488b634fc runtimes/rocm: implement zmlxrocm in Zig
Also, sandbox `amdgpu.ids` and restore safetensors json parsing.
2025-07-07 16:48:07 +00:00
a512b9c8a2 Update docs and example BUILD files to reflect the switch from zig_cc_binary to zig_binary. 2025-07-04 15:10:46 +00:00
cf00506dbb Switch workspace build rules from zig_cc_binary to zig_binary, removing the hack and using the C linker directly. 2025-07-03 15:10:36 +00:00
78679817df zml: remove nemo, sentencepiece loaders, remove zig-yaml
Progress towards
2025-06-24 15:39:55 +00:00
fe56f03f5d third_party/xla: de-modularize xla
This PR intends to remove XLA as a bzlmod and transfer it as a non
bzlmod dep.

This is because this module will never be upstreamed as is, so keep it
private.

Also, we fetch llvm-raw and stablehlo from it, which is fine.

While there, dummify the various local_config XLA symbols to please the
imports, as we don't use those parts in ZML itself.

Closes
2025-06-23 09:13:43 +00:00
a540564744 Remove example workspace and related documentation files. 2025-06-20 13:23:06 +00:00