111afcdd95
examples/llama: switch to --hf-model-path flag
...
Instead of giving config, model weights and tokenizer paths, rely on
`huggingface-cli` download.
2025-05-28 13:21:00 +00:00
5a49a3e8ca
Update examples MODULE.bazel.lock to reflect the added sandbox neuron dependencies.
2025-05-22 10:10:14 +00:00
1a2b862ec2
Add sandbox neuron dependencies: define a trampoline PJRT, create an empty repository for distroless deps, and update Bazel build files and Zig/C sources accordingly.
2025-05-19 17:35:33 +00:00
9e66d5a06a
Update examples/MODULE.bazel.lock to reflect new dependencies.
2025-05-16 11:26:13 +00:00
55c5b540f8
Add XLA 20250718.0‑6319f0d with ROCm 6.4.1 support, update Bazel module files and runtime configs, and apply migration, FFI‑handler and header‑cleanup patches.
2025-05-12 12:10:27 +00:00
cba9ce9615
protobuf, libxev: remove usingnamespace
...
remove usingnamespace from libxev and zig-protobuf
relevant PRs:
* libxev:
7f803181b1
* zig-protobuf: https://github.com/zml/zig-protobuf/pull/1/files
This means we can now build ZML with `-fincremental` (in debug mode only
and linux x86-64)
---------
Signed-off-by: Guillaume Wenzek <wzk@zml.ai>
2025-05-06 11:58:20 +00:00
db303a6010
remove usingnamespace from asyncio
...
Eliminate the final usingnamespace usage, which will be deprecated in version 0.15.0.
2025-05-05 09:03:06 +00:00
ed5ae31338
runtimes/rocm: fetch libdrm from amdgpu repository and add amdgpu.ids layer
2025-04-30 15:53:51 +00:00
47146fc088
workspace: remove glibc version constraint, update rules_rust, and re-enable frame pointers following sandboxing changes.
2025-04-29 12:09:13 +00:00
e7323be10b
runtimes/rocm: switch to in-process LLD, removing the need for sandboxed lld.
2025-04-23 11:43:18 +00:00
02ba41ea57
Update examples/MODULE.bazel to bump rules_zig version and fix static‑lib inclusion for non‑exe targets.
2025-04-21 13:51:52 +00:00
f5f3dc6b1b
Update rules_zig to 20250714.0-b14a4f1 and fix static‑lib inclusion in runfiles for non‑exe targets.
2025-04-18 12:09:38 +00:00
7d9fdf94e7
runtimes/rocm: sandbox ROCm dependencies and ensure they load on the main thread due to TLS usage in static C++ destructors.
2025-04-14 16:38:15 +00:00
eba0e72532
runtimes/tpu: sandbox TPU PJRT plugin; no external dependencies.
2025-04-10 14:47:16 +00:00
8073e45894
Update examples/MODULE.bazel.lock to reflect bumped hftokenizers dependency.
2025-04-09 10:21:44 +00:00
4294a4d08f
Bump hftokenizers dependency versions in Bazel and Cargo lockfiles (MODULE.bazel.lock, Cargo.toml, Cargo.lock)
2025-04-04 12:54:33 +00:00
78d7b672e7
runtimes/cpu: sandbox CPU PJRT plugin, simplifying as there are no additional NEEDED dependencies.
2025-04-03 11:57:46 +00:00
2d321d232d
runtimes/cuda: sandbox CUDA dependencies by removing them from the leaf binary, sandboxing the dependency graph, marking dlopen direct dependencies as NEEDED, setting RPATH to the sandbox, loading the PJRT plugin from the sandbox, and enabling weak CUDA symbols without direct linking.
2025-03-26 11:18:29 +00:00
a5420068b1
pjrt: emit warning instead of panic when FFI Extension is missing (e.g., on TPU).
2025-03-24 09:40:44 +00:00
dc121fce4f
Update example MODULE.bazel and lockfile to reflect toolchains_llvm_bootstrapped bump to 0.2.4.
2025-03-20 12:17:30 +00:00
907577525f
Update MODULE.bazel and lockfile to bump toolchains_llvm_bootstrapped to version 0.2.4.
2025-03-18 11:47:22 +00:00
f27a524f31
Update rules_zig: add zig_srcs target, fix source handling bug, clean up BUILD files, adjust async/coro.zig tests, and disable nemo and yaml model loaders.
2025-03-13 12:27:21 +00:00
6fc1148206
async/coro: make coroutines unwindable by zeroing the initial stack region, preventing random unwinding behavior and SIGSEGV during _Unwind_Backtrace.
2025-03-10 16:25:45 +00:00
f63c673f45
bazel: add RPATH manipulation to patchelf
2025-03-05 11:56:40 +00:00
9488672d4b
workspace: bump xla to version 20250710.0-22ea002
...
Also:
- Bump XLA deps : `com_github_grpc_grpc` and `com_google_protobuf`
- Inject `rules_ml_toolchain`
- Fix `zig_proto_library` rule
2025-03-04 17:12:34 +00:00
fa0ed045ef
runtimes/cuda: downgrade cuda and cudnn
...
This commit reverts part of https://github.com/zml/zml/pull/238/files
This is required because XLA has a strong dependencies on CUDA 12.8 and
upgrading to 12.9 is impossible due to
https://github.com/NVIDIA/cccl/issues/4967
2025-02-28 17:36:12 +00:00
ff1433d998
pjrt: bind PJRT_Client_CreateUninitializedBuffer.
2025-02-25 10:37:45 +00:00
8456a0d073
zml/pjrt: add binding for PJRT_Device_MemoryStats.
2025-02-19 12:14:05 +00:00
a580f2a398
Async: use stronger memory ordering to prevent potential segfaults due to ordering issues.
2025-02-18 11:38:56 +00:00
4d6d975dc0
Patch aio.zig: update loadBuffersWithPrefix argument type to match the conditional type of loadBuffers init_args.
2025-02-13 09:48:13 +00:00
af8844c1f1
Add model prefix support when loading a model from safetensors, enabling use of a specific model prefix (e.g., ModernBertModel) instead of the full model. Tested with the text embeddings server project.
2025-02-12 13:18:27 +00:00
1cafcc3c60
Workspace: bump XLA to newer version.
2025-02-05 17:35:27 +00:00
9ef838be25
Update neuron runtime BUILD.bazel to use Bazel manual tag and S3 cache integration.
2025-02-03 14:03:33 +00:00
dd52e988b4
Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow.
2025-01-31 16:28:38 +00:00
0a2ab7c8cb
Remove usingnamespace from MLIR.
2025-01-28 09:35:58 +00:00
f8ab0d7b2a
Remove dead imports.
2025-01-22 10:45:04 +00:00
51a6cab753
Wire has_side_effect field in zml/ops.
2025-01-20 16:45:13 +00:00
99a2001e63
Rename PJRT BufferType to follow Zig and ZML naming conventions.
2025-01-16 13:00:47 +00:00
7324a49da3
Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error.
2025-01-15 16:59:26 +00:00
09c43b8759
Add customCall operation to zml/ops.
2025-01-09 15:01:33 +00:00
9f1cc762cd
Fix map tests in zml/meta.
2025-01-06 17:49:50 +00:00
fbf1ecb8b7
Introduce Executable.getCompiledMemoryStats in PJRT.
2025-01-02 16:36:13 +00:00
4b1a3ff48a
Add union support to mapping helpers in zml/meta.zig.
2025-01-01 13:35:17 +00:00
c961d705f1
Set default values for operand_layouts and result_layouts in StableHLO dialect.
2024-12-26 09:29:45 +00:00
e6286b6097
Update Buffer.from to be blocking by default and add options for async loading and memory placement, adjusting aio, hostbuffer, pjrtx, and tensor implementations.
2024-12-25 17:14:44 +00:00
da1fd2d9dc
Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior.
2024-12-20 09:30:35 +00:00
bb2b77d7de
Correctly set model.norm.eps in Llama examples.
2024-12-18 11:48:23 +00:00
6aa9aa5a7b
Add preliminary implementation for custom call support.
2024-12-10 09:36:37 +00:00
1d5b79111a
modernbert: set default epsilon value for embeddings layernorm.
2024-12-09 16:43:29 +00:00
a63d0a4aa3
Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration.
2024-12-04 11:30:44 +00:00