|
|
47146fc088
|
workspace: remove glibc version constraint, update rules_rust, and re-enable frame pointers following sandboxing changes.
|
2025-04-29 12:09:13 +00:00 |
|
|
|
e7323be10b
|
runtimes/rocm: switch to in-process LLD, removing the need for sandboxed lld.
|
2025-04-23 11:43:18 +00:00 |
|
|
|
02ba41ea57
|
Update examples/MODULE.bazel to bump rules_zig version and fix static‑lib inclusion for non‑exe targets.
|
2025-04-21 13:51:52 +00:00 |
|
|
|
f5f3dc6b1b
|
Update rules_zig to 20250714.0-b14a4f1 and fix static‑lib inclusion in runfiles for non‑exe targets.
|
2025-04-18 12:09:38 +00:00 |
|
|
|
7d9fdf94e7
|
runtimes/rocm: sandbox ROCm dependencies and ensure they load on the main thread due to TLS usage in static C++ destructors.
|
2025-04-14 16:38:15 +00:00 |
|
|
|
eba0e72532
|
runtimes/tpu: sandbox TPU PJRT plugin; no external dependencies.
|
2025-04-10 14:47:16 +00:00 |
|
|
|
8073e45894
|
Update examples/MODULE.bazel.lock to reflect bumped hftokenizers dependency.
|
2025-04-09 10:21:44 +00:00 |
|
|
|
4294a4d08f
|
Bump hftokenizers dependency versions in Bazel and Cargo lockfiles (MODULE.bazel.lock, Cargo.toml, Cargo.lock)
|
2025-04-04 12:54:33 +00:00 |
|
|
|
78d7b672e7
|
runtimes/cpu: sandbox CPU PJRT plugin, simplifying as there are no additional NEEDED dependencies.
|
2025-04-03 11:57:46 +00:00 |
|
|
|
2d321d232d
|
runtimes/cuda: sandbox CUDA dependencies by removing them from the leaf binary, sandboxing the dependency graph, marking dlopen direct dependencies as NEEDED, setting RPATH to the sandbox, loading the PJRT plugin from the sandbox, and enabling weak CUDA symbols without direct linking.
|
2025-03-26 11:18:29 +00:00 |
|
|
|
a5420068b1
|
pjrt: emit warning instead of panic when FFI Extension is missing (e.g., on TPU).
|
2025-03-24 09:40:44 +00:00 |
|
|
|
dc121fce4f
|
Update example MODULE.bazel and lockfile to reflect toolchains_llvm_bootstrapped bump to 0.2.4.
|
2025-03-20 12:17:30 +00:00 |
|
|
|
907577525f
|
Update MODULE.bazel and lockfile to bump toolchains_llvm_bootstrapped to version 0.2.4.
|
2025-03-18 11:47:22 +00:00 |
|
|
|
f27a524f31
|
Update rules_zig: add zig_srcs target, fix source handling bug, clean up BUILD files, adjust async/coro.zig tests, and disable nemo and yaml model loaders.
|
2025-03-13 12:27:21 +00:00 |
|
|
|
6fc1148206
|
async/coro: make coroutines unwindable by zeroing the initial stack region, preventing random unwinding behavior and SIGSEGV during _Unwind_Backtrace.
|
2025-03-10 16:25:45 +00:00 |
|
|
|
f63c673f45
|
bazel: add RPATH manipulation to patchelf
|
2025-03-05 11:56:40 +00:00 |
|
|
|
9488672d4b
|
workspace: bump xla to version 20250710.0-22ea002
Also:
- Bump XLA deps : `com_github_grpc_grpc` and `com_google_protobuf`
- Inject `rules_ml_toolchain`
- Fix `zig_proto_library` rule
|
2025-03-04 17:12:34 +00:00 |
|
|
|
fa0ed045ef
|
runtimes/cuda: downgrade cuda and cudnn
This commit reverts part of https://github.com/zml/zml/pull/238/files
This is required because XLA has a strong dependencies on CUDA 12.8 and
upgrading to 12.9 is impossible due to
https://github.com/NVIDIA/cccl/issues/4967
|
2025-02-28 17:36:12 +00:00 |
|
|
|
ff1433d998
|
pjrt: bind PJRT_Client_CreateUninitializedBuffer.
|
2025-02-25 10:37:45 +00:00 |
|
|
|
8456a0d073
|
zml/pjrt: add binding for PJRT_Device_MemoryStats.
|
2025-02-19 12:14:05 +00:00 |
|
|
|
a580f2a398
|
Async: use stronger memory ordering to prevent potential segfaults due to ordering issues.
|
2025-02-18 11:38:56 +00:00 |
|
|
|
4d6d975dc0
|
Patch aio.zig: update loadBuffersWithPrefix argument type to match the conditional type of loadBuffers init_args.
|
2025-02-13 09:48:13 +00:00 |
|
|
|
af8844c1f1
|
Add model prefix support when loading a model from safetensors, enabling use of a specific model prefix (e.g., ModernBertModel) instead of the full model. Tested with the text embeddings server project.
|
2025-02-12 13:18:27 +00:00 |
|
|
|
1cafcc3c60
|
Workspace: bump XLA to newer version.
|
2025-02-05 17:35:27 +00:00 |
|
|
|
9ef838be25
|
Update neuron runtime BUILD.bazel to use Bazel manual tag and S3 cache integration.
|
2025-02-03 14:03:33 +00:00 |
|
|
|
dd52e988b4
|
Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow.
|
2025-01-31 16:28:38 +00:00 |
|
|
|
0a2ab7c8cb
|
Remove usingnamespace from MLIR.
|
2025-01-28 09:35:58 +00:00 |
|
|
|
f8ab0d7b2a
|
Remove dead imports.
|
2025-01-22 10:45:04 +00:00 |
|
|
|
51a6cab753
|
Wire has_side_effect field in zml/ops.
|
2025-01-20 16:45:13 +00:00 |
|
|
|
99a2001e63
|
Rename PJRT BufferType to follow Zig and ZML naming conventions.
|
2025-01-16 13:00:47 +00:00 |
|
|
|
7324a49da3
|
Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error.
|
2025-01-15 16:59:26 +00:00 |
|
|
|
09c43b8759
|
Add customCall operation to zml/ops.
|
2025-01-09 15:01:33 +00:00 |
|
|
|
9f1cc762cd
|
Fix map tests in zml/meta.
|
2025-01-06 17:49:50 +00:00 |
|
|
|
fbf1ecb8b7
|
Introduce Executable.getCompiledMemoryStats in PJRT.
|
2025-01-02 16:36:13 +00:00 |
|
|
|
4b1a3ff48a
|
Add union support to mapping helpers in zml/meta.zig.
|
2025-01-01 13:35:17 +00:00 |
|
|
|
c961d705f1
|
Set default values for operand_layouts and result_layouts in StableHLO dialect.
|
2024-12-26 09:29:45 +00:00 |
|
|
|
e6286b6097
|
Update Buffer.from to be blocking by default and add options for async loading and memory placement, adjusting aio, hostbuffer, pjrtx, and tensor implementations.
|
2024-12-25 17:14:44 +00:00 |
|
|
|
da1fd2d9dc
|
Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior.
|
2024-12-20 09:30:35 +00:00 |
|
|
|
bb2b77d7de
|
Correctly set model.norm.eps in Llama examples.
|
2024-12-18 11:48:23 +00:00 |
|
|
|
6aa9aa5a7b
|
Add preliminary implementation for custom call support.
|
2024-12-10 09:36:37 +00:00 |
|
|
|
1d5b79111a
|
modernbert: set default epsilon value for embeddings layernorm.
|
2024-12-09 16:43:29 +00:00 |
|
|
|
a63d0a4aa3
|
Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration.
|
2024-12-04 11:30:44 +00:00 |
|
|
|
5464281c91
|
Update workspace configuration to use the toolchains_llvm_bootstrapped toolchain for Zig builds.
|
2024-12-03 13:50:58 +00:00 |
|
|
|
f5ab2c3a55
|
zml: eliminate compile-time fields from Bufferized, removing the need to pass undefined to exe.call for inlined arguments. Introduce BufferizedWithArgs in zml.testing for compileAndCall utility.
|
2024-11-28 12:24:39 +00:00 |
|
|
|
364a222dc1
|
Update example MODULE.bazel and lockfile to target XLA version 20250527.0‑cb67f2f.
|
2024-11-25 17:57:45 +00:00 |
|
|
|
95453c7242
|
Update XLA dependency to version 20250527.0‑cb67f2f and refresh related Bazel BUILD, MODULE, overlay and patch files.
|
2024-11-22 16:50:20 +00:00 |
|
|
|
fa13287931
|
workspace: upgrade to Zig 0.14.1 and handle empty tuple syntax &.{} being detected as *const @TypeOf(.{}).
|
2024-11-19 11:45:36 +00:00 |
|
|
|
d8a83830e8
|
runtimes: switch to Cloudflare Debian snapshots for more reliable dependency pinning.
|
2024-11-15 09:40:58 +00:00 |
|
|
|
ea3ce685a9
|
runtimes/neuron: bump runtime version and expose nrt.h header to Zig.
|
2024-11-14 13:37:47 +00:00 |
|
|
|
09da9c2982
|
Make zls.sh example explicitly set the ZLS runner target.
|
2024-11-06 16:22:44 +00:00 |
|