Commit Graph

300 Commits

Author SHA1 Message Date
6fc1148206 async/coro: make coroutines unwindable by zeroing the initial stack region, preventing random unwinding behavior and SIGSEGV during _Unwind_Backtrace. 2025-03-10 16:25:45 +00:00
f63c673f45 bazel: add RPATH manipulation to patchelf 2025-03-05 11:56:40 +00:00
9488672d4b workspace: bump xla to version 20250710.0-22ea002
Also:
- Bump XLA deps : `com_github_grpc_grpc` and `com_google_protobuf`
- Inject `rules_ml_toolchain`
- Fix `zig_proto_library` rule
2025-03-04 17:12:34 +00:00
fa0ed045ef runtimes/cuda: downgrade cuda and cudnn
This commit reverts part of https://github.com/zml/zml/pull/238/files
This is required because XLA has a strong dependencies on CUDA 12.8 and
upgrading to 12.9 is impossible due to
https://github.com/NVIDIA/cccl/issues/4967
2025-02-28 17:36:12 +00:00
ff1433d998 pjrt: bind PJRT_Client_CreateUninitializedBuffer. 2025-02-25 10:37:45 +00:00
8456a0d073 zml/pjrt: add binding for PJRT_Device_MemoryStats. 2025-02-19 12:14:05 +00:00
a580f2a398 Async: use stronger memory ordering to prevent potential segfaults due to ordering issues. 2025-02-18 11:38:56 +00:00
4d6d975dc0 Patch aio.zig: update loadBuffersWithPrefix argument type to match the conditional type of loadBuffers init_args. 2025-02-13 09:48:13 +00:00
af8844c1f1 Add model prefix support when loading a model from safetensors, enabling use of a specific model prefix (e.g., ModernBertModel) instead of the full model. Tested with the text embeddings server project. 2025-02-12 13:18:27 +00:00
1cafcc3c60 Workspace: bump XLA to newer version. 2025-02-05 17:35:27 +00:00
9ef838be25 Update neuron runtime BUILD.bazel to use Bazel manual tag and S3 cache integration. 2025-02-03 14:03:33 +00:00
dd52e988b4 Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow. 2025-01-31 16:28:38 +00:00
0a2ab7c8cb Remove usingnamespace from MLIR. 2025-01-28 09:35:58 +00:00
f8ab0d7b2a Remove dead imports. 2025-01-22 10:45:04 +00:00
51a6cab753 Wire has_side_effect field in zml/ops. 2025-01-20 16:45:13 +00:00
99a2001e63 Rename PJRT BufferType to follow Zig and ZML naming conventions. 2025-01-16 13:00:47 +00:00
7324a49da3 Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error. 2025-01-15 16:59:26 +00:00
09c43b8759 Add customCall operation to zml/ops. 2025-01-09 15:01:33 +00:00
9f1cc762cd Fix map tests in zml/meta. 2025-01-06 17:49:50 +00:00
fbf1ecb8b7 Introduce Executable.getCompiledMemoryStats in PJRT. 2025-01-02 16:36:13 +00:00
4b1a3ff48a Add union support to mapping helpers in zml/meta.zig. 2025-01-01 13:35:17 +00:00
c961d705f1 Set default values for operand_layouts and result_layouts in StableHLO dialect. 2024-12-26 09:29:45 +00:00
e6286b6097 Update Buffer.from to be blocking by default and add options for async loading and memory placement, adjusting aio, hostbuffer, pjrtx, and tensor implementations. 2024-12-25 17:14:44 +00:00
da1fd2d9dc Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior. 2024-12-20 09:30:35 +00:00
bb2b77d7de Correctly set model.norm.eps in Llama examples. 2024-12-18 11:48:23 +00:00
6aa9aa5a7b Add preliminary implementation for custom call support. 2024-12-10 09:36:37 +00:00
1d5b79111a modernbert: set default epsilon value for embeddings layernorm. 2024-12-09 16:43:29 +00:00
a63d0a4aa3 Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration. 2024-12-04 11:30:44 +00:00
5464281c91 Update workspace configuration to use the toolchains_llvm_bootstrapped toolchain for Zig builds. 2024-12-03 13:50:58 +00:00
f5ab2c3a55 zml: eliminate compile-time fields from Bufferized, removing the need to pass undefined to exe.call for inlined arguments. Introduce BufferizedWithArgs in zml.testing for compileAndCall utility. 2024-11-28 12:24:39 +00:00
364a222dc1 Update example MODULE.bazel and lockfile to target XLA version 20250527.0‑cb67f2f. 2024-11-25 17:57:45 +00:00
95453c7242 Update XLA dependency to version 20250527.0‑cb67f2f and refresh related Bazel BUILD, MODULE, overlay and patch files. 2024-11-22 16:50:20 +00:00
fa13287931 workspace: upgrade to Zig 0.14.1 and handle empty tuple syntax &.{} being detected as *const @TypeOf(.{}). 2024-11-19 11:45:36 +00:00
d8a83830e8 runtimes: switch to Cloudflare Debian snapshots for more reliable dependency pinning. 2024-11-15 09:40:58 +00:00
ea3ce685a9 runtimes/neuron: bump runtime version and expose nrt.h header to Zig. 2024-11-14 13:37:47 +00:00
09da9c2982 Make zls.sh example explicitly set the ZLS runner target. 2024-11-06 16:22:44 +00:00
948c577205 Make ZLS runner target explicit in workspace BUILD files and update the zls.sh script accordingly. 2024-11-04 13:57:59 +00:00
47a4eda5f6 runtimes/cuda: expose cuda.h in the C namespace for CUDA runtimes, enabling custom calls to CUDA functions. 2024-11-01 13:27:24 +00:00
3849eb10b7 Add buffer and hostbuffer utilities with precise f32→bf16 conversion, type inference for loadBuffers, store expected input shapes, enhance meta.visit and JSON TaggedUnion support, and improve logging. 2024-10-28 11:21:46 +00:00
1540c6e85e Update loader example to demonstrate new HostBuffer helpers and type‑inferred buffer loading. 2024-10-25 10:20:04 +00:00
048d7eb38e third_party/sentencepiece: add missing protobuf_lite dependency and bump version. 2024-10-22 16:41:52 +00:00
4ef81b89ea stdx.fmt: add slice formatting support, improving on previous prettyPrinter implementation by leveraging internal fmt mechanisms. 2024-10-18 15:05:08 +00:00
fe55c600d4 Add/refresh how‑to docs and example loader for deployment, Docker, HuggingFace token, and getting‑started tutorials. 2024-10-14 11:27:41 +00:00
59f99c4501 Enable Bazel debug/release Zig builds and add stdx.time.Duration.hz and stdx.math.roundeven utilities. 2024-10-08 09:39:00 +00:00
aacbf2ee04 Fix Llama3 rope scaling implementation in the neural network module (zml/nn.zig) 2024-10-07 12:53:03 +00:00
9f61a8aacb Update example Zig code for llama3 rope scaling and modernbert usage. 2024-10-04 17:49:07 +00:00
d0cf5d3042 Update example MODULE.bazel and its lock file to incorporate workspace fixes and version bumps 2024-09-30 14:00:16 +00:00
4a0b1cce50 Update Bazel workspace and XLA overlay (MODULE.bazel, BUILD files, patches) to prevent dual LLVM builds and apply migration/bump patches. 2024-09-27 14:00:44 +00:00
8b53ebbd9b pjrt: bind PJRT_Device_AddressableMemories. 2024-09-26 12:02:04 +00:00
2863c1f5e0 zml/tensor: fix returned value in Tensor.toMemory – ensure _output_memory_kind is set correctly in the result. 2024-09-18 13:18:08 +00:00