a5e588f53b
async: detect xev only when the API is dynamic to fix macOS
2024-08-20 13:51:50 +00:00
8a8d3e0b15
workspace: fix incorrect checksum for ZLS macOS x86_64
2024-08-16 14:54:11 +00:00
4d809118d2
async: detect and select the most suitable xev backend, enabling use of io_uring when supported
2024-08-14 14:23:57 +00:00
12713ea8c7
Update Bazel MODULE files to fetch LLVM, StableHLO, and Triton dependencies, aligning versions for XLA.
2024-08-07 17:13:03 +00:00
7df89301dc
Bump XLA version and import llvm, stablehlo, triton, and zig‑protobuf modules in workspace BUILD files.
2024-08-06 10:28:43 +00:00
cbe6e730bd
stablehlo: fix forwarding of optional operand and result layout attributes in custom call
2024-07-31 17:53:18 +00:00
44933c9b89
Add macOS x86_64 support to example Bazel MODULE and platform mappings.
2024-07-29 10:31:51 +00:00
ca4e061ad5
Add Bazel build configurations for macOS x86_64 CPU runtime and ZLS third‑party integration.
2024-07-25 15:58:14 +00:00
3f36506f1c
zml: remove usingnamespace from floats.zig and related dependencies; note that incremental compilation does not improve overall build time due to linking overhead
2024-07-23 17:43:43 +00:00
42dee5d0e0
mlir: rework stablehlo custom call implementation and add a Triton example
2024-07-16 13:23:07 +00:00
aec1d96e6d
mlir: rework DenseElementsAttribute to correctly slice inputs and modify .as() to return a concrete value instead of an optional
2024-07-15 12:32:24 +00:00
201f5245c1
mlir: add MLIR index type
2024-07-11 11:14:22 +00:00
efcf955a4e
workspace, third_party/rules_zig: adjust ZLS to require --version as the first parameter and add missing keys to the BuildConfig object for code completion
2024-07-10 15:20:12 +00:00
b67fd2367d
Add benchmark, Llama, and ModernBERT example programs plus third‑party non‑module dependencies.
2024-07-03 11:30:49 +00:00
30f6be0e2f
Update core Zig modules (async, mlir, pjrt, stdx) and third‑party Bazel definitions for the Zig 0.14.0 release.
2024-07-02 14:19:04 +00:00
16cc8c6658
Update examples MODULE.bazel.lock to reflect sandboxing rework and upgraded CUDA/ROCm version dependencies.
2024-06-27 12:32:07 +00:00
967eeb928f
Update Bazel workspace and runtime configs: rework sandboxing, bump PJRT to 7.0.0, and upgrade CUDA (12.8), cuDNN (9.8), and ROCm (6.3.4).
2024-06-25 11:00:29 +00:00
8613a6da7d
workspace: update rules_cc to 0.1.1 and Bazel to 8.1.1 to resolve yanked version issue; adjust MODULE.bazel and .bazelversion accordingly; verified successful build and execution
2024-06-19 12:06:53 +00:00
5bcaf374c8
workspace: bump libxev to 20250222.0-07bcffa and introduce Dynamic to select between IO Uring and Epoll on Linux based on availability
2024-06-18 14:26:44 +00:00
18eb0e5a7b
Add async I/O, SentencePiece, NN, and tensor utilities for ModernBERT support and update Bazel build configuration.
2024-06-14 15:27:06 +00:00
17d02621e7
Add tests for the ModernBERT example, covering activation utilities, build setup, and example Zig code.
2024-06-11 17:33:22 +00:00
ab5ad874c3
workspace: fix libxev version
2024-06-07 14:25:44 +00:00
72263aa9e3
workspace: fix mistakes in modules bumps
...
Fix missing = in stbalehlo integrity.
Changes back - by . in xla module name and folders.
Correctly depend on `xla@20250204.1-6789523`
2024-06-06 09:56:17 +00:00
f7450a2104
stablehlo: bump to head and use new dialect capi
...
This drastically reduce the number of build steps (from 3589 to 2553 steps)
2024-05-31 13:02:46 +00:00
221ece647d
zml/ops.zig: Added zml.ops.case operation
...
This can be used to select which branch will be run at runtime.
It wraps the `stablehlo.case` operation.
2024-05-30 14:11:08 +00:00
27aabf9beb
Add Bazel build rules and a test for the benchmark, llama, mnist, and simple_layer examples.
2024-05-23 15:52:34 +00:00
3aac788544
Update Bazel build configurations (zig.bzl, BUILD files) for MLIR, PJRT, Neuron, ROCm, tokenizer, and tools, fixing broken dependencies.
2024-05-20 11:28:25 +00:00
05944b5cc9
Update FnCache to copy and reuse non‑tensor fields in fixed‑size structs, preventing undefined memory in core modules.
2024-05-15 17:54:52 +00:00
dfe55b0d34
Update Bazel lock file for examples to reflect FnCache non‑tensor handling changes.
2024-05-13 16:59:37 +00:00
8d795dd676
pjrt: profiler support std writer API
...
expose a more low-level function to customize where to write profile
reports
2024-05-09 11:09:29 +00:00
26558d6201
Update examples MODULE.bazel and lockfile to use XLA 20250204.0-6789523 and ensure Bazel 8 compatibility.
2024-05-08 14:03:45 +00:00
f5ab6ff2c6
Update XLA to version 20250204.0-6789523 and adjust Bazel module and runtime files for Bazel 8 compatibility.
2024-05-03 15:57:56 +00:00
a34190679b
Fix llama token handling and remove redundant prompt token reuse in core Zig modules (aio, module, nn, pjrtx, tensor)
2024-05-02 17:10:11 +00:00
394e63e273
Fix llama example to correctly handle token output and avoid re‑feeding the last prompt token.
2024-04-24 16:44:25 +00:00
5a2171793d
workspace: MODULE.bazel cleanup
...
Title says it all !
2024-04-22 09:27:44 +00:00
bafe13f546
Update examples/MODULE.bazel.lock to reflect libxev version bump.
2024-04-18 12:53:16 +00:00
65c28111a9
Update libxev to version 20252401.0‑31eed4e and apply patches and.
2024-04-15 13:03:25 +00:00
13eff4e661
pjrt,zml: add memory bindings
...
This preliminary PR binds PJRT memory endpoints and adds them to
`zml.Buffer`.
A follow up PR will properly integrate it inside `zml.Buffer`
2024-04-11 15:43:24 +00:00
190c6978d2
llama: simplify llama3 prompt template encoding by removing redundant newline re-encoding and ensuring a trailing newline.
2024-04-10 09:36:28 +00:00
d4db5ccc6b
Integrate TinyLlama support, restore the homemade tokenizer, and align Zig API naming across stdx and zml tokenizer modules.
2024-04-05 15:07:29 +00:00
b67685b941
Add example Bazel build files and tokenizer test for tinyllama, including tigerbeetle integration and flags.
2024-04-01 17:40:18 +00:00
567210d1d7
bazel: depend on prebuilt protoc binaries to eliminate ~1300 build steps. Note: integration is currently blocked due to version constraints in rules_proto and toolchains_protoc.
2024-03-29 09:54:57 +00:00
e0c8eecb79
bazel: use OID as sha256 for Git LFS files to prevent unnecessary HuggingFace redownloads.
2024-03-28 17:52:52 +00:00
a811b2e1e3
llama: fix dimensions and data types
...
Removed unnecessary batching dimension introduced by recent changes. Converted index outputs from i32 to u32 for token indices. Ensures Llama runs on CUDA and RoCM. Tested on CUDA.
2024-03-20 13:37:19 +00:00
602757e7a9
Update examples to use the corrected logFn API.
2024-03-18 13:11:14 +00:00
754656f2f0
Replace real mutex with async Mutex for logFn, add fallback logger support outside coroutines, and fix ResetCondition handling.
2024-03-14 11:43:33 +00:00
980f1b17fb
Ensure all runtime plugins have correct SONAME values, fixing issues with prebuilt PJRT plugins.
2024-03-11 10:15:22 +00:00
8a25b1eb74
Revert CUDA PJRT plugin version to 0.4.38 to address performance regression on XLA master.
2024-03-05 17:04:42 +00:00
76e314db9b
Update Llama example docs and Bazel build files, and add tests for the new HuggingFace tokenizer integration.
2024-03-04 12:11:13 +00:00
959bc48c42
Add HuggingFace tokenizer bindings and SentencePiece integration; update BUILD files, async utilities, and FFI modules to support the new tokenizers.
2024-02-28 15:47:37 +00:00