Commit Graph

331 Commits

Author SHA1 Message Date
201f5245c1 mlir: add MLIR index type 2024-07-11 11:14:22 +00:00
efcf955a4e workspace, third_party/rules_zig: adjust ZLS to require --version as the first parameter and add missing keys to the BuildConfig object for code completion 2024-07-10 15:20:12 +00:00
b67fd2367d Add benchmark, Llama, and ModernBERT example programs plus third‑party non‑module dependencies. 2024-07-03 11:30:49 +00:00
30f6be0e2f Update core Zig modules (async, mlir, pjrt, stdx) and third‑party Bazel definitions for the Zig 0.14.0 release. 2024-07-02 14:19:04 +00:00
16cc8c6658 Update examples MODULE.bazel.lock to reflect sandboxing rework and upgraded CUDA/ROCm version dependencies. 2024-06-27 12:32:07 +00:00
967eeb928f Update Bazel workspace and runtime configs: rework sandboxing, bump PJRT to 7.0.0, and upgrade CUDA (12.8), cuDNN (9.8), and ROCm (6.3.4). 2024-06-25 11:00:29 +00:00
8613a6da7d workspace: update rules_cc to 0.1.1 and Bazel to 8.1.1 to resolve yanked version issue; adjust MODULE.bazel and .bazelversion accordingly; verified successful build and execution 2024-06-19 12:06:53 +00:00
5bcaf374c8 workspace: bump libxev to 20250222.0-07bcffa and introduce Dynamic to select between IO Uring and Epoll on Linux based on availability 2024-06-18 14:26:44 +00:00
18eb0e5a7b Add async I/O, SentencePiece, NN, and tensor utilities for ModernBERT support and update Bazel build configuration. 2024-06-14 15:27:06 +00:00
17d02621e7 Add tests for the ModernBERT example, covering activation utilities, build setup, and example Zig code. 2024-06-11 17:33:22 +00:00
ab5ad874c3 workspace: fix libxev version 2024-06-07 14:25:44 +00:00
72263aa9e3 workspace: fix mistakes in modules bumps
Fix missing = in stbalehlo integrity.
Changes back - by . in xla module name and folders.
Correctly depend on `xla@20250204.1-6789523`
2024-06-06 09:56:17 +00:00
f7450a2104 stablehlo: bump to head and use new dialect capi
This drastically reduce the number of build steps (from 3589 to 2553 steps)
2024-05-31 13:02:46 +00:00
221ece647d zml/ops.zig: Added zml.ops.case operation
This can be used to select which branch will be run at runtime.

It wraps the `stablehlo.case` operation.
2024-05-30 14:11:08 +00:00
27aabf9beb Add Bazel build rules and a test for the benchmark, llama, mnist, and simple_layer examples. 2024-05-23 15:52:34 +00:00
3aac788544 Update Bazel build configurations (zig.bzl, BUILD files) for MLIR, PJRT, Neuron, ROCm, tokenizer, and tools, fixing broken dependencies. 2024-05-20 11:28:25 +00:00
05944b5cc9 Update FnCache to copy and reuse non‑tensor fields in fixed‑size structs, preventing undefined memory in core modules. 2024-05-15 17:54:52 +00:00
dfe55b0d34 Update Bazel lock file for examples to reflect FnCache non‑tensor handling changes. 2024-05-13 16:59:37 +00:00
8d795dd676 pjrt: profiler support std writer API
expose a more low-level function to customize where to write profile
reports
2024-05-09 11:09:29 +00:00
26558d6201 Update examples MODULE.bazel and lockfile to use XLA 20250204.0-6789523 and ensure Bazel 8 compatibility. 2024-05-08 14:03:45 +00:00
f5ab6ff2c6 Update XLA to version 20250204.0-6789523 and adjust Bazel module and runtime files for Bazel 8 compatibility. 2024-05-03 15:57:56 +00:00
a34190679b Fix llama token handling and remove redundant prompt token reuse in core Zig modules (aio, module, nn, pjrtx, tensor) 2024-05-02 17:10:11 +00:00
394e63e273 Fix llama example to correctly handle token output and avoid re‑feeding the last prompt token. 2024-04-24 16:44:25 +00:00
5a2171793d workspace: MODULE.bazel cleanup
Title says it all !
2024-04-22 09:27:44 +00:00
bafe13f546 Update examples/MODULE.bazel.lock to reflect libxev version bump. 2024-04-18 12:53:16 +00:00
65c28111a9 Update libxev to version 20252401.0‑31eed4e and apply patches and. 2024-04-15 13:03:25 +00:00
13eff4e661 pjrt,zml: add memory bindings
This preliminary PR binds PJRT memory endpoints and adds them to
`zml.Buffer`.

A follow up PR will properly integrate it inside `zml.Buffer`
2024-04-11 15:43:24 +00:00
190c6978d2 llama: simplify llama3 prompt template encoding by removing redundant newline re-encoding and ensuring a trailing newline. 2024-04-10 09:36:28 +00:00
d4db5ccc6b Integrate TinyLlama support, restore the homemade tokenizer, and align Zig API naming across stdx and zml tokenizer modules. 2024-04-05 15:07:29 +00:00
b67685b941 Add example Bazel build files and tokenizer test for tinyllama, including tigerbeetle integration and flags. 2024-04-01 17:40:18 +00:00
567210d1d7 bazel: depend on prebuilt protoc binaries to eliminate ~1300 build steps. Note: integration is currently blocked due to version constraints in rules_proto and toolchains_protoc. 2024-03-29 09:54:57 +00:00
e0c8eecb79 bazel: use OID as sha256 for Git LFS files to prevent unnecessary HuggingFace redownloads. 2024-03-28 17:52:52 +00:00
a811b2e1e3 llama: fix dimensions and data types
Removed unnecessary batching dimension introduced by recent changes. Converted index outputs from i32 to u32 for token indices. Ensures Llama runs on CUDA and RoCM. Tested on CUDA.
2024-03-20 13:37:19 +00:00
602757e7a9 Update examples to use the corrected logFn API. 2024-03-18 13:11:14 +00:00
754656f2f0 Replace real mutex with async Mutex for logFn, add fallback logger support outside coroutines, and fix ResetCondition handling. 2024-03-14 11:43:33 +00:00
980f1b17fb Ensure all runtime plugins have correct SONAME values, fixing issues with prebuilt PJRT plugins. 2024-03-11 10:15:22 +00:00
8a25b1eb74 Revert CUDA PJRT plugin version to 0.4.38 to address performance regression on XLA master. 2024-03-05 17:04:42 +00:00
76e314db9b Update Llama example docs and Bazel build files, and add tests for the new HuggingFace tokenizer integration. 2024-03-04 12:11:13 +00:00
959bc48c42 Add HuggingFace tokenizer bindings and SentencePiece integration; update BUILD files, async utilities, and FFI modules to support the new tokenizers. 2024-02-28 15:47:37 +00:00
5048e7dc89 Update example lock file for rules_distroless 0.4.2 upgrade and verify MNIST image build works. 2024-02-26 15:30:13 +00:00
b4b2490690 Upgrade rules_distroless to 0.4.2 in MODULE.bazel and refresh MODULE.bazel.lock accordingly. 2024-02-21 17:48:10 +00:00
c109b12e1b Various minor fixes: rewrite tinyllama tokenizer newline token, prevent HostBuffer.isContiguous false trigger on 1‑dim axes, improve HostBuffer.slice1d error messages, simplify module.zig output to show .mlir file path, correct setFlags handling of comptime int/float, make tokenizer.zig return <oob> for out‑of‑range detokenization, and speed up Buffer.constant creation up to 2.5 GB/s on CUDA. 2024-02-19 12:34:18 +00:00
3970df5b48 Update getting_started tutorial and example Bazel files for Bazel 8 migration. 2024-02-14 10:44:47 +00:00
169a24307c Migrate workspace and XLA module definitions to Bazel 8, updating MODULE.bazel files, BUILD rules, and related migration patches. 2024-02-12 12:43:23 +00:00
7e6103d876 Upgrade XLA to version 20250122.0-cc075be, switch to nvptx compiler and nvlink with nvjitlink support, add warning for CUDA path in LD_LIBRARY_PATH, and revert the previous CUDA sandbox fix. 2024-02-06 09:31:48 +00:00
b8a0aaee5a Update tokenizer to handle byte_fallback for Llama3 GPT2 vocab and add a Llama3‑specific normalizer; adjust tinyllama.zig and hostbuffer.zig to use the new tokenization logic. 2024-02-05 15:22:44 +00:00
b643f7bc53 Add Bazel build rule and test for Llama3 tokenizer’s byte fallback and unknown token handling. 2024-02-02 10:25:48 +00:00
5120fe00dc Update libxev epoll patch to resolve crashes and hangs in epoll and kqueue implementations. 2024-01-29 17:15:11 +00:00
edc2ac26f8 Adjust ROCm runtime sandboxing to hook only the PJRT plugin and make hipblastlt bytecodes optional. 2024-01-26 13:02:23 +00:00
0ce36599da Update example build config and Llama demo to support the new async epoll backend and zigcoro scheduler. 2024-01-22 12:17:01 +00:00