Commit Graph

68 Commits

Author SHA1 Message Date
7fb02e1888 Update docs: add deployment guide, Hugging Face token instructions, getting‑started tutorial, and include a Bazel lock example. 2025-06-05 13:18:14 +00:00
111afcdd95 examples/llama: switch to --hf-model-path flag
Instead of giving config, model weights and tokenizer paths, rely on
`huggingface-cli` download.
2025-05-28 13:21:00 +00:00
5a49a3e8ca Update examples MODULE.bazel.lock to reflect the added sandbox neuron dependencies. 2025-05-22 10:10:14 +00:00
9e66d5a06a Update examples/MODULE.bazel.lock to reflect new dependencies. 2025-05-16 11:26:13 +00:00
02ba41ea57 Update examples/MODULE.bazel to bump rules_zig version and fix static‑lib inclusion for non‑exe targets. 2025-04-21 13:51:52 +00:00
8073e45894 Update examples/MODULE.bazel.lock to reflect bumped hftokenizers dependency. 2025-04-09 10:21:44 +00:00
dc121fce4f Update example MODULE.bazel and lockfile to reflect toolchains_llvm_bootstrapped bump to 0.2.4. 2025-03-20 12:17:30 +00:00
dd52e988b4 Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow. 2025-01-31 16:28:38 +00:00
7324a49da3 Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error. 2025-01-15 16:59:26 +00:00
da1fd2d9dc Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior. 2024-12-20 09:30:35 +00:00
bb2b77d7de Correctly set model.norm.eps in Llama examples. 2024-12-18 11:48:23 +00:00
1d5b79111a modernbert: set default epsilon value for embeddings layernorm. 2024-12-09 16:43:29 +00:00
a63d0a4aa3 Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration. 2024-12-04 11:30:44 +00:00
364a222dc1 Update example MODULE.bazel and lockfile to target XLA version 20250527.0‑cb67f2f. 2024-11-25 17:57:45 +00:00
09da9c2982 Make zls.sh example explicitly set the ZLS runner target. 2024-11-06 16:22:44 +00:00
1540c6e85e Update loader example to demonstrate new HostBuffer helpers and type‑inferred buffer loading. 2024-10-25 10:20:04 +00:00
fe55c600d4 Add/refresh how‑to docs and example loader for deployment, Docker, HuggingFace token, and getting‑started tutorials. 2024-10-14 11:27:41 +00:00
9f61a8aacb Update example Zig code for llama3 rope scaling and modernbert usage. 2024-10-04 17:49:07 +00:00
d0cf5d3042 Update example MODULE.bazel and its lock file to incorporate workspace fixes and version bumps 2024-09-30 14:00:16 +00:00
12713ea8c7 Update Bazel MODULE files to fetch LLVM, StableHLO, and Triton dependencies, aligning versions for XLA. 2024-08-07 17:13:03 +00:00
44933c9b89 Add macOS x86_64 support to example Bazel MODULE and platform mappings. 2024-07-29 10:31:51 +00:00
b67fd2367d Add benchmark, Llama, and ModernBERT example programs plus third‑party non‑module dependencies. 2024-07-03 11:30:49 +00:00
16cc8c6658 Update examples MODULE.bazel.lock to reflect sandboxing rework and upgraded CUDA/ROCm version dependencies. 2024-06-27 12:32:07 +00:00
17d02621e7 Add tests for the ModernBERT example, covering activation utilities, build setup, and example Zig code. 2024-06-11 17:33:22 +00:00
27aabf9beb Add Bazel build rules and a test for the benchmark, llama, mnist, and simple_layer examples. 2024-05-23 15:52:34 +00:00
dfe55b0d34 Update Bazel lock file for examples to reflect FnCache non‑tensor handling changes. 2024-05-13 16:59:37 +00:00
26558d6201 Update examples MODULE.bazel and lockfile to use XLA 20250204.0-6789523 and ensure Bazel 8 compatibility. 2024-05-08 14:03:45 +00:00
394e63e273 Fix llama example to correctly handle token output and avoid re‑feeding the last prompt token. 2024-04-24 16:44:25 +00:00
bafe13f546 Update examples/MODULE.bazel.lock to reflect libxev version bump. 2024-04-18 12:53:16 +00:00
190c6978d2 llama: simplify llama3 prompt template encoding by removing redundant newline re-encoding and ensuring a trailing newline. 2024-04-10 09:36:28 +00:00
b67685b941 Add example Bazel build files and tokenizer test for tinyllama, including tigerbeetle integration and flags. 2024-04-01 17:40:18 +00:00
a811b2e1e3 llama: fix dimensions and data types
Removed unnecessary batching dimension introduced by recent changes. Converted index outputs from i32 to u32 for token indices. Ensures Llama runs on CUDA and RoCM. Tested on CUDA.
2024-03-20 13:37:19 +00:00
602757e7a9 Update examples to use the corrected logFn API. 2024-03-18 13:11:14 +00:00
76e314db9b Update Llama example docs and Bazel build files, and add tests for the new HuggingFace tokenizer integration. 2024-03-04 12:11:13 +00:00
5048e7dc89 Update example lock file for rules_distroless 0.4.2 upgrade and verify MNIST image build works. 2024-02-26 15:30:13 +00:00
3970df5b48 Update getting_started tutorial and example Bazel files for Bazel 8 migration. 2024-02-14 10:44:47 +00:00
b643f7bc53 Add Bazel build rule and test for Llama3 tokenizer’s byte fallback and unknown token handling. 2024-02-02 10:25:48 +00:00
0ce36599da Update example build config and Llama demo to support the new async epoll backend and zigcoro scheduler. 2024-01-22 12:17:01 +00:00
223857251d Update MNIST example to use new operator source locations and reflect recent API changes (sdpa bias removal, gatherSlices shape literals, Shape.outer, Tensor.all, and argMax dtype inference) 2023-12-26 10:45:52 +00:00
8a031bd4c8 Update Llama example to use the simplified transpose implementation and increase default profiler size to 1,000,000 events. 2023-12-15 12:06:42 +00:00
22a846de72 Update llama example to use per‑target output folders and call profiler.dumpDataAsJson for testing the new compilation layout. 2023-12-01 16:05:59 +00:00
46fbbf43a2 Update tutorial documentation in write_first_model.md with quick fixes. 2023-11-30 12:14:33 +00:00
737f7cbdee Add example build runner scripts and config for Zig code completion. 2023-11-21 14:55:34 +00:00
cb6fcbbb1a Update docs and Zig examples to demonstrate the new client creation flags API. 2023-11-09 12:31:11 +00:00
237a877a29 zml: Add support for Llama 3.2 text-only models. Implement transpose over embed_tokens as a replacement for missing lm_head and make lm_head optional for compatibility. Add repositories and executions to Bazel and update README. 2023-11-01 10:16:48 +00:00
37de7b9613 Add Llama example showcasing the new func.call emission and function caching behavior. 2023-10-17 11:00:37 +00:00
35395c13f8 Update example programs (benchmark, llama, mnist, simple_layer) to use the new Exe API and reflect BaseExe allocation changes. 2023-10-10 11:12:34 +00:00
474f76cd75 Enable buffer donation in the Llama example, donating all buffers except the token_index buffer. 2023-10-03 16:32:40 +00:00
06865f5876 Update Llama example to use the new direct rope IR implementation. 2023-09-25 10:22:05 +00:00
4abdd32f0d Update llama example BUILD to use jax-cuda-pjrt plugin and bump CUDA (12.6.2) / CuDNN (9.5.1) versions. 2023-09-12 15:40:21 +00:00