Commit Graph

25 Commits

Author SHA1 Message Date
cb6fcbbb1a Update docs and Zig examples to demonstrate the new client creation flags API. 2023-11-09 12:31:11 +00:00
237a877a29 zml: Add support for Llama 3.2 text-only models. Implement transpose over embed_tokens as a replacement for missing lm_head and make lm_head optional for compatibility. Add repositories and executions to Bazel and update README. 2023-11-01 10:16:48 +00:00
37de7b9613 Add Llama example showcasing the new func.call emission and function caching behavior. 2023-10-17 11:00:37 +00:00
35395c13f8 Update example programs (benchmark, llama, mnist, simple_layer) to use the new Exe API and reflect BaseExe allocation changes. 2023-10-10 11:12:34 +00:00
474f76cd75 Enable buffer donation in the Llama example, donating all buffers except the token_index buffer. 2023-10-03 16:32:40 +00:00
06865f5876 Update Llama example to use the new direct rope IR implementation. 2023-09-25 10:22:05 +00:00
4abdd32f0d Update llama example BUILD to use jax-cuda-pjrt plugin and bump CUDA (12.6.2) / CuDNN (9.5.1) versions. 2023-09-12 15:40:21 +00:00
937cdec324 examples/loader: add missing stdx dependency. 2023-08-30 13:03:59 +00:00
af0630616c Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance. 2023-08-21 09:15:48 +00:00
022baf782b Update examples/MODULE.bazel to reference the bumped LLVM, XLA, StableHLO, and PJRT plugin versions. 2023-08-11 16:57:15 +00:00
726a2d0691 Update docs and examples to showcase the new async runtime with coroutines and cross‑thread signaling. 2023-08-03 11:35:24 +00:00
0fa258cd88 Update examples to reflect recent async module changes, renaming asyncGeneric to asyncc. 2023-07-24 09:34:35 +00:00
f7bac1af10 Update example programs (llama and loader) with hotfixes for issue. 2023-07-04 13:40:05 +00:00
7985716562 Add new Zig example programs (benchmark, llama, loader, mnist, simple_layer) and include a test for the llama example. 2023-06-27 14:23:22 +00:00
bf23eef0d9 examples: clean up inconsistencies in asynk usage across the codebase. 2023-06-01 16:11:58 +00:00
672df8fa2f Update tutorial and example code to use the new asyncc name and Generic slugs. 2023-05-08 16:58:45 +00:00
837f8fb111 Add support for the Llama 3.1 70B Instruct model to facilitate testing on high‑performance accelerators. 2023-04-19 10:23:44 +00:00
fdb7da5c9b Introduce sharding attributes to Llama weights to enable Tensor Parallelism. 2023-04-13 12:35:27 +00:00
aea23c720e Update Llama example to use renamed zml.aio.Metadata (formerly Value) and reflect torch loader changes. 2023-04-05 14:09:59 +00:00
aae37738a5 Update loader example to demonstrate handling of empty JSON arrays and improved torch loader readability 2023-03-22 14:52:33 +00:00
e30e35deeb Update benchmark example to use new user sharding hints and drop deprecated module options. 2023-03-20 15:31:44 +00:00
cad1a688da Add sharding usage to the benchmark and simple_layer example programs. 2023-02-23 11:18:27 +00:00
7e131a106b Update examples/MODULE.bazel.lock to reflect XLA version bump. 2023-02-03 14:13:21 +00:00
16e066ec69 Add llama example demonstrating the new gatherValues functionality. 2023-01-11 09:58:09 +00:00
eded305649 Add initial documentation and example projects for ZML, covering how‑to guides, tutorials, and benchmark examples. 2023-01-03 10:21:07 +00:00