Commit Graph

14 Commits

Author SHA1 Message Date
35395c13f8 Update example programs (benchmark, llama, mnist, simple_layer) to use the new Exe API and reflect BaseExe allocation changes. 2023-10-10 11:12:34 +00:00
474f76cd75 Enable buffer donation in the Llama example, donating all buffers except the token_index buffer. 2023-10-03 16:32:40 +00:00
06865f5876 Update Llama example to use the new direct rope IR implementation. 2023-09-25 10:22:05 +00:00
4abdd32f0d Update llama example BUILD to use jax-cuda-pjrt plugin and bump CUDA (12.6.2) / CuDNN (9.5.1) versions. 2023-09-12 15:40:21 +00:00
af0630616c Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance. 2023-08-21 09:15:48 +00:00
726a2d0691 Update docs and examples to showcase the new async runtime with coroutines and cross‑thread signaling. 2023-08-03 11:35:24 +00:00
f7bac1af10 Update example programs (llama and loader) with hotfixes for issue. 2023-07-04 13:40:05 +00:00
7985716562 Add new Zig example programs (benchmark, llama, loader, mnist, simple_layer) and include a test for the llama example. 2023-06-27 14:23:22 +00:00
672df8fa2f Update tutorial and example code to use the new asyncc name and Generic slugs. 2023-05-08 16:58:45 +00:00
837f8fb111 Add support for the Llama 3.1 70B Instruct model to facilitate testing on high‑performance accelerators. 2023-04-19 10:23:44 +00:00
fdb7da5c9b Introduce sharding attributes to Llama weights to enable Tensor Parallelism. 2023-04-13 12:35:27 +00:00
aea23c720e Update Llama example to use renamed zml.aio.Metadata (formerly Value) and reflect torch loader changes. 2023-04-05 14:09:59 +00:00
16e066ec69 Add llama example demonstrating the new gatherValues functionality. 2023-01-11 09:58:09 +00:00
eded305649 Add initial documentation and example projects for ZML, covering how‑to guides, tutorials, and benchmark examples. 2023-01-03 10:21:07 +00:00