|
|
4abdd32f0d
|
Update llama example BUILD to use jax-cuda-pjrt plugin and bump CUDA (12.6.2) / CuDNN (9.5.1) versions.
|
2023-09-12 15:40:21 +00:00 |
|
|
|
af0630616c
|
Update docs (deploy_on_server, dockerize_models, getting_started) and example Bazel files to include AWS Neuron/Trainium/Inferentia deployment guidance.
|
2023-08-21 09:15:48 +00:00 |
|
|
|
726a2d0691
|
Update docs and examples to showcase the new async runtime with coroutines and cross‑thread signaling.
|
2023-08-03 11:35:24 +00:00 |
|
|
|
f7bac1af10
|
Update example programs (llama and loader) with hotfixes for issue.
|
2023-07-04 13:40:05 +00:00 |
|
|
|
7985716562
|
Add new Zig example programs (benchmark, llama, loader, mnist, simple_layer) and include a test for the llama example.
|
2023-06-27 14:23:22 +00:00 |
|
|
|
672df8fa2f
|
Update tutorial and example code to use the new asyncc name and Generic slugs.
|
2023-05-08 16:58:45 +00:00 |
|
|
|
837f8fb111
|
Add support for the Llama 3.1 70B Instruct model to facilitate testing on high‑performance accelerators.
|
2023-04-19 10:23:44 +00:00 |
|
|
|
fdb7da5c9b
|
Introduce sharding attributes to Llama weights to enable Tensor Parallelism.
|
2023-04-13 12:35:27 +00:00 |
|
|
|
aea23c720e
|
Update Llama example to use renamed zml.aio.Metadata (formerly Value) and reflect torch loader changes.
|
2023-04-05 14:09:59 +00:00 |
|
|
|
16e066ec69
|
Add llama example demonstrating the new gatherValues functionality.
|
2023-01-11 09:58:09 +00:00 |
|
|
|
eded305649
|
Add initial documentation and example projects for ZML, covering how‑to guides, tutorials, and benchmark examples.
|
2023-01-03 10:21:07 +00:00 |
|