b67685b941
Add example Bazel build files and tokenizer test for tinyllama, including tigerbeetle integration and flags.
2024-04-01 17:40:18 +00:00
a811b2e1e3
llama: fix dimensions and data types
...
Removed unnecessary batching dimension introduced by recent changes. Converted index outputs from i32 to u32 for token indices. Ensures Llama runs on CUDA and RoCM. Tested on CUDA.
2024-03-20 13:37:19 +00:00
76e314db9b
Update Llama example docs and Bazel build files, and add tests for the new HuggingFace tokenizer integration.
2024-03-04 12:11:13 +00:00
8a031bd4c8
Update Llama example to use the simplified transpose implementation and increase default profiler size to 1,000,000 events.
2023-12-15 12:06:42 +00:00
237a877a29
zml: Add support for Llama 3.2 text-only models. Implement transpose over embed_tokens as a replacement for missing lm_head and make lm_head optional for compatibility. Add repositories and executions to Bazel and update README.
2023-11-01 10:16:48 +00:00
37de7b9613
Add Llama example showcasing the new func.call emission and function caching behavior.
2023-10-17 11:00:37 +00:00
474f76cd75
Enable buffer donation in the Llama example, donating all buffers except the token_index buffer.
2023-10-03 16:32:40 +00:00
06865f5876
Update Llama example to use the new direct rope IR implementation.
2023-09-25 10:22:05 +00:00
7985716562
Add new Zig example programs (benchmark, llama, loader, mnist, simple_layer) and include a test for the llama example.
2023-06-27 14:23:22 +00:00
fdb7da5c9b
Introduce sharding attributes to Llama weights to enable Tensor Parallelism.
2023-04-13 12:35:27 +00:00
16e066ec69
Add llama example demonstrating the new gatherValues functionality.
2023-01-11 09:58:09 +00:00
eded305649
Add initial documentation and example projects for ZML, covering how‑to guides, tutorials, and benchmark examples.
2023-01-03 10:21:07 +00:00