Commit Graph

  • 82882cfd3e Add Qwen3VL bf16 example implementation and tutorial docs, including BMP/JPG/PNG support via zignal library. master Foke Singh 2025-12-31 12:46:11 +0000
  • e659dc8fa3 Add Qwen3VL bf16 example implementation and integrate zignal image format support; update Bazel build files and core ZML modules. Tarry Singh 2025-12-29 16:17:11 +0000
  • b8b4d33379 Update XLA to latest version Tarry Singh 2025-12-23 17:24:34 +0000
  • 57bef8d66c Switch workspace to use apple_support C++ toolchains instead of rules_cc, aligning with the recommended toolchain. Tarry Singh 2025-12-22 17:33:18 +0000
  • 6c80f2f394 Set LLVM backend as default for all Zig compile actions via rules_zig zigopt global setting, due to stability concerns with the self-hosted backend. Tarry Singh 2025-12-18 10:04:04 +0000
  • a3abf148b4 Fix pickle loader to use takeDelimiterInclusive for Zig 0.15.2 and update ZLS runner, buffer, callback, and tracer utilities accordingly. Tarry Singh 2025-12-15 11:08:19 +0000
  • 1b8d0ac627 Update callback example to use takeDelimiterInclusive for Zig 0.15.2 delimiter behavior change. Foke Singh 2025-12-12 14:00:50 +0000
  • 7e81d022ff Add implicit dependency on hf_transfer for tools/hf. Tarry Singh 2025-12-08 09:24:52 +0000
  • fa886db3bb Disable ZML aio torch pickle test pending implementation fix due to protocol support limitation in version 0.15.2. Tarry Singh 2025-12-03 10:57:27 +0000
  • ba5043ddf8 Switch ZML to use .host_pinned memory on ROCm, addressing missing host_unpinned memory error. Tarry Singh 2025-12-01 14:34:13 +0000
  • 7894335dc9 Correct compilation errors in examples/callback. Foke Singh 2025-11-26 15:23:41 +0000
  • b58f7ced3d Fix $ORIGIN handling in runtimes/neuron by escaping $ since zigopts does not expand Make variables. Tarry Singh 2025-11-24 12:04:56 +0000
  • 993cbffd22 Upgrade workspace Zig version to 0.15.2. Tarry Singh 2025-11-21 12:12:36 +0000
  • d3baeed676 Add -fllvm flag to the MNIST example BUILD configuration. Foke Singh 2025-11-20 15:28:47 +0000
  • 3d3a0ea463 Enable -fllvm flag for CUDA runtime, upb, and ZML Bazel targets. Tarry Singh 2025-11-11 13:02:40 +0000
  • 8f50d19cba Add missing C++ toolchain to zls. Tarry Singh 2025-11-10 09:44:11 +0000
  • 94c5bbcfa6 Update workspace to use rules_zig archive_override before version 0.12.2, fixing shared_library and PJRT loading and ensuring os/log.h inclusion. Tarry Singh 2025-11-06 12:36:22 +0000
  • cd1b66f615 chore: remove unused target Tarry Singh 2025-11-04 12:15:20 +0000
  • 91f1c3b7aa pjrt/cpu: fix compilation when CPU is absent by ensuring CcCompilationContext is non-empty for Zig modules accessing the c module Tarry Singh 2025-10-30 09:01:36 +0000
  • b67a3f9a0a workspace: switch to libxev from Bazel Central Registry, completing migration away from custom registry Tarry Singh 2025-10-29 17:43:22 +0000
  • 7bdce4c078 Add Llama 3.1 example showcasing optional head‑dimension support. Foke Singh 2025-10-23 17:54:48 +0000
  • 675cefaf26 Patch http_deb_archive rule and improve stdx utilities and ZML callback handling Tarry Singh 2025-10-20 10:24:27 +0000
  • 20ca4a6648 Update MNIST example BUILD file to use zigopts and align with new upstreamed rules_zig configuration. Foke Singh 2025-10-14 16:57:38 +0000
  • bcd43314a4 Migrate BUILD and MODULE files to upstreamed rules_zig (rename copts to zigopts) and adjust ZLS integration accordingly. Tarry Singh 2025-10-13 15:26:42 +0000
  • 7d7c124ada runtimes/cpu: update Darwin arm64 PJRT tarball checksum to match v13.0.0, enabling Bazel fetch on macOS arm64 Tarry Singh 2025-10-07 10:18:40 +0000
  • d056fd3511 zml: cleanup some todos Tarry Singh 2025-10-06 15:29:57 +0000
  • 77cd21d2b2 Add gpt-oss model support to core ZML components: formatting, utility functions, safetensors I/O, host buffer management, NN layer handling, and tensor operations. Tarry Singh 2025-10-01 14:20:32 +0000
  • e1b7fc5781 Add example implementation and Bazel build for OpenAI gpt-oss models (GptOss.zig, main.zig, and BUILD.bazel). Foke Singh 2025-09-30 16:17:00 +0000
  • d45a667ee5 Revamp gather API with named indices (and add gather_ variant), improve topK handling, and add Yarn rope embedding support across core modules (buffer, nn, pjrtx, quantization, shape, tensor, testing, tokenizer, torch). Tarry Singh 2025-09-26 13:38:11 +0000
  • 7264fff493 Update MNIST and ModernBERT example scripts to use the new gather and topk APIs. Foke Singh 2025-09-24 15:42:09 +0000
  • e641d05dd2 mlir, pjrt, zml: expose missing data types (u2, i1, i2, f4e2m1fn, f8e3m4, f8e4m3, f8e8m0fnu); fix Float32 conversion bug that truncated values Tarry Singh 2025-09-19 12:13:32 +0000
  • 29bd1242ba Add platform tag to buffers for target identification and safety checks; include workaround for PJRT uninitialized memory handling. Tarry Singh 2025-09-16 17:43:18 +0000
  • 9aeb4e9cd0 Update docs and example to demonstrate adding the platform tag to buffers when adding weights. Foke Singh 2025-09-11 11:15:37 +0000
  • 12efc763d5 Update modernbert example to reflect removal of old writer interface APIs. Foke Singh 2025-09-09 10:54:06 +0000
  • 3ed9bca5ad Remove deprecated writer interface APIs from core ZML modules (async, MLIR, PJRT, runtime, fmt, aio, buffer, exe, hostbuffer, meta, mlirx). Tarry Singh 2025-09-04 14:03:09 +0000
  • 090d7748d5 Update docs and example scripts (including test files) to remove async helpers asynk, asyncc, awaitt, and await_ Foke Singh 2025-09-02 10:27:40 +0000
  • 6e15123fb3 Remove obsolete async symbols (asynk, asyncc, awaitt, await_) from core, runtime, and aio modules. Tarry Singh 2025-08-29 11:03:59 +0000
  • 7913c00d70 Implement buffer‑ID based loading by moving tensor ID handling into BufferStore, fix zml.call tag hashing, and expose CPU device count. Tarry Singh 2025-08-28 14:39:21 +0000
  • 6e7617918d Update llama and simple_layer examples to use BufferStore tensor IDs, new CPU device count API, and fix zml.call tag hashing. Foke Singh 2025-08-22 17:55:03 +0000
  • cc969bd532 Add experimental zml.callback API (renamed from custom_call) and fix tensor.print(); update PJRT bindings, host buffer utilities, and related core ZML modules. Tarry Singh 2025-08-20 10:27:54 +0000
  • 1fa056a790 Add example and Bazel build for zml.callback demonstrating a manual CUDA kernel invocation. Foke Singh 2025-08-13 12:14:53 +0000
  • 01da2184fe xla: bump to commit b3fbfee, temporarily disable libnvptxcompiler due to missing support in PjRT CUDA plugin v13.0, add nvshmem to sandbox for PjRT CUDA plugin Tarry Singh 2025-08-12 13:32:18 +0000
  • f35119f768 ci: revamp commit workflow, deduplicate workflows, fix Linux linking issue introduced by upb, resolve duplicated _start symbol in test targets, temporarily use -c opt to bypass static archive bug Tarry Singh 2025-08-08 13:51:27 +0000
  • 9e3cd6d616 bump runtimes/* code to Zig 0.15.1, restore PyTorch loader using std.fs.File, update CI zig fmt, remove stdx.io, note remaining issues with Neuron and CUDA debug builds Tarry Singh 2025-08-07 15:09:27 +0000
  • 0ed7f5c907 Add Zig example programs for LLaMA, ModernBERT, and SimpleLayer, including a Bazel BUILD file for the LLaMA example. Foke Singh 2025-07-29 16:07:11 +0000
  • 488a844a0f Add Zig 0.15 compatibility: update BUILD files, async primitives, stdx utilities, MLIR dialects, and PJRT FFI. Tarry Singh 2025-07-28 13:54:28 +0000
  • e3b7705e3d Remove gguf and torch loader implementations and related BUILD and test assets. Tarry Singh 2025-07-24 17:19:43 +0000
  • 01db09c24b Update MNIST example (BUILD.bazel and mnist.zig) to remove torch loader references. Foke Singh 2025-07-23 12:53:46 +0000
  • 1cf26756a1 workspace: run buildifier, drop rules_uv, refactor tools/hf dependencies Tarry Singh 2025-07-16 10:01:41 +0000
  • 1427286716 runtimes/neuron: fix neuron runtime Tarry Singh 2025-07-15 15:26:03 +0000
  • e1ee340306 runtimes/cuda: implement zmlxcuda in Zig Tarry Singh 2025-07-08 09:25:25 +0000
  • c488b634fc runtimes/rocm: implement zmlxrocm in Zig Tarry Singh 2025-07-07 16:48:07 +0000
  • a512b9c8a2 Update docs and example BUILD files to reflect the switch from zig_cc_binary to zig_binary. Foke Singh 2025-07-04 15:10:46 +0000
  • cf00506dbb Switch workspace build rules from zig_cc_binary to zig_binary, removing the hack and using the C linker directly. Tarry Singh 2025-07-03 15:10:36 +0000
  • 78679817df zml: remove nemo, sentencepiece loaders, remove zig-yaml Tarry Singh 2025-06-24 15:39:55 +0000
  • fe56f03f5d third_party/xla: de-modularize xla Tarry Singh 2025-06-23 09:13:43 +0000
  • a540564744 Remove example workspace and related documentation files. Foke Singh 2025-06-20 13:23:06 +0000
  • e789e26008 Remove examples workspace and clean up related Bazel BUILD/MODULE files and Zig build scripts. Tarry Singh 2025-06-19 09:30:29 +0000
  • 0716f2dc0e Update examples/MODULE.bazel.lock to reflect the switch from zig‑protobuf to upb. Foke Singh 2025-06-10 16:04:53 +0000
  • 1b58c34b8c Update ZML build configuration to replace zig-protobuf with upb library. Tarry Singh 2025-06-09 16:34:20 +0000
  • cba3be4859 pjrt,zml: remove profiler Tarry Singh 2025-06-06 12:24:42 +0000
  • 7fb02e1888 Update docs: add deployment guide, Hugging Face token instructions, getting‑started tutorial, and include a Bazel lock example. Foke Singh 2025-06-05 13:18:14 +0000
  • f9280b1069 Add Bazel MODULE and tool build configuration (BUILD, requirements) for packaging the HF CLI. Tarry Singh 2025-05-30 16:42:57 +0000
  • 111afcdd95 examples/llama: switch to --hf-model-path flag Foke Singh 2025-05-28 13:21:00 +0000
  • 5a49a3e8ca Update examples MODULE.bazel.lock to reflect the added sandbox neuron dependencies. Foke Singh 2025-05-22 10:10:14 +0000
  • 1a2b862ec2 Add sandbox neuron dependencies: define a trampoline PJRT, create an empty repository for distroless deps, and update Bazel build files and Zig/C sources accordingly. Tarry Singh 2025-05-19 17:35:33 +0000
  • 9e66d5a06a Update examples/MODULE.bazel.lock to reflect new dependencies. Foke Singh 2025-05-16 11:26:13 +0000
  • 55c5b540f8 Add XLA 20250718.0‑6319f0d with ROCm 6.4.1 support, update Bazel module files and runtime configs, and apply migration, FFI‑handler and header‑cleanup patches. Tarry Singh 2025-05-12 12:10:27 +0000
  • cba9ce9615 protobuf, libxev: remove usingnamespace Tarry Singh 2025-05-06 11:58:20 +0000
  • db303a6010 remove usingnamespace from asyncio Tarry Singh 2025-05-05 09:03:06 +0000
  • ed5ae31338 runtimes/rocm: fetch libdrm from amdgpu repository and add amdgpu.ids layer Tarry Singh 2025-04-30 15:53:51 +0000
  • 47146fc088 workspace: remove glibc version constraint, update rules_rust, and re-enable frame pointers following sandboxing changes. Tarry Singh 2025-04-29 12:09:13 +0000
  • e7323be10b runtimes/rocm: switch to in-process LLD, removing the need for sandboxed lld. Tarry Singh 2025-04-23 11:43:18 +0000
  • 02ba41ea57 Update examples/MODULE.bazel to bump rules_zig version and fix static‑lib inclusion for non‑exe targets. Foke Singh 2025-04-21 13:51:52 +0000
  • f5f3dc6b1b Update rules_zig to 20250714.0-b14a4f1 and fix static‑lib inclusion in runfiles for non‑exe targets. Tarry Singh 2025-04-18 12:09:38 +0000
  • 7d9fdf94e7 runtimes/rocm: sandbox ROCm dependencies and ensure they load on the main thread due to TLS usage in static C++ destructors. Tarry Singh 2025-04-14 16:38:15 +0000
  • eba0e72532 runtimes/tpu: sandbox TPU PJRT plugin; no external dependencies. Tarry Singh 2025-04-10 14:47:16 +0000
  • 8073e45894 Update examples/MODULE.bazel.lock to reflect bumped hftokenizers dependency. Foke Singh 2025-04-09 10:21:44 +0000
  • 4294a4d08f Bump hftokenizers dependency versions in Bazel and Cargo lockfiles (MODULE.bazel.lock, Cargo.toml, Cargo.lock) Tarry Singh 2025-04-04 12:54:33 +0000
  • 78d7b672e7 runtimes/cpu: sandbox CPU PJRT plugin, simplifying as there are no additional NEEDED dependencies. Tarry Singh 2025-04-03 11:57:46 +0000
  • 2d321d232d runtimes/cuda: sandbox CUDA dependencies by removing them from the leaf binary, sandboxing the dependency graph, marking dlopen direct dependencies as NEEDED, setting RPATH to the sandbox, loading the PJRT plugin from the sandbox, and enabling weak CUDA symbols without direct linking. Tarry Singh 2025-03-26 11:18:29 +0000
  • a5420068b1 pjrt: emit warning instead of panic when FFI Extension is missing (e.g., on TPU). Tarry Singh 2025-03-24 09:40:44 +0000
  • dc121fce4f Update example MODULE.bazel and lockfile to reflect toolchains_llvm_bootstrapped bump to 0.2.4. Foke Singh 2025-03-20 12:17:30 +0000
  • 907577525f Update MODULE.bazel and lockfile to bump toolchains_llvm_bootstrapped to version 0.2.4. Tarry Singh 2025-03-18 11:47:22 +0000
  • f27a524f31 Update rules_zig: add zig_srcs target, fix source handling bug, clean up BUILD files, adjust async/coro.zig tests, and disable nemo and yaml model loaders. Tarry Singh 2025-03-13 12:27:21 +0000
  • 6fc1148206 async/coro: make coroutines unwindable by zeroing the initial stack region, preventing random unwinding behavior and SIGSEGV during _Unwind_Backtrace. Tarry Singh 2025-03-10 16:25:45 +0000
  • f63c673f45 bazel: add RPATH manipulation to patchelf Tarry Singh 2025-03-05 11:56:40 +0000
  • 9488672d4b workspace: bump xla to version 20250710.0-22ea002 Tarry Singh 2025-03-04 17:12:34 +0000
  • fa0ed045ef runtimes/cuda: downgrade cuda and cudnn Tarry Singh 2025-02-28 17:36:12 +0000
  • ff1433d998 pjrt: bind PJRT_Client_CreateUninitializedBuffer. Tarry Singh 2025-02-25 10:37:45 +0000
  • 8456a0d073 zml/pjrt: add binding for PJRT_Device_MemoryStats. Tarry Singh 2025-02-19 12:14:05 +0000
  • a580f2a398 Async: use stronger memory ordering to prevent potential segfaults due to ordering issues. Tarry Singh 2025-02-18 11:38:56 +0000
  • 4d6d975dc0 Patch aio.zig: update loadBuffersWithPrefix argument type to match the conditional type of loadBuffers init_args. Tarry Singh 2025-02-13 09:48:13 +0000
  • af8844c1f1 Add model prefix support when loading a model from safetensors, enabling use of a specific model prefix (e.g., ModernBertModel) instead of the full model. Tested with the text embeddings server project. Tarry Singh 2025-02-12 13:18:27 +0000
  • 1cafcc3c60 Workspace: bump XLA to newer version. Tarry Singh 2025-02-05 17:35:27 +0000
  • 9ef838be25 Update neuron runtime BUILD.bazel to use Bazel manual tag and S3 cache integration. Tarry Singh 2025-02-03 14:03:33 +0000
  • dd52e988b4 Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow. Foke Singh 2025-01-31 16:28:38 +0000
  • 0a2ab7c8cb Remove usingnamespace from MLIR. Tarry Singh 2025-01-28 09:35:58 +0000
  • f8ab0d7b2a Remove dead imports. Tarry Singh 2025-01-22 10:45:04 +0000
  • 51a6cab753 Wire has_side_effect field in zml/ops. Tarry Singh 2025-01-20 16:45:13 +0000