Commit Graph

  • 99a2001e63 Rename PJRT BufferType to follow Zig and ZML naming conventions. Tarry Singh 2025-01-16 13:00:47 +0000
  • 7324a49da3 Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error. Foke Singh 2025-01-15 16:59:26 +0000
  • 09c43b8759 Add customCall operation to zml/ops. Tarry Singh 2025-01-09 15:01:33 +0000
  • 9f1cc762cd Fix map tests in zml/meta. Tarry Singh 2025-01-06 17:49:50 +0000
  • fbf1ecb8b7 Introduce Executable.getCompiledMemoryStats in PJRT. Tarry Singh 2025-01-02 16:36:13 +0000
  • 4b1a3ff48a Add union support to mapping helpers in zml/meta.zig. Tarry Singh 2025-01-01 13:35:17 +0000
  • c961d705f1 Set default values for operand_layouts and result_layouts in StableHLO dialect. Tarry Singh 2024-12-26 09:29:45 +0000
  • e6286b6097 Update Buffer.from to be blocking by default and add options for async loading and memory placement, adjusting aio, hostbuffer, pjrtx, and tensor implementations. Tarry Singh 2024-12-25 17:14:44 +0000
  • da1fd2d9dc Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior. Foke Singh 2024-12-20 09:30:35 +0000
  • bb2b77d7de Correctly set model.norm.eps in Llama examples. Foke Singh 2024-12-18 11:48:23 +0000
  • 6aa9aa5a7b Add preliminary implementation for custom call support. Tarry Singh 2024-12-10 09:36:37 +0000
  • 1d5b79111a modernbert: set default epsilon value for embeddings layernorm. Foke Singh 2024-12-09 16:43:29 +0000
  • a63d0a4aa3 Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration. Foke Singh 2024-12-04 11:30:44 +0000
  • 5464281c91 Update workspace configuration to use the toolchains_llvm_bootstrapped toolchain for Zig builds. Tarry Singh 2024-12-03 13:50:58 +0000
  • f5ab2c3a55 zml: eliminate compile-time fields from Bufferized, removing the need to pass undefined to exe.call for inlined arguments. Introduce BufferizedWithArgs in zml.testing for compileAndCall utility. Tarry Singh 2024-11-28 12:24:39 +0000
  • 364a222dc1 Update example MODULE.bazel and lockfile to target XLA version 20250527.0‑cb67f2f. Foke Singh 2024-11-25 17:57:45 +0000
  • 95453c7242 Update XLA dependency to version 20250527.0‑cb67f2f and refresh related Bazel BUILD, MODULE, overlay and patch files. Tarry Singh 2024-11-22 16:50:20 +0000
  • fa13287931 workspace: upgrade to Zig 0.14.1 and handle empty tuple syntax &.{} being detected as *const @TypeOf(.{}). Tarry Singh 2024-11-19 11:45:36 +0000
  • d8a83830e8 runtimes: switch to Cloudflare Debian snapshots for more reliable dependency pinning. Tarry Singh 2024-11-15 09:40:58 +0000
  • ea3ce685a9 runtimes/neuron: bump runtime version and expose nrt.h header to Zig. Tarry Singh 2024-11-14 13:37:47 +0000
  • 09da9c2982 Make zls.sh example explicitly set the ZLS runner target. Foke Singh 2024-11-06 16:22:44 +0000
  • 948c577205 Make ZLS runner target explicit in workspace BUILD files and update the zls.sh script accordingly. Tarry Singh 2024-11-04 13:57:59 +0000
  • 47a4eda5f6 runtimes/cuda: expose cuda.h in the C namespace for CUDA runtimes, enabling custom calls to CUDA functions. Tarry Singh 2024-11-01 13:27:24 +0000
  • 3849eb10b7 Add buffer and hostbuffer utilities with precise f32→bf16 conversion, type inference for loadBuffers, store expected input shapes, enhance meta.visit and JSON TaggedUnion support, and improve logging. Tarry Singh 2024-10-28 11:21:46 +0000
  • 1540c6e85e Update loader example to demonstrate new HostBuffer helpers and type‑inferred buffer loading. Foke Singh 2024-10-25 10:20:04 +0000
  • 048d7eb38e third_party/sentencepiece: add missing protobuf_lite dependency and bump version. Tarry Singh 2024-10-22 16:41:52 +0000
  • 4ef81b89ea stdx.fmt: add slice formatting support, improving on previous prettyPrinter implementation by leveraging internal fmt mechanisms. Tarry Singh 2024-10-18 15:05:08 +0000
  • fe55c600d4 Add/refresh how‑to docs and example loader for deployment, Docker, HuggingFace token, and getting‑started tutorials. Foke Singh 2024-10-14 11:27:41 +0000
  • 59f99c4501 Enable Bazel debug/release Zig builds and add stdx.time.Duration.hz and stdx.math.roundeven utilities. Tarry Singh 2024-10-08 09:39:00 +0000
  • aacbf2ee04 Fix Llama3 rope scaling implementation in the neural network module (zml/nn.zig) Tarry Singh 2024-10-07 12:53:03 +0000
  • 9f61a8aacb Update example Zig code for llama3 rope scaling and modernbert usage. Foke Singh 2024-10-04 17:49:07 +0000
  • d0cf5d3042 Update example MODULE.bazel and its lock file to incorporate workspace fixes and version bumps Foke Singh 2024-09-30 14:00:16 +0000
  • 4a0b1cce50 Update Bazel workspace and XLA overlay (MODULE.bazel, BUILD files, patches) to prevent dual LLVM builds and apply migration/bump patches. Tarry Singh 2024-09-27 14:00:44 +0000
  • 8b53ebbd9b pjrt: bind PJRT_Device_AddressableMemories. Tarry Singh 2024-09-26 12:02:04 +0000
  • 2863c1f5e0 zml/tensor: fix returned value in Tensor.toMemory – ensure _output_memory_kind is set correctly in the result. Tarry Singh 2024-09-18 13:18:08 +0000
  • fa4a8d8de4 async: enforce synchronous logging by routing async.zig logs through std.log.defaultLog to avoid panics when using async.LogFn Tarry Singh 2024-09-17 09:03:48 +0000
  • aec7072837 pjrt: add FFI bindings for custom calls Tarry Singh 2024-09-10 09:14:28 +0000
  • 1f5ff96c10 zml/ops: add wiring for operand output alias in zml.ops.triton Tarry Singh 2024-09-09 15:00:28 +0000
  • 7e0fcecfc9 async: adjust implementation for xev version bump Tarry Singh 2024-09-06 16:00:58 +0000
  • 4b7e618b43 zml/aio: add bool handling in struct population within populateStruct Tarry Singh 2024-09-02 14:11:47 +0000
  • 1068d0bb33 async: fix TCP connection handling Tarry Singh 2024-08-29 17:10:30 +0000
  • ac63c30e12 add mini-DSL for creating MLIR common attributes and types, leveraging Zig 0.14 to simplify mlir.Type and mlir.Attribute creation Tarry Singh 2024-08-26 14:19:00 +0000
  • 63ef78efcc zml: add support for NVTX tracing Tarry Singh 2024-08-21 14:41:40 +0000
  • a5e588f53b async: detect xev only when the API is dynamic to fix macOS Tarry Singh 2024-08-20 13:51:50 +0000
  • 8a8d3e0b15 workspace: fix incorrect checksum for ZLS macOS x86_64 Tarry Singh 2024-08-16 14:54:11 +0000
  • 4d809118d2 async: detect and select the most suitable xev backend, enabling use of io_uring when supported Tarry Singh 2024-08-14 14:23:57 +0000
  • 12713ea8c7 Update Bazel MODULE files to fetch LLVM, StableHLO, and Triton dependencies, aligning versions for XLA. Foke Singh 2024-08-07 17:13:03 +0000
  • 7df89301dc Bump XLA version and import llvm, stablehlo, triton, and zig‑protobuf modules in workspace BUILD files. Tarry Singh 2024-08-06 10:28:43 +0000
  • cbe6e730bd stablehlo: fix forwarding of optional operand and result layout attributes in custom call Tarry Singh 2024-07-31 17:53:18 +0000
  • 44933c9b89 Add macOS x86_64 support to example Bazel MODULE and platform mappings. Foke Singh 2024-07-29 10:31:51 +0000
  • ca4e061ad5 Add Bazel build configurations for macOS x86_64 CPU runtime and ZLS third‑party integration. Tarry Singh 2024-07-25 15:58:14 +0000
  • 3f36506f1c zml: remove usingnamespace from floats.zig and related dependencies; note that incremental compilation does not improve overall build time due to linking overhead Tarry Singh 2024-07-23 17:43:43 +0000
  • 42dee5d0e0 mlir: rework stablehlo custom call implementation and add a Triton example Tarry Singh 2024-07-16 13:23:07 +0000
  • aec1d96e6d mlir: rework DenseElementsAttribute to correctly slice inputs and modify .as() to return a concrete value instead of an optional Tarry Singh 2024-07-15 12:32:24 +0000
  • 201f5245c1 mlir: add MLIR index type Tarry Singh 2024-07-11 11:14:22 +0000
  • efcf955a4e workspace, third_party/rules_zig: adjust ZLS to require --version as the first parameter and add missing keys to the BuildConfig object for code completion Tarry Singh 2024-07-10 15:20:12 +0000
  • b67fd2367d Add benchmark, Llama, and ModernBERT example programs plus third‑party non‑module dependencies. Foke Singh 2024-07-03 11:30:49 +0000
  • 30f6be0e2f Update core Zig modules (async, mlir, pjrt, stdx) and third‑party Bazel definitions for the Zig 0.14.0 release. Tarry Singh 2024-07-02 14:19:04 +0000
  • 16cc8c6658 Update examples MODULE.bazel.lock to reflect sandboxing rework and upgraded CUDA/ROCm version dependencies. Foke Singh 2024-06-27 12:32:07 +0000
  • 967eeb928f Update Bazel workspace and runtime configs: rework sandboxing, bump PJRT to 7.0.0, and upgrade CUDA (12.8), cuDNN (9.8), and ROCm (6.3.4). Tarry Singh 2024-06-25 11:00:29 +0000
  • 8613a6da7d workspace: update rules_cc to 0.1.1 and Bazel to 8.1.1 to resolve yanked version issue; adjust MODULE.bazel and .bazelversion accordingly; verified successful build and execution Tarry Singh 2024-06-19 12:06:53 +0000
  • 5bcaf374c8 workspace: bump libxev to 20250222.0-07bcffa and introduce Dynamic to select between IO Uring and Epoll on Linux based on availability Tarry Singh 2024-06-18 14:26:44 +0000
  • 18eb0e5a7b Add async I/O, SentencePiece, NN, and tensor utilities for ModernBERT support and update Bazel build configuration. Tarry Singh 2024-06-14 15:27:06 +0000
  • 17d02621e7 Add tests for the ModernBERT example, covering activation utilities, build setup, and example Zig code. Foke Singh 2024-06-11 17:33:22 +0000
  • ab5ad874c3 workspace: fix libxev version Tarry Singh 2024-06-07 14:25:44 +0000
  • 72263aa9e3 workspace: fix mistakes in modules bumps Tarry Singh 2024-06-06 09:56:17 +0000
  • f7450a2104 stablehlo: bump to head and use new dialect capi Tarry Singh 2024-05-31 13:02:46 +0000
  • 221ece647d zml/ops.zig: Added zml.ops.case operation Tarry Singh 2024-05-30 14:11:08 +0000
  • 27aabf9beb Add Bazel build rules and a test for the benchmark, llama, mnist, and simple_layer examples. Foke Singh 2024-05-23 15:52:34 +0000
  • 3aac788544 Update Bazel build configurations (zig.bzl, BUILD files) for MLIR, PJRT, Neuron, ROCm, tokenizer, and tools, fixing broken dependencies. Tarry Singh 2024-05-20 11:28:25 +0000
  • 05944b5cc9 Update FnCache to copy and reuse non‑tensor fields in fixed‑size structs, preventing undefined memory in core modules. Tarry Singh 2024-05-15 17:54:52 +0000
  • dfe55b0d34 Update Bazel lock file for examples to reflect FnCache non‑tensor handling changes. Foke Singh 2024-05-13 16:59:37 +0000
  • 8d795dd676 pjrt: profiler support std writer API Tarry Singh 2024-05-09 11:09:29 +0000
  • 26558d6201 Update examples MODULE.bazel and lockfile to use XLA 20250204.0-6789523 and ensure Bazel 8 compatibility. Foke Singh 2024-05-08 14:03:45 +0000
  • f5ab6ff2c6 Update XLA to version 20250204.0-6789523 and adjust Bazel module and runtime files for Bazel 8 compatibility. Tarry Singh 2024-05-03 15:57:56 +0000
  • a34190679b Fix llama token handling and remove redundant prompt token reuse in core Zig modules (aio, module, nn, pjrtx, tensor) Tarry Singh 2024-05-02 17:10:11 +0000
  • 394e63e273 Fix llama example to correctly handle token output and avoid re‑feeding the last prompt token. Foke Singh 2024-04-24 16:44:25 +0000
  • 5a2171793d workspace: MODULE.bazel cleanup Tarry Singh 2024-04-22 09:27:44 +0000
  • bafe13f546 Update examples/MODULE.bazel.lock to reflect libxev version bump. Foke Singh 2024-04-18 12:53:16 +0000
  • 65c28111a9 Update libxev to version 20252401.0‑31eed4e and apply patches and. Tarry Singh 2024-04-15 13:03:25 +0000
  • 13eff4e661 pjrt,zml: add memory bindings Tarry Singh 2024-04-11 15:43:24 +0000
  • 190c6978d2 llama: simplify llama3 prompt template encoding by removing redundant newline re-encoding and ensuring a trailing newline. Foke Singh 2024-04-10 09:36:28 +0000
  • d4db5ccc6b Integrate TinyLlama support, restore the homemade tokenizer, and align Zig API naming across stdx and zml tokenizer modules. Tarry Singh 2024-04-05 15:07:29 +0000
  • b67685b941 Add example Bazel build files and tokenizer test for tinyllama, including tigerbeetle integration and flags. Foke Singh 2024-04-01 17:40:18 +0000
  • 567210d1d7 bazel: depend on prebuilt protoc binaries to eliminate ~1300 build steps. Note: integration is currently blocked due to version constraints in rules_proto and toolchains_protoc. Tarry Singh 2024-03-29 09:54:57 +0000
  • e0c8eecb79 bazel: use OID as sha256 for Git LFS files to prevent unnecessary HuggingFace redownloads. Tarry Singh 2024-03-28 17:52:52 +0000
  • a811b2e1e3 llama: fix dimensions and data types Removed unnecessary batching dimension introduced by recent changes. Converted index outputs from i32 to u32 for token indices. Ensures Llama runs on CUDA and RoCM. Tested on CUDA. Foke Singh 2024-03-20 13:37:19 +0000
  • 602757e7a9 Update examples to use the corrected logFn API. Foke Singh 2024-03-18 13:11:14 +0000
  • 754656f2f0 Replace real mutex with async Mutex for logFn, add fallback logger support outside coroutines, and fix ResetCondition handling. Tarry Singh 2024-03-14 11:43:33 +0000
  • 980f1b17fb Ensure all runtime plugins have correct SONAME values, fixing issues with prebuilt PJRT plugins. Tarry Singh 2024-03-11 10:15:22 +0000
  • 8a25b1eb74 Revert CUDA PJRT plugin version to 0.4.38 to address performance regression on XLA master. Tarry Singh 2024-03-05 17:04:42 +0000
  • 76e314db9b Update Llama example docs and Bazel build files, and add tests for the new HuggingFace tokenizer integration. Foke Singh 2024-03-04 12:11:13 +0000
  • 959bc48c42 Add HuggingFace tokenizer bindings and SentencePiece integration; update BUILD files, async utilities, and FFI modules to support the new tokenizers. Tarry Singh 2024-02-28 15:47:37 +0000
  • 5048e7dc89 Update example lock file for rules_distroless 0.4.2 upgrade and verify MNIST image build works. Foke Singh 2024-02-26 15:30:13 +0000
  • b4b2490690 Upgrade rules_distroless to 0.4.2 in MODULE.bazel and refresh MODULE.bazel.lock accordingly. Tarry Singh 2024-02-21 17:48:10 +0000
  • c109b12e1b Various minor fixes: rewrite tinyllama tokenizer newline token, prevent HostBuffer.isContiguous false trigger on 1‑dim axes, improve HostBuffer.slice1d error messages, simplify module.zig output to show .mlir file path, correct setFlags handling of comptime int/float, make tokenizer.zig return <oob> for out‑of‑range detokenization, and speed up Buffer.constant creation up to 2.5 GB/s on CUDA. Tarry Singh 2024-02-19 12:34:18 +0000
  • 3970df5b48 Update getting_started tutorial and example Bazel files for Bazel 8 migration. Foke Singh 2024-02-14 10:44:47 +0000
  • 169a24307c Migrate workspace and XLA module definitions to Bazel 8, updating MODULE.bazel files, BUILD rules, and related migration patches. Tarry Singh 2024-02-12 12:43:23 +0000
  • 7e6103d876 Upgrade XLA to version 20250122.0-cc075be, switch to nvptx compiler and nvlink with nvjitlink support, add warning for CUDA path in LD_LIBRARY_PATH, and revert the previous CUDA sandbox fix. Tarry Singh 2024-02-06 09:31:48 +0000
  • b8a0aaee5a Update tokenizer to handle byte_fallback for Llama3 GPT2 vocab and add a Llama3‑specific normalizer; adjust tinyllama.zig and hostbuffer.zig to use the new tokenization logic. Tarry Singh 2024-02-05 15:22:44 +0000