|
|
f63c673f45
|
bazel: add RPATH manipulation to patchelf
|
2025-03-05 11:56:40 +00:00 |
|
|
|
9488672d4b
|
workspace: bump xla to version 20250710.0-22ea002
Also:
- Bump XLA deps : `com_github_grpc_grpc` and `com_google_protobuf`
- Inject `rules_ml_toolchain`
- Fix `zig_proto_library` rule
|
2025-03-04 17:12:34 +00:00 |
|
|
|
fa0ed045ef
|
runtimes/cuda: downgrade cuda and cudnn
This commit reverts part of https://github.com/zml/zml/pull/238/files
This is required because XLA has a strong dependencies on CUDA 12.8 and
upgrading to 12.9 is impossible due to
https://github.com/NVIDIA/cccl/issues/4967
|
2025-02-28 17:36:12 +00:00 |
|
|
|
ff1433d998
|
pjrt: bind PJRT_Client_CreateUninitializedBuffer.
|
2025-02-25 10:37:45 +00:00 |
|
|
|
8456a0d073
|
zml/pjrt: add binding for PJRT_Device_MemoryStats.
|
2025-02-19 12:14:05 +00:00 |
|
|
|
a580f2a398
|
Async: use stronger memory ordering to prevent potential segfaults due to ordering issues.
|
2025-02-18 11:38:56 +00:00 |
|
|
|
4d6d975dc0
|
Patch aio.zig: update loadBuffersWithPrefix argument type to match the conditional type of loadBuffers init_args.
|
2025-02-13 09:48:13 +00:00 |
|
|
|
af8844c1f1
|
Add model prefix support when loading a model from safetensors, enabling use of a specific model prefix (e.g., ModernBertModel) instead of the full model. Tested with the text embeddings server project.
|
2025-02-12 13:18:27 +00:00 |
|
|
|
1cafcc3c60
|
Workspace: bump XLA to newer version.
|
2025-02-05 17:35:27 +00:00 |
|
|
|
9ef838be25
|
Update neuron runtime BUILD.bazel to use Bazel manual tag and S3 cache integration.
|
2025-02-03 14:03:33 +00:00 |
|
|
|
dd52e988b4
|
Update example Bazel build files (MODULE.bazel, llama, modernbert) to test the revamped commit workflow.
|
2025-01-31 16:28:38 +00:00 |
|
|
|
0a2ab7c8cb
|
Remove usingnamespace from MLIR.
|
2025-01-28 09:35:58 +00:00 |
|
|
|
f8ab0d7b2a
|
Remove dead imports.
|
2025-01-22 10:45:04 +00:00 |
|
|
|
51a6cab753
|
Wire has_side_effect field in zml/ops.
|
2025-01-20 16:45:13 +00:00 |
|
|
|
99a2001e63
|
Rename PJRT BufferType to follow Zig and ZML naming conventions.
|
2025-01-16 13:00:47 +00:00 |
|
|
|
7324a49da3
|
Remove .print() calls from globalAttnMask() and localAttnMask() in ModernBERT example to resolve compilation sharding error.
|
2025-01-15 16:59:26 +00:00 |
|
|
|
09c43b8759
|
Add customCall operation to zml/ops.
|
2025-01-09 15:01:33 +00:00 |
|
|
|
9f1cc762cd
|
Fix map tests in zml/meta.
|
2025-01-06 17:49:50 +00:00 |
|
|
|
fbf1ecb8b7
|
Introduce Executable.getCompiledMemoryStats in PJRT.
|
2025-01-02 16:36:13 +00:00 |
|
|
|
4b1a3ff48a
|
Add union support to mapping helpers in zml/meta.zig.
|
2025-01-01 13:35:17 +00:00 |
|
|
|
c961d705f1
|
Set default values for operand_layouts and result_layouts in StableHLO dialect.
|
2024-12-26 09:29:45 +00:00 |
|
|
|
e6286b6097
|
Update Buffer.from to be blocking by default and add options for async loading and memory placement, adjusting aio, hostbuffer, pjrtx, and tensor implementations.
|
2024-12-25 17:14:44 +00:00 |
|
|
|
da1fd2d9dc
|
Add examples demonstrating Buffer.from options, non‑blocking loading, and memory copy behavior.
|
2024-12-20 09:30:35 +00:00 |
|
|
|
bb2b77d7de
|
Correctly set model.norm.eps in Llama examples.
|
2024-12-18 11:48:23 +00:00 |
|
|
|
6aa9aa5a7b
|
Add preliminary implementation for custom call support.
|
2024-12-10 09:36:37 +00:00 |
|
|
|
1d5b79111a
|
modernbert: set default epsilon value for embeddings layernorm.
|
2024-12-09 16:43:29 +00:00 |
|
|
|
a63d0a4aa3
|
Update example MODULE.bazel and lockfile to use the toolchains_llvm_bootstrapped configuration.
|
2024-12-04 11:30:44 +00:00 |
|
|
|
5464281c91
|
Update workspace configuration to use the toolchains_llvm_bootstrapped toolchain for Zig builds.
|
2024-12-03 13:50:58 +00:00 |
|
|
|
f5ab2c3a55
|
zml: eliminate compile-time fields from Bufferized, removing the need to pass undefined to exe.call for inlined arguments. Introduce BufferizedWithArgs in zml.testing for compileAndCall utility.
|
2024-11-28 12:24:39 +00:00 |
|
|
|
364a222dc1
|
Update example MODULE.bazel and lockfile to target XLA version 20250527.0‑cb67f2f.
|
2024-11-25 17:57:45 +00:00 |
|
|
|
95453c7242
|
Update XLA dependency to version 20250527.0‑cb67f2f and refresh related Bazel BUILD, MODULE, overlay and patch files.
|
2024-11-22 16:50:20 +00:00 |
|
|
|
fa13287931
|
workspace: upgrade to Zig 0.14.1 and handle empty tuple syntax &.{} being detected as *const @TypeOf(.{}).
|
2024-11-19 11:45:36 +00:00 |
|
|
|
d8a83830e8
|
runtimes: switch to Cloudflare Debian snapshots for more reliable dependency pinning.
|
2024-11-15 09:40:58 +00:00 |
|
|
|
ea3ce685a9
|
runtimes/neuron: bump runtime version and expose nrt.h header to Zig.
|
2024-11-14 13:37:47 +00:00 |
|
|
|
09da9c2982
|
Make zls.sh example explicitly set the ZLS runner target.
|
2024-11-06 16:22:44 +00:00 |
|
|
|
948c577205
|
Make ZLS runner target explicit in workspace BUILD files and update the zls.sh script accordingly.
|
2024-11-04 13:57:59 +00:00 |
|
|
|
47a4eda5f6
|
runtimes/cuda: expose cuda.h in the C namespace for CUDA runtimes, enabling custom calls to CUDA functions.
|
2024-11-01 13:27:24 +00:00 |
|
|
|
3849eb10b7
|
Add buffer and hostbuffer utilities with precise f32→bf16 conversion, type inference for loadBuffers, store expected input shapes, enhance meta.visit and JSON TaggedUnion support, and improve logging.
|
2024-10-28 11:21:46 +00:00 |
|
|
|
1540c6e85e
|
Update loader example to demonstrate new HostBuffer helpers and type‑inferred buffer loading.
|
2024-10-25 10:20:04 +00:00 |
|
|
|
048d7eb38e
|
third_party/sentencepiece: add missing protobuf_lite dependency and bump version.
|
2024-10-22 16:41:52 +00:00 |
|
|
|
4ef81b89ea
|
stdx.fmt: add slice formatting support, improving on previous prettyPrinter implementation by leveraging internal fmt mechanisms.
|
2024-10-18 15:05:08 +00:00 |
|
|
|
fe55c600d4
|
Add/refresh how‑to docs and example loader for deployment, Docker, HuggingFace token, and getting‑started tutorials.
|
2024-10-14 11:27:41 +00:00 |
|
|
|
59f99c4501
|
Enable Bazel debug/release Zig builds and add stdx.time.Duration.hz and stdx.math.roundeven utilities.
|
2024-10-08 09:39:00 +00:00 |
|
|
|
aacbf2ee04
|
Fix Llama3 rope scaling implementation in the neural network module (zml/nn.zig)
|
2024-10-07 12:53:03 +00:00 |
|
|
|
9f61a8aacb
|
Update example Zig code for llama3 rope scaling and modernbert usage.
|
2024-10-04 17:49:07 +00:00 |
|
|
|
d0cf5d3042
|
Update example MODULE.bazel and its lock file to incorporate workspace fixes and version bumps
|
2024-09-30 14:00:16 +00:00 |
|
|
|
4a0b1cce50
|
Update Bazel workspace and XLA overlay (MODULE.bazel, BUILD files, patches) to prevent dual LLVM builds and apply migration/bump patches.
|
2024-09-27 14:00:44 +00:00 |
|
|
|
8b53ebbd9b
|
pjrt: bind PJRT_Device_AddressableMemories.
|
2024-09-26 12:02:04 +00:00 |
|
|
|
2863c1f5e0
|
zml/tensor: fix returned value in Tensor.toMemory – ensure _output_memory_kind is set correctly in the result.
|
2024-09-18 13:18:08 +00:00 |
|
|
|
fa4a8d8de4
|
async: enforce synchronous logging by routing async.zig logs through std.log.defaultLog to avoid panics when using async.LogFn
|
2024-09-17 09:03:48 +00:00 |
|