Commit Graph

17 Commits

Author SHA1 Message Date
edc2ac26f8 Adjust ROCm runtime sandboxing to hook only the PJRT plugin and make hipblastlt bytecodes optional. 2024-01-26 13:02:23 +00:00
a7b7ae0180 Fix async hangs by reworking the libxev epoll backend and using callBlocking for PJRT plugin loading, improving performance across async and runtime modules. 2024-01-16 14:13:45 +00:00
434cee3a6c Fix CUDA and ROCm sandbox discovery, update epoll libxev patch to prevent high CPU usage, enable XLA GPU latency‑hiding scheduler, and upgrade cuDNN to 9.6.0. 2024-01-15 09:41:42 +00:00
145e60b4dd workspace: Update LLVM, XLA, StableHLO, and PJRT plugins to latest versions. 2023-12-13 10:10:32 +00:00
37725cdaa6 Update PJRT, runtime, and ZML modules to use per‑target output folders and expose profiler.dumpDataAsJson for JSON profiling output. 2023-12-04 10:38:10 +00:00
455bb3877f runtimes/cuda: obtain NCCL from the pip package, matching XLA behavior. 2023-09-20 17:41:44 +00:00
0d5389ceda Update CUDA runtime sandboxing and dynamic symbol renaming, switch to pre‑built jax‑cuda‑pjrt plugin, and bump CUDA to 12.6.2 and cuDNN to 9.5.1. 2023-09-14 13:28:25 +00:00
7d24329d0a Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support. 2023-08-18 17:11:27 +00:00
01eff33fa0 Update workspace dependencies to newer LLVM, XLA, StableHLO, and PJRT versions and expose new pjrt plugin attribute APIs and stablehlo version APIs in build and runtime configurations. 2023-08-07 12:28:36 +00:00
54e7eb30b4 Introduce a thin abstraction layer between ZML and PJRT to manage plugin loading decisions, enable compile‑time detection of linked runtimes, and handle cases such as libtpu blocking metadata access. 2023-05-15 09:36:41 +00:00
cfe38f27ca Switch ROCm dlopen handling to patchelf's rename_dynamic_symbols for more robust dynamic symbol import. 2023-05-03 17:33:46 +00:00
833ff5f28d Upgrade PJRT CUDA Plugin to version 0.2.3, adding NCCL support for correct sharding. 2023-04-12 15:47:06 +00:00
70d40208a2 runtimes/cuda: Fix version variable definitions in the build script to enable successful CUDA builds. 2023-03-09 11:31:02 +00:00
0c126c2e12 runtimes/cuda: Upgrade CUDA to 12.6.2 and cuDNN to 9.4.0. 2023-03-03 15:17:26 +00:00
f595d22134 runtimes/rocm: Upgrade ROCm to version 6.2.2. 2023-03-01 13:15:50 +00:00
0606ea1d7c Update Bazel workspace and runtime BUILD files to newer XLA, StableHLO, and LLVM versions, enabling batching‑dims support for the gather operator. 2023-02-01 15:58:30 +00:00
266da6d4be Add initial Bazel build configuration, async runtime implementation, and core MLIR dialect definitions for ZML. 2023-01-02 14:28:25 +00:00