Commit Graph

6 Commits

Author SHA1 Message Date
1427286716 runtimes/neuron: fix neuron runtime
This PR fixes the neuron runtime with the following:

Proxy the PJRT Api method to enforce the client struct sizes since the
neuron PJRT plugin doesn't use `>=` but `==` to assert them, breaking
PJRT compatibility guarantees.
Fixes https://github.com/aws-neuron/aws-neuron-sdk/issues/1095

Reimplement `libneuronxla` in Zig to control neuronx-cc sandboxing and
invocation.

Implement a python bootstrapper in Zig to create a full blown
`neuronx-cc` executable, avoiding the infamous chicken and egg problem
of python executables boostrapping when sandboxed (due to fixed path
shebangs).

---------

Co-authored-by: Corentin Kerisit <corentin.kerisit@gmail.com>
2025-07-15 15:26:03 +00:00
1a2b862ec2 Add sandbox neuron dependencies: define a trampoline PJRT, create an empty repository for distroless deps, and update Bazel build files and Zig/C sources accordingly. 2025-05-19 17:35:33 +00:00
ea3ce685a9 runtimes/neuron: bump runtime version and expose nrt.h header to Zig. 2024-11-14 13:37:47 +00:00
4a0b1cce50 Update Bazel workspace and XLA overlay (MODULE.bazel, BUILD files, patches) to prevent dual LLVM builds and apply migration/bump patches. 2024-09-27 14:00:44 +00:00
5a2171793d workspace: MODULE.bazel cleanup
Title says it all !
2024-04-22 09:27:44 +00:00
7d24329d0a Add Bazel build rules and runtime implementation for AWS Neuron/Trainium/Inferentia support. 2023-08-18 17:11:27 +00:00