//! KernelLowering: wraps scalar TIR programs into GPU compute kernels.
//!
//! Each GPU target implements `KernelLowering` to emit kernel source
//! that runs N instances of the same Trident program in parallel.
//! The program itself stays scalar โ parallelism is across instances,
//! not within a single execution.
//!
//! This is the data-parallel counterpart of:
//! - `tir::lower::StackLowering` โ stack targets โ assembly text
//! - `lir::lower::RegisterLowering` โ register targets โ machine code
use crateTIROp;
/// Lowers TIR operations into a GPU compute kernel (source text).
///
/// The kernel wraps one Trident program for batch execution:
/// each GPU thread runs one instance with its own inputs/outputs.
/// Create a kernel-lowering backend for the given target name.
trident/src/ir/kir/lower/mod.rs
ฯ 0.0%