#

pytorch

PyTorch deep learning framework: tensors, autograd, and nn modules.

ProgrammingHow PyTorch backward() Propagates Gradients to Params

Technical explanation of how PyTorch autograd and backward() build the dynamic graph and accumulate gradients into linear layer weights and biases for SGD.

1 answer 1 view
ProgrammingPyTorch Dataset/DataLoader for Multivariate Time Series

Fix PyTorch Dataset and DataLoader for multivariate time series preprocessing from CSV. Ensure (B, V, L) shapes, avoid data leakage with proper scaling, and validate sliding windows for MAMBA models.

1 answer 1 view
ProgrammingDGL with PyTorch 2.8 on RTX 50 - Compatibility Guide

DGL compatibility with PyTorch 2.8 and RTX 50 GPUs: recommended DGL/PyTorch/CUDA combos, Docker, prebuilt wheels, or build from source for Blackwell support.

1 answer 1 view
ProgrammingFix OOM Qwen3-0.6B Training on A100 32k Seq Length

Diagnose why OOM hits training Qwen3-0.6B (16 heads) on A100 48GB at 32k sequence with FlashAttention 2. Correct attention matrix estimates, quick fixes like windowed attention, ZeRO-3 offload, and scaling strategies for long sequences.

1 answer 1 view