From Crash and Burn to a Smooth Landing: A Reinforcement Learning Journey
A chronicle of a reinforcement learning project, from initial struggles with exploding losses to a successful pixel-based agent for LunarLander.
Lead Machine Learning Engineer | Victoria, BC
A chronicle of a reinforcement learning project, from initial struggles with exploding losses to a successful pixel-based agent for LunarLander.
A very approachable jumping off point for video captioning. If you're GPU-poor (<24GB vram) this is for you.
A small contribution to the community. Adds caption-like variety samples to SSV2 dataset.
Some thoughts about the potential power of Meta's V-Jepa 2.