Loading...
Karpathy-inspired: micrograd → MLP → attention — applied to price data
Inspired by karpathy/nn-zero-to-hero, adapted to crypto. Build a one-neuron classifier from scratch, then an MLP for oversold-bounce prediction, then a tiny attention module. Ends with the hard lessons: distribution shift, leakage, and why your backtest Sharpe of 3.0 turns into 0.4 live.
Before you wire a neural net to your PnL, know the failure modes. This lesson saves you months.
Karpathy's micrograd is 150 lines that teach you every line of PyTorch. We adapt it to predict next-bar direction.
Stack three neurons + a hidden layer. Train to classify 'this oversold bar will bounce >1%' vs 'it won't'.
90% of ML-for-trading quality is features. Learn what works and what leaks the future.
Karpathy's makemore-style transformer, adapted to 'what comes next after these 20 bars?'
You trained a model. Sharpe 2.5 on test set. Now what? This quest is the gap between lab and live.