- 🔭 I’m currently working on: Scaling models, post-training and agents at Google DeepMind
- 🌱 I’m currently learning: Mostly about post-training techniques, GPU kernels, and various ML training parallelization approaches
- 💬 Ask me about: machine learning, LLMs, computer vision, startups
- 📫 How to reach me: Twitter, Email
Pinned Loading
-
awesome-mlss/awesome-mlss
awesome-mlss/awesome-mlss Public🤖 Machine Learning Summer School Guide
-
safeguarding-llms
safeguarding-llms PublicTMLS 2024 Workshop: A Practitioner's Guide To Safeguarding Your LLM Applications
-
mla-pytorch
mla-pytorch Publicminimal Pytorch implementation of DeepSeek's Multi Head Latent Attention + benchmarks
Python 4
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




