I'm a postdoc at UC Berkeley doing deep learning research in Pieter Abbeel's group. My primary focus areas are unsupervised learning and reinforcement learning. Before this, I founded a venture-backed startup (YC W17, F30<30) and before that got a PhD in theoretical physics from UChicago where I was a Bloomenthal Fellow.

Links: Twitter, Google Scholar, Email

2EgA2NGC_400x400.jpeg


Blog posts

Multi-head Attention, GPT, and BERT

Efficient Patch Extraction

Research

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/c8a8a6ab-bb68-4536-bcce-7a799f095c68/curl_im.png

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/b7c25432-f129-44f2-8aa7-0a11727bbe18/rad.png

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/0111fc60-62df-4944-a586-72af074a18a4/atc.png

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/9b22512a-6e75-4dce-af6d-cae9f9a096b7/ferm_gif.gif

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/97dae42e-c02a-47d2-b4e5-7359d5ea49b6/sgm.gif

In a series of papers on representation learning (CURL, RAD, ATC) we showed that RL from pixels can be as efficient as RL from state and even learn real-robot control policies from pixels in just 30 mins of training (FERM). These days I work on self-supervised exploration and skill extraction.


Pre-prints

* indicates equal contribution

Hierarchical Few-Shot Imitation with Skill Transition Models Kourosh Hakhamaneshi*, Ruihan Zhao*, Albert Zhan*, Pieter Abbeel, Michael Laskin, 2021

Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL, Catherine Cang, Aravind Rajeswaran, Pieter Abbeel, Michael Laskin, 2021

A Framework for Efficient Robotic Manipulation Albert Zhan*, Philip Zhao*, Lerrel Pinto, Pieter Abbeel, Michael Laskin, 2021


Publications

URLB: Unsupervised Reinforcement Learning Benchmark Michael Laskin*, Denis Yarats*, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel, NeurIPS, 2021

Reinforcement Learning with Latent Flow Wenling Shang*, Xiaofei Wang*, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin, NeurIPS, 2021

Decision Transformer: Reinforcement Learning via Sequence Modeling Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch, NeurIPS, 2021

Hosted at Hostnotion – custom domains for Notion