Biography
I am a machine learning research scientist at the Toyota Research Institute, where I work on generative models, with a focus on efficiency and geometric grounding. I have contributed to large-scale training infrastructure and open-source efforts, our open language model training library (openlm) and pre-training of models up to 7B (including TRI-ML/mamba-7b). I also advise a number of research productization efforts. During my PhD at Toyota Technological Institute at Chicago, I worked with Greg Shakhnarovich and Matt Walter, focusing on geometric 3D vision and self-supervised learning.