Job description
- Conduct detailed failure analysis on large multimodal models and machine learning models- Prototype novel evaluation and benchmark methods from foundation model literature research- Develop tools for analyzing and visualizing data- Design and implement experiment (DOE) for engineering studies and large scale user studies- Facilitate and support data collection and analysis in collaboration with other groups- Contribute in defining feature specifications and anticipated user experience based on data insights