PhD Thesis Proposal: Mateusz Nowak

PhD Thesis Proposal: Mateusz Nowak

PhD Thesis Proposal: Mateusz Nowak

by
5 5 people viewed this event.

“Less can be More and More can be Less: Strategic Data Utilization and The Consequences of Data Overload”

ZOOM LINK Abstract: While machine learning has seen massive gains through large-scale data ingestion, the relationship between data volume, network architecture, and model reasoning remains underexplored. This thesis investigates the limitations of unstructured data scaling and demonstrates how targeted network design can enhance reasoning capabilities more effectively than sheer data volume. The research is structured into two main parts. First, we propose architectural modifications that allow models to better exploit intrinsic data structures without increasing dataset size. We demonstrate that tailoring architectures to the underlying data structure significantly improves performance on complex tasks, such as 3D reconstruction and world modeling. Second, we analyze the failure modes of high-capacity pre-trained models, quantifying how data overload degrades multiple-choice question understanding. We further examine how varying conceptual representations impact model performance across image and text modalities, focusing on image classification and sentiment analysis. Ultimately, this work provides a comprehensive framework for optimizing machine learning models through targeted augmentation and data-aware design, offering a structured evaluation of when ‘less can be more’ and ‘more can be less’ in the context of data scaling. Thesis Committee: Prof. Peter Chin (Chair), Prof. Xiaoyao Fan, Prof. Geoffrey P. Luke

Sponsored by: Thayer School of Engineering

To register for this event please visit the following URL: https://home.dartmouth.edu/events/event?event=81522 →

 

Date And Time

May 13, 2026 @ 01:00 PM
 

Event Category

 
Download Events iCal
Add to Calendar

Share With Friends


Subscribe for event notifcations!
Subscribe to Event Notifications Categories