Random Utility with Aggregated Alternatives
Abstract
This paper studies when discrete choice data involving aggregated alternatives such as categorical data or an outside option can be rationalized by a random utility model (RUM). Aggregation introduces ambiguity in composition: the underlying alternatives may differ across individuals and remain unobserved by the analyst. We characterize the observable implications of RUMs under such ambiguity and show that they are surprisingly weak, implying only monotonicity with respect to adding aggregated alternatives and standard RUM consistency on unaggregated menus. These are insufficient to justify the use of an aggregated RUM. We identify two sufficient conditions that restore full rationalizability: non-overlapping preferences and menu-independent aggregation. Simulations show that violations of these conditions generate estimation bias, highlighting the practical importance of how aggregated alternatives are defined.