Skip to main content
OCC Flag

An official website of the United States government

August 2009

Classifying Applicants for Fair Lending Analyses - What Do the Data Have to Say? (WP 2009-4)

This publication is part of:

Collection: Economics Working Papers Archive


Testing for discrimination in mortgage lending requires classifying consumers into treatment groups and control groups. Although this may seem like a straightforward task, it is actually quite complicated. Home Mortgage Disclosure Act (HMDA) data, the primary source of data for these analyses, contain information on the ethnicity, race, and gender for both primary and co-applicants. In addition, applicants have the option of reporting up to five races. Using these detailed data to construct the standard groups, such as "Black," "Hispanic," and " White," requires subjective decisions on how to appropriately aggregate applications.

This study uses a data-driven approach to classify applications, minimizing subjectivity. Using HMDA data, as well as data from a recent examination conducted by the Office of the Comptroller of the Currency, we disaggregated applications into the most basic subsets the HMDA data allowed. Our objectives are to better understand the characteristics of applicants, analyze variation in denial rates across underlying subsets of applications, and develop a data-driven classification strategy that could be used during fair lending analyses.


Jason Dietrich