Isn’t the Zip Code Level Good Enough—Why Look at More Granular Housing Market Data?

by Guest Contributor 7 min read October 7, 2011

By: John Straka

For many purposes, national home-price averages, MSA figures, or even zip code data cannot adequately gauge local housing markets. The higher the level of the aggregate, the less it reflects the true variety and constant change in prices and conditions across local neighborhood home markets. Financial institutions, investors, and regulators that seek out and learn how to use local housing market data will generally be much closer to true housing markets.

When houses are not good substitutes from the viewpoint of most market participants, they are not part of the same housing market.  Different sizes and types and ages of homes, for example, may be in the same county, zip code, block, or even right next door to each other, but they are generally not in the same housing market when they are not good substitutes.  This highlights the importance of starting with detailed granular information on local-neighborhood home markets and homes. 

To be sure, greater granularity in neighborhood home-market evaluation requires analysts and modelers to deal with much more data on literally hundreds of thousands of neighborhoods in the U.S. It is fair to ask if zip-code level data, for example, might not be generally sufficient. Most housing analysts and portfolio modelers, in fact, have traditionally assumed this, believing that reasonable insights can be gleaned from zip code, county-level, or even MSA data. But this is fully adequate, strictly speaking, only if neighborhood home markets and outcomes are homogenous—at least reasonably so—within the level of aggregation used. Unfortunately, even at zip-code level, the data suggests otherwise. 

Examples

All of the home-price and home-valuation data for this report was supplied by Collateral Analytics. I have focused on zip7s, i.e. zip+2s, which are a more granular neighborhood measure than zip codes. A Hodrick-Prescott (H-P) Filter was applied by Collateral Analytics to the raw home-price data in order to attenuate short-term variation and isolate the six-year trends. But as we’ll see this dampening still leaves an unrealistically high range of variation within zip codes, for reasons discussed below. Fortunately there is an easy way to control for this, which we’ll apply for final estimates of the range of within-zip variation in home-price outcomes. 

The three charts below show the H-P filtered 2005-2011 percent changes in home-price per square foot of living area within three different types of zip codes in San Diego county. Within the first type of zip code, 92319 in this case, the home-price changes in recent years have been relatively homogenous, with a range of -56% to -40% home-price change across the zip7s (i.e., zip+2s) in 92319. But the second type of zip code, illustrated by 92078, is more typical. In this type of case the home-price changes across the zip7s have varied much more. The 2055-2011 zip7 %chg in home prices within 92078 have varied by over 40 percentage points, from -51% to -10%. In the third type of zip code, less frequent but surprisingly common, the home-price changes across the zip7s have had a truly remarkable range of variation. This is illustrated here by zip code 92024 in which the home price outcomes have varied from -51% to +21%, or a 71 percentage point range of difference—and this is not the zip code with the maximum range of variation observed!

House Price Trends - ZIP 92139
House Price Trends - Zip 92078
House Price Trends - ZIP 92024

All of the San Diego County zip codes are summarized in the bar chart below. Nearly two-thirds of the zip codes, 65%, have more than 30 percentage points within-zip difference in the 2005-2011 zip7 %changes in home prices. 40% have more than a 40 percentage point range of different home-price outcomes, 23% have more than a 50 percentage point range, and 13% have more than a 70 percentage point range of differences. The average range of the zip7 within-zip code differences is a 37 percentage point median, 41 percentage-point mean. These high numbers are surprising, and are most likely unrealistically high.

Summary of Within-Zip (Zip+2 level) Ranges of Variation in Home-Price Changes in San Diego: Percentage of Zips by Range Across Zip+2s in Home Price/Living Area %Change 2005-2011

House Price Changes

Controlling for Factors Inflating the Range of Variation

Such sizable differences within a typical single zip code clearly suggest materially different neighborhood home markets. While this qualitative conclusion is supported further below, the magnitudes of the within-zip variation in home-price changes shown above are quite likely inflated. There is a tendency for a limited number of observations in various zip7s to create statistical “noise” outliers, and the inclusion of distressed property sales here can create further outliers, with cases of both limited observations and distress sales particularly capable of creating more negative outliers that are not representative of the true price changes for most homes and their true range of variation within zip codes.  (My earlier blog on June 29th discussed the biases from including distressed property sales while trying to gauge general price trends for most properties.)

Fortunately, I’ve been able to access a very convenient way to control for these factors by using the zip7 averages of Collateral Analytics’ AVM (Automated Valuation Model) values rather than simply the home price data summarized above. These industry-leading AVM home valuations have been designed, in part, to filter out statistical noise problems. 

The bar chart below shows the still significant zip7 ranges within San Diego County zip codes using the AVM values, but the distribution is now shifted considerably, and more realistically, to a much smaller share of the zip codes with remarkably high zip7 variation. Compared with the chart above, now just 1% of the zips have a zip7 range greater than 60 percentage points, 5% greater than 50, and 11% greater than 40, but there are still 36% greater than 30.

To be sure, this distribution, and the average range of zip7 differences—which is now a 25 percentage-point median, 26 percent age-point mean—do show a considerable range of local home market variation within zip codes. It seems fair to conclude that the typical zip code does not contain the uniformity in home price outcomes that most housing analysts and modelers have tended to simply assume. The difference between the effects on consumer wealth and behavior of a 10% home price decline, for example, vs. a 35 to 50% decline, would seem to be sizable in most cases. This kind of difference within a zip code is not at all unusual in these data.

Automated Value Model price  - San Diego County

How About a Different Type of Urban Area—More Uniform?

It might be thought that the diversity of topography, etc., across San Diego County (from the sea to the mountains) makes its variation of home market outcomes within zip codes unusually high. To take a quick gauge of this hypothesis, let’s look at a more topographically uniform urban area: Columbus, Ohio.

When I informally polled some of my colleagues asking what their prior belief would be about the within-zip code variation in home price outcomes in Columbus vs. San Diego County, there was unanimous agreement with my prior belief. We all expected greater within-zip uniformity in Columbus. I find it interesting to report here that we were wrong.

Both the H-P filtered raw home-price information and the AVM values from Collateral Analytics show relatively greater zip7 variation within Columbus (Franklin County) zip codes than in San Diego County. 

The bar chart below shows the best-filtered, most attenuated results,  the AVM values. 5% of the Columbus zips have a zip7 range greater than 70 percentage points, 8% greater than 60, 23% greater than 50, 35% greater than 40, and 65% greater than 30. The average range of zip7 within-zip code differences in Columbus is a 35 percentage point median, 38 percentage-point mean.

Conclusion

These data seem consistent with what experienced appraisers and real estate agents have been trying to tell economists and other housing analysts, investors, and financial institutions and policymakers for quite a long time. Although they have quite reasonable uses for aggregate time-series and forecasting purposes, more aggregate-data based models of housing markets actually miss a lot of the very real and material variation in local neighborhood housing markets.  For home valuation and many other purposes, even models that use data which gets down to the zip code level of aggregation—which most analysts have assumed to be sufficiently disaggregated—are not really good enough. These models are not as good as they can or should be.

These facts are indicative of the greater challenge to properly define local housing markets empirically, in such a way that better data, models, and analytics can be more rapidly developed and deployed for greater profitability, and for sooner and more sustainable housing market recoveries.

I thank Michael Sklarz for providing the data for this report and for comments, and I thank Stacy Schulman for assistance in this post.

Related Posts

Used EV Growth Signals a New Phase of Consumer Purchasing Behavior

The electric vehicle (EV) revolution isn’t slowing down, it’s changing lanes. While recent conversations have seemingly focused on softening demand for new EVs, the used segment has been gaining momentum. According to Experian Automotive’s 2025 EV Year in Review Report, new retail individual EV registrations fell 35.9% year-over-year. Meanwhile, the used retail individual EV registrations grew 25.4% from a year ago. As affordability and growing model availability reshapes consumer behavior, buyers are increasingly turning to pre-owned EVs, which has shown an interesting market divergence that is redefining how consumers are adopting this segment and what it can mean for automakers, dealers, and the overall industry. Key players behind rising used EV demand Notably, Tesla accounted for over half (60.5%) of used retail individual EV registrations in 2025, followed by Chevrolet at 6.4% and Nissan (5.5%). Diving a bit deeper, Tesla made up the top three models of the used individual registrations last year, with the Model 3 coming in at 27.2%, Model Y at 21.7%, and Model S (6.6%). The Chevrolet Bolt EV followed at 4.8% and the Nissan Leaf was at 4%. Tesla’s position as the leading make in the used EV market is a natural extension of its long-standing dominance in new EV sales. The brand’s leadership over the years created a large fleet of vehicles that are now entering the pre-owned market. What the used EV boom means for automotive professionals The growing demand for used EVs can present more opportunities for automotive professionals. Dealers that provide a healthy supply of pre-owned EVs can increase accessibility and play a role in adoption for consumers who are actively looking to purchase, while marketers can emphasize value and ownership benefits. As the market continues to evolve, automotive professionals who understand and respond to these changing dynamics will be best positioned to capitalize on the expanding pool of used EV shoppers. To learn more about EV insights, visit Experian Automotive’s EV Resource Center.

Published: June 30, 2026 by Kirsten Von Busch
How Terrace Finance Uses NeuroID to Respond to Fraud Faster and Smarter

Learn how Terrace Finance used NeuroID behavioral analytics to detect fraud faster, respond to attacks, and strengthen risk management.

Published: June 29, 2026 by Scarlet.Nickel@experian.com
Ask the Expert: A Closer Look at Modern Lending with Jeff Hops and Erin Haselkorn

In this first episode of Ask the Expert, Experian's Jeff Hops, Senior Director of Data Platform and Product, and Erin Haselkorn, Senior Director of Analyst Relations, explore how broader data and new signals can help lenders better understand today’s consumers, while maintaining responsible decisioning. Lending is changing  Interest rates, regulation, embedded finance and AI are reshaping the lending landscape. Consumer behavior is evolving just as quickly. But the core job hasn’t changed. Lenders are still making decisions about people they don’t fully know, and that makes data more important than ever. "There are periods where nothing changes, and periods where it seems like everything changes. We’re in the latter … but the core premise hasn’t changed. You’re still trying to lend to somebody you don’t know."Jeff Hops, Senior Director of Data Platform and Product To make those decisions with confidence, lenders need a strong foundation of identity, history and reliable signals. In a period of rapid change, the quality and completeness of that data become even more critical. A more complex view of today’s consumer What has changed is the consumer. Traditional credit data is foundational but can be further enhanced with visibility on how people earn, manage and move money. Income may come from multiple sources, and financial activity often spans bank accounts, applications (apps) and digital channels. Cash flow data, for example, can provide a clearer view of what’s actually coming into a consumer’s account, beyond what traditional records may show.These additional signals can help lenders better understand: Income variability across multiple earning sources Current financial behavior through cash flow activity Digital and identity-linked activity across channels These signals don’t replace traditional data; they expand it. The result is a more complete and current view of the consumer. From exploration to real-world application The conversation around broader data signals has moved beyond theory. Lenders are no longer just asking whether these signals are useful. They’re asking where, how and under what governance they can be applied across the lending lifecycle. Lenders are actively researching, testing and implementing new data sources across the lending lifecycle. What was once experimental is now operational. Institutions are progressing through a clear path: Research Understanding available signals and use cases Testing Evaluating performance in controlled environments Implementation Applying insights in production Today, alternative data is being used in areas like analytics, channel scoring and decisioning, often within governed environments that allow for safe testing and validation. AI may accelerate this shift by helping institutions identify patterns at scale, but its value depends on the strength of the underlying data: quality, governance, context and clear business use cases. More signal, more responsibility As data availability expands, lenders have access to more granular insights than ever before. That creates opportunity, but also responsibility. The institutions that lead won’t be the ones that use the most data. They’ll be the ones that know which signals to use, how to validate them and how to apply them in ways that are fair, explainable and aligned to consumer outcomes. “Institutions can unlock more granular and powerful decisions, but they have to do it responsibly.”Erin Haselkorn, Senior Director, Analyst Relations The future of lending will be shaped not just by how much data is available, but by how thoughtfully it’s applied. Keeping the consumer at the center of decisioning is essential to building trust and long-term success. Explore alternative data with us A more complete understanding of today’s consumers starts with better data. We help lenders responsibly incorporate broader data signals and advanced analytics into decisioning strategies, enhancing visibility into today’s consumers while strengthening risk assessment and expanding access to credit. Let’s work together to build more confident, more responsible lending decisions. Learn more Contact us About our experts Jeff Hops Senior Director, Data Platform and Product, Experian Jeff Hops is a Senior Director in Experian’s Financial Services and Data business with over eight years of experience driving innovation in credit and data solutions. He has led product development for Experian’s Credit Report and played a key role in launching Ascend Identity Platform™, a leading identity resolution platform. Erin Haselkorn Senior Director, Analyst Relations, Experian Erin Haselkorn is responsible for analyst relations for Experian. She has developed an understanding of key marketing trends across a broad range of verticals. Her market research around data strategy, AI, fraud, identity and data management, paired with her broad Experian product knowledge, gives her a unique understanding of business automation and data trends. Erin is a frequent spokesperson and guest blogger.

Published: June 22, 2026 by Julie.JLee@experian.com