Reject Inference and Underwriting: A Deep Dive

by Julie.JLee@experian.com 5 min read September 17, 2024

In the lending world, making precise underwriting decisions is key to minimizing risks and optimizing returns. One valuable yet often overlooked technique that can significantly enhance your credit underwriting process is reject inferencing. This blog post offers insights into what reject inference is, how it can improve underwriting, and various reject inference methods.

What is reject inference?

Reject inference is a statistical method used to predict the potential performance of applicants who were rejected for a loan or credit — or approved but did not book. In essence, it helps lenders and financial institutions gauge how rejected or non-booked applicants might have performed had they been accepted or booked. By incorporating reject inference, you gain a more comprehensive view of the applicant pool, which leads to more informed underwriting decisions.

Utilizing reject inference helps reduce biases in your models, as decisions are based on a complete set of data, including those who were initially rejected. This technique is crucial for refining credit risk models, leading to more accurate predictions and improved financial outcomes.

How can reject inference enhance underwriting?

Incorporating reject inference into your underwriting process offers several advantages:

  1. Identifying high-potential customers: By understanding the potential behavior of rejected applicants, you can uncover high-potential customers who might have been overlooked before.
  1. Improved risk assessment: Considering the full spectrum of applicants provides a clearer picture of the overall risk landscape, allowing for more informed lending decisions. This can help reduce default rates and enhance portfolio performance.
  1. Optimizing credit decisioning models: Including inferred data from rejected and non-booked applicants makes your credit scoring models more representative of the entire applicant population. This results in more robust and reliable predictions.

Techniques in reject inference

Several techniques are employed in reject inference, each with unique strengths and applications. Understanding these techniques is crucial for effectively implementing reject inference in your underwriting process. Let’s discuss three commonly used techniques:

  • Parceling: This technique involves segmenting rejected applicants based on their characteristics and behaviors, creating a more detailed view of the applicant pool for more precise predictions.
  • Augmentation: This method adds inferred data to the dataset of approved applicants, producing a more comprehensive model that includes both approved and inferred rejected applicants, leading to better predictions.
  • Reweighting: This technique adjusts the weights of approved applicants to reflect the characteristics of rejected applicants, minimizing bias towards the approved applicants and improving prediction accuracy.

Pre-diction method

The pre-diction method is a common approach in reject inference that uses data collected at the time of application to predict the performance of rejected applicants. The advantage of this method is its reliance on real-time data, making it highly relevant and current.

For example, pre-diction data can include credit bureau attributes from the time of application. This method helps develop a model that predicts the outcomes of rejected applicants based on performance data from approved applicants. However, it may not capture long-term trends and could be less effective for applicants with unique characteristics.

Post-diction method

The post-diction method uses data collected after the performance window to predict the performance of rejected applicants. Leveraging historical data, this method is ideal for capturing long-term trends and behaviors.

Post-diction data may include credit bureau attributes from the end of the performance window. This method helps develop a model based on historical performance data, which is beneficial for applicants with unique characteristics and can lead to higher performance metrics. However, it may be less timely and require more complex data processing compared to pre-diction.

Enhancing reject inference design for better classification

To optimize your reject inference design, focus on creating a model that accurately classifies the performance of rejected and non-booked applicants. Utilize a combination of pre-diction and post-diction data to capture both real-time and historical trends.

Start by developing a parceling model using pre-diction data, such as credit bureau attributes from the time of application, to predict rejected applicants’ outcomes. Regularly update your model with the latest data to maintain its relevance.

Next, incorporate post-diction data, including attributes from the end of the performance window, to capture long-term trends. Combining both data types will result in a more comprehensive model.

Consider leveraging advanced analytics techniques like machine learning and artificial intelligence to refine your model further, identifying hidden patterns and relationships for more accurate predictions.

How Experian can assist with reject inference

Reject inference is a powerful tool for enhancing your underwriting process. By predicting the potential performance of rejected and non-booked applicants, you can make more inclusive and accurate decisions, leading to improved risk assessment and optimized credit scoring models.

Experian offers various services and solutions to help financial institutions and lenders effectively implement reject inference into their decisioning strategy. Our solutions include comprehensive and high-quality datasets, which empower you to build models that are more representative of the entire applicant population. Additionally, our advanced analytics tools simplify data analysis and model development, enabling you to implement reject inference efficiently without extensive technical expertise.

Ready to elevate your underwriting process? Contact us today to learn more about our suite of advanced analytics solutions or hear what our experts have to say in this webinar

This article includes content created by an AI language model and is intended to provide general information.

Related Posts

Ask the Expert: A Closer Look at Modern Lending with Jeff Hops and Erin Haselkorn

In this first episode of Ask the Expert, Experian's Jeff Hops, Senior Director of Data Platform and Product, and Erin Haselkorn, Senior Director of Analyst Relations, explore how broader data and new signals can help lenders better understand today’s consumers, while maintaining responsible decisioning. Lending is changing  Interest rates, regulation, embedded finance and AI are reshaping the lending landscape. Consumer behavior is evolving just as quickly. But the core job hasn’t changed. Lenders are still making decisions about people they don’t fully know, and that makes data more important than ever. "There are periods where nothing changes, and periods where it seems like everything changes. We’re in the latter … but the core premise hasn’t changed. You’re still trying to lend to somebody you don’t know."Jeff Hops, Senior Director of Data Platform and Product To make those decisions with confidence, lenders need a strong foundation of identity, history and reliable signals. In a period of rapid change, the quality and completeness of that data become even more critical. A more complex view of today’s consumer What has changed is the consumer. Traditional credit data is foundational but can be further enhanced with visibility on how people earn, manage and move money. Income may come from multiple sources, and financial activity often spans bank accounts, applications (apps) and digital channels. Cash flow data, for example, can provide a clearer view of what’s actually coming into a consumer’s account, beyond what traditional records may show.These additional signals can help lenders better understand: Income variability across multiple earning sources Current financial behavior through cash flow activity Digital and identity-linked activity across channels These signals don’t replace traditional data; they expand it. The result is a more complete and current view of the consumer. From exploration to real-world application The conversation around broader data signals has moved beyond theory. Lenders are no longer just asking whether these signals are useful. They’re asking where, how and under what governance they can be applied across the lending lifecycle. Lenders are actively researching, testing and implementing new data sources across the lending lifecycle. What was once experimental is now operational. Institutions are progressing through a clear path: Research Understanding available signals and use cases Testing Evaluating performance in controlled environments Implementation Applying insights in production Today, alternative data is being used in areas like analytics, channel scoring and decisioning, often within governed environments that allow for safe testing and validation. AI may accelerate this shift by helping institutions identify patterns at scale, but its value depends on the strength of the underlying data: quality, governance, context and clear business use cases. More signal, more responsibility As data availability expands, lenders have access to more granular insights than ever before. That creates opportunity, but also responsibility. The institutions that lead won’t be the ones that use the most data. They’ll be the ones that know which signals to use, how to validate them and how to apply them in ways that are fair, explainable and aligned to consumer outcomes. “Institutions can unlock more granular and powerful decisions, but they have to do it responsibly.”Erin Haselkorn, Senior Director, Analyst Relations The future of lending will be shaped not just by how much data is available, but by how thoughtfully it’s applied. Keeping the consumer at the center of decisioning is essential to building trust and long-term success. Explore alternative data with us A more complete understanding of today’s consumers starts with better data. We help lenders responsibly incorporate broader data signals and advanced analytics into decisioning strategies, enhancing visibility into today’s consumers while strengthening risk assessment and expanding access to credit. Let’s work together to build more confident, more responsible lending decisions. Learn more Contact us About our experts Jeff Hops Senior Director, Data Platform and Product, Experian Jeff Hops is a Senior Director in Experian’s Financial Services and Data business with over eight years of experience driving innovation in credit and data solutions. He has led product development for Experian’s Credit Report and played a key role in launching Ascend Identity Platform™, a leading identity resolution platform. Erin Haselkorn Senior Director, Analyst Relations, Experian Erin Haselkorn is responsible for analyst relations for Experian. She has developed an understanding of key marketing trends across a broad range of verticals. Her market research around data strategy, AI, fraud, identity and data management, paired with her broad Experian product knowledge, gives her a unique understanding of business automation and data trends. Erin is a frequent spokesperson and guest blogger.

Published: June 22, 2026 by Julie.JLee@experian.com
How Consumer Vehicle Choices Are Shaping Automotive Loan Trends

Conversations about rising auto loan balances and higher monthly payments has often centered around increasing vehicle prices and elevated interest rates; and while those factors have undoubtedly played a role, another important piece of the puzzle is the type of vehicles consumers are choosing to purchase. According to Experian’s Automotive Consumer Trends Report: Q1 2026, consumers are continuing to opt for SUVs over other vehicle types, a trend that may be contributing to higher average loan amounts and monthly payments. SUVs accounted for 63.5% of all new retail vehicle registrations over the last 12 months, up from 62.8% a year ago. Additionally, more than 117 million SUVs were in operation across the United States in the first quarter of 2026, making up 42.2% of the market share. At the same time, traditional passenger cars continue to fall in share, coming in at 16.5%, a decrease from 18.4% last year. As consumers increasingly gravitate towards the larger vehicle segment, it reflects the ongoing desire for versatility, cargo capacity, and family-friendly functionality. Electrification’s growing role in consumer purchasing behavior Interestingly, electrified SUVs continue to gain traction, representing 27.7% of all new SUV registrations, these vehicles include battery-electric, hybrids, plug-in hybrids, and other alternative fuel types. Diving a bit deeper, the Tesla Model Y was the market share leader for new, retail electrified SUV registrations in the last 12 months, coming in at 15.8%. Rounding out the top five were Honda CR-V (9.6%), Toyota RAV4 (7.2%), Chevrolet Trax (7.2%), and Toyota Grand Highlander (3.4%). As model availability and familiarity with the electrification segment grows, the broader adoption of these vehicles are playing an increasingly important role in vehicle pricing and overall consumer demand. While average loan amounts and monthly payments are being driven by a combination of factors such as financing costs and consumer purchasing behavior, data in Q1 2026 demonstrates the continued interest in SUVs. This suggests that the industry’s shift toward larger vehicles is likely playing a meaningful role in today’s financing environment. To learn more about SUV insights, view the full Automotive Consumer Trends Report: Q1 2026 presentation.

Published: June 17, 2026 by Kirsten Von Busch
When New Data Impacts MBS Pricing: Student Loan Debt

In our previous post, we described the Current Second Lien Balance field, which is one of over 2,000 fields in the new Experian Mortgage Loan Performance (MLP) dataset. We showed that the Current Second Lien Balance field meets our three-pronged materiality standard for new data delivery: New: Provides information not available in existing datasets (i.e., orthogonal to currently available data). Material: Impacts a sizeable portion of the MBS universe. Significant: Differentiates collateral performance by a large enough margin to influence trading and risk management decisions. In this article, we discuss another field that satisfies the above criteria: Student Loan Balance.  We evaluate this field in the context of these criteria. First, however, we provide a summary of the MLP dataset and how it compares to standard GSE loan-level data available today. Standard GSE Data vs. Experian Mortgage Loan Performance (MLP) Data The MLP dataset contains thousands of fields describing mortgage performance from each borrower, loan, and property perspective, all refreshed monthly (including, amongst other things, new credit scores and refinance inquiry activity, loan performance, filed junior liens, and AVM values).  MLP differs from loan-level data provided by Freddie Mac, Fannie Mae, and Ginnie Mae, which the vast majority of market participants solely rely on, in a number of ways: Standard data provided by the GSEs and GNMA does not contain all the information necessary for accurate forecasting of mortgage prepayment and credit performance. Basic, critical fields like borrower’s current credit score and current junior liens on the property are missing. The new Mortgage Loan Performance (MLP) dataset from Experian contains borrower, loan, and property data fields covering the entire mortgage universe, including Agency, Non-Agency, and Esoteric mortgage products (CES, HELOC, Reverse), both securitized and non-securitized. MLP enables full three-dimensional (borrower + loan + property) tracking with persistent keys for borrower (before and after refinancing), loan (in securities/deals even after exit due to payoffs or buyouts, including before and after MSR sales), and property.  This enables end-to-end analysis of each borrower’s (and property’s) mortgage experience throughout their credit lifecycle. New, Material and Significant Field:  Student Loan Debt MLP contains a number of fields describing each mortgage borrower’s student debt load, including amounts in repayment, forbearance and collections; estimated interest rate, time remaining until forbearance expiration, and more. In the interest of simplicity, for this article we’ll focus on a single student loan-related field within MLP: Student Loans Balance, which is defined as the total balance on open non-deferred student trades reported in the last 3 months. Is Information Regarding Student Loans New to Markets? Standard loan-level data disclosed by the GSEs and GNMA contain no student-loan-specific fields. Theoretically, fields related to DTI at origination might capture some aspect of student loan debt. So, in the best-case scenario for an investor relying solely on standard disclosure, a DTI value as of origination is provided -- yet is never updated as the loan seasons and the borrower’s debt and income change (see more here).  But in the case of federal student loan debt attached to mortgages originated from early 2020 to late 2023, the level of detail provided by disclosure may be even more unknown due to COVID-era repayment and reporting moratoriums. The student loan repayment moratorium was a temporary federal policy that paused required payments, set interest rates to 0%, and suspended collections on most federally-held student loans. The moratorium began in March 2020, with payments resuming in October 2023, making it approximately 3.5 years in duration—the longest consumer credit payment pause in U.S. history. (Source: NCUA ) During the moratorium, student loan-related debt loads may have been understated as federal loans were in a temporary state of $0 repayment.  As an alternative to leaving student loan debt completely out of DTI calculations, an imputed payment equal to only 0.50% of the outstanding balance was often used as a placeholder for a borrower’s DTI calculation. As the Wall Street Journal recently reported, a new report from the Federal Reserve of New York shows a rise in student loan default rates by age group.  Student loan delinquencies have been broadly trending higher across all age groups.  Also, the average age of a borrower in default has risen to 40, and borrowers age 50 and older are now at a higher risk of default than younger groups. This 40 to 50-year-old age group represents prime home ownership years. Defaulted borrowers are also struggling to make other debt payments, too. The same report stated that almost 40% of past due student loan borrowers with auto loans are past due, 56% have at least one credit card past due, and 20% have a past due mortgage.  In addition to increased delinquency risk on their mortgage, borrowers with student loan debt also have fewer mortgage refinance options, as their elevated DTI may prevent qualification for a refinance, or increase the offered rate of a refinance and thereby reduce their incentive.  These dampening effects of student loan debt on mortgage CPR are clearly evident in the data, as described further below. Standard data only reports information related to the primary mortgage and does not include any details on the borrower’s other debts, with the exception of DTI at origination, which is never updated throughout the life of the loan. In contrast, MLP provides a comprehensive view of the borrower’s full credit profile, including other obligations such as credit cards, mortgages on other properties, student loan balances, and much more. Is Student Loan debt material to the residential mortgage market? Approximately $11 trillion of residential mortgage loans were originated during the student loan payment moratorium (Source: Experian MLP Dataset), a period marked by historically low mortgage rates during the COVID era.  As discussed above, DTI data contained in standard market disclosure may be particularly inaccurate for these loans.   As the Wall Street Journal recently reported, a new report from the Federal Reserve of New York shows a rise in student loan default rates by age group.  Student loan delinquencies have been broadly trending higher across all age groups.  Also, the average age of a borrower in default has risen to 40, and borrowers aged 50 and older are now at a higher risk of default than younger groups. This 40 to 50-year-old age group represents prime home ownership years.  Defaulted borrowers are also struggling to make other debt payments, too.   The same report stated that almost 40% of past due student loan borrowers with auto loans are past due, 56% have at least one credit card past due, and 20% have a past due mortgage.  In addition to increased delinquency risk on their mortgage, borrowers with student loan debt also have fewer mortgage refinance options, as their elevated DTI may prevent qualification for a refinance, or increase the offered rate of a refinance and thereby reduce their incentive.  These dampening effects of student loan debt on mortgage CPR are clearly evident in the data, as described further below.  Of today’s $13 trillion in outstanding mortgage debt, more than 10% of that debt ($1.5 trillion) is associated with borrowers who carry student loan debt.  For these borrowers, the average amount of student loan debt outstanding is approximately $50,000, versus a mortgage balance of approximately ~$289,000. In other words, the average student loan debt balance is almost 20% of the mortgage balance for the average borrower who carries both. For this set of borrowers, the average monthly payment is approximately $400 for student loan vs. approximately $2,200 for 1st lien mortgage—so that monthly student loan payments are a significant debt load, approximately 20% of the monthly mortgage payment.  (Source:  Experian MLP Dataset)  Is the effect of student loan debt a significant driver of performance? Figure 1 illustrates prepayments by student loan balance for a sample of loans drawn from MLP. The chart illustrates that borrowers with larger student loan balances prepay much more slowly, likely because some are effectively locked out of refinancing once student loan payments resume due to elevated DTI. The debt-to-income (DTI) ratio calculated using actual student loan payments may be significantly higher than the DTI calculated during the moratorium, in some cases exceeding GSE eligibility thresholds. As illustrated in Figure 1, for in-the-money (ITM) collateral, the differential between loans with material student loan balances (greater than $200,000) and loans with no student debt can reach up to 5 CPR. Notably, even for out-of-the-money (OTM) collateral, loans with student debt prepay 1 to 3 CPR slower, likely reflecting reduced mobility due to tighter financing constraints when purchasing a new home. Pools with otherwise similar prepayment characteristics may exhibit different prepayment behavior depending on the distribution of student loan exposure within their collateral. In addition, because loans with student debt tend to prepay more slowly, this effect increases over time due to burnout: loans without student debt prepay and exit the pools more quickly, leaving a higher concentration of slower-paying loans behind.  Given that 10% of the $13 trillion outstanding mortgage market is associated with borrowers who have student loans (Source:  Experian MLP dataset)—and that student loans have a meaningful impact on prepayments—many pools issued between March 2020 and October 2023 may be subject to this student loan debt CPR throttle, and therefore mispriced by investors relying exclusively on standard market data. Fig 1. Prepayment S-Curve: Student Loans Balance Source:  Experian MLP dataset hosted on IVolatility Data-Driven Platform _____________________________________________________ Michael Pyatski advises MBS traders, portfolio managers, quants, risk managers, loan originators, and technology professionals on making informed, data-driven business decisions that drive revenue growth, enhance risk management, and reduce trading costs. With more than 15 years of experience as an Agency RMBS trader—including serving as Head of the Proprietary Trading Desk at BNP Paribas—Michael developed and successfully implemented relative-value, data-driven profitable trading strategies to capture market opportunities embedded in data but not fully priced by the market. His trading experience, combined with a Ph.D. in econometrics, led him to found the Data-Driven Portal (https://datadrivenportal.com/), a platform that provides advanced technology for MBS trading and risk management. The platform’s No-Model Data-Driven technology leverages big data, econometric analysis, and AI to help traders identify relative-value opportunities in RMBS markets and generate above-market, risk-adjusted returns. _____________________________________________________

Published: June 17, 2026 by Perry DeFelice, Michael Pyatski