MDT's Transparent Decision Trees Drive All-Weather Quantitative Returns

Original Title: Daniel Mahr – Glass Box Quant at MDT Advisers (EP.472)

TL;DR

  • MDT's "glass box" decision tree framework, developed since 2001, blends machine learning sophistication with transparency, allowing for understanding of model drivers and avoiding opaque "black box" quant approaches.
  • The "company age" factor, unusual in quantitative models, provides context for other factors, enabling differentiated questions for young versus established companies, particularly regarding valuation importance.
  • MDT's quantitative strategy prioritizes analytical edge over informational edge by leveraging diversification and avoiding reliance on predicting market environments, aiming for all-weather portfolio returns.
  • The "glass box" approach allows MDT to identify and address human emotional biases, such as overriding trades on distressed stocks, which even quantitative investors struggle to overcome.
  • Overfitting and underfitting are critical pitfalls in machine learning for finance; MDT balances model complexity by using a forest of shallow decision trees rather than deep, data-sparse ones.
  • MDT's rigorous research process, driven by observations from over 30 years of strategy execution and academic literature, informs factor selection and model enhancements.
  • The portfolio optimizer balances alpha forecasts, risk management through hard constraints and statistical models, and trading costs, including market impact for less liquid securities.

Deep Dive

Daniel Mahr's early path into investing began with a fascination for numbers, evident from his childhood habit of poring over numerical data in newspapers. This interest evolved during his college years at Harvard in 1998, where he identified an opportunity in the dot-com bubble by flipping IPOs. He managed to secure allocations to these rapidly appreciating stocks, initially viewing the small share amounts as significant gains. This experience, however, also led to substantial losses when he deviated from his original investment theses, convincing him of the need for a more disciplined and systematic approach, which he found in quantitative investing.

The discussion then shifts to Mahr's entry into the quantitative investing space. He joined MDT Advisors in 2002 as a junior analyst, a firm that had been a pioneer in quant investing since 1991. After MDT was acquired by Federated Investors (now Federated Hermes) in 2006, Mahr took over leadership of the team in 2008. He has since guided the group through significant changes in data availability, computing power, and investment methodologies over two decades.

Mahr elaborates on the evolution of quantitative investing, noting that while some early strategies persist, the field has seen an explosion in processing power, data, and algorithms. This has led to a dramatic increase in the sophistication and variety of strategies that can be implemented, supported by these technological advancements.

He then details MDT's investment strategies, explaining that in 2002, their approach involved traditional factor tilting strategies using a limited number of characteristics. However, these strategies experienced difficulties during the 1998-1999 period when value and quality factors were not favored by the market. This led the firm to seek a more differentiated approach, resulting in the development of their decision tree framework, which they continue to refine today.

The concept of a decision tree approach to stock selection is explained by analogy to life insurance, where a series of yes/no questions about characteristics predict an outcome like longevity. In the stock market context, these questions pertain to company characteristics, with subsequent questions evolving based on relevance to specific company types. Mahr emphasizes that MDT's approach is not a "black box" but rather a "glass box," allowing for transparency into how the models operate and drive decisions.

Mahr articulates MDT's investment philosophy, which centers on the belief that a disciplined quantitative approach can yield an analytical advantage, leading to superior, all-weather portfolio returns. He contrasts this with a "global macro crystal ball" approach, stating that MDT's method relies on diversification across companies with differentiated alpha drivers rather than predicting market environments.

Regarding differentiation in a crowded market, Mahr highlights MDT's long-standing use of machine learning and artificial intelligence since 2001, giving them a significant head start. He notes that while many competitors are now exploring these technologies, MDT has accumulated extensive knowledge regarding their advantages and potential pitfalls in noisy data environments like stock return forecasting.

The critical challenges of overfitting and underfitting data in model building are discussed. Mahr explains that while a simple model might avoid overfitting, it risks underfitting and leaving explanatory power on the table. Striking the right balance between model complexity and simplicity is a key focus that MDT has evolved over decades.

Mahr describes how research ideas are generated, drawing from academic and practitioner literature, but more frequently from observations of their own strategies' behavior over their 30-plus years of investing. These observations across various market cycles have informed significant process enhancements.

He introduces "company age" as an unusual factor used in their models, measuring how long a company has been publicly traded. While company age alone does not predict performance, Mahr explains that the decision tree framework leverages it to ask different questions of young companies versus established ones, with valuation being more critical for older companies. The selection of factors is driven by the investment team's research, but their application within the algorithm is mechanically determined.

The process for retesting and potentially removing factors is detailed. Factors are removed if they no longer perform or if new factors capture a correlated effect. The decision to remove a factor is data-driven, stemming from observations of fewer trades being driven by that factor over time. Research projects then confirm whether removing the factor impacts portfolio outcomes negatively.

Mahr visualizes the internal workings of the "glass box" as a "forest of trees" rather than a single tree, a progression enabled by increased processing power and advanced algorithms. He explains that in the past, a single tree was printed and taped to the wall for trade review, but with thousands of trees, analytical tools are used to synthesize and summarize their operations. Each tree typically involves two to five decision points, as deeper trees operate on progressively smaller data pools, limiting their depth.

An example of a decision tree path is provided, starting with a question about a company's financing activities (issuing debt or shares versus buying them back). Companies with high financing levels tend to underperform, but the algorithm seeks out strong momentum companies within this group that can outperform despite financing. Down the non-financing branch, other differentiated company groups are identified. Subsequent questions might involve volatility or company age, with momentum being more meaningful for newer companies. The stopping rules for asking questions are mechanical, with a hard limit of five questions per tree or stopping if a branch's data pool becomes too small.

The aggregation of thousands of tree signals into a portfolio is managed by a portfolio optimizer. This optimizer considers alpha forecasts from the decision tree model, risk management through hard constraints and a statistical risk model, and trading costs, aiming for portfolios with consistent outcomes and efficient trading.

Mahr discusses the trade-off between alpha signals and market impact, noting that less liquid companies will have smaller positions and slower trading speeds. He also addresses human judgment in overriding model decisions, emphasizing a data-oriented approach. When a company reports earnings, the team analyzes how those earnings will impact the model's factors and subsequent tree decisions, aiming for precision in potential overrides.

The reflexivity of other quantitative participants is considered in the context of leverage. Mahr points to historical events like Long-Term Capital Management and the 2007 quant quake, attributing them to periods of underperformance magnified by leverage, rather than inherent flaws in quantitative strategies themselves. He notes that statistical arbitrage strategies had become crowded, leading some managers to increase leverage.

Mahr observes changes in market structure, suggesting that markets may have shifted from a continuous trend toward efficiency to something different in the last couple of years. He speculates that this could be due to the rise of "pod shops," passive management, retail trading, or a combination of factors, but stresses MDT's focus on having active strategies that can capitalize on inefficiencies regardless of their source.

A typical day involves downloading updated data, recalculating characteristics, running companies through the forest of trees for updated forecasts, and re-optimizing portfolios to generate a trade list. The day begins with a trade review process focused on verifying data accuracy, understanding model dynamics, and identifying any market news not yet captured by the data inputs. The rest of the day is dedicated to research, idea generation, and model improvement, utilizing proprietary tooling built in-house since 1991.

Action Items

  • Audit decision tree model: Identify 3-5 potential overfitting or underfitting risks by analyzing historical data usage and model complexity.
  • Implement automated factor re-evaluation: Develop a process to periodically test the efficacy of 5-10 core factors, removing those with diminishing explanatory power.
  • Design a "glass box" visualization tool: Create a dashboard to illustrate the decision-making process of 2-3 key decision trees, clarifying input drivers.
  • Track 3-5 "bad story" stock performance: Monitor companies with negative qualitative news that the model selects to understand the emotional override dynamics.
  • Evaluate 5-10 alternative data sources: Assess the analytical edge provided by new datasets versus the cost and potential for information arms races.

Key Quotes

"This is precisely why this strategy works is because even quantitative investors who are intentionally trying to buy these stocks find it hard to overcome the human emotions involved with buying a bad story."

Daniel Mahr explains that the effectiveness of their quantitative strategy lies in its ability to bypass the emotional biases that even sophisticated investors struggle with. This highlights a core challenge in investing: human psychology often conflicts with rational decision-making, and a systematic approach can provide an advantage by adhering to predefined rules.


"What differentiates us at MDT is the use of machine learning AI and machine learning are very hot topics right now... at MDT we've been using these machine learning tools since 2001 so we have a 24 year head start on someone who is new to the game."

Daniel Mahr asserts that MDT's long-standing use of machine learning and AI sets them apart from competitors. This longevity provides them with a significant advantage, allowing them to accumulate extensive experience and understand the nuances and potential pitfalls of these powerful tools in financial modeling.


"Our view in the machine learning space is that transparency is exceedingly important to understand precisely how these models are working a common epithet that gets thrown at us in the quant investment management space is that we're using black boxes and that MDT that is not the case we like to position our investment strategies as being a glass box there's a lot of machinery on the inside but we can see into it we can see how it's working and understand what's driving all of the decision making on a day to day basis."

Daniel Mahr emphasizes MDT's commitment to transparency in their machine learning models, contrasting their "glass box" approach with the "black boxes" often associated with quantitative investing. This focus on understandability allows them to monitor and comprehend the decision-making processes within their sophisticated systems.


"The questions they're asked in sequence and in context at the top of the tree you're going to ask a question of all companies you're going to want a question that is relevant to explaining returns for big companies small companies growth companies value companies a common question we'll ask at the top of the tree will be about a company's use of financing whether they're issuing debt and or shares or buying those things back."

Daniel Mahr illustrates the initial stages of their decision tree framework by explaining that the first questions are designed to be universally relevant across all company types. He highlights a common starting point: analyzing a company's financing activities, such as issuing debt or shares, as a factor that can influence returns.


"There are informational edges and there are analytical edges... we have intentionally focused not on that arms race but on the analytical piece through the use of decision trees and machine learning the data that feeds our models is oriented towards the longest historical data and the highest quality data sets that are out there in the quant space so financials prices analyst estimates."

Daniel Mahr distinguishes between informational and analytical edges in investing, stating that MDT prioritizes the latter through their use of decision trees and machine learning. He explains that their approach relies on high-quality, long-term historical data like financials and prices, rather than engaging in the competitive pursuit of new, potentially fleeting information sources.


"I wish I knew earlier on life is a journey and that no one wins everything often doors that seemed closed open in time sometimes the path that you end up on as an alternative ends up being the right path."

Daniel Mahr reflects on a life lesson learned later in his career, emphasizing the importance of viewing life as a journey rather than a series of wins and losses. He advises that setbacks are natural, and often, seemingly closed doors can lead to unexpected but ultimately beneficial alternative paths.

Resources

External Resources

Books

  • "The Theory of Investment Value" by John Burr Williams - Mentioned as a foundational text in investment valuation.

Research & Studies

  • Academic journals - Mentioned as a source for research ideas in quantitative investing.
  • Practitioner literature - Mentioned as a source for research ideas in quantitative investing.

People

  • Daniel Mahr - Head of MDT, quantitative equity investing group at Federated Hermes.
  • David Goldsmith - Founder of the quant group at MDT and Mahr's mentor.
  • Sarah Stall - Led analytical and portfolio attribution efforts at MDT and was a mentor to Mahr.
  • John Burr Williams - Author of "The Theory of Investment Value."
  • Andrew Lo - MIT professor, co-author of a paper on the quant quake.
  • Amir Kandanani - Co-author of a paper on the quant quake.
  • Dasha Burns - Shared a quote about life's journey.

Organizations & Institutions

  • MDT Advisers - Quantitative equity investing group.
  • Federated Hermes - Firm that acquired MDT Advisers.
  • Federated Investors - Previous name of Federated Hermes.
  • MIT - Institution where Andrew Lo is a professor.

Websites & Online Resources

  • Capital Allocators (capitalallocators.com) - Mailing list and premium membership access.
  • Twitter (@tseides) - Ted Seides' Twitter handle.
  • LinkedIn (Ted Seides) - Ted Seides' LinkedIn profile.
  • The Podcast Consultant (thepodcastconsultant.com) - Provided editing and post-production for the episode.

Other Resources

  • Decision tree framework - MDT's "glass box" approach to stock selection.
  • Machine learning (ML) - Technology used by MDT since 2001.
  • Artificial Intelligence (AI) - Technology area of interest for MDT.
  • Overfitting and underfitting - Related issues in data science models.
  • Statistical arbitrage strategies - Mentioned in the context of quant blow-ups.
  • Alpha - Measures excess returns relative to a benchmark.
  • Factor tilting strategies - Traditional early quant strategies.
  • Momentum effects - Found in price-based factors.
  • Reversal effects - Found in price-based factors for companies with significant price drops.
  • Company age - An unusual factor used in MDT's models.
  • Valuation - An important differentiator in investment.
  • Book to price - A factor previously used in MDT's models.
  • Intangible economy - Described as a factor in how companies trade.
  • Forest of trees - An advanced algorithmic approach used by MDT.
  • Portfolio optimizer - Technology used to construct portfolios.
  • Risk constraints - Used in portfolio construction.
  • Statistical risk model - Used to predict volatility and tracking error.
  • Trading costs - Considered in portfolio optimization.
  • Market impact - Considered in portfolio construction and trading speed.
  • Leverage - Mentioned in relation to quant blow-ups.
  • Quant quake - A market event in August 2007.
  • Long Term Capital Management - A famous quant blow-up.
  • Archigos - Mentioned in relation to traditional portfolio strategies.
  • Passive management - Mentioned as a potential factor in market changes.
  • Retail trading - Mentioned as a potential factor in market changes.
  • Meme stocks - Mentioned as a potential factor in market changes.
  • Robinhood - Mentioned as a potential factor in market changes.
  • Alternative data sets - Discussed in relation to informational edges.
  • Informational edge - Gaining returns from new information.
  • Analytical edge - Gaining returns through sophisticated analysis.
  • Large language models (LLMs) - Not currently used by MDT for stock picking.
  • Software development co-pilots - An appealing area of AI for MDT.
  • Stock ownership - A factor being researched by MDT.
  • Proprietary tooling - Used by MDT for their investment process.
  • Financials, prices, analyst estimates - High-quality data sets used by MDT.
  • 50 years of data - The approximate amount of historical data MDT trains its models on.
  • Glass box approach - MDT's investment strategy, emphasizing transparency.
  • Black box approach - A common criticism of quant strategies.
  • Overriding trades - Human judgment applied to model decisions.
  • Reflexivity - The impact of quantitative participants on markets.
  • Statistical arbitrage - A type of quant strategy.
  • Market efficiency - The trend towards markets becoming more efficient.
  • Pod shops - Mentioned in relation to market structure changes.
  • Homebrewing - Mahr's hobby.
  • Sour cherry beers - Beers brewed using cherries from his wife's tree.
  • Life is a journey - A life lesson learned by Mahr.

---
Handpicked links, AI-assisted summaries. Human judgment, machine efficiency.
This content is a personally curated review and synopsis derived from the original podcast episode.